日本語フィールド
著者:Koki Yoshioka and Hiroshi Dozono題名:The Classification of the Documents Based on Word2Vec and 2-Layer Self Organizing Maps発表情報:International Journal of Machine Learning and Computing 巻: Vol.8 号: No.3 ページ: pp.252-255キーワード:Self Organizing Map (SOM), Word2Vec, documents classification.概要:Abstract—Due to popularization of SNS and increase of use of WEB, people have to deal with large number of text data. However, it is difficult to process huge text data manually. For this problem, the classification methods based on machine learning is considered to be applicable. As a method of document classification, WEBSOM and its variations can visualize the relations among the documents as the similar documents are classified closely on the 2 dimension plane, and they will present good usability to the user because of their visualization ability. In this paper, the document classification method based on SOM and Word2Vec model, which can reduce the computational costs as to be executed on personal computers and can enhance the visualization ability. The performance of the proposed method is examined in the experiments using general collection of documents, and DNA sequences as the sample data.抄録:英語フィールド
Author:Koki Yoshioka and Hiroshi DozonoTitle:The Classification of the Documents Based on Word2Vec and 2-Layer Self Organizing MapsAnnouncement information:International Journal of Machine Learning and Computing Vol: Vol.8 Issue: No.3 Page: pp.252-255Keyword:Self Organizing Map (SOM), Word2Vec, documents classification.An abstract:Abstract—Due to popularization of SNS and increase of use of WEB, people have to deal with large number of text data. However, it is difficult to process huge text data manually. For this problem, the classification methods based on machine learning is considered to be applicable. As a method of document classification, WEBSOM and its variations can visualize the relations among the documents as the similar documents are classified closely on the 2 dimension plane, and they will present good usability to the user because of their visualization ability. In this paper, the document classification method based on SOM and Word2Vec model, which can reduce the computational costs as to be executed on personal computers and can enhance the visualization ability. The performance of the proposed method is examined in the experiments using general collection of documents, and DNA sequences as the sample data.