Citation
Abstract
The discovery of an active feature extraction technique has been the focus of many researchers to improve the performance of classification methods, such as for sentiment analysis. Many of them have shown interest in using word embeddings especially Word2Vec as the features for text classification tasks. Its ability to model high-quality distributional semantics among words has contributed to its success in many of the functions. Despite the success, Word2Vec features are high dimensional that lead to an increase in the complexity of the classifier. In this paper, an effective method for feature extraction based on Word2Vec is proposed for sentiment analysis. The process discovers polarity clusters of the terms in the vocabulary through Word2Vec and opinion lexical dictionary. The features vector for each text is constructed from the polarity clusters, which lead to a lower-dimensional vector to represent the text. This paper also investigates the effect of two opinion lexical dictionaries on the performance of sentiment analysis, and one of the dictionaries are created based on SentiWordNet. The effectiveness of the proposed method is evaluated on the IMDB with two classifiers, namely the Logistic Regression and the Support Vector Machine. The result is promising, showing that the proposed method can be more effective than the baseline approaches.
Download File
Full text not available from this repository.
Official URL or Download Paper: https://ejournal.um.edu.my/index.php/MJCS/article/...
|
Additional Metadata
Item Type: | Article |
---|---|
Divisions: | Faculty of Computer Science and Information Technology |
DOI Number: | https://doi.org/10.22452/mjcs.vol33no3.5 |
Publisher: | University of Malaya * Faculty of Computer Science and Information Technology |
Keywords: | Sentiment analysis; SentiWordNet; Word2Vec; Word embeddings |
Depositing User: | Nurul Ainie Mokhtar |
Date Deposited: | 07 Sep 2023 00:24 |
Last Modified: | 07 Sep 2023 00:24 |
Altmetrics: | http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.22452/mjcs.vol33no3.5 |
URI: | http://psasir.upm.edu.my/id/eprint/85796 |
Statistic Details: | View Download Statistic |
Actions (login required)
View Item |