UPM Institutional Repository

Improving sentiment classification accuracy of financial news using n-gram approach and feature weighting methods


Citation

Yazdani, Sepideh Foroozan and Azmi Murad, Masrah Azrifah and Mohd Sharef, Nurfadhlina and Abdul Latiff, Ahmed Razman (2015) Improving sentiment classification accuracy of financial news using n-gram approach and feature weighting methods. In: 2nd International Conference on Information Science and Security (ICISS 2015), 14-16 Dec. 2015, Seoul, South Korea. .

Abstract

Sentiment classification of financial news deals with the identification of positive and negative news so that they can be applied in decision support system to perform stock trend predictions. This paper explores several types of feature space as different datasets for sentiment classification of the news article. Experiments are conducted based on n-gram approach (unigram, bigram and the combination of unigram and bigram) used as feature extraction with different feature weighting methods, while, document frequency (DF) is used as feature selection method. We performed experiments to measure the classification accuracy of support vector machine (SVM) with two kernel methods of linear and Radial Basis Function (RBF). Results showed that an efficient feature extraction increased classification accuracy when it is used as a combination of unigram and bigram. Moreover, we also found that DF can be applied as a dimension reduction method to reduce the feature space without loss of accuracy.


Download File

[img]
Preview
Text (Abstract)
Improving sentiment classification accuracy of financial news using n-gram approach and feature weighting methods.pdf

Download (35kB) | Preview

Additional Metadata

Item Type: Conference or Workshop Item (Paper)
Divisions: Faculty of Computer Science and Information Technology
Putra Business School
DOI Number: https://doi.org/10.1109/ICISSEC.2015.7371008
Publisher: IEEE
Keywords: Sentiment classification; Financial news; Support vector machine; Radial basis function; Linear kernel; Document frequency; TF-IDF
Depositing User: Nabilah Mustapa
Date Deposited: 08 Apr 2019 08:32
Last Modified: 08 Apr 2019 08:32
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1109/ICISSEC.2015.7371008
URI: http://psasir.upm.edu.my/id/eprint/14342
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item