UPM Institutional Repository

Sentiment classification of financial news using statistical features


Citation

Yazdani, Sepideh Foroozan and Azmi Murad, Masrah Azrifah and Mohd Sharef, Nurfadhlina and Singh, Yashwant Prasad and Abdul Latiff, Ahmed Razman (2017) Sentiment classification of financial news using statistical features. International Journal of Pattern Recognition and Artificial Intelligence, 31 (3). ISSN 0218-0014; ESSN: 1793-6381

Abstract

Sentiment classification of financial news deals with the identification of positive and negative news so that they can be applied in decision support systems for stock trend predictions. This paper explores several types of feature spaces as different data spaces for sentiment classification of the news article. Experiments are conducted using N-gram models unigram, bigram and the combination of unigram and bigram as feature extraction with traditional feature weighting methods (binary, term frequency (TF), and term frequency-document frequency (TF-IDF)), while document frequency (DF) was used in order to generate feature spaces with different dimensions to evaluate N-gram models and traditional feature weighting methods. We performed some experiments to measure the classification accuracy of support vector machine (SVM) with two kernel methods of Linear and Gaussian radial basis function (RBF). We concluded that feature selection and feature weighting methods can have a substantial role in sentiment classification. Furthermore, the results showed that the proposed work which combined unigram and bigram along with TF-IDF feature weighting method and optimized RBF kernel SVM produced high classification accuracy in financial news classification.


Download File

[img]
Preview
Text (Abstract)
Sentiment classification of financial news using statistical features.pdf

Download (5kB) | Preview

Additional Metadata

Item Type: Article
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.1142/S0218001417500069
Publisher: World Scientific Publishing
Keywords: Sentiment classification; Financial news; N-gram models; Traditional feature weighting methods; TF-IDF; Linear kernel SVM; RBF kernel SVM; Document frequency
Depositing User: Mohd Hafiz Che Mahasan
Date Deposited: 20 Aug 2018 06:34
Last Modified: 20 Aug 2018 06:34
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1142/S0218001417500069
URI: http://psasir.upm.edu.my/id/eprint/63198
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item