UPM Institutional Repository

Enhancing minority sentiment classification in gastronomy tourism: a hybrid sentiment analysis framework with data augmentation, feature engineering and business intelligence


Citation

Razali, Mohd Norhisham and Hanapi, Rozita and Chiat, Lee Wen and Abdul Manaf, Syaifulnizam and Salji, Mohd Rafiz and Nisar, Kashif (2024) Enhancing minority sentiment classification in gastronomy tourism: a hybrid sentiment analysis framework with data augmentation, feature engineering and business intelligence. IEEE Access. pp. 1-24. ISSN 2169-3536 (In Press)

Abstract

The gastronomy tourism industry plays an important role in boosting local economies, enhancing the travel experience, and preserving culinary traditions unique to specific places. In this context, comprehending customer sentiments is of paramount importance for business decisions making, menu choices offering, marketing strategies, and customer service improvements. Traditional sentiment analysis methods in gastronomy tourism tend to be time-consuming, prone to human error, and influenced by subjectivity. Furthermore, the absence of an effective visualization strategy hampers the reliability of sentiment analysis efforts. Compounding this, the data collected often lacks balance across sentiment classes, making it challenging to predict minority sentiments accurately. To address these challenges, our research introduces a hybrid approach, combining various lexicon-based sentiment and emotional analysis algorithms, thereby enhancing the reliability of customer review analysis in the gastronomy tourism sector. Subsequently, we optimize machine learning sentiment classification by employing data augmentation in conjunction with feature engineering strategies, with the goal of improving the recognition of minority sentiment classes. Additionally, we present a comprehensive business intelligence and visualization solution that personalized for the gastronomy tourism industry in Sarawak that offering real-time sentiment visualization. The optimization of sentiment classification, achieved through the integration of synonym augmentation and n-gram feature engineering in conjunction with kNN classifiers, has yielded impressive results. This approach attains optimal classification performance, boasting an accuracy rate of 0.98, an F1-score of 0.99, and a ROC-AUC score of 0.99. Notably, this methodology significantly enhances the recognition of minority sentiment classes within the dataset, addressing one of the main challenges in this research endeavor.


Download File

Full text not available from this repository.
Official URL or Download Paper: https://ieeexplore.ieee.org/document/10422746/

Additional Metadata

Item Type: Article
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.1109/ACCESS.2024.3362730
Publisher: IEEE
Keywords: Classification (of information); Data mining; Data visualization; Learning algorithms; Learning systems; Reliability analysis; Sales; Support vector machines; Visualization; Business analytics; Cultural difference; Machine learning algorithms; Machine-learning; Sentiment analysis; Sentiment classification; Support vectors machine; Tourism industry; Sentiment analysis
Depositing User: Mr. Mohamad Syahrul Nizam Md Ishak
Date Deposited: 24 May 2024 09:39
Last Modified: 24 May 2024 09:39
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1109/ACCESS.2024.3362730
URI: http://psasir.upm.edu.my/id/eprint/106240
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item