UPM Institutional Repository

Feature selection techniques for enhancing app user review analysis


Citation

Salleh, Amran and Osman, Mohd Hafeez and Hassan, Sa’Adah and Said, Mar Yah (2025) Feature selection techniques for enhancing app user review analysis. IEEE Access, 13. pp. 85279-85293. ISSN 2169-3536

Abstract

The rapid growth of user-generated content, particularly app user reviews, presents a significant challenge in analyzing and extracting useful insights. The unstructured nature, inconsistent quality, and large volume of these reviews make it difficult to identify relevant information for app maintenance and updates. This study addresses this challenge by evaluating the impact of different feature selection techniques on the performance of machine learning models in multi-label classification tasks for app user review analysis. Our findings indicate that the subset attributes derived from a combination of Information Gain, GINI Index, and Correlation Matrix can improve the performance of machine learning models. Using a Support Vector Machine with proposed innovative score-based zero shot technique, promising results were achieved on average with 96.75% precision, 69.39% recall, and 80.81% F1-Score. Additionally, high-quality features, such as the percentage Difference can enhance performance in multi-label classification tasks, providing valuable insights for practitioners and researchers. The research implications and significance highlight the practical applications and strategic value of these findings, contributing to the advancement of knowledge and practice in the field of software engineering, particularly for multi-label classification tasks.


Download File

[img] Text
121576.pdf - Published Version
Available under License Creative Commons Attribution.

Download (2MB)
Official URL or Download Paper: https://ieeexplore.ieee.org/document/10962544/

Additional Metadata

Item Type: Article
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.1109/ACCESS.2025.3558795
Publisher: Institute of Electrical and Electronics Engineers
Keywords: App user reviews; Feature selection; Machine learning; Multi-label classification; Text mining
Depositing User: MS. HADIZAH NORDIN
Date Deposited: 06 Nov 2025 04:04
Last Modified: 06 Nov 2025 04:04
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1109/ACCESS.2025.3558795
URI: http://psasir.upm.edu.my/id/eprint/121576
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item