UPM Institutional Repository

The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis


Citation

Bala, Yahaya Zakariyau and Samat, Pathiah Abdul and Sharif, Khaironi Yatim and Manshor, Noridayu (2024) The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis. Telecommunication Computing Electronics and Control, 22 (4). pp. 830-837. ISSN 1693-6930; eISSN: 2302-9293

Abstract

This empirical investigation delves into the influence of machine learning (ML) algorithms in the realm of cross-project defect prediction, employing the AEEEEM dataset as a foundation. The primary objective is to discern the nuanced influences of various algorithms on predictive performance, with a specific focus on the F1 score metric as evaluation criterion. Four ML algorithms have been carefully assessed in this study: random forest (RF), support vector machines (SVM), k-nearest neighbors (KNN), and logistic regression (LR). The choice of these algorithms reflects their prevalence in software defect prediction literature and their diversity. Through rigorous experimentation and analysis, the investigation unveils compelling evidence affirming the superiority of RF over its counterparts. The F1 score utilized as evaluation metric, capturing the delicate balance between precision and recall, essential in defect prediction scenarios. The nuanced examination of algorithmic efficacy provides practical insights for developers and practitioners navigating the challenges of cross-project defect prediction. By leveraging the rich and diverse AEEEEM dataset, this study ensures a comprehensive exploration of algorithmic influences across varied software projects. The findings not only contribute to the academic discourse on defect prediction but also offer practical guidance for real-world application, emphasizing the pivotal role of RF as a tool in enhancing predictive accuracy and reliability.


Download File

[img] Text
113069.pdf - Published Version
Available under License Creative Commons Attribution Share Alike.

Download (447kB)

Additional Metadata

Item Type: Article
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.12928/TELKOMNIKA.v22i4.25916
Publisher: Universitas Ahmad Dahlan
Keywords: Ross-project; Defect prediction; Machine learning; Random forest; Software defect
Depositing User: Mr. Mohamad Syahrul Nizam Md Ishak
Date Deposited: 15 Nov 2024 06:59
Last Modified: 15 Nov 2024 06:59
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.12928/TELKOMNIKA.v22i4.25916
URI: http://psasir.upm.edu.my/id/eprint/113069
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item