UPM Institutional Repository

Cross-project software defect prediction


Citation

Bala, Yahaya Zakariyau and Abdul Samat, Pathiah and Sharif, Khaironi Yatim and Manshor, Noridayu (2022) Cross-project software defect prediction. Journal of Theoretical and Applied Information Technology, 100 (15). 4825 - 4833. ISSN 1992-8645; ESSN: 1817-3195

Abstract

The feasibility of building a software defect prediction (SDP) model in the absence of previous records has been increased by the introduction of the Cross-Project Defect Prediction (CPDP) method. Although this method overcomes the limitations of SDP in the absence of previous historical records, the predictive performance of the CPDP model is relatively poor due to distribution discrepancy between the source and the target datasets. To overcome this challenge, various studies have been published. This SLR was conducted after analyzing research articles published since 2013 in four digital libraries: Scopus, IEEE, Science Direct, and Google Scholar. In this work, five research questions covering the classification algorithms, dataset, independent variables, performance evaluation metrics used in CPDP studies, and as well as the performance of individual machine learning classification algorithms in predicting software defects across different software projects were addressed accordingly. To respond to outlined questions, 34 most relevant articles were selected after passing through quality assessment criteria. Through this work, it was discovered the majority of the selected studies used machine learning techniques as classification algorithms, and 64% of the studies used the combination of Object-Oriented (OO) and Line of Code (LOC) metrics. All the selected studies used publicly available datasets from NASA, PROMISE, SOFLAB, AEEEM, and Relink. The most commonly used evaluation metrics are F_measure and AUC. Best performing classifiers include Logistic Regression and SVM. Despite various efforts to improve the performance of the CPDP model, the performance is below the applicable level. Thus, there is a need for further study that will improve the performance of the CPDP model.


Download File

Full text not available from this repository.
Official URL or Download Paper: http://www.jatit.org/volumes/onehundred15.php

Additional Metadata

Item Type: Article
Divisions: Faculty of Computer Science and Information Technology
Publisher: Little Lion Scientific
Keywords: Software defect prediction; Cross-project; Machine learning techniques; Statistical techniques; Performance evaluation measure
Depositing User: Ms. Nur Faseha Mohd Kadim
Date Deposited: 16 Aug 2023 08:51
Last Modified: 16 Aug 2023 08:51
URI: http://psasir.upm.edu.my/id/eprint/100844
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item