UPM Institutional Repository

Improving cross-project software defect prediction method through transformation and feature selection approach


Citation

Bala, Yahaya Zakariyau and Abdul Samat, Pathiah and Sharif, Khaironi Yatim and Manshor, Noridayu (2023) Improving cross-project software defect prediction method through transformation and feature selection approach. IEEE Access, 11. pp. 2318-2326. ISSN 2169-3536

Abstract

In a practical situation where the project to be predicted is new, traditional software defect prediction cannot be employed. An alternative method is cross-project defect prediction, where the historical record of one project (source) is used to predict the defect status of another project (target). The cross-project defect prediction method solves the limitations of the historical records in the traditional software defect prediction method. However, the performance of cross-project defect prediction is relatively low because of the distribution differences between the source and target projects. Furthermore, the software defect dataset used for cross-project defect prediction is characterized by high-dimensional features, some of which are irrelevant and contribute to low performance. To resolve these two issues, this study proposes a transformation and feature selection approach to reduce the distribution difference and high-dimensional features in cross-project defect prediction. A comparative experiment was conducted on publicly available datasets from the AEEEM. Analysis of the results obtained shows that the proposed approach in conjugation with random forest as the classification model outperformed the other four state-of-the-art cross-project defect prediction methods based on the commonly used performance evaluation metric F1score.


Download File

[img] Text
Improving_Cross-Project_Software_Defect_Prediction_Method_Through_Transformation_and_Feature_Selection_Approach.pdf - Published Version

Download (1MB)

Additional Metadata

Item Type: Article
Divisions: Faculty of Computer Science and Information Technology
DOI Number: https://doi.org/10.1109/access.2022.3231456
Publisher: Institute of Electrical and Electronics Engineers Inc.
Keywords: Cross-project; Feature selection; Software defect; Transformation; Industry; Innovation and infrastructure
Depositing User: Mohamad Jefri Mohamed Fauzi
Date Deposited: 04 Sep 2024 03:57
Last Modified: 04 Sep 2024 03:57
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1109/access.2022.3231456
URI: http://psasir.upm.edu.my/id/eprint/110307
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item