Copyright belongs to Nigeria Computer Society (NCS)
Author Biographies
F.E. Usman-Hamza
Department of Computer Science, University of Ilorin, Ilorin, Nigeria
A.F. Atte
Department of Computer Science, University of Ilorin, Ilorin, Nigeria
A.O. Balogun
Department of Computer Science, University of Ilorin, Ilorin, Nigeria
H.A. Mojeed
Department of Computer Science, University of Ilorin, Ilorin, Nigeria
A.O. Bajeh
Department of Computer Science, University of Ilorin, Ilorin, Nigeria
V.E. Adeyemo
School of Computing and IT, Taylor’s University, Selangor, Malaysia
Main Article Content
Impact of feature selection on classification via clustering techniques in software defect prediction
F.E. Usman-Hamza
A.F. Atte
A.O. Balogun
H.A. Mojeed
A.O. Bajeh
V.E. Adeyemo
Abstract
Software testing using software defect prediction aims to detect as many defects as possible in software before the software release. This plays an important role in ensuring quality and reliability. Software defect prediction can be modeled as a classification problem that classifies software modules into two classes: defective and non-defective; and classification algorithms are used for this process. This study investigated the impact of feature selection methods on classification via clustering techniques for software defect prediction. Three clustering techniques were selected; Farthest First Clusterer, K-Means and Make-Density Clusterer, and three feature selection methods: Chi-Square, Clustering Variation, and Information Gain were used on software defect datasets from NASA repository. The best software defect prediction model was farthest-first using information gain feature selection method with an accuracy of 78.69%, precision value of 0.804 and recall value of 0.788. The experimental results showed that the use of clustering techniques as a classifier gave a good predictive performance and feature selection methods further enhanced their performance. This indicates that classification via clustering techniques can give competitive results against standard classification methods with the advantage of not having to train any model using labeled dataset; as it can be used on the unlabeled datasets.
AJOL is a Non Profit Organisation that cannot function without donations.
AJOL and the millions of African and international researchers who rely on our free services are deeply grateful for your contribution.
AJOL is annually audited and was also independently assessed in 2019 by E&Y.
Your donation is guaranteed to directly contribute to Africans sharing their research output with a global readership.
Once off donations here:
For annual AJOL Supporter contributions, please view our Supporters page.
Tell us what you think and showcase the impact of your research!
Please take 5 minutes to contribute to our survey so that we can better understand the contribution that African research makes to global and African development challenges. Share your feedback to help us make sure that AJOL's services support and amplify the voices of researchers like you.