An empirical analysis of the effectiveness of software metrics and fault prediction model for identifying faulty classes

dc.authoridkumar, lov/0000-0002-0123-7822
dc.authoridMisra, Sanjay/0000-0002-3556-9331
dc.authoridRath, Santanu/0000-0001-5641-8199
dc.authorscopusid56120791500
dc.authorscopusid56962766700
dc.authorscopusid55428272300
dc.authorwosidkumar, lov/N-4569-2017
dc.authorwosidMisra, Sanjay/K-2203-2014
dc.authorwosidRath, Santanu/O-6685-2017
dc.contributor.authorMısra, Sanjay
dc.contributor.authorMisra, Sanjay
dc.contributor.authorRath, Santanu Ku.
dc.contributor.otherComputer Engineering
dc.date.accessioned2024-07-05T15:29:05Z
dc.date.available2024-07-05T15:29:05Z
dc.date.issued2017
dc.departmentAtılım Universityen_US
dc.department-temp[Kumar, Lov; Rath, Santanu Ku.] Natl Inst Technol, Dept CSE, Rourkela, India; [Misra, Sanjay] Atilim Univ, Dept Comp Engn, Ankara, Turkeyen_US
dc.descriptionkumar, lov/0000-0002-0123-7822; Misra, Sanjay/0000-0002-3556-9331; Rath, Santanu/0000-0001-5641-8199en_US
dc.description.abstractSoftware fault prediction models are used to predict faulty modules at the very early stage of software development life cycle. Predicting fault proneness using source code metrics is an area that has attracted several researchers' attention. The performance of a model to assess fault proneness depends on the source code metrics which are considered as the input for the model. In this work, we have proposed a framework to validate the source code metrics and identify a suitable set of source code metrics with the aim to reduce irrelevant features and improve the performance of the fault prediction model. Initially, we applied a t-test analysis and univariate logistic regression analysis to each source code metric to evaluate their potential for predicting fault proneness. Next, we performed a correlation analysis and multivariate linear regression stepwise forward selection to find the right set of source code metrics for fault prediction. The obtained set of source code metrics are considered as the input to develop a fault prediction model using a neural network with five different training algorithms and three different ensemble methods. The effectiveness of the developed fault prediction models are evaluated using a proposed cost evaluation framework. We performed experiments on fifty six Open Source Java projects. The experimental results reveal that the model developed by considering the selected set of source code metrics using the suggested source code metrics validation framework as the input achieves better results compared to all other metrics. The experimental results also demonstrate that the fault prediction model is best suitable for projects with faulty classes less than the threshold value depending on fault identification efficiency (low - 48.89%, median- 39.26%, and high - 27.86%).en_US
dc.identifier.citation40
dc.identifier.doi10.1016/j.csi.2017.02.003
dc.identifier.endpage32en_US
dc.identifier.issn0920-5489
dc.identifier.issn1872-7018
dc.identifier.scopus2-s2.0-85014063875
dc.identifier.startpage1en_US
dc.identifier.urihttps://doi.org/10.1016/j.csi.2017.02.003
dc.identifier.urihttps://hdl.handle.net/20.500.14411/2855
dc.identifier.volume53en_US
dc.identifier.wosWOS:000401046900001
dc.identifier.wosqualityQ1
dc.language.isoenen_US
dc.publisherElsevieren_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectFeature selection techniquesen_US
dc.subjectArtificial neural networken_US
dc.subjectEnsemble methoden_US
dc.subjectSource code metricsen_US
dc.subjectCost analysis frameworken_US
dc.titleAn empirical analysis of the effectiveness of software metrics and fault prediction model for identifying faulty classesen_US
dc.typeArticleen_US
dspace.entity.typePublication
relation.isAuthorOfPublication53e88841-fdb7-484f-9e08-efa4e6d1a090
relation.isAuthorOfPublication.latestForDiscovery53e88841-fdb7-484f-9e08-efa4e6d1a090
relation.isOrgUnitOfPublicatione0809e2c-77a7-4f04-9cb0-4bccec9395fa
relation.isOrgUnitOfPublication.latestForDiscoverye0809e2c-77a7-4f04-9cb0-4bccec9395fa

Files

Collections