An Empirical Analysis of the Effectiveness of Software Metrics and Fault Prediction Model for Identifying Faulty Classes

dc.authorid kumar, lov/0000-0002-0123-7822
dc.authorid Misra, Sanjay/0000-0002-3556-9331
dc.authorid Rath, Santanu/0000-0001-5641-8199
dc.authorscopusid 56120791500
dc.authorscopusid 56962766700
dc.authorscopusid 55428272300
dc.authorwosid kumar, lov/N-4569-2017
dc.authorwosid Misra, Sanjay/K-2203-2014
dc.authorwosid Rath, Santanu/O-6685-2017
dc.contributor.author Kumar, Lov
dc.contributor.author Misra, Sanjay
dc.contributor.author Rath, Santanu Ku.
dc.contributor.other Computer Engineering
dc.date.accessioned 2024-07-05T15:29:05Z
dc.date.available 2024-07-05T15:29:05Z
dc.date.issued 2017
dc.department Atılım University en_US
dc.department-temp [Kumar, Lov; Rath, Santanu Ku.] Natl Inst Technol, Dept CSE, Rourkela, India; [Misra, Sanjay] Atilim Univ, Dept Comp Engn, Ankara, Turkey en_US
dc.description kumar, lov/0000-0002-0123-7822; Misra, Sanjay/0000-0002-3556-9331; Rath, Santanu/0000-0001-5641-8199 en_US
dc.description.abstract Software fault prediction models are used to predict faulty modules at the very early stage of software development life cycle. Predicting fault proneness using source code metrics is an area that has attracted several researchers' attention. The performance of a model to assess fault proneness depends on the source code metrics which are considered as the input for the model. In this work, we have proposed a framework to validate the source code metrics and identify a suitable set of source code metrics with the aim to reduce irrelevant features and improve the performance of the fault prediction model. Initially, we applied a t-test analysis and univariate logistic regression analysis to each source code metric to evaluate their potential for predicting fault proneness. Next, we performed a correlation analysis and multivariate linear regression stepwise forward selection to find the right set of source code metrics for fault prediction. The obtained set of source code metrics are considered as the input to develop a fault prediction model using a neural network with five different training algorithms and three different ensemble methods. The effectiveness of the developed fault prediction models are evaluated using a proposed cost evaluation framework. We performed experiments on fifty six Open Source Java projects. The experimental results reveal that the model developed by considering the selected set of source code metrics using the suggested source code metrics validation framework as the input achieves better results compared to all other metrics. The experimental results also demonstrate that the fault prediction model is best suitable for projects with faulty classes less than the threshold value depending on fault identification efficiency (low - 48.89%, median- 39.26%, and high - 27.86%). en_US
dc.identifier.citationcount 40
dc.identifier.doi 10.1016/j.csi.2017.02.003
dc.identifier.endpage 32 en_US
dc.identifier.issn 0920-5489
dc.identifier.issn 1872-7018
dc.identifier.scopus 2-s2.0-85014063875
dc.identifier.startpage 1 en_US
dc.identifier.uri https://doi.org/10.1016/j.csi.2017.02.003
dc.identifier.uri https://hdl.handle.net/20.500.14411/2855
dc.identifier.volume 53 en_US
dc.identifier.wos WOS:000401046900001
dc.identifier.wosquality Q1
dc.institutionauthor Mısra, Sanjay
dc.language.iso en en_US
dc.publisher Elsevier en_US
dc.relation.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.scopus.citedbyCount 59
dc.subject Feature selection techniques en_US
dc.subject Artificial neural network en_US
dc.subject Ensemble method en_US
dc.subject Source code metrics en_US
dc.subject Cost analysis framework en_US
dc.title An Empirical Analysis of the Effectiveness of Software Metrics and Fault Prediction Model for Identifying Faulty Classes en_US
dc.type Article en_US
dc.wos.citedbyCount 46
dspace.entity.type Publication
relation.isAuthorOfPublication 53e88841-fdb7-484f-9e08-efa4e6d1a090
relation.isAuthorOfPublication.latestForDiscovery 53e88841-fdb7-484f-9e08-efa4e6d1a090
relation.isOrgUnitOfPublication e0809e2c-77a7-4f04-9cb0-4bccec9395fa
relation.isOrgUnitOfPublication.latestForDiscovery e0809e2c-77a7-4f04-9cb0-4bccec9395fa

Files

Collections