A Paired Learner-Based Approach for Concept Drift Detection and Adaptation in Software Defect Prediction

Loading...
Publication Logo

Date

2021

Journal Title

Journal ISSN

Volume Title

Publisher

Mdpi

Open Access Color

GOLD

Green Open Access

Yes

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Average
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

The early and accurate prediction of defects helps in testing software and therefore leads to an overall higher-quality product. Due to drift in software defect data, prediction model performances may degrade over time. Very few earlier works have investigated the significance of concept drift (CD) in software-defect prediction (SDP). Their results have shown that CD is present in software defect data and tha it has a significant impact on the performance of defect prediction. Motivated from this observation, this paper presents a paired learner-based drift detection and adaptation approach in SDP that dynamically adapts the varying concepts by updating one of the learners in pair. For a given defect dataset, a subset of data modules is analyzed at a time by both learners based on their learning experience from the past. A difference in accuracies of the two is used to detect drift in the data. We perform an evaluation of the presented study using defect datasets collected from the SEACraft and PROMISE data repositories. The experimentation results show that the presented approach successfully detects the concept drift points and performs better compared to existing methods, as is evident from the comparative analysis performed using various performance parameters such as number of drift points, ROC-AUC score, accuracy, and statistical analysis using Wilcoxon signed rank test.

Description

Kumar, Dr Sandeep/0000-0003-0747-6776; Kumar, Sandeep/0000-0002-3250-4866; Mishra, Alok/0000-0003-1275-2050; Kumar, Sandeep/0000-0001-9633-407X

Keywords

concept drift, naive Bayes, random forest, software defect prediction, software quality assurance, Technology, QH301-705.5, T, Physics, QC1-999, concept drift, Engineering (General). Civil engineering (General), naive Bayes, software defect prediction, software quality assurance, Chemistry, TA1-2040, Biology (General), QD1-999, random forest

Fields of Science

02 engineering and technology, 0202 electrical engineering, electronic engineering, information engineering

Citation

WoS Q

Q2

Scopus Q

Q2
OpenCitations Logo
OpenCitations Citation Count
5

Source

Applied Sciences

Volume

11

Issue

14

Start Page

6663

End Page

Collections

PlumX Metrics
Citations

CrossRef : 5

Scopus : 4

Captures

Mendeley Readers : 19

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.8399

Sustainable Development Goals

SDG data is not available