NRPredictor: an ensemble learning and feature selection based approach for predicting the non-reproducible bugs

Bansal, Kulbhushan; Singh, Gopal; Sunesh Malik; Rohil, Harish

doi:10.1007/s13198-023-01902-7

NRPredictor: an ensemble learning and feature selection based approach for predicting the non-reproducible bugs

Original Article
Published: 08 May 2023

Volume 14, pages 989–1009, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

International Journal of System Assurance Engineering and Management Aims and scope Submit manuscript

NRPredictor: an ensemble learning and feature selection based approach for predicting the non-reproducible bugs

Download PDF

Kulbhushan Bansal ORCID: orcid.org/0000-0003-3874-8949¹,
Gopal Singh²,
Sunesh Malik³ &
…
Harish Rohil⁴

138 Accesses
1 Citation
Explore all metrics

Abstract

Software maintenance is essential and significant phase of software development life cycle. In software projects, issue tracking systems are used to collect, categorise, and track filed issues. The distinct bug reports are not being able to reproduced by software developers and hence, marked as non-reproducible. Non-reproducible problems are a major performance issue in bug repositories since they take up a lot of time and effort from developers. The goal of this paper is to create a prediction model for detecting non-reproducible bugs. Due to sheer unexpected nature of bug fixation, bug management is frequently a painful undertaking for software engineers. Non reproducible bugs add to the difficulty of this vexing indexing. This paper deals with the development of a early prediction model for identification of non-reproducible bugs. In this work, a novel framework named NRPredictor, has been proposed which uses three ensemble learning and one feature selection algorithm for Non-Reproducible bug prediction. The prediction performance of the proposed framework has been examined using projects of Bugzilla bug tracking system. Three open-source projects viz. Mozilla Firefox, Eclipse and NetBeans have been used for evaluating the prediction performance. While forecasting the fixability of bug reports, the experimental findings reveal that NRPredictor surpasses traditional machine learning techniques. For Mozilla Firefox, Eclipse, and NetBeans projects, NRPredictor, delivers performance (in terms of F1-score) up to 88.3, 87.8, and 87.4% respectively. An improvement in performance up to 6.1, 5 and 2.7% has been obtained for NetBeans, Eclipse, and Mozilla Firefox projects, respectively as compared to the best performing standalone machine learning classifier.

Automated bug assignment: Ensemble-based machine learning in large scale industrial contexts

Article 10 September 2015

An empirical study of non-reproducible bugs

Article 07 September 2019

Optimized ensemble machine learning model for software bugs prediction

Article 03 December 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The emergence of numerous project management tools and approaches have attributed to the increased project complexity and team-based initiatives. The use of bug tracking tools is an important aspect of open-source project management. Bug reports and their debugging procedures have become an unavoidable part of software development during the previous few decades (Kamkar 1998). Software developers work hard to ensure that the software entity is bug-free (Fagan 2002). However, in actuality, a high number of defect reports are encountered by any software. For collecting, organising, and monitoring of incoming bug reports, large software companies use bug tracking systems (Breu et al. 2010). Bug tracking systems (BTS) are also termed as issue tracking systems (ITS), hence, they have been used interchangeably in this paper. Past works have produced a multitude of research endeavours to ensure genuine treatment of bugs. Out of these, most works are concerned with bug summary generation (Gupta and Gupta 2021), meta-field prediction (such as severity, priority, etc.) (Kumari et al. 2020; Sharma et al. 2021), duplicate identification (Neysiani et al. 2020; Isotani et al. 2021), developer recommendation (Goyal and Sardana 2016; Ye et al. 2020), reopening (Tagra et al. 2021), fixing time (Lee et al. 2020; Kumari et al. 2020), localization (Li et al. 2021) etc. These methods collect numerous bug report parameters/ meta-fields from bug repositories and use them to create the prediction models for certain tasks.

In basic scenario, various stakeholders of BTS (such as end users, developers, and testers) file the problems found to BTS (Anvik 2006). Bug triager checks the bug’s existence and if found as valid, assigns the bug to the developer (Zhang et al. 2014). The developer uses information supplied by reporter in the bug report to reproduce the problem (Shokripour et al. 2015). However, if the developer is unable to replicate the bug, it is designated as Non-Reproducible (NR) (Joorabchi et al. 2014). In bug repositories, NR bug reports are a significant performance issue since they occupy a significant amount of developer’s time and effort. NR bugs create delay in bug fixing and they may even lead to the release of software project with critical bugs (Rahman et al. 2020). Hence, the detection of NR bugs in early bug life cycle is an open research problem requiring investigation.

Joorabchi et al. (Joorabchi et al. 2014) published first characterization study on NR bugs. They addressed four research questions related to quantitative and qualitative analysis of NR bugs. They manually mined the cause categories and transition patterns of about 1600 NR bugs. Further, they studied the NR bugs which eventually got fixed. After conducting an exploratory investigation on 6 bug tracking repositories, they discovered that 17% of all bug reports are resolved as NR. The cause categories for 1,643 NR bugs are defined as Interbug Dependencies (45%), Environmental Differences (24%), Insufficient Information (14%), Conflicting Expectations (12%), Non-deterministic Behaviour (3%) and Others (2%). Furthermore, only around 2% of all NR bug reports get fixed with code fixes in the end, while the other half are implicitly repaired. This work puts some light on the factors leading to make bugs NR, however, it does not provide any mitigation strategy. It does not provide any mechanisms to improve the bug fixing process. Further, (Goyal and Sardana 2017) presented a sentiment analysis based study of developers who worked on NR issue fixes. They discovered that developer comments posted in NR bug reports are more negative than standard defects. Machine learning classifiers are then used to forecast fixable issues from NR flagged bugs. Our work is different from this work as we do not study developer sentiments as bug reports are technical documents and they constitute technical keywords which lack any kind of sentiment. Secondly, the prediction model proposed by Goyal and Sardana (2017) deals with the prediction of reopened bugs whereas our work deals with the prediction of new bugs. Hence, the work presented in this paper attempts to fill the research gap present in the literature “ to provide a mitigation strategy to early predict the NR bugs”.

To the best of our knowledge, there does not exist any work on early prediction of NR bugs. A unique NRPredictor framework is provided in this paper to forecast the fixability of bug reports. For fixability prediction, the proposed model combines feature selection and ensemble learning methods. Ensemble-based approaches use the capabilities of several different basic classifiers to improve classification accuracy (Alzubi 2015). In this method, the training data is first separated into many disjoint groups, and then each subset is trained using a base classifier. Feature selection algorithms try to reduce the complexity of the system.

The following are the current work’s key research contributions (RC):

1.
The early fixability problem in bug reports has been examined. In this RC, the problem of prediction of bug type (R or NR) when a new bug is filed to BTS has been examined.
2.
A novel framework, NRPredictor, based on feature selection and ensemble machine learning algorithms, has been proposed. In this RC, a novel framework has been proposed which predicts whether a new bug report will get fixed or it will be marked as NR.
3.
Thirteen machine learning classifiers (Bayes Net, Naive Bayes, Naive Bayes Multinomial Text, Naive Bayes Updateable, IBk, Zero-R, JRip, OneR, PART, Decision Table, J48, Rep Tree and Random Tree) along with three ensemble learning techniques (Bagging, Boosting and Stacking) and one feature selection technique (Classifier Attribute Evaluator) has been utilized in proposed framework, NRPredictor. In this RC, traditional and advanced machine learning algorithms have been utilized for prediction of a newly reported bug as Fixable or NR.
4.
The proposed framework, NRPredictor has been tested on three large-scale, well-known, long lived, open-source Bugzilla repository projects, namely NetBeans,^{Footnote 1} Eclipse,^{Footnote 2} and Mo-zilla Firefox.^{Footnote 3} In this RC, bug reports from three long lived software projects have been collected and processed to be fed into NRPredictor framework for prediction purposes.
5.
Four evaluation metrics (Precision, Recall, F1-Score, Area under Receiver Operating Ch-aracteristic Curve) have been used for comparison. The experimental findings reveal that the proposed framework, NRPredictor outperforms traditional machine learning techniques consistently. F1-scores up to 88.3, 87.8 and 87.4% for Mozilla Firefox, Eclipse and NetBeans projects has been obtained respectively. In this RC, performance evaluation of proposed framework is conducted using various performance evaluation metrics.

The paper is organised as per the roadmap defined in Fig. 1. Section 2 goes through the background information which includes NR bug report structure, the bug report life cycle, and the ensemble and feature selection approaches used in this paper. The relevant past work across three domains (reproducibility, prediction and ensemble techniques) is discussed in Sect. 3. The architecture of proposed NRPredictor framework is detailed in Sect. 4. The experimental details are presented in Sect. 5. The results and analysis of the experimental evaluation are presented in Sect. 6. The risks to validity are discussed in Sect. 7. Finally, Sect. 8 brings the work to a close by providing conclusion. Section 9 discusses future research prospects.

2 Background

This section covers the necessary background information for this research, such as the fundamental layout of a bug report, the normal life-cycle of an issue, and various ensemble learning & feature selection methodologies used.

2.1 Bug report structure

A bug report is a record that contains complete information concerning a problem. It contains a number of bug meta-fields as well as some textual material. Bug id, product, component, platform, hardware, version, operating system, severity and priority, milestone, status, resolution, reporter’s name, time-stamp of report submission, assignee, and so on are all included in the meta-fields. A quick summary or tagline, a detailed explanation of the error, and comments provided by the reporter, developer, or testers are all included in the textual information. Figure 2 displays an example of an Eclipse Project’s NR bug report (Bug id: 13747).^{Footnote 4}

In Fig. 2, the unique serial number assigned for every problem is referred to as the "Bug ID". The term "Product" refers to the wide region from which the bug sprang. The term "Component" refers to the product’s next level of categorisation. One or more components can be found in a single product. The term "Version" refers to the software product version in which a defect was discovered. The "Status" parameter indicates where the bug is in its life cycle. The name of the developer who has been assigned task for fixing the fault is referred to as "Assigned-to." The term "Summary" refers to a one-sentence explanation of the reported defect. "Description" refers to the bug report’s whole comprehensive specification, which is often written by reporter. Description usually consists of 3 main elements: noticed behaviour, reproducible processes, and predicted software behaviour (Chaparro et al. 2017). The term "Comments" refers to an open-ended discussion among developers to find viable remedies for bug solving.

Along with particular meta-fields and textual contents, bug report contains attachments, URLs, and automatically produced notes. Extra information about the problem is commonly included in these columns, like test cases, patch filed, user-supplied screen shots, the URL of website containing issue, similar duplicate bugs, and so on.

2.2 Bug life-cycle

A bug progresses via various phases throughout its existence. Figure 3 shows life-cycle of a bug report in Bugzilla repository.^{Footnote 5} For different projects, life-cycle stages may vary slightly but the mainstream order remains same. Initially, any bug’s existence is UNCONFIRMED. A bug reporter has reported the problem thus far, but its existence has yet to be validated. The existence of an unconfirmed issue is confirmed by the bug triager, who then labels the validated bug as NEW. Because it is presumed that a bug submitted by an expert is real and existent, it may reach NEW state immediately. The bug triager assigns a verified bug to the developer and labels the resolution with ASSIGNED. The allocated developer investigates the problem, reproduces it, and performs appropriate modifications for fixing it.

There are numerous bug report resolutions available in the RESOLVED status, including fixed, duplicate, won’t fix, worksforme (NR), invalid, remind, and later. The resolution of the problem is indicated as fixed once the assigned developer has successfully made relevant source code adjustments. However, the assigned developer does not have to always discover a valid remedy to the reported issue. A software developer may discover that the claimed problem is not unique when investigating a bug report. It might be a duplicate of an existing or fixed problem, or it could have the same basic cause as another bug. In this case, the bug’s resolution is marked as duplicate (Sureka and Jalote 2010). The resolution of a bug report that outlines a non-rectified issue is set as won’t fix. The problem is marked as NR or worksforme if it cannot be recreated using the information given in the bug report. When additional information is added to the NR bug, it may be reopened, which may aid in replicating the problem. A bug is marked as resolving invalid when it is proven to be illegible or spam. Invalid bugs are considered as not real problems (Yuan et al. 2021). Bugs that force third-party software or websites to make changes, for example, constitute a breach of legal and contractual obligations. If bug requires further information and cannot be addressed immediately, then it is marked with resolution remind or later (Abou et al. 2021).

Table 1 Review of past works on non-reproducible bugs

NRPredictor: an ensemble learning and feature selection based approach for predicting the non-reproducible bugs

Abstract

Similar content being viewed by others

Automated bug assignment: Ensemble-based machine learning in large scale industrial contexts

An empirical study of non-reproducible bugs

Optimized ensemble machine learning model for software bugs prediction

Explore related subjects

1 Introduction

2 Background

2.1 Bug report structure

2.2 Bug life-cycle

2.3 Ensemble learning/ classification

2.4 Feature selection techniques

3 Literature review

3.1 Reproducing bug reports

3.2 Prediction models in bug fixing

3.3 Ensemble learning in bug fixing

4 NRPredictor framework

5 Experimental details

5.1 Subject systems

5.2 Implementation details

5.3 Evaluation metrics

5.4 Research questions

6 Results and analysis

7 Threats to validity

7.1 External validity

7.2 Internal validity

7.3 Construct validity

8 Conclusion

9 Future research directions

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Human or animal rights

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation