Abstract
Is there a statistical difference between Naive Bayes and Random Forest in terms of recall, f-measure, and precision for predicting software defects? By utilizing systematic literature review and meta-analysis, we are answering this question. We conducted a systematic literature review by establishing criteria to search and choose papers, resulting in five studies. After that, using the meta-data and forest-plots of five chosen papers, we conducted a meta-analysis to compare the two models. The results have shown that there is no significant statistical evidence that Naive Bayes perform differently from Random Forest in terms of recall, f-measure, and precision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ali, U., Iqbal, A., Aftab, S.: Performance analysis of machine learning techniques on software defect prediction using NASA datasets (2019)
Agee, J.: Developing qualitative research questions: a reflective process. Int. J. Qual. Stud. Educ. (2009)
Aleem, S., Capretz, L.F., Ahmed, F.: Benchmarking machine learning technologies for software defect detection (2015)
Borenstein, M., et al.: Identifying and quantifying heterogeneity. In: Introduction to Meta-analysis (Chap. 16), pp. 107–125 (2009). ISBN 9780470743386
Challagulla, V.U.B.: Empirical assessment of machine learning based software defect prediction techniques. ACP J. Club. (2005)
Petric, J., Bowes, D., Hall, T.: Software defect prediction: do different classifiers find the same defects? Softw. Qual. J. (2018)
DerSimonian, R., Laird, N.: Meta-analysis in clinical trials. Controlled Clin. Trials (1986)
Devnani-Chulani, S.: Modeling software defect introduction. In: Proceedings of California Software Symposium (1998)
Bing, L., Peng, H.: An empirical study on software defect prediction with a simplified metric set. Inf. Softw. Technol. (2015)
Akour, M., Alazzam, I., Alsmadi, I.: Software fault proneness prediction: a comparative study between bagging, boosting, and stacking ensemble and base learner methods (2017)
Jacob, S.G., et al.: Improved random forest algorithm for software defect prediction through data mining techniques. IJCA (2015)
Kumar, D., Mishra, K.K.: The impacts of test automation on software’s cost, quality and time to market (2016)
Leandro, G.: Meta-analysis in medical research: The handbook for the understanding and practice of meta-analysis. ACP J. Club. (2008)
Lewis, W.E.: Software Testing and Continuous Quality Improvement. CRC Press, Boca Raton (2017)
Li, M., et al.: Sample-based software defect prediction with active and semi-supervised learning. Autom. Softw. Eng. (2012)
Li, Z., Jing, X.-Y., Zhu, X.: Progress on approaches to software defect prediction. IET Softw. (2018)
Roberts, H., Petticrew, M.: Systematic reviews in the social sciences: a practical guide. ACP J. Club. (2008)
Jain, S., Kakkar, M.: Is open-source software valuable for software defect prediction of proprietary software and vice versa? (2018)
Bowes, D., Shepperd, M., Hall, T.: Researcher bias: the use of machine learning in software defect prediction (2016)
Miller, J.: Applying meta-analytical procedures to software engineering experiments. J. Syst. Softw. (2000)
Dubey, R., Khakhar, P.: The integrity of machine learning algorithms against software defect prediction (2020)
Soe, Y.N., Oo, K.K.: A comparison of Naıve Bayes and random forest for software defect prediction. In: ICCA (2018)
Khari, M., Son, L.H., Pritam, N.: Empirical study of software defect prediction: a systematic mapping. Symmetry (2019)
University of York: CRD’s guidance for undertaking reviews in health care. In: Introduction to Meta-analysis (Chap. 1), pp. 54–55 (2009)
Zaugg, H., et al.: Mendeley: creating communities of scholarly inquiry through research collaboration. TechTrends 55, 32–36 (2011)
Zuse, H.: Software Complexity: Measures and Methods. Walter de Gruyter GmbH & Co KG, Munich (2019)
Acknowledgements
This research is supported by the Russian Science Foundation, Grant No. 22-21-00494.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Awais, C.M., Gu, W., Dlamini, G., Kholmatova, Z., Succi, G. (2023). A Meta-analytical Comparison of Naive Bayes and Random Forest for Software Defect Prediction. In: Abraham, A., Pllana, S., Casalino, G., Ma, K., Bajaj, A. (eds) Intelligent Systems Design and Applications. ISDA 2022. Lecture Notes in Networks and Systems, vol 716. Springer, Cham. https://doi.org/10.1007/978-3-031-35501-1_14
Download citation
DOI: https://doi.org/10.1007/978-3-031-35501-1_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-35500-4
Online ISBN: 978-3-031-35501-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)