Skip to main content

A Meta-analytical Comparison of Naive Bayes and Random Forest for Software Defect Prediction

  • Conference paper
  • First Online:
Intelligent Systems Design and Applications (ISDA 2022)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 716))

  • 314 Accesses

Abstract

Is there a statistical difference between Naive Bayes and Random Forest in terms of recall, f-measure, and precision for predicting software defects? By utilizing systematic literature review and meta-analysis, we are answering this question. We conducted a systematic literature review by establishing criteria to search and choose papers, resulting in five studies. After that, using the meta-data and forest-plots of five chosen papers, we conducted a meta-analysis to compare the two models. The results have shown that there is no significant statistical evidence that Naive Bayes perform differently from Random Forest in terms of recall, f-measure, and precision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://github.com/Muhammad0Awais/SDP_NB_RF_meta_analysis.

References

  1. Ali, U., Iqbal, A., Aftab, S.: Performance analysis of machine learning techniques on software defect prediction using NASA datasets (2019)

    Google Scholar 

  2. Agee, J.: Developing qualitative research questions: a reflective process. Int. J. Qual. Stud. Educ. (2009)

    Google Scholar 

  3. Aleem, S., Capretz, L.F., Ahmed, F.: Benchmarking machine learning technologies for software defect detection (2015)

    Google Scholar 

  4. Borenstein, M., et al.: Identifying and quantifying heterogeneity. In: Introduction to Meta-analysis (Chap. 16), pp. 107–125 (2009). ISBN 9780470743386

    Google Scholar 

  5. Challagulla, V.U.B.: Empirical assessment of machine learning based software defect prediction techniques. ACP J. Club. (2005)

    Google Scholar 

  6. Petric, J., Bowes, D., Hall, T.: Software defect prediction: do different classifiers find the same defects? Softw. Qual. J. (2018)

    Google Scholar 

  7. DerSimonian, R., Laird, N.: Meta-analysis in clinical trials. Controlled Clin. Trials (1986)

    Google Scholar 

  8. Devnani-Chulani, S.: Modeling software defect introduction. In: Proceedings of California Software Symposium (1998)

    Google Scholar 

  9. Bing, L., Peng, H.: An empirical study on software defect prediction with a simplified metric set. Inf. Softw. Technol. (2015)

    Google Scholar 

  10. Akour, M., Alazzam, I., Alsmadi, I.: Software fault proneness prediction: a comparative study between bagging, boosting, and stacking ensemble and base learner methods (2017)

    Google Scholar 

  11. Jacob, S.G., et al.: Improved random forest algorithm for software defect prediction through data mining techniques. IJCA (2015)

    Google Scholar 

  12. Kumar, D., Mishra, K.K.: The impacts of test automation on software’s cost, quality and time to market (2016)

    Google Scholar 

  13. Leandro, G.: Meta-analysis in medical research: The handbook for the understanding and practice of meta-analysis. ACP J. Club. (2008)

    Google Scholar 

  14. Lewis, W.E.: Software Testing and Continuous Quality Improvement. CRC Press, Boca Raton (2017)

    Google Scholar 

  15. Li, M., et al.: Sample-based software defect prediction with active and semi-supervised learning. Autom. Softw. Eng. (2012)

    Google Scholar 

  16. Li, Z., Jing, X.-Y., Zhu, X.: Progress on approaches to software defect prediction. IET Softw. (2018)

    Google Scholar 

  17. Roberts, H., Petticrew, M.: Systematic reviews in the social sciences: a practical guide. ACP J. Club. (2008)

    Google Scholar 

  18. Jain, S., Kakkar, M.: Is open-source software valuable for software defect prediction of proprietary software and vice versa? (2018)

    Google Scholar 

  19. Bowes, D., Shepperd, M., Hall, T.: Researcher bias: the use of machine learning in software defect prediction (2016)

    Google Scholar 

  20. Miller, J.: Applying meta-analytical procedures to software engineering experiments. J. Syst. Softw. (2000)

    Google Scholar 

  21. Dubey, R., Khakhar, P.: The integrity of machine learning algorithms against software defect prediction (2020)

    Google Scholar 

  22. Soe, Y.N., Oo, K.K.: A comparison of Naıve Bayes and random forest for software defect prediction. In: ICCA (2018)

    Google Scholar 

  23. Khari, M., Son, L.H., Pritam, N.: Empirical study of software defect prediction: a systematic mapping. Symmetry (2019)

    Google Scholar 

  24. University of York: CRD’s guidance for undertaking reviews in health care. In: Introduction to Meta-analysis (Chap. 1), pp. 54–55 (2009)

    Google Scholar 

  25. Zaugg, H., et al.: Mendeley: creating communities of scholarly inquiry through research collaboration. TechTrends 55, 32–36 (2011)

    Google Scholar 

  26. Zuse, H.: Software Complexity: Measures and Methods. Walter de Gruyter GmbH & Co KG, Munich (2019)

    Google Scholar 

Download references

Acknowledgements

This research is supported by the Russian Science Foundation, Grant No. 22-21-00494.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ch Muhammad Awais .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Awais, C.M., Gu, W., Dlamini, G., Kholmatova, Z., Succi, G. (2023). A Meta-analytical Comparison of Naive Bayes and Random Forest for Software Defect Prediction. In: Abraham, A., Pllana, S., Casalino, G., Ma, K., Bajaj, A. (eds) Intelligent Systems Design and Applications. ISDA 2022. Lecture Notes in Networks and Systems, vol 716. Springer, Cham. https://doi.org/10.1007/978-3-031-35501-1_14

Download citation

Publish with us

Policies and ethics