Skip to main content

Comparative Analysis of Ensemble Models for Software Effort Estimation

  • Conference paper
  • First Online:
Advances in Data and Information Sciences

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 522))

  • 281 Accesses

Abstract

Project management is an essential step for the success of any software project. One of the most significant tasks in software project management is estimating the cost and effort of software development at the start of the project. The primary purpose of this research is to study the impact of tuning the base learner hyperparameter with different machine learning/ensemble models to improve estimator accuracy. We used random forest, support vector regression, and elastic net as the base learners. In this study, Albrecht, Desharnais, and China datasets were used for experimentation. We also performed feature selection and considered only those features that have strong correlation with target feature, i.e., effort. The mean magnitude relative error (MMRE) and PRED(25) results demonstrate that utilizing elastic net as the base learner for AdaBoost outperforms the other models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Abdelali Z, Mustapha H, Abdelwahed N (2019) Investigating the use of random forest in software effort estimation. Procedia Comput Sci 148:343–352. https://doi.org/10.1016/j.procs.2019.01.042

    Article  Google Scholar 

  2. Pospieszny P, Czarnacka-Chrobot B, Kobylinski A (2018) An effective approach for software project effort and duration estimation with machine learning algorithms. J Syst Softw 137:184–196. https://doi.org/10.1016/j.jss.2017.11.066

    Article  Google Scholar 

  3. Rijwani P, Jain S (2016) Enhanced software effort estimation using multi layered feed forward artificial neural network technique. Procedia Comput Sci 89:307–312. https://doi.org/10.1016/j.procs.2016.06.073

    Article  Google Scholar 

  4. MacDonell SG, Shepperd MJ (2003) Combining techniques to optimize effort predictions in software project management. J Syst Softw 66:91–98. https://doi.org/10.1016/S0164-1212(02)00067-5

    Article  Google Scholar 

  5. Idri A, Hosni M, Abran A (2016) Systematic literature review of ensemble effort estimation. J Syst Softw 118:151–175. https://doi.org/10.1016/j.jss.2016.05.016

    Article  Google Scholar 

  6. Sharma M, Fotedar N (2014) Software effort estimation with data mining techniques—a review

    Google Scholar 

  7. Suherman IC, Sarno R (2020) Sholiq: implementation of random forest regression for COCOMO II effort estimation. In: 2020 international seminar on application for technology of information and communication (iSemantic), pp 476–481. https://doi.org/10.1109/iSemantic50169.2020.9234269

  8. Dejaeger K, Verbeke W, Martens D, Baesens B (2012) Data mining techniques for software effort estimation: a comparative study. IEEE Trans Softw Eng 38:375–397. https://doi.org/10.1109/TSE.2011.55

    Article  Google Scholar 

  9. Promise Datasets Page. http://promise.site.uottawa.ca/SERepository/datasets-page.html. Accessed 28 May 2022

  10. Effort Estimation: Albrecht | Zenodo. https://zenodo.org/record/268467#.YpIzBqhBy5c. Accessed 28 May 2022

  11. Yun FH (2010) China: effort estimation dataset. https://zenodo.org/record/268446. https://doi.org/10.5281/zenodo.268446

  12. Corazza A, Di Martino S, Ferrucci F, Gravino C, Sarro F, Mendes E (2010) How effective is Tabu search to configure support vector regression for effort estimation? In: Proceedings of the 6th international conference on predictive models in software engineering. Association for Computing Machinery, New York, NY, USA, pp 1–10. https://doi.org/10.1145/1868328.1868335

  13. Sharp T. An introduction to support vector regression (SVR). https://towardsdatascience.com/an-introduction-to-support-vector-regression-svr-a3ebc1672c2. Accessed 28 May 2022

  14. Bhattacharyya S. Ridge and Lasso regression: L1 and L2 regularization. https://towardsdatascience.com/ridge-and-lasso-regression-a-complete-guide-with-python-scikit-learn-e20e34bcbf0b. Accessed 28 May 2022

  15. Aljamaan HI, Elish MO (2009) An empirical study of bagging and boosting ensembles for identifying faulty classes in object-oriented software. In: 2009 IEEE symposium on computational intelligence and data mining, pp 187–194. https://doi.org/10.1109/CIDM.2009.4938648

  16. Solomatine DP, Shrestha DL (2004) AdaBoost.RT: a boosting algorithm for regression problems. In: 2004 IEEE international joint conference on neural networks (IEEE Cat. No. 04CH37541), vol 2, pp 1163–1168. https://doi.org/10.1109/IJCNN.2004.1380102

  17. Kocaguneli E, Menzies T, Keung JW (2012) On the value of ensemble effort estimation. IEEE Trans Software Eng 38:1403–1416. https://doi.org/10.1109/TSE.2011.111

    Article  Google Scholar 

  18. Anandhi V, Chezian RM (2014) Regression techniques in software effort estimation using COCOMO dataset. In: 2014 international conference on intelligent computing applications, pp 353–357. https://doi.org/10.1109/ICICA.2014.79

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shweta Meena .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jawa, M., Meena, S. (2023). Comparative Analysis of Ensemble Models for Software Effort Estimation. In: Tiwari, S., Trivedi, M.C., Kolhe, M.L., Singh, B.K. (eds) Advances in Data and Information Sciences. Lecture Notes in Networks and Systems, vol 522. Springer, Singapore. https://doi.org/10.1007/978-981-19-5292-0_5

Download citation

Publish with us

Policies and ethics