Abstract
Project management is an essential step for the success of any software project. One of the most significant tasks in software project management is estimating the cost and effort of software development at the start of the project. The primary purpose of this research is to study the impact of tuning the base learner hyperparameter with different machine learning/ensemble models to improve estimator accuracy. We used random forest, support vector regression, and elastic net as the base learners. In this study, Albrecht, Desharnais, and China datasets were used for experimentation. We also performed feature selection and considered only those features that have strong correlation with target feature, i.e., effort. The mean magnitude relative error (MMRE) and PRED(25) results demonstrate that utilizing elastic net as the base learner for AdaBoost outperforms the other models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Abdelali Z, Mustapha H, Abdelwahed N (2019) Investigating the use of random forest in software effort estimation. Procedia Comput Sci 148:343–352. https://doi.org/10.1016/j.procs.2019.01.042
Pospieszny P, Czarnacka-Chrobot B, Kobylinski A (2018) An effective approach for software project effort and duration estimation with machine learning algorithms. J Syst Softw 137:184–196. https://doi.org/10.1016/j.jss.2017.11.066
Rijwani P, Jain S (2016) Enhanced software effort estimation using multi layered feed forward artificial neural network technique. Procedia Comput Sci 89:307–312. https://doi.org/10.1016/j.procs.2016.06.073
MacDonell SG, Shepperd MJ (2003) Combining techniques to optimize effort predictions in software project management. J Syst Softw 66:91–98. https://doi.org/10.1016/S0164-1212(02)00067-5
Idri A, Hosni M, Abran A (2016) Systematic literature review of ensemble effort estimation. J Syst Softw 118:151–175. https://doi.org/10.1016/j.jss.2016.05.016
Sharma M, Fotedar N (2014) Software effort estimation with data mining techniques—a review
Suherman IC, Sarno R (2020) Sholiq: implementation of random forest regression for COCOMO II effort estimation. In: 2020 international seminar on application for technology of information and communication (iSemantic), pp 476–481. https://doi.org/10.1109/iSemantic50169.2020.9234269
Dejaeger K, Verbeke W, Martens D, Baesens B (2012) Data mining techniques for software effort estimation: a comparative study. IEEE Trans Softw Eng 38:375–397. https://doi.org/10.1109/TSE.2011.55
Promise Datasets Page. http://promise.site.uottawa.ca/SERepository/datasets-page.html. Accessed 28 May 2022
Effort Estimation: Albrecht | Zenodo. https://zenodo.org/record/268467#.YpIzBqhBy5c. Accessed 28 May 2022
Yun FH (2010) China: effort estimation dataset. https://zenodo.org/record/268446. https://doi.org/10.5281/zenodo.268446
Corazza A, Di Martino S, Ferrucci F, Gravino C, Sarro F, Mendes E (2010) How effective is Tabu search to configure support vector regression for effort estimation? In: Proceedings of the 6th international conference on predictive models in software engineering. Association for Computing Machinery, New York, NY, USA, pp 1–10. https://doi.org/10.1145/1868328.1868335
Sharp T. An introduction to support vector regression (SVR). https://towardsdatascience.com/an-introduction-to-support-vector-regression-svr-a3ebc1672c2. Accessed 28 May 2022
Bhattacharyya S. Ridge and Lasso regression: L1 and L2 regularization. https://towardsdatascience.com/ridge-and-lasso-regression-a-complete-guide-with-python-scikit-learn-e20e34bcbf0b. Accessed 28 May 2022
Aljamaan HI, Elish MO (2009) An empirical study of bagging and boosting ensembles for identifying faulty classes in object-oriented software. In: 2009 IEEE symposium on computational intelligence and data mining, pp 187–194. https://doi.org/10.1109/CIDM.2009.4938648
Solomatine DP, Shrestha DL (2004) AdaBoost.RT: a boosting algorithm for regression problems. In: 2004 IEEE international joint conference on neural networks (IEEE Cat. No. 04CH37541), vol 2, pp 1163–1168. https://doi.org/10.1109/IJCNN.2004.1380102
Kocaguneli E, Menzies T, Keung JW (2012) On the value of ensemble effort estimation. IEEE Trans Software Eng 38:1403–1416. https://doi.org/10.1109/TSE.2011.111
Anandhi V, Chezian RM (2014) Regression techniques in software effort estimation using COCOMO dataset. In: 2014 international conference on intelligent computing applications, pp 353–357. https://doi.org/10.1109/ICICA.2014.79
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Jawa, M., Meena, S. (2023). Comparative Analysis of Ensemble Models for Software Effort Estimation. In: Tiwari, S., Trivedi, M.C., Kolhe, M.L., Singh, B.K. (eds) Advances in Data and Information Sciences. Lecture Notes in Networks and Systems, vol 522. Springer, Singapore. https://doi.org/10.1007/978-981-19-5292-0_5
Download citation
DOI: https://doi.org/10.1007/978-981-19-5292-0_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-5291-3
Online ISBN: 978-981-19-5292-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)