Hyperparameter Tuning in Random Forest and Neural Network Classification: An Application to Predict Health Expenditure Per Capita

Caliskan, Gulcin; Cinaroglu, Songul

doi:10.1007/978-981-19-6004-8_62

Gulcin Caliskan⁷ &
Songul Cinaroglu⁷

Part of the book series: Algorithms for Intelligent Systems ((AIS))

613 Accesses

Abstract

There is a lack of literature about the classification performance improvement effect of hyperparameter tuning to predict health expenditure per capita (HE). In this study, the effect of hyperparameter tuning on classification performances of random forest (RF) and neural network (NN) classification tasks is compared for grouping member of World Bank (WB) countries in terms of HE. Data gathered from 188 member countries of WB for the year 2019. GDP per capita, mortality, life expectancy at birth and population aged 65 years and over are used as predictors. Number of trees and neurons in hidden layer are changed from 5 to 100 for RF and NN by changing k-fold parameter from 2 to 20. The dependent HE variable is transformed into binary categories, and the categories are well balanced (%50–%50). Classification performances of learning techniques are good (AUC > 0.95). RF (AUC = 0.9609) is superior to NN (AUC = 0.9596) in terms of average AUC values generated by hyperparameter tuning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Forecasting and Analyzing Predictors of Inflation Rate: Using Machine Learning Approach

Article 29 February 2024

Optimized Random Forest Algorithm with Parameter Tuning for Predicting Heart Disease

An assessment of random forest technique using simulation study: illustration with infant mortality in Bangladesh

Article 21 June 2022

References

Kaur S, Aggarwal H, Rani R (2020) Hyper-parameter optimization of deep learning model for prediction of Parkinson’s disease. Mach Vis Appl 31(32):1–15
Google Scholar
Passos D et al (2022) A tutorial on automatic hyperparameter tuning of deep spectral modelling for regression and classification tasks. Chemometr Intell Lab Syst 223
Google Scholar
Breiman L (2001) Random forests. Mach Learn 45:5–32
Article MATH Google Scholar
Cui H, Bai J (2019) A new hyperparameters optimization method for convolutional neural network. Pattern Recogn 125:828–834
Article Google Scholar
Spesier JL, Miller ME, Tooze J, Ip E (2019) A comparison of random forest variable selection methods for classification prediction modeling. Expert Syst Appl 134:93–101
Article Google Scholar
Breiman B, Friedman CH, Olshen RA, Stone CJ (1984) Classification and regression trees, 1st edn. New York
Google Scholar
Cutler A, Cutler DR, Stevens JR (2012) Random forests BT—ensemble machine learning: methods and applications. In: Ensemble Mach. Learn. Springer US, Boston, MA, pp 157–175
Google Scholar
Probst P, Boulesteix AN (2018) To tune or not to tune the number of trees in random forest. J Mach Learn Res 18:1–18
MathSciNet MATH Google Scholar
Grömping U (2009) Variable importance assessment in regression: linear regression versus random forest. Am Stat 63(4):308–319
Article MathSciNet Google Scholar
Muchlinski D, Siroky D, He J, Kocher M (2015) Comparing random forest with logistic regression for predicting class-imbalanced civil war onset data. Polit Anal 1–17
Google Scholar
Dreseitl S, Ohno-Machado L (2002) Logistic regression and artifcial neural network classification models: a methodology review. J Biomed Inform 35:352–359
Article Google Scholar
Feraud R, Clerot F (2002) A methodology to explain neural network classification. Neural Netw 15:237–246
Article Google Scholar
Ceylan Z, Atalan A (2021) Estimation of healthcare expenditure per capita of Turkey using artificial intelligence techniques with genetic algorithm-based feature selection. J Forecast 40:279–290
Article MathSciNet Google Scholar
Marcot BG, Hanea AM (2021) What is an optimal value of k in k-fold cross-validation in discrete Bayesian network analysis? Comput Statistics 36:2009–2031
Article MathSciNet MATH Google Scholar
Wong TT (2015) Performance evaluation of classification algorithms by k-fold and leave-one-out cross validation. Pattern Recogn 48:2839–2846
Article MATH Google Scholar
Cho WK et al (2021) Diagnostic accuracies of laryngeal diseases using a convolutional neural network-based image classification system. Laryngoscope 131(11):2558–2566
Article Google Scholar
Goutte C, Gaussier E (2005) A probabilistic interpretation of precision, recall and F-score, with implication for evaluation. In: Proceedings of the European colloquium on IR resarch (ECIR’05), LLNCS 3408 (Springer), pp 345–359
Google Scholar
World Bank Open Data (2019). https://data.worldbank.org/
Manning W (2006) Dealing with skewed data on costs and expenditures. In: Jones AM (ed) The Elgar companion to health economics, 2nd edn. Edward Elgar
Google Scholar
Neelakandan S, Paulraj D (2021) An automated exploring and learning model for data prediction using balanced CA-SVM. J Ambient Intell Human Comput 12:4979–4990
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Health Care Management, FEAS, Hacettepe University, Ankara, Turkey
Gulcin Caliskan & Songul Cinaroglu

Authors

Gulcin Caliskan
View author publications
You can also search for this author in PubMed Google Scholar
Songul Cinaroglu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gulcin Caliskan .

Editor information

Editors and Affiliations

Computer Science and Engineering, GITAM University, Bangalore, Karnataka, India
I. Jeena Jacob
Department of Mathematics and Computer Science, Ashland University, River Forest, IL, USA
Selvanayaki Kolandapalayam Shanmugam
Department of Artificial Intelligence, Lviv Polytechnic National University, Lviv, Ukraine
Ivan Izonin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Caliskan, G., Cinaroglu, S. (2023). Hyperparameter Tuning in Random Forest and Neural Network Classification: An Application to Predict Health Expenditure Per Capita. In: Jacob, I.J., Kolandapalayam Shanmugam, S., Izonin, I. (eds) Data Intelligence and Cognitive Informatics. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-19-6004-8_62

Download citation

DOI: https://doi.org/10.1007/978-981-19-6004-8_62
Published: 03 December 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-6003-1
Online ISBN: 978-981-19-6004-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Hyperparameter Tuning in Random Forest and Neural Network Classification: An Application to Predict Health Expenditure Per Capita

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Forecasting and Analyzing Predictors of Inflation Rate: Using Machine Learning Approach

Optimized Random Forest Algorithm with Parameter Tuning for Predicting Heart Disease

An assessment of random forest technique using simulation study: illustration with infant mortality in Bangladesh

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Hyperparameter Tuning in Random Forest and Neural Network Classification: An Application to Predict Health Expenditure Per Capita

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Forecasting and Analyzing Predictors of Inflation Rate: Using Machine Learning Approach

Optimized Random Forest Algorithm with Parameter Tuning for Predicting Heart Disease

An assessment of random forest technique using simulation study: illustration with infant mortality in Bangladesh

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation