Abstract
Reduction of child mortality and improving child health are health priorities in underdeveloped and developing countries. Afghanistan has a high rank of child mortality. Therefore, the ability to predict child mortality is beneficial and a good preventive measure. This study aims to develop a child mortality predictive model by utilizing data mining classification algorithms and identify the most suitable classifier among the five popular machine learning techniques. These are K-Nearest Neighbors (K-NN), Naïve Bayes (NB), Neural Network (NN), Random Forest (RF), and Support Vector Machine (SVM). The dataset used is the Afghanistan Demographic and Health Survey (AfDHS). The well-known Correlation-based Feature Selection (CFS) algorithm is employed to select the top 13 attributes during the data preprocessing. The classification in this study comprises two categories, ‘Alive’ and ‘Dead’. Preparation of the dataset is carefully done to ensure well-balanced samples in each category. The validation of the predictive models is assessed by means of Accuracy, Precision, Recall, and Area Under the Receiver Operating Characteristic Curve (AUC). The study reveals that Random Forest is the best classifier. The result obtained from this study can be beneficial to child health improvement programs in Afghanistan as well as in policy-making, especially when resources are limited.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Prammer E, Martinuzzi A (2013) The millennium development goals (MDGs) and the post-2015 debate. Eur Sustain Dev Netw
You D, Hug L, Ejdemyr S, Idele P, Hogan D, Mathers C et al (2015) Global, regional, and national levels and trends in under-5 mortality between 1990 and 2015, with scenario-based projections to 2030: a systematic analysis by the UN inter-agency group for child mortality estimation. Lancet 386(10010):2275–2286. https://doi.org/10.1016/S0140-6736(15)00120-8
UN IGME Homepage, https://childmortality.org/wpcontent/uploads/2020/09/UNICEF-2020-Child-Mortality-Report.pdf. Accessed 10 Sept 2020
DHS Program Homepage, http://dhsprogram.com/pubs/pdf/FR323/FR323.pdf. Accessed 18 Oct 2020
Martin, dpicampaigns: Health. http://www.un.org/sustainabledevelopment/health/. Accessed 08 Oct 2020
Rasooly MH, Govindasamy P, Aqil A, Rutstein S, Arnold F, Noormal B, Way A, Brock S, Shadoul A (2014) Success in reducing maternal and child mortality in Afghanistan. Glob Public Health 9(sup1):S29-42. https://doi.org/10.1080/17441692.2013.827733
Delen D, Walker G, Kadam A (2005) Predicting breast cancer survivability: a comparison of three data mining methods. Artif Intell Med 34(2):113–127. https://doi.org/10.1016/j.artmed.2004.07.002
Ngo T (2011) Data mining: Practical machine learning tools and technique, third edition by ian h. witten, eibe frank, mark a. hell. Softw Eng Notes 36:51–52
Umadevi S, Marseline KSJ (2017) A survey on data mining classification algorithms. In: proceedings of 2017 International conference on signal processing and communication (ICSPC). pp 264–268. IEEE. https://doi.org/10.1109/CSPC.2017.8305851
Shah C, Jivani AG (2013) Comparison of data mining classification algorithms for breast cancer prediction. In: proceedings of 2013 Fourth international conference on computing, communications and networking technologies (ICCCNT). pp 1–4. IEEE. https://doi.org/10.1109/ICCCNT.2013.6726477
Han J et al (2014) Data mining: concepts and techniques. Morgan Kaufmann
Senthilkumar D, Paulraj S (2015) Prediction of low birth weight infants and its risk factors using data mining techniques. In: proceedings of the 2015 international conference on industrial engineering and operations management. pp 186–194
Lavangnananda K, Sawasdimongkol P (2012) Neural network classifier of time series: A case study of symbolic representation preprocessing for Control Chart Patterns. In: Proceedings of 2012 8th International conference on natural computation. pp 344–349. IEEE. https://doi.org/10.1109/ICNC.2012.6234651
Belgiu M, Drăguţ L (2016) Random forest in remote sensing: a review of applications and future directions. ISPRS J Photogramm Remote Sens 114:24–31
Ahmed Z, Kamal A, Kamal A (2016) Statistical analysis of factors affecting child mortality in Pakistan. J Coll Physicians Surg Pak 26(6):543–544
Al Kibria GM, Burrowes V, Choudhury A, Sharmeen A, Ghosh S, Mahmud A, Angela KC (2018) Determinants of early neonatal mortality in Afghanistan: an analysis of the demographic and health survey 2015. Glob Health 14 (1). https://doi.org/10.1186/s12992-018-0363-8
Tesfaye B, Atique S, Elias N, Dibaba L, Shabbir SA, Kebede M (2017) Determinants and development of a web-based child mortality prediction model in resource-limited settings: a data mining approach. Comput Methods Programs Biomed 140:45–51. https://doi.org/10.1016/j.cmpb.2016.11.013
Gawande R, Indulkar S, Keswani H, Khatri M, Saindane P (2019) Analysis and prediction of child mortality in India
Momand Z, Mongkolnam P, Kositpanthavong P, Chan JH (2020) Data mining based prediction of malnutrition in afghan children. In: Proceedings of 2020 12th International conference on knowledge and smart technology (KST). pp 12–17. IEEE. https://doi.org/10.1109/KST48564.2020.9059388
Alemu A, Berhanu Y (2018) Assessment of breastfeeding practices in Ethiopia using different data mining techniques
Hall MA Correlation-based feature selection for machine learning. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.455.4521&rep=rep1&type=pdf. Accessed 10 Apr 2020
Lavangnananda K, Chattanachot S (2017) Study of discretization methods in classification. In: Proceedings of 2017 9th International conference on knowledge and smart technology (KST). pp 50–55. IEEE. https://doi.org/10.1109/KST.2017.7886082
Shah P An introduction to weka. https://opensourceforu.com/2017/01/an-introduction-to-weka/., last accessed 2020/08/20
Hossin M, Sulaiman MN (2015) A review on evaluation metrics for data classification evaluations. Int J Data Min Knowl Manag Process 5(2):01–11
Gebreyohans G, Gandhi N (2018) Analyzing children’s data using machine learning: a case study in Ethiopia. Int J Comput Inf Syst Ind Manag Appl 10:154–163
Acknowledgements
The authors are grateful to the Higher Education Development Program (HEDP), Ministry of Higher Education of Afghanistan, and King Mongkut’s University of Technology Thonburi (KMUTT), Thailand for the Scholarship of Mr. Nasratullah Nasrat. The provision of computing facilities at the School of Information Technology, KMUTT, is gratefully acknowledged. Special gratitude is extended to the Demographic and Health Survey (DHS) program, Ministry of Public Health of Afghanistan, for granting the dataset used. Last but not least, the moral support from Dr. Patcharaporn Lavangnananda during the preparation of this manuscript is much appreciated.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Nasrat, N., Lavangnananda, K. (2022). Implementing Predictive Model for Child Mortality in Afghanistan. In: Yang, XS., Sherratt, S., Dey, N., Joshi, A. (eds) Proceedings of Sixth International Congress on Information and Communication Technology. Lecture Notes in Networks and Systems, vol 217. Springer, Singapore. https://doi.org/10.1007/978-981-16-2102-4_31
Download citation
DOI: https://doi.org/10.1007/978-981-16-2102-4_31
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-2101-7
Online ISBN: 978-981-16-2102-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)