Abstract
Classification is a predictive data mining task. Nowadays, it is also playing a pivotal role in the field of medical diagnostic towards early disease predictions. The aim of applying different classification techniques in diseases like cancer, diabetes, kidney infections, etc., is not to undermine the decision of doctor, but the outcomes determined from the classifiers might augment the correct treatment initiatives. The classifiers developed for medical diagnosis should be validated on reliable results to be trustworthy by doctors. In this research work, the authors attempted to assess the classification accuracy of different classifiers on datasets taken from UCI with cross-validation. Majorly, SVM, logistic regression, ML perceptron, Naïve Bayes, fuzzy logic, k-nearest neighbours, random forest, and J48 are used for experimentation purposes. The performance measures like accuracy, RO curve, kappa statistics, MAE, RMSE, and model building time are used on WEKA. The authors have chosen datasets specifically related to liver, heart, and diabetes among widely spread most life-threatening diseases. Experimental results show that random forest demonstrated the best classification and prediction capability over other classifiers and chosen datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Inza I, Merino M, Quiroga J, Larranaga P (2005) Feature selection in bayesian classifier for the prognosis of survival of cirrhotic patients treated with tips. J Biomed Inf 38(5):376–388
Mythili T, Mukherji D, Padalia N, Naidu A (2013) A heart disease prediction model using SVM, decision tree, logistic regression. Int J Comput Appl 68(16):11–15
Akhiljabbar M, Deekshatula BL, Chandra P (2013) Classification of heart disease using k-nearest neighbor and genetic algorithm. In: First international conference on computational intelligence: modeling techniques and applications, vol 10, pp 85–94
Ahmad F, Isa NA, Hussain Z, Osman MK (2013) Intelligent medical disease diagnosis using improved hybrid genetic algorithm—multilayer perceptron network. J Med Syst 37(2):9934
Sug H (2012) Better decision tree induction for limited data sets of liver disease. In: International conference on future generation information conference, FGIT 2012. Gangneug, pp 88–93
Gulia A, Vohra R, Rani P (2014) Liver patient classification using intelligent techniques. Int J Comput Sci Inf Technol 5(4):5110–5115
Srikanth P, Deverapalli D (2016) A critical study of classification algorithms using diabetes diagnosis. In: proc. of IEEE 6th International Conference on Advanced Computing (IACC), Bhimavaram, 245–249
Jabbar MA, Deekshatulu BL, Chandra P (2012) An evolutionary algorithm for heart disease prediction. In: 6th international conference on information processing. ICIP 2012, pp 378–389
Anooj PK (2012) Clinical decision support system: risk level prediction of heart disease using weighted fuzzy rules. J King Saud Univ Comput Inf Sci 24(1):27–40
Liang C, Peng L (2013) An automated diagnosis system of liver disease using artificial immune and genetic algorithms. J Med Syst 37(2):9932
Priya GN, Kannan A, Anandhakumar P (2013) An efficient classification analysis for multivariate coronary artery disease data patterns using distinguished classifier techniques. In: Fourth international conference on signal and image processing, vol 2, pp 385–394
Kumar S, Sahoo G (2014) Classification of heart using Naïve Bayes and genetic algorithm. In: International conference on CIDM, vol 2, pp 269–282
Bhuvaneswari C, Aruna P, Loganathan D (2014) A new fusion model for classification of the lung disease using genetic algorithm. Egypt Inf J 15:69–77
Bajaj P, Choudhary K, Chauhan R (2015) Prediction of occurrence of heart disease and its dependability on RCT using data mining techniques. In: Second international conference India 2015, vol 2, pp 851–858
Saravana Kumar NM, Eswari T, Sampath P, Lavanya S (2015) Predictive methodology for diabetic data analysis in big data. In: 2nd international symposium on big data and cloud computing, vol 50, pp 203–208
Pawar S, Sikchi S (2016) An extensive survey on diagnosis of diabetes mellitus in healthcare. In: International conference on data engineering and communication and technology, vol 1, pp 97–104
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Khatri, S., Kumar, N., Arora, D. (2021). Empirical Classification Accuracy Assessment of Various Classifiers for Clinical Diagnosis Datasets. In: Singh, V., Asari, V.K., Kumar, S., Patel, R.B. (eds) Computational Methods and Data Engineering. Advances in Intelligent Systems and Computing, vol 1257. Springer, Singapore. https://doi.org/10.1007/978-981-15-7907-3_29
Download citation
DOI: https://doi.org/10.1007/978-981-15-7907-3_29
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7906-6
Online ISBN: 978-981-15-7907-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)