Skip to main content

Empirical Classification Accuracy Assessment of Various Classifiers for Clinical Diagnosis Datasets

  • Conference paper
  • First Online:
Computational Methods and Data Engineering

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1257))

Abstract

Classification is a predictive data mining task. Nowadays, it is also playing a pivotal role in the field of medical diagnostic towards early disease predictions. The aim of applying different classification techniques in diseases like cancer, diabetes, kidney infections, etc., is not to undermine the decision of doctor, but the outcomes determined from the classifiers might augment the correct treatment initiatives. The classifiers developed for medical diagnosis should be validated on reliable results to be trustworthy by doctors. In this research work, the authors attempted to assess the classification accuracy of different classifiers on datasets taken from UCI with cross-validation. Majorly, SVM, logistic regression, ML perceptron, Naïve Bayes, fuzzy logic, k-nearest neighbours, random forest, and J48 are used for experimentation purposes. The performance measures like accuracy, RO curve, kappa statistics, MAE, RMSE, and model building time are used on WEKA. The authors have chosen datasets specifically related to liver, heart, and diabetes among widely spread most life-threatening diseases. Experimental results show that random forest demonstrated the best classification and prediction capability over other classifiers and chosen datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Inza I, Merino M, Quiroga J, Larranaga P (2005) Feature selection in bayesian classifier for the prognosis of survival of cirrhotic patients treated with tips. J Biomed Inf 38(5):376–388

    Google Scholar 

  2. Mythili T, Mukherji D, Padalia N, Naidu A (2013) A heart disease prediction model using SVM, decision tree, logistic regression. Int J Comput Appl 68(16):11–15

    Google Scholar 

  3. Akhiljabbar M, Deekshatula BL, Chandra P (2013) Classification of heart disease using k-nearest neighbor and genetic algorithm. In: First international conference on computational intelligence: modeling techniques and applications, vol 10, pp 85–94

    Google Scholar 

  4. Ahmad F, Isa NA, Hussain Z, Osman MK (2013) Intelligent medical disease diagnosis using improved hybrid genetic algorithm—multilayer perceptron network. J Med Syst 37(2):9934

    Google Scholar 

  5. Sug H (2012) Better decision tree induction for limited data sets of liver disease. In: International conference on future generation information conference, FGIT 2012. Gangneug, pp 88–93

    Google Scholar 

  6. Gulia A, Vohra R, Rani P (2014) Liver patient classification using intelligent techniques. Int J Comput Sci Inf Technol 5(4):5110–5115

    Google Scholar 

  7. Srikanth P, Deverapalli D (2016) A critical study of classification algorithms using diabetes diagnosis. In: proc. of IEEE 6th International Conference on Advanced Computing (IACC), Bhimavaram, 245–249

    Google Scholar 

  8. Jabbar MA, Deekshatulu BL, Chandra P (2012) An evolutionary algorithm for heart disease prediction. In: 6th international conference on information processing. ICIP 2012, pp 378–389

    Google Scholar 

  9. Anooj PK (2012) Clinical decision support system: risk level prediction of heart disease using weighted fuzzy rules. J King Saud Univ Comput Inf Sci 24(1):27–40

    Google Scholar 

  10. Liang C, Peng L (2013) An automated diagnosis system of liver disease using artificial immune and genetic algorithms. J Med Syst 37(2):9932

    Google Scholar 

  11. Priya GN, Kannan A, Anandhakumar P (2013) An efficient classification analysis for multivariate coronary artery disease data patterns using distinguished classifier techniques. In: Fourth international conference on signal and image processing, vol 2, pp 385–394

    Google Scholar 

  12. Kumar S, Sahoo G (2014) Classification of heart using Naïve Bayes and genetic algorithm. In: International conference on CIDM, vol 2, pp 269–282

    Google Scholar 

  13. Bhuvaneswari C, Aruna P, Loganathan D (2014) A new fusion model for classification of the lung disease using genetic algorithm. Egypt Inf J 15:69–77

    Google Scholar 

  14. Bajaj P, Choudhary K, Chauhan R (2015) Prediction of occurrence of heart disease and its dependability on RCT using data mining techniques. In: Second international conference India 2015, vol 2, pp 851–858

    Google Scholar 

  15. Saravana Kumar NM, Eswari T, Sampath P, Lavanya S (2015) Predictive methodology for diabetic data analysis in big data. In: 2nd international symposium on big data and cloud computing, vol 50, pp 203–208

    Google Scholar 

  16. Pawar S, Sikchi S (2016) An extensive survey on diagnosis of diabetes mellitus in healthcare. In: International conference on data engineering and communication and technology, vol 1, pp 97–104

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sabita Khatri .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Khatri, S., Kumar, N., Arora, D. (2021). Empirical Classification Accuracy Assessment of Various Classifiers for Clinical Diagnosis Datasets. In: Singh, V., Asari, V.K., Kumar, S., Patel, R.B. (eds) Computational Methods and Data Engineering. Advances in Intelligent Systems and Computing, vol 1257. Springer, Singapore. https://doi.org/10.1007/978-981-15-7907-3_29

Download citation

Publish with us

Policies and ethics