Skip to main content

Feature Selection and Prediction of Heart Disease Using Machine Learning Approaches

  • Conference paper
  • First Online:
Proceedings of the 6th International Conference on Electrical, Control and Computer Engineering

Abstract

Heart Disease (HD) is the world's most serious illness that seriously impacts human life. The heart does not push blood to other areas of the body in cardiac disease. For the prevention and treatment of cardiac failure, accurate and timely diagnosis of heart disease is critical. The diagnosis of cardiac disease has been considered via conventional medical history. Non-invasive approaches like machine learning are effective and powerful to categorize healthy people and people with heart disease. In the proposed research, by using the cardiovascular disease dataset, we created a machine-learning model to predict cardiac disease. In this paper, it is capable of recognizing and classifying the heart disease patient from healthy people by using three standard machine learning algorithms: Random Forest (RF), Support Vector Machine (SVM) and K-Nearest Neighbor (KNN). In addition, the Area Under Curve (AUC) value is calculated for each classification algorithms. In the proposed scheme, we also used the feature selection algorithm to reduce dimensions over a qualified heart disease dataset. After that, the whole structure for the classification of heart disease has been created. On complete features and reduced features, the performance of the proposed approach has been verified. The decrease in features affects the accuracy and time of execution of the classifiers. With the selected features, the highest classification accuracy is obtained for the KNN algorithm is about 93%, with a sensitivity is 0.9750 and specificity is 0.8529. Therefore, with the complete features, the classification accuracy is about 91%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Bui AL, Horwich TB, Fonarow GC (2011) Epidemiology and risk profile of heart failure. Nat Rev Cardiol 8(1):30–41

    Article  Google Scholar 

  2. Durairaj M, Ramasamy N (2016) A comparison of the perceptive approaches for preprocessing the data set for predicting fertility success rate. Int J Control Theory Appl 9:256–260

    Google Scholar 

  3. López-Sendón J (2011) The heart failure epidemic. Medicographia 33(4):363–369

    Google Scholar 

  4. Mourão-Miranda J, Bokde ALW, Born C, Hampel H, Stetter M (2005) Classifying brain states and determining the discriminating activation patterns: support vector machine o functional MRI data. Neuroimage 28(4):980–995

    Article  Google Scholar 

  5. Al-Shayea QK (2011) Artificial neural networks in medical diagnosis. Int J Comput Sci Issues 8(2):150–154

    Google Scholar 

  6. Jui JJ, Molla MMI, Rashid M, Bari BS, Hasan MJ (2020) Flat price prediction using linear and random forest regression based on machine learning techniques. In: Embracing Industry 4.0, pp 205–217

    Google Scholar 

  7. Molla MMI, Jui JJ, Rashid M, Bari BS, Hasan MJ (2019) Cardiotocogram data classification using random forest based machine learning algorithm. In: 11th National Technical Seminar on Underwater System Technology. Springer, Singapore, pp 357–369

    Google Scholar 

  8. Shafi ASM, Molla MMI, Jui JJ, Rahman MM (2020) Detection of colon cancer based on microarray dataset using machine learning as a feature selection and classification techniques. SN Appl Sci 1243(2):1–8

    Google Scholar 

  9. L’opez-Send’on J (2011) The heart failure epidemic. Medicographia 33:363–369

    Google Scholar 

  10. Vanisree K, Singaraju J (2011) Decision support system for congenital heart disease diagnosis based on signs and symptoms using neural networks. Int J Comput Appl 19(6):6–12

    Google Scholar 

  11. Nazir S, Shahzad S, Septem Riza L (2017) Birthmark-based software classification using rough sets. Arab J Sci Eng 42(2):859–871

    Article  Google Scholar 

  12. Samuel OW, Asogbon GM, Sangaiah AK, Fang P, Li G (2017) An integrated decision support system based on ANN and Fuzzy_AHP for heart failure risk prediction. Expert Syst Appl 68:163–172

    Article  Google Scholar 

  13. Detrano R, Janosi A, Steinbrunn W (1989) International application of a new probability algorithm for the diagnosis of coronary artery disease. Am J Cardiol 64(5):304–310

    Article  Google Scholar 

  14. Gudadhe M, Wankhade K, Dongre S (2010) Decision support system for heart disease based on support vector machine and artificial neural network. In: Proceedings of International Conference on Computer and Communication Technology (ICCCT), Allahabad, India, September, pp 741–745

    Google Scholar 

  15. Kahramanli H, Allahverdi N (2008) Design of a hybrid system for the diabetes and heart diseases. Expert Syst Appl 35(1–2):82–89

    Article  Google Scholar 

  16. Palaniappan S, Awang R (2008) Intelligent heart disease prediction system using data mining techniques. In: Proceedings of IEEE/ACS International Conference on Computer Systems and Applications (AICCSA 2008), Doha, Qatar, March-April 2008, pp 108–115

    Google Scholar 

  17. UCI Machine Learning Repository: Heart Disease Data Set. Accessed 25 Mar 2021, http://archive.ics.uci.edu/ml/datasets/Heart+Disease

  18. Badbe V, Londhe V, Shirole G (2016) Analysis of heart disease by LVQ in neural network. Int J Recent Innov Trends Comput Commun 4:603–607

    Google Scholar 

  19. Putri NK, Rustam Z, Sarwinda D (2019) Learning vector quantization for diabetes data classification with chi-square feature selection. IOP Conf Ser Mater Sci Eng 546(5):052059

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Molla, M.M.I., Islam, M.S., Shafi, A.S.M., Alam, M.K., Islam, M.T., Jui, J.J. (2022). Feature Selection and Prediction of Heart Disease Using Machine Learning Approaches. In: Md. Zain, Z., Sulaiman, M.H., Mohamed, A.I., Bakar, M.S., Ramli, M.S. (eds) Proceedings of the 6th International Conference on Electrical, Control and Computer Engineering. Lecture Notes in Electrical Engineering, vol 842. Springer, Singapore. https://doi.org/10.1007/978-981-16-8690-0_83

Download citation

Publish with us

Policies and ethics