Abstract
Today, big data occupies a significant place in each one’s life. Various supervised and unsupervised machine learning algorithms exist which can be applied on this big data to get the desired information. A wide application of the classification algorithms is seen in analyzing the data of medicine or health where it has been useful in predicting the disease based on the given features. In this paper, the Cleveland database from the dataset of heart disease publicly available on UCI machine learning Web site was taken, and three different classification algorithms were applied on it to predict the presence or absence of heart disease in patients. The performance of the three well-known algorithms—support vector machine, decision tree and random forest—has been compared based on various metrics accuracy, sensitivity, and specificity and error rate. The experiment has been performed on R language, a programming language which is widely used for data visualization, and the result shows that random forest algorithm outperforms the rest of the two algorithms in successfully classifying healthy and unhealthy patients. For this experiment, fourteen different features like age, resting electrocardiographic results, resting blood pressure, chest pain type, etc., of an individual have been considered.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
A. Oussous, F.Z. Benjellun, A.A. Lahcen, S. Belfkih, Big data technologies: a survey. J. King Saud Univ. Comput. Inf. Sci. 30(4), 431–448 (2018)
P.P. Shinde, K.S. Oza, R.K. Kamat, Big data predictive analysis: using R analytical tool, in 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), Palladam (2017), pp. 839–842
A. Dey, Machine learning algorithms: a review. Int. J. Comput. Sci. Inf. Technol. 7(3), 1174–1179 (2016)
M. Alloghani, D. Al-Jumeily, J. Mustafina, A. Hussain, A.J. Aljaaf, A systematic review on supervised and unsupervised machine learning algorithms for data science, in Supervised and Unsupervised Learning for Data Science (Springer, Berlin, 2020), pp. 3–21
S.B. Kotsiantis, I.D. Zaharakis, P.E. Pintelas, Machine learning: a review of classification and combining techniques. Artif. Intell. Rev. 26, 159–190 (2006)
O. Simeone, A very brief introduction to machine learning with applications to communication systems. IEEE Trans. Cogn. Commun. Networking 4(4), 648–664 (2018)
Y. Yang, J. Li, Y. Yang, The research of the fast SVM classifier method, in 2015 12th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP), Chengdu (2015), pp. 121–124
M.A. Hearst, S.T. Dumais, E. Osuna, J. Platt, B. Scholkopf, Support vector machines. IEEE Intell. Syst. Appl. 13(4), 18–28 (1998)
S. Tong, D. Koller, Support vector machine active learning with applications to text classification. J. Mach. Learn. Res. 45–66 (2001)
H. Sharma, S. Kumar, A survey on decision tree algorithms of classification in data mining. Int. J. Sci. Res. (IJSR) 5(4) (2016)
M.V. Datla, Bench marking of classification algorithms: decision trees and random forests—a case study using R, in 2015 International Conference on Trends in Automation, Communications and Computing Technology (I-TACT-15), Bangalore (IEEE, 2015), pp. 1–7
B.R. Patel, K.K. Rana, A survey on decision tree algorithm for classification. IJEDR 2(1) (2014)
Y. Song, Y. Lu, Decision tree methods: applications for classification and prediction. Shanghai Arch. Psychiatry 27(2), 130–135 (2015)
A. Sarica, A. Cerasa, A. Quattrone, Random forest algorithm for the classification of neuroimaging data in Alzheimer's disease: a systematic review. Front. Aging Neurosci. (2017)
Machine Learning Databases. https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Maurya, P., Srivastava, N. (2021). Performance Evaluation of the Supervised Machine Learning Algorithms Using R. In: Bhateja, V., Satapathy, S.C., Travieso-González, C.M., Aradhya, V.N.M. (eds) Data Engineering and Intelligent Computing. Advances in Intelligent Systems and Computing, vol 1407. Springer, Singapore. https://doi.org/10.1007/978-981-16-0171-2_38
Download citation
DOI: https://doi.org/10.1007/978-981-16-0171-2_38
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-0170-5
Online ISBN: 978-981-16-0171-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)