Keywords

1 Introduction

When the heart beats, it pushes the blood against arteries with some force, which creates some pressure called as systolic blood pressure. The heart takes rest during the beats, while the pressure inside the arteries is called diastolic blood pressure. Most of the literature says that hypertension (HBP) or heart rate change leads to heart failure or stroke [1]. Recent health statistics show that men and women of age above 25 are prone to hypertension [3]. We consider blood pressure level, obesity condition, total cholesterol level as reported by laboratories and scientific studies as shown in Table 1.

Table 1 Ranges for BP, obesity, and cholesterol

In this research work, we address the classification methodology to predict whether a person is a victim of HBP or not. We have taken parameters like age, obesity level, and complete blood cholesterol level of a person.

The rest of the paper is organized as follows. A background of the factors influencing HBP is given in Sect. 2. Procedure for identifying a specific classifier based on the performance measures is given in Sect. 2.1, and Sect. 4 evaluates the performance of a classifier. Section 5 concludes and gives future outline of the paper.

2 Background Work

This section describes overview of the factors influencing HBP.

2.1 Factors Effecting BP

Blood pressure (BP) is mainly effected by the cardiac output (CO) and total peripheral resistance (TPR), which is calculated using (1).

$$\begin{aligned} BP = CO * TPR \end{aligned}$$
(1)

Here, CO is effected by increased venous return or stroke volume or heart rate and sympathetic activity. TPR is effected by the resistance that acts against the blood flow in the arteries. It may be due to a blood clot or fat in the blood vessels. CO effects the systolic blood pressure, whereas TPR effects the diastolic blood pressure [4]. So, CO is proportional to a number of heart beats per minute (HBM) and volume of blood (BV) pushed in each beat.

$$\begin{aligned} CO = HBM * BV \end{aligned}$$
(2)
$$\begin{aligned} BP = Systolic/ Diastolic \end{aligned}$$
(3)
$$\begin{aligned} Systolic / Diastolic = CO / TPR \end{aligned}$$
(4)

Sometimes, we may consider mean arterial blood pressure (MABP) which can be calculated as

$$\begin{aligned} MABP = (2 * (Diastolic + Systolic)) / 3 \end{aligned}$$
(5)

If MABP [2] is within the range, then all organs and tissues will get enough blood, oxygen, and nutrients. Our paper focuses on age, obesity, and cholesterol levels of a person in elevating the blood pressure.

2.2 Impact of Age, Obesity, and Cholesterol on Hypertension

Aging is inevitable although a person has a healthy diet and exercise regularly. As we age, arteries may become narrow and harden and the ability of body to process sodium in the diet decreases [6]. The person is said to be obese if his body mass index (BMI) value is more than 30. In obese people, there is increased fatty tissue which needs more blood to live. High blood cholesterol is one of the main reasons for fat deposits in arteries which may harden the arteries also. However, age, obesity, and high blood cholesterol are playing their role to elevate the blood pressure of a person.

2.3 Nervous System

It will monitor changes in the state of a human body and transmits signals to and from different parts of the body. It is of two types [5]: voluntary system (VS) and automatic nervous system (ANS).

2.3.1 Voluntary System

It deals with movement and sense. The reaction of VS happens, when we move our hand in any direction or close our eye. It is under the control of human sense.

2.3.2 Automatic Nervous System

It controls the heartbeat, digestive system, BP, etc. It is not under the control of human sense. It is of two types, namely sympathetic nervous system (SNS) and parasympathetic nervous system (PNS).

Sympathetic Nervous System:

When we experience a sudden fear or high anxiety, SNS helps to prepare the body to defend itself by sending more blood to brain, muscles. This may increase BP. If SNS is active for a longer period, it is not good for human body (Fig. 1).

Fig. 1
figure 1

Sympathetic action and its effects on circulation

Parasympathetic Nervous System:

PNS looks after the immune system to work effectively. PNS is active in the states like being calm, relaxed, and positive. It mostly works against the HBP.

2.4 How Brain Monitors BP

The BP management refers to heartbeat, arterial blood pressure, total blood volume, \(O_{2}\), and \(CO_{2}\) levels in the blood [7]. It uses the sensory system to know the state of the human system and update the same to the brain. Then brain converts this information to response signals via the peripheral nervous system, where the control listeners are waiting to take action on the received information. They may act to increase or decrease blood flow to various tissues by a change in systemic resistance. The response of the control is designed to maintain overall stable blood flow in the human body.

3 Proposed Methodology

In our work, we use a data mining classification technique. It borrows concepts from probability, statistics, fuzzy logic, neural networks to predict class labels of an object. The classification model will have two stages [8]: 1. training stage, where the model is trained with a set of records whose class labels are already known and 2. testing stage, where the model is going to predict class labels of a set of records whose class labels are unknown. There are various classifiers; we used a decision tree-based classifier named J48.

The classifier evaluation is most often based on prediction accuracy (the percentage of correct prediction divided by the total number of predictions). If accuracy is unsatisfactory, then to find the reason different factors must be observed like: 1. Relevant attributes of the problem are considered, 2. it may need more training records, 3. the dimensionality of the problem is too high, 4. algorithm that we select may not be suitable, and 5. parameter tuning is needed.

4 Experimental Result and Analysis

In our experiments, we use a classifier called J48-C 0.25-M2 (weka.classifiers.trees) to classify the tuples. The input data set is doctorcwekacsv-weka.filters.unsupervised.attribute.Remove-R2-weka.filters.unsupervised.attribute.Remove-R4. We have used 1045 number of records for our analysis and considered four attributes like age, obesity level, cholesterol, and BP for evaluation (Table 2).

Table 2 Performance measures of a classifier

TP rate, FP rate, precision, recall, F-measure, Matthews correlation coefficient (MCC), receiver operating characteristic curve (ROC), precision–recall curve (PRC) are calculated for each class for the test data as presented in Table 3.

Table 3 Detailed accuracy by class

We trained the algorithm using 66% of input records and used remaining 34% of records to test the algorithm. The time taken to build the model is 0.05 s and time taken to test the model is 0.05 s. Some instances predicted correctly, some are predicted incorrectly, and the accuracy of the classifier, values of different errors and error rate of a classifier are as presented in Table 2.

The matrix in Table 4 represents 120 records, whose class labels are YES and predicted as YES, but 17 records whose class labels are YES and predicted as NO. For class label NO, 24 records are predicted as YES, and 194 records are predicted as NO.

Table 4 Confusion matrix
Fig. 2
figure 2

Comparison of error rate

Fig. 3
figure 3

Comparison of error rate

Fig. 4
figure 4

Comparison of error rate

The total number of right and wrong predictions is drawn using decision tree-based J48 algorithm. Figure 2a represents the distribution of test samples predicted right, wrong for selected attribute obesity. Figure 2b represents the distribution of test samples predicted right, wrong for selected attribute age. Figure 3a represents the distribution of test samples predicted right, wrong for selected attributes age, cholesterol. Figure 3b represents the distribution of test samples predicted right, wrong for selected attributes age, obesity. Figure 4a represents the distribution of test samples predicted right, wrong for selected attribute cholesterol. Figure 4b represents the distribution of test samples predicted right, predicted wrong for selected attributes cholesterol, obesity. Accuracy and error comparison of different classifiers are shown in Table 5.

Table 5 Accuracy and error comparison of different classifiers

5 Conclusion and Future Work

This paper used a decision tree-based J48 algorithm to predict whether a person is prone to HBP or not. By analyzing the experimental results, we conclude that if the age is more than 30 and cholesterol is more than 205, then the possibility of a person prone to the HBP is high. For certain records, if age is between 35 and 50, cholesterol is less than 150, and even obesity is more than 30 then as per the experimental analysis, the person may not be prone to HBP. If the age of a person is more than 55 irrespective of cholesterol and obesity levels, then that person is prone to HBP. As part of the future work, we will consider parameters like anxiety, depression along with age, obesity, and cholesterol, to predict HBP of a person using multiple regression-based classifiers.