Abstract
In order to assist the diagnosis procedure of heart sound signals, this paper presents a new automated method for classifying the heart status using a rule-based classification tree into normal and three abnormal cases; namely the aortic valve stenosis, aortic insufficient, and ventricular septum defect. The developed method includes three main steps as follows. First, one cycle of the heart sound signals is automatically detected and segmented based on time properties of the heart signals. Second, the segmented cycle is preprocessed with the discrete wavelet transform and then largest Lyapunov exponents are calculated to generate the dynamical features of heart sound time series. Finally, a rule-based classification tree is fed by these Lyapunov exponents to give the final decision of the heart health status. The developed method has been tested successfully on twenty-two datasets of normal heart sounds and murmurs with success rate of 95.5%. The resulting error can be easily corrected by modifying the classification rules; consequently, the accuracy of automated heart sounds diagnosis is further improved.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Heart diseases including valve disorders are still one of main leading causes of death. The cardiac auscultation can be used for early detection of abnormal function of the heart before recommending other diagnosis modalities. It is the technique of interpreting heart sounds, which result from electromechanical activity of the cardiac muscle during each heartbeat such as the atrioventricular (AV) valves closure and opening events [1].
Heart valves produce low frequency transient signals in case of normal heart sounds. For pathological cardiac sounds or murmurs [2], they result of turbulence in blood flow through stenosis or regurgitation through the cardiac valves. High-frequency noise-like sounds present heart murmurs [3]. In this study, we investigated normal heart sounds and three different cases of the heart abnormalities; namely the aortic valve stenosis (AS), aortic insufficient (AI), and ventricular septum defect (VSD).
Stethoscopes are the standard aid to acquire and hear the heart sounds [2]. However, the cardiologist needs extensive training and practical experience with good hearing ability to perform the auscultation technique successfully. Although electronic stethoscopes have been developed to overcome the drawbacks of traditional stethoscopes, but they sometimes need a professional adjustment to make clearly all portions of the heart signal audible. To avoid these complications of using stethoscopes, the phonocardiography is alternatively used for displaying the sounds produced by the heart in a graphical form similar to the electrocardiograms [4]. Therefore, the heart diseases can be visually appeared in the corresponding phonocardiogram (PCG) signals.
Non-stationary property and large variations of PCG signals make visual accurate diagnosis of the heart status a difficult task [5]. Hence, employing digital signal processing methods to automatically analyze PCG signals can ease and assist the diagnosis procedure of the heart status. Features extraction and classification are two major steps of PCG recognition algorithms [6]. In previous studies, characterized features of the PCG signals could be extracted based on signal transformation methods either in frequency or time-frequency domains such as Fourier transform, short time Fourier transform (STFT) [7], and wavelet transform [8].
The wavelet transform provides time-varying monitoring window of adjustable sizes for slow frequencies and narrow enough for tracking rapidly changing ranges. Various intelligent systems have been developed to classify heart sound abnormalities for symptom detection and computer-aided diagnosis, such that the wavelet transforms were applied for segmentation and feature extraction with supervised neural networks for classification of the PCG signals [6, 8]. But the coefficients of wavelet transform as representation of heart signal features are still limited, because they don’t provide information about the underlying nonlinear signal dynamics [9]. In this study, Lyapunov exponents of chaotic bio-signal time series [10] are integrated with discrete wavelet transform (DWT), in order to determine dynamical measures of PCG signals as useful parameters for clinically recognizing heart murmurs.
Multilayer perceptron (MLP) Neural networks and support vector machines (SVMs) have been widely used in many previous studies for classifying PCG signals [3, 5, 11]. These classifiers must be initially trained well by all possible heart sounds and murmurs data to give correct classification results. Practically, it is not available to have a standard data collection of PCG signals; consequently, supervised classifiers suffer from the lack of sufficient training PCG datasets, affecting the classification accuracy. On the other hand, advanced hardware resources with long time, e.g. several days, may be needed to perform the successful training initialization of supervised classifiers.
In order to avoid the drawbacks of classifier training phase, this study presents a rule-based classification tree (RCT) for unsupervised classification of PCG signals into normal, AS, AI, and VSD based on both the experts’ rules and PCG features extracted from Lyapunov exponents. The RCT model provides “if-then” rules which can easily read and interpret similar to the personal diagnosis procedure by physicians in medical applications [12].
Materials and methods
Datasets
Twenty-two sets of heart sounds and murmurs data have been used in this study. The datasets are taken from a public database produced by the CliniSurf, Faculty of Medicine, University of Bern, Switzerland. They include 3 sets for normal healthy heart and 19 sets represent three cases of heart abnormalities, which divided into 4, 10, and 5 datasets for AS, AI, and VSD, respectively. Each dataset contains about 15 heartbeat cycles with a sample rate of 44.1 kHz.
Segmentation of heartbeat cycles
Automatic segmentation of one heartbeat cycle is the first step of the heart sound signals analysis and diagnosis procedure. Some previous studies used the electro-cardiogram (ECG) recordings to assist in segmenting the heart sounds [1]. However, the corresponding synchronized ECGs are not available all the time with the PCG collections data as presented in this study.
Spectral analysis was successfully used to separate the heart sound signals into individual cycles, such that the original signal is down sampled by a factor of four to give the maximum power of the first and second heart sounds (S1 and S2) in each cycle using Daubechies four and five-coefficients (DUB4 and 5) wavelet, respectively [13]. Heart sounds were also segmented based on cycle frequency and dynamic clustering in time-frequency domains [14].
To avoid the complexity of segmenting PCG signals based on signal decomposition and reconstruction using time-frequency analysis, this study used a simple and effective algorithm to segment the heart sounds into cycles in time domain only [15, 16]. As one heart beat is characterized by four temporal states S1, systolic phase, S2, and diastolic phase; the dataset will be divided into segments each contains these four phases. From a previous study, it has been found that the maximum duration of S1 and S2 is within 150 ms [16], so a maximum value within 150 ms ranges is considered as a peak. Using the sampling frequency of the PCG datasets, that is equivalent to about 6000 samples. In addition, Shannon Energy technique [17] has been adopted where the energy of the heart sound signal has been calculated to attenuate the effect of low noise and maximize the S1 and S2 sounds.
The segmentation procedure can be summarized in the following steps:
-
Calculate and normalize Shannon Energy of the dataset using Eq. 1.
-
Detect all the peaks within a predefined threshold of the max value. In this paper, any threshold between 40% to 90% of the maximum value has been tested successfully.
-
Detect only the significant peaks of S1 and S2. A maximum value of an overshoot within 150 ms range is considered as a peak and its location was registered. Other peaks within this range are rejected.
-
Using the peak locations and the length of the whole dataset; determine the average cycle length and the number of cycles.
-
Divide the dataset into cycles using the determined average length.
The segmentation technique resulted in individual cycles that contain the four phases of the heartbeat. Moreover, the spectral characteristics of the resulted cycles were estimated using welch method and compared to that of the whole dataset to ensure the consistency of the segmentation procedure. No significant differences have been found between the spectrum of one heart sound cycle and that of the whole dataset.
Preprocessing of heart sounds using DWT
The DWT can be viewed as a constant filter bank with octave spacing between the centers of the filters such that the wavelet transform decomposes a discrete signal, x[n], into two sub-signals of half its length.
The DUB4 wavelet [18] is used here to preprocess the PCG recordings, because it is the most accurate type of wavelets to accomplish this task, as demonstrated by previous studies [19–21]. The third level of approximation of the heart sounds was selected to represent the tested datasets without losing the main features of original signals. Moreover, this approximation of the heart signals is smooth and noise free because of the wavelet filtering.
Features generation using lyapunov exponents
Calculating the largest Lyapunov exponent is the common test for chaos in data time series if it has a positive value [9]. The Lyapunov exponent is a statistical measure of divergence between two orbits starting with slightly different initial conditions. Assuming d0 and dn are initial divergence between two trajectories and divergence between These orbits after n steps, respectively, the value of largest Lyapunov exponent, λ, can be calculated by
In this study, ten largest Lyapunov exponents were calculated and normalized from the observed time series of PCG signals by dividing each preprocessed dataset into ten equal regions with d0 less than 10−5. A summary of the resulted Lyapunov exponents for all tested PCG dataset is illustrated in Table 1. The bold values of Lyapunov exponents indicate the distinguishable features of each category of all tested PCG datasets and used to construct the rules of the developed classification tree, as presented in the following section.
Building the RCT model of the heart health status
Tree models such as classification and regression trees (CART) have been widely used for predictive modeling and data mining [22]. The basic idea of a classification tree is to construct a binary decision tree sequentially by using splitting if-then rules based on variables to partition the data in such a way to reduce the conditional variation in response to these variables. Many algorithms have been proposed in the literature to automate growing the classification tree with the training data, e.g. diagnosing heart sounds [23]. However, in this study we developed a non-learning static RCT model using the largest Lyapunov exponents to directly decide the health status of the heart. The position and value of each Lyapunov exponent (LPE) are used to derive the following splitting rules, in order to give the correct output of RCT if the heart has either normal healthy condition or a specific disease, as shown in Fig. 1:
-
Rule 1: If LPE #2 ≥ 0.6 then the heart disorder is AI, else
-
Rule 2: If LPE #2 & #3 & #4 & # 8 ≤ 0.5 then the heart is normal (N), else
-
Rule 3: If LPE #9 < 0.5 then the heart disorder is AS, else
-
Rule 4: If LPE #9 ≥ 0.5 then the heart disorder is VSD, else the heart disorder is undefined.
Results and validation
Successful segmentation of four different datasets of the PCG signals into single heartbeat cycles are shown in Fig. 2. The original heart signals, corresponding normalized energy and final segmented cycle of the heart sounds are depicted for each PCG dataset. Fig. 3 shows a comparison between detected and real heartbeat cycles of all tested datasets, in order to evaluate the accuracy of the segmentation algorithm. All tested datasets have about fifteen cycles (or peaks). An error occurred only in dataset of VSD #4, because the expected time length of one heartbeat was long to show two heart cycles of this special VSD case. The cycles in the rest of PCG datasets are however correctly detected, achieving a success rate of 95.5%.
Cross validation was used to evaluate the performance of the developed RCT of PCG signals. It results in a confusion matrix with four possible outcomes; namely true positive (TP), false negative (FN), true negative (TN), and false positive (FP) [11, 24]. Table 2 illustrates classification results of all tested PCG datasets using the developed RCT. The dataset of VSD #4 is only misclassified due to the overlapped rule of recognizing the AS heart disorder, as depicted above in Fig. 1. Therefore, the developed RCT has been manually adjusted to overcome this misclassification error by adding new splitting rule (dashed line) as shown in Fig. 4.
Evaluation results of the developed PCG classification method is presented in Table 3. Measured classification performance of VSD and AI heart disorders showed accuracy of 95.45% because of one misclassified VSD dataset, but accurate results of 100% are validated for both tested normal and AS datasets.
Discussion
Validation of our developed method for automated diagnosis of PCG signals demonstrated the effectiveness and flexibility of both Lyapunov exponents to identify the important features of chaotic heart sounds and the unsupervised RCT to classify different heart disorders successfully.
Figure 3 shows the segmentation error of one heartbeat cycle in the dataset VSD #4, which has been manually corrected to continue the workflow of PCG classification method. However, automatic segmentation of PCG signals into heartbeat cycles was accurate for 21 of 22 tested datasets based on time-domain properties of the heart sounds, i.e. amplitude and distance thresholds.
The advantages of RCTs are exploited in this study to employ unsupervised classification of the PCG signals based on Lyapunov exponents. As illustrated in Tables 2 and 3, classification and evaluation results of the developed PCG classifier showed robust performance to assist the diagnosis procedure of the heart health status accurately. The misclassified error was only appeared in the dataset VSD #4, and it has been easily fixed by modifying the splitting rules, as depicted in Fig. 4.
Deriving if-then rules of the developed RCT presents the critical step to achieve correct classification results. It may require continuous updating because of increasing the total number of PCG datasets, or adding other heart diseases, or any unexpected error, as illustrated in Table 2. Therefore, automating this step using optimization methods can be investigated in the future work. Nevertheless, the performance of our developed classification method of PCG signals is still superior to the other classification methods, which depend mainly on the success of the training phase.
Conclusions
A new automated diagnosis method of PCG signals has been developed to classify different clinical cases of the heart health status. The developed diagnosis method is mainly based on normalized Lyapunov exponents and the RCT. Compared to the previous studies, our classification method is unsupervised, and can successfully cover large variations of twenty-two PCG datasets, without the need for any training data. Additionally, real-time application of automated heart sounds and murmurs diagnosis in the clinical field is an important prospect of this study.
References
Syed, Z., Leeds, D., Curtis, D., Nesta, F., Levine, R.A., and Guttag, J., A framework for the analysis of acoustical cardiac signals. Biomedical Engineering, IEEE Transactions on. 54(4):651–662, 2007. doi:10.1109/tbme.2006.889189.
Webster, J.G., The physiological measurement handbook. CRC Press, Boca Raton, 2014.
Safara, F., Doraisamy, S., Azman, A., Jantan, A., and Ranga, S., Wavelet packet entropy for heart murmurs classification. Adv. Bioinforma. 2012:6, 2012. doi:10.1155/2012/327269.
Brusco, M., and Nazeran, H., Development of an Intelligent PDA-based Wearable Digital Phonocardiograph. In: Engineering in Medicine and Biology Society, 2005. IEEE-EMBS 2005. 27th annual international conference of the, 17–18 Jan. 2006 2005, pp 3506–3509. doi:10.1109/iembs.2005.1617235.
Kao, W.-C., and Wei, C.-C., Automatic phonocardiograph signal analysis for detecting heart valve disorders. Expert Syst. Appl. 38(6):6458–6468, 2011. doi:10.1016/j.eswa.2010.11.100.
Reed TR, Reed NE, Fritzson P (2004) Heart Sound Analysis for Symptom Detection and Computer-Aided Diagnosis. Simulation Modeling Practice and Theory 12. doi:10.1016/j.simpat.2003.11.005
Chauhan, S., Wang, P., Sing Lim, C., and Anantharaman, V., A computer-aided MFCC-based HMM system for automatic auscultation. Comput. Biol. Med. 38(2):221–233, 2008. doi:10.1016/j.compbiomed.2007.10.006.
Debbal, S.M., and Bereksi-Reguig, F., Automatic measure of the split in the second cardiac sound by using the wavelet transform technique. Comput. Biol. Med. 37(3):269–276, 2007. doi:10.1016/j.compbiomed.2006.01.005.
Übeyli, E.D., Adaptive neuro-fuzzy inference system for classification of ECG signals using lyapunov exponents. Comput. Methods Prog. Biomed. 93(3):313–321, 2009. doi:10.1016/j.cmpb.2008.10.012.
Eckmann, J., Kamphorst, S.O., Ruelle, D., and Ciliberto, S., Liapunov exponents from time series. Phys Rev A Gen Phys. 34(6):4971–4979, 1986.
Maglogiannis, I., Loukis, E., Zafiropoulos, E., and Stasis, A., Support vectors machine-based identification of heart valve diseases using heart sounds. Comput. Methods Prog. Biomed. 95(1):47–61, 2009. doi:10.1016/j.cmpb.2009.01.003.
Melillo, P., Luca, N.D., Bracale, M., and Pecchia, L., Classification tree for risk assessment in patients suffering from congestive heart failure via long-term heart rate variability. IEEE Journal of Biomedical and Health Informatics. 17(3):727–733, 2013. doi:10.1109/jbhi.2013.2244902.
Faizan, J., Venkatachalam, P.A., and Ahmad Fadzil, M.H., A signal processing module for the analysis of heart sounds and heart murmurs. J. Phys. Conf. Ser. 34(1):1098, 2006.
Tang, H., Li, T., Qiu, T., and Park, Y., Segmentation of heart sounds based on dynamic clustering. Biomedical Signal Processing and Control. 7(5):509–516, 2012. doi:10.1016/j.bspc.2011.09.002.
Gill, D., Gavrieli, N., Intrator, N., Detection and identification of heart sounds using homomorphic envelogram and self-organizing probabilistic model. In: Computers in Cardiology, 2005, 25–28 Sept. 2005 2005, pp 957–960. doi:10.1109/cic.2005.1588267.
Ari, S., Kumar, P., Saha, G., On An Algorithm for Boundary Estimation of Commonly Occurring Heart Valve Diseases in Time Domain. In: 2006 Annual IEEE India conference, Sept. 2006, pp 1–5. doi:10.1109/indcon.2006.302758.
Choi, S., and Jiang, Z., Comparison of envelope extraction algorithms for cardiac sound signal segmentation. Expert Syst. Appl. 34(2):1056–1069, 2008. doi:10.1016/j.eswa.2006.12.015.
Daubechies, I., Orthonormal bases of compactly supported wavelets II. Variations on a theme. SIAM J. Math. Anal. 24(2):499–519, 1993. doi:10.1137/0524031.
Babaei, S., and Geranmayeh, A., Heart sound reproduction based on neural network classification of cardiac valve disorders using wavelet transforms of PCG signals. Comput. Biol. Med. 39(1):8–15, 2009. doi:10.1016/j.compbiomed.2008.10.004.
Gergely, S., Roman, M. N., Fort, C., Multirate Sampling in PCG Signal Correlation. In: Vlad, S., Ciupa, R. (Eds.), International Conference on Advancements of Medicine and Health Care through Technology. Vol. 36. IFMBE Proceedings. Springer Berlin Heidelberg, pp 198–201, 2011. doi:10.1007/978–3–642-22586-4_43
Karar, M. E., El-Brawany, M., Embedded heart sounds and murmurs generator based on discrete wavelet transform. In: 2016 Fourth International Japan-Egypt Conference on Electronics, Communications and Computers (JEC-ECC), May 31 2016–June 2 2016 2016, pp 34–37. doi:10.1109/jec-ecc.2016.7518962
Gray, J.B., and Fan, G., Classification tree analysis using TARGET. Computational Statistics & Data Analysis. 52(3):1362–1372, 2008. doi:10.1016/j.csda.2007.03.014.
Stasis, A.C., Loukis, E.N., Pavlopoulos, S.A., and Koutsouris, D., A multiple decision trees architecture for medical diagnosis: the differentiation of opening snap, second heart sound split and third heart sound. Comput. Manag. Sci. 1(3):245–274, 2004. doi:10.1007/s10287-004-0015-8.
Sokolova, M., and Lapalme, G., A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 45(4):427–437, 2009. doi:10.1016/j.ipm.2009.03.002.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Funding
There is no funding source.
Conflict of Interest
The authors declare that they have no conflict of interest.
Ethical Approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
This article is part of the Topical Collection on Image & Signal Processing
Rights and permissions
About this article
Cite this article
Karar, M.E., El-Khafif, S.H. & El-Brawany, M.A. Automated Diagnosis of Heart Sounds Using Rule-Based Classification Tree. J Med Syst 41, 60 (2017). https://doi.org/10.1007/s10916-017-0704-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10916-017-0704-9