Few-shot pulse wave contour classification based on multi-scale feature extraction

Lu, Peng; Liu, Chao; Mao, Xiaobo; Zhao, Yvping; Wang, Hanzhang; Zhang, Hongpo; Guo, Lili

doi:10.1038/s41598-021-83134-y

Few-shot pulse wave contour classification based on multi-scale feature extraction

Article
Open access
Published: 12 February 2021

Volume 11, article number 3762, (2021)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Few-shot pulse wave contour classification based on multi-scale feature extraction

Download PDF

Peng Lu^1,2,
Chao Liu^1,2,
Xiaobo Mao^1,2,
Yvping Zhao^2,3,
Hanzhang Wang^1,2,
Hongpo Zhang⁴ &
…
Lili Guo^5,6

1917 Accesses
5 Citations
Explore all metrics

Abstract

The annotation procedure of pulse wave contour (PWC) is expensive and time-consuming, thereby hindering the formation of large-scale datasets to match the requirements of deep learning. To obtain better results under the condition of few-shot PWC, a small-parameter unit structure and a multi-scale feature-extraction model are proposed. In the small-parameter unit structure, information of adjacent cells is transmitted through state variables. Simultaneously, a forgetting gate is used to update the information and retain long-term dependence of PWC in the form of unit series. The multi-scale feature-extraction model is an integrated model containing three parts. Convolution neural networks are used to extract spatial features of single-period PWC and rhythm features of multi-period PWC. Recursive neural networks are used to retain the long-term dependence features of PWC. Finally, an inference layer is used for classification through extracted features. Classification experiments of cardiovascular diseases are performed on photoplethysmography dataset and continuous non-invasive blood pressure dataset. Results show that the classification accuracy of the multi-scale feature-extraction model on the two datasets respectively can reach 80% and 96%, respectively.

Effective IoT Based Analysis of Photoplethysmography Waveforms for Investigating Arterial Stiffness and Pulse Rate Variability

Article 23 April 2024

Pulse-Wave-Pattern Classification with a Convolutional Neural Network

Article Open access 17 October 2019

Blood pressure estimation and classification using a reference signal-less photoplethysmography signal: a deep learning framework

Article 25 September 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Introduction

The collection process of pulse wave signals is simple and non-invasive, enabling convenient acquisition of PWC data^1,2,3. The classification of PWC can be used to monitor the heart condition. However, no uniform standard exists for marking PWC⁴. The standard diagnostics of PWC is primarily based on an individual’s different clinical knowledge and experience to define the morphological features. The formed diagnostic results are subjective, which are difficult to be recognized by all experts.

Existing classification methods of PWC can be categorized into two: statistical methods and machine-learning methods^5,6. Statistical methods represented by artificial time-frequency domain feature extraction are suitable for stationary sequences⁷. Machine-learning methods represented by convolution neural networks^8,9 (CNN), recursive neural networks^10,11 (RNN), and support vector machines^12,13 are suitable for complex non-linear sequences. Clinically collected PWCs are non-stationary and time varying¹⁴, rendering it suitable for machine-learning methods.

Long short-term memory (LSTM), a variant of RNN, has the ability of mining long-distance time-series data information¹⁵. It is extensively used in machine translation^16,17, fault diagnosis^18,19, speech recognition^20,21, and electrocardiogram classification^22,23. In literature²⁴, the representation of speech signals from an original network is automatically learned by CNN, and then the temporal representation of features is learned by LSTM; In literature²⁵, the features of wearable sensor data is learned by CNN, and then the time dependence between actions are modeled by LSTM.

The common idea of the above methods is to extract high-dimensional features by CNN and subsequently obtain short-series features through LSTM, which can be used to predict and synthesize time series. PWC is a kind of few-shot data that difficultly meets the training needs of complex deep networks²⁶.

In the present study, a recursive-network unit structure based on LSTM is designed and found to be suitable for few-shot PWC. Compared with LSTM structure, it has fewer parameters and faster training rate. Adjacent unit information is transmitted through state variables, information is updated through a forgetting gate, and the long-term dependence of PWC is retained in the form of unit series. Simultaneously, a multi-scale feature-extraction model adapted to few-shot PWC is proposed. The multi-scale feature-extraction model is an integrated model that extracts features from three angles. The spatial features of single-period PWC and the rhythm features of multi-period PWC are extracted through CNN. The long-term dependence features of PWC are retained through RNN. All extracted features are then combined linearly for classification.

Cardiovascular-disease classification experiments are performed on photoplethysmography (PPG) dataset and continuous non-invasive blood pressure (CNBP) dataset. Experimental results show that the classification accuracy of the multi-scale feature-extraction model can reach 80% and 96%, respectively.

The remaining part is organized as follows. In the ’Materials & Methods’ section, the experimental data are stated, and multi-scale feature-extraction model are proposed. In the ’Results’ section, the process of data pre-processing and experiment are stated in detail. Finally, conclusions are drawn in the ’Discussion’ section.

Materials and methods

Data description

At present, the most commonly used methods of collecting pulse wave signals information primarily include PPG and pressure detection. PPG traces the pulsation state of blood vessels and measures pulse wave signals by measuring the attenuated light reflected and absorbed by human blood vessels and tissues²⁷. Pressure-detection method obtains pulse wave signals by directly detecting changes in the pressure of human arteries over time. The cuff is tied directly to the patient’s wrist, and the patient should maintain the same posture during measurement²⁸. Patients feel that the wrist is sore and uncomfortable during long-term measurement. Compared with the pressure-detection method, the operation of PPG is simpler and suitable for long-term measurement, but it is susceptible to environmental interference during the measurement process, and its measurement accuracy and stability are lower than those of the pressure-detection method. For pulse wave signals obtained using different acquisition methods, to verify the generalization ability of the model, this study conducts experiments based on PPG dataset and CNBP dataset.

Considering that the clinically collected pulse wave signals contain substantial noise, pre-processing steps such as data filtering, beat division, etc. are necessary. The specific pre-processing details are described in the ’Results’ section.

Dataset I: The PPG dataset for the non-invasive detection of cardiovascular diseases contains 657 pulse wave records from 219 subjects²⁹. The data covers the age range of 20–89 years old, and the sampling rate is 1000 Hz. Various diseases including hypertension and diabetes are recorded. The dataset provides four labels for normal blood pressure, pre-hypertension, and stage I/II hypertension. Table 1 shows the details of dataset I.
Table 1 Dataset I label and sample size.
Full size table
Dataset II: The CNBP dataset based on cardiovascular diseases is collected from the Fifth Affiliated Hospital of Zhengzhou University. The subjects who are 18–83 years old fill-up an informed-consent form before data collection. ZM-300 intelligent pulse wave signal collector is adopted for data acquisition, with a sampling frequency of 200 Hz. A pressure sensor based on a semiconductor strain gauge, with a sensitivity of 0.5 mV/g (bridge voltage of 6 V) and a pressure range of 0–1000 g, is used for the collection of pulse wave signals. The A/D converter is a 4-channel 12-bit converter. Dataset II contains 1326 pulse wave records from 221 subjects under six kinds of pulse pressures. It records various cardiovascular diseases, including hypertension and arrhythmia.

Moreover, the dataset is collected under the control of standard experimental conditions and specifications. All methods are carried out in accordance with relevant guidelines and regulations, and all experimental protocols are approved by the Ethics Committee of Drug Clinical Trials of the Fifth Affiliated Hospital of Zhengzhou University. Table 2 shows the details of dataset II.
Table 2 Dataset II label and sample size.
Full size table

Multi-scale feature-extraction model

Few pulse wave datasets of cardiovascular diseases are available. At the same time, the feature division of PWC is unclear, and no uniform marking standard exists. The machine needs to learn PWC features automatically. In view of the above problems, a solution is to establish a feature-extraction model based on few-shot PWC. In this paper, a multi-scale feature-extraction model is proposed.

The multi-scale feature-extraction model for few-shot PWC is shown in Fig. 1. N continuous PWC cycles are used as the network input, and then three network layers are connected in parallel to extract the multi-scale features of PWC. The recursive network layer is used to extract the length features of PWC; the periodic feature-extraction layer is used to extract the features of PWC within a single cycle; and the rhythm feature-extraction layer is used to extract PWC features between multiple cycles. Finally, features extracted from the three network layers are combined in the inference layer to classify PWC.

Recursive network layer

PWC is composed of multiple cycles of pulse fluctuations and belongs to a one-dimensional time series. RNN is a network which is good at dealing with time series. By modeling PWC through RNN, the correlation between several cycles of PWC can be obtained. PWC is segmented in chronological order and sequentially input into RNN. The output state of the hidden layer represents the long-term memory feature extracted as the recursive network layer.

Considering the number of sample points and the complexity of network training, a recursive-network unit structure for few-shot PWC is designed. The forgetting and saving of information at each moment are controlled by the forgetting gate. Compared with the LSTM unit structure, the parameters are fewer. The specific structure is shown in Fig. 2.

In the recursive-network unit for few-shot PWC, the unit state vector c is controlled by the forgetting gate. The number of unit states of the previous-cycle PWC retained in the current-cycle PWC is determined by the forgetting gate. The forward calculation formula of the forgetting gate is as follows:

$$\begin{aligned} ne{t_{f,t}}= & {} {W_f} \cdot [{h_{t - 1}},{x_t}] + {b_f} \end{aligned}$$

(1)

$$\begin{aligned} {f_t}= & {} \sigma (ne{t_{f,t}}) \end{aligned}$$

(2)

where the weighted input of the forgetting gate is represented by $ne{t_{f, t}}$, the weight matrix of the forgetting gate is represented by ${W_f}$, the operation of splicing two vectors into one vector is represented by $[{h_{t - 1}},{x_t}]$, the offset term of the forgetting gate is represented by ${b_f}$, the forgetting gate is represented by ${f_t}$, and $\sigma$ refers to sigmoid function.

The weight matrix ${W_f}$ is formed by splicing ${W_{fh}}$ and ${W_{fx}}$, corresponding to the previous-cycle PWC ${h_{t - 1}}$ and the current-cycle PWC ${h_t}$:

$$\begin{aligned} ne{t_{f,t}} = \left[ {\begin{array}{*{20}c} {W_{fh}}&{}{W_{fx}}\\ \end{array} } \right] \left[ {\begin{array}{*{20}c} {h_{t - 1}}\\ {x_t}\\ \end{array} } \right] + {b_f} = {W_{fh}}{h_{t - 1}} + {W_{fx}}{x_t} + {b_f} \end{aligned}$$

(3)

The input state ${{\tilde{c}}_t}$ of the current-cycle PWC is calculated from the unit output of the previous cycle and the input of the current cycle:

$$\begin{aligned} ne{t_{{\tilde{c}},}}_t= & {} {W_c} \times [{h_{t - 1}},{x_t}] + {b_c} = {W_{ch}}{h_{t - 1}} + {W_{cx}}{x_t} + {b_c} \end{aligned}$$

(4)

$$\begin{aligned} {{\tilde{c}}_t}= & {} tanh(ne{t_{{\tilde{c}},}}_t) \end{aligned}$$

(5)

where the weighted input of the PWC unit state is represented by $ne{t_{{\tilde{c}},t}}$; the weight matrix of the PWC unit state is represented by ${W_c}$, which is composed of two matrices ${W_{ch}}$ and ${W_{cx}}$; and the bias term of the forgetting gate is represented by ${b_c}$.

To update the cell state of the current cycle, the current memory ${{\tilde{c}}_t}$ needs to be combined with the long-term memory ${c_{t - 1}}$, and new information needs to be added whilst forgetting some information. The unit state ${c_{t - 1}}$ of the previous-cycle PWC is multiplied with the corresponding position element of the forgetting gate ${f_t}$, and the unit state ${{\tilde{c}}_t}$ of the current-cycle PWC is multiplied with the corresponding position element of $1 - {f_t}$ to obtain the state output of the current-cycle PWC ${c_t}$:

$$\begin{aligned} {c_t} = diag[{f_t}]{c_{t - 1}} + diag[1 - {f_t}]{{\tilde{c}}_t} \end{aligned}$$

(6)

The output is determined by the state of the PWC unit:

$$\begin{aligned} {h_t} = tanh({c_t}) \end{aligned}$$

(7)

As shown in Fig. 3, the recursive network layer is designed as a two-layer structure. The time of the main wave peak of the front PWC is used as the reference, and the single beat PWC of two adjacent troughs is used as the input of the unit. To mine the information that is opposite to the first-layer network-information transmission direction, a layer of reverse network is built on top of each layer of forward network, and the output of the first-layer network is directly used as the input of the reverse network. The output of the lower networks (${h_0}\sim {h_n}$) is used as the input of the reverse network after random screening by the upper layer:

$$\begin{aligned}&{r_i}\sim {Bernoulli(p)} \end{aligned}$$

(8)

$$\begin{aligned}&{x_t} = diag({r_t}) \times {h_t} \end{aligned}$$

(9)

where Bernoulli(p) represents the Bernoulli distribution according to probability p and has a value of 0 or 1; ${r_t} = [{r_0},{r_1},\ldots ,{r_n}]$, n are the dimensions of ${h_t}$, and ${x_t}$ is the input of the unit node at time t at the reverse network layer.

Periodic feature-extraction layer

The shape and change of PWC in a single cycle contain its main features. Extracting the spatial features of PWC in a single cycle is necessary. However, PWC is sparse and contains little information at a single moment. If all points of PWC are used as input sessions, the number of network parameters becomes too large. At the same time, the training efficiency and accuracy of the model are greatly reduced by too many features.

To effectively extract the features of PWC in a single cycle, one-dimensional CNN is used. The convolution module has the functions of local connection and weight sharing and can extract the spatial position relationship at different time points of a single-cycle PWC. At the same time, it can reduce the amount of network parameters and reduce the model complexity.

The single-cycle PWC is a signal segment with a length of 235, and it is used as the input. The specific structure of the single-period feature of PWC is obtained through two convolution modules, which each module containing a one-dimensional convolution layer and a maximum pooling layer. To reduce the amount of computation in the process of model training and improve the computation speed, ReLU function is used as the activation function. The specific structure is shown in Fig. 4. .

Rhythm feature-extraction layer

PWC is formed by the periodic and regular pulse of pressure in arteries. However, slight differences exist between PWC cycles, which contain rhythm information and are difficult to observe artificially. The rhythm features represent the feature relationship between cycles and reflects the envelope change of the pulse signal. These features have a classification effect and can improve classification accuracy. To extract the rhythm features of PWC, a two-dimensional CNN is used.

Compared with single-cycle signals, multi-cycle signals have larger data dimensions, and a suitable convolution module scale is more important. Whilst ensuring that features are not discarded, reducing the dimension of data as much as possible is necessary. N adjacent periodic signals are spliced into a feature map, which is used as the input. The specific structure of the single-period features of PWC is obtained through two convolution modules, with each module containing one two-dimensional convolution layer and one max pooling layer. Same as the periodic feature-extraction layer, ReLU function is used as the activation function. The specific structure is shown in Figure 5.

Reasoning layer

The reasoning layer contains two fully connected layers, each neuron is fully connected to all neurons in the previous layer, and the local information with category discrimination in the network layer is integrated. The activation function of the fully connected layer adopts the ReLU function, and the training algorithm adopts the error back-propagation algorithm. The output of the last layer is passed through the SoftMax classifier to obtain PWC classification result.

Given that the gate-unit activation function in the recursive network layer is selected as the Sigmoid function, when the training result is close to the true value, the gradient operator is extremely small, and the convergence speed of the model is slow. The cross-entropy loss function is a logarithmic function. When it is close to the upper boundary, it can still be maintained at a high gradient state without affecting the convergence speed of the model. To improve the convergence speed of the model, cross-entropy is used as the loss function:

$$\begin{aligned} loss = - {1 \over N}\sum \limits _{i = 1}^N {\left( { - \sum \limits _{j = 1}^M {{y_{ij}}\log ({p_{ij}})} } \right) } \end{aligned}$$

(10)

where N is the number of samples, M is the number of PWC categories, ${y_{ij}}$ is the label of PWC, and ${p_{ij}}$ is the prediction result corresponding to each label.

Results

Evaluation index

We use Accuracy(Acc), Sensitivity (Sen), Specificity(Spe), Precision(Pre), and F1-score for model evaluation. The calculation formula for the five evaluation indicators is as follows³⁰:

$$\begin{aligned} \mathrm{{Acc}}= & {} {{TP + TN} \over {TP + TN + FP + FN}} \end{aligned}$$

(11)

$$\begin{aligned} \mathrm{{ Sen}}= & {} {{TP} \over {TP + FN}} \end{aligned}$$

(12)

$$\begin{aligned} \mathrm{{Spe}}= & {} {{TN} \over {TN + FP}} \end{aligned}$$

(13)

$$\begin{aligned} \mathrm{{ Pre}}= & {} {{TP} \over {TP + FP}} \end{aligned}$$

(14)

$$\begin{aligned} F1 - score\mathrm{{ }}= & {} \mathrm{{ }}{{2 \times Pre \times Sen} \over {Pre + Sen}} \end{aligned}$$

(15)

where TP stands for true positive and is the number of samples predicting abnormal PWC; TN stands for true negative and is the number of samples predicting normal PWC as normal; FP stands for false positive and is the number of samples predicting normal PWC as abnormal; FN stands for false negative and is the number of samples predicting abnormal PWC as normal; Acc represents the overall classification accuracy of the overall model; Sen represents the proportion of abnormal PWC that are matched and measures the model’s ability to recognize abnormal PWC; Spe represents the proportion of normal PWC that are matched and measures the pairing of the classifier, i.e., the ability to recognize normal PWC; and Pre represents the proportion of PWC classified as abnormal that are actually marked as abnormal.

Preprocessing

Pulse wave signals collected in clinical settings is easily affected by noise, including motion artifacts, inherent noise of collecting instruments, and power-supply noise. Using end-to-end training directly reduces the classification accuracy. Therefore, the original signals need to be pre-processed before the experiment.

The frequency of pulse wave signals is primarily distributed between 0.5–2 Hz. Motion artifact is primarily caused by breathing, and the respiratory frequency of normal adults is about 0.2–0.3 Hz. The inherent noise of collecting instruments is above 90 Hz. The power supply noise is 50 Hz/60 Hz. Empirical mode decomposition (EMD) is a signal-processing method based on the time-scale features of the data itself without pre-setting any basis functions³¹. EMD has obvious advantages in dealing with non-stationary and non-linear data. According to the frequency-distribution features of the pulse wave signals, the EMD method is used to remove the noise.

In the process of EMD, cubic spline function is used to interpolate the maximum value sequence and the minimum value sequence to obtain the upper and lower envelopes. The cubic spline interpolation needs to use two adjacent points before and after. Therefore, there will be divergence at the two ends of the data, that is, the derivative of the intrinsic mode-function at the boundary increases, which makes the filtered signal has obvious distortion at the beginning and end. To avoid the transient phenomenon at the beginning and end of the filtered signal, the end points of the curve are added to the spline. The Pearson correlation degree is used to measure the degree of information loss before and after denoising, and the calculation method is shown in equation (16):

$$\begin{aligned} Pearson = {{\sum {XY} - {{\sum X \sum Y } \over N}} \over {\sqrt{\left( {\sum {{X^2}} - {{{{(\sum X )}^2}} \over N}} \right) \left( {\sum {{Y^2}} - {{{{(\sum Y )}^2}} \over N}} \right) } }} \end{aligned}$$

(16)

where X represents the set of elements arranged in time in the original PWC, Y represents the set of elements arranged in time after denoising, and N represents the number of sample points.

The value interval of Person is [− 1,1]. When a person is close to 1, X and Y have a strong linear correlation, showing that the information loss is relatively small after denoising. In the experiment, only data with a Person coefficient greater than 0.93 after filtering is retained to ensure that the filtered data retains the original information to the greatest extent. The calculation result of the signal Person coefficient in Fig. 6 is 0.99.

Signals collected clinically are actually the slice data of pulse wave signals. Few records of pulse wave database itself exist; not every record has all categories, and even some categories appear only on a few records. In this case, to ensure that categories are not omitted as far as possible, the PWC should be divided separately into beats³². In the experiment, the PWC is divided into cycles according to the trough position. The signal segment of the adjacent valleys is regarded as one PWC cycle. The specific pre-treatment process is shown in Fig. 6.

Setting of model parameters

The pre-processed PWC is randomly divided into training set and test set at a ratio of 4:1. Under the condition of a certain amount of data, the classification accuracy of the model no longer increases before the number of hidden nodes of the neural network increases to a certain extent³³. Therefore, the number of hidden nodes in the recursive network layer is set to 32, the number of kernels in the periodic feature-extraction layer is set to 32 and 64, the number of kernels in the rhythm feature-extraction layer is set to 32 and 64. After verification through multiple experiments, the network training parameters are set, as shown in Table 3.

Table 3 Network hyperparameters.

Full size table

Experiment

The number of PWC cycles input of the model is expressed as a unit cycle. An increase in amount of cycles of model input leads to an increase in depth of the recursive network. Furthermore, the proportion of features extracted by the periodic feature-extraction layer and the rhythm feature-extraction layer increase. At the same time, the network dimension increases leads to decreased training speed of the model. To determine the optimal number of unit cycles for a suitable PWC multi-feature scale model, experiments are conducted for different unit cycles. Training set II is used in the experiment, and each group is trained 50 times. The final results are shown in Table 4. According to the table, when the number of unit cycles is 5, the model has the optimum effect.

Table 4 The influence of different unit cycles on experimental results.

Full size table

On the premise of retaining the structure of the pre-processing layer and the reasoning layer, as well as the network parameters, the network layer is changed for comparison experiments. Zhan³⁴ et al. extracted features through CNN for classification. Liu³⁵ et al. extracted PWC timing features through parallel CNN structures, including signal segment features within a period and multi-period features representing cycle relationships; Ghosh³⁶ et al. extracted PWC features based on LSTM to predict systolic and diastolic blood pressure. The training process of different network models is shown in Figure 7.

Performance comparison with other neural network methods under different datasets is shown in Table 5. According to the table, the multi-scale model has good classification performance on dataset I and dataset II.

Table 5 Performance comparison of multi-scale model and other methods.

Full size table

Discussion

A small-parameter unit structure and a multi-scale feature-extraction model are proposed to obtain improved results under the condition of few-shot pulse wave contour classification. Cardiovascular-disease classification experiments were carried out on PPG dataset and CNBP dataset. To evaluate the classification performance of the multi-scale feature-extraction model, three different classification methods including single neural network model³⁴, parallel neural network model³⁵, and LSTM model³⁶ were introduced for comparison. Comparative studies show that the multi-scale feature-extraction model outperforms the other classification methods in terms of identification accuracy, stability, and sensitivity, and the multi-scale feature-extraction model consumes less time for training. For the proposed novel PWC classification approach, the model is notably sensitive to the number of unit cycles, and we find that the best unit cycle is five. Also, the multi-scale feature-extraction model depends on the division result of the PWC cycle.

Moreover, (a) the classification problem of few-shot PWC is well handled; (b) owing to the limitation of data volume, only binary classification is carried out; (c) PWC must be pre-processed (including EMD filtering and beat segmentation) before feature learning, which may lead to missing partial information. Finally, given the excellent performance of the multi-scale feature-extraction model in the few-shot PWC experiments, the classification of few-shot PWC is interesting and meaningful to study, and it will be further investigated in our future research.

References

Fan, F., Yan, Y., Tang, Y. & Zhang, H. A motion-tolerant approach for monitoring spo(2) and heart rate using photoplethysmography signal with dual frame length processing and multi-classifier fusion. Comput. Biol. Med. 91, 291–305 https://doi.org/10.1016/j.compbiomed.2017.10.017. (2017)
Article PubMed Google Scholar
Phillips, J. P., Hickey, M. & Kyriacou, P. A. Evaluation of electrical and optical plethysmography sensors for noninvasive monitoring of hemoglobin concentration. Sensors 12, 1816–1826 https://doi.org/10.3390/s120201816 (2018).
Article CAS Google Scholar
Papini, G. B. et al. Photoplethysmography beat detection and pulse morphology quality assessment for signal reliability estimation. In Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Seogwipo, Korea, Jul 11–15, 2017, 117–120, https://doi.org/10.1109/EMBC.2017.8036776 (IEEE, 2017).
Denisse et al. A review on wearable photoplethysmography sensors and their potential future applications in health care. Int. J. Biosens. Bioelectron. 4, 195–202 https://doi.org/10.15406/ijbsbe.2018.04.00125 (2018).
Article Google Scholar
Jermana, M. et al. Advances in photopletysmography signal analysis for biomedical applications. Sensors 18, 1894 https://doi.org/10.3390/s18061894 (2018).
Article Google Scholar
Srihari, A., Mousumi, P. & Srivatsa, K. A comparative study and analysis of time series forecasting techniques. SN Comput. Sci. 1, 175 https://doi.org/10.1007/s42979-020-00180-5 (2020).
Article Google Scholar
Pirhonen, M. & Vehkaoja, A. Fusion enhancement for tracking of respiratory rate through intrinsic mode functions in photoplethysmography. Biomed. Signal Process. Control 59, 101887 https://doi.org/10.1016/j.bspc.2020.101887 (2020).
Article Google Scholar
Makridakis, S., Spiliotis, E. & Assimakopoulos, V. Statistical and machine learning forecasting methods: Concerns and ways forward. Plos One 13, e0194889 https://doi.org/10.1371/journal.pone.0194889 (2018).
Article CAS PubMed PubMed Central Google Scholar
Poernomo, A. & Kang, D. K. Biased dropout and crossmap dropout: Learning towards effective dropout regularization in convolutional neural network. Neural Netw. 104, 60–67 https://doi.org/10.1016/j.neunet.2018.03.016 (2018).
Article PubMed Google Scholar
Sui, M., Wang, Y., Han, X. & Kloeden, P. E. Random recurrent neural networks with delays. J. Differ. Equ. 269, 8597–8639 https://doi.org/10.1016/j.jde.2020.06.008 (2020).
Article ADS MathSciNet MATH Google Scholar
Quax, S. C., D’Asaro, M. & van Gerven, M. A. J. Adaptive time scales in recurrent neural networks. Sci. Rep. 10, 11360 https://doi.org/10.1101/800540 (2020).
Article PubMed PubMed Central Google Scholar
Tang, X., Machimura, T., Li, J., Liu, W. & Hong, H. A novel optimized repeatedly random undersampling for selecting negative samples: A case study in an SVM-based forest fire susceptibility assessment. J. Environ. Manag. 271, 111014 https://doi.org/10.1016/j.jenvman.2020.111014 (2020).
Article Google Scholar
Wang, D., Chen, G., Xing, D. & Liang, D. Fault diagnosis method for switch control circuit based on SVM-AdaBoost. J. Meas. Sci. Instrum. 11, 251–257 https://doi.org/10.3969/j.issn.1674-8042.2020.03.008 (2020).
Article Google Scholar
Peralta, E., Lazaro, J., Bailon, R., Marozas, V. & Gil, E. Optimal fiducial points for pulse rate variability analysis from forehead and finger photoplethysmographic signals. Physiol. Meas. 40, 025007 https://doi.org/10.1088/1361-6579/ab009b (2019).
Article PubMed Google Scholar
Karim, F., Majumdar, S., Darabi, H. & Chen, S. Lstm fully convolutional networks for time series classification. IEEE Access 6, 1662–1669 https://doi.org/10.1109/ACCESS.2017.2779939 (2018).
Article Google Scholar
Alayba, A. M., Palade, V., England, M. & Iqbal, R. A combined CNN and LSTM model for Arabic sentiment analysis. Lecture Notes Comput. Sci. 11015, 179–191 https://doi.org/10.1007/978-3-319-99740-7_12 (2018).
Article Google Scholar
Wu, S. et al. Dependency-to-dependency neural machine translation. IEEE-ACM Trans. Audio Speech Lang. Process. 26, 2132–2141 https://doi.org/10.1109/TASLP.2018.2855968 (2018).
Article Google Scholar
Huang, D., Fu, Y., Qin, N. & Gao, S. Fault diagnosis of high-speed train bogie based on LSTM neural network. Sci. China Inf. Sci. 64, 119203 https://doi.org/10.1007/s11432-018-9543-8 (2020).
Article Google Scholar
Xue, Z. Y., Xiahou, K. S., Li, M. S., Ji, T. Y. & Wu, Q. H. Diagnosis of multiple open-circuit switch faults based on long short-term memory network for DFIG-based wind turbine systems. IEEE J. Emerg. Select. Top. Power Electron. 8, 2600–2610 https://doi.org/10.1109/JESTPE.2019.2908981 (2020).
Article Google Scholar
Wang, D. & Chen, J. Supervised speech separation based on deep learning: An overview. IEEE/ACM Trans. Audio Speech Lang. Process. 26, 1702–1726 https://doi.org/10.1109/TASLP.2018.2842159 (2018).
Article PubMed PubMed Central Google Scholar
Zhao, J., Mao, X. & Chen, L. Speech emotion recognition using deep 1d & 2d CNN LSTM networks. Biomed. Signal Process. Control 47, 312–323 https://doi.org/10.1016/j.bspc.2018.08.035 (2019).
Yildirim, O. A novel wavelet sequence based on deep bidirectional LSTM network model for ECG signal classification. Computers in Biology and Medicine 96, 189–202 https://doi.org/10.1016/j.compbiomed.2018.03.016 (2018).
Article PubMed Google Scholar
Tan, J. H. et al. Application of stacked convolutional and long short-term memory network for accurate identification of CAD ECG signals. Computers in Biology and Medicine 94, 19–26 https://doi.org/10.1016/j.compbiomed.2017.12.023 (2018).
Article PubMed Google Scholar
Dias, I., Fatih Demirci, M. & Adnan, Y. Speech emotion recognition with deep convolutional neural networks. Biomed. Signal Process. Control 59, 101894 https://doi.org/10.1016/j.bspc.2020.101894 (2020).
Article Google Scholar
Ordonez, F. J. & Roggen, D. Deep convolutional and LSTM recurrent neural networks for multimodal wearable activity recognition. Sensors 16, 115 https://doi.org/10.3390/s16010115 (2016).
Article Google Scholar
Xiao, X., Fu, D., Shi, Y. & Wen, J. Optimized mahalanobis-taguchi system for high-dimensional small sample data classification. Comput. Intell. Neurosci.https://kopernio.com/viewer?doi=10.1155%2F2020%2F4609423 (2020).
Guo, Y. et al. Mobile photoplethysmographic technology to detect atrial fibrillation. Journal of the American College of Cardiology 74, 2365–2375 https://doi.org/10.1016/j.jacc.2019.08.019 (2019).
Article PubMed Google Scholar
Senturk, U., Polat, K. & Yucedag, I. A non-invasive continuous cuffless blood pressure estimation using dynamic recurrent neural networks. Appl. Acoust. 170, 107534 https://doi.org/10.1016/j.apacoust.2020.107534 (2020).
Article Google Scholar
Liang, Y., Chen, Z., Liu, G. & Elgendi, M. A new, short-recorded photoplethysmogram dataset for blood pressure monitoring in China. Sci. Data 5, 180020 https://doi.org/10.1038/sdata.2018.20 (2018).
Article PubMed PubMed Central Google Scholar
Tjahjadi, H., Ramli, K. & Murfi, H. Noninvasive classification of blood pressure based on photoplethysmography signals using bidirectional long short-term memory and time-frequency analysis. IEEE Access 8, 20735–20748 https://doi.org/10.1109/ACCESS.2020.2968967 (2020).
Article Google Scholar
Abdulhay, E. et al. Computer-aided autism diagnosis via second-order difference plot area applied to EEG empirical mode decomposition. Neural Comput. Appl. 32, 10947–10956 https://doi.org/10.1007/s00521-018-3738-0 (2020).
Article Google Scholar
Ding, X., Cheng, F., Morris, R., Chen, C. & Wang, Y. Machine learning-based signal quality evaluation of single-period radial artery pulse waves: Model development and validation. JMIR Med. Inf. 8, e18134 https://doi.org/10.1038/sdata.2018.20 (2020).
Article Google Scholar
Duan, M., Li, K., Liao, X. & Li, K. A parallel multiclassification algorithm for big data using an extreme learning machine. IEEE Trans. Neural Netw. Learn. Syst. 29, 2337–2351 https://doi.org/10.1109/TNNLS.2017.2654357 (2018).
Article MathSciNet PubMed Google Scholar
Zhan, Q., Wang, W. & Haan, G. Analysis of CNN-based remote-PPG to understand limitations and sensitivities. Biomedical Optics Express 11, 1268–1283 https://doi.org/10.1364/BOE.382637 (2020).
Article PubMed PubMed Central Google Scholar
Liu, G. & Zhou, W. Aliasing classification of pulse wave characteristics in time-frequency domain based on convolutional neural network (in Chinese). J. Jilin Univ. (Eng. Technol. Edn.) 50, 1818–1825, https://doi.org/10.13229/j.cnki.jdxbgxb20190504 (2020).
Article Google Scholar
Ghosh, S., Banerjee, A., Ray, N., Wood, P. W. & Boulanger, P. Using accelerometric and gyroscopic data to improve blood pressure prediction from pulse transit time using recurrent neural network. In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, CANADA, APR 15-20, 2018, 935–939, https://ieeexplore.ieee.org/document/8461959 (2017).

Download references

Acknowledgements

This research was supported by the National Key R&D Program of China (Grant no. 2020YFC2006100), the Key Project at Central Government Level (Grant no. 2060302), and the Outstanding Foreign Scientist Support Project in Henan Province (Grant no. GZS2019008).

Author information

Authors and Affiliations

School of Electrical Engineering, Zhengzhou University, Zhengzhou, 450001, China
Peng Lu, Chao Liu, Xiaobo Mao & Hanzhang Wang
Research Center for Intelligent Science and Engineering Technology of TCM, Zhengzhou, 450001, China
Peng Lu, Chao Liu, Xiaobo Mao, Yvping Zhao & Hanzhang Wang
China Academy of Chinese Medical Sciences, Beijing, 100000, China
Yvping Zhao
Internet Medical and Health Service Henan Collaborative Innovation Center, Zhengzhou, 450001, China
Hongpo Zhang
The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450001, China
Lili Guo
First Affiliated Hospital of Shihezi University Medical College, Shihezi, 832000, China
Lili Guo

Authors

Peng Lu
View author publications
You can also search for this author in PubMed Google Scholar
Chao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaobo Mao
View author publications
You can also search for this author in PubMed Google Scholar
Yvping Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Hanzhang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hongpo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Lili Guo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.L. conceived the experiments, C.L. and H.W. conducted the experiments, C.L. and H.W. analyzed the results, X.M., Y.Z., H.Z. and L.G. read and revised the paper. All authors reviewed the manuscript.

Corresponding author

Correspondence to Peng Lu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lu, P., Liu, C., Mao, X. et al. Few-shot pulse wave contour classification based on multi-scale feature extraction. Sci Rep 11, 3762 (2021). https://doi.org/10.1038/s41598-021-83134-y

Download citation

Received: 27 August 2020
Accepted: 14 January 2021
Published: 12 February 2021
DOI: https://doi.org/10.1038/s41598-021-83134-y
Springer Nature Limited

Few-shot pulse wave contour classification based on multi-scale feature extraction

Abstract

Similar content being viewed by others

Effective IoT Based Analysis of Photoplethysmography Waveforms for Investigating Arterial Stiffness and Pulse Rate Variability

Pulse-Wave-Pattern Classification with a Convolutional Neural Network

Blood pressure estimation and classification using a reference signal-less photoplethysmography signal: a deep learning framework

Introduction