Keywords

1 Introduction

During the last two decades, the Photoplethysmography (PPG) technique has become an emerging area of research for its wide application in clinical physiological monitoring [1]. PPG is a simple, noninvasive optoelectronic method to detect the blood volume changes on the peripheral body parts. The PPG sensor detects change in light intensity due to blood volume changes as a result of cardiac systole and diastole [2, 3] using near red and infrared wavelength light source and detector. It is found that PPG waveform can provide a lot of valuable clinical information such as blood pressure, heart rate, respiratory rate, and cardiac output [4,5,6].

However, for ubiquitous and real-time measurement system, PPG signals are easily prone to motion artifacts (MA) which could lead to inaccurate interpretation of the PPG waveform. The common sources of errors are ambient light at the photodetector, poor contact of the PPG sensor and the skin, patient’s movement, respiration, etc. [7, 8]. The motion artifacts which mix the raw PPG signal that resides in the same frequency range with the actual PPG data (0–2 Hz). In particular, the motion artifact reduction is the most challenging issue. Over the last few years, many researchers focused on the area of motion artifacts removal technique from PPG signal.

The adaptive filtering technique used in real-time application is a well-known method for motion artifacts removal. But it requires an additional sensor to provide reference signal which is the major drawback of this method [9, 10]. AS-LMS adaptive filter eliminates this drawback by generating the reference signal internally from the MA-corrupted PPG itself. Principal Component Analysis (PCA) is a popular signal processing tool that can be effectively used for noise elimination, signal separation, and feature extraction. By selecting suitable number of principal components, the dimension of the actual data can be reduced but also retaining maximum possible information of the actual data [11]. In [12], the frequency characteristics of the PPG signal are analyzed to characterize the clean PPG and motion artifact. It is observed that the raw PPG has fundamental, second and third harmonic structure caused by arterial blood flow and component caused by motion artifact. The raw signal is divided into different groups depending on the relative location of these frequency components. The motion artifact is composed of single frequency component. Another frequency domain analysis method named Empirical Mode Decomposition (EMD), that empirically extracts the oscillatory behavior from the signal and decompose a time series data into a number of intrinsic mode function (IMFs). In [13], Hilbert Hung Transform is used to generate instantaneous frequency for each IMF. The PPG signal is divided into six scales of different frequency bands. According to mean frequency, the scales 1–3 match with clean PPG signal and its harmonics and scales 4–6 correlate with the motion artifacts. An improved algorithm is presented in [14] which combine PCA with the EMD method to generate a smaller group of orthogonal variables. A fewer number of principal components are able to represent the clinical information of the signal, and this number is always less than the number of IMFs. In all of these frequency domain methods, the Fast Fourier Transform (FFT) is used to convert the signal from time domain to frequency domain. Wavelet transform is a powerful tool that has been widely used in ECG signal denoising. However, its application in PPG preprocessing is comparatively fewer [15, 16]. In [16], application of wavelet transform for MA reduction is described. However, the authors only evaluated the systolic peak amplitude variations and spectrum analysis and no detailed study on error parameters was done.

In this work, we used synthetic noisy PPG by imposing three different noise levels and decompose the signal using DWT. The performance of different mother wavelets and different thresholding rules on signal reconstruction quality was also investigated. Additionally, a detailed study on error parameters was done to evaluate the effectiveness of the proposed method.

2 Methodology

2.1 Data Collection Protocol

The PPG signal was recorded from forty male and female volunteers in the age group 20–45 years. Prior to data collection, they were requested to sit in a relaxed condition. A transmission type PPG sensor was placed on the left hand index finger, kept in stationary condition, while an accelerometer was attached on the right hand index finger. For our application, we considered only horizontal movement of the hand (X-axis). The right hand was moved from elbow at three predefined levels of motion (level, i.e., 10, 15, and 20 cm), each non-periodic. For every subject, data collection was done for 30 s. at 60 Hz using NI6009 DAQ device. The motion data was added with the clean PPG signal in different proportions to generate synthetic noisy signal. The PPG data was stored in calibrated voltage format against sampling time in <*.txt> formatted file.

2.2 Wavelet Denoising Approach

Because the PPG signal is non-stationary in nature, the discrete wavelet transform (DWT) is an appropriate signal processing tool for analyzing the time–amplitude information from a signal. It analyzes the signal at different resolutions by decomposing it into various successive frequency bands. The signal is passed through a series of high-pass filter and low-pass filter.

Since in DWT the signal is a discrete time function, it can be represented as x[n], where n is an integer. The signal is passed through a half band digital low-pass filter with impulse response h[n]. The convolution operation in discrete time is defined as follows:

$$ x[n] * h[n] = \sum\limits_{k = - \infty }^{\infty } {x[k] \cdot h[n - k]} $$
(1)

The half band low-pass filter removes all the frequencies that are above half of the highest frequency in the signal.

The denoising algorithm is given below:

  1. a.

    Decomposition—The input signal was decomposed up to level 5 (to get a frequency level around 1.7 Hz, that of PPG signal) by choosing appropriate wavelet function using Daubechies6 (Db6) as the basis wavelet. The decomposition structure is shown in Fig. 1.

    Fig. 1
    figure 1

    Wavelet decomposition structure of the PPG data

  2. b.

    Thresholding—In ‘soft’ thresholding, the coefficients below a threshold value are set to zero, while the magnitude of others is truncated by the same value. Mathematically, it is expressed as

    $$ \begin{aligned} \hat{d}_{j,\,k} & = sign(d_{j,\,k} )|d_{j,\,k} - t|,\;if\;|d_{j,\,k} | > t \\ & = 0,\;otherwise \\ \end{aligned} $$
    (2)

    where d j, k represents the kth wavelet coefficient in jth level, \( \hat{d}_{j,\,k} \) is the thresholded value, and t is the threshold. There are various soft thresholding rules, like fixed threshold (‘sqtwolog’), ‘rigrsure’ based on Stein’s unbiased risk estimate, ‘heursure’ a combination of first two rules, and ‘minimax’ based on mean square error [17].

    The soft thresholding rules were selected for our application, and it is applied in each level (A5, D5, …, D1) for selective truncation of the wavelet coefficients. The thresholded coefficients at different levels are shown in Fig. 2.

    Fig. 2
    figure 2

    Soft thresholding applied at different levels using Db6

  3. c.

    Reconstruction—After addition of the selected coefficients at each level, the reconstruction was done by the inverse discrete wavelet transform (IDWT).

3 Results and Discussions

For evaluating the denoising performance, the objective assessment was done by computing the following metrics which are common for any denoising operation.

$$ \begin{array}{*{20}l} {MSE\;(Mean\;Square\;Error) = \frac{1}{N}\sum\limits_{i} {[x(i) - \tilde{x}(i)]^{2} } } \hfill \\ {MAE\;(Maximum\;Absolute\;Error) = \hbox{max} [x(i) - \tilde{x}(i)]} \hfill \\ {PNR\;(Percentage\;Noise\;retention) = \frac{{P_{ds} - P_{cs} }}{{P_{cs} }} \times 100} \hfill \\ {SNR_{i} \;(Signal\;to\;noise\;ratio\;improvement) = 10\, \times \,\log \frac{{\sum\nolimits_{i} {\frac{{x^{2} (i)}}{{n^{2} (i)}}} }}{{\sum\nolimits_{i} {\frac{{x^{2} (i)}}{{[x(i) - \tilde{x}(i)]^{2} }}} }}} \hfill \\ \end{array} $$
(1)

where x = clean signal, \( \tilde{x} \) = reconstructed signal, Pcs = power of clean signal, N = number of samples, Pds = power of reconstructed signal, n = noisy signal, and P = 10 × log∑|x(i)|2. Among these, MSE provides the global error figure, since errors can be positive or negative. PNR provides the residual noise that is retained in the preprocessed signal. SNRi actually denotes the ratio of input and output SNR (to the preprocessing algorithm), where x(i)−\( \tilde{x} \)(i) is the residual noise. MAE is the maximum absolute sample-to-sample error after reconstruction and local residual noise. Table 1 shows these parameters for six arbitrarily chosen healthy volunteers’ data for three levels of motion using ‘rigrsure’ thresholding rule. Among the parameters, we observed that the MAE, averaged over the six volunteers, remains almost unchanged, while MSE and PNR increase slightly with enhanced noise, as expected. Computed over all volunteers’ data, for level 1 noise, the average MSE, PNR, MAE, and SNR are 0.104, 2.05, 0.89 mV, and 14.12, respectively. When the noise level increases to level 2, these values are 0.143, 2.44, 0.983 mV, and 11.45, respectively. For highest level 3, these values are 0.230, 2.855, 0.910 mV, and 12.54, respectively. In summary, with noise-level increase, the change in MSE, PNR, MAE, and SNR are 37%, 19%, 10%, and 18%, respectively, from level 1 to 2 and 60%, 16%, 7%, and 9.51%, respectively, for level 2 to level 3. Among these, the MAE is least changed, which means the local distortion could be kept lower in the denoising process.

Table 1 Denoising parameters for db6 mother wavelet function

The denoising performance is also pictorially represented using Fig. 3 for volunteer 1. The upper panel shows the original and reconstructed signal superimposed, with the residual, i.e., the sample-to-sample error in lowest panel.

Fig. 3
figure 3

Subjective assessment of denoising performance

Table 2 shows the denoising parameter values for different noise levels using various thresholding rules for volunteer #1 using Db6 wavelet. Here, we studied volunteer 1 data for different levels of noise. It is clearly observable from Table 1 and 2 that MSE value is minimum for ‘rigrsure’ thresholding rule, while ‘heursure’ provides the least performance. For local distortion assessment, the fixed thresholding provides worst performance, but its SNR improvement is very good (Fig. 4).

Table 2 Denoising parameters for different thresholding rules (Volunteer #1, Db6)
Fig. 4
figure 4

Deviation of clinical signatures in the reconstructed signal for five volunteers

In the next phase, four clinical features: systolic peak height, diastolic peak height, peak-to-peak time, and crest time of the reconstructed data were compared with the original signal to assess the percentage deviation for five volunteers as in Table 1. The used noise levels are level 1 at a proportion of 100 and 200%. The deviations are shown in Fig. 5. It is observed that for volunteer #1, there is no variation of peak-to-peak time event for 200% of noise, while the other features slightly increase. For volunteer #2, the crest time changes by 25% in denoised data. As a general observation, using level 1 noise at 200% proportion, most of the features could be retained within 15% of their original values.

Fig. 5
figure 5

Denoising performance of different mother wavelet functions

Table 3 shows the different denoising parameters for various mother wavelet functions for volunteer #1 and noise level 1. It is observed from Tables 1 and 3 that the denoising parameters show better result for Db6 mother wavelet. In terms of MSE, the Symlets 2, 3, and 4 provide the worst performance. The denoising performance is also graphically shown in Fig. 5. In the published literature using DWT-based PPG denoising [15, 16], no quantitative performance is provided. Only [9] reports SNR of 1.732 dB with horizontal motion of PPG sensor and [11] reports SNR improvement from 0.078 to 0.318 dB. The proposed method achieved an average SNRi of 7.40 dB.

Table 3 Denoising parameters for different mother wavelet functions (volunteer #1)

4 Conclusion

In this paper, we present the application of wavelet transform for reduction of motion artifact from PPG signal. Among all wavelets used in the study, the Daubechies wavelet provided best result using ‘rigrsure’ soft thresholding rule. Using three different intensity levels of noise, the reconstructed waveform could retain the most of clinical signatures within 15% of their respective values. We have also made a comparative study between different denoising parameters by using other mother wavelet functions.