Introduction

Arterial oxygen saturation reflects the relative amount of oxyhemoglobin in the blood. The most common method to measure it is based on pulse oximetry, whereby oxidized hemoglobin and reduced hemoglobin have significantly different optical spectra. Specifically, at a wavelength of about 660 nm, and a second wavelength between 805 and 960, there is a large difference in light absorbance between reduced and oxidized hemoglobin. A measurement of the percent oxygen saturation of blood is defined as the ratio of oxyhemoglobin to the total concentration of hemoglobin present in the blood. Pulse oximetry assumes that the attenuation of light is due to both the blood and bloodless tissue. Fluctuations of the PPG signal are caused by changes in arterial blood volume associated with each heartbeat, where the magnitude of the fluctuations depends on the amount of blood rushing into the peripheral vascular bed, the optical absorption of the blood, skin, and tissue, and the wavelength used to illuminate the blood.

The pulse oximeter signal contains not only the blood oxygen saturation and heart rate (HR) data, but also other vital physiological information. The fluctuations of photoplethysmogram (PPG) signals contain the influences of arterial, venous, autonomic and respiratory systems on the peripheral circulation. In the current environment where health care costs are ever increasing, a single sensor that has multiple functions is very attractive from a financial perspective. Moreover, utilizing a pulse oximeter as a multi-purpose vital sign monitor has clinical appeal, since it is familiar to the clinician and comfortable for the patient. Knowledge of respiratory rate3 and HR patterns would be provide more useful clinical information in many situations in which pulse oximeter is the sole monitor available.

Although there are many promising and attractive features of using pulse oximeters for vital sign monitoring, currently they are used on stationary patients. This is mainly because motion and noise artifacts (MNAs) result in unreliable HR and SpO2 estimation. Clinicians have cited motion artifacts in pulse oximetry as the most common cause of false alarms, loss of signal, and inaccurate readings.14 A smart watch with a PPG sensor is currently commercially available for monitoring HRs (www.mioglobal.com). However, MNA is a big source of problem for accurate vital extraction from PPG signals and prevents wide adoption of this potentially useful technology for mobile health.

In practice MNA are difficult to remove because they do not have a predefined narrow frequency band and their spectrum often overlaps that of the desired signal.27 Consequently, development of algorithms capable of reconstructing the corrupted signal and removing artifacts is challenging.

There are a number of general techniques used for artifact detection and removal. One of the methods used to remove motion artifacts is adaptive filtering.1,5,15,19,24 An adaptive filter is easy to implement and it also can be used in real-time applications, though the requirement of additional sensors to provide reference inputs is the major drawback of such methods.

There are many MNA reduction techniques based on the concept of blind source separation (BSS). BSS is attractive and has garnered significant interest since this approach does not require a reference signal. The aim of the BSS is to estimate a set of uncorrupted signals from a set of mixed signals which is assumed to contain both the clean and MNA sources.2 Some of the popular BSS techniques are independent component analysis (ICA),4 canonical correlation analysis (CCA),28 principle component analysis (PCA),13 and singular spectrum analysis (SSA).6

In ICA, the recorded signals are decomposed into their independent components or sources.4 CCA uses the second order statistics to generate components derived from their uncorrelated nature.8 PCA is another noise reduction technique which aims to separate the clean signal dynamics from the MNA data. A multi-scale PCA has been also proposed to account for time-varying dynamics of the signal and motion artifacts from PPG recordings.20

A promising approach that can be applied to signal reconstruction is the SSA. The SSA is a model-free BSS technique, which decomposes the data into a number of components which may include trends, oscillatory components, and noise (see historical reviews in Ref. 10). The main advantage of SSA over ICA is that SSA does not require user input to choose the appropriate components for reconstruction and MNA removal. Comparing PCA to SSA, SSA can be applied in cases where the number of signal components is more than the rank of the PCA covariance matrix. Applications of the SSA include extraction of the amplitude and low frequency artifacts from single channel EEG recordings,26 and removing heart sound dynamics from respiratory signals.9

In this paper, we introduce a novel approach to reconstruct a PPG signal from those portions of data that have been identified to be corrupted using the algorithm detailed in Part I of the companion paper. The fidelity of the reconstructed signal was determined by comparing the estimated SpO2 and HR to reference values. In addition, we compare the reconstructed SpO2 and HR values obtained via the time-domain ICA (TD-ICA) to our method. We have chosen to compare our method to TD-ICA since the latter has recently been shown to provide good reconstruction of corrupted PPG signals.18

Materials and Method

Experimental Protocol and Preprocessing

Three sets of data were collected from healthy subjects recruited from the student community of Worcester Polytechnic Institute (WPI). This study was approved by WPI’s institutional review board and all the subjects gave informed consent before data recording.

In the first experiment, 11 healthy volunteers were asked to wear a forehead reflectance pulse oximeter developed in our lab along with a reference Masimo Radical (Masimo SET®) finger transmittance pulse oximeter. PPG signals from the forehead sensor and reference (HR) derived from a finger pulse oximeter were acquired simultaneously. The HR and SpO2 signals were acquired at 80 and 1 Hz, respectively. After baseline recording for 5 min without any movement (i.e., clean data), motion artifacts were induced in the PPG data by the spontaneous movements in both horizontal and vertical directions of the subject’s head while the right middle finger was kept stationary. Subjects were directed to introduce the motions for specific time intervals that determined the percentage of noise within each 1 min segment, varying from 10 to 50%. For example, if a subject was instructed to make left–right movements for 6 s, an 1 min segment of data would contain 10% noise. Note that noise amplitudes varied among subjects due to their head movements.

The second dataset consisted of finger-PPG signals from the nine healthy volunteers in an upright sitting posture using an infrared reflection type PPG transducer (TSD200). An MP1000 pulse oximeter (BIOPAC Systems Inc., CA, USA) was also used to acquire finger PPG signals at 100 Hz. One pulse oximeter of each model was placed on the same hand’s index finger (one model) and middle finger (the other model) simultaneously. After baseline recording for 5 min without any movement (i.e., clean data), motion artifacts were induced in the PPG data by the left–right movements of the index finger while the middle finger was kept stationary to provide a reference. This was accomplished by placing the middle finger on top of the edge of a table while the index finger moved in the free space off the edge of the table. Similar to the first dataset, motion was induced at specific time intervals corresponding to 10–50% corruption duration in 1 min segments, i.e., the controlled movement was carried out five times per subject.

The third dataset consisted of data measurements from nine subjects with the PPG signal recorded from the subjects’ foreheads using our custom sensor (80 Hz) simultaneously with the reference ECG from a Holter Monitor at 180 Hz and reference HR and SpO2 derived from a Masimo (Rad-57) pulse oximeter at 0.5 Hz, respectively. The reference pulse oximeter provided HR and SpO2 measured from the subject’s right index finger, which was held steadily to their chest. The signals were recorded while the subjects were going through sets of walking and climbing up and down flights of stairs for approximately 45 min.

Once data were acquired, PPG signals from all three experiments outlined above were preprocessed offline using Matlab (MathWorks, R2012a). The PPG signals were filtered using a zero-phase forward-reverse 4th order IIR band-pass filter with cutoff frequency 0.5-12 Hz.

Motion Artifact Removal

To reconstruct the artifact-corrupted portion of the PPG signal that has been detected using the support vector machine approach provided in the accompanying paper, we propose a novel hybrid procedure using Iterative Singular Spectrum Analysis (ISSA) and a frequency matching algorithm. Henceforth, we will call these combined procedures the iterative motion artifact removal (IMAR) algorithm.

Singular Spectrum Analysis

The SSA is composed of two stages: singular decomposition and spectral reconstruction. The former is the spectral decomposition or eigen-decomposition of the data matrix whereas the latter is the reconstruction of the signal based on using only the significant eigenvectors and associated eigenvalues. The assumption is that given a relatively high signal-to-noise ratio of data, significant eigenvectors and associated eigenvalues represent the signal dynamics and less significant values represent the MNA components.

The calculation of the singular stage of the SSA consists of two steps: embedding followed by singular value decomposition (SVD). In essence, these procedures decompose the data into signal dynamics consisting of trends, oscillatory components, and MNA. The spectral stage of the SSA algorithm also consists of two steps: grouping and diagonal averaging. These two procedures are used to reconstruct the signal dynamics but without the MNA components. In the following section, we detail all four steps in the SSA algorithm.

Singular Decomposition

Embedding

Assume we have a nonzero real-value time series of length N samples, i.e., \( X = \{ x_{1} ,x_{2} , \ldots ,x_{N} \} \). In the embedding step, window length \( {{f_{s} } \mathord{\left/ {\vphantom {{f_{s} } {f_{l} }}} \right. \kern-0pt} {f_{l} }} < L < {N \mathord{\left/ {\vphantom {N 2}} \right. \kern-0pt} 2} \) is chosen to embed the initial time series, where \( f_{\text{s}} \) is the sampling frequency and \( f_{\text{l}} \) is the lowest frequency in the signal. We map the time series \( X \) into the \( L \) lagged vectors, \( X = \{ x_{i} ,x_{i + 1} , \ldots ,x_{i + L - 1} \} \) for \( i = 1, \ldots ,K \), where \( K = N - L + 1 \).10 The result is the trajectory data matrix \( T_{x} \) or vector \( X_{i} \) that is each row of \( T_{x} \) for \( i = 1, \ldots ,K \).

$$ T_{x} = \left[ \begin{gathered} X_{1} \hfill \\ X_{2} \hfill \\ \,\, \vdots \hfill \\ X_{K} \hfill \\ \end{gathered} \right] = \left[ {\begin{array}{*{20}c} {x_{1} } & {x_{2} } & \cdots & {x_{L} } \\ {x_{2} } & {x_{3} } & \cdots & {x_{L + 1} } \\ \vdots & \vdots & \ddots & \vdots \\ {x_{K} } & {x_{K + 1} } & \cdots & {x_{N} } \\ \end{array} } \right] $$
(1)

From Eq. (1), it is evident that the trajectory matrix, \( T_{x} \), is a Hankel matrix.

Singular Value Decomposition

The next step is to apply the SVD to the trajectory matrix \( T_{x} \) which results in eigenvalues and eigenvectors of the matrix \( T_{x} T_{x}^{T} \) where \( T_{i} \) for \( i = 1, \ldots ,L \) can be defined as \( T = USV^{T} \).10 \( U_{i} \) for \( 1 < i < L \) is a \( K \times L \) orthonormal matrix. S i for \( 1 < i < L \) is a diagonal matrix and \( V_{i} \) for \( 1 < i < L \) is an \( L \times L \) square orthonormal matrix, which is considered the principle component. In this step, \( T_{x} \) has \( L \) many singular values which are \( \sqrt {\lambda_{1} } > \sqrt {\lambda_{2} } > , \ldots ,\sqrt {\lambda_{L} } \). By removing components with eigenvalues equal to zero, the ith eigentriple of \( T_{i} \) can be written as \( U_{i} \times \sqrt {\lambda_{i} } \times V_{i}^{T} \) for \( i = 1,2, \ldots ,d \), in which \( d = \hbox{max} (i:\sqrt {\lambda_{i} } > 0) \) is the number of nonzero singular values of \( T_{x} \). Normally every harmonic component with a different frequency produces two eigentriples with similar singular values. So the trajectory matrix \( T_{x} \) can be denoted as10

$$ \begin{aligned} T_{x} & = T_{1} + T_{2} + \cdots + T_{d} \\ & = U_{1} \sqrt {\lambda_{1} } V_{1}^{T} + \cdots + U_{d} \sqrt {\lambda_{d} } V_{d}^{T} \\ & = \sum\limits_{i = 1}^{d} {U_{i} \sqrt {\lambda_{i} } V_{i}^{T} } \\ \end{aligned} $$
(2)

Projecting the time series onto the direction of each eigenvector yields the corresponding temporal principal component (PC).7

Spectral Reconstruction

The reconstruction stage has two steps: grouping and diagonal averaging. First the subgroups of the decomposed trajectory matrices are grouped and then a diagonal averaging step is needed so that a new time series can be formed.6

Grouping

The grouping step of the reconstruction stage decomposes the \( L \times K \) matrix \( T_{i} \) (\( i = 1,2, \ldots ,d \)) into subgroups according to the trend, oscillatory components, and MNA dynamics. The grouping step divides the set of indices \( \left\{ {1,2, \ldots ,d} \right\} \) into a collection of m disjoint subsets of \( I = \left\{ {I_{1} , \ldots ,I_{m} } \right\} \).12 Thus, \( T_{I} \) corresponds to the group \( I = \left\{ {I_{1} , \ldots ,I_{m} } \right\} \). \( T_{{I_{i} }} \) is a sum of \( T_{j} \), where \( j \in I_{i} \). So \( T_{x} \) can be expanded as

$$ T_{x} = \overbrace {{T_{1} + \cdots + T_{L} }}^{SVD} = \overbrace {{T_{{I_{1} }} + \cdots + T_{{I_{m} }} }}^{\text{Grouping}} $$
(3)

Diagonal Averaging

In the final step of analysis, each resultant matrix, \( T_{{I_{i} }} \) , in Eq. (3) is transformed into a time series of length \( N \). We obtain the new Hankel matrices \( \tilde{X}^{(i)} \) by averaging the diagonal elements of the matrix \( T_{{I_{i} }} \).7 Let \( H \) be denoted as the Hankel operator. So that we obtain the Hankel matrix \( \tilde{X}^{(i)} = HT_{{I_{i} }} \) for \( i = 1, \ldots ,m \).12 Under the assumption of weak separability and applying the Hankel procedure to all matrix components of Eq. (3), we obtain the following expansion

$$ X = \tilde{X}^{(1)} + \tilde{X}^{(2)} + \cdots + \tilde{X}^{(M)} $$
(4)

We can assert that \( \tilde{X}^{(1)} \) is related to the trend of the signal; however, harmonic and noisy components do not necessarily follow the order of \( \sqrt {\lambda_{1} } > \sqrt {\lambda_{2} } > \cdots > \sqrt {\lambda_{M} } \).

Iterative Motion Artifact Removal based on SSA

In order to reconstruct the MNA corrupted segment of the signal, an IMAR approach based on SSA was explained in the last section. The ultimate goodness of the reconstructed signal is determined by the accuracy of the estimated SpO2 and HR values. The top and bottom panels of Fig. 1 show clean and MNA corrupted signals, respectively. The most important part of the SSA is to choose the proper eigenvector components for reconstruction of the signal. Under the assumption of high SNR, the normal practice is to select only the largest eigenvalues and associated eigenvectors for signal reconstruction. However, most often it is difficult to determine the demarcation of the significant from non-significant eigenvalues. Further, the MNA dynamics can overlap with the signal dynamics, hence, choosing the largest eigenvalues does not necessarily result in an MNA-free signal.

Figure 1
figure 1

Typical infrared PPG signal; (a) clean, (b) corrupted with motion artifacts

To overcome the above limitations, we have modified the SSA approach. The first step of our modified SSA involves computing SVD on both a corrupted data segment and its most prior adjacent clean data segment. Under the assumption of a high SNR of the data, the second step is to retain only the top 5% of the eigenvalues and their associated eigenvectors. The third step is to replace the corrupted segment’s top 5% eigenvalues with the clean segment’s eigenvalues. The fourth step is to further limit the number of eigenvectors by choosing only those eigenvectors that have HRs between 0.66 Hz < f s < 3 Hz for both the clean and noise corrupted data segments. The two extreme HRs are chosen so that they account for possible scenarios that one may encounter with low and high HRs. With the remaining candidate eigenvectors resulting from step four, we further prune non-significant eigenvectors by performing frequency matching of the noise corrupted eigenvectors to those of the clean data segment’s eigenvectors, in the fifth step. Only those eigenvectors’ frequencies that match to those of the clean eigenvectors are retained from the pool of eigenvectors remaining from step four. For the remaining eigenvector candidates, we perform iterative SSA to further reduce MNA and match the dynamics of the clean data segments’ eigenvectors for the final step. For each iteration we perform the standard SSA algorithm. It is our experience that this convergence is achieved within 4 iterations.

Figure 2 shows an example of the iterative SSA procedure applied to candidate eigenvectors that have resulted from step four of the procedure for the modified SSA algorithm. Note that there may be several eigenvectors remaining after the fifth step, hence, this example shows an iterative SSA procedure performed on a particular set of candidate eigenvectors that may match most closely to an eigenvector of a clean data segment. The top row of panels of Fig. 2 represents one of the eigenvectors of the clean signal and the second row of panels represents the MNA corrupted signal’s candidate eigenvectors which have the same frequency as that of the clean signal’s eigenvector. The remaining lower panels of Fig. 2 represent the candidate eigenvectors after they have gone through four successive iterations of the SSA algorithm. For this portion of the SSA algorithm, we perform SVD on the trajectory matrix of Eq. (1) created from the candidate eigenvector and then reconstruct the eigenvectors based on SSA using only the first 3 largest eigenvalues obtained from the SVD. This process repeats iteratively until the shape of the reconstructed eigenvector closely resembles one of the clean eigenvectors with the same frequency. It can be seen from Fig. 2 that after 4 iterations the result shown in the (a) panel corresponds most closely to the clean signal’s eigenvector, hence, this eigenvector is selected rather than the eigenvectors shown in panels b–c. We calculate the discarding metric (DM) at each iteration and compare this value to the DM value of the corresponding clean component. The DM is calculated according to:

$$ DM = {\raise0.7ex\hbox{${\sum {\left| u \right|} }$} \!\mathord{\left/ {\vphantom {{\sum {\left| u \right|} } {L(u)}}}\right.\kern-0pt} \!\lower0.7ex\hbox{${L(u)}$}} $$
(5)

where u is the signal component, and \( \,\left| {\,.\,} \right| \), \( L(.) \) are absolute operator and component length, respectively. The entire procedure for the modified SSA algorithm is summarized in Table 1.

Figure 2
figure 2

Iterative reconstruction of a corrupted eigenvector with frequency of 0.967 Hz. Black font signals (top panels) represent the clean component with frequency of 0.967 Hz; Blue font signals (2nd rows) indicate the corrupted component with the same frequency; Pink font signals are related to iterative evolution of corrupted component to a clean oscillatory signal. (a) Reconstruction of 4th corrupted eigenvector compared to the corresponding clean component. The final pattern after 4 iterations resembles the black font clean component in the top panel. This component is chosen among the components with the same frequency, since it shows the most similarity to the black font clean component. (b) Reconstruction of 9th corrupted eigenvector compared to the corresponding clean component. (c) Reconstruction of 22nd corrupted eigenvector compared to the corresponding clean component

Table 1 Iterative motion artifact removal (IMAR) procedure

Results

Noise Sensitivity Analysis

To validate the proposed IMAR procedure, we added different SNR levels of Gaussian white noise (GWN) and colored noise to an experimentally collected clean segment of PPG signal. The purpose of the simulation was to quantitatively determine the level of noise that can be tolerated by the algorithm. Seven different SNR levels ranging from 10 dB to −25 dB were considered. For each SNR level, 50 independent realizations of GWN and colored noise were added separately to a clean PPG signal. The Euler–Maruyama method was used to generate colored noise.17

Figure 3 shows the results of these simulations with additive GWN. The left panels show pre- and post-reconstruction HR in comparison to the reference HR; the right panels show the corresponding comparison for SpO2. Tables 2 and 3 show the mean and standard deviation values of the pre- (2nd column) and post-reconstruction (4th column), and the reference (3rd column) HR and SpO2 values, respectively, for all SNR. The last columns of Tables 2 and 3 also show the estimated HR and SpO2 values obtained by the TD-ICA method.18 As shown in Fig. 3 and Tables 2 and 3, the reconstructed HR and SpO2 values using our IMAR approach were found to be not statistically different when compared to the reference values for all SNR except for −20 and −25 dB. However, the TD-ICA method fails and we obtain significantly different values to those of the reference HR and SpO2 values when the SNR is lower than −10 dB.

Figure 3
figure 3

(Left) HR estimated from reconstructed PPG for different additive white noise levels; (Right) SpO2 estimated from reconstructed PPG for different levels of additive white noise

Table 2 Comparison and statistical analysis of HR estimations from IMAR-reconstructed PPG for different levels of additive white noise
Table 3 Comparison & statistical analysis of estimations from IMAR-reconstructed PPG for different levels of additive white noise

Tables 4 and 5 show corresponding results to that of Tables 2 and 3 but with additive colored noise. Similar to the GWN case, the reconstructed HR and SpO2 values using the proposed IMAR approach are found to be not significantly different than the reference values for all SNR except for −20 and −25 dB. Moreover, the TD-ICA compares poorly compared to our IMAR as the HR and SpO2 values from the former method are found to be significantly different to the reference values for all SNR.

Table 4 Comparison and statistical analysis of HR estimations from IMAR-reconstructed PPG for different levels of additive colored noise
Table 5 Comparison and statistical analysis of SpO2 estimations from IMAR-reconstructed PPG for different levels of additive colored noise

Heart Rate and SpO2 Estimation from Forehead Sensor

As described in “Motion Artifact Removal” section, we collected PPG data under three different experimental settings so that our proposed approach could be more thoroughly tested and validated. For all three experimental settings, the efficacy of our IMAR approach for the reconstruction of the MNA-affected portion of the signal will be compared with the reference HR and SpO2 values for all experimental datasets. Unlike Masimo’s finger pulse oximeter, our custom-designed forehead pulse oximeter does not calculate moving average HR and SpO2 values. This is the reason why the standard deviations of HR and SpO2 values from the forehead sensor are much larger than those from Masimo’s finger pulse oximeter (see Tables 6, 7, 8, 9, 10).

Table 6 Comparison and statistical analysis of HR estimations from IMAR-reconstructed PPG for 10 different subjects (Head Experiment)
Table 7 Comparison and statistical analysis of SpO2 estimations from IMAR-reconstructed PPG for 10 different subjects (Head Experiment)
Table 8 Comparison and statistical analysis of HR estimations from IMAR-reconstructed PPG for 10 different subjects (Finger Experiment)
Table 9 Comparison and statistical analysis of HR estimations from IMAR-reconstructed PPG for 9 different subjects (Walking & Stair Climbing Experiment)
Table 10 Comparison and statistical analysis of SpO2 estimations from IMAR-reconstructed PPG for 9 different subjects (Walking & Stair Climbing Experiment)

For the error-free SpO2 estimation, Red and IR PPG signals with clearly separable DC and AC components are required. The pulsatile components of the Red and IR PPG signals are denoted as ACRed and DCRed, respectively, and the “ratio-of-ratio” \( R \) is estimated22,23 as

$$ R = \frac{{{{{\text{AC}}_{\text{Red}} } \mathord{\left/ {\vphantom {{{\text{AC}}_{\text{Red}} } {{\text{DC}}_{\text{Red}} }}} \right. \kern-0pt} {{\text{DC}}_{\text{Red}} }}}}{{{{{\text{AC}}_{\text{IR}} } \mathord{\left/ {\vphantom {{{\text{AC}}_{\text{IR}} } {{\text{DC}}_{\text{IR}} }}} \right. \kern-0pt} {{\text{DC}}_{\text{IR}} }}}} $$
(6)

Accordingly, SpO2 is computed by substituting the R value in an empirical linear approximate relation given by

$$ {\text{SpO}}_{2} (\% ) = (110 - 25R)(\% ) $$
(7)

After applying the proposed IMAR procedure to the identified MNA segment of the PPG signal, we estimate the SpO2 (using Eq. (6, 7)) and HR, and compare it to the corresponding reference and MNA contaminated segment values. As was the case with the “Noise Sensitivity Analysis” section, we compare the performance of the IMAR algorithm to the TD-ICA method. The top and bottom panels of Fig. 4 represent a representative HR and SpO2 comparison result, respectively. We can see from these figures that the estimated values for both HR (left panels) and SpO2 (right panels) from the IMAR (black font) track closely to the reference values recorded by the Masimo transmittance type finger pulse oximeter (red square line), while the estimated HR and SpO2 obtained from the TD-ICA method (green font) deviate significantly from the reference signal. Tables 6 and 7 show comparison of the IMAR and the TD-ICA reconstructed HR and SpO2 values, respectively, for all 10 subjects. As shown in Table 6, there was no significant difference between the finger reference HR and the IMAR reconstructed HR in 6 out of 10 subjects. However, there was significant difference between the finger reference HR and the TD-ICA reconstructed HR in all 10 subjects. Similarly, the reconstructed SpO2 values from the IMAR were found to be not significantly different than the finger reference values in 6 out of 10 subjects, but the TD-ICA method was found to be significantly different for all 10 subjects.

Figure 4
figure 4

(a) HR estimated from IMAR-reconstructed PPG compared to reference and corrupted PPG; (b) HR estimated from ICA-reconstructed PPG compared to reference and corrupted PPG; (c) SpO2 estimated from IMAR-reconstructed PPG compared to reference and corrupted PPG; (d) SpO2 estimated from ICA-reconstructed PPG compared to reference and corrupted PPG

PPG Signal Reconstruction Performance in Finger Experiment

The performance of the signal reconstruction of the proposed IMAR approach is compared to TD-ICA for the PPG data with an index finger moving left-to-right patterns. The pulse oximeter on the middle finger of the right hand, which was stationary, was used as the reference signal. Since the subjects were directed to produce the motions for 30 s within each 1-min segment, corresponding to 50% corruption by duration, the window length of both clean and corrupted segments were both set as half length of the signal. Table 8 compares the HR reconstruction results between the IMAR and TD-ICA methods for all 10 subjects. As shown in Table 8, the IMAR reconstructed HR values are not significantly different from the reference HR in 7 out of 10 subjects. However, the TD-ICA’s reconstructed HR is significantly different from the reference HR in 8 out of 10 subjects indicating poor reconstruction fidelity.

PPG Signal Reconstruction Performance for the Walking and Stair Climbing Experimental Data

The signal reconstruction of the MNA identified data segments of the walking and stair climbing experiments using our proposed IMAR and its comparison to TD-ICA are provided in this section. Detection of the MNA data segments was performed using the algorithm described in Part I of the companion paper. The reconstructed HR and SpO2 values using our proposed algorithm and TD-ICA are provided in Tables 9 and 10, respectively. For both HR and SpO2 reconstruction, the measurements were carried out using PPG data recorded from the head pulse oximeter. The right hand index finger’s PPG data was used as HR and SpO2 references. As shown in Table 9, 7 out of 9 subjects’ reconstructed HR values were found to be not significantly different from the reference HR values using our algorithm. While 2 subjects’ reconstructed HR values were found to be significantly different than the reference, the differences in the actual HR values are minimal. For TD-ICA’s reconstructed HR values, all values deviate significantly from the reference values.

For the reconstructed SpO2 values, our algorithm again significantly outperforms TD-ICA. All but one subject are not significantly different than the SpO2 reference values for TD-ICA. For our IMAR algorithm, only 4 out of 9 subjects do not show significant difference from the reference values. Note the zero standard deviation reference SpO2 values from Massimo’s pulse oximeter in 7 out of 9 subjects. This is because Massimo uses a proprietary averaging scheme based on several past values. Hence, it is possible that the significant difference seen with our algorithm in some of the subjects would turn out to be not significant if the averaging scheme were not used. While some of the SpO2 values from our algorithm are significantly different from the reference, the actual deviations are minimal and they are far less than with TD-ICA.

Discussion

In this study, a novel IMAR method is introduced to reconstruct MNA contaminated segments of PPG data. Detection of MNA using a support vector machine algorithm was introduced in the companion paper. The aim of the current paper is to reconstruct the MNA corrupted segments as closely as possible to the non-corrupted data so that accurate HRs and SpO2 values can be derived. The question is how to reconstruct the MNA data segments when there is no reference signal. To address this question, we use the most adjacent prior clean data segment and use its dynamics to derive the MNA contaminated segment’s HRs and oxygen saturation values. Hence, the key assumption with our IMAR technique is that signal’s dynamics do not change abruptly between the MNA contaminated segment and its most adjacent prior clean portion of data. Clearly, if this assumption is violated, the IMAR’s ability to reconstruct the dynamics of the signal will be compromised. We are currently working on a time-varying IMAR algorithm to address this issue.

There are hosts of algorithms available for MNA elimination and signal reconstruction. Various adaptive filter approaches to remove MNA have been proposed with good results but the test data to fully evaluate the algorithms are either limited or confined to laboratory controlled MNA involving simple finger or arm movements.11,19,24,25 Moreover, these adaptive filter methods work best when a reference signal is available.

For those methods that do not require a reference signal to remove MNA, there have been many algorithms developed based on variants of the ICA.16,18,21,25 Most of the ICA-based methods produced reasonably good signal reconstructions of the MNA contaminated data. However, most of these methods were validated on data that were collected using laboratory controlled MNA involving pre-defined simple side-to-side or up-and-down finger and arm movements.16,18,21,25

Given that ICA-based methods produced good signal reconstructions of the MNA contaminated data, we have compared our proposed approach to the TD-ICA method as described by Krishnan et al.18 using simulated data, laboratory controlled data as well as daily activity data involving both walking and stair climbing movements. Krishnan et al. proposed frequency-domain ICA and compared its performance to the TD-ICA but the improvement is marginal at the expense of higher computational complexity, hence, we used the latter method. Comparison of the performance of our method to TD-ICA was based on reconstruction of HR and SpO2 values since these measures are currently used by clinicians.

Comparing HR and SpO2 estimations of the reconstructed signal to the reference measurements using both simulation and experimental data have shown that the proposed IMAR method is a promising tool as the reconstructed values were found to be accurate. The simulation results from noise sensitivity analysis showed that SNR level down to −20 and −15 dB from additive white and colored noise, respectively, can be tolerated well by the application of the proposed IMAR procedure, compared to the SNR values of −10 and −15 dB for the TD-ICA method. Application of the proposed IMAR approach and the TD-ICA to three different sets of experimental data have also shown significantly better signal reconstruction performance with our IMAR algorithm. It is our opinion that ICA is not a good approach for signal reconstruction because its performance suffers from arbitrary scaling and random gain changes in the output signal; these would detrimentally affect the accuracy of SpO2 estimates.

The use of singular spectrum analysis (SSA) to a single channel EEG recordings to extract high amplitude and low frequency MNA has been previously performed.26 The main aim of the work by Teixeira et al. was to remove the artifacts in EEG signals, hence, an iterative approach to reconstruct the main dynamics of the signal was not implemented. The novelty of our approach is based on the use of SSA combined with an iterative approach to reconstruct the portion of the MNA contaminated data with the most likely true dynamics (i.e., non-MNA contaminated data) of the pulse oximeter signal. In summary, the advantages of the IMAR algorithm is that it can obtain accurate frequency dynamics and amplitude estimates of the signal, hence, the reconstructed HR and SpO2 estimates should not deviate much from the true uncorrupted values. The disadvantage of the IMAR algorithm is that because it is an iterative approach, the computational complexity is high, hence, in the current form, it is most suitable for offline data analysis. We are not aware of any previous applications of SSA-based algorithms for MNA reconstruction of pulse oximeter data. In conclusion, a scenario where a reference signal is not available to remove the MNA, the proposed IMAR algorithm is a promising new approach to accurately reconstruct HR and SpO2 values from MNA contaminated data segments.