Keywords

1 Introduction

As per the records of the WHO and the statistics of neurological disorders, 50 million people are affected by epilepsy globally and 24 million by Alzheimer’s and other disorders. As a result of neurological disorders, around 6.8 million people lose their lives every year [1, 2]. Physicians suggest continuous brain monitoring to diagnose the diseases like Alzheimer’s, epilepsy, and other neurological disorders.

There are several methods to acquire the cerebral activity: magnetoencephalography (MEG), functional magnetic resonance imaging (FMRI), near-infrared spectroscopy (NIRS), computed tomography (CT), electroencephalogram (EEG), etc. Though these methods can provide good spatial resolution, EEG is the powerful and frequently used tool in modern medical field and academia due to affordable price along with acceptable temporal resolution [3].

Of late, EEG-based home care systems such as Neuro Monitor and OPTIMI are being used rapidly to monitor the condition of individuals, particularly in care related to elderly, where minimal instrumentation and less computational resources are required [4, 5]. Single-channel EEG contains few electrodes and ambulatory system embedded with signal processing to process the EEG signals, which makes it more popular. EEG signal can be produced with the excitation of neurons in brain. The magnitude of the EEG signal approximately varies between 10 μV–100 μV and 0–64 Hz, respectively. The sub-bands of typical EEG brain rhythms are delta, theta, alpha, and beta, respectively, from low to high frequencies which are shown in Fig. 1.

Fig. 1
Four waves represent different E E G brain rhythms. The waves are labeled as beta 13 to 30 Hartz, alpha 8 to 13 Hartz, theta 4 to 8 Hartz, and delta 0.5 to 4 Hartz in a high-frequency to low-frequency sequence.

Typical EEG brain rhythms

The extra-biological artifacts like line interference and electrode noise can be removed by filtering, due to spectral separation between extra-biological artifacts and electroencephalography signal. However, due to the fact that the heartbeat, movement of eyes, and muscles coexist in the frequency range as EEG signal, considerable attention must be paid to eliminate the biological artifacts. There exists a significant damage to the data of an EEG, if the procedure of artifact elimination is not methodical [6].

Several efforts have been made, and various methods have been developed till date for the removal of artifacts. However, each of them lags behind with some practical issues such as requirement of high computational resources and computational time, and there is no complete solution yet, which leads to the development of simple artifact removal techniques.

Several techniques were projected to correct BAs from recordings of EEG, among which wavelet transform (DWT and SWT) with thresholding methods is being widely used [7, 8]. Identification of mother wavelet also plays a major role in the system’s overall performance.

Huang et al. developed EMD to evaluate nonstationary and nonlinear signals such as EEG [9, 10]. EMD decomposes the signal into several IMFs. The main advantage of EMD over WT is its ability to estimate suitable changes in the frequency of the signal. IMF interval thresholding is employed for correcting the artifacts resulting in a substantially cleaner EEG signal unlike using conventional EMD methods, which may result in loss of neural information at blink regions [11].

Frontal channels like F8, F7, FP2, and FP1 are likely to be dominant with respective to ocular artifacts. Thus, it is realistic to read the signals as corrupted or contaminated EEG signals from these electrodes. Figure 2 illustrates the blink artifacts of typical EEG signals. “eegmmidb” (EEG motor movement/imagery) dataset was used to carry the computations [12]. Comparison is made between the proposed method and the conventional EMD with concentration on considering standard metrics for performance like ARR, RMSE, ΔSNR, and CC.

Fig. 2
A graph of blink antiques of regular E E G signals. The vertical axis represents amplitude in microvolts. The horizontal axis represents the number of samples. The channels F7, F8, FP1, and FP3, are prevalent with an individual to viewed antiques.

Frontal electrode EEG signals with blink artifacts

The outline of this chapter is as narrated: A concise study of EMD algorithm is done in Sect. 2; Sect. 3 describes the identification of noisy IMFs and blink artifacts; the concepts of threshold function and IMF thresholding are discussed in Sect. 4; experimental verification and results of the proposed method are projected in Sect. 5; the conclusions are furnished in Sect. 6.

2 Empirical Mode Decomposition

EMD is an efficient approach for breaking down any complicated signal into finite intrinsic mode functions [13, 14]. The extremas and IMF zero crossings should be same or alter by one. The average of envelope stated by extrema must be zero at any point for each IMF. For a nonstationary signal x(t), the detailed steps of EMD are the following:

  1. (a)

    The maxima and minima points of the raw EEG signal are to be determined.

  2. (b)

    Using cubic spline interpolation, the signals lower envelope (elower) and upper envelope (eupper) are to be created.

  3. (c)

    To generate the first IMF, the mean envelope should be deducted from the signal x(t). h1(t) = x(t) − a(t), where a(t) represents the average of the envelope, i.e., \( a(t)=\frac{\left({e}_{\mathrm{lower}}+{e}_{\mathrm{upper}}\right)}{2} \).

  4. (d)

    To get a new residual signal r1(t), IMF1 is to be subtracted from x(t), i.e., r1(t) = x(t) − h1(t), and then r1(t) should be decomposed as done above to obtain the second IMF.

  5. (e)

    All the steps are to be made recurrent till no IMFs are derived. Reconstruction of original signal can be from IMFs like \( x(t)=\sum \limits_{i=1}^K{h}_k(t)+r(t) \). Figure 3 illustrates the flowchart of the EMD process [15]. The resulting IMFs by applying EMD for the FP2 EEG signal are as shown in Fig. 4.

Fig. 3
A flowchart of empirical mode decomposition. The flow chart starts with n equals 1; r of t equals x of t. At the step, is h of t an I M F? has two options yes and no. If No, x of t equals h of t. If Yes, n equals n plus 1, and r of t equals r of t minus h of t. Then, Is r of t a monotonic is checked. Then, x of t equals r of t is checked.

Flowchart depicting the process of EMD

Fig. 4
A graph of decomposition of FP2 E E G signals. The vertical axis represents amplitude in microvolts. The horizontal axis represents the number of samples. The I M F waves are labeled as 1 to 15 from a high-frequency to low-frequency sequence.

IMFs 1–15 due to decomposition of FP2 EEG

3 Identifications of Noisy IMFs and Blink Artifacts

3.1 Identification of Noisy IMFs

By performing thresholding to all IMFs, there may exist a loss of neural activity in the reconstructed signal [16]. Hence, it is necessary to identify the IMFs that belong to signal component or artifactual component and thresholding is performed to noisy IMFs that results in clearer EEG signal. The coefficients of correlation amid the IMFs and the raw EEG signal can be used to categorize the IMFs as either noise-dominant or signal-dominant modes. The magnitude and frequency of the EEG signal typically range from 10 μV to 100 μV and from 0 to 64 Hz, respectively, whereas ocular activity varies in the order of millivolts spread from 0 to 16 Hz sub-band. Hence, blink artifacts absolutely capture the neural signal with its occurrence, and mostly the EEG signal is noise dominant. So, the IMFs having the highest correlation coefficient with respect to raw EEG signal are considered to be dominant in noise, while the others are considered as signal dominant. However, if the signal contains a momentous noise, this method may cause some noise-dominant IMFs to be misjudged. Table 1 labels CC between the raw EEG and individual IMFs for FP2 EEG. Those IMFs with CC greater than 0.25, i.e., IMF5, IMF6, IMF7, IMF8, IMF9, and IMF10, are considered to be noise dominant.

Table 1 Correlation coefficient between raw EEG and IMFs

Spectral analysis of modes is used to classify the IMFs in this study. Figure 5 illustrates the spectral comparison of raw EEG signal and its IMFs. The IMFs with substantial power at lower frequencies are considered as noisy IMFs. From this figure, IMFs 11–15 might not be considered as signal-dominant IMFs due to their spectral distribution at lower frequencies. To resolve the issue, EEG signal is preprocessed by stationary wavelet transform, and the CC between preprocessed signal and its IMFs is derived.

Fig. 5
The power spectral density plots of I M F 1 to 15 where spectral comparison of E E G signals and its I M F are exhibited. The vertical axis of the plot represents magnitude in decibel. The horizontal axis represents frequency in Hertz. The I M F 11 to 15 has spectral distribution at lower frequencies.

Power spectral density plots of IMFs 1–15

Raw EEG signal of sampling frequency 160 Hz is decomposed to four levels by stationary wavelet transform. Daubechies (db8) wavelet is chosen; hence, its shape resembles that of ocular activity. The approximation coefficients at fourth decomposition level that are at OA range are set to zero, and reconstruction is performed. The preprocessed EEG signal and its power spectrum are shown, respectively, in Figs. 6 and 7. As shown in Fig. 6, the spectral distribution of signal plus noise is minimum at 0–16 Hz. EMD is employed to the preprocessed signal for extracting IMFs. Correlation coefficient is carried out over the preprocessed signal and the extracted IMFs, from which noise-free IMFs are estimated and summarized in Table 2. On observing Table 1, it can be concluded that IMFs 1–4 are to be considered as signal dominant due to their mutual correlation with preprocessed signal. IMFs 11–15 are not correlated to the preprocessed signal and hence considered as noise dominant.

Fig. 6
A graph of raw and preprocessed FP2 E E G signals by S W T. The vertical axis represents amplitude in microvolts. The horizontal axis represents the number of samples. Two high-frequency waves for raw EEG and Pre processed EEG, where both range from (0, 0) to (1600, 0) approximately.

Raw and preprocessed FP2 EEG signal by SWT

Fig. 7
A graph for raw E E G and Pre processed E E G signals. The vertical axis represents magnitude in decibel, and the horizontal axis represents frequency in Hertz. Two frequency waves for raw E E G and Pre processed E E G start around (0, 38) and (0, negative 30) and connect at (40, 26) and end at (260, 8). All the values are approximate.

Spectra of raw and preprocessed EEG signals

Table 2 Correlation coefficient between preprocessed EEG signals and its IMFs

3.2 Identification of Blink Artifacts

Krishnaveni et al. developed a wavelet-based approach to identify slowly varying artifacts and performed denoising for the zones identified, which preserves the cerebral information at artifact-free zones. Haar wavelet is used to recognize rising and falling edges of the blinks, i.e., artifact rising edge (ARE) and artifact falling edge (AFE); based on the location of ARE and AFE, blink zones are identified [17]. Figure 8 illustrates an EEG with identified artifacts.

Fig. 8
A graph of raw E E G signals with identified artifacts. The vertical axis represents amplitude in microvolts, and the horizontal axis represents the number of samples. The signal wave ranges from (0, 0) to (1800, 0) with the highest point at (690, 260) and the lowest point (1120, negative 120) approximately.

Raw EEG with identified artifacts

4 Threshold Function and IMF Thresholding

Threshold function and thresholding methods play a critical role in artifact correction. Many threshold functions are available in literature such as universal threshold, statistical threshold, minimax threshold, and SURE threshold. But universal threshold is more popular and widely being used for artifact correction of biological signals.

4.1 Universal Threshold (UT)

UT is a function approved globally in which the threshold values for each IMF are estimated by Eq. (1):

$$ {\lambda}_i=\sqrt{2{E}_i\ln N} $$
(1)

where E is energy and N is number of samples [18].

The energy of each IMF resulting from white Gaussian noise follows an exponential relationship defined by

$$ {E}_i=\frac{E_1^2}{\beta }{\rho}^{-\mathrm{i}}\kern2.25em i=2,3,\dots $$
(2)

Here, E1 is the first IMF energy expressed by Eq. (3), and ρ and β are the parameters to be analyzed with a huge quantity of noise realizations and IMFs as 2.01 and 0.719, respectively [19]:

$$ {E}_1={\left[\frac{\mathrm{Median}\mid {h}_1(n)-\overline{h_1(n)}\mid }{0.6745}\right]}^2 $$
(3)

h1(n) replicates the first IMF coefficients. For additive white Gaussian noise, the denominator value of 0.6745 is found to be a more suitable estimator.

4.2 IMF Thresholding

The development of EMD decomposed the signal to numerous IMFs. The IMFs with higher order contain noise at low frequencies and vice versa. Typically, ocular artifacts are disseminated at 0–16 Hz. Hence, in conventional EMD, denoising of the denoised signal is attained by excluding the higher order low-frequency IMFs while combining the lower order high-frequency IMFs. However, selection of noisy IMF plays a critical role in artifact removal process; moreover, excluding the IMF completely yields loss of neural activity in artifact-free zones.

In EMD-DT (EMD direct thresholding), the signal is reconstructed after performing thresholding to the noisy IMFs. Direct wavelet thresholding performed on IMFs results in the construction of EMD-DT. The signal reconstructed with modified IMFs can be denoted as

$$ x(t)=\sum \limits_{i={M}_1}^{M_2}\tilde{h}_i(t)+\sum \limits_{i={M}_2+1}^L{h}_i(t)+r(t) $$
(4)

where

$$ \tilde{h}_i(t)=\left\{\begin{array}{c}{h}_i(t)\kern5em \left|{h}_i(t)\right|>{\lambda}_i\\ {}0\kern6.75em \left|{h}_i(t)\right|\le {\lambda}_i\end{array}\right. $$
(5)

for hard thresholding

$$ \tilde{h}_i(t)=\left\{\begin{array}{c}\operatorname{sgn}\left({h}_i(t)\right)\left(\left|{h}_i(t)\right|-{T}_i\right)\kern1em \left|{h}_i(t)\right|>{\lambda}_i\\ {}0\kern11.75em \left|{h}_i(t)\right|\le {\lambda}_i\end{array}\right. $$
(6)

for soft thresholding.

M2 and M1, respectively, indicate the high-order, low-order IMFs. λi is the value of threshold at ith IMF. However, for the decomposition modes, directly applying the threshold can yield disastrous results to disturb the continuity of the signal reconstructed. Kopsinis and Mclanglin introduced and verified an innovative EMD-based denoising technique on different signals, where EMD-IT was studied [20,21,22]. Taking into note the two zero crossings (adjacent) \( {Z}_j^{(i)}=\left[{Z}_j^{(i)}\ {Z}_j^{\left(i+1\right)}\right] \) in the ith IMF, hard and soft thresholding for the IMF coefficients can be described mathematically by Eqs. (7) and (8), respectively:

$$ \tilde{h}_i\left({z}_j^i\right)=\left\{\begin{array}{c}{h}_i\left({z}_j^i\right)\kern11em \left|{h}_i\left({r}_j^i\right)\right|>{\lambda}_i\\ {}0\kern13.25em \left|{h}_i\left({r}_j^i\right)\right|\le {\lambda}_i\end{array}\right. $$
(7)
$$ \tilde{h}_i\left({z}_j^i\right)=\left\{\begin{array}{c}{h}_i\left({z}_j^i\right)\frac{\left(\left|{h}_i\left({r}_j^i\right)\right|-{T}_i\right)}{\left|{h}_i\left({r}_j^i\right)\right|}\kern3.5em \left|{h}_i\left({r}_j^i\right)\right|>{\lambda}_i\\ {}0\kern13.25em \left|{h}_i\left({r}_j^i\right)\right|\le {\lambda}_i\end{array}\right. $$
(8)

where hi is the IMF coefficient before decomposition, \( \tilde{h}_i \)is the estimated IMF coefficient after thresholding, \( {h}_i\left({z}_j^i\right) \) indicates the samples from \( {z}_j^i \) to \( {z}_j^{i+1} \), and \( {h}_i\left({r}_j^i\right) \) represents the maxima of the zero crossing interval. Hard thresholding is unstable and is sensitive to even minute modifications of the signal: i.e., the coefficients remain unchanged above the threshold and below are decreased to zero. The local properties of the signal are not modified by this method, but due to discontinuity, they lead to certain fluctuation in the reconstructed signal. Soft thresholding is more stable than hard thresholding, and the coefficients can be minimized towards zero. Hence, soft thresholding is selected throughout, for this study.

The modified IMF coefficients by direct and interval thresholding for IMF 5 are as shown in Fig. 9. T5 is the obtained threshold value of fifth IMF using universal threshold. The modified IMF coefficients are continued at zero crossing by direct thresholding, whereas those in the neighborhood of zero crossings are reduced to zero by interval thresholding, resulting in errors in the reconstructed signal. Hence, EMD direct thresholding is not a suitable choice for correcting artifacts of EEG signals.

Fig. 9
A graph of modified I M F coefficients. The vertical axis represents amplitude, and the horizontal axis represents the number of samples. Three high-frequency waves for I M F 5, E M D-I T, and E M D-D T range from (0, 0) to (1600, 0) approximately. A dashed line for T5 ranges from (0, 5) to (1600, 5) approximately.

Modified IMF coefficients by EMD-DT and EMD-IT techniques

4.3 Methodology

EMD is to be performed on the raw EEG signal for extracting the IMFs. The IMFs with higher order contain noise at low frequencies and vice versa; that is, the BAs are found to be distributed in the higher order IMFs.

The noisy IMFs are to be identified as explained in Sect. 3.1.

The threshold for each IMF should be calculated, and interval thresholding for the identified noisy IMFs is performed as described in Sect. 4.2.

Modified IMFs and the remaining signal IMFs are added for reconstructing the signal according to Eq. (4).

The metrics ARR, ΔSNR, RMSE, and CC amid clean and raw EEG signals are to be calculated. Figure 10 depicts the process of artifact correction.

Fig. 10
A flow chart. The steps are raw E E G signal, E M D, I M F s, identification of noisy I M F s, E M D-I T, signal reconstruction, and performance evaluation. Arrow between I M F s and identification of noisy I M F s connects to signal reconstruction through signal I M F s. Arrow labeled Noisy I M F s connect to E M D-I T.

Denoising approach based on EMD-IMF thresholding

5 Experimental Verification and Results

The quantitative metrics to evaluate the denoising techniques are defined by the following equations [23]:

$$ \Delta \mathrm{SNR}=10{\log}_{10}\left(\frac{\sigma_x^2}{\sigma_y^2}\right) $$
(9)
$$ ARR=\frac{\sum \limits_{i=1}^N{\left({x}_i(n)-{y}_i(n)\right)}^2}{\sum \limits_{i=1}^N{y}_i^2(n)} $$
(10)
$$ CC=\frac{\sum \limits_{n=1}^N\left(x\left[n\right]-\overline{x\left[n\right]}\right)\left(y\left[n\right]-\overline{y\left[n\right]}\right)}{\sqrt{\sum \limits_{n=1}^N{\left(x\left[n\right]-\overline{x\left[n\right]}\right)}^2\sum \limits_{n=1}^N{\left(y\left[n\right]-\overline{y\left[n\right]}\right)}^2}} $$
(11)
$$ \mathrm{RMSE}=\sqrt{\frac{1}{N}\sum \limits_{n=1}^N{\left(x\left[n\right]-y\left[n\right]\right)}^2} $$
(12)

where y(n) represents the clean EEG signal, x(n) is the contaminated EEG, and \( {\sigma}_y^2 \) and \( {\sigma}_x^2 \) are the variance of the clean and raw EEG signals, respectively.

5.1 Results and Discussions

In the conventional EMD-based denoising method, the denoised signal is the sum of selected IMFs by excluding the IMFs which have artifactual components. However, in EMD direct thresholding (EMD-DT), the signal is reconstructed after performing thresholding to the noisy IMFs. Figure 11 exemplifies the raw and reconstructed FP2 EEG signal with EMD-DT and conventional EMD denoising method of artifact correction. Artifacts in blink region are not suppressed by EMD-DT method, which can be done by conventional EMD method. However, the cerebral activity at artifact-free zones might be affected. Table 3 represents the quantitative metrics between the contaminated and clean EEG signals, using conventional EMD and EMD-DT techniques for the frontal EEG signals at non-blink regions, which are separately discussed in Sect. 3.2, with an average CC and RMSE of 0.4175, 0.84 and 21.55, 0.443 by conventional EMD and EMD-DT methods, respectively, over non-blink regions. Artifacts are largely attenuated by the conventional EMD method, but the neural information is greatly affected based on CC and RMSE.

Fig. 11
A clean E E G signal graph by the conventional E M D method. The vertical axis represents amplitude in microvolts, and the horizontal axis represents the number of samples. Three high-frequency waves for raw E E G, E M D-D T and conventional E M D, range from (0, 0) to (1600, 0) approximately.

Clean EEG signal by the conventional EMD method

Table 3 Quantitative metrics of artifact removal by Conv-EMD and EMD-DT

However, in the partial sum of IMFs, there might be the possibility of excluding high-frequency content in the signal or including the unwanted low-frequency oscillations of the signal. With this view to avoid any loss of data in the signal reconstructed, IMF interval thresholding is opted for artifact correction.

Figure 12 illustrates the time domain description of FP2 EEG signal before and after artifact removal by conventional EMD, EMD-DT, and EMD-IT methods. From this figure, it can be clearly understood that EMD-DT method is unable to reject the artifacts, whereas conventional EMD and EMD-IT methods minimize the artifacts to a greater extent.

Fig. 12
Two graphs. Graph 1 represents the clean E E G signal. Three high-frequency waves for raw E E G, E M D-D T, and E M D-I T range from (0, 0) to (1600, 0) approximately. Graph 2 represents the clean FP2 E E G signal. Two high-frequency waves for raw E M D-I T and conventional E M D range from (0, 0) to (1600, 0) approximately.

(a) Clean EEG signal by EMD-IT and EMD-DT techniques. (b) Clean FP2 EEG signal by Conv-EMD and EMD-IT techniques

Table 4 provides the average quantitative metrics between contaminated and reconstructed EEG signals using EMD-DT and EMD-IT techniques. The higher the values of ΔSNR and ARR, the better will be the rejection of artifacts, the higher the CC, and the lower the RMSE, which indicate that the neural information is protected. ΔSNR and ARR are estimated for the whole data, whereas CC and RMSE are estimated at non-blink regions. EMD-DT and EMD-IT methods have accounted for average ΔSNR, ARR, CC, and RMSE of 1.11, 1.35, 0.84, and 2.82 and 21.05, 11.15, 0.65, and 30.8, respectively, for the frontal EEG signals. According to ΔSNR and ARR, EMD-IT method has given better results than EMD-DT. Similarly, the performance of EMD-DT is found superior to EM-IT considering CC and RMSE. Universal threshold opted for DT was unable to shrink the IMF coefficients at each level, thereby being incapable to reject the artifacts. However, the same threshold function shrinks the coefficients better by the process of interval thresholding between zero crossings. CC and RMSE for EMD-IT are poor compared to EMD-DT but superior to conventional EMD method of artifact correction.

Table 4 ΔSNR, ARR, CC, and RMSE by EMD-DT and EMD-IT methods

Blink artifacts were found dominant in EEG recording and occupy the low-frequency band from 0 to 16 Hz. Artifact removal algorithms should selectively reduce the spectral power in the lower frequency bands without distressing the high-frequency bands. Hence, spectral analysis was carried out for the EEG signal to estimate the artifact correction in the frequency domain, which is implemented in this study using Welch method (pwelch). Figure 13 illustrates power spectral density (PSD) of the raw and reconstructed EEG signals using EMD-DT and EMD-IT techniques. Unlike the spectrum of FP2 EEG signal, the power drop is found restricted to lower frequencies, and the neural information at higher frequencies is preserved by EMD-IT method.

Fig. 13
A graph of raw and clean E E G signals. The vertical and horizontal axes represent magnitude in decibel and frequency in Hertz, respectively. Two frequency waves of raw E E G and clean E E G E M D-D T start (0, 37) and end at (260, 6). The frequency wave of clean E E G E M D-I T starts (0, 6) and ends at (260, 6). All the points are approximate.

Spectra of raw and clean EEG signals by EMD-DT and EMD-IT methods

6 Conclusion

In this chapter, IMF interval thresholding is proposed to correct blink artifacts of EEG signals. Noisy IMFs are detected based on CC and power spectral density estimate of the signals. Background data at artifact-free regions is conserved by performing thresholding to the noisy IMFs. Performance of the proposed method is compared with EMD-DT and conventional method of EMD denoising in terms of several standard metrics: ARR, ΔSNR, RMSE, and CC. Conventional EMD denoising method has shown extreme performance in artifact rejection, utterly affecting the neural activity. EMD-DT can preserve the neural information at artifact-free zones but is unable to correct the artifacts, whereas EMD-IT is good in correcting the artifacts while preserving the cerebral information at artifact-free regions. However, the neural information at non-blink regions is further improved by an optimized threshold function.