Identification of epileptic seizures in EEG signals using time-scale decomposition (ITD), discrete wavelet transform (DWT), phase space reconstruction (PSR) and neural networks

Zeng, Wei; Li, Mengqing; Yuan, Chengzhi; Wang, Qinghui; Liu, Fenglin; Wang, Ying

doi:10.1007/s10462-019-09755-y

Identification of epileptic seizures in EEG signals using time-scale decomposition (ITD), discrete wavelet transform (DWT), phase space reconstruction (PSR) and neural networks

Published: 24 August 2019

Volume 53, pages 3059–3088, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Artificial Intelligence Review Aims and scope Submit manuscript

Identification of epileptic seizures in EEG signals using time-scale decomposition (ITD), discrete wavelet transform (DWT), phase space reconstruction (PSR) and neural networks

Download PDF

Wei Zeng ORCID: orcid.org/0000-0002-8353-8265¹,
Mengqing Li¹,
Chengzhi Yuan²,
Qinghui Wang¹,
Fenglin Liu¹ &
…
Ying Wang¹

1260 Accesses
29 Citations
Explore all metrics

Abstract

Traditionally, detection of epileptic seizures based on the visual inspection of neurologists is tedious, laborious and subjective. To overcome such disadvantages, numerous seizure detection techniques involving signal processing and machine learning tools have been developed. However, there still remain the problems of automatic detection with high efficiency and accuracy in distinguishing normal, interictal and ictal electroencephalogram (EEG) signals. In this study we propose a novel method for automatic identification of epileptic seizures in singe-channel EEG signals based upon time-scale decomposition (ITD), discrete wavelet transform (DWT), phase space reconstruction (PSR) and neural networks. First, EEG signals are decomposed into a series of proper rotation components (PRCs) and a baseline signal by using the ITD method. The first two PRCs of the EEG signals are extracted, which contain most of the EEG signals’ energy and are considered to be the predominant PRCs. Second, four levels DWT is employed to decompose the predominant PRCs into different frequency bands, in which third-order Daubechies (db3) wavelet function is selected for analysis. Third, phase space of the PRCs is reconstructed based on db3, in which the properties associated with the nonlinear EEG system dynamics are preserved. Three-dimensional (3D) PSR together with Euclidean distance (ED) has been utilized to derive features, which demonstrate significant difference in EEG system dynamics between normal, interictal and ictal EEG signals. Fourth, neural networks are then used to model, identify and classify EEG system dynamics between normal (healthy), interictal and ictal EEG signals. Finally, experiments are carried out on the University of Bonn’s widely used and publicly available epilepsy dataset to assess the effectiveness of the proposed method. By using the 10-fold cross-validation style, the achieved average classification accuracy for eleven cases is reported to be 98.15%. Compared with many state-of-the-art methods, the results demonstrate superior performance and the proposed method can serve as a potential candidate for the automatic detection of seizure EEG signals in the clinical application.

Classification of EEG signals using normal inverse Gaussian parameters in the dual-tree complex wavelet transform domain for seizure detection

Article 31 December 2014

Automated detection of epileptic seizures using successive decomposition index and support vector machine classifier in long-term EEG

Article 31 July 2019

An Intelligent Method for Epilepsy Seizure Detection Based on Hybrid Nonlinear EEG Data Features Using Adaptive Signal Decomposition Methods

Article 27 November 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Epilepsy is a chronic neurological disorder caused due to abnormal and excessive brain neuronal activity, in which Electroencephalogram (EEG) signal is the most commonly used and efficient clinical technique to assess epilepsy due to its inexpensiveness and availability (Zhang et al. 2017). Traditionally, detection of epileptic seizures based on the visual inspection of neurologists is tedious, laborious and subjective (Martis et al. 2015). In addition, it requires expertise in the analysis of long-duration EEG signals (Scheuer and Wilson 2004). In those application scenarios absence of experts, for example, in emergency, computer-aided automatic detection of epileptic seizure becomes significant. To overcome above-mentioned disadvantages, numerous seizure detection techniques involving signal processing and machine learning tools have been developed, such as support vector machine (SVM), extreme learning machine (ELM), random forest (RF) and deep learning (Zhang and Chen 2016; Song et al. 2016; Mursalin et al. 2017; Acharya et al. 2018; Ullah et al. 2018; Li et al. 2019; Subasi et al. 2019; Sharma et al. 2018; Sharma and Pachori 2017a, b; Bhati et al. 2017a, b; Bhattacharyya and Pachori 2017; Tiwari et al. 2016; Sharma et al. 2017; Bhattacharyya et al. 2017; Sharma and Pachori 2015; Kumar et al. 2015; Pachori and Patidar 2014; Bajaj and Pachori 2012, 2013; Pachori and Bajaj 2011; Pachori 2008; Pachori et al. 2015). However, it still remains an open problem of automatic detection with high efficiency and accuracy in distinguishing normal, interictal and ictal EEG signals (Djemili et al. 2016). In attempt to sovle the problem, various algorithms have been developed. Since EEG signals are the redundant discrete-time sequences, numerous methods with combination of time-domain, frequency-domain, time-frequency-domain and nonlinear analysis have been proposed (Acharya et al. 2013). For the time-domain analysis, representative techniques such as linear prediction (Sheintuch et al. 2014), fractional linear prediction (Joshi et al. 2014), principal component analysis (PCA) based radial basis function neural network (Kafashan et al. 2017), etc, have been proposed for seizure detection and EEG classification. For the frequency-domain analysis, with an assumption that EEG signals are stationary, Fourier transform is usually employed to extract features for epileptic seizure detection. Samiee et al. (2015) applied the rational Discrete Short Time Fourier Transform (DSTFT) to extract features for the separation of seizure epochs from seizure-free epochs using a Multilayer Perceptron (MLP) classifier. Considering the non-stationary nature of EEG signals (Subasi and Gursoy 2010), for the time-frequency-domain analysis, a wavelet transform tool together with certain classifier has usually been used for the epileptic seizure detection. Hassan et al. (2016) decomposed the EEG signal segments into sub-bands using Tunable-Q factor wavelet transform (TQWT) and several spectral features were extracted. Then bootstrap aggregating was employed for epileptic seizure classification. For the nonlinear analysis, various nonlinear parameters extracted through different types of entropies (Acharya et al. 2015), Lyapunov exponent (Shayegh et al. 2014), fractal dimension (Zhang et al. 2015), correlation dimension (Sato et al. 2015), recurrence quantification analysis (RQA) (Timothy et al. 2017) and Hurst exponent (Lahmiri 2018) methods have been used for automatic detection of epileptic EEG signals. Aarabi and He (2017) developed a method on the fusion of features extracted from correlation dimension, correlation entropy, noise level, Lempel–Ziv complexity, largest Lyapunov exponent, and nonlinear interdependence for the detection of focal EEG signals.

Despite the fact that these previous approaches have demonstrated respectable classification accuracy, the potential of nonlinear methods has not been thoroughly investigated. The EEG signal is highly random, nonlinear, nonstationary and non-Gaussian in nature (Acharya et al. 2013), for which nonlinear features characterize the EEG more accurately than linear models (Wang et al. 2017). Considering this characteristics, several self-adaptive signal processing methods, such as empirical mode decomposition (EMD) (Huang et al. 1998; Huang and Kunoth 2013), local mean decomposition (LMD) (Park et al. 2011) and intrinsic time-scale decomposition (ITD) (Frei and Osorio 2007), can be employed to extract effective and predominant features from EEG signals (Li et al. 2013; Zahra et al. 2017). EMD decomposes a multi-component signal into a series of single components and a residual signal while LMD decomposes any complicated signal into a series of product functions. However, there exist some drawbacks in these methods, in which the EMD method contains over envelope, mode mixing, end effects and unexplainable negative frequency caused by Hilbert transformation (Chen et al. 2011), while the LMD method has distorted components, mode mixing and time-consuming decomposition (Li et al. 2015). To address these problems, recently, a new technical tool named ITD, has been introduced by Frei and Osorio (2007) for analyzing data from nonstationary and nonlinear processes. Compared with EMD, more local characteristic information of the signal can be utilized in ITD method. In addition, the negative frequency caused by Hilbert transform has been completely eliminated (Feng et al. 2016). Furthermore, the computational efficiency has been significantly improved. With high decomposition efficiency and frequency resolution, ITD can help decompose a complex signal into several proper rotation components (PRCs) and a baseline signal, which leads to the accurate extraction of the dynamic features of nonlinear signals. Meanwhile, there is no spline interpolation and screening process in ITD method which contains low edge effect (An et al. 2012; Xing et al. 2017). ITD can better preserve and extract the EEG system dynamics which is effective for the classification of normal, interictal and ictal EEG signals. Phase space reconstruction (PSR) is another popular nonlinear tool for analyzing composite, nonlinear and nonstationary signals (Takens 1980; Xu et al. 2013; Lee et al. 2014; Chen et al. 2014; Jia et al. 2017). The principle of PSR is to transform the properties of a time series into topological properties of a geometrical object which is embedded in a space, wherein all possible states of the system are represented. Each state corresponds to a unique point, and this reconstructed space is sharing the same topological properties as the original space. The dynamics in the reconstructed state space is equivalent to the original dynamics. Hence reconstructed phase space is a very useful tool to extract nonlinear dynamics of the signal (Takens 1980; Xu et al. 2013; Lee et al. 2014; Chen et al. 2014; Jia et al. 2017). It is hypothesized that EEG system dynamics between normal, interictal and ictal EEG signals are significantly different, which implies that PSR offers the potential to compute the difference and classify these EEG signals.

The novelty of this work lies in four aspects: (1) ITD method is employed to measure the variability of EEG signals and the first and second proper rotation components (PRCs) are extracted as predominant PRCs which contain most of the EEG signals’ energy; (2) discrete wavelet transform (DWT) decomposes the predominant PRCs into different frequency bands, which are used to construct the reference variables. (3) 3D phase space of the two PRCs components is reconstructed, in which the properties associated with the EEG system dynamics are preserved; (3) EEG system dynamics can be modeled and identified using neural networks, which employ the ED of 3D PSR of the reference variables as features; (4) the difference of EEG system dynamics between normal, interictal and ictal EEG signals is computed and used for the discrimination between the three groups based on a bank of estimators. Detailed description is illustrated as follows. In the present study we propose a combined and computational method from the area of nonlinear method and machine learning for the classification of normal, interictal and ictal EEG signals. To explore the underlying motor strategies in the three groups, neural networks together with ITD, discrete wavelet transform (DWT) and PSR are implemented for this purpose. The complete algorithm encompasses four principal stages: (1) EEG signals are decomposed into a series of proper rotation components (PRCs) and a baseline signal by using the ITD method. The first two PRCs of the EEG signals are extracted, which contain most of the EEG signals’ energy and are considered to be the predominant PRCs. (2) four levels DWT is employed to decompose the predominant PRCs into different frequency bands, in which third-order Daubechies (db3) wavelet function is selected for analysis. (3) Phase space of the PRCs is reconstructed based on db3, in which the properties associated with the nonlinear EEG system dynamics are preserved. Three-dimensional (3D) PSR together with Euclidean distance (ED) has been utilized to derive features, which demonstrate significant difference in EEG system dynamics between normal, interictal and ictal EEG signals. (4) Neural networks are then used to model, identify and classify EEG system dynamics between normal (healthy), interictal and ictal EEG signals.

The rest of the paper is organized as follows. Section 2 introduces the details of the proposed method, including the Bonn dataset, data description, ITD, DWT, PSR, ED, feature extraction and selection, learning and classification mechanisms. Section 3 presents experimental results. Sections 4 and 5 give some discussions and conclusions, respectively.

2 Method

In this section, we propose a method to discriminate between normal, interictal and ictal EEG signals using the information obtained from nonlinear EEG dynamics. It is divided into the training stage and the classification stage and follows the following steps. In the first step, ITD is applied to decompose EEG signals into several PRCs to extract predominant components. In the second step, DWT is employed to decompose the predominant PRCs into different frequency bands. In the third step, PSR is applied to extract nonlinear dynamics of EEG signals and Euclidean distances are computed. Finally, feature vectors are fed into the neural networks for the modeling and identification of EEG system dynamics. The difference of dynamics between normal (healthy), interictal and ictal EEG signals will be applied for the classification task. The flowchart of the proposed algorithm is illustrated in Fig. 1.

2.1 EEG database

In the present study we use the open and publicly available Bonn University database (Andrzejak et al. 2001) consisting of five different sets (Z, O, N, F and S), each of which contains 100 single-channel EEG segments of 23.6-s duration. All EEG signals were recorded at a sampling rate of 173.61 Hz using a 128-channel amplifier system with an average common reference. Band-pass filter was set with the frequency 0.53–40 Hz. Hence each signal has 4097 recordings, which means the data length of each signal is 4097. Set Z and O contain surface EEG recordings that were carried out on five healthy subjects in relaxing state. Set Z was recorded when subjects’ eyes were open while set O was recorded when subjects’ eyes were closed. Sets N, F, and S contain intracranial recordings from depth and strip electrodes collected from five epileptic patients. Set N contains seizure-free intervals collected from the hippocampal formation of opposite hemisphere, set F contains seizure-free intervals collected from epileptogenic zone, and set S contains epileptic seizure segments originated from all channels. EEG recordings from Z–O, N–F and S datasets were defined as normal (healthy), interictal and ictal signals, respectively.

2.2 Intrinsic time-scale decomposition (ITD)

Intrinsic time-scale decomposition (ITD) is suitable for analyzing nonstationary and nonlinear signals such as the EEG signals. Without resorting to the spline interpolation to signal extrema and sifting in mono-component separation, it decomposes a signal into proper rotation components (PRCs) that are suitable to calculate the instantaneous frequency and amplitude, based on the baseline defined via linear transform. The obtained decomposition result precisely preserves the temporal information of each component regarding signal critical points and riding waves, with a time resolution equal to the time scale of the occurrence of extrema in the raw signal (Feng et al. 2016). Based on the single wave analysis, it extracts accurately the inherent instantaneous amplitude and frequency/phase information and other relevant morphological features (Frei and Osorio 2007).

For a time series signal I(t), define the operator L to extract the baseline signal from I(t) and the residual signal is called the proper rotation component (PRC). The decomposed signal I(t) can be expressed as

$$\begin{aligned} I(t)=L I(t)+(1-L)I(t)=B(t)+H(t) \end{aligned}$$

(1)

where B(t) is the baseline signal and H(t) is the proper rotation.

The decomposition procedure of a nonlinear signal can be summarized by the following steps:

Step 1 Find the local extrema of the signal I(t), denoted by $I_k$, and the corresponding occurrence time instant $\tau _k, k=0,1,2,\ldots $. For convenience $\tau _0=0$.
Step 2 Suppose the operators B(t) and H(t) are given over the interval $[0, \tau _k]$, and I(t) is set on the interval $t\in [0, \tau _{k+2}]$. Then on the interval $[\tau _k, \tau _{k+1}]$ between two adjacent extrema $I_k$ and $I_{k+1}$, the piecewise baseline extraction operator is defined as
$$\begin{aligned} LI(t)=B(t)=B_k+(\frac{B_{k+1}-B_k}{I_{k+1}-I_k})\times (I(t)-I_k), \quad t\in [\tau _k, \tau _{k+1}], \end{aligned}$$
(2)
where
$$\begin{aligned} B_{k+1}=\beta [I_k+(\frac{\tau _{k+1}-\tau _{k}}{\tau _{k+2}-\tau _{k}})(I_{k+2}-I_k)]+(1-\beta )I_{k+1}, \end{aligned}$$
(3)
and $0<\beta <1$, typically $\beta =0.5$.
Step 3 After extracting the baseline signal, the operator $\Theta $ for extracting the residual signal as PRCs is defined as
$$\begin{aligned} \Theta I(t)\equiv (1-L)I(t)=I(t)-B(t) \end{aligned}$$
(4)

According to the definition, the PRC is a riding wave with the highest frequency on the baseline. Therefore, ITD separates the PRC in a frequency order from high to low. In addition, the PRC is obtained directly by subtracting the baseline from the input signal, without resorting to any sifting within each iterative decomposition. Thus, ITD has low computational complexity, and more importantly, avoids the smoothing of transients and time-scale smearing due to repetitive sifting (Feng et al. 2016).

Take the baseline B(t) as the input signal I(t), and repeat steps (1)–(3), until the baseline becomes a monotonic function or a constant. Eventually, the raw signal will be decomposed into PRCs and a trend (Feng et al. 2016)

$$\begin{aligned} I(t)=\sum \limits _{i=1}^\rho H^i(t)+B^\rho (t), \end{aligned}$$

(5)

where $\rho $ is the decomposition level.

Samples of the ITD of EEG signals from the five sets are demonstrated in Fig. 2.

2.3 Discrete wavelet transform (DWT)

Wavelet transform is an effective time-frequency tool for the analysis of non-stationary signals. Discrete Wavelet Transform (DWT) is a procedure for the decomposition of input signal H(t) (H(t) is the PRC of the EEG signal in this work) into sets of function, called wavelets, by scaling and shifting of mother wavelet function. Consequently, the decomposition i.e. set of wavelet coefficients are formed.

To accomplish this, the signal H(t) can be reconstructed as linear combination of wavelets and weighting wavelet coefficients. The setting of appropriate wavelet function and the number of decomposition levels is of great importance for correctly reconstructing the signal H(t). In order to extract five physiological EEG bands, four levels DWT with third-order Daubechies (db3) wavelet function have been used (Table 1 represents the frequency distribution of the DWT-based coefficients of the PRCs of the EEG signals at 173.6 Hz), from which the choice of the mother wavelet is supported by many works in literature (Vavadi et al. 2010; Tawfik 2016; Li et al. 2017). Figure 3 shows samples of EEG channel of five sets and their decomposed frequency bands of predominant PRCs. Since the frequency components above 40 Hz is lack of use in epilepsy analysis, in order to reduce the feature dimension, the advisable sub-bands (D4 and A4) are selected for feature acquisition.

Table 1 Frequency band of PRCs of the EEG signal using fourth level decomposition

Identification of epileptic seizures in EEG signals using time-scale decomposition (ITD), discrete wavelet transform (DWT), phase space reconstruction (PSR) and neural networks

Abstract

Similar content being viewed by others

Classification of EEG signals using normal inverse Gaussian parameters in the dual-tree complex wavelet transform domain for seizure detection

Automated detection of epileptic seizures using successive decomposition index and support vector machine classifier in long-term EEG

An Intelligent Method for Epilepsy Seizure Detection Based on Hybrid Nonlinear EEG Data Features Using Adaptive Signal Decomposition Methods

Explore related subjects

1 Introduction

2 Method

2.1 EEG database

2.2 Intrinsic time-scale decomposition (ITD)

2.3 Discrete wavelet transform (DWT)

2.4 Phase space reconstruction (PSR)

2.5 Feature extraction and selection

2.6 Training and modeling mechanism based on selected features

2.7 Classification mechanism

3 Experimental results

4 Discussion

5 Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation