Keywords

1 Introduction

The World Health Organization (WHO) report states that more than 2% of the world population are suffering from epilepsy each year worldwide; out of them, nearly 90% of people belong to the developing countries [90]. The reasons may be lack of awareness, cure, diet, neutrinos, vitamins supplements, etc. It can be observed that the proper diet plan and meditation cure most types of seizures.

The human brain commands the human nervous system that controls most of the human activities. It is a cluster of billions of nerve cells or neurons that are connected via a very complex network. Usually, the information exchange among the neurons takes place through electrical and chemical impulse signals that have a swift response time. It collects information from the sensory organs and transfers to the muscles.When the neurons clusters are hyper-synchronously fired at a time, this sudden surge of electrical activity causes involuntary and temporary disturbance of regular neural activity which can lead to seizures, or in other sense, anything that disturbs the regular pattern of neuron activity can lead to seizures. The recurrent unprovoked seizures are known as epilepsy [10].

The abnormality in brain activity is not always a sign of a seizure. It may happen that those persons are diagnosed with some specific syndrome. Otherwise, in some conditions, a seizure-affected person may continue to show standard electrical activity patterns. Hence, the seizures can be classified depending on the starting area of the disturbance inside the brain and how far it spreads. A single seizure may occur due to high fever or head injury, and this does not indicate that a person has epilepsy. Only when a person suffers from two or more seizures, they may be considered as symptoms of epilepsy. Although the seizures are the predominant symptom of epilepsy, but having a seizure does not necessarily mean that a person is affected by epilepsy [57, 90].

Epilepsy can happen from a variety of causes such as brain tumors, traumatic brain injury, strokes, heart attacks, genetically, abnormal blood vessel formation, etc., which do not lead to epileptic seizures. In most cases, the seizures are controlled with diet, exercise, and medication as many types of seizures do not cause brain destruction. When the seizures are occurring very frequently, that may cause brain damage that leads to epilepsy. Those seizures cannot be controlled with medication [10, 57]. The area of the brain where seizures occur first is known as the focus area or epileptic zone area. By performing surgery, the doctors remove the defined focus area of the brain where seizures are originated. Before performing surgery, the epileptic person is monitored intensively to pinpoint the exact location in the brain as the focus area [33].

There are numerous neuroimaging techniques that are used to explore the dynamics of brain activities at the pre-surgical stage. The computed tomography (CT), single-photon emission computed tomography (SPECT), positron emission tomography (PET), functional magnetic resonance imaging (fMRI), diffusion optical tomography (DOT), magnetic resonance imaging (MRI), magnetoencephalogram (MEG), and more are employed to reveal the structural abnormalities of the brain. These techniques map the damaged areas or abnormalities that may be the origin of seizures. Despite of rapid advancement of the neuroimaging techniques, the analysis of electroencephalogram (EEG) signals is the most frequently used technique for diagnosis of neurological diseases [33, 90].

The EEG signals are recorded in two ways, namely, extracranial and intracranial. The extracranial EEG signals are recorded by noninvasive electrodes placed on the scalp with the international 10/20 system, whereas the small and deep penetrating electrodes are used to record the intracranial signals. The intracranial EEG signals having highly nonstationary and nonlinear behavior including impulses and sharp spikes as compared to the extracranial EEG signals. The prediction, detection, and occurrences of epilepsy by visualization of lengthy EEG signals are a soporific and time-consuming process for Neuro-experts. Hence, the prime challenge is to develop a signal processing algorithm which can extract substantial information about the nonlinear dynamics of the brain’s electrical activities. In this chapter, a computer-aided nonlinear higher-order statistics (HOS)-based method has been proposed that leads to an excellent seizure classification accuracy on the Bonn university EEG database [7].

The layout of the present chapter is as follows: Sect. 10.2 contains the literature study about the existing seizures classification algorithms. The studied database is briefly illustrated in Sect 10.3. Section 10.4 contains the proposed methodology which includes brief descriptions about higher-order spectra, feature extraction, dimension reduction algorithm, and the support vector machine (SVM) classifier. The discussion on the achieved results is given in Sect. 10.5, whereas the conclusion is included in Sect. 10.6.

2 Related Work

The literature reveals that many research papers have been published on automated classification of epileptic seizure based on the EEG signals.

Earlier the linear prediction (LP) technique is used for seizure classification. The LP algorithm computes the future values which depend on the present and past values behaviors. Pradhan et al. [63] used the LP algorithm for data compression, storage, and transmission of EEG signals. It is relevant to extract the significant features that reveal the underlying nonlinearity and complexity of the EEG signals. Altunay et al. [6] computed the LP error for the epileptic EEG signal detection. In the series of LP-based EEG signal classification algorithm, Joshi et al. [34] proposed the fractional LP method. The energy of the original signal and LP fractal model error are used as measurements for the classification of ictal and seizure-free EEG signals. However, the proposed features on the LP algorithm are not suitable to capture the required information about the nonstationary EEG signals, which yields a low classification accuracy. Some model-based algorithms are also proposed for the EEG signals classification [36, 37, 83]. The autoregressive (AR) model deals with the prediction of the future values based on the present and previous values, which also helps to compute the transfer function. The AR coefficients of the EEG signals can be used as inputs to different classifiers. Ubeyli et al. [83] used the least square-SVM (LS-SVM) classifier to classify the EEG signal based on AR coefficients.

Usually, the local binary pattern (LBP) is applied for image classification or in pattern recognition. It extracts some suitable patterns that explore the pattern characteristics [40]. Kaya et al. [35] extracted the uniform and non-uniform 1D-LBP features from the studied signals and employed those attributes as inputs to different classifiers to evaluate the performance efficiency. The authors used five different classifiers, namely, the BayesNet, artificial neural network (ANN), functional tree (FT), SVM, and logistic regression (LR). They got maximum classification accuracy with the BayesNet classifier. Tiwari et al. [80] employed the 1D-LBP for automated epilepsy detection. They detect the key-points features of the EEG signals by applying the difference of Gaussian (DoG) filter at multiple scales. The 1D-LBP algorithm is applied at these detected key-points, and the histogram of the LBPs are employed as features. The SVM classifier with radial basis (RBF) kernels is used to distinguish the seizure classes. The proposed algorithm achieved excellent classification accuracy in both binary and three seizures classes classification cases.

Many researchers proposed the spectral analysis techniques to explore the nonlinear dynamics of EEG signals. Polat and Gunes [60] used the non-parametric spectral analysis technique of EEG signals for the purpose of epilepsy detection. The spectral estimation had been performed with various methods, and the spectral coefficients are discriminated with the decision tree classifier. They extended their work with same parameters and classified them with the artificial immune recognition systems [61]. Faust et al. [19] proposed the frequency domain parametric method for automatic identification of epileptic seizure based on EEG signals.

The frequency domain techniques alone are not often suitable for analysis of non-stationary signals. As it does not reveal the time and frequency information simulteneously. To overcome this problem, many researchers proposed the time-frequency (TF) analysis methods that allow an analysis of both parameters (time and frequency) in a single frame. The TF techniques are used by numerous authors [32, 656975, 81, 82] for analysis of the EEG signals vis-a-vis the brain activities. The TF methods decompose the signal at time and frequency axes simultaneously. Samiee et al. [65] proposed the rational discrete short-time Fourier transform (DSTFT) for epileptic seizure classification. They measured five absolute values of the statistical parameters (standard deviation, mean, median, minimum value, maximum value) from the decomposed DSTFT coefficients. They achieved 98.1% classification accuracy with the multilayer perceptron (MLP) classifier to classify sezures and non-sezures EEG signals. In the DSTFT, a fixed-length window does not provide an accurate analysis of time in frequency frame and vice versa. Sharma and Pachori [69] proposed the improved eigenvalue decomposition of the Hankel matrix and Hilbert transform (IEVDHM-HT) technique for the TF representation. The studied signal is decomposed into sub-signals by using the IEVDHM with a specific criterion such as eigenvalue selection, number of iterations, etc. The Hilbert transform (HT) algorithm is applied on each sub-band to extract the instantaneous amplitude and frequency parameters to represent the studied signal on the TF plane. The Rényi entropy is measured from the TF matrix, and it is used as features for seizure classification. They achieved 99.41% classification accuracy for seizure-free and seizure classes when the length of the studied signals is 4080 points. Fu et al. [21] used the Hilbert marginal spectrum (HMS) analysis technique for seizure classification. The HMS is derived from the empirical mode decomposition (EMD) method which decompose the non-stationary multicomponent signals into a group of sub-band signals known as the intrinsic mode functions (IMFs). The spectral entropies and energy are measured from these sub-bands and used as inputs to the SVM classifier for EEG signal classification.

Iscan et al. [32] combine time domain and frequency domain attributes for seizures classification. The cross-correlation (CC) and the power spectral density (PSD) are computed from the studied signals. The four statistical features, namely, the peak value, equivalent width, centroid and mean square abscissa, are measured from the CC of the analysis signals, while the total power is computed for each EEG sub-band signal (δ, θ α, β, γ). Hence, the combined features are used for the classification of normal and epileptic EEG signals. Various classifiers, namely, the Naive Bayes (NB), k-nearest neighbor (KNN), SVM, least square support vector machine (LS-SVM), Parzen, Fisher discriminant analysis (FDA), quadratic classifiers, binary decision tree (BDT), and nearest mean, are used to classify the labeled combined attributes. They achieved 94.94%, 97.97%, 88.88%, 100%, 91.91%, 89.89%, 100%, 98.98%, and 91.91% classification accuracy, respectively. Tzallas et al. [81, 82] used the time-frequency (TF) analysis techniques for automatic EEG signal classification for different problems such as seizure classification with the ANN [81] and epileptic seizure detection [82].

With regard to the TF resolution property, the wavelet transform (WT) is a powerful tool for investigation of non-stationary signals. It provides a time-varying decomposition of EEG signals which makes possible to capture the transient features of the studied signals. Hence, various wavelet-based techniques and their variants are proposed for exploration of nonlinear dynamics of the EEG signals. Subasi used the discrete wavelet transform (DWT) for the EEG signals classification. The mixture of the expert model is used as a classifier to classify the seizures-labeled wavelet coefficients [77]. They extended their work and applied various data dimensionality reduction techniques such as the principal component analysis (PCA), independent component analysis (ICA), and linear discriminative analysis (LDA) for reducing the dimensionality of the wavelet coefficients.  The resultant wavelet coefficient matrix is fed to the SVM classifier to discriminate seizures classes [78]. Acharya et al. [1] computed different nonlinear features from the DWT coefficients that are evaluated by the SVM classifier. The multifractal analysis technique and WT had been proposed for classification of EEG signals [84], while the phase-space reconstruction (PSR) with Euclidean distance (on wavelet coefficients) methods is used to distinguish healthy and epileptic EEG signals [43]. Kumar et al. [39], Guo et al. [26], and Ocak et al. [50] computed the approximation entropy (ApEn) from the wavelet coefficients. The ANN and the optimal threshold value classifiers are used to classify the computed features. The results obtained in [50] reveal that the detail coefficients of the first level provided the best classification accuracy of 96.65% using the ApEn as a feature.

Guo et al. [24, 25] proposed various measurements for the seizures-labeled EEG signal classification. The relative wavelet energy [24] and line length [25] attributes are measured from the decomposed signals that are classified by the ANN classifier. The wavelet packet entropy in [86] and the mixed band wavelet chaos neural network in [22] are used for epileptic seizure detection. Orhan et al. [51] proposed the DWT for EEG signal decomposition into sub-bands. The K-mean clustering techniques are employed to separate the features according to the labeled classes. These labeled clustered data are classified using the MLP neural network (MPNN) model. Zhou et al. [95] decomposed the EEG signals by using the DWT method up to three, four, and five levels of decomposition. The different features, namely, lacunarity and fluctuation indexes, are measured from these decomposed scaled signals. The lacunarity is a scale-dependent feature that measures heterogeneity, while fluctuation index reflects the intensity fluctuation of EEG signals. The Bayesian linear discriminant analysis classifier is used for labeled feature classification. The proposed method achieved an excellent classification accuracy of the order of 96.67% for intracranial EEG signal classification.

Many authors proposed multiple variants of the WT, namely, the multiwavelet transform [26], orthogonal wavelet filter banks [11, 12], tunable Q-wavelet transform [14, 29, 58, 68], wavelet packet decomposition (WPD) [4], dual-tree complex wavelet transformation (DTCWT) [17, 59], empirical wavelet transform [13], flexible analytic wavelet transform [70], and new frequency slice wavelet transform [79], etc., for seizure classification based on the EEG signals. Many variants of the WT extract relevant information from the nonlinear and nonstationary EEG signals. However, the obtained results may vary with a change of the basis wavelet; also the computational complexity increases with an increase of decomposition levels.

Similar to the WT, the EMD is another nonlinear algorithm that identifies the subtle information regarding seizure classification and it is easy to use [5, 8, 20, 44, 52, 53, 67]. The EMD is a data decomposition method that decomposes an arbitrary signal into the IMFs. It is a data-dependent TF analysis method that is characterized by the Hilbert-Huang transform (HHT) algorithm [31]. Fu et al. [20] used the HHT-based TF representation (TFR) method for seizure and non-seizure EEG signal classification. The TFR is simulated as a time-frequency image (TFI), and the segmentation of these images has been done based on the frequency bands of rhythms presented in the EEG signals. The different statistical features up to fourth order are measured from the pixel intensity of the TFI. These attributes are evaluated by the SVM classifier with RBF kernel function. They reported 99.125% classification accuracy with the θ-rhythm of EEG signals. Alam et al. [5] computed the same features as in [20] directly from the IMFs. The collected features are classified by the ANN classifier that yields 100% classification accuracy. Bajaj and Pachori [8] proposed the EMD-based algorithm for epileptic seizure detection. The instantaneous signal area is computed from the trace of the windowed IMFs. They achieved 90% sensitivities with 24.25% error rate detection. Pachori [53] uses the EMD algorithm for ictal and seizure-free EEG signal classification, where the decomposed IMFs are transformed in terms of the Fourier Bessel (FB) expansion, and the mean frequency of FB expansion is considered as features.

Pachori et al. [54,55,56] extracted different features from IMFs and inputted them into various classifiers to classify labeled EEG signals. The second-order difference plot (SODP) is measured from the IMFs, and the area of 95% confidence ellipse (measured from the SODP) is considered as a feature [55]. They obtained 97.72% classification accuracy to discriminate epileptic seizure from seizure-free EEG signals. They extended their work in [56], where they measured two areas, first one corresponding to the SODP, while the other for the graph obtained when the IMFs of the analytic signal are represented in the complex plane. Sharma et al. [67] classify the epileptic seizures based on the phase-space representation (PSR) of IMFs. Oweis et al. [52] used the HHT algorithm to decompose the EEG signal into the IMFs, and the instantaneous frequency and amplitude are computed from each decomposed signal. They observed that the starting three or four modes are highly determinant by using the hypothesis testing. The hypothesis testing used the t-test with two different p-values that allow reducing the insignificant feature space. The proposed algorithm is evaluated on 25 healthy and 25 seizure EEG signals. The ANN is used as a classifier that yields 94% classification accuracy, while the 80% accuracy is obtained with the multivariate empirical mode decomposition (MEMD) method. However, the EMD does not correctly extract the low-energy components from the nonlinear EEG signals; also, the EMD suffered from mode mixing problem that can be temporarily reduced by the ensemble empirical mode decomposition (EEMD) [91]. As compared to the EMD and EEMD, the complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) gives an accurate reconstruction of the EEG signal without mode mixing that allows a better spectral separation of the mode functions. Hassan et al. [28] proposed the CEEMDAN method for decomposition of the EEG signals into the IMFs. The different statistical features, namely, the average, variance, and kurtosis are computed from these IMFs. The ANN classifier is used to separate the labeled classes. They achieved the classification accuracy of order 98.87% for normal (eye open) and non-seizure including normal eye closed and 86.37% for three categories, e.g., seizure, seizure-free, and healthy EEG signals. The variational mode decomposition (VMD) algorithm is proposed in [94] for automated seizure detection based on the EEG signals. The VMD can extract relevant low-energy component information from the raw EEG signals by decomposing it into a fixed number of band-limited IMFs (BLIMFs).

To explore the subtle information about the underlying nonlinear dynamics from the EEG signals, many authors used various nonlinear parameters extraction methods with different algorithms. The recurrence quantification analysis (RQA) visualizes the recurrence behavior of the phase-space trajectory of dynamical systems. Acharya et al. [3] used the RQA for automatic epilepsy EEG signal detection. They extended their work with another nonlinear parameter called the Hurst parameter [2]. The Hurst parameter customarily used to measure the long-term memory of the nonlinear signal. The short-term maximum Lyapunov exponent (STLmax) that measures the rate of divergence, is used to know the dynamic behavior of the EEG signals [23, 45]. To measure the complexity of the studied signals, various entropies, such as the ApEn, Shannon entropy, Fuzzy entropy Renyi entropy, log energy entropy, and more are used. Srinivasan et al. [76] and Zhang et al. [93] used the ApEn as a feature, and the labeled features are classified by the ANN and the SVM classifiers, respectively. Guo et al. [25] proposed the line length features derived from the DWT coefficients. The line length features are used to capture the changes in the waveform dimensionality. These are very sensitivity for signal amplitude and frequency variations. The EEG signals are decomposed up to five levels of decomposition with the DWT corresponding to various frequency bands. The line length features are measured from each sub-signal and classified by using the ANN classifier. The proposed algorithm achieved 99.60% for normal (eye open) and seizure, 97.75% for seizure and seizure-free including eye open, and 97.77% for seizure and non-seizure classification accuracy for different EEG signal sets.

Wang et al. [87] proposed the adaptive learning algorithm, whereas Shin et al. [73] used the sparse representation model for EEG signal classification. Besides the single-scale entropy measurement, the multiscale entropy is also registered in literature for the EEG signal based seizures classification. The single-scale entropy measurement is not able to quantify the underlying dynamics of the nonlinear signals typically when the signals are generated from the biological system. Labate et al. [41] proposed the multiscale entropy as a feature for measuring the complexity in Alzheimer’s disease. They extended the same concept for the EEG signal classification with the multiscale permutation entropy. The features are classified by the SVM classifier with various kernel functions such as linear, RBF, and sigmoidal, and they obtained 78%, 95%, and 91% classification accuracy, respectively [42]. Guo et al. [27] proposed the genetic programming (GP) algorithm for the EEG signal classification. The GP automatically extracted the features from the original feature database. It not only improves the performance of discrimination of the classifier but at the same time reduces the dimensionality of feature space. The DWT is applied on the raw EEG signals, and the mean, standard deviation, energy, curve length, and skewness are measured as original features from each sub-band signal that are fed to the GP algorithm. The GP generated the new dimension features having a substantial probability of discrimination capability. These GP generated features are classified with the KNN classifier which results in 99.20% classification accuracy. There are various learning based algorithms, namely, the random forest, extreme machine learning [92], ANN [88], recurrent neural network (RNN) [23], etc., that are also used for the EEG signal classification.

3 Database

In the study, the web-available EEG signal database has been considered that is recorded in the University of Bonn, Germany [7]. The database is the collection of five sets of EEG signals, namely, Z, O, N, F, and S. Each set contains 100 single-channel EEG time series of 23.6 sec duration. These signals are the subset of 128-channel EEG signals that are selected by the neuro experts. All these signals are digitalized at 173.61 Hz sampling rate using 12-bit resolution. A band-limited filter (0.53–40 Hz) is used to remove the artifact due to muscle activities. The EEG time series sets Z and O are recorded extracranially on five healthy persons during the relaxed state with eyes open (set Z) and eyes closed (set O) condition, respectively. However, the remaining time series sets N, F, and S are recorded intracranially from five epileptic patients in their pre-surgical evaluation of epilepsy. Sets N and F consist of seizure-free time series. The signals of set N are collected from the hippocampal formation of the opposite hemisphere of the brain, while the set F time series are recorded from the epileptogenic zone, whereas the intracranial time series of set S are collected from the epileptic zone during seizure activity. All signals are collected using the standard 10–20 electrode placement scheme.

In brief, the different EEG signals groups can be summarized as:

  • O – Normal signal set, persons are relaxed and in the awake state with eyes closed.

  • Z – Normal signal set, persons are relaxed and in the awake state with eyes open.

  • N – Seizure-free signal set recorded from the hippocampal formation of the opposite hemisphere of the brain.

  • F – Seizure-free signal set recorded from the epileptogenic zone.

  • S – Signal set collected from the epileptic zone during seizure activity.

In this chapter, the three classification problems have been considered as follows:

  • Case 1. Seizure (S) versus normal eyes-open (Z)

  • Case 2. Seizure (S) versus all non-seizure (FNZO)

  • Case 3. Seizure (S) versus seizure-free (FN) versus normal (ZO)

Figure 10.1 shows the EEG time series from each set. The Z and O are the healthy, F and N are the seizure-free, while S represents the seizure EEG signal, respectively. In the broad scene, S is known as the seizure EEG signal, while the remaining (ZOFN) is known as the non-seizure EEG signals.

Fig. 10.1
figure 1

Signals from each set of database

4 Proposed Methodology

In this study, a nonlinear higher-order spectra (HOS) based algorithm is proposed for the seizure-labeled EEG signal classification. Figure 10.2 depicts the block diagram representation of the proposed algorithm. The EEG signals are highly nonlinear in nature. Hence, to extract the relevant information from such signals, the higher-order spectruma, namely, bispectrum, is used. The bispectrum reveals the nonlinearity present in EEG signals and may better illustrate the dynamics of the nervous system. It helps to the diagnosis of brain diseases by understanding of neurological processes. The bispectrum contains the six symmetries along with various directions. Thus, the principal region has been extracted to avoid the irrelevant mathematical calculation. The numerous features are extracted from the principal region that capture the underlying dynamics of the studied signals. As the studied dataset contains five sets of different classes, each consisting of a hundred long EEG time series,hence, the dimension of the extracted features is much high; also the extracted features may contain some irrelevant features that may degrade the algorithm performance. A dimensionality reduction technique known as the locality sensitive discriminant analysis (LSDA) is used to moderate feature space dimensionality. It ranked the features according to the probability of separation. These ranked features are then subjected to the supervised learning SVM classifier with RBF kernel.

Fig. 10.2
figure 2

Block representation of proposed algorithm for epileptic seizure detection

4.1 Higher-Order Spectra

The nonlinearity analysis of a system operating under a random input has been quite extensively done for many years [15]. The HOS are initially generated for the ergodic random processes, and later, it has been expanded to the deterministic signals [15, 64]. In the real-time signal processing, the researchers are mainly dealing with the nonlinear, non-Gaussian, and nonstationary signals. The second- or lower-order statistics do not adequately analyze these types of signals. Also, the considered noise in signal processing is assumed to be Gaussian distributed and additive. The literature reveals that the HOS of order higher than two is commonly used for the analysis of nonstationary and non-Gaussian signal [15, 48, 64]. The HOS can reveal in terms of moments and cumulant functions. The cumulant function is defined as the logarithm of the moment function. For a zero mean signal, both are equal up to order three. The HOS have some unique properties that help for the analysis of non-Gaussian signals; such as, the higher-order (three or more) cumulants of the Gaussian distribution are equal to zero [47, 49]. Figure 10.3 illustrates the bispectrum of each set of the EEG signals and the principal region of bispectrum. The third-order cumulant (ToC) is infinitely differentiable and convex, that allows analysis of the non-minimum-phase and phase couple signals; also, it is scale- invariant. Hence, the ToC of a non-Gaussian signal with an additive uncorrelated Gaussian noise filters out the Gaussian noise part, and represents only the ToC of the signal [47]. Sharma et al. [71] used the residual-based higher-order statistics algorithm for the classification of focal and nonfocal EEG signals. The proposed method measured the disturbance in the labeled EEG signals with various statistical attributes. They extended their work and proposed a center slice algorithm of higher-order statistics for automated glaucoma detection based on fundus images [72].

Fig. 10.3
figure 3

Bispectrum magnitude corresponding to the various set of EEG signals and the principal region of the bispectrum

Let y(τ) be a zero mean random process where τ = 1, 2, …L. The ToC can be represented as follows [47]:

$$ {C}_y^3\left(a,b\right)=E\left[y{\left(\tau \right)}^{\ast }\ y\left(\tau +a\right)\ y\left(\tau +b\right)\right] $$
$$ \kern3.6em =E\left[y\left(\tau \right)\ y{\left(\tau +a\right)}^{\ast }\ y\left(\tau +b\right)\right] $$
(10.1)
$$ \kern3.6em =E\left[y\left(\tau \right)\ y\left(\tau +a\right)\ y{\left(\tau +b\right)}^{\ast}\right] $$

where E and “∗” denote the expectation operator and complex conjugate, respectively. The Fourier transform of ToC is defined as bispectrum [49] and is given below:

$$ B\left({\zeta}_1,{\zeta}_2\right)= FT\left[{C}_y^3\left(a,b\right)\right] $$
(10.2)
$$ or $$
$$ B\left({\zeta}_{1,}{\zeta}_2\right)=E\left[Y{\left({\zeta}_1\right)}^{\ast }\ Y\left({\zeta}_2\right)\ Y\left({\zeta}_1+{\zeta}_2\right)\right] $$

where FT denotes the Fourier transform, Y(ζ) is the discrete-time Fourier transform of y(τ), and ζ1and ζ2are the normalized frequency components that lie between 0 and 1. The bispectrum of a real signal is uniquely defined with the triangle 0 ≤ ζ2 ≤ ζ1 ≤ (ζ1 + ζ2) ≤ 1, known as the principal domain [64].

4.2 Feature Extraction

The suitable feature extraction is an essential part of any classification algorithm. Otherwise, it may lead to underperformance of the classifier. To explore the characteristics of EEG signals, various parameters, namely, the maximum value, energy, mean, variance, normalized bispectrum entropy (NbE) up to order three, interquartile distance (IqD), and spectral flatness are measured from the principal domain of the bispectrum [38, 66, 74]. These statistical features are very significant and less computationally complex.

Let zi be the random variable that denotes the elements of bispectrum principal domain, and let p(zi); i = 0, 1, …, η − 1 be the probability of occurrence of zi, where η is the number of distinct random values. Then, the features are defined as follow:

$$ Maximum\ value\ of\ {z}_i=\operatorname{Max}\ \left({\mathrm{z}}_i\right) $$
(10.3)
$$ Energy=\sum \limits_{i=1}^{\eta }{\left|{z}_i\right|}^2 $$
(10.4)
$$ Mean=\sum \limits_{i=1}^{\eta }{z}_i\ p\left({z}_i\right) $$
(10.5)
$$ Variance=\sum \limits_{i=1}^{\eta }{\left({z}_i\hbox{-} \mathrm{Mean}\right)}^2\ p\left({z}_i\right) $$
(10.6)
$$ NbE1=-\sum \limits_i{p}_i\ \log \left({p}_i\right);\kern1em {p}_i=\frac{\left|B\left({\zeta}_1,{\zeta}_2\right)\right|}{\sum_{\Omega}\left|B\left({\zeta}_1,{\zeta}_2\right)\right|}\kern0.75em $$
(10.7)
$$ NbE2=-\sum \limits_i{q}_i\ \log \left({\mathrm{q}}_i\right);\kern1em {\mathrm{q}}_i=\frac{{\left|B\left({\zeta}_1,{\zeta}_2\right)\right|}^2}{\sum_{\Omega}{\left|B\left({\zeta}_1,{\zeta}_2\right)\right|}^2}\kern0.75em $$
(10.8)
$$ NbE3=-\sum \limits_i{r}_i\ \log \left({\mathrm{r}}_i\right);\kern1em {\mathrm{r}}_i=\frac{{\left|B\left({\zeta}_1,{\zeta}_2\right)\right|}^3}{\sum_{\Omega}{\left|B\left({\zeta}_1,{\zeta}_2\right)\right|}^3}\kern0.75em $$
(10.9)

The IqD is a measure of variability, based on dividing the dataset into quartiles [74] and can be expressed as follows:

$$ IqD=\gamma -\nu $$
(10.10)

where \( P\left(\gamma \right)=\frac{3}{4} \), \( P\left(\nu \right)=\frac{1}{4} \), and P is the sample cumulative distribution function of the diagonal of the center slice.

$$ Spectral\ flatness=\frac{GM\left({z}_i\right)}{AM\left({z}_i\right)} $$
(10.11)

where GM and AM represent geometric and arithmetic mean, respectively [74].

4.3 Locality Sensitive Discriminant Analysis (LSDA)

The LSDA is a supervised dimensional reduction technique. It is based on the concept of maximizing the local margin between different classes while minimizing the local margin within the classes. It splits the data into the within-class and between-class graphs according to its class labels using the nearest neighbor graph (NNG) [16]. The transformation or mapping from the original feature space to a new feature space is allowed to maximize the probability of separation among different classes. Hence, each separated class data is mapped by using a linear transformation matrix such that it preserves the local as well as discriminant neighborhood information. Let l data points {x1, x2, …, xl} ⊂ Rbe sampled from a large dataset. Assume that Ns(xi) is a group of neighbors having the same label, while Nd(xj) is a group of neighbors having different labels such that Ns(xi) ∩ Nd(xj) = ϕ; where ϕ is the null space and Ns(xi) ∪ Nd(xj) = N(xl). The LSDA ranked the resultant mapped features and arrange them from the highest probability of discrimination to the lowest probability of discrimination.

4.4 Support Vector Machine

The SVM is a most popular supervised machine learning technique commonly used for data classification and regression. It maximizes the distance between the support vectors such that the probability of data points error becomes minimum. The slack variable is introduced for compensating the data points error. The support vector of a class represents the boundary that passes through the far most data point that is well separated from another class data [18, 30, 85]. The hyperplane is a virtual perpendicular plane between the support vectors. The SVM is the most popular machine learning algorithm having high generalization performance without any additional a priori knowledge even in high dimensional input space. Different linear or nonlinear kernels are used to map the old feature into a new feature space to achieve the lowest probability of misclassification error [89].

Let Xj be a training vector such that Xj ∈ Rn; j = 1, 2, …, l in two classes, labeled asy ∈ {+1, −1}t. The SVM separates the training vector such as [18]:

$$ {\displaystyle \begin{array}{l}\kern1.5em \underset{\omega, b,\xi }{\min}\kern2em \frac{1}{2}{\omega}^T\omega \kern0.5em +\kern0.5em \mathrm{C}\ \sum \limits_{j=1}^l{\xi}_j\\ {} subject\ to\kern0.75em {\mathrm{y}}_j\left({\omega}^T\phi \left({X}_j\right)+\mathrm{b}\right)\ge 1\hbox{-} {\xi}_j\end{array}} $$
(10.12)

where ϕ is defined as mapping space with an error cost,C ≻ 0, Xj ≥ 0. The above equation is a constraint optimization problem that can be solved with the Lagrangian method. Hence, the optimization function can be written as follows:

$$ {\displaystyle \begin{array}{l}\kern1em \underset{\lambda }{\min}\kern1.25em \mathrm{f}\left(\lambda \right)=\frac{1}{2}{\lambda}^T Q\lambda \kern0.5em \hbox{-} \kern0.5em {\mathrm{e}}^T\lambda \\ {} subject\ to\kern0.75em 0\le {\lambda}_j\le \mathrm{C};\kern1.25em {\mathrm{y}}^T\lambda =0\end{array}} $$
(10.13)

where Qjk ≡ yj yk ϕ(Xj)T ϕ(Xk) and e is the vector of all ones. Let K(Xj, Xk) = ϕ(Xj)T ϕ(Xk) be the kernel function; then, the generalized function of the SVM classifier can be defined as [85]:

$$ {\beta}_{\kappa }=\mathit{\operatorname{sign}}\left[\sum \limits_{j\in l}{\lambda}_j\ {\mathrm{y}}_j\ K\left({X}_j,{X}_k\right)+\mathrm{b}\right] $$
(10.14)

where λj is a nonzero Lagrange multiplier and b denotes the bias.

5 Results and Discussion

In this study, a nonlinear third-order spectrum, namely, bispectrum algorithm, is proposed for seizures classification based on the EEG signals. To explore the seizure dynamics, the EEG signals undergo the bispectrum analysis, and the bispectrum of the EEG signal reveals that significant alteration takes place with the EEG signals classes. The magnitude of the bispectrum of seizure-EEG signal (S) is more spread than other classes bispectrum magnitude, also its magnitude is of the order of 109 which is high as compared to the bispectrum magnitude graph of seizure-free and normal EEG signals. It can be illustrated that spectra magnitude of standard EEG signal has significant high peaks at a higher frequency as compared to the spectra of seizure-free EEG signals.

From the principal region of bispectrum-magnitude of the EEG signals, several statistical features are computed to extract the relevant information about the brain electrical activities. In this study, two binary-classes classification and one three- classes classification have been performed. The extracted features are subjected to the LSDA algorithm to reduce the feature matrix dimension and also to rank the features from the higher value to lower value subjected to the probability of discrimination. The ranked features are fed to the SVM classifier with RBF and ten-fold cross-validation strategy. The SVM classifier with RBF kernels achieved the maximum accuracy in all cases which are 98.67% for S-Z, 96.30% for S-FN-ZO, and 97.53% for S-FNZO.

In literature, various algorithms are listed for seizure classification. Table 10.1 illustrates various seizure classification methods to investigate brain electrical activities. Peker et al. [59] used the DT-CWT for decomposition of the EEG signals into low-dimensional wavelet coefficients. These features are discriminated with the complex-valued neural network. Their proposed technique is suited for binary as well as three-classes classification. They achieved 100% for S-Z, 99.15% for S-FNOZ classes, and 98.28% and 99.30% classification accuracy for three-classes, namely, S-FN-ZO and S-F-Z, respectively.

Table 10.1 A comparison of performances of the various methods for seizure classification

Bao et al. [9] employed the power spectral features, Higuchi fractal dimension, Petrosian fractal dimension, and Hjorth parameters as features that were classified with probabilistic neural network (PNN) classifier. They registered 97.00% classification accuracy for differentiating S-from NF-labeled signals. The LP method for seizure classification is used by Joshi et al. [34] and Altunay et al. [6]. The LP energy is used as a feature in [6], while the fractional LP error energy and fractal model signal energy are computed from the EEG signals to be used as features in [34]. They obtained 94% and 95.33% classification accuracy for seizures and seizure-free (S-NF) datasets with the threshold value classifier and the SVM classifier, respectively. The permutation entropy (PE) is used as a measure to distinguish the epileptic EEG signals from the seizure-free EEG signals [46]. They classified subsets S and N with the SVM classifier to the obtained classification accuracy of order 88.83%. Polat and Gunes [60] proposed the non-parametric spectral estimation method for the EEG signal classification. They reported 98.72% classification accuracy with the decision tree (DT) classifier, while the same group achieved a maximum classification accuracy of 100% when the principal component analysis (PCA) algorithm is implemented for dimension reduction of the fast Fourier transform (FFT) coefficient with the artificial immune recognition system classifier [61]. In [62], the authors proposed the AR model for EEG signal representation. The PSD vector is considered as a feature. The distance-based data reduction (DBDR) algorithm is applied to reduce the dimension of the attribute vectors. The obtained features are classified with the C4.5 decision tree classifier. They achieved 99.32% classification accuracy for S-O datasets. Tiwari et al. [80] proposed the 1D-LBP algorithm to extract the relevant information for seizure classification. They obtained 99.45% for S-NF, 99.31% for S-FNZO, and 98.80% for ZO-FN-S classification accuracy when the sparse LBP features are classified by the SVM classifier. Kaya et al. [35] proposed the uniform and non-uniform 1D-LBP feature extraction method from the raw EEG signals. These extracted features are applied to different classifiers to evaluate algorithm performance. They obtained 99.50% classification accuracy with the functional tree (FT) classifier for S-O classes classification.

The TQWT is proposed in Sharma and Pachori [68] and Bhattacharyya et al. [14] for the seizure EEG signal classification. Sharma and Pachori [68] computed the signal fractal dimension as a feature on each sub-band. These features are subjected to the LS-SVM classifier for the seizure classification. They achieved 100% classification accuracy for the individual and combine normal classes with seizure class (S-O, S-Z, and S-ZO), and 99.67% classification accuracy for S-NF while 99.60% for the binary class (S-FNZO).

Bhattacharyya et al. [14] measured the multiscale KNN entropy as features for seizure classification on the TQWT decomposed sub-bands. The multiscale entropy measures the complexity of multivariate EEG signals over different scales. These mixed labeled features are classified by the SVM classifier and obtained 98.60% to 100% classification for different seizure classes. The Burg AR coefficients are used for the EEG signal classification [83]. They reported 99.56% classification accuracy. Bhati et al. [11] proposed a TF localized three-band wavelet filter-bank technique for the EEG signal classification. The proposed technique is implemented by using the semidefinite relaxation and nonlinear least square method. They obtained classification accuracy of the order of 99.66% for seizure and seizure-free (S-NF) classes with the MLP neural network (MLPNN) classifier.

Subasi [77] used the DWT for EEG signal decomposition. The author computed four statistical features. Three out of four features, namely, the mean, average power, and the standard deviation, are computed from each sub-band, while the fourth feature is the ratio of the absolute mean values of the adjacent sub-bands. These four statistical features from the DWT coefficients were classified by a modular neural network called the mixture of experts (MEs) and reported classification accuracy of order 95.00%. The author extended the wavelet-based signal classification technique [78]. In this study, the different data dimension reduction techniques such as the principal component analysis (PCA), independent component analysis (ICA), and linear discriminant analysis (LDA) are used to reduce the dimension of the wavelet coefficients, and the resultant coefficients were classified by the SVM classifier. They achieved 98.85% using PCA, 99.5% using ICA, and 100% using LDA, classification accuracy with the SVM classifier. Acharya et al. [4] used the wavelet packet decomposition (WPD) algorithm for the seizure classification. The method decomposed the EEG signals into higher and lower frequency bands simultaneously, and the PCA is simulated on computed feature vectors from each sub-band. For the binary (S-NFZO) seizure classification, they reported 99.31% classification accuracy with the Gaussian mixture model classifier.

Tzallas et al. [81] used the TF analysis technique for automated EEG signal classification. The fractional energy is computed as features from the selected segments obtained with the TF-EEG signal decomposition method. The ANN classifier is used to classify the computed features, and has obtained from 97.72% to 100% classification accuracy for separating different seizure classes. Samiee at al. [65] used the DSTFT for EEG signal decomposition on the TF axis. They computed different statistical features, namely, the average, median, maximum, standard deviation, and minimum, from the rational DSTFT coefficients. They obtained 98.10% for S-NFZO and 99.30% for S-Z, the order of accuracy using the MLP classifier. The TF feature-based classification methodology is also proposed in Srinivasan et al. [75]. They achieved classification accuracy of 99.60% for S-Z classes separation with the recurrent neural network (RNN) classifier. Orhan et al. [51] measured various nonlinear parameters from the DWT coefficients that capture the subtle information from the EEG signals. They reported 96.67% to 100% accuracy with the MLPNN classifier for separating the different classes of seizures. The authors recorded 99.20% to 99.50% classification accuracy with the features extracted from the genetic programming, while these features are subjected to the KNN classifier to separating the seizures [27]. The ApEn used to compute the complexity of the nonlinear signal. Ocak et al. [50] computed the ApEn from the DWT coefficients for the S-ZNF seizure classification. They achieved 96.65% classification accuracy with the optimal threshold value-based classification algorithm.

The EMD method-based seizure classification algorithms are listed in the literature. Sharma et al. [67] used the EMD algorithm that utilized the Hilbert-Huang transform for signal decomposition into IMFs for complexity computation at high scales. They measured 95% confidence area of 2D and 3D PSR of each IMF. These computed areas are used as features to register 98.67% classification accuracy for binary (S-NF) seizure classification with the LS-SVM classifier. Fu et al. [20] registered 99.13% binary (S-Z) classification accuracy with the EMD algorithm and the SVM classifier.

6 Conclusion

As epilepsy occurs, due to episodic nature of the seizures, the detention and prediction of the seizures with visualization of the EEG signals by Neuro-experts are a very tedious and time-consuming process. The EEG signals are practically very lengthy. Hence, the prime challenge is to develop signal processing algorithms to explore the underlying behavior of the nonstationary and nonlinear EEG signals. Many nonlinear classification methods have been proposed that are able to capture the significant underlying information about the dynamics of the brain’s electrical activities. In this study, an elegant and convenient nonlinear HOS algorithm is proposed to extract relevant information from the EEG signals. The various statistical features are measured from the bispectrum of the EEG signals. These extracted measurements are input to the LSDA for discrimination of features according to the class label. The SVM with the different kernels (to maximize the probability of separation) are used for signal classification. The proposed method has achieved an excellent classification accuracy to classify seizures in various categories. The achieved results show the effectiveness of the proposed algorithm.