1 Introduction

Indeed, machine learning plays a crucial role in a variety of fields and has been widely applied. Ones of the key research fields are biology and medicine. Computational techniques and tools derived from machine learning concepts and theories have been applied to obtain better diagnosis and prognosis of diseases, treatments, and also health monitoring systems. Epilepsy, one of the most common neurological disorders, has been a challenging subject and gained a great attention from researchers. Epilepsy is characterized by recurrent seizures that are physical reactions to sudden, usually brief, excessive electrical discharges in clusters of nerve cells [21]. Approximately 50 million people worldwide have epilepsy, and most of the people with epilepsy live in low- and middle-income countries [21].

An electroencephalogram (EEG) that is recorded using electrodes placed on the scalp is the most common diagnosis test for epilepsy [13]. The EEG that quantifies electrical activity of the brain provides ability to detect abnormalities in the brain. Epileptic seizure classification and detection are a crucial task of epilepsy diagnosis where specific features and patterns of the EEG such as monomorphic waveforms, polymorphic waveforms, spike and sharp wave complexes, or periods of reduced electrocerebral activity [8, 17, 20] are needed to be identified and detected. The scalp EEG is very sensitive to signal attenuation and artifacts, and also has poor spatial resolution. An intracranial EEG or an electrocorticogram (ECoG) is an alternative approach to detect the electrical activity of the brain by placing electrodes on the cortex. The intracranial EEG therefore provides a better characteristic of brain activity; however, it is a more complicated and expensive diagnostic test. Previously most of studies on epileptic seizure classification and detection examine intracranial EEG data.

A number of quantitative features extracting from either scalp EEG or intracranial EEG including time-domain features, frequency-domain or transform-domain features, and also nonlinear features have been applied and shown to be useful for epileptic seizure classification and detection. Several of those quantitative features have been applied to both scalp and intracranial EEGs for epileptic seizure classification and detection. Some common time-domain features [4, 9, 14, 18] are Hjorth parameters, zero crossing, root mean square (RMS), line length, number of local maxima and minima, and various statistical values such as variance. Average power, total power, powers of spectral subbands, peak frequency, mean frequency, and median frequency are common frequency-domain features that are typically obtained from the discrete Fourier transform, power spectral density (PSD), and the discrete wavelet transform [9, 12, 15]. In particular, the time-domain features of single-channel EEGs that provide the best epileptic seizure classification [9] with respect to the area under receiver operating characteristic (ROC) curve are line length, nonlinear energy, variance, power, and maximum. Recently, several nonlinear quantitative measures [6, 7, 14] such as Shannon entropy, approximate entropy, fractal dimension, maximum Lyapunov exponent, and spectral exponent are common nonlinear features applied for epileptic seizure classification and detection.

Various computational techniques and classifiers have been used to classify EEG into corresponding physiological and pathological states associated with epilepsy such as ictal state (EEG associated with an epileptic seizure event), interictal, pre-ictal, and post-ictal states. Machine learning methods are a popular choice recently applied for classifying a set of multiple quantitative features of EEG. Support vector machine (SVM) classifiers are applied to scalp EEG data [12, 1618]. However, in [4], a linear discriminant classifier is applied for neonatal seizure detection. Evolutionary neural networks are also applied for epileptic seizure classification [7]. Epileptic seizure classification and detection can be performed using either patient-independent- or patient-dependent-based algorithms. A wide range of success on epileptic seizure classification and detection has been reported. In general, patient-dependent-based algorithms provide better performance on epileptic seizure classification. Also, the performance on epileptic seizure classification and detection depends on seizure morphologies [12].

In particular, for epileptic seizure classification and detection using scalp EEGs, sixty-five quantitative features of multichannel scalp EEGs were examined and utilized in Ref. [9]. All sixty-five quantitative features including features derived from time-domain analyses, the discrete wavelet transform, the continuous wavelet transform, and discrete Fourier transform have been previously applied in various studies. It was shown that there is a performance trade-off between the sensitivity and the specificity. The best performance on seizure classification is achieved using the relative power obtained from the discrete wavelet transform corresponding to a 12.5–25 Hz subband with the sensitivity of 71.32% and the specificity of 79.67% when the optimal threshold is used [9]. On the other hand, averaged and integrated powers of multichannel bipolar EEGs focusing on the 2.5–12 Hz subband were applied for epileptic seizure detection in Ref. [5]. The sensitivity of epileptic seizure detection was 87.3% for subjects with temporal lobe epilepsy (TLE) and extra-temporal lobe epilepsy (ETLE).

The energies determined from the coefficients of discrete wavelet transform corresponding to four subbands of multiple-channel scalp EEGs are used for patient-dependent epileptic seizure detection in Ref. [17]. Overall, 131 of 139 epileptic seizure events are detected for the patient-dependent epileptic seizure detection while there are 15 false detections [17]. In Ref. [12], seven quantitative features of 19-channel scalp EEGs including subband powers obtained from the continuous wavelet transform and various time-domain features and an SVM classifier were applied for epileptic seizure detection where the prior knowledge on diversity of seizure morphologies was taken into account. The high correct detection rate (between 85 and 100%) and low false alarm rates (between 0.2 and 0.4 per hour) were achieved [12]. A large set of quantitative features of scalp EEGs were applied in Ref. [7] for patient-dependent epileptic seizure classification. The main features examined belonged to morphological-based features, time-domain features, frequency-domain, features, and a nonlinear feature. The average sensitivity and specificity obtained were 89.01 and 94.71%, respectively.

In this study, performances on epileptic seizure classification using wavelet-based features are examined by, in particular, focusing on patient-dependent classification. The number of channels of scalp EEGs is minimized which yields a small number of wavelet-based features applied to epileptic seizure classification. The wavelet-based features denoted by \(\lambda _l\) are given by the logarithm to base 2 of variance of detail and approximation coefficients of scalp EEGs. Such wavelet-based features are computationally relevant to the power spectral density (PSD) [1]. The wavelet-based approach, however, allows an unbiased estimate [1]. SVM is used as a binary classifier to discriminate scalp EEG epochs associated with epileptic seizure event from scalp EEG epochs associated with pre-ictal and post-ictal states, i.e., non-seizure period. The patient-dependent epileptic seizure classification using wavelet-based features is evaluated using two schemes. First, the wavelet-based features of scalp EEG epochs corresponding to the first epileptic seizure event are used as a training data. This approach complies with the actual application when it is applied to real-time or online epileptic seizure classification and detection. Furthermore, k-fold cross-validation is applied to the patient-dependent epileptic seizure classification using wavelet-based features in the second scheme. The performance on patient-independent epileptic seizure classification is also evaluated using k-fold cross-validation.

2 Materials and methods

2.1 Data and subjects

Scalp EEG data of subjects with intractable seizures examined in this study were obtained from the CHB-MIT Scalp EEG Database (available online at http://www.physionet.org/pn6/chbmit/). The database was collected at the Children’s Hospital Boston [3]. All subjects were monitored for up to several days following withdrawal of anti-seizure medication in order to characterize their seizures and assess their candidacy for surgical intervention [3]. All protected health information (PHI) in the original recordings was replaced with surrogate information in order to protect the privacy of the subjects [3]. The scalp EEG recordings were acquired using a sampling rate of 256 Hz with 16 bit resolution [3]. The international 10–20 system of EEG electrode positions and nomenclature was used for the recordings [3].

There are 24 cases of scalp EEG recordings, referred to as chb01, chb02, chb03, and so on. The first 23 cases, excluding the chb24 case, were recorded from 22 subjects (5 males, ages 3–22 years old, and 17 females, ages 1.5–19 years old) [3]. The chb01 and chb21 cases were obtained from the same subject. There are a total of 198 epileptic seizure events. Further details on scalp EEG data and cases can be obtained at http://www.physionet.org/pn6/chbmit/.

2.2 Wavelet-based features of scalp EEGs

The discrete wavelet transform is a representation of a signal using a countably infinite set of wavelets that constitutes an orthonormal basis [11]. The wavelet transform can be interpreted as a generalized filter bank [22] as the so-called mother wavelet is typically associated with a bandpass filter. Also, the wavelet transform can be interpreted in the context of multiresolution analysis (MRA) [10]. The multiresolution analysis generally consists of a sequence of successive approximation spaces [19]. Furthermore, the multiresolution analysis leads to a hierarchical scheme for the computation of the wavelet coefficients of a given function [19].

A signal x[n] is decomposed into approximations and details using the scaling and wavelet functions that, respectively, correspond to lowpass halfband filter and highpass halfband filter. This can be expressed as

$$\begin{aligned} x[n]&= \sum _k a_0[k]\phi _{0,k}[n] \end{aligned}$$
(1)
$$\begin{aligned}&= \sum _k a_1[k]\phi _{1,k}[n]+\sum _kd_1[k]\psi _{1,k}[n] \end{aligned}$$
(2)

where the scaling function \(\phi _{1,k}[n]\) and the wavelet function \(\psi _{1,k}[n]\) are, respectively, an orthonormal basis for the space \(V_1\) and the orthogonal complement of \(V_1\), denoted by \(W_1\), and the space \(V_0=V_1\oplus W_1\). The approximation coefficients \(a_1[n]\) and the detail coefficients \(d_1[n]\) can be obtained by

$$\begin{aligned} a_1[n]&= \sum _k a_0[k]h[k-2n] \end{aligned}$$
(3)
$$\begin{aligned} d_1[n]&= \sum _k a_0[k]g[k-2n] \end{aligned}$$
(4)

where h[n] and g[n] are, respectively, the impulse response of lowpass halfband filter and highpass halfband filter.

For a single-level discrete wavelet decomposition at level l, the approximation coefficients \(a_l[n]\) can be obtained by convolving the approximation coefficients \(a_{l-1}[n]\) with the time-reversed filter of h[n], i.e., \(\tilde{h}[n]\), followed by the downsampling and, similarly, the detail coefficients \(d_l[n]\) can be obtained by convolving the approximation coefficients \(a_{l-1}[n]\) with the time-reversed filter of g[n], i.e., \(\tilde{g}[n]\), followed by the downsampling.

From the L-level discrete wavelet decomposition, there are L detail coefficients, i.e., \(d_1\), \(d_2\), \(\ldots\), \(d_L\), and one approximation coefficients, i.e., \(a_L\), obtained. Wavelet-based features proposed for epileptic seizure classification in this study are determined by taking the logarithm to base 2 of variance of detail coefficients and approximation coefficients. The wavelet-based features obtained from the detail coefficients \(d_l\) are denoted by \(\lambda _l\), and the wavelet-based feature obtained from the approximation coefficients \(a_L\) is denoted by \(\lambda _{L+1}\). Therefore, the wavelet-based features are given by

$$\begin{aligned} \lambda _l&= \log _2\left( \mathrm {var}\left( d_l\right) \right) , \quad \text {where }l=1,2,\ldots ,L\text { and } \end{aligned}$$
(5)
$$\begin{aligned} \lambda _{L+1}&= \log _2\left( \mathrm {var}\left( a_L\right) \right) \end{aligned}$$
(6)

2.3 Data analysis and classification

Segments of single-channel scalp EEGs around epileptic seizure events (12 min before seizure onset and 12 min after seizure offset, unless limited by the beginning, the end of recording, or the contiguous epileptic seizure events) are used in this study. The scalp EEG segments are divided into epochs with length of 512 samples (2 s) and with 50% overlap. Such short length of scalp EEG epochs is chosen to be able to capture characteristics of a brief lapse of epileptic seizure event. Scalp EEG epochs associated with epileptic seizure event are categorized as an SZ class while scalp EEG epochs associated with pre-ictal and post-ictal states are categorized as an NS class. The number of SZ and NS epochs for each case is summarized in Table 1.

Table 1 Details of scalp EEG epochs
Table 2 Statistical values (Mean\(\pm \hbox {SD}\)) of wavelet-based features of scalp EEG epochs
Table 3 Performance on patient-dependent epileptic seizure classification using 2 wavelet-based features
Table 4 Performance on patient-dependent epileptic seizure classification using 3 wavelet-based features
Table 5 Performance on patient-dependent epileptic seizure classification using 4 wavelet-based features

The second-order Daubechies wavelets are used for the discrete wavelet decomposition. The Daubechies wavelet family, one of the most commonly used wavelet families, has several nice characteristics including orthogonality and finite compact support. Higher-order Daubechies wavelets corresponds to higher regularity and also a number of vanishing moments. Scalp EEG epochs are decomposed into 7 levels that are the maximum level of wavelet decomposition using the second-order Daubechies. Accordingly, seven detail coefficients, i.e., \(d_1\), \(d_2\), \(d_3\), \(d_4\), \(d_5\), \(d_6\) and \(d_7\), and one approximation coefficients, i.e., \(a_7\), are obtained. The coefficients \(d_1\), \(d_2\), \(d_3\), \(d_4\), \(d_5\), \(d_6\), \(d_7\), and \(a_7\) correspond approximately to 64–128, 32–64, 16–32, 8–16, 4–8, 2–4, 1–2, and 0–1 Hz subbands, respectively.

A feature vector applied for epileptic seizure classification is composed of all combinations of wavelet-based features of scalp EEG epochs, ranging from two wavelet-based features, i.e., \(\left( \lambda _i,\lambda _j\right)\) where \(i\ne j\), to seven wavelet-based features, i.e., \(\left( \lambda _i,\lambda _j,...,\lambda _n,\lambda _o\right)\) where \(i\ne j\ne k \ne l\ne m\ne n\ne o\). Feature vectors of scalp EEG epochs are classified using support vector machine (SVM). The radial basis function (RBF) kernel is used to train an SVM classifier. Scalp EEG epochs of both SZ and NS classes obtained from the first seizure event of each subject are used as training data sets. The classification and the performance evaluation are performed by a case-by-case basis, i.e., patient-dependent epileptic seizure classification. Furthermore, to validate a generalized performance of wavelet-based features on both patient-dependent and patient-independent epileptic seizure classifications, tenfold cross-validations are applied using feature vectors composing of two wavelet-based features, i.e., \(\left( \lambda _i,\lambda _j\right)\). For each case, the feature vectors of scalp EEG epochs associated with SZ and NS classes are randomly divided into ten subsets. Nine subsets of feature vectors are used as a training set while another subset of feature vectors being used as a testing set. This process is repeated ten times with each of the ten subsets of feature vectors being used once as the training set. The performance of tenfold cross-validation is determined from all ten classifications.

The performance of epileptic seizure classifications is evaluated using three conventional classification performance measures: accuracy, sensitivity, and specificity. The accuracy (Ac), the sensitivity (Se), and the specificity (Sp) are given, respectively, by

$$\begin{aligned} \mathrm{Ac}&= \frac{\mathrm {TP}+\mathrm {TN}}{\mathrm {TP}+\mathrm {TN}+\mathrm {FP}+\mathrm {FN}} \\ \mathrm{Se}&= \frac{\mathrm {TP}}{\mathrm {TP}+\mathrm {FN}}, \text { and} \\ \mathrm{Sp}&= \frac{\mathrm {TN}}{\mathrm {TN}+\mathrm {FP}} \end{aligned}$$

where \(\mathrm {TP}\), \(\mathrm {TN}\), \(\mathrm {FP}\), and \(\mathrm {FN}\) denote a number of true positives, a number of true negatives, a number of false positives, and a number of false negatives, respectively. All channels of bipolar scalp EEG data are analyzed and examined in this study. Nevertheless, only results that are obtained from the channel providing the best performance on epileptic seizure classification for each case with respect to the product of sensitivity and specificity are presented.

In addition, the performance of patient-dependent epileptic seizure classification using the wavelet-based features is compared to that using the best five time-domain features [9], i.e., line length, nonlinear energy, variance, power, and maximum, in terms of the area under ROC curve (AUC). Also, the accuracy, the sensitivity and the specificity obtained from patient-dependent epileptic seizure classification using SVM are compared. The line length \(f_1\), nonlinear energy \(f_2\), variance \(f_3\), power \(f_4\), and maximum \(f_5\) are, respectively, defined as follows [9]:

$$\begin{aligned} f_1&= \sum _{n=1}^{N-1} \big |x[n-1]-x[n]\big | \end{aligned}$$
(7)
$$\begin{aligned} f_2&= \frac{1}{N-2}\sum _{n=1}^{N-2} x^2[n]-x[n-1]x[n+1] \end{aligned}$$
(8)
$$\begin{aligned} f_3&= \frac{1}{N}\sum _{n=0}^{N-1} \left( x[n]-\bar{x}\right) ^2 \end{aligned}$$
(9)
$$\begin{aligned} f_4&= \frac{1}{N}\sum _{n=0}^{N-1} x^2[n] \end{aligned}$$
(10)
$$\begin{aligned} f_5&= \max (x[n]) \end{aligned}$$
(11)

where N denotes the length of EEG signal x[n] and \(\bar{x}\) denotes the mean of x[n].

3 Results

3.1 Characteristics of wavelet-based features

Means and standard deviations of all wavelet-based features, i.e., \(\lambda _1\), \(\lambda _2\), \(\lambda _3\), \(\lambda _4\), \(\lambda _5\), \(\lambda _6\), \(\lambda _7\), and \(\lambda _8\), of both SZ and NS epochs for each case are summarized in Table 2. The characteristics of wavelet-based features vary corresponding to cases and also subbands. In general, wavelet-based features \(\lambda _i\) of SZ epochs tend to be higher than those of NS epochs. From the results of two-sample t-tests (p-value of 0.0001), it is suggested that for all cases there is at least one wavelet-based feature that associates with the significant difference between the means of corresponding wavelet-based features of both SZ and NS epochs. The means of any wavelet-based feature, i.e., \(\lambda _1\), \(\lambda _2\), \(\lambda _3\), \(\lambda _4\), \(\lambda _5\), \(\lambda _6\), \(\lambda _7\), and \(\lambda _8\), of SZ epochs are significantly different from those of NS epochs in 13 cases (chb01, chb03, chb05, chb07, chb08, chb09, chb11, chb17, chb18, chb21, chb22, chb23, and chb24). On the contrary, for the case chb06, there is a significant difference between the means of only wavelet-based feature \(\lambda _4\) of both SZ and NS epochs.

Table 6 Performance on patient-dependent epileptic seizure classification using 5 wavelet-based features
Table 7 Performance on patient-dependent epileptic seizure classification using 6 wavelet-based features
Table 8 Performance on patient-dependent epileptic seizure classification using 7 wavelet-based features
Table 9 Best performance on patient-dependent epileptic seizure classification

All wavelet-based features of all SZ and NS epochs obtained from all subjects are compared in box plots shown in Fig. 1. This obviously shows the tendency of higher values of wavelet-based features of SZ epochs compared to those of NS epochs. In addition, Figs. 2 and 3, respectively, compare all wavelet-based features of all SZ and NS epochs obtained from the case chb09 posing the best performance on epileptic seizure classification and the case chb16 posing the worst performance on epileptic seizure classification. Figure 2 shows that wavelet-based features of SZ epochs are substantially higher than those of NS epochs in the case chb09. On the other hand, even though wavelet-based features of SZ epochs tend to be higher than those of NS epochs in the case chb16, wavelet-based features of SZ epochs are in the spans of wavelet-based features of NS epochs.

Table 10 Performance on patient-dependent epileptic seizure classification using tenfold cross-validation
Table 11 Performance on patient-independent epileptic seizure classification using tenfold cross-validation

3.2 Performance of patient-dependent epileptic seizure classification

The performances on patient-dependent epileptic seizure classification using the feature vectors composing of 2, 3, 4, 5, 6, and 7 wavelet-based features of scalp EEG epochs corresponding to the first epileptic seizure event as the training data set are shown in Tables 3, 4, 5, 6, 7, and 8, respectively. The EEG channels and the wavelet-based features that provide the best performance on corresponding epileptic seizure classification are individually reported for each case. Remark that the EEG channel slightly changes from case to case. The performance on epileptic seizure classification tends to decrease as the number of wavelet-based features used as the feature vector increases. The wavelet-based features providing the best performance on epileptic seizure classification also vary from case to case.

Table 12 Area under ROC curve of patient-dependent epileptic seizure classification
Table 13 Best performance on patient-dependent epileptic seizure classification using time-domain features

The best overall performance on patient-dependent epileptic seizure classification is obtained using the feature vector composing of 2 wavelet-based features. There are 19 cases that the feature vector composing of 2 wavelet-based features provides the best performance on epileptic seizure classification. The best performance on patient-dependent epileptic seizure classification is obtained using the feature vector composing of 3 wavelet-based features for the cases chb10, chb16, chb17, and chb21. The feature vector composing of 4 wavelet-based features is required to obtain the best performance on patient-dependent epileptic seizure classification for the case chb12. The best performances on patient-dependent epileptic seizure classification for each case are summarized in Table 9.

The best performance on patient-dependent epileptic seizure classification is obtained at the case chb09 which corresponds to the accuracy of 0.9978, the sensitivity of 0.9806, and the specificity of 0.9987 using the wavelet-based features \(\lambda _2\) and \(\lambda _4\) as the feature vector. The worst performance on patient-dependent epileptic seizure classification is obtained at the case chb16 which corresponds to the accuracy of 0.9937, the sensitivity of 0.1930, and the specificity of 0.9983 using the wavelet-based features \(\lambda _1\), \(\lambda _3\) and \(\lambda _4\) as the feature vector. The distributions of wavelet-based features of both SZ and NS epochs corresponding to each epileptic seizure event of cases chb09 and chb16 are compared in box plots shown in Figs. 4 and 5, respectively. These obviously give a justification for the corresponding performances. The wavelet-based features of SZ and NS epochs in the case chb09 slightly vary from one epileptic seizure event to another for all four epileptic seizure events. There are, however, considerable variations on wavelet-based features of SZ and NS epochs within ten epileptic seizure events for the case chb16. Furthermore, in the case of chb16 the range of wavelet-based features of NS epochs is over that of wavelet-based features of SZ epochs. This makes such epileptic seizure classification unfeasible.

Fig. 1
figure 1

Comparison between the wavelet-based features of scalp EEG epochs associated with non-seizure periods and those associated with epileptic seizure events of all subjects

Fig. 2
figure 2

Comparison between the wavelet-based features of scalp EEG epochs associated with non-seizure periods and those associated with epileptic seizure events of case chb09

Fig. 3
figure 3

Comparison between the wavelet-based features of scalp EEG epochs associated with non-seizure periods and those associated with epileptic seizure events of case chb16

3.3 Performance evaluation using tenfold cross-validation

Using the same corresponding EEG channels and wavelet-based features providing the best performance reported in Table 3, the results of tenfold cross-validation on patient-dependent epileptic seizure classification are summarized in Table 10. In general, the accuracy and the specificity of patient-dependent epileptic seizure classification are remarkably high. Both accuracy and specificity are higher than 0.90 for all cases. The highest and lowest accuracies are, respectively, 0.9959 and 0.9281 while the highest and lowest specificities are, respectively, 0.9997 and 0.9910. On the contrary, the wide range of sensitivity is obtained. The best and worst sensitivities are 0.9295 and 0.0174, respectively. The performance on patient-dependent epileptic seizure classification evaluated using tenfold cross-validation is in general lower than those using the wavelet-based features of scalp EEG epochs corresponding to the first epileptic seizure event as the training data set. The better performance on patient-dependent epileptic seizure classification evaluated using tenfold cross-validation is achieved in the cases chb04, chb10, chb11, chb14, and chb22.

Fig. 4
figure 4

Comparison of the wavelet-based features of SZ and NS epochs of case chb09 corresponding to each epileptic seizure event

Fig. 5
figure 5

Comparison of the wavelet-based features of SZ and NS epochs of case chb16 corresponding to each epileptic seizure event

The results of tenfold cross-validation on the patient-independent epileptic seizure classification corresponding to each pair of wavelet-based features are summarized in Table 11. The same EEG channels providing the best performance for all cases reported in Table 3 are used. The highest product of sensitivity and specificity is 0.2004 that is obtained from the patient-independent epileptic seizure classification using the wavelet-based features \(\lambda _1\) and \(\lambda _5\) as the feature vector. The corresponding accuracy, sensitivity, and specificity are 0.9579, 0.2011, and 0.9965, respectively. On the other hand, the worst performance on the patient-independent epileptic seizure classification is obtained using the wavelet-based features \(\lambda _7\) and \(\lambda _8\) as the feature vector with the accuracy, sensitivity, and specificity of 0.9518, 0.0259, and 0.9990, respectively.

3.4 Comparison of performance on patient-dependent epileptic seizure classification

The areas under ROC curve obtained from using the wavelet-based features and the time-domain features are compared in Table 12. The maximum area under ROC curve for each case is written in bold. In general, the wavelet-based features provide better performance on epileptic seizure classification compared to the time-domain features. The wavelet-based features, i.e., \(\lambda _3\), \(\lambda _4\), \(\lambda _5\), and \(\lambda _6\), provide the best classification performance regarding to the maximum area under ROC curve in 18 cases while the time-domain features, i.e., \(f_2\), \(f_4\), and \(f_5\) provide the best classification in the other 6 cases. The wavelet-based features \(\lambda _4\) and \(\lambda _5\) are the wavelet-based features providing the best classification performance among all eight wavelet-based features. The nonlinear energy, i.e., \(f_2\), is the time-domain feature providing the best classification performance among all five time-domain features.

Table 13 shows the best performance on patient-dependent epileptic seizure classification using the time-domain features and SVM. The best performance on patient-dependent epileptic seizure classification can be achieved using the feature vector composing of 2, 3, and 4 time-domain features for 13, 7, and 4 cases, respectively. The best performance is obtained at the case chb09 using the line length, the nonlinear energy, the variance, and the maximum as the feature vector while the worst performance is obtained at the case chb16 using the line length and the variance as the feature vector. In general, the wavelet-based features provide the better performance for epileptic seizure classification than the time-domain features. The better performance on patient-dependent epileptic seizure classification can be obtained using the wavelet-based features in 19 cases.

4 Discussion

From the computational results, it is shown that, in general, the wavelet-based features of scalp EEG epochs associated with epileptic seizure event and non-seizure period, i.e., pre-ictal and post-ictal states, are considerably different from each other corresponding to the same case. Furthermore, this suggests that at any spectral subband the power of scalp EEG epochs associated with epileptic seizure event is higher than those of scalp EEG epochs associated with non-seizure period. Such differences between the wavelet-based features lead to an excellent epileptic seizure classification. The average accuracy, sensitivity, and specificity of patient-dependent epileptic seizure classification are, respectively, 0.9680, 0.7200, and 0.9811. This performance is comparable with the computational results of the automated patient-dependent epileptic seizure classification [7] where the corresponding average sensitivity and specificity are 89.01 and 94.71%. However, a larger number of quantitative features were applied in Ref. [7]. The best accuracy, sensitivity, and specificity of patient-dependent epileptic seizure classification are 0.9978, 0.9806, and 0.9998, respectively.

A variety of wavelet-based features of scalp EEG epochs are incorporated in the feature vectors used to obtained the best performance on patient-dependent epileptic seizure classification for corresponding cases. The number of wavelet-based features of scalp EEG epochs used to obtained the best performance on patient-dependent epileptic seizure classification also varies among cases. Two wavelet-based features of scalp EEG epochs are required to obtain the best patient-dependent epileptic seizure classification in most cases, i.e., 19 cases out of 24. The wavelet-based features \(\lambda _4\), \(\lambda _2\), and \(\lambda _5\) are the wavelet-based features of scalp EEG epochs that are mostly used as feature vectors to obtain the best performance on patient-dependent epileptic seizure classification. These wavelet-based features, i.e., \(\lambda _2\), \(\lambda _4\), and \(\lambda _5\) correspond to the 32–64, 8–16, and 4–8 Hz subbands of scalp EEG epochs, respectively. The wavelet-based feature \(\lambda _8\) which corresponds to the 0–1 Hz subband of scalp EEG epochs is not associated with the best performance on patient-dependent epileptic seizure classification of any cases.

The best performance on patient-dependent epileptic seizure classification using the wavelet-based features of scalp EEG epochs corresponding to the first epileptic seizure event as the training data set is achieved with the accuracy of 0.9978, the sensitivity of 0.9806, and the specificity of 0.9987. The accuracy, sensitivity, and specificity of patient-dependent epileptic seizure classification using the wavelet-based features of scalp EEG epochs corresponding to the first epileptic seizure event as the training data set are, respectively, 0.9937, 0.1930, and 0.9983 for the worst performance. The wavelet-based features and the time-domain features provides both best and worst performances on patient-dependent epileptic seizure classification for the same cases. Superior performance of the wavelet-based features for epileptic seizure classification suggests that the distinctive characteristics of epileptic seizure manifest in some specific spectral bands of EEG signals rather than the whole bandwidth of EEG signals.

In the case with best performance on epileptic seizure classification, the consistent characteristics of scalp EEG epochs associated with epileptic seizure event and non-seizure periods are observed as their corresponding wavelet-based features are particularly steady among various epileptic seizure events. Furthermore, the durations of epileptic seizure events are sufficiently long. The average duration of 4 epileptic seizure events is 69 s. On the contrary, the characteristics of scalp EEG epochs associated with epileptic seizure event and non-seizure periods relatively vary from one epileptic seizure event to another in the case of worst performance on epileptic seizure classification. The durations of epileptic seizure events are also exceptionally brief. The longest and shortest durations of epileptic seizure events are 14 and 6 s, respectively. This leads to a very small number of scalp EEG epochs associated with epileptic seizure event applied to the classification.

When the tenfold cross-validation is applied, the performance on patient-dependent epileptic seizure classification tends to decrease. In most cases, i.e., 19 cases out of 24, the performance on patient-dependent epileptic seizure classification using tenfold cross-validations is lower than those using the wavelet-based features of scalp EEG epochs corresponding to the first epileptic seizure event. This suggests that the wavelet-based features of scalp EEG epochs obtained from a single event, i.e., around the first epileptic seizure event, may not well represent the whole classes due to the variation of characteristics of scalp EEG epochs associated with epileptic seizure event and non-seizure period from one epileptic seizure event to another. The performance on patient-independent epileptic seizure classification is relatively low compared to the performance on patient-dependent epileptic seizure classification. The variation of characteristics of scalp EEG epochs associated with epileptic seizure event and non-seizure period among cases or subjects is the primary factor of decreasing classification performance. The best performance on patient-independent is achieved using the wavelet-based features \(\lambda _1\) and \(\lambda _5\) which correspond to the 64–128 and 4–8 Hz subbands of scalp EEGs. The 64–128 Hz subband of EEGs is associated with high-frequency oscillations, i.e., gamma (30–80 Hz) and ripple (80–250 Hz) oscillations [23]. There is evidence that high-frequency oscillations, in particular, gamma and ripple oscillations, in scalp EEGs are clinically correlated with the seizure onset zone [2, 23].

5 Conclusions

The computational results suggest that the wavelet-based features obtained from the logarithm of variance of detail and approximation coefficients are a promising quantitative feature for epileptic seizure classification. Accompanied with SVM, an excellent performance on patient-dependent epileptic seizure classification can be achieved using wavelet-based features of a single channel of scalp EEG. Three key factors having an effect on the performance of epileptic seizure classification are consistency of features/patterns of epileptic seizure activity (epileptiform activity), duration of epileptic seizures, and amount of training data. In cases whose characteristics of scalp EEGs associated with epileptic seizure activity and non-seizure period are consistent, only two wavelet-based features of a single channel of scalp EEGs are required to achieve an excellent epileptic seizure classification. The performance on epileptic seizure classification can, however, be further improved by refining relevant parameters. This will be studied in future works. Also, the epileptic seizure classification can be potentially applied for real-time (online) epileptic seizure detection and monitoring system.