Introduction

Electroencephalogram shortly referred as EEG is used as a physiological indicator to assess the continuous neuronal activities of the brain. Of specific, epileptic seizures are considered as a chronic neurological disorder that occurs due to abnormal excessive or synchronous neuronal activity in the brain of patients suffers from epilepsy (Fisher et al. 2005; Gao et al. 2011; Pippa et al. 2016). Continuous monitoring of EEG is essential for the recognition/detection of seizures and such a procedure is tedious for the neurologist through qualitative visual inspection. Designing an automated system to detect epilepsy on time would be helpful and time-saving for the neurologist. Lately, numerous mechanized frameworks have been proposed for the forecast and recognition of epilepsy with distinctive time and frequency domain features, non-linear features with pattern classifiers (Gotman and Deng 1991; Acharya et al. 2012a, b, c; Bajaj and Pachori 2013; Wang et al. 2013; Venkataraman et al. 2014; Samiee et al. 2015; Faust et al. 2015; Du et al. 2016).

It is well known that the computerized seizure detection merely relies on the selection of appropriate EEG feature. Time domain, frequency domain, time–frequency, non-linear domain features were applied for deriving these feature from EEG recordings. Specifically, entropy based related studies have shown significant specificity and sensitivity quantitative measures for the recognition of epileptic seizures. Any EEG-based quantitative study towards epileptic seizure detection requires the appropriate application of filtering, segmentation, features selection and pattern classifier techniques. Although several attempts have been made by make use of entropy as a feature pattern to detect epileptic seizures from EEG recordings (Aydin et al. 2009; Pravin et al. 2010; Guo et al. 2010; Wang et al. 2013), there are still unsolved issues related to computational complexity and overall performance. The appropriate selection of feature and pattern classifier by considering its relative performance will confirm its suitability for the real time seizure detection. This specific research study makes use of wavelet packet based log and norm entropies along with recurrent Elman neural network (REN) for the classification of normal versus pre-ictal, normal versus epileptic and pre-ictal versus epileptic. The proposed work makes use of the University of Bonn, Germany, database (Andrzejak 2001) and three different entropies, log energy entropy, wavelet packet log energy and norm entropies were applied to extract entropy features from EEG recordings. The time–frequency characteristic of the EEGs and its non-linear nature is well exploited by making use of wavelet packet based on Haar mother wavelet with 5th level decomposition and entropy measure respectively. It has been reported that the physiological signals with disorders (abnormality) tend to yield high entropy values (Natwong et al. 2006; Wang et al. 2011; Das and Bhuiyan 2016). The features considered in our work were based on energy present in the signal which relatively increases the magnitude of the entropy value and shows a distinguishable difference between normal, pre-ictal and pure epileptic EEG. Further relative performance in terms of classification accuracy and computational complexity confirms the suitability of this proposed scheme for real-time seizure detection compared to the other techniques reported in the literature (Srinivasan et al. 2007; Pravin et al. 2010; Wang et al. 2011; Acharya et al. 2012a; Xiang et al. 2015).

Figure 1 shows the proposed approach schematic diagram.

Fig. 1
figure 1

Block diagram of the proposed epileptic detection system

During the visual inspection by the local specialist, the presence of line noise in the Bonn database was observed and same was evident through the periodogram and spectrogram analysis. An adaptive least mean square (LMS) algorithm was applied to remove the power line noise from the raw EEGs and then wavelet packet based entropies were estimated. The best basis was confirmed by the statistical analysis and the identified features were then applied as the input to REN classifier to perform the binary classification: normal-pre-ictal, normal-epileptic and pre-ictal-epileptic. The performance of the WPE- REN was compared with log energy entropy feature with REN classifier.

Related literature review

In early 1980’s, Gotman initiated an automated procedure for long duration EEG recordings (Gotman 1982). In 2007, Srinivasan et al. showed an attainment of 100 % classification accuracy by making use of approximate entropy (ApEn) with a REN classifier. A study proposed by Pravin et al. (2010) making use of sample, spectral and wavelet entropies with REN network showed a classification accuracy of 94.5 %. Zhou et al. (2013) have applied Lacunarity feature with Bayesian linear discriminant analysis (BLDA) classifier for epileptic seizure detection and showed a classification rate of 96.25 %. Samiee et al. (2015) introduced rational discrete short time Fourier transforms (DSTFT) with multilayer perception (MLP) neural network for classification of epileptic seizures from normal EEGs. Bajaj and Pachori (2013) have applied the Hilbert transformation of intrinsic mode functions to detect the focal temporal lobe epilepsy. Kumar et al. (2014) have investigated the discrete wavelets transform (DWT) based ApEn and multilayer back propagation neural network to discriminate normal and epileptic seizures. Koren et al. (2015) have done a detailed clinical investigation towards the recognition of non-convulsive seizures and non-convulsive status epileptics. Venkataraman et al. (2014) proposed an automated seizure detection algorithm by making use of adaptive short-term maximum Lyapunov exponent and adaptive Teager energy. Kelly et al. (2010) have made use of scalp based electrodes for the assessment of epileptic seizures. Three significant features, pattern-match regulation static, local maximum frequency and amplitude variation were used to quantify the seizures.

Pippa et al. (2016) proposed a clinical study to investigate the effect of psychogenic non epileptic seizures and vasodepressor syncope. Several time and frequency domain features with different pattern classifier were employed. The study revealed a classification accuracy of 95 % was achieved. Abualsaud et al. (2015) have proposed an EEG-based epileptic seizure framework that makes use of compressed sensing phenomenon and encoding technique for transmission of data followed by automated detection of seizure at the receiver side using linear and non-linear classifier. An attempt has been made by making use of smoothed pseudo-Wigner-Ville distribution to recognize the seizure from normal EEGs (Tzallas et al. 2007). Zeng et al. (2016) have employed compressive sensing principle for recognizing inter-ictal, pre-ictal and ictal EEG-based epileptic seizures. Sample and permutation entropies with Hurst index features were applied along with four variant classifiers. The performance results obtained were quite promising and confirm it’s suitable for tele-monitoring of epileptic patients. Best result was achieved using discriminant analysis (DA) for compressed sensing structure similarity feature (CS-SSIM). A focused study that makes use of four entropies namely ApEn, sample entropy (SampEn), phase entropy 1 (S1), and phase entropy 2 (S2) for automated detection of normal, pre-ictal and ictal conditions derived from EEGs was proposed. Seven different classifiers were employed and an overall accuracy of 98.1 % was achieved (Acharya et al. 2012a). A specific work that makes use of fuzzy entropy for epileptic seizures recognition was reported (Xiang et al. 2015). The fuzzy entropy was estimated based on the working principle of ApEn and the results were compared with the sample entropy. A grid optimization method was proposed to train the features using support vector machine (SVM) classifier. Gajic et al. (2014) have proposed classification algorithm using wavelets and statistical pattern recognition for the detection of epileptic seizures. Wang et al. (2010) have applied principal component analysis for dimension reduction to the décorrelation of epileptic and normal EEG. Then DWT combined with ApEn is performed and found distinct difference between the ApEn values of epileptic and normal EEG.

Faust et al. (2015) reported a detailed review of wavelet based technique for epileptic seizure detection. The study included wavelet decomposition followed by non-linear dynamic based entropy features. The importance of supervised and unsupervised pattern classifiers were also reported. Acharya et al. (2012c) have applied two entropies namely ApEn and SampEn along with fractal dimension and Hurst exponent as features with fuzzy classifier for the classification of normal and epileptic seizures from EEG recording. It was shown classification accuracy of 99.7 % attained using combined non-linear features. Srinivasan et al. (2007) and Pravin et al. (2010) have applied ApEn and SampEn, achieved 100 and 94.5 % classification accuracy respectively. It can be inferred that all such procedures yields good classification rate with high classification cost. Acharya et al. (2012c) make use of wavelet packet decomposition to extract Eigen values from the wavelet coefficient using principal component analysis. Significant features were selected using ANOVA test and 99 % classification accuracy was obtained using Gaussian mixture model (GMM) classifier with tenfold cross validation. Aydin et al. (2009) make use of log energy entropy with a multilayer neural network for distinguishing epileptic EEGs from normal. Our work makes use of WPT followed by log energy entropy to explore the low-frequency components during epileptic EEG signal and recorded best results compared to work done by Aydin et al.

It can be observed from the recent literature (Srinivasan et al. 2007; Pravin et al. 2010; Acharya et al. 2012a; Kumar et al. 2014) that the time–frequency domain features ensures good classification accuracy at extreme computational cost. Specifically, approximate entropy (Srinivasan et al. 2007) and sample entropy (Pravin et al. 2010) features showed a good classification accuracy but their computational cost related to feature estimation found to be very high and hence had its own limitation for real-time seizure detections. One can infer from the literature that the entropy-based features ensure good classification results compare to the other time–frequency domain features, there is a need to perform detailed research study to determine the trade-off between classification accuracy and computational complexity by making a selection of appropriate entropy-based features.

While considering the University of Bonn database for the experimental study, there is a symptom of the presence of line noise and to the best of authors knowledge, attempts have not been made for the removal of line noise as a preprocessing procedure.

To solve the above mentioned issues, the proposed approach make use of an adaptive filter, wavelet packet transform, log energy entropy, norm entropy, REN classifier and relative performance methods. The major contribution of this research work is the removal of line noise in the EEGs with appropriate recognition of the statistical variations during normal, pre-ictal and epileptic activities through wavelet based log and entropy features. Then, the relative performance in terms of classification accuracy and computational complexity ensures the suitability of the proposed method for real-time automated seizure detection.

Materials and methods

EEG dataset

For the proposed study, EEG datasets were obtained from University of Bonn, Germany. Three conditions, normal (set A), pre-ictal (set C) and epileptic seizures (set E) were considered and corresponding datasets were used. For further details, one can refer the work reported (Andrzejak 2001). Figure 2 shows the sample recording of sets A–C–E respectively. As reported earlier (Andrzejak 2001; Srinivasan et al. 2007; Kumar et al. 2014) the entire datasets were divided into 100 single channel segments with a total duration 23.6 s with a sampling rate of 173.6 Hz.

Fig. 2
figure 2

Sample recordings of a Normal EEG b Pre-ictal EEG c Epiliptic EEG

Line noise removal

It is well-known fact that the artifacts such as 50 Hz power line noise are added during EEG recordings. Such artifact leads to the wrong diagnosis during the clinical inspection by a specialist. In the proposed study, line noise was observed during the clinical interpretation by the specialist, which was also in evident through periodogram and spectrogram analysis. To eliminate the line noise, an adaptive Weiner filter was employed. (Tan and Jiang 2008). The filter removes the 50 Hz power line noise from the raw EEGs by making use of LMS algorithm (Haykin 1996). Figure 3 shows the structure used for the current study.

Fig. 3
figure 3

Structure of adaptive filter using LMS algorithm

The LMS algorithm is defined as below:

The corrupted EEG signal can be expressed as (Tan and Jiang 2008)

$$d\,[n] = s\,[n] + n\,[n]$$
(1)

An error signal is expressed as

$$e\,[n] = d\,[n] - y\,[n]$$
(2)

An error signal is approximately equal to desired line noise free EEG signal.

Output of the filter is

$$y\,[n] = w^{T} [n]\,x\,[n]$$
(3)

And filter coefficients by LMS algorithm is given by (Tan and Jiang 2008)

$$w_{n + 1} = w_{n} + 2\,\mu \,e\,[n]\,x\,[n]$$
(4)

where n =  sample index; s[n] = Actual signal (EEG signal); n[n] =  Noise signal (power line noise); d[n] = Corrupted signal (EEG + power line noise); e[n] =  Error signal (desired signal); x[n] = Reference input (50 Hz sinusoidal signal); y[n] = Adaptive filter output; μ = Convergence factor (step size); w n  = Filter coefficients.

Step size μcontrols how fast and how well the algorithm converges to the optimum filter coefficients so it needs to be selected properly to reach the best performance of the adaptive filter. If μis too larger, the algorithm will not converge, if μ is too small the algorithm converges slowly.

Wavelet packet decomposition

The wavelet packet transform (WPT) is considered as a classical derivative of wavelet transform (WT). Both satisfy the orthogonality condition. Given a non-stationary signal applying these transforms emphasizing the low pass and high pass filtering operations yields the sub-band tree decomposition to some desired level. It is well-known fact that the WT decomposes only approximation coefficients of the signal and successive detail are never considered and taking information located at higher frequency will be lost on the other hand WPT decomposes successive details and approximation, taking good and adjustable frequency resolutions at high frequencies (Adeli et al. 2003; Yang et al. 2006; Ocak 2009). It retains the key information located in higher frequency than WT for certain applications. The bottom level of WPT has better frequency resolution, whereas top level is considered as the time representation of the signal.

A typical 3-level WPT decomposition sub-band tree is depicted in Fig. 4. The two infinite filters shown correspond to wavelet, i.e. a low pass filter h(k) and high pass filter g(k) (Han et al. 2013). Using these two filters the wavelet packet function can be defined as (Mallat 1989)

$$\psi_{j + 1}^{2i - 1} (t) = \sqrt 2 \sum\limits_{k = - \infty }^{\infty } {h(k)\,\psi_{j}^{i} (2t - k)}$$
(5)
$$\psi_{j + 1}^{2i} (t) = \sqrt 2 \sum\limits_{k = - \infty }^{\infty } {g(k)\,\psi_{j}^{i} (2t - k)}$$
(6)

where ψ(t) is the mother wavelet function. The recursive relation between jth level and the (j + 1)th level for signal s(t)are (Mallat 1989)

$$s_{j + 1}^{2i - 1} (t) = \sum\limits_{k = - \infty }^{\infty } {h(k)\,s_{j}^{i} (2t - k)}$$
(7)
$$s_{j + 1}^{2i} (t) = \sum\limits_{k = - \infty }^{\infty } {g(k)\,s_{j}^{i} (2t - k)}$$
(8)
Fig. 4
figure 4

Decomposition tree for three-layer WPT

Thus, wavelet coefficients c j k at position k of level j can be expressed as (Mallat 1989)

$$c_{j}^{k} = \int\limits_{ - \infty }^{\infty } {s(t)\,\psi_{j}^{i} (t)\,dt}$$
(9)

In our study, wavelet packet transform is implemented by using Haar function as mother wavelet with 5th level decomposition from the previous study (Raghu et al. 2015).

Feature extraction

Log energy entropy

Entropy is a common in many fields, mainly in signal processing and communication. In biomedical signal processing, lengthy EEGs recording needs more time to do analysis and these lengthy data or signals can be viewed as entropy to differentiate seizures from normal EEGs. Entropy gives a measure of signal disorder or complexity and provides distinctive features about the signal (Shannon 1948). There are many types of entropy such as Shannon entropy, log energy entropy, norm entropy, threshold entropy, fuzzy entropy and approximation entropy that are employed in estimating information from signals. The two entropies, log energy and norm entropy investigated in this proposed study were not based on concept of a probability. These two entropies are estimated based on the energy present in the signal and entropy may result in higher values. The normalization of the signals is not performed in the case of wavelet entropy and this fact is well exploited in this proposed work.

EEG signal can be defined as the variation in amplitude w.r.t. time. The best way to acquire entire information from the signal is to square the signal. Squaring the EEG time series doesn’t lead to a decrement of signal energy instead, it results in higher values (Selik et al. 2001). s = (s 1s 2s 3,….., s n ), where s is a wavelet packet decomposed EEG signal with n samples, energy of the signal is calculated in following way

The log energy entropy is defined as (Coifman et al. 1990),

$$E1(s_{i} ) = \log (s_{i}^{2} )\,\,$$
(10)
$$E1(s) = \sum\limits_{i = \,1}^{n} {\log (s_{i}^{2} )}$$
(11)

E1(s) gives a log energy entropy value for a 1 s segment. The same procedure was employed for rest of the EEGs to estimate log energy entropy.

Norm entropy

The norm entropy neither uses normalization nor log function and it is based on the value ofpin the power. The norm entropy is defined as the concentration inl pnorm entropy with p ≥ 1

$$E2(s_{i} ) = \left| {s_{i} } \right|\,^{p}$$
(12)
$$E2(s) = \sum\limits_{i = 1}^{n} {\left| {s_{i} } \right|\,^{p} = \left\| s \right\|_{p}^{p} } \,$$
(13)

In this work, p value is chosen as 1.

Artificial neural network classification

Recurrent Elman neural network (REN) also referred as Elman neural network found to be a potential candidate for the pattern classification problems (Elman 1990; Srinivasan et al. 2007; Tang and Li 2011; Sriraam 2013). The typical architecture of that has been considered for the proposed study is shown in Fig. 5. The neural nodes 1-90-90-1 were considered with three additional context nodes for establishing feedback from hidden to input layers. Two hidden layers with ninety hidden neurons were found to be sufficient to train the network for attaining the convergence during the experimental study. For the input hidden layer, tan-sigmoidal activation function and hidden output layer, log sigmoidal activation function was employed. A resilient back-propagation (RP) learning algorithm with an adaptive learning rate of 0.6 was used to train the REN network. Table 1 shows the recurrent Elman neural network setting parameters. Nguyen- Windrow algorithm was employed during the initialization of hidden layer weights.

Fig. 5
figure 5

Recurrent Elman neural network architecture

Table 1 REN training parameters

The classification performance of REN classifier was evaluated by in terms of four parameters namely, specificity (SP), sensitivity1 (SE1), sensitivity2 (SE2) and classification accuracy (CA) (Srinivasan et al. 2007).

$$SP (\% ) = \frac{Total\,number\,of\,correctly\,detected\,normal\,patterns}{Total\,number\,of\,actuall\,normal\,patterns}\,\,\, \times \,100$$
(14)
$$SE1 (\% ) = \frac{Total\,number\,of\,correctly\,detected\,pre{-}ictal\,patterns}{Total\,number\,of\,actuall\,pre{-}ictal\,patterns}\,\,\, \times \,100$$
(15)
$$SE2 (\% ) = \frac{Total\,number\,of\,correctly\,detected\,epileptic\,patterns}{Total\,number\,of\,actuall\,epileptic\,patterns}\,\, \times \,100$$
(16)
$$CA (\% ) = \frac{Total\,number\,of\,correctly\,detected\,\,patterns}{Total\,number\,of\,applied\,patterns}\,\,\, \times \,100$$
(17)

Three cases were considered for classification purpose.

  • Case A: Normal-Pre-ictal

  • Case B: Normal-Epileptic

  • Case C: Pre-ictal-Epileptic

Performance evaluation

For the experimental study, set A, set C and set E reflecting normal eye open, pre-ictal and epileptic seizure EEG recordings were considered from University of Bonn database (Andrzejak 2001). An adaptive filter reported earlier was applied to remove line noise from raw EEGs. The convergence factor μ = 0.005 and order N = 3 was selected for adaptive filter implementation. The estimation of periodogram clearly reveals the presence and removal of noise as depicted in Fig. 6a–d respectively.

Fig. 6
figure 6

Power spectral density of a Normal EEG with line noise b Normal EEG without line noise c Epileptic EEG with line noise d Epileptic EEG without line noise

The spectrogram analysis also ensures the presence of line noise as shown in Fig. 7a–c. The spectrogram shows the time–frequency components of the EEG signal using short-term Fourier transform.

Fig. 7
figure 7

Spectrogram of a Normal EEG b Pre-ictal EEG c Epileptic EEG

In order to preserve the stationarity for non-linear EEGs, a 1 s windowing was introduced to the filtered recordings. For each 1 s segmented time series (say, EEG pattern) three entropies namely, log energy entropy (LogEn), wavelet packet log energy entropy (WPLogEn) and wavelet packet norm entropy (WPNormEn) were estimated. In this proposed study, Haar wavelet with 5th level decomposition has been applied to decomposed EEG (Raghu et al. 2015). Figure 8a–c show the pattern distribution for the different conditions using three entropy features.

Fig. 8
figure 8

Entropy values for normal, pre-ictal and epileptic EEGs using a Log En features b WPLogEn features c WPNormEn features

It can be observed from Fig. 8a–c that the entropies provide significance variation under the three conditions. LogEn values are higher in epileptic states than in normal and pre-ictal states. But these values are in same range for normal and pre-ictal states, it would be difficult to achieve good classification accuracy (refer Fig. 8a). A good discrimination was obtained by applying wavelet decomposition on EEGs before estimating entropy values. The influence of wavelet packet decomposition on EEG signals can be seen in Fig. 8b. Among the three features, WPLogEn feature shows a clear band difference compared to other features.

To recognize the importance of selected feature for its significance for the pattern classification, statistical analysis using ANOVA and Wilcoxon model was employed. Since EEG signal is a non-stationary signal, the Wilcoxon non-parametric test was applied for the analysis. Further Box plot analysis demonstrates the median value with interquartile distances with skewed distribution. The typical box plot has lines at the upper median and lower quartile values. Further Boxplot analysis (refer Fig. 9) confirms the suitability of the feature for pattern classification. The significance of wavelet packet decomposition on EEGs for WPLogEn gives good discrimination between normal and pre-ictal (refer Fig. 9a, b).

Fig. 9
figure 9

Box plot for normal, pre-ictal and epileptic EEGs using a LogEn features b WPLogEn features c WPNormEn features

The interesting observation made from Table 2 is that the mean value for pre-ictal EEGs is lower than normal EEGs for all three types of entropies. The WPLogEn is lower than LogEn due to the influence of wavelet packet decomposition on EEGs. Standard deviation is an increasing quantity for LogEn and WPLogEn from normal to pre-ictal to epileptic EEGs, whereas standard deviation for WPNormEn in pre-ictal is lesser than in normal EEGs. A p value is used for testing statistical hypothesis and it found to be less than 0.01 for all combinations of EEG cases except for Case B for LogEn. Decomposition of EEGs using wavelet packet transform results in p value lesser than 0.01 for Case B.

Table 2 Range (mean ± standard deviation) of entropy features and p value for normal, pre-ictal, and epileptic EEGs

In order to introduce the automated seizure detection procedure, REN classifier was employed with entropy features as input. By considering the typical generalization characteristics of the artificial neural network, only 58 % of the patterns were applied for training using the REN architecture. With the available 2360 patterns (for each condition), 1360 were used for training and 1000 for testing. Figure 10a–d show the performance of REN classifier and Tables 3, 4 and 5 shows the overall classification results using REN classifier with LogEn, WPLogEn, and WPNormEn. The REN classifier gives high CA rate of 99.70 % using WPLogEn with less time and number of iterations as compared to other two features for Case A (refer Table 3). The CA was found very less i.e. 67.10 % using LogEn and maximum CA obtained using WPLogEn for Case B (refer Table 4). For Case C the best accuracy recorded using WPLogEn and it uses less time and iterations to train REN classifier. Among all three entropies WPLogEn shows best results for all the cases.

Fig. 10
figure 10

REN classifier output for a LogEn for Case A, Case B, Case C b WPLogEn and WPNormEn for Case A c WPLogEn and WPNormEn for Case B d WPLogEn and WPNormEn for Case C

Table 3 Classification results using LogEn, WPLogEn and WPNormEn for Case A
Table 4 Classification results using LogEn, WPLogEn and WPNormEn for Case B
Table 5 Classification results using LogEn, WPLogEn and WPNormEn for Case C

Discussion

The time varying behavior and non-linear characteristics of EEG signals is well exploited for the studies related to epileptic seizure detection (Adeli et al. 2003; Srinivasan et al. 2007; Tzallas et al. 2007; Panda et al. 2010). The proposed study makes use of log energy, wavelet packet based log energy and norm entropies for the seizure detection. Following were the inference derived from the proposed study.

  1. 1.

    Although several works using the University of Bonn, Germany, database for EEG-based seizure detection was reported (Andrzejak 2001; Srinivasan et al. 2007; Pravin et al. 2010; Kumar et al. 2014; Chaurasiya et al. 2015), none of the work emphasize on the removal of power line noise from raw EEGs. The application of adaptive LMS algorithm showed a 2 % improvement in terms of classification accuracy compared to the raw EEGs without removal of line noise.

  2. 2.

    Entropies based detection of an epileptic seizure using EEGs have been studied and well reported in the recent past (Srinivasan et al. 2007; Pravin et al. 2010; Aydin et al. 2009; Wang et al. 2011; Gopan et al. 2015). Srinivasan et al. (2007) have employed ApEn feature with REN network as a classifier and have achieved 100 % CA. The computation cost for estimating ApEn by varying the parameters m, r and N were found to be very high and thus had constraints for real-time seizure detection. For REN, Gradient descent learning algorithm was applied to train the network. The proposed study makes use of RP algorithm and it was found to yield better results in terms of computation time. Attempts have been made using Gradient descent learning algorithm for training REN network and it was observed during the experimental study that the computation time was large compared to RP learning algorithm. Similarly, Pravin et al. (2010) have shown the influence of spectral, sample and wavelet entropies.

  3. 3.

    In the proposed study, the application of wavelet packet based log and norm entropies found proven for seizure detection due to the nature of WPT and its ability to detect varying component at a lower frequency. Figure 8a–c confirms these results. Further, the efficiency of wavelet packet based entropy found to show band variation compared to log entropy.

  4. 4.

    During the experimental study, it was observed that LogEn and norm entropy yielded the same result. Hence comparison analysis was given only with LogEn.

  5. 5.

    A preliminary study on the wavelet packet decomposition levels (Raghu et al. 2015) on trial and error basis provided the optimal selection of wavelet basis and decomposition level. The proposed study makes use of Haar wavelet basis with 5th level decomposition.

  6. 6.

    To confirm the suitability of the proposed features for classification problem, Wilcoxon non-linear statistical model was employed. Both p value and Box plot results show the significance of the wavelet packet features than log entropy features.

  7. 7.

    The selection of pattern classification is very important for the development of automated medical expert system. The proposed study makes use of the REN for classification. Being the feedback network, REN ensures effective training with early convergence and then ensuring the low computational requirement. With the MATLAB R13 environment, the network further yielded less computational time compared to the work reported earlier (Srinivasan et al. 2007; Pravin et al. 2010; Sriraam 2012, 2013). It can be inferred from the results shown in Fig. 10a–d show the wavelet packet entropy with REN proven to be excellent pattern recognition compared to the other neural network including SVM where computational burden found to be large.

  8. 8.

    The performance of the proposed method was evaluated in terms of new index parameter called relative performance (RP), which is defined as (Sriraam and Eswaran 2008; Sriraam and Shyamsunder 2011):

    $$RP = \frac{{Classification \, accuracy \left({CA} \right)}}{{Total \, coomputation \, time \, \left( {in \, seconds} \right)}}$$
    (18)

    The computation time was calculated based on the preprocessing, feature extraction, training and testing phase of the REN classifier. Higher the RP value shows the efficiency of the technique. Figure 11a–b show the plots for computational time and relative performance index using all the three feature parameters employed in the proposed study.

    Fig. 11
    figure 11

    a Total computation time b Relative performance of the proposed method

    It can be inferred from Fig. 11a–b that the RP clearly indicates the overall performance of the feature extraction parameter along with recurrent Elman neural network. Among the three entropies, WPLogEn outperforms compared to the other two techniques. The study reported by Srinivasan et al. (2007), Acharya et al. (2012a, c) have employed ApEn and SampEn where the computational cost of feature estimation were found to be high. The proposed technique overcomes such limitations (computational complexity and classification accuracy.

  9. 9.

    A similar work reported using non-normalization based Shannon entropy features obtained from best basis wavelet packet decomposition (Wang et al. 2011). The study was based on wavelet packet entropy computations using the Eq. (18) (Shannon 1948).

    $$E(s) = - \sum\limits_{i}^{{}} {s_{i}^{2} \log (s_{i}^{2} )}$$
    (19)

    where s i is coefficient of a signal in an orthonormal basis.

    Compared to the results obtained using Shannon entropy, the proposed wavelet packet log energy and norm entropies found to be superior in terms of computational complexity. The Shannon entropy based wavelet packet coefficients possess large value and due to this computational time were found to be large. The proposed technique overcomes this problem.

  10. 10.

    An overall comparison of the work proposed in the literature with the proposed study is shown in Table 6. All the work has made use of the University of Bonn, Germany, database for the seizure detection. From the Table 6, it can be observed that the proposed technique outperforms results reported by Srinivasan et al. (2007), Pravin et al. (2010).

    Table 6 A Comparison of overall classification accuracy obtained by proposed method and other’s method for EEG classification problems

    But due to the excellent computational processing facilities (Intel® core ™ i5) with R2013a MATLAB version, the proposed algorithm required less computational time with an overall classification accuracy of 99.70 %. It can be concluded that the proposed wavelet packet entropies with recurrent Elman network classifier classifier can be considered as the potential candidate for epileptic seizure detection problems. As a future step multi-classification will be considered between normal, pre-ictal and epileptic as future work with different mother wavelets and classifiers. The entire methodology will be implemented using SCILAB platform due to its open source availability option for clinical implementation.

Conclusion

The proposed research study suggested a combinational variant feature, wavelet packet log energy and norm entropies that possess time–frequency domain and non-linear domain characteristics to investigate the EEG time series. In order to distinguish normal, preictal and epileptic seizures, wavelet packet log energy and norm entropies based feature were deployed and its performance was compared with log energy entropy. Periodogram and spectrogram techniques were employed on EEG to test the presence of line noise and it was eliminated by applying Weiner adaptive filter. A recurrent Elman neural network classifier was employed to perform binary pattern classification. From the experimental study, ANNOVA test, Wilcoxon statistical test and Box plot results showed the distinguishable variation using the wavelet packet entropy features for normal, pre-ictal and epileptic conditions. It was found from the simulation results that the wavelet packet log entropy with REN classifier yields a good classification accuracy of 99.70 % for normal-pre-ictal, 99.70 % for normal-epileptic and 99.85 % for pre-ictal-epileptic respectively. The overall effectiveness of the proposed technique was evaluated in terms of relative performance index which proved wavelet packet log energy entropy feature with recurrent Elman network as the potential candidate for epileptic seizure classification from normal EEG as compared to the earlier methods reported in the literature.