Feature extraction and classification for EEG signals using wavelet transform and machine learning techniques

Amin, Hafeez Ullah; Malik, Aamir Saeed; Ahmad, Rana Fayyaz; Badruddin, Nasreen; Kamel, Nidal; Hussain, Muhammad; Chooi, Weng-Tink

doi:10.1007/s13246-015-0333-x

Feature extraction and classification for EEG signals using wavelet transform and machine learning techniques

Scientific Paper
Published: 04 February 2015

Volume 38, pages 139–149, (2015)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Australasian Physical & Engineering Sciences in Medicine Aims and scope Submit manuscript

Feature extraction and classification for EEG signals using wavelet transform and machine learning techniques

Download PDF

Hafeez Ullah Amin¹,
Aamir Saeed Malik¹,
Rana Fayyaz Ahmad¹,
Nasreen Badruddin¹,
Nidal Kamel¹,
Muhammad Hussain² &
…
Weng-Tink Chooi³

7702 Accesses
204 Citations
1 Altmetric
Explore all metrics

Abstract

This paper describes a discrete wavelet transform-based feature extraction scheme for the classification of EEG signals. In this scheme, the discrete wavelet transform is applied on EEG signals and the relative wavelet energy is calculated in terms of detailed coefficients and the approximation coefficients of the last decomposition level. The extracted relative wavelet energy features are passed to classifiers for the classification purpose. The EEG dataset employed for the validation of the proposed method consisted of two classes: (1) the EEG signals recorded during the complex cognitive task—Raven’s advance progressive metric test and (2) the EEG signals recorded in rest condition—eyes open. The performance of four different classifiers was evaluated with four performance measures, i.e., accuracy, sensitivity, specificity and precision values. The accuracy was achieved above 98 % by the support vector machine, multi-layer perceptron and the K-nearest neighbor classifiers with approximation (A4) and detailed coefficients (D4), which represent the frequency range of 0.53–3.06 and 3.06–6.12 Hz, respectively. The findings of this study demonstrated that the proposed feature extraction approach has the potential to classify the EEG signals recorded during a complex cognitive task by achieving a high accuracy rate.

EEG data classification using wavelet features selected by Wilcoxon statistics

Article 24 December 2014

Comparison of classification methods on EEG signals based on wavelet packet decomposition

Article 24 December 2014

Wavelet-based Machine Learning Techniques for ECG Signal Analysis

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Electroencephalography (EEG) is a neuroimaging technique for recording the brain’s electrical potentials, which are commonly used to study the dynamics of neural information processing in the brain, and diagnose brain disorders and cognitive processes. Large amounts of EEG data are recorded and it is not possible to analyze EEG data visually [1]. Therefore, there is a strong demand to extract relevant information from EEG recordings for the proper evaluation and understanding of the desired cognitive processes. The main steps in the process of extracting relevant information from EEG recordings include preprocessing, feature extraction and classification [2]. Extracting relevant features is among the most critical and significant steps for EEG data classification. The reason behind this is that the feature extraction step has a direct impact on the systems classification performance [3]. If the extracted features are not expressive for a certain problem, then the classification performance will not be satisfactory. In such a case, the classification method may be highly optimal for the problem but due to inadequate features, the method may not provide good classification results. Hence, extracting suitable features from EEG signals to get high classification performance is mandatory.

Recently, a multi-disciplinary research area—brain computer interface (BCI), involving researchers from neuropsychology, engineering, computer science, mathematics and neuroscience attracted a lot of interest as it has the potential to provide control capabilities and communication to people facing severe motor disabilities. BCI is a system that enables a physically disabled subject to utilize brain signals to control a device without using any muscle activity. In other words, it uses the brain signals and communicates with the external devices for control. Most of the research in BCI and related research has been performed using electroencephalogram (EEG) signals. Any good practical implementation of the BCI system demands an efficient brain signal processing scheme that could extract features and perform classification [4]. Several methods have been reported for feature extraction, which include time domain, frequency domain, and wavelet transform- (WT) based features [3]. However, WT-based analysis is highly effective, because it deals better with the non-stationary behavior of EEG signals than other methods. Wavelet-based features, including wavelet entropy [5], wavelet coefficients [2], and wavelet statistical features (mean, median, and standard deviations) have been reported for normal EEG analysis as well as in clinical applications [6, 7]. Details on the performance of time domain, frequency domain and wavelet-based techniques employed in EEG classification for cognitive tasks and/or BCI applications are provided in the related work section and the classification accuracy of these techniques are provided in the discussion section. Most of these studies have reported good results in discriminating cognitive tasks of different workloads in simulated and/or real EEG recordings [8]. Hence, the experimental design used in these studies could not find dynamics in the performance of a participant in a unique task with a constant cognitive workload. Therefore, this study presents EEG signal classification in a cognitive task with a constant workload from a baseline task using wavelet-based feature extraction and machine learning classifiers.

The purpose of this study has been to extract suitable wavelet-based features (relative wavelet energy) for the classification of EEG signals. The EEG dataset used for the validation of the proposed method consisted of two classes—EEG recordings in a complex cognitive task (class 1), and EEG recordings in a rest condition—eyes open (class 2). The paper is structured as follows: “Related work” section reviews the related previous studies of feature extraction methods, “Materials and methods” section describes the materials and methods, “Experimental results and discussion” section presents the results and discussion, and “Conclusion” section concludes the paper.

Related work

In literature, the time domain, frequency domain, and wavelet-based feature extraction techniques for classification of EEG signals have been reported [9–11]. These techniques use the time and frequency domain features in the classification models to determine the optimal feature set and combine with classifiers that gives the highest classification performance. Here, we present the recent related work of the time domain, frequency domain and wavelet-based feature extraction methods for classification of EEG signals in cognitive tasks. Time domain features mainly include sample entropy [12], approximate entropy [12], permutation entropy, fractal dimension, Hjorth parameters [13], Hurst component [10], and Lyapunov exponent [14]. Frequency domain features include EEG absolute power, relative power and power ratio in different frequency bands [15]. The time–frequency analysis include wavelet-based feature extraction and stockwell transform [16]. Hariharan et al. [16] have used the stockwell transform for feature extraction and the support vector machine (SVM) for classification of EEG signals of different cognitive tasks. The authors have reported classification accuracy between 84.72 and 98.95 %. Noshadi et al. [17] have used empirical mode decomposition and both time and frequency domain features for cognitive task classification. The authors have employed linear classifiers (k-nearest neighbor and linear discriminant analysis) and have reported 97.78 % classification accuracy. Guo et al. [8] have used weighted SVM with immune feature and classified cognitive tasks with 85.4–97.5 % accuracy. Zhang et al. [18] have reported 72.4–76.4 % classification accuracy using high frequency power and Fischer’s discriminant classifier for EEG classification in cognitive tasks. Hosni et al. [19] utilized the EEG power feature and SVM classifier with a radial basis function (RBF) kernel, and have classified three cognitive tasks with 70 % accuracy. Xue et al. [20] have used the wavelet packet transform for feature extraction with the RBF classifier, and have achieved 85.3 % accuracy. Zhiwei and Minfen [21] have used the wavelet pack entropy feature and SVM classifier and have shown 87.5–93 % accuracy for discriminating a baseline task from a cognitive task. The above cited studies have used a database of seven subjects who performed five tasks—baseline (eyes open) task, multiplication, visual counting, mental letter composing and geometric object rotation. The database was originally reported by Keirn and Aunon [22] at Colorado State University. The database consisted of only seven subjects and the experimental cognitive tasks were simple. Further, the majority of the studies utilized very few subjects for classification; for example, Zhiwei and Minfen [21] have used only two subjects’ data while Nai-Jen and Palaniappan [23] have used only four subjects’ data. Additionally, the database had a variable cognitive load in different tasks.

Many other studies have worked on EEG classification in cognitive tasks using different databases recorded by the authors or adopted from past studies. Such as, Lin and Hsieh [24] have classified cognitive tasks using EEG power features with neural network classifier and have reported 78.31 % accuracy. Rodrıguez-Bermudez et al. [25] have employed time, frequency and wavelet-based features with the SVM classifier and reported 67.96–80.71 % accuracy. This study has used four subjects’ EEG data and a linear classifier for discrimination of cognitive tasks. Karkare et al. [26] have used a scaling exponent and classified two groups who performed complex cognitive tasks using an artificial neural network, and have reported the classification performance at over 80 % accuracy. These studies reported low classification accuracy and most of them have used non-linear classifiers, such as neural network and kernel-based SVM, which are time consuming in building models for classification. The standard psychometric cognitive task, e.g., Raven the progressive metric test, has been reported by Jahidin et al. [27]. They have utilized the EEG power feature with the neural network classifier and achieved the 88.89 % accuracy. However, this study classified within the group classification for high and low cognitive processes. From the literature, we have found a gap for this study, i.e., efficient feature extraction and classification for EEG signals for an offline dataset as well as applicable for online EEG applications.

Materials and methods

This section describes the details of the materials and methods used during this study, which include the experimental tasks, dataset description, discrete wavelet transform and relative energy computation, description of classifiers and the discussion of their performance parameters.

Participants

All of the eight healthy participants were graduate students in Universiti Teknologi PETRONAS. They participated voluntarily in this study. All of them were male, right handed and aged between 24 and 32 years (28.6 ± 4.20) [28]. At the time of the experiment, they were free from any medication, drugs, neurological disorder, or head injury that may have affected the experimental results. They had normal or corrected to normal vision. Previously, they had not experienced the cognitive task used in this study.

Consent form and ethics approval

This research study was approved by the Research Coordination Committee of Universiti Teknologi PETRONAS, Malaysia [28]. All the participants had signed the informed consent form before starting the experiment. The consent document had a brief description of this research study concerning humans.

Experimental tasks

Eyes open task

In this task, there was no cognitive task to be performed. The participants were instructed to sit relaxed and try to think of nothing in particular. To maintain the concentration of the participants, they were asked to focus their attention on a point displayed at the center of a computer screen during the EEG recording. The EEG recording of this task was used as a baseline signal.

Raven’s advance progressive metric (RAPM)

RAPM is a standard psychometric test used to measure the intellectual ability. It consists of two sets (I&II). Set-I is used for practice, which contains 12 problems; Set-II consists of 36 problems used to measure the general cognitive ability. Each problem is a diagrammatic structure with some missing information and with eight multiple choices to complete the diagram’s missing part. Each correct answer has a score of ‘1’ and a score of ‘0’ is assigned for each incorrect answer. The score range is 36 and the administration time for Set-I and Set-II is 10 and 40 min, respectively (for more details about the RAPM, see [29, 30]).

Dataset

The dataset consisted of eight healthy volunteers’ EEG data, which were recorded while performing the RAPM test and eyes open condition [28, 31]. The details about the procedure of this RAPM, EEG data recording and preprocessing can be found in our previous studies [28, 31]. For feature extraction and classification, the dataset was organized into two classes. The class 1 represented all of the eight participants’ EEG data recorded during the RAPM task and the class 2 represented all of the eight participants’ EEG data recorded in the eyes open condition. In the RAPM task, each participant solved 36 problems. Thus, each participant was observed 36 times (maximum) in the RAPM task, resulting in 36 instances corresponding to a single participant. There were some un-attempted (not answered) problems with a few participants. The missing problems in all of the participants were excluded and we were left with a total of 280 instances for class 1 (8 participants × 36 problems = 288, excluding missing problems ⇒ 280 instances). Similarly, each participant’s eyes open EEG recording (class 2) was segmented according to the corresponding numbers of the problem solved in the RAPM task (class 1). Hence, we have balanced both the classes in terms of instances for classification, i.e., 280 instances for each class.

Discrete wavelet transform (DWT)

The DWT is widely used for the time–frequency analysis of biomedical signals [2, 32], especially in an EEG signal analysis due to its non-stationary characteristics. The DWT employs extensive time windows for low frequencies and short time windows for higher frequencies, resulting in good time–frequency analysis. The DWT decomposition of a signal uses successive high pass and low pass filtering of the time series and two down samplers by 2. The high pass filter g(n) is the discrete mother wavelet and the low pass filter h(n) is its mirror version [33]. The mother wavelet of the Daubechies wavelet (db4) and the corresponding scaling function are shown in Fig. 1.

The output of the first high pass and low pass filters are referred to as the approximation and detailed coefficients, represented by A1 and D1, respectively. The A1 is further disintegrated and the procedure is repeated till the specified number of decomposition levels is reached (see Fig. 2) [32, 33].

The dilation function $\varphi_{j,k} \left( n \right)$ is dependent on the low pass filter, and the wavelet function $\psi_{j,k} \left( n \right)$ is follows the high pass filter, which is denoted as follows.

$$\varphi_{\text{j,k}} \left( {\text{n}} \right) = 2^{{{\text{j}}/2}} {\text{h}}\left( {2^{\text{j}} {\text{n}} - {\text{k}}} \right)$$

(1)

$$\psi_{\text{j,k}} \left( {\text{n}} \right) = 2^{{{\text{j}}/2}} {\text{g}}\left( {2^{\text{j}} {\text{n}} - {\text{k}}} \right)$$

(2)

where n = 0, 1, 2,…, M−1; j = 0, 1, 2,…, J−1; k = 0, 1, 2,…, 2^j−1; J = log2(M); and M is the length of the signal [34].

The maximum level of decomposition is specified depending on the principal frequency components in the given signal [2]. The coefficients of the DWT are referred to as the dot product of the original time series and the designated basis functions. The approximation coefficients A _i and the detailed coefficients D _i in the ith level are denoted as [2]:

$${\text{A}}_{\text{i}} = \frac{1}{{\sqrt {\text{M}} }}\mathop \sum \limits_{\text{n}} {\text{x}}\left( {\text{n}} \right) \times \varphi_{\text{j,k}} \left( {\text{n}} \right)$$

(3)

$${\text{D}}_{\text{i}} = \frac{1}{{\sqrt {\text{M}} }}\mathop \sum \limits_{\text{n}} {\text{x}}\left( {\text{n}} \right) \times \psi_{\text{j,k}} \left( {\text{n}} \right)$$

(4)

where k = 0, 1, 2, …, 2^j−1 and M is the length of the EEG time series in the discrete points.

Relative and total wavelet sub-band energy

The wavelet energy at each decomposition level i = 1,…, L is computed as follows:

$${\text{E}}_{{{\text{D}}_{\text{i}} }} = \mathop \sum \limits_{{{\text{j}} = 1}}^{\text{N}} \left| {{\text{D}}_{\text{ij}} } \right|^{2} ,\quad {\text{i}} = 1,2,3, \ldots ,L$$

(5)

$${\text{E}}_{{{\text{A}}_{\text{i}} }} = \mathop \sum \limits_{{{\text{j}} = 1}}^{\text{N}} \left| {{\text{A}}_{\text{ij}} } \right|^{2} , \quad {\text{i}} = L$$

(6)

The ‘L’ is the maximum level of decomposition. Hence, from Eqs. 5 and 6, the total energy can be defined as:

$${\text{E}}_{\text{Total}} = \left( {\mathop \sum \limits_{{{\text{i}} = 1}}^{L} {\text{E}}_{{{\text{D}}_{\text{i}} }} + {\text{E}}_{{{\text{A}}_{L} }} } \right)$$

(7)

The normalized energy values represent the relative wavelet energy.

$${\text{E}}_{\text{r}} = \frac{{{\text{E}}_{\text{j}} }}{{{\text{E}}_{\text{Total}} }}$$

(8)

where ${\text{E}}_{\text{j}} = E_{{D_{i = 1, \ldots ,L} }} \,{\text{or}}\, E_{{A_{i = L} }}$

Feature extraction using DWT

The relevant information extraction from raw signals is a critical step in EEG pattern classification due to its direct influence on classification performance. The illustrative representation of the proposed feature extraction scheme is presented in Fig. 3. The band pass (1–48 Hz) EEG signal was decomposed into sub-band frequencies by using the discrete wavelet transformation with the Daubechies wavelet of order 4 up to level 4. The approximate and detailed coefficients were computed (see Fig. 4 as an example). Table 1 represents one channel’s sub-band percentage relative energy and its frequency range of a single subject. The total and relative sub-band energies were computed from the extracted wavelet coefficients. The relative wavelet energy $E_{rD1} ,E_{rD2} , \ldots ,E_{rA4}$ was calculated using Eq. 8.

Table 1 Frontal F3 channel’s sub-band percentages’ relative energy and their frequency range

Full size table

The relative energy features were computed for all of the participants and all of the channels’ data. Accordingly, the feature matrix of relative energy for a single participant in each EEG task and each sub-band (detailed or approximation) became as follows:

$${\text{Relative}}\, {\text{Energy}} \,{\text{Feature}}\, {\text{Matrix}} (\overrightarrow {{{\text{F}}_{\text{r}} }} ) = [{\text{E}}_{{{\text{rA}}4(280 \times 128)}} ]$$

(9)

where the number of channels was 128, number of instances in each class was 280, D₁~D₄ and A₄ were the detailed and approximation coefficients. Accordingly, the ${\text{E}}_{{{\text{D}}1(280 \times 128)}}$ represented the relative energy feature matrix of the first detailed coefficients for all of the eight participants in each class.

Classification methods

A classifier is a technique that utilizes various independent variable values (features) as input and predicts the corresponding class to which the independent variable belongs [12]. In the EEG signal analysis, the features can be any kind of extracted information from the signal, such as energy, entropy, power etc. and the class can be the type of task or the stimulus used during the recording. A classifier has a number of parameters that need to be learned from training data. The learned classifier is a model of the association between the features and the classes. For example, for a given feature x of a class y, the classifier is a function f that predicts the class y = f(x). After the learning, the classifier is able to predict new instances that have not been used in the training data. Thus, the performance of the classifier is tested on a different set of instances.

To demonstrate the effectiveness of the proposed feature extraction scheme in cognitive function classification, the SVM, multi-layer perceptron (MLP), K-nearest neighbor (K-NN) and Naïve Byes classifiers were used. The SVM used a kernel trick to transform the data points into a higher dimensional space and then separated them by a hyper-plane with a maximal margin. The MLP is a neural network-based method, which is commonly used for performing a different variety of detection and estimation tasks. The K-nearest neighbor works to find a testing sample’s class by the majority class of the k nearest training samples. The Naïve Bayes is a simple and efficient statistical method, which is based on Bayes’ theorem. For more details about SVM, MLP, K-NN and Naïve Bayes (see [1, 2, 35–37].

Experimental results and discussion

In this section, we present the experimental results for validation, and discuss them. We start with the experimental set-up used for these experiments.

Experimental setup

The classifiers were trained and tested for these extracted features using the tenfold cross validation method. The tenfold cross validation method divided the dataset into ten subsets of equal size and used nine subsets for the classifier training and one subset for the classifier testing. This process repeated ten times, each time leaving out one of the subsets from the training, which was used for testing. This method has the advantage that it utilized all of the instances in the dataset for both training and testing. The classifiers’ performances were computed using the most commonly used parameters, are the accuracy, sensitivity, specificity, precision, and Kappa statistic [38]. These parameters are defined as follows.

$${\text{Accuracy}} = \frac{{\text{Total}}\, {\text{no}}. \,{\text{of}}\, {\text{correctly}} \,{\text{classified}}\, {\text{instances}}} {{\text{Total}}\, {\text{numbers}} \,{\text{of}}\,{\text{instances}}} \times 100$$

(10)

$${\text{Sensitivity}} = \frac{{{\text{True}} \,{\text{Positive}}}}{{{\text{True}}\, {\text{Positive }} + {\text{False}} \,{\text{Negative}}}} \times 100$$

(11)

$${\text{Specificity}} = \frac{{{\text{True}}\, {\text{Negative}}}}{{{\text{True}} \,{\text{Negative}} + {\text{False}} \,{\text{Positive}}}} \times 100$$

(12)

$${\text{Precision}} = \frac{{{\text{True}}\, {\text{Positve}}}}{{{\text{True}}\, {\text{Positive}} + {\text{False}}\, {\text{Positive}}}} \times 100$$

(13)

$${\text{Kappa}} ({\text{k}}) = \frac{{\left( {{\text{P}}_{\text{o}} - {\text{P}}_{\text{e}}^{\text{C}} } \right)}}{{\left( {1 - {\text{P}}_{\text{e}}^{\text{C}} } \right)}}$$

(14)

where P _o represents the probability of the overall agreement of the label assignments between the classifier and the true process, and $P_{e}^{C}$ denotes the chance agreement over the labels—sum of the proportion of instances assigned to a class multiplies the proportion of true labels of that class in the dataset.

Classification results

The extracted relative wavelet energy features of D1–D4 and A4 were classified using SVM with RBF kernel, MLP with five hidden layers, K-NN with k = 1, and Naïve Bayes classifiers for both of the EEG conditions, i.e., eyes open and cognitive task. This classification process was implemented for the extracted features from all of the decomposition levels (D1–D4, and A4). However, the classification results were not prominent in all of the decomposition levels. The highest classification performance was found in the relative energy of the approximation coefficients and detailed coefficients of level 4, which reflected the low frequency (0.53–3.06 Hz) and above low frequency (3.06–6.12 Hz) dominations in the cognitive task (see Tables 2 and 3). A representative signal of 8 s from both the experimental tasks at the F3 electrode is presented in Fig. 5. The amplitude differences can be observed in both the 0.5–3 and 3–6 Hz frequency bands of the two experimental tasks.

Table 2 Classification results of the relative wavelet energy of the level 4 approximate coefficients (A4) for the cognitive task

Full size table

Table 3 Classification results of the relative wavelet energy of the level 4 detailed coefficients (D4) for the cognitive task

Full size table

The SVM classifier achieved 98.75 % accuracy and the MLP and K-NN classifiers achieved 98.21 %, accuracy in the classification using the relative energy of the approximation A4 coefficients. In the detailed D4 coefficients, the SVM and MLP achieved 98.21 and 98.57 % accuracy, respectively, as shown in Tables 2 and 3. The accuracy of the Naïve Bayes classifier was also found to be above 80 %. The values of the other performance parameters, such as sensitivity, specificity, precision and Kappa statistic were prominent. From these results, it seems that the relative wavelet energy in the low frequency band (0.53–3.06 Hz) and the above low frequency band (3.06–6.12 Hz) was a useful feature to classify the EEG brain patterns in eyes open and the complex cognitive task (i.e., RAPM).

Discussion

Comparison with existing techniques

A direct comparison of the results with the previous research in EEG signals was hard due to the variety of EEG datasets, wavelet types, decomposition levels, participants’ variability, and the cognitive tasks used. However, a brief comparison with the previous related studies is presented here. The information about the dataset, feature extraction methods, cognitive tasks, machine learning algorithm and the classification performance reported in previous studies are presented in Table 4. The list of studies in Table 4 have used the time domain, frequency domain, autoregressive (AR) coefficient and/or wavelet transform-based features for EEG classification in a cognitive task as mentioned in “Related work” section. The majority of the studies have used non-linear classifiers (e.g., ANN and kernel-based SVM), which are complex in nature and time consuming to build the classification model. In the case of using very few instances in the classification as mentioned in a few studies in Table 4, it may causes the over fitting problem in classification [39]. In this work, we used 280 instances for each class in the classification. Such a high number of instances in classification have not been reported in cognitive task classification using EEG. Hence, we have used both the linear (e.g., K-NN) as well as the non-linear classifiers (e.g., ANN) along with the 10-cross validation scheme. The benefit of the 10-cross validation process is that all instances in the sample are used for both training and validation exactly once [39]. Therefore, in the presence of a high number of instances, the use of multiple classifiers in the present study made it comparable with previous studies in terms of classification performance. The classification results of this study in both linear and non-linear classifiers were found to be better than related studies which used similar classifiers and the same nature of the cognitive tasks.

Table 4 Summary of existing feature extraction and classification techniques for EEG in cognitive tasks

Full size table

EEG low frequencies with cognitive neuroscience perspective

The EEG low frequency bands (delta and theta) have been reported by the cognitive neuroscientists as cognitive rhythms, and have been linked with cognitive and attention demanded tasks [43–45]. Especially, the event-related potential (ERP) studies have reported the most significant findings of the delta band related to cognitive processing [44], i.e., the associations of the P300 component with the cognitive process [46]. This relationship has been widely reported in the cognitive neuroscience literature. In brief, the delta band has been considered as the primary contributor to the P300 component of ERP [47]. Gennady et al. [48] reviewed the delta band relationship with cognitive processing and have confirmed that delta is linked with the cognitive process. Similarly, the theta rhythms are the most intensively studied in the cognitive neuroscience aimed at correlating the theta rhythms with cognitive processing [49–51]. Particularly, theta in the frontal regions is critical for attentional and cognitive processing in ERP tasks [50]. Most of these studies reported significant increase in delta and theta power in the cognitive tasks. This may be the reason, that we achieved high classification accuracy in the low frequency bands (0.53–3.06 and 3.06–6.12 Hz) for discriminating the RAPM and baseline—eyes open task. Hence, the results of this study reflect the previous studies’ findings in cognitive neuroscience research.

Conclusion

This paper has presented the use of relative discrete wavelet energy along with machine learning algorithms for the classification and the quantitative analysis of spontaneous EEG signals recorded during complex cognitive task. The EEG signals were split into sub-bands using DWT with Daubechies (db4) wavelets and the sub-bands’ relative energies were computed for all of the 128 channels of each subject’s EEG recording. For classification, four different classifiers (SVM, MLP, K-NN and Naïve Bayes) were employed and their performance was evaluated for cognitive task discrimination. The classification results of SVM and MLP demonstrated above 98 % accuracy with features extracted using A4 (0.53–3.06 Hz) and D4 (3.06–6.12 Hz) sub-bands. The wavelet energy is a useful feature to classify the EEG signals corresponding to complex cognitive tasks, and it will be helpful for EEG classification in clinical applications, such as epilepsy, depression, and stress diagnosis as it is capable of identifying variations in non-stationary EEG signals because of the localization characteristics of the wavelet transformation [5]. The low frequencies, especially in the range of the delta band, are perceived in cognitive neuroscience as the primary contributor to the cognitive processing. Hence, the proposed feature scheme has the clinical significance to be applied on real time EEGs in BCI applications for severe motor disabled patients to control external devices using cognitive power. This may be implemented in future work.

References

Übeyli ED (2009) Combined neural network model employing wavelet coefficients for EEG signals classification. Digit Signal Proc 19:297–308
Article Google Scholar
Orhan U, Hekim M, Ozer M (2011) EEG signals classification using the K-means clustering and a multilayer perceptron neural network model. Expert Syst Appl 38:13475–13481
Article Google Scholar
Iscan Z, Dokur Z, Demiralp T (2011) Classification of electroencephalogram signals with combined time and frequency features. Expert Syst Appl 38:10499–10505
Article Google Scholar
Liang N-Y, Saratchandran P, Huang G-B, Sundararajan N (2006) Classification of mental tasks from EEG signals using extreme learning machine. Int J Neural Syst 16:29–38
Article PubMed Google Scholar
Rosso OA, Blanco S, Yordanova J, Kolev V, Figliola A, Schürmann M et al (2001) Wavelet entropy: a new tool for analysis of short duration brain electrical signals. J Neurosci Methods 105:65–75
Article CAS PubMed Google Scholar
Yazdani A, Ebrahimi T, Hoffmann U (2009) Classification of EEG signals using Dempster Shafer theory and a k-nearest neighbor classifier. In: 4th International IEEE/EMBS conference on neural engineering, pp 327–330
Garry H, McGinley B, Jones E, Glavin M (2013) An evaluation of the effects of wavelet coefficient quantisation in transform based EEG compression. Comput Biol Med 43:661–669
Article PubMed Google Scholar
Guo L, Wu Y, Zhao L, Cao T, Yan W, Shen X (2011) Classification of mental task from EEG signals using immune feature weighted support vector machines. IEEE Trans Magn 47:866–869
Article Google Scholar
Taghizadeh-Sarabi M, Daliri MR, Niksirat KS (2014) Decoding objects of basic categories from electroencephalographic signals using wavelet transform and support vector machines. Brain Topogr 1–14
Acharya UR, Sree SV, Ang PCA, Yanti R, Suri JS (2012) Application of non-linear and wavelet based features for the automated identification of epileptic EEG signals. Int J Neural Syst 22:1250002
Article PubMed Google Scholar
Gandhi T, Panigrahi BK, Anand S (2011) A comparative study of wavelet families for EEG signal classification. Neurocomputing 74:3051–3057
Article Google Scholar
Richman JS, Moorman JR (2000) Physiological time-series analysis using approximate entropy and sample entropy. Am J Physiol Heart Circ Physiol 278:H2039–H2049
CAS PubMed Google Scholar
Vidaurre C, Krämer N, Blankertz B, Schlögl A (2009) Time domain parameters as a feature for EEG-based brain–computer interfaces. Neural Netw 22:1313–1319
Article PubMed Google Scholar
Übeyli ED (2010) Lyapunov exponents/probabilistic neural networks for analysis of EEG signals. Expert Syst Appl 37:985–992
Article Google Scholar
Thatcher RW, North D, Biver C (2005) EEG and intelligence: relations between EEG coherence, EEG phase delay and power. Clin Neurophysiol 116:2129–2141
Article CAS PubMed Google Scholar
Hariharan M, Vijean V, Sindhu R, Divakar P, Saidatul A, Yaacob S (2014) Classification of mental tasks using stockwell transform. Comput Electr Eng 40:1741
Article Google Scholar
Noshadi S, Abootalebi V, Sadeghi MT, Shahvazian MS (2014) Selection of an efficient feature space for EEG-based mental task discrimination. Biocybern Biomed Eng 34:159–168
Article Google Scholar
Zhang L, He W, He C, Wang P (2010) Improving mental task classification by adding high frequency band information. J Med Syst 34:51–60
Article PubMed Google Scholar
Hosni SM, Gadallah ME, Bahgat SF, AbdelWahab MS (2007) Classification of EEG signals using different feature extraction techniques for mental-task BCI. In: International conference on computer engineering & systems, 2007. ICCES’07, pp 220–226
Xue J-Z, Zhang H, Zheng C-x, Yan X-G (2003) Wavelet packet transform for feature extraction of EEG during mental tasks. In: International conference on machine learning and cybernetics, 2003, pp 360–363
Zhiwei L, Minfen S (2007) Classification of mental task EEG signals using wavelet packet entropy and SVM. In: 8th International conference on electronic measurement and instruments, 2007. ICEMI’07, pp 3-906–3-909
Keirn ZA, Aunon JI (1990) A new mode of communication between man and his surroundings. IEEE Trans Biomed Eng 37:1209–1214
Article CAS PubMed Google Scholar
Nai-Jen H, Palaniappan R (2004) Classification of mental tasks using fixed and adaptive autoregressive models of EEG signals. In: 26th Annual international conference of the IEEE engineering in medicine and biology society, 2004. IEMBS’04, pp 507–510
Lin C-J, Hsieh M-H (2009) Classification of mental task from EEG data using neural networks based on particle swarm optimization. Neurocomputing 72:1121–1130
Article Google Scholar
Rodríguez-Bermúdez G, García-Laencina PJ, Roca-González J, Roca-Dorda J (2013) Efficient feature selection and linear discrimination of EEG signals. Neurocomputing 115:161–165
Article Google Scholar
Karkare S, Saha G, Bhattacharya J (2009) Investigating long-range correlation properties in EEG during complex cognitive tasks. Chaos Solitons Fractals 42:2067–2073
Article Google Scholar
Jahidin A, Ali MM, Taib MN, Tahir NM, Yassin IM, Lias S (2014) Classification of intelligence quotient via brainwave sub-band power ratio features and artificial neural network. Comput Methods Programs Biomed 114:50–59
Article CAS PubMed Google Scholar
Amin HU, Malik AS, Subhani AR, Badruddin N, Chooi W-T (2013) Dynamics of scalp potential and autonomic nerve activity during intelligence test. In: Lee M et al (eds) Neural information processing, vol 8226. Springer, Berlin, pp 9–16
Chapter Google Scholar
Raven J (2000) The Raven’s progressive matrices: change and stability over culture and time. Cogn Psychol 41:1–48
Article CAS PubMed Google Scholar
Kunda M, McGreggor K, Goel A (2012) Reasoning on the Raven’s advanced progressive matrices test with iconic visual representations. In: 34th Annual conference of the cognitive science society pp 1828–1833
Amin HU, Malik AS, Badruddin N, Chooi W-T (2013) EEG mean power and complexity analysis during complex mental task. In: ICME International conference on complex medical engineering (CME) pp 648–651
Jahankhani P, Kodogiannis V, Revett K (2006) EEG signal classification using wavelet feature extraction and neural networks. In: IEEE John Vincent Atanasoff 2006 international symposium on modern computing, 2006. JVA’06. pp 120–124
Subasi A (2007) EEG signal classification using wavelet feature extraction and a mixture of expert model. Expert Syst Appl 32:1084–1093
Article Google Scholar
Gonzalez RC, Woods RE (2002) Digital image processing, 2nd edn. Prentice Hall, SL
Google Scholar
Hausfeld L, De Martino F, Bonte M, Formisano E (2012) Pattern analysis of EEG responses to speech and voice: influence of feature grouping. Neuroimage 59:3641–3651
Article PubMed Google Scholar
Ben-Hur A, Weston J (2010) A user’s guide to support vector machines. In: Data mining techniques for the life sciences, Springer, New York, pp 223–239
Pereira F, Mitchell T, Botvinick M (2009) Machine learning classifiers and fMRI: a tutorial overview. Neuroimage 45:S199–S209
Article PubMed Central PubMed Google Scholar
Subasi A, Gursoy MI (2010) EEG signal classification using PCA, ICA, LDA and support vector machines. Expert Syst Appl 37:8659–8666
Article Google Scholar
Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques. Morgan Kaufmann, Burlington
Google Scholar
Daud M, Yunus J (2004) Classification of mental tasks using de-noised EEG signals. In: 7th International conference on signal processing, pp 2206–2209
Guo L, Rivero D, Seoane JA, Pazos A (2009) Classification of EEG signals using relative wavelet energy and artificial neural networks. In: Proceedings of the first ACM/SIGEVO summit on genetic and evolutionary computation, pp 177–184
Diez PF, Mut V, Laciar E, Torres A, Avila E (2009) Application of the empirical mode decomposition to the extraction of features from EEG signals for mental task classification. In: Annual international conference of the IEEE engineering in medicine and biology society, 2009. EMBC 2009, pp 2579–2582
Dimitriadis SI, Laskaris NA, Tsirka V, Vourkas M, Micheloyannis S (2010) What does delta band tell us about cognitive processes: a mental calculation study. Neurosci Lett 483:11–15
Article CAS PubMed Google Scholar
Harper J, Malone SM, Bernat EM (2014) Theta and delta band activity explain N2 and P3 ERP component activity in a go/no-go task. Clin Neurophysiol 125:124–132
Article PubMed Central PubMed Google Scholar
Amin H, Malik AS (2013) Human memory retention and recall processes: a review of EEG and fMRI studies. Neurosciences 18:330–344
PubMed Google Scholar
Ergen M, Marbach S, Brand A, Başar-Eroğlu C, Demiralp T (2008) P3 and delta band responses in visual oddball paradigm in schizophrenia. Neurosci Lett 440:304–308
Article CAS PubMed Google Scholar
Demiralp T, Ademoglu A, Schürmann M, Basar-Eroglu C, Basar E (1999) Detection of P300 waves in single trials by the wavelet transform (WT). Brain Lang 66:108–128
Article CAS PubMed Google Scholar
Gennady GK (2012) EEG delta oscillations as a correlate of basic homeostatic and motivational processes. Neurosci Biobehav Rev 36:677–695
Article Google Scholar
Baijal S, Srinivasan N (2010) Theta activity and meditative states: spectral changes during concentrative meditation. Cogn Process 11:31–38
Article PubMed Google Scholar
Sakowitz O (2001) The selectively distributed theta system: functions. Int J Psychophysiol 39:197–212
Article PubMed Google Scholar
Nigbur R, Ivanova G, Stürmer B (2011) Theta power as a marker for cognitive interference. Clin Neurophysiol 122:2185–2194
Article PubMed Google Scholar

Download references

Acknowledgments

This research work has been supported by University Research Internal Funding (URIF: 0153AA-B26), Universiti Teknologi PETRONAS; the Fundamental Research Grant Scheme (Ref: FRGS/1/2014/TK03/UTP/02/1), Ministry of Education (MOE), Malaysia and by NSTIP strategic technologies programs, Grant number (12-INF2582-02), in the Kingdom of Saudi Arabia.

Author information

Authors and Affiliations

Department of Electrical & Electronic Engineering, Centre for Intelligent Signal & Imaging Research (CISIR), Universiti Teknologi PETRONAS, Bandar Seri Iskandar, 31750, Tronoh, Perak, Malaysia
Hafeez Ullah Amin, Aamir Saeed Malik, Rana Fayyaz Ahmad, Nasreen Badruddin & Nidal Kamel
Department of Computer Science, College of Computer and Information Sciences, King Saud University, Riyadh, 12372, Saudi Arabia
Muhammad Hussain
Advanced Medical and Dental Institute (AMDI), Universiti Sains Malaysia, Kepala Batas, 13200, Penang, Malaysia
Weng-Tink Chooi

Authors

Hafeez Ullah Amin
View author publications
You can also search for this author in PubMed Google Scholar
Aamir Saeed Malik
View author publications
You can also search for this author in PubMed Google Scholar
Rana Fayyaz Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Nasreen Badruddin
View author publications
You can also search for this author in PubMed Google Scholar
Nidal Kamel
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Hussain
View author publications
You can also search for this author in PubMed Google Scholar
Weng-Tink Chooi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Aamir Saeed Malik.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Amin, H.U., Malik, A.S., Ahmad, R.F. et al. Feature extraction and classification for EEG signals using wavelet transform and machine learning techniques. Australas Phys Eng Sci Med 38, 139–149 (2015). https://doi.org/10.1007/s13246-015-0333-x

Download citation

Received: 10 June 2014
Accepted: 29 January 2015
Published: 04 February 2015
Issue Date: March 2015
DOI: https://doi.org/10.1007/s13246-015-0333-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Feature extraction and classification for EEG signals using wavelet transform and machine learning techniques

Abstract

Similar content being viewed by others

EEG data classification using wavelet features selected by Wilcoxon statistics

Comparison of classification methods on EEG signals based on wavelet packet decomposition

Wavelet-based Machine Learning Techniques for ECG Signal Analysis

Introduction

Related work