RETRACTED ARTICLE: Automatic detection and classification of EEG artifacts using fuzzy kernel SVM and wavelet ICA (WICA)

Yasoda, K.; Ponmagal, R. S.; Bhuvaneshwari, K. S.; Venkatachalam, K.

doi:10.1007/s00500-020-04920-w

RETRACTED ARTICLE: Automatic detection and classification of EEG artifacts using fuzzy kernel SVM and wavelet ICA (WICA)

Methodologies and Application
Published: 12 April 2020

Volume 24, pages 16011–16019, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Soft Computing Aims and scope Submit manuscript

RETRACTED ARTICLE: Automatic detection and classification of EEG artifacts using fuzzy kernel SVM and wavelet ICA (WICA)

Download PDF

K. Yasoda¹,
R. S. Ponmagal²,
K. S. Bhuvaneshwari³ &
…
K. Venkatachalam⁴

1742 Accesses
58 Citations
Explore all metrics

This article was retracted on 12 August 2024

This article has been updated

Abstract

Electroencephalography (EEG) is almost contaminated with many artifacts while recording the brain signal activity. Clinical diagnostic and brain computer interface applications frequently require the automated removal of artifacts. In digital signal processing and visual assessment, EEG artifact removal is considered to be the key analysis technique. Nowadays, a standard method of dimensionality reduction technique like independent component analysis (ICA) and wavelet transform combination can be explored for removing the EEG signal artifacts. Manual artifact removal is time-consuming; in order to avoid this, a novel method of wavelet ICA (WICA) using fuzzy kernel support vector machine (FKSVM) is proposed for removing and classifying the EEG artifacts automatically. Proposed method presents an efficient and robust system to adopt the robotic classification and artifact computation from EEG signal without explicitly providing the cutoff value. Furthermore, the target artifacts are removed successfully in combination with WICA and FKSVM. Additionally, proposes the various descriptive statistical features such as mean, standard deviation, variance, kurtosis and range provides the model creation technique in which the training and testing the data of FKSVM is used to classify the EEG signal artifacts. The future work to implement various machine learning algorithm to improve performance of the system.

Automated and accurate focal EEG signal detection method based on the cube pattern

Article 01 February 2023

Review of EEG Signal Classification

A Preprocessing Method of EEG Based on EMD-ICA in BCI

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The brain signal is recorded with electroencephalography (EEG) method in which electrical activity of the cerebral cortex is monitored and different electrodes are placed on the scalp. Presently, noninvasively, an electroencephalography signals are recorded and monitored. Clinical diagnosis and sleep disorders are most widely identified by the EEG technique. Data preprocessing is required when the visual inspection artifact is not a final one, and these artifacts may go ahead with ambiguous results. Generally, the segmentation of the whole affected part with the artifacts is difficult to classify that may in turn leads irrelevant data and data loss. An automatic artifact removal from existence to real time processing is very difficult, so this paper proposes the idea to separate the artifacts automatically. Nowadays, the EEG has also ever-increasing concentration in brain computer interface applications (BCI). But EEG electrical activity signal is corrupted with many biological or natural, medical and societal artifacts (Mahajan and Morshed 2015; Shoker et al. 2005; Jung et al. 2000). Then, societal artifacts are originated and taken from external part (i.e., outer part of human body) because of the motor development and interference in EEG from outside devices like electric motor and potential power.

Obviously, these two artifacts are the obstacles of the BCI applications and clinical diagnosis. Conventionally, the artifact removal is done with linear regressions and filters, with respect to the time and frequency of the target artifacts (Gotman et al. 1973; Woestenburg et al. 1983). Filtering in time or frequency acquires the consistent loss of the brain activity due to the overlap between the signal artifacts and neurological activity (de Beer et al. 1995; Guerrero-Mosquera and Navia-Vazquez 2012). Wavelet using multi-resolution analysis is a more effective technique to remove the artifact, while saving the skeleton in EEG includes both time and frequency domains (Zhang et al. 2004; Mamun et al. 2013). The split target function is established with independent component analysis (ICA) where the set is partitioned into independent component (IC) with small set of blind partition (Jung et al. 2000; Mammone et al. 2012). The method uses spatial filters derived by the ICA algorithm and does not require reference channels for each artifact source. Once the independent time courses of different brain and artifact sources are extracted from the data, “corrected” EEG signals can be derived by eliminating the contributions of the artifactual sources. The ICA algorithm is highly effective at performing source separation in domains where (1) the mixing medium is linear and propagation delays are negligible, (2) the time courses of the sources are independent, and (3) the number of sources is the same as the number of sensors; the global or public artifacts are removed by using the technique called ICA with signals cutoff frequency bands (Gallois et al. 2006).

The EEG signal related with visual inspection is recorded by using the combined method of wavelet ICA, or to apply the manual defined function of artifact removal or random threshold value to classify the noisy element from EEG signals (Devipriya and Nagarajan 2018). Default threshold value can be unsuccessful to catch the artifact that is targeted that very near to the randomly characterized artifacts decision margin of the EEG signals. The false rate may get increased when the threshold value was defined manually.

This paper proposes the technique called a novel method of combining the wavelet ICA (WICA) with fuzzy kernel SVM for effective and robust process of artifacts removal of EEG signals and also gives an improved technique of artifact removal by selecting the training and testing the features or data. Both techniques allow the removal of artifacts with minimal error rate for the brain signals. Finally, test the recorded EEG signal in the publicly available dataset in EEGLAB (Delorme and Makeig 2004). In future discusses the ensemble method of many artifacts removal.

2 Literature review

A single event-related trial-based potentials are separated for the source of blind using 2nd order statistics for correcting the artifacts automatically (Chang et al. 2006). Event-related potentials are normally hided with a variety of artifacts. The properties of artifacts are attenuated with different techniques like epoch removal, electrooculogram (EOG) linear regression and feature extraction. Since the existing techniques are not focused on the automatic removal of artifacts from one ERP epoch, ICA integrating with nth-order form, i.e., higher-order statistics that require the huge volume of input samples to achieve the robust result, this proposed work deals with the technique of automatic identification of artifacts in given raw data by giving the purpose eligibility of different artifacts. Time domain signal amplitude acts as the base for the eligibility.

In Lee (1998), the problem of artifacts selection and extraction in EEG was given, then proposes a novel method to remove the artifacts with combined exercise of wavelet transform that is integrated with ICA. This is contrasted through wavelet denoising. An efficient artifact from EEG recordings using wavelet ICA is proposed. Mainly, four different kinds of waves identified and denoted as alpha, beta, delta and theta. The frequency range from 0 to 4 Hz denotes delta waves, and it is correlated by deep sleep stage. The frequency range from 4 to 8 Hz denotes theta wave, and it is related by drowsiness. The frequency range from 8 to 12 Hz denotes alpha wave, and it is related by relaxed stage. Finally, the frequency range above 12 Hz is related with beta wave which is in active stage.

In Mammone et al. (2012), a novel method for automatic ICA is proposed for removing the artifacts and to trough the AWICA, the performance is increased and the multichannel artifact removal from recorded EEG signal is automated. This provides combined form of wavelet ICA: it contains two major flow process of artifact removal: (1) Kurtosis, (2) entropy, both are synthesized and processed with proposed method. The main objective is to avoid the disadvantages of feature extraction technique of ICA.

In Lu et al. (2006), the EEG artifacts are removed by the effective method of independent component analysis (ICA). During eye blink, an important step is used to classify correctly and indentify the component which is artifacts within the independent component. This component automatically projects the eye blink artifacts depends on the template or pattern of the brain structure that could exemplified as a pattern identical technique. So only feature relevant with spatial is engaged in singleton away from the eye blink element, and this technique is proven to be effective and easier to execute. Finally, this method proves that artifacts are rejected when eyes are blinked while considering the brain activities.

When considering long time periods, multiple factors make it necessary to treat the EEG signal as a nonstationary, stochastic process, i.e., whose mean, correlation and higher-order moments are time varying (Blanco et al. 1995). Short intervals, on the other hand, can reasonably be considered stationary, that is of time-invariant statistical properties, the validity of which depends on the type of signal. Stationarity really depends on the recording conditions, with statistical tests revealing that EEG may be stationary for just a few seconds to several minutes.

3 Methodology

3.1 Limitations of ICA in an artifact removal

ICA can accept widely in which the given artifact is not statistically dependent from the rest of the signals. If the artifacts are considered to be an external, then the hypothesis is defined with clear way when it is in internal state is also selected because of the generating events initially started in the brain field at that time course of artifacts have no information about the triggered event. This is the main purpose to use the ICA for artifact removal. ICA normally carries artifactual information as individual element and many different time elements have non-artifactual knowledge and it can give data loss most of the time. However, the performances of the ICA can fully depend upon the dimension of the samples. When the dimension of an input get increases, the probability of the given number of input is conquered with total amount of channels because the amount of the communication is fixed. This case produces redundancy, and it is not mandatory to calculate the sources in an effective way. Otherwise, algorithm cannot possible to divide signal that is considered as artifacts from the sources. In controversy, the small amount of input samples will lead hard evaluation of arguments with ICA performance that will also get low. So as to overcome the difficulty of this technique, the proposed work includes wavelet ICA (WICA).

3.2 Wavelet ICA technique

According to the limitation of ICA in an artifacts extraction, the proposed WICA allows to increase the redundancy, then use various features/attributes of EEG signal artifacts of frequency domain.

The wavelet extraction related with mother wavelet ψ(t) and scaling function φ(t) of the input signal x(t)

$$ x\left( t \right) = \mathop \sum \limits_{k} m_{j0k} \varphi_{j0k} \left( t \right) + \mathop \sum \limits_{j = j0} \mathop \sum \limits_{k} n_{jk} \psi_{jk} \left( t \right) $$

(1)

Here, j0 is the random inputting scale. The 1st part of the Eq. 1 is not an exact value of the scale j0. Second part represents summation of inputs.

The approximation coefficients m_j0k are represented by

$$ m_{j0k} = \smallint x\left( t \right)\varphi_{j0k}^{*} \left( t \right){\text{d}}t $$

(2)

and

$$ \varphi_{j0k} = \frac{1}{{\sqrt {2^{j0 } } }}\varphi \left( {\frac{{t - k2^{j0} }}{{2^{jo} }}} \right) $$

(3)

Equations (2) and (3) are denoted as scaling functions and related coefficients that are defined with the following equation^{Footnote 1}

$$ n_{jk} = \smallint x\left( t \right)\psi_{jk}^{*} \left( t \right){\text{d}}t $$

(4)

and

$$ \psi_{jk} = \frac{1}{{\sqrt {2^{j } } }}\psi \left( {\frac{{t - k2^{j} }}{{2^{j} }}} \right) $$

(5)

Equations (4) and (5) are denoted as a wavelet functions.

An occurrence of artifact is detected in a particular channel that is used to divide the given input channel into particular amount of wavelet components (WCs), if an artifactual substances of the spectral content not fully focused on the levels of the decomposition, ICA will be applied in wavelet components and can take the advantages of an improved redundancy, because of the occurrence of the event that is visible to single channel is turned to be visible in more than that single channel

The block diagram of EEG artifacts removal using WICA is shown in Fig. 1. The first step is denoting the wavelet disintegration that used to separate the given dataset into four parts of cerebrum activity, for example, the taken input dataset into n-dimensional vector space in which the ICA is implemented. This new vector space contains scaling and wavelet functions denoted with n − 1 number of decomposition so that the scaling and wavelet function uniformly depends upon the given wavelet family. This paper proposes the Daubechies-4 wavelet family (Ten 1992), and the raw dataset is processed into n-feature/dimensional vector space once the ICA process is selected when the wavelet component is attached to the artifactual activity. The following terms represent the observed random vector in Eq. (6)

$$ {\mathbf{p}} = \, \left( {p_{1} , \, p_{2} , \ldots ,p_{n} } \right)^{\text{T}} $$

(6)

and

$$ {\text{Independent}}\;{\text{component}}\;\left( {I_{p} C_{s} } \right)_{k} $$

where k = 1, 2, …, n:

$$ p_{i} = \, a_{i,1} \left( {I_{p} C_{s} } \right)_{1} + \cdots a_{i,k} \left( {I_{p} C_{s} } \right)_{k} + \, \cdots a_{i,n} \left( {I_{p} C_{s} } \right)_{n} \; {\text{representing}}\;{\text{the}}\;{\text{mixing}}\;{\text{weights}}\;a_{i,k} . $$

The above equation is also represented in Eq. (7)

$$ p = \mathop \sum \limits_{k = 1}^{n} s_{k } a_{k} $$

(7)

$$ S_{k} = \left( {I_{p} C_{s} } \right)_{k} $$

the basic vector a_k represents the random vector p form the value of e.

$$ {\mathbf{e}} = {\mathbf{Mp}}\;{\text{where}}\;{\text{the}}\;{\text{case}}\;{\text{of}}\;{\text{noiseless}} $$

where p denotes n-dimensional vector; this has to be used to extract the I_pC_s. Here, p denotes the dataset which selects the wavelet component and e represents the estimation of I_pC_s and M denotes the matrix. The matrix M is estimated with respect to additional learning rule (Lee 1998). This rule is used to extract the effective artifacts removal such as eye blinks and line noise occurred when electrical signal happens from brain.

3.3 Fuzzy kernel support vector machine classification

Support vector machine (SVM) generally used as a binary classifier under machine learning technique of supervised learning (Chang and Lin 2011). The main objective is to build an optimal hyperplane by using training dataset as shown in Fig. 2 that is used to test two or more datasets in the classification for testing the dataset. Vapnik proposes the technique of SVM which is used to extend the study of classification and regression (Belousov et al. 2002). The best hyperplane is selected to calculate the maximal margin with the closet dataset which is different from support vectors. Increasing the general characteristics, maximal of margin is used to construct the capabilities. The optimization problem is quadratic when training the SVM.

The construction of SVM hyperplane is defined as

$$ {\mathbf{W}}^{\text{T}} {\mathbf{X}} + b = 0 $$

(8)

Here W is the weight vector and b is defined as offset parameter. In order to maximize the margin, the hyperplane and their closest point denoted as support vectors.

This type of decision boundary classification is called as linear SVM, and kernel trick used for decreasing the complexity of classification in a nonlinear margin is known as nonlinear SVM. Nonlinear margin function is defined as Eq. (9)

$$ f\left( x \right) = {\text{sign }}\left( {d\left( x \right)} \right) $$

(9)

$$ d\left( x \right) = \mathop \sum \limits_{n = 1}^{n} \alpha_{i} y_{i} \left( {x, x_{i}^{n} } \right) + b $$

where the d(x) represents the distance function, $ \alpha_{i} $ represents the Lagrange multiplier, n represents the amount of support vectors, and b denotes offset parameter.

The kernel function K $ \left( {x, x_{i}^{n} } \right) $ used to compute the nonlinear mapping function Eq. (10)

$$ x \to \varphi \left( x \right) $$

(10)

where the kernel K $ \left( {x, x_{i}^{n} } \right) $ function commonly used for BCI (brain computer interface) research represented as Gaussian or radial basis function (RBF) Kernel is denoted as

$$ K\left( {x, x_{i } } \right) = \exp \left( {\frac{{\left| {\left| {x - x_{i} } \right|} \right|^{2} }}{{2\sigma^{2} }}} \right) . $$

(11)

If RBF kernel is used by SVM, two main parameters are considered (1) kernel parameter ($ \sigma $) and (2) trade-off parameter. In traditional way, the above two parameters are optimized for good performance by using the n-fold cross-validation. All these calculation are taken place solely for training data.

3.4 Fussy kernel support vector machine (FKSVM)

The optimal hyperplane calculated by the SVM classifier depends upon the low amount of data. It leads to errors or outliers in training data. To overcome this problem, FKSVM is proposed with fuzzy SVM membership of input data. FKSVM (Devipriya and Nagarajan 2018) is also used to concentrate on maximization of margin like traditional SVM but at the same while considering the outliers with less membership prevents the noises to make a narrow data points in terms of higher probability (Lin and Wang 2004).

Suppose the task of binary classification with t training data represented as [X₁, y₁, m₁], …, [X_t, y_t, m_t]. For each and every training data set

$$ X_{i} \in \Re^{N} $$

(12)

Output label y_i ∈ {+ 1, − 1} and fuzzy membership m_i ∈ [$ \sigma $, 1].

i is represented as i = 1,…, m, and it is enough when $ \sigma > \, 0 $ the training data point equal to 0 represent for empty then it should deleted in the training data without affecting training dataset. Then, optimal hyperplane is determined with minimum error by the term used to measure error in SVM. To minimize error function using Eq. (13)

$$ \frac{1}{2}\varvec{W}.\varvec{W} + C \mathop \sum \limits_{i = 1}^{t} m_{i} \xi_{i} $$

(13)

With respect to

$$ \varvec{y}_{\varvec{i}} \left( {\varvec{W}.\varvec{x}_{\varvec{i}} + b} \right) \ge 1 - \xi_{i} $$

where

$$ i = 1, \ldots ,t $$

The best possible (optimal) hyperplane was calculated by finding the quadratic equation by using the Kuhn–Tucker conditions and Lagrange multiplier.

The method of Lagrange multipliers is a strategy for finding the local maxima and minima of a function subject to equality constraints (i.e., subject to the condition that one or more equations have to be satisfied exactly by the chosen values of the variables). The basic idea is to convert a constrained problem into a form such that the derivative test of an unconstrained problem can still be applied.

The Lagrange multiplier theorem roughly states that at any stationary point of the function that also satisfies the equality constraints, the gradient of the function at that point can be expressed as a linear combination of the gradients of the constraints at that point, with the Lagrange multipliers acting as Coefficients. The relationship between the gradient of the function and gradients of the constraints rather naturally leads to a reformulation of the original problem, known as the Lagrangian function.

To select the appropriate fuzzy memberships is a crucial step for designing the fuzzy SVM classifiers. The objective rule is to process a membership value which is appropriate to the input information points can only depend upon the corresponding information points of their own classes. The below section discusses the statistical measures (feature selection) projected to find out the accuracy of the performance system.

4 Results and discussion

In this research, used a dataset that is taken from the database available ftp://ieee.org/uploads/press/rangayyan/. This contains eight channels without artifacts signal of EEG. It is processed under sampling rate of frequency 100 Hz; time taken for recording the signal considered as 7.5 s. The scalp electrode position/placement of EEG is represented in Mammone et al. (2007). Delorme et al. (2007) simulated the different methods of artifacts described here: (i) eye blink, (ii) electrical power shift and (iii) temporal motor movement signal. This temporal random motor movement signal artifact is filtered with band pass filter with the frequency between 20 and 70 Hz and simultaneously filters the eye blink artifacts by random band pass filtered between 2 and 5 Hz.

To facilitate a visual artifactual EEG, correlate the one to one with artifacts of central and frontal electrodes placed in the scalp, therefore acquired four different datasets of eight channels brain signals degraded with artifacts.

4.1 Artifacts removal

MATLAB^® was used for implementing the artifact removal by applying the method of WICA and fuzzy kernel SVM. The frequency range is subdivided into various subbands relevant with the EEG measure because entire frequency range in the EEG is decomposed with four levels wavelet and this measure is considered as an enough to separate the frequency range into sub-bands. The EEG measure is categorized as delta: 0–4 Hz, theta: 4–8 Hz, alpha: 8–12 Hz, beta: 12 Hz and higher. The EEG measure used to extract the signal from the EEG artifact is described in the below algorithm

The EEG measures project the clear output such as delta and theta range inclined with wavelet components (Daubechies-4) of visual condition. So delta and theta waves are instructed to correlate with eye blinks. Very high frequency such as beta and alpha concentrate on the motor movement activity, and majorly beta measures are correlated with motor movement activity. Because of equal influence of the various waves, the decrease in artifact loss was also possible in order to obtain the efficiency of the artifacts removal; optimization is accomplished for further processing.

4.2 Automatic selection of artifact component correlated with wavelet

Various statistical measures were estimated for measuring the artifacts with the correlated wavelet components. The statistical measures such as: (µ) mean, (σ) standard deviation, (k) kurtosis and (E) entropy are described in the below section.

4.2.1 Mean (M)

This term mean refers to measure the central tendency in the data points

$$ \overline{x} = \frac{1}{N}\mathop \sum \limits_{n = 1}^{N} x\left( n \right) . $$

(14)

4.2.2 Standard deviation (SD)

SD is used to calculate the quantity of variation or dispersion of data, and this used a feature as time domain. It is regularly evaluated with the following expression

$$ \sigma_{x} = \frac{1}{N - 1}\mathop \sum \limits_{n = 1}^{N} \left( {x\left( n \right) - \overline{x} } \right)^{2} . $$

(15)

4.2.3 Hjorth parameters (HP)

HP is design to calculate the mobility, movement and density of the EEG signals (Hjorth 1975). This is estimated using the EEG signal, m(n) and 1st and 2nd orders are defined as m′(n) and m″(n) as follows

$$ \begin{aligned} & {\text{HP}}\;{\text{movement}} = \, \sigma_{x}^{2} \\ & {\text{HP}}\;{\text{mobility }} = \, \sigma_{{x^{\prime } }} / \, \sigma_{x} \\ & {\text{HP}}\;{\text{density}} = \left( {\sigma_{{x^{\prime \prime } }} /\sigma_{x\prime } } \right)/\left( {\sigma_{x\prime } /\sigma_{x} } \right) \\ \end{aligned} $$

4.2.4 Renyi entropy (RE)

RE is used to estimate the entropy of the Gaussian distribution G = (G₁, G₂, …, G_n) (Renyi 1960). RE is represented as

$$ {\text{RE}} = \frac{1}{1 - m}\log \left( {\mathop \sum \limits_{i = 1}^{n} p_{i}^{m} } \right) $$

(16)

where m denotes the array of Renyi entropy, when m is > 0, then m is not equal to 1.

4.2.5 Kraskov entropy (KE)

KE measures Shannon entropy used with maximum of n samples multiplied with m-dimensional random vectors x. KE is represented as follows:

$$ {\text{KE}} = \psi \left( k \right) + \psi \left( N \right) + \log (v_{m} ) + \frac{m}{N}\mathop \sum \limits_{i = 1}^{N} \log \left( {2r_{i} } \right) . $$

(17)

4.2.6 Organized spectral entropy (OSE)

OSE measured from power band normalization (Sabeti et al. 2009). It is estimated as follows

$$ {\text{NSE}} = \frac{1}{{\log_{2 } N_{f} }}\mathop \sum \limits_{{f = f_{1} }}^{{f_{2} }} S\left( f \right)\log_{2} \frac{1}{S\left( f \right)} $$

(18)

where f₁ and f₂ are the lower and upper frequency, S denotes power density, and N_f denotes total amount of frequencies between the range.

4.3 Minimum and maximum

The max() and min() functions return the maximum and minimum value in a vector.

4.3.1 Mode

The mode is another representative value that may be used to describe a group of numbers. It is the value that occurs most often in the group. The mode of a set of data is the number with the highest frequency.

4.3.2 Variance

The variance to be represented as

$$ s^{2} = \frac{1}{n - 1}\mathop \sum \limits_{i = 1}^{n} \left( {x - \overline{x} } \right)^{2} . $$

4.3.3 Kurtosis (K)

K is a measure unconditional form of the probability in the dataset, and it is defined as

$$ \left\{ {\frac{{n\left( {n + 1} \right)}}{{\left( {n - 1} \right)\left( {n - 2} \right)\left( {n - 3} \right)}}\sum \left( {\left( {x_{i } - \overline{x} |s} \right)} \right)^{4} } \right\} - \frac{{3\left( {n - 1} \right)^{2} }}{{\left( {n - 2} \right)\left( {n - 3} \right)}} . $$

(19)

The computation of the WCs correlated with EEG measures is shown in Fig. 3. Entropy detects the wavelet component 3 and wavelet components 4 successfully, simultaneously represent the motor movement activity and kurtosis detect wavelet component 11 and 12 to represent the electrical power. At end, kurtosis detects the wavelet component 1, 2, 9 and 10 to represent the visual inspection.

The performance of the WICA was compared with enhanced ICA methodology (Makarov and Castellanos 2006). The quantitative results of the (RMSE) root mean square error and correlation within the original artifact EEG signals with processed EEG signals are shown in Table 1.

Table 1 Different types of artifact value of RMSE and correlation

Full size table

4.3.4 Machine performance evaluation

The main four measures can be used to estimate the performance of the proposed work. Generally, classification parameters are defined as performances such as accuracy, sensitivity, specificity and confusion matrix as follows

$$ \begin{aligned} & {\text{Sensitivity}}\; \left( {\text{Sn}} \right) = \frac{\text{TP}}{{{\text{TP}} + {\text{FN}}}} \;\left( \% \right) \\ & {\text{Specificity }}\;\left( {\text{Sp}} \right) = \frac{\text{TN}}{{{\text{TN}} + {\text{FP}} }} \;\left( \% \right) \\ & {\text{Accuracy}} = \frac{{{\text{TP}} + {\text{TN}}}}{{{\text{TP}} + {\text{FN}} + {\text{TN}} + {\text{FP}}}} \;\left( \% \right) \\ \end{aligned} . $$

TP, TN, FP and FN represent true positives, true negatives, false positives and false negatives, respectively.

N-fold cross-validation is used for estimating machine parameters and to perform the machine performance. N denotes total amount of inputs. Each and every fold one input is used for testing; then other inputs are used for training and testing (validating) input. This type of procedure was iterated until n times. The average value of k-fold results is taken and visualized as a machine performance as shown in Fig. 4.

Classification performance for RMSE rate is depicted in Fig. 5.

Table 2 shows the FKSVM that reported the classification accuracy 86.1% as an output using an input dataset, and comparison of the performance of FKSVM is shown in Table 3

Table 2 Confusion matrix for classification

Full size table

Table 3 Classification accuracy

Full size table

5 Conclusions and future work

An identification and removal of artifacts of EEG signal implemented through a novel technique of fuzzy kernel SVM to classify the artifacts in WICA is proposed. The FKSVM continuously increases the identification of artifacts components and to give better classification accuracy mentioned as 86.1%. Moreover, different types of artifacts features are selected using the training data. Our proposed system automatically removes the artifacts of EEG signal from the raw dataset. To conclude, different kinds of features are taken for removal of artifacts of EEG signal and various system performance measures are considered for estimating the accuracy of the proposed work. Future work could find out the best feature, which is used to remove the artifacts while presenting the process of eye blink. The method dependency model is evaluated for the number of source components. It is reassuring that the performances of the methods, relative to each other, remain at a similar level for a wide range of numbers of components retained. This indicates that there are indeed true differences between the methods that do not strongly depend on whether a strict or mild cleaning policy is used.

Change history

12 August 2024
This article has been retracted. Please see the Retraction Notice for more detail: https://doi.org/10.1007/s00500-024-10017-5

Notes

The best possible (optimal) hyperplane was calculated by finding the quadratic equation by using the Kuhn–Tucker.

References

Belousov AI, Verzakov SA, von Frese J (2002) A flexible classification approach with optimal generalization performance: support vector machines. Chemometr Intell Lab Syst 64:15–25
Article Google Scholar
Blanco S, Garcia H, Quiroga RQ, Romanelli L, Rosso OA (1995) Stationarity of the EEG series. IEEE Eng Med Biol Soc 14:395–399
Article Google Scholar
Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2:27
Article Google Scholar
Chang CQ, Chan FH, Ting KH, Fung PC (2006) Automatic correction of artifact from single-trial event-related potentials by blind source separation using second order statistics only. Med Eng Phys 28(8):780–794
Article Google Scholar
de Beer NA, van de Velde M, Cluitmans PJ (1995) Clinical evaluation of a method for automatic detection and removal of artifacts in auditory evoked potential monitoring. J Clin Monit 11:381–391
Article Google Scholar
Delorme A, Makeig S (2004) EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods 134:9–21
Article Google Scholar
Delorme A, Sejnowski T, Makeig S (2007) Enhanced detection of artifacts in EEG data using higher-order statistics and independent component analysis. Neuroimage 34:1443–1449
Article Google Scholar
Devipriya A, Nagarajan N (2018) A novel method of segmentation and classification for meditation in health care systems. J Med Syst 42(10):193
Article Google Scholar
Gallois P, Vasseur C, Boudet S, Peyrodie L (2006) A global approach for automatic artifact removal for standard EEG record. In: Proceedings of IEEE engineering in Medicine and Biology Society (EMBC 2006), Orlando, FL, pp 5719–5722
Gotman J, Skuce DR, Thompson CJ, Gloor P, Ives JR, Ray WF (1973) Clinical applications of spectral analysis and extraction of features from electroencephalograms with slow waves in adult patients. Electroencephalogr Clin Neurophysiol 35:225–235
Article Google Scholar
Guerrero-Mosquera C, Navia-Vazquez A (2012) Automatic removal of ocular artefacts using adaptive filtering and independent component analysis for electroencephalogram data. IET Signal Proc 6:99–106
Article MathSciNet Google Scholar
Hjorth B (1975) Time domain descriptors and their relation to a particular model for generation of EEG activity. In: Dolce G, Künkel H (eds) CEAN: computerized EEG analysis. Gustav Fischer-Verlag, Stuttgart, pp 3–8
Google Scholar
Jung TP, Makeig S, Humphries C, Lee TW, McKeown MJ, Iragui V et al (2000) Removing electroencephalographic artifacts by blind source separation. Psychophysiology 37:163–178
Article Google Scholar
Lee T-W (1998) Independent component analysis—theory and applications. Kluwer Academic Publishers, New York
Book Google Scholar
Lin C, Wang S (2004) Training algorithms for fuzzy support vector machines with noisy data. Pattern Recogn Lett 25:1647–1656
Article Google Scholar
Lu Z, Li Y, Li Y, Ma Z (2006) Automatic removal of the eye blink artifact from EEG using an ICA-based template matching approach. Physiol Meas 27(4):425–436
Article Google Scholar
Mahajan R, Morshed BI (2015) Unsupervised eye blink artifact denoising of EEG data with modified multiscale sample entropy, kurtosis, and wavelet-ICA. IEEE J Biomed Health Inform 19:158–165
Article Google Scholar
Makarov VA, Castellanos NP (2006) Recovering EEG brain signals: artifact suppression with wavelet enhanced independent component analysis. J Neurosci Methods 158(2):300–312
Article Google Scholar
Mammone N, Morabito FC, Inuso G, La Foresta F (2007) Wavelet-ICA methodology for efficient artifact removal from electroencephalographic recordings. In: Proceedings of international joint conference neural networks (IJCNN 2007), Orlando, FL, pp 1524–1529
Mammone N, Foresta FL, Morabito FC (2012) Automatic artifact rejection from multichannel scalp EEG by wavelet ICA. IEEE Sens J 12:533–542
Article Google Scholar
Mamun M, Al-Kadi M, Marufuzzaman M (2013) Effectiveness of wavelet denoising on electroencephalogram signals. J Appl Res Technol 11:156–160
Article Google Scholar
Renyi A (1960) On measures of information and entropy. In: Proceedings of 4th Berkeley symposium on mathematical statistics and probability, pp 547–561
Sabeti M, Katebi S, Boostani R (2009) Entropy and complexity measures for EEG signal classification of schizophrenic and control participants. Artif Intell Med 47(3):263–274
Article Google Scholar
Shoker L, Sanei S, Chambers J (2005) Artifact removal from electroencephalograms using a hybrid BSS-SVM algorithm. IEEE Signal Process Lett 12:721–724
Article Google Scholar
Ten Daubechies I (1992) Lectures on wavelets. Society for Industrial and Applied Mathematics, Philadelphia
Google Scholar
Woestenburg JC, Verbaten MN, Slangen JL (1983) The removal of the eye-movement artifact from the EEG by regression analysis in the frequency domain. Biol Psychol 16:127–147
Article Google Scholar
Zhang JH, Janschek K, Bohme JF, Zeng YJ (2004) Multi-resolution dyadic wavelet denoising approach for extraction of visual evoked potentials in the brain. IEE Proc Vis Image Signal Process 151:180–186
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Biomedical Engineering, SNS College of Technology, Coimbatore, India
K. Yasoda
School of Computing, SRM Institute of Science and Technology, Kattankulathur, Kanchipuram, Tamilnadu, India
R. S. Ponmagal
Department of Computer Science and Engineering, Karpagam College of Engineering, Coimbatore, Tamilnadu, India
K. S. Bhuvaneshwari
School of CSE, Vellore Institute of Technology University Bhopal, Bhopal, India
K. Venkatachalam

Authors

K. Yasoda
View author publications
You can also search for this author in PubMed Google Scholar
R. S. Ponmagal
View author publications
You can also search for this author in PubMed Google Scholar
K. S. Bhuvaneshwari
View author publications
You can also search for this author in PubMed Google Scholar
K. Venkatachalam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K. Venkatachalam.

Ethics declarations

Conflict of interest

All author states that there is no conflict of interest.

Human and animals rights statement

Humans/animals are not involved in this work.

Additional information

Communicated by V. Loia.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s00500-024-10017-5

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

About this article

Cite this article

Yasoda, K., Ponmagal, R.S., Bhuvaneshwari, K.S. et al. RETRACTED ARTICLE: Automatic detection and classification of EEG artifacts using fuzzy kernel SVM and wavelet ICA (WICA). Soft Comput 24, 16011–16019 (2020). https://doi.org/10.1007/s00500-020-04920-w

Download citation

Published: 12 April 2020
Issue Date: November 2020
DOI: https://doi.org/10.1007/s00500-020-04920-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

RETRACTED ARTICLE: Automatic detection and classification of EEG artifacts using fuzzy kernel SVM and wavelet ICA (WICA)

Abstract

Similar content being viewed by others

Automated and accurate focal EEG signal detection method based on the cube pattern

Review of EEG Signal Classification

A Preprocessing Method of EEG Based on EMD-ICA in BCI

Explore related subjects

1 Introduction

2 Literature review

3 Methodology

3.1 Limitations of ICA in an artifact removal

3.2 Wavelet ICA technique

3.3 Fuzzy kernel support vector machine classification

3.4 Fussy kernel support vector machine (FKSVM)

4 Results and discussion

4.1 Artifacts removal

4.2 Automatic selection of artifact component correlated with wavelet

4.2.1 Mean (M)

4.2.2 Standard deviation (SD)

4.2.3 Hjorth parameters (HP)

4.2.4 Renyi entropy (RE)

4.2.5 Kraskov entropy (KE)

4.2.6 Organized spectral entropy (OSE)

4.3 Minimum and maximum

4.3.1 Mode

4.3.2 Variance

4.3.3 Kurtosis (K)

4.3.4 Machine performance evaluation

5 Conclusions and future work

Change history

12 August 2024

Notes

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Human and animals rights statement

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation