Better electrobiological markers and a improved automated diagnostic classifier for schizophrenia—based on a new EEG effective information estimation framework

Jing, Tianyu; Wang, Jiao; Guo, Zhifen; Ma, Fengbin; Xu, Xindong; Fu, Longyue

doi:10.1007/s10489-024-05669-7

Better electrobiological markers and a improved automated diagnostic classifier for schizophrenia—based on a new EEG effective information estimation framework

Published: 10 July 2024

Volume 54, pages 9105–9135, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Applied Intelligence Aims and scope Submit manuscript

Better electrobiological markers and a improved automated diagnostic classifier for schizophrenia—based on a new EEG effective information estimation framework

Download PDF

Tianyu Jing¹,
Jiao Wang ORCID: orcid.org/0000-0003-3131-7740¹,
Zhifen Guo¹,
Fengbin Ma¹^na1,
Xindong Xu¹^na1 &
…
Longyue Fu¹^na1

109 Accesses
Explore all metrics

Abstract

Advances in AI techniques have fueled research on using EEG data for psychiatric disorder diagnosis. Despite EEG’s cost-effectiveness and high temporal resolution, low Signal-to-Noise Ratio (SNR) hampers critical marker extraction and model improvement, while denoising techniques will lead to a loss of effective information in EEG. The aim of this study is to employ AI methods for the processing of raw EEG data. The primary objectives of the processing are twofold: first, to acquire more reliable markers for schizophrenia, and second, to construct a superior automatic classification for schizophrenia. To remove the noises and retain task-related (classification tasks) effective information mostly, we introduce an Effective Information Estimation Framework (EIEF) based on three key principles: the task-centered approach, leveraging 1D-CNNs’ test metrics to gauge effective information proportion, and feedback. We address a theoretical foundation by integrating these principles into mathematical derivations to propose the mathematical model of EIEF. In experiments, we established a paradigm pool of 66 denoising paradigms, with EIEF successfully identifying the optimal paradigms (on two datasets) for restoring effective information. Utilizing the processed dataset, we trained a 3D-CNN for automatic schizophrenia diagnosis, achieving outstanding test accuracies of 99.94$\%$ on dataset 1 and 98.02$\%$ on dataset 2 in subject-dependent evaluations, and accuracies of 89.85$\%$ on dataset 1 and 98.02$\%$ on dataset 2 in subject-independent evaluations. Additionally, we extracted 38 features from each channel of both processed and raw datasets, revealing that 20.86$\%$ (dataset 1) of feature distribution differences between the patients and the healthy exhibited significant changes after implementing the optimal paradigm. We enhance model performance and extract more reliable electrobiological markers. These findings have promising implications for advancing the field of the clinical diagnosis and pathological analysis of Schizophrenia.

Empowering precision medicine: AI-driven schizophrenia diagnosis via EEG signals: A comprehensive review from 2002–2023

Article 05 December 2023

Schizophrenia Diagnosis by Weighting the Entropy Measures of the Selected EEG Channel

Article 13 November 2022

Evaluating Ratio Indices Based on Electroencephalogram Brainwaves in Schizophrenia Detection

Article 12 February 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Schizophrenia (SZ) is a chronic and intricate neuropsychiatric disorder characterized by symptoms such as blunted affect, hallucinations, and delusions. These debilitating symptoms lead to significant cognitive and, ultimately, social deficits [1], often resulting in individuals spending a substantial portion of their lives in psychiatric care facilities. The precise etiology of schizophrenia remains unclear, but it is believed to be influenced by a complex interplay of genetic, environmental, and psychosocial factors [2].

Electroencephalogram(EEG) is a biological signal characterized by high temporal resolution and low acquisition cost, allowing for the detection of changes in brain states [3]. Consequently, EEG has been widely utilized in the field AI for identifying various human mental states, such as drunkenness [4, 5], depression [6], and various emotions [7], etc. Clearly, the diagnosis of schizophrenia using EEG, the primary focus of this paper, is an ongoing area of research pursued by many researchers [8,9,10].

In the above context, when developing an automatic system for the diagnosis of SZ, we have several primary objectives. Firstly, our main goal is to create a system that can assist clinicians in diagnosing SZ effectively. Secondly, we aim to identify and validate interpretable features that have the potential to classify SZ patients accurately. Lastly, we endeavor to establish electrobiological markers, which are essentially features, for SZ using AI techniques. This paper will address all three of these objectives, striving to provide solutions and insights into the diagnosis of SZ.

Regrettably, EEG data’s inherently low SNR [11] poses a challenge to achieving the aforementioned objectives. Firstly, the presence of noise in EEG data dilutes the relevant information, consequently affecting the performance of trained classifiers. Besides, due to the limitation of the acquisition equipment, the scale of EEG datasets is generally not large. Thus, the situation easily happens that the noise components and the redundant components have different distributions in the two sub-datasets belonging to the healthy and the patients. For example, Fig. 1 shows the boxplots of the bubble entropies [12] of all the disjoint segments with 4 seconds in the dataset 1 mentioned below (the SZ segments and the healthy(HC) segments have their boxplot, respectively). It becomes evident that, in the raw data, the average and median entropy values for SZ patients are higher than those for healthy individuals. However, after the noise removal, these metrics for SZ patients drop below those for healthy individuals. Such a problem definitely hinders the achievement of identifying potential qualified features.

Moreover, training a automatic classification system using such datasets may mislead the system, causing it to extract noise as hidden layer features, ultimately leading to a decrease in the generalization ability of the system.

Because the EEG redundant signal has yet to be defined clearly, removing them is impossible at current. As for the noise, although more and more denoising approaches for EEG signals have been proposed in recent years [11, 13,14,15], these aforementioned issues still persist without complete resolution. Even worse, sometimes signal preprocessing operations significantly influence the outcomes of some specific classification tasks when applied to task-related datasets, potentially causing the loss of essential task-related information [16, 17].

To overcome the limitations of the aforementioned researches, this paper introduces an EEG effective information estimation framework(EIEF). EIEF is designed to be directly aligned with the classification task at hand and is tailored to a specific EEG dataset. The core mechanism of the framework is using the testing metrics of a trained end-to-end DNN to feed the stock and proportion of the effective information back to denoising approach selection. Let us take a testing metric as the objective function. Finding the optimal denoising paradigm for a specific EEG dataset and the corresponding optimal estimation of the SZ-related effective information of this dataset will become an optimization problem. Ideally, the optimal paradigm can remove a lot of noise components with the effective information retained, and enhance the reliability of the SZ electrobiological marker discovery and the generalization capacity of SZ classification systems based on the optimal dataset.

In this paper, the framework utilizes a 1D-CNN as the core component of the DNN, as introduced in [18]. But when applying the framework’s result to construct an automatic SZ diagnosis system, we opt for the 3D-CNN proposed by [19] with a more substantial scale and increased depth to enhance system performance.

To sum up, this paper focuses on the establishment and validation of EIEF, the assessment of changes in EEG feature disparities between patients and healthy individuals before and after implementing the optimal denoising paradigm, and the construction of an automated SZ diagnosis system by EIEF. The paper is structured into three main parts: theory and methodology, experiments, and verifications. In the part of the theory and methodology, starting with discussing the properties of the EEG effective information of classification tasks, then relying on the estimation for that information, we propose EIEF, comprising the objective function, constraints and solution methods. As for the method, according to the conditions and solution of EIEF, after specifying the two EEG datasets used, the research will create a paradigm pool consisting of 66 denoising paradigms, which is followed by the description of the two used CNNs. In the part of the experiments, for each dataset processed by each paradigm, we input segments of the processed dataset into the 1D-CNN for training and testing, using subject-independent cross-validation. We aim to find the two optimal paradigm for the two datasets within the pool, with test accuracy as the optimization objective. Subsequently, using the datasets processed by the optimal paradigms, we develop an automatic SZ diagnostic system based on the 3D-CNN and compare it with S.O.T.A. In the part of the verifications. Making the 3D-CNN replace the 1D-CNN, we will implement another search to verify the stability of the optimal paradigm and the strength of putting the 1D-CNN into the framework. Furthermore, the change in the features disparities between the two groups of individuals before and after the implementation is evaluated to elucidate the significance of the denoising paradigm.

2 Related Work

2.1 EEG noise removal techniques

EEG has a high resolution and the signals are prone to unwanted noise pollution, resulting in various artifacts [20]. Eye movement, blinking, heart activity, and muscle activity in EEG signals are the main physiological artifact types. Besides, there are many extrinsic artifacts existing, like the line noise and volume conduct artifact [11]. TTraditional artifact removal techniques encompass regression methods, wavelet transformations, blind signal separation (BSS) methods, filtering methods, among others. In recent years, several new approaches, such as AI-based methods and hybrid methods, have emerged, demonstrating improved performance and reduced computational demands. Sweeney, Ward, and McLoone assumed that each channel was the accumulation of pure EEG data and a certain proportion of artifacts. The estimated artifacts were then subtracted from the EEG [21]. Gianluca Di Flumeri proposed the regression-based eye correction algorithm(REBLINCA) with a higher ability to retain the EEG signal in the no-eye movement part. Besides, the method does not require an EOG channel compared to other regression-based methods. Compared to ICA-based algorithms, this requires fewer channels and facilitates the calculation [14]. Bigdely-Shamlo et al. Introduced a robust referrals algorithm that attempted to estimate the actual average of EEG channels after removing bad channel contamination. Their efforts were to develop a standardized early-stage preprocessing pipeline (the PREP pipeline) that detects and removes certain experimentally generated artifacts, such as eye blinks or muscle activations [13]. Banghua Yang proposed a novel blind source separation method called CCA-EEMD to remove EOG artifacts automatically as well as reserve more valuable information from raw EEG. A distinctive aspect of this method is that the identified EOG component is not removed directly but used to extract neural EEG data, which would keep more effective information [15]. Sadiq et al. used multiscale principal component analysis to decompose EEG signals, and employed Kaiser rule to select principal components to remove the noises [22]. Morteza Zangeneh Soroush introduced a novel method to detect artifactual components estimated by second-order blind identification (SOBI). Artifacts are detected using a mixture of well-established conventional classifiers and were removed employing stationary wavelet transform (SWT) to reserve neural information. This method combines signal processing techniques and machine learning algorithms, yielding significant results across various scenarios [23]. However, all the above methods are general and aren’t task-centered. In other words, they are open-loop methods without feedback.

2.2 EEG-based SZ classification

From the raw EEG data to the final classification results, researchers’ processing can be broadly categorized into the following steps. Firstly, data extraction is conducted, where commonly encountered EEG signals for disease classification include resting-state signals and task-related signals. Subsequently, data denoising is performed, as mentioned in the preceding section. Then, signal analysis, such as feature extraction, nonlinear signal decomposition, spectral analysis, and so forth. Finally, based on the analyzed data, classifiers are trained. Therefore, in this subsection, we will emphasize how researchers conduct signal analysis and classifier design in those prominent achievements.

More than a decade ago, Sabeti et al. [24] extracted Shannon entropy, spectral entropy, approximate entropy, Lempel-Ziv complexity and Higuchi fractal dimension from an EEG dataset (recorded data in resting-state with eyes opened), and achieved a classification accuracy of 86$\%$ and 90$\%$ obtained by LDA and Adaboost respectively. Parvinnia et al. [25] also used resting-state EEG signals with eyes opened to conduct research. After extracting fractal dimension, band power and autoregressive (AR) model, they applied weighted distance nearest neighbor (WDNN) for classification. And the accuracy was 95.3$\%$. Murphy et al. [26] collected a task-related dataset from duration deviant MMN tasks, and found adolescents with psychotic symptoms were characterised by a reduction in MMN amplitude at frontal and temporal regions compared to the controls through statistical analysis. During that period, researchers were directly extracting a few features from raw data and then using them to perform simply statistical analysis or train classical machine learning classifiers.

As research progresses, researchers are increasingly incorporating new technologies into the feature extraction process. For example, researchers can simultaneously extract features of several dozen different types at once and then use feature selection techniques to to select a suitable subset, and use this subset to train a classifier. Jahmunah et al. [27] total mined 157 features from the dataset, and select 14 features using Student’s t-test. Based on these feature, they implemented classification practice with various ML classifiers, DT, LD, KNN, PNN, and SVM with various kernels. And the average performance value is 92.91$\%$. Prabhakar et al. [28] first extracted 9 nonlinear features and then optimized the selection of the features by Artificial Flora (AF) optimization, Glowworm Search (GS) optimization, Black Hole (BH) optimization, and Monkey Search (MS) optimization. They also trained several classifiers by the optimized features and found SVM-RBF can reach the best performance of 97.54$\%$ (for normal cases) and 92.17$\%$ (for schizophrenia cases).

In recent years, various signal decomposition techniques have been widely employed for state recognition based on EEG. Sadiq et al. achieved a sensitivity, specificity and classification accuracy of 93$\%$, 92.1$\%$ and 91.4$\%$, respectively, on a motor imagery dataset by utilizing a robust and simple automated multivariate empirical wavelet transform (MEWT) to obtain joint instantaneous amplitude and frequency components [29]. And achieved an average classification accuracy of 99.8$\%$ by employing a multivariate variational mode decomposition (MVMD) method to obtain joint modes in frequency scale across all channels [30]. In the realm of schizophrenia recognition, researchers have also begun innovating feature engineering from the perspective of signal decomposition. Krishnan et al. [31] used Multivariate Empirical Mode Decomposition (MEMD) to decompose the EEG data into Intrinsic Mode Functions (IMF) signal. Then five entropy measures were measured from the IMF signals. And the subset of features was selected by Recursive Feature Elimination. Based on Radial Basis Function (SVM-RBF), they achieved the highest accuracy and F1-score of 93$\%$ with 95 features and obtained an AUC of 0.9831. Baygin [32] conducted feature extraction from 19-channel EEG signals with healthy and schizophrenia classes, using Tunable Q-Factor Wavelet Transform (TQWT) and statistical moment methods, and selected feature subset by the ReliefF method. He chose KNN to be the classifier and achieved an accuracy of 99.12$\%$. Khare et al. [33] used the Fisher score method to select the most discriminant channel, then used flexible tunable Q wavelet transform (F-TQWT) to decompose the EEG signal. After the decomposition, similar to the aforementioned researches, they extracted five features and employed the Kruskal-Wallis test to select a subset of features. Subsequently, this subset was fed into an flexible least square support vector machine (F-LSSVM) classifier. In their paper, a more innovative approach involved utilizing the grey wolf optimization algorithm to incorporate feedback from SVM results into the selection of Q-wavelets. An accuracy of 91.39$\%$, sensitivity, specificity, precision, F-1 measure, false positive rate and error of 92.65$\%$, 93.22$\%$, 95.57$\%$, 0.9306, 6.78$\%$ and 8.61$\%$ was achieved.

In addition to intensive research in feature engineering, with the continuous breakthroughs in deep learning technology, researchers have also begun to utilize various types of DNNs for schizophrenia recognition. One category of research involves directly feeding continuous or segmented EEG signals into the network, for example: Oh et al. [18] introduced a 1D-CNN model designed to analyze signals, automatically extract salient features, and perform classification. This model achieved a classification accuracy of 98.07$\%$ for subject-dependent (SD) evaluation and 81.26$\%$ for subject-independent(SI) evaluation. Sharma et al. [34] proposed a schizophrenia hybrid neural network (SzHNN), which is a combination of convolutional neural networks (CNNs) and long short-term memory (LSTM). They divided the original data of two EEG datasets into non-overlapping segments and used these segments to train the SzHNN. The performance is an accuracy of 99.9$\%$ on dataset 1 and an accuracy of 99.5$\%$ on dataset 2. Another category of research involves the fusion of feature engineering with DNNs. Leveraging the scale of DNNs, such papers often yield large-sized feature sets in their feature engineering, such as the visualized image of EEG signals. Shen et al. [35] developed an image feature, functional brain network, using a multivariate autoregressive model and coherence connectivity algorithm. And they used 3D-CNN to classify the SZ patients. The proposed 3D-CNN method achieved the performance of a 98.47 ± 1.47$\%$ in accuracy, 99.26 ± 1.07$\%$ in sensitivity, and 97.23 ± 3.76$\%$ in specificity. Similarly, Khare et al. [36] captured the instantaneous information of EEG signals in the time-frequency domain using MH-TFD, converted the information to two-dimensional plots, and fed the plots to the developed CNN model. The developed CNN is SchizoNET model. And the proposed model achieved an accuracy of 97.4$\%$, 99.74$\%$, and 96.35$\%$ on the three datasets, respectively. Zülfikar et al. [37] integrated Empirical Mode Decomposition (EMD) with the VGG16 pre-trained CNN. HS (Hilbert Spectrum) images of the first four Intrinsic Mode Functions (IMF) components obtained by applying EMD to EEG signals were fed into several famous CNN. They obtained the classification performance of 98.2$\%$ for Dataset I and 96.02$\%$ for Dataset II, using VGG16 network.

The innovation in the aforementioned articles includes the introduction of new features and new methods of feature acquisition (such as signal decomposition), the introduction of new feature subset selection methods, the introduction of new classifiers, etc. However, none of these articles focus on improving the reliability of existing features, which is the focal point of our research.

3 Theory and method

The primary objective of this chapter is to leverage mathematics to introduce an effective information estimation framework that can get the optimum among a given series of denoising paradigms, based on any end-to-end classification model and a certain dataset. Then according to the conditions of EIEF, after specifying the EEG dataset used, the research will create a paradigm pool consisting of 60 denoising paradigms for the following grid search for the optimum. And, the two classifiers inside EIEF and outside EIEF will be detailed, plus the classifier inside serves as the foundation for assessing the metrics used in the paradigm search, and the classifier outside is responsible for constructing the automated SZ diagnosis system.

Here, the flow chart from EIEF’s work to the construction and the use of the diagnosis system is illustrated in Fig. 2. And Table 1 is the list of symbols used in the thory part.

3.1 Hypotheses on property of effective information

Let $\varvec{Z \in \mathbb {R}^{c \times t}}$ denote the $\varvec{c \times t} $-dimensional matrix variable, and $\varvec{X \in \Theta \subset \mathbb {R}^{c \times t}}$ denote the observed EEG signal sample, where $\varvec{c}$ is the number of recorded channels, $\varvec{t}$ is the sample size and $\varvec{\Theta }$ is the sample space. The purpose of this subsection is to describe a common property of EEG effective information for any classification task with mathematic. So first, in order to denote the effective information, we shall explain and denote EEG signal objective components.

Table 1 List of Symbols

Better electrobiological markers and a improved automated diagnostic classifier for schizophrenia—based on a new EEG effective information estimation framework

Abstract

Similar content being viewed by others

Empowering precision medicine: AI-driven schizophrenia diagnosis via EEG signals: A comprehensive review from 2002–2023

Schizophrenia Diagnosis by Weighting the Entropy Measures of the Selected EEG Channel

Evaluating Ratio Indices Based on Electroencephalogram Brainwaves in Schizophrenia Detection

Explore related subjects

1 Introduction

2 Related Work

2.1 EEG noise removal techniques

2.2 EEG-based SZ classification

3 Theory and method

3.1 Hypotheses on property of effective information

Definition 1

Definition 2

Definition 3

Definition 4

Hypothesis 1

Hypothesis 2

Corollary 1

3.2 EEG effective information estimation framework

Definition 5

3.3 Datasets

3.3.1 Dataset 1

3.3.2 Dataset 2

3.4 Denoising paradigms

3.5 Model inside EIEF–1D-CNN

3.6 Model for diagnosis system–3D-CNN

4 Experiment and verification

4.1 Details of experiments and hyper-parameters adjustment

4.2 Platforms and softwares

4.3 Implementation of EIEF

4.4 Construction of SZ diagnosis system

4.5 Verification for EIEF mechanism

4.6 Comparison between two classifiers

4.7 Confirmation of improved electrobiological markers

4.7.1 Features extraction and analysis method

4.7.2 Analysis result

5 Discussion of advantages and limitations

5.1 Advantages

5.2 Limitations and future solutions

6 Conclusion

Availability of data and materials

Code Availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval

Consent to participate

Consent for publication

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation