Automated classification of multi-class sleep stages classification using polysomnography signals: a nine- layer 1D-convolution neural network approach

Satapathy, Santosh Kumar; Loganathan, D

doi:10.1007/s11042-022-13195-2

Automated classification of multi-class sleep stages classification using polysomnography signals: a nine- layer 1D-convolution neural network approach

1199: Computational Intelligence Revolution in Multimedia Data Analytics and Business Management
Published: 27 May 2022

Volume 82, pages 8049–8091, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

Automated classification of multi-class sleep stages classification using polysomnography signals: a nine- layer 1D-convolution neural network approach

Download PDF

Santosh Kumar Satapathy^1,2 &
D Loganathan¹

1090 Accesses
12 Citations
Explore all metrics

Abstract

Sleep disorder diseases have one of the major health issues across the world. To handle this issue the primary step taken by most of the sleep experts is the sleep staging classification. The whole visual inspection process is carried out manually by the sleep experts, which can be a highly time-consumed task and creates a lot of annotation errors due to more human interventions. In this study, we introduce an efficient and robust approach to improve the sleep staging accuracy. In this paper, we proposed an automated deep nine-layer one-dimensional convolution neural network for multi-class sleep staging classification (9 L-1D-CNN-SSC) using polysomnography (PSG) signals. The proposed 9 L-1D-CNN-SSC model comprises eleven layers with learnable parameters: nine convolution layers and two fully connected layers. The main objective of designing such a model is to achieve higher classification accuracy for multiclass sleep stages classifications with reduced learnable parameters. The proposed network architecture is tested on two different subgroups recordings of ISRUC-Sleep datasets namely ISRUC-Sleep subgroup1 (ISR-SG-I), and ISRUC-Sleep subgroup3 (ISR-SG-III). The proposed model is compiled with eight different individual experiments based on a single-channel electroencephalogram (EEG), electrooculogram (EOG), electromyogram (EMG), and combinations of EEG + EOG+ EMG signals. The proposed 9 L-1D-CNN-SSC model achieved the highest classification accuracy of 99.03%, 99.50%, and 99.03% for three to five sleep stages classification, respectively with single-channel of EEG signals, similarly, the model achieved 98.93% for two-state sleep stage classification with EMG signals using the ISR-SG-I dataset. The same model achieved the highest classification accuracy of 98.88%, 98.76%, and 98.67% for three-five sleep stages classification with a single-channel EMG signal, and 99.24% for two-state sleep classification with single-channel EOG using ISR-SG-III dataset. It has been observed that the obtained results from the proposed 9 L-1D-CNN-SSC model give the best classification accuracy performance on multiclass sleep stages classification incomparable to the existing literature works. The developed 9 L-1D-CNN-SSC deep learning architecture is ready for clinical usage with high PSG data.

Automated Sleep Staging Using Convolution Neural Network Based on Single-Channel EEG Signal

Application of convolutional neural network-based biosensor and electroencephalogram signal in sleep staging

Article 14 March 2021

An automatic method using MFCC features for sleep stage classification

Article Open access 10 February 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Nowadays it has been observed that the neurocognitive system directly decides the mental and cognitive performance in a particular task. It’s very difficult to determine the subject’s sleep behavior very accurately either numerically or any standard evaluation procedures [8]. Currently, sleep related diseases are an open challenge in the medical domain concerning different diseases such as neurology disorder, rehabilitation, and psychology-related disorders. It’s also very difficult to assess with a scenario of changes sleep characteristics in a known predictable manner. These types of diseases are more challenging concerning analysis and getting proper diagnosis solutions [23]. Sleep is one of the important ingredients for good human health and also responsible for maintaining the fitness and functioning of the different core systems of our body. It also put an impact on our proper functioning of mental and cognitive systems [12].

For human life, a total of one-third of its duration is constituted of the sleep cycle. It has been observed from several studies that sleep deficiency causes so many consequences like inability to solve the problem, not able to make proper decisions, not controlling the emotions, and reflected several changes in people [11, 70]. Sometimes the improper quality of sleep influenced different types of sleep-related disorders such as sleep apnea, insomnia, depression, narcolepsy, hypersomnia, breathing-related disorders, and circadian rhythm disorders [77]. Sometimes it has been seen that sleep deprivation is considered a stress-related disorder or sleep pathology, which causes high risk in performing some common cognitive risks such as workplace incidents, road accidents happened [25]. According to a report of the National Highway Traffic Administration of USA, due to drowsiness around one lakh, car accidents happened, as consequence more than 1500 death cases resulted and injuries cases reported around 71,000 annually [22].In this scenario, proper analysis of sleep stages is very important for identifying sleep-related irregularities. So that it is very essential to analyze the sleep stages behavior and accurate scoring of sleep states is a very crucial segment of the sleep staging process [27].

The polysomnography test is the primary step for any type of sleep-related disorder. It is a combination of different physiological signal which is useful during analyzing the sleep patterns of an individual subject. Several polysomnographic recording is recorded for sleep scoring: the EEG signal are used for monitoring the brain-behavior, the EOG signal used for monitoring eye movements and the EMG signals are tracked the changes behavior muscle tone. The entire sleep staging process is generally conducted through visualizing the sleep patterns of the subject during sleep periods by well-trained sleep experts according to two available standard sleep guidelines: the Rechtschaffen and Kales (R&K) [36] and the American Academy of Sleep Medicine (AASM) standards [78].As per AASM rules, the whole sleep stages are divided into five sleep stages: wake stage (W), non-rapid eye movement (NREM) sleep stage1 (N1), NREM sleep stage2 (N2), NREM sleep stage3 (N3), and rapid eye movement (REM) sleep stage. The main revision that occurred with AASM is, the merge of two R&K-defined sleep stages S3 and S4 into one single sleep stage, called N3.

Each defined sleep stage in AASM behaves differently during sleep periods. In this study, we are using an automated sleep stage classification system for these five sleep stages. Generally, one subject can go through all these sleep stages during its sleep cycle periods. One person can cover 3 to 5 sleep cycles, each of the time duration around 90–110 minutes during a full sleep [41]. During the first phase of the sleep cycle, the period of NREM stages is more and later part of the sleep cycles, periods REM stage increases. Earlier the sleep staging is done through visual inspection for a period of 2 to 5 hrs. in 8 hrs. of sleep for one subject. Traditionally, the sleep experts have segmented the entire sleep recordings into the 30-s interval, called one epoch, and each epoch labeled with one of the sleep stages through visualizing of its frequency and amplitude ranges, the characteristics of EEG waveforms, blinking of eye movements (EOG) and muscle movements (EMG) [76].

This traditional way of monitoring sleep stages methods has so many disadvantages such as requires more sleep experts to monitor the sleep recordings, time-consuming, and erroneous [10]. Due to more human interpretations during recording and it may not report good classification accuracy in the diagnosis of sleep stage classification [72]. Based on the above-mentioned drawbacks, automated classification of sleep stages is introduced, which ultimately gives benefits for quick diagnosis and also reported with increases of high classification accuracy [52, 55].

Sleep staging analysis and its scoring is a complicated procedure because of changes in sleep characteristics related to different sleep stages and also the non-stationary nature of the signal information [57].

The rest of the paper is presented as follows: In Section 2, the authors present the research contributions with related to sleep staging. Section 3 describes a brief overview on CNN model and its parameters. Section 4 describes briefly on the proposed methodology, which includes descriptions of the dataset used, proposed 9 L-1D-CNN-SSC model, model training and testing of 9 L-1D-CNN-SSC model are discussed. Section 5 presents the brief descriptions of experimental results of the proposed model. Section 6 discusses about the results and compares them with those by the state-of-the-art methods.Finally in Section 7 the concludes the paper and present the future directions.

2 Related work

Most of the authors are proposed an automatic sleep stage classification system for identifying the sleep patterns and diagnosis of several types of sleep-related disorders [29, 33, 34, 44]. In general sleep, staging procedures are conducted mainly on two strategies, one with single-channel input recording, and the other is multi-channel input recordings [60, 62, 63]. In the first approach, only one channel is considered for extracting the informative features about the sleep characteristics of the subjects. Similarly in a multi-channel system of recordings, a number PSG signal is used that is more than one EEG channel, EOG channel, and EMG channel [65, 66]. There is a standard procedure obtained for an automated sleep stage classification by most of the authors to their sleep staging experiments through five basic stages: 1) Signal the acquisition, 2) Pre-processing, 3) feature extraction,4) feature reduction, and 5) classification [79, 82]. The 3)feature extraction step is used for extracting the different characteristics parameter from preprocessed signal stage 2).These feature values can be extracted in frequency,time,time-frequency and non-linear domains [2].It has been seen that some of the s, one additional step used by authors that is feature reduction or dimensionality reduction stage. It is very helpful in screening the relevant features for the classification model. From our survey, it has reported that some of the feature selection algorithms used by different authors in their ASSC study are: principal component analysis (PCA) [46], relief algorithm [50], linear discriminate analysis (LDA) [58], minimum redundancy maximal relevance (mRMR) [71], and sequential forward and backward selection method [73, 74].Similarly in case of classification model used in recent automated sleep stage classification includes: support vector machine (SVM) [38, 53, 80], k-nearest neighbor (KNN) [104], k-means clustering [84], decision tree (DT) [96], bootstrap aggregating [7],random forest (RF) [20],naïve bayes [3],Gaussian mixture model (GMM) [97],adaboost [6], sparse auto encoders (SAE) [61],and artificial neural networks (ANNs) [93].

In recent research developments, deep learning techniques are becoming more popular in machine learning research applications and also used so many applications such as human-brain computing, computer vision, natural language processing, and speech recognition. Recently it has been found that deep learning concepts such as CNN [87, 99], RNN [28, 37], and LSTM [35] applied to the sleep staging approach. Currently, research on sleep staging plays an important role in NCP and Human-Machine interaction (HMI). There are several studies related to automated sleep staging using various physiological datasets and multimedia data, such as EEG, EMG, EOG, ECG, and audio, etc. One of the most popular contributions of sleep stage classification is the study of sleep behavior through human brain-computer- interaction (BCI) [102, 105].Till now also traditional machine learning techniques used for sleep staging, and recently in this research deep learning methods used in several contributions of automated recognition of sleep stages. We now look upon some of the recent contributions presented by different authors related to sleep staging using machine learning and deep learning concepts.

2.1 Polysomnography (PSG) based sleep staging using machine learning approaches

Most of the research contributions until now depend on the machine learning techniques for the recognition of sleep stages in an automated system. Krakovska et al. [35] used 6 EEG channels, 2 EOG channels, and 1 EMG channel used for recordings of the sleep behaviour and obtained features like variance, average amplitude, and spectral power. For classification of sleep stages obtained using quadratic discriminate analysis and the accuracy result reported about 74%.

In [54] the author considered multiple signals such as EEG, EOG, and EMG for the automated sleep scoring through the extraction of features like skewness, kurtosis, variance, entropy and used a dendrogram-based SVM (DSVM) classifier for classifying the sleep stages and reported accuracy for the model as 88%.

Zhu et al. [106] obtained graph-oriented features from single-channel EEG and used SVM classification techniques and the accuracy result was reported for six-state classification as 87.5%.

Hassan et al. [32] applied the EEMD algorithm for signal enhancement from single-channel EEG signal and extracted statistical features are forwarded into boosting techniques and the reported accuracy for two-six sleep stages is reported as 98.15%, 94.23%, 92.66%, 83.49%, and 88.07% respectively.

Silveria et al. [85] presented a six-state sleep staging approach using a discrete wavelet concept and obtained a random forest classifier, the model achieved 90% accuracy.

Rahman et al. [75] introduced a single-channel EOG sleep scoring approach and extracted statistical features by applying discrete wavelet transform techniques. The average accuracy reported for six state classifications through RUSBoost, RF and SVM is 90%, 91%, and 91.7%.

Memar et al. [59] proposed two-state sleep staging and the acquired signal decomposed into eight sub-bands, finally 13 features are extracted from each sub-band epoch. The suitable features are identified through the mRMR feature selection algorithm. The model achieved an overall accuracy of 95.31% through a random forest classifier.

Imtiaz et al. [42] presented automated sleep staging through home-based polysomnography signal and the model reported accuracy for training and testing dataset are 89% and 72% respectively through decision tree classification algorithm.

Dimitriadis et al. [16] proposed one channel EEG sensor ASSC techniques and estimated cross-coupling frequency (CFC) from each epoch and the system achieved an overall accuracy of 94% through multi-class Naïve Bayes classification techniques.

Sen et al. [81] proposed a sleep staging system using a single-channel EEG signal and obtained 41 multiple feature parameters. The relevant features were selected through different feature selection algorithms such as minimal redundancy maximal relevance (mRMR), ReliefF feature selection algorithm, fast correlation-based feature selection algorithm (FCBF), and Fisher score algorithm. Finally, the selected features were fed into five different classification algorithms such as support vector machine(SVM), decision tree(DT), random forest(RF), feed-forward neural network (FFNN), and radial basis function neural network (RBF) and the RF classification model reported accuracy of 97.03% for six sleep states classification problems.

Dikhya et al. [17] presented automated sleep staging system by obtaining the statistical time-domain features, structural graph similarity feature based on single-channel of EEG signal under R&K sleep scoring rules and the proposed model performed best using SVM classifier, an average accuracy of 95.53%.

T. Zhang et al. [103] proposed a novel mechanism for feature selection using the filter method with pairwise constraints. The author has obtained two different categories, one section of data having affected with mild sleep problem and the other category section, healthy controlled subjects. The whole recordings were collected from the S-EDF dataset. Finally, the model reported an accuracy of 97.66% and 93.57% with consideration of category-1 and category-2 data respectively.

Basha, A. J et al. [9] obtained the fuzzy-kernel SVM for classifying the sleep stages and extracted the statistical features and the selected features were fed into the recurrent neural network and the model reported an overall accuracy of 90.2%.

Shen, H et al. [83] proposed an improved model based on features with a combination of locality energy and state-space model for automated classifying the sleep stages based on single-channel electroencephalogram signals under the R&K and AASM sleep scoring guidelines. The model reached an overall classification accuracy of 92.04% and 78.92% using the S-EDF and Dreams dataset respectively. Similarly, the same model reported accuracy of 79.90% and 81.65% using Dreams and ISRUC-Sleep dataset.

Wang, Q. et al. [98] proposed a high-accuracy and high-efficiency automated sleep staging system using single-channel EEG data and extracted 30 features of types time, frequency, time-frequency features, and non-linear parameters. The selected features were classified through the ensemble learning stacking model and reported an overall accuracy of 96.67% for the five-class sleep stages classification task. It has been also seen that the multi-modal signals analysis also takes an important role during the diagnosis of different types of sleep-related diseases. Therefore several sleep studies were conducted by the different researchers using multi-channel signals.

Yan, R. et al. [100] develop an automated sleep staging system based on the eight combinations of the four multi-modality channels of PSG signals and obtained a total of 232 features of statistical, time, frequency, time-frequency entropy, fractal and non-linear parameters were extracted. The model reported an accuracy of 86.24% using a random forest classifier with the ReliefF selected features.

Ghimatgar, H. et al. [24] introduced a multi-modality approach using a deep learn-ing model and Hidden Markov Model (HMM) to improve the sleep staging performance using multi-channel EEG data. The experimental data collected from sixteen neonates having an age range of 38–40 weeks. The relevant features were screened using the MGCACO algorithm. The model is trained using bi-directional long-short time memory and post-processing done using HMM model. The proposed model reported an accuracy of 78.9% and 82.4% using the K-Fold cross-validation and LOOCV techniques respectively.

Cooray, N. et al. [14] presented a sleep staging system followed by Rapid Eye Movement sleep behavior disorder (RBD) detection. The extracted 156 features from EEG, EOG and EMG channels and forwarded them into the RF classifier. The model reported an accuracy of 92%.

Diykh, M et al. [17] presented the new sleep staging system based on the statistical features and weighted brain networks using multiple-channel of EEG signals under both R&K and AASM sleep scoring guidelines. The proposed model has been per-formed on the two most popular public datasets namely ISRUC-Sleep and S-EDF dataset. The model reported an average accuracy of 96.74% with C3-A2 channel under the AASM scoring standards and 96% with Pz-Oz channel under the R&K standards.

H. Korkalainen et al. [51] presented the deep learning approach for automatic sleep staging system and analyzed the severity of obstructive sleep apnea (OSA). The overnight polysomnography recordings obtained from S-EDF public dataset, both healthy and sleep apnea subjects were considered for analysis of the sleep behavior. The model reported an overall accuracy of 83.7% with single-channel EEG and 83.9% with single-channel EOG. Similarly, for the clinical data, the model achieved an accuracy of 82.9 with EEG signal and 83.8% with combinations of EEG and EOG data.

2.2 Polysomnography (PSG) based sleep staging using deep learning approaches

Nowadays the researchers are majorly focused on deep learning techniques for sleep staging because of its robustness, scalability, and adaptability with related to handle large amounts of signal recordings and it’s processing. Another important advantage related to deep learning models is, no need to require any explicit features for discriminating the subject’s sleep behaviour [30]. It has witnessed that deep learning techniques working well in different applications like image segmentation, recognition, detection, and natural language processing. It has also been observed that deep neural models are widely used in different fields of the biomedical research area. In recent research developments, it has found that notable increases happened with the use of the deep neural network in the field of biomedical signals (EEG, ECG, EMG, and EOG) [18].

Recently deep learning concepts proposed in many challenging applications by the different researchers with the input of biomedical signal data which includes epileptic seizures [18, 39, 45, 89], neurological disorders using CNN models [21, 40, 67, 94] and heart diseases using ECH channel [18, 30, 39, 51].some of the recent contributions conducted by different researchers using deep learning models for classifying sleep stages are described below here.

In [45], the author used a deep convolution neural network for automated sleep staging with the input of single-channel EEG. The model achieved an overall accuracy of 74%.

Sors A et al. [89] presented automatic sleep stages scoring for five-sleep states based on one-channel of EEG using the CNN model and the results reported for the proposed model are 87%.

Chambon et al. [94] introduced a deep learning model with the concept of multivariate signal analysis such as EEG, EOG, and EMG using KNN. The proposed model reached an overall accuracy of 80% with combinations of EEG + EOG + EMG.

In [40] the authors obtained five-layer convolution layers for classifying the sleep stages based on two-channels of EEG and EOG signal and one-channel of EMG signal and achieved result for the model is 83%.

Tripathy et al. [67] introduced a novel approach of sleep scoring based on coupling features of EEG data and RR time-series information using deep neural networks. The model resulted in an average accuracy of 95.71%, 94.03%, and 85.51% for the classification in between NREM vs REM, deep sleep vs light sleep, and sleep vs wake respectively.

Zhihong Cui et al. [21] proposed a sleep scoring system with input of 30s multi-channel signal information based on CNN and fine-grained properties, the model reported an average accuracy of 92.2% with the ISRUC-Sleep public dataset.

Supra Tk A et al. [56] designed a system of sleep scoring through extracted time-invariant information using CNN and find sleep stages transition information from the bidirectional LSTM network. The reported classification accuracy performance reached to 86.2%.

Akyol, K. et al. [5] presented a stacking ensemble learning model for analysis of the single-channel EEG signal and identifying the epileptic seizure detection. The clinical dataset was collected from the Bonn University. Finally, the author has com-pared the performance of the proposed model with the deep neural network (DNN) model. The model achieved an average accuracy value of 97.17%.

Yildirim, O. et al. [101] developed a deep learning model using a one-dimensional convolutional neural network (1D-CNN) using combinations of EEG, EOG, and EMG signal for classifying the six-two sleep stages classification problems. The model reported an overall accuracy of 91.00%, 91.22%, 92.36%, 94.64%, and 98.06% using the S-EDF dataset and similarly, the same model achieved an accuracy of 89.54%, 90.98%, 92.33%, 94.34%, and 97.62% using SE-EDF dataset for classifica-tion of six-two sleep classes problems.

Zhu, T. et al. [107] proposed a sleep staging system using a neural network with the implementation of the CNN concept and obtained inter and intra epoch features from the input signal. The model reached overall accuracies of 93.7% with S-EDF and 82.8% with SE-EDF datasets respectively.

Fernandez-Blanco, E et al. [19] proposed an ensemble technique for an automatic sleep scoring system using multiple-channel of EEG signals, and the model was trained using the convolutional neural network. The entire experiment work was done through a widely accepted bench-mark dataset as SE-EDF dataset and the test was carried out with help of leave-one-out cross-validation technique. The model achieved an accuracy of 92.67%.

C. Sun et al. [90] presented multi-class sleep stages classification using PSG signals based on a hierarchical neural network model. The model functioned in two phases, in the first phase using through feature learning stage and the second phase is the sequence learning stage. This study performed on 147 patients’ sleep recordings, which were obtained from the MASS dataset. Finally, the model reported an overall accuracy of 87.8% and an F1score of 81.8 respectively.

Chenglu Sun et al. [91] developed an automated sleep staging system based on a two-stage neural network model. During the first stage, the model learning hand-crafted features, and in the second stage the model is learned. The author also intro-duced the data augmentation techniques to resolve the class imbalance problem. The whole work was executed through two public datasets such as S-EDF and Sleep apnea dataset. The proposed model reported the result of F1score and Kappa score as 80.6% and 80% with healthy subjects, and 79% and 74% with sleep-disordered subjects.

Antoine Guillot et al. [26] proposed an automated sleep staging system using a deep learning model called as SimpleSleepNet, where the author retrieved two differ-ent categories of data from the Dreams dataset, one completely healthy controlled and other section of data are collected from the subjects who were affected by the sleep apnea subjects. The required data prepared by the five different sleep experts in the different sleep centers. Finally, the SimpleSleepNet framework achieved an average F1Score 89.9% with healthy controlled subjects and 88.3% with apnea subjects.

Mehdi Abdollahpour et al. [1] proposed the automated sleep staging system using the combinations of EEG and EOG signals. The extracted features from both the signals were separated into two different sets, one set contained the EEG features and the other set contained the fused feature of EEG + EOG. Each feature set is transformed into a horizontal visibility graph (HVG). The images of the HVG are classified by the convolutional neural network with the concept of transfer learning. The model has been performed on the two most popular datasets such as S-EDF and SE-EDF datasets. The model achieved an overall accuracy of 93.58% using the S-EDF dataset.

According to the existing contribution to sleep scoring, major challenges found that choosing the correct features which helps to distinguish the sleep stages. It has found that the maximum researchers extracted the time, frequency, and time-frequency features, then after finalizing the relevant features either manually or applied some conventional feature selection algorithm. In some cases, this selection algorithm increases the complexity factor and consumes more time. Another challenge related to feature selection is that some features are well fitted for some of the subjects but the same may not apply for another one.

The next challenge with earlier contribution is that, improper distributions of sleep epochs for all the sleep stages. This imbalance of sleep information may produce biased results with conventional machine learning algorithms. From the literature survey, it has found that the maximum researchers extracted the time, frequency, and time-frequency features, then the relevant features are selected either in manually or using some conventional feature selection algorithm, which takes the computational time and also increases the complexity factor, Another limitation regarding selected features, some of the features well suitable for classification for some of the subject cases may be the same many not applicable for other categories of subjects. This may create a problem to achieve higher classification accuracy.

Another challenge with subjects to sleep staging is that sometimes in the recorded data, it may see that the sleep epochs are not distributed equally for all the sleep stages. This imbalance of information may produce biased results with traditional machine learning algorithms. In most of the previous studies, these may not be properly addressed, so that reported classification accuracy performance is not up to the mark level. With consideration of all these issues, the authors obtained deep learning techniques for automated analysis and classification of sleep stages using polysomnography signals. Though our input data is in the form one-dimensional size. In this study, we propose a 9 L-1D-CNN-SSC for automated sleep stages classification. To recognize the sleep behavior, here the authors have proposed an end-to-end structure, without use any type of handcrafted features for learning. The proposed model learning the features automatically from the obtained layers of the model. Apart from this, other advantages of this architecture and addressing the multi-class sleep stage classification problems without changing any of its layers and its parameters for two to five sleep classes.

2.3 Contribution

The main contributions of our proposed research works are explained below:

1. The authors propose a 9 L-1D-CNN-SSC architecture for classifying multiple sleep classes based on multi-modality signal fusions under the AASM sleep scoring rules using two different categories of subjects’ sleep recordings.

2. The proposed architecture of 9 L-1D-CNN-SSC consists of a convolutional layer, pooling layer, batch normalization layer, and fully connected layer. The performance of the proposed model is also compared with the existing pre-trained model. The obtained features from the last fully connected layers of the 9 L-1D-CNN-SSC model were fed into the softmax activation function.

3. The complete sleep staging process was analyzed with the three different combinations of signals and each one executed in the individual experiment. The first three experiments were executed with the input of single-channel EEG, EMG, and EOG signal, and the final experiment is performed with combinations of the three signals, EEG + EMG + EOG.

4. The proposed methodology uses fewer parameters to train the model and extracting the prominent features from the input signal data automatically, which supports achieving the high classification accuracy incomparable to the earlier contributions. Concerning the earlier contribution of sleep staging using the CNN model by different researchers, our proposed model is well competitive with the results of existing sleep studies, which use even complex structure CNN architecture.

3 Background of CNN model

In recent years, deep learning techniques attempt excellent performance to learn the highly complicated behavior from the input biomedical signals through a designed hierarchical architecture model. Among the different modes in deep learning methods, the CNN model is a more effective technique in biomedical signals and image analysis and classification problems compared to traditional machine learning techniques. The network styles of the CNN model are quite similar to the conventional ANN model structure, a CNN model framed with compositions o the input and output layers, and a set of hidden layers. The hidden layers of the CNN model comprise a set of convolution, pooling, and fully connected layers, which extracts the highly commendable features from the input data automatically, which are more feasible in concerns to each and individual neuron representing in each layer. Like ANN, CNN also depends on the previous layer’s weight and bias information to get the final result. The typical structure of the CNN model for a one-dimensional signal is shown in Fig. 1.

The entire working procedures in CNN are implemented through two basic stages 1.feedforward stage, 2.backpropagation stage.

In the feedforward stage, the given input data are fed into the designed model, each input data are multiplied with the layer’s parameters of each layer and finally, the achieved feature map values are forwarded into the network output. During the backpropagation stage, the values of the weights are adjusted in each successive layer in the network to control and reduce the error values in the obtained model by implementing the proper loss function. Finally, the error rate determines by comparing actual and desired outcomes.

The CNN model consists of several network layers such as convolution layers (CONV), rectified linear unit (ReLU), batch normalization layers (BN), pooling layers, and fully connected layer, each layers description is described in detail below:

3.1 Convolution layer

In the CNN model, the convolution (CONV) layer is the basic core section of a CNN model. Generally, in one CNN architecture, there are more than convolutions layers are there, in this layer the given inputs are processed through the set of learnable parameters and with the number of filters with different dimension sizes. From each filter as an output, we generate a set of feature maps values, which are computed from dot product small unit of given input data and its weight values. It helps to learn the features from the input signal by learning the weight parameters of the filters. The given size of input data for a convolution layer is S_hXS_wXS_d, the required output volume size of the layer is S_h^newXS_w^newXS_d^new.The required output computed using four hyper parameters 1. Number of filters (N_F), 2. Size of filter (F_s), 3.Padding size ( P_s), 4.Stride information (S_I).The general form of computation the output volume from convolution layers as follows:

$$ {S_h}^{new}=\frac{S_h-{F}_S+2\ast {P}_S}{S_I}+1 $$

(1)

$$ {S_w}^{new}=\frac{S_w-{F}_S+2\ast {P}_S}{S_I}+1 $$

(2)

$$ {S_d}^{new}={N}_F $$

(3)

3.2 Batch normalization layer

The importance uses of batch normalization (BN) layer in the CNN model are normalizing the data present inside the network [4].It also supports to increase the training speed and reducing the internal changes of covariance values. The main intention to include BN layer in the model is to make confirm that how best the activation function distributes in a stable manner throughout the training procedure.

BN_n = {B₁, B₂, B₃……………B_m} Presents a minibatch size of m.

The batch normalization of BN_n is computed using this set of mathematical equations.

$$ \mu {BN}_n=\frac{1}{m}\sum \limits_{i=1}^m{B}_i $$

(4)

$$ {\sigma}^2{BN}_n=\frac{1}{m}\sum \limits_{i=1}^m{\left({B}_i-\mu {BN}_n\right)}^2 $$

(5)

$$ \hat{B_i}=\frac{\left({B}_I-\mu {BN}_n\right)}{\sqrt{\sigma^2{BN}_n+\in }} $$

(6)

$$ {B_I}^{BN_n}=\tau \ast {\hat{B}}_i+\vartheta $$

(7)

$$ \hat{\kern0.5em {B}_i}\ represent\ normalized\ input $$

$$ {B_I}^{BN}\ Batch\ normalization\ output\ for\ a\ minibatch\ {B}_n $$

Majorly BN_n layer deployed in between convolution layers and ReLU layers, which permits the users to fix the higher learning rates. The most important advantage with batch normalization layer is, it mainly controls over fit issue and increase the training speed.

3.3 ReLU layer

It is a general trend in CNN model to deploy ReLU layer after each successive convolution layers, the main intention to use of this activation function is establishes the nonlinear impact in the network. In the proposed work, we have obtained two different convolution functions are used 1.ReLU and 2.Softmax.

In general, the ReLU activation function converts the negative map values into zeros and maintains the positive values. The mathematically form of ReLU function defined as

$$ \varnothing (x)=\left\{\begin{array}{c}x, if\ the\ value\ of\ x\ge 0\\ {}0, if\ the\ value\ of\ x<0\end{array}\right\} $$

(8)

The role of softmax activation function is decides the probable classification of the output classes. So that softmax function used in the final fully connected layer, for predicting which input signal is belong to wake, N1, N2, N3 and REM sleep stages. The mathematically the equation defined as:

$$ {P}_i=\frac{e^{S_j}}{\sum \limits_1^k{e}^{S_k}}\kern0.5em for\ j=1,2,3\dots \dots ..k $$

(9)

Where S is the input to the network model.

P_i is represents the output value.

The output values are lies in between 0 to 1.

3.4 Pooling layer

The main purposes of using pooling techniques in CNN model is reducing the number of trainable parameters which helps indirectly reduce in complexity in execution. Finally it helps to control the overfitting problem. After each layer of convolution, we applied the max-pooling techniques for down sampling the feature map volume size and during this layer, no parameters are to be trained. In general architecture of CNN model, the pooling layer placed in between two convolution layers, for the purpose of down sampling the feature map size. From many of studies, it has found that maxpooling techniques are more effective with concern to CNN model. So that in our proposed model obtained max-pooling techniques.

3.5 Fully connected layer

Generally in a CNN model, there is one or more fully connected layers (FC) are used after successive convolution, ReLU and max-pooling layers. The operational procedure of FC layer is same with accordance to conventional neural network, where each neuron is associated to all the preceding layer neurons. The most important concern with related to FC layer is, it holds number of learnable parameters which indirectly leading computational overhead during training. In the current research work, used only one FC layer in our proposed model.

4 Methodology

The main intention of the proposed research work is to develop an artificial intelligence-based deep learning classification model to automatically classifying the sleep stages, which alternatively help for diagnosing the major types of sleep diseases This paper proposed a novel nine-layer one-dimensional convolutional neural network-based automated classification model for multi-class sleep staging classification (9 L-1D-CNN-SSC) using polysomnography (PSG) signals. The main intention is to make use of the effective information of multi-modality combinations of the signals to improve the sleep staging classification performance. The concept of the proposed research work is shown in Figure 2 by block diagram. The proposed methodology involves signal processing concepts and proposed 9 L-1D-CNN-SSC architecture followed by finding the discriminatory features to distinguish these multi-modal signals for training and testing. Mainly this research work executed through these four parts (i) input module, which takes PSG signals,(ii) pre-processing is performed for eliminating the irrelevant noise compositions which are contaminated in the recorded signals and passes them into the 9 L-1D-CNN-SSC model (iii) final decision, the output of the 9 L-1D-CNN-SSC model are fed into the fully connected layer to take the final predictions,(iv) validated the results of the proposed 9 L-1D-CNN-SSC architecture for two to five sleep stages classification problems with the existing state-of-the-art works.

4.1 Sleep stages classes

According to the AASM sleep standards, the sleep classes can be divided into two to five sleep stage classes. The only changes with the AASM sleep standards are N3 and N4 stages of the R&K standards are merged into one stage called as N3 stage. The proposed sleep staging procedure is executed under the AASM sleep scoring rules. The brief description of sleep classes considered in this proposed research work is shown in Table 1.

Table 1 The sleep class description considered in this proposed research work under the AASM standard

Automated classification of multi-class sleep stages classification using polysomnography signals: a nine- layer 1D-convolution neural network approach

Abstract

Similar content being viewed by others

Automated Sleep Staging Using Convolution Neural Network Based on Single-Channel EEG Signal

Application of convolutional neural network-based biosensor and electroencephalogram signal in sleep staging

An automatic method using MFCC features for sleep stage classification

Explore related subjects

1 Introduction

2 Related work

2.1 Polysomnography (PSG) based sleep staging using machine learning approaches

2.2 Polysomnography (PSG) based sleep staging using deep learning approaches

2.3 Contribution

3 Background of CNN model

3.1 Convolution layer

3.2 Batch normalization layer

3.3 ReLU layer

3.4 Pooling layer

3.5 Fully connected layer

4 Methodology

4.1 Sleep stages classes

4.2 Experimental data

4.3 Preprocessing

4.4 Proposed 9 L-1D-CNN-SSC model architecture

4.5 Model training and testing

5 Experiments and results

5.1 Experimental setup

5.2 Performance evaluation metrics

5.3 Results with the input of ISRUC-sleep subgroup-I dataset

5.3.1 Experiment-1

Using EEG signal

5.3.2 Experiment-2

Using EOG signal

5.3.3 Experiment-3

Using EMG signal

5.3.4 Experiment-4

Using combinations of EEG + EMG + EOG signals

5.4 Results with the input of ISR-SG-III data

5.5 Summary of experimental results

6 Discussion

7 Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethics approval

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation