Biomedical Signal Processing and Artificial Intelligence in EOG Signals

López, Alberto; Ferrero, Francisco

doi:10.1007/978-3-031-23239-8_8

Alberto López⁴ &
Francisco Ferrero⁴

849 Accesses
1 Citations

Abstract

Electrooculography is a technique that detects and analyses eye movement based on electrical potentials recorded using electrodes placed around the eyes. The electrical signal recorded is named electrooculogram (EOG) and can be used as an alternative input for medical and human-computer interface systems. To implement an eye movement-based system, at least four main stages will be required: signal denoising, feature extraction, signal classification and decision-making. The first one after the EOG signal acquisition is signal denoising, which suppresses noise that could not be removed by the analogue filters. In this task, the slope of the signal edges, as well as the amplitudes of the signal to distinguish between different eye movements, must be preserved. After denoising, the second task is to extract the features of the EOG signal based mainly on the detection of saccades, fixations, and blinks. The next stage is the automatic identification of eye movements. This task, called signal classification, is essential for generating accurate commands, especially in real-time applications. This classification is carried out mainly using a combination of algorithms in artificial intelligence (AI). These types of algorithms are the most suitable for adaptive systems that require real-time decision-making supported by AI techniques. In some applications, EOG modelling, and compression are also applied as an additional signal processing stage.

Access provided by Autonomous University of Puebla. Download chapter PDF

Suppression of Artifacts from EEG Recordings Using Computational Intelligence

Mathematical Analysis of Electroencephalography Applied to Control Brain Machine Interfaces

EEG Based Automated Detection of Six Different Eye Movement Conditions for Implementation in Personal Assistive Application

Article 08 January 2022

Keywords

8.1 Introduction

8.1.1 EOG Fundamentals

The electrooculogram (EOG) is the electrical signal produced by the potential difference between the cornea (the positive pole) and the retina (the negative pole). This potential difference is measured by placing surface electrodes near the eyes, and the process of measuring the EOG is called electrooculography.

The eye movements can be either saccades, smooth pursuits, vergence or vestibule-ocular, which can further be reflex or voluntary. Saccades are eye voluntary movements used in clinical studies to analyse eye movements. Smooth pursuits are also voluntary but are slower tracking eye movements that keep a moving stimulus on the fovea. Vergence and vestibulo-ocular movements have involuntary origins [1], so they are usually removed in EOG applications.

EOG-based human−computer interfaces (HCIs) offer disabled people a new means of communication and control as eye movements can easily be interpreted in EOG signals. In recent years, different types of HCI systems have been developed such as controlling the computer cursor, virtual keyboards, electric wheelchairs, games, hospital alarm systems, television control systems, home automation applications and smartphones [2,3,4,5,6,7,8]. Diabetic retinopathy or refractive disorders such as hypermetropia and myopia can be diagnosed early based on the EOG results [9]. EOG also provides reliable information to identify sleep stages and detect anomalies [10, 11].

Figure 8.1 shows a block diagram of the main stages to develop an EOG system. For the development of widely used EOG-based applications in the real world, it is necessary to provide accurate hardware and efficient software to implement the tasks shown in Fig. 8.1, which will be introduced in this chapter.

A block diagram of the main stages which can make up a system. The involved steps are signal acquisition, digital processing, features processing, classification, and decision-making. — **Fig. 8.1**

8.1.2 EOG Signal Measurement

Saccadic eye movements are the most interesting as they are voluntary and easily identifiable on the EOG. The most basic movements are up, down, right, and left. To distinguish between these eye movement classes, two pairs of bipolar electrodes and a reference electrode are positioned around the eyes as shown in Fig. 8.1.

The amplitude of the EOG signals has a mean range of 50–3500 μV and the frequency is between zero and about 50 Hz. Another issue to consider is that muscle noise spreads along with the signal bandwidth almost constantly, which makes it very difficult to completely remove it. The amplitude of the signal obtained using two electrodes to record the differential potential of the eye is directly proportional to the angle of rotation of the eyes within the range ± 30°. Sensitivity is on the order of 15 μV/º [12].

Voluntary and involuntary blinks produce spikes in EOG signals that must be detected because they can be mistaken for saccades. Figure 8.2 shows EOG fundamentals by modelling the eye as a dipole and an electrooculogram where two typical saccades with different amplitudes depending on the gaze angle are represented.

An eye model and a graph. The model involves the electrode, retina, cornea, head orientation vector, gaze vector, and bioamplifier. The graph presents an electrooculogram where two typical saccades with different amplitudes depending on the gaze angle. — **Fig. 8.2**

Before any further analysis, a preprocessing step will be necessary to reduce mainly the base-line drift, the powerline interference and the electromyographic potential. For this task, analogue filters are usually used. Table 8.1 shows the most relevant commercial bio amplifiers used in EOG for experimental measurements.

Table 8.1 Most relevant commercial bio amplifiers used for EOG measurement along with their main characteristics

Full size table

On the other hand, several datasets are publicly available that offer signals which are already pre-processed and ready to be used. Some of the most widely used datasets in the literature are shown in Table 8.2, mostly related to sleep recordings.

Table 8.2 Summary of the most relevant EOG signal databases

Full size table

8.2 EOG Signal Denoising

After hardware acquisition and preprocessing, EOG signals are still contaminated by several noise sources that can mask eye movements and simulate eyeball events. Additional denoising must be done to remove unwanted spectral compo-nents to improve analogue filtering and remove other kinds of human biopotentials. Denoising must preserve the signal amplitudes and the slope of the EOG signals to detect blinks and saccades. In some cases, it is an additional feature while in others, it is considered an artefact that should be eliminated. In these latter cases, the blink artefact region is easily removed because blinking rarely occurs during saccades, instead, they usually occur immediately before and after the saccades. Another issue that needs to be considered in EOG noise removal is crosstalk or the interdependence between acquisition channels. Many changes in eye movements recorded by one channel generally appear in other EOG channels. The main strategy is to ignore those signals with a low amplitude.

Several methods are proposed in the literature to attenuate or eliminate the effects of artefacts on EOG signals. Digital filters are typically employed to reduce muscle artefacts and remove power line noise and linear trends. Adaptive filters, such as Kalman and Wiener filters are used to remove the effects of overlap frequencies over the EOG spectrum from electrocardiographic and electromyographic artefacts. In addition to linear filtering, the median filter is very robust in removing high-frequency noise, and preserving amplitude and the slope, without introducing any shift in the signal [11].

Regression methods can learn signal behaviour by modelling colour noise that distorts the EOG signal and subtracting it. Due to the relationship between nearby samples, these methods include the nearby noise samples for the prediction of the given sample. The noise distribution characteristics are not considered in the regression models.

The wavelet transform (WT) is a powerful mathematical tool for noise removal. The WT consists of chopping a signal into scaled and displaced versions of a wavelet that is called “mother wavelet”. From the point of view of signal processing, wavelets act as bandpass filters. The WT is a stable representation of transient phenomena, and therefore, conserves energy. In this way, the WT provides much more information about the signal than the Fourier transform because it allows to highlight its peculiarities by acting as a mathematical microscope.

Figure 8.3 shows the general scheme of the WT-based filtering procedure. The decomposition module is responsible for obtaining the wavelet coefficients of the EOG signal at the output of the conditioning stage. The thresholding stage consists of selecting an appropriate threshold for the wavelet coefficients so that those of lower values are eliminated because they correspond to noise and interference. Finally, the EOG signal is reconstructed through the coefficients not discarded in the previous stage. To do this, the reverse process of the wavelet transformation carried out in the first module is followed.

A process flow presents the general outline of W T-base filtering. The order of the flow is as follows. E O G, decomposition, thresholding, reconstruction, and E O G filtered. — **Fig. 8.3**

For a complete decomposition and reconstruction of the signal, it is necessary that the filters of the wavelet structures have a finite number of coefficients (finite impulse response filters) and that they are regular. In addition, it is also important to ensure that the filters have phase linearity, as this prevents the use of non-trivial orthogonal filters but allows the use of biorthogonal filters. It is very common to choose the Biorthogonal and Daubechies families for EOG signal processing [3, 24].

8.3 Compression

In some EOG applications, such as sleep studies, it is necessary to compress the signals because they can extend up to several gigabytes. Storage and transmission for remote health monitoring of this amount of data comes at a high cost. In these cases, compression techniques are needed for the transmission of the signal through the communication networks. An alternative is the Turning point compression algorithm reported in [25]. This algorithm reduces the effective sampling rate by half and saves the turning points that represent the peaks and valleys of the EOG signal. The main purpose of compressing data is to reduce the size while retaining the characteristic and useful features. Figure 8.4 shows the original, filtered, and compressed EOG signal using this technique.

A graph of voltage versus time. It depicts three curves for original, filtered, and compressed. The trend for original is more noisy. — **Fig. 8.4**

8.4 EOG Feature Processing

EOG feature processing consists of selecting how many and which signal features are the most relevant for the application. An important issue is that the chosen features must be independent of each other to prevent collecting redundant data. By evaluating the EOG signals, it is possible to conclude that fixations have a stable slope whereas saccades and blinks increase quickly. The same happens for smooth eye movements and other features such as average speed, range, variance, and signal energy. Fixations are the slowest eye movements and saccades are the fastest [26]. The processing of informative, discriminatory, and independent features is a key step in preparing an appropriate collection of values for a classifier.

8.4.1 Feature Extraction

Feature extraction is the process of extracting features from the processed signals to obtain significant discrimination on several independent features. The goal is to find a space that makes the extracted features more independent and where they can be discriminated against. Feature extraction techniques can fall in the time domain, frequency domain, time-frequency domain, and nonlinear domain [27].

8.4.1.1 Time-Domain Features

Time-domain features represent the morphological characteristics of a signal. They are simply interpretable and suitable for real-time applications. The most popular time-based parameters are compiled in Table 8.3.

Table 8.3 Summary of the main time-domain EOG features, where x refers to the input signal, n is the nth sample of the signal, and N is the total number of samples

Full size table

Statistical features are mean, variance, standard deviation, skewness, kurtosis, median and the 25th, and 75th percentile of the signal.
Hjorth features are activity, mobility, and complexity parameters to measure the variance of a time series, the proportion of standard deviation of the power spectrum and change in the signal frequency, respectively.
Zero crossing rate features refer to the number of times that a signal crosses the baseline. This parameter is very sensitive to additive noises.

Other time-domain EOG features for eye movements are amplitude, latency, deviation, velocity, slope, peak polarity, and duration.

8.4.1.2 Frequency Domain Features

The main features in the frequency domain are energy, power ratio, spectral frequency, duration ratio and power spectral density (PSD). The most popular methods to estimate the PSD are Autoregressive (AR), Moving average and Autoregressive moving average. In [28] AR was considered to extract features from the EOG signal. These methods are named the parametric method because the spectrum is estimated by the signal model. These approaches are suitable for signals with both low SNR and length. In the non-parametric methods, such as Periodogram and Welch, the PSD values are calculated directly from the signal samples in each signal window. In [29] a non-parametric statistical analysis is performed using Welch method. The features obtained by the Welch technique discriminate better due to the lower sensitivity of nonparametric methods to residual noise and motion artefacts compared to parametric and cumulant-based methods. The non-parametric methods based on the Fast Fourier transform are easy to implement. Another method used to extract the frequency domain features is the higher-order spectra, which represent the frequency content of a higher-order signal static.

8.4.1.3 Time-Frequency Features

EOG signals are non-stationary, and to transfer a signal from the time domain to the frequency domain, three main techniques are available:

Signal decomposition: The aim of signal decomposition is to decompose the signals into a series of basic functions, and the most common methods are Short-Time Fourier and WT [3]. The first one is simple and well-known, however, for EOG signals, the second one is the most widely used. Continuous wavelet transforms have more separable features and the coefficients are more redundant than the discrete wavelet transforms within the same period.
Energy distribution: Several methods are proposed for energy distribution: Choi–Williams distribution and Wigner-Ville distribution are the traditional non-linear time-frequency methods widely used to analyse non-stationary signals. Hilbert–Huang transform is a more recent method to obtain momentary regularity of nonlinear and non-stationary signals such as EOGs [4].
Modelling: The Gaussian mixture model (GMM) is used in some works to estimate the continuous probability density of the signal. The model parameters are estimated using the Expectation-Maximization algorithm such that the probability of observation is maximised. Figure 8.5 depicts the GMM model [8, 26, 30].

A diagram presents the G M M model. The input passes through Gaussian function and results in the output. — **Fig. 8.5**

8.4.1.4 Non-Linear Features

Non-linear methods employed for EOG signal feature extraction fall into two main groups:

Entropy and complexity-based methods. Complexity methods are used to estimate the nonlinear dynamic parameters of EOG, electroencephalographic (EEG) and electromyographic (EMG) signals. Among complexity methods, entropy-based algorithms are robust estimators for evaluating the regularity of signals. Shannon’s entropy method is the most famous one. However, in some cases, the data for the decision-making processes cannot be measured accurately and other methods have been proposed, such as Renyi’s, Sample, Tsallis, Permutation, Multi scale and Approximate entropy [31].
Fractal-based methods. They propose measuring the fractal dimension of the EOG irregular shape and determining the amount of self-similarity on the signal. The Correlation Dimension, Lyapunov exponent and Hurst exponent are examples of fractal-based methods. First, they map a signal into the phase space and then measure the self-similarity of its trajectory shape [32].

Both techniques are suitable for measuring the amount of roughness in the signal, in turn increasing the entropy of the signal with the irregularity. These techniques are only effective at detecting stage transitions, not for the signal bandwidth.

8.4.2 Feature Selection

After the feature extraction, feature selection techniques are applied to find a discriminative subset of features to reduce the number of features needed to feed and train subsequent classification models, for avoiding over-fitting and reducing the computational time [33].

Minimum redundancy maximum relevance is an algorithm for feature selection according to the criteria of minimum redundancy (least correlation between themselves) and maximum relevance (most correlation with the class). Redundancy can be computed by using Pearson’s R for continuous features or mutual information for discrete ones. Relevance can be calculated using F-distribution under the null hypothesis for continuous features or mutual information for discrete ones [34].

Another selection method is named the Clear based feature selection (CBFS). CBFS computes the distance between the objective sample and the centroid of each class. Then, the algorithm compares the class of the closest centroid with the class of the target sample. [35] reports an efficient classification of EOG signals using this algorithm.

8.4.3 Feature Normalization

Feature normalization can be applied to reduce the effects of the individual variability and is performed over values for each feature separately. This process can prevent extremely high or low values from influencing any conclusions. [36] reports the procedure for feature normalization in an automatic sleep staging method. Another example of normalization is shown in [37] in which the original EOG signal, managed by a dynamic threshold (includes a positive and a negative threshold), would be transformed into a series of rectangular pulses that have −1 or 1 in their amplitude.

8.5 Classification

The automatic identification of eye movements (classification) is essential to generate accurate commands, especially in real-time applications. Classification techniques based on static or dynamic thresholds are also not easily generalisable, so methods based on artificial intelligence (AI) are needed. Many conventional machine learning algorithms and recently deep learning due to the increased computational power are truly becoming virtual assistants for clinicians to classify EOG features and improve medical diagnosis. The most widely used classifiers are herewith briefly commented.

The main parameters associated with classification performance are accuracy, precision, sensitivity, specificity, recall, F1 and F2 score, true positive rate, false-positive rate and Genni’s or Mathew’s correlation coefficient. Confusion matrices are also commonly used to compare the performance of different classification methods and avoid misleading when data is unbalanced.

8.5.1 Machine Learning Techniques

Conventional machine learning techniques include a wide variety of algorithms. All of them have shown great performance in EOG features’ classification compared to threshold-based classification techniques proposed in some preliminary EOG-based systems. The main machine learning techniques used in EOG features’ classification are briefly described below.

K-Nearest Neighbor (K-NN) is an algorithm that finds the nearest observations to the one it is trying to predict and classifies the observation of interest according to the majority of the surrounding data. The only parameter to set is the number of neighbouring points to consider in the vicinity to classify the different classes that are already known in advance.

Support vector machines are other conventional hierarchical supervised classifiers. They involve the adoption of a nonlinear kernel function to transform the input data into an optimal hyperplane for separating the features.

Decision trees are non-parametric supervised learning techniques that require little preprocessing and have a good runtime performance to handle tasks in real-time. The goal is to create a model that predicts the value of an objective variable according to various input variables [38].

Random Forest (RF) is one of the best algorithms for classifying large data with accuracy. RF is an ensemble of predictor trees such that each tree depends on the values of a random vector. This random vector is tested independently and presents the same distribution for each tree. Each tree is grown through bootstrap training. Figure 8.6 shows the general structure of RF. The classification is made from the vote of each tree in the ensemble and by selecting the most popular class among them [26]. [39] reports an automatic scoring of sleep stages classification using EOG signals.

A flowchart of a random forest structure. The dataset leads to random vectors which divides into 3 datasets and passes through bootstrapping, and training, later combined for voting which in turn influences the prediction. — **Fig. 8.6**

Linear Discriminant Analysis (LDA) is a method of supervised classification in which two or more groups of variables are known a priori and new observations are classified into one of them according to their characteristics. The result is created on the nearest centre classifier applied to the LDA outputs. After training, the nearest centres calculate the distance between any point and each class. In [40], an LDA classifier was applied to EOG classification with good training and testing accuracy that could be used for disabled people.

Logistic Regression (LG) optimises a set of weights assigned to each input feature to provide the best classification performance using a training dataset. LG was used in the design of an omnidirectional robot controlled by eye movements because of its efficiency and the low computing resources needed [7].

GMM as a classifier learns the input features of each class and assigns a specific label to them. When a sample fits into the scheme of Fig. 8.5, the label that produces the highest probability is assigned. The GMM provides a framework to model unknown distributional shapes. The key issue is to estimate how many components to include in the model.

A Hidden Markov Model (HMM) is a statical model in which the parameters are unknown. The training is done using maximum likelihood. HMM assesses the transition and emission probabilities from the observation sequence to the state sequence. This classifier can tolerate time warping of the input data. [8, 41] report a wheelchair navigation system based on an HMM for people with restricted mobility. Figure 8.7 shows an example of the transition of states of HMM.

A diagram presents an example of the transition of states of Hidden Markov Model. — **Fig. 8.7**

Clustering is an unsupervised grouping classifier where the samples lack labels. The goal is to create groups with similar samples using criteria such as information, statistical measures, and distance metrics. Each eye movement has specific features; therefore, first grouping the signals into the two categories, centre gazes and non-centre gazes might be a useful step in some classification schemes. Figure 8.8 shows the application of this concept to the hierarchical clustering procedure for classifying eye movements.

A flowchart describes the hierarchical clustering procedure to classify eye movements. All eye movements divided into center gaze, and non center gaze which further divided into down and left, and up and right. — **Fig. 8.8**

Based on how the clusters are related to each other and the objects in the dataset, the first division of clustering algorithms can be established. In hard clustering, each object belongs to a single cluster, so the clusters would become a partition of the dataset. In soft (or fuzzy) clustering, the objects belong to the clusters according to a degree of trust or belonging (e.g., Fuzzy C-Means). Clustering can be classified by looking at how the object is related: flat clustering, hierarchical clustering, graph-based clustering, and density-base clustering. Examples of these clusters can be found in [26, 42].

Even EOG signals from the same eye movement can differ in amplitude and time and thus, produce errors in recognition. The Dynamic time wrapping algorithm can solve this problem by breaking the problem recursively into subproblems, storing the results and later using those results when needed. For large datasets, this algorithm employs a lot of time for training the model [8, 43].

Artificial neural network (ANN) has numerous applications for pattern classification in the medical field to easily interpret the EOG signals and diagnose the problem more accurately. An ANN comprises several highly interconnected processing elements called neurons, which are organized into layers. These layers have a geometry and functionality linked to the human brain. ANNs include three layers: input, hidden and output as depicted in Fig. 8.9 [44].

A diagram depicts the basic architecture of multilayer feed-forward A N N. It consists of three layers, namely input layer, hidden layer, and output layer. — **Fig. 8.9**

8.5.2 Deep Learning Techniques

Deep learning (DL) replicates the functioning of the human brain regarding sending information from one neuron to another and handling a great amount of data. DL produces more insight knowledge than machine learning techniques as it can learn multiple levels of representation from raw data using unsupervised learning and model more complex relationships. The nucleus of DL is the ANN with multiple nonlinear hidden layers. DL offers robust computing power and enormous datasets, as they generally use a greater number of recordings to develop and evaluate their methodologies than the traditional machine learning classification methods.

A convolutional neural network (CNN) is an ANN class composed of a convolution layer to filter the extraction of features, a pooling layer to reduce the size of the analysed data, a fully connected layer, and a loss function to calculate the errors between the current and the desired network output. Back propagation is applied to update weights for convolutional layers and pooling filters cascade. Figure 8.10 shows an example of a deep neural network for eye movement classification.

A schematic presents an example of a deep neural network for eye movement classification. It involves input signal, feature extraction, classification, and probabilistic distribution. — **Fig. 8.10**

CNN requires fewer parameters than the conventional neural network, therefore CNN can be applied for solving regression problems. For example, in [45] CNN was used to eliminate eye blinking artefacts, and in [46], the authors used CNN for drowsiness detection based on EOG signals.

The recurrent neural network (RNN) is basically an ANN developed under the premise that humans always consider the past when making decisions. RNN automatically stores past information through a loop within its architecture. Based on this fact, in [47], an RNN was considered for real-time eye blink suppression in EEG recordings.

The time distributed convolutional neural network (TDConvNet) is a DL model comprising two main stages: a one-dimensional CNN epoch encoder, to extract the time-invariant features from raw EOG signals, and another one-dimensional CNN stage, to infer labels from the sequence of epochs. TDConvNet was applied to classify the sleep stages of polysomnography signals [48].

Unsupervised pre-training algorithms initialise the parameters such that the optimisation process ends up with a higher speed of training. In [49, 50], two pre-training methods were presented for EOG signals: restricted Boltzmann machine (RBM) and deep belief networks (DBN). Figure 8.11a shows an example of the RBM system. The relation between the input and output layers allows the network to be trained much faster. RBM can be extended if the output layer of one RBM is the input layer for another RBM, as shown in Fig. 8.11b.

Two diagrams a and b. Diagram A depicts an example of the R B M system. Diagram B depicts an example of D B N system. It illustrates that R B M can be extended if the output layer of one R B M is the input layer for another R B M. — **Fig. 8.11**

Long short-term memory (LSTM) deep networks are developed to obtain long-term dependencies in the data. LSTM algorithm as a classifier uses three kinds of gates to configure the data entering a network: input, forget and output. The more important formations can be saved between data segments using two forward and backward LSTMs. This architecture of two LSTMs, named Bidirectional long short-term memory (Bi-LSTM), presents each forward and backward training sequence in two separate LSTM layers, connected to the equal output layer.

Since the large length of the data can cause a leakage gradient problem, Gated Recurrent Unit (GRU) networks can be used to learn the representation of the EOG signal. This recurrent neural layer not only allows the improvement of the memory capacity but also eases the training since they retain the information within the unit while a sequence flows in the gating unit [51].

8.6 Decision-Making

The major difficulty in classification and subsequent decision-making is the variability of the data. Hence the importance of having large datasets that allow the creation of generalisable models and the learning process. The classification methods must be adapted to each user based on the previous actions and the results derived from them. Eye movements and blinks (voluntary and involuntary) are considered commands in the EOG-based systems and are used as input in different medical diagnostic systems and operation interfaces, such as serious games, home automation or communication and mobility solutions.

Some of the EOG-based communication solutions also include text-to-speech modules. These multilingual speech synthesisers are one of the recent forms of AI. They convert the stream of digital text selected by eye movements and blinks into natural-sounding speech. EOG can also be found in the design of industry-oriented robotic arms. In these systems, decision-making based on the results of previous commands improves real-time usability to provide the user with a reasonable degree of control.

8.6.1 Intelligent Decision Support Systems

Intelligent decision support systems (IDSS) use AI tools to improve the decision-making related to complex problems that involve a large amount of data in real-time. ANNs, Fuzzy logic, Expert systems, Case-based reasoning (CBR), and Intelligent agents (IA) can be considered IDSS. This section is a brief introduction to some of these techniques related to bio-signals for medical applications.

Fuzzy logic is a very promising technology within the medical decision-making application. Its main challenge is obtaining the required fuzzy data, even more when one must produce such data from patients. Usually, Fuzzy logic is used for the classification of the EOG and EEG signals, but they need calibration parameters obtained previously during the user training. [52] is an example of a Fuzzy logic-based controller for wheelchair motion control using the EOG technique.

An Expert system tries to solve human problems by embedding human knowledge in the computer. A typical application of expert systems is to filter ocular artefacts hidden in EEG signals without affecting clinically important EEG information [53]. Another application is reported in [54] for multichannel sleep data analysis.

ANNs have the advantage of executing the trained network quickly, which is a key issue for signal processing applications. However, the ANN algorithm is iterative and suffers from convergence problems. ANN has many practical applications, for example, ANN can be used for the diagnosis of a subnormal eye through the analysis of EOG signals [44].

To solve a new problem, CBR compares that problem with earlier solved problems and adjusts their well-known solutions instead of starting from scratch [55]. A CBR problem requires recovering relevant cases from the memory of cases, choosing the best cases, developing a solution, assessing the solution, and storing in the memory the newly solved case. A CBR system was used to classify ocular artefacts in EEG signals [56].

Alexa and Siri are examples of IA. They gather data from the internet without the help of the user. In the field of biomedical application, IA is used to diagnose, treat, and manage problems associated with dementia and Alzheimer’s [57]. In these cases, the agents may be any methodologies with decision-making abilities such as patient analysts, signal processing, neural network models and Bayesian systems. The information of each agent can be shared with other agents.

In the last ten years, Multi-Agent System (MAS) has gained interest due to the advances in AI, wireless sensor networks and sensors. In MAS, a larger problem can be divided into smaller subproblems. A task can be delegated among different agents, and each agent produces the output according to its task. Then, all outputs are joined and converted into the final answer to the complete problem. The interaction between agents increases the speed of problem resolution. MAS is a research topic in complex medical applications [58].

8.6.2 Learning Approaches

Learning approaches try to overcome the problems related to training data of the classification models in many EOG studies. Another important challenge is to improve the automatic classification.

Two of these learning approaches are Transfer learning (TL) and Deep transfer learning. These are powerful methods that reuse previously trained models as the starting point. This approach avoids the needed large training dataset and saves time, while, at the same time, does not reduce the accuracy of the assessment. Figure 8.12 outlines the TL from the source to the target domains [59]. The base model is trained using the data from the source domain and then fitted to the data from the target domain to complete the transfer of knowledge and make EOG-based systems more reliable and accurate. Through transfer learning, the classification performance improves significantly in all learning cases for temporal models trained only on the target domains.

A diagram outlines the T L from the source to the target domains. It includes Source dataset, M L model for training, model of knowledge, M L model for finetune, and target dataset. — **Fig. 8.12**

Due to the limited ability of the EOG signals to adapt to the characteristics of each user, a Reinforcement Learning (RL) algorithm is included, which allows adapting the interface to the user. The RL algorithm allows the adaptation of the user’s commands to the responses in the interfaces controlled by EOG. This algorithm is usually implemented in computer serious games as a moderator of the intensity of user commands given experience [60].

A model-free Q-learning method was proposed in [61] for the planning of robot motion through the user EOG signals, including obstacles surrounding the robotic platform. Figure 8.13 depicts the navigation approach in a simplified way.

A flowchart depicts the navigation approach in a simplified way. It involves ocular event class, environment, robotic platform model, and reinforcement learning applied to robot motion planning. — **Fig. 8.13**

Learning vector quantization (LVQ) is a supervised classification algorithm frequently used to recognise eye movements in EOG-based systems. LVQ is an artificial neural network that lets us choose how many training instances to latch onto and learns exactly what those instances should look like. EOG features can be considered as training data to build a network for recognition. Despite not being particularly powerful compared to other methods, it is simple and intuitive for the recognition of eye movements [62].

8.7 Discussion

Considering the articles published in the last decade, we can say that for EOG signal denoising, wavelet transform is the most useful technique for data preprocessing because this mathematical tool is better focused on transient and high-frequency phenomena. For EOG feature selection, the CBFS allows reducing redundant features and increases the precision and accuracy of the neural network-based classifier. EOG compression improves the signal transmission with fewer data from the original signal. As a result, the size of memory is reduced, which is an important feature for large polysomnogram signals.

EOG signal classification can be done automatically using any conventional classification algorithm. K-NN is the typical classification algorithm based on supervised ML that offers better performance and simplicity. K-NN employs the complete dataset to train “every point”, which is why the required memory is higher than other classifiers. Therefore, K-NN is recommended for small datasets with fewer features.

CNN is a very efficient classification method in EOG signal processing, especially for EOG-based HCIs. CNN yields models of significantly higher correlation coefficients than the traditional K-NN classification algorithm for large datasets. The RL layer is used to help the user by selecting proper actions, and at the same time, learning from previous behaviours. For example, to prevent collisions in a robotic platform or improve wheelchair navigation. Deep transfer learning can be used for a relatively small amount of data for sleep stage classification and models.

The development of sophisticated AI-based models together with the availability of larger datasets will allow better interpretation of EOG. This will result in the design of more efficient systems that also present an improvement in the decision-making stage.

8.8 Conclusions

This chapter introduced and discussed signal processing in electrooculographic signals, which is a challenging problem due to the wide variability in the morphology and features of electrooculograms within the population. The key aspect is to find the technique that presents the best overall performance in each of the basic signal processing stages that are divided: denoising, feature extraction, classification, and decision-making. Some applications require the processing of large electrooculograms lasting several hours to monitor the health status of patients. Such scenarios also bring the need for powerful artificial intelligence-based techniques for classification and modelling, as well as compression of the signal for efficient decision and storage.

References

E. Kowler, Eye movements: The past 25 years. Vis. Res. 51(13), 1457–1483 (2011). https://doi.org/10.1016/j.visres.2010.12.014
Article Google Scholar
A. López, F.J. Ferrero, D. Yangüela, C. Álvarez, O. Postolache, Development of a computer writing system based on EOG. Sensors 17, 1505 (2017). https://doi.org/10.3390/s17071505
Article Google Scholar
A. López, M. Fernández, H. Rodríguez, F.J. Ferrero, O. Postolache, Development of an EOG-based system to control a computer serious game. Measurement 127, 481–488 (2018). https://doi.org/10.1016/j.measurement.2018.06.017
Article Google Scholar
G. Teng, Y. He, H. Zhao, D. Liu, J. Xiao, S. Rankumar, Design and development of human computer interface using electrooculogram with deep learning. Artif. Intell. Med. 102, 101765 (2021). https://doi.org/10.1016/j.artmed.2019.101765
Article Google Scholar
Q. Huang, S. He, Q. Wang, Z. Gu, N. Peng, K. Li, Y. Zhang, M. Shao, Y. Li, An EOG-based human–machine interface for wheelchair control. IEEE Trans. Biomed. Eng. 65(9), 2023–2032 (2017). https://doi.org/10.1109/tbme.2017.2732479
Article Google Scholar
R. Zhang, S. He, X. Yang, X. Wang, K. Li, Q. Huang, Z. Yu, X. Zhang, D. Tang, Y. Li, An EOG-based human-machine interface to control a smart home environment for patients with severe spinal cord. IEEE Trans. Biomed. Eng. 66(1), 89–100 (2018). https://doi.org/10.1109/tbme.2018.2834555
Article Google Scholar
F.D. Pérez-Reynoso, L. Rodríguez-Guerrero, J.C. Salgado-Ramírez, R. Ortega-Palacios, Human–machine interface: multiclass classification by machine learning on 1D EOG Signals for the Control of an Omnidirectional Robot. Sensors 21(17), 5882 (2021). https://doi.org/10.3390/s21175882
Article Google Scholar
F. Fang, T. Shinozaki, Electrooculography-based continuous eye-writing recognition system for efficient assistive communication systems. PLOS One 13(2), e0192684 (2018). https://doi.org/10.1371/journal.pone.0192684
Article Google Scholar
D. Kumar, K. Priyadharsini, Analysis of CNN model based classification of diabetic retinopathy diagnosis, in Proceedings of International Conference on Secure Cyber Computing and Communication (ICSCCC), (2021)
Google Scholar
A. López, F.J. Ferrero, O. Postolache, An affordable method for evaluation of ataxic disorders based on electrooculographic signals. Sensors 19, 3756 (2019). https://doi.org/10.3390/s19173756
Article Google Scholar
R.A. Becerra-García, R.V. García-Bermúdez, G. Joya-Caparrós, A. Fernández-Higuera, C. Velázquez-Rodríguez, M. Velázquez-Mariño, F.R. Cuevas-Beltrán, F. García-Lagos, R. Rodríguez-Labrada, Data mining process for identification of non-spontaneous saccadic movements in clinical electrooculography. Neurocomputing 250, 28–36 (2017). https://doi.org/10.1016/j.neucom.2016.10.077
Article Google Scholar
R.J. Leigh, D.S. Zee, The neurology of eye movements. Encyclopedia of biomedical engineering, 5th edn. (Oxford, New York USA, 2015)
Book Google Scholar
PowerLab 26 Series. ADInstruments. [Online]. Available: https://www.adinstruments.com/products/powerlab/35-and-26-series
BlueGain Cambridge Research Systems. BlueGain EOG Biosignal Amplifier. [Online]. Available: http://www.crsltd.com/tools-for-vision-science/eye-tracking/bluegain-eog-biosignal-amplifier/
ActiveTwo AD-box. Biosemi. [Online]. Available: https://www.biosemi.com/ad-box_activetwo.htm
g.USBAMP Research. G.Tec. [Online]. Available: https://www.gtec.at/product/gusbamp-research/
C. Velázquez-Rodríguez, R.V. García-Bermudez, F. Rojas-Ruiz, R. Becerra-García, Automatic glissade determination through a mathematical model in electrooculographic records, in Proceedings of Lecture Notes in Computer Science, (2017)
Google Scholar
A.L. Goldberger, L.A. Amaral, L. Glass, J.M. Hausdorff, P.C. Ivanov, R.G. Mark, J.E. Mietus, G.B. Moody, C.K. Peng, H.E. Stanley, Physiobank. Physiotool. Physionet Circulat. 101(23), e215–e220 (2000). https://doi.org/10.1161/01.cir.101.23.e215
Article Google Scholar
G.Q. Zhang, L. Cui, R. Mueller, S. Tao, M. Kim, M. Rueschman, S. Mariani, D. Mobley, S. Redline, The national sleep research resource: towards a sleep data communications. J. Am. Med. Inform. Assoc. 25(10), 1351–1358 (2008). https://doi.org/10.1093/jamia/ocy064
Article Google Scholar
C. O’Reilly, N. Gosselin, J. Carrier, T. Nielsen, Montreal archive of sleep studies: An open-access resource for instrument benchmarking & exploratory research. J. Sleep Res. 1(24), 628–635 (2014). https://doi.org/10.1111/jsr.12169
Article Google Scholar
A. Sterr, J.K. Ebajemito, K.B. Mikkelsen, M.A. Bonmati-Carrion, N. Santhi, C. Della Monica, L. Grainger, G. Atzori, V. Revell, S. Debener, D.J. Dijk, M. De Vos, Sleep EEG derived from behind-the-ear electrodes (ceegrid) compared to standard polysomnography: A proof of concept study. Front. Hum. Neurosci. 12(452), 1–9 (2018). https://doi.org/10.3389/fnhum.2018.00452
Article Google Scholar
M. Rezaei, H. Mohammadi, H. Khazaie, EEG/EOG/EMG data from a cross sectional study on psychophysiological insomnia and normal sleep subjects. Mendeley Data V4 (2017). https://doi.org/10.17632/3hx58k232n.4
N. Barbara, T.A. Camilleri, K.P. Camilleri, A comparison of EOG baseline drift mitigation techniques. Biomed. Sign. Proces. Cont. 57(540), 101738 (2020). https://doi.org/10.1016/j.bspc.2019.101738
Article Google Scholar
A. Banerjee, A. Konar, D.A. Timbarewala, R. Janarthanan, Detecting eye movement direction from stimulated electro-oculogram by Intelligent Algorithms, in Proceedings of International Conference on Computing Communication & Networking Technologies (ICCCNT), (2012)
Google Scholar
A. López, F.J. Ferrero, J.R. Villar, EOG signal compression using turning point algorithm, in Proceedings of IEEE International Instrumentation and Measurement Technology Conference (I2MTC), (2021)
Google Scholar
R. Boostani, F. Karimzadeh, M. Nami, A comparative review on sleep stage classification methods in patients and healthy individuals. Comput. Methods Prog. Biomed. 140, 77–91 (2017). https://doi.org/10.1016/j.cmpb.2016.12.004
Article Google Scholar
K. Mehta, A Review on different methods of EOG signal analysis. Internat. J. Innovat. Res. Sci. Eng. Technol. 5(2), 1862–1865 (2016). https://doi.org/10.15680/IJIRSET.2016.0502128
Article MathSciNet Google Scholar
A. Banerjee, S. Dattab, M. Palb, A. Konarb, Tibarewalaa, DN Janarthananc R Classifying electrooculogram to detect directional eye movements, in Proceedings of International Conference on Computational Intelligence: Modeling Techniques and Applications (CIMTA), (2013)
Google Scholar
S. D’Souza, N. Sriraam, Statistical based analysis of electrooculogram (EOG) signals: A Pilot Study. Internat. J. Biomed. Clin. Eng. 2(1), 12–25 (2013). https://doi.org/10.4018/ijbce.2013010102
Article Google Scholar
A. López, F.J. Ferrero, S.M. Qaisar, O. Postolache, Gaussian mixture model of saccadic eye movements, in Proceedings of Medical Measurements and Applications (MeMeA), (2022)
Google Scholar
H. Wang, C. Wu, T. Li, Y. He, P. Chen, A. Bezerianos, Driving fatigue classification based on fusion entropy analysis combining EOG and EEG. IEEE Access 7, 61975–61986 (2019). https://doi.org/10.1109/ACCESS.2019.2915533
Article Google Scholar
G.R.M. Babu, S. Gopinath, E. Arunkumar, An intelligent EOG system using fractal features and neural networks. Test Eng. Manag. 83, 9920–9925 (2020)
Google Scholar
S. Mala, K. Latha, Feature selection in categorizing activities by eye movements using electrooculograph signals, in Proceedings of Science Engineering and Management Research (ICSEMR), (2014)
Google Scholar
C. Ding, H. Peng, Minimum redundancy feature selection from microarray gene expression data. J. Bioinforma. Comput. Biol. 3(2), 185–205 (2005). https://doi.org/10.1142/S0219720005001004
Article Google Scholar
S. Mala, K. Latha, Efficient Classification of EOG using CBFS feature selection algorithm, in Proceedings of International Conference on Emerging Research in Computing Information, Communication and Applications (ERCICA), (2013)
Google Scholar
C.-E. Kuo, G.-T. Chen, Automatic Sleep staging based on a hybrid stacked LSTM Neural Network Verification Using Large-Scale Dataset. IEEE Access 8, 111837–111849 (2020). https://doi.org/10.1109/ACCESS.2020.3002548
Article Google Scholar
Z. Lv, X.-P. Wu, M. Li, D.-X. Zhang, Development of a human computer interface system using EOG. Health 1(1), 39–46 (2009). https://doi.org/10.4236/health.2009.11008
Article Google Scholar
P. Babita Syal, P. Kumari, Comparative analysis of KNN, SVM, DT for EOG based human computer interface, in Proceedings of International Conference on Current Trends in Computer, Electrical, Electronics and Communication (CTCEEC), (2017)
Google Scholar
M.M. Rahman, M.I.H. Bhuiyan, A.R. Hassanb, Sleep stage classification using single-channel EOG. Comput. Biol. Med. 102, 211–220 (2018). https://doi.org/10.17605/OSF.IO/SCGJX
Article Google Scholar
F. Anis, M. Mustafa, N. Sulaiman, M. Rashid, B. Sama, M. Islam, M. Hasan, N. Ali, The classification of Electrooculogram (EOG) through the application of Linear discriminant analysis (LDA) of selected time-domain signals, in Proceedings of Innovative Manufacturing, Mechatronics & Materials Forum (IM3F), (2020)
Google Scholar
F. Aziz, H. Arof, N. Mokhtar, M. Mubin, HMM based automated wheelchair navigation using EOG traces in EEG. J. Neural Eng. 11(5), 1–11 (2014). https://doi.org/10.1088/1741-2560/11/5/056018
Article Google Scholar
N. Flad, T. Fomina, H. Buelthoff, L. Chuang, Unsupervised clustering of EOG as a viable substitute for optical eye tracking, in Proceedings of Eye Tracking and Visualization (ETVIS), (2015)
Google Scholar
Z. Lv, X.P. Wu, M. Li, D. Zhang, A novel eye movement detection algorithm for EOG driven human computer interface. Pattern Recogn. Lett. 31(9), 1041–1047 (2010). https://doi.org/10.1016/j.patrec.2009.12.017
Article Google Scholar
L. Jia, N. Alias, Comparison of ANN and SVM for classification of eye movements in EOG signals. J. Phys. 971, 1–11 (2018). https://doi.org/10.1088/1742-6596/971/1/012012
Article Google Scholar
M. Jurczak, M. Kolodziej, A. Majkowski, Implementation of a convolutional neural network for eye blink artifacts removal from the electroencephalography signal. Front. Neurosci. 16, 782367 (2022). https://doi.org/10.3389/fnins.2022.782367
Article Google Scholar
X. Zhu, W.L. Zheng, B.L. Lu, X. Chen, S. Chen, C. Wang, EOG-based drowsiness detection using convolutional neural networks, in Proceedings of International Joint Conference on Neural Networks (IJCNN), (2014)
Google Scholar
A. Erfanian, B. Mahmoudi, Real-time ocular artifact suppression using recurrent neural network for electro-encephalogram based brain-computer interface. Med. Biol. Eng. Comput. 43, 296–305 (2005). https://doi.org/10.1007/BF02345969
Article Google Scholar
M. Dutt, M. Goodwin, C.W. Omlin, Automatic sleep stage identification with time distributed convolutional neural network, in Proceedings of International Joint Conference on Neural Networks (IJCNN), (2021)
Google Scholar
B. Xia, Q. Li, J. Jia, J. Wang, U. Chaudhary, A. Ramos-Murguialday, N. Birbaumer, Electrooculogram based sleep stage classification using deep belief network, in Proceedings of International Joint Conference on Neural Networks (IJCNN), (2017)
Google Scholar
P. Kawde, G.K. Verma, Deep belief network based affect recognition from physiological, in Proceedings of Uttar Pradesh Section International Conference on Electrical, Computer and Electronics (UPCON), (2017)
Google Scholar
I. Niroshana, X. Zhu, Y. Chen, W. Chen, Sleep stage classification based on EEG, EOG, and CNN-GRU deep learning model, in Proceedings of IEEE International Conference on Awareness Science and Technology (iCAST), (2019)
Google Scholar
N.M.M. Noor, S. Ahmad, Implementation of Fuzzy logic controller for wheelchair motion control based on EOG data. Appl. Mech. Mater. 661, 183–189 (2014). https://doi.org/10.4028/www.scientific.net/amm.661.183
Article Google Scholar
M.T. Hellyara, E.C. Ifeachora, D.J. Mappsa, E.M. Allen, N.R. Hudson, Expert system approach to electroencephalogram signal processing. Knowl.-Based Syst. 8(4), 164–175 (1995). https://doi.org/10.1016/0950-7051(95)96213-B
Article Google Scholar
T.G. Chang, J.R. Smith, J.C. Principe, An expert system for multichannel sleep EEG/EOG signal analysis. ISA Trans. 28(1), 45–51 (1989). https://doi.org/10.1016/0019-0578(89)90056-6
Article Google Scholar
I. Watson, F. Marir, Case-based reasoning: A review (Cambridge University, 2009)
Google Scholar
S. Barua, S. Begum, M.U. Ahmed, P. Funk, Classification of ocular artifacts in EEG signals using hierarchical clustering and case-based reasoning, in Proceedings of International Conference on Case-Based Reasoning (ICCBR), (2014)
Google Scholar
H.B. Abdessalem, A. Byrns, C. Frasson, Optimizing Alzheimer’s disease therapy using a neural intelligent agent-based platform. International Journal of Biosensors & Bioelectronics. Inter. J. Biosens. Bioelectron. 11(2), 70–96 (2021). https://doi.org/10.4236/ijis.2021.112006
Article Google Scholar
T.P. Filgueiras, P.B. Filho, Intelligent agents in biomedical engineering: a systematic review. Inter. J. Biosens. Bioelectron. 6(5), 123–128 (2020). https://doi.org/10.15406/ijbsbe.2020.06.00200
Article Google Scholar
H. Phan, O.Y. Chén, P.K. Zongqing, I. McLoughlin, A. Mertins, M. De Vos, Towards more accurate automatic sleep staging via deep transfer learing. IEEE Trans. Biomed. Eng. 68(6), 1787–1798 (2021). https://doi.org/10.1109/TBME.2020.3020381
Article Google Scholar
J. Perdiz, L. Garrote, G. Pires, U.J. Nunes, A Reinforcement learning assisted eye-driven computer game employing a decision tree-based approach and CNN classification. IEEE Access 9, 46011–46021 (2021). https://doi.org/10.1109/ACCESS.2021.3068055
Article Google Scholar
L. Garrote, J. Perdiz, G. Pires, U. Nunes, Reinforcement learning motion planning for an EOG-centered robot assisted navigation in a virtual environment, in Proceedings of IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), (2019)
Google Scholar
P. Zhang, M. Ito, S.-I. Ito, M. Fukumi, Implementation of EOG mouse using Learning vector quantization and EOG-feature based methods, in Proceedings of Conference on Systems, Process & Control (ICSPC), (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical, Electronic, Communications and Systems Engineering, University of Oviedo, Gijón, Spain
Alberto López & Francisco Ferrero

Authors

Alberto López
View author publications
You can also search for this author in PubMed Google Scholar
Francisco Ferrero
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Francisco Ferrero .

Editor information

Editors and Affiliations

Electrical and Computer Engineering Department, Effat University, Jeddah, Saudi Arabia
Saeed Mian Qaisar
Department of Electronic Engineering Faculty of Engineering and Green Technology, Universiti Tunku Abdul Rahman, Kampar, Malaysia
Humaira Nisar
Institute of Biomedicine Faculty of Medicine, University of Turku, Turku, Finland
Abdulhamit Subasi

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

López, A., Ferrero, F. (2023). Biomedical Signal Processing and Artificial Intelligence in EOG Signals. In: Qaisar, S.M., Nisar, H., Subasi, A. (eds) Advances in Non-Invasive Biomedical Signal Sensing and Processing with Machine Learning. Springer, Cham. https://doi.org/10.1007/978-3-031-23239-8_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-23239-8_8
Published: 23 February 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-23238-1
Online ISBN: 978-3-031-23239-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Biomedical Signal Processing and Artificial Intelligence in EOG Signals

Abstract

Similar content being viewed by others

Suppression of Artifacts from EEG Recordings Using Computational Intelligence

Mathematical Analysis of Electroencephalography Applied to Control Brain Machine Interfaces

EEG Based Automated Detection of Six Different Eye Movement Conditions for Implementation in Personal Assistive Application

Keywords

8.1 Introduction

8.1.1 EOG Fundamentals

8.1.2 EOG Signal Measurement

8.2 EOG Signal Denoising

8.3 Compression

8.4 EOG Feature Processing

8.4.1 Feature Extraction

8.4.1.1 Time-Domain Features

8.4.1.2 Frequency Domain Features

8.4.1.3 Time-Frequency Features

8.4.1.4 Non-Linear Features

8.4.2 Feature Selection

8.4.3 Feature Normalization

8.5 Classification

8.5.1 Machine Learning Techniques

8.5.2 Deep Learning Techniques

8.6 Decision-Making

8.6.1 Intelligent Decision Support Systems

8.6.2 Learning Approaches

8.7 Discussion

8.8 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation