Abstract
The cortical pyramidal neurons in the cerebral cortex, which are positioned perpendicularly to the brain’s surface, are assumed to be the primary source of the electroencephalogram (EEG) reading. The EEG reading generated by the brainstem in response to auditory impulses is known as the Auditory Brainstem Response (ABR). The identification of wave V in ABR is now regarded as the most efficient method for audiology testing. The ABR signal is modest in amplitude and is lost in the background noise. The traditional approach of retrieving the underlying wave V, which employs an averaging methodology, necessitates more attempts. This results in a protracted length of screening time, which causes the subject discomfort. For the detection of wave V, this paper uses Kalman filtering and Cyclic Shift Tree Denoising (CSTD). In state space form, we applied Markov process modeling of ABR dynamics. The Kalman filter, which is optimum in the mean-square sense, is used to estimate the clean ABRs. To save time and effort, discrete wavelet transform (DWT) coefficients are employed as features instead of filtering the raw ABR signal. The results show that even with a smaller number of epochs, the wave is still visible and the morphology of the ABR signal is preserved.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Genetic predisposition, [post/peri]-natal factors, intrauterine environment, all have a part in the growth of childhood hearing impairment. Due to the presence of various categorization systems, the definition of hearing loss/deficit may fluctuate. In early development, being exposed to spoken language is essential. As a result, children with undiagnosed hearing loss, including mild and unilateral deficits, may experience significant delays in speech development as well as psychological and mental behavioral disorders, which can have an impact on their social and academic skills as well as their overall development [15, 18, 22].
The Auditory Brainstem Response (ABR) is a regularly used tool for evaluating neonatal auditory function. This method has long been considered the gold standard for neonatal diagnostic evaluation throughout the first six months of life. However, employing repeated averaging of the trials for meaningful ABR waveform, ABR is also utilized for detecting hearing loss in adults [1]. To find the lowest level that provokes a discernible response, the electroencephalogram (EEG) ABR waveform is sampled, averaged, and waveforms are recorded for stimuli of various intensities [13, 17]. Noise interferences can cause the morphology of the ABR to be distorted, affecting its accuracy. As a result, identifying the wave V can be a difficult task since noise can be emitted by equipment, circuits, or power sources. To correctly identify the presence of the ABR and differentiate it from physiologic noise, skilled clinical interpretation is required; unfortunately, subjective interpretation and the possibility of human mistake impair an objective physiologic measure [18]. For this reason, researchers like Wang et al. [23] suggested reasonable techniques to make hearing screening feasible. They suggested use a Kalman filter to adaptively extract sounds from ABR signals, reducing the number of trials required to enhance efficiency. Their work shows that with proper denoising using Kalman filter, fewer sweeps was required to obtain reliable ABR waveforms. In another related study, a wavelet based algorithm was introduced by [4] known as cyclic shift tree denoising (CSTD) method that technically gives a faster convergence on estimating the underlying ABR waveform compared to the conventional averaging methods [10, 21]. The CSTD method is computationally stable and it has uncomplicated algorithm which motivates us to apply the method in this study.
2 Materials and Methods
Stimulus such as click, tonal, or chirp can be used in ABR measurements [20]. Though click stimulus is widely used in ABR measurements, however the responses obtained are not from the whole cochlea but are rather thought to originate from basal regions (2–4 kHz) [12]. Therefore, the chirp stimulus was developed. This chirp purpose is to stimulate the entire cochlea simultaneously and provide effective neural synchronisation. Studies report that, since chirp stimulus has a specific sequence from low frequency to high, larger amplitude ABR waves can be formed than with click sounds and the whole cochlea can be stimulated at the same time [6, 7, 11]. A chirp is a quick sweep through frequencies which is either low to high or high to low frequency. The stimulus of Audio chirp was generated using a personal computer for each participant, which simultaneously processes and records the ABR signals. By simultaneously connecting output audio to both the trigger box and the g.PAH Programmable Attenuator Headphone buffer, the triggering procedure could be completed (Guger Technologies, Austria). The trigger box receives the computer’s audio signal and converts it to a square signal (trigger signal), which is utilized as a reference point for framing the EEG signal according to the provided stimulus. At the same time, the g.PAH attenuator sent the signal to the headphones such that every time the “click” played on the computer, the participant hears it. The device input power is 100–240 V with a maximum frequency of 50–60 Hz; thus, we used a bandpass filter with cutoff 100 Hz–1500 kHz for the recorded signal. Twenty chirps per second of stimulus rate were used to record the ABR signal with a sampling rate of 19.2 kHz and 24-bit resolution. Although, this sampling rate was empirically set supported by previous studies, which was found to be around 20 kHz. In this study, the intensity level of the stimulus was set in the range of 30–60 dBnHL with an increment of 10 dB. The EEG data were then recorded and averaged using sliding windows of 2000 frames. Each recorded epoch start-point is aligned with the start of the triggering signal and ends within a window of 40 ms (768 samples) for chirp and 20 ms (384 samples) for a click. The processing steps were implemented using MATLAB. Figure 1 shows the experiment setup and devices used for data collection of the ABR signals.
2.1 Wavelets Methods
The two wavelet denoising methods subjected to performance evaluation are as follows:
-
Wavelet Kalman Filter (WKF)
-
Cyclic Shift Tree Denoising (CSTD)
Figure 2, shows the flow of the two types wavelet based denoising methods, (a) Wavelet Kalman filter approach, (b) Cyclic Shift Tree Denoising.
A detail description of WKF method could be found in [19] while CSTD method could be found in [4]. This section only covered the process of modification. Due to the closeness in morphology between the ABR and the synthesis wavelet, the biorthogonal 5.5 wavelets were used as proposed by [8]. The signal was split into high-pass (HP) and low-pass (LP) components, which correspond to details and approximates components, respectively, using the discrete wavelet transform (DWT). The signal was dissected to level 5, at which point the ABR’s main frequencies were 200, 500, 900 Hz, respectively [8].
2.2 Wavelet Kalman Filter Approach
DWT is the digital form of continuous Wavelet transform (CWT) and can be used by setting discrete values of \(s = 2^j\) and \(u = n(2^j)\), where j and n are integer numbers, such that,
where \(S_{2^j} f (n)\) are the approximation coefficients (represent the low frequency sub-band), \(W_{2^j} f (n)\) are the detail coefficients (represent the high frequency sub- band), \(j,\;n,\; k\; \in Z\), Z is a set of all integer numbers. \(h_k\) and \(g_k\) are the low and high pass filters coefficients respectively. This study used wavelet coefficients for the observation model as suggested by [16] to reduce the state dimensions as shown in Fig. 2(a). The biorthogonal 5.5 wavelets were chosen as suggested in [16, 24] by considering the similarity in the morphology of the ABR with the synthesis wavelet. The improved signal was then decomposed with DWT to obtain the coefficients for the Kalman filter process. Algorithm 1 summarizes the recursive procedure of Kalman filter where H and C are reshaping matrices. When applying Kalman filter, the ABR estimation model was assumed as a linear additive model according to [5]. From the analysis, the minimum number of epochs sufficient to detect wave V was chosen based on the experimental result. This method was introduced to overcome the high computational complexity due to the high dimensional state vector. This data fusion algorithm gives a small computational requirement, elegant recursive properties and is the optimal estimator for one-dimensional linear systems with Gaussian error statistics. They typically used for smoothing noisy data and provide the estimated parameter of interest. It is also widely used for positioning system receivers, phase locked loops in radio equipment, smoothing the output from laptop track pads and more.
2.3 Cyclic Shift Tree Denoising
The flow chart in Fig. 2(b) shows CSTD method, an array of wavelet coefficients was created by performing a DWT on each frame and arranged as a successive original frame. Proceeding with the final average created by linearly averaging all the frames and applied the scale threshold to obtain a smooth ABR signal. The array of wavelet coefficients was then denoised using CSTD, with each denoised in a distinct way. To create a sequence of total N frames, linearly average all different denoised reordering of frames to which CSTD has been performed [3, 9]. Time domain samples were obtained by linearly averaging the N frames to generate one frame of wavelet coefficient and then reconstructing this average frame. This new estimation technique has a faster rate of convergence to the underlying signal than linear averaging, and it outperforms linear averaging in terms of performance [2, 8]. This current technique, however, has a number of drawbacks, including the inability to apply it to a single frame of data. Other constraints include the requirement for multiple measurements of the same signal. The CSTD algorithm requires a power of two number of initial frames, with the signal being estimated being constant between frames.
3 Results and Discussion
Wave V amplitudes and latencies, for example, were recognized as crucial components. In the acceptance area of latencies, the wave V was traced whether it existed or not. The average of 2048 epochs of data from each subject were used to calculate the delay. In addition, data collection from tainted data with noise interference was examined using a new modified methodology called MCSTD, which was recommended by the researchers. Female shows earlier latencies than male and larger amplitude than male. Table 1, below shows the average latency values of 11 normal adult’s subjects for chirp stimulus at 2048 sweep.
Wave V amplitudes and latencies were identified as key components. The wave V was traced in the acceptance zone of latencies, whether it exists or not. The average of 2048 epochs recorded data from each subject was used to calculate the delay. In addition, data collection from contaminated data with noise interference were analyzed using new suggested modified approach MCSTD. Female shows earlier latencies than male and larger amplitude than male. Table 1, below shows the average latency values of 11 normal adult’s subjects for chirp stimulus at 2048 sweep.
The inter-wave intervals for females are shorter due to the effect of negligible for a wave I and more pronounced for later waves. Females have smaller head size and less brain volume compared to male. Thus, the inter-wave latencies become shorter if the distance between the generators for each of the waves is shorter and the amplitude will be larger if the recording electrode is relatively closure to the wave generator. Figure 3, is the comparison between female’s and male’s signal.
3.1 Selection Minimum Number of Epochs
The signal was analyzed using averaging, KF, CSTD and MCSTD approaches. The data were recorded for four different intensity levels consist of 60, 50, 40 and 30 dBnHL. Wave V recognition between different intensity levels was related to their latency and frequency for each stimulus. Corona-Strauss et al. [5] stated that the latency value increased by decreasing the frequency content of chirps. Moreover, the latency of the responses evoked by higher intensity levels is shorter than the response evoked from a lower intensity level. Decreased the intensity level will longer the latency and smaller the amplitude.
Figure 4 (a) and (b) showed the results for the wave V detection at sweeps number of 16 and 8. When decreasing the sweep number, the signal amplitude increased. According to [16] the ABR signal amplitude is between 0.1 to 1.0 \(\upmu \text {V}\). However, ABR signal amplitude at 16 and 8 sweeps exceeded 1 \(\upmu \text {V}\) showed by the straight red line. In addition, wave V detection failed when decreasing the sweeps number to 16 and 8. Thus, 16 and 8 sweeps are not reliable for detecting wave V. The goal of this research is to determine the minimal trial that will be fulfilled in detecting the presence of wave V and denoising the tainted signals. By comparing the results of averaged data morphology, the performance of each methodology was assessed. Each recorded signal was averaged sweep by sweep and the sweep was divided into 2048, 1024, 512, 256, 128, 64 and 32 sweeps. When the signal is averaged over 2048 epochs, all techniques provide an accurate assessment of the ABR signal. When the number of epochs is reduced, however, wave V identification differs between techniques. As a result, when compared to other approaches, MCSTD performs admirably. When decreasing the epoch’s number to 16 and 8, the signal amplitude increases and exceeded 1 \(\upmu \text {V}\) in which ABR amplitude is between 0.1 to 1.0 \(\upmu \text {V}\) [14].
4 Conclusion
Analyzing EEG signal is very challenging due to their high complexity, low SNR, non-linearity and non-stationary. ABR is currently the most reliable method for hearing screening, however, the noise interference from environment, equipment, and subject movement can be a difficult task for wave V detection. The aim of this study is to reduce screening time by using a better signal processing approach, reducing the sweeping number and lowering the noise interference effects. This study focused on detecting wave V in normal adults by applying KF and CSTD. From the result, it is shown that better detection of wave V is obtained using WKF with the reduced number of epochs by considering the SNR and RMSE. Further work should be carried to see the performance based on Modified Cyclic Shift Tree Denoising.
References
Afsana, F., Asif-Ur-Rahman, M., Ahmed, M.R., Mahmud, M., Kaiser, M.S.: An energy conserving routing scheme for wireless body sensor nanonetwork communication. IEEE Access 6, 9186–9200 (2018)
Al Nahian, M.J., et al.: Towards an accelerometer-based elderly fall detection system using cross-disciplinary time series features. IEEE Access 9, 39413–39431 (2021)
Al Nahian, M.J., et al.: Social group optimized machine-learning based elderly fall detection approach using interdisciplinary time-series features. In: 2021 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD), pp. 321–325. IEEE (2021)
Causevic, E., Morley, R.E., Wickerhauser, M.V., Jacquin, A.E.: Fast wavelet estimation of weak biosignals. IEEE Trans. Biomed. Eng. 52(6), 1021–1032 (2005)
Corona-Strauss, F.I., Schick, B., Delb, W., Strauss, D.J.: Notched-noise embedded frequency specific chirps for objective audiometry using auditory brainstem responses. Audiol. Res. 2(1), 30–38 (2012)
Dau, T., Wegner, O., Mellert, V., Kollmeier, B.: Auditory brainstem responses with optimized chirp signals compensating basilar-membrane dispersion. J. Acoust. Soc. Am. 107(3), 1530–1540 (2000)
Davis, M.H., et al.: Dissociating speech perception and comprehension at reduced levels of awareness. Proc. Natl. Acad. Sci. 104(41), 16032–16037 (2007)
De Silva, A., Schier, M.: Evaluation of wavelet techniques in rapid extraction of ABR variations from underlying EEG. Physiol. Meas. 32(11), 1747 (2011)
Dey, N., Rajinikanth, V., Fong, S., Kaiser, M., Mahmud, M.: Social-group-optimization assisted Kapur’s entropy and morphological segmentation for automated detection of Covid-19 infection from computed tomography images. Cogn. Comput. 12(5), 1011–1023 (2020)
Farhin, F., Kaiser, M.S., Mahmud, M.: Secured smart healthcare system: blockchain and Bayesian inference based approach. In: Kaiser, M.S., Bandyopadhyay, A., Mahmud, M., Ray, K. (eds.) Proceedings of International Conference on Trends in Computational and Cognitive Engineering. AISC, vol. 1309, pp. 455–465. Springer, Singapore (2021). https://doi.org/10.1007/978-981-33-4673-4_36
Freeman, W., Quiroga, R.Q.: Imaging Brain Function with EEG: Advanced Temporal and Spatial Analysis of Electroencephalographic Signals. Springer, Heidelberg (2012)
Gorga, M.P., Johnson, T.A., Kaminski, J.K., Beauchaine, K.L., Garner, C.A., Neely, S.T.: Using a combination of click-and toneburst-evoked auditory brainstem response measurements to estimate pure-tone thresholds. Ear Hear. 27(1), 60 (2006)
Kaiser, M.S., Chowdhury, Z.I., Al Mamun, S., Hussain, A., Mahmud, M.: A neuro-fuzzy control system based on feature extraction of surface electromyogram signal for solar-powered wheelchair. Cogn. Comput. 8(5), 946–954 (2016)
Maglione, J., Pincilotti, M., Acevedo, R., Bonell, C., Gentiletti, G.: Estimation of the auditory brainstem response’s wave V by means of wavelet transform. In: Proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (IEEE Cat. No. 03CH37439), vol. 3, pp. 2631–2634. IEEE (2003)
Mahmud, M., Kaiser, M.S., Hussain, A., Vassanelli, S.: Applications of deep learning and reinforcement learning to biological data. IEEE Trans. Neural Netw. Learn. Syst. 29(6), 2063–2079 (2018)
Mohseni, H.R., Nazarpour, K., Wilding, E.L., Sanei, S.: The application of particle filters in single trial event-related potential estimation. Physiol. Meas. 30(10), 1101 (2009)
Noor, M.B.T., Zenia, N.Z., et al.: Application of deep learning in detecting neurological disorders from magnetic resonance images: a survey on the detection of alzheimer’s disease, parkinson’s disease and schizophrenia. Brain Inform. 7(1), 1–21 (2020)
Noor, M.B.T., Zenia, N.Z., Kaiser, M.S., Mahmud, M., Al Mamun, S.: Detecting neurodegenerative disease from MRI: a brief review on a deep learning perspective. In: Liang, P., Goel, V., Shan, C. (eds.) Brain Informatics. LNCS, vol. 11976, pp. 115–125. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-37078-7_12
Omar, M.H.: Single trial estimation for auditory brainstem response signal analysis. Ph.D. thesis, Universiti Teknologi Malaysia (2013)
Purdy, S.C., Abbas, P.J.: ABR thresholds to tonebursts gated with Blackman and linear windows in adults with high-frequency sensorineural hearing loss. Ear Hear. 23(4), 358–368 (2002)
Sharma, M.K., Ray, K., Yupapin, P., Kaiser, M.S., Ong, C.T., Ali, J.: Comparative analysis of different classifiers on EEG signals for predicting epileptic seizure. In: Kaiser, M.S., Bandyopadhyay, A., Mahmud, M., Ray, K. (eds.) Proceedings of International Conference on Trends in Computational and Cognitive Engineering. AISC, vol. 1309, pp. 193–204. Springer, Singapore (2021). https://doi.org/10.1007/978-981-33-4673-4_17
Singh, A.K., Kumar, A., Mahmud, M., Kaiser, M.S., Kishore, A.: Covid-19 infection detection from chest X-ray images using hybrid social group optimization and support vector classifier. Cogn. Comput. 1–13 (2021)
Wang, X., et al.: The effects of random stimulation rate on measurements of auditory brainstem response. Front. Hum. Neurosci. 14 (2020)
Wilson, W.: The relationship between the auditory brain-stem response and its reconstructed waveforms following discrete wavelet transformation. Clin. Neurophysiol. 115(5), 1129–1139 (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Hussain, H. et al. (2021). Enhanced Signal Processing Using Modified Cyclic Shift Tree Denoising. In: Mahmud, M., Kaiser, M.S., Kasabov, N., Iftekharuddin, K., Zhong, N. (eds) Applied Intelligence and Informatics. AII 2021. Communications in Computer and Information Science, vol 1435. Springer, Cham. https://doi.org/10.1007/978-3-030-82269-9_12
Download citation
DOI: https://doi.org/10.1007/978-3-030-82269-9_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-82268-2
Online ISBN: 978-3-030-82269-9
eBook Packages: Computer ScienceComputer Science (R0)