An ECoG-Based BCI Based on Auditory Attention to Natural Speech

Brunner, Peter; Dijkstra, Karen; Coon, William G.; Mellinger, Jürgen; Ritaccio, Anthony L.; Schalk, Gerwin

doi:10.1007/978-3-319-57132-4_2

Peter Brunner⁴,
Karen Dijkstra⁴,
William G. Coon⁴,
Jürgen Mellinger⁴,
Anthony L. Ritaccio⁴ &
…
Gerwin Schalk⁴

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSELECTRIC))

1957 Accesses
1 Citations
2 Altmetric

Abstract

People affected by severe neuro-degenerative diseases (e.g., late-stage amyotrophic lateral sclerosis (ALS) or locked-in syndrome) eventually lose all muscular control and are no longer able to gesture or speak. For this population, an auditory BCI is one of only a few remaining means of communication. All currently used auditory BCIs require a relatively artificial mapping between a stimulus and a communication output. This mapping is cumbersome to learn and use. Recent studies suggest electrocorticographic (ECoG) signals in the gamma band (i.e., 70–170 Hz) can be used to infer the identity of auditory speech stimuli, effectively removing the need to learn such an artificial mapping. However, BCI systems that use this physiological mechanism for communication purposes have not yet been described. In this study, we explore this possibility by implementing a BCI2000-based real-time system that uses ECoG signals to identify the attended speaker.

Access provided by CONRICYT-eBooks. Download chapter PDF

Towards an Auditory Attention BCI

An Auditory Output Brain–Computer Interface for Speech Communication

Towards a Speech BCI Using ECoG

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

People affected by severe neuro-degenerative diseases (e.g., late-stage amyotrophic lateral sclerosis (ALS) or locked-in syndrome) eventually lose all muscular control and are no longer able to gesture or speak. They also cannot use traditional assistive communication devices that depend on muscle control, nor typical brain-computer-interfaces (BCIs) that depend on visual stimulation or feedback [1,2,3]. For this population, auditory [4,5,6,7,8] and tactile BCIs [9, 10] are two of only a few remaining means of communication (see [11] for review).

While visual BCIs typically preserve the identity between the stimulus (e.g., a highlighted ‘A’) and the symbol the user wants to communicate (e.g., the letter ‘A’), all currently used auditory or tactile BCIs require a relatively artificial mapping between a stimulus (e.g., a particular but arbitrary sound) and a communication output (e.g., a particular letter or word). This mapping is easy to learn when there are only few possible outputs (e.g., a yes or no command). However, with an increasing number of possible outputs, such as with a spelling device, this mapping becomes arbitrary and complex. This makes most current auditory and tactile BCI systems cumbersome to learn and use.

Two avenues are being investigated to overcome this limitation. The first avenue is to directly decode expressive silent speech without requiring any external stimuli. In this approach, linguistic elements at different levels (e.g., phonemes, syllables, words and phrases) are first decoded from brain signals and then synthesized into speech. While recent studies have demonstrated this possibility [12,13,14,15,16], even invasive brain imaging techniques (e.g., ECoG, LFPs, single neuronal recordings) are currently unable to capture the entire complexity of expressive speech. Consequently, silent speech BCIs are limited in the vocabulary that can be decoded directly from the brain signals. The second avenue is to replace unnatural stimuli that require an artificial mapping with speech stimuli that do not. In such a system, the user would communicate simply by directing attention to the speech stimulus that matches his/her intent. Previous studies that explored this avenue required the speech stimuli to be designed (e.g., altered and broken up [17]) such that they elicit a particular and discriminable evoked response. Such evoked responses can be readily detected in scalp-recorded electroencephalography (EEG) to identify the attended speech stimulus. However, such altered speech stimuli are difficult to understand, which makes such a BCI system difficult to use. More importantly, this approach does not scale well beyond two simultaneously presented speech stimuli.

Recent studies suggest that the envelope of attended speech is directly tracked by electrocorticographic (ECoG) signals in the gamma band (i.e., 70–170 Hz) [15, 18,19,20,21], effectively removing the need to ‘alter’ the speech stimuli. Further evidence shows that this approach can identify auditory attention to one speaker in a mixture of speakers, i.e., a ‘cocktail-party’ situation [22].

However, BCI systems that use this physiological mechanism for communication purposes have not been described yet. In this study, we explore this possibility by implementing a BCI2000-based real-time system that uses ECoG signals to identify the attended speaker.

2 Methods

2.1 Human Subject

The subject in this study was a 49 year old left handed woman with intractable epilepsy who underwent temporary placement of subdural electrode arrays (see Fig. 1a) to localize seizure foci prior to surgical resection. A neuropsychological evaluation [23] revealed normal cognitive function and hearing (full scale IQ = 97, verbal IQ = 91, performance IQ = 99) and a pre-operative Wada test [24] determined left hemispheric language dominance.

The subject had a total of 72 subdural electrode contacts (one \(8\times 8\) 64-contact grid with 3 contacts removed, two strips in \(1\,\times \,4\) configuration, and one strip in \(1\,\times \,3\) configuration). The grid and strips were placed over the left hemisphere in frontal, parietal and temporal regions (see Fig. 1b for details). The implants consisted of flat electrodes with an exposed diameter of 2.3 mm and an inter-electrode distance of 1 cm, and were implanted for one week. Grid placement and duration of ECoG monitoring were based solely on the requirements of the clinical evaluation, without any consideration of this study. The subject provided informed consent, and the study was approved by the Institutional Review Board of Albany Medical College.

We used post-operative radiographs (anterior-posterior and lateral) and computed tomography (CT) scans to verify the cortical location of the electrodes. We then used Curry software (Neuroscan Inc, El Paso, TX) to create subject-specific 3D cortical brain models from high-resolution pre-operative magnetic resonance imaging (MRI) scans. We co-registered the MRIs by means of the post-operative CT and extracted the electrode coordinates according to the Talairach Atlas [25]. These electrode coordinates are depicted on Talairach template brain in Fig. 1b.

2.2 Data Collection

We recorded ECoG from the implanted electrodes using a g.HIamp amplifier/digitizer system (g.tec, Graz, Austria) and the BCI software platform BCI2000 [26,27,28], which sampled the data at 1200 Hz. Simultaneous clinical monitoring was implemented using a connector that split the cables coming from the patient into one set that was connected to the clinical monitoring system and another set that was connected to the g.HIamp devices. This ensured that clinical data collection was not compromised at any time. Two electrocorticographically silent electrodes (i.e., locations that were not identified as eloquent cortex by electrocortical stimulation mapping) served as electrical ground and reference, respectively.

2.3 Stimuli and Task

The subject’s task was to selectively attend to one of two simultaneously presented speakers in a cocktail party situation (see Fig. 2a). The two speakers were John F. Kennedy and Barack Obama, each delivering his presidential inauguration address. Both speeches were similar in their linguistic features, but were uncorrelated in their sound intensities (\(\mathrm{r} = -0.02, \mathrm{p} = 0.9\)). To create a cocktail party situation, we mixed the two (monaural) speeches into a binaural presentation in which the stream presented to each ear contained \(20\%:80\%\) of the volume of one speaker and \(80\%:20\%\) of the other, respectively. This allowed us to manipulate the aural location of each speaker throughout the task. For the binaural presentation, we used in-ear monitoring earphones (AKP IP2, 12–23500 Hz bandwidth) that isolated the subject from any ambient noise in the room.

To create a trial structure, we broke these combined streams into segments of 15–25 s in length, which resulted in a total of 10 segments of 187 s combined length. In the course of the experiment, we presented each segment four times to counter-balance the aural location (i.e., left and right) and the identity (i.e., JFK and Obama) of the attended speaker. Thus, over these four trials, the subjects had to attend to each of the two speakers at each of the two aural locations. This resulted in a total of 40 trials (i.e., 10 segments, each presented 4 times).

At the beginning of each trial, an auditory cue indicated the aural location (i.e., left or right) to which the subject should attend. Throughout the trial, a visual stimulus complemented the initial auditory cue to indicate the identity of the aural location (e.g., ‘JFK in LEFT ear’). Each trial consisted of a 4 s cue, a 15–25 s stimulus and a 5 s inter-stimulus period. The total length of these 40 trials was 12.5 min. The subject performed these 40 trials in 5 blocks of 8 trials each, with a 3 min break between each block.

2.4 Offline Analysis

In the offline analysis, we characterized the relationship between the neural response (i.e., the ECoG signals) and the attended and unattended speech streams, as shown in Fig. 2b. In particular, we were interested in two parameters of this neural response. The first parameter was the delay between the audio stream and resulting cortical processing, i.e., the time from presentation of the audio stream to the observation of the cortical change. The second parameter was the cortical location that was most selective to the attended speech stream. These two parameters are the only two parameters that were later needed to configure the online BCI system.

To determine these two parameters, we extracted the high gamma band envelope at each cortical location and the envelopes of the covertly attended and unattended speech (i.e., JFK and Obama). We then correlated the high gamma band envelope at each cortical location, once with the attended and once with the unattended speech envelope. This resulted in two Spearman’s r-values for each cortical location. An example of this is shown in Fig. 2c. To determine the delay between the audio stream and resulting cortical processing, we measured the neural tracking of the sound intensity across different delays from 0 to 250 ms to identify the deal with the highest r-value.

2.4.1 Signal Processing

We first pre-processed the ECoG signals from the 72 channels to remove external noise. To do this, we high-pass filtered the signals at 0.5 Hz and re-referenced them to a common average reference that we composed from only those channels for which the 60 Hz line noise was within 1.5 standard deviations of the average.

Next, we extracted the signal envelope in the high gamma band using these pre-processed ECoG signals. For this, we applied an 18th order 70–170 Hz Butterworth filter and then extracted the envelope of the filtered signals using the Hilbert transform. Finally, we low-pass filtered the resulting signal envelope at 6 Hz (anti-aliasing) and downsampled the result to 120 Hz.

For each attended and unattended auditory stream, we extracted the time course of the sound intensity, i.e., the envelope of the signal waveform in the speech band. To do this, we applied a 80–6000 Hz Butterworth filter to each audio signal, and then extracted the envelope of the filtered signals using the Hilbert transform. Finally, we low-pass filtered the speech envelopes at 6 Hz and downsampled them to 120 Hz.

2.4.2 Feature Extraction

We extracted features that reflect the neural tracking of the attended and unattended speech stream. We defined neural tracking of speech as the correlation between the gamma envelope (of a given cortical location) and the speech envelope. We calculated this correlation separately for the attended and unattended speech, thereby obtaining two sets of r-values labeled ‘attended’ and ‘unattended,’ respectively.

2.4.3 Selection of Cortical Delay and Location

We expected a delay between the audio presentation and resulting cortical processing, i.e., the time from presentation of the audio stimuli to the observation of the cortical change. To account for this delay, we measured the neural tracking of the attended speech stream across different delays (0–250 ms, see Fig. 3) and across all channels. Next, we determined the cortical location that was most selective of the attended speech stream. For this, we selected the cortical location that showed the largest difference between the ‘attended’ and ‘unattended’ r-values. Based on these results, we selected a delay of 150 ms and a cortical location over superior temporal gyrus (STG). We corrected for this delay by shifting the speech envelopes relative to the ECoG envelopes prior to calculating the correlation values.

2.4.4 Classification

In our approach, we assumed that the extracted features, i.e., the two r-values of the selected cortical location, were directly predictive of the ‘attended’ conversation. In other words, for the selected cortical location, if the ‘attended’ r-value was larger than the ‘unattended’ r-value, the the trial was classified correctly. To determine the performance as a function of the length of attention, we applied our feature extraction and classification procedure to data segments from 0.1 to 15 s in length.

2.5 Real-Time System Verification

In the real-time verification, we evaluated the system performance on the data recorded during the first stage of this study. We configured this system with parameters (i.e., cortical location and delay) determined in the previously detailed offline analysis.

2.5.1 Real-Time System Architecture

We used the BCI software platform BCI2000 [26,27,28] to implement an auditory attention based BCI. For this, we expanded BCI2000 with the capability to process auditory signals in real time. In detail, we implemented a signal acquisition for audio devices (e.g., a microphone) or pre-recorded files that is synchronized with the acquisition from the neural signals. Further, we implemented a signal correlation filter. For our evaluation, the two (monaural) speeches served as the audio input to the auditory attention based BCI (see Fig. 4).

In this system, BCI2000 filters the audio signals between 80 and 6000 Hz and the ECoG signals between 70 and 170 Hz. Next, a BCI2000 filter extracts the envelopes, decimates them to a common sampling rate of 200 Hz and adjusts their timing for the cortical delay. A signal correlation filter then calculates the correlation values, i.e., the correlation between the two (monaural) speeches and the selected neural envelope, to determine to which speaker the user directs his/her attention. Finally, the feedback augmentation filter increases the volume of the attended speaker and decreases the volume of the unattended speaker to provide feedback to the subject. This processing steps are updated every 50 ms to provide feedback in real-time.

3 Results

3.1 Neural Correlates of Attended and Unattended Speech

First, we were interested in visualizing the cortical areas that track the ‘attended’ and ‘unattended’ conversations. The results in Fig. 5 show the neural tracking of the ‘attended’ and ‘unattended’ speech in the form of an activation index. For each cortical location, this activation index expresses the negative logarithm of the p-value (−log(p)) of the correlation between the high gamma ECoG envelope and the attended or unattended speech envelope. The neural tracking is focused predominantly on areas on or around superior (STG) and middle temporal gyrus (MTG).

3.2 Relationship Between Segment Length and Classification Accuracy

Next, we were interested in determining the duration of attention that is needed to infer the ‘attended’ speech. For this, we examine the relationship between the segment length and classification accuracy. The results in Fig. 6 show the classification accuracies for variable segment lengths (0.1–15 s). In this graph, the accuracy improvements level off after 5 s, at 80–90% accuracy.

3.3 Interface to the Investigator

Finally, we evaluated the real-time system performance that the determined parameters (i.e., the cortical location and delay) yield on the data recorded during the first stage of this study. The screenshot in Fig. 7 shows interface to the investigator. The interface presents the decimated and aligned ECoG and audio envelopes, their correlation with each other, and the inferred attention. The content of this interface is updated 20 times per second.

4 Discussion

We show the first real-time implementation of an auditory attention BCI that uses ECoG signals and natural speech stimuli. The configuration of this system requires only two parameters: the cortical location and the delay between the audio presentation and the cortical processing. Our results can guide the selection of these parameters. For example, our results indicate that the underlying physiological mechanism is primarily focused on the temporal lobe, specifically the STG and MTG areas. Further, the neural tracking of attended speech is stronger and more widely distributed than that of unattended speech. This confirms results from a previous ECoG study that investigated auditory attention [22]. Further, our study shows that the cortical delay between the audio presentation and the cortical processing is in the range of \(\sim \)150 ms.

The presented results indicate that such system could support BCI communication. While being invasive, it may be justified for those affected by severe neuro-degenerative diseases (e.g., late-stage ALS, locked-in syndrome) who have lost all muscular control and therefore cannot use conventional assistive devices or BCIs that depend on visual stimulation or feedback. Most importantly, the results suggest that sufficient communication performance (\({>}70\%\), [29]) could be achieved with a single electrode placed over STG or MTG. This finding is important, because placement of ECoG grids as used in this study requires a large craniotomy. In contrast, a single electrode could be placed through a burr hole [30]. Furthermore, the electrodes in this study were placed subdurally (i.e., the electrodes are placed underneath the dura). Penetration of the dura increases the risk of bacterial infection [31,32,33,34,35]. Epidural electrodes (i.e., electrodes placed on top of the dura) provide signals of approximately comparable fidelity [36, 37]. A single electrode placed epidurally could reduce risk, which should make this approach more clinically practical.

In this study, we focused on demonstrating that one cortical location is sufficient for providing BCI communication. However, it is likely that combining the information from multiple cortical locations could substantially improve the communication performance. Thus, recent advances in clinically practical recordings of ECoG signals from multiple cortical locations [38, 39] could improve the clinical efficacy of the presented approach.

In comparison to many other auditory BCIs, the present approach has the unique advantage of using natural speech without any alteration. This aspect may be particularly relevant for those who are already at a stage where learning how to use a BCI has become difficult.

5 Conclusion

In summary, our study demonstrates the function of an auditory attention BCI that uses ECoG signals and natural speech stimuli. The implementation of this system within BCI2000 lays the groundwork for future studies that investigate the clinical efficacy of this system. Once clinically evaluated, such a system could provide communication without depending on other sensory modalities or a mapping between the stimulus and the communication intent. In the near future, this could substantially benefit people affected by severe motor disabilities that cannot use conventional assistive devices or BCIs that require some residual motor control, including eye movement.

References

J.R. Wolpaw, N. Birbaumer, D.J. McFarland, G. Pfurtscheller, T.M. Vaughan, Clin. Neurophysiol. 113(6), 767 (2002). doi:10.1016/S1388-2457(02)00057-3
Article Google Scholar
P. Brunner, S. Joshi, S. Briskin, J.R. Wolpaw, H. Bischof, G. Schalk, J. Neural Eng. 7(5), 056013 (2010). doi:10.1088/1741-2560/7/5/056013
Article Google Scholar
P. Brunner, G. Schalk, Clin. Neurophysiol. (2010). doi:10.1016/j.clinph.2010.11.014
A. Belitski, J. Farquhar, P. Desain, J. Neural Eng. 8(2), 025022 (2011). doi:10.1088/1741-2560/8/2/025022
Article Google Scholar
A. Furdea, S. Halder, D.J. Krusienski, D. Bross, F. Nijboer, N. Birbaumer, A. Kübler, Psychophysiology 46(3), 617 (2009). doi:10.1111/j.1469-8986.2008.00783.x
Article Google Scholar
D.S. Klobassa, T.M. Vaughan, P. Brunner, N.E. Schwartz, J.R. Wolpaw, C. Neuper, E.W. Sellers, Clin. Neurophysiol. 120(7), 1252 (2009)
Article Google Scholar
S. Halder, M. Rea, R. Andreoni, F. Nijboer, E.M. Hammer, S.C. Kleih, N. Birbaumer, A. Kübler, Clin. Neurophysiol. 121(4), 516 (2010). doi:10.1016/j.clinph.2009.11.087
Article Google Scholar
M. Schreuder, B. Blankertz, M. Tangermann, PLoS ONE 5(4) (2010). doi:10.1371/journal.pone.0009813
A.M. Brouwer, J.B. van Erp, Front. Neurosci. 4, 19 (2010). doi:10.3389/fnins.2010.00019
Google Scholar
M. van der Waal, M. Severens, J. Geuze, P. Desain, J. Neural Eng. 9(4), 045002 (2012). doi:10.1088/1741-2560/9/4/045002
Article Google Scholar
A. Riccio, D. Mattia, L. Simione, M. Olivetti, F. Cincotti, J. Neural Eng. 9(4), 045001 (2012). doi:10.1088/1741-2560/9/4/045001
Article Google Scholar
X. Pei, D.L. Barbour, E.C. Leuthardt, G. Schalk, J. Neural Eng. 8(4), 046028 (2011). doi:10.1088/1741-2560/8/4/046028
Article Google Scholar
E.C. Leuthardt, C. Gaona, M. Sharma, N. Szrama, J. Roland, Z. Freudenberg, J. Solis, J. Breshears, G. Schalk, J. Neural Eng. 8(3), 036004 (2011). doi:10.1088/1741-2560/8/3/036004
Article Google Scholar
X. Pei, J. Hill, G. Schalk, IEEE Pulse 3(1), 43 (2012). doi:10.1109/MPUL.2011.2175637
Article Google Scholar
S. Martin, P. Brunner, C. Holdgraf, H.J. Heinze, N.E. Crone, J. Rieger, G. Schalk, R.T. Knight, B. Pasley, Front. Neuroeng. 7(14) (2014). doi:10.3389/fneng.2014.00014
F. Lotte, J.S. Brumberg, P. Brunner, A. Gunduz, A.L. Ritaccio, C. Guan, G. Schalk, Front. Hum. Neurosci. 9, 97 (2015). doi:10.3389/fnhum.2015.00097
Article Google Scholar
M.A. Lopez-Gordo, E. Fernandez, S. Romero, F. Pelayo, A. Prieto, J. Neural Eng. 9(3), 036013 (2012). doi:10.1088/1741-2560/9/3/036013
Article Google Scholar
C. Potes, A. Gunduz, P. Brunner, G. Schalk, NeuroImage 61(4), 841 (2012). doi:10.1016/j.neuroimage.2012.04.022
Article Google Scholar
C. Potes, P. Brunner, A. Gunduz, R.T. Knight, G. Schalk, NeuroImage 97, 188 (2014). doi:10.1016/j.neuroimage.2014.04.045
Article Google Scholar
B.N. Pasley, S.V. David, N. Mesgarani, A. Flinker, S.A. Shamma, N.E. Crone, R.T. Knight, E.F. Chang, PLoS Biol. 10(1), e1001251 (2012). doi:10.1371/journal.pbio.1001251
Article Google Scholar
J. Kubanek, P. Brunner, A. Gunduz, D. Poeppel, G. Schalk, PLoS ONE 8(1), e53398 (2013). doi:10.1371/journal.pone.0053398
Article Google Scholar
E.M. Zion Golumbic, N. Ding, S. Bickel, P. Lakatos, C.A. Schevon, G.M. McKhann, R.R. Goodman, R. Emerson, A.D. Mehta, J.Z. Simon, D. Poeppel, C.E. Schroeder, Neuron 77(5), 980 (2013). doi:10.1016/j.neuron.2012.12.037
D. Wechsler, Weschsler Adult Intelligence Scale-III (The Psychological Corporation, San Antonio, TX, 1997)
Google Scholar
J. Wada, T. Rasmussen, J. Neurosurg. 17, 266 (1960)
Article Google Scholar
J. Talairach, P. Tournoux, Co-Planar Sterotaxic Atlas of the Human Brain (Thieme Medical Publishers Inc., New York, 1988)
Google Scholar
G. Schalk, D.J. McFarland, T. Hinterberger, N. Birbaumer, J.R. Wolpaw, IEEE Trans. Biomed. Eng. 51(6), 1034 (2004)
Article Google Scholar
J. Mellinger, G. Schalk, in Toward Brain-Computer Interfacing, ed. by G. Dornhege, J. del R. Millan, T. Hinterberger, D. McFarland, K. Müller, (MIT Press, Cambridge, MA, USA, 2007), pp. 359–367
Google Scholar
G. Schalk, J. Mellinger, A Practical Guide to Brain-Computer Interfacing with BCI2000, 1st edn. (Springer, London, UK, 2010)
Book Google Scholar
A. Kübler, B. Kotchoubey, J. Kaiser, J.R. Wolpaw, N. Birbaumer, Psychol. Bull. 127(3), 358 (2001)
Article Google Scholar
E.C. Leuthardt, Z. Freudenberg, D. Bundy, J. Roland, Neurosurg. Focus 27(1), E10 (2009). doi:10.3171/2009.4.FOCUS0980
Article Google Scholar
H. Davson, J. Physiol. 255(1), 1 (1976)
Article Google Scholar
H.M. Hamer, H.H. Morris, E.J. Mascha, M.T. Karafa, W.E. Bingaman, M.D. Bej, R.C. Burgess, D.S. Dinner, N.R. Foldvary, J.F. Hahn, P. Kotagal, I. Najm, E. Wyllie, H.O. Lüders, Neurology 58(1), 97 (2002)
Article Google Scholar
K.N. Fountas, J.R. Smith, Stereotact. Funct. Neurosurg. 85(6), 264 (2007). doi:10.1159/000107358
Article Google Scholar
J.J. Van Gompel, G.A. Worrell, M.L. Bell, T.A. Patrick, G.D. Cascino, C. Raffel, W.R. Marsh, F.B. Meyer, Neurosurgery 63(3), 498 (2008). doi:10.1227/01.NEU.0000324996.37228.F8
Article Google Scholar
C.H. Wong, J. Birkett, K. Byth, M. Dexter, E. Somerville, D. Gill, R. Chaseling, M. Fearnside, A. Bleasel, Acta Neurochir. (Wien) 151(1), 37 (2009). doi:10.1007/s00701-008-0171-7
Article Google Scholar
A. Torres Valderrama, R. Oostenveld, M.J. Vansteensel, G.M. Huiskamp, N.F. Ramsey, J. Neurosci. Methods 187(2). doi:10.1016/j.jneumeth.2010.01.019
D.T. Bundy, E. Zellmer, C.M. Gaona, M. Sharma, N. Szrama, C. Hacker, Z.V. Freudenburg, A. Daitch, D.W. Moran, E.C. Leuthardt, J. Neural Eng. 11(1), 016006 (2014). doi:10.1088/1741-2560/11/1/016006
Article Google Scholar
K.A. Sillay, P. Rutecki, K. Cicora, G. Worrell, J. Drazkowski, J.J. Shih, A.D. Sharan, M.J. Morrell, J. Williams, B. Wingeier, Brain Stimul. 6(5), 718 (2013)
Article Google Scholar
T. Stieglitz, Miniaturized Neural Interfaces and Implants in Neurological Rehabilitation. In: W. Jensen, O. Andersen, M. Akay (eds.) Replace, Repair, Restore, Relieve–Bridging Clinical and Engineering Solutions in Neurorehabilitation. Biosystems and Biorobotics, vol. 7 Springer, Cham (2014)
Google Scholar

Download references

Acknowledgements

This work was supported by the NIH (EB006356 (GS), EB00856 (GS) and EB018783 (GS)), the US Army Research Office (W911NF-07-1-0415 (GS), W911NF-08-1-0216 (GS) and W911NF-14-1-0440 (GS)) and Fondazione Neurone.

Author information

Authors and Affiliations

New York State Department of Health, Center for Adapt Neurotech, Wadsworth Center, Albany, NY, USA
Peter Brunner, Karen Dijkstra, William G. Coon, Jürgen Mellinger, Anthony L. Ritaccio & Gerwin Schalk

Authors

Peter Brunner
View author publications
You can also search for this author in PubMed Google Scholar
Karen Dijkstra
View author publications
You can also search for this author in PubMed Google Scholar
William G. Coon
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Mellinger
View author publications
You can also search for this author in PubMed Google Scholar
Anthony L. Ritaccio
View author publications
You can also search for this author in PubMed Google Scholar
Gerwin Schalk
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gerwin Schalk .

Editor information

Editors and Affiliations

g.tec Guger Technologies OG , 4521 Schiedlberg, Austria
Christoph Guger
g.tec Guger Technologies OG, 4521 Schiedlberg, Austria
Brendan Allison
Biosciences and Informatics, Keio University Biosciences and Informatics, 223-8522 Yokohama, Kazagawa, Japan
Junichi Ushiba

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Brunner, P., Dijkstra, K., Coon, W.G., Mellinger, J., Ritaccio, A.L., Schalk, G. (2017). An ECoG-Based BCI Based on Auditory Attention to Natural Speech. In: Guger, C., Allison, B., Ushiba, J. (eds) Brain-Computer Interface Research. SpringerBriefs in Electrical and Computer Engineering. Springer, Cham. https://doi.org/10.1007/978-3-319-57132-4_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-57132-4_2
Published: 30 April 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-57131-7
Online ISBN: 978-3-319-57132-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An ECoG-Based BCI Based on Auditory Attention to Natural Speech

Abstract