Forward Masking in Cochlear Implant Users: Electrophysiological and Psychophysical Data Using Pulse Train Maskers

Adel, Youssef; Hilkhuysen, Gaston; Noreña, Arnaud; Cazals, Yves; Roman, Stéphane; Macherey, Olivier

doi:10.1007/s10162-016-0613-5

Forward Masking in Cochlear Implant Users: Electrophysiological and Psychophysical Data Using Pulse Train Maskers

Research Article
Published: 21 February 2017

Volume 18, pages 495–512, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of the Association for Research in Otolaryngology Aims and scope Submit manuscript

Forward Masking in Cochlear Implant Users: Electrophysiological and Psychophysical Data Using Pulse Train Maskers

Download PDF

Youssef Adel ORCID: orcid.org/0000-0002-8968-5995¹^nAff2,
Gaston Hilkhuysen¹,
Arnaud Noreña³,
Yves Cazals³,
Stéphane Roman⁴ &
…
Olivier Macherey¹

450 Accesses
11 Citations
Explore all metrics

Abstract

Electrical stimulation of auditory nerve fibers using cochlear implants (CI) shows psychophysical forward masking (pFM) up to several hundreds of milliseconds. By contrast, recovery of electrically evoked compound action potentials (eCAPs) from forward masking (eFM) was shown to be more rapid, with time constants no greater than a few milliseconds. These discrepancies suggested two main contributors to pFM: a rapid-recovery process due to refractory properties of the auditory nerve and a slow-recovery process arising from more central structures. In the present study, we investigate whether the use of different maskers between eCAP and psychophysical measures, specifically single-pulse versus pulse train maskers, may have been a source of confound.

In experiment 1, we measured eFM using the following: a single-pulse masker, a 300-ms low-rate pulse train masker (LTM, 250 pps), and a 300-ms high-rate pulse train masker (HTM, 5000 pps). The maskers were presented either at same physical current (Φ) or at same perceptual (Ψ) level corresponding to comfortable loudness. Responses to a single-pulse probe were measured for masker-probe intervals ranging from 1 to 512 ms. Recovery from masking was much slower for pulse trains than for the single-pulse masker. When presented at Φ level, HTM produced more and longer-lasting masking than LTM. However, results were inconsistent when LTM and HTM were compared at Ψ level. In experiment 2, masked detection thresholds of single-pulse probes were measured using the same pulse train masker conditions. In line with our eFM findings, masked thresholds for HTM were higher than those for LTM at Φ level. However, the opposite result was found when the pulse trains were presented at Ψ level.

Our results confirm the presence of slow-recovery phenomena at the level of the auditory nerve in CI users, as previously shown in animal studies. Inconsistencies between eFM and pFM results, despite using the same masking conditions, further underline the importance of comparing electrophysiological and psychophysical measures with identical stimulation paradigms.

Exploring the Use of Interleaved Stimuli to Measure Cochlear-Implant Excitation Patterns

Article Open access 08 March 2024

Effect of Pulse Rate and Polarity on the Sensitivity of Auditory Brainstem and Cochlear Implant Users to Electrical Stimulation

Article Open access 03 July 2015

Effect of Pulse Polarity on Thresholds and on Non-monotonic Loudness Growth in Cochlear Implant Users

Article 30 January 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

INTRODUCTION

Psychophysical forward masking (pFM) refers to the increase in detection threshold of a probe when presented after a masker, compared with the probe’s unmasked threshold. This effect can persist up to several hundreds of milliseconds in both normal-hearing (NH) and cochlear implant (CI) listeners and depends on the duration and level of the masker (Plomp 1964; Shannon 1990). The amplitude-normalized temporal course of masking was found to be similar for these two subject groups, and it has been thus hypothesized that pFM likely involves processes at or more central than the auditory nerve (Shannon 1990).

Contemporary CIs are able to record the composite auditory nerve response to electrical stimulation, a measure known as the electrically evoked compound action potential (eCAP). Using different recording paradigms, Brown et al. (1990) and Morsnowski et al. (2006) used this functionality to measure the eCAP response to a probe pulse presented after a single-pulse masker for several masker-probe intervals. Both studies used masker and probe levels at or just below the loudest acceptable levels. They showed that eCAP recovery from forward masking (eFM) was more rapid than what is typically found in pFM and that the masker had no effect on the probe response when the interval was larger than a few milliseconds. Morsnowski et al. (2006) reported absolute refractory periods and exponential recovery from masking with time constants in the range of a few hundreds of microseconds for human CI users. These findings agree well with single-fiber recordings in cats using a similar paradigm (Miller et al. 2001). Also, using single-pulse maskers, Nelson and Donaldson (2001) reported that psychophysical recovery from forward masking was dominated by a rapid-recovery process similar to that observed in physiological recovery functions and, therefore, suggested that this process reflects refractory properties of the auditory nerve.

Apart from Nelson and Donaldson (2001), most pFM studies in CI users employed maskers with long durations, i.e., pulse train maskers. For example, using 320-ms maskers, Nelson and Donaldson (2002) found slow recovery functions with a mean time constant greater than 50 ms, which is consistent with data from previous studies in both NH and CI listeners. They thus theorized that the slow-recovery pFM is mediated by more central processes in both subject groups. Lee et al. (2012) reached a similar conclusion when comparing younger with older CI users. Although eCAP recovery from single-pulse maskers showed no difference between these groups, psychophysical detection thresholds showed slower recovery in older CI users. Due to this difference in performance, the authors proposed that changes in the central auditory system may be the main contributors to slow recovery from pFM, rather than peripheral mechanisms.

However, the use of different maskers between eCAP and psychophysical measures, specifically single-pulse versus pulse train maskers, may have been a source of confound. Two animal studies showed that the amount of masking generated at the level of the auditory nerve depends on various properties of the masker, including its duration, pulse rate, and current level: Using 100-ms sinusoidal electrical maskers in guinea pigs, Killian et al. (1994) found eCAP recovery functions that were sometimes incomplete, even after 500 ms. More recently, Miller et al. (2011) measured single-fiber responses in cats following 300-ms pulse train maskers and found a decrease in the response probability to a probe stimulus for several hundreds of milliseconds. These observations are in contrast with the fast recovery obtained with single-pulse maskers. The first aim of the present study was to test whether such long eFM recovery could be observed at the level of the auditory nerve in human CI users when using pulse train maskers.

Miller et al. (2011) also found that the masker pulse rate had a significant effect on eFM recovery. Specifically, for a fixed current level, single units recovered faster after a 250-pps than after a 5000-pps pulse train masker. This observation has relevance for contemporary CI speech coding strategies, which continuously stimulate the auditory nerve at high stimulation rates up to 2500 pps per channel depending on the speech processing strategy (reviews in Loizou 1998; Zeng et al. 2008), yielding aggregate rates with possibly an order of magnitude higher. Such high stimulation rates are likely to induce longer adaptation than lower rates (Miller et al. 2008; Miller et al. 2011). The second aim of this study was to test whether recovery from eFM slowed down as a function of masker rate using the same low and high rates investigated by Miller et al. (2011). In human CI listeners, a direct comparison between low and high rates is further complicated by the fact that both stimuli do not elicit the same loudness percept when presented at the same current level. When increasing the stimulation rate in CI users, the current level needs to be decreased in order to maintain the same loudness (Kreft et al. 2004). This decrease in current level may in turn decrease the amount of adaptation (Miller et al. 2011). Therefore, changing the electrical stimulation rate in CI strategies may have two opposite effects on the amount of adaptation.

These study aims were addressed in experiment 1 where we compared eFM functions for low-rate and high-rate pulse train maskers in two cases: when presented at the same current level and when presented at loudness-balanced levels. A single-pulse masker served as reference. In experiment 2, we attempted to relate our eFM findings with perception in order to understand which part of the percept may be explained by the auditory nerve response. To that end, pFM data using the same stimuli as used for eFM are also reported.

EXPERIMENT 1: ELECTROPHYSIOLOGICAL FORWARD MASKING

Methods

Subjects

Experiment 1 included nine adult CI users (U01–U09). They were implanted with Cochlear CI24RE implants (Cochlear Limited, Sydney, Australia), except for U05 who had the Cochlear CI512 implant. All had Contour Advance electrode arrays with 22 intracochlear contacts. Table 1 provides more information on the subjects. They received financial compensation and reimbursement of their traveling costs. A local ethics committee approved this study (Eudract 2012-A00438-35).

TABLE 1 Subject information

Full size table

Stimuli

The eCAPs were measured in response to a probe stimulus, a single pulse set at a current level eliciting an N₁-P₂ amplitude response of approximately 50 μV when no masker (NM) was present. In addition, the probe response was measured in five masking conditions, where masker duration, pulse rate, and current level were varied. Maskers were either (1) a single-pulse masker (SPM), (2) a 300-ms low-rate pulse train masker (LTM) at 250 pps, or (3) a 300-ms high-rate pulse train masker (HTM) at 5000 pps. HTM was always presented at a current level evoking comfortable loudness (C level). LTM was presented either at the same physical current level (Φ) or at a level eliciting the same loudness as HTM, i.e., psychophysical level (Ѱ). SPM was also presented at two different current levels, equal to those of LTM. Besides masker levels, the masker-probe interval (MPI) was varied as an experimental condition. This interval was defined by the time between the offset of the masker and the onset of the probe and was 2ⁿ ms, where n is an integer in the range 0–9. Figure 1 provides an overview of the masking conditions.

All electrical pulses were biphasic, symmetric, and rectangular. They were presented in monopolar mode. The two phases had durations of 25 μs each and were separated by an 8-μs inter-phase gap. All masker stimuli (SPM, LTM, and HTM) had pulses with leading cathodic phases, while the leading polarity of the probe was alternated.

Procedures

Prior to data collection, we determined whether eCAP responses could be detected. If such was the case, the eCAP growth function was measured and the masker levels were established. Finally, forward masking was determined electrophysiologically by measuring eCAP recovery functions. These procedures are explained in the following subsections.

eCAP Detection

The eCAP detection procedure checked whether a response could be detected and determined the current level eliciting an N₁-P₂ amplitude response of approximately 50 μV. It started by choosing a stimulating electrode in the apical or middle range, with the external electrode (MP1) as reference. Initially, the recording electrode was two positions apical to the stimulation electrode, with the CI stimulator housing (MP2) as reference. The probe was presented at a rate of 80 pps. Its current level was increased in small steps, ranging from 0.16 to 0.78 dB, while monitoring loudness acceptability on a 10-point rating scale. The alternating polarity method (e.g., Miller et al. 1998) was used for artifact suppression. The procedure terminated once the probe evoked an N₁-P₂ amplitude response of approximately 50 μV. If no eCAP response was found or if no sufficient amplitude could be obtained before the CI user rated the loudness above C level, i.e., rating 7 on the 10-point scale, then recording gain, recording electrode position, or both stimulating and recording electrodes were varied and the detection procedure was started again. If the detection of an eCAP response was successful, the amplitude growth as a function of stimulation level was measured with a low probe rate of 14 pps to limit the effect of neural adaptation (Clay and Brown 2007). Again, subjective loudness acceptability was monitored during the procedure. Table 1 shows the electrodes that were used for each subject.

Establishing Masker Levels

To establish masker levels, LTM or HTM was presented at increasing current level while monitoring loudness perception on the 10-point rating scale to determine comfortable loudness levels. For each masker, the procedure terminated when a loudness rating of 8 (labeled as “loud”) was reached. Subsequently, the loudness of LTM was balanced to that of HTM using the following adjustment paradigm: Each loudness-balancing trial consisted of two pulse trains presented consecutively with a 500-ms inter-stimulus gap. The first pulse train was the reference and its level was fixed across the adjustment. The current level of the second stimulus pulse train was adjustable in step sizes 0.16, 0.31, or 0.47 dB with a graphical user interface provided to the subjects. They were asked to balance its loudness to the first pulse train and were encouraged to make over- and undershoots before deciding on the final level. First, HTM was the reference and was set to the current level invoking a rating of 6 (labeled as “most comfortable”), while LTM was initially set to the current levels invoking a rating of 3 or 7 (labeled as “soft” and “loud but comfortable,” respectively). This procedure was carried out once for each of the previously mentioned initial levels of LTM. Then, a new procedure was carried out with LTM as the reference. Its level was fixed at the average of the two final LTM levels adjusted to the loudness of HTM. The subject next adjusted the level of HTM, which was also initially set to the current levels invoking a rating of 3 or 7. The loudness-balanced level of LTM was calculated by

$$ {L}_B\left(\mathrm{LTM}\right)={L}_{r6}\left(\mathrm{HTM}\right)+\frac{1}{2}\times \left[\overline{L}\left(\mathrm{LTM}\right)-{L}_{r6}\left(\mathrm{HTM}\right)+\overline{L}\left(\mathrm{LTM}\right)-\overline{L}\left(\mathrm{HTM}\right)\right] $$

(1)

where L _B is the loudness-balanced level, L _r6 is the level invoking a loudness rating of 6, and $ \overset{-}{L} $ is the average of the two final levels set by the subject during the adjustment paradigm. L _B(LTM) was thus defined as having same loudness (i.e., psychophysical level Ѱ) as HTM at its fixed current level L _r6(HTM). Presenting LTM at L _r6(HTM) was defined as having the same physical current level (Φ) as HTM.

Forward Masking: eCAP Recovery Functions

Recovery functions were obtained with measurement sequences, each of which started with a 200-ms pulse train at a rate of 10,000 pps and at current level zero to power up the implant. A measurement sequence consisted of a masker stimulus, a defined MPI, and a probe pulse with a given leading polarity. After a 98-μs measurement delay, 32 points were recorded at a 20-kHz sampling rate. The gap between a probe and the onset of a subsequent masker was 400 ms. Artifact suppression was accomplished using a modified alternating polarity method, which compensated for the pulse train masker artifact. Ordinarily, the alternating polarity method measures responses for cathodic-leading (A ₁) and anodic-leading (A ₂) biphasic pulses. Inverting the polarity of the stimulus results in inverting the polarity of the artifact, whereas the polarity of the neural response remains the same. Averaging these two measurements cancels out most of the stimulus artifact. In this study, each measurement sequence had a preceding pulse train masker, which itself could generate an artifact. Thus, a third measurement (B) with the probe at current level zero was carried out. The neural response as a function of time (t) was finally calculated by

$$ \mathrm{eCAP}(t)=\frac{A_1(t)+{A}_2(t)}{2}- B(t). $$

(2)

The telemetry system allows one to average neural responses on the internal memory (so-called sweeps) before transferring the averaged data back to the computer. Each measurement sequence consisted of 8 sweeps and was performed 8 times, effectively resulting in 64 neural responses per experimental condition.

Measurement sequences were grouped in blocks with a defined order and set of experimental conditions. Each block had a fixed MPI of 2ⁿ ms, where n is an integer in the range 0–9. A block started with an NM condition as control, followed by five combinations of different masker types (SPM, LTM, and HTM) and stimulation levels (Φ and Ѱ) in a randomized order. After the last block within a session, another measurement was conducted in the NM condition.

Sessions

Given a particular position of the testing electrode, apical or middle, two sessions were conducted. In the first session, we determined whether neural responses could be measured using the eCAP detection procedure. After defining measurement electrodes, the eCAP growth function was measured. Masker loudness growth was determined and loudness balancing was conducted, thereby establishing masker levels. In the second session, we first confirmed the eCAP growth function and subject’s tolerance of the masking levels. Then, eCAP recovery functions were measured for different experimental conditions. Each session had a duration of approximately 3 h, whereas the second session was interrupted in case no forward masking was found for MPIs of 4 ms or less. At the start and end of each session, electrode impedances were checked.

Material

A Cochlear Pod served as a USB interface to the speech processors. The eCAP detection was conducted using the clinical software Cochlear Custom Sound EP 3.2 and the Cochlear SP12 speech processor. Masker loudness growth and loudness balancing were conducted using the APEX research platform (Laneau et al. 2005) and the Cochlear L34 speech processor. The eCAP forward masking measurement sequences were programmed in Python using the Nucleus Implant Communicator (NIC2) software interface and again the Cochlear SP12 speech processor.

Analysis

Analysis of eCAP responses was done in MATLAB (MathWorks, Inc., Natick, MA, USA). After artifact suppression using the modified alternating polarity method, each eCAP trace consisted of 32 points, sampled at 20 kHz. These traces were interpolated at the 10-fold sampling frequency using shape-preserving piecewise cubic interpolation (interp1 function in MATLAB). Then the minimum and maximum amplitude points corresponding to N₁ and P₂, respectively, were calculated using zero crossings of the first derivative. Their difference was defined as the eCAP amplitude. Estimates for each experimental condition were calculated from the mean of eight eCAP amplitudes, each consisting of eight sweeps (internal memory averages).

To validate eCAP responses and establish the noise floor, all traces were checked by visual inspection. Three judges blinded to the experimental condition of a given trace classified the response as “eCAP present” or “eCAP absent.” Each judge checked all traces three times, with the majority defined as their respective judgment, and the majority judgment defined as the final judgment. Due to the stochastic nature of eCAP generation, the noise floor was defined when 25 % or less traces were classified as eCAP present in the final judgment.

Finally, to quantify eFM over time, eCAP recovery functions were fitted to an exponential model adopted from Morsnowski et al. (2006)

$$ V( t)={V}_{\infty}\times \Big(1- \exp \left( c- t/\tau \right)\Big) $$

(3)

where V(t) is the eCAP amplitude for a given time interval t between the offset of the masker and the onset of the probe (i.e., MPI), V _∞ is the average of all measurements in the NM condition, τ is the time constant of the exponential decay, and c is a constant representing the absolute refractory period. The Nelder-Mead simplex algorithm (fminsearch function in MATLAB) was used to fit the parameters τ and c using unconstrained nonlinear optimization.

Additional statistical analysis was done using SPSS Statistics (IBM Corporation, Armonk, NY, USA) and MLwiN (Centre for Multilevel Modelling, University of Bristol, England).

RESULTS

Probe and Masker Levels

Probe levels determined in the eCAP detection procedure and established masker levels are shown in Table 2 for all subjects. When examining eFM data, one needs to consider the masker to probe level differences, which inherently affect the amount of forward masking. The eCAP detection procedure yielded a mean probe level of 53.4 dB re 1 μA. Psychophysically established HTM levels had a mean of 49.9 dB re 1 μA, which in 13/15 cases were lower than the respective probe level. LTM at Φ level was per definition at the same respective current level as HTM. LTM at its loudness-balanced Ѱ levels had a mean level of 54.3 dB re 1 μA, and in 9/15 cases was greater than the respective probe level. Loudness-balanced LTM always had current levels greater than HTM with a mean difference of +4.4 dB. This level difference as a function of stimulation rate is in accordance with previous findings (Kreft et al. 2004), showing that higher current levels are needed to achieve the same loudness percept with a low-rate pulse train (LTM at 250 pps) compared with a high-rate pulse train (HTM at 5000 pps).

TABLE 2 Probe and masker levels

Full size table

At these masker and probe levels, only subjects U01–U05 showed eFM at both electrode positions. Their results demonstrate the general trend of lower HTM (8/10 cases) and higher loudness-balanced LTM (9/10 cases) current levels than corresponding probe levels. For the remaining subjects U06–U09, no masking was found for MPI > 4 ms or no masking was evident at all. They are thus excluded from further analysis of eFM. For this subject group, it appears that the differences between established masker levels (for LTM and HTM) and respective probe levels were generally larger than those for U01–U05, i.e., established masker current levels were relatively lower (c.f. Table 2). To assess the contribution of the masker-probe current level difference to the probability of observing forward masking, we fitted the following binary logistic regression model to the data:

$$ \mathrm{logit}\ P(Y)={\beta}_{0, u}+{\beta}_1{X}_1+{\beta}_2{X}_2+{\beta}_3{X}_1{X}_2+\varepsilon $$

(4)

where X ₁ is the difference between masker and probe current levels (scalar predictor), X ₂ is the pulse train masker (categorical predictor with baseline LTM = 0 and HTM = 1), β _0 , u is a random intercept across CI users u ∈ {1, … , 9}, β ₁ is the main effect coefficient of X ₁, β ₂ is the main effect coefficient of X ₂, β ₃ is the interaction effect coefficient of X ₁ X ₂, and ε is the residual term. The categorical outcome measure Y was defined as “masking” = 1 when forward masking was found for MPIs greater than 4 ms; otherwise, it was defined as “no masking” = 0. Table 3 shows the fitted coefficients and their respective Wald statistic and odds ratio. The model was able to correctly predict 84.4 % of the observed outcomes. The Wald statistic confirms that both predictors X ₁ and X ₂ have significant contributions to the model (p = .016 and p = .044, respectively), which is not the case for their interaction nor for the random intercept. The odds ratio of β ₁ at 4.179 shows that a larger positive difference between masker and probe current levels increases the odds that the outcome measure “masking” occurs. This suggests that the negative differences between established masker levels and respective probe levels for subjects U06–U09 did contribute to the absence of forward masking.

TABLE 3 Fitting parameters of the binary logistic regression model

Full size table

The very large odds ratio of β ₂ indicates that the model saturates in the step from LTM = 0 (baseline) to HTM = 1, which can be traced to the fact that masking occurred in 9/30 cases with LTM (Φ and Ѱ) as opposed to 10/15 cases with HTM (Φ = Ѱ). To alleviate this problem, one could remove the corresponding variable from the model and collapse the data or just consider the baseline data, i.e., only LTM at Φ and Ѱ levels. Such a simplified model includes only the random intercept and the main effect of X ₁. Still, fitting this model showed a significant contribution (p = .019, data not shown here) of the predictor X ₁ and not the random intercept, which further underlines the importance of the masker-probe current level difference to the probability of forward masking in this CI subject group.

eCAP Amplitude Estimates

An illustrative example of eCAPs in response to the masked probe for subject U05, middle electrode, is shown in Figure 2. All maskers were presented at Ѱ level for MPI in the range of 1–512 ms. Note that the eCAP response starts to recover at MPI = 4 ms for SPM, which is indicated by the black triangle. This is in contrast to pulse train maskers where valid responses were first detected at MPI = 32 ms for LTM and at MPI = 128 ms for HTM. Furthermore, eCAP responses to SPM appear to rapidly reach full recovery, which was not the case for either pulse train masker.

The eCAP N₁-P₂ amplitudes were used as a measure of eFM and are shown as a function of MPI in Figures 3 and 4 for each subject and electrode and for each masker presented at Φ and Ψ level, respectively: white circles for SPM, gray squares for LTM, and black triangles for HTM. The average of all amplitude estimates in the NM condition is shown as a solid horizontal line and two times the standard deviation (σ) below that (i.e., 95.45 % confidence interval) as a dashed horizontal line. Masking was considered when amplitude estimates were lower than NM − 2σ. The NM condition range was 30.5–86.7 μV with a mean of 50.4 μV. For all subjects, eCAP measurements in NM condition over the course of the experimental sessions showed some variability but did not demonstrate any sign of neural fatigue (data not shown here).

All eCAP responses were also validated by visual inspection to establish the noise floor (see above). Krippendorff’s alpha coefficient (Hayes and Krippendorff 2007), a measure of inter-judge reliability, was α = 0.87. The probability of failure to achieve α _min = 0.90 was 0.73, which indicates very high judgment reliability. The noise floor, which was defined when 25 % or less traces were valid eCAP responses, was in the range of 9–12 μV. This range is consistent with noise floor values previously reported for the CI24RE device (McKay et al. 2013a). The noise floor is shown as a shaded area in each eCAP recovery plot. The mean dynamic range between NM level (average of all amplitude estimates in the NM condition) and respective noise floor was 13.4 ± 2.2 dB re 1 μV.

eCAP Recovery Functions

In cases where forward masking occurred, i.e., when eCAP amplitude estimates were lower than NM − 2σ, eCAP recovery functions were fitted to the exponential model described in Eq. 3. These are shown in Figures 3 and 4 as a function of MPI for each masker: dashed lines for SPM, dotted lines for LTM, and dash-dotted lines for HTM. Tables 4 and 5, respectively, show values for V _∞, which is the average of all measurements in the NM condition, and V _∞ − 2σ (two times the standard deviation). Fitted parameters τ and c are shown for each masking condition, in addition to intersections with the time axis T ₀ for SPM and intersections with the amplitude axis V ₀ for LTM and HTM.

TABLE 4 Fitting parameters of the eCAP recovery model, pulse train maskers at same physical current level (Φ)

Full size table

TABLE 5 Fitting parameters of the eCAP recovery model, pulse train maskers at same psychophysical loudness level (Ѱ)

Full size table

At Φ level (Fig. 3 and Table 4), HTM showed masking in 9/10 cases (exception was U02, middle electrode) with long recovery time constants τ in the range 85.9–428.6 ms. Except for one case (U05, middle electrode), none of the subjects tested showed masking for SPM or LTM at this current level. For the middle electrode of subject U05, LTM had τ = 128.3 ms, which was shorter than that for HTM with τ = 339.7 ms. Interestingly, this case showed relatively long forward masking for SPM with τ = 12.7 ms and the fitted recovery function had a V ₀ = 6.69 μV. This is contrary to what is otherwise observed for single-pulse maskers. Note, however, that this case had a high noise floor at 28.7 % re V _∞ and a high standard deviation for the NM condition at 20.35 % re V _∞.

At Ѱ level (Fig. 4 and Table 5), LTM showed masking in 8/10 cases (exception was U04, both electrodes), with recovery time constants τ in the range 128.5–286.6 ms, while two cases apparently did not recover from masking after 128 ms (U01, both electrodes) and one case even after 256 ms (U05, apical electrode). LTM generally showed less forward masking than HTM except in two cases (U01 and U02, middle electrodes), with mean τ = 176.85 ± 83.46 ms for LTM versus mean τ = 246.38 ± 116.70 ms for HTM. In one case (U01, middle electrode), LTM showed longer masking with τ = 208.6 ms compared with τ = 85.9 ms for HTM. Interestingly, in the one case where HTM showed no masking (U02, middle electrode), LTM achieved forward masking with τ = 40.5 ms. Both of these exceptions could not be traced back to differences in current level (c.f. Table 2). In all cases, SPM showed shorter forward masking than both pulse train maskers, with τ in the range 0.7–5.33 ms and mean τ = 3.11 ± 1.44 ms. Note that while having the same current level as LTM, SPM achieved more masking at MPI = 1 ms in six cases (U02, both electrodes; U03, apical electrode; U04, both electrodes; and U05, apical electrode), which is discussed below.

DISCUSSION

In experiment 1, eFM in CI users was found to be much longer with both pulse train maskers than with single-pulse maskers, with time constants over 100 ms in contrast to a few milliseconds, respectively. Single-pulse maskers were found to produce at least as much masking as pulse train maskers (low-rate, LTM; and high-rate, HTM) at the shortest masker-probe intervals. Data at the same physical current level (Φ) showed that high-rate pulse trains can mask more and longer than low-rate pulse trains or single pulses. Data at the same psychophysical loudness level (Ѱ) had variable results; high-rate pulse trains did not always produce more masking than low-rate ones. This underlines the importance of comparing these maskers at the same perceptual level. In the next experiment, we attempt to relate our eFM findings to perception in order to investigate whether psychophysical masking may be explained by neural masking at the level of the auditory nerve.

In the following, we discuss why long eFM was not observed in a subset of subjects for the given experimental conditions. Then, we examine potential limitations of the method used to extract the neural response from telemetry recordings. Finally, we discuss a possible explanation of how single pulses could mask more than pulse trains at the shortest masker-probe intervals.

Current Level Differences

In our subject group and for the given conditions, long eCAP recovery from forward masking was measurable in 5/9 subjects, i.e., 10/15 electrodes. Since the absence of forward masking in one electrode was an exclusion criterion, not all subjects were tested in two electrode positions. When considering all tested electrodes, the binary logistic regression model showed a significant contribution of the difference between masker and probe current levels to the outcome of forward masking at MPIs greater than 4 ms, where a larger positive difference increased the odds that forward masking occurred. The model also showed that the pulse train masker rate (LTM or HTM) had a significant contribution to that outcome.

Inspection of the loudness-balancing results (c.f. Table 2) revealed that the current level difference between LTM and HTM at equal loudness was larger on average for subjects who did not show masking (5.7 dB) than for subjects who showed masking (3.7 dB), with a significant group difference, two-tailed Student’s t test, p = .0048. It can be thus hypothesized that this subject group either had less tolerance to high current levels when determining the loudness of pulse train maskers or had more temporal integration of loudness for pulse trains. However, the additional variability of probe current levels between subjects makes it difficult to draw such conclusions from the present data.

Our study design determined the current level eliciting an N₁-P₂ amplitude response of approximately 50 μV and did not fix the probe current level across subjects (typically just below the loudest acceptable level), which was motivated by several reasons; in order to maximize the chances of measuring forward masking of the probe response, the current level had to be low enough to avoid recruiting too many nerve fibers. However, the eCAP amplitude range needed to be sufficiently above the noise floor to allow statistical analysis. Still, the large variance of eCAP amplitude growth functions between subjects and the low resolution of stimulation levels in CI made this difficult to achieve (results showed mean NM level of 50.4 μV, in the range 30.5–86.7 μV). More importantly, while eCAP studies typically use single-pulse probes at high current levels to obtain clear responses, these levels may not always have clinical relevance. When presented as a pulse train, such high current levels would most likely exceed acceptable loudness. Here, for the subject group that showed forward masking, probe current levels were generally higher than HTM (5000 pps) and lower than LTM at Ψ level (250 pps; c.f. Table 2). Consequently, these probe current levels can be assumed to be at or below comfortable loudness when presented at clinical stimulation rates. Our eCAP recovery functions for pulse train maskers thus confirm the presence of slow-recovery phenomena at the level of the auditory nerve in CI users, most probably at clinical stimulation rates and levels.

Probe Stimulus Polarity

Artifact suppression of eCAP recordings could have also been achieved using the forward-masking paradigm (Brown and Abbas 1990; Brown et al. 1990). This method adds a preceding masker stimulus to evoke a refractory response to the probe stimulus, which is in turn used to reduce the artifact from the probe response. The paradigm can be adjusted to use pulse train instead of single-pulse maskers (Abbas and Brown 2015). With increasing MPI, the forward-masking paradigm would record an eCAP equal to the difference between the unmasked and the masked probe responses (Miller et al. 2000), which can be used to derive the masked probe response. In the current experiment, we used a modified alternating polarity method which has at least two limitations. First, it has been shown that the averaged waveform (from both polarities) produces significantly smaller amplitudes and higher thresholds than that obtained with the forward-masking method for short MPIs (Baudhuin et al. 2016; Eisen and Franck 2004; Frijns et al. 2002). This can be traced back to previously observed differences in eCAP amplitude and latency for cathodic- versus anodic-leading biphasic stimulation (Macherey et al. 2008). In this regard, the second limitation of our method is the inability to separate differences in masking between polarities. In a guinea pig model, Matsuoka et al. (2000) found that the amount of adaptation measured with eCAPs was greater for anodic than for cathodic stimulation. It is, therefore, possible that the long recovery time constants observed were mainly due to one polarity.

To evaluate this possibility, we mathematically derived eCAP responses using the forward-masking paradigm from our existing data set, isolating the responses using either anodic- or cathodic-leading probe pulses. The results had overall more noise (data not shown here), which may be due to the calculation requiring more averages and/or the maskers being at suboptimal current levels with regard to the probe levels. Nevertheless, when the data could be analyzed, there was no evidence of one leading polarity showing more masking than the other in the derived forward-masking paradigm.

Neural Response Alternation

Another observation was that the single-pulse masker could sometimes produce more masking than pulse train maskers at MPI = 1 ms in six cases (see Fig. 4). A possible explanation for this unexpected result is the alternating response pattern observed in eCAP recordings to each pulse in a train (Hughes et al. 2012; Rubinstein et al. 1999). At specific pulse rates, individual eCAP responses sometimes alternate between higher and lower amplitudes for odd and even pulse train counts or vice versa. This pattern is thought to be the result of variance in absolute and relative refractory periods for different auditory nerve fibers. Since the pulse train maskers LTM and HTM each had an even number of pulses, it could be argued that a direct comparison with masking from SPM (single pulse, i.e., odd number of pulses) oversees the possible alternating response pattern.

In a control experiment, we collected additional eCAP data for all subjects (U01–U05, both electrode positions) and for all maskers (SPM, LTM, and HTM) at Ѱ level and at MPIs 1, 8, and 64 ms. The following experimental conditions were added for each pulse train masker: −1 pulse, ±0 pulse, and +1 pulse, i.e., removing a pulse, no change, and adding a pulse at the beginning of the pulse train, respectively. The results were similar to those measured in experiment 1 (data not shown here) and confirmed that the long time constants of eCAP recovery from LTM and HTM were independent from the parity of the pulse counts. As previously observed, SPM masked more than LTM at MPI = 1 ms in 3/10 cases and more than HTM in 4/10 cases but this did not depend on whether the maskers had an even or odd number of pulses. The absence of an effect for HTM can be explained by the so-called “stochastic independence” state for high-rate stimulation, where alternating response patterns are no longer observed when reaching a sufficiently high rate (Hughes et al. 2012). Low-rate stimulation below 200 pps has been shown to yield steady eCAP amplitudes across individual pulses (Wilson et al. 1997), where nerve fibers have the time to recover from depolarization. It is thus possible that LTM at 250 pps had a rate too low to show the alternating response pattern in these subjects.