Introduction

Exchange of the sound processor of cochlear implants (CI) allows existing implant recipients to take advantage of any advances in sound processor technology by exchanging or upgrading their current processor to a newer model [1, 2]. Funding of processor upgrade differs from one country to another. Considering the high prices of the processors, the benefit provided by new processors must be demonstrated.

In 2013, Advanced Bionics (AB, Stäfa, Switzerland) introduced the Naída CI Q70 (Naída CI) sound processor. As well as being compatible with the newest AB cochlear implant systems, it was also compatible with the existing HiRes 90K™ and CII™ cochlear implant systems and therefore existing recipients of AB devices, who were using older sound processor types, could be upgraded to the newer technology. In addition to the functions and sound-processing technology already available in the previous generation sound processors, the Naída CI introduced an acoustic signal-processing beamforming technology called UltraZoom, which was already used in Phonak hearing aids (Nyffeler, Reference Note 1). The intention was to help AB implant recipients to communicate more easily and effectively in noisy environments, which still remains a challenge, even for the best-performing recipients [3].

UltraZoom is an adaptive multi-channel dual-microphone beamformer that focuses on input originating from in front of the listener, while attenuating sounds coming from the sides and the rear (Fig. 1). It works by exploiting timing and phase differences in the signal arriving at two spatially separated front and back omnidirectional microphones, positioned on top of the processor. The inputs from the two microphones are subtracted from each other, after applying an appropriate delay, and a front-facing directionality pattern is created, reducing input from the rear hemisphere and creating a null point where sounds are completely attenuated. The adaptive nature of UltraZoom allows it to constantly change the directionality of the null, based on the loudest noise source in 33 separate channels, thus suppressing moving noise sources as well as static ones [4].

Fig. 1
figure 1

Polar plot showing UltraZoom performance on KEMAR left ear

Previous studies evaluating adaptive beamforming technology with CI devices have shown that it can significantly improve the perception of speech in noise [5,6,7,8,9]. Geißler et al. [4] tested UltraZoom as implemented in the Naída CI in ten subjects and showed significant improvement in speech perception in noise in a variety of challenging and realistic conditions, when compared with the Harmony sound processor. However, subjects had no take-home experience with the new sound processor and therefore it is not known if they would have been able to transfer these gains shown in the laboratory, into the real world. This is a potential issue for all beamforming technologies, as CI users report smaller subjective benefits than expected from laboratory testing [5]. In part, this may be due to the fact that listeners often find themselves in situations where speech and noise sources are not sufficiently spatially separated, particularly in reverberant environments, which results in cancelation of the speech signal as well as the noise and reduces the signal to noise advantages gained [10, 12]. In the previous studies where subjective measures have been reported, two failed to show a significant improvement in subjective performance with the beamforming technologies using the Speech Spatial Qualities questionnaire, even though the objective results did show a significant benefit [5, 10]. Only Mosnier et al. [1] showed a significant improvement in performance in both objective and subjective measures using the Abbreviated Profile of Hearing Aid Benefit (APHAB) [13], when subjects using Cochlear Ltd. devices were upgraded to the newer CP810 speech processor with additional directionality.

The objectives of this study were to compare the performance of a group of existing AB cochlear implant users, who were upgraded to the new Naída CI sound processor, in a test of speech perception in noise with and without UltraZoom and to compare their subjective performance with their current sound processor, to their subjective performance after upgrading to the new Naída CI sound processor.

Methods

Subjects

From February to November 2015, 34 adult subjects aged between 21 and 89 years (mean 52.8 ± 18.5) were prospectively enrolled in a single tertiary referral center. Subjects were required to have at least one CII/HiRes 90K cochlear implant, a postlingual onset of severe-to-profound hearing loss (≥6 years of age) and French as their first language. The demographic data of these subjects are presented in Table 1. Nine subjects were unilaterally implanted, 11 bilaterally implanted and 14 were bimodal users with a hearing aid on the contralateral ear. All subjects were experienced CI users (5–14.7 years, mean 6.9 ± 1.8) who were due to get a processor upgrade to the Naída CI as part of their routine clinical care. A repeated measures design was used, where subjects acted as their own controls.

Table 1 Demographic details of the population

Fitting

At the baseline visit, subjects were fitted with a loaner sound processor for the purposes of testing, identical to their current processor. This was to ensure that all microphones were new and working optimally. It was programmed with their current clinical program, including the speech enhancement algorithm ClearVoice™ [14] as well as the T-Mic™ microphone setting (microphone placed within the concha) [6, 15], if this was used on an everyday basis. They were then upgraded to a new Naída CI sound processor, programmed with the same current clinical program and an identical clinical program plus UltraZoom. The T-Mic microphone and ClearVoice algorithm continued to be used with the Naída CI if they had been used with the original processor. They were given a minimum of a 2 months take-home trial with the Naída CI sound processor, where they were encouraged to use UltraZoom in appropriate situations, where speech was coming from the front and noise from the back and sides of the recipient. The Advanced Bionics SoundWave™ programming software was used and all program parameters remained the same, unless the subject was not happy with the sound quality, in which case alterations to the current clinical program were made accordingly. All bilateral CI users except one were upgraded on both sides.

Speech perception measures

Speech understanding in quiet was evaluated with two lists of 17 monosyllabic words each (Lafon lists) presented at 60 dB SPL from a source based at 1 m in front of the subject. Speech understanding in noise was measured with the Matrix sentence test in French [16], which is an adaptive test based on the Oldenburg Sentence Test (OlSa) [17]. The subjects were asked to repeat semantically unpredictable sentences, which always had the same structure: name, verb, number, common name and color. A speech reception threshold (SRT) was automatically measured by adjusting signal to noise ratio until a 50% word understanding score was reached. A lower SRT means a better performance. Prior to testing, at least two practice lists (each containing 20 sentences) were presented to the subject to avoid training effects during the test.

Sentences were presented from a loudspeaker located 1 m in front of the subject (0° azimuth). Non-correlated stationary speech-shaped noise (SSN) was presented at a fixed level of 65 dB SPL simultaneously from all three loudspeakers positioned at ±90° and 180° to simulate a diffuse noise environment. The level of the speech signal was varied to adjust the signal to noise ratio. A low to moderately reverberant room was used, with a T60 of around 0.3 s.

Subjects were evaluated while listening with the technology that they utilized in their daily environments; participants with a Naída CI processor on one ear and a contralateral hearing aid were tested with both devices together, and bilateral participants (two Naída CI processors or one Naída CI processor with another processor type contralaterally) were tested with both devices turned on. The contralateral hearing aid was not fitted or changed during the follow-up period.

At the baseline visit, speech perception was measured with words in quiet with the current sound processor and sentences in noise with the current sound processor and the new Naída CI sound processor in the omnidirectional microphone mode, without UltraZoom. At the follow-up visit, 2 months later, speech perception was measured in quiet with the Naída CI processor without UltraZoom and in noise with the Naída CI sound processor with and without UltraZoom (Table 2). The order of the speech test lists and the test conditions was randomized using a randomization table prepared before the start of the study. At the end of the study, the subjects returned home with the new Naída CI sound processor.

Table 2 Tests conducted at each visit

Subjective testing

The subject’s self-assessment of their hearing with the different sound processors and programs was recorded using the APHAB [13]. This 24-item self-assessment inventory requires recipients to report the amount of trouble they are having with communication in various everyday situations. Benefit is calculated by comparing the recipient’s reported difficulty in listening in the specified scenarios. There are four subscales: ease of communication (EC), reverberation (RV), background noise (BN), and aversiveness (AV). Scores are given on a scale from A to G where A is “I always experience this” and G “I never experience this”. A percentage score from 1 to 99% is allocated to each category of response to give a mean percentage for each section. The average score for each subsection, recorded at baseline with the previous sound processor, was compared to the average score recorded at the 2-month follow-up visit with the Naída CI sound processor. A global score was also calculated, which is the mean of the scores for all the items in the three EC, RV and BN subscales.

Statistics

The results for each test session were compared independently. Scores for words in quiet and the Matrix sentence test in noise were not normally distributed, so a non-parametric Wilcoxon paired test was used. Individual scores in quiet were compared using the binomial model described by Thornton and Raffin [18]. The different subsections of the APHAB data were compared using a series of non-parametric Wilcoxon tests. A p value of less than 0.05 was considered to be significant, A power calculation showed that to detect a difference of 2 dB with sufficient power (80%) and at a significance level of p = 0.05, a minimum of 17 subjects were required for the objective testing.

Results

Speech perception testing

When subjects were tested in quiet, there was no difference in group performance between the previous sound processor(s) at baseline and the Naída CI processor(s) in omnidirectional mode after 2 months of use (median score of 53.8% ranging from 5 to 94% and median score of 52.5% ranging from 0 to 97%, respectively) (Wilcoxon paired test, Z = 0.37, p > 0.05) (Fig. 2). Analysis of individual scores using the binomial model, described by Thornton and Raffin (1978), showed that 4 out of the 34 subjects had a significant improvement in scores between the baseline and second test sessions (18). One subject saw a significant reduction in speech score in quiet in the second test session, but scores in noise between sessions did not reflect this. All other subjects had non-significant differences between scores of less than 20% (Table 3).

Fig. 2
figure 2

Performance score in quiet with the subject’s original sound processor(s) at baseline and with the Naída CI sound processor(s) in omnidirectional mode at the follow-up visit. The results are expressed as percentage of words correct for the lists of monosyllabic words in quiet for the 34 subjects. The box plots show the first and third quartile values and the central square, the median value. The whiskers indicate the non-outlier values for each group

Table 3 Individual scores for speech perception testing in quiet for Lafon words at baseline with the previous sound processor and at the follow-up visit with the Naída CI processor in omnidirectional microphone mode

Twenty-one out of the 34 subjects had sufficiently good performance in quiet with the previous sound processor(s) at the initial session to be able to perform the Matrix test in noise (median scores of 64% for monosyllabic words versus 23% in the group of 13 subjects who were not able to perform test in noise). The Matrix test at the follow-up session was only performed in this group of 21 patients. At the initial baseline session, there was no significant difference between the recipients’ previous sound processor(s) (median SRT of −1.1 dB) and the Naída CI sound processor(s) (median SRT of −1.2 dB) for performance in noise when using the omnidirectional microphone (Wilcoxon paired test, Z = 1.01, p > 0.05) (Fig. 3a, a lower SRT means a better performance).

Fig. 3
figure 3

Performance in noise for the 21 subjects who were able to perform Matrix test in noise. a At baseline with the subject’s original sound processor(s) and with the Naída CI sound processor(s) in omnidirectional mode (without UltraZoom); b at the follow-up visit with the Naída CI sound processor(s) in omnidirectional mode and with UltraZoom. Results are expressed as the speech reception thresholds (SRT, dB) for the Matrix sentence test in noise. A lower SRT means a better performance. The box plots show the first and third quartile values and the central square the median value. The whiskers indicate the non-outliers values for each group. The asterisks indicate a statistically significant difference in performance (***p < 0.0001)

At the follow-up session, after 2 months of experience with the Naída CI sound processor(s), the median SRT score with Naída CI with UltraZoom was −4 dB (range +4.8; −10.5 dB) compared to −0.45 dB (range +6.5; −8.0 dB) with the Naída CI in the omnidirectional mode (without UltraZoom). The use of UltraZoom significantly improved the median SRT by 3.6 dB (range +0.5; −7.8 dB) (Wilcoxon paired test, Z = 3.91, p < 0.0001) (Fig. 3b).

Subjective evaluation

APHAB questionnaires were completed by all 34 subjects. When the performance on the APHAB questionnaire was compared across the sessions, significant differences between the scores with the existing sound processor(s) at baseline and the Naída CI sound processor(s) for speech understanding in noisy environments (Wilcoxon paired test, Z = 3.57, p < 0.001), aversive situations (Wilcoxon paired test, Z = 2.10, p < 0.05) and globally (Wilcoxon paired test, Z = 2.19, p < 0.05) were obtained.

When looking at the APHAB outcomes for the group of 21 subjects who were able to perform the Matrix test, a significant improvement when using the Naída CI sound processor(s) compared to the previous processor(s) was found for speech understanding in noisy environments (Wilcoxon paired test, Z = 2.84, p < 0.01) and in aversive situations (Wilcoxon paired test, Z = 2.10, p < 0.05) (Fig. 4a). For the 13 subjects who were not able to perform the Matrix test at baseline, a significant improvement when using the Naída CI sound processor(s) compared to the previous processor(s) was also shown for speech understanding in noise (Wilcoxon paired test, Z = 2.13, p < 0.05) (Fig. 4b).

Fig. 4
figure 4

Median scores for the APHAB self-assessment questionnaire at baseline with the subject’s original sound processor(s) and at the follow-up visit with the Naída CI sound processor(s). a For the 21 subjects who were able to perform the Matrix test in noise; b for the 13 subjects who were not able to perform the Matrix test in noise. Scores are given for each of the four subsections and a global value for the average of the ease of communication, reverberation and background noise sections. The box plots show the first and third quartile values and the central square the median value. The whiskers indicate the non-outlier values for each subscale and each group. The asterisks indicate a statistically significant difference in performance (*p < 0.05, **p < 0.01)

Discussion

This study showed that for the 21 subjects who were able to complete the testing in difficult noisy conditions, the use of UltraZoom provided a significant improvement in performance of 3.6 dB SRT. The diffuse noise test conditions used in this study were designed to be challenging to represent the most common noise condition that CI users encounter in everyday life. The addition of some reverberation in the testing room also helped to simulate a real-world condition and is particularly relevant for beamforming technologies, as when the target and interfering noise become more spatially diffuse the beamforming performance can degrade [11, 12].

Our results were in line with the improvement seen by Geißler et al. [4] in a study evaluating ten adult Harmony users who had been converted to the Naída CI, but had no take-home experience with the new processor. Subjects had been evaluated using the same adaptive test as in our study, but in more challenging conditions with five loudspeakers used to create the noise environment and a higher reverberation time of 0.6 s. In our study, 26 out of the 34 subjects were using the T-Mic in standard condition which already provides some directionality [6]. Some subjects also used the ClearVoice static noise reduction technology, which in combination with UltraZoom has been shown to provide the greatest improvement in performance in noise [4, 14]. We chose to keep the use of ClearVoice and/or the T-Mic constant across all test conditions and for both sound processor types to have no impact on the results.

There is considerable variation in the degree of improvements reported for beamforming technology. In previous studies, when compared with the omnidirectional microphone or the T-Mic, UltraZoom improved speech reception thresholds in noise from 4 up to 9.8 dB in optimum conditions [4, 14] (Advanced Bionics, Reference Note 2). Many factors can explain this variation, such as the speech materials and noise type used, the configuration of the speaker array, the microphone and program configurations. In addition, the head alignment of the subject with the speech source can also affect the level of benefits of any adaptive beamformer. Even though instructions about head position were provided to subjects prior to testing, this is something which remained difficult to control over the whole duration of the session. However, these testing conditions reflected ‘real-life’ conditions and show the wide range of benefit a CI user could expect from using this new technology in daily life. Unfortunately, one of the limitations of adaptive SRT procedures is that a calculation of individual significant SRT differences cannot be made based on the binomial model. The other limitation is that the Matrix test was only performed in the group of the better performers, meaning that we cannot rule out that some poorer performers from the baseline session were finally able to do the test at the follow-up session. Therefore, no information can be provided on the percentage of subjects whose performance improved significantly when using the beamformer.

The purpose of providing beamforming to CI users is to improve their ability to communicate in the everyday noisy environments we all encounter. While many studies have shown the benefits of this technology in a laboratory setting [4, 5, 7, 8], a subjective evaluation by subjects is required to show that these benefits can be achieved in real-world scenarios. Moreover, the upgrade process was part of the routine clinical practice of the clinic, so both good and poor performers were enrolled. As a result, almost 40% of the subjects in our study who had poor speech comprehension score in quiet were unable to do the Matrix test in difficult noisy environment before upgrade, but may still have some subjective benefit of the speech processor upgrade. The APHAB results shown here indicate a significant subjective improvement for the listening in background noise, aversiveness and global sections when using the Naída CI sound processor. To check whether the poorer performers benefited from the new sound processor, the APHAB questionnaire was analyzed for this particular population and still showed a significant benefit in the background noise section. It is particularly interesting to observe this improvement in poorer performers, for whom the objective improvement could not be shown through the Matrix test. It highlights the importance of evaluating subjective feedback from CI recipients to assess their level of comfort in everyday life. Some previous studies using different subjective measures have been unable to show a subjective benefit alongside the laboratory benefits shown [5, 8]. Only Mosnier et al. [1] showed a significant benefit on the APHAB when subjects upgraded from the older Cochlear Esprit 3G and Freedom sound processors to the newer CP810. This lack of strong evidence for any subjective benefits of beamforming is not just an issue for its use in cochlear implants, but is also a criticism for its use with hearing aids [19]. The subjective results are limited by the fact that the APHAB, in common with most of the subjective measurement tools available, relies on asking subjects about predetermined situations, which may not be relevant or equally important to all subjects. An additional limitation of any study where subjects are upgraded to newer technology and cannot be blinded to the sound processor used is that responses may be biased toward the newer technology.

The results of the speech perception testing at baseline show that group performance with the new Naída CI sound processor in noise was the same as with the subjects’ existing sound processor when using the same programs and omnidirectional microphone settings. This provides clinicians with confidence that subjects can be upgraded to the Naída CI without a change in performance when used with the standard microphone settings and do not require any training period. However, the subjects recruited were all using the Harmony sound processor, so these findings can only be applied to recipients who are currently using this sound processor type.

The improvements in recipients’ use of beamforming in real-world environments may result from a better understanding by clinicians on how to use the technology and appropriately counsel recipients and better implementation of the beamforming algorithm, improving its robustness [9]. Indeed, appropriate counseling on the use of UltraZoom is crucial as recipients are required to manually change the program depending on the listening situation encountered. Therefore, it is important to provide recipients with concrete real-life examples of situations where this feature helps speech understanding. However, this might be less relevant with the newest generation of sound processors, which offer automatic selection of the microphone settings depending on the incoming signal, i.e., UltraZoom is switched on and off automatically depending on the environment.

To conclude, this study showed that all subjects were successfully upgraded to the new Naída CI Q70 sound processor. Once upgraded, subjects who were able to perform the French Matrix test in noise with their previous processor could take advantage of the UltraZoom beamforming technology on the new sound processor, so that their ability to communicate in noise was improved. Subjective results with the APHAB questionnaire confirmed these objective results, showing improvements in median scores in the whole group, but also in the group of poorer performers, for listening in background noise when using the Naída CI Q70 sound processor. This study highlighted the importance of upgrading CI recipients to new technology and of including adaptive tests in noise and subjective feedback evaluation as part of the process.