Keywords

1 Cochlear Implants and Aging

Cochlear implants (CI) have successfully restored partial hearing to over 400,000 deaf persons in the world and the rhythm of implantations is high in the world with a minimal rate of about 30,000 implantations [1].

Economically, CI is a good investment for the society. The machine is expensive (about $25,000), the surgery also and the follow up of the patient during the first year (speech therapy, audiology) pushes this figure to $50,000. Nevertheless, if we consider the cost of a handicapped persons during his life (education in dedicated centers, perturbed familial life, professional limitations and so on) fitting a deaf person with a CI is worthwhile to be done and many social systems cover the expense mostly when it is for children.

1.1 Reliability

Designed to last for more than 10 years, CI suffers from failures and some of them come from aging [2, 3] and one of the goal of engineering is to reduce their impact. In this paper we wish to discuss this issue.

Cochlear implants are biomedical prostheses which restore some hearing in profoundly deaf subjects. Basically, they are made with two main components, an external device which is for speech reception and processing, and an implant which decodes the information transmitted by the external device and distributes it to the ends of the auditory nerve in the inner ear. All the elements are subjected to aging and patients are deeply affected when failures occur in the system. There are two kinds of failure, the hard failure easy to detect as the machine breakdown is seen immediately and the soft failure when the failure is intermittent. Another kind of burden, more insidious, is the progressive loss of efficiency on one of the components of the machine. Among them the attack of the microphone is one of the hazards affecting the machine. Several alerts have been raised and some dramatic stories about alterations on the opening in hearing aids have issued warnings [4] which must not be ignored. A check of the microphone entrance often discovers debris accumulated in undesirable locations and the input sound pathway is more or less obliterated.

1.2 Microphone Aspect

The microphone repair issue is well ranked into the external failures of a cochlear implant [5, 6]. It comes in the fourth or fifth position in the classification after the headpiece (antenna), the battery compartment and the speech processor and it represents nearly 11% of the repairs.

But the progressive drift of the microphone properties is often not noticed and everybody should keep in mind that the microphone works in very adverse environments which can range from very cold and dry to very humid and hot, with a common exposure to sweat, dirt, dust, grease and so on. This is a very aggressive situation and moisture and grease are likely to affect the sensitivity of the microphone and its frequency response.

Consequently this behavior should be studied and this aspect of aging should not be ignored. In a previous approach Razza et al. [7, 8] indicated that speech tracking scores are correlated significantly with the loss of the microphone sensitivity. Responses curves were compared to those of fully working microphones and the consequences on speech recognition need to be studied.

This situation is not very simple as the use of cochlear implants occurs often in noise which is part of our usual environment.

1.3 Coding Strategies

CI allows several coding strategies which can be implanted on almost all the machines, and the choice of a relevant coding strategy is something which must be discussed.

Basically, in CI, the speech input spectrum is divided into frequency bands, and this “simplified” spectrum is distributed to the electrodes distributed along the cochlea which is the hearing organism.

Two main schemes are usually taken to program the speech processors by the CI manufacturers:

  • n-of-m: the speech spectrum is divided into m bands, and the n with the highest energy are kept and transmitted to the cochlea. This scheme is popular, because the presentation of selected acoustic features corresponds to an approach based on the phonetic theory. In this circumstance, only a subset of the available electrodes is stimulated. Also, as the number of active electrodes is limited, it is expected that the overlap between two successive stimulations is small, but this may be not true if two consecutive electrodes are selected [9].

  • CIS (Continuous Interleaved Sampling): it is a sound processing scheme designed to enhance the delivery of temporal fine structure to the implant, in order to beat the poor intelligibility of speech in a competing noise. All the band pass filters are kept and the number of electrodes is reduced, leading to a wide spacing, between two consecutive electrodes, which is likely to reduce the interaction. Also, the stimulation rate is high and it enhances the delivery of temporal fine structure to the implant, and this is an advantage in a competing noise [10].

1.4 Noise Influence

The choice of the noise is also a major issue. The effect of listening conditions determines the performances of the listeners. The signal to noise ratio (SNR) is an important parameter for the assessment of the recognition performances. Lowering the SNR spoils the incoming speech signal and this action is worthy to be measured. Also, with different values of the SNR it is easy to avoid the ceiling and the floor effects, enabling the determination of some characteristic parameters traditionally used in the evaluation of speech recognition [11].

In our work, the choice of the Fournier’s lists (a French equivalent to the spondee lists) led to recognition percentages ranging from 0 to 100%, according to the SNR, and it was suitable to the aims of our experiment.

1.5 NH Versus CI

Two aspects are generally considered when coding strategies are examined in cochlear implants: the recognition seen with patients using the coding schemes, and the behavior of normal hearing (NH) subjects facing the signal (a simulation is done for NH subjects). Both performances are complementary as in one case (CI users) we have the direct results with the group of persons concerned by the prosthesis and in the second group we can test the signal action using a more comprehensive approach [12]. In other words, one matter is more oriented to the patient and the other tests the signal. Both groups receive the same amount of information about speech, but the outcomes may be different or similar [13, 14]. Furthermore, the CI users group is very heterogeneous and the standard deviation in the measures is large making the statistical interpretation of the results more difficult. On the contrary, with the NH listeners, several strategies and microphone drift are evaluated on the same people and intra-subject studies can be done making the evaluation more accurate.

1.6 Organization of This Study

The aim of this study is to assess the behavior of two coding strategies (n-of-m and CIS-like) in front of the aging of the CI microphone, and to give some indication to the audiologists in charge of CI settings in the specialized clinical centers.

After an introduction of the CI issues about aging and the main elements which will be manipulated to carry out the study, the second section introduces the tools for the simulation (CI and microphone drift), the subjects who participated to the experiment and the mathematical fitting of the recognition results. Then in the third section, the results (recognition percentages and audiological features) obtained with the CI users and the NH subjects are presented.

The fourth section is a discussion about the different elements which were used in this work, The comparison of the performances of CI users and NH subjects, the choice of the noise and evaluation of the strategy and how to reduce the influence of the microphone drift with aging. Finally, a conclusion sums up the main contributions of this work.

2 Materials and Methods

2.1 Stimulation

2.1.1 Cochlear Implant Principles

We will not expose the details of signal processing in cochlear implants here, but only the essential principles (Fig. 1). Cochlear implant can be divided into two main parts:

Fig. 1
figure 1

Shematic view of a cochlear implant

  • The external block (signal processor) processes the signal and separates the sound in multiple frequency bands (channels).

  • The internal block (the implant) receives the information from the processor and distributes the energy of the channels to the electrodes along the cochlea.

The information is transmitted from the processor to the internal part through the skin by a radiofrequency signal.

Sounds in the environment are captured by the microphones that are very impacted by soiling and aging. Then the processor fits the signal received from the microphones to the patient’s physiology and separates the signal into multiple frequency bands. The frequency bands correspond to the treatment channels scaled using a logarithmic scale. We chose to use the Bark scale for our CI simulator [15].

Most of the existing sound coding strategies are based on two main principles: there are CIS-like strategies and “n-of-m” strategies. It is interesting to notice that, for the n-of-m strategies, if n = m it is very similar to CIS. So the value of n is important to be considered.

2.1.2 Vocoder

The CI simulator used for testing NH subjects is a basic vocoder fitted for the purpose of the study (Fig. 2).

Fig. 2
figure 2

Block diagram representing the signal processing performed by the n-of-m simulator

The different steps of the signal processing are:

  • The input signal goes through a pre-emphasis filter which is a high-pass filter (cutoff frequency 1.2 kHz and slope 3 dB/octave).

  • The signal is then sampled (16 kHz sampling frequency, 16-bit quantization), a short term fast Fourier transform (FFT) is applied to the samples, frame length is 128 points (8 ms), leading to 64 spectral beams (amplitude and phase), ranging from 0 to 8 kHz. The step between two beams is 125 Hz.

  • Then, the spectral beams are grouped into frequency bands which are logarithmically distributed, according to the ear physiology (Bark scale). Considering the usual values taken in cochlear implants we used 20 bands (leading to 20 channels).

  • In each band, the energy is calculated using the Parseval’s formula (the squares of the amplitude of each beam are added). In the CIS-like coding all the channels are taken. For the n-of-m coding only the 8 more energetic channels are kept. The value n = 8 is classical in cochlear implants. There is a frame overlap of 6 ms (75% overlap), and a set of pulses is calculated for each frame.

  • Each channel is represented by a narrow-band spectrum coming from a white noise spectrum. The amplitude of the narrow band follows the energy detected in the corresponding channel. The first two-channels, which are very narrow, were represented by sine waves. The output signal is obtained by summing the selected channels (8 for n-of-m; 20 for CIS-like).

2.1.3 Evaluation of the Frequency Drift of a Hearing Aid Microphone

In this study we tested speech recognition in noise with CI users, before and after a microphone cleaning process. For NH subjects we used a CI simulator integrating a low-pass filter mimicking the microphones frequency drift due to soiling. The low-pass filter approximated a measured median attenuation. We based our work on a study about microphone frequency drift due to soiling on conventional hearing-aids. Knowles electronics (Itasca, Illinois, USA) is the manufacturer which leads the market of microphones for hearing-aids and cochlear implants. That is why we considered that this study matched for cochlear implants’ microphones too.

The study of the microphones sensitivity loss had been conducted, during a routine maintenance appointment in a hearing care center, on regular hearing aids worn by n = 129 hearing impaired patients [16]. Two steps were necessary in order to check the microphone frequency drift. First the receiver and the ear mold were deeply cleaned leading to the state “clean receiver and dirty microphone”. Secondly, the microphone was carefully cleaned, leading to the state “clean earphone and clean microphone”. The subtraction of the two states gave the attenuation due to the microphone soiling.

Then the microphone transfer function was evaluated after a frequency warble sweep ranging from 200 to 8,000 Hz (at 60 dB SPL).

Four degrees of attenuation have been defined:

  • Case 1: clean microphone (no loss); the response was not modified,

  • Case 2: medium loss corresponding to the attenuation seen on 50% of the microphones (percentile 50%)

  • Case 3: strong loss, corresponding to the 20% worse microphones (percentile 80%),

  • Case 4: very strong loss, corresponding to the 10% worse microphones (percentile 90%).

The clean signal was modified by the different attenuation curves and then passed through the vocoder.

On Fig. 3, the three degrees of loss of sensitivity, corresponding to the 50, 80 and 90% percentiles, are indicated. Each degree was represented by two straight lines for simplification. The signal attenuation started at 800 Hz and then three slopes were introduced, respectively, 2 dB, 7 dB and 9 dB per decade.

Fig. 3
figure 3

Simulation of the microphone loss of sensitivity

2.2 Tests

2.2.1 Participants

The work presented in this paper follows a previous pilot study [17] and it was approved by the French Ethics Committee “Sud Est 2” (August, 27, 2014), under the supervision of the HCL (Civil Hospices of Lyon). All the participants filled an agreement form prior entering the study and all the recordings were made by an audiologist.

2.2.1.1 CI Users

Sixty-one CI patients were included in this study. Their ages ranged from 18 to 60 years (average 37 years old). Nineteen subjects were fitted with CIS like strategies and 42 had the n-of-m strategies.

The population of the CI users had been parted into four groups:

  • Subpopulation 1 (SP1): Med-El® and Advanced-Bionics® subjects (19 patients): “brush and blow” cleaning procedure and CIS-like coding strategies,

  • Subpopulation 2 (SP2): Neurelec® subjects (13 patients); “brush and blow” cleaning and n-of-m strategies,

  • Subpopulation 3 (SP3): Cochlear® subjects (14 patients): “brush and blow” cleaning and n-of-m strategies,

  • Subpopulation 4 (SP4): Cochlear® subjects (15 subjects): replacement of the protective filter and n-of-m strategies.

2.2.1.2 NH Subjects

Twenty NH subjects participated to this experiment. Their age ranged from 18 to 33 years (average 25 years). All the subjects had an ORL examination before entering the study, in order to eliminate previous pathologies or deafness which may corrupt the results. The auditory thresholds were measured; they were always below 20 dB HL for all the frequencies between 250 and 8,000 Hz. According to the BIAP (International Bureau for Audio-Phonology) all the subjects were considered as NH listeners.

2.2.2 Session

2.2.2.1 Material

In this work we used the French dissyllabic Fournier’s lists. These lists were well adapted to the patients’ recognition ability in this environment. They were uttered by a male voice and were the vocal part of the signal. They are similar to the spondees lists used in English. Each list contains 10 disyllabic words (for instance “le bouchon” = the cork). Forty lists are available, and the recognition unit was the syllable. So the recognition step was 5%.

A cocktail-party noise has been used in this study. This noise was a mix of voices from 8 speakers, 4 males and 4 females.

2.2.2.2 Test Sessions

The signal delivered to the subjects was made of the Fournier’s lists mixed with the cocktail-party noise. The mixing was managed by a Madsen Orbiter 922 audiometer and the signal to noise ratio (SNR) was perfectly adjusted.

*CI users

For CI users, tests were made in free field in an audiometric booth. In all cases only one ear was stimulated. When a subject was fitted with two implants, only the best ear was kept.

For NH subjects, the Fournier’s lists and the cocktail-party noise were first processed by the CI simulator and then passed through the audiometer and finally delivered on the right ear by a TDH 39 headset.

For the two groups of subjects the experiment consisted in a speech audiometry in noise. The maximum level delivered was below 65 dB SPL. According to the conditions requested by the ethics committee it did not exceed the 80 dB SPL limitation recommended for professional noises exposure.

The recognition scores of the CI users were collected before and after cleaning the microphone, leading to two different situations (“dirty and clean”). Our experiment took place at the beginning of the periodical clinical check and device setting occurring periodically at the CRIC (Cochlear Implant Setting Center) located in the ORL department of the Edouard-Herriot University hospital of Lyon. This check-up consists of an appointment with a speech therapist, a setting of the implant parameters and a clinical examination. Consequently, in our study the microphone cleaning occurred before the classical check-up. This device check follow-up is carried out at least once a year.

We used two different procedures to clean the microphones ports depending on the type of speech processor: A “Brush and Blow” technique (commonly used in French hearing care centers) and the replacement of the protective filters. The “brush and blow” procedure consists in brushing and blowing a flow of dry air on the microphones ports to remove dirt and dust.

The following tasks were realized in our work:

  • Verification of the patient’s medical file,

  • Short training session to help the patient to understand the instructions,

  • First test with the Fournier’s lists, before cleaning the microphone. The lists were presented to the patient with an increasing level of difficulty (SNR decreased from 18 dB to −3 dB; step −3 dB),

  • Microphone cleaning,

  • Second test with different Fourier’s lists after cleaning the microphone.

The full session lasted about 30 min. Sixteen Fourier’s lists were used in this experiment (8 before cleaning and 8 after) and the lists were not repeated.

*NH subjects

For NH subjects, stimuli were delivered to only one ear similar to the experiment with CI users. Furthermore, we chose to test the subjects on the right ear considering that the left brain auditory cortex is specialized in processing speech.

Eight conditions have been considered:

  • 2 coding schemes (CIS-like and n-of-m)

  • 4° of microphone sensitivity loss.

Five levels of SNR were tested for each condition leading to 40 (2 * 4 * 5) combinations. Each combination was assigned to a Fournier’s list so that the lists were not repeated. Each session started with a short training period to help the listener to understand the instructions. Then the 40 lists were randomly presented to each subject. The session lasted about 45 min.

2.3 Mathematical Representation

The recognition percentages versus the SNR can be represented by a sigmoid regression curve (Fig. 4).

Fig. 4
figure 4

Fitting of the recognition percentages by a sigmoid curve

Three features were considered here:

  • the SNR corresponding to 50% of the maximum recognition plateau denoted here x50%;

  • the “slope” (SNR interval, given in dB, between 25 and 75% of the maximum recognition) and denoted here Δ75–25%;

  • the top asymptote ymax shows the maximum recognition score.

These analytical values are represented on the sigmoid curve. The minimum recognition was 0% (seen for SNR = −3 dB). Thus, the sigmoid equation is:

$$ y = \frac{a}{{1 + e^{{ - b\left( {x - c} \right)}} }} $$

where:

  • y is the recognition percentage

  • x is the SNR

  • a is ymax

  • c is x50%

  • b is linked to the slope: b = 2.2/Δ75–25% => Δ75–25% = 2.2/b

3 Results

3.1 Percentage of Intelligibility

3.1.1 NH Subjects

Figure 5 shows the percentage of correctly repeated syllables, function of the SNR, for the simulated strategies (a) CIS-like (b) n-of-m. The intelligibility is represented for the 4° of simulated soiled microphones (no attenuation, 800 Hz low pass filtering at −2, −7 and −9 dB per decade)

Fig. 5
figure 5

Intelligibility with respect to the SNR in four cases of simulated soiled microphones, a for the CIS-like strategy, b for the n-of-m strategy

  • N-of-m coding scheme; ANOVA and post hoc analyses

A two-factor repeated measure ANOVA was used to analyze the results. It indicated a significant effect of SNR [p < 10−4] and of the degree of soiling [p < 10−4]. Furthermore a significant interaction was seen between them [p < 10−4].

Table 1 shows the results of Wilcoxon’s tests taken for the paired post hoc comparisons, between soiling conditions, with respect to the SNR; significant differences are highlighted.

Table 1 n-of-m, paired comparisons between the 4 cases with respect to the SNR

Largest differences were located at 9, 3 and 0 dB of SNR (Table 1). The results indicated that Case 1 was better than Cases 2 and 4 at the highest SNR (9 dB). For the SNR 3 dB the intelligibility for the Case 1 was significantly higher than for the Cases 3 and 4 and also Case 2 was better than Case 4. Finally, Case 1 was better than Cases 2, 3 and 4 for the SNR 0 dB.

  • CIS-like coding scheme; ANOVA and post hoc analyses

The two-factors repeated measures ANOVA indicated a significant effect of SNR [p < 10−4] and of the degree of soiling [p < 10−4]. Furthermore a significant interaction was seen between them [p < 10−4]. Table 2 shows the results of Wilcoxon’s tests for the paired post hoc comparisons with respect to the SNR. The significant differences are highlighted.

Table 2 CIS-like, intelligibility paired comparisons between the 4 cases with respect to the SNR

Largest differences are located between 3 and −3 dB of SNR (Table 2). The results indicate that Case 1 was better than Cases 3 and 4 and that Case 2 was better than Case 3 at the SNR 3 dB. For the SNR 0 dB the intelligibility for Case 1 was significantly higher than for the Cases 2, 3 and 4. Finally, Case 1 was better than Case 3 for the SNR −3 dB.

Strategies are compared on Fig. 6.

Fig. 6
figure 6

Intelligibility with respect to the SNR in four cases of simulated soiled microphones, comparison of CIS-like and n-of-m

3.1.2 CI Users

The effect of cleaning the microphone (“Brush and Blow” procedure) was evaluated on the recognition percentages; it is represented on Fig. 7 according to the strategies. It indicates the efficiency of cleaning. The heterogeneity in the results did not allow definitive conclusions and only indicated clues.

Fig. 7
figure 7

Intelligibility with respect to the SNR before and after cleaning the microphones’ port (“Brush and Blow” cleaning procedure), a CIS-like strategies, b n-of-m strategies

  • N-of-m coding schemes; ANOVA analysis

For the n-of-m strategies the improvements (Fig. 7b) were mostly noticed for the high values of the SNR (12, 15 and 18 dB) and there was an average improvement of 2.7% in syllable recognition after cleaning (corresponding to “½ syllable” per list).

The two-factors repeated measures ANOVA indicated a significant effect of SNR [p < 10−4] but no difference between before and after cleaning [p = 0.056].

  • CIS-like coding schemes; ANOVA analysis

For the CIS-like strategy (Fig. 7a), the best improvements were observed in the SNR range 0–9 dB (0, 3, 6, and 9 dB) and the average improvement after cleaning was 2.9%.

The two-factors repeated measures ANOVA indicated a significant effect of SNR [p < 10−4] but no difference between before and after cleaning [p = 0.12].

Protective filter replacement; ANOVA analysis

In the case of the replacement of the protective filter, there was an average improvement of 4.7% in syllable recognition after replacement (corresponding to one syllable per list); the results are represented on Fig. 8.

Fig. 8
figure 8

Intelligibility with respect to the SNR before and after cleaning the microphones’ port, a cochlear processors, protective filters replacement, b cochlear processors, “Brush and Blow” cleaning procedure

The two-factors repeated measures ANOVA indicated a significant effect of SNR [p < 10−4] but no difference was seen between before and after cleaning [p = 0.43] and no interaction occurred between the two factors [p = 0.91]

3.2 Audiological Values

3.2.1 NH Subjects

  • N-of-m coding schemes; ANOVA and post hoc analyses

A one-factor repeated measures ANOVA indicated a significant effect of the soiling degree on x50% [p < 10−4] but did not reveal a difference for the features Δ75–25% [p = 0.52] and ymax [p = 0.32].

We can see on Table 3 that the feature x50%was better for Case 1 compared to Cases 2, 3, 4. Also, x50% for Case 2 was significantly better than for Case 4.

Table 3 N-of-m, feature x50% paired comparisons between the 4 cases
  • CIS-like coding scheme; ANOVA and post hoc analyses

Again we used a one-factor repeated measure ANOVA. It indicated a significant effect of the soiling degree on the feature x50% [p < 10−4] but did not reveal a difference for the features Δ75–25% [p = 0.48] and ymax [p = 0.43].

We can see on Table 4 that the feature x50% was better for Case 1 compared to Cases 2, 3, 4. Also, x50% for Case 2 was significantly better than for Case 3.

Table 4 CIS-like, feature x50% paired comparisons between the 4 cases

3.2.2 CI Users

  • n-of-m coding schemes; Wilcoxon’s tests

The Wilcoxon’s tests indicated no significant effect of the cleaning on the curve features x50% [p = 0.66], Δ75–25% [p = 0.85] and ymax [p = 0.35].

  • CIS-like coding schemes; Wilcoxon’s tests

The Wilcoxon’s tests indicated no significant effect of the cleaning on x50% [p = 0.63], Δ75–25% [p = 0.29] and ymax [p = 0.41].

3.2.3 Filter Replacement Procedure

  • Wilcoxon’s tests for the procedure “filter replacement”

The Wilcoxon’s tests indicated no significant effect of the cleaning on the curve features x50% [p = 0.96], Δ75–25% [p = 0.62] and ymax [p = 0.19].

4 General Comments

4.1 About the Present Study

Several indications came out of this work:

  • The higher the SNR, the better the recognition (this is obvious). What is less obvious is the SNR for the ceiling effect. For NH subjects it was 6 dB for CIS-like, and n-of-m whatever the microphone drift. Also, the floor effect (0% recognition) was always reached at −3 dB of SNR. For CI users, the ceiling was seen for 15 dB, before and after cleaning for CIS-like, and n-of-m.

  • Cleaning improves the recognition (this is sensible). For NH subjects the improvement could not be seen for the extreme values of the SNR range (−3 dB and +9 dB). The importance of cleaning was maximum for the medium values of the SNR range (0 and 3 dB). For CI users, the results were similar to the NH subjects; only the improvement was reduced.

  • Considering the sound coding strategies, the recognition percentages obtained with CIS-like, are better than those seen with n-of-m, except for the extreme values (floor and ceiling effects).

Some exceptions can be pointed out:

For NH subjects, ceilings were not the same for CIS-like, and n-of-m, when the microphone was strongly dirty. For CI users when the microphone was clean, the n-of-m scheme led to better results than CIS-like, at the high values of the SNR. May be due to the heterogeneity of CI users, this result was not significant.

Consequently, the robustness of this finding has to be established and further studies should be done to clarify this aspect. So far we may suggest that the coding scheme could be adapted to the noisy environment. Also, a cleaning protocol should be inserted in the regular check of the machines.

Another aspect which was seen was the cleaning procedure: “Brush and Blow” compared to the microphone protective filter replacement.

Basically it seems that the replacement led to a better improvement of the recognition percentage than the “Brush and Blow” procedure. In our experiment, the “Brush and Blow” procedure was performed by a qualified audiologist; it may be different when audiologists which are not used to this procedure have to accomplish this task, mostly if they are worried about damaging the device. If the filter replacement is cheap and fast, it may be a safer approach to systematically change the head filter protecting the microphone, but it has to be seen in our work that it is not worthy to be done.

Going further, a replacement of the microphone can be suggested but other questions are open by this procedure. Obviously, a new microphone would be the best solution, but this is more costly and the connection can lead to contact problems; the junction can be fragile and could be a source of hard failure. Also, for reasonable and good values of the SNR, the difference of performances seen between a new microphone and a microphone with a small drift (50% of the users) is very light and not significant (result seen with the NH subjects). Also, cleaning the microphone in the case of CIS-like, users, did not lead to a dramatic increase of the performances (possibly, the microphones were only slightly soiled).

4.2 About the Microphone

Aging occurs for all components but the action is not the same for all. A breakdown is an acute failure, and it will be followed by an immediate repair. When the fault is irregular, it is more difficult to stop it, because no one knows when the incident will occur and this is a very worrying situation. When the trouble comes insidiously, it is often not perceived and the subject loses progressively his efficiency without being aware of it. This is more or less what happens during the life; someone gets older, in good health, and he hardly notices the decrease of this aptitude to perform many tasks in the life, mentally or physically. Slopes get steeper, mental activities get harder, but the subject can still cope, even if it is a little bit more difficult. This is typically what happens with the microphone; it does not break, but it gets less efficient and the user is not aware of it. This is why this evolution must be studied; what are the consequences and how we can beat it with rather inexpensive actions, in time and in money? The community surrounding a patient, a cochlear-implantee in our case, must be aware of this condition and should take the necessary steps to restore the good conditions attached to the machine. Several protective measures are easy to imagine. To get a good protection to moisture, grease, sweat, dust, debris, a head filter is a good shield for the microphone and it should be systematically checked by the audiologist each time the patient come for a follow up with the medical team. The degree of sensitivity reduction has its importance; this reduction can be beaten; we should lower the signal perturbation, with an attenuation and a systematic shift of the frequencies seen with an increase of the stiffness and of the dirt of the membrane [7].

A study, in simulation, can bring situations which are not often seen in real life; it emphasizes effects which are attached to extreme conditions and sheds more light on the issue we want to study. The influence of the modifications of the properties of the microphone membrane was tested in classical hearing aids checked by a hearing aid analyzer [16]. Then all the situations (coding scheme, microphone drift, signal to noise ratio) were presented to the same NH subjects, a rather homogenous group, and powerful paired student’s tests could be conducted. Obviously, the study in simulation cannot ignore the behavior and the performances obtained with CI users as this group is the target for this technology.

4.3 About the Listening Conditions

Three approaches have been done in this work, simulation with NH subjects, simulation of the microphone drift and addition of noise. Noise needs to be introduced in this kind of work for two reasons. First noisy situations are very common in the usual environment, for CI users and for NH subjects. Secondly, to get understandable results, one must avoid the floor and the ceiling effects; the range of the noise, associated with a good choice of the speech signal, allows to fulfill this requirement. Experimental conditions are a real issue, and their choice is far from being easy; pilot studies are more than necessary [17].

Then, the assessment of a listening condition cannot be done only by the comparison of recognition percentages. Other speech tests are worthy to be considered as they throw another glance on the performance. These parameters help to understand a scatter plot, and we have calculated the SNR for the mean recognition, the slope representing the action of the noise on the percentages, and the ceiling indicating the maximum performance reached in a given situation. In order to do so, a mathematical fitting was necessary, and finally it was worth to be done regarding the data synthesis it provided.

Practically, it is necessary for hearing aid management and rehabilitation of CI recipients that their devices are functioning well and are set by an audiologist during the check which is periodically done in all the clinics. The decrease of the microphone sensitivity is part of this clinical evaluation and it should be corrected, as much as possible. Nevertheless the correlation between the decline of the microphone sensitivity and its consequences on speech recognition scores was not found to be dramatic.

4.4 Strategy

As indicated above, CIS-like, strategy was less sensitive to the microphone drift than the n-of-m coding, mostly in the middle of the SNR range. This is an indication to the audiologists in charge of CI settings and to the manufacturers.

Is it only due to the stimulation speed, the spacing between the electrodes which lowers the overlapping? Is the spectrum coverage more useful to the brain than a selection of the phonetic features presented to the brain? For many researchers, CIs work, “because there is a brain behind” and we should not forget the efficiency of this premier organ of our body. We give information to the brain, then let it work.

Then other settings on the strategy have their influence: sampling rate, filters settings, length of the time frame (window), and they have been studied by many authors [1, 18,19,20]. Most of the studies concluded to an advantage of the CIS coding over n-of-m. Then, the Advanced Combination Encoder (ACE) scheme is very efficient; it uses a very fast updating of the data and therefore provides a fine temporal analysis of the signal, indicating that spectrum is not the only component of speech.

Then the results are subjected to the patient’s condition and mostly to the degree of nerve survival and to the intellectual background; it has consequences on the appropriate strategy for individual implant patient [21]. A good adjustment of a strategy to a subject is a goal to be reached, and it should be adapted throughout the time.

4.5 Noise

The robustness of the strategies should be also seen in noise. In a background noise, the performance of CI users is significantly better when using a full filter bank approach (CIS-like) in comparison to using a peak picking method. Consequently, when the signal is distorted (noise or microphone drift) CIS-like, coding seems to be more robust, but this is not unanimously accepted [22].

With CI users the performance in noise is very difficult to interpret and it is related to the patient’s characteristics and to the wide variability in this group of people. The degree of deterioration in noise was similar with each strategy.

Also the number of channels needed to understand speech in noise is still an open question [23]. More channels are needed to understand speech in noise than in quiet, both for CI users and NH listeners, even if results are better in simulation with NH subjects. Again, the number of electrodes (channels) must be kept low to reduce the overlap between the electrodes.

A recommendation which is classically delivered when hard of hearing people are concerned is to provide good listening conditions (high SNR) and to speak slowly and distinctively. This situation is not always easy to meet and we should reduce the noise level as much as we can. Consequently many studies have been done to lower the noise, in other words to increase the SNR [24,25,26]. This is attractive, but not always easy to implement as hypotheses should be made about the noise.

Aging is an important issue for human, animal, vegetal and machines. “Time flows and we follow”. Wisdom is to adapt and to find the best strategies to reduce the consequences; this is also true for the cochlear implants microphones and that was the aim of this dissertation: how can we lower the impact of time?

According to us, to lower that impact on the microphones several suggestions can be made to keep the microphone function:

  • “Stop the time flow” (unfortunately this is not possible),

  • Use the Brush a Blow cleaning procedure,

  • Make a head filter replacement,

  • Perform a microphone replacement.

The cheapest is to apply a “Bruch and Blow” cleaning procedure at every routine check and the results on the performance should be satisfactory. So we can recommend this procedure to the specialized teams in charge of the CI patient’s care.

5 Conclusion

In this work we have studied the effect of soiling and aging of the microphones of cochlear implants on the recognition of syllables in the French Fournier’s disyllabic lists of words.

Several parameters were studied in this work:

  • The influence of the coding strategies (CIS-like and n-of-m),

  • The action of the signal to noise ratio,

  • The effect of cleaning the microphones.

Experiments were conducted with CI users and NH listeners as the two approaches are complementary. Also, two procedures to clean the microphones were considered, the classical “Brush and Blow” and the replacement of the protective filter of the microphone. With CI users we were in the true situation, and with NH subjects we could see more analytically the issue and we could perform a more precise analyses using data taken on the same person.

Results indicated that:

  • The CIS-like coding strategies were more robust than the n-of-m strategies regarding the microphone drift; the difference could reach two syllables in a list (10%) and it was seen mainly in the middle of the SNR range,

  • The SNR effect was very important, the recognition percentages ranging from 0% (SNR = −3 dB) to 100% (SNR = 9 dB or higher). Results were generally lower with CI users than with NH subjects. In some circumstances, the difference could be very small,

  • Cleaning the microphones improved the percentages, mostly in the case of NH subjects in simulation as more extreme situations could be considered and they were tested on the same subjects (through paired comparisons).

  • With CI users, it was difficult to get significant results, probably due to the heterogeneity of the group, and this is very general when pathology is considered. Also, the protective filter replacement seemed to be more efficient than the classical “Brush and Blow” cleaning procedure, but this effect was not seen everywhere on the SNR range, and this result was not statistically proven.

Finally, we recommend the use of the CIS-like strategies over the n-of-m coding strategies whenever it is possible. Also, in the regular checkup which is programmed in the follow-up of CI users, we suggest that the audiologists perform a cleaning of the microphone (the “Brush and Blow” procedure is not aggressive) as the drift is very insidious and often not noticed by the subjects. In some cases (very dirty microphones) it would be helpful for the patient.

We cannot stop aging, but we can attenuate its effects and it was the aim of this work on the microphone drift for cochlear implants (“a pin hole in a wide issue”).