Introduction

Cochlear implantation (CI) has proven to be a successful procedure to restore hearing in patients who suffer from medium to severe hearing loss. Nevertheless, the level of restoration highly depends on patient-specific factors, such as the patient’s anatomy [1,2,3], making the prediction of CI performance a challenging process. Currently, statistical relations based on the patient’s deafness duration and residual hearing are mainly used to estimate the expected implant performance. Some authors have tried to identify other parameters that may affect the level of restoration and, thus, could be established as predictors of the implant outcomes [2,3,4]. Nevertheless, a clear association between pre-operative measurements and the patient’s CI outcome has not yet been observed, since a high number of factors, including pre-operative and implant information, contribute to a high variability on the implant performance [2]. Previous studies proposed the brain metabolic activity as a factor to effectively predict CI results before surgery [4, 5]. However, such clinical pre-operative data is not always available, and these hypotheses still need to be fully tested with further functional experiments [4]. Consequently, new prediction tools can play an important role in the implant performance assessment. The development of advanced computational tools can provide an estimation of the patient-specific CI outcome to directly support pre-operative decisions.

Although in silico studies have not been commonly applied into the clinical practice of CI, they have shown their potential to become valuable tools to compute predictive performance of the implantable device [6,7,8,9]. In the past, some authors assessed the resulting electric field accounting for different cochlear implant setups, such as stimulation protocols [10,11,12], or neural fiber conditions, contrasting both intact and degenerated fibers [8, 13]. Other authors focused their studies to improve finite element models by including the ear canal and middle ear [14] or the whole human head [15, 16], and to evaluate the neural excitation patterns [6, 17, 18]. However, these studies did not lead to a complete and automatic computational approach encompassing all stages, from the generation of a highly detailed patient-specific model of the cochlear anatomy to the neural response evaluation. Some previous studies considered volume conduction models from simplified [7, 16] and parametric [9] representations of the cochlear anatomy, built from guinea pig cochleae [19], or considered a more detailed geometrical model [20]. However, simplified or generic models limit the insight on CI performance, due to their high dependence on patient-specific factors, pointing out to the need for personalized detailed models [20].

The neural response of the auditory nerve fibers (ANF) to a CI stimulus has been paid special attention during the last decades [17, 18, 21, 22]. Dynamic range, place pitch, or spread of excitation on the auditory nerves has been a focus of interest due to their direct relation to the CI performance [6, 13, 17, 20, 23]. ANF response according to the intra-cochlear electrode position has also been investigated in several studies, in which different and often contradictory conclusions were reported [24,25,26,27,28,29]. To reduce the spread of excitation and obtain a better pitch discrimination, an electrode position closer to the modiolus has been reported to decrease the threshold in which the first response on the ANF is obtained [24, 26]. On the other hand, some authors observed that the distance between the electrode and the modiolus did not affect, or affected minimally, this neural response threshold, or that its relation could not be clearly determined [27, 30]. Generally, a better speech perception of CI patients is obtained when the electrode is completely placed on the scala tympani [28, 29]. Lateral wall electrode arrays provide a higher probability to maintain this intra-cochlear position, whereas perimodiolar electrodes are likely to be misplaced, affecting directly the residual hearing preservation [28]. This issue is still an open discussion, yet it plays an important role when the electrode design needs to be chosen; thus, further studies are required.

The cochlea bears a tonotopic mapping, meaning that there is a frequency-position relation along the basilar membrane of the cochlea, following Greenwood’s function [31]. This provides a frequency map that describes the related frequency ranges that a subpopulation of ANF bundles are sensitive to, according to their location in the cochlea. In particular, high frequencies are related to fibers at the base of the cochlea, and low frequencies to fibers at the apex. The optimal neural response caused by the electrode array is defined by its design, which determines the target frequency ranges. Thus, it is desired for each electrode inside the array to activate only a specific group of ANF bundles directly related to the frequency map by fitting the electrode parameters, such as stimulation amplitude or location, to obtain an optimal performance of the implant for the given patient. However, the characterization of the CI performance in terms of the computational ANF response has not been attempted before. Thus, it remains unclear whether the actual neural response can be used to provide an estimated measure for the further optimization of the implant performance.

In previous studies, we proposed an automatic framework for the generation of computational models of CI [32]. In this work, we further develop this model to compute the final neural response after CI. It includes a complete automatic in silico assessment of the ANF activation on an improved computational CI model. The main objective of this article is to provide an automatic framework able to estimate and quantify the neural response evoked by the predicted intra-cochlear potential generated by the implant. This framework allows accounting for patient-specific anatomy, as well as surgical and electrical stimulation parameters that affect the surgery outcome [32].

Here, we present the development of the complete in silico framework and the results obtained on specific CI scenarios. The methodology includes the generation of the finite element (FE) model (“Finite Element Model” section) and, following, the FE electrical simulations (“FE Simulation—Electric Conduction Model and Stimulation Parameters” section). The obtained electric field causes a response of the ANF (“Human Auditory Nerve Fiber Model” section) that provides information to assess the general intra-cochlear activation and quantify the overall CI performance on the specific patient (“Activation Map and Performance Evaluation” section). The framework has been applied to evaluate the effects of (1) ANF degeneration and (2) the electrode intra-cochlear position on the activation thresholds according to the stimulation protocol employed.

Methods

Finite Element Model

Creation of a patient-specific CI model can be approached in different ways. A common approach is to obtain a simplified spiral 3D model created from a 2D cross section of the cochlea and the electrode array [19, 22, 26]. In contrast, our model is based on high resolution anatomical images of the inner ear to create a detailed 3D model, rather than on geometrical simplifications. The step-by-step pipeline of the developed framework is shown in Fig. 1, starting from the clinical CT of the patient (Fig. 1, A), and resulting in the evaluation of the neural response (Fig. 1, I). All elements of the CI computational model are created by applying our automatic framework for the generation of personalized meshes for FE modeling [32]. The framework includes a statistical shape model, an electrode array virtual insertion simulation algorithm, and the generation of a volumetric mesh of the inner ear. Anatomical μCT images were used to construct the highly detailed statistical model, process described in more detail in [32, 33]. The creation of a statistical shape model of the inner ear allows us to generate virtual patients, by sampling the model. More importantly, it allows obtaining the patient-specific detailed anatomical cochlear shape from conventional low-resolution patient images, by aligning and fitting the model to the image using a non-rigid B-spline image registration [34] (Fig. 1, B).

Fig. 1
figure 1

Evaluation pipeline of the cochlear implant. From the clinical CT of the patient (A), the patient’s cochlear surface (B) is extracted by the statistical shape model fitting procedure. A virtual surgical insertion (C) is performed with the desired surgical parameters. By segmentation from CT and MRI data from open-access repositories (D), a generalized head model (E) is created to be coupled with the patient’s cochlear surface according to measurements extracted from MRI images of the cochlea localization. A full head tetrahedral finite element mesh (F) is thus generated, allowing us to perform computational electrical simulations. Then, the potential field created by the implant stimulation (G) is obtained and, through the human auditory nerve fiber model (H), the neural activation (I) is computed

Once the surface of the cochlea was obtained (Fig. 1, B), the virtual insertion of the CI electrode was performed (Fig. 1, C). This insertion was done by simulating the real scenario of the surgery using our surgical planning software, thus obtaining a realistic trajectory line of insertion, and then matching the centerline of the electrode array geometry with the trajectory line of the insertion [32, 35]. An electrode array design with 12 electrodes (19 contacts) was used based on Med-EL Flex28 design. The ANF around the cochlea were generated automatically according to the patient’s cochlear shape, considering different anatomical landmarks, such as the organ of Corti and the beginning of the basilar membrane [32].

Generalized models of the brain, scalp, and skull were extracted from VHP CT Dataset (University of Iowa, Magnetic Resonance Research Facility) (Fig. 1, D, E) and they were coupled with the cochlear surface and electrode array virtually inserted (Fig. 1, F). The reference electrode was located on the scalp, according to its prescribed placement on implant housing. The simulation of this reference electrode has proved to have an important role on the current paths predicted by the electrical conduction model and consequently, on the neural excitation patterns [15, 36]. All elements were merged and transformed into a single volumetric mesh (Fig. 1, F), leading to a mesh free of intersections of approximately 2·106 tetrahedral elements. The mesh quality was checked to ensure a good convergence on the FE simulation [32]. Temporal bone conductivity was updated based on measurements in patients of electric field imaging (EFI) [6]. For more details on the implementation of the FE simulations, see [32].

FE Simulation—Electrical Conduction Model and Stimulation Parameters

The electrical FE simulations were carried out considering an electrical conduction model and using the electrostatic solver of the open source Multiphysics software Elmer [37]. By solving the Poisson equation (Eq. 1) in a quasi-static approximation regime, we obtained the electric field created by the CI stimulation (Fig. 1, G):

$$ \nabla \cdot \sigma \nabla \phi =\frac{\partial \rho }{\partial t} $$
(1)

where ∇ is the gradient operator, ρ is the electric conductivity, σ the electric potential, and ϕ the total current density. Different stimulation strategies can be set up in a CI, regarding the configuration of active electrodes. Monopolar, bipolar, and tripolar protocols are the most common ones. Monopolar stimulation consists in configuring one intra-cochlear electrode as a source, while the reference is defined as an extra-cochlear electrode placed on the implant housing positioned on the bone [12]. This generates a higher voltage spread than other stimulation protocols [19, 38]. Bipolar stimulation places the reference on the intra-cochlear adjacent electrode, producing a more focused electric field, while tripolar stimulation uses two adjacent electrodes as reference. In this work, monopolar (MP) stimulation was used to assess the excitation spread in all ANF, considering the ground electrode in the implant housing located on the scalp (Fig. 2).The stimulus level delivered by the active electrode is directly related to the excitation current spread [17]. In previous studies, the amplitude of the stimulating current has been established in a broad range, which varies from 150 μA to 5 mA [6, 12, 13, 19, 26]. However, in the clinical practice, a narrower range (from 200 μA to 1 mA) of stimulation amplitudes are commonly selected. In the current study, the maximum stimulation amplitude corresponds to the maximum one which is clinically used. In this way, it is possible to assess the maximum spread of excitation observed in the patient’s cochlea (Fig. 1, G), and thus, to evaluate accordingly the response of the ANF in the patient-specific case.

Fig. 2
figure 2

Potential field distribution. From left to right, zoom in captures of the whole model that illustrate the potential field distribution from different perspectives when the monopolar stimulation strategy is used. In this example, the 12th electrode (E12) is activated, while the reference electrode is set to ground

Since the model was considered purely resistive, the potential field obtained for different stimulation amplitudes was proportional to the one generated by a current of 1 mA (see Fig. 4). By adapting this parameter, we determined the minimum amplitude needed to excite the desired range of nerve bundles. That is, we set the parameters to exclusively stimulate the target group of fibers that the specific electrode design aims to activate according to the tonotopic mapping of the cochlea. This allows evaluating, for instance, the cross-talk zones—activation of wrong frequencies. On those, the nerve fibers are stimulated by a higher voltage spread than the expected one, and consequently they have a non-desired neural response. The desired voltage spread can be defined as the one that exclusively activates the target nerve fiber bundles, avoiding any cross-talk (see “Activation Map and Performance Evaluation” section for further details).

To assess the response of the ANF, the temporal response of the stimulation needs to be computed in a transitory state. This stimulation is computed via the generation of pulses. Commonly, the MP strategy employs biphasic pulses in the clinical practice. Anodic pulses are defined when the return electrode is set to ground and the intra-cochlear electrode is the current source. On cathodic pulses, the configuration is reversed, the return electrode on the scalp acts as current source and the intra-cochlear electrodes as ground. In our experiments, each pulse had a duration of 100 μs with a biphasic cathodic-first pulse. The potential field obtained (Fig. 2) was used as an input to the nerve fiber model, needed to initiate the activation response [17,18,19, 21].

Human Auditory Nerve Fiber Model

A Hodgkin-Huxley (HH) compartment model was used to reproduce the membrane excitable behavior of the neural cells [39]. The model provides the kinetics for activation and inactivation of the membrane ionic channels. Since the original implementation, proposed in the late 1950s, and based on the behavior of the neural activation of a squid, several modifications have been proposed. Rattay etal. [18] presented a cable model of the human ANF, where each of the compartments that form the fiber was represented by an equivalent electrical circuit (see Fig. 1, H). Each compartment has an external stimulation—extracellular potential—corresponding in our case to the outputs obtained from the FE simulation. The model accounts for the temperature effect found in humans and incorporated a speed up factor for the faster opening probabilities of channels. The multi-compartment model with human ANF morphology was implemented according to the description of [22], which also considered previous descriptions reported in [18]. We modeled 330 nerve fiber bundles spaced from each other 100 μm along the organ of Corti of the inner ear. Along the human cochlea, there exist around 30,000 nerve fibers in the healthy cochlea and the mean length of the organ of Corti is 33.13 mm [40]. Thus, each fiber bundle computationally modeled contains 90 actual neural fibers, which retain enough frequency resolution along the cochlear duct [13].

The human ANF is composed by the peripheral axon, comprised by six internodes (which are removed in case of degeneration [18]), the pre-somatic region, the soma, and the central axon with 16 internodes (see Fig. 3). In total, a single fiber encompasses 47 compartments. The internodes of the peripheral and central processes have 40 and 80 layers of myelin, respectively, and the pre-somatic region soma, 4 layers [18, 22]. The internode membranes have zero conductivity due to the insulating layers of myelin. The leakage conductance is inversely proportional to the layer of myelin: the higher the insulation of the myelin, the smaller the leakage.

Fig. 3
figure 3

Illustration of an auditory nerve fiber. The impulse, or activation, created on the peripheral axon travels along the nerve fiber towards the central neural system (CNS) (Adapted from [22])

Following Rattay’s approach [18], the voltage potential of the neural cell membrane, V m, can be expressed in terms of the capacitance, C m, and the capacitive current, I m, of the compartment as follows:

$$ \frac{dV_{\mathrm{m}}}{dt}={I}_{\mathrm{m}}/{C}_{\mathrm{m}}. $$
(2)

The input and output currents (Fig. 1, H) of the current node are defined as follows:

$$ {I}_{\mathrm{in}}=\frac{\left({V}_{\mathrm{m},n-1}+{V}_{\mathrm{e},n-1}\right)-\left({V}_{\mathrm{m},n}+{V}_{\mathrm{e},n}\right)}{R_{n-1}/2+{R}_n/2} $$
(3)
$$ {I}_{\mathrm{out}}=\frac{\left({V}_{\mathrm{m},n}+{V}_{\mathrm{e},n}\right)-\left({V}_{\mathrm{m},n+1}+{V}_{\mathrm{e},n+1}\right)}{R_n/2+{R}_{n+1}/2} $$
(4)

where V e , n is the extracellular potential on the nth compartment and R n the corresponding resistance. According to Kirchoff’s law, expression (2) can be rewritten as:

$$ \frac{dV_{\mathrm{m}}}{dt}=\left(-{I}_{\mathrm{ion}}+{I}_{\mathrm{in}}-{I}_{\mathrm{out}}\right)/{C}_{\mathrm{m}} $$
(5)

where I ion is the ionic current from the sodium, potassium, and leakage channels (HH-model).

Activation Map and Performance Evaluation

Evaluating the global stimulation pattern of a cochlear implant is not a straightforward process. In the clinical practice, there exist telemetry measurements, such as evoked component action potential (eCAP) that allow analyzing the response of the ANF to an electrical stimulus. In computational models, a direct approach is to evaluate such a response assessing the action potential generation—neural response to a stimulus—when delivering the electrical pulses. If this stimulus is able to create a spike that passes the soma, the spike is assumed to reach afterwards the central neural system, and therefore, the nerve fiber is considered to be excited.

This is the common approach to assess the neural response in CI models. However, the quantification of the outcome has not been addressed before due to the complexity of the real parameters that influence the intra-cochlear electrical potential generated by the CI. Such quantification is nonetheless of high relevance to provide information that can help pre-operatively on CI electrode array design decisions. Thus, we propose an evaluation measure to assess the implant neural activation performance. For this, we first define a local measure to evaluate each individual simulation in relation to the stimulating current intensity and the neural response obtained. Then, we obtain the optimal stimulation by defining the required stimulus intensity levels to cause ideal activation of the ANF with minimal cross-talk. Finally, from this optimized activation, a global performance measure is computed. In the following, we explain this procedure in detail.

For the quantification of the neural response, a local activation map is computed. This map encompasses the neural response of the ANF of the cochlea for all 12 electrodes from the array (Fig. 1, I). As described above, each electrode has its own target frequency bandwidth according to the cochlear tonotopic mapping and its location inside the cochlear duct. Figure 4a shows an example of a desired activation map, stimulating selectively only the target ANF without any cross-talk. Using the proposed neural computational model, the obtained ANF activation is compared with the desired activation map (Fig. 4b). This leads to a mismatch activation map, which indicates the frequencies that have been properly or wrongly excited—true or false positives—or missed—false negatives (see Fig. 4c). Once the local activation map is computed, the neural response is optimized by defining the threshold level that produces the closest neural response to the desired one. To cover the range of stimulation amplitudes used on the clinical routine, a series of local activation maps for different stimulation amplitudes, from 200 μA to 1 mA, are obtained. The higher the amplitude, the wider the range of ANF activated.

Fig. 4
figure 4

Examples of activation maps (horizontal axis: stimulated electrode from the apex (E1) to the base (E12); vertical axis: frequency mapping of the cochlea). a Target frequency ranges according to the electrode array design (Flex28, Med-EL). b Example of a patient-specific activation map for a monophasic (cathodic) stimulation protocol. c Mismatch activation map

The excitation of the ANF outside the desired bandwidth, known as cross-talk, plays an important role since it impairs hearing perception. For this reason, a local activation performance measure is important to quantify the mismatch for each electrode and intensity of the stimulation (Fig. 5b) and eventually, assess the global activation performance. This local activation measure δ i , e quantifies the performance of the excitation profile, f i , e (x), obtained for the electrode e and the impulse amplitude i. The strict excitation limits defined on Fig. 4a—and modeled for the electrode array design—are unrealistic in clinical practice, since the real voltage spread does not provide such a delimited field. Assuming this, each target bandwidth was modeled as a modified Gaussian distribution in such a way that locations that need to be excited are assigned positive values and otherwise are negative. Thus, this weighting function, w e (x), defines the ideal activation profile of the ANF. For each activation map, the weighting function corresponding to the different electrodes is applied (see Fig. 5b). The weighting function penalizes the excitation out of the ideal bandwidth by defining w e (x) > 0 for x ∈ [μ e  − σ e , μ e  + σ e ] and w e (x) < 0 otherwise, where μ e is the central frequency target and σ e the bandwidth limits, both depending on the electrode.

Fig. 5
figure 5

Local performance measure for the 5th electrode when it is delivered an impulse of a 300 μA (δ = 91%) and b 800 μA (δ = 47%))

The local measure is defined by \( {\delta}_{i,e}={\int}_0^{\infty }{f}_{i,e}(x)\cdot {w}_e(x) \), where δ i , e  ∈ [p min,  p max] with \( {p}_{e,\min }={\int}_0^{\infty }{w}_e(x)-{\int}_{\mu_e-{\sigma}_e}^{\mu_e+{\sigma}_e}{w}_e(x)\kern0.5em \) and \( {p}_{e,\max }={\int}_{\mu_e-{\sigma}_e}^{\mu_e+{\sigma}_e}{w}_e(x)\kern0.5em \).

The midpoint of each electrode range [p e , min,  p e , max] was defined as 50% the performance, thus, mapping p e , min and p e , max , 0 and 100%, respectively. Measures with no ANF response were considered as a zero value. The best local performance measure was chosen for each electrode which is equivalent to select the amplitude threshold which provides the closest excitation to the desired one. Then, a global activation map, composed by the excitation profiles of the identified thresholds, was computed (Fig. 6a). The mismatch activation map with a higher likelihood between the actual and the ideal ANF excitation was obtained (see Fig. 4c). Finally, the global performance is computed as \( {\Phi}_i=\sum_{e=1}^N{\delta}_{i,e,\mathrm{optim}}/N \), where N is the number of electrodes in the array. This global measure and the optimized mismatch map enable to evaluate and compare easily the performance of CI simulation according to the electrode design, the patient, and surgical parameters.

Fig. 6
figure 6

Performance measures and activation maps for a an optimized case (Φ i  = 70 % ) and local ones for b 400 μA (Φ i  = 65 % ), c 600 μA (Φ i  = 51 % ), d 800 μA (Φ i  = 37 % ), and d 1000 μA (Φ i  = 32%)

Results

The developed computational framework was employed to analyze the CI induced neural activation patterns on four different cases with distinct intra-cochlear position of the electrode array (lateral wall or perimodiolar) and ANF conditions (normal, degenerated). Firstly, positions of perimodiolar and lateral electrode array overlaid in a single cochlear 3D representation are presented in Fig. 7a. Potential field distributions and the resulting ANF excitation are shown in panels b and c of Fig. 7, respectively. These results correspond to the excitation profiles caused by activation of electrode 4 (E4), located at the medial-apical part of the cochlea and targeting the activation of ANF tuned to lower frequencies (central frequency 0.8 Hz). The range of extracellular potential applied at the ANF model as input stimulus (Fig. 7c) was proportional to the one caused by 1 mA, shown in Fig. 7b. The perimodiolar case shows a higher non-specific potential field on the basal part, from the beginning of insertion to the first half turn. Both activation profiles show that the higher the stimulus amplitude, the higher the cross-talk, as the electrode activates non-specifically also ANF tuned to higher frequencies (central frequency 12 kHz, and 12 and 4 kHz for lateral wall and perimodiolar array, respectively), which are anatomically relatively close to the stimulating electrode due to the spiral shape of the cochlea. On the one hand, the perimodiolar position requires a higher stimulus intensity (500 μA) to reach the whole desired frequency range of excitation (central target frequency 0.8 Hz) while the lateral case requires lower intensity (300 μA). On the other hand, the lateral wall configuration shows an overall higher voltage spread at lower stimulation intensities, and thus, less specific ANF excitation (Fig. 7c). Importantly, this spread is narrowed by decreasing the stimulus amplitude delivered, effect also seen in Fig. 8.

Fig. 7
figure 7

a Models for perimodiolar and lateral wall position. Only cochlear and array meshes are displayed. b Potential filed distribution, E4 set as active source for 1 mA. c Activation profile E4 within an amplitude range [200, 1000 μA]

Fig. 8
figure 8

Activation maps for a lateral wall position and b periomodiolar position in healthy conditions; and lateral (c) and periomodiolar (d) for a case of degeneration of the ANF. Color scale shows the intensity of the stimulus

Results of the neural response of both intra-cochlear positions are presented in Fig. 8. As mentioned above, the activation map contains information about the global ANF excitation caused by all 12 electrodes of the implant. To achieve that, activation profiles for each electrode of the array were calculated, similarly to Fig. 7c, and obtained results were transformed in a single activation map representing the activation of ANF with a given characteristic frequency evoked by stimulation of a given electrode with specific current amplitude. The presented results show the effect of the ANF degeneration (Fig. 8c, d), combined with both location approaches (Fig. 8a, b). As expected, in all cases a higher stimulation amplitude causes a wider, less specific, activation of the ANF. Consequently, as the intensity decreases, the range of excitation is narrowed. Cross-talk effect is present in all cases, although reduced in degenerated ones. The simulation indicates that if the stimulation amplitude is reduced to minimize the cross-talk, the target frequencies might be missed. This can be demonstrated in the lateral wall position, non-degenerated case, for the electrodes E1 and E2, which require an amplitude of 600 μA to activate the target frequencies, from 150 to 700 Hz, but the non-specific neural response already started at high frequencies for an amplitude of 300 μA or below (Fig. 8a). This effect can be seen also in Fig. 8c, d for the degenerated cases. However, in these cases, higher intensities are required due to the degeneration of the ANF, thus a higher potential field is needed to obtain the desired excitation, which results in, as a consequence, the appearance of cross-talk. The perimodiolar approach for the case with ANF degeneration (Fig. 8d) does not cause cross-talk up to 500 μA. However, higher amplitudes from 600 μA are required to excite the desired range of ANF. On the other hand, Fig. 8c shows that lateral wall electrode provides a lower activation threshold for the electrodes located on the base, while on the apex cross-talk starts from 400 μA.

Figure 9 presents dependence of local activation performance of the electrodes—directly related to the specificity—with amplitudes of the currents delivered. The trend in all four cases is a non-monotonic decrease of the activation performance measure (δ i , e ) as the amplitude increases. In the degenerated cases (Fig. 9c, d), this measure is the highest for the threshold level, the smallest current for which non-zero neural response is obtained. On these, measure performances are generally decreased compared to the healthy case. This is also due to the penalization of the cross-talk effect. When there is a degeneration of the ANF, the optimal intensity is 100 to 500 μA higher than the healthy case (see values in Table1). This causes cross-talk on the optimized stimulus amplitude (Fig. 8c, d). In the lateral wall position and non-degenerated case (Fig. 8a), the local activation performance further decreases with increasing current amplitude below the threshold for electrodes E5 to E9, but does not change or slightly increases for other electrodes. This suggests that it is important to identify electrodes highly sensitive to stimulation current amplitude in terms of frequency specificity and not to overstimulate ANF with these, but rather to stay close to the threshold if specificity is the primary scope during the patient post-operative fitting. The electrodes in the perimodiolar and non-degenerated case (Fig. 8b) as well as the degenerated cases (Fig. 8c, d) do not seem to show such negative behavior for higher currents than threshold ones, and larger current stimulation does not have any detrimental effect in terms of frequency selectivity.

Fig. 9
figure 9

Local performance at a lateral wall and b periomodiolar position in healthy conditions, and at lateral (c) and periomodiolar (d) position for a case of degeneration of the ANF

Table 1 Optimized values (A optim, optimized amplitude (in mA); δ i , e , local performance measure)

The data from Fig. 9 are summarized in Table1. It shows that optimized amplitudes and their corresponding local performance values highly differ from one case to the other. In particular, degeneration of the ANF affects the CI performance by increasing the amplitude needed to achieve the excitation of the ideal frequency bandwidth. Some specific cases on the lateral electrode show decreases from 7% (E9) to 47% (E5) compared to the healthy case. For periomodiolar insertion, these values are decreased up to 79% (E4). The presented results show that the apical electrode (E1) provides higher spread, which provokes wider bandwidth activation. In case of neural degeneration, this excitation bandwidth is reduced and, in the same way, also the cross-talk at the base. This explains why for 300 μA (δ i , e = 73%), an improvement of a 23% in its performance is obtained, compared to the healthy case, for which the best measure is obtained at 200 μA (δ i , e = 50%). However, this is not observed with periomodiolar electrodes, where higher intensities are required, reducing their performance. Some cases, such as electrodes located at the base, show a decrease of the amplitude with a slight improvement.

The optimized stimulation intensities for the four cases presented in Table1 form each one a global activation map. By comparing to the target activation map (Fig. 4a), the mismatch map provides a fast visualization of the global CI performance (Fig. 10). ANF correctly activated or missed are clearly identifiable by means of the provided color code. Generally, mismatch maps with high values indicate that the cross-talk remains. This effect is penalized in the proposed performance measure (“Activation Map and Performance Evaluation” section). In particular, perimodiolar electrodes do not provide the assumed better performance in cases of ANF degeneration (Φ i  = 50% against Φ i  = 52% for the lateral case), while lateral electrodes decrease significantly the cross-talk effect. This suggests that perimodiolar electrodes may be located too close of the soma of basal ANF, thus, creating a high cross-talk due to the high intensities (from 0.7 to 1 mA) required to activate the target apical ANF. Figure 10c, d suggests that intra-cochlear position needs to be carefully selected when neural degeneration occurs. High cross-talk is present in both cases, and this reduces the ideal frequency bandwidth effectively excited. We believe that this is due to the short distance from the source electrode located on the apical part to the soma of the basal ANF.

Fig. 10
figure 10

Mismatch optimized activation map. a Lateral wall position and b periomodiolar in healthy conditions. Lateral (c) and periomodiolar (d) for a case of degeneration of the ANF

Discussion and Conclusion

We have presented a complete framework for the functional assessment of neural excitation with CI based on patient-specific factors, as well as on surgical and stimulation parameters. A new measure of CI performance is proposed to assess the resemblance of the actual stimulation with the target neural response. By evaluating the activation maps, we can set different stimulation parameters to optimize the activation and quantify the goodness of fit of the actual excitation provoked by the implant. The developed framework has been used to assess the neural response in different scenarios accounting for the electrode placement inside the cochlear scala tympani and the state of degeneration of the nerve fibers.

Initial results show that the framework provides reliable outcome results, which are consistent with previous findings in the literature and the clinical practice. Most importantly, similar potential fields were obtained emulating previously reported simulation parameters [15, 20]. Values obtained for the intra-cochlear voltage were in agreement with clinical measurements (within the range 0.19–0.35 V for a stimulus of 300 cu, equivalent to 300 μA approximately). Normalizing the intra-cochlear voltage with the current injected, obtained impedance values were in concordance with EFI (electric field image) measurements from patients reported on previous studies [41,42,43]. In addition, telemetry measurements, such as eCAP recordings, indicate that neural activation in ANF is initiated at 333 ± 114 cu [44]. This value, known as eCAP threshold, depends on the region of the cochlea where the stimulating electrode is located, and increases towards the base up to approximately 350 cu [27, 44]. This effect is also seen in our simulations in the case of the lateral wall position, when the first ANF response is obtained at 300 μA, while in the apical region neural responses at lower amplitude are obtained. However, this is not observed in degenerated cases, in which the apical fibers require higher stimulus amplitudes. This suggests that ANF located at the apex is more affected by the degeneration of the peripheral process in terms of performance.

The proposed global and mismatch activation map provides a useful optimization tool for pre-operative selection of optimal CI electrode array according to the patient anatomy and post-operative implant fitting of stimulation parameters for each patient. It allows us to visualize clearly the general neural activation pattern of the implant and localize easily cross-talk effects. The presented results show a high presence of cross-talk in the apical region, which is common on CI patients. This effect is penalized in the proposed performance measure, which presents a clear and intuitive approach to quantify the performance of the patient-specific model. The quantification of the CI performance had not been attempted before, due to the complexity of the CI system and the interplay of the many factors that affect its performance. Our proposed measure seems to provide a reliable overview of this CI outcome.

The perimodiolar electrodes were originally designed to obtain the same performance with a lower stimulus threshold—compared to the threshold needed for straight electrodes inserted in a lateral wall position. However, the perimodiolar approach uses an electrode design that may cause severe internal trauma due to the penetration of the array and translocation to scala vestibuli, while having the same efficacy as lateral electrodes in terms of eCAP measurements [28]. There is still controversy about whether the electrode-modiolus distance has an influence on the CI outcome [20, 24, 26, 27, 29, 45]. Some authors determined that this distance does not make a difference on the neural activation, while the insertion depth plays an important role [27]. Others suggested the need of more research and to find an optimized design to preserve the structure and achieve the best outcome [46]. Our results show that lateral wall electrodes perform better for the healthy ANF case, while only slight differences are obtained in case of ANF degeneration. These results reveal that, for the cases under study, intra-cochlear electrode position plays a role on the ANF excitation. However, further studies are required to establish a correlation between these parameters and their performance. We believe that the computational tools presented in this work can be valuable to perform such studies.

Some limitations have been identified in the proposed computational framework. Although the generation of the model has been improved regarding previous works, nerve fiber geometry needs further improvements. Part of our current work goes towards more realistic anatomical shape of the ANF, improving spatial distribution along the cochlea and neuron density, similarly to [6]. Moreover, obtaining more accurate clinical data to validate computational results, such as CI telemetry measurements, is of prime importance to fully test computational results. However, the extraction of some experimental data involves clinical testing, which prolongs surgical time and patient anesthetization period. Such measurements should be attempted in the future to validate the obtained optimization results.

It would be interesting to use the proposed framework to assess patient-specific models retrospectively, where the electrode position can be obtained from post-operative data. In addition, including the generation of eCAP response in the model could be an important improvement to directly compare the computational ANF activation with clinical measurements. For this, further development of the temporal neural response is required to interpret the refractory effect seen in clinical eCAP telemetry. It is believed that electrode encapsulation affects eCAP measurements, since it creates an important change on electrode impedance. Thus, adding the electrode-tissue interface could provide more accurate results and consequently, predict long-term CI performance. Overall, the work presented here contributes with an automatic tool able to provide pre-operative predictions of the outcomes of the CI. This information can be used to optimize surgical and stimulation parameters for a given patient, or to assess the outcomes on a population of patients accounting for uncertain and variability in parameters such as cochlear shape anatomy [3] or tissue electrical conductivity [47]. This work is under development and is expected to fully cover the whole spectrum of parameters that can affect the outcome of the CI.