Keywords

1 Introduction

Most cochlear implant (CI) systems allow for the recording of the auditory nerve’s response to an electric stimulus—the electrically evoked compound action potential (eCAP). This objective measure is an important clinical tool to assess the quality of the electrode-nerve interface of a CI recipient (Miller et al. 2008). Routinely, the lowest stimulation level that evokes an eCAP (‘threshold’) is determined. The morphology of suprathreshold eCAP waveforms is usually not evaluated.

1.1 Mathematical Model of the Compound Action Potential

The recorded compound action potential (CAP) is described as the convolution of the unit response (UR, i.e., the waveform resulting from a single action potential) with the compound discharge latency distribution (CDLD), the sum of spike events over time of all individual nerve fibres (Goldstein and Kiang 1958, see also Fig. 1).

$$\text{CAP}(t)=\int_{-\infty }^{\text{t}}{\text{CDLD( }\!\!\tau\!\!\text{ )UR(t}-\text{ }\!\!\tau\!\!\text{ )d }\!\!\tau\!\!\text{ }}$$
(1)

This mathematical model was validated with simultaneous recordings of acoustically evoked CAPs from the round window and single-fibre responses from the auditory nerve in guinea pigs (Versnel et al. 1992b).

Most mathematical models of the CAP are concerned with solving the forward problem, i.e. predicting the CAP by modelling the activation of single nerve fibres and assuming a convolution with the unit response (Teas et al. 1962; de Boer 1975; Elberling 1976b; Kiang et al. 1976; Versnel et al. 1992b; Frijns et al. 1996; Miller et al. 1999; Briaire and Frijns 2005).

A few studies addressed the inverse problem, i.e. predicting the firing probability by deconvolution of acoustically evoked CAPs with a unit response (Elberling 1976a; Charlet de Sauvage et al. 1987). To our knowledge no study investigated the inverse problem for eCAPs. In this study, we perform a deconvolution of eCAPs to examine CDLDs, presuming that they could provide information about the neural status of the auditory periphery. It has been shown that the eCAP in a guinea pig exhibits differences in its morphology depending on the neural health of the auditory nerve (Ramekers et al. 2014, 2015). Therefore, eCAPs from deafened guinea pigs are analysed to assess effects of the status of the auditory nerve on its firing probability. eCAPs from human CI users are evaluated for comparison to the effects observed in the animal model.

2 Methods

2.1 eCAP Recording System

eCAP measurements were performed with MED-EL PULSAR or SONATA CIs (for details see Neustetter et al. 2012). A charge-balanced biphasic pulse with an inter-phase gap of 2.1 μs and a phase duration of 30 μs was presented at an intra-cochlear stimulation electrode in a monopolar configuration. The stimulation amplitude was defined in current units (cu), where 1 cu corresponds to approximately 1 μA. The eCAP was recorded on a neighbouring electrode with a minimum delay of 125 μs from stimulus onset to reduce stimulation artefacts. The recording window was 1.7 ms. Single eCAP recordings were separated by at least 20 ms, assuming a relative refractory time below 10 ms (Brill et al. 2009; Ramekers et al. 2015). Any stimulation or recording artefact was removed using an alternating polarity and zero amplitude template paradigm, respectively (for details see Brill et al. 2009).

2.2 eCAP Data Sets

2.2.1 Guinea Pig

Data were acquired from 18 guinea pigs (Ramekers et al. 2014, 2015), which were divided into 3 groups of 6 animals: normal-hearing (NH), 2 weeks deaf (2WD) and 6 weeks deaf (6WD). Methods of deafening and eCAP recordings are described in detail in Ramekers et al. (2014).

Briefly, profound cochlear hair cell loss was induced by co-treatment of kanamycin and furosemide. It resulted in mild degeneration of the auditory nerve after 2 weeks (25 % loss of spiral ganglion cells, SGCs) and severe degeneration after 6 weeks (60 % loss of SGCs).

The eCAP recordings were performed in isoflurane-anaesthetized animals. eCAPs were evoked and recorded with two electrodes placed in the basal turn through a cochleostomy. In this study, three different stimulation intensities were analysed: just above threshold (first visible eCAP), halfway the input/output function (‘intermediate’), and at maximum stimulation intensity typically corresponding to saturation (800 cu). Waveforms were averaged over 900 iterations.

2.2.2 Human

From a multicentre study (Senn et al. 2012) eCAPs recorded post-operatively with the clinical system software MAESTRO (MED-EL GmbH, Innsbruck) in 52 awake human subjects were selected. The selection criteria were the availability of a pre-operative audiogram and eCAPs of at least 100 μV amplitude. The mean age at onset of hearing loss was 19 years (range: 0–72). The mean age at implantation was 46 years (range: 15–79).

The recordings were manually analysed by three experts. Similar to the animal model, three different stimulation levels were selected: threshold, intermediate, and maximum. In contrast to the animal model, maximum typically corresponded to the loudest acceptable presentation level. Intermediate corresponded to 50 % eCAP amplitude compared to maximum.

The eCAPs were obtained from stimulation electrodes in the apical (contact 2), middle (contact 4 or 5) and basal cochlear region (contact 9 or 10). Waveforms were averaged over 25 iterations and 5 kHz low-pass filtered (fifth-order Butterworth) to remove any remaining recording noise.

2.3 Deconvolution of the eCAP

Following Eq. (1) and Versnel et al. (1992a) the unit response UR is assumed constant and was modelled by Eqs. 2 and 3 with U N = 0.12 μV, σN = 0.12 ms describing the negative and U P = 0.045 μV, σ P = 0.16 ms describing the positive part. The cross point is defined with t 0 = ‑0.06 ms (Fig. 1a).

$$UR(t)=\frac{{{U}_{N}}}{{{\sigma }_{N}}}(t-{{t}_{0}}){{e}^{\frac{1}{2}-\frac{{{(t-{{t}_{0}})}^{2}}}{2{{\sigma }_{N}}^{2}}}},\ t<{{t}_{0}}$$
(2)
$$UR(t)=\frac{{{U}_{P}}}{{{\sigma }_{P}}}(t-{{t}_{0}}){{e}^{\frac{1}{2}-\frac{{{(t-{{t}_{0}})}^{2}}}{2{{\sigma }_{P}}^{2}}}},\ t\ge {{t}_{0}}$$
(3)

Having assumed the same unit response for all contributing fibres, the CDLD can be obtained directly from the recorded eCAP by deconvolution. The deconvolution was performed in the frequency domain with

$$CDLD(t)={{\mathcal{F}}^{-1}}\left( \frac{\mathcal{F}\left( eCAP(t) \right)}{\mathcal{F}\left( UR(t) \right)} \right)$$
(4)

where \(\mathcal{F}\) represents the Fourier transform and \({{\mathcal{F}}^{-1}}\) the inverse Fourier transform.

Fig. 1
figure 1

The assumed elementary unit response of a single nerve fibre is shown in a. Its convolution with a CDLD (b, thick gray line) results in the CAP waveform shown in c (red line). The monotonic extrapolation of the time-limited eCAP recording is shown in c (blue dashed line). The derived CDLDs obtained by Eq. 4 are shown in b for the full eCAP (red dashed line) and a simulation of the time-limited and extrapolated eCAP (thin blue line)

The baseline was estimated over the final 200 μs and, assuming that the eCAP decays monotonically to baseline before and after the recording window, the eCAPs were linearly extrapolated to baseline (Fig. 1c) before performing the deconvolution. The extrapolation only affected the CDLD before stimulus onset (see Fig. 1b). The deconvolution of the eCAP can be understood as applying an inverse low-pass filter, amplifying unwanted high-frequency noise. Therefore, the extrapolated eCAPs were low-pass filtered by a 50-point moving-average filter applied twice (once in forward and once in reverse direction) to achieve a zero-phase digital filtering. Any remaining high-frequency noise in the CDLD was removed by a 2.5 kHz fifth-order Butterworth low-pass filter.

2.4 Parametrisation of the CDLD

The CDLDs derived from eCAP responses from both data sets exhibited a skewed, quasi-bimodal, distribution (see Fig. 2). To perform further analysis, a parametrisation of the CDLD was performed by a two-component Gaussian mixture model (GMM).

$$CDLD={{a}_{1}}\mathcal{N}({{\mu }_{1}},\ {{\sigma }_{1}})+{{a}_{2}}\mathcal{N}({{\mu }_{2}},{{\sigma }_{2}})$$
(5)

The GMM was fitted to the CDLDs using a nonlinear least-squares regression (nlinfit, MATLAB; Mathworks, Natick, MA, USA) with initial start values being manually optimised if needed to achieve an adequate fit (R2 > 0.95). As outcome parameters we considered μ1 and μ2 (corresponding to the peak latencies), σ1 and σ2 (reflecting peak width), and the ratio of the components a2/a1.

Fig. 2
figure 2

Example eCAPs for maximum stimulation level (upper row) and the corresponding CDLDs and GMM fits (bottom row). The left three panels show examples of each of the three guinea pig groups; the right three panels are from each of the three cochlear regions in one human subject

3 Results

3.1 Guinea Pig Deafness Model eCAPs

Figure 2 shows eCAP and CDLD examples for NH (a), 2WD (b) and 6WD animals (c). In each animal the deconvolution of the recorded eCAPs resulted in CDLDs with a fast rising peak, typically narrow (< 0.5 ms width), in most cases followed by a shoulder-shaped component or a second peak. In those cases the CDLD could be well fitted with the two-component GMM (Eq. 5), remaining single-peak cases (8 % of all CDLDs) could be well fitted with a single Gaussian. Figure 3 and Table 1 show group averages of CDLD parameters. Two variables significantly varied with group (i.e., with degeneration of the auditory nerve): μ1 and a2/a1 (p < 0.05; rmANOVA with stimulation intensity as within factor, and group as between factor). With more nerve degeneration (6WD vs. 2WD and NH) the peak latency μ1 was shorter and a2/a1 was larger.

Table 1 CDLD parameters for guinea pig groups (upper section), and humans grouped by cochlear region (lower section), averaged over stimulation level

3.2 Human Clinical eCAPs

Figure 2 shows examples of eCAPs and corresponding CDLDs for an apical (d), middle (e) and basal (f) stimulation site in one human subject. Human CDLDs showed a morphology comparable to the guinea pig data, with 15 % of CDLDs being single-peaked. Figure 3 and Table 1 show averages of CDLD parameters for different cochlear regions. Assuming that any residual hearing correlates with the auditory nerve’s neural health, subjects were divided into two groups having pre-operative 0.5–4 kHz pure tone averages ≤ 105 dB hearing loss (HL) (24 subjects) or > 105 dB HL (28 subjects). No significant between-subject group effects were found (p = 0.054; rmANOVA with cochlear region and stimulation intensity as within factors, and group as between factor). A significant decrease of a2/a1 with increasing stimulation intensity (p < 0.001) and distance from the apex (apex vs. middle and base, p < 0.05) was found. The latency μ1 was significantly longer in the apical cochlear region compared to middle and base (p < 0.001). The width of the first Gaussian σ1 was significantly smaller in the apex vs. middle and base (p < 0.05) and with increasing stimulation intensity (p < 0.001).

Fig. 3
figure 3

Average CDLD parameters a2/a1, μ1 and σ1 shown for guinea pigs (left) and humans (right). THR threshold, INT intermediate, MAX maximum. Error bars are SEM

4 Discussion

4.1 Comparing Deconvolution Results from Guinea Pigs and Humans

We observed several similarities between the obtained CDLDs of guinea pigs and humans. First, the CDLDs could be described in the large majority of cases by two Gaussian distributions (Fig. 2). Second, the peak latency of the early component decreased with increasing neural degeneration, assuming for the human data more degeneration at the cochlear base than near the apex (Fig. 3, μ1). Notable differences were observed in particular with respect to the width of the early component, which was very robust in guinea pigs, and quite variable with stimulation intensity and cochlear region in humans.

4.2 Choice of UR Waveform

The main challenge of the eCAP deconvolution is the dependence of the CDLDs on the choice of UR. For instance, with a larger UP a2/a1 increases since the P2 component of eCAP can be reduced by increasing the N2. We have chosen our parameters based on experimental estimates of the UR in guinea pigs (Prijs 1986; Versnel et al. 1992a). Therefore, we can be fairly confident about our choice for the eCAP deconvolutions in the guinea pig. The UR in humans is thought to be similar (Briaire and Frijns 2005; Whiten 2007). However, the cochlear morphology is quite different in humans. The UR may change with stimulation site in the human cochlea and explain in part the cochlear region dependent variation of the CDLD seen in the human dataset. Also our assumption of the same UR across the nerve population might be too coarse (Westen et al. 2011), in particular considering location-dependent effects of degeneration, such as demyelination and cell shrinkage. To further address this point, a sensitivity analysis of the convolution model is needed.

4.3 Parametrisation of the CDLD

Single-fibre responses to electric pulses (estimated by post-stimulus time histograms) showing a fast rise and a slow decline (van den Honert and Stypulkowski 1984; Sly et al. 2007) may be modelled best with a gamma function (Versnel et al. 1992b). However, the distribution of response latencies between fibres may be Gaussian. Therefore, we tested fits of CDLDs with gamma functions and with Gaussian functions, and both in guinea pigs and humans we observed superior fits with Gaussian functions. Due to the large population size and assuming that the discharge jitter is smaller within than between fibres, the resulting convolution of the gamma function with the Gaussian distribution could explain the Gaussian-like CDLD.

4.4 Interpretation of CDLD Parameters

In most cases a2/a1 is significant, in both guinea pigs and humans. This suggests either two subpopulations of fibres and/or neural elements (peripheral vs. central site of excitation) with different latencies and proportions, or repeated firings of the same neurons. The latter option is supported by the following arguments (see also in Ramekers et al. 2015). First, the interval between the two components, μ2–μ1, is about 0.5 ms, which is around the absolute refractory period estimated from masker-probe responses. Second, the N2–N1 interval of the eCAP waveform has been found to correlate with recovery measures (Ramekers et al. 2015).

Alternatively, a reduced relative contribution of the second component with increasing stimulation intensity and more basal position within the cochlea as observed in humans could be explained by the former option. The late component can be ascribed to excitation peripheral to the cell body (Stypulkowski and van den Honert 1984) which is thought to occur to a larger extent near threshold (Briaire and Frijns 2005).

5 Conclusion

Obviously, there are multiple differences in experimental settings and biology between the guinea pig and human, which may contribute to different CDLD outcomes. Among others, cause of deafness, cochlear anatomy, duration of deafness, and rate of neural degeneration differ, and there is awake vs. anaesthetized state, chronic vs. acute cochlear implantation, and long vs. short electrode array. Taking these differences into account, a few characteristics appeared to be quite robust across species and experimental setting. Importantly, the deconvolution of the eCAP revealed two separate firing components, which could not easily be detected from the eCAP waveforms themselves. The ratio of the components and the latency of the early component came out as potential markers to assess the condition of the auditory nerve.