Introduction

Development of Human Cochlear Anatomy

The study of human cochlear development has focused on hair cell differentiation and innervation, hair-bundle development and organization, morphogenesis of the tectorial membrane, and maturation of the stria vascularis (Tanaka et al. 1979; Igarashi and Ishii 1980; Sánchez-Fernández et al. 1983; Pujol 1985; Pujol and Lavigne-Rebillard 1985; Lavigne-Rebillard and Pujol 1986, 1987, 1988; Lavigne-Rebillard and Bagger-Sjöbäck 1992; Lim and Rueda 1992). Many reports have also described gross developmental gradients, such as the maturation of the cochlear base before the apex, the maturation of inner before outer hair cells, and the innervation of sensory cells by afferent before efferent nerve fibers (e.g., Retzius 1884; Bast and Anson 1949; Bredberg 1968; Pujol and Hilding 1973).

Collectively, this anatomical work has established that the human cochlea is fully coiled by the 9th embryological week (Pujol and Lavigne-Rebillard 1985), though growth and elongation of the cochlear segment of the labyrinth continue through 17–19 gestational weeks, when it attains adult-like dimensions (Jeffery and Spoor 2004). By around 9–10 weeks, afferent nerve fibers enter the undifferentiated epithelia of the otocyst. Two weeks later, hair cells can be clearly distinguished (at least in the basal half of the cochlea; outer hair cells are not yet recognizable in the apex). Both afferent and efferent inner hair cell (IHC) patterns of innervation are nearly adult-like by the 14th gestational week; however, arrival of the medial efferent endings on the outer hair cells (OHCs) occurs much later (20–22 weeks) (Pujol 1985). Stereocilia formation begins on the IHCs by 11–12 weeks and a week or so later on OHCs. The stereocilia rapidly become appropriately graded in length, and the tip and lateral links are observable. However, not until 20–22 fetal weeks does an adult-like pattern of stereocilia form on OHCs (Igarashi 1980). The shape and innervation of the OHC also remain immature longer than that of the IHC; it assumes its characteristic elongated shape about the time that the large efferent synapses are formed, though post-synaptic specializations remain immature into the third trimester. The time point by which sensory cells of the human cochlea reach their terminal mitosis has not been well defined, but in other mammals, this milestone occurs early in gestation. For the basal mouse cochlea, sensory cells are postmitotic by embryological days 15–16 (Ruben 1967; Roberson and Rubel 1994; Lim and Brichta 2016).

In general, these anatomical studies indicate that the micromechanical elements of morphology giving rise to active cochlear processes are mostly complete by the end of the second trimester although changes to outer hair cells and their efferent innervation continue into the 3rd trimester.

Development of Human Cochlear Function

Over the last several decades, otoacoustic emissions (OAEs) have provided a tool for investigating the functional development of the human cochlea. However, immaturities in the acoustics of the newborn ear canal and anatomy of the middle ear, both of which are developing in early infancy, can confound OAE interpretation (see Abdala and Keefe 2012). An exception to this is the phase of the cubic (2f1f2) distortion-product OAE (DPOAE), which is not impacted by changes in the stimulus level over the range used for most measurements (L1 = 45 to 65 dB SPL; Abdala et al. 2011a). This makes DPOAE phase a particularly useful feature to gauge cochlear status in newborns without contamination from immaturities of the outer and middle ear.

DPOAE phase has been used as a noninvasive tool for assessing deviations from cochlear scaling symmetry in newborns and adults. In laboratory animals, deviations from perfect scaling are evident in the widths and shapes of neural tuning curves. Although tuning curves vary only slowly with characteristic frequency (CF) in the basal half of the cochlea, consistent with approximate local scaling, the variations become much more pronounced in the apical half (e.g., Temchin et al. 2008). When the cochlear tonotopic map is exponential, scaling implies that excitation patterns produced by tones of different frequencies are simply translated along the basilar membrane such that traveling waves accumulate the same phase regardless of where they peak along the cochlear partition (Zweig 1976). In the fixed-ratio DPOAE paradigm, the two stimulus frequencies (f1 and f2) used to evoke the DPOAE are changed while their ratio (f2/f1) is held constant. At or near an optimal ratio of 1.2, DPOAE phase remains approximately invariant across much of the frequency range, consistent with a scaled cochlea (Shera et al. 2000). However, for f2 frequencies below approximately 1–2 kHz, the phase of the distortion emission begins to increase as frequency decreases. The emergence of this non-zero phase slope is consistent with the existence of a mid-cochlear break in scaling within the human cochlea analogous to that seen in laboratory animals.

Although the location of this “break” frequency does not change during the human lifespan (Abdala et al. 2011b; Abdala and Dhar 2012), the slope of DPOAE phase below the break frequency does—newborns have a significantly steeper low-frequency slope than adults and a correspondingly longer phase-gradient delay by nearly 1 ms. This OAE immaturity cannot easily be attributed to the immature middle ear, both because the prolonged delay is too long to result from middle-ear transmission and because the phases of distortion-type emissions are largely insensitive to variations in the effective stimulus level entering the cochlea (Abdala et al. 2011a). Thus, prolonged DPOAE delays appear to indicate residual immaturities in the newborn cochlea. This hypothesis—that the non-adult-like DPOAE phase observed in newborns arises from immature geometry or material properties of the neonatal cochlear partition—provides the impetus for the current study.

Although the microanatomy of the developing cochlea has been well defined and appears to develop mostly in utero, no quantitative comparison of the architecture of the newborn and adult cochlea has been undertaken. The present study fills this void. Properties that control the mass and stiffness gradient of the basilar membrane and organ of Corti dictate the passive cochlear vibration defined by von Békésy (1960) and establish the tonotopic map. We hypothesize that immaturity of structure in the newborn cochlea could produce a non-adult-like tonotopic map along the cochlear partition, in particular in its apical half.

Materials and Methods

Temporal Bones

In this study, we evaluated the anatomy of human temporal bones from the collection at Massachusetts Eye and Ear (MEE). The temporal bones were prepared for light microscopy by standard techniques involving fixation, decalcification, celloidin-embedding, 20-μm horizontal sectioning, and hematoxylin and eosin staining of every 10th section (Merchant and Nadol 2010). The temporal bones were selected from a subset of cases previously screened for normalcy, image quality, and post-mortem autolysis (Makary et al. 2011; Amatuzzi et al. 2011). The adult temporal bone samples were from 17 individuals ranging from 17 to 58 years with a mean age of 40 years. Infant temporal bone samples were from 30 newborns, including 11 premature neonates born between 26 and 34 weeks gestational age (mean = 30 weeks) and 19 full-term neonates born at a mean gestational age of 40 weeks. The newborns survived for a mean of 4 weeks after birth and suffered to varying degrees from clinical conditions such as hypoxia, meconial aspiration, and sepsis (see Amatuzzi et al. 2011 for details). Because initial analysis of the data revealed no differences between term and preterm newborns, and because the data at each post-conceptional age were insufficient to form a maturational continuum, the two infant subgroups were combined for analyses.

Using differential interference contrast optics, we obtained images of mid-modiolar sections, which include up to five cross-sections of the cochlear duct that roughly correspond to tonotopic frequencies of 3.5, 1.2, 0.75, 0.28, and 0.15 kHz. These estimates are derived by converting the distance from the cochlear base in a standard 2-D cochlear reconstruction (e.g., Fig. 1.60 in Schuknecht’s Pathology of the Ear; Merchant and Nadol 2010) to frequency using the Greenwood map for the human cochlea (Greenwood 1990). By measuring all cochlear turns using one mid-modiolar section, we could not ensure an exactly orthogonal sectioning angle for all cochlear locations. In any given half-turn, deviations from an orthogonal sectioning angle affect measurements that include a radial dimension (e.g., basilar membrane or scalae width) but do not affect measurements in the opposite direction (e.g., organ of Corti height).Footnote 1 Errors in section selection will produce the largest effects in the apical turn, where the radius of the arc is smallest and the angular deviation of the sectioning angle from 90° could be the greatest. We can use geometry to estimate the possible magnitude of such effects: In the apical turn (where the typical radius is ~ 1 mm), even an “error” in section selection as large as ± 20% translates into only a ± 6% error in measured basilar membrane width; in the basal turn, the error would be only ± 3%. Importantly, there is no evidence that these errors would be age specific. All images were processed by manually mapping the positions of various anatomical structures, as detailed below, using the software package Fiji (Schindelin et al. 2012).

Measurements

Cochlear Scalae

 For each temporal bone, a low-magnification image (× 2 objective; 0.23 pixel/μm) captured multiple cross-sections through the endolymphatic (scala media) and perilymphatic spaces (scala tympani and scala vestibuli) of the cochlear duct. Polygons outlining the perimeters of the scalae for each cross-section are superimposed in Fig. 1a. Each polygon traces the boundary between the fluid-filled lumen and the lining epithelium. To outline the scala tympani (green in Fig. 1a), we followed the “underside” of the organ of Corti (OoC). With this convention, the OoC is excluded from the scala tympani although electrochemically it is within the perilymphatic space (Tasaki et al. 1954). On the other side of the OoC, the polygon outlining the scala media (blue) ignores the inner spiral sulcus and follows a straight line from the tip of the limbus to the estimated location of the inner hair cells. Finally, we approximated Reissner’s membrane, which separates scala media and scala vestibuli (red), as a straight line between its contacts at the limbus and the spiral ligament. We used the polygons to calculate the areas of the different scalae using Gauss’s area formula. In addition to these polygons, we used a straight line-segment (black, Fig. 1a) to estimate the orientation of the spiral lamina for each cochlear cross-section relative to the mid-modiolar axis. The outlines of the scalae in Fig. 2A have been rotated so that the spiral lamina is oriented horizontally.

FIG. 1
figure 1

Description of the measurements. a Image of a mid-modiolar, cochlear section from the left ear of an adult case. For each cross-section of the cochlear duct, the orientation of spiral lamina was approximated by a straight line (black), and the outlines of scala tympani (green), scala media (blue), and scala vestibuli (red) were described with polygons. Scale bar corresponds to 1 mm. b Composite image showing the entire cochlear partition from the basal turn in a. The partition consists of the spiral lamina and the BM. The spiral lamina includes an ossified portion (OSL) and a non-ossified portion, recently dubbed the “bridge” (Raufer et al. 2019). Scale bar, 100 μm. c Composite image showing the organ of Corti (OoC) from the basal turn in a. The outline of the OoC was described by a polygon (blue) that included the basilar membrane (BM; cyan) and the reticular lamina (RL; green) in the outer hair cell region. The BM consists of pars pectinata (pp) and pars tecta (pt), which are separated by the center of the outer pillar cell foot. The thickness (“height”) of BM was measured at two locations corresponding to the midpoints of pp and pt (vertical red lines). Both the BM and RL were fit with straight lines (yellow and red, respectively). The polygon outlining the OoC was also used to obtain the maximum OoC height (vertical black dashed line). The outline of the tunnel of Corti (ToC) was described by a polygon (white) that followed the inner and outer pillars. The scale bar corresponds to 20 μm. d The polygon describing the ToC (black) was used to calculate a deformation index. For this, the sides of the polygon were straightened out to form a triangle (blue). The areas between the polygon and this triangle were calculated (red), and the sum of these areas was divided by the area of the triangle. In this example, the deformation index is 0.2. Scale bar, 10 μm

FIG. 2
figure 2

Size and shape of cochlear scalae in adults versus newborns. (A) Overlays of the polygons describing the three scalae (SV = scala vestibuli; SM = scala media; ST = scala tympani) for adults (red) and newborns (cyan). The outlines are arranged in four columns that represent different cochlear locations, see the abscissa in (D). Outlines are oriented with the modiolus to the left, aligned on their centroids, and rotated so that the spiral lamina runs horizontally (see also Fig. 1a). The offsets between the different scalae are arbitrary and were chosen for visual clarity. The scale bar (1 mm) applies to all outlines. Scatter plots for the areas of (B) scala vestibuli, (C) scala media, and d scala tympani in the different cross-sections of the cochlear duct for adults (red diamonds) and newborns (cyan circles). The areas were obtained from the polygons using Gauss’s area formula. Data are grouped according to cochlear location. Horizontal bars indicate mean values. Gray horizontal bars indicate that the cochlear location was excluded from statistical analysis due to insufficient observations (N < 8)

Spiral Lamina and Bridge

Using a × 20 objective (2.25 pixel/μm), we imaged the cochlear partition, which divides scala vestibuli from scala tympani and comprises the spiral lamina and basilar membrane, at each cochlear location in multiple, partially overlapping images. At each location, the overlapping images were joined using the Fiji plugin “Stitching” (Preibisch et al. 2009) to show the entire partition (Fig. 1b). The spiral lamina is ossified near the modiolus, and the ossification extends progressively farther out towards the BM as the spiral progresses from apex to base. The (distal) non-ossified portion of the lamina, nearest the BM, has recently been dubbed the “bridge” (Raufer et al. 2019).

To measure the width of the osseous spiral lamina (OSL), we defined the distal endpoint as whichever of its bony shelves extended farthest from the modiolus. On the modiolar side, we defined the initial point as follows: we first identified the minimal OSL thickness (vertical white line in Fig. 1c) and then found the point closest to the modiolus at which the thickness was twice the minimum (vertical blue line in Fig. 1c). We also measured the width of the non-ossified spiral lamina (the “bridge”) as the distance between the distal end of the OSL and the proximal end of the BM.

Basilar Membrane

A third set of tiled images (× 100 oil-immersion objective; 11.36 pixel/μm) was stitched to visualize the OoC cochlear partition (Fig. 1c). In each case, the width of the basilar membrane (BM) was obtained using a polyline (cyan) from the center of the foot of the inner pillar cell on the modiolar side to the basilar crest on the side of the spiral ligament. The center of the outer pillar foot served as a landmark separating the BM into two adjoining segments: pars tecta (pt; inner pillar to outer pillar) and pars pectinata (pp; outer pillar to basilar crest). The widths of these two BM segments sum to the total width of the BM. The thickness (or height) of the BM was measured at the midpoints of pt and pp (vertical red line segments).

Organ of Corti

 In the same stitched image (Fig. 1c), a second polyline (blue), continuous with the polyline defining the BM, traces the boundary of the OoC. To define the reticular lamina (RL), we used the polyline’s vertices located over the apical aspects of the outer hair cells (green); these points were fit by a straight line (red). The slope of this line was compared to the slope of the straight line fit to the BM vertices (yellow) beneath the RL to obtain the angle between the RL and the BM. The combined polygon (cyan + blue) was used to calculate the total OoC area and determine the maximum OoC height (dashed black line). The latter was taken as the largest orthogonal distance between the BM and the upper margin of the OoC. Finally, we used a polygon to describe the tunnel of Corti (ToC; white).

Tunnel of Corti

 Within the sensory epithelium, the two rows of pillar cells form the tunnel of Corti (ToC). These cells are thought to form a semi-rigid structure, coupling BM vibrations to the hair cell stereocilia (Dallos 2003). We delineated the ToC in each cochlear cross-section with a polygon and used this to quantify the degree of compression (see Fig. 1d). After correction for compression artifacts, the ToC forms a triangle defined by four parameters: (1) its width at the base, (2) its height, (3) the angle at its apex, and (4) its skewness. The latter locates the apex relative to the base and varies between −1 (apex directly over the foot of the outer pillar cell) and +1 (apex directly over the foot of the inner pillar cell). A skewness of 0 indicates a symmetrical, equilateral triangle.

Analysis

As a result of post-mortem autolysis and subsequent tissue removal and preparation, artifacts can occur (e.g., Edge et al. 1998). One of their most prominent is a partial collapse of the tunnel of Corti (ToC), in which one or both pillar cells appear to buckle. To quantify this effect for each cross-sectional image, we used the polygon that describes the (buckled) ToC to calculate a “deformation index,” as illustrated in Fig. 1d. For this, the sides of the polygon were straightened without changing their lengths to form a triangle (blue). The deformation index is defined as the total area between the polygon and this triangle (red), normalized by the area of the triangle. The deformation index is 0 for an uncollapsed ToC and increases with the amount of buckling. The results could have been confounded by compression artifacts if they differed systematically with age and/or cochlear location. To address this possible confound, we analyzed the deformation index and found no effect of age (F(1,132) = 0.36, p = 0.71) or cochlear location (F(3,132) = 0.36, p = 0.55) and no significant interaction (F(3,132) = 1.28, p = 0.28). This suggests that compression effects, when present, did not exert a systematic bias on our results. We also recognize that fixed material is dehydrated and will provide artificially thinner measures of tissue height, for example (Edge et al. 1998); however, it is unlikely that this inevitable fixation artifact impacted the newborn more than the adult tissue.

The image analyses yielded 18 metrics for each cochlear location (see Table 1), although not all metrics could be determined for all temporal bones and/or locations. In most mid-modiolar sections, the scalae in the apical-most turn were in direct communication, due to the presence of the helicotrema within the cutting plane (e.g., Fig. 1a). In those regions, the scala perimeter was not measured. Also, when the OoC at a given cochlear location included many ruptured cells or a collapsed ToC, the area and height of the OoC were not measured. If the OHC region was damaged, an estimate of the angle between the RL and the BM was also omitted. In some cases, the BM boundaries were unclear, and measurements were not made. In general, the OoC was more disorganized in the apical turn; consequently, the number of observations decreases towards the apex.

Table 1 For each temporal bone, 18 morphological metrics were obtained. The columns labeled “age” and “cochlear location” give the F-statistic, within-factor, and error degrees of freedom (in parentheses) and the corresponding p values for the main effects from two-way ANOVAs. Significance (p < 0.05) is indicated by gray shading of the cells

The results detailed below describe the distributions and central tendencies for these 18 metrics and examine newborn-adult differences in cochlear morphology, as well as differences across cochlear location in both age groups. The effects of age and cochlear location on each metric were assessed using two-way ANOVAs (MATLAB). Although all data points are shown in the scatter plots, any location with fewer than eight observations in either age category was excluded from all statistical analyses and comparisons. Throughout the manuscript, we use p < 0.05 to indicate significance.

The two-way ANOVAs conducted on these individual metrics showed no significant interaction between age and cochlear location, except for BM thickness (pars tecta only). Hence, we consider the main effects of age and cochlear location on each metric independently. Because the primary aim of this study was a comparison between newborn and adult cochleae, we first present the main effects of age on each metric and the effects of cochlear location follow. Table 1 provides a summary of the statistical results.

Results

Age Effects

Cochlear Scalae

Figure 2A shows superimposed outlines of all adult and infant scalae, grouped by cochlear location. Qualitatively, newborn scalae appear slightly larger than those in adults. Scala areas calculated from these outlines are shown as scatterplots in Fig. 2B–D. A two-way ANOVA (age × cochlear location) found an age effect on the scala area for both scala vestibuli and scala tympani, with larger areas measured in newborns compared to adults; this is evident in the mean area values provided in Table 2 (rows 1 and 3) and the scatter plots of Fig. 2 (horizontal lines display mean values). The area of scala media was not significantly different between age groups at any cochlear location (also evident in Table 2, row 2).

Table 2 Mean values (top) and numbers of observations (below) for the 18 morphometric features in adult and newborn (newbn) specimens; SV scala vestibuli, SM scala media; ST scala tympani; OoC organ of Corti; RL reticular lamina; BM basilar membrane; OSL osseous spiral lamina; ToC tunnel of Corti

Spiral Lamina Widths

The spiral lamina is ossified near the modiolus, and the ossification extends progressively farther towards the BM when progressing down the spiral from apex to base. The (distal) non-ossified portion of the lamina, nearest the BM, has been dubbed the “bridge,” and has shown to undergo sound-induced vibrations similar to that seen in the pars tecta of the BM (Raufer et al. 2019). Measurements of the bridge width (Fig. 3) reveal an age effect, as newborns had a wider bridge segment than adults by between 30 and 81 μm (~ 10–24 %) across cochlear location; no such age effect was noted for osseous spiral lamina width (see mean values in Table 2, row 5). Also, in both age groups, the width of the bridge covaries with that of the BM and therefore with cochlear location (see Fig. 4).

FIG. 3
figure 3

Widths of the three segments of the cochlear partition. Scatter plots for the widths of (A) the osseous spiral lamina, (B) the bridge, (C) the basilar membrane (pt + pp), and (D) the complete cochlear partition (OSL + bridge + BM) in newborns (cyan circles) and adults (red diamonds) grouped by cochlear location

FIG. 4
figure 4

Correlation between bridge width and basilar-membrane width. Scatter plot of bridge width versus basilar membrane (BM) width for newborns (cyan circles) and adults (red diamonds), with straight-line fits (WBM = A Wbridge + B) to the data. The best-fitting parameter values (with respective 95% confidence intervals in parentheses) are as follows: newborn A = 0.56 (0.47–0.64), B = 94 (64–125) μm (r2 = 0.75); adult A = 0.78 (0.63–0.94), B = 33 (− 14–80) μm (r2 = 0.73). The gray dashed line indicates equality

Basilar Membrane

Passive mechanical properties of the BM (e.g., its stiffness) play a role in shaping the tonotopic map along the cochlear spiral. We hypothesized that newborn-adult differences in DPOAE phase-gradient delays might reflect underlying differences in the mechanical properties of the BM. We measured the width and the thickness of both the pars pectinata (pp) and pars tecta (pt), the two adjoining segments of the BM. The endpoints of the basilar membrane used to measure width are somewhat arbitrary. Given the large total width of the BM, however, small uncertainties (on the order of a few pixels) in estimating these two points contribute only marginally. Therefore, the considerable spread in the measures of BM width likely reflects natural variation rather than measurement errors. Both pp and pt show age effects (Fig. 5A, B) and are wider in infants than in adults at each cochlear location (see Table 2, rows 6 and 8). Consequently, the total BM is also wider in newborns than adults with an overall mean difference of 17.8 μm (~ 6%) between the two age groups (Table 2, row 10).

FIG. 5
figure 5

Width and thickness of the basilar membrane in adults versus newborns. Scatter plots for the width (A, B) and thickness (C, D) of pars pectinata and pars tecta, respectively. Data from adults are shown as red diamonds and newborns as cyan circles, while mean values are indicated by horizontal bars. The data are grouped according to cochlear location as indicated along the abscissa in (D). Gray horizontal bars indicate that the data were excluded from statistical analysis due to insufficient observations (N < 8) at that cochlear location

The scatter plots in Fig. 5C and D show the thickness (or height) of each BM segment (pp and pt) as a function of cochlear location. Because the BM thickness spans only a few tens of pixels in the images we utilized, an error of even a few pixels can significantly affect the estimated value. Indeed, quantization of the thickness values is evident in Fig. 5C and D. However, such measurement errors cannot account for the systematic differences observed between age groups. Moreover, BM thickness varies considerably in the radial direction at all cochlear locations, and this inherent variation further masks potential measurement errors. Additionally, our adult measures of BM thickness values shown in Fig. 5 are quantitatively and qualitatively similar to those previously described (Bhatt et al. 2001).

We found the pars pectinata to be significantly thinner in newborns than in adults (see Table 2, row 7). For pars tecta thickness, there was a marginal but significant interaction between age and cochlear location (F(3,111) = 2.91, p = 0.04). With the exception of the 0.75-kHz cochlear location, which likely produced the significant interaction (see Fig. 5D), pars tecta is also thinner in newborns than in adults by an average of 0.17 μm (~ 15 %; Table 2, row 9).

Organ of Corti

Figure 6A shows stitched images of the OoC from the basal turn (cochlear location = 3.5 kHz) for an adult (left) and a term newborn (right). In the newborn cochlea, the region of the OoC occupied by Hensen cells is less pronounced than it is in the adult cochlea, giving the neonatal OoC a flatter appearance. This difference exists throughout most of the cochlea as noted by the superimposed outlines shown in Fig. 3B. Figure 6C–E confirm this observation, showing scatter plots for the three metrics used to describe OoC shape. An ANOVA applied to each measure indicates differences between newborns and adults: newborns have a flatter OoC corresponding to a reduced maximum OoC height and a reduced total OoC area (Fig. 6C and D; Table 2, rows 12 and 13). The less pronounced Hensen cell area makes the RL run more parallel to the BM in newborns compared to adults, as quantified by the reduced angle between the two (Fig. 6E, Table 2, row 14).

Fig. 6
figure 6

Size and shape of the organ of Corti in adults versus newborns. (A) Composite images of the organ of Corti (OoC) for an adult (left) and a newborn (right) from the base of the cochlea (3.5 kHz). (B) Superimposed outlines of the OoC for adults (red) and newborns (cyan) for cross-sections at the five cochlear locations. Outlines are oriented with the modiolus to the left, aligned on their centroids, and rotated so that the basilar membrane runs horizontally. The yellow and magenta outlines are for the adult and newborn images in (A), respectively. The scale bar (100 μm) applies to all outlines. Scatter plots for (C) the maximum OoC height, (D) the area of the OoC, and (E) the angle between reticular lamina and basilar membrane. In the scatter plots, data are clustered by tonotopic location as indicated in abscissa (F). Adult data are shown as red diamonds and newborn data as cyan circles. Horizontal bars indicate mean values. Gray horizontal bars indicate that the cochlear location was excluded from statistical analysis due to insufficient observations (N < 8)

Tunnel of Corti

Although measured independently, the width of the ToC (Fig. 7A) matches the width of pars tecta, as expected (see Fig. 5B), and is wider in newborns than in adults as shown by the mean values in Table 2 (row 15). Because the newborn OoC is flatter than that of adults (see Fig. 6), it is not surprising that ToC height also shows a significant effect of age with a shorter ToC observed in the newborn ear by an average of 10 μm (~ 23%; see Table 2, row 16; also Fig. 7). There is an effect of age on the apical angle of the ToC as well (Fig. 7C); whereas the angle is acute (< 90°) in adults, the angle is obtuse in newborns, consistent with the flattened appearance of the infant OoC as evident from the mean values presented in Table 2, row 17. Finally, the skewness of the ToC is almost invariably positive, indicating that the tunnel of Corti “leans” towards the modiolus (Fig. 7D). Recall that skew values vary between −1 (apex directly over the foot of the outer pillar cell) and +1 (apex directly over the foot of the inner pillar cell) and a skewness of 0 indicates an equilateral triangle. The infant skew value is significantly closer to 0 than the adult value; hence, the newborn ToC is more symmetrical.

FIG. 7
figure 7

Shape parameters for the tunnel of Corti. Scatter plots for the four parameters that describe the tunnel of Corti as a triangle. (A) Width at base, (B) height, (C) angle at apex, and (D) skewness. The layout in each panel is the same with adults shown as red diamonds and newborns shown as cyan circles. Mean values are indicated by horizontal bars. For each of the metrics, the data at 0.15 kHz were excluded from the analysis because of insufficient observations (N < 8) for the adults. This is indicated by the gray horizontal bars

Effects of Cochlear Location

All but two of the morphological metrics vary with cochlear location (see Table 1). A well-established example is the width of the BM, which increases from base to apex in these adult and newborn cases (Fig. 3C) and in all mammals studied thus far (Guild 1927; Wever 1938; Cabezudo 1978). The increase in BM width along the cochlear spiral arises predominantly from the widening of pars pectinata, with only a small contribution from pars tecta (see Fig. 5). Similarly, previous studies have shown that BM thickness in the human cochlea decreases from base to apex (Bhatt et al. 2001), a result reproduced here for both pars tecta and pars pectinata (Fig. 5A, B).

The ossified and non-ossified portions of the human spiral lamina (Fig. 3A, B) also varied in width as a function of cochlear location: whereas the ossified portion decreases in width from base to apex, the non-ossified bridge segment increases. These opposing trends combine so that the total width of the cochlear partition (OSL + bridge + BM) remains constant along the cochlear spiral (Fig. 3D).

The effect of cochlear location on the scala area is easily observed for the scala tympani (Fig. 2D), which decreases in area from cochlear base to apex. For both scala vestibuli and scala media, dependence on cochlear location is less clear (Fig. 2B, C) and appears nonmonotonic. When going from base to apex, the areas alternately increase and decrease in size, resulting in a “zigzag” pattern visible in the scatter plots. Although less obvious, a similar zigzag pattern may also be present in the scatter plots of OoC height and area (Fig. 6C, D). As detailed in the “Discussion” section, this pattern reflects the difference in effective cutting angle for cochlear half-turns on opposite sides of the modiolus.

Summary of Results

Although compression artifacts appear in these temporal-bone preparations, their influence on the data appears uniformly distributed; hence, they do not bias the interpretation of age or cochlear-location effects. In the absence of significant interactions, the analysis reveals a main effect of age (i.e., significant differences between newborns and adults) for all but two metrics. In newborns compared to adults, the areas of scalae vestibuli and tympani are larger, the organ of Corti is smaller and flatter, the BM is wider and thinner, and the non-ossified bridge segment of the spiral lamina is also wider. Although the newborn cochlea appears fully formed with respect to the structures assessed here, most of these anatomical features continue to mature in a subtle but significant way. Our evidence for continuing maturation contrasts with the dogma that the human cochlea is entirely adult-like by the third trimester. All but two of the metrics also show a significant effect of cochlear location consistent with classic work on cochlear anatomy.

Discussion

In this study, we quantified morphological features of the human cochlear duct to assess potential differences between adults and newborns around the time of birth. Our motivating goal was to determine whether anatomical immaturities could help explain the functional immaturities seen in low-frequency DPOAE phase-gradient delays (Abdala et al. 2011b; Abdala and Dhar 2012). We derived 18 different metrics at several cochlear locations in histological preparations of adult and infant temporal bones and found significant effects of age and cochlear location on most of the metrics.

Nonmonotonic Dependence on Cochlear Location

An important factor influencing the metrics extracted here is the sectioning angle. The temporal bones in this study were cut in the horizontal plane, from dorsal to ventral, using extracochlear landmarks to fine-tune the tissue orientation, such that the final cutting angle through the inner ear is highly stereotyped. The axis around which the upper turns spiral is also in the horizontal plane, slightly tilted such that the apex is medial and inferior to the base (see Fig. 8). Furthermore, the spiral is stretched along its central axis so that the apex is several millimeters anterior to the base. Given this basic geometry, the horizontal plane tends to cut orthogonal to the cochlear half-turns on the medial side of the modiolus and at a more oblique angle on the lateral half-turns. We assume that this systematic alternation in the effective cutting angle on different half-turns along the duct causes the nonmonotonic dependence on cochlear location (“zigzag” pattern) observed in several of the scatter plots (Figs. 2B, C and 6C, D). Our observation that the angle between the spiral lamina and the modiolus exhibits a similar zigzag pattern provides additional support for the existence of a sectioning artifact (Fig. 9).

FIG. 8
figure 8

Temporal-bone model illustrates why sectioning angles differ for lateral vs medial half-turns along the cochlear spiral. The apposed red and white spiraling forms here are extracted from a 3-D model of the human temporal reconstructed from a serial-section set from the MEE collection, prepared identically to the ones studied here (Wang et al. 2006). The red and white surfaces correspond to the scala media and basilar membrane respectively. The frequency correlates of the mid-modiolar sections through several of the half-turns are shown

FIG. 9
figure 9

Angle between spiral lamina and modiolar axis. Scatter plots of the angle between spiral lamina and the modiolar axis for both adults (red diamonds) and newborns (cyan circles). The angles were measured on the side of scala tympani; values larger than 90° indicate that the spiral lamina “points” towards the apex as viewed from the modiolus. Observations from the same temporal bone are connected by lines. For visual clarity, each age group has an arbitrary vertical offset; the horizontal gray dashed lines indicate 90°; the scale bar corresponds to 20°

Morphology of the Newborn Cochlea

Age differences in the shape and size of the organ of Corti are among the most robust effects observed: overall, the newborn OoC is flatter and smaller. This age difference arises because the Hensen-cell region appears to be less highly developed in newborns. Since there is no evidence for postnatal cell proliferation in the sensory epithelium of the mammalian cochlea (e.g., Roberson and Rubel 1994), the eventual maturation of the Hensen-cell region presumably occurs due to an increase in cell size rather than an increase in cell number. However, this speculation cannot be tested with the images and metrics in the present study.

All but two of the metrics reveal an effect of age, indicating that newborn-adult differences extend well beyond the smaller and flatter shape of the neonatal organ of Corti. Although the human cochlea appears structurally complete at birth, and the length of the spiral has reached adult dimensions, many morphological features have yet to attain full maturity, and an ongoing refinement occurs postnatally. Although the residual immaturities cannot always be discerned from measurements at a given cochlear location, a clear pattern emerges from data considered across the cochlear spiral. For example, the total BM (pars tecta + pars pectinata) averages 17.8 μm wider (~ 6 %) in newborns than in adults (Fig. 3C). This age difference is small compared to the overall width of the BM (range, 175–600 μm) and is less than the variation in BM width at each cochlear location (mean SD = 28.5 μm), making it hard to detect without considering multiple cochlear locations.

Some of the age differences are counter-intuitive. For example, the areas of two of the three scalae are, on average, larger in infants. The origin of this result is unclear; however, developmental changes in the bony canals themselves cannot be ruled out. Bone density and growth are ongoing in the newborn otic capsule beyond the perinatal period (Anson and Bast 1958).

The non-ossified bridge segment of the spiral lamina, and the BM, increases in width from base to apex in both adults and infants, presumably contributing to the progressive decrease in stiffness and resonant frequency from base to apex. However, both the bridge and the BM are wider in newborns throughout the length of the cochlear spiral, resulting in a wider cochlear partition overall. This is consistent with the larger scala area in newborns since the cochlear partition defines a key scalar boundary (Fig. 1a).

Collectively, prior work suggests that the features supporting micromechanical function of the human cochlea are mature by term birth. Yet, no previous study has quantified the architecture of the human cochlea in newborns. Here, we show that, on a scale larger than individual cells and/or organelles, several structural properties are immature in newborns and undergo change after birth. These material and physical features contribute to the mass and stiffness gradients along the length of the cochlear spiral and help establish the frequency-place map. The anatomical immaturities reported here are likely to impact function.

Functional Consequences of Anatomical Immaturities

Many of the anatomical immaturities evident in the newborn cochlea presumably have functional consequences, some of which can be tentatively inferred from conventional conceptions of cochlear macromechanics. For example, both the BM and the adjacent spiral laminar bridge are wider in newborns than in adults, and the BM is thinner. These differences appear throughout the cochlea and suggest that the newborn partition is more compliant. Because transverse waves generally propagate with a speed inversely proportional to the square root of the effective compliance, larger compliance implies slower wave speeds, longer travel times, and greater delays. The observed anatomical differences therefore appear consistent with the longer low-frequency DPOAE phase-gradient delays previously measured in newborns.

Approximating the BM as a uniform rectangular beam (e.g., Olson and Mountain 1991) allows one to estimate the expected ratio of neonatal to adult delays more quantitatively. Taking the delay, τ, proportional to the square root of the volume compliance of the beam yields the relation τneoadult ≈ [(wneo/wadult)5(dadult/dneo)3]1/2, where w and d are the BM total width and mean thickness, respectively. All other relevant parameters, including the boundary conditions at the ends of the beam, are assumed unchanged between newborns and adults, and they therefore effectively cancel out. Using the anatomical measurements at the 0.75-kHz location (Table 2, rows 7, 9, and 10) gives the estimate τneoadult ≈ 1.8 ± 0.6 (SEM) for comparison with the ratio 1.5 ± 0.1 found for the distortion component of DPOAE delays at the same frequency (Abdala and Dhar 2012). Although surprisingly good, the agreement should be interpreted with caution given the many simplifying assumptions involved in the calculation, including the omission of possible dynamical effects of the “bridge” (Raufer et al. 2019).

Because low-frequency DPOAE delays in newborns are significantly longer than those in adults, while high-frequency DPOAE delays are similar, we initially hypothesized an immaturity confined to the apical half of the newborn cochlea. We were therefore surprised to find anatomical immaturities at both basal and apical cochlear locations studied here. Note, however, that unlike low-frequency DPOAE phase-gradient delays, which may depend at least in part on cochlear travel times, high-frequency phase-gradient delays measured using the fixed-f2/f1 DPOAE paradigm are always close to 0 as a result of the local scaling manifest by the basal half of the cochlea (Shera and Guinan 1999). Thus, even if newborn cochlear delays are indeed greater throughout the cochlea, as suggested by their immature anatomy, they may only affect fixed-ratio DPOAE delay measurements at low frequencies. Unfortunately, attempts to work around this limitation by using other types of OAEs (i.e., reflection-source OAEs) to probe neonatal cochlear delays at high frequencies are confounded by the level dependence of these emissions and the immaturities of the neonatal middle ear.

In addition to its effect on traveling-wave delays, a larger compliance of the neonatal cochlear partition would tend to shift the newborn tonotopic map towards lower frequencies compared to the adult. Although the smaller size (and thus mass) of the newborn organ of Corti might naively be expected to offset the greater compliance, perhaps acting to increase local resonant frequencies and shift the map in the opposite direction, any reduction in the volume of vibrating tissue would be countered by a compensating volume of vibrating fluid of almost equal density. Interestingly, any basal-ward shift in the tonotopic map that occurs during maturation due to increasing stiffness may subsequently be reversed during aging: the group delays of stimulus-frequency OAEs are longer in elderly adults, consistent with an age-related decrease in partition stiffness (Abdala et al. 2018).

Despite the macromechanical speculations offered above, the functional consequences of most of the observed anatomical immaturities are difficult to assess without extensive computational modeling in which details of cochlear micromechanics and related material properties are well represented. Good examples include the noted differences in the shape and size of the organ of Corti and the geometry of the tunnel. The measurements reported here, including their variation along the cochlear spiral, provide the necessary empirical foundation for future modeling studies, whether they aim to probe the maturation of cochlear function or understand the mechanical responses of the adult human cochlea.