Introduction

Brain development throughout childhood and adolescence is a dynamic process, with maturation of the white matter characterised by ongoing axonal development (LaMantia and Rakic 1990) and myelination (Yakovlev and Lecours 1967). These two integral components of the white matter, the axon and myelin sheath, contribute to the microstructural organisation of a white matter fibre bundle and should be assessed separately (Paus 2010). However, it has not been possible to assess, in vivo, the relative contribution of these features until the recent introduction of sophisticated diffusion-weighted imaging (DWI) acquisition and modelling techniques.

The corpus callosum is an integral part of the central nervous system, providing a physical pathway for information transfer between the two hemispheres of the brain. The corpus callosum is a highly organised bundle of fibres that converges at the midline, and projects out to the cortical surfaces. Axon count in the corpus callosum stabilises in the early post-natal period (LaMantia and Rakic 1990), quickly reaching adult levels and serving as a structural scaffold for further neuronal maturation. From this point, evidence suggests that continuing axon diameter growth and myelination is synchronised (Sherman and Brophy 2005; Nave 2010). A recent lifespan assessment of the g-ratio (the ratio of the inner axonal diameter to the total outer diameter) in the corpus callosum revealed a stable trajectory with age, further evidencing this coupled growth (Berman et al. 2017).

Fibre composition varies across the corpus callosum, with differing proportions of small and large diameter fibres across callosal subregions (Aboitiz et al. 1992). The posterior (splenium) and anterior (genu) segments are composed of a high density of small diameter fibres, whereas the body has a higher proportion of large diameter fibres. Differences in myelin composition are also apparent across the corpus callosum, as magnetic resonance imaging (MRI) derived estimates of myelin volume fraction are highest in the genu and lowest in the body (Bjornholm et al. 2017; Stikov et al. 2015).

In vivo estimates of white matter organisation are most commonly derived from diffusion tensor imaging (DTI). Most commonly, fractional anisotropy (FA) is used to estimate the degree of anisotropic water diffusion along a fibre bundle, where high FA can signify greater organisation of white matter fibre tracts. In studies of development, DTI applications have revealed that the white matter becomes more highly organised with age (Lebel and Beaulieu 2011), and that age-related development of the corpus callosum follows a known developmental course (Lebel et al. 2008, 2012). Whilst useful as a sensitive marker of microstructural alterations, DTI metrics are not specific enough to disentangle the physical properties of a fibre bundle. Crossing fibres, glial abundance/loss, partial voluming, and orientation dispersion can conflate the inference of axon and myelin ‘organisation’ (Jones et al. 2013; Beaulieu 2002).

In addition to expected age-related development, puberty is a particularly dynamic period that can influence white matter developmental trajectories during childhood and adolescent brain development (Herting et al. 2017; Byrne et al. 2017). However, current efforts to understand white matter microstructure during puberty have been limited to DTI studies, with the exception of our work (Genc et al. 2017b). Pubertal stage can partially explain sex differences and age-related development, therefore, investigating the influence of puberty on white matter microstructure with advanced analysis techniques is particularly pertinent.

Recent advancements in modelling and acquisition methods offer to disentangle multiple fibre properties using clinically feasible acquisitions. Neurite orientation dispersion and density imaging (NODDI) (Zhang et al. 2012) is a compartment modelling technique that estimates neurite density as the intra-cellular volume fraction (vic), as well as the bending and fanning of axons as the orientation dispersion index (ODI). Apparent fibre density (AFD) is an alternative metric that quantifies the total intra-axonal volume fraction, to estimate axon density per unit of tissue (Raffelt et al. 2012, 2017). Higher AFD in a given fibre bundle can reflect larger axon diameter, or higher local axon count. Using such metrics instead of DTI-based measures allow a more specific interrogation of axonal properties, which are useful as axonal conduction velocity is determined by axon diameter, whereby larger axons conduct faster (Horowitz et al. 2015). Estimating axon diameter, however, using current clinical MRI systems is not feasible for small axons, which make up a large proportion of the brain (Nilsson et al. 2017). Therefore, the quantification of white matter fibre density estimates are warranted.

We focus our efforts on examining corpus callosum development using several commonly derived and recently introduced diffusion MRI metrics. We first provide a detailed examination of how several diffusion metrics vary across the length of the corpus callosum, with comparison to previously reported profiles in adults. Subsequently, we examine how these relationships vary as a function of age, sex and pubertal stage during development. We utilise two independent developmental samples; (1) a developmental population (aged 4.1–18.9 years), and (2) an age-matched group of children at the cusp of pubertal onset (aged 9.6–11.9 years).

Methods

Cohorts

Two independent, typically developing cohorts of children and adolescents were used for the current study:

The Cincinnati MR Imaging of NeuroDevelopment (CMIND) study comprised a sample of children from early infancy to late adolescence, based in Cincinnati, USA. Data acquired at a single site, the Cincinnati Children’s Hospital Medical Center (CCHMC), were accessed from their online data repository (Holland SK 2015). Full description of the recruitment process and MRI scanning protocol is detailed online (https://cmind.research.cchmc.org/). Briefly, children were excluded from the study if they had a history of personal mental illness or head trauma. Only participants between the ages of 4–19 with DWI data were included for analysis in this study (N = 85, M 10.80 years, 45 female).

The Neuroimaging of the Children’s Attention Project (NICAP) study comprised a community sample of children aged 9–12 years, recruited from 43 socio-economically diverse primary schools distributed across the Melbourne metropolitan area, Victoria, Australia. A detailed protocol has been described previously (Silk et al. 2016). Briefly, children were excluded from the study if they had a neurological disorder, or serious medical condition (e.g. diabetes, kidney disease). Additionally, children underwent comprehensive assessment for attention-deficit/hyperactivity disorder (ADHD) via parent interview (Sciberras et al. 2013), and children were excluded from the current study if they received a positive diagnosis of ADHD.

This narrow age band of children included typically developing pre-pubertal (N = 44, M 10.41 years, 13 female) and pubertal children (N = 30, M 10.45 years, 18 female), as defined by the Pubertal Development Scale (PDS). For the current study, the primary caregiver was asked to rate their child’s physical development on a four-point scale. This included questions assessing the presence of characteristics phenotypical of pubertal onset such as deepening of voice and presence of facial hair in boys, and breast development and menarche for females. Pubertal groups were classified by first calculating a PDS-Shirtcliff (PDSS) score (Shirtcliff et al. 2009), followed by dividing the sample into two groups: pre-pubertal (children with PDSS = 1) and pubertal (children with PDSS ≥ 1.5). Only participants that had both DWI data and PDS data were included in the current study (N = 74, M 10.40 years, 31 female). This study was approved by The Royal Children’s Hospital Melbourne Human Research Ethics Committee (HREC #34071).

Image acquisition

CMIND children underwent MRI at 3.0 T on a Phillips Achieva TX. In brief, diffusion-weighted images were obtained with a spatial resolution of 2.0 × 2.0 × 2.0 mm, acquisition matrix = 112 × 109, bandwidth = 1753 Hz, 60 slices, along 61 directions. Two DWI shells were acquired, one b = 1000 s/mm2 shell [relaxation time (TR) = 6614 ms, echo-time (TE) = 81 ms] and one b = 3000 s/mm2 shell (TR = 8112 ms, TE = 104 ms). The total acquisition time for both shells was 21 m 27 s.

NICAP children underwent MRI at 3.0 T on a Siemens Tim Trio at The Melbourne Children’s Campus, Melbourne, Australia. Briefly, three diffusion-weighted shells were acquired: (1) b = 2800 s/mm2, 60 directions; (2) b = 2000 s/mm2, 45 directions; and (3) b = 1000 s/mm2, 25 directions. All shells had a voxel size of 2.4 × 2.4 × 2.4 mm, acquisition matrix = 110 × 100, bandwidth = 1758 Hz, TR = 3200 ms, TE = 110 ms. A multi-band acceleration factor of three significantly reduced the acquisition time (9 m 23 s). An additional reverse phase-encoded image was collected to assist with susceptibility induced distortion correction (total acquisition time = 10 m 33 s).

All images were visually inspected for motion artefact (assessed by the presence of venetian blinding artefact), and whole datasets were excluded if excessive motion was present. In addition, we calculated mean frame-wise displacement using the FSL software library (Smith et al. 2004). There was no effect of this motion parameter on our main findings (see Supplementary Information).

Data pre-processing and analysis

For CMIND, a total of 72 participants had sufficient data for DTI and NODDI analyses (M 10.42, SD 3.99, 36 female). Images were pre-processed and a tensor-based population-specific template (Zhang et al. 2006) was generated as previously described in Genc et al. (2017a). Subsequently, fractional anisotropy (FA) and mean diffusivity (MD) maps were generated in template space from the b = 1000 s/mm2 data using TVtool. NODDI maps (vic and ODI) were generated from the combined low and high b-value data (b = 1000, 3000 s/mm2) using the NODDI toolbox (Zhang et al. 2012), and then transformed to template space using DTI-TK (Zhang et al. 2006). For the AFD analysis, a total of 85 participants with high b-value data (b = 3000 s/mm2) were included for analysis in this study (M 10.80, SD 3.90, 45 female). Data were processed using MRtrix3 (https://github.com/mrtrix3), using a recommended fixel-based analysis pipeline (Raffelt et al. 2017). Data were denoised, corrected for distortions due to eddy currents and bias fields, normalised for variations in global intensity, and upsampled by a factor of 2. A fibre orientation distribution (FOD) map was generated for each participant, and subsequently a study-specific FOD template was generated using one male and one female for each age between 5 and 18 years (total of 28 participants). Each participant’s FOD map was transformed to template space, and the first volume of the FOD map was extracted to estimate apparent fibre density (AFD) in the corpus callosum.

For NICAP, a total of 74 participants had sufficient DWI data for further analyses (M 10.40 years, 31 female). A population-specific template was generated using the low b-value data (as above). FA and MD maps were generated from the b = 1000 s/mm2 data using TVtool. Using the combined b = 1000 and b = 2000 s/mm2 data, vic and ODI maps were generated using the NODDI toolbox and transformed to template space using DTI-TK. Following visual inspection, a total of 68 vic maps and 71 ODI maps were included for further analysis. Processing of the higher b-value data (b = 2800 s/mm2) for AFD estimation was performed on 74 children as described previously in (Genc et al. 2017b) and as above.

In summary, data used for the CMIND cohort comprised: FA, MD, vic and ODI data (N = 72, M 10.42 years, 36 female), and AFD data (N = 85, M 10.80, 45 female). Data used for the NICAP cohort comprised: FA, MD, and AFD data (N = 74, M 10.40 years, 31 female), vic data (N = 68, M 10.40 years, 30 female) and ODI data (N = 71, M 10.42 years, 31 female).

Corpus callosum parcellation

The corpus callosum segmentation was performed according to Aboitiz et al. (1992), whereby the mid-sagittal slice of the corpus callosum was divided along its length to generate ten segments: genu (G1, G2, G3), body (B1, B2, B3), isthmus (ISTH) and splenium (S1, S2, S3). This is represented as a colour-coded map in Fig. 1, and tract projections were also generated purely for visualisation (Fig. 3b). Regions of interest (ROIs) were drawn on the population-based tensor template for DTI and NODDI quantification, and on the FOD template for AFD quantification, for each respective cohort using mrview. A mean value per segment for each diffusion metric was calculated for statistical analysis.

Fig. 1
figure 1

Corpus callosum parcellation on mid-sagittal slice of FOD template, alongside heat-maps of diffusion metric variability across the corpus callosum. Brighter colours correspond to higher values

Statistical analyses

Corpus callosum profiles were examined using profile analysis via multivariate analysis of covariance (MANCOVA) (SPSS, version 24). Corpus callosum segment was entered as the within-subjects factor for each analysis. Age and sex were included as between-subjects covariates. Main effects and interactions are reported for CMIND and NICAP cohorts.

Statistically significant interactions involving segment were further investigated using univariate linear models separately for each segment (R, version 3.4.1). For age relationships, in addition to first order linear models, second and third order models were also computed and Akaike information criterion (AIC) values were reported. The coefficient of determination (adjusted R2) was computed to understand the proportion of variance explained by the model. Partial eta-squared (\(\eta _{{\text{p}}}^{2}\)) is reported as an estimate of effect size.

Multivariate profile analysis was also performed for the pubertal group comparisons in the NICAP cohort, as information on pubertal status derived from the PDS was available for this cohort. Sex and pubertal group were included as between-subjects factors. Follow-up univariate linear models were computed for each segment, to examine the relationship between pubertal group and AFD. In addition, we performed an exploratory group comparison to assess step-wise changes in AFD after generating a new variable, pubertal-type, which consisted: pre-pubertal male; pubertal male, pre-pubertal female, and pubertal female, as determined by the PDS scores.

All graphical visualisations were carried out in R (version 3.4.1). Coloured diffusion metric profiles were plotted as 95% confidence intervals (CIs) and adjusted for repeated measures across the ten corpus callosum segments (Fig. 2). In addition, CMIND data was divided into three equally distributed age groups, purely for graphical visualisation of diffusion metric variation across distinct age bands. Subsequent analysis of age-related differences in diffusion metrics were performed using age as a continuous variable.

Fig. 2
figure 2

Profiles of diffusion metrics across the ten segments of corpus callosum in a NICAP, and b CMIND cohorts. CMIND groups are split into three age groups for visualisation, where the coloured ribbon is darker with older age group. Coloured ribbons represent 95% CIs

We made use of a strict statistical threshold following Benjamin et al. (2018), and defined a significance level of 0.005. Given the sample size, and multiple comparisons, this represents an even balance between type I error and discovery of effects.

Results

Corpus callosum profiles

For both CMIND and NICAP datasets, the variability of diffusion metrics across the corpus callosum are visualised as heat-maps (Fig. 1), and as profiles plotted across the ten segments of the corpus callosum (Fig. 2). For the CMIND cohort, age-related development of the profiles are visualised by grouping data into three age groups (4.1–8.1; 8.3–12.3; 12.3–18.8 years).

In the CMIND cohort, a high–low–high profile was observed for both FA and vic across the ten segments, with profiles generally translating upwards suggesting an increase in FA and vic with age. MD followed a low–high–low profile, peaking in the isthmus. ODI peaked at B1, and followed a downward pattern reaching a minima at the splenium. AFD showed most marked differences between age groups, with an anterior–posterior increase peaking at B3.

Multivariate profile analysis for diffusion metrics in the CMIND cohort confirmed the profiles listed above. These results are reported in Table S1. In general, all metrics had a statistically significant main effect for segment, indicating variability across the segments. Main effect of age was significant for MD, F(1,69) = 29.1, p < .001, \(\eta _{{\text{p}}}^{2}\) = 0.30, ODI, F(1,69) = 21.5, p < .001, \(\eta _{{\text{p}}}^{2}\) = 0.76, and AFD, F(1,82) = 154.14, p < .001, \(\eta _{{\text{p}}}^{2}\) = 0.65. The segment by age interaction effect was statistically significant for ODI, F(9,61) = 4.94, p < .001, \(\eta _{{\text{p}}}^{2}\) = 0.42, and AFD, F(9,74) = 21.55, p < .001, \(\eta _{{\text{p}}}^{2}\) = 0.72, indicating that the profiles were not parallel across different ages.

The diffusion metric profiles over the corpus callosum in NICAP were similar to that of CMIND. We observed that FA and vic both exhibited a high–low–high profile across the ten segments, with a minimum value in the isthmus and maximum in the splenium. MD exhibited a low–high–low profile, peaking in the isthmus. We observed a smoother profile of low–high–low ODI from the genu to the isthmus, before reaching a minimum at S2 and rising again at S3. AFD had a low–high–low profile from the genu to S2, peaking at B3 and rising again at S3. Multivariate profile analysis for diffusion metrics in the NICAP cohort are reported in Table S2. There was no evidence for a significant main effect of age, sex, or interactions between segment and age, and segment and sex, for any of the diffusion metrics.

Given that AFD had the greatest interaction with age in the CMIND cohort, F(9,74) = 21.55, p < .001, \(\eta _{{\text{p}}}^{2}\) = 0.72, we subjected each corpus callosum segment to further analysis (below).

Age relationships

Linear models were computed to further investigate the relationship between AFD and age in each corpus callosum segment separately. For each segment, AFD was regressed onto age (with sex included as a covariate). As shown in Table 1, AFD in the body of the corpus callosum had the strongest relationship with age (B2: \(\eta _{{\text{p}}}^{2}\) = 0.60 and B3: \(\eta _{{\text{p}}}^{2}\) = 0.72). In addition to first order (linear) effects, we also modelled second order (quadratic) and third order (cubic) effects. Overall the quadratic and cubic models did not result in a better fit as indicated by both AIC and adjusted R2 values (Table S5), therefore, only the linear relationships are reported and visualised in Fig. 3a.

Table 1 Relationship between AFD with age and sex, across the ten segments of the corpus callosum
Fig. 3
figure 3

Relationship between AFD and age across the corpus callosum. a Linear relationship between AFD and age, where adjusted R2 values are adjusted for sex, and shaded areas around the line of fit represent 95% CIs. b Anatomical position and tract projections shown for each corpus callosum segment

Sex differences

For the CMIND cohort, there was a statistically significant main effect of sex across the callosal segments, F(1,82) = 15.62, p < .001, \(\eta _{{\text{p}}}^{2}\) = 0.16, suggesting that females had higher AFD than males (Fig S1a). The segment by sex interaction, however, was not statistically significant for AFD, F(9,74) = 0.61, p = .78, \(\eta _{{\text{p}}}^{2}\) = 0.07.

Overall, age had a stronger effect on AFD across the ten segments (0.22 < \(\eta _{{\text{p}}}^{2}\) < 0.72), compared to the relatively weaker effect of sex (0.06 < \(\eta _{{\text{p}}}^{2}\) < 0.14). The strongest evidence for sex differences was observed in G3 (p < .001, \(\eta _{{\text{p}}}^{2}\) = 0.13), and S1 (p < .001, \(\eta _{{\text{p}}}^{2}\) = 0.14) (Table 1).

Influence of puberty

We performed an investigation into the influence of pubertal group on AFD across the corpus callosum in the NICAP cohort. Multivariate profile analysis (Table S3) revealed a significant segment by pubertal group interaction for AFD, F(9, 61) = 3.03, p = .005, \(\eta _{{\text{p}}}^{2}\) = 0.31. Evidence of pubertal group differences was only apparent in S2, F(1,70) = 10.3, p = .002, \(\eta _{{\text{p}}}^{2}\) = 0.13, and S3, F(1,70) = 8.98, p = .004, \(\eta _{{\text{p}}}^{2}\) = 0.11.

In addition, we performed an exploratory group comparison to assess differences in AFD across pre-pubertal and pubertal males and females (pubertal-type). There was evidence for a segment by pubertal-type interaction, F(27,192) = 1.91, p = .007, \(\eta _{{\text{p}}}^{2}\) = 0.21. The main effects of pubertal-type were statistically significant in the regions S2, F(3, 70) = 5.1, p = .003, \(\eta _{{\text{p}}}^{2}\) = 0.18, and S3, F(3, 70) = 6.45, p = .001, \(\eta _{{\text{p}}}^{2}\) = 0.22 (Fig S1b), suggesting a step-wise progression of AFD.

Discussion

The current study presents a comprehensive investigation of the influence of age, sex, and puberty on corpus callosum development using a number of DWI modelling and analysis techniques in two independent developmental samples.

Corpus callosum profiles

The profiles across the corpus callosum for FA and MD are consistent with previous reports in adults (Caminiti et al. 2013; Bjornholm et al. 2017). The profile of vic followed a very similar pattern to FA, although vic did not show an interaction with age as each segment increased similarly with age. Since the corpus callosum is a coherently organised bundle of white matter fibres, it is possible that vic does not substantially improve the characterisation of neurite properties compared with FA (Genc et al. 2017a).

Upon visual inspection of the FA and vic profiles alone might lead to the interpretation that the genu and splenium have “more organised” white matter than the body. However, axons do not always follow straight courses, they bend and fan, which influences the quantification of DTI metrics such as FA and MD (Nilsson et al. 2012; Ronen et al. 2014). High orientation dispersion in the corpus callosum results from fibres intertwining and diverging at the midline, as evidenced by a recent histological and DWI validation study (Mollink et al. 2017). Consistent with these observations, we observe high ODI and low FA in the body, whereas regions with low ODI have high FA. These observations hand-in-hand might suggest that FA and vic are most sensitive to myelin architecture in single-fibre regions, as corroborated by a recent combined histological and MRI validation study in rodents (Chang et al. 2017). Therefore, caution needs to be exercised when interpreting any differences in FA as related to “organisation”, since these DTI metrics do not give us an indication of organisation when the fibres are bending and fanning.

Lastly, we observe that AFD follows this low–high–low pattern in the NICAP data, and to a lesser extent in the CMIND data. Compared with the NICAP profile (Fig. 4a), the age-matched CMIND group (Fig. 4b) has a local minimum in the genu, increase towards the body, and a slight decrease in S2 before peaking at S3. These slight profile differences might be explained by the absence of susceptibility distortion correction in CMIND. We additionally note a remarkable similarity between the NICAP AFD profile and an ex-vivo human study (Aboitiz et al. 1992) of fibre density for axons > 3 µm (Fig. 4c). This similarity might be explained by regional differences in membrane permeability, whereby larger axon diameters can lead to alterations in the radial DWI signal with subsequent regional differences in AFD (Raffelt et al. 2012). Myelin water fraction is highest in the body of the corpus callosum, likely due to the larger fibre diameters seen in this region (Liu et al. 2010; Bjornholm et al. 2017). Therefore, the bigger, more myelinated axons in the body may influence the AFD measurement in this region.

Fig. 4
figure 4

c Reprinted from Aboitiz et al. (1992) with permission from Elsevier

Profiles of fibre density across the corpus callosum. AFD profiles in a NICAP dataset; and b CMIND dataset in age-matched group. c Aboitiz et al. (1992) light microscopy fibre density estimates for axons > 3 µm.

In general, we observed a good agreement across all DTI, NODDI and AFD callosal profiles between the two cohorts, with any slight profile differences likely explained by differences in scanner acquisitions. Even in a highly organised bundle such as the corpus callosum, orientation dispersion and T2 relaxation can influence quantitative metrics, such that regions cannot be compared with one another in terms of their “organisation”. Therefore, we have chosen to focus our interpretation of these diffusion metrics in the context of their relationship with age and sex, rather than absolute comparisons between regions across the profile.

Age-related development

Age-related development of white matter has been established in terms of volumetric (Mills et al. 2016; Lenroot and Giedd 2006), DTI (Lebel and Beaulieu 2011; Lebel et al. 2008) and NODDI (Genc et al. 2017a; Mah et al. 2017; Chang et al. 2015) studies. In general, white matter volume and microstructural organisation increases with age in the majority of white matter tracts. Our results in the CMIND cohort are in line with these previous findings, as we report significant correlations between age and each of our diffusion metrics. AFD, however, exhibited the strongest relationship with age, relative to DTI and NODDI metrics, across the corpus callosum. Although we found no main effect of age across the diffusion metrics in the NICAP cohort, this was most likely due to the relatively narrow age range (9.6–11.9 years) in this cohort.

Upon further investigation within each callosal segment, we observed a high correlation between AFD and age in all segments with the strongest relationship with age in the posterior midbody. The posterior midbody is known to have the highest density of large fibres, evidenced by ex vivo adult data (Aboitiz et al. 1992). Given that larger axons conduct action potentials faster (Horowitz et al. 2015), this strong relationship with age might reflect the need for faster interhemispheric transfer times to enable rapid communication between developing cortices of the brain during this critical period. Higher AFD with older developmental age might, therefore, reflect bigger axon diameter (Raffelt et al. 2017), however, this would need to be verified by imaging developmental populations using a high gradient system (McNab et al. 2013) or an advanced acquisition protocol (Alexander et al. 2010) to estimate axon diameter.

Sex differences and pubertal processes

We observe higher AFD in females compared with males across the corpus callosum, most considerably in the anterior splenium and posterior genu. This is consistent with adult ex vivo data, whereby neurotypical females have greater fibre density in the genu and body, compared with males (Highley et al. 1999). This suggests that the observed differences in density (AFD) might be due to females having either: (1) more axons to begin with; or (2) larger diameter axons. Previous work has shown sex differences in the growth of axon and myelin in adolescents (Perrin et al. 2009), a developmental change that could be described by males having a higher proportion of large diameter fibres (Paus and Toro 2009). Therefore, it is possible that females have a higher proportion of small diameter axons.

Sex differences in brain development can be attributed to endocrine processes associated with puberty. We found that pubertal children have greater AFD in the posterior splenium compared with pre-pubertal children, consistent with our previous whole-brain analysis (Genc et al. 2017b). This relationship was not present with respect to NODDI or DTI metrics, which could be explained by the specificity of AFD to infer intra-axonal properties more closely related to pubertal processes. Comprehensive analysis of longitudinal development of white matter microstructure is warranted with larger sample sizes, to confirm whether these pubertal relationships exist across other microstructural metrics.

The pubertal period is accompanied by regulation and remodelling of white matter (Herting et al. 2017) due to rising adrenal (Maninger et al. 2009) and gonadal (Pangelinan et al. 2016) hormones [see Juraska and Willing (2017) for a recent review]. We additionally demonstrated a potential step-wise relationship between pubertal group and sex, with pre-pubertal males having the lowest AFD in the splenium, and pubertal females having the highest AFD in the splenium. This additional finding suggests that females are ahead in their axonal development, likely due to the rise in adrenal and gonadal hormones 1 year earlier than males (Grumbach and Styne 1998). Pubertal onset determined by the PDS is thought to be accompanied by increasing axon diameter (Genc et al. 2017b), likely driven by surges in testosterone during this critical period of neurodevelopment (Perrin et al. 2008; Pesaresi et al. 2015). Previous studies using the PDS as a marker of physical maturation support these complementary findings. Longitudinal change in PDS score predicts unique patterns of FA and MD change (Herting et al. 2017), and mid-pubertal stage is associated with greater volume of the callosal mid-body (Chavarria et al. 2014).

The organisational–activational hypothesis suggests that the brain reorganises with not only peripubertal, but perinatal exposure to testosterone (Schulz et al. 2009). In the context of our developmental data, rather than a divergence between female and male axonal development, we present evidence that they develop at the same rate. The finding of higher AFD in females over development is likely due to earlier exposure to testosterone, both perinatally and peripubertally. We do note, however that our reported sex differences are mostly driven by age-related development, so it is difficult to infer sex interactions with smaller relative effect sizes. Further work is required to investigate the longitudinal development of axon and myelin properties, particularly as clinical and typical populations may have different rates of development of axon or myelin.

Practical considerations and optimising study design

Deciding on whether to investigate myelin and/or axon depends on the question at hand. It may make more sense to quantify myelin in the case of a demyelinating disease, or axon in the case of traumatic brain injury recovery. As Jones et al. (2013) has argued, DTI is a sensitive but not specific marker of tissue microstructure. Problems only arise when making any claims about the tissue properties using DTI alone, therefore, more specific markers of tissue microstructure should be included to further inform interpretations.

Practical considerations of modelling diffusion metrics depend on the acquisition parameters of the data collected, if a retrospective analysis is planned. Estimating intra-cellular volume fraction (vic) obtained from NODDI requires multiple DWI shells for the compartment modelling, however, ODI can be estimated using one DWI shell (Zhang et al. 2012). AFD estimation is recommended using a higher b-value to optimise signal attenuation of the extra-axonal space (e.g. b = 3000 s/mm2). In areas of multiple fibre orientations (basically all areas outside of the corpus callosum) the fixel-based analysis framework should be implemented to perform fibre-specific statistical comparisons (Raffelt et al. 2015, 2017). We stress that investigations should be tailored based on the acquisition parameters of collected data—apart from the case of a prospective design whereby acquisition of multiple DWI shells can be planned in advance to quantify these parameters.

Limitations and future directions

A strength of the current study is the assessment of diffusion metrics in two independent developmental populations: a Melbourne, Australia population (NICAP dataset); and a Cincinnati, USA population (CMIND dataset). Analysing data collected from independent sites will inherently lead to mismatches in image acquisition parameters, which can affect processing and analysis. Despite differences in acquisition parameters (b-value and number of directions) and image pre-processing methods (no susceptibility distortion correction in CMIND), we observe good agreement between DTI and NODDI metrics (Fig. 2, Table S4).

The proximity of the ventricles to the inferior corpus callosum may induce partial-volume effects in DTI metrics. We have attempted to control for this confound by manually drawing the corpus callosum masks (on a high resolution template) and avoiding voxels that may be contaminated with cerebrospinal fluid (CSF). Despite our efforts, non-callosal CSF may contribute to the estimation of DTI metrics in particular, and future studies may consider including fibre bundle thickness as a covariate in statistical analyses (Vos et al. 2011).

Whilst we have attempted to mitigate the impact of myelination on axon development using AFD as a marker of intra-axonal volume fraction, T2 relaxation from myelin water can affect AFD measurements. We were not able to estimate myelin maps as part of the current study, however, we believe this is vital for understanding how axon and myelin develop together. Therefore, future studies should attempt to quantify myelin to achieve a complementary analysis of regional growth of axon and myelin over development.

Conclusion

We present evidence of a strong relationship between apparent fibre density and age, over childhood and adolescent corpus callosum development, compared with other commonly derived diffusion metrics. This is likely driven by increasing axon diameter over development. We additionally observe that females have higher fibre density compared with males, which may be explained by differences in axon composition resulting from early surges in testosterone in both the perinatal and peripubertal period.