Abstract
Background
Diffusion tensor parameters can be analysed by fitting regions of interest (ROIs) to selected brain structures. The clinical usefulness of these measurements is influenced by their reproducibility and validity.
Objective
To investigate the reproducibility of fractional anisotropy (FA) and mean diffusivity (MD) measurements.
Material and methods
Seventy-six infants were imaged once at term-equivalent age. We measured several brain regions. Reproducibility was assessed using intraclass correlation coefficient and Bland-Altman method.
Results
Intra-observer reproducibility was excellent for FA in the calcarine cortex (right) and frontal white matter (left), and for MD in the corpus callosum (anterior), internal capsule, corona radiata, putamen, frontal white matter, optic radiation (left), thalamus (right) and calcarine cortex (right). Inter-observer reproducibility was excellent for FA in the corpus callosum (posterior) and for MD in the internal capsule and corona radiata (right). Inter-observer reproducibility was poor for FA in frontal and posterior white matter (right) and for MD in the inferior colliculus (right). Reproducibility was fair to good in other areas. The Bland-Altman plots showed no considerable bias, and variance was independent of the mean value.
Conclusion
Reproducibility of ROI measurement was fair to good for both FA and MD.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
White matter pathways of the brain can be non-invasively visualized and quantitatively characterized using diffusion tensor imaging (DTI) [1]. Diffusion can be characterized using scalar quantities such as fractional anisotropy (FA), which tells the degree of diffusion anisotropy, and the mean diffusivity (MD), which is used to measure the orientationally averaged diffusivity [1, 2]. Maturation of brain white matter is characterized by increasing FA and decreasing MD [3, 4]. In previous studies, it has been reported that premature infants have lower anisotropy and higher MD in several brain areas compared with full-term infants at term-equivalent age [5–7]. Lower FA and higher MD compared with normal have been associated with neurological abnormalities and disabilities in later development [8, 9].
The typical way to analyse DTI images is to manually fit ROIs (regions of interest) to selected brain regions and measure parameter values [10–12]. The location of ROIs is usually based on anatomical information of the b0 images, the direction-coded anisotropy maps or conventional sequences obtained at the same session as the DTI imaging [4, 13–15]. The accuracy of the ROI location is based on anatomical knowledge of the researcher performing the measurements. In several studies, fixed-shape fixed-sized ROIs have been used. However, better intra-observer and inter-observer reproducibility of parameter value measurements has been reported using anatomically shaped ROIs [16]. The value of these measurements is highly influenced by the reproducibility and validity of the measurements. Another way to analyse DTI images is to use group-level methods such as TBSS (track-based spatial statistics) [17].
Reproducibility studies of DTI parameter value measurements was first done to examine the variability between FA and MD values when imaging was done with MRI scanners from different manufacturers, and significant variation of mean values was found [18]. Reproducibility has also been examined between repeat imaging sessions with the same equipment, e.g. in hippocampal areas, and excellent reproducibility was found [19]. However, parameter value reproducibility may have significant regional variability [20–22]. Most of the reproducibility studies of DTI parameter value measurements have been performed on DTI images of adult brains [19, 22]. However, reproducibility may vary in infants compared with adults as FA and MD measurements are more susceptible to partial volume artefacts in infants. This is because brain structures are smaller and still maturing. Reproducibility of FA and MD value measurement of the pyramidal tracts has been reported on term infants [23]. However, regional variability between different white matter tracts has not been reported in infants.
The aim of this study is to measure intra-observer and inter-observer reproducibility of ROI-based FA and MD measurements of the brain areas in prematurely born infants imaged at term. One DTI sequence was obtained and all measurements were performed on the same dataset. Intra-observer reproducibility was based on repeated ROI measurements performed by the same observer. Inter-observer reproducibility was investigated by comparing ROI measurements between two observers. Measurements were performed in several brain regions. In this study, we used known anatomy to draw ROIs.
Materials and methods
Subjects
This study is a part of the Development and Functioning in Very Low Birth Weight Infants from Infancy to School Age (PIPARI) study at Turku University Hospital. Inclusion criteria to the PIPARI study were birth at gestational age 31 weeks and 6 days or below, or birth weight < 1,500 g. Imaging was performed with a 1.5-T MRI scanner using a DTI sequence. Exclusion criteria were: death during neonatal period major congenital anomalies or recognized syndromes. The total number of infants included was 76. Gestational ages at birth ranged from 23 weeks and 3 days to 34 weeks and 3 days. MRI was performed once at term-equivalent age. The study protocol was approved by the Ethics Review Committee of the hospital. All families gave informed consent.
MRI
The MRI was performed with a 1.5-T MRI system (Gyroscan Intera CV Nova Dual; Philips Medical Systems, Best, The Netherlands) with a SENSE head coil. MRI was done during postprandial sleep without any pharmacological sedation. The infants were swaddled to calm them and to reduce movement artefacts. A pulse oximeter was routinely used during the scan. A physician attended the examination if necessary to monitor the infant. Ear protection was used (3M disposable ear plugs 1100; 3M, Brazil/Wurth hearing protector, no. 899 3000 232; Wurth, Böheimkirchen, Austria).
The sequence used for diffusion imaging was a single-shot, echo-planar imaging with SENSE. TR/TE, 2,264/68 ms. The axial slice thickness was 5 mm, with a gap between slices of 1 mm. A 200-mm-square field of view (FOV) was used. Imaging matrix was 112 × 89 and the reconstructed voxel size 0.78 mm × 0.78 mm. Number of signal averages was 2, SENSE reduction 2 and EPI factor 47. The b values used for diffusion weighting were 0, 600 and 1,200 s/mm2, with 15 directions. However, only images with b values 0 and 600 s/mm2 were used for the analysis. Fat suppression was done using spectral presaturation with inversion recovery (SPIR). In addition, the imaging protocol included conventional T1-weighted, T2-weighted and FLAIR seuences.
Processing and regions of interest
Image analysis was done using PRIDE V4 Fiber Tracking 4.1 beta 4 (Philips Medical Systems, Best, The Netherlands). ROIs were drawn based on anatomical structures on the directionally encoded anisotropy color maps. For anatomical reference, b0 images were used. ROI size and shape varied between different structures. The smallest ROIs were drawn to enclose the colliculus inferior and largest to enclose the calcarinus cortex or posterior white matter.
The ROIs were drawn to enclose the anterior and posterior part of corpus callosum, posterior part of capsula interna, corona radiata, putamen, thalamus, radiatio optica, colliculus inferior, cortex calcarinus and frontal and posterior white matter. Bilateral structures were measured individually. FA and MD were measured. ROIs were drawn twice by one observer (observer A, medical physicist supervised by neuroradiologist with 2 years’ experience in neuroradiology) to evaluate intra-observer reproducibility and once by a second observer (observer B, senior neuroradiologist with 13 years’ experience in neuroradiology) to evaluate inter-observer reproducibility. ROI placements are shown in Fig. 1. Observers scored the image quality independently. Only images that both observers agreed were of acceptable quality were used in this study. Images from seven infants were removed from this study due to movement artefact. Additionally, if image quality was insufficient in some slices of the whole DTI slice set, no measurements were made from those individual slices. Because of this, the number of ROIs varied among different structures.
Statistical analysis
Intra-observer and inter-observer reproducibility were assessed using the intra-class correlation coefficient (ICC) and Bland-Altman proposed limits of agreement [24]. We decided to use both methods since they can provide inconsistent results [25]. The ICC reflects relative homogeneity within the observers or measurements of one observer in relation to the total variation. ICCs were calculated between observers A and B and between first and second measurement of observer A using repeated measurements ANOVA. Reproducibility was classified as “excellent” when ICC is greater than 0.75, “fair to good” when ICC is more than 0.4 but less then 0.75 and “poor” when ICC is less than 0.4 [26]. Variation in ROI size was taken into account in the analyses and the association of ROI size and measurements were also assessed. In addition, the differences in ROI size between the observers were tested using repeated measurements ANOVA. Residuals were checked for justification of the analysis and logarithmic or power transformations of the variables were used in the analyses when appropriate. P < 0.05 was considered statistically significant. Statistical analyses were performed using SAS System for Windows, version 9.1.3. (SAS Institute, Cary, NC, USA).
Results
MRI was performed at term-equivalent age. Gestational ages at MRI ranged from 39.1 to 44.1 (mean ± SD, 40.1 ± 0.63) weeks.
The intra-observer reproducibility of FA measured with ICC was excellent bilaterally in the calcarine cortex and in the frontal white matter on the left side. In other structures, intra-observer reproducibility of FA was fair to good. The intra-observer reproducibility of MD was excellent in the anterior part of the corpus callosum and bilaterally in the posterior limb of the internal capsule, corona radiata, putamen and frontal white matter. In the optic radiation, the reproducibility was excellent on the left and in the thalamus and calcarine cortex on the right. In other structures, the intra-observer reproducibility of MD was fair to good. Results of the ICC measurements are summarized in Table 1. Measured values for FA and MD are shown in Figs. 2 and 3.
Inter-observer reproducibility of FA measured with ICC was excellent in the posterior part of the corpus callosum. In most of the structures, inter-observer reproducibility of FA was fair to good, but in the frontal and posterior white matter on the right, it was poor. The inter-observer reproducibility of MD was excellent in the posterior part of the corpus callosum, bilaterally in the posterior limb of the internal capsule and in the corona radiata on the right side. In most of the structures, inter-observer reproducibility of MD was fair to good. In the inferior colliculus on the right side the reproducibility was poor.
Bland-Altman analysis showed good results for measured structures based on visual reviewing of the Bland-Altman scatter plots. The scattering plots showed no considerable bias, and the variance was independent of the mean value. Limits of agreement varied between structures. The differences between measurements were smallest for FA in the calcarine cortex and for MD values in the internal capsule for both intra- and inter-observer comparisons. The largest differences between measurements were in the corpus callosum for FA for both intra-observer and inter-observer comparisons. For inter-observer comparisons, difference in MD values was largest in the posterior white matter and for intra-observer comparison in the posterior part of the corpus callosum. Scatter plots of the posterior limb of the internal capsule and the inferior colliculus are shown in Fig. 4. Limits of agreement for all measured structures are shown in Table 2.
ROI size varied significantly and systematically between observers. Observer A’s ROIs were always smaller than those of observer B’s. Observer A’s ROIs were smaller on the second measurement compared with the first for most of the structures. When intra-observer reproducibility was assessed using ICC, ROI size had a significant effect to FA values in the inferior colliculus and in the frontal white matter on the left side. For MD values, the effect was significant in the inferior colliculus bilaterally and in the posterior limb of the internal capsule on the left side. When inter-observer reproducibility was assessed using ICC, ROI size had a significant effect on FA in the anterior and posterior parts of the corpus callosum, the posterior limb of the internal capsule on the left side, the optic radiation on the right side and bilaterally in the inferior colliculus, putamen and thalamus. ROI size had a significant effect on MD values in the anterior and posterior parts of the corpus callosum, in the posterior limb of the internal capsule on the right side and bilaterally in the inferior colliculus.
Discussion
Our results suggest that FA and MD measurements of different brain areas are adequately reproducible at term-equivalent age when ROI fitting is done using prior anatomical knowledge and when variation in the ROI size is taken into account. Our results showed no systematic difference in reproducibility between white and grey matter areas or between the hemispheres. Both intra-observer and inter-observer reproducibility assessed with ICC and the Bland-Altman methods varied among structures and between FA and MD values. This reflects the facts that there are brain areas that can be anatomically delineated precisely, e.g. the internal capsule. In these areas, reliable measurements can be performed repeatedly. In frontal or posterior white matter, the most representative area for ROI measurements is more challenging to delineate repeatedly. Also, vicinity of ventricles can cause additional variation to measurements.
We found intra-observer reproducibility for MD generally better than for FA when assessed with ICC. Half of the reproducibility values for MD were classified as excellent and the rest as fair to good. Most values were classified as fair to good for FA. Different ROI size, shape and positioning relative to surrounding tissues may cause variation between measured FA and MD values. This may cause poorer reproducibility. The effect of ROI size is due in part to volume and spatial variation of parameter values within the brain. The effect of ROI size was quite systematic only in the inferior colliculus. One possible explanation is the small size of this structure.
Inter-observer reproducibility was similar for both parameters. In most structures reproducibility was fair to good when assessed with ICC. However, the inter-observer reproducibility was poor in the right frontal and posterior white matter for FA and in the right inferior colliculus for MD. The poor reproducibility values may be due to different ROI shapes or to positioning relative to white and grey matter, as the ROI size did not affect values of FA and MD in the frontal and posterior white matter. In addition to ROI positioning, poor reproducibility value of MD on the right side in the inferior colliculus may be because of the small size of the target and because of pulsatility. Pulsatile brain motion can artificially increase MD values or increase standard deviation in structures adjacent to ventricles and inferior to the corpus callosum [27–29]. This may cause spatial heterogeneity between voxels in these structure. However, it is unclear why the effect was not bilateral.
In most structures, both ICC and LA gave similar results for intra-observer and inter-observer reproducibility. Results were inconsistent in the thalamus, cortex calcarinus and frontal and posterior white matter for FA values. For MD values, results were inconsistent also in the corpus callosum and in the optic radiation.
The first studies to evaluate intra-observer and inter-observer reproducibility in adults revealed high reproducibility in the hippocampal area [19]. However, regional distribution of reported reproducibility values varies among studies. Compared with our results, higher intra-observer reproducibility of FA and MD values (CV ≤ 2.7% and ICC ≥ 0.96) and for inter-observer reproducibility (CV ≤ 2.7% and ICC ≥ 0.90) were reported in cerebral peduncle, anterior and posterior limb of internal capsule, genu of corpus callosum, superior corona radiate and cingulum [16]. However, a similar regional distribution to ours was observed in another study in corpus callosum, cortical spinal tract, internal capsules, basal ganglia and centrum semiovale [22]. Reproducibility varied from slight to substantial agreement [22]. Our results were similar to published reports in the internal capsule, but variation was larger in the corpus callosum [30]. In other structures, limits of agreement have not been published to our knowledge. Measured FA and MD values are similar to those reported in previous studies [15, 31, 32]. However, the deviation was greater. This is not unexpected as our patient population included infants with brain injury (including severe).
The main limitation of our study is image resolution, especially slice thickness. Decreasing slice thickness would benefit measurements in small structures in particular. Inter-slice gaps may have had a negative effect. Also, the ROI size may have caused variation between measured FA and MD values. This might have caused lower reproducibility. Another limitation is that we did not measure signal-to-noise. Decreasing signal-to-noise causes an upward bias in FA but no significant effect on MD [33]. In addition to pulsatile brain motion, this could cause spatial heterogeneity of parameters, especially FA, which might have led to greater variation with in measured values. Thus, different ROI positioning may have caused poorer reproducibility. It may be argued that stricter criteria for interpretation of correlations should have been used as the interpretation of ICC is not evidence-based. This fact and the possible effect of outliers might have led to overestimation of reproducibility.
Conclusion
Although parameter values are reported constantly for DTI studies, the reliability of these values needs to be evaluated. The reproducibility of anatomy-based ROI measurement of FA and MD was fair to good in this study. However, ranges for optimal ROI size in different brain regions and different patient groups might be useful for improving the intra-observer and inter-observer reproducibility. In future studies, the benefits and limitation of tractography-based ROI selection need to be investigated also in infants.
References
Pierpaoli C, Jezzard P, Basser PJ et al (1996) Diffusion tensor MR imaging of the human brain. Radiology 201:637–648
Pierpaoli C, Basser PJ (1996) Toward a quantitative assessment of diffusion anisotropy. Magn Reson Med 36:893–906
Berman JI, Mukherjee P, Partridge SC et al (2005) Quantitative diffusion tensor MRI fiber tractography of sensorimotor white matter development in premature infants. Neuroimage 27:862–871
Provenzale JM, Liang L, DeLong D et al (2007) Diffusion tensor imaging assessment of brain white matter maturation during the first postnatal year. AJR 189:476–486
Rose SE, Hatziaeorgiou X, Strudwick MW et al (2008) Altered white matter diffusion anisotropy in normal and preterm infants at term-equivalent age. Magn Reson Med 60:761–767
Huppi PS, Maier SE, Peled S et al (1998) Microstructural development of human newborn cerebral white matter assessed in vivo by diffusion tensor magnetic resonance imaging. Pediatr Res 44:584–590
Gimenez M, Miranda MJ, Born AP et al (2008) Accelerated cerebral white matter development in preterm infants: a voxel-based morphometry study with diffusion tensor MR imaging. Neuroimage 41:728–734
Krishnan ML, Dyet LE, Boardman JP et al (2007) Relationship between white matter apparent diffusion coefficients in preterm infants at term-equivalent age and developmental outcome at 2 years. Pediatrics 120:E604–E609
Arzoumanian Y, Mirmiran M, Barnes PD et al (2003) Diffusion tensor brain imaging findings at term-equivalent age may predict neurologic abnormalities in low birth weight preterm infants. AJNR 24:1646–1653
Bester M, Heesen C, Schippling S et al (2008) Early anisotropy changes in the corpus callosum of patients with optic neuritis. Neuroradiology 50:549–557
Lin Y, Wang J, Wu C et al (2008) Diffusion tensor imaging of the auditory pathway in sensorineural hearing loss: changes in radial diffusivity and diffusion anisotropy. J Magn Reson Imaging 28:598–603
Wu CM, Ng SH, Liu TC (2009) Diffusion tensor imaging of the subcortical auditory tract in subjects with long-term unilateral sensorineural hearing loss. Audiol Neurotol 14:248–253
Counsell SJ, Shen YJ, Boardman JP et al (2006) Axial and radial diffusivity in preterm infants who have diffuse white matter changes on magnetic resonance imaging at term-equivalent age. Pediatrics 117:376–386
Saksena S, Husain N, Malik GK et al (2008) Comparative evaluation of the cerebral and cerebellar white matter development in pediatric age group using quantitative diffusion tensor imaging. Cerebellum 7:392–400
Bartha AL, Yap KRL, Miller SP et al (2007) The normal neonatal brain: MR imaging, diffusion tensor imaging, and 3D MR spectroscopy in healthy term neonates. AJNR 28:1015–1021
Bonekamp D, Nagae LM, Degaonkar M et al (2007) Diffusion tensor imaging in children and adolescents: reproducibility, hemispheric, and age-related differences. Neuroimage 34:733–742
Smith SM, Jenkinson M, Johansen-Berg H et al (2006) Tract-based spatial statistics: voxelwise analysis of multi-subject diffusion data. Neuroimage 31:1487–1505
Pfefferbaum A, Adalsteinsson E, Sullivan EV (2003) Replicability of diffusion tensor imaging measurements of fractional anisotropy and trace in brain. J Magn Reson Imaging 18:427–433
Muller MJ, Mazanek M, Weibrich C et al (2006) Distribution characteristics, reproducibility, and precision of region of interest-based hippocampal diffusion tensor imaging measures. AJNR 27:440–446
Marenco S, Rawlings R, Rohde GK et al (2006) Regional distribution of measurement error in diffusion tensor imaging. Psychiatry Res 147:69–78
Bonekamp D, Nagae LM, Sullivan EV (2003) Diffusion tensor imaging in brain. Neuroimage 18:427–433
Ozturk A, Sasson AD, Farrell JAD (2008) Regional differences in diffusion tensor imaging measurements: assessment of intrarater and interrater variability. AJNR 29:1124–1127
Partridge SC, Mukherjee P, Berman JI et al (2005) Tractography-based quantitation of diffusion tensor imaging parameters in white matter tracts of preterm newborns. J Magn Reson Imaging 22:467–474
Bland JM, Altman DG (1986) Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1:307–310
Costa-Santos C, Bernardes J, Ayres-de-Campos D et al (2011) The limits of agreement and the intraclass correlation coefficient may be inconsistent in the interpretation of agreement. J Clin Epidemiol (in press)
Fleiss JL (1986) The design and analysis of clinical experiments. Wiley, New York, pp xiv, 432
Nunes RG, Jezzard P, Clare S (2005) Investigations on the efficiency of cardiac-gated methods for the acquisition of diffusion-weighted images. J Magn Reson 177:102–110
Brockstedt S, Borg M, Geijer B et al (1999) Triggering in quantitative diffusion imaging with single-shot EPI. Acta Radiol 40:263–269
Skare S, Andersson JLR (2001) On the effects of gating in diffusion imaging of the brain using single shot EPI. Magn Reson Imaging 19:1125–1128
Brander A, Kataja A, Saastamoinen A et al (2010) Diffusion tensor imaging of the brain in a healthy adult population: normative values and measurement reproducibility at 3 T and 1.5 T. Acta Radiol 51:800–807
Drobyshevsky A, Bregman J, Storey P et al (2007) Serial diffusion tensor imaging detects white matter changes that correlate with motor outcome in premature infants. Dev Neurosci 29:289–301
Rose J, Butler EE, Lamont LE et al (2009) Neonatal brain structure on MRI and diffusion tensor imaging, sex, and neurodevelopment in very-low-birthweight preterm children. Dev Med Child Neurol 51:526–535
Farrell JAD, Landman BA et al (2007) Effects of signal-to-noise ratio on the accuracy and reproducibility, of diffusion tensor imaging-derived fractional anisotropy, mean diffusivity, and principal eigenvector measurements at 1.5 T. J Magn Reson Imaging 26:756–767
Author information
Authors and Affiliations
Consortia
Corresponding author
Rights and permissions
About this article
Cite this article
Lepomäki, V.K., Paavilainen, T.P., Hurme, S.A.M. et al. Fractional anisotropy and mean diffusivity parameters of the brain white matter tracts in preterm infants: reproducibility of region-of-interest measurements. Pediatr Radiol 42, 175–182 (2012). https://doi.org/10.1007/s00247-011-2234-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00247-011-2234-9