Abstract
Objectives
In the neuroradiological work-up of Multiple Sclerosis (MS), the detection of “black holes” (BH) represent an information of undeniable importance. Nevertheless, different sequences can be used in clinical practice to evaluate BH in MS. Aim of this study was to investigate the possible impact of different sequences, resolutions, and levels of expertise on the intra- and inter-rater reliability identification of BH in MS.
Methods
Brain MRI scans of 85 MS patients (M/F = 22/63; mean age = 36.0 ± 10.2 years) were evaluated in this prospective single-center study. The acquisition protocol included a 3 mm SE-T1w sequence, a 1 mm 3D-GrE-T1w sequence from which a resliced 3 mm sequence was also obtained. Images were evaluated independently by two readers of different expertise at baseline and after a wash-out period of 30 days. The intraclass correlation coefficient (ICC) was calculated as an index of intra and inter-reader reliability.
Results
For both readers, the intra-reader ICC analysis showed that the 3 mm SE-T1w and 3 mm resliced GrE-T1w images achieved an excellent performance (both with an ICC ≥ 0.95), while 1 mm 3D-GrE-T1w scans achieved a moderate one (ICC < 0.90). The inter-reader analysis showed that each of the three sequences achieved a moderate performance (all ICCs < 0.90).
Conclusions
The 1 mm 3D-GrE-T1w sequence seems to be prone to a greater intra-reader variability compared to the 3 mm SE-T1w, with this effect being driven by the higher spatial resolution of the first sequence. To ensure reliability levels comparable with the standard SE-T1w in BH count, an assessment on a 3 mm resliced GrE-T1w sequence should be recommended.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
The detection and report of “black holes” (BH) is part of the standard neuroradiological evaluation in Multiple Sclerosis (MS). Indeed, given the correlation existing between BH and clinical disability [1,2,3], MS international guidelines recommend to always include their presence and number in neuroradiological reports [4, 5]. Although the presence of acute inflammation is related to the detection of transient BH [6, 7], the prognostic relevance relies on the presence of chronic BH, persisting for at least 6 months [8, 9] in absence of contrast-enhancement [7], which represent areas of severe tissue destruction, with irreversible axonal and neuronal loss [1, 3, 6].
Operationally, BH have been defined, more than 20 years ago, as T1-weighted (T1w) hypointense lesions with signal intensity comprised between the one of the gray matter (GM) and that of the cerebrospinal fluid (CSF) on Spin-Echo (SE) T1w images [10], corresponding to hyperintense lesions on T2w images [6, 11, 12]. Nonetheless, the last decades have seen the increase in acquisition of 3D T1w sequences not only in research settings (i.e., for brain atrophy quantitative assessment), but also in everyday clinical practice [13]. These sequences, with the Magnetization-Prepared RApid Gradient-Echo (MPRAGE) being the most representative among them, lead to a large variety of advantages, such as increased spatial resolution and decreased acquisition time, but are obviously characterized by a different tissue contrast compared to standard SE-T1w sequences [14]. Accordingly, it has been demonstrated that the evaluation of SE-T1w and 3D-Gradient-Echo (GrE)-T1w sequences leads to the identification of a different number of T1w hypointense lesions in MS [15]. So far, however, no information about intra-reader reproducibility, a crucial point in the evaluation of a condition such as MS in which seriate MRI scans are acquired, is available in the literature. Furthermore, no previous work has investigated inter-reader reproducibility between neuroradiologists with different years of expertise in MS. Indeed, it can be hypothesized that new generations of neuroradiologists might be more likely trained to evaluate 3D-GrE-T1w sequences, which are acquired always more widely and routinely.
Given this background, aim of this study was to investigate the possible impact of different sequences (SE-T1w or 3D-GrE-T1w), image resolution, and level of training on the intra- and inter-rater reliability of BH identification in MS. Finally, as different degrees of microstructural changes have been reported in SE-T1w compared to 3D-GrE-T1w hypointense lesions [16], to explore the clinical meaningfulness of different assessment approaches we tested correlations between BH identified on different sequences and disability.
Material and methods
Compliance with ethical standards
This study was approved by the local Ethics Committee, in accordance with the ethical standards of the institutional research committee and with the 1964 Helsinki Declaration and its later amendments. Written informed consent was obtained from all patients prior to enrolment.
Participants
In this single center study, MRI data from MS patients prospectively acquired from January 2019 to December 2021 in the context of a larger prospective MRI study were selected. To be included in this study, patients had to fulfill the following inclusion criteria: age ≥ 18 or ≤ 70 years; MS diagnosis according to the 2017 revision of the McDonald’s criteria [17] absence of any medical conditions associated with brain pathology other than MS; an Expanded Disability Status Scale (EDSS) obtained within one week from the MRI exam; a Relapsing–Remitting (RR-MS) course according to Lublin et al. [18]. The following exclusion criteria were then applied: unavailability of a T1w sequence acquired after gadolinium administration; unavailability of both SE-T1w and 3D-GrE-T1w sequences acquired in the same MRI session; images with poor quality (i.e. due to motion artifacts) or patients with exclusively large confluent lesions.
A flowchart showing the number of patients included and excluded from the study is available in Fig. 1.
Images acquisition
All brain MRI scans were acquired on the same 3T scanner (Trio, Siemens Medical Systems, Erlangen, Germany) using the same acquisition protocol, that included a 3D Fluid-Attenuated Inversion Recovery sequence (FLAIR; TR = 6000 ms, TE = 396 ms, TI = 2200 ms, voxel size = 1x1x1mm, 176 sagittal slices, no gap), a 2D SE-T1w sequence acquired before gadolinium administration (TR = 615 ms, TE = 8.5 ms, voxel size = 1x1x3mm, number of slices = 40, no gap) as well as a 3D-GrE-T1w volume (MPRAGE, TR = 2500 ms, TE = 2.8 ms, TI = 900 ms, voxel size = 1x1x1mm, 160 axial slices, no gap) before and after contrast administration.
All T1w sequences were acquired with the same bicommisural orientation along the AC-PC line, to minimize possible errors in image evaluation due to multiplanar reconstruction of the 3D-GrE-T1w volume.
MRI data analysis
Three different sequences were evaluated by the readers in this study, namely the SE-T1w, the 3D-GrE-T1w (with a slice thickness = 1 mm) and the resliced-(rs)GrE-T1w (with a slice thickness = 3 mm). Indeed, given the differences in resolution between the 2D-SE-T1w and the 3D-GrE-T1w volume, the latter was resliced to a 2D-GrE-T1w sequence with a slice thickness equal to the one of the SE-T1w (3 mm), in order to retain only the effect of different tissue contrasts on BH evaluation and evaluate the possible effect of the different spatial resolution.
All images were independently evaluated by two readers with different expertise: a neuroradiology fellow with 4 years of experience in the field of MS (Reader A) and a board-certified neuroradiologist with more than 10 years of experience in MS field (Reader B).
Both readers evaluated images a first time (T0), and after a wash-out period of 30 days (T1).
To provide data in a different order and minimize possible learning curve effects, a random alphanumeric identification code was assigned to each sequence at T0 and randomly changed at T1.
At all steps, images were evaluated with the readers being blinded to any clinical or demographic information.
According to the literature [7, 11, 19], BH were defined as non-enhancing T1w hypointense lesions, with a minimal diameter of 3 mm and an intensity comprised between the CSF and the GM and corresponding to FLAIR hyperintensities. An example of what has been defined as chronic BH on both SE-T1w and GrE-T1w sequences is shown in Fig. 2. Confluent or poorly defined lesions, as well as acute BH (i.e., those showing enhancement after gadolinium administration), were excluded from the lesion count. An example of confluent periventricular and acute BH lesions is shown in Fig. 3.
Statistical analysis
All statistical analyses were performed using R (v. 4.2.1). Descriptive statistics are reported for demographics and lesion count.
Intraclass correlation coefficient (ICC) and corresponding 95% confidence intervals (CI) were employed to assess intra- and inter-reader reliability separately for each of the three sequences. Inter-reader reliability was assessed between readers’ T0 evaluations to minimize the influence of possible learning curve effects. According to the study by Koo and colleagues [20], values greater than 0.9 indicated excellent reliability, values ranging from 0.75 to 0.9 and from 0.5 to 0.75 indicated good and moderate reliability respectively, while values of less than 0.5 were indicative of poor reliability.
Additionally, these analyses were also replicated using Cohen's kappa statistics (Supplementary Materials).
Possible correlations between BH number on each MRI sequence at T0 (Reader B) and patients’ clinical status, assessed via EDSS, were tested with Pearson correlation coefficient analysis.
Results
After the application of the inclusion and exclusion criteria, eighty-five MS patients were included in the analysis (M/F = 22/63; mean age = 36.0 ± 10.2 years; median EDSS = 2.0 [range: 2.0 – 3.0]).
Means, standard deviations and medians of BH counts for each reader, sequence and assessment session are reported in Table 1.
For both readers, the intra-reader ICC analysis showed that SE-T1w and rsGrE-T1w images achieved an excellent performance in terms of reliability, whereas 3D-GrE-T1w scans achieved a moderate one. In particular, when evaluating the intra-reader reliability for Reader A, the highest reliability was associated with SE-T1w images (ICC = 0.98, CI = 0.97—0.99), followed by rsGrE-T1w images (ICC = 0.95, CI = 0.92—0.97), while 3D-GrE-T1w images presented the lowest ICC value (ICC = 0.86, CI = 0.79—0.91) (Fig. 4). On the other hand, when evaluating the intra-reader reliability for reader B, the highest reliability was associated to rsGrE-T1w images (ICC = 0.94, CI = 0.91—0.96), followed by SE-T1w images (ICC = 0.91, CI = 0.86—0.95). 3D-GrE-T1w images presented the lowest ICC value also in this case (ICC = 0.86, CI = 0.78—0.91) (Fig. 4).
Finally, in the inter-reader ICC analysis between Reader A and Reader B assessments at T0, each of the three sequences achieved a moderate performance. Indeed, despite the highest reliability being associated with SE-T1w images (ICC = 0.84, CI = 0.76—0.89), followed by 3D-GrE-T1w (ICC = 0.83, CI = 0.74—0.89) and rsGrE-T1w images (ICC = 0.81, CI = 0.72—0.87), similar ICC values and respective confidence intervals were observed. Comparable results were also obtained when the Cohen's kappa analysis was carried out (Supplementary Materials).
For all sequences, a significant correlation was observed between BH number and EDSS score (SE-T1w: r = 0.25, p = 0.03, CI = 0.03–0.45; rsGrE-T1w: r = 0.30, p < 0.01, CI = 0.08–0.49; 3D-GrE-T1w: r = 0.28, p = 0.01, CI = 0.06–0.47).
Discussion
The present study demonstrates that, applying the traditional definition of BH, the 3D-GrE-T1w sequence is prone to a greater intra-reader variability compared to the SE-T1w, with this effect being driven by the higher spatial resolution of the 3D-GrE-T1w sequence. Indeed, when evaluating the latter sequence but resampled to a resolution comparable to the one usually acquired of the SE-T1w acquisitions (thus preserving the different contribution to tissue contrast only, minimizing the possible effects of voxel resolution), we observed a comparable reliability in comparison to the one achievable with the SE-T1w.
Over the last years, 3D-GrE-T1w sequences have been largely preferred to SE-T1w, allowing for the relatively fast acquisition of whole brain volumes indispensable in research settings for morphometric segmentation and GM volume and thickness quantitative evaluation, but also reducing acquisition times with direct impact on the everyday clinical practice [16, 21]. Furthermore, the higher sensitivity of these sequences in identifying small lesions and subtle differences in tissue contrast is well-known [15, 21], as in future they also might be use as inputs in machine-learning algorithms that might help improving their detection [22]. Regarding BH identification, the issue of reproducibility has gained a growing interest over time [15, 23]. Indeed, reliability in BH identification directly affects neuroradiological reports, consequently impacting the information provided to the neurologists. Our results indicate that, more than image contrast, spatial resolution is directly related to reliability. As such, the evaluation of a resliced 3D-GrE-T1w sequence in clinical practice could result in higher reproducibility in BH assessment, closer if not comparable to the one obtained with the SE-T1w images. Interestingly, when assessing the inter-observer reliability between a young reader and a more experienced neuroradiologist, we observed similar concordance for all sequences, with only a slightly higher agreement when the SE-T1w sequence was evaluated, suggesting that the different level of expertise could exert a similar influence on variability independently form the sequence. A possible explanation for this finding could be researched in the abovementioned widespread and increasing diffusion of 3D-GrE-T1w images in clinical practice, which could have led to the acquaintance with BH detection on this sequence also in less experienced raters. On the other hand, whereas the last update of MAGNIMS–CMSC–NAIMS guidelines on MRI protocols in MS focuses on acquiring high-resolution T1w sequences [13], it is noteworthy to mention that some MS centers kept the habit of acquiring SE-T1w sequences. Thus, further studies are warranted to explore the reliability of these different sequences in BH assessment by readers without a specific experience in the MS field, to better simulate the daily clinical setting.
When evaluating correlations between BH and disability, our results are in line with previous studies showing the correlation between BH numbers and EDSS for all three sequences [15, 16]. Indeed, we found a weak, although statistically significant, correlation between these variables, a result expected also given the small range of EDSS of our MS group. It is noteworthy to remember that while the pathological relevance of SE-T1w hypointense lesions is clear and well understood, changes underlying hypointense lesions on 3D-GrE-T1w sequences do not seem to be univocally clarified [24]. Indeed, while more severe microstructural changes characterize SE-T1w compared to 3D-GrE-T1w hypointense lesions [16], the latter might represent the sum of a wide range of pathologic features, part of which could be reversible, such as edema and inflammation [15, 25].
This investigation does present some limitations. In the first place, being a single center study, the relatively small sample size here included could have lowered the sensitivity of our analyses, also considering that our group of patients included only RR-MS phenotypes, while it is known that the BH are commonly found (although usually in a confluent manner) in progressive stages of the disease [26]. Secondly, we only explored possible correlations between BH number and EDSS, as it would be of interest to further confirm their prognostic role by proving the correlation with other known prognostic biomarkers of the disease. Furthermore, since we have analyzed only images acquired on a 3T scanner, this study lacks information about the reliability of these sequences at 1.5T. Despite the well know limitations in routine acquisitions associated with a lower magnetic field strength [27], 1.5T scanners are widely used in clinical neuroradiological practice and therefore it could be of interest to compare the reliability of these different sequences at 1.5T. Moreover, in this study we could not address the variability in BH assessment across the wide range of scanner vendors and platforms currently available, also in the light of the systematic differences that can be present in studies with consistent scanner field strength and manufacturer after protocol harmonization [28]: for these reasons, future multi-center perspective studies are warranted to evaluate whether the results here presented can be generalized to different scanner platforms and field strengths.
Despite being characterized by these limitations, our study suggests that to ensure reliability levels comparable with the standard SE-T1w in BH count, which is crucial in the neuroradiological workup of MS patients, an assessment on a resliced GrE-T1w sequence should be recommended.
Data availability
Data supporting the findings of the study will be made available from the corresponding author upon reasonable request.
Abbreviations
- MS:
-
Multiple Sclerosis
- BH:
-
Black Holes
- CSF:
-
Cerebrospinal Fluid
- EDSS:
-
Expanded Disability Status Scale
- GM:
-
Gray Matter
- ICC:
-
Intraclass Correlation Coefficient
- GrE:
-
Gradient-Echo
- rsGrE:
-
3 mm-resliced Gradient-Echo
- SE:
-
Spin-Echo
References
Barkhof F, McGowan JC, van Waesberghe JH, Grossman RI (1998) Hypointense multiple sclerosis lesions on T1-weighted spin echo magnetic resonance images: their contribution in understanding multiple sclerosis evolution. J Neurol Neurosurg Psychiatry 64(Suppl 1):S77-79
Bitsch A, Kuhlmann T, Stadelmann C et al (2001) A longitudinal MRI study of histopathologically defined hypointense multiple sclerosis lesions: Longitudinal Study of Hypointense T1 Lesions. Ann Neurol 49:793–796. https://doi.org/10.1002/ana.1053
van Walderveen MAA, Kamphorst W, Scheltens P et al (1998) Histopathologic correlate of hypointense lesions on T1-weighted spin-echo MRI in multiple sclerosis. Neurology 50:1282–1288. https://doi.org/10.1212/WNL.50.5.1282
Alessandrino F, Pichiecchio A, Mallucci G et al (2018) Do MRI Structured Reports for Multiple Sclerosis Contain Adequate Information for Clinical Decision Making? Am J Roentgenol 210:24–29. https://doi.org/10.2214/AJR.17.18451
ASNR (2018) Neuroradiology CDE Distribution Supporting Documentation https://www.asnr.org/wp-content/uploads/cde/ASNR%202018%20CDE%20Distribution%20Supporting%20Documentation.pdf
Siger M (2022) Magnetic Resonance Imaging in Primary Progressive Multiple Sclerosis Patients: Review. Clin Neuroradiol 32:625–641. https://doi.org/10.1007/s00062-022-01144-3
Sahraian MA, Radue E-W, Haller S, Kappos L (2009) Black holes in multiple sclerosis: definition, evolution, and clinical correlations: Black holes in MS. Acta Neurol Scand 122:1–8. https://doi.org/10.1111/j.1600-0404.2009.01221.x
Arnold DL, Matthews PM (2002) MRI in the diagnosis and management of multiple sclerosis. Neurology 58:S23–S31. https://doi.org/10.1212/WNL.58.8_suppl_4.S23
Bagnato F (2003) Evolution of T1 black holes in patients with multiple sclerosis imaged monthly for 4 years. Brain 126:1782–1789. https://doi.org/10.1093/brain/awg182
Filippi M, Preziosa P, Barkhof F et al (2021) Diagnosis of Progressive Multiple Sclerosis From the Imaging Perspective: A Review. JAMA Neurol 78:351. https://doi.org/10.1001/jamaneurol.2020.4689
Filippi M, Rovaris M, Rocca MA et al (2001) Glatiramer acetate reduces the proportion of new MS lesions evolving into “black holes.” Neurology 57:731–733. https://doi.org/10.1212/WNL.57.4.731
van Waesberghe JH, van Walderveen MA, Castelijns JA et al (1998) Patterns of lesion development in multiple sclerosis: longitudinal observations with T1-weighted spin-echo and magnetization transfer MR. AJNR Am J Neuroradiol 19:675–683
Wattjes MP, Ciccarelli O, Reich DS et al (2021) 2021 MAGNIMS-CMSC-NAIMS consensus recommendations on the use of MRI in patients with multiple sclerosis. Lancet Neurol 20:653–670. https://doi.org/10.1016/S1474-4422(21)00095-8
Brant-Zawadzki M, Gillan GD, Nitz WR (1992) MP RAGE: a three-dimensional, T1-weighted, gradient-echo sequence–initial experience in the brain. Radiology 182:769–775. https://doi.org/10.1148/radiology.182.3.1535892
Dupuy SL, Tauhid S, Kim G et al (2015) MRI detection of hypointense brain lesions in patients with multiple sclerosis: T1 spin-echo vs. gradient-echo. Eur J Radiol 84:1564–1568. https://doi.org/10.1016/j.ejrad.2015.05.004
Lapucci C, Romano N, Schiavi S et al (2020) Degree of microstructural changes within T1-SE versus T1-GE hypointense lesions in multiple sclerosis: relevance for the definition of “black holes.” Eur Radiol 30:3843–3851. https://doi.org/10.1007/s00330-020-06761-5
Thompson AJ, Banwell BL, Barkhof F et al (2018) Diagnosis of multiple sclerosis: 2017 revisions of the McDonald criteria. The Lancet Neurology 17:162–173. https://doi.org/10.1016/S1474-4422(17)30470-2
Lublin FD, Reingold SC, Cohen JA et al (2014) Defining the clinical course of multiple sclerosis. Neurology 83:278–286. https://doi.org/10.1212/WNL.0000000000000560
Fazekas F (2000) Apolipoprotein E genotype related differences in brain lesions of multiple sclerosis. J Neurol Neurosurg Psychiatry 69:25–28. https://doi.org/10.1136/jnnp.69.1.25
Koo TK, Li MY (2016) A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J Chiropr Med 15:155–163. https://doi.org/10.1016/j.jcm.2016.02.012
Hu XY, Rajendran L, Lapointe E et al (2019) Three-dimensional MRI sequences in MS diagnosis and research. Mult Scler 25:1700–1709. https://doi.org/10.1177/1352458519848100
McKinley R, Wepfer R, Aschwanden F et al (2021) Simultaneous lesion and brain segmentation in multiple sclerosis using deep neural networks. Sci Rep 11:1087. https://doi.org/10.1038/s41598-020-79925-4
Bozsik B, Tóth E, Polyák I et al (2022) Reproducibility of Lesion Count in Various Subregions on MRI Scans in Multiple Sclerosis. Front Neurol 13:843377. https://doi.org/10.3389/fneur.2022.843377
Traboulsee A, Simon JH, Stone L et al (2016) Revised Recommendations of the Consortium of MS Centers Task Force for a Standardized MRI Protocol and Clinical Guidelines for the Diagnosis and Follow-Up of Multiple Sclerosis. AJNR Am J Neuroradiol 37:394–401. https://doi.org/10.3174/ajnr.A4539
Schiavi S, Petracca M, Sun P et al (2021) Non-invasive quantification of inflammation, axonal and myelin injury in multiple sclerosis. Brain 144:213–223. https://doi.org/10.1093/brain/awaa381
Truyen L, Van Waesberghe JHTM, Van Walderveen MAA et al (1996) Accumulation of hypointense lesions (“black holes”) on T 1 spin-echo MRI correlates with disease progression in multiple sclerosis. Neurology 47:1469–1476. https://doi.org/10.1212/WNL.47.6.1469
Schick F, Pieper CC, Kupczyk P et al (2021) 1.5 vs 3 Tesla Magnetic Resonance Imaging: A Review of Favorite Clinical Applications for Both Field Strengths-Part 1. Invest Radiol 56:680–691. https://doi.org/10.1097/RLI.0000000000000812
Shinohara RT, Oh J, Nair G et al (2017) Volumetric Analysis from a Harmonized Multisite Brain MRI Study of a Single Subject with Multiple Sclerosis. AJNR Am J Neuroradiol 38:1501–1509. https://doi.org/10.3174/ajnr.A5254
Funding
This work was supported in part by a grant by FISM—Fondazione Italiana Sclerosi Multipla – cod. 2020/R-Single/061 and financed or co-financed with the ‘5 per mille’ public funding.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethics approval
This study was approved by the local Ethics Committee, in accordance with the ethical standards of the institutional research committee and with the 1964 Helsinki Declaration and its later amendments.
Informed consent
Written informed consent was obtained from all patients prior to enrolment.
Competing interests
The Authors declare no competing interests.
Conflicts of interest
The Authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Tranfa, M., Scaravilli, A., Pastore, C. et al. The impact of image contrast, resolution and reader expertise on black hole identification in Multiple Sclerosis. Neuroradiology 66, 1345–1352 (2024). https://doi.org/10.1007/s00234-024-03310-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00234-024-03310-5