Introduction

Systemic sclerosis (SSc) is a complex disease leading to fibrosis of the skin and internal organs and in particular of the lung [1, 2]. Interstitial lung disease is the leading cause of death in systemic sclerosis [3, 4]. Its diagnosis may be achieved with high-resolution computed tomography (HRCT) as a non-invasive surrogate for gold standard biopsy [5,6,7]. It is well known that HRCT may reflect the features of non-specific interstitial pneumonia (NSIP—characterized by a greater prevalence ground-glass opacities (GGO) than coarse reticulation (RET)) or, less frequently, of usual interstitial pneumonia pattern (UIP—with sub-pleural basal predominance of RET and honeycombing (HC), with or without traction bronchiectasis) [8, 9]. In order to reduce radiation exposure, magnetic resonance imaging (MRI) was proposed as a radiation-free technique to detect morphological and functional findings of parenchymal lung disease [10]. Lung MRI has the potential to differentiate inflammation-predominant versus fibrosis-predominant lesions, and during the past decade, significant developments have been achieved in this field with promising preliminary data on its potential employment in the assessment of many pulmonary diseases. Lung MRI has been indeed performed in patients with ILD of different aetiologies, from connective tissue disease (CTD)–related ILD (including SSc) to idiopathic pulmonary fibrosis (IPF) and sarcoidosis [11, 12]. In these contexts, lung MRI was shown to be comparable or inferior to HRCT in the morphological evaluation of typical ILD changes (including ground-glass opacities (GGO), reticulation (RET), and honeycombing (HC)) [11]. Promising data are available for lung MRI to distinguish between lung segments with active and non-active inflammation; despite limited evidences being available for detection of inflammation in IPF and sarcoidosis, lung MRI has proven to be useful in the assessment of lung ventilation changes, as well as in the assessment of lymph nodes’ involvement [12]. Conversely, short tau inversion recovery (STIR) sequences can detect oedema with high specificity, and T1 mapping assessment can detect tissue abnormalities with high sensitivity, especially in CTD-related ILD [11].

However, at present, MRI use is limited compared to HRCT, in particular for technical issues due to vascular, respiratory, and body movements as well as a lower spatial resolution [13]; thus, it is not routinely employed for the evaluation of the lung, although it is considered the gold standard for non-invasive virtual histological discrimination of different tissues.

The aim of the present study was to compare the MRI signal of the lung with chest HRCT features of SSc patients, to analyse its correlation with other ILD clinical and instrumental parameters, and to evaluate the role of MRI in the prediction of ILD clinical progression.

Materials and methods

Patient population

This was a prospective study enrolling consecutive patients attending the Rheumatology outpatient clinic of the Azienda Ospedaliero-Universitaria Careggi, Florence, with the following inclusion criteria: (1) a diagnosis of SSc according to the 2013 ACR/EULAR classification criteria [14]; (2) a clinical indication to cardiac MRI evaluation; (3) a clinical indication to chest HRCT [5]; (4) MRI and HRCT done on the same day; (5) cardiac MRI axial acquisition on the same lung zones of HRCT. Data regarding demographic, clinical, serological, and instrumental features of the disease were collected from medical history charts, in particular regarding as follows: previous/ongoing tobacco smoke exposure, skin involvement according to modified Rodnan skin score (mRSS) [15], forced vital capacity (%FVC), lung diffusion of carbon oxide (%DLco), total lung capacity (%TLC), microvascular involvement [16], presence/history of digital ulcers [17], and the number of anterior and posterior B-lines on lung ultrasound [18]. Patients were clinically followed up from the date of MRI and HRCT: according to the evolution of their lung involvement, they were divided into stable or worsened. A group of 15 healthy individuals without SSc, undergoing clinically driven cardiac MRI which in the end excluded any cardiac condition, was considered the control group. The study was approved by the local Ethical Committee Area Vasta Nord-Ovest Toscana.

High-resolution computed tomography

HRCT were performed using a spiral CT/I high-speed GEMs (General Electric Medical Systems) machine carrying multidetector CT scanner with a 1.0–1.5-mm thick section taken at 1.0–1.25-mm intervals throughout the entire lung during inspiration in the prone position and with spiral technique in the supine position. HRCT images were semi-quantitatively scored by two thoracic radiologists, blinded to MRI as well as to other examinations and clinical findings, with specific expertise in pulmonary disease and belonging to an established multidisciplinary group consisting of pulmonology, radiology, pathology, rheumatology, and cardiology specialists (GIIP, Gruppo Interdisciplinare Interstiziopatie Polmonari) [19]. The presence of ILD was then visually scored according to the previously described semi-quantitative Scleroderma Lung Study (SLS) I score [9]. The SLS I score was performed in three zones (upper zone: extending from the apex to aortic arch; middle zone: from the aortic arch to inferior pulmonary vein; lower zone: from the inferior pulmonary veins to diaphragms) of each lung, as previously described (5). The SLS I score is a semi-quantitative scoring method where the extent of the pulmonary abnormality in each of six zones was graded using a scale from 0 to 4, as follows (0 = absent, 1 = 1–25%, 2 = 26–50%, 3 = 51–75%, and 4 = > 75%) to evaluate the four categories of parenchymal abnormalities: (1) pure GGO; (2) lung fibrosis including thickened RET, bronchiectasis, and bronchioloectasis (BB); (3) HC, (4) emphysema (EMP). The two readers scrolled all slices from the apex to the base of each HRCT scan and attributed to each of the 6 thoracic zones, the eventual presence of GGO, RET, BB, HC, or EMP, and a number from 0 to 4 according to the extent of the parenchymal abnormality.

Lung magnetic resonance imaging

MRI was performed on a 1.5-T system (SIGNA Excite HDxT, General Electronic Medical, Waukesha, WI, USA) with cardiac phased-array coil (8 channels). The lung was assessed at the end of a clinically driven cardiac MRI by axial acquisition on the same lung zones of HRCT in the supine position, with ECG gating, during inspiration. Images were acquired by:

  • Triple IR; TR (related to heart rate) about 2000 ms; TE 82 ms; FoV 38 cm; TI 180 ms; ThK 8 mm with gap variable depending on the size of the lung zone (15–20 cm); 224 × 224; Nex 1; asset 2; acquisition time about 8 s for section

  • T1 Map-MCine-IR (modified Cine-IR) spoiled gradient-echo sequences with TR 3 ms; TE 1.1 ms; FA° 8 ms ThK 8 mm with gap variable depending on the size of the lung zone (15–20 cm); 192 × 192, views per segment (VPS) = 32; Nex 1; asset 2; acquisition time about 8 s for section 8 s RR = 6 time required full recovery of the longitudinal magnetization

  • MCine-IR sequences are repeated and at 5, 10, and 15 min post gadolinium-chelate contrast agent (CA)

  • T1 Map with MCine-IR sequences (modified Cine-IR) spoiled gradient echo with TR 3 ms; TE 1.1 ms; FA° 8 ms ThK 8 with gap variable depending on the size of the lung zone (15–20 cm); 192 × 192, views per segment (VPS) = 32; Nex 1; asset 2; acquisition time about 8 s RR = 2 time required full recovery of the longitudinal magnetization

MRI images were analysed using the HIPPO SW software, developed in IDL 8.2.3, and validated by the Cardiovascular Magnetic Resonance Unit of the Fondazione Toscana G. Monasterio. With this software, it is possible to manually draw one or more region of interest (ROI) on a DICOM image at a particular inversion time (IT), allowing the detection of the single point within the ROI signal intensity and thus calculating an intensity signal/IT curve. The same protocol was applied to SSc patients and to the control group.

Comparison between HRCT and lung MRI

All patients underwent chest HRCT and lung MRI on the same day. On MRI, ROIs were positioned to cover all 6 thoracic zones from the apex to the diaphragm, whenever possible, according to the same scheme used for the HRCT (upper zone extending from the apex to aortic arch; middle zone extending from the aortic arch to inferior pulmonary vein; lower zone extending from the inferior pulmonary veins to diaphragms). ROIs were positioned on areas of parenchymal hyperintensity whenever present at visual estimation; otherwise, they were positioned on parenchymal areas of apparent normal intensity. ROI positioning and selection were done by the same operator on all exams, blinded to HRCT and all other patients’ data, apart from the initial diagnosis of SSc. ROIs’ signal intensities were then measured in STIR and T1 mapping pre- and post-gadolinium sequences at 5, 10, and 15 min after CA administration.

According to the combination of HRCT/MRI features, patients were clustered as follows:

  • Group 1: patients with both normal HRCT and MRI, identified as patients without SSc-related ILD signs on HRCT and no visual hyperintensity on MRI (normal, no ILD)

  • Group 2: patients without SSc-related ILD signs on HRCT but areas of visual hyperintensity on MRI (discordant, HRCT−/MRI+)

  • Group 3: patients with SSc-related ILD signs on HRCT and areas of visual hyperintensity on MRI, identified as patients with SSc-related ILD (abnormal, ILD)

Lung ultrasound

LUS was performed on the same the day of HRCT and MRI, as previously described (11) on the antero-lateral and posterior thorax, for a total of 58 scanning sites (28 anterior scanning sites +30 posterior scanning sites), using 2.5–3.5-MHz cardiac sector transducers (2.5 cm long) on commercially available ultrasonographic equipment (Vivid i, GE/Vingmed, Milwaukee, WI, USA). Convex (3.5–6.0 MHz) or linear transducers (7.0–10.0 MHz) were used when needed to better identify pleural alterations.

B-lines were defined as discrete laser-like vertical hyperechogenic reverberation artefacts that arise from the pleural line, extend to the bottom of the screen, and move synchronously with respiration and lung sliding [20]. In each scanning site, B-lines were quantified from 0 to 10 as previously described [21], and summed up to obtain a total number of B-lines [21, 22]. All LUS exams were done by a single operator with more than 10 years’ experience in LUS, blinded to HRCT and MRI data, and to all other clinical and instrumental information apart those clearly evident from inspection.

Follow-up

Patients were followed up at regular outpatient clinic appointments every 6–12 months by an attending rheumatologist [23]. Since there is no universally accepted definition of worsening SSc-related ILD, events were classified based on an integrated evaluation of the clinical picture combined with all available predictors of progression (i.e., ≥ 10% decrease in FVC, ≥ 15% decrease in DLCO, extent and characteristics of disease on HRCT scan) [18]. Two physicians, blind to lung MRI data, evaluated each case separately. Patients were censored at the time of the event or at the time of the last available visits).

Statistical analysis

Continuous variables were expressed as mean ± standard deviation or as median and inter-quartile range (IQR), as appropriate according to normal distribution. Two-sample comparisons were performed using t test if variables were normally distributed and the Mann-Whitney U test for non-normally distributed data. Differences between baseline and each experimental point were tested using one-way analysis of variance followed by Tukey’s post hoc test. Correlations between STIR and T1 signals and abovementioned clinical-instrumental parameters were tested with Pearson’s or Spearman’s rho test, as appropriate. Differences between groups were analysed with the ANOVA univariate test or Kruskal-Wallis test for independent samples, as applicable. The association of selected variables with the outcome was assessed by Cox’s proportional hazard model using univariable and multivariable procedures. The best threshold for STIR values to predict pulmonary worsening was determined using receiver operating characteristic (ROC) curve by the Youden index; the results are expressed in terms of area under the curve (AUC) and the 95% confidence interval for this area. The event rates were estimated with Kaplan-Meier curves and compared by the log-rank test. A p value of <0.05 was considered statistically significant. All statistical analyses were performed using the Statistical Package for the Social Sciences (SPSS Inc., version 22, Chicago, IL, version 20) and GraphPad Prism version 6 (GraphPad Software Inc., San Diego, CA, USA).

Results

Thirty SSc patients (28 women, median age 47 (IQR 37–54) years, median disease duration 3 (IQR 2–5) years) were enrolled: 11 had previous or ongoing smoking exposure, 13 showed a NYHA class ≥2, 3 presented diffuse cutaneous skin involvement, and 7 had a current or previous history of digital ulcers. Anti-nuclear, anti-centromere, and anti-topoisomerase I autoantibodies were positive in 29, 10, and 13 patients, respectively. Microvascular involvement on nail-fold videocapillaroscopy was characterized as 7 no specific, 11 early, 9 active, and 3 late patterns. More epidemiological, clinical, and instrumental details are presented in Table 1.

Table 1 Clinical characteristics of the study population

Comparison between HRCT and lung MRI

HRCT showed absence of ILD in 17 patients and signs of ILD in 13 patients. The SLS I score ranged from 0 to 16. From a visual point-of-view, the lung parenchyma qualitatively appeared with normal intensity on MRI in 10 cases (all corresponding to patients with absence of ILD at HRCT), whereas in 20 cases, MRI showed some degree of hyperintensity of the lung parenchyma. Among these patients, 13 had also evidence of ILD at HRCT, but 7 patients had a normal HRCT.

According to the previously presented combination of HRCT/MRI features, 10 patients were labelled as normal (group 1, both HRCT and MRI without ILD), 7 patients as discordant (group 2, negative HRCT but positive MRI), and 13 as abnormal (group 3, both positive HRCT and MRI, ILD present). Patients in the three groups were statistically different in terms of functional impairment (%FVC, %DLco, with trend towards statistical significance of %TLC), and ILD severity according to both SLS score on HRCT and number of B-lines on lung ultrasound (Table 2).

Table 2 Pulmonary functional, ultrasonographic, and MRI features of the patients’ groups

On MRI, a total of 135 ROIs were selected (mean of 4.5 ROIs per patient). Mean STIR values were significantly different among the 3 groups (Table 2, Fig. 1, and Supplementary Fig. 1). All T1 mapping pre- and post-CA values (at 5, 10, and 15 min) were also significantly different between the three groups (Table 2). In group 3, no significant differences in STIR or T1 values were found between the different ILD features (GGO, RET, and HC as seen on HRCT).

Fig. 1
figure 1

HRCT and MRI imaging in 3 different patients of groups 1, 2, and 3

Mean STIR values showed significant correlations with SLS I score (R = 0.79, p < 0.0001), total lung ultrasound B-lines (R = 0.73, p < 0.0001) (Supplementary Fig. 2), mRSS (R = 0.52, p = 0.003), %FVC (R = − 0.46, p = 0.022), and %DLco (R = − 0.63, p = 0.0001). Similarly, mean T1 mapping signal at T0 (pre-CA) showed a significant correlation with SLS I score (R = 0.52, p = 0.003), lung ultrasound B-lines (R = 0.59, p = 0.001), %TLC (R = − 0.65, p = 0.0001), and %DLco (R = − 0.45, p = 0.014). All the ROI of the subjects of the control group yielded a STIR value of 0.

Follow-up

Patients were followed up for a mean of 25 ± 20 months. During this time gap, 9 patients presented worsening of lung involvement (0/10 patient from group 1, 2/7 patients from group 2, and 7/13 patients from group 3). We found a significant difference in mean STIR values between patients with stable and worsening lung involvement, but not in T1 mapping values (both pre- and post-CA) (Table 3). At univariate analysis, mean STIR values, positivity of anti-topoisomerase I, and posterior ultrasound B-lines were significantly associated with events, with mean STIR values being the only independent predictors of events at multivariate analysis (HR 1.018; CI 1.005–1.031; p < 0.005—see Table 4). ROC analysis showed an AUC for STIR values of 0.85 (0.71–0.99; p = 0.002). A cut-off of STIR >90 ms was then determined to maximize the sum of sensitivity and specificity, with a sensitivity and specificity of 80%. A STIR value >90 ms was thus able to discriminate patients at a higher risk of worsening pulmonary involvement (HR 8.80; CI 1.81–42.74; p < 0.007), as confirmed by Kaplan-Meier curves (Fig. 2).

Table 3 Baseline MRI values according to worsening of lung involvement
Table 4 Univariate and multivariate regression to predict for worsening lung involvement
Fig. 2
figure 2

Kaplan-Meier curves to predict lung worsening during follow-up in patients with mean STIR value ≤ and > 90 ms

Discussion

Our data show that lung MRI STIR and T1 values are significantly different between patients with and without SSc-related ILD. STIR values, independently of HRCT appearance, are also able to predict worsening lung involvement over time. HRCT is the gold standard to assess and quantify morphological abnormalities in SSc-related ILD, in terms of both parenchymal and vascular changes [24]. However, in the last decade, other imaging modalities have been explored to assess lung involvement, such as lung ultrasound and lung MRI, which could yield the advantage to reduce radiation exposure [25]. In previous studies, 3-T lung MRI T1-weighted sequences have been analysed in IPF: pathological areas presented a different signal intensity both pre- and post-CA administration, both when compared to non-altered areas of IPF patients and to normal lung areas [26]. Previously, MRI has been tested in SSc-ILD showing interstitial modifications with a good inter- and intra-observer agreement for the measurement of ILD extent [27]. Our study confirms the potential role of lung MRI in the detection of ILD in SSc patients. In addition to previous results, we measured resonance signal intensity in STIR and T1-weighted sequences in anatomical sections corresponding to normal lung parenchyma and ILD. We found a significant difference, in either STIR or T1 values, between normal and pathological ILD tissue areas, with HRCT as the gold standard. This was further supported by the significant correlations with other functional (%FVC, %TLC, %DLco), imaging (lung ultrasound B-lines), and radiological (HRCT SLS I score) parameters reflecting SSc-ILD. The differences in T1 values of pathological and normal lung parenchyma could be explained by differences in proton density and tissue composition, likely related to the higher percentage of water and proteins in inflamed areas or increase of macro-molecule fraction and water decrease content in fibrotic areas [28].

Moreover, we identified a group of patients with apparently no signs of ILD on HRCT but with signal intensity abnormalities at lung MRI. This group, while not showing clear ILD features, does not seem completely normal, with intermediate values of STIR and T1 at MRI, as well as %FVC, %DLco, and lung ultrasound B-lines. This evidence could be related to a higher sensitivity but lower specificity of quantitative lung MRI and lung ultrasound when compared to visual estimation of HRCT. Another possible explanation which could account for the apparent discrepancy between HRCT and MRI results could be related to the transient accumulation of extravascular lung water in the dependent areas of the lung, due to the supine position, determining increased STIR and T1 values compared to the normally aerated lung parenchyma. In fact, Stadler et al. have shown that T1 value after CA administration was higher in posterior versus anterior lung areas, possibly related to the pooling of blood in the supine position [29]. In our patients, MRI was performed in full inspiration, with reduction of proton density within the ROIs and consequently lower signal-to-noise ratio in the normal lung. Similar findings were previously reported in 24 SSc patient with T2 sequence MRI: in the 2 cases of positive MRI but negative HRCT, the authors speculated on the possible detection by MRI of a very early non-HRCT-detectable GGO; however, they considered these two cases as false positive for MRI, given the presence of false positive also in the control group, differently from our cohort, where all control subjects without SSc had no increased STIR signal on the lung parenchyma [30].

Our data show significant correlations between MRI findings and ILD signs as assessed by other tools. Mean STIR and mean T1 values correlated with %TLC and %DLco. This is in agreement with the results of Pinal-Fernandez et al. who showed a significant correlation between MRI evaluation of ILD extent and respiratory functional parameters [27]. These data confirm also those of Ohno et al. who showed that mean T2 values are different when comparing connective tissue disease ILD and healthy subjects, with significant correlation with %DLco and severity of ILD on HRCT [31]. Moreover, the correlations found with lung ultrasound B-lines further support the role of non-radiating tools in evaluating lung involvement.

To the best of our knowledge, these are the first data showing promising results of the use of lung MRI in predicting future clinical worsening related to lung involvement. In SSc, the capacity to identify patients characterized by a progressive evolution still remains one of the main clinical unmet needs [23]. In our cohort, STIR values were the only parameter, among all the clinical and instrumental data available, which was able to predict further ILD development or worsening, irrespective of the presence of overt ILD at HRCT (group 2 with discordant normal HRCT and hyperintensity on MRI).

Our data should be considered preliminary because the study has some limitations. The study population is not large, and MRI was performed in the supine position only, in order to examine the myocardial status. This position is not ideal to discriminate dependent areas with increased signal due to the patient’s position. Moreover, some details concerning MRI methodology need to be tested and further verified on a much larger population to identify the best setting for the ILD investigation. Finally, our population presented at baseline relatively preserved lung function in terms of both FVC% and TLC%, while mean %DLco values already showed a borderline impairment. This could be representative of the early signs of ILD, without significant restrictive impairment both at baseline and during follow-up assessments, which may be more frequent in advanced and rapidly progressive cases.

In conclusion, lung MRI is a promising technique which might be able to discriminate patients with normal lung parenchyma from patients with SSc-ILD, and to identify altered signal intensities in lung parenchyma areas that appear normal on HRCT.

These preliminary data support the scientific effort to better understand the potentiality of lung MRI which, in the future, could help selecting patients for an early personalized treatment of SSc-related ILD [32, 33], as well as a more appropriate use of HRCT, in order to reduce radiation exposure [34], in particular in young fertile patients.