Introduction

Recurrent anterior dislocation of the humeral head causes wear, remodelling and bone loss of the antero-inferior margin of glenoid [1]. The reduction of the glenohumeral contact area can further increase joint instability, thus causing failure in soft tissue-only surgery [2, 3]. Therefore, pre-operative knowledge of glenoid bone loss is important in choosing the best surgical approach, in other words, whether to perform capsulolabral repair alone or also bone augmentation (i.e., Latarjet procedure, open or arthroscopic bone block) [1, 4]. Different glenoid bone loss threshold values have been reported, ranging from 10 to 25% [5]. A recent expert consensus statement stated that bone graft reconstruction should be considered when the glenoid bone deficit is greater than 20% [6].

Computed tomography (CT) was the first imaging modality to be used to assess glenoid bone loss: anterior glenoid flattening, decreased maximum glenoid width, and decreased maximum width-to-length ratio resulted the most useful parameters [7]. Later, PICO method was proposed to quantify the bone loss using the unaffected contralateral glenoid as a reference [8]. Then, the CT method was validated through arthroscopic comparison and magnetic resonance imaging (MRI) was also proposed as a valuable radiation-free alternative modality that demonstrates similar accuracy levels [9, 10]. The quantification of glenoid bone loss is still open to debate with regard to the radiological inter-reader discrepancy, the accuracy of MRI compared to CT, and the time-consuming processes of CT and MRI methods. Furthermore, nowadays because of the widespread use of both MR arthrography (MRA) to assess soft tissues and PICO method, acquiring CT images of both shoulders to assess glenoid bone loss, concerns have been raised regarding the economic impact on healthcare systems and radiation burden for patients.

In this context, the purpose of this multicentric study is to evaluate which method is the most easily reproduced and to establish agreement in the measurements carried out on CT and MRA and on unilateral and bilateral glenoid. According to our knowledge, there are no further multicentric and multi-reader studies in the literature that have compared different CT and MRA methods on a large sample of patients.

Material and methods

Study design

Campolongo Hospital Institutional review board approval was obtained. All participants of this study provided written informed consent for collection of data in order to use them for scientific purposes.

Patients

This study is concerned with the evaluation of a consecutive series of 110 patients with history of anterior shoulder dislocation who underwent shoulder MRA and unenhanced bilateral shoulder CT between January 2012 and April 2019 in three different hospitals (Campolongo Hospital, Medicanova Institute, Varelli Institute). All patients underwent both MRA and bilateral CT of the shoulder on the same day. Four patients were discarded, three of them for previous Latarjet surgery and one due to low quality MRA images related to motion artifacts. Six patients with bilateral glenoid bone deficiency were also excluded. Finally, a total of 100 patients (86 males, 14 females; mean age: 26.4 ± 8.7 years, range 16–52) were included in the study.

MRA and CT examination technique

All MRA were performed on a 1.5 T magnet (Siemens Magnetom Aera, Toshiba Titan, and Philips® Achieva). MRA protocol included at least a Turbo spin-echo (TSE) T1-weighted sequence on the oblique sagittal plane with a maximum field of view (FOV) of 18 cm and a maximum slice thickness of 4 mm. Any volumetric sequences have been deleted from the dicom folder, thus using only the acquired oblique sagittal plane in measurements. DOTAREM® 0.0025 mmol/ml was injected as the intra-articular contrast agent.

Simultaneous CT examination of both shoulders was performed on two different CT scanners (Toshiba Aquilion 16; Aquilion 64) using a maximum FOV of 18 cm and ≤ 1:1.75 pitch. The CT scanning plane extended from the acromion to just below the glenoid, with the patient's arms positioned along the chest wall and the palms pronated. In order to test the reader's variability in reconstructing the planes, only the native axial images were archived, and not the series of multiplanar reconstructed images.

MRA and CT analysis

All DICOM files were anonymized, renamed, and sent to 8 radiologists (2–30 years of musculoskeletal radiology experience) and 3 orthopedic surgeons (1–30 years of shoulder surgery experience). Of the latter, two of the orthopedists performed the calculations together with the reference radiologist, as would be done in clinical practice. Independently and blinded to clinical data, each reader or pair of readers (radiologist-orthopedic) calculated the glenoid bone loss of each patient using the following methods: best fit circle area on both MRA and CT images [10, 11] (Fig. 1a, b), maximum transverse glenoid width relative to diameter of best-fit circle on MRA and CT [10] (Fig. 2a, b), CT PICO technique [8] (Fig. 3a, b), ratio of the maximum glenoid fossa width to the maximum glenoid height on MRA and CT [7] (Fig. 4a, b) and length of flattening of the anterior glenoid curvature on MRA and CT [7] (Fig. 5a, b).

Fig. 1
figure 1

Turbo spin-echo T1-weighted a and CT b oblique images en face to glenoid. Glenoid bone loss is measured within best-fit circle. Glenoid bone loss is percentage reduction in osseous defect area (area y) relative to area of best-fit circle (area y + area x)

Fig. 2
figure 2

Turbo spin-echo T1-weighted a and CT b oblique images en face to glenoid. Maximum transverse width of the bony glenoid (black line) is compared with diameter of best-fit circle (black + white line). Glenoid bone loss is percentage reduction in maximum transverse glenoid width relative to diameter of best-fit circle

Fig. 3
figure 3

PICO method. CT oblique reformatted images en face to glenoid of nondislocating a and dislocating shoulder b. Glenoid bone loss is percentage reduction in osseous defect area (area y) relative to area of best-fit circle drawn on nondislocating shoulder (area x, left)

Fig. 4
figure 4

Turbo spin-echo T1-weighted a and CT b oblique images en face to glenoid. Glenoid bone loss is measured as ratio of the maximum glenoid fossa width to the maximum glenoid height

Fig. 5
figure 5

Turbo spin-echo T1-weighted a and CT b oblique images en face to glenoid. Glenoid bone loss is measured as length of flattening of the anterior glenoid curvature (dashed line)

All readers were free to use the processing software of their choice to visualize the DICOM files and to perform all measurements. They were also asked to calculate the time taken for each of the measurements performed on the 20th patient.

Data analysis

Statistical analysis was performed using MATLAB software (version R2020b). Data were preliminary processed to be expressed in common units. We used the Pearson’s Correlation Coefficient (PCC) to calculate: (i) the inter-reader agreement for each quantification method; (ii) the intra-reader inter-modality agreement between MRA and CT using the best fit circle area, maximum transverse glenoid width relative to diameter of best-fit circle, ratio of the maximum glenoid fossa width to the maximum glenoid height, and length of flattening of the anterior glenoid curvature; and (iii) the inter-modality agreement of bone loss area calculated with CT “best-fit circle” on unilateral glenoid and CT PICO method. The sequence index when computing PCCs is related to the patients and indices where at least one of the sequences had a missing value are deleted. The mean inter-reader discrepancy for each method was calculated, and the maximum error value was reported. Finally, the Mann–Whitney test was used to compare the time needed by the readers for bone loss quantification for each CT and MRA method. A p value < 0.05 was considered statistically significant.

Results

All inter-reader agreement PCC values are reported in Table 1. Inter-reader agreement PCC mean values were the following: 0.77 ± 0.08 (range 0.61–0.91) for CT best fit circle diameter; 0.72 ± 0.13 (0.35–0.91) for CT best fit circle area; 0.75 ± 0.10 (0.5–0.91) for CT PICO; 0.62 ± 0.1 (0.34–0.90) for CT anterior straight line; 0.43 ± 0.32 (0.15–0.86) for CT length-to-width ratio; 0.70 ± 0.1 (0.40–0.87) for MRA best fit circle diameter; 0.68 ± 0.13 (0.49–0.88) for MRA best fit circle area; 0.64 ± 0.11 (0.43–0.92), for MRA anterior straight line; 0.49 ± 0.22 (0.03–0.84) for MRA length-to-width ratio.

Table 1 Heatmap table shows the inter-reader agreement PCC values. The pair of readers are reported on the rows

All inter-modality agreement PCC values are reported in Table 2.

Table 2 Heatmap table shows the inter-modality agreement PCC values. The readers are reported on the rows

For inter-modality agreement, the PCC mean value was 0.9 ± 0.1 ranging from 0.68 to 1.00 when comparing CT and MRA best fit circle diameter, 0.9 ± 0.1 with 0.68–0.97 range when comparing CT and MRA best fit circle area, 0.62 ± 0.33 with 0.07–0.93 range when comparing CT and MRA anterior straight line, and 0.94 ± 0.06 with 0.80–0.99 range when comparing CT and MRA length-to-width methods.

When comparing unilateral (best fit circle area) and bilateral glenoid bone loss (PICO) CT methods, the PCC mean value was 0.8 ± 0.2, ranging from 0.51 to 1.00.

These results are summarized in Figs. 6 and 7.

Fig. 6
figure 6

Inter-reader agreement analysis. The results of inter-reader agreement analysis via normalized histograms of the fraction of reader’s pairs having a given level of agreement (measured in terms of PCC) when basing their assessments on a specific feature. The results are split in two subfigures for ease of reading, dividing the results related to CT A and MRA B. As an example, the second rightmost point of the green curve on the upper subfigure denotes that approximately 45% of the surgeon’s pair have high level of agreement (PCC close to 0.9) when basing their assessments on CT PICO. An apparent information extracted from the inter-reader agreement results is that the green curve is the one exhibiting its peaks toward the right side (closer to PCC = 1); then blue, red and yellow curves follow; and finally, the purple curves have their peaks toward the center (closer to PCC = 0). This means that reader shows higher level of agreement when basing their assessments on CT PICO (resp. length-to-width ratio), while the level of agreement is intermediate when based on the remaining imaging features. This behavior is confirmed for both CT-based and MRI-based features

Fig. 7
figure 7

Inter-modality agreement analysis. Results of the inter-modality agreement analysis via normalized histograms of the fraction of surgeons having a given level of agreement (measured in terms of PCC) when comparing their own assessments based on a pair of different-but-related bone loss quantification methods. As an example, the rightmost point of the purple curve on the figure denotes that approximately 35% of the readers have high level of agreement (PCC close to 0.9) when comparing their own assessments based on CT length-to-width ration and on MRI length-to-width ratio. An apparent information extracted from the intra-reader agreement results is that the green curve is the one exhibiting its peak toward the right side (closer to PCC = 1), while the purple curve has its peaks toward the center (closer to PCC = 0). This means that surgeons show higher (resp. lower) level of agreement when comparing their assessments on best fit circle diameter (resp. length-to-width ratio), while the level of agreement is intermediate when based on the remaining pairs of clinical features

The lowest level of inter-reader discrepancy was seen in CT PICO technique (4.16%), followed by CT and MRA best fit circle area methods (both 4.67%). Mean inter-reader discrepancy results are reported in Table 3.

Table 3 Mean inter-reader discrepancy (MID) with standard deviation (SD) and maximum discrepancy (D-MAX) values are reported

The average time taken by each reader for each method ranges from 31.33 s for MRA anterior straight line to 102.00 s for PICO technique (see Table 4). MRA best fit circle area method was significantly faster than the same method performed on CT (p = 0.031), while no significant difference was seen in CT/MRA best fit circle diameter (p = 0.666), CT/MRA anterior straight line (p = 0.340), and CT/MRA length-to-width methods (p = 0.730).

Table 4 Mean time to assess glenoid bone loss with different CT and MRA methods

Discussion

The main finding of this study is that CT PICO is the method with the highest inter-reader agreement and the lowest mean value of discrepancy in terms of glenoid bone loss percentage. Similarly, the calculation of the area and diameter with the best fit circle method in both MRA and CT also showed high inter-reader agreement. Furthermore, the greatest level of inter-modality agreement was reached by the best fit circle area CT and the PICO CT, followed by CT/MRA best fit circle area and best fit circle diameter.

Several studies have applied different methods and imaging modalities in the evaluation of glenoid bone loss in patients with anterior shoulder dislocation history [7, 9, 12, 13]. In fact, despite the established negative impact of large bone defects on the surgical outcome of glenohumeral soft-tissue stabilization interventions, there is no consensus on the gold standard technique for correctly quantifying glenoid bone loss pre-operatively [14]. It must be noted that the various methods are not interchangeable, given that threshold values used for critical glenoid bone loss cannot be compared. Consequently, high variability has been reported in the use of diagnostic methods in clinical practice with no consensus on threshold values for glenoid bone loss in surgical decision-making [15]. Hence, there has been an urgent request by the scientific community over recent decades for data from research studies regarding pre-operative imaging.

In this context, a reliable, reproducible, and easy-to-use imaging method for the assessment of glenoid bone loss is required for application in clinical practice. According to our data, CT PICO technique shows the greatest level of inter-reader agreement. In fact, CT is widely used to assess glenoid bone loss, using multiplanar reconstructions to build en face sagittal images of the glenoid with clear cortical contours. Methods besides PICO have been proposed to measure glenoid bone loss with interesting results, such as the best fit circle area or diameter, ratio of the maximum glenoid fossa width-to-height, and length of flattening of the anterior glenoid [9, 12]. All these measurements have the strong advantage of being measured on a single shoulder CT, without the use of bilateral CT acquisition, thereby reducing radiation exposure [7]. In our study in particular, the best fit circle area method showed optimal inter-reader agreement values and high inter-method agreement values with PICO. Our results suggest that the acquisition of data from both shoulders is not mandatory for glenoid bone loss measurement. We also found that all these measurements—except for the best fit circle area—can be obtained faster than with the PICO method. One strength of PICO and best fit circle area methods, however, is that these are independent from the site and morphology of glenoid bone loss which, in turn, affects the other CT measurements of glenoid bone loss [16]. Nevertheless, it is worth nothing that a tracing tool, which is not available on all PACS systems, is required to calculate bone loss with PICO method, whereas the CT measurements performed on a single shoulder can be easily carried out with routinely available linear measuring tools.

It is interesting to note that three-dimensional CT is strongly supported by several studies which report it as an accurate, reliable tool for surgical planning [17]. The disadvantage, however, is that it requires post-processing and presents some limitations. Using a 3D image to calculate the area might increase the risk of mistakenly including portions of bone located medial to the articular plane in the measurement. What is more, some studies have reported low correlation between 3D CT and arthroscopy data [18]. However, it should be noted that, although arthroscopy has been used extensively as a reference standard in previous imaging studies, arthroscopy tends to overestimate glenoid bone loss using the bare spot method whose inaccuracy has been demonstrated [19]. Arthroscopic measurements of bone loss also rely on the bare area as an anatomic landmark for the glenoid central point, despite the fact that it can be absent and its position is extremely variable [20, 21]

MRI is increasing under scrutiny as a radiation-free alternative to CT in the assessment of glenoid bone loss [22, 23]. The rationale lies in the fact that patients with glenohumeral dislocation are routinely subjected to MRA to assess labrum, capsule-ligamentous, chondral, and rotator cuff injuries [24], while CT is performed only to evaluate bone status. According to our results, MRA measurements are reliable in this setting, demonstrating high inter-reader agreement, particularly with the best fit circle area and diameter methods which also show high agreement with CT measurements. This is in line with previous studies that reported high inter-reader agreement values in evaluating glenoid bone loss on two-dimensional [22, 23] and three-dimensional MRI [25]. Moreover, MRI measurements have been shown to be accurate when compared with arthroscopy as reference standard [25, 26]. In this regard, it should be underlined that three-dimensional sequences are not always included in standard MRA protocols [27] and have the drawback of longer acquisition times. MRA measurements have the advantage of being slightly faster than those performed on CT, with a significant reduction of the time employed to calculate glenoid bone loss using MRA best fit circle area method, due to the immediate availability of sagittal images to perform all measurements and no need for multiplanar reconstructions. Furthermore, the recent introduction of Ultrashort echo time, Zero echo time and other CT-like bone contrast sequences promises further improvement in bone loss assessment [28,29,30].

This study has some limitations. First, only inter-reader and inter-methods agreement of these CT and MRA measurement were evaluated, while the clinical impact of these measurements on surgical decision-making, which was beyond the scope of the paper, was not tested. Neither was the use of three-dimensional CT images, but we believe that standardized surface measurements should be performed on multiplanar sagittal reconstructions of two-dimensional CT images. We did not evaluate operated patients. Finally, the time employed to perform all measurements may have been affected using different processing software to visualize the images.

In conclusion, both CT and MRA can be reliably used to assess glenoid bone loss in patients with a history of anterior shoulder dislocation. CT PICO is the most reliable imaging method, but other CT and MRA methods, particularly the best fit circle area, are valuable alternative measurement techniques that can be used to avoid the acquisition of data from both shoulders and limit pre-operative planning to the use of MRA only.