Abstract
Objectives
The recent European Society of Urogenital Radiology (ESUR) guidelines for evaluation and reporting of prostate multiparametric magnetic resonance imaging (mp-MRI) include the Prostate Imaging Reporting and Data System (PI-RADS). The aim of this study was to investigate the inter-reader agreement of this scoring system.
Methods
One hundred and sixty-four lesions in 67 consecutive patients with elevated prostate-specific antigen and previously negative trans-rectal ultrasound (TRUS)-guided biopsy were scored retrospectively by three blinded readers using PI-RADS. Mp-MRI was performed at 3 T using T2-weighted, diffusion-weighted and dynamic contrast-enhanced imagings (T2WI, DWI, DCE-MRI). Histology of all lesions was obtained by in-bore MRI-guided biopsy. Cohen’s kappa statistics were calculated for all readers.
Results
Inter-reader agreement for all lesions was good to moderate (T2WI, κ = 0.55; DWI, κ = 0.64; DCE-MRI, κ = 0.65). For tumour lesions it was good (T2WI, κ = 0.66; DWI, κ = 0.80; DCE-MRI, κ = 0.63) and for benign lesions moderate to good (T2WI, κ = 0.46; DWI, κ = 0.52; DCE-MRI, κ = 0.67). Using an overall PI-RADS score with a threshold of ≥10, we achieved a sensitivity of 85.7 %, and negative predictive value of 90.1 % for biopsied lesions.
Conclusion
PI-RADS score shows good to moderate inter-reader agreement and enables standardised evaluation of prostate mp-MRI, with high sensitivity and negative predictive value.
Key Points
• The European Society of Urogenital Radiology recently published guidelines for prostate MRI.
• We have evaluated inter-reader agreement of ESUR scoring for multiparametric prostate MRI.
• PI-RADS shows good to moderate inter-reader agreement and is clinically applicable.
• PI-RADS achieves in our series high sensitivity and negative predictive value for biopsied lesions.
• PI-RADS can be used as standardised scoring system in prostate cancer detection.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Prostate cancer is the second most frequently diagnosed cancer worldwide (13.6 % of all diagnosed cancers) and the third most lethal cancer in men in the developed world [1]. However, the detection rate of prostate cancer has been found to be only around 25 % when detection is based on elevated prostate-specific antigen (PSA), suspicious PSA kinetics and digital rectal examination [2, 3]. Advances in multiparametric magnetic resonance imaging (mp-MRI) combining anatomical and functional data showed considerable advantages in the detection and characterisation of prostate cancer [4, 5]. Several studies have demonstrated that functional imaging techniques, such as diffusion-weighted imaging (DWI), dynamic contrast-enhanced MRI (DCE-MRI) and spectroscopic imaging (MRSI) clearly improve the accuracy of MRI for the detection and localisation of prostate cancer [6]. Various MRI protocols have been proposed [7–10].
Recently the European Society of Urogenital Radiology (ESUR) published prostate MRI guidelines in order to standardise the evaluation and reporting of prostate MRI [11]. One relevant part of these guidelines is a unified scoring system named PI-RADS (Prostate Imaging Reporting and Data System) comparable to the breast imaging reporting and data system (BI-RADS) [12, 13]. While this guideline is the first attempt to standardising prostate MRI, little to no evidence has been available on its accuracy and inter-reader agreement.
Thus, the purpose of this study was to assess the inter-reader agreement of the ESUR scoring system using histology obtained from MRI-guided biopsy as the reference standard.
Materials and methods
Patients
The study was approved by the local ethics committee. Between August 2011 and April 2012, 67 consecutive patients (mean age 66.8 ± 7.5 years, mean prostate volume 57 ± 26.1 ml, mean PSA value 10 ± 7.6 ng/ml) with increased prostate-specific antigen (PSA) levels (above 4 ng/ml) and at least one negative trans-rectal ultrasound (TRUS)-guided biopsy were included in this study. Each patient underwent mp-MRI for assessment of the prostate. In a second session MRI-guided biopsy of all described lesions—including suspicious and unsuspicious findings (n = 164 lesions in total)—was performed.
MRI protocol
Using 3-T MRI (Magnetom Trio; Siemens Healthcare, Forchheim, Germany) an mp-MRI of the prostate was performed with a six-channel phased-array body coil. To suppress bowel peristalsis all patients received 20 mg butylscopolamine (Buscopan; Boehringer, Ingelheim, Germany) intravenously and intramuscularly. An interval of at least 6 weeks was maintained between mp-MRI and the preceding TRUS biopsy. Mp-MRI of the prostate included T2-weighted imaging (T2WI), T1-weighted imaging (T1WI), DWI and DCE-MRI. T2-weighted turbo spin echo sequences were acquired in three standard orthogonal planes (axial, sagittal and coronal). Axial T1-weighted turbo spin echo images, single-shot spin echo echo-planar sequence using five b values (0, 250, 500, 750, 1,000 s/mm2) with five averages for DWI and volume-interpolated gradient echo sequence for the DCE-MRI were applied (Table 1). The imaging protocol was adapted according to the ESUR guidelines.
Scoring system
The ESUR guidelines recommend a standardised scoring system for evaluation and reporting of prostate MRI similar to the BI-RADS classification used by breast radiologists for X-ray mammography, breast ultrasound and breast MRI [12, 13]. The ESUR guidelines endorse a division of the prostate gland into 27 regions (minimum 16 regions). All lesions are rated on a score from 1 to 5 in each of the three MRI sequences (T2WI, DWI, DCE-MRI). For evaluating T2-weighted data sets, the location of the lesion either in the peripheral zone or the central zone has to be considered (Table 2) [11].
Scoring
Lesions (n = 164) were retrospectively evaluated by three blinded readers (D.B., M.Q. and L.S., with 4, 3 and 2 years of experience in reading prostate MRI, respectively) comprising the different MRI sequences (T2WI, DWI, DCE-MRI). Scoring was performed according to the ESUR guidelines (PI-RADS). Additionally, each lesion was given an overall score (3–15 points). All readers evaluated each lesion separately and were blinded with respect to the patients’ clinical data and the histology of the corresponding MRI-guided biopsy. All lesions were marked by a circle on the PACS workstation before starting the study evaluation (Fig. 1). Lesion documentation used a 27-region localisation scheme [4].
In-bore MRI-guided biopsy
The MRI-guided biopsies were performed on the same 3-T system (Magnetom Trio; Siemens Healthcare, Forchheim, Germany). Patients were placed in a prone position and a needle guide fixed to a portable biopsy device (DynaTRIM) was introduced rectally (Invivo, Gainesville, FL, USA). T2-weighted axial and sagittal images were acquired with body coils. Image data were transferred to a DynaCAD workstation (Invivo) for biopsy planning. Two cores were taken of each lesion with an MRI-compatible, 18-gauge, fully automatic biopsy gun (Invivo).
Statistics
The data were tested for normal distribution using the Kolmogorov–Smirnov test. Normally distributed parameters were compared using the independent sample t-test, non-parametric data were tested using the Mann–Whitney U test. All data are expressed as mean ± SD. Statistical analysis was performed using IBM SPSS Statistics 19 for Windows (SPSS, Chicago, IL, USA). Statistical significance was defined at a P value below 0.05. The inter-reader agreement was calculated using Cohen’s kappa statistics. The inter-reader agreement was defined excellent (κ > 0.81), good (κ = 0.61–0.80), moderate (κ = 0.41–0.60), fair (κ = 0.21–0.40) and poor (κ ≤ 0.20) [14]. Sensitivity, specificity, positive and negative predictive values were calculated for the recommended cut-off score of ≥10 and, additionally, for a cut-off ≥9 using MRI-guided biopsy as the reference standard.
Results
Patients
In 56 lesions in 28 patients (42 %) MRI-guided biopsy confirmed prostate cancer. Seventeen lesions had a Gleason score of 6, 35 lesions a Gleason score of 7, 1 lesion a Gleason score of 8 and 3 lesions a Gleason score of 9. The mean age of all patients with verified prostate cancer was 69.6 ± 8.4 years compared with 65.1 ± 6.5 years in patients without cancer (P < 0.05). The mean prostate volume in patients with prostate cancer was 42.1 ± 11.5 ml and 67.6 ± 27.2 ml in patients without histologically verified prostate cancer (P < 0.01). PSA values were 11.2 ± 10.3 ng/ml in patients with and 8.7 ± 4.8 ng/ml in patients without verified prostate cancer (P = 0.183).
PI-RADS
The mean PI-RADS score of all lesions (n = 168) for all readers (n = 3) was 3.5 ± 1 for T2WI, 3.9 ± 0.9 for DWI and 2.7 ± 1.3 for DCE-MRI. Tumour lesions had a mean score of 4.2 ± 0.8 (T2WI), 4.5 ± 0.7 (DWI) and 3.5 ± 1.4 (DCE-MRI). Benign lesions had a mean score of 3.0 ± 0.8 (T2WI), 3.5 ± 0.7 (DWI), and 2.4 ± 1.1 (DCE-MRI). The mean overall PI-RADS score of tumour lesions and benign lesions was 12.3 ± 2.1 and 9.0 ± 1.6, respectively (Table 3). Data analysis considering the reference standard resulted in a sensitivity of mp-MRI for the detection of prostate cancer of 85.7 %, a specificity of 67.6 %, a positive predictive value of 57.8 % and a negative predictive value of 90.1 % when applying the recommended cut-off value of 10 points. A cut-off value of 9 points resulted in a sensitivity of 92.9 %, a specificity of 41.7 %, a positive predictive value of 45.2 % and a negative predictive value of 91.8 % (Table 4).
Inter-reader agreement
Inter-reader agreement of all three readers was κ = 0.55 for T2WI, κ = 0.64 for DWI and κ = 0.65 for DCE-MRI. For malignant lesions kappa values were κ = 0.66 for T2WI, κ = 0.80 for DWI and κ = 0.63 for DCE-MRI. For benign lesions κ was 0.46 for T2WI, κ = 0.52 for DWI and κ = 0.67 for DCE-MRI using the PI-RADS score (Table 5).
Discussion
Based on unsatisfactory detection rates of clinically relevant prostate cancer by currently recommended diagnostic tools such as digital rectal examination, PSA and TRUS biopsy, using mp-MRI prostate cancer diagnostics can be significantly improved especially in patients with prior negative TRUS-guided biopsy [15–21]. The recently published ESUR recommendation on mp-MRI of the prostate standardises all aspects of mp-MRI, including implementation, evaluation and documentation. This ESUR guideline includes a scoring system (PI-RADS) to evaluate prostate lesions on high-resolution T2-weighted images and at least two functional MR sequences [11]. Our study investigated the inter-reader agreement of the PI-RADS score. The results show that the ESUR score used by different radiologists leads to good to moderate inter-reader agreement and to a detection rate of 42 % in our patient population with elevated PSA and previously negative TRUS-guided biopsy.
Our mp-MRI protocol does not include spectroscopy (MRSI), which the ESUR guideline defines as “optional”. MRSI has been reported to be a valid additional tool to detect prostate cancer but extends the examination time. In addition, spectroscopy has not been reliably implemented at 3 T and the use of an endorectal coil at 1.5 T reduces patient comfort [22, 23].
Studies published before the release of the ESUR score demonstrated high sensitivities and negative predictive values for the detection of prostate cancer by using different scoring systems for lesion characterisation according to high-resolution T2WI, DWI and DCE-MRI [24–26]. The PI-RADS score qualitatively evaluates lesions in the T2-weighted images according to the signal intensity separated by the peripheral zone (PZ) and the transition zone (TZ) with low signal appearance as a characteristic of malignancy. Well-defined lesions are assessed with a score of 2, whereas a score of 3 represents an intermediate, thus heterogeneous, appearance [11]. However, the inter-reader agreement for the T2-weighted images was only moderate. The main reason might be that characterisation of an area based on T2-weighted images alone is variable and subjective [27]. The DWI demonstrated better agreement between different readers. This is most likely due to evaluation of more than one parameter, namely a reduced apparent diffusion coefficient (ADC) in addition to a hyperintense signal on high b values (score 4 or 5). For DCE-MRI the scores are clearly defined by enhancement curves and, therefore, inter-reader agreement was better than for T2-weighted images [9, 28].
Focusing on histology, the inter-reader agreement was higher for malignant than for benign lesions. This clarifies the difficulty in determining and estimating benign lesions because of their various appearances. Recent studies show that tumours in the TZ are significantly less detected than tumours of the PZ [29]. They clearly suggest a different weight of the three MRI sequences, whereas DCE-MRI for the TZ obviously plays a minor part in a scoring system [30]. Nonetheless, the overall inter-reader agreement was good to moderate. An image atlas similar to the BI-RADS classification and also raising experience with the PI-RADS scoring system could further improve the inter-reader agreement.
Considering the threshold to be applied the ESUR guideline does not, as yet, provide any fixed threshold. A different study published a cut-off value of 9, whereas our data show better specificity and positive predictive value with only slightly lower sensitivity and negative predictive value when applying a cut-off value of 10. However, the accuracy data refer to in-bore MRI-guided biopsied data and therefore false-negative results may be present. Also, missing MRI biopsy correlation might be a limitation. These could only be excluded by histology from radical prostatectomy or a long follow-up. This lack of reference standard must be considered a limitation of this study. Nevertheless, the primary aim was to assess the inter-reader agreement of PI-RADS, because it is indispensable to use a uniform standardised score for the evaluation of mp-MRI of the prostate.
In conclusion, the PI-RADS score of the ESUR guideline shows good to moderate inter-reader agreement. The inter-reader agreement may be increased by a PI-RADS atlas with sample images, similar to the BI-RADS publications and growing experience with the PI-RADS score. Further studies have to prove whether a weighting of the MRI sequences should be implicated in the scoring system. With a standardised scoring system the evaluation of mp prostate MRIs results in a high sensitivity and negative predictive value using a cut-off value of 10.
Abbreviations
- ESUR:
-
European Society of Urogenital Radiology
- PI-RADS:
-
Prostate Imaging Reporting and Data System
- mp-MRI:
-
Multiparametric magnetic resonance imaging
- DCE:
-
Dynamic contrast-enhanced imaging
- MRSI:
-
Spectroscopic imaging
- DWI:
-
Diffusion-weighted imaging
- PSA:
-
Prostate-specific antigen
- TRUS:
-
Trans-rectal ultrasound
References
Ferlay J, Shin HR, Bray F, Forman D, Mathers C, Parkin DM (2010) GLOBOCAN 2008 v2.0, cancer incidence and mortality worldwide: IARC CancerBase No. 10. International Agency for Research on Cancer, Lyon, France. Available via http://globocan.iarc.fr. Accessed 15. January 2013
Mistry K, Cable G (2003) Meta-analysis of prostate-specific antigen and digital rectal examination as screening tests for prostate carcinoma. J Am Board Fam Pract 16:95–101
Heidenreich A, Aus G, Bolla M et al (2008) EAU guidelines on prostate cancer. Eur Urol 53:68–80
Dickinson L, Ahmed HU, Allen C et al (2011) Magnetic resonance imaging for the detection, localization and characterisation of prostate cancer: recommendations from a European consensus meeting. Eur Urol 59:477–494
Umbehr M, Bachmann LM, Held U et al (2009) Combined magnetic resonance imaging and magnetic resonance spectroscopy imaging in the diagnosis of prostate cancer: a systematic review and meta-analysis. Eur Urol 55:575–590
Mazaheri Y, Shukla-Dave A, Muellner A et al (2008) MR imaging of the prostate in clinical practice. MAGMA 21:379–392
Hoeks CM, Barentsz JO, Hambrock T et al (2011) Prostate cancer: multi-parametric MR imaging for detection, localization, and staging. Radiology 261:46–66
Heidenreich A, Bellmunt J, Bolla M et al (2011) EAU guidelines on prostate cancer. Part 1: screening, diagnosis, and treatment of clinically localised disease. Eur Urol 59:61–71
Portalez D, Rollin G, Leandri P et al (2010) Prospective comparison of T2w- MRI and dynamic-contrast-enhanced MRI, 3D-MR spectroscopic imaging or diffusion-weighted MRI in repeat TRUS-guided biopsies. Eur Radiol 20:2781–2790
Hricak H, Choyke PL, Eberhardt SC et al (2007) Imaging prostate cancer: a multidisciplinary perspective. Radiology 243:28–53
Barentsz JO, Richenberg J, Clements R et al (2012) ESUR prostate MR guidelines 2012. Eur Radiol 22:746–757
American College of Radiology (2003) Breast Imaging Reporting and Data System (BI-RADS), Breast Imaging Atlas, 4th edn. American College of Radiology (ACR), Reston
Molleran V, Mahoney MC (2010) The BI-RADS breast magnetic resonance imaging lexicon. Magn Reson Imaging Clin N Am 18:171–185
Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33:159–174
“Interdisziplinären Leitlinie der Qualität S3 zur Früherkennung, Diagnose und Therapie der verschiedenen Stadien des Prostatakarzinoms”. Available via http://www.awmf.org/leitlinien/detail/ll/043-022OL.html. Accessed 01. April 2013
Hara N, Okuizumi M, Koike H, Kawaguchi M, Bilim V (2005) Dynamic contrast-enhanced magnetic resonance imaging (DCEMRI) is a useful modality for the precise detection and staging of early prostate cancer. Prostate 62:140–147
Delongchamps NB, Rouanne M, Flam T et al (2011) Multiparametric magnetic resonance imaging for the detection and localization of prostate cancer: combination of T2-weighted, dynamic contrast-enhanced and diffusion-weighted imaging. BJU Int 7:1411–1418
Anastasiadis AG, Lichy MP, Nagele U et al (2006) MRI-guided biopsy of the prostate increases diagnostic performance in men with elevated or increasing PSA levels after previous negative TRUS biopsies. Eur Urol 50:738–748
Roethke M, Anastasiadis AG, Lichy M et al (2012) MRI-guided prostate biopsy detects clinically significant cancer: analysis of a cohort of 100 patients after previous negative TRUS biopsy. World J Urol 30:213–218
Franiel T, Stephan C, Erbersdobler A, Dietz E, Maxeiner A et al (2011) Areas suspicious for prostate cancer: MR-guided biopsy in patients with at least one transrectal US-guided biopsy with a negative finding—multiparametric MR imaging for detection and biopsy planning. Radiology 259:162–172
Hambrock T, Somford DM, Hoeks C et al (2010) Magnetic resonance imaging guided prostate biopsy in men with repeat negative biopsies and increased prostate specific antigen. J Urol 183:520–527
Kobus T, Vos PC, Hambrock T, De Rooij M, Hulsbergen-van de Kaa CA, Barentsz JO, Heerschap A, Scheenen TWJ (2012) Prostate cancer aggressiveness: in vivo assessment of MR spectroscopy and diffusion-weighted imaging at 3 T. Radiology 265:457–467
Futterer JJ, Scheenen TW, Heijmink SW et al (2007) Standardized threshold approach using three-dimensional proton magnetic resonance spectroscopic imaging in prostate cancer localization of the entire prostate. Invest Radiol 42:116–122
Arsov C, Quentin M, Rabenalt R, Antoch G, Albers P, Blondin D (2012) Repeat transrectal ultrasound biopsies with additional targeted cores according to results of functional prostate MRI detects high-risk prostate cancer in patients with previous negative biopsy and increased PSA—a pilot study. Anticancer Res 32:1087–1092
Arsov C, Blondin D, Rabenalt R, Antoch G, Albers P, Quentin M (2012) Standardised scoring of a multi-parametric 3-T MRI for a targeted MRI-guided prostate biopsy. Urologe 51:848–856
Quentin M, Arsov C, Röhlen S et al (2012) Inter-reader agreement of multi-parametric MR imaging for the detection of prostate cancer: evaluation of a scoring system. Röfo 184:925–929
Kirkham AP, Emberton M, Allen C (2006) How good is MRI at detecting and characterising cancer within the prostate? Eur Urol 50:1163–1174
Choi YJ, Kim JK, Kim N et al (2007) Functional MR imaging of prostate cancer. Radiographics 27:63–75
Bratan F, Niaf E, Melodelima C, Chesnais AL, Souchon R, Mège-Lechevallier F, Colombel M, Rouvière O (2013) Influence of imaging and histological factors on prostate cancer detection and localisation on multiparametric MRI: a prospective study. Eur Radiol. doi:10.1007/s00330-013-2795-0
Chesnais AL, Niaf E, Bratan F, Mège-Lechevallier F, Roche S, Rabilloud M, Colombel M, Rouvière O (2013) Differentiation of transitional zone prostate cancer from benign hyperplasia nodules: evaluation of discriminant criteria at multiparametric MRI. Clin Radiol 68:e323–e330
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Schimmöller, L., Quentin, M., Arsov, C. et al. Inter-reader agreement of the ESUR score for prostate MRI using in-bore MRI-guided biopsies as the reference standard. Eur Radiol 23, 3185–3190 (2013). https://doi.org/10.1007/s00330-013-2922-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00330-013-2922-y