Introduction

With the increasing use and improvement of cross-sectional abdominal imaging, the incidental detection of pancreatic cystic lesions, including intraductal papillary mucinous neoplasms (IPMNs), has increased [1, 2]. The reported prevalence of incidental pancreatic cysts on multidetector computed tomography (CT) or magnetic resonance imaging (MRI) with MR cholangiopancreatography (MRCP) ranges from 2.2 to 19.6% [2]. Since a small percentage of pancreatic cystic lesions progress to invasive cancers and the management strategy can differ based on the malignancy risk, it is important to accurately predict the malignant potential of pancreatic cysts on imaging.

The international consensus guidelines were established in 2006 and recently revised in 2017, suggesting high-risk stigmata and worrisome features on imaging to determine the malignant potential of IPMNs [3]. Moreover, this guideline recommends that only high-risk IPMNs should be resected, while low-risk IPMNs should be managed by surveillance to avoid overtreatment [3]. According to the guideline, endoscopic ultrasound (EUS), CT, and MR imaging with MRCP are recommended as diagnostic and surveillance imaging modalities for pancreatic cysts [3]. Although the usefulness of this guideline has been validated by previous reports [4, 5], the diagnostic performance for predicting the malignant potential of IPMNs by using CT or MR imaging findings before surgical resection is still limited, and the agreement between the preoperative imaging diagnosis and pathologic examination is inadequate in a substantial proportion (30–70%) of cases [6,7,8,9].

Recently, texture analysis has received much attention as a quantitative imaging biomarker in oncologic imaging by quantifying tissue heterogeneity and providing important information regarding tumor characterization and prognosis [10, 11]. In a previous study, CT texture analysis of pancreatic IPMNs was shown to be helpful in differentiating malignant IPMNs from benign IPMNs and selecting patients for surgical or nonsurgical management [12, 13]. To the best of our knowledge, however, there have been few studies regarding the usefulness of texture analysis of MR imaging for predicting the malignant potential of pancreatic IPMNs. Thus, the purpose of this study was to investigate the utility of MR findings and texture analysis for predicting the malignant potential of pancreatic IPMNs.

Materials and methods

Study population

This retrospective study was approved by our institutional review board, and the requirement of informed consent was waived. Between January 2006 and April 2018, 339 patients with pathologically proven IPMN at surgery were identified through a review of our pathologic database. Among these patients, we included 290 consecutive patients who fulfilled the following inclusion criteria: (1) patients who underwent preoperative pancreas protocol MR examinations including MRCP within 2 months before surgery and (2) patients without any history of local or systemic therapy for pancreatic lesions. We then excluded 41 patients for the following reasons: suboptimal MR image quality with severe motion artifacts or metallic artifacts (n = 12), segmentation failure of pancreatic lesions on MRCP source images due to severe motion artifacts (n = 27), and segmentation failure due to inability to identify IPMN lesions on MRCP (n = 2) (Fig. 1). Therefore, a total of 248 patients (mean age, 69.9 years ± 10.6 [standard deviation (SD)], 155 men and 93 women) were included in this study. The mean interval between MRI examination and surgery was 15.8 days (range, 1–59 days).

Fig. 1
figure 1

Flow diagram of enrolled patients. IPMN, intraductal papillary mucinous neoplasm; MRCP, MR cholangiopancreatography

MR imaging with MR cholangiopancreatography

Pancreatobiliary MR images were obtained using either a 1.5-T (n = 58) or 3.0-T (n = 190) MR scanner: Signa HDxt 1.5 T (GE Healthcare, n = 29); Discovery MR 750w 3.0 T (GE Healthcare, n = 3); Magnetom Verio, Magnetom Trio, Biograph mMR, or Skyra 3.0 T (Siemens Healthineers, n = 157); and Achieva or Ingenia 3.0 T (Philips Healthcare, n = 27) at our institution and various 1.5-T (n = 17) or 3.0-T (n = 15) scanners at outside hospitals. Routine pancreatobiliary MRI included the following sequences: a breath-hold transverse T2-weighted sequence with a single-shot fast spin echo (SSFSE) or a half-Fourier acquisition single-shot turbo spin-echo sequence, a T1-weighted in-phase and opposed-phase spoiled 3D gradient echo sequence, and diffusion-weighted images with b values of 0 s/mm2, 400 s/mm2, and 800 s/mm2. A dynamic study was performed after the administration of 1.0 mol/L gadobutrol (7.5 mL Gadovist; Bayer Healthcare) at a dose of 0.1 mmol/kg with an injection rate of 2 mL/s. Precontrast imaging was obtained before contrast material injection, and the arterial phase was obtained 8 s after contrast material arrival at the distal thoracic aorta. The portal phase was obtained 60 s after contrast media injection, and the delayed phase was obtained after 2 min, 3 min, and 5 min. MR cholangiography was performed with a coronal respiratory–triggered multisection 3D half-Fourier single-shot turbo spin-echo or single-shot fast spin-echo T2-weighted sequence. After that, 3D MRCP images were reconstructed using a maximum-intensity projection algorithm to produce oblique images rotated about the z-axis in 10° increments. All parameters for the MR sequences of the two main scanners are summarized in Supplementary Table 1.

Image analysis

Two abdominal radiologists (S.K.J. and J.Y., with 6 years of experience with abdominal MR imaging each) independently reviewed all MRI examinations using a picture archiving and communication system. Both readers were blinded to the detailed histologic diagnoses but were aware that the study population had pathologically proven IPMN at surgery. Each reviewer recorded whether each IPMN demonstrated any of the high-risk stigmata or worrisome features on MR imaging by referring to the revised 2017 international consensus guidelines (Table 1) [14]. The caliber of the MPD was measured at the maximal point of pancreatic duct dilation. The size of the branch-duct IPMN was measured as the longest diameter in any axial or coronal MRCP image. Enhancing mural nodules were defined as any papillary protruding lesions showing contrast enhancement within the cyst or dilated duct. A thickened and enhancing cyst wall was defined as a cyst wall greater than 2 mm in width with enhancement. All measurements were performed on either axial or coronal images. Lymphadenopathy was defined as enlarged peripancreatic lymph nodes with a maximum short diameter greater than 10 mm. Because this study only evaluated the findings from a single MR examination, the cyst growth rate was not evaluated. In patients with multiple identifiable branch-duct IPMNs, the most suspicious lesion was selected as the target lesion. The most suspicious lesion was defined as the lesion that demonstrated any high-risk stigmata or worrisome features, and if the lesion showed neither high-risk stigmata nor worrisome features, the largest lesion was selected.

Table 1 Revised 2017 international consensus Fukuoka guidelines for the management of suspected intraductal papillary mucinous neoplasm of the pancreas

Computerized texture analysis

Lesion segmentation was performed on the source images of MRCP by one radiologist (J.E.K., with 6 years of experience in abdominal imaging) using commercially available software (MEDIP, MEDICAL IP). The software allows semiautomatic identification of lesions with the assistance of the “Hounsfield unit (HU) or signal intensity (SI) threshold” function, which automatically draws all areas within the range of the specific SI defined by the radiologist. After loading the MRCP images into the software, the radiologist drew ROIs using the draw-cut method in a few, selected slice images as follows: (1) within the cyst and (2) outside of the cyst. When drawing (2), the radiologist was assisted by the draw-cut method based on the “SI threshold” function and which automatically drew all areas within the range of the specific SI defined by the radiologist. The software then automatically segmented the cyst. After that, the radiologist modified the extent and margin of the lesion and confirmed its segmentation (Fig. 2). To minimize measurement errors, we used the mean value of the texture parameters obtained from three respective segmentations of the same representative lesion. After segmenting the lesions, the software automatically calculated the texture features, including histogram-based 1st-order statistics, shape- and size-based geometric features, gray-level co-occurrence matrix (GLCM) texture features, and wavelet features [15].

Fig. 2
figure 2

MR texture analysis. a Semiautomatic lesion segmentation was performed on MR cholangiography (MRCP) source images. After that, the lesion was confirmed on the MRCP source image and maximum intensity projection (MIP) reconstruction image (a, b). c Subsequently, texture analysis was performed, and the texture features of the lesions were automatically extracted and calculated

Pathologic data collection

Information on the type of IPMN, World Health Organization (WHO) grade, and tumor location was collected from the pathologic reports. The type of IPMN was categorized as branch duct, main duct, or mixed type. WHO grade was classified as low-grade dysplasia, intermediate-grade dysplasia, high-grade dysplasia, or associated invasive carcinoma [16]. In this study, IPMNs with low- and intermediate-grade dysplasia were classified as benign IPMNs, whereas IPMNs with high-grade dysplasia and associated invasive carcinoma were regarded as malignant IPMNs [5, 17].

Statistical analysis

The prevalence of clinical or pathologic characteristics and MR findings between malignant IPMNs and benign IPMNs were compared by using the chi-square test or Fisher’s exact test for categorical variables and Student’s t test for continuous variables. Thereafter, logistic regression analyses with the forward projection method were performed to identify significant predictors for malignant IPMN. First, univariable analysis was performed for each image finding and texture parameter, and only variables with p values < 0.05 were selected as input variables for multivariable analysis. Receiver operating characteristic analysis was used for regression equations using image findings only as well as for regression equations using image findings and texture variables to evaluate their diagnostic accuracy for predicting malignant IPMN. To evaluate the interreader reliability of MR findings, the Cohen κ values were calculated and interpreted as follows: 0.81–1.00, excellent agreement; 0.61–0.80, good agreement; 0.21–0.40, fair agreement; and 0.00–0.20, poor agreement. To evaluate the intraexaminer reliability of texture analysis, the intraclass correlation coefficients (ICCs) of three measurement values of each texture variable were calculated and interpreted as follows: ≥ 0.90, excellent; ≥ 0.75 to < 0.90, good; ≥ 0.50 to < 0.75, moderate; and < 0.50, poor reliability [18]. All statistical analyses were performed by using SPSS software (version 25; IBM), and a p value < 0.05 was considered to be statistically significant.

Results

A total of 142 (57.3%) patients had benign IPMN (68 with low-grade dysplasia and 74 with intermediate-grade dysplasia), and 106 (42.7%) had malignant IPMN (39 with high-grade dysplasia and 67 with invasive carcinoma). There were significant differences in age, location of lesion, type of surgery, and type of IPMN between benign IPMN and malignant IPMN (all ps < 0.05) (Table 2).

Table 2 Clinico-pathologic characteristics of the study population

Comparison of MR imaging features and texture variables between benign and malignant IPMNs

The MR imaging findings between benign and malignant IPMNs are summarized in Table 3 and Supplementary Table 2. Malignant IPMNs showed significantly higher frequencies of enhancing mural nodules ≥ 5 mm, MPD sizes ≥ 10 mm or MPD sizes of 5 to 9 mm, and abrupt MPD changes with distal parenchymal atrophy from both reviewers (1 and 2) (all ps < 0.05) (Figs. 3 and 4). In addition, each MRI finding showed good or excellent interobserver agreement (κ = 0.620–0.829), except for thickened or enhancing cyst wall, which showed poor agreement (κ = 0.176). Table 4, Supplementary Table 3, and Supplementary Table 4 show the comparison of the texture parameters between benign and malignant IPMNs. Several texture variables, including entropy, volume, effective diameter, surface area, sphericity, compactness 2, and GLCM entropy, showed a significant difference between benign and malignant IPMNs (ps < 0.05).

Table 3 Comparison of MR imaging findings of benign and malignant IPMNs
Fig. 3
figure 3

Images from a 75-year-old woman who had intraductal papillary mucinous neoplasm (IPMN) with low-grade dysplasia. Maximal intensity projection (MIP) reconstruction image of MR cholangiography (MRCP) shows a 1.8-cm pleomorphic cyst in the pancreas body without significant pancreatic ductal dilatation. T1-weighted MRI obtained in the portal phase shows no definite enhancing mural nodules or thickened septum. MR texture analysis shows relatively low entropy and high compactness (arrow)

Fig. 4
figure 4

Images from a 60-year-old man who had intraductal papillary mucinous neoplasm (IPMN) with invasive carcinoma. MIP reconstruction image of MRCP shows a dilated main pancreatic duct up to 20 mm. T1-weighted MRI obtained in the portal phase shows an enhancing mural nodule measuring up to 13 mm (arrow). MR texture analysis shows relatively high entropy and low compactness

Table 4 Comparison of the texture variable of benign and malignant IPMNs

Important MR findings and texture variables for predicting malignant IPMN

In multivariable logistic regression analysis, an enhancing mural nodule size ≥ 5 mm (odds ratios (ORs), 6.697 and 6.968; 95% confidence intervals (CIs), 3.431–13.14 and 3.601–13.480; ps < 0.001 for reviewers 1 and 2, respectively), an MPD size ≥ 10 mm (ORs, 4.098 and 4.215; 95% CIs, 1.663–10.097 and 13.619 and 10.973; p = 0.002 and 0.003 for reviewers 1 and 2, respectively), an MPD size of 5 to 9 mm (ORs, 2.517 and 3.055; 95% CIs, 1.209–5.241 and 1.488–6.275; p = 0.014 and 0.002 for reviewers 1 and 2, respectively), greater entropy (ORs, 1.485 and 1.515; 95% CIs, 1.025–2.153 and 1.057–2.171; p = 0.037 and 0.024 for reviewers 1 and 2, respectively), and smaller compactness 2 (ORs, 0.981 and 0.977; 95% CIs, 0.964–0.999 and 0.960–0.995; p = 0.036 and 0.011 for reviewers 1 and 2, respectively) were found to be independent predictors for malignant IPMNs (Fig. 5). Table 5 and Supplementary Table 5 show the multivariate logistic regression analysis of MR findings and texture analysis variables to predict malignant IPMN.

Fig. 5
figure 5

Images from a 64-year-old man who had intraductal papillary mucinous neoplasm (IPMN) with invasive carcinoma. MIP reconstruction image of MRCP shows a 3-cm pleomorphic cyst in the pancreas head (arrow) without significant pancreatic ductal dilatation (less than 5 mm). T1-weighted MRI obtained in the portal phase shows no definite enhancing mural nodules or thickened septum within the cyst (arrow). MR texture analysis shows relatively high entropy and low compactness (arrow)

Table 5 Multivariate logistic regression analysis of MR findings and texture analysis variables to predict malignant IPMN

Using MRI features including the presence of an enhancing mural nodule ≥ 5 mm and an MPD size ≥ 10 mm and an MPD size of 5 to 9 mm, the diagnostic performance for predicting malignant IPMN was 0.80 (95% CI, 0.75–0.85) and 0.79 (95% CI, 0.73–0.84) for reviewers 1 and 2, respectively. When we applied both MR imaging findings and texture variables, including entropy and compactness, the diagnostic performance for predicting malignant IPMN was improved to 0.85 (95% CI, 0.80–0.89) and 0.85 (95% CI, 0.80–0.89) for reviewers 1 and 2 (p = 0.021 and 0.002), respectively.

Discussion

Our study demonstrated that an enhancing mural nodule size ≥ 5 mm, an MPD size ≥ 10 mm or an MPD size of 5–9 mm on MR imaging, greater entropy, and smaller compactness on MR texture analysis were significant predictors for malignant IPMNs in multivariable analysis (p < 0.05). When MR texture variables were added to the MRI findings, the diagnostic performance for diagnosing malignant IPMN improved from AUCs of 0.80 and 0.79 to 0.85 and 0.85 for reviewers 1 and 2, respectively.

Although the imaging features, including the high-risk stigmata and worrisome features suggested by international consensus guidelines, were validated as predictors for malignant IPMNs in many previous studies, the reported diagnostic performance for predicting malignant potential is inadequate (AUCs, 0.637–0.787) [6, 13]. In our study, the diagnostic performance for the prediction of malignant IPMNs was improved up to 0.85 from 0.79 when we applied both MR findings and texture variables. Quantitative image analysis of pancreatic IPMN may have better diagnostic value in predicting malignancy risk than using accepted qualitative radiologic assessment alone, which is also consistent with the results of previous studies using CT texture analysis [12, 13].

Our MR texture analysis results revealed that greater entropy (OR, 1.485–1.515) and smaller compactness (OR, 0.977–0.981) were significant predictors for malignant IPMNs. Entropy is the first-order statistic from the gray-level histogram, reflecting the uniformity of the distribution of the gray level and specifying the randomness in the image values. A greater entropy represents increased tumor heterogeneity, and previous studies have reported that increased tumor entropy suggests tumor aggressiveness and poorer prognosis [10, 12]. Moreover, increased entropy on CT texture analysis [19] and apparent diffusion coefficient histogram metrics [20] were also helpful for predicting malignant IPMNs in previous studies, which is consistent with our study results. Meanwhile, compactness, which is similar to sphericity, is a measure of how compact the shape of the tumor is relative to a sphere, which is considered the most compact. In our study, smaller compactness had a predictive value for malignant IPMNs. Our study results could be explained by the fact that the risk of malignancy in main duct–type IPMNs is higher than that of branch duct–type IPMNs [21], as the value of sphericity and compactness is larger in branch duct–type IPMNs than in main duct–type IPMNs. Therefore, more validation studies will be required for clinical application. In addition, a previous study reported that smaller sphericity or compactness was an important predictor of high-grade (grade 2 or 3) pancreatic neuroendocrine tumors showing tumor aggressiveness and heterogeneity [22], which is also similar to our result.

Similar to previous reports [6, 23], in our study, an enhancing mural nodule size ≥ 5 mm and an MPD size ≥ 10 mm, which are high-risk stigmata suggested by the 2017 international consensus guidelines, were independent factors for predicting malignant IPMNs in multivariable analysis. In previous studies, the presence of mural nodules has been one of the strongest predictors of malignant IPMN, and the risk of malignancy increases as the nodule size increases [24]. Our study results also demonstrated that an enhancing mural nodule size ≥ 5 mm was the strongest predictor for malignant IPMN, followed by an MPD size ≥ 10 mm.

Several limitations of our study need to be mentioned. First, as a retrospective study design, there may be potential selection bias. In addition, as our study population consisted of surgically confirmed IPMNs, it can be a source of selection bias. Because of this specific study population, we think that our study could present the performance of MR findings and texture analysis for predicting malignant IPMNs in patients who consider surgical resection rather than the performance of those findings for the entire population with IPMNs undergoing MRI surveillance. Although our study is meaningful in the prediction of the malignant potential of IPMNs with the pathologic result, which is a concrete reference standard, a large cohort study including all patients with IPMNs undergoing MRI surveillance would be required to validate our study result and reflect clinical practice. Second, the MR scanners used in our study were not uniform across all patients, and interscanner differences may affect the texture analysis results. So, we performed subgroup analysis using a single scanner, and the result of subgroup analysis (n = 126) was consistent with the result of total study population (n = 248) (Supplementary A, Supplementary Tables 4 and 5). However, further studies with a larger study population and uniform MR scanners and examination protocols are warranted to validate the findings of our study. Third, although tumor segmentation was performed semiautomatically, there may be a potential for measurement error. To minimize the measurement error, repeated measurement was performed in our study, and at intraexaminer reproducibility analysis, texture features showed moderate to high reproducibility (Supplementary Table 3). However, as interexaminer reproducibility was not performed in our study, further study showing interexaminer reproducibility of texture analysis is warranted. Finally, the texture analysis was only performed by using MRCP images. Due to the high lesion-to-background contrast and thin section thickness of MRCP source images, the volumetric lesion segmentation by using MRCP could be easy and stable. However, other MR images such as DWI or post-contrast images could potentially provide more information regarding malignant potential of IPMNs. Further studies of texture analysis of multiparametric MR images could be helpful. In conclusion, MRCP-derived texture features are useful for predicting the malignant potential of IPMNs, and the addition of texture analysis to MR imaging features may improve the diagnostic performance for predicting malignant IPMNs.