Robustness and performance of radiomic features in diagnosing cystic renal masses

Könik, Arda; Miskin, Nityanand; Guo, Yang; Shinagare, Atul B.; Qin, Lei

doi:10.1007/s00261-021-03241-2

Robustness and performance of radiomic features in diagnosing cystic renal masses

Kidneys, Ureters, Bladder, Retroperitoneum
Published: 11 August 2021

Volume 46, pages 5260–5267, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Abdominal Radiology Aims and scope Submit manuscript

Robustness and performance of radiomic features in diagnosing cystic renal masses

Download PDF

Arda Könik ORCID: orcid.org/0000-0002-9397-8600¹,
Nityanand Miskin²,
Yang Guo³,
Atul B. Shinagare⁴ &
…
Lei Qin¹

463 Accesses
7 Citations
2 Altmetric
Explore all metrics

Abstract

Purpose

We study the inter-reader variability in manual delineation of cystic renal masses (CRMs) presented in computerized tomography (CT) images and its effect on the classification performance of a machine learning algorithm in distinguishing benign from potentially malignant CRMs. In addition, we assessed whether the inclusion of higher-order robust radiomic features improves the classification performance over the use of first-order features.

Methods

230 CRMs were independently delineated by two radiologists. Through a combination of random fluctuations, dilation, and erosion operations over the original region of interests (ROIs), we generated four additional sets of synthetic ROIs to capture the inter-reader variability realistically, as confirmed by dice coefficient measurements and visual assessment. We then identified the robust features based on the intra-class coefficient (ICC > 0.85) across these datasets. We applied a tenfold stratified cross-validation (CV) to train and test the performance of the random forest model for the classification of CRMs into benign and potentially malignant.

Results

The mean area under the curve (AUC), sensitivity, specificity, positive predictive value, and negative predictive value were 0.87, 0.82, 0.90, 0.85, and 0.93, respectively. With the usage of first-order features alone, the corresponding values were nearly identical.

Conclusion

AUC ranged for the robust and uncorrelated features from 0.83 ± 0.09 to 0.93 ± 0.04 and for the first-order features from 0.84 ± 0.09 to 0.91 ± 0.04. Our study indicates that the first-order features alone are sufficient for the classification of CRMs, and that inclusion of higher-order features does not necessarily improve performance.

Clinical value of radiomics and machine learning in breast ultrasound: a multicenter study for differential diagnosis of benign and malignant lesions

Article Open access 21 May 2021

Interpretable multiphasic CT-based radiomic analysis for preoperatively differentiating benign and malignant solid renal tumors: a multicenter study

Article Open access 11 May 2024

Differentiation of benign from malignant solid renal lesions using CT-based radiomics and machine learning: comparison with radiologist interpretation

Article 12 November 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Incidental findings of cystic renal masses (CRMs) are common on abdominal computed tomography (CT) images [1]. Majority of these lesions are benign, but some may be malignant. Bosniak classification [2] stratifies these lesions in five classes: Bosniak I and II are almost always benign, while Bosniak IIF, III, and IV can be potentially malignant with 10–20% [3], 50% [4], and 90% [5] malignancy rate, respectively.

While “Bosniak Classification version 2019” [2] aims to be more objective and reduce the inter-reader variability in characterization of CRMs, it still remains subjective. A further reduction in inter-reader variability can be obtained with the use of quantitative features extracted from the imaging data, referred to as radiomic features. Radiomics coupled with machine learning algorithm has been an emerging research area aiming to complement radiologists’ visual interpretation, enhance the overall diagnostic and prognostic accuracy, and help reducing the inter-reader variability [6,7,8].

In a previous study, a texture-based machine learning algorithm was created to potentially reduce inter-reader variability in distinguishing benign from potentially malignant CRMs on CT images [9]. A short list of 1st-order radiomic features (mean, standard deviation, entropy, skewness, and kurtosis) was employed, given their strengths in straightforward implementation and known low variability among different software packages [10]. 1st-order features are based on the histogram of the pixel intensity values within a region of interest (ROI) and are independent of the spatial distribution, unlike the higher-order features varying with spatial distribution of the pixel values [11].

In this work, as an extension to the previous study, we investigate the effect of inter-reader variability in delineating the ROI, which is considered one of the most important issues in radiomics and thus has been extensively studied for various disease cases and imaging modalities [12,13,14,15]. In addition, we investigate whether the inclusion of higher-order features can further improve the performance of the machine learning algorithm in classifying benign from potentially malignant CRMs, compared to the use of 1st-order features alone. For this purpose, we evaluate the performance of a set of robust features, determined based on the intraclass correlation coefficient (ICC), which is a widely used metric to assess the feature reliability among the readers [16].

Materials and methods

Patient cohort and demographics

In this IRB-approved and HIPAA-compliant retrospective study, we included 230 cystic renal masses (161 benign and 69 potentially malignant) with diameters larger than 1 cm derived from our prior study on the use of CT texture analysis for stratification of cystic renal masses into benign and potentially malignant lesions per Bosniak classification v2019 [9]. All exams included 3 mm sections reconstructed with a 50% overlap before and 100 s (nephrographic phase) after IV administration of 50–150 ml iodinated contrast material (300–370 mg iodine/ml). Two radiologists independently and manually delineated 2D ROIs (referred to as R1 and R2) of the CRMs at an axial CT slice at the same level used in the prior study. The Bosniak class of all lesions was previously established independently by three blinded radiologists [9].

Synthetic ROI

To increase the number of observers, we generated two synthetic ROIs (Syn1 and Syn2) for each of the original ROI (R1 or R2) by modifying them with the following steps:

Syn1: Random fluctuation + 4-pixel dilation + 3-pixel erosion

Syn2: Random fluctuation + 4-pixel dilation + 5-pixel erosion

Here, the first step introduces random variation about the original contour in the vertical direction. The noisy contour is then smoothed and filled up by applying a 4-pixel dilation. Finally, the application of 3-pixel and 5-pixel erosion operations creates on average slightly larger (Syn1) and smaller (Syn2) synthetic ROIs compared to the original ROI, respectively. Figure 1 illustrates these steps applied to a sample ROI. As a result, a total of 4 synthetic ROIs were obtained in addition to the two original ROIs. The synthetic ROIs derived from R1 are named as R1_Syn1 and R1_Syn2. Similarly, R2_Syn1 and R2_Syn2 were derived from R2.

To measure similarity and to assess how well the synthetic ROIs represent the degree of inter-reader variability, we used dice coefficients (Dcoeff), similar to the approach presented in [15]. The Dcoeff for a pair of ROIs is calculated as the ratio of the overlap area (A_o) to the total area (A_t) multiplied by two (i.e., Dcoeff = 2 \(\times\) A_o/A_t). We first obtained the Dcoeff between R1 and R2 to establish a baseline for the expected degree of inter-reader variability. Then, we compared the Dcoeffs of the synthetic and original ROIs to this baseline.

Radiomic features

We used an open source Matlab code by Vallières et al. [17] to extract a total of 169 features. 160 of these were higher-order features from gray-level co-occurrence matrix (GLCM), gray-level run-length matrix (GLRLM), gray-level size zone matrix (GLSZM), and neighborhood gray-tone difference matrix (NGTDM) for two different quantization levels (8 and 16 Gy-levels) and for two different quantization algorithms (equal and uniform). The remaining 9 were 1st-order features (size of the major axis, total number of pixels, solidity, eccentricity, mean, standard deviation, entropy, skewness, and kurtosis).

Robust features intraclass correlation (ICC)

We computed the ICC (single fixed raters) among the 6 observers (2 original and 4 synthetic) using the Pingouin python package. We considered the features that provided ICC > 0.85 as robust features, which was defined as “good reliability” in [16].

Building the machine learning model

For the classification of the cystic renal masses, we used a random forest decision tree-based machine learning algorithm (sklearn.ensemble.RandomForestClassifier), which is among the most commonly used classifier in radiomics field because of its favorable features such as computational efficiency, stability against data perturbation [18], and superior performance [19, 20]. We applied a tenfold stratified cross-validation (CV) to train and test the performance of the model. In this method, all the operations were strictly separated between the training and test datasets. First, the data were split in 10 randomly selected groups (folds). Then, ninefold of the data were selected to build the machine learning model and the remaining onefold was used to test the performance of the model. This process was repeated ten times so that all individual folds are used for the test. The steps in each CV iteration were as following: (1) selection of the robust features based on the ICC > 0.85 criterion; (2) data normalization by removing the mean and scaling to unit variance; (3) removal of highly correlated features (Pearson Coefficient > 0.85); (4) fivefold hyperparameter grid-search to tune the model training parameters including size and depth of the decision trees; (5) fit the model using tuned parameters from step 4 on all ninefold training data; and (6) test the performance of the model on the onefold test data to compute the quantitative metrics: area under curve (AUC), sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). The test results from these 10 iterations were then aggregated to obtain the mean ± standard deviation (SD). We also repeated these steps (except for the steps 1 and 3) using the nine 1st-order features only.

Results

Table 1 summarizes the dice coefficient percentages obtained between different pair of observers. The Dcoeff% for the original ROIs, R1 and R2, was 88.1 ± 6.7. The Dcoeff% among synthetic and original ROIs were close to this value indicating that the synthetic ROIs provided sufficient variation in contours mimicking the degree of variation seen among readers. Figure 2 presents six sample ROIs visually confirming realistic variations among original and synthetic ROIs. The dice coefficients between R1 and R2, Dcoeff(R1,R2), are presented in increasing order from left (61%) to right (95%) at the top row. The middle row shows the inter-reader variability between R1_Syn1 and R1_Syn2 and the bottom row shows the inter-reader variability between R2_Syn1 and R2_Syn2.

Table 1 Dice coefficient percentages among the original and synthetic ROIs

Full size table

Robust features

We used a total of 169 features in this study (36 GLCM + 52 GLSZM + 52 GLRM + 20 NGTDM + 9 first order) as listed in five separate feature groups in Fig. 3a with their ICC values (3 GLZSM and 2 GLRM features with negative ICC values were not listed). 78 of these were considered as robust features with ICC > 0.85 (16 GLCM + 26 GLSZM + 23 GLRM + 7 NGTDM + 6 first order). Note, for the purpose of this figure only, the ICC values are calculated from the entire dataset without using the CV folds. Hence, the ICC values in each CV fold can be slightly different than the ones shown in this figure. The number of features was further reduced when highly correlated features were dropped based on Pearson correlation threshold of 0.85. For example, for R2, the remaining total features were 22 (2 GLCM + 7 GLSZM + 8 GLRM + 0 NGTDM + 5 first order) as listed in Fig. 3b. In Fig. 4, the ICC distributions are summarized in box plots, where the median of the 1st-order features have higher ICC values. Figure 5 shows the feature value heatmaps for the robust and uncorrelated features obtained from the 2 original and 4 synthetic ROIs. R1 and R1_Syn1 had the lowest (17) and highest (25) number of features, respectively (R2 had 22 features as mentioned above). The feature values for the benign and potentially malignant cases are well separated in these heatmaps indicating good classification performance and reliability across the six sets of ROIs.

Performance of the machine learning algorithm

The performance metrics for the robust (ICC > 0.85) and uncorrelated (Pearson < 0.85) features (ranging from 17 to 25 features total) as well as for the 1st-order features (9 features total) averaged over 6 observers (2 original and 4 synthetic ROIs) obtained with the random forest classifier are summarized in Table 2 in terms of mean ± SD. Figure 6 shows the receiver operating characteristic (ROC) curves obtained from the 6 observers, each averaged over 10 CV folds. Figure 7 shows the first 10 most important features from these CV folds.

Table 2 Quantitative performance

Full size table

Discussion

We presented the use of a random forest machine learning algorithm for the classification of benign from potentially malignant CRMs seen on CT images. For this purpose, we studied the inter-reader variability in delineation of ROIs and determined stable (robust) 1st- and higher-order radiomic features under this variability. We then evaluated the classification performance of these robust features in comparison to the use of 1st-order features obtained from original and synthetic sets of ROIs.

Generating synthetic ROIs from the original ROIs is a common practice in radiomics as manual segmentation of large datasets is labor-intensive and time-consuming. This is typically performed through simple image operations such as smoothing, dilation and erosion [13, 21], and isotropic resizing [22, 23] over the original ROIs, initially created by one or few readers. In a comprehensive study, Zwanenburg et al. [24] investigated 18 combinations of image perturbations to determine feature robustness, based on noise addition, translation, rotation, volume resizing, and contour randomization. Many of these methods, however, may not capture the inter-reader variability in a realistic manner and may be highly correlated with the original ROIs as the overall shape remains very similar under these operations. An extensive study by Joscowicz et al. [25] has well illustrated the actual degree of inter-reader variability based on 11 radiologists’ manual delineation of lesion (brain hematoma) and organ (brain, lung, liver, and kidney) contours in CT. Therefore, in contrast to many prior studies, our synthetic ROIs were generated through a combination of random fluctuations, dilation, and erosion operations over the original ROIs, which realistically captured the inter-reader variability as presented quantitatively in Table 1 and qualitatively in Fig. 2. For comparison, we performed a sub-study by generating three more synthetic ROIs by applying simple 1-pixel, 2-pixel, and 3-pixel dilation operations on R1. The average dice coefficients between “R1 and dilation of R1”, DCoeff(R1, dil_R1), were 93% (for 1-pixel), 86% (for 2-pixel), and 81% (for 3-pixel), respectively. These results indicate quantitatively that 1-pixel and 3-pixel dilations do not capture the level of variability recorded between R1 and R2 (DCoeff(R1,R2) = 88%). While the average DCoeff with the 2-pixel dilation appears to be reasonable quantitatively, it does not represent a realistic variation visually as it follows the same contour (slightly larger in every directions) of the original ROI.

By computing ICC of these datasets (two original and four synthetic ROIs), we were able to determine the robust radiomic features from a larger sample, which improves the reliability of feature selection. We listed these features using the entire dataset in Fig. 3a. This list, however, was not directly used in the training data to prevent information leakage between the training and test data [26]. Instead, we performed ICC computations for robust feature selection and all the other operations strictly within each cross-validation fold.

The robust features extracted from these ROIs were used in the random forest algorithm for the classification of benign and potentially malignant CRMs, which resulted in high AUC (0.87), sensitivity (0.82), and specificity (0.90) averaged over six sets of ROIs as shown in Table 2. The quantitative results were considerably affected by different delineations among the six datasets. For example, AUC ranged for the robust and uncorrelated features from 0.83 ± 0.09 to 0.93 ± 0.04 and for the 1st-order features from 0.84 ± 0.09 to 0.91 ± 0.04.

As the AUC ranges show, the inclusion of higher-order features for this specific study did not improve the performance compared to the usage of the 1st-order features alone. In fact, a review paper by Traverso et al. [27] reported that the 1st-order features were overall more reproducible among the 41 research studies they reviewed. The same publication also stated that entropy was among the most stable 1st-order features, which was also the case in this study. As shown in Fig. 7, the entropy was the most important feature followed by the mean. Another advantage of using 1st-order feature is their simple implementation and low variability among different software packages.

The quantitative results presented here were overall consistent with the results of a previous study applying 1st-order features on a similar dataset [9]. One exception to this was the sensitivity, which was considerably higher in this study (0.82) compared to the previous one (0.67). This difference is likely due to the exclusion of renal masses smaller than 1 cm in size in the present study as they are often too small to fully characterize on CT and typically considered clinically inconsequential [28]. Also, the differences in the 1st-order feature list and steps in the train/test workflow (e.g., grid-search, inclusion of higher-order features, and ICC-based robust feature selection) may have contributed to the differences in results between the two studies. Another important difference between the two studies was the inclusion of a new set of ROIs (R2) created by a second radiologist and four more synthetic ROIs derived from R1 and R2 to investigate the inter-reader variability in delineating the ROI.

While the random forest algorithm used in this study demonstrated high discriminatory capability in distinguishing benign from potentially malignant CRMs, slight improvements in these results can be expected with the inclusion of features that were not considered here and/or with the usage of different machine learning algorithms. However, in a sub-investigation we performed, we were not able to improve the results significantly by a different machine algorithm (e.g., Gradient Boost), inclusion of more correlated features, and/or expanding the grid-search (unpublished data).

Another potential limitation was the usage of 2D ROI drawn on a single axial slice, which may not fully capture the information for the Bosniak classification compared to a volumetric approach using 3D ROI. Most current radiomics research, however, is based on single slice for practical reasons as 3D approach is more time-consuming and complicated. Additionally, there appears to be no strong evidence in literature indicating that the classification based on 3D method is better than that of 2D methods [29,30,31,32]. While typical 2D approach selects the center slice to represent the entire mass, in this work, the slice with the highest Bosniak class features was selected to better capture the characteristics of the cystic renal mass.

Conclusion

We presented the level of inter-reader variability in delineation of CRMs and its effect on the classification performance. Using random forest algorithm, we obtained high sensitivity and specificity in the prediction of the benign and potentially malignant cystic renal masses. We also demonstrated that the usage of the 1st-order features alone is sufficient for the classification of cystic renal masses and the inclusion of higher-order features does not necessarily improve the performance.

References

L. L. Berland et al., "Managing incidental findings on abdominal CT: white paper of the ACR incidental findings committee," J Am Coll Radiol, vol. 7, no. 10, pp. 754-73, Oct 2010.
Article Google Scholar
S. G. Silverman et al., "Bosniak Classification of Cystic Renal Masses, Version 2019: An Update Proposal and Needs Assessment," Radiology, vol. 292, no. 2, pp. 475-488, Aug 2019.
Article Google Scholar
N. M. Hindman, E. M. Hecht, and M. A. Bosniak, "Follow-up for Bosniak category 2F cystic renal lesions," Radiology, vol. 272, no. 3, pp. 757-66, Sep 2014.
Article Google Scholar
N. M. Hindman, "Cystic renal masses," Abdom Radiol (NY), vol. 41, no. 6, pp. 1020-34, Jun 2016.
Article Google Scholar
S. G. Silverman, Y. U. Gan, K. J. Mortele, K. Tuncali, and E. S. Cibas, "Renal masses in the adult patient: the role of percutaneous biopsy," Radiology, vol. 240, no. 1, pp. 6-22, Jul 2006.
Article Google Scholar
B. Kocak, E. S. Durmaz, E. Ates, and O. Kilickesmez, "Radiomics with artificial intelligence: a practical guide for beginners," Diagn Interv Radiol, vol. 25, no. 6, pp. 485-495, Nov 2019.
Article Google Scholar
M. Avanzo et al., "Machine and deep learning methods for radiomics," Med Phys, vol. 47, no. 5, pp. e185-e202, Jun 2020.
Article Google Scholar
R. Suarez-Ibarrola, M. Basulto-Martinez, A. Heinze, C. Gratzke, and A. Miernik, "Radiomics Applications in Renal Tumor Assessment: A Comprehensive Review of the Literature," Cancers (Basel), vol. 12, no. 6, May 28 2020.
N. Miskin et al., "Stratification of cystic renal masses into benign and potentially malignant: applying machine learning to the bosniak classification," Abdominal Radiology, 2020.
J. J. Foy, K. R. Robinson, H. Li, M. L. Giger, H. Al-Hallaq, and S. G. Armato, 3rd, "Variation in algorithm implementation across radiomics software," Journal of medical imaging, vol. 5, no. 4, p. 044505, Oct 2018.
M. E. Mayerhoefer et al., "Introduction to Radiomics," J Nucl Med, vol. 61, no. 4, pp. 488-495, Apr 2020.
Article CAS Google Scholar
F. Rizzetto et al., "Impact of inter-reader contouring variability on textural radiomics of colorectal liver metastases," Eur Radiol Exp, vol. 4, no. 1, p. 62, Nov 10 2020.
X. Zhang et al., "The effects of volume of interest delineation on MRI-based radiomics analysis: evaluation with two disease groups," Cancer Imaging, vol. 19, no. 1, p. 89, Dec 21 2019.
M. Pavic et al., "Influence of inter-observer delineation variability on radiomics stability in different tumor sites," Acta Oncol, vol. 57, no. 8, pp. 1070-1074, Aug 2018.
Article Google Scholar
C. Haarburger, G. Muller-Franzes, L. Weninger, C. Kuhl, D. Truhn, and D. Merhof, "Radiomics feature reproducibility under inter-rater variability in segmentations of CT images," Sci Rep, vol. 10, no. 1, p. 12688, Jul 29 2020.
T. K. Koo and M. Y. Li, "A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research," J Chiropr Med, vol. 15, no. 2, pp. 155-63, Jun 2016.
Article Google Scholar
M. Vallieres, C. R. Freeman, S. R. Skamene, and I. El Naqa, "A radiomics model from joint FDG-PET and MRI texture features for the prediction of lung metastases in soft-tissue sarcomas of the extremities," Phys Med Biol, vol. 60, no. 14, pp. 5471-96, Jul 21 2015.
Article CAS Google Scholar
C. Parmar, P. Grossmann, J. Bussink, P. Lambin, and H. Aerts, "Machine Learning methods for Quantitative Radiomic Biomarkers," Sci Rep, vol. 5, p. 13087, Aug 17 2015.
J. Uhlig et al., "Discriminating malignant and benign clinical T1 renal masses on computed tomography: A pragmatic radiomics and machine learning approach," Medicine (Baltimore), vol. 99, no. 16, p. e19725, Apr 2020.
C. Erdim et al., "Prediction of Benign and Malignant Solid Renal Masses: Machine Learning-Based CT Texture Analysis," Acad Radiol, vol. 27, no. 10, pp. 1422-1429, Oct 2020.
Article Google Scholar
S. A. Mattonen et al., "Bone Marrow and Tumor Radiomics at (18)F-FDG PET/CT: Impact on Outcome Prediction in Non-Small Cell Lung Cancer," Radiology, vol. 293, no. 2, pp. 451-459, Nov 2019.
Article Google Scholar
R. Liu et al., "Stability analysis of CT radiomic features with respect to segmentation variation in oropharyngeal cancer," Clin Transl Radiat Oncol, vol. 21, pp. 11-18, Mar 2020.
Article CAS Google Scholar
B. Kocak, E. Ates, E. S. Durmaz, M. B. Ulusan, and O. Kilickesmez, "Influence of segmentation margin on machine learning-based high-dimensional quantitative CT texture analysis: a reproducibility study on renal clear cell carcinomas," Eur Radiol, vol. 29, no. 9, pp. 4765-4775, Sep 2019.
Article Google Scholar
A. Zwanenburg et al., "Assessing robustness of radiomic features by image perturbation," Sci Rep, vol. 9, no. 1, p. 614, Jan 24 2019.
L. Joskowicz, D. Cohen, N. Caplan, and J. Sosna, "Inter-observer variability of manual contour delineation of structures in CT," Eur Radiol, vol. 29, no. 3, pp. 1391-1399, Mar 2019.
Article Google Scholar
M. C. F. Cysouw et al., "Machine learning-based analysis of [(18)F]DCFPyL PET radiomics for risk stratification in primary prostate cancer," Eur J Nucl Med Mol Imaging, Jul 31 2020.
A. Traverso, L. Wee, A. Dekker, and R. Gillies, "Repeatability and Reproducibility of Radiomic Features: A Systematic Review," Int J Radiat Oncol Biol Phys, vol. 102, no. 4, pp. 1143-1158, Nov 15 2018.
Article Google Scholar
M. M. Nguyen and I. S. Gill, "Effect of renal cancer size on the prevalence of metastasis at diagnosis and mortality," J Urol, vol. 181, no. 3, pp. 1020–7; discussion 1027, Mar 2009.
C. Shen et al., "2D and 3D CT Radiomics Features Prognostic Performance Comparison in Non-Small Cell Lung Cancer," Transl Oncol, vol. 10, no. 6, pp. 886-894, Dec 2017.
Article Google Scholar
L. Meng et al., "2D and 3D CT Radiomic Features Performance Comparison in Characterization of Gastric Cancer: A Multi-center Study," IEEE J Biomed Health Inform, vol. PP, Jun 16 2020.
S. Roy et al., "Optimal co-clinical radiomics: Sensitivity of radiomic features to tumour volume, image noise and resolution in co-clinical T1-weighted and T2-weighted magnetic resonance imaging," EBioMedicine, vol. 59, p. 102963, Sep 2020.
D. Arefan, R. Chai, M. Sun, M. L. Zuley, and S. Wu, "Machine learning prediction of axillary lymph node metastasis in breast cancer: 2D versus 3D radiomic features," Med Phys, vol. 47, no. 12, pp. 6334-6342, Dec 2020.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Imaging Department, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
Arda Könik & Lei Qin
Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Nityanand Miskin
Department of Radiology, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
Yang Guo
Department of Radiology, Brigham and Women’s Hospital and Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
Atul B. Shinagare

Authors

Arda Könik
View author publications
You can also search for this author in PubMed Google Scholar
Nityanand Miskin
View author publications
You can also search for this author in PubMed Google Scholar
Yang Guo
View author publications
You can also search for this author in PubMed Google Scholar
Atul B. Shinagare
View author publications
You can also search for this author in PubMed Google Scholar
Lei Qin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Arda Könik.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Könik, A., Miskin, N., Guo, Y. et al. Robustness and performance of radiomic features in diagnosing cystic renal masses. Abdom Radiol 46, 5260–5267 (2021). https://doi.org/10.1007/s00261-021-03241-2

Download citation

Received: 22 April 2021
Revised: 22 April 2021
Accepted: 06 August 2021
Published: 11 August 2021
Issue Date: November 2021
DOI: https://doi.org/10.1007/s00261-021-03241-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Robustness and performance of radiomic features in diagnosing cystic renal masses