Abstract
Purpose
Radiomics is an emerging field that utilizes quantitative features extracted from medical images to predict clinically meaningful outcomes. Validating findings is crucial to assess radiomics applicability. We aimed to validate previously published magnetic resonance imaging (MRI) radiomics models to predict oncological outcomes in oral tongue squamous cell carcinoma (OTSCC).
Materials and methods
Retrospective multicentric study on OTSCC surgically treated from 2010 to 2019. All patients performed preoperative MRI, including contrast-enhanced T1-weighted (CE-T1), diffusion-weighted sequences and apparent diffusion coefficient map. We evaluated overall survival (OS), locoregional recurrence-free survival (LRRFS), cause-specific mortality (CSM). We elaborated different models based on clinical and radiomic data. C-indexes assessed the prediction accuracy of the models.
Results
We collected 112 consecutive independent patients from three Italian Institutions to validate the previously published MRI radiomic models based on 79 different patients. The C-indexes for the hybrid clinical-radiomic models in the validation cohort were lower than those in the training cohort but remained > 0.5 in most cases. CE-T1 sequence provided the best fit to the models: the C-indexes obtained were 0.61, 0.59, 0.64 (pretreatment model) and 0.65, 0.69, 0.70 (posttreatment model) for OS, LRRFS and CSM, respectively.
Conclusion
Our clinical-radiomic models retain a potential to predict OS, LRRFS and CSM in heterogeneous cohorts across different centers. These findings encourage further research, aimed at overcoming current limitations, due to the variability of imaging acquisition, processing and tumor volume delineation.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Radiomics represents one of the most attractive fields in medicine that has taken hold in the last 10 years [1]. Radiomic features analysis from pretreatment images aimed at acquiring all possible data on cancer characteristics and prognosis prediction [2, 3]. Currently, oncological outcomes can be predicted only based on pathological and clinical tumor stage, leading to the need for innovative strategies to better foresee patients’ prognosis. Therefore, the quantitative information extracted from medical images, is the basis for radiomics to become unbiased and independent support in daily clinical practice. Despite the large number of published studies, the application of radiomics to clinical practice is not yet feasible because of the lack of radiomic validation in different cohorts [3,4,5].
In our study, we apply the radiomics workflow to oral tongue squamous cell carcinoma (OTSCC), the tumors that most frequently affect the oral cavity [6]. For preoperative staging, magnetic resonance imaging (MRI) is the gold standard imaging [7]. The first-choice therapeutic approach for OTSCC is surgery, and the prognosis reported in the literature remains 60% at 5 years [8, 9]. Survival ranges from 80% for early stages (I–II) to 30% for advanced stages (III–IV). There is an increasing awareness that these estimates may not adequately fit single patient's history, calling for a more tailored prediction approach. In precision medicine, accurate risk prediction is necessary to plan personalized therapeutic/follow-up schemes based on the individual survival curves, leading to the ability to predict “individual patient’s survival,” instead of a too general “global survival” [4].
To date, published studies have underlined how the hybrid clinical-radiomic predictive models are better than the clinical or radiomic models for predicting patients’ outcomes [10,11,12].
Validating a radiomic model should be mandatory [5, 13] to confirm its potential use in clinical settings. The validation cohort should be independent and external, based on subjects other than the training set cohort [14], as validations based on the reference model's internal cohort are generally less robust [5, 14,15,16].
We have recently published a study with clinical-radiomic models in patients with OTSCC capable of preoperatively predicting patients' prognosis, better than clinical models alone [10]. In this published analysis we assessed preoperative MRI of 79 patients with OTSCC, aiming to define potential prognostic biomarkers using radiomic features. Upon evaluating clinical and radiomic features, the radiomic score maintained statistical significance in almost all clinical-radiomic predictive models and apparent diffusion coefficient map (ADC) MRI provided the best fit to the models.
According to the concept set out in the previous paragraph on the importance of external validation of radiomics studies [14] to highlight stability and reproducibility, the aim of this paper is to externally validate our previously published clinical-radiomics models [10] through a different and independent cohort of patients.
Materials and methods
Clinical dataset
We performed a retrospective radiomic analysis on preoperative MRI of consecutive patients with OTSCC surgically treated in a multicentric setting: Surgery performed at the European Institute of Oncology (IEO) Milan, Italy with MRI performed in other Italian facilities collected in the online imaging storage (IEO EXT); Surgery and Imaging at Policlinico San Matteo, Pavia, Italy (PV); Surgery and Imaging at Spedali Civili, Brescia, Italy (BS).
The study was approved by the Ethics Committee of the IEO and by the IEO Radiomic Board (UID 2520), Spedali Civili of Brescia (120/Reg.IX) and Policlinico San Matteo of Pavia (96379/2020).
Inclusion criteria were: diagnosis of OTSCC; surgery performed between 2010 and 2019; preoperative MRI (≤ 4 weeks before surgery) including contrast-enhanced T1-weighted (CE-T1) and diffusion-weighted (DWI) sequences with at least two b values for the calculation of apparent diffusion coefficient map (ADC).
Exclusion criteria were: concurrent or previous cancer in head and neck (HN) region; inadequate follow-up information (irretrievable medical information data); inadequate MRI for tumor volume segmentation and quantitative analysis (i.e., cases were excluded when the primary tumor was not detectable or artifacts significantly degraded the images).
Clinical, pathological, treatment and follow-up information were collected from medical reports. All patients were re-staged according to 7th and 8th edition of AJCC TNM [17, 18]. To re-stage all patients using the AJCC 8th edition we re-evaluated and collected all histological and radiological Depth of Invasion (DOI). In both, histological specimen and radiological MRI DOI was measured perpendicularly from the line connecting the adjacent normal mucosal basement membrane to the deepest point of tumor invasion.
All patients were surgically treated and then the need of adjuvant therapy was discussed and defined within a multidisciplinary tumor board according to stage disease [19, 20].
For at least 5 years from the end of treatment, all patients underwent state-of-art clinical assessments and procedures of standardize oncological follow-up according to the NCCN guidelines [20]. The distant or local events as metastases or locoregional recurrence were recorded during the follow-up visits. Patients who did not attend scheduled follow-up appointments were phone interviewed to verify and update their medical information,
MRI acquisition parameters, segmentation, feature extraction
For the Training Group (TG), MRIs were performed at IEO on a 1.5-T system (Magnetom Avanto, Siemens Healthineers, Erlangen, Germany) with a dedicated 16-channel receive-only radiofrequency HN coil [10]. DWI sequences were obtained via single-shot spin-echo and echo-planar imaging (field of view 250 × 250 mm, TR/TE 5000/77 ms, slice thickness 5 mm, spacing between slices 1 mm, bandwidth 1565 Hz/pixel). Three different b values were used (b = 0.500 and 900 s/mm2) with diffusion-sensitizing gradients applied in three orthogonal directions to obtain trace-weighted images [10]. ADC maps derive from a mono-exponential analysis of diffusion-weighted images. The imaging protocol included post-contrast (Gadolinium 0.2 ml/kg) isotropic T1-w images (acquisition matrix 263 × 384 mm, field of view 187 × 240 mm, TR/TE 7.43/2.88 ms, slice thickness 0.6 mm) [10].
For the Validation Group (VG) at Policlinico San Matteo of Pavia, the MRI was performed on a 1.5-T system (Aera, Siemens Healthineers, Erlangen, Germany) with the dedicated 16-channel receive-only radiofrequency HN coil. DWI sequences were obtained via single-shot spin-echo and echo-planar imaging (260 × 260 mm, TR/TE 7075 /58 ms, slice thickness 3 mm, spacing between slices 3.3 mm, bandwidth 1540 Hz/pixel). To obtain trace-weighted images, two different b values were used (b = 50, 800 s/mm2) with diffusion-sensitizing gradients applied in three orthogonal directions. The imaging protocol included post-contrast (Gadolinium 0.2 ml/kg) isotropic T1-w images (acquisition matrix 256 × 256, field of view 260 × 260 mm, TR/TE 8.23/2.39 ms, slice thickness 1 mm).
For the VG at Spedali Civili of Brescia, the MRI was performed on a 1.5-T system (Aera, Siemens Healthcare Sector, Erlangen, Germany) with the dedicated 16-channel receive-only radiofrequency HN coil. DWI sequences were obtained via single-shot spin-echo and echo-planar imaging (field of view 250 × 250 mm, TR/TE 4000/60 ms, slice thickness 3 mm, spacing between slices 1 mm, bandwidth 1447 Hz/pixel). Two different b values were used (b = 0 and 1000 s/mm2) with diffusion-sensitizing gradients applied in three orthogonal directions to obtain trace-weighted images. The imaging protocol included post-contrast (Gadolinium 0.2 ml/kg) isotropic T1-w images (acquisition matrix 448 × 350, field of view 270 × 210 mm, TR/TE 8.2/3.16 ms, slice thickness 0.6 mm).
The MRI characteristics described above cannot be described in the same detail for patients treated at IEO with imaging performed in different external centers (IEO EXT). This is because the IEO imaging storage archives only the DICOM images of the MRIs performed in the external hospitals, leading to some lacking details.
Two dedicated HN radiologists [one senior radiologist (FR) with 7 years’ experience in HN and one junior (AN) with 1 year of experience in HN] manually segmented the entire tumor volume (region of interest, ROI) in CE-T1 sequences and ADC maps (Fig. 1). Inter-reader agreement was evaluated and all discrepancies were solved through discussion. The radiologists were unaware of the patient characteristics and their follow-ups status.
DICOM files and ROIs were extracted as radiotherapy (RT) structure files on AW Server 3.2 workstation (GE Healthcare, Milwaukee, WI) and exported in DICOM format. Features extraction was performed using the Image Biomarker Standardization Initiative (IBSI) compliant tool PyRadiomics v3.0.1 (Numpy version 1.20.3, SimpleITK version 2.0.2, PyWavelet version 1.1.1) with default settings. The radiomic features were extracted from all available filter classes (Laplacian of Gaussian, wavelet, logarithm, exponential, local binary pattern 2D, local binary pattern 3D, square, square root and gradient). In total, 1967 features were extracted from each segmentation.
Oncological outcomes endpoints
Overall survival (OS) was defined as the time from surgery until death from any cause or the last contact date if alive. Locoregional recurrence-free survival (LRRFS) was defined as the time from surgery until locoregional recurrence or the last contact date without locoregional recurrence. LRRFS includes relapses on tumor (T), lymph nodes (N) or T and N, no distant metastasis or second primary were included. Cause-specific mortality (CSM) was defined as the time from surgery until the date of death for OTSCC. In case of no death due to OTSCC, the observation was censored at the last follow-up visit or the date of death for other causes. The Kaplan–Meier was estimated for 10-year OS, LRRFS and CSM.
Statistical analysis
We performed a validation of ahead trained models on a different and independent cohort of patients (TG) [10]. Specifically, we applied the radiomic models, the pretreatment clinical models (including gender, age and clinical status of the lymph node), and the posttreatment clinical models (including state of margins, state of lymph nodes, presence extracapsular metastases (ECE) [10].
Frequency and percentages for categorical variables and median and interquartile range (IQR) for continuous variables were used to summarize patients’ characteristics in the validation cohort. Pearson's chi-square and Fisher's exact tests were used to test for differences between categorical variables. Wilcoxon's rank sum and Kruskal–Wallis tests were used to assess for a continuous variable difference between two or more groups, respectively. First, clinical differences and median follow-up between patients in the TG and VG were analyzed, as well as between the centers involved. Differences of the radiomic features between VG and TG were evaluated (Supplementary Tables 1 and 2). If significant differences among features were identified, these features were harmonized with the COMBAT method (“EZ. combat” library in R), considering each center as a batch variable [21]. The difference in OS, LRRFS and CSM between the TG and VG was tested with the Log-rank test. The centers involved in validation were compared in the OS, LRRFS and CSM. Risk estimates were quantified by hazard ratio (HR) and 95% confidence intervals (95%-CI).
Radiomic information was aggregated in the statistical analysis within a “radiomic score,” built with the coefficient used in the previous study [10]. For each patient, the radiomic score was defined as the sum of the product between the regression coefficients and their respective feature values. We evaluated the previously presented predictive models: clinical model (pre- and posttreatment), radiomic model and clinical-radiomic model (pre- and posttreatment). The models’ accuracy was tested by using the Concordance index (C-index), a measure of goodness of fit for survival outcomes, which ranges from 0 (poor predictive model) to 1 (hypothetical perfect predictive model). All analyses were repeated for CE-T1 and ADC. Lastly, a Bootstrap analysis with 5000 repetitions was performed to determine a 95%-CI for the C-index estimate. As an exploratory analysis, we used the VG as a new dataset to build a new model with new coefficients using the preselected clinical variables and the radiomic score obtained in the training model. Analyses were performed using R (4.1.1).
Results
The study included 112 patients as VG. TG models were built on 79 different patients [10].
Out of the 112 patients included in the study based on inclusion and exclusion criteria, after MRI quality images reviewed by the HN Radiologists, 108 patients were included in the VG for CE-T1 sequences (IEO EXT N = 25; PV N = 19; BS N = 64) and 83 patients for ADC from DWI sequences (IEO EXT N = 13; PV N = 7; BS N = 63).
Table 1 reports the clinical and pathological characteristics of patients in the two study groups: TG (79 patients), VG (112 patients) and Overall (191 patients). Sixty-nine percent of the entire population was male, with an average age of 62 (IQR: 47–70), with VG including significantly older patients (64 years, IQR: 52–73) than the TG (55 years, IQR: 41–67).
Clinical staging (c) showed differences between TG and VG in tumor (cT) classification (p = 0.03) and clinical Stage cTNM (p = 0.05) with higher stages found for patients in TG.
Postsurgical information: vascular invasion, perineural invasion, surgical margins (positive or close) and lymph node stage (pathological (p)N) were statistically different between the two groups TG and VG: < 0.001, < 0.001, 0.017, 0.008, respectively (Table 1).
No difference was found between the pTNM and postoperative treatments; p = 0.02 was found for the type of surgery, transoral (glossectomy type II) versus en block major surgery (glossectomies type III–V) [22].
The median follow-up time was 2.8 years IQR (1.84–5.44) for OS, 2.8 years (0.94–4.69) for LRRFS and 2.75 years (1.45–4.87) for CSM.
No significant differences were detected between the TG and VG for OS, LRRFS and CSM (all p values > 0.05) (Fig. 2).
Even among the patients collected by the three different clinical centers involved in the validation, there were no differences in survival: OS, LRRFS and CSM (Fig. 3).
Features selected in our previous model for OS, LRRFS and CSM were retrieved from the VG [10]. The radiomic score was calculated on the validation dataset with the previously defined coefficient.
Tables 2 and 3 summarize all the results of the validation analysis, comparing the C-Index in the training set and the fitted C-index (“C-index-validation”) in CE-T1 and ADC map for the pretreatment and posttreatment models, respectively. Radiomic models adequately fitted each endpoint: Table 2 depicts the comparison between the C-Index in the TG and the fitted C-Index (C-Index–VG) for ADC and CE-T1 sequences in the model base on pretreatment information.
Table 3 shows the comparison between the C-Index in the TG and the fitted C-Index (C-Index–VG) in ADC and CE-T1 sequences for the Model using posttreatment information. The C-Index was generally > 0.5 in VG: OS C-index for radiomic score were 0.59; LRRFS C-index for ADC and CE-T1were 0.43 and 0.52, respectively; CSM C-index for ADC and CE-T1 were 0.63 and 0.61, respectively. In Supplementary Tables 1 and 2 are reported the differences in radiomic features between VG and TG in CE-T1 and ADC sequences, respectively.
The C-indexes for the hybrid clinical-radiomic models in the validation cohort were lower than those in the training cohort but remained > 0.5 in most cases. CE-T1 sequence provided the best fit to the models: the C-indexes obtained were 0.61, 0.59, 0.64 (pretreatment model) and 0.65, 0.69, 0.70 (posttreatment model) for OS, LRRFS and CSM, respectively.
Other additional details are reported in the Supplementary Material (S1–S2).
Discussion
The application of radiomics in HN cancers is a barely explored field, especially when tongue tumors are considered [23]. To date, upon a non-systematic search in Pubmed and Embase with the keywords “radiomic” AND “tongue cancer,” few publications emerge: 11 on Pubmed and 15 on Embase. Of these, 9 are in common between the two search engines, one paper is written in Chinese, 3 are congress abstracts, one is focused on lung cancer and the latter is a review [24,25,26,27,28,29]. Of the remaining publications, one is a review, and five proposed radiomic models to predict the lymph node status in the neck, occult metastases or lymph node ECE [11, 30,31,32,33,34]. Two articles focused on predicting prognosis through MRI radiomic-based features, and others on tumor grading radiomic determination before histology [10, 35,36,37].
Regarding the use of MRI, the most recent article by Corti et al. reported how MRI-based radiomic signature could be a prognostic marker for OS in oral cavity cancer patients, comparing it with gene expression signatures [38].
Our manuscript presented the validation of clinical-radiomic models for prognosis prediction in mobile tongue tumors, considering OS, CSM and LRRFS.
The validation phase is crucial for radiomics application in clinical practice. As expected, the C-indexes for the clinical-radiomic models in the VG were lower than those in the TG. This is a common feature in validating studies mainly due to the unavoidable differences between the training and validating cohorts [39]. In the present analysis, the VG resulted to have some worse prognostic histological characteristics compared to the TG. However, the C-indexes still remained > 0.5 in most cases, predicting a good adherence of the hybrid model to reality.
In our previous publication, the combined clinical-radiomic models for prognosis prediction showed a strong association between clinical variables, radiomic features and oncological outcomes in OTSCC [10]. We focused our radiomic study on this specific patient group because it is known that HN cancers are very heterogeneous due to different risk factors, anatomical site and prognosis [40]. This heterogeneity is also reported within the same anatomical site: for example, oral cavity subsites are considered cheek, floor of the mouth, mobile tongue, maxillary tuber, mandible. Tumors of these subsites have different survival rates even within the same staging [40]. Furthermore, the gold standard preoperative imaging also varies by subsites: MRI for OTSCC or cheek; computed tomography for maxillary tuber and mandible. For these reasons and to maximize the models’ accuracy, the study group was selected as OTSCC patients and not general oral cavity cancer.
As already mentioned, MRI is the gold standard imaging for local evaluation in OTSCC [7]. The protocol for proper MRI in these patients includes T1- and T2-weighted sequences, CE-T1 with fat saturation and optionally DWI and ADC. MRI can add important information such as preoperative radiological DOI which has been demonstrated to be an independent preoperative predictor of oncological outcomes in OTSCC to and to better predict patients’ clinical stage [41]. This is in accordance with the idea that applying radiomics on MRI could represent an added value in prognosis prediction for OTSCC. In this study we selected only CE-T1 sequences and ADC maps because they represented respectively the most widespread and reproducible sequence (CE-T1) and the best fit (ADC) in the previous published models that we aim to externally validate [10].
Our data confirmed that among the clinical models for prognosis prediction, the models with posttreatment information showed on average better performance in oncological outcomes also in the combined clinical-radiomic models.
Notably, upon comparing the radiomic model (radiomic score) alone with the clinical model based on pretreatment information, we found an improvement in terms of C-index in the TG radiomic model. In the VG, the C-indexes values of the two models were very similar, with an average variation of 0.02, 0.001, suggesting that the radiomic score may have been specific to the population in which it was constructed (TG). The radiomic sequences studied and chosen in the TG for constructing the radiomic score were built with a brush on the TG itself. So, the radiomic score could overfit the TG but be less effective in predicting clinical outcomes in the VG.
Moreover, in the VG, all the MRIs studied belong to different hospitals, while the TG features were acquired on MRI performed at the same hospital (IEO) and some differences may be attributable to the differences between the two cohorts.
In the VG, comparing the clinical model built on posttreatment information and the radiomic model alone, we did not find a relevant advantage in the prognosis prediction accuracy of the radiomic model alone.
Focusing on the combined pre and posttreatment clinical-radiomic models in OS, LRRFS and CSM, we presented comparable C-indexes between groups without significant increases.
Our data underlined that the CE-T1 sequences were suitable for prognosis prediction (C-index > 0.5) in OTSCC. This sequence is also widespread in other hospitals, including non-tertiary centers as it is considered standard-of-care for tongue cancer evaluation, allowing the predictive model to be reproduced for all patients and across different facilities [10, 38]. Conversely, DWI sequences and ADC map are not performed in the clinical routine of all centers, thus limiting its potential application [10, 38].
This validation step on the previously created models underlined two different aspects. Firstly, it confirmed the potential ability of radiomics associated with clinical information to predict the prognosis of a single patient, even in heterogeneous cohorts. Secondly, it also highlighted the fragility of the technique due to the critical issues of validation, which is fundamental for models’ exportation and potential applicability to daily clinical practice [41]. The fact that the C-indexes of the VG were good but all lower than those obtained in the TG draws attention to how it could be difficult to speak a common language in this field [42, 43]. The discrepancy in predictive performance between the training and validation sets questions whether overfitting is an issue or if it is due to differences in external validation images.
Moreover, in our study not all patients presented both CE-T1 and ADC MRI sequences, after multidisciplinary and internal discussion we included all patients with CE-T1 data (108 patients), as represents the most widespread and reproducible MRI sequences: its greater number in the sample, with respect to ADC (83 patients), also reflects its potential greater external applicability.
Multiple variables can influence MRI-based-radiomics on OTSCC, including the different machines, ROIs manual segmentation, relatively small number of cohorts analyzed for oral cancers [43, 44]. These aspects partly explain why radiomics represents a still-developing discipline and in its beginning in HN cancers, especially in tongue cancer [13, 43, 45]. Also, the importance of comparing data across study methodologies and structure with other cancer types, in which radiomics is already at a more advanced state, is essential [43]. Our study is one of the few currently published with the intent to associate MRI radiomics with the ability to predict patients’ prognosis in OTSCC, and it presents strengths and limitations. The validation group, different from the TG cohort, should be considered the main strength of our work [46], followed by the completeness of all clinical and follow-up information of all patients. The major limitation is the suboptimal number of patients in the two cohorts. A more significant number of patients and prospective multicentric studies could increase the significance of these results, their validation and reproducibility. This paper focused on the quantitative data demonstrating the significance of the radiomic data alone thus, we did not measure the ADC value or included radiological information on surrounding structures invasion even if they could be important for tumor aggressiveness and prognosis prediction. Moreover, using a multi-parametric image fusion could lead to better results in terms of models’ performance, this could be the subject of a future work on the same expanded dataset.
Conclusion
Radiomics represents a promising noninvasive method for the implementation of precision medicine. Our results confirmed that the validation step is one of the main limitations in radiomics, which is more evident when external cohorts are at stake. Nevertheless, our results confirmed the potential added value of radiomics to refine individual patients’ prognosis, even in the context of rather heterogeneous populations. The challenges in external validation may derive from many reasons, including different types of imaging acquisition parameters and variability of tumor volume delineation. Upon solving these issues, radiomics could be a potential instrument to perform tailored prognosis prediction in OTSCC.
References
Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RG, Granton P, Zegers CM, Gillies R, Boellard R, Dekker A, Aerts HJ (2012) Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer 48(4):441–446. https://doi.org/10.1016/j.ejca.2011.11.036
Keek SA, Leijenaar RT, Jochems A, Woodruff HC (2018) A review on radiomics and the future of theranostics for patient selection inprecision medicine. Br J Radiol 91(1091):20170926. https://doi.org/10.1259/bjr.201709262
Volpe S, Mastroleo F, Krengli M, Jereczek-Fossa BA (2023) Quo vadis Radiomics? Bibliometric analysis of 10-year Radiomics journey. Eur Radiol 33(10):6736–6745. https://doi.org/10.1007/s00330-023-09645-6
Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J, Sanduleanu S, Larue RTHM, Even AJG, Jochems A, van Wijk Y, Woodruff H, van Soest J, Lustberg T, Roelofs E, van Elmpt W, Dekker A, Mottaghy FM, Wildberger JE, Walsh S (2017) Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol 14(12):749–762. https://doi.org/10.1038/nrclinonc.2017.141
Shahzadi I, Zwanenburg A, Lattermann A, Linge A, Baldus C, Peeken JC, Combs SE, Diefenhardt M, Rödel C, Kirste S, Grosu AL, Baumann M, Krause M, Troost EGC, Löck S (2022) Analysis of MRI and CT-based radiomics features for personalized treatment in locally advanced rectal cancer and external validation of published radiomics models. Sci Rep 12(1):10192. https://doi.org/10.1038/s41598-022-13967-8
Ferlay J, Colombet M, Soerjomataram I, Dyba T, Randi G, Bettio M, Gavin A, Visser O, Bray F (2018) Cancer incidence and mortality patterns in Europe: estimates for 40 countries and 25 major cancers in 2018. Eur J Cancer 103:356–387. https://doi.org/10.1016/j.ejca.2018.07.005
Calabrese L, Bruschini R, Giugliano G, Ostuni A, Maffini F, Massaro MA, Santoro L, Navach V, Preda L, Alterio D, Ansarin M, Chiesa F (2011) Compartmental tongue surgery: long term oncologic results in the treatment of tongue cancer. Oral Oncol 47(3):174–179. https://doi.org/10.1016/j.oraloncology.2010.12.006
Brenner H (2002) Long-term survival rates of cancer patients achieved by the end of the 20th century: a period analysis. Lancet 360(9340):1131–1135. https://doi.org/10.1016/S0140-6736(02)11199-8
Zanoni DK, Montero PH, Migliacci JC, Shah JP, Wong RJ, Ganly I, Patel SG (2019) Survival outcomes after treatment of cancer of the oral cavity (1985–2015). Oral Oncol 90:115–121. https://doi.org/10.1016/j.oraloncology.2019.02.001
Mossinelli C, Tagliabue M, Ruju F, Cammarata G, Volpe S, Raimondi S, Zaffaroni M, Isaksson JL, Garibaldi C, Cremonesi M, Corso F, Gaeta A, Emili I, Zorzi S, Alterio D, Marvaso G, Pepa M, De Fiori E, Maffini F, Preda L, Benazzo M, Jereczek-Fossa BA, Ansarin M (2023) The role of radiomics in tongue cancer: a new tool for prognosis prediction. Head Neck 45(4):849–861. https://doi.org/10.1002/hed.27299
Wang F, Tan R, Feng K, Hu J, Zhuang Z, Wang C, Hou J, Liu X (2022) Magnetic resonance imaging-based radiomics features associated with depth of invasion predicted lymph node metastasis and prognosis in tongue cancer. J Magn Reson Imaging 56(1):196–209. https://doi.org/10.1002/jmri.28019
Park H, Lim Y, Ko ES, Cho HH, Lee JE, Han BK, Ko EY, Choi JS, Park KW (2018) Radiomics signature on magnetic resonance imaging: association with disease-free survival in patients with invasive breast cancer. Clin Cancer Res 24(19):4705–4714. https://doi.org/10.1158/1078-0432.CCR-17-3783
Liu Z, Wang S, Dong D, Wei J, Fang C, Zhou X, Sun K, Li L, Li B, Wang M, Tian J (2019) The applications of radiomics in precision diagnosis and treatment of oncology: opportunities and challenges. Theranostics 9(5):1303–1322. https://doi.org/10.7150/thno.30309
Kramer AA, Zimmerman JE (2007) Assessing the calibration of mortality benchmarks in critical care: the Hosmer–Lemeshow test revisited. Crit Care Med 35(9):2052–2056. https://doi.org/10.1097/01.CCM.0000275267.64078.B0
Moskowitz CS, Welch ML, Jacobs MA, Kurland BF, Simpson AL (2022) Radiomic analysis: study design, statistical analysis, and other bias mitigation strategies. Radiology 304(2):265–273. https://doi.org/10.1148/radiol.211597
Kazmierski M, Welch M, Kim S, McIntosh C, Rey-McIntyre K, Huang SH, Patel T, Tadic T, Milosevic M, Liu FF, Ryczkowski A, Kazmierska J, Ye Z, Plana D, Aerts HJWL, Kann BH, Bratman SV, Hope AJ, Haibe-Kains B (2023) Multi-institutional prognostic modeling in head and neck cancer: evaluating impact and generalizability of deep learning and radiomics. Cancer Res Commun 3(6):1140–1151. https://doi.org/10.1158/2767-9764.CRC-22-0152
Edge SB, Compton CC (2010) The American Joint Committee on Cancer: the 7th edition of the AJCC cancer staging manual and the future of TNM. Ann Surg Oncol 17(6):1471–4. https://doi.org/10.1245/s10434-010-0985-4
Brierley JD, Gospodarowicz MK, Wittekind C (eds) (2016) TNM classification of malignant tumours, 8th edn. Wiley-Blackwell, Hoboken
Alterio D, Preda L, Volpe S, Giannitto C, Riva G, KamgaPounou FA, Atac M, Giugliano G, Bruschini R, Ferrari A, Marvaso G, Cossu Rocca M, Verri E, Rossi D, Bellomi M, Jereczek-Fossa BA, Orecchia R, Ansarin M (2020) Impact of a dedicated radiologist as a member of the head and neck tumour board: a single-institution experience. Acta Otorhinolaryngol Ital 40(1):26–32. https://doi.org/10.14639/0392-100X-N0326
NCCN Guidelines Version 1.2024—October 9, 2023 Head and neck cancers. Accessed Oct 2023. https://www.nccn.org/professionals/physician_gls/pdf/head-and-neck.pdf9
Orlhac F, Lecler A, Savatovski J, Goya-Outi J, Nioche C, Charbonneau F, Ayache N, Frouin F, Duron L, Buvat I (2021) How can we combat multicenter variability in MR radiomics? Validation of a correction procedure. Eur Radiol 31(4):2272–2280. https://doi.org/10.1007/s00330-020-07284-9
Ansarin M, Bruschini R, Navach V, Giugliano G, Calabrese L, Chiesa F, Medina JE, Kowalski LP, Shah JP (2019) Classification of glossectomies: proposal for tongue cancer resections. Head Neck 41(3):821–827. https://doi.org/10.1002/hed.25466
van der Hulst HJ, Jansen RW, Vens C, Bos P, Schats W, de Jong MC, Martens RM, Bodalal Z, Beets-Tan RGH, van den Brekel MWM, de Graaf P, Castelijns JA (2023) The prediction of biological features using magnetic resonance imaging in head and neck squamous cell carcinoma: a systematic review and meta-analysis. Cancers 15(20):5077. https://doi.org/10.3390/cancers15205077
Ying Y, Xiaofeng T (2022) Value of MRI radiomics for predicting occult cervical lymph nodes metastases in early-stage oral tongue squamous cell carcinoma. Chin J Radiol 12:30–35
Elhalawani H, Fleming CW, Rao S, Chen Y, Hobbs B, Ma T, Kuzmin G, Woody NM, Joshi NP, Chute D, Ku J (2020) Prognostic impact of baseline and delta tumor radiomics features in patients with oropharyngeal cancer (OPC) treated with adaptive image guided radiotherapy (IGRT). Int J Radiat Oncol Biol Phys 108(3):e313
Traverso A, Hosni-Abdalaty A, Hasan M, Kim J, Ringash J, Cho J, Bratman S, Bailey A, Waldron J, Welch M, Irish J (2019) Investigating radiomics to predict positive lymph nodes in oral cavity squamous cell carcinoma (OSCC). Med Phys 46(6):e295
Elhalawani H, Volpe S, Barua S, Mohamed A, Yang P, Ng SP, Lai S, Hutcheson KA, Gunn GB, Rao A, Fuller CD (2018) Exploration of an early imaging biomarker of osteoradionecrosis in oropharyngeal cancer patients: case-control study of the temporal changes of mandibular radiomics features. Int J Radiat Oncol Biol Phys 100(5):1363–1364
Kirienko M, Cozzi L, Rossi A, Voulaz E, Antunovic L, Fogliata A, Chiti A, Sollini M (2018) Ability of FDG PET and CT radiomics features to differentiate between primary and metastatic lung lesions. Eur J Nucl Med Mol Imaging 45(10):1649–1660. https://doi.org/10.1007/s00259-018-3987-2
Abdel Razek AAK, Mansour M, Kamal E, Mukherji SK (2022) MR imaging of oral cavity and oropharyngeal cancer. Magn Reson Imaging Clin N Am 30(1):35–51. https://doi.org/10.1016/j.mric.2021.07.002
Lin Y, Tang M, Liu Y, Jiang M, He S, Zeng D, Cui MY (2022) A narrative review on machine learning in diagnosis and prognosis prediction for tongue squamous cell carcinoma. Transl Cancer Res 11(12):4409–4415. https://doi.org/10.21037/tcr-22-1669
Kubo K, Kawahara D, Murakami Y, Takeuchi Y, Katsuta T, Imano N, Nishibuchi I, Saito A, Konishi M, Kakimoto N, Yoshioka Y, Toratani S, Ono S, Ueda T, Takeno S, Nagata Y (2022) Development of a radiomics and machine learning model for predicting occult cervical lymph node metastasis in patients with tongue cancer. Oral Surg Oral Med Oral Pathol Oral Radiol 134(1):93–101. https://doi.org/10.1016/j.oooo.2021.12.122
Zhong YW, Jiang Y, Dong S, Wu WJ, Wang LX, Zhang J, Huang MW (2022) Tumor radiomics signature for artificial neural network-assisted detection of neck metastasis in patient with tongue cancer. J Neuroradiol 49(2):213–218. https://doi.org/10.1016/j.neurad.2021.07.006
Konishi M, Kakimoto N (2023) Radiomics analysis of intraoral ultrasound images for prediction of late cervical lymph node metastasis in patients with tongue cancer. Head Neck 45(10):2619–2626. https://doi.org/10.1002/hed.27487
Kudoh T, Haga A, Kudoh K, Takahashi A, Sasaki M, Kudo Y, Ikushima H, Miyamoto Y (2023) Radiomics analysis of [18F]-fluoro-2-deoxyglucose positron emission tomography for the prediction of cervical lymph node metastasis in tongue squamous cell carcinoma. Oral Radiol 39(1):41–50. https://doi.org/10.1007/s11282-022-00600-7. (Erratum In: Oral Radiol. 2022 Jun 11)
Pan X, Zhang T, Yang Q, Yang D, Rwigema JC, Qi XS (2020) Survival prediction for oral tongue cancer patients via probabilistic genetic algorithm optimized neural network models. Br J Radiol 93(1112):20190825. https://doi.org/10.1259/bjr.20190825
Ren J, Qi M, Yuan Y, Tao X (2021) Radiomics of apparent diffusion coefficient maps to predict histologic grade in squamous cell carcinoma of the oral tongue and floor of mouth: a preliminary study. Acta Radiol 62(4):453–461. https://doi.org/10.1177/0284185120931683
Yu B, Huang C, Xu J, Liu S, Guan Y, Li T, Zheng X, Ding J (2021) Prediction of the degree of pathological differentiation in tongue squamous cell carcinoma based on radiomics analysis of magnetic resonance images. BMC Oral Health 21(1):585. https://doi.org/10.1186/s12903-021-01947-9
Corti A, De Cecco L, Cavalieri S, Lenoci D, Pistore F, Calareso G, Mattavelli D, de Graaf P, Leemans CR, Brakenhoff RH, Ravanelli M, Poli T, Licitra L, Corino V, Mainardi L (2023) MRI-based radiomic prognostic signature for locally advanced oral cavity squamous cell carcinoma: development, testing and comparison with genomic prognostic signatures. Biomark Res 11(1):69. https://doi.org/10.1186/s40364-023-00494-5
Jha AK, Sherkhane UB, Mthun S, Jaiswar V, Purandare N, Prabhash K, Wee L, Rangarajan V, Dekker A (2023) External validation of robust radiomic signature to predict 2-year overall survival in non-small-cell lung cancer. J Digit Imaging 36(6):2519–2531. https://doi.org/10.1007/s10278-023-00835-8
Leemans CR, Snijders PJF, Brakenhoff RH (2018) The molecular landscape of head and neck cancer. Nat Rev Cancer 18(5):269–282. https://doi.org/10.1038/nrc.2018.11. (Erratum in: Nat Rev Cancer. 2018 Oct;18(10):662)
Lo Casto A, Cannella R, Taravella R, Cordova A, Matta D, Campisi G, Attanasio M, Rinaldi G, Rodolico V (2022) Diagnostic and prognostic value of magnetic resonance imaging in the detection of tumor depth of invasion and bone invasion in patients with oral cavity cancer. Radiol Med 127(12):1364–1372. https://doi.org/10.1007/s11547-022-01565-7
Park JE, Kim HS, Kim D, Park SY, Kim JY, Cho SJ, Kim JH (2020) A systematic review reporting quality of radiomics research in neuro-oncology: toward clinical utility and quality improvement using high-dimensional imaging features. BMC Cancer 20(1):29. https://doi.org/10.1186/s12885-019-6504-5
Jethanandani A, Lin TA, Volpe S, Elhalawani H, Mohamed ASR, Yang P, Fuller CD (2018) Exploring applications of radiomics in magnetic resonance imaging of head and neck cancer: a systematic review. Front Oncol 8:131. https://doi.org/10.3389/fonc.2018.00131
Zhao B, Tan Y, Tsai WY, Qi J, Xie C, Lu L, Schwartz LH (2016) Reproducibility of radiomics for deciphering tumor phenotype with imaging. Sci Rep 6:23428. https://doi.org/10.1038/srep23428
Salmanpour MR, Hosseinzadeh M, Rezaeijo SM, Rahmim A (2023) Fusion-based tensor radiomics using reproducible features: application to survival prediction in head and neck cancer. Comput Methods Programs Biomed 240:107714. https://doi.org/10.1016/j.cmpb.2023.107714
Noortman WA, Aide N, Vriens D, Arkes LS, Slump CH, Boellaard R, Goeman JJ, Deroose CM, Machiels JP, Licitra LF, Lhommel R, Alessi A, Woff E, Goffin K, Le Tourneau C, Gal J, Temam S, Delord JP, van Velden FHP, de Geus-Oei LF (2023) Development and external validation of a pet radiomic model for prognostication of head and neck cancer. Cancers 15(10):2681. https://doi.org/10.3390/cancers15102681
Funding
The institution of some authors (European Institute of Oncology) was partially supported by the Italian Ministry of Health with Progetto di Eccellenza, Ricerca Corrente and 5x1000 funds.
Author information
Authors and Affiliations
Contributions
All persons designated as authors participated sufficiently in the work to take public responsibility for the content of the manuscript: conception, design, analysis and interpretation of data, draft of the manuscript, revision for important intellectual content and final approval of the version to be published.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflicts of interest.
Ethical standards
This article does not contain any prospective studies with human participants or animals performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Tagliabue, M., Ruju, F., Mossinelli, C. et al. The prognostic role of MRI-based radiomics in tongue carcinoma: a multicentric validation study. Radiol med 129, 1369–1381 (2024). https://doi.org/10.1007/s11547-024-01859-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11547-024-01859-y