Abstract
Objective
To identify the magnetic resonance imaging (MRI) features associated with epidermal growth factor (EGFR) expression level in lower grade gliomas using radiomic analysis.
Methods
270 lower grade glioma patients with known EGFR expression status were randomly assigned into training (n=200) and validation (n=70) sets, and were subjected to feature extraction. Using a logistic regression model, a signature of MRI features was identified to be predictive of the EGFR expression level in lower grade gliomas in the training set, and the accuracy of prediction was assessed in the validation set.
Results
A signature of 41 MRI features achieved accuracies of 82.5% (area under the curve [AUC] = 0.90) in the training set and 90.0% (AUC = 0.95) in the validation set. This radiomic signature consisted of 25 first-order statistics or related wavelet features (including range, standard deviation, uniformity, variance), one shape and size-based feature (spherical disproportion), and 15 textural features or related wavelet features (including sum variance, sum entropy, run percentage).
Conclusions
A radiomic signature allowing for the prediction of the EGFR expression level in patients with lower grade glioma was identified, suggesting that using tumour-derived radiological features for predicting genomic information is feasible.
Key Points
• EGFR expression status is an important biomarker for gliomas.
• EGFR in lower grade gliomas could be predicted using radiogenomic analysis.
• A logistic regression model is an efficient approach for analysing radiomic features.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Gliomas are the most common primary brain malignancies in adults. Lower grade gliomas refer to World Health Organization (WHO) grade II and III gliomas [1], and the response to treatment and survival of these patients differs significantly due to genetic differences between lower grade gliomas [2]. Recently, the use of individualised cancer therapy has attracted considerable interest and has been extensively investigated. Molecular biomarkers reflect the genetic background of the tumour and provide potential targets for the development of individualised treatments for patients with gliomas.
Epidermal growth factor receptor (EGFR) is a key glioma molecular biomarker. EGFR overexpression can promote malignant proliferation of tumour cells, and a number of studies have focused on suppressing malignant proliferation by inhibiting its activity [3]. Nimotuzumab, one EGFR-targeted drug, has already been approved for the treatment of oesophageal cancer [4], advanced head and neck cancer [5,6,7] and glioma [8] in 30 countries worldwide [9]. Studies have shown that the EGFR variant III mutation may significantly influence tumour microenvironment changes and invasion. Hence, detection of EGFR overexpression is especially important for the personalised treatment of glioma patients. Conventional methods for detecting the EGFR expression status include immunohistochemistry and polymerase chain reaction, which are invasive assays based on craniotomy.
Magnetic resonance imaging (MRI) is capable of detecting brain malignancy characteristics non-invasively. Preliminary studies suggest that MRI features (e.g. relative cerebral blood volume and tumour blood flow) are associated with EGFR expression in gliomas [10, 11]. Radiogenomics is a new field investigating the relationships between radiological features and genomic data [12] and allows radiological data to be transformed into high-dimensional information and features, thereby greatly enriching the quantitative information of the radiological features. Consequently, this helps analyse the relationships between radiological features and gene phenotypes. For instance, the O6-methylguanine-DNA-methyltransferase methylation status in glioblastoma can be predicted by a set of radiological texture features [13, 14].
In the present study, the texture features of gliomas were extracted from MRI data, and radiological features predicting the EGFR expression status were identified.
Materials and methods
Patients
A total of 270 patients with pathologically confirmed grade II or III glioma and available MRI and EGFR expression status data who were treated at our hospital between August 2005 and August 2012 were included in this study. The inclusion criteria were as follows: (1) pathologically confirmed lower grade glioma with no previous diagnosis of any type of brain tumour; (2) available EGFR expression status based on immunohistochemistry; (3) high-resolution pre-surgical T2-weighted MR images; (4) available clinical characteristics. Patients who had undergone preoperative treatment, those with other pathological types, or those with incomplete EGFR, imaging or clinical data were excluded. The patients were randomly assigned into training (n=200) and validation sets (n=70). The study was reviewed and approved by the institutional review board.
Immunohistochemistry of EGFR
Identification of EGFR amplification was conducted using standard immunohistochemical methods. First, the glioma specimens were fixed in formalin, after which they were routinely processed and paraffin-embedded. Five-micron-thick sections were created, and antibodies specific for EGFR (Santa Cruz Biotechnology, Santa Cruz, CA, USA) were used to perform the immunohistochemical staining. All samples were analysed by two pathologists with experience in scoring central nervous system tumours. Less than 5% positive staining was identified as negative expression, while 6–100% positive staining was identified as positive expression. High and low EGFR expressions were defined as >30% and ≤30% positive cells, respectively [15].
Tumour segmentation
T2-weighted MR images were used for the extraction of radiomics features, as these images are well accepted in the identification of abnormal hyperintense signals that represent the involved regions of low-grade gliomas [16,17,18]. Most T2-weighted images were obtained with a Trio 3.0T scanner (Siemens, Erlangen, Germany). The T2-weighted image parameters were as follows: TR = 5800 ms; TE = 110 ms; flip angle = 150°; field of view = 240×188 mm2; voxel size = 0.6×0.6×5.0 mm3; matrix = 384×300. Lesions were delineated on the MR image by two experienced neurosurgeons who were blinded to the clinical information of the patients using MRIcron software (http://www.mccauslandcenter.sc.edu/mricro/mricron). Abnormal hyperintense signals on the T2-weighted images were identified as the tumour areas, while care was taken to avoid the cerebrospinal fluid signal, which may mask the tumour margins. A senior neuroradiologist re-evaluated the lesions afterwards. If the discrepancy was ≥5%, the senior neuroradiologist decided on the tumour borders [19].
Extraction of texture features
Quantitative radiological feature extraction was conducted, as previously described [20]. 431 texture features that could be assigned into four groups were extracted from each tumour (Fig. 1). Group 1 (first order statistics) consisted of 14 descriptors that quantitatively delineate the distribution of voxel intensities within the MR image through commonly used and basic metrics. Group 2 (shape- and size-based features) contained eight three-dimensional features that reflect the shape and size of the tumour region. Calculated from grey level run-length and grey level co-occurrence texture matrices, 33 textural features that can quantify intra-tumour heterogeneity differences were classified into group 3 (texture features). Finally, group 4 (wavelet features) included the intensity and texture features derived from wavelet transformation of the original image (group 1 and group 3 features). These four groups of features are listed in Supplementary Table 1.
Texture-based EGFR prediction
A logistic regression model (Y = X1×β1 + X2×β2 + … + Xn×βn + k) was created using MATLAB (2014a) software. Here, Y is the estimated value of EGFR status (high expression is defined as 1, while low expression is defined as 0); X represents the value of each radiological feature; β is the unknown coefficient; and k is the unknown constant. Receiver operating characteristic (ROC) curve analysis was conducted to evaluate the prediction results. The logistic model was repeated and the variable that had the smallest predictive value for the EGFR expression level was excluded each time, until the smallest number of features that provided an area under the curve (AUC) of at least 0.9 remained. These radiomics features, selected from the training set, were defined as a signature and were subsequently evaluated in the validation set.
Statistics
The logistic regression model was established using MATLAB (2014a) software. The clinical characteristics of the two groups were compared using Student’s t test and the Chi-square test. The differences were considered significant at p < 0.05. ROC curve analysis was utilised to illustrate the prediction performance of the radiomic signature. The optimal cutoff value was selected as the point when the sensitivity plus specificity was maximal. The AUC and prediction accuracy were calculated in both the training and validation sets.
Results
Patient demographic characteristics
High EGFR expression was detected in 49.0% (98/200) of patients in the training set, and in 48.6% (34/70) in the validation set. No significant difference was found between patients with low or high EGFR expression with respect to age, sex or WHO grade in both the training and validation sets (p > 0.05). The detailed patient demographic characteristics are listed in Table 1.
Texture analysis
The texture features were extracted through the method described above, resulting in 431 texture features being obtained from each patient, including 14 first-order statistics, eight shape- and size-based features, 33 textural features, and 376 wavelet features derived from the first-order statistics (group 1) and textural features (group 3). These are shown in Supplementary Table 1.
Texture-based EGFR prediction
Training set
The logistic regression model was utilised to perform linear regression of the 431 texture features of the 200 patients in the training set. Variables that contributed less to the regression equation than others were excluded one by one. Next, the estimated value of the regression equation was used to predict the EGFR expression status, and a ROC curve was delineated. When there were 41 textural features left, the distribution of the estimated values of patients could be differentiated significantly (Fig. 2a). In the ROC curve analysis, the AUC was 0.90, and the optimal cutoff value (0.3934) exhibited a sensitivity, specificity and accuracy of 94.9%, 70.6% and 82.5%, respectively (Fig. 2b).
The identified 41 EGFR-related radiological features could be regarded as a radiomic signature, comprising 25 first-order statistics or related wavelet features (such as range, standard deviation, uniformity, variance), one shape- and size-based feature (spherical disproportion), and 15 textural features or related wavelet features (such as sum variance, sum entropy, run percentage). All of these features are listed in Table 2.
Validation set
Next, these 41 features were applied to the validation set to predict the EGFR expression status. As a result, the distribution of the estimated values could be differentiated with high efficiency (Fig. 3a). In the ROC curve analysis, which was conducted for the evaluation of the prediction efficiency, the AUC was 0.95, and the optimal cutoff value (0.4334) exhibited a sensitivity, specificity and accuracy of 94.1%, 86.1% and 90.0%, respectively (Fig. 3b).
Discussion
In this study, EGFR-associated radiomic features were studied based on a large-scale imaging and genetics database. Through radiological feature extraction, feature selection and model construction, a set of radiological signatures that could reflect the EGFR expression level was established. A radiogenomic signature allowing for the prediction of the EGFR expression level in patients with lower grade glioma was identified. In the future, using non-invasive radiological information could provide crucial clinical information for the development of individualised treatment targeting EGFR in the future.
Radiogenomics is an emerging field that explores the associations between radiological features on MRI and genetic characteristics of tumours. Many oncological studies have successfully linked radiological features with gene expression patterns [21]. For example, an 11-feature radiomic signature that could successfully stratify the survival of glioblastoma patients was selected from 12,190 features extracted from multiparametric and multiregional tumour volumes [22]. Additionally, using a random forest algorithm and a total of 2,970 features extracted from multimodal MR images of glioma patients, the isocitrate dehydrogenase mutation status of the patients was effectively predicted in one previous study [23]. Furthermore, radiogenomics can be applied to many other types of tumours. In an early classical study, neural network machine learning was used in patients with liver cancer to determine the association between texture features extracted from enhanced CT scans and gene expression patterns [24].
Further, arterial spin-labelling cerebral blood flow maps have been found to relate to EGFR variant III expression in gliomas [25]; a higher median relative cerebral blood volume has been shown to be associated with high EGFR expression [10]; maximum tumour blood flow and relative tumour blood flow have been shown to significantly and positively correlate with EGFR expression status [11]; and relative tumour blood volume, an MR-perfusion-weighted imaging-derived parameter, has been found to be significantly related to the EGFR variant III expression status [26]. Taken together, these findings suggest that the EGFR expression status and MRI features are closely related. These previous studies, however, only indicated that these parameters were related to EGFR expression instead of predicting it, owing to the lack of independent validation sets.
In the current study, 41 out of a total 431 texture features were screened by using a logistic regression model. Consequently, the EGFR expression status could be predicted efficiently in the training set. Next, these 41 features were applied to the validation set, and the equation derived from the linear regression was found to still be effective in predicting the expression of EGFR. These results indicated that these 41 texture features correlate with the EGFR expression in lower grade gliomas. This is an improvement over the previous studies analysing individual imaging parameters associated with the EGFR expression status.
The 41 texture features obtained by the logistic regression model are also related to the expression status of EGFR theoretically. For example, as one component of this signature, Spherical Disproportion = A/4πR2. Here, A refers to the surface area of the tumour, while R is the radius of the sphere that has the same volume as the tumour [20]. This index quantitates the differentiation between the tumour shape and sphericity. Recently, EGFR expression status was demonstrated to correlate with this shape characteristic in lung cancers, with this shape characteristic showing a strong ability to identify patients with the strongest EGFR immunostaining.
Previous authors have speculated that the malignant proliferation of tumour cells and angiogenesis caused by high EGFR expression might contribute to the increased shape irregularity [27]. Additionally, as two group 1 features in this signature, the index Range refers to the range of voxel intensity values of the tumour, and the Variance refers to the voxel intensity value variability of the tumour. These two indicators may be associated with the non-uniformity levels within the tumour. Moreover, it has recently been reported that, in a murine glioma model, high EGFR expression related to increased tumour cell proliferation, reduced tumour cell adhesion to the extracellular substrate, and altered progenitor cell phenotype, suggesting that the EGFR expression status and tumour heterogeneity are indeed closely related [28]. Therefore, the Range and Variance may reflect the EGFR expression status. Of note, as glioblastomas (WHO grade IV) have distinct genetic characteristics from lower grade gliomas (WHO grade II and III), the radiomics features that associate with the EGFR expression level in glioblastomas are an interesting topic for further investigations.
A logistic regression model is a regression method in which the variables that contribute little to the linear model are eliminated. It is a simple and efficient method for reducing dimensionality. In this research, a single-mode quantitative imaging sequence was used for the prediction of the EGFR expression level. This radiomic-based prediction was found to be more effective than prediction based on traditional qualitative imaging, as the latter is descriptive rather than quantitative.
Lastly, the EGFR overexpression level in lower grade gliomas remains controversial. A previous study showed that EGFR amplification (a minimum average EGFR gene copy number of 7 per genome) was identified in 40–50% of glioblastomas and in about 10% of anaplastic gliomas, but was rare in WHO grade II gliomas [29]. On the other hand, another study showed that EGFR overexpression (defined as >25% positive cells on immunohistochemistry) was observed in 40% of grade II gliomas and 78.4% of grade III gliomas [30]; these proportions of high EGFR expression were similar to the proportion detected in the current study. The inconsistencies in the EGFR expression level may result from differences in the patient populations and the definition of EGFR expression used among the different previous studies.
We recognise some limitations to this study. First, our model was generated based on retrospectively collected data. The current study, therefore, needs to be prospectively evaluated. Second, it should be noted that multi-modal radiological data were not included in this research, and radiological information reflecting tumour perfusion (such as perfusion-weighted imaging, arterial spin-labelling, etc.) will be added for the optimisation of this prediction model in the future.
Abbreviations
- AUC:
-
Area under the curve
- EGFR:
-
Epidermal growth factor receptor
- MRI:
-
Magnetic resonance imaging
- ROC:
-
Receiver operating characteristic
References
Chen B, Liang T, Yang P et al (2016) Classifying lower grade glioma cases according to whole genome gene expression. Oncotarget 7:74031–74042
Fan X, Wang Y, Zhang C et al (2016) ADAM9 Expression Is associate with glioma tumor grade and histological type, and acts as a prognostic factor in lower-grade gliomas. Int J Mol Sci 17
Davis SC, Samkoe KS, O'Hara JA et al (2010) MRI-coupled fluorescence tomography quantifies EGFR activity in brain tumors. Acad Radiol 17:271–276
Ramos-Suzarte M, Lorenzo-Luaces P, Lazo NG et al (2012) Treatment of malignant, non-resectable, epithelial origin esophageal tumours with the humanized anti-epidermal growth factor antibody nimotuzumab combined with radiation therapy and chemotherapy. Cancer Biol Ther 13:600–605
Reddy BK, Lokesh V, Vidyasagar MS et al (2014) Nimotuzumab provides survival benefit to patients with inoperable advanced squamous cell carcinoma of the head and neck: a randomized, open-label, phase IIb, 5-year study in Indian patients. Oral Oncol 50:498–505
Rodríguez MO, Rivero TC, Del Castillo Bahi R et al (2014) Nimotuzumab plus radiotherapy for unresectable squamous-cell carcinoma of the head and neck. Cancer Biol Ther 9:343–349
Basavaraj C, Sierra P, Shivu J, Melarkode R, Monte E, Nair P (2014) Nimotuzumab with chemoradiation confers a survival advantage in treatment-naïve head and neck tumors over expressing EGFR. Cancer Biol Ther 10:673–681
Chong DQ, Toh XY, Ho IA et al (2015) Combined treatment of Nimotuzumab and rapamycin is effective against temozolomide-resistant human gliomas regardless of the EGFR mutation status. BMC Cancer 15:255
Xu S, Ramos-Suzarte M, Bai X, Xu B (2016) Treatment outcome of nimotuzumab plus chemotherapy in advanced cancer patients: a single institute experience. Oncotarget 7:33391–33407
Gupta A, Young RJ, Shah AD et al (2015) Pretreatment Dynamic Susceptibility Contrast MRI Perfusion in Glioblastoma: Prediction of EGFR Gene Amplification. Clin Neuroradiol 25:143–150
Yoo RE, Choi SH, Cho HR et al (2013) Tumor blood flow from arterial spin labeling perfusion MRI: a key parameter in distinguishing high-grade gliomas from primary cerebral lymphomas, and in predicting genetic biomarkers in high-grade gliomas. J Magn Reson Imaging 38:852–860
Bai HX, Lee AM, Yang L et al (2016) Imaging genomics in cancer research: limitations and promises. Br J Radiol 89:20151030
Drabycz S, Roldan G, de Robles P et al (2010) An analysis of image texture, tumor location, and MGMT promoter methylation in glioblastoma using magnetic resonance imaging. Neuroimage 49:1398–1405
Korfiatis P, Kline TL, Coufalova L et al (2016) MRI texture features as biomarkers to predict MGMT methylation status in glioblastomas. Med Phys 43:2835
Wang Y, Fan X, Zhang C et al (2014) Identifying radiographic specificity for phosphatase and tensin homolog and epidermal growth factor receptor changes: a quantitative analysis of glioblastomas. Neuroradiology 56:1113–1120
Ricard D, Kaloshi G, Amiel-Benouaich A et al (2007) Dynamic history of low-grade gliomas before and after temozolomide treatment. Ann Neurol 61:484–490
Kinoshita M, Sakai M, Arita H et al (2016) Introduction of High Throughput Magnetic Resonance T2-Weighted Image Texture Analysis for WHO Grade 2 and 3 Gliomas. PLoS One 11, e0164268
Wang Y, Fan X, Li H et al (2015) Tumor border sharpness correlates with HLA-G expression in low-grade gliomas. J Neuroimmunol 282:1–6
Wang YY, Zhang T, Li SW et al (2015) Mapping p53 mutations in low-grade glioma: a voxel-based neuroimaging analysis. AJNR Am J Neuroradiol 36:70–76
Aerts HJ, Velazquez ER, Leijenaar RT et al (2014) Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun 5:4006
Herold CJ, Lewin JS, Wibmer AG et al (2016) Imaging in the Age of Precision Medicine: Summary of the Proceedings of the 10th Biannual Symposium of the International Society for Strategic Studies in Radiology. Radiology 279:226–238
Kickingereder P, Burth S, Wick A et al (2016) Radiomic Profiling of Glioblastoma: Identifying an Imaging Predictor of Patient Survival with Improved Performance over Established Clinical and Radiologic Risk Models. Radiology 280:880–889
Zhang B, Chang K, Ramkissoon S et al (2016) Multimodal MRI features predict isocitrate dehydrogenase genotype in high-grade gliomas. Neuro-Oncology. doi:10.1093/neuonc/now121:now121
Segal E, Sirlin CB, Ooi C et al (2007) Decoding global gene expression programs in liver cancer by noninvasive imaging. Nat Biotechnol 25:675–680
Qiao XJ, Ellingson BM, Kim HJ et al (2015) Arterial spin-labeling perfusion MRI stratifies progression-free survival and correlates with epidermal growth factor receptor status in glioblastoma. AJNR Am J Neuroradiol 36:672–677
Tykocinski ES, Grant RA, Kapoor GS et al (2012) Use of magnetic perfusion-weighted imaging to determine epidermal growth factor receptor variant III expression in glioblastoma. Neuro Oncol 14:613–623
Apostolova I, Ego K, Steffen IG et al (2016) The asphericity of the metabolic tumour volume in NSCLC: correlation with histopathology and molecular markers. Eur J Nucl Med Mol Imaging. doi:10.1007/s00259-016-3452-z
Lindberg OR, McKinney A, Engler JR et al (2016) GBM heterogeneity as a function of variable epidermal growth factor receptor variant III activity. Oncotarget. doi:10.18632/oncotarget.12600
Hofer S, Lassman AB (2010) Molecular markers in gliomas: impact for the clinician. Target Oncol 5:201–210
Hu X, Miao W, Zou Y, Zhang W, Zhang Y, Liu H (2013) Expression of p53, epidermal growth factor receptor, Ki-67 and O6-methylguanine-DNA methyltransferase in human gliomas. Oncol Lett 6:130–134
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Guarantor
The scientific guarantor of this publication is Tao Jiang.
Conflict of interest
The authors of this manuscript declare no relationships with any companies whose products or services may be related to the subject matter of the article.
Funding
This study has received funding by the National Natural Science Foundation of China (No. 81601452).
Statistics and biometry
Mr Kaibin Xu kindly provided statistical advice for this manuscript.
Informed consent
Written informed consent was not required for this study because the data were collected retrospectively in the study.
Ethical approval
Institutional review board approval was obtained.
Methodology
• retrospective
• diagnostic or prognostic study
• performed at one institution
Additional information
Yiming Li and Xing Liu contributed equally to this work
Electronic supplementary material
Below is the link to the electronic supplementary material.
Supplementary Table 1
(DOCX 15 kb)
Rights and permissions
About this article
Cite this article
Li, Y., Liu, X., Xu, K. et al. MRI features can predict EGFR expression in lower grade gliomas: A voxel-based radiomic analysis. Eur Radiol 28, 356–362 (2018). https://doi.org/10.1007/s00330-017-4964-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00330-017-4964-z