Introduction

The development of the human cerebrum is a carefully orchestrated process whereby neuronal precursor cells migrate radially from the stem cell niche in the subventricular zone (SVZ) [1]. During neural development, SVZ is the site of extraordinary proliferation where nearly a quarter of a million new neural precursor cells are generated per minute [2]. As the neural precursors migrate centrifugally from the SVZ, they differentiate into distinct cell types and lose the capacity for self-renewal and multi-potency. Waves of radially migrating neural precursor and differentiated cells ultimately populate the cerebrum, forming layers of exquisitely intricate cyto-architecture [3]. After completion of neural development, the SVZ continues to maintain a niche that sustains neural stem cells, which continue to radially migrate and differentiate. In this way, cells located in proximity to the SVZ niche are more likely to harbor stem-like properties relative to those located distant to the SVZ.

The unique SVZ “stem” niche harbors implications related to glioblastoma, the most common form of adult brain cancer [4]. Many investigators have suggested that glioblastomas located in proximity to the SVZ may have arisen from the populations of cells that harbor stem or stem-like properties and retained these properties during neoplastic transformation. Supporting this hypothesis, some studies have reported that glioblastomas contacting the SVZ exhibit cancer stem-cell like properties, including aggressive clinical course [5], increased likelihood of recurrence [6], and poor survival [7,8,9]. However, these results have not been reproduced by others investigators [10, 11]. In general, these studies involved limited patient numbers and are confounded by the subjectivity in the determination of SVZ contact. Here, we utilized Iterative Probabilistic Voxel Labeling (IPVL), an automated tumor segmentation algorithm developed in our laboratory [12] to quantitatively determine glioblastoma location relative to the SVZ in 217 The Cancer Imaging Atlas (TCIA) patients. We then tested whether glioblastoma proximity to the SVZ is associated with survival pattern or gene signatures associated with the cancer stem cell state.

Methods

Data acquisition and Subtype Classification

In total, 217 subjects with artifact-free pre-operative T1 weighted MR images were downloaded from the TCIA (https://cancerimagingarchive.net) in November 2014. Level 3 probe collapsed Messenger RNA (mRNA) expression data: Affymetrix HT-HG-U133A GeneChip and RNAseq, was downloaded for available patients via the TCGA Data Portal (https://tcga-data.nci.nih.gov/tcga/). Corresponding copy number variation (CNV) and single nucleotide polymorphism (SNPs) were also down-loaded and analyzed as previously described [13]. Boxplot analysis was employed to discard outliers in the SNP and CNV data. Tumors with extreme CNVs in more than 1% of their genes are excluded in this context. Affymetrix expression data were normalized by robust multichip average (RMA) [14]. RNAseq data were RSEM normalized [15]. When not already available in published literature, genomic subtypes were determined for subjects employing single sample Gene Set Enrichment Analysis (ssGSEA) as described previously [13, 16].

Image preprocessing and registration

MR images were corrected for gradient nonlinearity using previously described methods [17]. Images were additionally preprocessed and intensity corrected utilizing N4 bias field correction [18], then affine and nonlinear registrations were performed to the Montreal Neurological Institute (MNI) 152 nonlinear 1 mm3 template employing methods from Advanced Normalization Tools (ANTS) [19]. Visual inspection of resulting images was performed by three independent reviewers (T.C.S, J.M.T, & K.S.P) to ensure successful and accurate preprocessing was completed for all subjects.

Tumor segmentation

Contrasting-enhancing (CE) regions of tumors were segmented using the iterative probabilistic voxel labeling (IPVL) algorithm developed in the laboratory [12] and the enclosed volume was filled using a 3D morphological open-close operation. Centroid density maps were generated using a 15-mm centroid of each filled CE volume. Visual inspection by two independent operators (T.C.S, J.M.T) was performed to ensure adequacy of region filling and centroid estimation.

Subventricular zone distance

To measure SVZ distance (Fig. 1) with respect to each tumor’s filled CE volume, the MNI template’s lateral ventricle segmentation was used as a basis of comparison. SVZ distances were calculated by taking the mean of the distance from the nearest MNI template ventricular border to the centroid point within a subject’s CE volume. Statistical analyses of SVZ distance with respect to patient survival was performed within the statistics software package SPSS (IBM Corp, New York, United States). When stratification was required, tumors were classified by the median of the SVZ distance: low SVZ (SVZ distance < 19.23 mm) and high SVZ (SVZ distance 19.23 mm).

Fig. 1
figure 1

Graphical illustration of the SVZ distance measurement. SVZ distance is color coded so that blue indicates shorter SVZ distances while red indicates high SVZ distances

ssGSEA and statistical analyses

ssGSEA was employed to calculate normalized enrichment scores for gene signatures according to established methods [16], using custom software in MATLAB (Statistics Toolbox Release 2012b, The MathWorks Inc., Natick, MA). Statistical analysis of SVZ distance, gene expression, and boot-strap analysis was performed in MATLAB. All mRNA expression analyses were performed in the Affymetrix HT-HG-U133A RMA normalized data. Gene expression or ssGSEA scores were compared between SVZlow and SVZhigh tumors. To mitigate type I error, 100,000 permutation tests were performed during hypothesis testing in our sample to determine the likelihood of the outcome measured occurring by random chance [20]. When exploring the relationship between SVZ distances, ssGSEA scores and SNP burden, we employed a linear regression and reported statistical correlation and p-value.

Results

SVZ distance and the stem-cell state

Demographic data of the 217 TCIA patients are as previously described [21] and are shown in Table 1. The cohort consisted of 59.8% (n = 128) male with a median age of 59.3 ± 14.1. Images from these patients were segmented using Iterative Probabilistic Voxel Labeling (IPVL) [12]. The centroid for each CE tumor volume was calculated. Distances from the tumor centroid to the lateral ventricular border were calculated (Fig. 1).

Table 1 Demographic data

CD133 is a glycoprotein expressed in neuronal and glial stem cells that reside in the SVZ [4, 22]. It is a key surface marker for isolation of normal and glioblastoma stem cells [23]. High expression of CD133 was associated with self-renewal [24], increased DNA repair capacity [25], and other properties associated with the stem-cell state and aggressive clinical course [26]. In this context, we tested whether glioblastomas with centroid in proximity to the SVZ harbored increased CD133 expression. For this analysis, each patient’s glioblastoma is classified as “high” or “low” for SVZ distance based on the median value of SVZ for the entire cohort. We found that glioblastomas located in proximity to the SVZ (low SVZ score) are associated with higher CD133 expression (P = 0.006, Table 2).

Table 2 Gene expression differences between SVZ proximal and SVZ distal samples

To confirm this finding, we tested whether the observed SVZ association can be recapitulated using an independent gene signature associated with glioblastoma “stemness”. Murat et al. reported a stem cell-related signature through analysis of clinical glioblastoma specimens [27]. Supporting our hypothesis, the ssGSEA score for stemness inversely correlated with SVZ distance (R2 = -0.4714, P = 0.0012, Table 3). Meaning, glioblastomas with centroid in proximity of SVZ are more likely to harbor higher expression of the glioblastoma stem cell signature as reported by Murat et al. [27].

Table 3 ssGSEA signature enrichment as a function of SVZ distance

In aggregate, these analyses suggest that glioblastomas with centroid located in proximity to the SVZ are more likely to harbor gene expression patterns with a stem-cell state.

SVZ distance and DNA repair capacity

The predominant mechanism of TMZ resistance in glioblastoma involves the expression of the DNA repair protein O6-Methyl-Guanine MethylTransferase (MGMT). MGMT encodes an evolutionarily conserved DNA repair protein whose primary function is to restore TMZ damaged guanine to their native undamaged state. High expression of MGMT has been consistently associated with TMZ resistance in glioblastoma xenografts [28] and in independent clinical trials [7,8,9, 29,30,31]. Importantly, induction or maintenance of glioblastoma stem cell state and MGMT expression appear coupled [32, 33]. That is, glioblastomas in a stem-cell state are more likely to exhibit higher expression of MGMT [32]. In this context, we tested whether glioblastoma with low SVZ score (centroid in proximity to the SVZ) is associated with increased expression of MGMT. Our analysis indicated that glioblastoma with a centroid located in proximity to the SVZ (low SVZ score) exhibited higher expression of MGMT relative to those located farther from the SVZ (high SVZ score, P = 0.01, Table 2). Of note, our previous study demonstrated that MGMT expression level was associated with MGMT promoter methylation status in TCGA glioblastomas [34].

While MGMT plays a central role in TMZ resistance, MGMT-independent DNA repair processes also contribute to TMZ resistance [35]. Using a signature derived from clinical glioblastoma specimens, Gaspar et al. showed that an MGMT independent repair process contributed to clinical temozolomide response [35]. We tested whether mRNA expression of genes in this signature also correlated with the SVZ distance of glioblastomas. Like MGMT, our analysis showed that the ssGSEA score for this gene signature inversely correlated with SVZ distance. Specifically, lower signature expression (indicative of TMZ resistance) was found in glioblastoma in closer proximity to SVZ (R2 = 0.4073, P = 0.0061, Table 3).

In aggregate, these analyses support the notion that glioblastomas located in proximity to the SVZ are more likely to be associated with increased DNA repair capacity and TMZ resistance.

SVZ distance and single nucleotide polymorphism

Glioblastoma cells intrinsically harbor high levels of oxidative stress and endogenous DNA damages [36]. Failure to repair these processes lead to genomic instability, resulting in the accumulation of single nucleotide polymorphisms (SNPs) in the genome. Our results indicate that glioblastomas in proximity of SVZ harbored higher capacity for DNA repair against these damages. Since faithful repair of these damages prevents mutation formation, a corollary of the increased repair capacity for SVZ proximal glioblastomas is that they should harbor fewer SNPs relative to those distant to the SVZ. We tested these hypotheses.

We normalized glioblastoma SNPs into a range of + 2 or − 2 values and tested the linear dependence between SVZ distances for patients age older than 65 versus normalized SNP (Fig. 2). This cut-off was selected because age > 65 is a commonly accepted threshold for age related research (https://www.who.int/healthinfo/survey/ageingdefnolder/en/). Moreover, the survival patterns for glioblastoma patients age > 65 significantly differ from those age ≤ 65 [37, 38]. The Pearson’s Correlation scores for this analysis was 0.38 (P = 0.003). This analysis points to direct correlations between SNP frequency and SVZ distance. Such correlation was not seen in patients age < 65, supporting the literature that the clinical behavior of glioblastoma in younger patients is distinct from that of the elderly [37].

Fig. 2
figure 2

The correlation between normalized SNP frequency and normalized SVZ distance. The normalized SNP frequency per patient has an uphill linear relationship (correlation = 0.38) with per patient normalized SVZ distance measures

SVZ distance and survival

Previous studies have demonstrated that patients afflicted with glioblastomas that contact SVZ exhibit poor survival relative to those with tumor without such contacts [5]. These results are consistent with our finding that glioblastoma in proximity to the SVZ tend to harbor properties associated with the cancer stem cell state, since these properties tend to render tumor progression or recurrence more likely. We wish to determine whether the previously reported survival associations [7,8,9] can be recapitulated in our dataset. For this analysis, each patient’s glioblastoma is again classified as “high” or “low” for SVZ distance based on the median value of SVZ distance for the entire cohort. Our analysis showed that patients with SVZ proximal (or lower than median SVZ distance) glioblastomas were associated with poor overall survival and progression free survival in comparison to patients with SVZ distal glioblastomas (Fig. 3a, b). We next determined whether the association between SVZ distance and survival remained robust after controlling for patient age. In a multi-variate model, age and SVZ distance independently associated with overall survival (Table 4).

Fig. 3
figure 3

a Association of SVZ distance with overall survival. Kaplan Meier Survival curve demonstrating association of lower SVZ distance and reduced overall survival for all glioblastomas (p = 0.02, log rank, n = 217). SVZ stratified as high or low based on median value. b Association of SVZ distance with progression-free survival. Kaplan Meier Survival curve demonstrating association of lower SVZ distance and reduced progression-free survival for all glioblastomas (p = 0.03, log rank, n = 196). SVZ stratified as high or low based on median value. c SVZ distance and LVd independently associate with overall survival. SVZ and LVd stratified as high or low based on median value. p = 0.02 (log rank, n = 217)

Table 4 Multivariate model of overall survival incorporating SVZd and Age variables

Finally, we examined the interaction between SVZ and, LVd, another MR variable that we previously published [39]. LVd is defined as the magnitude of displacement from the center of mass of the lateral ventricle volume in glioblastoma patients relative to that of a normal, reference brain. It is a measure of the mass effect associated with the glioblastoma tumor mass. We had previously demonstrated that higher LVd was associated with poor patient survival. In a bivariate model that incorporated LVd and SVZ, both variables independently associate with survival (Fig. 3c). In a multi-variate model that included age, LVd, and SVZ, all three variables independently contribute to survival (Table 5). We believe that this analysis further demonstrates the robust association between SVZ and overall survival.

Table 5 Multivariate model of overall survival incorporating SVZd, LVd, Age variables

Discussion

Making sense of the conflicting literature on glioblastoma phenotypes based on SVZ contact [7,8,9,10,11] is challenging. The published studies determined SVZ contact based on expert readers, and MR interpretation by the human eye is necessarily subjective [40]. For instance, would a one millimeter distance between the glioblastoma and SVZ be sufficient to disqualify contact? What image slice and in what orientation should be used for this determination? Should questionable contact on a single slice be sufficient to qualify determination of “contact”? These issues are further confounded by variability between readers in terms of the defining of the glioblastoma tumor border [41] as well as the limited number of subjects studied. Here, we address the subject matter by employing an automated segmentation algorithm [12] to quantitatively determine glioblastoma distance to the SVZ in 217 TCIA subjects and subsequently correlating these distances to the gene expression patterns and clinical survival. Our results suggest that glioblastomas in proximity to the SVZ exhibited increased expression of gene signatures associated with the cancer stem cell phenotype, including signatures associated with increased DNA repair capacity. Consistent with the aggressive behavior of glioblastoma stem cells, patients afflicted with glioblastomas in proximity to the SVZ exhibited from poor overall survival. Overall, our study supports the notion that glioblastoma in proximity to the SVZ are molecularly and phenotypically distinct from those in proximity to the cortex.

In the context of the current glioma literature, we propose two potential explanations to account for the differences between SVZ proximal and cortex proximal glioblastomas. First, the finding supports the hypothesis that glioblastomas located in proximity to the SVZ may have arisen from the populations of cells that harbor stem or stem-like properties and retained these properties during neoplastic transformation. In experimental models, gliomas formed from SVZ proximal cells are more invasive than those forming from non-SVZ cells [42]. Alternatively, the microenvironment of the SVZ may induce glioblastoma de-differentiation into a more stem-like state [43]. This notion is supported by the phenomenon of induced pluripotent state [44] and observation that the molecular physiology of gliomas may be modified by its microenvironment [45]. These observations, coupled with the capacity of the SVZ microenvironment to sustain neural stem cells [46] suggest the plausibility of the second hypothesis. Of note, these two hypotheses are not mutually exclusive and elements of both hypotheses may contribute to our observations.

The finding that SVZ proximal and cortex proximal glioblastomas differ in molecular physiology harbor significant therapeutic implications. The enriched stem-cell like properties of the SVZ proximal glioblastomas and associated resistance to DNA damage suggest that intensification of DNA damaging regimen for these tumors may improve clinical outcome. Several published studies have provided data in support of this notion [47,48,49,50]. Employment of such intensified regimens should be weighed in the context of increased risk for treatment related morbidities, including neuro-cognitive decline [51]. The distinct molecular physiology of SVZ-proximal and cortex-proximal glioblastomas further suggest that these tumors may exhibit differential response to targeted agents. For instance, SVZ-proximal glioblastomas may be more sensitive to agents targeting pathways required for maintaining the cancer stem cell state. In this context, SVZ distance may serve as a biomarker for response to select targeted agents.

A fundamental assumption of our study is that the bulk of the glioblastoma cells did not migrate from its native site to another region during the process of tumor formation. We believe that this assumption is reasonable for the following reasons. First, studies in murine glioblastoma models where lineage tracing afford definition of the cell of origin reveal no evidence of such mass tumor bulk migration [52]. While it is possible that a few tumor initiating cells migrated away from the SVZ, this hypothesis remains unsubstantiated [6]. Second, gliomas can be formed by the introduction of oncogenes into cortical astrocytes as well as astrocytes in proximity to the SVZ [53]. Thus, tumor formation in distinct regions of the cerebrum can occur in the absence of significant cell migration. Finally, mathematical modeling of glioblastoma growth based on serial MRIs also reveals no evidence of en bulk tumor migration during tumor growth [10, 54].

While a strength of our study is that we bypass reader subjectivity through application of an automated segmentation algorithm, no algorithm is perfect. Technical caveats of our IPVL segmentation algorithm have been reviewed in our previous articles [12, 21]. In addition, TCGA provides no information regarding the location within the tumor in which specimens were retrieved. As such, a potential spatial sampling bias exists which may limit the interpretation of our results. Finally, quantification of the extent of resection was not collected in most TCGA glioblastomas. As such, we were unable to tease out the contribution of surgical resection to the survival association reported here. Within the limit of these technical caveats, our results indicate distinct molecular physiology between SVZ-proximal and cortex-proximal glioblastomas. These findings bear significant implication in terms of clinical prognosis and therapeutic development.

In future works, analysis using an external dataset would serve to validate our results. While our work examined specific genes and gene signatures, future analyses may interrogate the expression landscape in an unbiased manner. In addition, trials targeting SVZ proximal glioblastomas through dose-dense TMZ or escalation of radiation therapy warrant consideration, since these measures may counteract the enhanced DNA-repair capacity.

Conclusions

Quantitative MR image analysis of 217 TCIA glioblastoma subjects in the context of gene expression profiles revealed that tumor proximity to the SVZ is associated with increased expression of gene signatures suggestive of enrichment in cancer stem cell state. Patients with SVZ-proximal glioblastomas exhibited poor clinical survival.