Introduction

Approximately 20,000 high-grade gliomas (HGG) are diagnosed each year in the US, and despite advances in chemotherapy (CT), radiotherapy (RT) and surgery, the prognosis of many of these patients is poor. Traditionally, prognostication for patients with HGGs has been based on factors such as age, performance status, cognitive function, tumor grade, and extent of resection [1, 2].

Silencing of the O6-methylguanine-DNA methyltransferase (MGMT) gene has been associated with longer overall survival (OS) in patients with HGG [3]. MGMT encodes a DNA-repair protein that counteracts the effect of treatment by removing alkyl groups from guanine, a target site for alkylating CT agents such as temozolomide (TMZ). MGMT has ubiquitous expression in human tissue with increased expression in some gliomas [4]. Silencing of the MGMT gene by epigenetic methylation of its promoter may disable this repair mechanism, thus increasing the cytotoxicity of CT, and potentially RT. Conversely, high MGMT activity may render cancer cells resistant to treatment. In addition, the presence of methyl groups on glioma DNA theoretically could increase radiosensitivity, irrespective of the level of MGMT expression [5].

The most reliable technique for evaluating MGMT status is controversial. The methylation-specific polymerase reaction (MSP), the most widely-used technique, is highly sensitive, but it is time-consuming and dependent on quality of tissue. Immunohistochemistry (IHC) is readily available in most diagnostic histopathology labs and is less expensive and time-intensive. A recent systematic review and meta-analysis evaluated the correlation between IHC and MSP in studies that compared MGMT promoter methylation by MSP with protein expression by IHC in the same patient cohorts; the study demonstrated that protein expression by IHC did not correlate with results obtained with MSP [6]. In light of these findings, MSP and IHC should not be used interchangeably. Therefore, in this meta-analysis, studies that measured protein expression by IHC alone were excluded.

Several clinical studies have demonstrated that methylation of the MGMT gene may have prognostic significance, though the magnitude of effect varies across studies, but is contradicted by others, and has not been demonstrated in all subgroups [3, 7, 8]. Precise estimates of the prognostic value of MGMT in all treatment groups could allow for refinement in clinical management, and would better inform clinicians and patients. The goal of this meta-analysis was to characterize the prognostic significance of MGMT silencing in patients with HGG, and to assess if MGMT silencing predicts survival differences that are specific to the form of therapy.

Materials and methods

Objective

The primary objective of this meta-analysis was to answer the question: “What is the prognostic value of MGMT silencing for OS in patients with HGG?”. Specifically, we sought to determine the hazard ratio (HR) for OS of patients with MGMT-silenced tumors, compared to patients with MGMT-non-silenced tumors, to evaluate whether this HR changes based on the type of treatment (i.e. is it a predictive marker [9]), and to assess the robustness of our estimate using sensitivity analyses. The analysis methods and inclusion criteria were specified a priori and were conducted with reference to MOOSE and PRISMA guidelines [10, 11].

Search strategy and inclusion criteria

Combinations of terms were used to search the MEDLINE and EMBASE electronic databases from inception until July 2010, relating to the following two concepts: (1) gliomas (“glioma”, “glioblastoma”, “GBM”, etc.) and (2) MGMT (“MGMT”, “O6-methylguanine-methyltransferase”, etc.). Both text and exploded Medical Subject Heading (MeSH) terms were used. Only English-language articles studying human subjects were included. A total of 610 unique studies were identified (MEDLINE n = 370; EMBASE n = 551, with some duplicates).

Studies that assessed OS by MGMT status in patients with HGG, treated with surgery, potentially with the addition of either: CT, RT, or chemoradiotherapy (CRT) were included. Studies that did not report an adjusted HR for OS after controlling for potential confounding clinical variables in a multivariable analysis (e.g. Cox regression analysis including important clinical factors such age, grade, and/or performance status) were excluded, since, the accuracy of HRs estimated from Kaplan–Meier OS curves without a multivariable analysis are uncertain [1214]. In addition, studies in which patients were assigned to different treatments based on MGMT status, studies which assessed MGMT status by protein expression, and case–control studies, were excluded.

Data abstraction

The data abstraction process is shown in Fig. 1. Title and abstract review of the selected reports were initially performed by one of two authors (RO or PB). Subsequently, the full text publication were independently screened by two of the authors (RO, DP, or PB), and their references checked for additional potential reports. Disagreements between reviewers were resolved by consensus.

Fig. 1
figure 1

Flow diagram of study selection

We developed a data abstraction spreadsheet based on the Cochrane Consumers and Communication Review Group’s data extraction template. One review author (RO) extracted data from included studies and the second author (DP) checked the extracted data. Disagreements were resolved by discussion between the two authors. In instances where studies reported their outcomes more than once, the most recently published update was used for data extraction, and preference was given to outcomes that adjusted for previously demonstrated prognostic variables (e.g. age, performance status, gender, and treatment). Where a multivariable analysis was performed but HR for MGMT not reported, we contacted authors to obtain this information.

The following information was abstracted from included studies: (1) characteristics of study participants (including age, tumour grade, primary or recurrent tumors); (2) study inclusion and exclusion criteria; (3) type of treatments received; (4) method used to determine MGMT silencing (e.g. promoter methylation or protein expression). Unfortunately, extent of surgical resection was either mixed, or not available for the majority of studies, and therefore could not be reliably assessed. Studies were categorized as “primary only”, if study subjects MGMT status and OS were obtained from their initial brain tumor presentation (Table 1). Conversely, if all study subjects were presenting with recurrent disease, with MGMT status and OS was obtained from timing of their recurrence, they were categorized as “salvage only” (Table 1). Studies containing a mixture of both salvage and primary subjects were categorized as both (Table 1). The choice of study treatment category (Table 1) was based on the treatment being assessed at the time of the each study, since OS for each study was based on study treatment dates rather than previous or future salvage treatments, which this meta-analysis cannot account for. Ideally, progression-free survival (PFS) would have been assessed, since it is a valuable surrogate endpoint, which can provide definitive results with shorter follow-up than OS, and is therefore especially useful in studies with long natural histories; however, this was not analyzed for three major reasons: (1) PFS was not reported in most of the studies, which would result in elimination of several studies from the analysis, (2) progression is very difficult to define in the context of GBM, as pseudo-progression and radiation necrosis can result in misclassification, and (3) PFS depends on frequency of imaging follow-up, and studies in which imaging is done less frequently could over-estimate PFS. OS is a much more definitive endpoint that does not have these limitations. Therefore, our principle summary measure was the HR for OS comparing outcome by MGMT status (MGMT silenced versus MGMT not-silenced).

Table 1 Study characteristics

If studies presented adjusted HR for patient subgroups by treatment, these data were extracted in addition to the overall study result. Conversely, if subgroup analyses were unadjusted (i.e. crude) [3, 15] then this data was not extracted. If the 95% CI interval was not reported, it was calculated from the HR and associated P value [16]. For one study the authors sent us their raw data, and granted us permission to generate the HR and 95% CI [17].

Assessment of potential publication bias

For each trial we plotted the log HR by the standard error, allowing for visual inspection for publication bias via a ‘funnel plot’; the spread expands at the bottom of the graph, as a result of decreasing accuracy among smaller studies. The graph should resemble a symmetrical inverted funnel, in the absence of bias; in the presence of bias, the plot should appear skewed and asymmetrical. Because graphical evaluation can be subjective, we also estimated the number of unpublished studies that would bring the P value above 0.05 significance level, using the ‘Classic fail-safe N’ method.

Assessment of heterogeneity

The Breslow–Day test was used to assess for heterogeneity. Furthermore, we present the I 2 statistic as a measure of inconsistency of the prognostic value of MGMT status, since it represents the percentage of total variation across studies due to heterogeneity, and does not inherently depend on the number of studies [18]. Meta-analysis was performed using random-effects modelling. Pre-defined subgroup analyses by grade, treatment type, type of CT, and treatment intent (newly diagnosed versus salvage) were specified. In two of the studies, adjusted subgroup analysis by treatments were reported, and therefore used when assessing if the prognostic value of MGMT status is associated with the treatment received (Table 1; Fig. 2) [7, 19, 20]. Analysis of variance (ANOVA) was used to test whether there was a relationship between the HR and the subgroup category explored. In other words, ANOVA was used to assess if there was a statistically significant difference in HR for OS among different subgroups of studies.

Fig. 2
figure 2figure 2

Hazard ratio of death with MGMT silenced versus MGMT non-silenced. a Surgery and CT only studies, b surgery and CRT studies, c surgery and RT studies, d surgery only studies, and e mixed treatment studies. The centre of each square is the HR for individual trials and the corresponding horizontal line is the 95% CI. The area of the square is proportional to the number of deaths in each trial. The black diamond is the pooled HR and the horizontal tip of the diamond is the 95% CI

Sensitivity analyses

Through sensitivity analyses, we examined if our pooled estimate of the prognostic value of MGMT status was largely influenced by studies including grade III tumors, non-CRT treatment, or salvage treatment. Data analysis was done using Comprehensive Meta-Analysis software (version 2.2.05, Englewood, NJ, USA).

Results

Search results

Twenty studies met the inclusion criteria for this meta-analysis, including a total of 2,018 patients (Table 1) [7, 17, 2037]. Two of these studies also reported subgroup analyses. All studies included patients treated for HGG with surgery, since this is necessary in order to obtain tissue for MGMT analysis. Most studies reported on patients who received either RT, CRT, or CT after surgery.

Meta-analysis

In the pooled analysis MGMT silencing was associated with improved OS (HR 0.436; 95% CI 0.333–0.571; P < 0.001). However, we detected significant heterogeneity [I 2 = 75.2%; Q = 76.7; degrees of freedom (df) = 19; P < 0.001], and therefore do not present the pooled analysis via a forest plot. We pre-specified examination of the heterogeneity by grade and treatment. We continued to detect significant heterogeneity in studies restricted to (1) grade IV gliomas (I 2 = 74.8; Q = 67.5; df = 17; P < 0.001), (2) primary treatment only (not salvage) (I 2 = 78.8; Q = 70.9; df = 15; P < 0.001), and (3) surgery and CRT only (I 2 = 70.3; Q = 40.4; df = 12; P < 0.001). In contrast, there was no significant heterogeneity within the other treatment groups, which supports our a priori hypothesis to explore whether the HR was dependent on treatment used.

On subgroup meta-analysis by treatment (Fig. 2), only treatment type was significantly associated with the prognostic utility of MGMT status (P = 0.001; Table 2). In other words, there is a highly significant difference in HR for OS by the treatment received. Furthermore, in studies which received mixed treatments, the HR for OS was within the average value (HR = 0.419; Fig. 2e). Since the surgery and CT or surgery studies were limited in number, an exploratory analyses comparing surgery and CRT versus surgery and RT was performed, suggesting that MGMT status was significantly more prognostic in the former group (P = 0.006; Table 2). As a measure of the robustness of the OS benefit associated with MGMT silencing, it is evident in all subgroups by the 95% confidence intervals (CI) in Table 2, with the exception of the surgery alone studies subgroup. In addition, sensitivity analyses confirm that our estimate of the overall HR of OS by MGMT status is robust across various scenarios (Table 3).

Table 2 Prognostic value of MGMT status by grade and study treatment characteristics
Table 3 Sensitivity analyses

Assessment for publication bias

Publication bias was initially assessed visually through plotting the log HR by the standard error (Fig. 3); a lack of complete symmetry in this figure suggests that a small number of missing studies with a log HR of 0 (i.e. HR = 1) may not have been published. It is estimated that 610 such null studies would have to be unpublished, in order to bring the P value for a pooled HR estimate to be larger than 0.05, using the Classic fail-safe N method.

Fig. 3
figure 3

Funnel plot of standard error by log HR to assess potential publication bias

Discussion

This meta-analysis of 20 studies, involving 2,018 patients, shows that MGMT gene silencing is associated with a survival advantage in patients undergoing treatment for HGG, and this survival advantage is of a large magnitude (HR of death of 0.436; 95% CI: 0.333–0.571), and similar to the subgroup of studies where all patients received surgery followed by concurrent CRT (HR 0.403; 95% CI 0.282–0.576). If the 2-year survival in patients with MGMT-non-silenced tumors is assumed to be 10%, a HR of 0.436 would correspond to a 2-year survival of 37% in patients with MGMT-silenced tumors, assuming an exponential survival function. This HR estimate has been generated using studies which have all adjusted for other major baseline prognostic factors through multivariable analysis, such as age, gender, and performance status. A benefit of MGMT silencing is evident across all groups of patients who received adjuvant CT, CRT or RT, and for patients treated initially or at recurrence.

This study also demonstrates a significant difference in the estimated effect of MGMT based on treatment type, indicating that MGMT is a predictive marker, and that it is preferable to use the HR specific to the treatment received, rather than a pooled estimate [9]. However, caution should be used interpreting these results given the small number of studies that were not in the CRT category. The HR of death associated with MGMT silencing was lowest in the surgery and CT studies, intermediate in the studies including CRT, and highest in patients treated with surgery alone (Table 2). This finding is consistent with the biological role of MGMT. Alkylating chemotherapies such as nitrosoureas or temozolomide induce cell death by forming cross-links between DNA strands through alkylation of the O6 position of guanine, and MGMT is a DNA repair enzyme that removes these alkyl groups and confers resistance to these agents [38]. Thus, it follows that the effect of MGMT silencing should be most pronounced where alkylating agents are the primary treatment modality.

Like alkylating CT, DNA damage from RT can be mediated through alkylation of DNA at the O6 position of guanine. In addition, MGMT activity has been demonstrated to correlate with tumor resistance in glioma cell lines and xenografts treated with ionizing radiation [39]. However, alkylation of DNA at the O6 position of guanine is just one, minor pathway of RT mechanism of action [5]; therefore, one would expect MGMT activity to have a lesser impact on survival in patients treated with radiation, than in those treated with alkylating CT. This indeed was the case in our meta-analysis (Table 2). Furthermore, MGMT status was not prognostic in patients treated with surgery alone, suggesting that MGMT status is only prognostic when the treatment used is targeted at the pathway MGMT repairs; however, this conclusion is based on data from only a subgroup of a single study, and therefore further studies assessing the prognostic utility of MGMT status in patient receiving only surgery are clearly warranted.

There were no significant differences observed in the HR of death associated with MGMT silencing across the other subgroups tested: type of CT, intent of treatment (primary vs. salvage), or grade (Table 2). Although there appeared to be a trend to a lower HR for studies of grade III only patients compared to studies with only grade IV studies, there was only one study with only grade III patients, which limits the interpretation (Tables 1, 2). The trend for studies with salvage only patients to have a lower HR for OS by MGMT status, is likely due to confounding, in that most patients treated for salvage were also treated with CT only (Tables 1, 2). Sensitivity analysis revealed that our pooled estimate of effect was robust and did not change appreciably in the various scenarios tested (Table 3).

In the era of molecularly targeted therapy, these results could have important therapeutic implications. If these differences in survival associated with MGMT silencing are indeed causal in nature (which cannot be proven here), then MGMT provides an attractive therapeutic target. If the enzyme could be silenced pharmacologically, then theoretically this large survival benefit associated with MGMT silencing could be conferred to patients with MGMT activity [40]. Currently, inhibitors of the MGMT enzyme are being evaluated in clinical trials of recurrent malignant gliomas [41]. Although the value of these drugs may not be known for several years, in the meantime, the results of this meta-analysis will have important implications for patients and physicians. Accurate estimates of survival are important to assist in decisions regarding aggressiveness of treatment. Patients and physicians must balance the potential benefits of treatment against the negative effects of such treatments on quality of life, especially when life expectancy is short. Better estimates of prognosis and life expectancy will assist in such decisions.

The results of this study should be considered in the context of its strengths and limitations. As a meta-analysis, it combines the power of numerous studies and large numbers of patients to provide more precise estimates of effects and to allow for better-powered subgroup analyses. However, as a tradeoff for improved precision, there are several inherent limitations, specifically regarding the risks of selection bias, and from combining data from studies that may be heterogeneous. Unpublished studies and studies not reporting adjusted HRs for OS do not contribute to the estimates of the effect of MGMT calculated herein. It is conceivable that negative studies were less likely to be published. Nonetheless, we calculated that over six hundred null studies would be needed in order to statistically negate the benefit of MGMT silencing that is reported here. Secondly, as a result of the aggressive nature of HGGs, most patients in these studies received treatment other than the study treatment, either previously (e.g. initial CRT in salvage CT studies [32]) or in the future (e.g. salvage CT in upfront studies [34]), which could influence their prognosis. Since individual patient data on OS from the other non-study treatments is not available, these heterogeneous treatments could not be accounted for. Thirdly, extent of surgery could not be assessed, since this data was often not available, or was mixed within study patients. Fourthly, not all studies adjusted the prognostic value of MGMT for the same baseline factors, and some factors (such as p53, Ki-67, epidermal growth factor receptor, cyclin-dependent kinases, allelic losses on chromosome arms 1p and 19q, isocitrate dehydrogenase 1 mutations, or other gene expression parameters) were rarely used to adjust the estimate of MGMT effect [37]. Ideally, all available baseline factors would be used to adjust the estimate of effect, but this is only possible in the setting of an individual patient data meta-analysis that contains the same variables. Studies also varied in methods of classifying tumors as MGMT-silenced. These important differences likely contributed to the amount of heterogeneity detected between studies.

We have several recommendations for future studies reporting on the prognostic value of MGMT. Investigators should report HRs for all variables included in their multivariable models, not only variables that are statistically significant. Clear reporting of MGMT detection techniques and a priori specification of cutoff thresholds is essential. Currently, the best method of MGMT detection is unknown. Comparison of different detection techniques with receiver-operator characteristic curves would allow for standardized detection techniques, and thereafter uniform detection procedures across studies. Finally, we support several authors’ recommendation that new RPA prognostic models should be constructed, accounting for MGMT status [15].

Conclusion

Epigenetic silencing of MGMT is associated with markedly improved survival in patients with HGG. The largest benefit is apparent in patients receiving CT as a component of adjuvant treatment, an intermediate benefit in those receiving adjuvant RT, and no evidence to support benefit in those receiving surgery alone. There was no difference in the effect of MGMT silencing based on type of CT, primary versus salvage treatment, or tumor grade.