Introduction

The incidence of Metastatic Brain Disease (MBD) in the general cancer population is approximately 10% [1, 2]. Depending on primary type, prognosis can vary from poor to dismal. Of the most devastating, the historical overall survival (OS) of melanoma MBD is 3–5 months, with up to 95% of mortality events in these patients is attributed to the brain metastases [3, 4]. Thus, optimization of MBD management is important in order to improve survival outcomes.

The rise of immunotherapy in recent years has shown great promise in treating many of the MBD-prone primary cancers such as melanoma, non-small cell lung cancer (NSCLC) and renal cell carcinoma, in particular by immune checkpoint inhibition (ICI) [5,6,7,8,9]. ICI aims to boost antitumor immune responses by preventing the down-regulation of T-cell activation. Currently there are two classes of ICI that have been approved by the Federal Drug and Administration (FDA) for use in treating MBD: anti-cytotoxic T-lymphocyte-associated protein 4 (CTLA-4) monoclonal antibody ipilimumab, and the anti-programmed cell death protein 1 receptor (PD-1) checkpoint inhibitors nivolumab and pembrolizumab [10,11,12].

There is emerging data that describes that synergistic combination of ICI with stereotactic radiosurgery (SRS) confers greater survival prognosis in MBD patients compared to SRS alone [13,14,15,16]. This is in part believed to be attributed to what is termed the ‘abscopal effect’. First reported in 1953, it describes the phenomenon of metastatic tumor regression outside the field of radiation focus, and has been linked functionally to the immune system [17, 18]. It is believed that the use of ICI may provide an avenue to enhance this effect, and improve overall metastatic control, thus improving survival outcomes. However, as the use of ICI in MBD management involving SRS continues to garner interest, it is currently unclear as to if the timing of ICI administration in relation to SRS can affect the survival outcomes of MBD.

The aim of this systematic review and meta-analysis (SRMA) was to evaluate the current literature investigating the pooled effect and quality of evidence regarding the timing of ICI administration in relation to SRS to manage MBD, in particular focusing on whether or not concurrent ICI administration affords different survival outcomes compared to ICI administered non-concurrently, either before or after with SRS management. There were no funding sources used in this study.

Methods

Search strategy

The strategy was designed around the PICOS question format—Do MBD patients who are managed by SRS (population) and concurrent ICI (intervention: concurrent group) when compared to those who received ICI before or after SRS (comparator: non-concurrent group), differ in survival metrics (outcome) based on comparative studies (study type)? The present review was conducted according to the preferred reporting items for systematic reviews and meta-analyses (PRISMA) guidelines and recommendations [19]. Electronic searches were performed using Ovid Embase, PubMed, SCOPUS, Cochrane Central Register of Controlled Trials (CCTR), Cochrane Database of Systematic Reviews (CDSR), American College of Physicians (ACP) Journal Club and Database of Abstracts of Review of Effectiveness (DARE) from their dates of inception to April 2018. The literature was searched by using the following string of MeSH terms to detect all possible studies: (brain metastases) AND (ipilimumab OR nivolumab OR pembrolizumab OR immunotherapy OR checkpoint) AND (radiosurgery OR radiation OR radiotherapy). Translations of search terms are provided in the Supplementary. The reference lists of all retrieved articles were reviewed independently by two investigators (V.M.L. and A.G.) for further identification of potentially relevant studies. All identified articles were then systematically assessed against the inclusion and exclusion criteria.

Selection criteria

The inclusion criteria used to screen all identified articles were (1) MBD patients, (2) treated by SRS, (3) and received immunotherapy targeting the checkpoint of cell cycle by CTLA-4 or PD-1 inhibition either concurrently or non-concurrently, (4) with reported survival outcomes in (5) patients > 18 years. Concurrent administration was defined as within 4 weeks (or less) of SRS management, and included cases where immunotherapy was interdigitated between multiple SRS cycles. The exclusion criteria applied to all identified articles were (1) case series, (2) involvement of immunotherapy targeting BRAF, and other non-checkpoint targets, (3) primary treatment radiation in the form of whole brain radiation therapy, and (4) no quantitative definition of concurrent administration.

When institutions published duplicate studies with accumulating numbers of patients or increased lengths of follow-up, or when studies reported multiple time courses of the same treated cohort, only the most complete reports were included for quantitative assessment at each time interval. All publications were limited to those involving human patients and in the English language. Reviews, abstracts, case reports, conference presentations, editorials and expert opinions were excluded to minimize potential publication bias and duplication of results.

Data extraction

All survival outcome data were extracted from article texts, tables and figures. Our primary outcome of interest was the OS metric at 12-months as inferred by Kaplan–Meier estimation specific to each study. Other outcomes of interest included local and distal progression free survival, local progression free survival (LPFS) and distant progression free survival (DPFS) respectively, at 12-months, obtained in a similar manner to the OS metric. The LPFS and DPFS were defined as the new progressive enhancement involving and exclusive to the treated intracranial metastatic regions respectively.

Meta-analysis

Odds ratio (OR) was the primary summary statistic of this study, with concurrent administration used as the reference cohort. The outcomes of the included studies were pooled together by meta-analysis of non-integer proportions using a random-effects (RE) model to provide an overall summary statistic. These were presented as forest plots, with the overall heterogeneity statistics provided in-graphic. The I2 statistic was used to estimate the percentage of total variation across studies, owing to heterogeneity rather than chance, with values > 50% considered as substantial heterogeneity [20].

Meta-regression was performed to analyze potential effect modification on overall prognostic trend by a number of study covariates when the outcome was reported by ≥ 4 studies: study year, study size, study duration, mean age, proportion of females, MBD histology (melanoma only vs other combinations), ICI choice (ipilimumab only vs other combinations) and definition of concurrent use (2 vs. 4 weeks). All p values were 2-sided with significance set at p < 0.05. Statistical analyses were conducted with STATA 14.1 (StataCorp, College Station, Texas).

Quality assessment

The strength of evidence for each outcome was evaluated by two independent assessors (V.M.L. and A.G.) using the Grading of Recommendations, Assessment, Development and Evaluations (GRADE) approach and presented as a summary of findings table to identify the certainty of all pooled outcomes [21]. Each study was then assessed against the Newcastle–Ottawa Scale (NOS) [22] which accounted for criteria such as selection, comparability, and outcome to evaluate the quality of its design.

Bias assessment

Publication and small-study biases were assessed through the generation of a funnel plot and assessed for asymmetry. Egger’s linear regression test and Begg’s correlation test were used to investigate suspect asymmetry for small-study bias [23, 24]. Sensitivity analysis by exclusion was performed for all outcomes to evaluate the risk of single-study bias. Finally, a trim-and-fill approach was used to recalculate pooled effect size if any bias was suspected [25].

Results

Search strategy

The search strategy identified 1210 articles for evaluation (Fig. 1). Removal of 321 duplicates resulted in title and abstract screening of 889 articles, which yielded 48 articles for full-text evaluation. The full-text of these were then assessed against the selection criteria, which identified 8 single-institution retrospective observational cohort studies [16, 26,27,28,29,30,31,32] for inclusion into this systematic review and meta-analysis (Table 1). Of these 8 studies, Schapira et al. [32] described outcomes for NSCLC metastases only and Chen et al. [16] described outcomes for a combination of NSCLC, melanoma and RCC metastases—these 2 studies were allocated to the Mixed subgroup for secondary analysis. The remaining 6 studies [26,27,28,29,30,31] described melanoma metastases only, and were allocated to the Melanoma subgroup for secondary analysis.

Fig. 1
figure 1

Results of the preferred reporting items for systematic reviews and meta-analyses (PRISMA) search strategy

Table 1 Design and demographic characteristics of included studies

Of the excluded 40 studies, the three studies require mentioning: Ahmed et al. [33] did not quantify OS proportion of their cohorts; An et al. [34] evaluated early ICI with the definition of ≤ 5.5 months of SRS management which was too broad for our purposes; and quantitative data by Skrepnik et al. [35] could not be clarified.

Demographics and clinical features

The pooled cohort included 408 MBD patients with 105/408 (26%) female, an average age of 60.8 years, and median of 2 metastases treated (Table 1). There were 181 (44%) patients who received ICI concurrent with their SRS management (67 within 2 weeks, 114 within 4 weeks), and 227 (56%) patients who received ICI either before or after their SRS management (Table 2). The most common metastatic primary and ICI agent reported was melanoma and ipilimumab (3 mg/kg for 4 doses) respectively. SRS was most commonly dosed at 20 Gy in a single fraction. Concurrent ICI was considered to be within 2 weeks of SRS in 3 studies [16, 26, 29], and the remaining 5 studies [27, 28, 30,31,32] a period of 4 weeks was used as the definition.

Table 2 Clinical features and outcomes of included studies

Overall survival

The median Kaplan Meier estimates for 12-month OS were 62% in the concurrent group and 50% in the non-concurrent group when all study data was pooled (Table 2). Overall, in the pooled cohort of all 8 studies, the 12-month OS was better in the concurrent ICI group, and this was statistically significant (OR = 1.74; 95% CI 1.13–2.69; I2 = 0.0%; p = 0.011, Fig. 2a). Meta-regression of study covariates did not detect any significant modifying trend on the observed pooled outcome (Supplementary).

Fig. 2
figure 2

Forest plots of the odd ratios (ORs) derived from Kaplan–Meier estimates and their corresponding 95% confidence intervals (95% CIs) of all studies investigating concurrent vs non-concurrent immune checkpoint inhibition (ICI) in metastatic brain disease (MBD) for outcomes (a) 12-month overall survival (OS), (b) 12-month local progression free survival (LPFS) and (c) 12-month distant progression free survival. The summary statistic, the 95% CI, and the relative weightings are represented by the middle of the square, the horizontal line, and the relative size of the square respectively. The heterogeneity and its significance of included studies is provided as an overall statistic for each plot

Furthermore, in the Mixed subgroup (n = 2), the trend and significance were maintained (OR = 2.71; 95% CI 1.06–6.91; I2 = 0.0%; p = 0.037), whereas in the Melanoma subgroup (n = 6), the trend was maintained, however significance was not (OR = 1.55; 95% CI 0.95–2.52; I2 = 0.0%; p = 0.070).

Local progression free survival

The median Kaplan Meier estimates for LPFS were 88% vs. 77% for concurrent vs non-concurrent cohorts respectively (Table 2). Overall, in a pooled cohort of 5 studies, superior 12-month LPFS trended in favor of concurrent ICI, however this was not statistically significant (OR = 2.09; 95% CI 0.76–5.73; I2 = 40.8%; p = 0.154) as shown in Fig. 2b. Meta-regression of study covariates did not detect any significant modifying trend on the observed pooled outcome (Supplementary).

In the Melanoma subgroup (n = 3), the trend and lack of significance were maintained (OR = 2.14; 95% CI 0.37–12.45; I2 = 67.1%; p = 0.399), and in the Mixed subgroup (n = 2), a similar result was obtained (OR = 2.30; 95% CI 0.69–7.72; I2 = 0.0%; p = 0.177).

Distal progression free survival

The median Kaplan Meier estimates for DPFS were 32.5% vs. 55.5% for concurrent vs non-concurrent cohorts respectively (Table 2). Overall, in a pooled cohort of 4 studies, superior 12-month DPFS trended in favor of non-concurrent ICI, however this was not statistically significant (OR = 0.88; 95% CI 0.23–3.29; I2 = 64.1%; p = 0.839) as shown in Fig. 2c. Meta-regression of study covariates identified study year (slope = 0.16; p = 0.039) as a covariate that exerted significant effect on overall trend (Supplementary). As slope < 1, it implied that newer studies were less likely to favor concurrent ICI compared to older studies. All other covariates did not significant modify the overall trend.

In the Melanoma subgroup (n = 3), the lack of significance was maintained, however, the trend was reversed in favor of concurrent ICI (OR = 1.34; 95% CI 0.27–6.69; I2 = 67.5%; p = 0.723). The resulting trend mirrored the overall one in the Mixed subgroup (n = 1) in term so trend favoring non-concurrent ICI (OR = 0.25; 95% CI 0.05–1.53; I2 = NA; p = 0.09).

Quality assessment

The certainty of effect estimates for pooled outcomes were evaluated by GRADE assessment. The certainty of the 12-month OS outcome was deemed to be low (+ 2), and very low for 12-month LPFS (+ 1), and 12-month DPFS (0) (Table 3). According to the NOS criteria, all included studies were of good quality with a median score of 8 (Supplementary).

Table 3 GRADE assessment of outcomes to assess the level of evidence contributing to each outcome. OR, odds ratio; CI, confidence interval

Bias assessment

Funnel plots were generated for each outcome described to assess for publication bias via asymmetry (Supplementary). Asymmetry and bias was suspected in the 12-month OS outcome, despite negative results of Egger’s (p = 0.217) and Begg’s (p = 0.266) tests, as well as a negative sensitivity analysis for single-study bias. A trim-and-fill analysis by RE modelling was performed to validate this concern, which resulted in 2 filled studies (Supplementary), however both trend and significance were maintained in favor of concurrent ICI (OR = 1.56; 95% CI 1.04–2.34; p = 0.030; n = 10). In the Melanoma subgroup (n = 6), sensitivity analysis detected one study [36] upon exclusion which resulted in a significant 12-month OS trend. Yet, trim-and-fill analysis resulted in 1 filled study, and maintained the lack of significance in the trend favoring concurrent ICI (OR = 1.40; 95% CI 0.88–2.24; p = 0.153; n = 7).

The funnel plot for 12-month LPFS outcome (n = 5) was suspected of asymmetry despite negative Egger’s test (p = 0.348) and Begg’s test (p = 0.308) results. Sensitivity analysis by leave-one-out approach identified one study [36] which upon its individual exclusion, significance of the pooled trend was achieved. However, trim-and-fill analysis was negative. The symmetry of the funnel plot for 12-month DPFS (n = 4) was unclear, with Egger’s test (p = 0.201) and Begg’s test (p = 0.308) non-significant. However, sensitivity analysis did not identify any outlying studies to affect significance, and trim-and-fill analysis was negative.

Discussion

The aim of this SRMA was to determine if the timing of ICI relative to SRS has the potential to influence survival metrics in MBD patients. Based on the current literature, pooled analysis indicates that, compared to non-concurrent ICI, concurrent ICI results in significantly better 12-month OS (OR = 1.75; p = 0.011), and comparable 12-month LPFS (OR = 2.09; p = 0.154) and DPFS (OR = 0.88; p = 0.839) outcomes. The trend and significance of the LPFS finding was reflected in the Melanoma subgroup. However bias assessment identified a number of concerns in current studies which limits the certainty of these results.

To date, the largest study reporting concurrent ICI with SRS outcomes is by Chen et al. [16], who investigated outcomes of 79 NSCLC, melanoma and RCC MBD patients managed by ICI. They found with multivariate Cox regression modelling that concurrent ICI conferred a significant OS benefit for MBD, compared to non-concurrent ICI (HR = 2.69; p < 0.01), and SRS alone (HR = 2.69; p < 0.01). This conclusion aligns with the overall finding of this study. Despite this finding, we did not observe corresponding significant trends with respect either LPFS or DPFS. The leading view on this finding is that concurrent ICI with SRS affords greater systemic, extracranial control of MBD, rather than superior intracranial control [31, 36]. Additionally, it may be possible that the number of metastases present at treatment can dictate the propensity for progression independent to that of OS [16]. There is need for enhanced clinical course reporting to assist in clarifying this further in the future.

From the analysis by Chen et al. [16], melanoma histology was identified to significantly correlate with worse OS outcome compared to NSCLC and RCC (p = 0.043). A large portion of the current literature in fact focuses on melanoma outcomes specifically, with the most recent study by Rahman et al. [31] the largest to date. Their conclusions were slightly more tempered however, with OS survival not significantly different between concurrent vs noncurrent ICI (p > 0.05), despite median OS being 17.8 vs. 11.6 months respectively. This correlates with the non-significant trend in 12-month OS in favor of concurrent ICI seen in our melanoma subgroup. Of clinical interest, is that with regards to DPFS, the melanoma subgroup trended in favor of concurrent ICI (OR = 1.34, > 1; n = 3), compared to the other subgroup which favored non-concurrent ICI (OR = 0.25, < 1; n = 1)—the possibility that melanoma achieves better DPFS with different ICI timing compared to other MBD histology is a novel concept, potentially related to superior biological abscopal effect which has been postulated [37], however this remains speculative at best until more studies are published given it is extremely underreported. Nonetheless, it is plausible that MBD histology may be a delineating factor when evaluating optimal time frame for defining concurrent administration of ICI with SRS.

There is evidence that demonstrates melanoma histology carries worse OS and PFS prognosis in MBD patients managed by SRS when compared to other histologies [38,39,40]. It is highly possible then that the addition of ICI to treatment does not change this [16]. The lack of 12-month OS significance with concurrent ICI in the Melanoma subgroup (p = 0.070) compared to the significance in the Mixed subgroup (p = 0.037) would support this notion. We attempted to investigate the effect of histology by means of meta-regression, and although the results were negative for trend modifying effect across all outcomes, we acknowledge that the analysis was most likely underpowered due to low study numbers, and lack of non-melanoma cohorts for control. Should a difference between MBD histology be validated in the future, it may necessitate different inclusion criteria for concurrent ICI with SRS based on histology.

Another potential delineation factor for concurrent ICI is the agent of choice. To date there remains a paucity of data directly comparing different ICI agents administered concurrently in MBD. Chen et al. [16] reported that anti-PD-1 inhibition resulted in a significantly superior OS compared to anti-CTLA-4 inhibition based on their multivariate Cox regression (HR = 0.24; p < 0.01). However, this was not taking into consideration the timing of ICI with respect to SRS, nor was there any separation of this outcome based on MBD histology. Pasquali et al. [41] calculated a similar significant trend favoring anti-PD-1 over anti-CTLA-4 inhibition with respect to PFS in metatastic melanoma, however this was not delineated by timing either. We note that our meta-regression did not detect trend modifying effect by ICI choice reported by each study for all outcomes, however this is compromised by the fact studies were designated as either ‘ipilimumab only’ or ‘mixed ICI’ studies, wherein the latter group involved other ICI agents as well as ipilimumab.

Whether concurrent ICI alters the risk of immune- or SRS-related complications after SRS in the treatment of MBD compared to non-concurrent ICI is currently underreported. The acceptable safety profile of ICI for MBD with SRS is discussed elsewhere in the literature [10,11,12], but whether or not concurrent ICI modulates this profile is unclear. Chen et al. [16] observed comparable incidence rates of immune-related adverse events between their concurrent vs non-concurrent cohorts (32% vs 30%). With respect to SRS-related complications, Diao et al. [36] reported radionecrosis (RN) incidences of 2/23 (9%) and 2/28 (7%) in their concurrent and non-concurrent ipilimumab cohorts with SRS for melanoma MBD respectively, with an overall median time to event of 6.6 months after SRS. Rahman et al. [31] delineated further, noting that between concurrent and non-concurrent ICI cohorts, rates of symptomatic RN were 4/35 (11%) vs. 5/39 (13%) respectively, and rates of pathological RN were 1/35 (3%) and 2/39 (5%) respectively. This anecdotal evidence would appear to advocate for the negligible modification to RN risk with concurrent ICI vs non-concurrent, however additional studies will be helpful in this regard.

Strengths and limitations

This SRMA adhered strictly to PRISMA guidelines and its selection criteria. We obtained ORs for survival metrics based on presented data in individual studies modelled by Kaplan–Meier estimation. The strength of this approach is that it accounts for potential loss by 12-month follow-up via censoring, to provide more consistent, comparable metrics across studies [42]. Finally, in all analyses, a random-effect (RE) model was tested in an attempt to minimize statistical interference by the possible clinical diversity and methodological variation between studies, as it assumes unequal variance between studies and distributes statistical weighting more conservatively [43]. This includes the use of various systemic therapies used prior/concurrently with the SRS and ICI combination, as well as the unique disease biology of different histology types, duration of follow-up, use of different ICI agents, baseline differences, and disease burden at presentation, all of which would be better accounted for by prospective study [16, 27, 31].

We applied a strict definition of ≤ 4 weeks within SRS for ‘concurrent’, which did exclude one study [34]. Although yet to be completely elucidated, it has been suggested that an optimal window within which ICI can be added to SRS exists, possibly within 6–15 days of SRS for ipilimumab [35]. This is based on the upregulation of SRS-induced cell death mechanisms within 7 days, and the half-life of ipilimumab being 14.9 days [44, 45]. Current studies are limited by their retrospective nature in this regard, which limits the scope for investigation of when exactly an OS ‘advantage’ of concurrent ICI diminishes in BM managing. In other words, after SRS, there may be a therapeutic window by which administration ICI has optimal effect. This may or may not be within 4 weeks, as An et al. [34] observed significant LPFS benefit of concurrent ICI within 5.5 months of SRS compared to outside that time period. Evaluation of this therapeutic window will prove clinically useful in the future, as it may widen the utility of ICI in the management of prolonged, complex MBD cases.

The number of studies in the literature is small. This is not particularly surprising given the novel nature of this topic, however limits the certainty by which we can advocate our findings due to the potential for underpowered analysis, as well as over-weighted, outlying studies. For example, our DPFS outcome appears to be affected by year of publication (slope = 0.16; p = 0.039), yet with n = 4 studies, it cannot be claimed that this trend is in fact clinically relevant until more studies are published.

Of particular note is the study by Diao et al. [27], whose exclusion in both 12-month OS and LPFS resulted in statistical significance in favor of concurrent ICI in the Melanoma subgroup, which was not the case with its inclusion. They noted that more frequent prior chemotherapy and higher doses of ipilimumab in the non-concurrent ICI cohort may have contributed to their inconsistent findings with the rest of the literature. This highlights that while it is attractive to assume concurrent ICI affords greater survival outcome benefits in theory, we must be wary that this does not guarantee effective translation into clinical practice.

Furthermore, potential selection bias cannot be controlled for in non-randomized studies. For example, patients into the non-concurrent ICI cohorts may have been recruited after insufficient outcomes following SRS were observed, predisposing these cases to poorer survival outcomes. Additionally, it is unclear if non-concurrent ICI administration before vs. after the SRS window may affect survival outcomes differently. Schapira et al. [32] reported that non-concurrent nivolumab/pembrolizumab before SRS trended towards superior OS and DPFS in NSCLC MBD compared to non-concurrent inhibition after SRS. Kiess et al. [28] reported a similar trend in their cohort of melanoma BM with ipilimumab. Greater factorial control by larger, randomized studies will assist in overcoming these limitations in the future.

Conclusion

It appears that ICI administration within 4 weeks of SRS for MBD confers a statistically significant superior 12-month OS when compared to administration outside that time period. However, the biological heterogeneity of MBD and ICI, and small number of retrospective clinical studies to date, limits the certainty of these results. The results of this systematic review and meta-analysis illustrate that it is conceivable there exists an optimal therapeutic window for ICI in MBD management to yield greatest OS, LPFS and DPFS benefit; when that is, and by what sequence, requires intensive delineation based on clinical parameters in future studies.