Introduction

Pituitary adenomas comprise close to 20% of brain tumors in adults in the United States with roughly 12,000 cases diagnosed each year [1]. Functioning pituitary adenomas comprise roughly 70–75% of these, and of these growth-hormone secreting pituitary adenomas account for approximately 10% of all functioning pituitary adenomas [2]. Growth-hormone secreting pituitary adenomas left untreated may result in acromegaly with subsequent symptoms such as macrognathia, forehead furrowing, enlarging nose and ears, and, in pediatric patients, gigantism with an associated two- to three- fold increase in mortality compared to age- and gender-adjusted controls [3, 4]. Also, both functioning and non-functioning pituitary adenomas may cause local mass effect on adjacent structures such as the optic nerve as well as optic chiasm and result in potential visual field deficits as well as hypopituitarism [5].

Generally, first-line management for growth-hormone secreting pituitary adenomas includes surgical resection typically via a trans-sphenoidal approach with or without somatostatin analogues such as octreotide or lanreotide [6, 7]. Options for management should the former interventions not be successful include radiotherapy in the form of fractionated stereotactic radiotherapy (RT) or stereotactic radiosurgery (SRS), which delivers an ablative dose of radiation typically in 1–5 fractions with submillimeter accuracy with some studies suggesting less toxicity and more rapid endocrine remission with SRS as compared to fractionated stereotactic radiotherapy (RT) [8, 9]. Prior single institution and multi-institutional studies as well as meta-analyses have noted encouraging local control (LC) and biochemical control following SRS. However, earlier manuscripts have had short and non-uniform follow-ups and subsequent studies have reported improved biochemical outcomes with longer term follow-up [9,10,11]. As such, we aimed to provide an updated systematic review and meta-analysis of biochemical control, LC, and subsequent toxicity rates following SRS for the management of growth-hormone secreting pituitary adenomas. We also examine any potential prognostic factors that were associated with biochemical control, LC, or toxicity rates.

Materials and methods

Literature selection

A literature search was performed using PubMed, EMBASE, and the Cochrane Library through March 1st, 2020. Inclusion criteria for selection was defined using the Population, Intervention, Control, Outcomes, Study Design (PICOS) method (Supplementary Table 1) [12,13,14]. Additionally, the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) selection algorithm (Supplementary Fig. 1) was designed [13]. Guidelines from the PRISMA checklist [15] (Supplementary Fig. 2) and the Meta-analysis of Observational Studies in Epidemiology (MOOSE) [16] (Supplementary Fig. 3) were followed.

Table 1 Studies examining clinical outcomes following SRS for the management of acromegaly

The following combination of keywords were searched: radiosurgery, SRS, radiation therapy, acromegaly, pituitary, fractionation, endocrine control, endocrine remission, local control, toxicity, GammaKnife, CyberKnife, LINAC, protons, and side-effects. Related articles as well as reference lists were reviewed of manuscripts that were initially found with additional relevant publications included. A total of 20 published studies were chosen for inclusion in both the qualitative and quantitative portions of the meta-analysis.

The inclusion criteria for the quantitative meta-analysis included studies with information on: (1) patients clinically or pathologically diagnosed with acromegaly secondary to a functioning pituitary adenoma; (2) either endocrine control (EC), endocrine remission (ER), local control (LC), or pituitary or visual toxicity rates; (3) dose and fractionation (4) patients treated with SRS either definitively or post-operatively (with SRS defined as being delivered in hypofractionated courses of 1–5 fractions). Definitions for each of these endpoints were based upon the individual studies themselves. Exclusion criteria included: (1) studies without information on ER, EC, LC, visual toxicity, or hypopituitarism rates; (2) studies that included patients without outcomes specific to those with functioning pituitary adenomas resulting in acromegaly; (3) studies without a minimum follow-up of at least 2 years and at least 15 patients (4) works involving patients included in more than one study; (5) works involving non-human subjects; (6) works not published in English; (7) unfinished manuscripts.

Data extraction

Independent authors (R.S., P.D., E.L.) conducted and reviewed extraction of relevant data. Information obtained included the primary and secondary outcomes, as well as patient, study, and treatment characteristics. Authors of relevant studies were contacted for missing data if such studies met inclusion criteria.

Outcome measures

The primary outcome of the study was 5- and 10-year ER and EC rates, with secondary outcomes of 10-year radiographic LC and both new visual toxicity rates secondary to cranial nerve (CN) II neuropathy or hypopituitarism secondary to SRS. The definition of ER across studies varied but generally was defined as a post-SRS GH < 1–2 ng/mL and/or normal IGF-1 levels for patient sex and age without the use of pharmacologic management/suppressive medications. The definition of EC was defined similarly to ER but with the use of pharmacologic management/suppressive medications.

Statistical analysis

Statistical analyses were conducted using R Studio Version 1.1.383 (Boston, MA) [17]. The Meta-Analysis for R (metafor) package version 2.0-0 was used to conduct the meta-analyses, meta-regressions, tests for heterogeneity, analysis of publication bias, and Wald-type tests [18]. The DerSimonian and Laird method was used to calculate between study variances [19]. Proportions were calculated for each of the outcome measures, where the denominator was the total number of patients in each study or arm. Weighted random effects models were used to determine an overall summary estimate for each of the outcome measures [20, 21]. Summary estimates for each of the outcome measures were depicted on forest diagrams with their associated 95% confidence interval.

Heterogeneity was assessed using both the I2 statistic [22] and Cochran Q-Test [23]. Significant heterogeneity was considered to be present if I2 > 50% and the p value of the Q-Test was < 0.10. An assessment of publication bias was performed using the Egger test [46]. Publication bias was considered to be present if the p-value of the Egger Test was < 0.05.

Meta-regression and the Wald-type test were used to compare summary effect sizes on each of the forest diagrams for ER, EC, LC, visual toxicity and hypopituitarism rates, where treatment category was used as a categorical covariate. The null-hypothesis was rejected for p < 0.05.

For dose response analyses, mixed effects meta-regression models utilizing an ordinary least squares (OLS) approach were used to estimate weighted linear relationships between ER, EC, and LC rates and median post-operative growth hormone (GH) and insulin-like growth factor-1 (IGF-1), gross tumor volume (GTV), and median margin prescription dose as well as between visual toxicity and new hypopituitarism rates and GTV and median margin prescription dose. The weight applied to a given study’s published effect estimate was the ratio of the number of patients analyzed in that study divided by the total number of patients over all studies used for the meta-estimate of that effect [20]. Results were summarized by slopes representing expected changes in independent variables based on per unit changes in independent variables.

Results

Characteristics of studies included for quantitative analysis

Among 20 published studies, 1533 pituitary patients with acromegaly treated with definitive, adjuvant, or salvage SRS were identified that met inclusion criteria [10, 11, 24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41]. Studies were published from 2001 to 2020 with patients from the United States, Italy, France, Turkey, Germany, Japan, Czech Republic, Taiwan, Hong Kong, China, Norway, the United Kingdom, Australia, and South Korea. Data on both the primary and secondary outcomes are listed for each study in Table 1. Other information regarding patient age, radiation technique, prescription dose and prescription isodose, GTV or planning target volume (PTV), EC, ER, LC, visual and pituitary toxicity rates, whether patients had received prior radiation therapy or had prior surgery, and other relevant information can also be found in Table 1. Among studies examining SRS, the majority of patients had received adjuvant SRS with a few patients in some studies having received definitive SRS. Median pre-SRS GH and IGF-1 levels were 10.53 ng/mL (range: 0–442) and 665.5 ng/mL (range: 18.5–2915), respectively. The median GTV was 2.0 cc (range: 0.1–49.5), and the median margin prescription dose was 24 Gy (range: 8.8–42 Gy). In nineteen of twenty studies, single fraction SRS was utilized with one experience reporting on fractionated SRS in 3 or 5 fractions [40].

Endocrine remission, endocrine control, and local control

With respect to ER, 17 studies with 1,298 patients had information on 5-year ER [10, 11, 24, 26, 28,29,30,31,32,33,34,35,36,37, 39,40,41], and 10 studies with 1,023 patients had information on 10-year ER [10, 11, 31, 34,35,36,37, 39, 41]. At 5-years, the estimated ER rate was 43.2% (95% CI 31.7–54.6%; Fig. 1), and at 10-years the estimated ER rate was 56.9% (95% CI 47.5–66.4%; Fig. 1). On meta-regression for evaluation of potential prognostic factors, median GH (p = 0.90), IGF-1 (p = 0.57), GTV (p = 0.55), and margin prescription dose (p = 0.43) were not found to be correlated with 5-year ER. Similarly, median GH (p = 0.78), IGF-1 (p = 0.90), GTV (p = 0.78), and margin prescription dose (p = 0.66) were not found to be correlated with 10-year ER. Egger’s test assessing publication bias for 5-year (p = 0.77) and 10-year ER (p = 0.28) were non-significant.

Fig. 1
figure 1

Forest plots examining 5- (a) and 10-year (b) endocrine remission (ER) rates following SRS

Upon examination of EC, 4 studies with 317 patients on 5-year EC [11, 30, 36, 39] and these same 4 studies with 317 patients had information on 10-year EC [11, 34, 36, 39]. At 5-years, the estimated EC rate was 55.0% (95% CI 27.6–82.4%; Fig. 2), and at 10-years, the estimated EC rate was 69.7% (95% CI 47.7–91.8%; Fig. 2), respectively. On meta-regression for evaluation of potential prognostic factors, median GH (p = 0.26) and margin prescription dose (p = 0.22) were not found to be correlated with 10-year EC. The potential correlation between median IGF-1 levels and GTV and 10-year EC were not able to be examined due to a lack of enough information on both of these variables. Egger’s test assessing publication bias for 5-year (p = 0.90) and 10-year EC (p = 0.73) were non-significant.

Fig. 2
figure 2

Forest plots examining 5- (a) and 10-year (b) endocrine control (EC) rates following SRS

Five studies with 360 patients in total had information on 10-year LC [29, 31, 37]. The estimated 10-year LC rate was 92.8% (95% CI 83.0–100%; Fig. 3). On meta-regression for evaluation of potential prognostic factors, neither GTV (p = 0.35) nor margin prescription dose (p = 0.45) were found to be correlated with 10-year LC. Egger’s test assessing publication bias for 10-year LC was non-significant (p = 0.10).

Fig. 3
figure 3

Forest plot examining 10-year local control (LC) following SRS

Visual toxicities and hypopituitarism

Across 17 studies, 1255 patients were identified has having information on visual toxicity rates following SRS secondary to CN II neuropathy [10, 24,25,26,27,28,29,30,31,32,33,34,35,36,37,38, 40]. The estimated rate of visual toxicity secondary to CN II neuropathy following SRS was 2.7% (95% CI 1.3–4.2%; Fig. 4). On meta-regression for evaluation of potential prognostic factors, margin prescription dose was found to be correlated with visual toxicity incidence with the model estimating a 0.41% increased risk of visual toxicity for every 1 Gy increase in margin prescription dose beyond approximately 17 Gy (Supplementary Fig. 4; p = 0.03). GTV at the time of SRS was not found to be associated with visual toxicity incidence (p = 0.40). Egger’s test assessing publication bias for visual toxicity was non-significant (p = 0.73).

Fig. 4
figure 4

Forest plots examining incidences of visual toxicity (a) and hypopituitarism (b) following SRS

With regards to rates of hypopituitarism, 19 studies with 1,512 patients had information on this outcome [10, 11, 24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40]. The estimated rate of hypopituitarism following SRS was estimated to be 26.8% (95% CI 16.9–36.7%; Fig. 4). On meta-regression for evaluation of potential prognostic factors, neither GTV (p = 0.29) nor margin prescription dose (p = 0.95) were found to be correlated with incidence of hypopituitarism. Egger’s test assessing publication bias for hypopituitarism rates were significant (p = 0.003).

Discussion

SRS is increasingly being utilized in both adjuvant and salvage settings for the management of acromegaly, and also provides an attractive option as definitive management for patients who are not surgical candidates and/or have poor biochemical control or symptoms secondary to mass effect should the adenoma be refractory to pharmacologic management. Multiple single and multi-institutional trials have reported on biochemical control, LC, and toxicity outcomes following SRS with variable follow-up. This study thus aimed to characterize ER, EC, LC, and visual and pituitary toxicities at time-specific to characterize both long-term efficacy and toxicity following SRS. Ten-year ER and EC was estimated to be approximately 56% and 70%, respectively, with 10-year LC exceeding 90% with visual and pituitary toxicities following SRS estimated to be < 3% and 30%, respectively. Visual toxicity was found to be associated with increasing margin prescription dose, with incidence rates with margin prescription doses of 20, 25, 30, and 35 Gy estimated to be approximately 1.2%, 3.3%, 5.3%, and 7.4%, respectively.

As noted above, improving biochemical control rates were noted over time, with 5- and 10-year estimated ER rates of 43.2% and 56.9% and 5- and 10 year EC rates of 55.0% and 69.7%. This temporal rise in both ER and EC was seen across all studies and may be indicative of a gradual benefit over time in biochemical control that requires follow-up over time to confirm. Of note, this finding is not limited to SRS, as a prior study by Minniti, et al., have also noted improving IGF-1 normalization rates over time of 23%, 42%, and 61% at 5-, 10-, and 15-years following conventionally fractionated radiation therapy to 45–50 Gy in 25–28 fractions for patients with persistent or recurrent acromegaly [42]. Moreover, the benefits of EC in acromegalic patients can translate to improvements in medical management, downstream effects on organ systems affected by acromegaly, and overall quality of life.

A prior meta-analysis by Abu Darbh, et al., aimed to compare outcomes following SRS for patients with acromegaly as compared to RT [9]. At last follow-up, the estimated ER rate was 52% with SRS as compared to 36% with RT, though this difference was not found to be significantly different (p = 0.14). These results are similar to the estimated 10-year ER rate of 56.9% (95% CI 47.5–66.4%) from our study. Interestingly, when looking at post-treatment IGF-1 levels, however, SRS was found to result in a statistically significantly higher reduction in IGF-1 levels (− 409.72 ng/mL) as compared to with RT (− 102 ng/mL) (p = 0.002). With regards to toxicity rates, lower rates of hypopituitarism were noted following SRS (32%) as compared to RT (51%), though did not meet statistical significance (p = 0.05). With respect to hypopituitarism rates, we noted a similar incidence of 26.8% (95% CI 16.9–36.7%) following SRS in studies included in our analysis. However, the authors discussed the possibility of selection bias in comparing outcomes with SRS as compared to RT as RT may be selected for larger or bilateral tumor remnants following resection with SRS most often utilized as a treatment modality for smaller tumors (often times < 2.5 cm) with adequate distance from the optic apparatus (generally 2–3 mm) [43]. A prior study examining whole sella SRS as compared to targeted SRS have noted higher rates of hypopituitarism (40.6% vs. 29.7%), albeit not achieving significance [44].

Prior studies have assessed whether certain clinical features may be prognostic with respect to biochemical control. Castinetti, et al., found that median pre-SRS GH (p = 0.01) and IGF-1 levels (p = 0.047) off of somatostatin analogous were significantly lower in patients that achieved biochemical remission (7.1 ng/mL and 495 ng/mL) as compared to patients that remained uncured (25.3 ng/mL and 673 ng/mL) [25]. Pre-SRS GH values have also been noted to be associated with ER by Ronchi, et al. [34]. A multi-institutional cohort study by Ding, et al., noted that cessation of IGF-1-suppressive medications was an independent predictor of durable ER (HR 2.49 (95% CI 1.21–5.11); p = 0.01) with factors found to be associated with recurrence after initial remission being prior resection (HR 0.21 (95% CI 0.06–0.74); p = 0.01) and maximum dose (HR 0.95 (95% CI 0.87–0.99); p = 0.01) [10]. Pollock, et al., have also found that cessation of IGF-1 and GH suppressive medications prior to SRS in addition to lower IGF-1 levels (< 2.25 × the upper normal limit adjusted for sex and age) were associated with improved rates of ER [32]. Another study by Pai, et al., previously noted that the absence of cavernous sinus invasion (p = 0.041) and lower baseline IGF-1 levels (p = 0.019) were significantly correlated with durable ER [30]. However, when examining the impact of pre-SRS IGF-1 levels or GH levels, we did not find that either were associated with durable ER. We were unable to assess the potential impact of the cessation of IGF-1/GH lower medications or cavernous sinus invasion as few studies reported these proportion of patients.

With regards to toxicities, Ding, et al., noted that targeting of the whole sella was associated with a higher risk of hypopituitarism following SRS (p = 0.01) and tumors treated with suprasellar extension were associated with an increased risk of optic apparatus injury (p = 0.02) [10]. Jezkova, et al., have also noted in their cohort of patients that no thyroid or sex hormone deficiencies were noted if the mean pituitary dose was less than 15 Gy and no cortisol deficiencies with mean pituitary doses less than 18 Gy [29]. Other studies have noted that new hypopituitarism may be associated with margin doses exceeding 25 Gy or GTVs greater than 2.5 cc [37]. In comparing SRS to RT, Knappe, et al., have noted lower rates of either thyreotropic of adenocorticotropic hypopituitarism following SRS as compared to RT (HR 0.54 (95% CI 0.3–1.00); p = 0.049), consistent with the findings of the meta-analysis by Abu Darbh, et al. [9, 11].

Of note, experiences thus far reporting on hypofractionated SRS for the management of acromegaly are quite limited [40]. Iwata, et al., have reported the largest study thus far on the use of hypofractionated SRS in 3 or 5 fractions for 52 patients with acromegaly [40]. The 5-year ER rate was noted to be 17.3%, somewhat lower than other series have reported following single fraction SRS, with 5- and 10-year LC of 100% and 82.5%. With regards to toxicities, only 1/52 patients (1.9%) noted SRS-induced hypopituitarism following treatment with no visual toxicities reported. However, further studies are required to better characterize both the efficacy and safety of hypofractionated SRS as compared to single fraction SRS. Other promising avenues potentially include the utilization of proton therapy, particularly with respect to improving the therapeutic ratio with respect to visual or pituitary toxicities, though results following proton therapy with large patient cohorts with robust follow-up are pending [45].

Study limitations

Our analysis has limitations that merit discussion. The studies included in this meta-analysis were conducted at many institutions with differing treatment guidelines resulting in heterogeneity with respect to patient selection criteria and patient-level data that could be not accounted for, such as pre-SRS GH and IGF-1 levels, duration of biochemical control or LC prior to SRS, whether patients had ceased using somatostatin analogues prior to SRS, extent of initial resection (either gross total or subtotal resection), receipt of either prior surgery or radiation therapy, age, performance status, prescription dose, GTV, and extent of disease that was treated (cavernous sinus invasion, optic apparatus contact, and/or suprasellar extension). There also was heterogeneity noted in the estimates of both primary and secondary outcomes. As only one study examined hypofractionated SRS, we were unable to compare clinical outcomes between single fraction and hypofractionated SRS. The definitions for LC, ER, and EC varied to some degree depending upon the era of the study and clinical team that published the work. Similarly, with respect to hypopituitarism, studies varied in defining general hypopituitarism or specific hormonal deficiencies, and as such were not able to provide estimates of hormone specific toxicities. Our analysis also included retrospective experiences of SRS, which also raises the possibility of bias in our effect estimates [46].

Conclusions

SRS resulted in gradually improving ER and EC rates over time that approached 60% and 70%, respectively, 10 years following SRS. Higher margin prescription doses were found to be associated with increased incidences of SRS-related visual toxicity. Further studies are warranted examining the use of SRS for the management of acromegaly patients either after failed resection or as a primary treatment, particularly given encouraging and improving ER and EC rates over time following SRS with longer follow-up.