Introduction

Growing evidence suggests the association of type 2 diabetes mellitus (T2DM) and an increased risk of cancer [1]. Although the underlying mechanisms are not fully understood, potential mechanisms involve overproduction of reactive oxygen species, insulin and insulin-like growth factor receptor-1 [2]. On the other hand, hyperglycemia and concurrent obesity also impair immunosurveillance and blunt antitumor responses [3]. Currently multiple classes of antidiabetic agents are available for the treatment of T2DM, and it is critical that antidiabetic drugs do not increase the risk of cancer.

Native glucagon-like peptide-1 (GLP-1), secreted from intestinal enteroendocrine L cells in response to meal challenges, augments the biosynthesis of insulin and stimulates insulin secretion in a glucose-dependent manner. Complementary to its insulinotropic effects, GLP-1 also suppresses postprandial glucagon excursions, delays gastric emptying and intestinal mobility, induces satiety, and promotes weight loss, exerting a potent glucose-lowering activity. Furthermore, a β-cell-preserving effect of GLP-1 was observed in animal studies [4]. Easily degraded by dipeptidyl peptidase-4, endogenous GLP-1 has a short elimination half-life. Two approaches were developed to potentiate GLP-1 action, including exogenous administration of GLP-1 receptor agonists (GLP-1RAs) with a prolonged duration of action, and pharmacological inhibition of proteolytic degradation [5].

GLP-1RAs, including exenatide, liraglutide, albiglutide, dulaglutide, semaglutide, lixisenatide, and taspoglutide, are a fairly new class of incretin-based antidiabetic drugs indicated for the treatment of T2DM, especially those with poor weight control, showing clinical advantages in improving glycemic control and weight management with minimal risks of hypoglycemia. However, some unfavorable findings in preliminary experiments as well as in signal generation analyses of adverse event reporting system (AERS) database, and a lack of adequate evidence of their long-term safety in humans raised concerns about their pancreatic and thyroid safety profiles [6, 7]. To address this issue, an updated meta-analysis aimed at evaluating the risk of overall cancer, including pancreatic and thyroid cancer respectively, associated with GLP-1RAs, was performed in this study, collecting data from randomized controlled trials.

Methods

This meta-analysis was performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [8], and was registered with PROSPERO (number CRD42019133712). The PRISMA checklist is provided in Supplementary Table 1.

Data sources and searches

An extensive literature search was performed in PubMed, EMBASE, Cochrane Central Register of Controlled Trials (CENTRAL) and Web of Science, using the main terms “(exenatide OR liraglutide OR lixisenatide OR albiglutide OR dulaglutide OR taspoglutide OR semaglutide OR GLP-1 receptor agonist* OR GLP-1 analog* OR incretin-based therapy* OR incretin mimetic*) AND (T2DM OR insulin-independent diabetes mellitus) AND (neoplasm or neoplasms or cancer or cancers or carcinoma or carcinomas or tumor or tumorsor neoplasia or neoplasias or malignancy or malignancies) to search for relevant studies up to March 18, 2019. Detailed search strategy is listed in Supplementary Table 2. Reference lists of all retrieved trials, reviews, and conference abstracts were searched manually for eligibility to supplement the computer search.

Study selection

All available randomized controlled trials (RCTs) that compared GLP-1RAs with placebo or active comparators in all age groups of T2DM patients, and that reported the incidence of cancers were included. Inclusion criteria: (1) randomized clinical trials comparing a GLP-1RA with a non-GLP-1RA active comparator or/and a placebo, (2) recruiting patients with T2DM only, (3) with durations of at least 52 weeks, (4) providing an estimate on cancers associated with GLP-1RAs use, and (5) published in English. Studies using the fixed-ratio combination containing GLP-1RAs, and those comparing different formulations or subclasses of GLP-1RAs were excluded.

Data extraction and quality assessment

The primary outcome was overall incidence of any cancer defined as ‘Neoplasms benign, malignant and unspecified (including cysts and polyps)’ according to the MedDRA dictionary. Secondary endpoints included incidence of thyroid and pancreatic cancer. Cancer events reported in the publications served as the primary source of information. When multiple reports based on the same population was identified, the most recent or informative one was selected. All other sources, including relevant reviews and pooled analysis reporting results of individual studies, were searched for complementary information on results of published trials, when not available in publications. If cancer events were not reported in the manuscripts, data from the ‘Serious Adverse Events’ section on ClinicalTrials.gov were extracted, and assumed to be zero if not reported on ClinicalTrials.gov. Editorials, letters, articles without treatment-emergent adverse events, and animal experimental studies were excluded. The following data was extracted independently by two reviewers: the first author, publication year, NCT number, trial duration, characteristics of participants (sample size, background therapy, mean age and percentage of women), intervention (type and regimen of GLP-1RAs), comparators, and outcomes of interest. Any discrepancy was resolved by consensus or by the third investigator.

The Cochrane risk of bias tool was used to assess the risk of bias of each study, based on the following aspects: random sequence generation, allocation concealment, blinding, incomplete outcome data, selective outcome reporting, and other source of bias. Two independent reviewers evaluated each item as low, high, or unclear risk of bias.

Jadad score [9, 10] was used to evaluate the methodologic quality of the included studies. The quality scale included randomization (1 point), method to generate randomization sequence was described (computer-generated randomization, or tables of random numbers) (1 point), double blind (1 point), placebo (1 point), numbers and reasons for withdrawal (1 point), and analysis by intent-to-treat (1 point). When more than one reference was found for the same trial, methodological quality assessment was based on the total set of information. A score of ≥4 was considered of good quality.

Data synthesis and analysis

Odds ratios with 95% confidence interval (95% CI) were calculated for all the adverse events aforementioned, using the DerSimonian and Laird method with a random-effects model. We chose the random-effects model because of the relatively small number of component studies and resultant limited validity of tests of heterogeneity. Between-study heterogeneity was calculated by using the chi-square test and quantified by the I2 statistic, with a significance threshold set at p < 0.10. The following prespecified subgroup analyses were performed to explore the source of heterogeneity: (1) types of GLP-1RAs, (2) mean age (≥60 years vs. <60 years), (3) mean percentage of females (≥50% vs. <50%), (4) methodologic quality of study (Jadad score of ≥4 vs. <4), and (5) publication year (recent 5 years vs. recent 5–10 years vs. recent 10–15 years). For primary outcome, a sensitivity analysis was performed by removing one study at each time. Funnel plots and Egger’s linear regression method were used to screen for potential disclosure bias, and the analyses mentioned above were performed with STATA 12.0 (Stata Corporation, College Station, TX, USA).

Results

A total of 1218 citations were retrieved from Pubmed/Embase/CENTRAL/Web of Science. 890 studies remained after automatic and manual de-duplication. Of which, 266 potentially eligible studies were identified by reviewing the titles and abstracts. After retrieving the full text and searching on ClinicalTrials.gov, 37 eligible RCTs were finally identified (Supplementary Fig. 1) [11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47].

The study characteristics are shown in Supplementary Table 3. In total, 64817 subjects were randomly assigned to one of the GLP-1RAs or comparators. The majority of studies (30 out of 37) provided high-quality evidences. The risk of bias was summarized as follows (Supplementary Fig. 2): 15 RCTs provided adequate descriptions of random sequence generation; 28 RCTs reported adequate allocation concealment for treatment assignment; performance bias was high in 13 trials, which were open-label in the core trials; in 34 RCTs, drop-outs were basically balanced across groups and detailed reasons were provided; methods for intention-to-treat analyses and handling missing data were adequately described in 31 RCTs; finally, RCTs were judged as unclear for selective reporting in 20 RCTs, unless cancer events were included as a safety outcome (none of these studies included cancers as outcomes of interest).

Overall cancer

The 37 studies included a safety population of 63,594 patients with T2DM, with a total exposure of 150,001 patient × years (77,888 for GLP-1RAs and 72,113 for comparators), reporting 1342 and 1223 neoplasms in GLP-1RAs and comparators as treatment-emergent serious adverse events, corresponding to a crude yearly rate of 1.72% and 1.70% in GLP-1RA and comparator groups respectively.

Out of the 37 eligible studies, one trial reported zero events of the primary outcome. In the remaining 36 trials which reported at least one case of cancer, GLP-1RAs were not associated with an increased risk of overall cancer in comparison with comparators (OR 1.03 [95% CI 0.95–1.12]; p = 0.41) (Fig. 1). The results of subgroup analysis are shown in Table 1. When the OR for overall cancer with individual GLP-1RA was analyzed, albiglutide was significantly associated with a lower risk of cancer than comparators (OR 0.76 [95% CI 0.60–0.97]; p = 0.03), whereas no significant differences between other GLP-1RAs and comparators were observed in the risk of overall cancer. The heterogeneity among studies was low (I2 range 0–34.5%). In the sensitivity analysis, the results remained consistent to the removal of each study in turn (Supplementary Fig. 3). Furthermore, no obvious bias was revealed by the Egger’s regression test (p = 0.72) and the visual inspection of the funnel plot (Supplementary Fig. 4).

Fig. 1
figure 1

Forest plot of GLP-1 receptor agonists versus comparators on risk of overall cancer. OR odds ratio, CI confidence interval

Table 1 Pre-specified subgroup analyses of the effects of GLP-1 receptor agonists on the risk of overall cancer

Thyroid cancer

Of the 37 trials included, 22 reported zero events of thyroid cancer in both the GLP-1RA and comparator groups. In trials reporting at least one case of thyroid cancer (N = 15 trials), the overall risk of thyroid cancer was not different between GLP-1RAs and comparators (OR 1.49 [95% CI 0.83–2.66]; p = 0.18) (Fig. 2a). The Egger’s test showed no significant disclosure bias for thyroid cancer (p = 0.15).

Fig. 2
figure 2

Forest plot of GLP-1 receptor agonists versus comparators on risk of thyroid cancer (a) and pancreatic cancer (b). OR odds ratio, CI confidence interval

Pancreatic cancer

Sixteen out of 37 trials reported cases of pancreatic cancer. Forty-eight patients treated with GLP-1RAs were diagnosed with pancreatic cancer, compared with 41 patients in comparator groups, yielding an OR of 1.05 [0.68–1.60] (P = 0.83) for pancreatic cancer associated with GLP-1RAs use (Fig. 2b). No obvious disclosure bias was found for the pancreatic cancer, based on the Egger’s test (p = 0.89).

Discussion

Our results, pooling data from RCTs, demonstrated no increased risks of overall, thyroid or pancreatic cancer associated with GLP-1RA therapies, and confirmed the findings of a previous meta-analysis based on a smaller number of trials [48]. Subgroup analysis indicated a protective effect of albiglutide against overall cancer risk. Absence of sufficient studies did not guarantee a comprehensive cancer risk assessment, and therefore the results needs to be interpreted with caution.

Concerns regarding the association between GLP-1RAs use and thyroid cancer, medullary thyroid cancer (MTC) in particular, rose predominantly from murine models. However, species-specific differences in several aspects limited the extrapolation of preliminary findings to humans, including GLP-1 receptor system in C cells and the predisposition to MTC [6, 49, 50]. Moreover, analyses of sequential changes of calcitonin in 5000 participants identified no consistent pattern of relationship between liraglutide and calcitonin concentration [51]. Furthermore, given the rare incidence of MTC, the number needed to treat to appreciate the tumorigenic potential, if any, would be enormous to disclose the difference [52]. Despite the uncertainty, the apparent risk is minimal. On the other hand, MTC accounted only for a small fraction of all thyroid cancers in humans. With respect to other types of thyroid cancer, neither hyperplastic nor papillary human thyroid cancer expressed GLP-1 receptors [53]. Consistently, postmarketing reports did not indicate an elevated risk of thyroid cancer associated with incretin treatment [7].

Another safety concern was a potentially higher risk of pancreatic abnormality. Alarming signals coming from postmarketing reports based on the FDA AERS database, which indicated >6-fold and 2.9-fold excess of risks of pancreatitis and pancreatic cancer, respectively, in patients using exenatide compared with users of other hypoglycemic agents, from 2004–2009 [7]. Indeed, the AERS database exhibited several limitations intrinsic to spontaneous reporting analysis, restricting its clinical implication. Being incomplete and not all independently adjudicated, a reporting bias could not be ruled out. In addition, confounders including alcohol exposure, smoking, diet and comorbidities were not comprehensively taken into consideration in that study [7]. When the time-trend axis was incorporated into the analysis of FDA AERS database from 2004–2009, a significant disproportionality signal of pancreatitis for exenatide appeared only in the first quarter of 2008, right after the FDA alert on exenatide, highlighting a remarkable influence of notoriety bias [54]. For this reason, we also performed a subgroup analysis based on the time of publication, and the results remained consistent across publication year categories.

Evidence collected from randomized trials is generally considered superior to that coming from observational studies, because the process of randomization minimizes confounding bias. The baseline characteristics between groups were highly matched in trials included in this meta-analysis. In fact, concerns also came from autopsy studies, in which increased exocrine preneoplastic lesions and a potential for evolution to neuroendocrine tumors were observed in pancreas of incretin users [55]. However, not only the baseline characteristics of incretin users and nonusers in that study were not matched (e.g., age), but the confounding of comorbidities, limited sample sizes, and no evidence of a previously tumor-free pancreata also blurred the relationship between incretin use and the alteration of pancreatic histology. Moreover, though current data remained conflicting, most cohort studies revealed that incretin treatment was not associated with increased risks of pancreatic events [56, 57].

Though no signal of overall cancer risk associated with GLP-1RA use was detected, the small number of cases prevent further subgroup analyses on other specific types of cancer events. Population-based cohort studies did not indicate a tumor promoter effect of GLP-1RAs with regard to cholangiocarcinoma, colorectal and breast cancer [58,59,60]. Prospective studies with overall tumor or specific ones as primary outcomes would be needed for a reliable assessment of certain cancer risk.

The small sample size suggests caution in the interpretation of the results of the subgroup analysis. On the other hand, while no evidence existed regarding the antitumor effect of albiglutide, preliminary studies showed a growth-inhibiting and apoptosis-promoting effect of liraglutide on human pancreatic cancer cell lines in vitro and in tumor implantation experiments [61]. However, not only the pharmacokinetics, hypoglycemic effect and immunogenicity but also the tolerability and safety of individual GLP-1RA might vary with different molecular structures [62]. Considering the potential existence of mechanism-based safety profiles but a lack of adequate studies, no decisive conclusion can be drawn until further researches concentrated on the anti-tumor activities of albiglutide provide more information.

Our study had several limitations. First, individuals enrolled in RCTs are usually healthier and with less comorbidities than T2DM patients in the general population, being at a lower risk for developing the adverse events of cancer. Moreover, selection bias could not be readily ruled out for some cancer types, in particular MTC (a family or personal history of MTC is a specific exclusion criterion in most RCTs) and pancreatic cancer (patients with a history of pancreatitis or alcohol abuse, being at a higher risk or with low adherence, are usually excluded from trials). Second, none of the studies included oncology events as primary outcomes, resulting in an unclear risk of reporting bias. However, many studies set a panel of oncological experts, who were unaware of the treatment assignments, to adjudicate any cancer event, minimizing potential bias. Third, the durations of the included trials were not long enough for cancer surveillance. Finally, compared with cohort studies, the number of cancer events in this study was relatively small, owing to the nature of RCTs. Also, given the low incidence of cancer events (as is the case of MTC or even pancreatic cancer), the number of participants lost to follow-up is considerably higher than those reporting the outcome of interests, and therefore, an attrition bias could not be easily eliminated. Further studies that provide more conclusive information about the oncological safety of incretin-based therapies in diabetic patients are warranted.

In summary, when pooling data from randomized clinical trials, the risks of overall, thyroid or pancreatic cancer were not significantly different between GLP-1RAs and comparators in T2DM patients.