Introduction

Gastroesophageal cancers occur at a rate of approximately one million cases of gastric cancer and half a million cases of esophageal cancer each year and they represent the fourth and sixth leading cancer sites, respectively [1]. Gastric and esophageal cancers show poor 5-year survival rates of 28 and 17 %, respectively [2]. Esophageal adenocarcinoma is one of the most rapidly growing cancers in the United States [35], Europe [5], and Australia [5]. The poor prognosis of gastroesophageal cancers and the rapid increase in esophageal adenocarcinomas emphasize the need to identify potential measures for the primary prevention of gastroesophageal cancers, in particular esophageal adenocarcinomas.

Recent studies concluded that smoking cessation [6, 7], alcohol avoidance [8, 9], a healthy diet [1013], and obesity and diabetes prevention [1416] may prevent gastroesophageal cancer. Because physical activity is closely associated with those lifestyle factors, particularly with obesity and diabetes, physical activity may plausibly be linked to decreased risk of gastroesophageal cancer. However, the epidemiologic literature regarding a protective effect of physical activity is mixed, with five [1721] of twelve studies [1728] showing an inverse association between physical activity and esophageal cancer and eight [17, 23, 24, 27, 2932] of 21 studies [1719, 2227, 2940] yielding an inverse relation to gastric cancer.

Gastroesophageal cancers show distinct etiologies according to their anatomic site and tumor histologic type [41, 42], suggesting potential differences in their relations with physical activity. However, the associations between physical activity and gastroesophageal cancers according to anatomic site and tumor histologic type have not been comprehensively quantified in a meta-analysis. We therefore conducted a systematic review and meta-analysis of physical activity in relation to gastroesophageal cancers, examining potential variation by anatomic site and tumor histology. A further goal was to perform an exploratory dose–response meta-analysis in a first attempt to produce a physical activity recommendation for the primary prevention of gastroesophageal cancers.

Methods

Literature search

Our systematic review and meta-analysis followed the guidelines concerning preferred reporting items for systematic reviews and meta-analyses (PRISMA) [43]. Two authors (G.B. and M.F.L.) comprehensively searched the literature using PubMed (see Supplemental Material for PubMed search options) and Web of Science to identify published non-ecologic epidemiologic studies quantifying the relation between physical activity and gastroesophageal cancer incidence or mortality. We disregarded studies of cancer survivors. Our search was complemented by a scan of the reference lists of the identified studies. We considered all human research articles published in English through mid December 2013 not classified as review, meta-analysis, editorial, comment, letter, practice guideline, or news. Articles were eligible if they reported a relative risk estimate with a corresponding 95 % confidence interval (CI) or sufficient data to calculate them and if they accounted for age and, if applicable, for sex as potential confounding factors. Our search strategy included the terms physical activity, exercise, cardiorespiratory fitness, cardiovascular fitness, lifestyle, stomach cancer, stomach carcinoma, gastric cancer, gastric carcinoma, esophageal cancer, esophageal carcinoma, cancer, risk, incidence, and mortality (see Supplemental Material for Boolean PubMed search terms). That search yielded 2,209 articles. Ineligible articles were eliminated after screening titles and abstracts (n = 2,139) or full manuscripts (n = 47). Of the 23 remaining studies [1719, 2227, 29, 3140, 4446], two [44, 45] were removed because they were updated in a subsequent analysis [33] and one mortality study [46] was excluded because incidence data [24] from the same cohort were available. Three additional studies were found by manual search in the Web of Science [20, 21, 28] and one study [30] was found in the reference lists of the identified studies. Thus, a total of 24 studies [1740] were included in the meta-analysis.

Data extraction

To assess potential differences in physical activity relations according to tumor histologic type and cancer anatomic site, we preferably extracted risk estimates for gastroesophageal cancer subtypes even if that meant that not all cancer cases could be considered. Specifically, we were unable to include 99 gastric cancers with an unspecified subsite from one study [23], for which no relative risks were reported. Also, we did not include the 49 gastric cardia cancers from another study [38] because risk estimates were unavailable for those cases. We extracted risk estimates for men and women separately whenever possible because men and women were considered independent samples. If studies reported on more than one physical activity domain, estimates for all domains were collected. With respect to different types of physical activity assessments, preference was given for assessments of lifelong physical activity, for assessments of vigorous physical activity, for the most comprehensive physical activity assessment, and for quantitative physical activity assessments. When there was a choice among quantitative physical activity assessments, we used frequency of physical activity because that was the most common quantitative physical activity component measured.

Study quality score

Two previous systematic reviews on physical activity and cancer [47, 48] found that the study quality as assessed by a score proposed by Monninkhof et al. [47] affected the summary risk estimates. Thus, we employed that quality score to assess whether selection bias, misclassification, or confounding affected the summary risk estimates. In addition, we rated the degree of control for confounding by awarding points for including smoking, adiposity, and alcohol intake as major adjustment factors in the multivariate models of the underlying studies.

Random effects meta-analysis

The odds ratios and hazard ratios presented in individual studies were interpreted as relative risk estimates (RRi). These were log-transformed to log(RRi) and their corresponding standard errors were computed as si = di/1.96, with di representing the maximum of [log(upper 95 % CI bound of RRi) − log(RRi)] and [(log(RRi) − log(lower 95 % CI bound of RRi)] to take into account that the log-transformed 95 % confidence interval from one study [21] was not centred at the corresponding log-transformed relative risk. We employed a random effects model using the weighted average of the log(RRi) expressed as wi = 1/(s 2i  + t2), where si represented the standard error of log(RRi) and t2 represented the restricted maximum likelihood estimate of the overall variance. Heterogeneity of the risk estimates was assessed using the Q- and the I2-statistics [49]. Publication bias was tested using funnel plot diagnostics, Begg’s test [50], and Egger’s test [51].

Stratified analyses

We investigated whether the summary risk estimate was affected by cancer major anatomic site (esophageal cancer, gastric cancer), histologic type (esophageal squamous cell carcinoma, esophageal adenocarcinoma), and anatomic subsite (gastric cardia adenocarcinoma, gastric non-cardia adenocarcinoma). We also examined potential differences according to study design (cohort, case–control), study quality score (tertiles), gender (men, women), study geographic region (North America, Europe, Australia, Asia), physical activity domain (recreational, occupational), timing in life of physical activity (recent past physical activity, distant past physical activity, consistent physical activity over time), physical activity assessment (assessment of energy expenditure, assessment of activity duration, assessment of activity frequency, and qualitative physical activity assessments using descriptive categories such as ‘sedentary’, ‘light’, ‘moderate’, or ‘high’ physical activity), and adjustments for smoking (yes, no), adiposity (yes, no), and alcohol intake (yes, no).

Sensitivity analyses

In order to investigate whether our summary risk estimate was affected by the inclusion of multiple risk estimates from a given study, we conducted a sensitivity analysis that included only one risk estimate per gender from each study, choosing the risk estimate with the highest quality score and the gastroesophageal cancer endpoint with the largest number of cases. We conducted a further sensitivity analysis in which we omitted one study at a time from the analysis to examine whether results were affected by an individual study. Because studies used heterogeneous definitions of exposure and reference levels of physical activity, we investigated whether the summary risk estimate changed if the meta-analysis was restricted to studies with comparable definitions of exposure and reference categories. We also tested whether restriction to studies of gastroesophageal cancer incidence changed the summary risk estimate.

Non-linear dose–response meta-analysis

Most studies [17, 19, 24, 29, 30, 32, 34, 3840] based on quantitative physical activity assessments examined the association between frequency of moderate to vigorous recreational physical activity and gastroesophageal cancer risk. Five of those studies [24, 29, 32, 34, 38] provided sufficient information to be included in a non-linear dose–response meta-analysis [52].

All statistical analyses were performed in R [53] using the R-packages ‘metafor’ [54] and ‘mvmeta’ [55]. Risk estimates are reported with 95 % confidence intervals. Statistical significance was based on the 5 % significance level.

Results

Study characteristics

Table 1 presents the main characteristics of the nine cohort and 15 (non-nested) case–control studies of physical activity and gastroesophageal cancer included in the meta-analysis. Those studies comprised a total of 15,745 cases. Ten studies [1719, 2227, 29] examined more than one gastroesophageal endpoint, four studies [19, 23, 37, 39] investigated more than one physical activity domain, four studies [17, 20, 25, 31] presented results stratified by gender, one study [29] presented results stratified by geographic region, and one study [30] reported results stratified by family history of gastric cancer. Thus, the 24 studies reported 61 individual risk estimates.

Table 1 Characteristics of the 9 cohort studies and 15 case–control studies on physical activity and esophageal and gastric cancer risk included in the meta-analysis grouped by study design

All cohort studies examining recent past physical activity in relation to gastroesophageal cancer assessed the usual level of physical activity at baseline [23, 24, 26, 27, 31, 33, 37, 38], whereas case–control studies queried about physical activity performed at cancer incidence [18, 21, 22, 28, 34, 35, 39, 40], one year [32] or 2 years [29] before cancer incidence or before symptoms appeared [30]. By comparison, studies investigating distant past physical activity inquired about the level of physical activity ten or more years prior to baseline [36] (cohort study) or cancer incidence [17, 19, 20] (case–control studies).

Main analysis

We summarized associations according to gastroesophageal cancer anatomic site and histology (Fig. 1). High levels of physical activity showed statistically significant inverse relations to gastric non-cardia adenocarcinoma (RR = 0.72, 95 % CI = 0.62–0.84), gastric cardia adenocarcinoma (RR = 0.83, 95 % CI = 0.69–0.99), and esophageal adenocarcinoma (RR = 0.79, 95 % CI = 0.66–0.94). No statistically significant association was observed for esophageal squamous cell carcinoma (RR = 0.94, 95 % CI = 0.41–2.16). No overall differences in the relations of physical activity to esophageal squamous cell carcinoma, esophageal adenocarcinoma, gastric cardia adenocarcinoma, and gastric non-cardia adenocarcinoma were observed (p-difference = 0.71). Similarly, comparing the relation of physical activity to esophageal adenocarcinoma with that to esophageal squamous cell carcinoma (p-difference = 0.61) and comparing the relation of physical activity to gastric non-cardia adenocarcinoma with that to gastric cardia adenocarcinoma (p-difference = 0.26) revealed no statistically significant differences.

Fig. 1
figure 1

Forest plot of a random effects meta-analysis including 61 risk estimates of gastroesophageal cancer for a high versus low level of physical activity, grouped by anatomic site and histologic type. RR relative risk, CI confidence interval

Combining all 61 risk estimates in a random effects model, we observed a statistically significant 18 % reduction in gastroesophageal cancer risk when comparing high versus low levels of physical activity (RR = 0.82, 95 % CI = 0.74–0.90). A funnel plot (supplementary Figure S1), Begg’s test (p = 0.95), and Egger’s test (p = 0.19) suggested no publication bias. The funnel plot, though, identified one extreme risk estimate [28]. We detected statistically significant heterogeneity between studies (I2 = 76 %, p-heterogeneity < 0.001).

Stratified analyses

The relation of physical activity to gastroesophageal cancer was statistically significantly inverse in case–control studies (RR = 0.77, 95 % CI = 0.67–0.89), whereas it was statistically non-significant in cohort studies (RR = 0.89, 95 % CI = 0.78–1.01; p-difference = 0.18, Fig. 2).

Fig. 2
figure 2

Forest plot of a random effects meta-analysis including 61 risk estimates of total gastroesophageal cancer for a high versus low level of physical activity, grouped by study design. RR relative risk, CI confidence interval, ESCC esophageal squamous cell carcinoma, EA esophageal adenocarcinoma, EC esophageal cancer, GCA gastric cardia adenocarcinoma, GNCA gastric non-cardia adenocarcinoma, GA gastric adenocarcinoma

A direct comparison between women and men revealed a stronger inverse association between physical activity and gastroesophageal cancer in women (RR = 0.60, 95 % CI = 0.48–0.73) than men (RR = 0.87, 95 % CI = 0.77–0.98; p-difference = 0.01) (Fig. 3), but modelling the effect of gender did not materially attenuate between-study heterogeneity (I2 = 72 %, p-heterogeneity < 0.001). In contrast, no statistically significant differences in relations were observed for study quality score, study geographic region, physical activity domain, timing in life of physical activity, physical activity assessment, and adjustments for smoking, adiposity, or alcohol intake (all p-difference > 0.05) (Table 2).

Fig. 3
figure 3

Forest plot of a random effects meta-analysis including 61 risk estimates of total gastroesophageal cancer for a high versus low level of physical activity, grouped by gender. RR relative risk, CI confidence interval, ESCC esophageal squamous cell carcinoma, EA esophageal adenocarcinoma, EC esophageal cancer, GCA gastric cardia adenocarcinoma, GNCA gastric non-cardia adenocarcinoma, GA gastric adenocarcinoma

Table 2 Random effects summary estimates of total gastroesophageal cancer risk for a high versus low level of physical activity by selected study characteristics

In sub-analyses of total esophageal cancer (supplementary Table S1), the relations of distant past physical activity (RR = 0.53, 95 % CI = 0.42–0.67) and consistent physical activity over time (RR = 0.68, 95 % CI = 0.51–0.91) were statistically significant inverse, while that of recent past physical activity was not (RR = 1.01, 95 % CI = 0.70–1.47; p-difference = 0.04).

Sub-analyses examining the relation of physical activity to total gastric cancer (supplementary Table S2) revealed that the inverse association was statistically significant in case–control studies (RR = 0.77, 95 % CI = 0.72–0.82) but not in cohort studies (RR = 0.90, 95 % CI = 0.76–1.06; p-difference = 0.04). Similarly, the inverse association of physical activity to gastric cancer was statistically non-significant in men (RR = 0.95, 95 % CI = 0.84–1.08) but it was statistically significant in women (RR = 0.64, 95 % CI = 0.50–0.84; p-difference = 0.02).

Sensitivity analyses

In a sensitivity analysis, we generated a summary risk estimate that included only one risk estimate per study and gender, using the risk estimate with the highest quality score for the gastroesophageal cancer endpoint with the largest number of cases. The 30 selected risk estimates are printed in bold in Table 1. We found no difference between that summary risk estimate (RR = 0.82, 95 % CI = 0.70–0.95) and our summary risk estimate from the main analysis (RR = 0.82, 95 % CI = 0.74–0.90; p-difference = 0.96).

We tested for influential studies by removing one study at a time from the meta-analysis and observed no material differences in the summary risk estimates for total gastroesophageal cancer, for total esophageal cancer or for total gastric cancer. Similarly, we observed no material changes in the modifying effects of study design and gender on the physical activity and gastroesophageal cancer relation. After excluding the study by Dar et al. [28] from the main analysis, the previously observed between-study heterogeneity was substantially attenuated (I2 = 51 %, p-heterogeneity < 0.001) and the inverse association between physical activity and esophageal squamous cell carcinoma became statistically significant (RR = 0.66, 95 % CI = 0.46–0.96). After additionally including terms for study design and gender in the model, the heterogeneity between studies was no longer apparent (I2 = 0 %, p-heterogeneity = 0.08).

We identified four studies [23, 24, 32, 34] of recreational physical activity and gastroesophageal cancer whose exposure and reference levels could be considered comparable. Those studies compared participants engaging in 2 h per week or more of recreational physical activity with those not regularly engaging in recreational physical activity. The summary risk estimate combining those studies (RR = 0.79, 95 % CI = 0.69–0.91) did not statistically significantly differ from the risk estimate that was based on all studies of recreational physical activity (RR = 0.78, 95 % CI = 0.72–0.85; p-difference = 0.97) or from the risk estimate from our main analysis (RR = 0.82, 95 % CI = 0.74–0.90; p-difference = 0.80).

When we excluded two studies on gastroesophageal cancer mortality [17, 33] and three studies using fatal and non-fatal gastroesophageal cancers as a combined endpoint [26, 31, 36], we obtained a summary risk estimate of RR = 0.83 (95 % CI = 0.77–0.89), which was comparable to that of the main analysis (p-difference = 0.62).

Dose–response meta-analysis

We conducted a random-effects dose–response meta-analysis to further explore the relation between frequency of moderate to vigorous physical activity and gastroesophageal cancer (Fig. 4). The best fitting dose–response model was given by

$$ RR = { \exp }(a_{1} dose^{2} + a_{2} dose^{3} ), $$

where a 1 = −0.051, a 2 = 0.007, var(a 1) = 1 × 10−4, cov(a 1a 2) = −2 × 10−5, and \( var(a_{2} ) = 3 \times 10^{ - 6} \), and no heterogeneity between studies was observed (I2 = 8 %, p-heterogeneity = 0.36). The J-shaped dose–response relation indicated that a maximal reduction in gastroesophageal cancer risk of 33 % was attained by engaging in moderate to vigorous physical activity at a frequency of five times per week (RR = 0.67, 95 % CI = 0.58–0.79) as compared to not engaging in moderate or vigorous physical activity.

Fig. 4
figure 4

Dose-response analysis of frequency of moderate to vigorous physical activity in relation to total gastroesophageal cancer

Discussion

We conducted a random effects meta-analysis of physical activity in relation to gastroesophageal cancer and found that high versus low physical activity was associated with risk decreases of 17–28 % for esophageal adenocarcinoma, gastric cardia adenocarcinoma, and gastric non-cardia adenocarcinoma, whereas no relation was detected for esophageal squamous cell carcinoma. However, apparent differences in risk across tumor subtypes were not statistically significant. When combining gastroesophageal cancer subsites, we observed a statistically significant 18 % reduction in total gastroesophageal cancer risk with high versus low levels of physical activity. Our dose–response meta-analysis showed that the greatest risk reduction of 33 % was achieved by engaging in moderate to vigorous activity at a frequency of five times per week.

We initially observed substantial between-study heterogeneity, which was no longer apparent after removing an influential study [28] and additionally including terms for study design and gender in the model. The authors of that influential, large hospital-based case–control study from Kashmir, India [28] compared high versus low levels of occupational physical activity and reported an odds ratio of 5.65 (95 % CI = 3.49–9.12) for esophageal squamous cell carcinoma but speculated about the possibility of residual confounding by socioeconomic status. After excluding that study [28], we found that physical activity was related to a substantial risk reduction of 34 % for esophageal squamous cell carcinoma.

Numerous biologic mechanisms potentially mediate the observed inverse association between physical activity and gastroesophageal cancer. For example, chronic inflammatory processes, such as Helicobacter pylori infection [56], gastroesophageal reflux [57], Barrett’s esophagus [58], obesity [14] and type 2 diabetes mellitus [15, 16] support gastroesophageal carcinogenesis, while factors that reduce chronic inflammation, such as regular aspirin use [59, 60] and dietary fiber intake [10, 11] are associated with reduced gastroesophageal cancer risk. In a similar vein, physical activity may inhibit gastroesophageal carcinogenesis by reducing chronic inflammation [61] and alleviating metabolic disorders associated both with chronic inflammation and gastroesophageal cancer, such as insulin resistance [62] and elevated insulin-like growth factor 1 levels [63, 64]. Physical activity may also prevent gastroesophageal carcinogenesis by decreasing oxidative stress [65] and improving DNA repair [66].

The observed consistency in risk reduction with physical activity across gastroesophageal cancer subtypes suggests that physical activity represents a shared protective factor for these cancers. That gastroesophageal cancers show common risk factors despite distinct etiologies is supported by observations of quite similar risk increases across gastroesophageal cancer subtypes of 7–16 % for each increment of 10 pack-years of smoking [67] and risk decreases of about 35–45 % for regular aspirin use [59, 60] and intakes of dietary fiber [10, 11]. Also, exposure to alcohol intake [8, 9, 68] and pickled foods [69, 70] has been positively related to all gastroesophageal cancer subtypes.

We examined whether obesity mediated the inverse relation of physical activity to risk of gastroesophageal cancer by comparing risk estimates that were adjusted for adiposity with those that were not. We noted that the inverse association between physical activity and gastroesophageal cancer was only modestly attenuated when the meta-analysis was restricted to datasets that were adjusted for adiposity. This suggests that the biologic mechanisms by which physical activity decreases risk for gastroesophageal cancer are only partly mediated through its effects on weight control. That the etiologic pathways linking increased physical activity to decreased risk of gastroesophageal cancer are distinct from those associated with weight control is supported by widely divergent associations between obesity and gastroesophageal cancer histologic types. Notably, obesity shows an approximate 30–50 % reduced risk for esophageal squamous cell carcinoma [71], whereas it exhibits a substantial increased risk for esophageal and gastric cardia adenocarcinomas in the order of 100 % [14].

The inverse association between physical activity and gastroesophageal cancer was more pronounced in case–control studies than cohort studies. Possible explanations for this particular constellation of findings are preferential selection of physically active controls, physical activity under-reporting among cases, or superior physical activity assessment in case–control studies than cohort studies. The gastroesophageal cancer risk reduction of 11 % with high versus low physical activity obtained from cohort studies likely represents the more conservative risk estimate than the 23 % risk reduction generated by case–control studies.

We observed a stronger inverse association between physical activity and gastroesophageal cancer in women than men. The modifying effect of gender was statistically significant for total gastroesophageal cancer and for total gastric cancer but not for total esophageal cancer, for which case numbers were limited. Apart from chance, one possible explanation for this finding is inhibitory effects of estrogen on gastroesophageal cancer growth [72, 73]. Support for a protective role of circulating estrogen levels also comes from observational studies showing an inverse association between hormone replacement therapy and gastroesophageal cancer [74].

We noted that distant past physical activity and consistent physical activity over time showed stronger inverse relations with total esophageal cancer than recent past physical activity. Other than chance, one possible explanation for this observation is that distant past or long-term consistent physical activity may better capture the relation of physical activity to gastroesophageal cancer, if one exists. Future studies should confirm whether distant past or consistent physical activity has greater potential for esophageal cancer prevention than recent past physical activity.

One potential limitation of the present meta-analysis is that a causal relation for the observed inverse association between physical activity and gastroesophageal cancer could not be established because no intervention study was available for inclusion. In addition, the investigation of the relation of physical activity to gastroesophageal cancer by anatomic subsite and histologic type was limited because only nine [18, 20, 21, 2325, 28, 29, 38] of the 24 studies provided relevant data on cancer anatomic subsite and histologic type. Information on esophageal squamous cell carcinoma was provided by only five studies [18, 20, 21, 24, 28], which may partly explain the observed statistically non-significant inverse association with physical activity in the main analysis. Furthermore, we lacked sufficient information to assess whether the relation of physical activity to gastric cancer was modified by H. pylori infection status. One previous study [23] found no effect modification of the association between physical activity and gastric cancer by H. pylori infection status.

This first meta-analysis of physical activity and gastroesophageal cancer also has a number of important strengths. We included a large number of studies, which yielded a substantial number of cases and enabled us to evaluate different subtypes of gastroesophageal cancers according to anatomic site and tumor histologic type. We used standardized criteria for identifying pertinent studies and abstracting key information. Our meta-analysis differs from a previous meta-analysis of physical activity and gastric cancer [75] in that we (1) included five additional gastric cancer studies with 1,054 additional cases of gastric cancer [17, 33, 36, 39, 40]; (2) examined physical activity in relation to esophageal cancer and assessed differences according to gastroesophageal cancer histologic types; (3) generated a quantitative physical activity recommendation for the primary prevention of gastroesophageal cancers based on an exploratory dose–response meta-analysis; (4) used random effects meta-regression to identify and remove the sources of between-study heterogeneity in the published data; (5) explored for potential differences between studies of gastroesophageal cancer incidence and those of gastroesophageal cancer mortality; and (6) summarized risk estimates using comparable exposure and reference categories in a sensitivity analysis, thereby confirming the robustness of our primary findings.

In conclusion, our comprehensive meta-analysis provides support for an inverse relation of physical activity to risk of gastroesophageal cancer. We estimate that high versus low physical activity decreases the risk of gastroesophageal cancer by approximately 18 %. Our dose–response meta-analysis suggests that engaging in any amount of moderate to vigorous physical activity is beneficial and that a frequency of five times per week of physical activity is associated with a risk reduction of 33 %. Future research is required to discern which specific types and durations of physical activity are needed for gastroesophageal cancer risk reduction. High-quality epidemiologic studies that employ standardized physical activity assessments and uniform definitions of high versus low physical activity levels are warranted.