Introduction

Coffee is the second most common beverage in the world after tea. Thus, any health effect of coffee is an important issue of public health. Besides caffeine, coffee contains many bioactive compounds with potential effects on health [1], including minerals and antioxidants, mainly phenolic compounds (such as chlorogenic, caffeic, ferulic and cumaric acids), melanoidins and diterpenes (such as cafestol and kahweol). Coffee has been related with lower incidence of diabetes [2], stroke[3] and cancer at selected sites [48].

To date 25 prospective studies considered the quantitative relation between coffee and mortality for all causes [933]. Among these, 21 found an inverse or null association (including the most recent paper based on the large NIH-AARP Diet and Health Study[32]), one cohort study of US Seventh-day Adventists found a significant positive association [16], and three studies found a significant increased risk only in men [13, 26, 33].

An estimate of the relation between cardiovascular disease (CVD) mortality and coffee was provided by 22 studies [10, 11, 1416, 1822, 25, 27, 29, 30, 32, 3440]. Most of these found no relation or a moderate inverse association, significant in 4 studies [1820, 32], while 4 studies found an increased risk [10, 14, 16, 40]. The relation of coffee with mortality for coronary/ischemic heart disease (CHD/IHD) was considered in 17 studies, reporting inconsistent results [10, 11, 1416, 18, 19, 23, 26, 29, 30, 3540]. Of the 8 studies considering stroke mortality [10, 20, 26, 29, 30, 32, 39, 40], 5 found no relation [26, 29, 30, 32, 39].

Estimates of the relation between coffee and all cancers mortality have been provided from 13 prospective studies, overall reporting no relation [12, 15, 16, 20, 22, 23, 25, 27, 3032, 34, 41].

Since most studies included a few deaths and were possibly underpowered to detect an association, we combined all published data on the relation of coffee with mortality from all cause, CVD, CHD/IHD, stroke and all cancers, using a meta-analytic approach.

Methods

Data sources and search

We carried out a PubMed search in January 2013 using the string ‘(prospective OR cohort) AND (fatal OR death OR mortality) AND (hot beverages OR coffee OR caffeine)’, following the Meta Analysis of Observational Studies in Epidemiology (MOOSE) guidelines [42]. A similar strategy was used to search Embase. The searches were limited to human studies written in English. A flow chart of the procedure to select studies is shown in Fig. 1. From the 889 and 2,274 articles retrieved on Pubmed and Embase, three authors (SM, AT, FT) separately selected the studies reporting data on the association between coffee and all cause and/or selected cause-specific mortality, including deaths for CVD, CHD/IHD, stroke and all cancers. Discrepancies between review team members were discussed and resolved.

Fig. 1
figure 1

Flow-chart of the selection of studies included in the meta-analyses. RR relative risk

Study selection

Most studies were not focused on the relation between coffee and mortality, and were no longer considered. Of the 92 publications selected, 58 were not included for at least one of the following reasons: (a) did not report original results from cohort studies; (b) did not provide relative risks (RR) and the corresponding confidence intervals (CI) or information sufficient to calculate them; (c) considered together the disease incidence and mortality; (d) analyzed special populations (e.g. diabetics, people with heart conditions); (e) investigated patients with specific cancers (we included only studies considering total cancer mortality). The reference lists of the remaining 34 articles were checked to obtain other pertinent publications, and 2 additional report was identified [26]. Of the 36 articles selected, 8 were excluded since they reported information on caffeinated drinks [43, 44], or were duplicate reports on the same population [45, 46], or had a number of deaths below 100 [47], or had the upper cutpoint of the reference category too high [10, 12, 34], since for the present analyses we considered as reference category ≤1 cup/day of coffee consumption. No study was excluded a priori for weakness of design or data quality, and we did not assign quality scores to the studies.

Thus, the present analyses were based on 28 prospective studies whose main characteristics are reported in eTable 1 [9, 11, 1324, 2633, 35, 3741]. We summarized decisions and operations on original data for each study in the column “notes”.

Data extraction

We abstracted from the selected studies the following information in a standard format: study design, country, number of subjects (total number of deaths, total cohort size or the number of person-years of follow-up), in general and for each category of coffee consumption, duration of follow-up (years of enrolment and study end, and mean/median duration of follow-up), sex, RR for the categories of coffee intake and the corresponding CIs, type of coffee, outcomes considered and covariates adjusted in the model. Discrepancies between the authors were discussed and resolved.

When a study did not report the RR or the CI, but reported sufficient data to calculate them (i.e. the number of cases and non-cases for each category of coffee consumption), we estimated the RRs and the corresponding 95 % CIs from raw data [9, 11, 23, 26]. When a study reported the adjusted RR but not the corresponding CI, we calculated the CI of the crude RR, and related them to the adjusted RR [14, 18]. If a study provided more than one RR, the one adjusted for the largest number of confounding factors was included in the analysis. When a study reported data on both regular and decaffeinated coffee, data were collected separately [22, 24, 27, 32].

Data synthesis and analysis

All studies reported data on coffee consumption measured in cups/day. As reference category (defined as low drinking) we used either no consumption [17, 18, 21, 22, 25, 26, 30, 32, 37, 38], or the lowest category of coffee consumption considered in each paper, i.e. occasional drinking[20, 23, 27, 33, 40, 41] or drinking up to 1 cup/day [11, 14, 16, 19, 24, 28, 31, 35, 39].

For all cause mortality, we pooled the RRs for the study-specific highest category of coffee consumption versus low drinking, using random-effects models, which consider both within and between study variation. Since smoking is a major confounder of the possible relationship between coffee consumption and mortality for several causes, we excluded from all further analyses studies reporting RRs not adjusted for smoking [9, 11, 23, 26, 35]. Thus, analyses for cause-specific mortality were carried out considering only smoking-adjusting studies.

We selected studies reporting three or more categories of coffee consumption and computed the pooled RRs for an increment of coffee of 1 cup/day. To obtain the information on the dose–response relationship we used the method proposed by Greenland and Longnecker [48], which estimates the study specific RRs for the increment of one unit of consumption by relating the natural logarithm of the RR to the corresponding mean value of intake across exposure categories. This method takes into account that estimates of risk for successive levels of intake are correlated. Open-ended upper categories were considered of the same amplitude as the previous one. The overall RR for an increment of 1 cup/day of coffee was then obtained by pooling the corresponding study-specific RRs with random-effects models.

For all causes and CVD mortality, we computed the pooled RRs for consumption of coffee of ≤3 cups/day (moderate) and >3 cups/day (heavy) versus low drinking. When more than one RR estimate in a study fell in the range considered for moderate or heavy coffee drinking, we first pooled for each study the corresponding estimates using the Hamling et al. method [49]. For all cause and CVD mortality we also computed the pooled RRs including only studies whose highest category of coffee was of at least ≤4 cups/day [1315, 1719, 22, 24, 25, 2729, 31, 32, 37, 39].

Heterogeneity among studies was assessed using the χ2 test, and was defined as a p value <0.10. Moreover, potential heterogeneity was quantified through the I2 statistic, which describes the percentage of total variation across studies due to heterogeneity, rather than chance [50]. Usually low heterogeneity is defined by values of the I2 statistic <25 %, while values of such statistic >75 % are indicative of high heterogeneity.

Publication bias was assessed by funnel plot and by the Begg’s and Mazumdar’s [51], and Egger’s tests [52]. All the statistical analyses were performed using the STATA software (version 11; StataCorp, College Station, TX, USA).

Results

Mortality from all causes

Figure 2 shows the RRs of all cause mortality for the study-specific highest versus low (≤1 cup/day) coffee drinking categories for 23 prospective studies. The pooled RRs were 0.88 (95 % CI 0.84–0.93, p for heterogeneity <0.001, I2 = 68.7 %) for all studies, 0.88 (95 % CI 0.80–0.97) for the 17 studies considering men, and 0.86 (95 % CI 0.82–0.90) for the 13 studies reporting the relation in women.

Fig. 2
figure 2

Total and sex-specific relative risks (RRs) of the relation of all cause mortality with coffee intake. The pooled RRs were calculated using the random-effects models. RR for the study-specific highest coffee drinking category versus low drinking (<1 cup/day) from all the studies combined. *Non smoking adjusting studies

Table 1 shows the combined RRs of all cause mortality for the highest versus low coffee category in strata of selected study characteristics. The pooled RR from the 4 studies with RRs not adjusted for tobacco smoking was 0.91 (95 % CI 0.81–1.01). The corresponding RR for the 19 studies adjusting for tobacco smoking was 0.87 (95 % CI 0.82–0.93), similar in men and women. Among smoking adjusting studies, the pooled RRs were 0.92 (95 % CI 0.85–1.00) for studies conducted in North America, 0.85 (95 % CI 0.76–0.95) for those conducted in Europe, and 0.76 (95 % CI 0.63–0.91) for studies conducted in Japan (p for heterogeneity among RRs in geographic regions = 0.138). The pooled RR was 0.88 (95 % CI 0.83–0.93) for alcohol adjusting papers and 0.81 (95 % CI 0.61–1.08) for those non adjusting for alcohol (p for heterogeneity = 0.577). When we included in the meta-analysis only studies whose reference category was non-drinking, the pooled RR for the highest category of intake was 0.87 (95 % CI 0.83–0.94, p for heterogeneity = 0.42).

Table 1 Combined relative risks (RRs) for the highest versus low coffee drinking for all cause mortality and coffee drinking according to selected study characteristics

There was no evidence of publication bias in the 19 smoking adjusting studies, as tested using Egger’s (p = 0.114) and Begg’s (p = 0.118) tests.

The pooled RR for a consumption increment of 1 cup/day of coffee was 0.97 (95 % CI 0.96–0.98, p for heterogeneity <0.001, I2 = 80.7 %), obtained pooling the RRs of the 17 smoking adjusting studies considering at least three categories of coffee consumption (Fig. 3).

Fig. 3
figure 3

Total and sex specific relative risks (RRs) of the relation of all cause mortality with coffee intake, including only smoking adjusting studies. RR for an increment of coffee drinking of 1 cup/day. The pooled RRs were calculated using the random-effects models

Four studies considered the relation of total mortality with decaffeinated coffee [22, 24, 27, 32]. The pooled RRs for the study-specific highest versus low (≤1 cup/day) decaffeinated coffee intake were 0.90 (95 % CI 0.82–0.99) in both sexes combined, 0.86 (95 % CI 0.80–0.92) in men and 0.83 (95 % CI 0.76–0.91) in women.

Mortality from CVD

The meta-analysis on the relation of overall CVD mortality with coffee included 17 smoking adjusting studies reporting quantitative estimates for CVD and/or CHD/IHD and/or stroke mortality, whose main characteristics are reported in eTable 1. There was no evidence of publication bias overall, as tested using Egger’s (p = 0.963) and Begg’s (p = 0.981) tests.

The combined RR for the study-specific highest versus low (≤1 cup/day) categories of coffee was 0.89 (95 % CI 0.77–1.02, p for heterogeneity <0.001, I2 = 75.7 %) for all subjects, 0.92 in men and 0.84 in women (Fig. 4a). The pooled RR for a consumption increment of 1 cup/day of coffee was 0.98 (95 % CI 0.95–1.00, p for heterogeneity <0.001, I2 = 87.8 %), from 16 studies (Fig. 4b). The corresponding RRs were 0.99 (95 % CI 0.95–1.03) in men, and 0.95 (95 % CI 0.92–0.98) in women.

Fig. 4
figure 4

Total and sex specific relative risks (RRs) of the relation of cardiovascular (CVD) mortality with coffee intake, including only smoking adjusting studies. The pooled RRs were calculated using the random-effects models. a RR for the study-specific highest coffee drinking category versus low drinking (≤1cup/day). b RR for an increment of coffee drinking of 1 cup/day

Table 2 reports the pooled RRs on mortality for CHD/IHD and stroke separately, for the study-specific highest category of coffee drinking compared with low drinking and for an increment of 1 cup/day of coffee. The RRs for CHD/IHD were respectively 0.95 (95 % CI 0.78–1.15, 1.09 in men and 0.77 in women) and 1.00 (95 % CI 0.97–1.04). The corresponding RRs for stroke were 0.95 (95 % CI 0.70–1.29, 0.72 in men and 1.21 in women) and 0.97 (95 % CI 0.94–1.01). The pooled RRs for the study-specific highest category of coffee drinking compared with low drinking were 0.92 (95 % CI 0.73–1.14) for studies conducted in North America, 0.85 (95 % CI 0.73–0.99) for those conducted in Europe, and 0.89 (95 % CI 0.51–1.56) for studies conducted in Japan (p for heterogeneity among RRs in geographic regions = 0.846). The RRs were 0.84 (95 % CI 0.73–0.96) when we pooled the studies adjusting for alcohol intake, and 0.96 (95 % CI 0.77–1.20) among non alcohol adjusting studies (p for heterogeneity = 0.315).

Table 2 Combined relative risks (RRs) for the highest versus low coffee drinking and for the increment of 1 cup/day of coffee, in smoking adjusting studies considering mortality for cardiovascular disease (CVD), in strata of type of CVD and other study characteristics

Mortality from all cancers

The meta-analysis on the relation between all cancer mortality and coffee was based on 10 smoking adjusting studies, whose main characteristics are reported in eTable 1. There was small evidence of publication bias overall, as tested using Egger’s (p = 0.067) and Begg’s (p = 0.075) tests.

The pooled RR for the study-specific highest versus low (≤1 cup/day) categories of coffee was 1.03 (95 % CI 0.97–1.10), with no significant heterogeneity among studies (p for heterogeneity = 0.647, I2 = 0 %); the corresponding RRs were 1.07 in men and 0.98 in women (Table 3). Nine studies included three or more categories of coffee intake. Based on these studies, the combined RR for an increment of 1 cup/day of coffee was 1.00 (95 % CI 0.99–1.01), similar in men and women.

Table 3 Combined relative risks (RRs) of the relation between all cancers mortality and coffee drinking for the highest versus low coffee drinking and for the increment of 1 cup/day of coffee in smoking adjusting studies

The pooled RRs for the study-specific highest versus low coffee intake were 1.05 (95 % CI 0.99–1.13, p for heterogeneity = 0.844) among 4 studies conducted in North America, 0.79 (95 % CI 0.39–1.57, p for heterogeneity = 0.555) among 2 studies conducted in Europe, and 0.94 (95 % CI 0.80–1.09, p for heterogeneity = 0.357) among 4 studies conducted in Japan (p for heterogeneity among RRs in geographic regions = 0.329). We found pooled RRs for the study-specific highest versus low coffee intake of 1.05 (95 % CI 0.98–1.11, p for heterogeneity = 0.968) among 7 studies adjusting for alcohol intake, and 0.84 (95 % CI 0.60–1.16, p for heterogeneity = 0.128) among 3 studies non adjusting for alcohol intake (p for heterogeneity = 0.192).

Additional analyses on the dose-risk relation for all cause and CVD mortality

Based on smoking adjusting studies, compared with low drinking (≤1 cup/day), the pooled RRs for all cause mortality were 0.91 (95 % CI 0.88–0.94) for coffee drinking ≤3 cups/day and 0.87 (95 % CI 0.82–0.92) for drinking >3 cups/day (Table 4). When we selected studies whose highest category of coffee was at least ≤4 cups/day, the pooled RR was 0.87 (95 % CI 0.83–0.91, p for heterogeneity = 0.12) from 14 studies, with no differences between sexes. The corresponding RRs for CVD mortality were 0.87 (95 % CI 0.79–0.96, 0.92 in men and 0.80 in women) for drinking ≤3 cups/day, 0.88 (95 % CI 0.77–1.02, 0.92 in men and 0.80 in women) for drinking >3 cups/day. When we selected studies whose highest category of coffee was at least ≥4 cups/day, the pooled RR was 0.86 (95 % CI 0.77–0.97, 0.90 in men and 0.80 in women).

Table 4 Relative risks (RRs) for coffee drinking and mortality for all causes and CVD in smoking adjusting studies, according to level of coffee consumption

Discussion

The present meta-analysis, carried out on 23 prospective studies, found a 12 % decreased risk of all cause mortality for the highest coffee intake compared with an intake of up to 1 cup/day among smoking-adjusting studies. The risk was 3 % significantly lower for an increment of consumption of 1 cup/day of coffee, and the inverse association was stronger for heavy than moderate coffee drinking, suggesting a dose-risk relation. Among smoking-adjusting studies, only 4 studies included in the meta-analysis found RR >1 in men [13, 14, 33]. The RRs for the highest versus the lowest category of coffee were below unity, although not significantly, also in the two studies excluded from the meta-analysis because they had a reference category >1 cup/day. In those studies the RRs were 0.95 (95 % CI 0.86–1.05) for drinking ≥7 cups/day compared with ≤2 cups/day [12], and 0.82 (95 % CI 0.48–1.4) in men and 0.61 (95 % CI 0.33–4.15) in women for drinking ≥5 cups/day compared with <5 cups/day[10] Another study, excluded from the meta-analysis since it was based on 162 subjects and 53 deaths, [47] reported a RR of 0.35 (95 % CI 0.20–0.64) for the highest tertile of intake (>2 cups/week) compared to the lowest one (<1 cup/week), with a dose-risk relation.

The exclusion of any study from the analysis did not materially change the summary estimates, and p values from Begg’s and Egger’s tests, together with the absence of significant asymmetry in the funnel plot, indicate that publication bias is unlikely to have appreciably influenced our results. We could not stratify for smoking status, as only 4 papers reported separate estimates for never, former and current smokers [14, 17, 27, 32], and, of these, two studies did not report confidence intervals [14, 17]. The two papers reporting all the information showed a tendency towards a lower RR in never smokers than in current smokers [27, 32]. The existence of an inverse association was further supported by the consistent results of several sensitivity analyses and by the consistency of results across studies conducted in Asia and in North America/Europe, where coffee consumption is higher.

We found an inverse association of borderline significance between coffee and CVD, accordingly with the results of a meta-analysis of fatal and non fatal CHD [53], although the two meta-analyses shared only 7 studies. In fact, differently from the present meta-analysis, the previous one included also studies not adjusting for tobacco smoking and those considering as endpoints also incident for CVD. Moreover, the present meta-analysis included other 10 original studies, [20, 21, 25, 27, 29, 30, 32, 3840] 8 of which published after the Wu et al. [53] meta-analysis.

For CVD mortality, the favorable effect of coffee appeared similar to that for all cause mortality, although there was no clear dose-risk relation. For fatal stroke, we did not confirm the finding of a protective effect of coffee on stroke incidence reported in a few prospective studies [5456], though the summary RR was below unity in the present meta-analysis, too.

The effect of coffee on CVD is probably complex and possibly a combination of unfavorable and favorable effects of caffeine and other coffee components on the cardiovascular system. Among unfavorable effects, caffeine has inotropic effect on the heart, and induces a transient increase in blood pressure [57]. However, habitual coffee drinking of >3 cups/day was not associated with an increased risk of hypertension, compared with <1 cup/day [58], and also in hypertensive subjects coffee had no long-term effects and did not increase CVD risk [59]. As for cholesterol levels, coffee has been related to cholesterol increase in Nordic countries [60], but not in other populations [61], and a meta-analysis of 14 clinical trials showed increased serum levels of cholesterol and low density lipoprotein (LDL) only for boiled unfiltered coffee [62]. In fact, diterpenes (such as cafestol and kahweol), responsible for this effect, are mostly bound to coffee filters, and the small amount crossing the filters has no effects on serum lipids [62], including high density lipoproteins (HDL) and triglycerides [63]. Among favorable effects on CVD risk, coffee decreased the risk of diabetes incidence[2] and might decrease CVD risk in diabetic patients [64, 65]. Caffeine has been associated with lower body weight and with decreased platelet aggregation [66]. Coffee has been associated with a lower risk of incident gout [67], higher adiponectin levels in Japanese males [68], inhibition of inflammation in a prospective study of postmenopausal women [22], and beneficial effects on subclinical inflammation in a clinical trial [69]. Moderate doses of coffee have no unfavorable effect on heart failure [70], valvular heart disease [38], and cardiac arrhythmias [71]. Moreover, coffee is very rich in antioxidants[72] which may have a general favorable effect on health.

The favorable effect of coffee on total mortality is however not fully explainable through the effect of coffee on CVD mortality. Favorable effects of coffee on health may also include decreased risk of diabetes mortality [32], and of suicide or injury and accident mortality [17, 32, 73], through its favorable effects on well-feeling and alertness.

In agreement with most evidence, we found no association of coffee with all cancer mortality. In a sensitivity analysis, when considering the 6 studies whose highest category of consumption was at least ≥4 cups/day, we found a RR of 1.04 (95 % CI 0.98–1.11, 1.08 in men and 1.01 in women). An inverse association has been reported by a meta-analysis considering the relation of coffee with incidence of total cancer [74], and by meta-analyses considering incidence of cancer at selected sites, such as cancer at the oral cavity and pharynx [7, 8], liver [4], endometrium [5], which, however, are relatively infrequent. An inverse relation of coffee intake with incidence of colorectal cancer was found in case–control [75], but not in cohort studies [6, 76].

A significant heterogeneity across studies was found. A possible source of heterogeneity is that type of coffee powder, brewing methods, preparation and cup size vary widely in different populations and this may modify the relation, since coffee content of several antioxidants and lipids varies depending on the type of coffee powder (Arabica or Robusta), roasting, and beverage preparation [77]. However, most studies did not report such information. The amount of coffee intake was self-reported. However, recall of coffee drinking has been shown satisfactorily reproducible and valid [78, 79]. Other possible sources of variability are the arbitrary classification of consumption with different cut-points in various studies, and the fact that studies included in these meta-analyses did not consider genetic variants associated with coffee drinking, such as CYP1A2, which is responsible for more than 95 % of caffeine metabolism, and consequently of the effects of caffeine and coffee consumption [80]. However, no heterogeneity emerged in the analyses considering more homogeneous categories of coffee intake, such as non-drinking as reference category or drinking ≥4 cups/day as the highest category, in the absence of material variations of the pooled RRs. With reference to confounding, the major avoidable risk factors for all cause, CVD and cancer mortality also associated with coffee drinking are tobacco smoking and alcohol drinking [81]. Although we excluded from most analyses studies reporting risk estimates not adjusted for tobacco smoking, we cannot exclude residual confounding, which, however, should weaken any inverse association. The pooled RRs were similar in studies adjusting or not for alcohol intake, further suggesting a real inverse relation between coffee intake and total mortality. The lack of adjustment in a few early studies for other relevant confounding factors, such as education, body mass index, or physical activity, might also have somewhat affected our results.

We were unable to consider the relation between caffeine and mortality because most published studies reported only the number of cups of coffee, with no information on total caffeine intake. However, although caffeine intake depends on the variety of raw coffee and the preparation method, and it is also found in cola, energy drinks and several drugs, coffee drinking is its major source and total caffeine intake is strongly correlated with coffee intake in Europe and North America [82]. The inverse association found for decaffeinated coffee with total mortality suggests, however, that substances other than caffeine are responsible of the favorable effect of coffee on mortality. The observation of an inverse relation between decaffeinated coffee and total mortality is based on few studies [22, 24, 27, 32]; moreover, drinkers generally consume low amount of decaffeinated coffee, and in most studies it was not possible to separate drinkers of both regular and decaffeinated coffee from drinkers of decaffeinated coffee only.

A more relevant possible source of bias is reverse causation, since it is possible that healthier people consume more coffee than those with chronic diseases associated with higher mortality rates. However, against this possibility is the fact that the overall estimates were derived only from prospective studies with information collected long time before death, and in most original studies subjects with cancer or CVD at baseline were excluded.

Although bias and confounding were unlikely responsible for the observed association, given the consistent results in different countries, settings and time, the interpretation of the inverse association of coffee with all cause and CVD mortality remains open to discussion, also considering the lack of information on the relation with duration and other time-related factors.