Introduction

Colorectal cancer is the third most common cancer worldwide, with more than 1.93 million new cases of colorectal cancer incidence, and more than 935,000 deaths from colorectal cancer in 2020 [1]. In addition, the World Health Organization (WHO) estimated that in 2040, the global incidence of colorectal cancer rises to more than 3.15 million new cases and more than 1.62 million deaths from colorectal cancer [2]. The trends in colorectal cancer incidence and mortality are related to the current level of human development and might be due to adopting more Western diets and lifestyles [3].

Nuts and legumes may be considered key components of healthy diet patterns. Nuts and legumes play an important role in plant foods characterized by the Mediterranean diet due to their favorable nutrient profile [4]. Nuts included not only tree nuts, but also a wide range of nuts including peanuts. Peanuts are actually legumes, but they are often identified by consumers as part of the nuts. Nuts are a rich source of vegetable proteins, unsaturated fatty acids, vitamin B-6, vitamin E, selenium, fiber, folic acid, and other phytochemicals [5,6,7]. Legumes also are good sources of phytochemicals, protein fiber, and some micronutrients [4, 8]. In addition, legumes are generally low in fats, except for soybean [4]. Several previous observational studies reported that nuts and legumes were associated with a lower risk of colorectal cancer [9,10,11,12,13,14,15,16,17].

Three meta-analyses on nuts consumption and cancer risk have been previously conducted and included some results for the association between nuts consumption and colorectal cancer risk as part of subgroup analysis by cancer type [7, 18, 19]. However, no comprehensive meta-analysis was conducted to quantitatively assess the association between nuts consumption and the risk of colorectal cancer. For legumes consumption, there was a previous meta-analysis of colorectal cancer risk, which also included studies of legume fiber consumption [20]. In addition, the World Cancer Research Fund International/American Institute for Cancer Research (WCRF-AICR) conducted a meta-analysis of legumes consumption and colorectal cancer [21], which included a few studies only.

Thus, we systematically reviewed and performed a comprehensive meta-analysis of all observational studies to quantitatively evaluate the association between the consumption of nuts and legumes and risk of colorectal cancer.

Methods

Literature search and study selection

Studies published up to January 2022 were searched from PubMed and ISI Web of Science electronic databases, and the searches were limited to articles published as written in English and full-length. The search strategy included the following keywords: “(nut OR almond OR cashew OR tree nut OR peanut OR pecan OR pine nut OR pistachio OR macadamia OR hazelnut OR walnut OR brazil nut OR legume OR soy OR bean OR pea OR soybean OR tofu OR soymilk OR pulse OR lentils OR miso OR natto) AND (colorectal OR colon OR rectal OR rectum) AND (cancer OR neoplasm OR carcinoma OR tumor)”. In addition, we supplemented by a manual search of reference lists of retrieved articles and reviews to identify additional qualified studies. The searches were limited to articles published as written in English and full-length. Inclusion criteria were as follows: (1) case–control or cohort studies; (2) studies that reported the association between the consumption of nuts or legumes and the risk of colorectal cancer; (3) studies that reported relative risks (RRs) and confidence intervals (CIs). If more than one article reported the results from the same study, we selected the study which included more cases. In addition, we excluded a study that had no adjustment for any confounder [9].

Data extraction

Data extraction was conducted independently by two investigators (S.J. and Y.J.) using the meta-analysis of observational studies in epidemiology (MOOSE) guidelines [22], and any disagreements were addressed by checking the original reports and discussion. The following information was extracted from each study: first author’s last name, year of publication; country and study name; study design; follow-up period or study period; baseline age; sex; number of cases and controls/participants or person-time; each category of nuts or legumes consumption; RRs and 95% CIs for all categories of nuts and legumes consumption; adjustment for potential confounders. If studies reported several RRs, we used the RR that reflected maximally adjusted for potentially confounding variables. If some studies assessed more than one type of legumes products, we used the RR that was the most representative of overall legumes consumption and legumes that were the most commonly consumed.

Statistical analysis

To estimate pooled RRs and its 95% CIs for the highest versus lowest category of nuts and legumes consumption, we combined a natural logarithm of the RR from the original study, using the random-effects models by DerSimonian and Laird, which incorporate both within- and between-study variations [23]. If the study separately reported by cancer site, we combined the two results using a fixed-effect model to obtain an overall estimate of colorectal cancer or colon cancer first and then combined with other studies [10, 16, 17, 24,25,26,27,28]. In addition, if the study separately reported according to sex [11, 14, 16, 24, 26, 29,30,31,32,33], different ages [34], race [35] or family history [36, 37], we also combined the two results using a fixed-effect model to obtain an overall estimate first and then combined with other studies. To investigate whether the association between consumption of nuts and legumes and risk of colorectal cancer differed by study design (cohort/case–control), sex, cancer site (colon/proximal colon/distal colon/rectal), geographical region (America/Europe/Asia/Oceania), we conducted a subgroup analysis when separated data were available.

Linear dose–response analyses using the generalized least-squares trend (GLST) estimation method by Greenland and Longnecker were conducted to estimate study-specific slopes across categories of nuts and legumes consumption [38,39,40]. We used the median value for each exposure category of nuts and legumes consumption. If the upper category was open-ended, we assumed the same amplitude as the previous category. Studies with less than 3 exposure categories or missing data on the number of cases and participants for each exposure category were excluded from the dose–response analysis. For studies that reported exposure units other than grams per day, we defined one serving as 28 g for nuts and 100 g for legumes according to the standard of the previous study converting these into grams per day [18, 41].

Statistical heterogeneity across the included studies was assessed using the Q statistic [42], and inconsistency was quantified by I2 statistic [43]. Sensitivity analyses were conducted by excluding each study at a time. Potential publication bias was evaluated with Begg’s [44] and Egger’s tests [45]. To detect the effect of possible missing studies on the overall effect, we used Duval and Tweedie trim-and-fill methods [46]. A two-tailed P value < 0.05 was considered statistically significant. All statistical analyses were performed by using Stata/SE version 14.2 Software (StataCorp, College Station, TX, USA).

Results

Study characteristics

For nuts consumption, a total of 13 studies including 8 prospective cohort studies [12, 14, 29, 47,48,49,50] with 9546 cases and 5 case–control studies [10, 17, 34, 35, 51] with 2914 cases were included in the meta-analyses (Fig. 1). For legumes consumption, a total of 29 studies including 16 prospective cohort studies [12, 14, 15, 26, 28, 30,31,32, 36, 52,53,54,55,56,57] with 13,631 cases and 13 case–control studies [10, 11, 13, 16, 24, 25, 27, 33, 35, 37, 51, 58,59,60] with 7275 cases were included in the meta-analyses. The characteristics of the included studies are summarized in Tables 1 and 2. By geographic region, regarding nuts consumption, seven studies were performed in America, three studies in Europe, and three studies in Asia. Regarding legumes consumption, ten studies were performed in America, four studies in Europe, fourteen studies in Asia, and one study in Oceania. Most of the studies adjusted for age, total energy intake, alcohol consumption, smoking, body mass index (BMI, kg/m2), and physical activity.

Fig. 1
figure 1

Flow diagram of the study selection

Table 1 Characteristics of the prospective cohort/case–control studies included in the meta-analysis of nuts consumption and colorectal cancer
Table 2 Characteristics of the prospective cohort/case–control studies included in the meta-analysis of legumes consumption and colorectal cancer

Nuts consumption and colorectal cancer

A total of thirteen studies including 12,460 cases and 926,327 participants investigated the association between nuts consumption and risk of colorectal cancer (Table 1). The pooled RR for highest versus lowest categories of nuts consumption was 0.84 (95% CI: 0.71–0.99), with some evidence of heterogeneity (I2 = 82.4%, P < 0.001) (Fig. 2, Table 3). No significant associations were found when stratified by study design or sex, and the meta-regression analysis showed no significant differences (Pdifference > 0.5 for all comparisons). By cancer site, a significant inverse association was shown in colon (RR = 0.78; 95% CI: 0.63–0.96) and rectal cancer (RR = 0.71; 95% CI: 0.51–0.98). Based on the meta-regression analyses, there was no significant difference with cancer site (Pdifference > 0.7 for all comparisons). By geographic region, however, there was some difference in RRs. The inverse association between nut consumption and colorectal cancer risk was only observed in Asia (RR = 0.44; 95% CI: 0.29–0.68), while no association was found in America (RR = 1.01; 95% CI: 0.92–1.11) (Pdifference for America vs. Asia = 0.003) or Europe (RR = 1.02; 95% CI: 0.84–1.25) (Pdifference for Europe vs. Asia = 0.003). Furthermore, no heterogeneity was observed in the studies stratified by region. Regarding adjustment for confounders, there was no significant difference with energy intake, alcohol intake, smoking, BMI, physical activity, or dietary factors (Pdifference > 0.4 for all comparisons). Nine studies [12, 17, 29, 35, 47, 48, 50] were included in the dose–response analysis for nuts consumption and risk of colorectal cancer. A 28 g/day (1 serving/day) increment of nuts consumption was associated with a 33% lower risk of colorectal cancer (RR = 0.67; 95% CI: 0.45–0.98).

Fig. 2
figure 2

Forest plot of colorectal cancer risk for the highest versus lowest categories of nuts consumption

Table 3 Summary of pooled relative risks (RR) of colorectal cancer risk for nuts consumption

Legumes consumption and colorectal cancer

A total of twenty-nine studies including 20,906 cases and 1,688,603 participants investigated the association between legumes consumption and risk of colorectal cancer (Table 2). The pooled RR of colorectal cancer for the highest versus lowest categories of legumes intake was 0.90 (95% CI: 0.83–0.98), with some evidence of heterogeneity (I2 = 56.5%, P < 0.001) (Fig. 3, Table 4). In the stratified analysis by study design, we found no significant association, and the meta-regression analysis showed no significant differences (Pdifference = 0.31). By sex, a significant inverse association was found in women (RR = 0.86; 95% CI: 0.75–0.99), but not in men (RR = 0.93; 95% CI: 0.84–1.02) (Pdifference = 0.64). By cancer site, we found a significant inverse association in colon (RR = 0.89; 95% CI: 0.82–0.96) and rectal cancer (RR = 0.82; 95% CI: 0.70–0.94). By geographic region, there was a significant inverse association in Asia (RR = 0.88; 95% CI: 0.77–0.999) and Oceania (RR = 0.59; 95% CI: 0.35–0.98), while there was no significant association between in America (RR = 0.90; 95% CI: 0.78–1.04) and Europe (RR = 1.00; 95% CI: 0.85–1.19). In the meta-regression analysis, however, the pooled RRs were not significantly different with geographic region (Pdifference for America, Europe, or Oceania vs. Asia = 0.84, 0.30, and 0.24, respectively). By adjustment for confounders, we found no significant difference with energy intake, alcohol intake, smoking, BMI, physical activity, or dietary factors (Pdifference > 0.1 for all comparisons). Fourteen studies [12, 15, 16, 27, 28, 30, 35, 36, 53, 56,57,58, 60] were included in the dose–response analysis for legumes consumption and risk of colorectal cancer. A 100 g/day (1 serving/day) increment of legumes consumption was associated with a 21% lower risk of colorectal cancer (RR = 0.79; 95% CI: 0.64–0.97).

Fig. 3
figure 3

Forest plot of colorectal cancer risk for the highest versus lowest categories of legumes consumption

Table 4 Summary of pooled relative risks (RR) of colorectal cancer risk for legumes consumption

Publication bias

For the analysis of nuts consumption and risk of colorectal cancer, there was no evidence of publication bias in Egger’s (P value for bias = 0.06) and Begg’s test (P value for bias = 0.09). For the analysis of legumes consumption and risk of colorectal cancer, Egger’s test suggested some evidence of bias (P value for bias = 0.02), but not in Begg’s test (P value for bias = 0.21). When we used the trim-and-fill method to examine the influence of potential publication bias, the pooled RR did not alter, indicating that the results were not affected by publication bias.

Discussion

The present meta-analysis of observational studies assessed the association between nuts and legumes consumption and colorectal cancer risk. This meta-analysis indicated that people in the highest category of nuts and legumes consumption had a decreased risk of colorectal cancer by 16% and 10%, respectively, compared with those in the lowest category. In addition, we found the inverse association tends to be stronger in Asia, especially for nuts consumption. The results of the dose–response analysis also supported these associations.

A few previous meta-analyses have been conducted to assess the association between nuts consumption and cancer risk, and the association of nuts consumption with colorectal cancer risk was examined in the form of subgroup analysis only [7, 18, 19]. For legumes consumption, two meta-analyses were conducted to examine the association with colorectal cancer risk [20, 21]. One of the meta-analyses from the WCRF-AICR continuous update project included six studies, and due to the limited number of studies, it could not conduct a stratified analysis [21]. Another meta-analysis of legumes consumption also included studies of legume fiber consumption [20]. In the present meta-analysis, we included twenty-nine studies, and thus could conduct a stratified analysis. In addition, we included studies with legumes consumption only, and also conducted a dose–response analysis of nuts and legumes consumption.

There was some evidence of heterogeneity among the studies in the meta-analysis of nuts and legumes consumption and the risk of colorectal cancer overall. For the results of nuts consumption, the observed heterogeneity among the studies tended to disappear when stratified by geographic region. The International Nut and Dried Fruit Council Foundation (INC) reported that consumption of nuts varies by region, depending on the type of nuts [61]. Europe was the largest consumer of tree nuts, but in terms of peanut consumption, per capita, peanut consumption in Nigeria and China was much higher than in other countries [61]. In addition, in many countries, nuts were not depicted in food guides or not mentioned in brief guides or other available descriptions of food classification [62], so the consumption patterns of nuts in different countries would be different. We observed an association between nuts consumption and colorectal cancer risk only in Asia. Although it is difficult to make the definitive explanation for this result, it may be because Asians have a lower risk of nut allergy than people in Western countries [17, 63], which leads to different dietary patterns. For the type of nuts, American and European studies contained more variety of nut types than Asian studies. One Asian study only included peanut products [14], and the other Asian study only included pine nuts, peanuts, and almonds [17], while the study by Hoshiyama et al. did not mention the type of nuts [10]. The different types of nuts consumption among the studies might also explain the different results by region. However, the inverse association of nuts consumption found in Asian studies came from only three studies, and most of them were case–control studies, which are susceptible to methodological biases. For the results of legumes consumption, although the observed heterogeneity tends to disappear in colon cancer when stratified analysis by cancer site, heterogeneity could still be observed in rectal cancer. We cannot clearly explain the reasons for the observed heterogeneity, but we speculate that it may be due to the diversity of legumes. We have usually eaten dried legumes, which have matured and dried on the plant, but there are also several legumes eaten as a vegetable when they are green or sprouted, such as green beans, runner beans, and beansprouts, etc. [64]. In addition, legumes can be consumed as soy products such as tofu, soy milk, and natto, etc. Dry legumes are higher in protein than most other plant foods and are typically high in carbohydrates and dietary fiber [64], although legumes are low in fat, except for soybeans [4, 64]. On the other hand, it may be because the consumption of legumes varies by region. In East Asian diets, soybeans are the main source of phytoestrogen isoflavones, and the average isoflavone intake of the Asian population is almost 10 times higher than in Western countries [65, 66]. Therefore, more studies are needed to determine subgroups of nuts and legumes consumption in relation to colorectal cancer risk.

Several potential mechanisms of nuts and legumes consumption may reduce the risk of colorectal cancer. Nuts and legumes are a dietary source of fiber, and dietary fiber could bind and excrete potential luminal carcinogens, such as secondary bile acids and reduce fecal pH in the colon, thereby playing a critical role in colorectal cancer prevention [4, 5, 67]. In addition, high-fiber diets may decrease insulin resistance, which is a risk factor for colorectal cancer [64]. Nuts contain phytochemicals, such as resveratrol and quercetin, which can decrease the inflammatory process and prevent inflammation-induced tumors [5, 68]. Legumes also contain phytochemicals, which are isoflavones, representative anti-cancer components. Isoflavones have a structure similar to estrogen and selectively bind to estrogen receptors (ER) [66, 69]. When estrogen is deficient, phytoestrogens can exert an estrogenic effect [66]. Moreover, a previous meta-analysis reported that the ER-β protein protects against carcinogenesis and colorectal cancer development when activated by estrogen, indicating the potential for ER-β to act as a tumor suppressor [66, 70].

The present study had some advantages. It is the first meta-analysis to assess the association between nuts consumption and the risk of colorectal cancer. In addition, a relatively large number of studies allowed us to conduct subgroup analyses by study design, sex, cancer site, and geographic region, as well as the linear dose–response meta-analysis. The present meta-analysis included the most recent prospective cohort and case–control studies and the largest number of study participants. In addition, most of the studies included in the analysis were adjusted for confounding factors such as age, sex, energy intake, alcohol intake, smoking, BMI, and physical activity.

Despite these advantages, several potential limitations of our meta-analysis should be considered. First, the current meta-analysis included several case–control studies in addition to cohort studies, and therefore, potential methodological biases, including selection bias or recall bias, might be considered. The inverse association of nuts consumption found in three Asian studies came mostly from case–control studies, so we should be interpreted with caution. Second, some misclassification of nuts and legumes consumption may exist, which influences the results of individual studies and thus pooled estimates in this meta-analysis. In addition, the types of nuts and legumes consumed throughout each study varied widely. This wide range includes peanuts and peanut butter, which are legumes, but often identified by consumers as part of the nut, and some studies did not report detailed types of nuts and legumes consumption. Third, the cut-offs for the consumption of nuts and legumes in the highest and lowest categories varied among the studies. To address this limitation, we conducted a linear dose–response meta-analysis as well. Fourth, for the analysis of legumes consumption and risk of colorectal cancer, we observed evidence of bias from Egger’s test, but it was not observed in Begg’s test. The difference in the results obtained by the two methods might be because of the greater statistical power of the regression method [71]. In addition, we conducted the trim-and-fill method, which showed that the results were not affected by publication bias. Finally, when we conducted the stratified analysis by the adjustment for confounders, it tended to show weaker associations when limited to studies with strong adjustment. Moreover, the number of well-adjusted cohort studies are still limited at present.

In conclusion, the results of the current meta-analysis suggest that the evidence for an association between nuts and legumes consumption and the risk of colorectal cancer is currently weak. We observed that the association tended to be weak when stratified by adjustment for confounders. Thus, further well-designed prospective cohort studies, especially well-adjusted for confounders, on different types of nuts and legumes consumption and different types of study populations are warranted.