Introduction

The incidence of colorectal cancer (CRC) has increased in Europe in the last decades [1], being the second most diagnosed cancer in 2012 [2]. According to the scientific evidence, 40–50% of CRC cases are attributable to modifiable risk factors such as diet, physical activity and body weight [3, 4], providing major opportunities for prevention. The current evidence, points to a possible protective effect of foods containing dietary fiber and calcium against CRC [5, 6] and a detrimental effect of red and processed meat [6, 7] and alcohol consumption [6, 8, 9].

Despite the importance of these findings for individual foods, some authors suggest that the evaluation of the effects of full dietary patterns might be more appropriate, since it allows the exploration of the effect of food and nutrient interactions in disease [10,11,12]. Many indexes have been developed in the last years that evaluate dietary quality against predefined criteria [13, 14] and a recent metaanalysis found an inverse association between a high score for these indexes and cancer mortality and/or incidence [15]. However, these indexes are based on results in the area of cardiovascular disease and they refer to a theoretical diet that do not necessarily reflect the eating habits of a particular population. Moreover, moderate alcohol intake is positively considered in most of these indexes although alcohol consumption as low as one drink per day increases the risk of several tumors, including colorectal cancer [6]. In fact, some authors suggested that the lack of concordance of the results found for diet quality indexes and cancer might be due to their positive scoring for alcohol consumption [16]. As an alternative approach, dietary patterns that accurately represent the diet in a specific population can be identified with statistical methods like principal component analysis. These patterns also present the advantage of being extracted independently of disease associations, which allows exploration of the role of actual dietary habits in different health conditions. The scarce existing results for data-driven dietary patterns and CRC, indicate a possible protective effect of the so called Mediterranean/Healthy/Prudent dietary pattern [17,18,19,20,21,22] on CRC and a harmful effect of a pattern labelled as Western/Unhealthy diet [17,18,19,20,21,22,23], but the evidence is still insufficient.

A recent Spanish study on female breast cancer (BC)—EpiGEICAM—identified three data-driven dietary patterns [24] labelled as Western (associated with increased risk of BC), Prudent (not associated with BC) and Mediterranean (protective against BC). EpiGEICAM presents the novelty of being able to identify, over the same population, two different patterns (Prudent and Mediterranean) commonly interchanged in the literature [9, 18, 20, 21, 23, 25]. However, these patterns do not always represent the same dietary habits and the differences might be determinant in their association with disease risk, as it was the case for BC in the EpiGEICAM context [25]. Therefore, the replication of these patterns in different populations and the exploration of their association with tumors other than BC are of great scientific interest. In fact, the reproducibility of the patterns found has already been assessed in a different sample of Spanish women [26] and similar associations have been observed for other tumors and individuals. The detrimental effect of a high adherence to the Western dietary pattern has been corroborated for breast cancer [27] and also observed for gastric [28] cancer. These studies also show different results for the Prudent (null effect) and Mediterranean (protective) patterns in the case of breast [27], prostate [29] and gastric cancer [28].

The objective of the present study is to assess if the associations found between these dietary patterns—Western, Prudent and Mediterranean—with breast [24, 27], prostate [29] and gastric cancer [28] risk in our country are also observed for CRC risk, and to evaluate possible differences by sex and cancer site.

Patients and methods

The multicase-control MCC-Spain study [30] recruited between 2008 and 2013 histologically confirmed incident cases of five tumors: breast, prostate, colorectal, stomach and chronic lymphocytic leukemia. Cases were recruited in 23 hospitals from 12 provinces and a single set of population controls, frequency matched by age and sex with the overall distribution of cases in each province, were randomly selected from the lists of residents assigned to primary health-care centers belonging to the same catchment area of each collaborating hospital. MCC-Spain recruited 2140 histologically confirmed CRC cases and 3950 population-based controls from 11 of the 12 contributing provinces. These numbers, represented the 64% of the CRC cases and the 53% of controls invited to participate (supplementary material, Figure S1). Potential participants had to be able to answer the questionnaire, had to live in the study area for at least 6 months before the diagnosis (cases) or at recruitment (controls) and had to be 20–85 years. Cases were identified as soon as possible after the diagnosis, and histologically confirmed incident cases of colon (ICD10 codes C18: malignant neoplasm of colon and D01.0: Carcinoma in situ of colon) or rectum (ICD10 codes C19: Malignant neoplasm of rectosigmoid junction; C20: Malignant neoplasm of rectum; D01.1: Carcinoma in situ of rectosigmoid junction and D01.2: Carcinoma in situ of rectum) cancer with no prior history of the disease and diagnosed within the recruitment period were included. They were classified according to the localization of tumor in proximal colon (including caecum, ascending and transverse colon and hepatic and splenic flexures), distal colon (including descending and sigmoid colon) and rectal cancer. When more than one tumor in different locations were diagnosed at the same time, the site in which the tumor was more invasive was assigned.

Information on socio-demographic factors, lifestyle and personal/family medical history was collected with a questionnaire administered by trained personnel in a face-to-face interview. Subsequent telephone contact was made to complete missing values on key variables. Height and weight at different ages were self-reported and diet was assessed with a 154-items semi-quantitative food frequency questionnaire (FFQ), which was based on an instrument validated in Spain [31]. Dietary information referred to the previous year before diagnosis (cases) or before interview (controls).

In the present study, three dietary patterns identified in a previous Spanish case-control study (EpiGEICAM) on female breast cancer (BC) [24] are examined: A western dietary pattern positively associated with BC risk that is characterized by high intakes of high-fat dairy products, processed meat, refined grains, sweets, caloric drinks, convenience food and sauces and by low intakes of low-fat dairy products and whole grains; A Prudent pattern not related to BC that represented high intakes of low-fat dairy products, vegetables, fruits, whole grains and juices; and a Mediterranean pattern that seemed to be protective and denoted a high intake of fish, vegetables, legumes, boiled potatoes, fruits, olives and vegetable oil—mainly olive oil (72%), and olives (22%) in our context-, and a low intake of juices. These three dietary patterns were identified in the EpiGEICAM sample in 2014 by grouping the dietary intake information collected with a 117 FFQ into 26 inter-correlated food groups and applying principal components analysis (PCA) without rotation of the variance–covariance matrix over these 26 food groups [32]. This method defines a set of weights (pattern loadings) associated with each food group that represents the correlation between food consumption and the component/pattern scores. Pattern loadings can be used to reproduce such patterns in other samples as explained in detail elsewhere [25, 26]. Briefly, we grouped 146 of the 154 items of the MCC-Spain FFQ (excluding non-caloric and alcoholic beverages) into 26 food groups defined in the EpiGEICAM study (see Table 1 for detailed information on the composition of food groups and their weight in the patterns). Afterwards, the scores of adherence to the Western, Prudent and Mediterranean dietary patterns of the MCC-Spain participants were calculated as a linear combination of the pattern loadings for each food group and dietary pattern extracted from the EpiGEICAM study [24] (Table 1) and the food group consumption reported by the MCC-Spain participants.

Table 1 Composition of food groups based on the food frequency questionnaire of the MCC-Spain study and component loadings for each pattern identified in the EpiGEICAM study [24]

After describing the data, crude and adjusted associations between adherence to each dietary pattern and CRC risk were evaluated using logistic regression models with random province-specific intercepts. As fixed-effects terms, caloric and alcohol intake, self-reported body mass index (BMI) and physical activity [metabolic equivalents (METs)] during the 10 years before diagnosis (cases)/interview (controls), age, smoking status, education, family history of CRC and sex were considered as potential confounders. Scores of adherence were analyzed both, as categorical (grouping the scores of adherence into quartiles of their distribution among controls) and continuous variables (1-standard deviation increase taking into account the dispersion among controls). Heterogeneity of the effects by sex was tested including in the models an interaction term between the score of adherence and sex. Multinomial logistic regression models were used to evaluate the association of the adherence to the Western, Prudent and Mediterranean dietary patterns with proximal colon, distal colon and rectal cancer separately. These models were adjusted by the same set of variables described before but including province of residence as a fixed effect term.

Finally, assuming a causal relationship between the adherence to each of the patterns and CRC risk for all analyses, the population attributable fraction (PAF%) was calculated using Hanley’s J.A. formula [33] to estimate the proportion of total cancer in the population that hypothetically would not have occurred if all participants were in the optimal quartile of adherence to the dietary patterns (first quartile for Western and fourth quartile for Prudent and Mediterranean dietary patterns). Confidence intervals for PAF were computed using bootstrap with 500 iterations.

Analyses were performed using STATA/MP (version 14.1, 2015, StataCorp LP) and statistical significance was set at 2-sided p < 0.05.

Results

Among the initially recruited participants, 3509 (89%) controls and 1889 (88%) cases reported data on diet. Cases that provided dietary information later than 6 months after diagnosis were excluded (n = 260). Therefore, 1629 cases and 3509 controls aged 22–85 years were included in the study (supplementary Figure S1). The multivariable analyses were carried out over 1530 cases and 3240 controls, because data on either BMI (< 5%), physical activity (< 1%), smoking status (< 1%), total energy (< 2%) or alcohol intake (< 2%) was missing for 99 cases and 269 controls.

Compared to controls, CRC cases were more adherent to the Western (p < 0.001) and Mediterranean (p = 0.015) dietary patterns and reported higher energy (p < 0.001) and alcohol (p = 0.001) intake. CRC cases were also older (p < 0.001) and reported higher BMI (p < 0.001) and lower levels of physical activity (p < 0.001). The proportion of former smokers (p < 0.001), males (p < 0.001), participants with no formal education (p < 0.001) or with family history of CRC (p < 0.001) was also higher among cases (Table 2).

Table 2 Description of scores of adherence to Western, Prudent and Mediterranean dietary patterns and other baseline characteristics for colorectal cancer cases and controls

Results from Table 3 revealed a positive association between a high adherence to the Western dietary pattern and global CRC [ORfourth(Q4) vs. first(Q1)quartile (95% CI): 1.50 (1.20;1.87)] risk that was similar among males and females (p-interaction = 0.615). Once the difference in calorie intake was taken into account, a high adherence to the Mediterranean pattern appeared to be protective [ORQ4 vs. Q1 (95% CI): 0.65 (0.53;0.80)], with this effect slightly stronger among females [ORQ4 vs. Q1 (95% CI): 0.56 (0.40;0.77)] than among males [ORQ4 vs. Q1 (95% CI): 0.71 (0.55;0.92)], though the p value for the heterogeneity of the linear effects was not significant (p-interaction = 0.733). Assuming a causal relationship between diet and CRC risk, the estimations indicate that 1/4 and 1/5 of CRC cases could have been prevented if all the participants had been in the lowest category of adherence to the Western and in the highest category of adherence to the Mediterranean dietary patterns, respectively.

Table 3 Association between colorectal cancer incidence and the scores of adherence to Western, Prudent and Mediterranean dietary patterns and attributable fractions for all participants and by sex

Stratified results by tumor subtype (Table 4) also indicate a detrimental effect of a high adherence to the Western dietary pattern over CRC risk that seems to be confined to distal colon [ORQ4 vs. Q1 (95% CI): 2.02 (1.44;2.84)] and rectal tumors [ORQ4 vs. Q1 (95% CI): 1.46 (1.05;2.01); p-heterogeneity = 0.087], while the protective effect of the Mediterranean dietary pattern was similar for all tumor sites [Proximal colon: ORQ4 vs. Q1 (95% CI): 0.70 (0.51;0.97); Distal Colon: ORQ4 vs. Q1 (95% CI): 0.65 (0.48;0.89); Rectum: ORQ4 vs. Q1 (95% CI): 0.60 (0.45;0.81); p-heterogeneity = 0.746]. In agreement with these results, it was estimated that more than 1/3 of distal colon and 1/4 of rectum tumors could have been prevented if all the study participants were in the lowest quartile of adherence to the Western dietary pattern and 1/5 for distal colon and 1/4 for rectum tumors could have been prevented with the highest adherences to Mediterranean dietary pattern.

Table 4 Adjusted odds ratios for the association between proximal colon, distal colon and rectal cancer incidence and scores of adherence to Western, Prudent and Mediterranean dietary patterns

A high adherence to the prudent pattern did not show an association with CRC risk.

Discussion

The detrimental effect of a high adherence to the Western dietary pattern for breast [24, 27] and gastric [28] cancer and the differential effect of a high adherence to the Prudent (null) and to the Mediterranean (protective) dietary patterns over breast [24, 27], prostate [29] and gastric cancer [28] identified in the previous studies was also found for CRC in the present work. Concretely, we found that a high adherence to the Western dietary pattern might increase CRC risk in both males and females and that such risk might be confined to distal colon and rectal cancer. Also, a high adherence to the Mediterranean dietary pattern showed a general protective effect against CRC that was very similar among males and females and for all cancer sites. On the contrary, the adherence to the Prudent dietary pattern was not associated to CRC.

Some recent reviews and metaanalysis [9, 19, 22], also report a positive association between a high adherence to the Western dietary pattern and CRC risk and a protective effect of a diet rich in fruits, vegetables, legumes and/or fish. The studies published after these reviews, also report a positive association of a high adherence to the Western dietary pattern with global CRC risk [18, 20, 21, 23] and a possible protective effect of a Healthy diet against this tumor [18, 20, 21]. In agreement with our results, some authors conclude that the effect of the Western and Healthy diet might be stronger for distal colon and rectal cancer [21, 22] or indicate stronger effects of the Western diet in distal colon tumors [9]. Only a few of these studies provide information of a possible interaction between diet and sex [18, 20, 21] and none of them report significant differences. Similarly, the current evidence for index based dietary patterns point to a detrimental effect of pro-inflammatory diets (similar to our Western pattern) for CRC risk [34] and a protective effect of diets that share common characteristics with our Mediterranean pattern against this type of tumor [34, 35]. One of the most important findings of the present study is the difference in the associations found for Prudent and Mediterranean dietary patterns. To understand these differences, we explored the associations of CRC risk with individual food groups (supplementary Table S1). We believe that the protective effect of the Mediterranean pattern against the null effect of the Prudent might be greatly explained by the protective effect of oily fish, nuts and olives and olive oil, only present in the Mediterranean pattern, but also by the detrimental effect of juices intake, only included in the Prudent pattern, that might counteract the positive effect of a high consumption of fruits, vegetables and whole grains characteristic of this pattern.

Some biological mechanisms support the associations found. On the one hand, the “Western”-like diet high in fat, refined grains, red and processed meats and sweets has been associated with higher levels of inflammatory markers [36] and with inflammation-related chronic diseases [37]. Moreover, the high content of iron in meat products present in this pattern generates free radicals that attack DNA and damage the tissue [38]. In addition, processing meat at high temperatures produces carcinogens such as N-nitroso and polycyclic aromatic hydrocarbons [39]. On the other hand, the antioxidants from fruits, vegetables and legumes present in the Mediterranean pattern may reduce risk by quenching free radicals and reducing oxidative damage to DNA [40]. Furthermore, fiber dilutes faecal content, decreases transit time and increases stool weight [41] contributing to a healthier gastrointestinal tract. Different carcinogenic pathways in proximal and distal tumors have been suggested, based on their molecular differences [42]. In this sense, the higher effect of the Western dietary pattern (characterized by a low dietary fiber intake) in distal colon and rectal tumors, might reflect a higher susceptibility to dietary carcinogens due to a less mature phenotype and lower immune activity of dendritic cells involved in immunologic surveillance at this location [43]. Olive oil intake has also been suggested to inhibit colon cancer development by inducing apoptosis and down-regulating the expression of cyclooxygenase2 and Bcl-2 proteins that have a crucial role in colorectal carcinogenesis [44]. Finally, the gut microbiome seems to play an important role in colorectal carcinogenesis [45], and dietary habits strongly influence it [46]. Turnbaugh et al. [46] recently demonstrated in an animal model that changing from low-fat, plant based diets to high-fat, high-sugar diets can shift the structure of the microbiota, modify the representation of metabolic pathways in the microbiome, and alter microbiome gene expression.

Our results should be interpreted in the context of the study’s limitations. Recall bias is always a concern in case-control studies. Anticipating the existence of this bias, some questions about general dietary habits were included in the questionnaire and used to adjust the responses to the FFQ [47]. In addition, only cases that responded to the questionnaire within the 6 months following their diagnosis were included. On the other hand, the participation rates (64% among CRC cases and 53% among controls) might give rise to some concerns about a possible selection bias. In this sense, participating controls might have healthier lifestyles than the general population, resulting in an overestimation of the effects. However, no effect was found for the prudent pattern that includes consumption of products widely known as “Healthy”. Therefore, we believe that this bias, if it exists, would be non-differential. Finally, the biological plausibility of the associations found, their strength, their consistency across sex and tumor site, their consistency with the results from other studies on CRC [9, 17,18,19,20,21,22,23] and the reproducibility of the results across different studies and tumors [24, 27,28,29], deem it unlikely that our findings are a result of recall or selection bias.

One of the main strengths of the current research is the recruitment of histologically confirmed incident cases of CRC and population controls. Furthermore, the geographical variability of the recruited participants, coming from 11 provinces from the North, South, Center, West and East of the country, ensured the representation of the different diets coexisting within Spain. Also, the sample size allowed the evaluation of potential interactions of diet and sex and the exploration of the associations by tumor localization. We also carried out a sensitivity analysis to explore all the associations excluding 42 in situ cases and obtained very similar results that led to the same exact conclusions (supplementary material Tables S2 and S3). In addition, as mentioned before, we explored the associations of CRC risk with individual food groups to ensure the associations found for patterns are not only due to the presence in the pattern of one or two foods associated with this tumor (supplementary material Table S1). High consumers of high-fat dairy products, meats, refined grains and sweets (products characteristic of the Western Pattern) showed higher risk of CRC, while high consumers of oily fish, vegetables, fruits, nuts and olive oil (foods present in the Mediterranean pattern) seemed to be protected against this tumor. Therefore, most of the components of the two patterns associated with CRC were also individually associated with this tumor, making it unlikely that the associations found for the whole dietary patterns are due only to the association of CRC with some individual foods. Finally, the reproducibility [26] and applicability [25] of the Western, Prudent and Mediterranean dietary patterns applied here was previously tested, demonstrating that the scores of adherence to these patterns can be calculated following the exact same rules over different populations, resulting in different levels of adherence but still being valid, which is supported by the similitude of the results found for breast [24, 27], prostate [29] and gastric cancer [28] and the present results found for CRC.

Our results provide evidence about very specific associations between diet and CRC that could be useful to clinical practitioners and public health professionals to offer nutritional recommendations based on avoiding the Western dietary pattern and promoting the Mediterranean diet. Even though other risk factors are involved in the genesis of these type of tumors, diet is a key risk factor for colorectal cancer. In this sense, if a country like Spain, with a high compliance with the Mediterranean diet and a moderate adherence to the Western diet, can benefit from abandoning the latter in favor of the former, the benefit might be greater in countries with unhealthier diets.

Conclusion

A high consumption of fruits, vegetables and whole grains combined with a low dietary fat intake might not be enough to prevent CRC. A fair percentage of colorectal cancer cases could be reduced in the general population by providing dietary recommendations based in a decrease of the consumption of high-fat dairy products, red and processed meat, refined grains, sweets, caloric drinks, juices, convenience food and sauces in favor of an increase in the intake of whole fruits, vegetables, legumes, olive oil, nuts and fish, especially for distal colon and rectal tumors.