Introduction

Diet is one of the modifiable risk factors for breast cancer, and several single foods, macro- and micronutrients (e.g., vegetables, fiber, and vitamins) have been investigated in association with breast cancer risk [13]. Thus far, only alcohol, being overweight and weight gain have been established as risk factors for breast cancer [4]. In addition, dietary patterns received much attention during the last years. Dietary patterns take the whole complexity of diet into consideration when compared to investigations of single food items and thus can be used to also account for interactions between foods and nutrients in assessing the association between diet and disease risk [5].

Several studies investigating dietary patterns in relation to breast cancer risk showed inverse associations with the “prudent/healthy” pattern (usually including high vegetable, fruit, whole grains, and fish intake) and positive associations with the “western/unhealthy” pattern (generally characterized by high red and processed meat, refined grains, potatoes, and butter intake) in the whole population [69] or in subgroups, e.g., among smokers [10]. Some studies showed only a positive association with the “western/unhealthy” pattern but no association for any other pattern [11, 12], whereas other studies showed only a protective effect for the “prudent/healthy” pattern either in the total population or in certain subpopulations (e.g., smokers or BMI < 25) [1315]. Moreover, some studies found differing dietary patterns that were associated with breast cancer like the drinker pattern, the starch pattern, or the traditional patterns [1623], while others found no association in their overall study population [10, 14]. A recent meta-analysis on dietary patterns and breast cancer risk resulted in a significant decreased association for the “prudent/healthy” dietary pattern, no association for the “western/unhealthy” dietary pattern and a significant positive association for the “drinker” pattern [24]. Due to the inconsistent results so far, we aimed at assessing the association between dietary patterns and breast cancer risk in a German case–control study.

Dietary patterns can be assessed in several ways. Either an a priori approach can be used, in which dietary patterns are built based on existing knowledge about dietary constitutes, or an a posteriori approach, in which the existing data from the study (i.e., from food frequency questionnaires) are used to identify the dietary patterns. We used the latter variant in order to explore our data directly using factor analysis.

Besides an overall breast cancer risk assessment, we also performed subgroup analyses according to estrogen and progesterone receptor (ER/PR) status, body mass index, hormonal replacement therapy use, educational status, energy intake, and smoking habits.

Materials and methods

Study population

The population-based case–control study MARIE (Mammakarzinom Risikofaktoren Erhebung) was conducted from August 2002 to September 2005 in two German study regions (Hamburg and Rhine-Neckar-Karlsruhe) to assess menopausal hormone therapy and lifestyle factors in relation to breast cancer risk. The study has been previously described in great detail [25]. In brief, German-speaking women aged 50–74 years and diagnosed with histologically confirmed primary invasive or in situ breast cancer were identified through participating clinics and cancer registries. For each patient, two age- and study center-matched controls were randomly selected using inhabitant registries. In total, 3,919 cases (64% of eligible) and 7,421 controls (43% of eligible), of which 3,464 cases and 6,657 controls were postmenopausal, took part in the study. The study was approved by the ethics committee of both the University of Heidelberg and the University of Hamburg, and conducted in agreement with the Helsinki declaration. All participants gave written informed consent.

Women were defined as postmenopausal if their last menstrual bleeding was at least 12 months before age at diagnosis for cases, and at least 12 months before age at recruitment for controls, if they reported having a bilateral oophorectomy or cessation of menses due to radiation or chemotherapy, if they had unclear menopausal status due to a hysterectomy or hormone use, or if they were older than 55 years.

In-person interviews were performed to collect information on demographic factors and on breast cancer risk factors such as menstrual history, reproductive history, hormonal replacement therapy for menopausal disorders, family history of breast cancer, breastfeeding, smoking, and alcohol consumption. In addition, a self-administered, validated 176-item food frequency questionnaire (FFQ), which recorded the nutritional habits 1 year prior to diagnosis (for cases) or recruitment (for controls), was completed by 2,944 cases and 5,625 controls.

The FFQ was very similar to the one used in the European Prospective Investigation into Cancer and Nutrition (EPIC) [26, 27] and included six additional food items rich in phytoestrogens (i.e., soy milk, soy beans, tofu, soy spread, sesame/flaxseeds and sunflower-/pumpkinseeds). For each food item in the FFQ, the information about portion size and consumption frequency was used to calculate intakes in gram per day. Nutrients were estimated using the German food composition table BLS II.3 (Bundesinstitut für Gesundheitlichen Verbraucherschutz und Veterinärmedizin) [26, 27]. This FFQ was validated for food groups, energy, and nutrient intake [26, 27]. Exclusion criteria for the analyses comprised participants with missing data and participants in the top and bottom 1% energy range. Thus, 2,884 cases and 5,509 controls were included in the current analyses.

Definition of food groups

The 176 FFQ food items were assigned to one of 80 food classes using the EPIC-Soft software [28]. Further, 76 of the 80 food classes and six phytoestrogen-rich food items were grouped into food groups based on nutrient profiles or culinary usage, similar to other studies [10, 29, 30]. Four food classes were defined as ‘unclassified’ and thus were not classified. Food items not fitting into any of the food groups or representing separate food groups were retained as individual food groups (e.g., eggs, butter, cabbages, and potatoes). In total, 33 food groups were defined (Table 1).

Table 1 Food groups and food classes

Factor analysis

Factor analysis was performed on 33 food groups. Principal components and maximum-likelihood estimates were used to extract factors explaining the maximum proportion of the variance in the correlation matrix of the food groups. Principal component factor analysis (PCFA) was performed using SAS statistical software environment (version 9.1). The number of factors to be retained in the model was determined by eigenvalues of the correlation matrix (greater than 1 for PCFA), Scree plots, proportion of variance (>75%) explained by the identified factors, and the natural interpretability of each factor. An orthogonal transformation (varimax) was used to rotate the correlation matrix in order to get a simpler data structure with greater interpretability [31]. Each individual was assigned a factor score for each identified pattern using the SAS procedure score. Thus, individuals with high scores for a dietary pattern have a greater tendency to follow the pattern than individuals with a low score. Labeling of the factor (i.e., dietary pattern) was performed quantitatively using a cut-off of 0.30 of the factor loadings. Also, a subgroup analysis by study center was conducted to assess the reproducibility of the dietary patterns retrieved. Confirmatory identification of dietary patterns was also performed using maximum-likelihood factor analysis using R statistical software environment (version 2.9.2). However, PCFA was more appropriate for these data because of potentially non-normal distributions, and only results for PCFA are shown.

Statistical methods

Unconditional logistic regression was used to assess the association between the factor score and postmenopausal breast cancer risk. Odds ratios (ORs) and 95% confidence intervals (CIs) were calculated using the LOGISTIC procedure of the SAS statistical software package, version 9.1 (SAS Institute, Cary, NC). The factor score distribution among controls was used for quintile definition. Higher quintiles of the factor score were compared to the lowest quintile.

Univariate and multivariate odds ratios were estimated. All multivariate analyses were adjusted for year of birth (≤1934, 1935–1939, 1940–1944, 1945–1949, ≥1950), and study center (Rhine-Neckar-Karlsruhe, Hamburg), menopausal induction [natural with/without hormonal replacement therapy (HRT), bilateral oophorectomy (chemical or by radiation), hysterectomy, start of HRT], body mass index (BMI) (<22.5, 22.5–25, 25–30, ≥30 kg/m2), occupational level (low, simple, medium, higher, highest), first-degree family history of breast cancer (yes, no, unknown), history of benign breast disease (yes, no, unknown), number of pregnancies (≥28th week) (0, 1, 2, ≥3), age at menarche (<12, 12–14, ≥15), breastfeeding history (ever, never), total number of mammograms (0, 1–4, 5–9, ≥10, unknown), smoking habits (never, past, current), and total energy (kJ/day in continuous).

Since the association of dietary patterns with breast cancer risk may be modified by several risk factors, subgroup analyses were performed by BMI category (<25, ≥25 kg/m2), HRT use (never, ever), smoking habits (never, past, current), education level (low, medium, high) as a representative of socioeconomic status (dietary habits may differ between social groups), and energy intake (below and above median energy intake of the whole population). For the analysis of the dietary patterns by ER/PR status of the tumor (ER-positive/PR-positive, ER-positive/PR-negative or ER-negative/PR-positive, ER-negative-/PR-negative), polytomous logistic regression models were applied controlling for the above-mentioned adjustment variables in multivariate analyses. Further, a sensitivity analysis by exclusion of in situ carcinomas was performed.

To test for linear trend, median values of each factor score category were used as ordinal variables and entered in logistic regression models. Tests for heterogeneity between risk estimates were performed using the Q statistic. p values below 0.05 were considered significant.

Results

The 33 food groups included five non-alcohol beverage food groups as well as three groups on alcoholic beverages (liqueurs/spirits, wine, and beer), four groups on meat intake (poultry, processed, red meat, and offals), one group on fish intake, five groups on vegetable intake (cabbages, fruiting/leafy/other vegetables, legumes/soya, garlic/onion, potatoes), four groups on fat intake (butter, margarine, deep-frying fat, and oil and vinegar dressing), as well as eleven other food groups, as listed in Table 1. Baseline characteristics of the 2,884 cases and 5,509 controls included in the dietary pattern analyses are shown in Table 2. Cases and controls differed with respect to several characteristics: cases had fewer pregnancies, had less often breastfed, had more mammograms, and were more likely to have a non-natural induction of menopause, a first-degree relative with breast cancer, and a positive history of benign breast diseases when compared to controls. No differences in age at menarche, BMI, occupational level, smoking habits, and alcohol consumption were observed.

Table 2 Distribution of breast cancer risk factors and other baseline characteristics among postmenopausal cases (n = 2,884) and controls (n = 5,509)

Two major dietary patterns were identified using PCFA: (1) high fruit, fruiting, leafy and other vegetables, garlic/onion, vegetable oils, and mayonnaise intake (further referred to as the “healthy” pattern) explaining 48.2% of the variance; (2) high processed and red meat, garlic/onion, and deep-frying fat intake (further referred to as the “unhealthy” pattern) explaining 29.8% of the variance (Table 3). Some baseline characteristics and food and nutrient intakes in quintile 1 and 5 of both dietary patterns are shown in Table 4. Within the “healthy” pattern, subjects in the highest quintile were younger, had higher energy, fat, protein, fiber, vitamin C, cholesterol, and salt intake when compared to subjects in the lowest quintile but not different with respect to BMI, carbohydrates and alcohol intake, and polyunsaturated to saturated fatty acids ratio. Within the “unhealthy” pattern, energy, fat, protein, alcohol, cholesterol, and salt intake, as well as BMI were higher in the highest compared to lowest quintile, whereas carbohydrates intake was lower. Only marginal differences were observed for age, polyunsaturated to saturated fatty acids ratio, fiber, and vitamin C intake.

Table 3 Food group factor loadings for the “healthy” and “unhealthy” dietary pattern, identified using PCFA
Table 4 Baseline characteristics for the “healthy” and “unhealthy” dietary patterns

Table 5 shows the ORs for overall breast cancer risk and for subgroup analyses by ER/PR status. After adjustment for potential confounders, breast cancer risk did not differ among women in the highest compared to the lowest quintile of the “healthy” or the “unhealthy” dietary pattern. Also, no statistically significant trends were observed. Only for ER-/PR-negative tumors, a significant inverse association was observed for the “unhealthy” pattern (P for trend = 0.02). All other ER/PR subgroups did not show significant results for any pattern. For the “healthy” pattern, no heterogeneity was observed (P = 0.85), whereas the association of the “unhealthy” pattern was heterogeneous between the ER/PR subgroups (P = 0.05).

Table 5 Univariate and multivariate odds ratios for “healthy” and “unhealthy” dietary patterns in relation to postmenopausal breast cancer risk

Further, subgroup analyses by BMI, HRT use, smoking habits, education level, and energy intake did not show any statistically associations between the dietary patterns and breast cancer risk (data not shown). In addition, associations were similar for both study centers (data not shown).

Sensitivity analysis by exclusion of in situ carcinomas did not result in changes of the estimates either for the “healthy” pattern or for the “unhealthy” pattern and breast cancer risk association.

Discussion

We identified two dietary patterns by PCFA, a “healthy” and an “unhealthy” dietary pattern. However, none of these dietary patterns was statistically significantly associated with postmenopausal breast cancer risk.

Our null results for the “healthy” and “unhealthy” dietary patterns in relation to postmenopausal breast cancer risk are in line with findings from several other studies, for instance with the results from three cohorts in which premenopausal [14] and postmenopausal [10, 21] breast cancer risk was not associated with any dietary pattern. A recent systematic review and meta-analysis found a decreased breast cancer risk for the highest compared with the lowest categories of “prudent/healthy” dietary patterns, whereas no association for a “western/unhealthy” dietary pattern was found [24]. Different from the “prudent/healthy” dietary patterns associated with breast cancer risk in other studies, the “healthy” pattern of our study was not characterized by high intake of dairy products and low intake of mayonnaise. High intake of mayonnaise in our “healthy” pattern can be explained by the fact that mayonnaise intake was primarily from salad dressings. High intake of sweets and desserts as well as grains, which were usually part of the “unhealthy/western” pattern, did not apply to this pattern in our study population.

There may be several reasons for the null findings of our study. One explanation for not finding an association for the “unhealthy” pattern with breast cancer risk may be due to the fact that this pattern rather represents the traditional dietary pattern consumed in Germany, containing high amounts of potatoes, onions, garlic, and grains besides the mentioned food groups of processed meat, red meat, and deep-frying fat. Further, the onset of the “unhealthy” pattern is more likely to be in childhood than the initiation of the “healthy” pattern. Women reporting a “healthy” diet 1 year before diagnosis are likely to have had the “unhealthy” dietary pattern during their childhood and adolescence, too. It is unknown which time period is mostly relevant for breast cancer risk and whether this period is recorded by the FFQ. However, it was reported that dietary behavior in childhood and adolescence might have implications on the development of breast cancer in adulthood [32, 33]. Thus, it might be possible that the diet, which is mostly related to breast cancer risk, was not recorded by the FFQ (e.g., during childhood or adolescence). Further, over- and underreporting of certain foods and total energy intake is of concern, especially in obese women [3436]. This may have resulted in misclassification of women into categories of dietary patterns.

Results from studies conducted in the United States appear to be different from those conducted in Europe. In the United States, most studies have shown that the “prudent” pattern is associated with a breast cancer risk reduction and/or the “western” pattern is associated with an increased risk of breast cancer in the entire study population or at least in a subgroup [8, 10, 13]. Most European studies, on the other hand, observed no association for the “prudent/healthy” and “western/unhealthy” patterns [16, 18, 22, 23], and in some studies even a protective effect of the “Animal Products” or the “Pork, Processed meat and Potato” pattern was found [17, 20]. The differences between the two dietary patterns might be more extreme in the United States. “Western” patterns in the United States may contain really low amounts of vegetables and fruits, and “prudent” patterns may be completely opposite to the “western” pattern, whereas in Europe these dietary patterns might not be totally separated. Thus, the identified patterns in this study may not represent dietary patterns that are associated with breast cancer risk. Despite similar labels, the “prudent” and “western” dietary patterns in the United States and Europe are actually different, possibly reflecting cultural differences in eating habits. Also, differences in agricultural practices, food processing and other growing influences may result in differing nutrient content or different content of carcinogens present as food additives, pesticides, and preservatives.

Subgroup analyses by smoking habits, energy intake, educational status, BMI, and HRT use did not yield statistically significant association for any dietary pattern with breast cancer risk. Diet may have a stronger effect in ER-/PR-negative tumors where the role of estrogen may be smaller than in ER-positive breast cancer. There was no difference by ER/PR status for the “healthy” dietary pattern. However, the “unhealthy” pattern showed a significant inverse association with ER-/PR-negative breast cancer risk. Thus far, only three studies investigated dietary patterns in relation to ER status defined breast cancer risk, with inconsistent results [6, 10, 21]. We do not have an explanation for our finding in the ER-/PR-negative subgroup and the “unhealthy” pattern and cannot exclude the possibility of a chance finding as a consequence of testing multiple hypotheses. Therefore, further studies are necessary to clarify the association between dietary patterns and breast cancer defined by ER/PR status.

A general problem of factor analysis is that there are several subjective decisions (e.g., number of patterns to be retained, prior classification into food groups, rotation method used, naming of the factors) [37]. These facts were also critically described in a review on dietary patterns and breast cancer risk [38]. However, the dietary patterns we identified are similar to those identified by other studies, with the “healthy” pattern resembling the “prudent” pattern and the “unhealthy” pattern being similar to the “meat/western” pattern [7, 10, 12, 14, 20, 23].

Although we used a validated food frequency questionnaire, measurement error and recall bias could not be excluded in this study [39]. Especially, recall bias is very likely as there is a median time range of 5.6 months (inter-quartile range of 1.7–14.9 months) between breast cancer diagnosis and completion of the FFQ. However, results did not differ for cases completing the FFQ within or after 5.6 months of diagnosis (data not shown). In addition, the FFQ employed may not have sufficiently accounted for all relevant food items in certain food groups. Fish, for example, was not asked in enough detail to subgroup the fish intake. Similarly, vegetable oils were asked with respect to dressings and frying fats and therefore only a food group “oil and vinegar dressing” including all types of oil and a group “deep-frying fat” were available to represent oil intake. These two groups loaded, however, either for the “healthy” (oil and vinegar dressing) or for the “unhealthy” pattern (deep-frying fat). In addition, fat intake was covered by butter and margarine food groups, which did not load strong for any of the patterns. Moreover, like in every case–control study, selection bias cannot be excluded since part of the study subjects did not complete the FFQ. Although we adjusted for a broad range of confounders and breast cancer risk factors, we cannot exclude residual confounding by factors not recorded in our study.

In conclusion, we found neither a “healthy” nor an “unhealthy” dietary pattern to be associated with the risk of breast cancer in German postmenopausal women.