Introduction

The etiology of breast cancer is complex, involving interactions between genetic and environmental factors; and epigenetic modifications. The high-penetrant genetic mutations account for less than 10 % breast cancer cases (Hoskins et al. 1995). Several genetic polymorphisms have been explored across different pathways for possible association with breast cancer (Fachal and Dunning 2015). Among these pathways, the most widely investigated pathway is the one-carbon metabolic pathway, which was so named because of the several metabolic reactions involving the transfer of one-carbon moiety from one substrate to another to form several crucial metabolic precursors.

The dietary folate in the form of folyl polyglutamates is hydrolyzed by folate hydrolase (prostate-specific membrane antigen) 1 (FOLH1) to folyl monoglutamates, which are absorbed easily by the intestinal cells (Halsted 1989). Folate reductase catalyzes the reduction of folate to dihydrofolate (DHF) and tetrahydrofolate (THF). The THF of the blood stream is transported into RBC by the solute carrier family 19 (folate transporter), membrane 1 (SLC19A1) (Brzezińska et al. 2000). The methylene moiety from the serine is transferred to THF to form 5,10-methylene THF in the presence of serine hydroxymethyltransferase 1 (soluble) (SHMT1) (Stover and Schirch 1992). The 5,10-methylene THF is the common substrate for two rate-limiting enzymes, i.e., thymidylate synthase (TYMS) and methylenetetrahydrofolate reductase (NAD(P)H) (MTHFR), which catalyze the conversion of dUMP to dTMP and FAD-dependent reduction of 5,10-methylene THF to 5-methyl THF, respectively (Trinh et al. 2002). The 5-methyl THF is the cosubstrate for the remethylation of homocysteine to methionine in the presence of 5-methyltetrahydrofolate-homocysteine methyltransferase (MTR). The other cosubstrate, i.e., methyl cobalamin is formed due to reductive methylation of cobalamin by the 5-methyltetrahydrofolate-homocysteine methyltransferase reductase (MTRR). The MTR-MTRR holoenzyme complex, thus contributes toward the remethylation of homocysteine to methionine (Brot and Weissbach 1966). Methionine is the precursor for the synthesis of S-adenosylmethionine (SAM), which is a universal methyl donor that donates a methyl group to DNA, histones, and catecholamines. After donating a methyl group, SAM is converted to S-adenosyl homocysteine (SAH).

Since aberrant methylation of tumor suppressors and catechol estrogens, defective DNA synthesis and repair are the well-documented risk factors for breast cancer; several polymorphisms of one-carbon metabolic pathway were explored for the possible association with breast cancer. Limited studies are there on FOLH1 C1561T polymorphism in association with breast cancer (Mohammad et al. 2011; Naushad et al. 2011). Several other genetic variants of FOLH1 are also explored for their association with breast cancer (Divyya et al. 2013). SLC19A1 G80A was shown to exert breast cancer risk in Indian and Brazilian populations (Carvalho Barbosa Rde et al. 2011; Mohammad et al. 2011; Naushad et al. 2011), while no association was reported in the US population (Xu et al. 2007). SHMT1 C1420T polymorphism was shown to confer protection against breast cancer in Indian (Mohammad et al. 2011; Naushad et al. 2011), Chinese (Wu et al. 2014), and Brazilian (Carvalho Barbosa Rde et al. 2011) populations while null association was observed in the US and (Xu et al. 2007) Taiwanese (Yu et al. 2007) populations. TYMS 5′-UTR 28 bp tandem repeat polymorphism showed null association with breast cancer risk in six studies (Xu et al. 2007; Suzuki et al. 2008; Carvalho Barbosa Rde et al. 2011; Naushad et al. 2011). TYMS 3′-UTR ins6/del6 showed positive association with breast cancer in the Japanese population (Zhai et al. 2006), while it showed a null association in Indians (Naushad et al. 2011), Chinese (Zhai et al. 2006) and Germans (Justenhoven et al. 2005). MTHFR C677T is the most widely studied polymorphism of this pathway; the segregation of data based on ethnic group or populations revealed a strong association of this polymorphism in Turkey(Deligezer et al. 2005; Ozen et al. 2013), China (Cheng et al. 2008; Gao et al. 2009; Wu et al. 2012; Jiang-Hua et al. 2014), Syria (Lajin et al. 2012), Morocco (Diakite et al. 2012), and North America (Maruti et al. 2009; Bentley et al. 2010; Ramos-Silva et al. 2015). Two studies, i.e., one on multi-ethnic group (Le Marchand et al. 2004) and another on the Iranian population (Hosseini et al. 2011), showed a protective role of this polymorphism. In rest of the population (N = 10), the association of this polymorphism was either borderline or null (Shrubsole et al. 2004; Lee et al. 2004; Inoue et al. 2008; Ma et al. 2009; Hosseini et al. 2011; Prasad and Wilkhoo 2011; Akram et al. 2012; Awwad et al. 2015). MTR A2756G was investigated in ten different populations out of which it was identified as a risk factor in Iranian (Hosseini 2013) and Australian (Beetstra et al. 2008) populations. In two populations, i.e., Greece (Kakkoura et al. 2015) and China (He et al. 2014), this polymorphism showed an inverse association with breast cancer risk while in other population, it showed a null association (Platek et al. 2009; Weiwei et al. 2014; He et al. 2014). MTRR A66G was identified as a risk for breast cancer in a Russian population (Tao et al. 2009), while it was shown to have an inverse association with Thai (Sangrajrang et al. 2009; Sangrajrang et al. 2010) and Australian (Beetstra et al. 2008) populations. In another eight populations, MTRR A66G showed a null association (Kotsopoulos et al. 2008; Burcoş et al. 2010; Weiner et al. 2012).

The variations in genetic association across different populations might be attributed to gene-gene and gene-nutrient interactions which act as potential risk modulators. In the current study, we have aimed to provide a comprehensive overview of all the genetic association relevant to one-carbon metabolism as possible risk modulators for breast cancer using meta-analysis approach. This will help in identifying the genetic risk factors that are common across the different populations and also to emphasize the role of dietary and lifestyle patterns in dictating the breast cancer risk along with a given set of polymorphisms.

Materials and methods

Search strategy and selection criteria

We searched four electronic databases (PubMed, Google Scholar, Scopus, and Medline) to identify eligible studies that were published before December 2015. Articles were retrieved by using the following keywords: “GCPII/FOLH1,” “RFC1/SLC19A1,” “SHMT1,” “TYMS,” “MTHFR,” “MTR,” “MTRR,” “polymorphism,” and “breast cancer.” The reference list of the retrieved publications was also reviewed to identify additional relevant articles.

Inclusion and exclusion criteria

The inclusion criteria were the following: (a) case-control study involving unrelated individuals, (b) information on the raw data of genotypes, (c) information on ethnicity, and (d) the genotype distribution in accordance with Hardy-Weinberg equilibrium (HWE). The exclusion criteria were the following: (a) case only study, (b) meta-analysis, (c) only minor allele frequencies provided, and (d) duplication of data.

Data extraction

A standardized form was used by the investigators for independent extraction of data and for credibility of results. The following information was extracted from each published article: first author, year of publication, ethnicity (country), number of cases, number of controls, source of controls, and genotype distribution.

Segregation of data into ethnic groups and populations

The studies from the same ethnic group or population were pooled together to assess ethnicity or population-based risk. This analysis is useful in addressing the risk modulation based on specific dietary or lifestyle patterns. The number of ethnic groups, number of subjects in each group, and their association with each polymorphism were tabulated.

In silico analysis

In order to elucidate the structural and functional effects of the amino acid substitution, Polymorphism Phenotyping v2 (PolyPhen-2) (http://genetics.bwh.harvard.edu/pph2/) was used, which was based on the physical and comparative considerations. The score classifies the proteins into benign, partially damaging and highly damaging.

Statistical analysis

Using statpages.org, chi-square goodness of fit for each cases and control studies was calculated. p Value with 95 % confidence interval, odds ratio, and phi coefficient were included in the data. p Value with 0.05 or less was considered as significant, and breast cancer risk was assessed through OR. A computational tool (statdirect) was used to conduct meta-analysis of all the studies related to polymorphisms of one-carbon metabolism. The data were computed in the form of number of variant alleles and number of total alleles in cases and controls. The fixed effect model was generated using the Mantel-Haenszel and the Robins-Breslow-Greenland algorithm. These were based on conditional maximum likelihood. The random effects model was based on the DerSimonian-Laird algorithm. Non-combinability of studies was assessed based on the Cochran’s Q test. The effect of heterogeneity was quantified with the I 2 test. Publication bias was assessed based on the Horbold-Egger test.

Results

Characteristics of the included studies

According to search, 94 potentially relevant articles were identified. After applying the inclusion and exclusion criteria, we have chosen 62 published studies representing 17 populations involving 18,117 cases of breast cancer and 23,573 healthy controls (Flowchart 1).

Flowchart 1
figure 1

Preferred reporting items for systematic reviews and meta-analyses (PRISMA) flow diagram. This illustrates the selection process of the studies

Meta-analysis databases

There are three studies from India on FOLH1 C1561T polymorphism where one study was taken as representative which shows no association of this polymorphism with breast cancer (OR 0.74, 95 % CI 0.46–1.19) (Table 1). No other population-specific data were available on FOLH1 C1561T with relevance to breast cancer.

Table 1 Association of polymorphisms of one-carbon metabolic pathway with breast cancer risk

The data on SLC19A1 G80A was complied into three distinct populations, out of which Indian (OR 1.33, 95 % CI 1.02–1.74) and Brazilian (OR 1.63, 95 % CI 1.18–2.28) populations exhibited an increased risk for breast cancer. Null association was observed in US population (OR 1.04, 95 % CI 0.90–1.20). The pooled OR was significant in the fixed effect model (OR 1.16, 95 % CI 1.03–1.30). However, the random effects model showed no statistical significance (p = 0.08). Cochran’s Q tests showed evidence of heterogeneity in association (p = 0.02). There is no evidence of publication bias (p = 0.07) (Fig. 1).

Fig. 1
figure 2

Association of SLC19A1 G80A with breast cancer in three populations. This illustrates population based risk association in a fixed effects model and b random effects model

Studies on SHMT1 C1420T were segregated into four populations; out of these Indian (OR 0.75, 95 % CI 0.58, 0.97) and Brazilian (OR 0.64, 95 % CI 0.44–0.94) populations showed statistically significant protective role against breast cancer. Null association was observed in Chinese (OR 0.97, 95 % CI 0.73–1.29) and US (OR 0.97, 95 % CI 0.89–1.07) populations. Pooled OR was not statistically significant in both fixed effects and random effects models. The Cochran’s Q test of heterogeneity was positive (p = 0.047). However, there is no evidence of publication bias (p = 0.20) (Fig. 2). Altitudinal variation was observed in the association of SHMT1 C1420T with breast cancer (Fig. 3).

Fig. 2
figure 3

Association of SHMT1 C1420T with breast cancer in four populations. This illustrates population based risk association in a fixed effects model and b random effects model

Fig. 3
figure 4

Influence of altitude on SHMT1 C1420T association. The two population, i.e., Indians and Brazilians who showed protective role (light gray) and two other population, Chinese and US, who showed null association (dark gray) shared similar altitude

Studies on TYMS 5′UTR 28 bp repeat polymorphism were segregated into five populations, i.e., Chinese, Brazilian, US, Indian, and Japanese. Null association was observed in all the populations. Both fixed effects and random effects models showed null association with pooled data (p = 0.28). No evidence of heterogeneity was shown in Cochran’s Q test (p = 0.60). There was evidence for publication bias based on Horbold-Egger test (p = 0.02).

In four populations, the association of TYMS 3′UTR with breast cancer was investigated. Japanese population alone showed statistically significant risk with this polymorphism (OR 3.54, 95 % CI 1.86–6.76). However, the sample size of the study was very less. In other populations, i.e., Indian, Chinese, and Germans, null associations were observed with this polymorphism. Both fixed effects (p = 0.77) and random effects (p = 0.36) models showed null association of the pooled data. The Cochran’s Q test (p = 0.0003) showed evidence of heterogeneity in association. No publication bias was observed for this polymorphism (p = 0.20).

As shown in Table 2, the association of MTHFR C677T with breast cancer exhibits lot of heterogeneity across different studies. The Cochran’s Q test (Q 239.09, p < 0.0001) confirms this heterogeneity. Furthermore, the data was grouped based on populations. It was identified as a risk factor in North America, Turkey, China, Morocco, and Syria. In other populations, the association was not statistically significant. Both fixed effects (p < 0.0001) and random effects (p = 0.01) models showed significant risk for this polymorphism in the pooled analysis. The Cochran’s Q test (p < 0.0001) indicated heterogeneity in the association across different populations. No evidence of publication bias was observed (p = 0.34) (Fig. 4). Except for North America and China, all other countries showing MTHFR-mediated risk for breast cancer belong to Mediterranean origin (Fig. 5). As shown in Table 3, even within a geographical area, heterogeneity was observed with regard to association of MTHFR C677T with breast cancer, specifically among Chinese, Indians, and Turks. Subgroup analysis revealed MTHFR C677T-mediated breast cancer risk in post-menopausal women and in women with low dietary intake of folate (Tables 4 and 5).

Table 2 Association of MTHFR C677T with breast cancer risk across different studies
Fig. 4
figure 5

Association of MTHFR C677T with breast cancer across 17 populations. This illustrates population based risk association in a fixed effects model and b random effects model

Fig. 5
figure 6

Influence of altitude on MTHFR C677T association. Iran, Singapore, Germany, Brazil, and Australia showed odds ratio ≤1.01. Saudi Arabia, Thailand, Greece, and North America showed odds ratios between 1.06 and 1.13. India, Korea, Chinese, Jordan, and Pakistan showed odds ratios between 1.17 and 1.27. Japan, Turkey, Morocco, and Syria showed odds ratios between 1.35 and 1.68. The change in odds ratio was depicted in gray to black gradation in the world map

Table 3 Test for homogeneity between the studies on MTHFR C677T segregated based on geographical area
Table 4 Effect of folate in modulating MTHFR C677T-mediated breast cancer risk
Table 5 Effect of menopausal status on MTHFR C677T-mediated breast cancer risk

Out of the ten populations investigated for possible association of MTR A2756G with breast cancer, the Iranian and Australian populations showed positive association; the Chinese and Greece populations showed an inverse association; and the Russian, German, Japanese, Brazilian, Indian, and US populations showed a null association. The pooled data showed the protective role of this polymorphism in fixed effects models alone (p < 0.0001). However, the random effects model showed a null association (p = 0.96). The Cochran’s Q test (p < 0.0001) showed evidence of heterogeneity. There is evidence of publication bias (p = 0.04).

Eleven studies explored the association of MTRR A66G with breast cancer. Out of these, only Russian population showed increased risk for breast cancer. Australian population showed an inverse association with breast cancer. Other nine populations, namely Chinese, Romanian, Canadian, Syrian, Polish, Thai, Japanese, and Indian showed a null association. Both fixed effects (p = 0.41) and random effects models (p = 0.59) showed a null association. The Cochran’s Q test (p < 0.11) showed no evidence of heterogeneity in association. No publication bias was observed (p = 0.26).

As shown in Fig. 6, in silico studies revealed the potential damaging effects of SLC19A1 G80A, MTHFR C677T, and MTRR A66G, while no damage was observed with SHMT1 C1420T and MTR A2756G polymorphisms (Fig. 6).

Fig. 6
figure 7

In silico analysis of functional relevance of genetic polymorphisms. PolyPhen-based computational analysis was carried out to elucidate effect of SNPs on the respective proteins. The damage is represented in the form of color gradient from light gray to black suggesting no damage to severe damage

Discussion

The current study attempts to investigate the role of putative genetic polymorphisms in one-carbon metabolic pathway with the etiology of breast cancer. Here, we have pooled the data from different ethnic groups and populations. Two polymorphisms, i.e., SLC19A1 G80A and MTHFR C677T, were identified as potential risk factors for breast cancer in the pooled analysis. SHMT1 C1420T and MTR A2756G showed borderline protective role against breast cancer.

The functional analysis revealed the strong association of SLC19A1 G80A and MTHFR C677T polymorphisms with structural instability and damage of the respective proteins. MTHFR C677T was shown to induce thermolability in MTHFR protein, resulting in its dissociation into inactive monomers with loss of FAD-binding capacity (Yamada et al. 2001). The SHMT1 C1420T polymorphism has no deleterious effect on SHMT1 protein. The MTR A2756G was shown to induce benign damage to the protein. The MTRR A66G showed a deleterious effect on MTRR protein. Since MTR and MTRR act together in 1:1 stoichiometric ratio to form holoenzyme complex (Yamada et al. 2006), it is likely that MTR and MTRR variant alleles act in synergy in modulating breast cancer risk.

The SLC19A1 G80A was found to be a risk factor in Indians and Brazilians, but not in US population. This lack of association with US population can be attributed to folate fortification program in the USA. SLC19A1 expression was reported to be downregulated in conditions of folate deprivation (Ifergan et al. 2008) suggesting that availability of folate might act as an effect modifier. Decreased transcription and altered function might have deleterious impact, thus influencing intracellular folate levels.

The MTHFR C677T polymorphism was found to be a risk factor in Mediterranean populations. The risk is probably attributed to change in the dietary patterns from the conventional Mediterranean diet to the processed food resulting in the deficiency of folate and other vitamins (Castro-Quezada et al. 2014). This hypothesis was substantiated by the subgroup analysis showing MTHFR C677T-mediated breast cancer risk among women with low folate intake. Our results corroborate with a previous study, which demonstrated increased risk for post-menopausal breast cancer in carriers of MTHFR 677 T-allele despite having high plasma folate levels (Ericson et al. 2009). Methionine after the synthesis of SAM and its utilization as a methyl group donor forms SAH and thus contributes toward higher homocysteine levels. The remethylation of homocysteine depends on the bioavailability 5-methyl THF and the activity of MTR and MTRR complex. The MTR-MTRR complex requires methyl cobalamine as cofactor. Thus, the synthesis of 5-methyl THF is hampered in subjects harboring the variants. The other possible contributors for the population level variation in the association are complex interactions among MTR, MTHFR, and MTRR.

The induction of futile folate cycle by SHMT1C1420T might be contributing toward the maintenance of one-carbon homeostasis through formation of 5-formyl THF from 5,10-methylene THF (Stover and Schirch 1992) thus conferring protection against breast cancer. In a previous study, we have observed higher circulating folate levels in a subject with SHMT1 TT genotype when compared to those with SHMT1 CT and CC genotypes (Naushad et al. 2011). The in silico study showing active enzyme in the presence of this polymorphism supports this hypothesis further.

The results of our meta-analysis corroborated with other meta-analyses (Yu and Chen 2012; Liang et al. 2013; Castro-Quezada et al. 2014); (Li et al. 2014a; Rai 2014; Pooja et al. 2015; Kumar et al. 2015) in showing MTHFR C677T as a risk factor for breast cancer. The null association of MTR A2756G in random effects model and an inverse association with breast cancer in Caucasians were consistent with the meta-analysis of Zhong et al. (2013). The null association of MTRR A66G with breast cancer is in agreement with the meta-analysis of Hu et al. (2010). To date, there is no other meta-analysis that explored the association of SLC19A1 G80A with breast cancer risk.

Our study showed agreement with another study (Li et al. 2014b) in demonstrating the protective role of SHMT1 C1420T against breast cancer in Asians, while null association in Caucasians. In contrast with Wang et al. (2010), we have performed a meta-analysis of TYMS polymorphisms based on alleles. And hence, null association was observed. Only Japanese population exhibited positive association of TYMS 3′UTR ins6/del6 with breast cancer.

The limitations of the current study were the following: (i) inclusion of only published studies in the current meta-analysis and (ii) this meta-analysis was based on unadjusted odds ratios as it is difficult to retrieve information on the confounding factors from each study.

To summarize, MTHFR C677T and SLC19A1 G80A are considered to be a significant risk factors for breast cancer globally. The SHMT1 C1420T seems to confer borderline protective role. The ethnic and population level variations in genetic association could be due to gene-gene interaction, gene-nutrient interaction, and genome-epigenome interactions.