Introduction

Esophageal cancer (EC) is one of the most common causes of cancer-related mortality worldwide, and its incidence rate has significantly increased in recent years [1]. There are two different forms of EC, esophageal squamous cell carcinoma (ESCC) and esophageal adenocarcinoma (EA). There were 223,306 patients diagnosed with EC and 197,472 patients died in China in 2012. The incidence rate (7.3 %) was ranked fifth, and the mortality rate (9.0 %) was ranked fourth for both sexes [2, 3]. For both incidence and mortality, the rate of EC is much higher in men than in women, in rural areas than in urban areas, and peaks at age 80–84 years [4]. The majority of EC cases occur in developing countries, such as China. Our study focused on China’s main population, the Han Chinese.

More than half of the EC global cases were diagnosed in China, a high-incidence area for esophageal carcinoma [5]. The actual etiology of EC remains unclear, but extensive evidence collected in the past decades has demonstrated that tobacco smoking, hot beverage intake, and alcohol are prominent risk factors for this disease [69]. The risk gene variants may confer different magnitudes of increased risk in different populations for a variety of reasons, including differences in allele frequency and lifestyles and differences in genetic and environmental backgrounds that interact with the variants [10, 11]. Only a few of individuals that partake in risk factors such as smoking develop EC, indicating that an individual’s genetic makeup plays an important role in esophageal carcinogenesis [12].

Because the function of the gastrointestinal gland is closely related to the esophagus and since a recent study indicates that gastrointestinal cancer and EC may share genetic risk factors [13], we used susceptibility loci associated with gastrointestinal gland cancer to identify new loci associated with EC. We selected 26 single nucleotide polymorphisms (SNPs) from gastrointestinal gland cancer susceptibility loci to analyze their association with EC in the Chinese Han population.

Materials and methods

Study population

This study was conducted as a population-based case–control study. All participants were Han Chinese, had histologically confirmed EC including ESCC or EA, were at least 18 years old, and were in good mental condition. Exclusion criteria included self-reported cancer history and previous radiotherapy and/or chemotherapy. A total of 360 patients newly diagnosed with histologically confirmed EC were consecutively recruited between March 2011 and July 2014 in the First Affiliated Hospital of Xi’an Jiaotong University and in Tangdu Hospital, Xi’an City, China. We also recruited a random sample of unrelated healthy individuals between March 2011 and July 2014 from the health centers of Tangdu Hospital, Xi’an City, China. The control population was matched with the EC population based on age and gender. This minimized factors could influence mutation rate, thus maximizing the study’s power. None of the healthy control subjects had any chronic or severe endocrine, metabolic, or nutritional diseases. The study was comprised of 310 unrelated healthy subjects (113 females and 197 males with a mean age of 49.4 years) and 360 EC patients (72 females and 288 males with a mean age of 60.7 years). Basic population information is shown in Table 1.

Table 1 Characteristics of EC cases and controls in this study

Demographic and clinical data

For each participant, a standard questionnaire was used to collect demographic information, including age, sex, smoking status, alcohol use, education, and family history of cancer. Detailed clinical information was collected from treating physicians or medical chart reviews. Blood samples (5 ml) and signed informed consent forms were obtained from every participant enrolled. This protocol was approved by the Clinical Research Ethics Committee of Baoji Central Hospital. In addition, alpha-fetoprotein and plasma carcinoembryonic antigen levels were analyzed to ensure that no participants in the control group suffered from cancer.

SNP selection and genotyping

We selected 26 SNPs from previously published polymorphisms associated with gastrointestinal cancer susceptibility loci. Two SNPs (rs17401966 and rs455804) were selected from previous research of hepatocellular carcinoma [14]. Twenty SNPs (rs6736997, rs401681, rs6879627, rs9502893, rs9363918, rs4269383, rs3016539, rs7832232, rs10088262, rs10788473, rs12413624, rs10500715, rs1000589, rs1585440, rs9573163, rs9543325, rs4924935, rs225190, rs372883, and rs1547374) were selected from previous research of pancreatic cancer [1520]. Four SNPs (rs975334, rs10953615, rs13294589, and rs7504990) were selected from previous research of gallbladder cancer [21]. All of the SNPs in this study had minor allele frequencies (MAFs) of >5 % in the Asian Population HapMap and had previously been reported to be associated with gastrointestinal cancer. Basic information about the 26 SNPs is listed in Table 2. Genomic DNA was extracted from the peripheral blood using phenol–chloroform, and its concentration was measured using a DU 530 UV/VIS spectrophotometer (Beckman Instruments, Fullerton, CA, USA) according to the manufacturer’s protocol. MassARRAY Assay Design 3.0 Software (Sequenom, San Diego, CA, USA) was used to design Multiplex SNP MassEXTEND assays [22]. Genotyping was performed using the Sequenom MassARRAY RS1000 following a standard protocol recommended by the manufacturer [23], and data were analyzed using Sequenom Typer 4.0 Software (Sequenom, San Diego, CA, USA) [23, 24].

Table 2 Basic information of the 26 SNPs

Statistical analyses

The association of each of the SNPs with EC was evaluated using unconditional logistic regression models include dominant, co-dominant, recessive, and log-additive genetic models. In all analyses, the lower frequency allele was defined as the risk allele. For example, for the additive model, individuals were assigned with 0, 1, or 2 representing the number of risk alleles they possessed for that SNP; for the dominant model, individuals were coded as 1 if they carried at least one risk allele and 0 otherwise; and for the recessive model, individuals were coded as 1 if they were homozygous for the risk allele (two copies) and 0 otherwise.

We used Microsoft Excel and the SPSS 16.0 Statistical Package (SPSS, Chicago, IL, USA) to perform statistical analyses. In controls, each SNP was tested to determine whether it fit the Hardy–Weinberg equilibrium (HWE). Chi-square tests [25] and SNPStats Software from http://bioinfo.iconcologia.net/snpstats/start.htm [26] were used to test the association between genetic polymorphisms and EC. Odds ratios (ORs) and 95 % confidence intervals (CIs) were calculated using unconditional logistic regression analyses adjusted for age and gender [27], and the most common control homozygote was used as reference. Akaike information criterion (AIC) and Bayesian information criterion (BIC) were used to choose the best model for each SNP. All p values reported in this study were two-tailed, and p values less than 0.05 were considered statistically significant. Linkage disequilibrium (LD) of the candidate SNPs was analyzed using HaploView v4.2 [28]. Pairwise LD and haplotype constructions were performed using the SHEsis Software (http://analysis.bio-x.cn/myAnalysis.php) [29].

Results

In our initial analyses, we sought to determine whether the 26 selected SNPs fit the Hardy–Weinberg equilibrium. We found that all SNPs were in Hardy–Weinberg equilibrium in EC patients and controls (p > 0.05 after Bonferroni correction) except for rs6736997 (p = 5.53E−08) and rs10788473 (p = 0.010) in the controls. These two SNPs were excluded, and only the remaining 24 SNPs were used for subsequent association analysis. Detailed information of the observed heterozygosity (O-HET) and expected heterozygosity (E-HET) of the SNPs is listed in Table 2. Chi-square tests suggested that we found rs4269383 (OR = 1.429, 95 % CI = 1.087–1.878, p = 0.010), rs10953615 (OR = 1.418, 95 % CI = 1.022–1.968, p = 0.036), rs13294589 (OR = 0.690, 95 % CI = 0.479–0.996, p = 0.046), and rs4924935 (OR = 0.706, 95 % CI = 0.500–0.995, p = 0.046) are correlated with EC risk (Table 3).

Table 3 Pearson chi-square test of the 24 SNPs

In the genetic model analyses (Table 4), the minor allele “T” of rs401681 was associated with reduced EC risk, based on the results from the co-dominant (p = 0.01) and dominant (OR = 0.60, 95 % CI = 0.41–0.87, p = 0.0073) models. The minor allele “A” of rs10088262 was associated with reduced risk of EC, based on the co-dominant model (p = 0.045). The minor allele “C” of rs4924935 may reduce the risk of EC, based on the co-dominant (p = 0.023) and dominant (OR = 0.54, 95 % CI = 0.34–0.85, p = 0.008) model. In contrast, the minor allele A of rs4269383 was associated with increased risk of EC as revealed by the co-dominant model (p = 0.024) and the genotype “AA” may significantly increase EC risk in the recessive model (OR = 3.37, 95 % CI = 1.28–8.89, p = 0.009).

Table 4 SNP association with esophageal cancer (adjusted by sex and age)

Finally, we looked for interactions of the SNPs in the same chromosome from patients adjusted by age and gender and found that none of the analyzed SNPs correlated with age or gender (data not shown).

Discussion

In this Chinese Han population-based case–control study, we found that six susceptibility loci were associated with EC and first found that five loci (rs4269383 located in 6q25.3, rs10953615 located in 7q31.1, rs10088262 located in 8p11.22, rs13294589 located in 13q21.31, and rs4924935 located in 17p11.2) were associated with EC. We searched for these SNPs in electronic databases (PubMed, Medline, Web of Knowledge, CNKI, and Google Scholar) to identify eligible studies that were published before September 2014. We found that rs401681 was reported to be associated with EC, lung cancer, pancreatic cancer, and melanoma. rs4269383, rs10088262, and rs4924935 were reported to be associated with pancreatic cancer [18, 15], rs10953615 with gallbladder cancer [18], and rs455804 with hepatocellular carcinoma [14]. Previous research suggests that the minor allele “G” of rs49249935 is associated with increased risk of pancreatic cancer with OR = 1.37 (95 % CI = 1.19–1.58, p = 8.15E−06) under the dominant model [18]. The minor allele A of rs10088262 is associated with increased risk of pancreatic cancer with OR = 1.40 (95 % CI = 1.21–1.61, p = 4.30E−06) under the dominant model [18]. The minor allele T of rs4269383 is associated with reduced risk of pancreatic cancer with OR = 0.75 (95 % CI = 0.66–0.87 p = 8.5E−06) [18]. The rs401681 allele T is implicated in pancreatic cancer with the log-additive model; each additional copy of the minor allele T is associated with a 1.24-fold increased risk of pancreatic cancer (OR = 1.24, 95 % CI = 1.06–1.44, p = 5.61E−03), The minor allele T of rs401681 is also associated with reduced risk of EC and increased lung cancer [15, 19, 30, 31]. The minor allele C of rs10953615 is associated with increased risk of gallbladder cancer, with OR = 10.13 (95 % CI = 3.33–30.78, p = 4.43E−05) under the recessive model. The minor allele G of rs13294589 is associated with increased risk of gallbladder cancer with OR = 20.98 (95 % CI = 4.96–88.78, p = 3.55E−05) under the recessive model [18].

Of the six EC susceptibility loci, we identified and found that only the minor allele of rs10953615 was a similar risk factor for gallbladder and esophageal carcinoma and this allele was associated with increased risk for both cancers [21]. The other five SNPs showed results that were inconsistent with previous studies. For example, we found that rs40681, which mapped to the CLPTM1L minor allele A, was associated with a reduced risk of EC but previous reports linked it to an increased risk of pancreatic cancer. The function of CLPTM1L and its role in tumorigenesis is largely unknown. However, a recent study reported that CLPTM1L is a commonly overexpressed anti-apoptotic factor in lung cancer [30, 32]. Our study and previous research suggest that one SNP can be associated with two or more cancers; for example, rs401681 is associated with many cancers including pancreatic cancer, lung cancer, bladder cancer, and hepatocellular carcinoma [3335]. Further studies are needed to clarify the genetic mechanism of esophageal carcinogenesis by fine mapping the susceptibility region of the variants.

Our data demonstrate six loci that are associated with EC. Our assessment of the risk of the 24 investigated SNPs was based on the linear analysis model in the discovery set, not including further analysis of the differences between eating habits, family history, or age, making it neither prospective nor epidemiologically rigorous. Alternatively, our sample size (360 cases and 310 controls) could be too small to detect an association in the other SNPs. Moreover, we did not conduct a pathological classification of our EC patients. Due to the low incidence of salivary gland cancer, our loci were chosen based on their association with pancreatic cancer, hepatocellular carcinoma, and gallbladder cancer. Thus, we need to increase the sample size, investigate more SNP from the EC-associated loci, and further investigate whether these SNPs are involved in EC in the Han population.

In our intelligent research, initially, there was a link between these loci and EC but there was no more other evidence of a direct association between gastrointestinal cancer and EC. Gene expression analysis could be used to elucidate the pathogenesis of EC and the relationship between EC and digestive gland tumors. It is known that the contribution of risk alleles to EC risk may vary between populations; this phenomenon may be due to differences in allelic frequencies or specific LD structures or because of additional genetic factors or environmental backgrounds that may influence the effect of these genetic variants. Our research was limited to Han in Chinese population. Our conclusions may, thus, not extent to other populations. Our study did not distinguish between ESCC and EA, which may have different risk factors. Further analysis is required to elucidate the differences between these types of EC.

Conclusion

Our study provides strong evidence that the six SNPs described here may contribute to the risk of EC and other upper gastrointestinal cancers. rs401681 was previously reported to be associated with EC. The other five SNPs characterized here have not been previously shown to be associated with EC. We recommend that those who carry increased risk alleles should concentrate on developing healthy eating habits and receive regular physical examinations. Additionally, larger studies and in vitro or tissue-specific biological characterization are required to confirm our preliminary findings.