Introduction

Acute leukemia consists of a group of heterogeneous malignancies in which immature and dysfunctional hematopoietic progenitors proliferate and accumulate in the bone marrow (Huang et al. 2013). Acute leukemia is composed of acute lymphoblastic leukemia (ALL) and acute myeloid leukemia (AML), and there were 6020 and 18,860 new ALL and AML cases around the world in 2013 (Siegel et al. 2014 ). Both ALL and AML occur during all ages but with very different age distributions. ALL has a peak incidence during childhood and thereafter remains at a low incidence during adulthood (Redaelli et al. 2005), whereas AML is relatively uncommon in childhood and gradually increases with age (Deschler and Lübbert 2006), similar to most of other malignancies.

The etiology of acute leukemia is largely unknown. A few environmental risk factors have been described, such as ionizing radiation, benzene, pesticides, and illicit drug use (Belson et al. 2007; Eden 2010; Schnatter et al. 1996); however, they can only account for a minor proportion of incident cases. The opinion that acute leukemia may have a genetic basis has long been pursued through candidate gene-based association studies, identifying genes involved in xenobiotic metabolism (Krajinovic et al. 1999), DNA repair (Bănescu et al. 2013), cell-cycle regulation (Li et al. 2013), and oxidative stress response (Krajinovic et al. 2002). Despite differences in the pathogenesis, AML and ALL sometimes share some genetic factors. For example, XRCC1 (X-ray repair cross-complimenting group 1 gene) 399Gln allele could increase risk of both AML (Sorour et al. 2013) and ALL (Sorour et al. 2013; Pakakasama et al. 2007), and MTR (5-methyltetrahydrofolate-homocysteine methyltransferase) 2756GG genotype showed similar effect on both acute leukemia subtypes (Lightfoot et al. 2010).

So far, several GWASs have explored the association of genetic variations with acute leukemia risk, especially for ALL. However, some GWASs of ALL focused on specific subtypes, like B cell precursor (BCP) ALL, ETV6-RUNX1-rearranged ALL, and Ph-like ALL (Migliorini et al. 2013; Xu et al. 2013; Ellinghaus et al. 2012; Perez-Andreu et al. 2013), and one GWAS had a small sample size (50 cases and 50 controls) (Han et al. 2010). Only two GWASs focused on the risk of all subtypes of ALL with a relatively large sample size (Treviño et al. 2009; Papaemmanuil et al. 2009), which provided statistically robust evidence for the role of common genetic variants in the susceptibility to acute leukemia. Using the Affymetrix 500 K Mapping array and publicly available genotype data, Treviño et al. identified 18 single-nucleotide polymorphisms (SNPs) significantly associated with ALL risk among 317 European ALL patients and 17,958 European individuals without acute leukemia (Treviño et al. 2009). Papaemmanuil et al. conducted a GWAS of two European case–control series with a total of 907 ALL cases and 2398 controls using Illumina Infinium HD Human370 Duo BeadChips and performed a meta-analysis to derive joint effect of each SNP. Finally, three SNPs were found to be associated with ALL risk (Papaemmanuil et al. 2009). Subsequently, many replication studies confirmed that SNPs identified by these two GWASs could affect ALL susceptibility in different populations (Gutiérrez-Camino et al. 2013; Wang et al. 2013; Healy et al. 2010; Chokkalingam et al. 2013; Linabery et al. 2013 ); however, only one study focused on AML and found that some SNPs associated with ALL in previous GWASs were also associated with AML risk in European population (Rudant et al. 2013).

Here, we conducted a case–control study with 545 AML cases and 1,034 controls in a Chinese population to first evaluate whether these variants are associated with AML risk in Chinese population.

Materials and methods

Ethics statement

The present study was approved by the institutional review board of Nanjing Medical University. The design and performance of current study involving human subjects were clearly described in a research protocol. All participants were voluntary and would complete the informed consent in written before taking part in this research.

Study subjects

A total of 545 AML cases and 1034 cancer-free controls were included in the current study, and all of the cases and controls were unrelated ethnic Han Chinese. All of the AML patients were newly diagnosed according to French–American–British (FAB) diagnosis and typing standard and consecutively recruited between December 2007 and August 2011 from Wuxi People’s Hospital Affiliated to Nanjing Medical University. There was no restriction in terms of age, stage of disease, or histology, but those who had a history of cancer, chemotherapy, or hematopoietic stem cell transplantation (HSCT) were excluded from the case group. All of the controls were randomly selected from more than 30,000 participants in a community screening of non-communicable disease conducted in Jiangsu Province during the same period as the cases were recruited. All of the controls had no self-reported cancer history, and those who had a history of hematological system diseases were excluded. At the time of recruitment, informed consent was obtained from each subject, and each subject provided 5 ml of venous blood. This study was approved by the institutional review board guidelines, according to the Declaration of Helsinki.

SNP selection

We first selected 18 SNPs identified by Treviño et al. (rs10821936, rs10994982, rs11978267, rs10849033, rs10873876, rs11155133, rs12621643, rs1881797, rs2089222, rs2191566, rs6428370, rs7554607, rs9290663, rs1879352, rs563507, rs2242041, rs6509133, and rs2167364) (Treviño et al. 2009), and 3 SNPs identified by Papaemmanuil et al. (rs7089424, rs4132601, and rs2239633) (Papaemmanuil et al. 2009). Of all 21 SNPs, 3 SNPs were, respectively, in high linkage disequilibrium (LD) with other 3 SNPs (rs11978267 vs. rs4132601, rs10821936 vs. rs7089424, and rs2191566 vs. rs6509133) in Chinese Han population (CHB) according to HapMap database (HapMap Data Rel 27 Phase II + III, Feb09, on NCBI B36 assembly, dbSNP b126). Thus, we selected either one of each pairs. Besides, rs1881797 and rs563507 were excluded because the minor allele frequencies (MAF) were less than 0.05 in CHB. Finally, 16 SNPs were kept for genotyping.

Genotyping

DNA was extracted from venous blood using phenol/chloroform extracting method and stored in an environment of minus 20 degree centigrade. Genotyping was performed using the TaqMan allelic discrimination assay on the platform of 7900HT Real-Time PCR System (Applied Biosystems, Foster City, CA) without knowing the subjects’ status (case or control). Two negative controls included in each 384-well reaction plat were used for quality control, and the genotyping results were determined by using SDS 2.3 Allelic Discrimination software (Applied Biosystems).

Statistical analysis

Differences between the cases and controls on the demographic characteristics were analyzed by using the Student’s test (for continuous variables) and Chi-squared test (for categorical variables). Hardy–Weinberg equilibrium (HWE) was tested by a goodness-of-fit Chi-squared test to compare the observed genotype frequencies to the expected ones among the control subjects. The associations between these SNPs and AML risks were estimated by computing the odds ratio (ORs) and their 95 % confidence intervals (CIs) from multivariate logistic regression analysis. Likelihood ratio test was used for stratification analysis. Power was calculated using Power and Sample Size Calculation (PS) software (http://biostat.mc.vanderbilt.edu/twiki/bin/view/Main/PowerSampleSize, accessed Dec 14, 2010), as the type I error probability was set as 0.05, and OR was set as 1.5. Other statistical analyses were performed with SAS 9.1.3 software (SAS Institute, Cary, NC). P < 0.05 was the criterion of statistical significance, and all statistical tests were two sided.

Results

Demographic and clinical characteristics of the study subjects are described in Table 1. The age (44.1 ± 17.2 vs. 45.4 ± 9.6) and gender (male 53.4 vs. 56.7 %) were comparable (P > 0.05) between cases and controls. For clinical and cytogenetic characteristics, most cases were M2 AML (32.1 %), of myeloid lineage (69.4 %), with aberrant karyotypes (45.1 %), and without common fusion gene transcripts (55.6 %).

Table 1 Demographic and clinical characteristics of study subjects

The basic information of selected SNPs is presented in Table 2, including locations on chromosomes, genes, allele changes, genotype distributions in both cases and controls, MAF among all subjects, HWE test among controls, call rate of all samples, and power. Consistent with published GWASs (Treviño et al. 2009; Papaemmanuil et al. 2009), we set G allele of rs2239633 and rs2191566 as risk alleles, though rs2239633-A and rs2191566-T are variant alleles in Chinese population. When we assumed the OR as 1.5, the results of 13 SNPs achieved a statistical power of greater than 80 %, except for three SNPs (rs2242041, rs11155133, and rs1879352).

Table 2 Basic information of 16 selected SNPs

Then, we conducted the association analyses between selected SNPs and AML risk in additive model. As shown in Table 3, four SNPs were associated with an altered risk of AML (P < 0.05). Similar to the findings of published GWASs, risk alleles of rs2191566, rs9290663, and rs11155133 significantly increased the risk of AML (rs2191566: adjusted OR 1.46, 95 % CI 1.24–1.72, P < 0.001; rs9290663: adjusted OR 1.26, 95 % CI 1.05–1.50, P = 0.011; rs11155133: adjusted OR 1.32, 95 % CI 1.05–1.65, P = 0.017). However, the variant T allele of rs10873876 decreased the AML risk, which was in the opposite effect direction (OR 0.62, 95 % CI 0.52–0.75, P < 0.001). We also explored the associations between these SNPs and AML risk in other genetic models and found that other three SNPs (rs2239633, rs10821936, and rs2242041) achieved statistical significance in dominant or recessive models (Supplementary Table 1).

Table 3 Association of selected SNPs and AML risk in additive model

Furthermore, we examined the association of above seven SNPs with risk of AML stratified by selected variables including age, gender, lineage, karyotype, and molecular subtype. We observed heterogeneities of ORs for rs9290663 among different age groups (P = 0.048, Table 4), and interaction analysis showed significant multiplicative interaction between rs9290663 and age (≤45 years old and >45 years old; P int  = 0.009; Supplementary Table 2). No heterogeneity was found for the other SNPs among any of these subgroups.

Table 4 Stratification analysis of rs9290663 associated with AML risk in additive model

As the fusion gene AML1/ETO and PML/RARα, respectively, accounted for the majority of molecular subtypes of M2 AML and M3 AML, we then investigated the association of these two genes with the seven SNPs in M2 and M3 AML. As shown in Supplementary Table 3, rs2191566-G allele significantly increased the risk of M2 AML with AML1/ETO (OR 2.04, 95 % CI 1.20–3.48), but there was no such effect on M2 AML without common fusion gene transcripts (OR 1.06, 95 % CI 0.77–1.44; heterogeneity test: P = 0.038). Additionally, there is no heterogeneity for the other six SNPs in different groups.

Discussion

The two published GWASs have provided the unambiguous evidence that several common genetic variations influence the risk of ALL (Treviño et al. 2009; Papaemmanuil et al. 2009), although several replication studies in ALL were later reported; however, the role of these SNPs in the susceptibility to AML is rarely investigated (Sorour et al. 2013; Pakakasama et al. 2007; Lightfoot et al. 2010). In our study, we evaluated the association of these SNPs with AML risk in an independent case–control study with 545 AML cases and 1034 controls in a Chinese population and found that seven SNPs (rs2191566, rs10873876, rs9290663 and rs11155133 in additive model and rs2239633, rs10821936 and rs2242041 in other genetic models) were significantly associated with altered risk of AML.

rs2191566 located at 19q13.31 is in the intron of ZNF230 (zinc finger protein 230), which plays a role in mammalian spermatogenesis (Song et al. 2008; Xu et al. 2009; Qiu et al. 2003). However, no study has demonstrated the roles of ZNF230 in tumorigenesis. In our study, we observed that rs2191566-G significantly altered the risk of AML from myeloid lineage, but not for AML from other two lineages (myeloid and lymphoid, myeloid and monocytic). Besides, rs2191566-G could increase the risk of M2 AML with AML1/ETO, but not for M2 AML without common fusion gene transcripts. These findings suggested that rs2191566 affects the susceptibility to AML, and the mechanisms are variable in different AML subtypes.

rs9290663 (A>T) located at 3q26.32 is in the intron of KCNMB2 that encodes the β2 subunit of potassium large conductance calcium-activated channel. Our study found that rs9290663-T allele could increase AML risk, and there was interaction between rs9290663 and age, as people who are more than 45 years old and carry rs9290663-TT genotype will get the highest risk of AML; however, the reason is unclear so far.

rs2239633 (G>A) maps to the 5′ near region of CEBPE gene encoding CCAAT/enhancer-binding protein (C/EBP), epsilon. Interestingly, the Web-based tool of TFSEARCH 1.3 (http://www.cbrc.jp/research/db/TFSEARCH.html) showed that the A-to-G base change of another significant SNP (rs11155133) might affect the binding of C/EBP. We can infer that rs2239633 and rs11155133 alter the AML risk by affecting C/EBP binding, as C/EBP takes important roles in leukemogenesis (Papaemmanuil et al. 2009).

rs10821936 is in the intron of ARID5B gene, and s2242041 (C>G) at 7p12.1 is in the intron of DDC gene. Both ARID5B and DDC were found to be involved in some malignancies (Chang et al. 2008; Bourquin et al. 2006; Geomela et al. 2012; Avgeris et al. 2008). However, the exact mechanism of these SNPs in leukemogenesis was unknown.

Noteworthy, the variant T allele of rs10873876 showed a protective effect on AML risk, which was opposite to the result of GWAS (Treviño et al. 2009). There might be two reasons explaining this. First, the MAF of rs10873876-T allele in our control group was 0.278, while it was only 0.150 in European population, and such difference may lead to different associations. Second, the effects of rs10873876 and the relevant genes on ALL and AML may be different, because of different mechanisms of ALL and AML tumorigenesis, taking the different blood cell progenitors, for example (Mi et al. 2007). However, the detailed mechanism of such associations needs to be further explored.

There were two main limitations in our study. Firstly, the sample size might be relative small, as three SNPs (rs2242041, rs11155133, and rs1879352) did not achieve a statistical power of greater than 80 % when we assumed that the OR was 1.5. Secondly, we lacked environmental information like ionizing radiation and benzene, so that gene–environment interaction cannot be evaluated. Further studies with larger sample size and more environmental information were needed.

In summary, our study found that seven loci identified in population of European descent in ALL were also associated with AML in Chinese population. Further studies are warranted to clarify the biological mechanisms of these loci with AML risk and determine the causal variants in AML carcinogenesis.