Introduction

Alzheimer’s disease (AD) is the most prevalent neurodegenerative disease, which mainly leads to memory loss in the elderly population over 65 years old. Extracellular senile plaques (SP), which were mainly composed of amyloid β (Aβ) peptide and intracellular neurofibrillary tangles (NFT) formed by hyperphosphorylated tau protein, are the hallmarks observed in AD brain [1]. It has been widely accepted that AD can be divided into early-onset (EO-AD, <65 years) and late-onset Alzheimer’s disease (LO-AD, ≥65 years).

Genetic linkage studies have identified rare mutations in the amyloid precursor protein (APP), presenilin 1 and presenilin 2 (PSEN1 and PSEN2) genes in early-onset familial Alzheimer’s disease (EO-FAD) [2]. Most of these mutations are inherited in an autosomal dominant manner [3, 4], and can lead to an increased ratio of Aβ42/Aβ40 or Aβ aggregation [5, 6]. However, the etiology of LO-AD is extremely intricate and influenced by complicated interactions between genetic components and environmental risk factors. The ε4 allele of the apolipoprotein E gene (APOE) has been established unequivocally as a susceptible gene for LO-AD, which confers 4–15-fold risk to AD in ε4 allele carriers than in noncarriers [2]. In recent years, several genome-wide association studies (GWAS) were performed to identify risk gene/loci for LO-AD, and ten novel AD risk genes (BIN1, CLU, ABCA7, CR1, PICALM, MS4A6A, CD33, MS4A4E, CD2AP, and EPHA1) were identified [711]. Some of these genes were repeatedly identified as AD risk genes in subsequent independent studies [1214]. In 2011, two large GWASs of European origin populations identified another five potential AD risk genes (ABCA7, MS4A6A/MS4A4E, CD33, CD2AP, and EPHA1); meanwhile, these two GWASs successfully replicated the association between those previously reported loci (CLU, CR1, PICALM, and BIN1) and LO-AD [7, 8]. Collectively, these risk genes can explain around 50 % heritability of LO-AD and are involved in pathways of production, degradation, and clearance of Aβ, cholesterol metabolism, immunity, and cellular signaling [2].

It should be noted that these large-scale GWASs about AD were conducted in European and populations of European origin. Several case-control studies had been carried out to validate the potential association of these genes with AD in different populations [12, 13, 15, 16]. Recently, Tan et al. [17] assessed the association of GWAS-linked genes/loci with LO-AD in a Han Chinese population from North China, and they confirmed that the MS4A6A and CD33 genes were associated with LO-AD.

In this study, we aimed to test whether these GWAS-associated AD risk genes confer genetic susceptibility to AD in different Han Chinese populations from Southwest and East China. We screened 14 single-nucleotide polymorphisms (SNPs) of the GWAS-associated AD risk genes in two independent sample sets and verified the association between several loci of those GWAS-reported genes and AD in Han Chinese populations.

Materials and Methods

Subjects

Two independent samples from Southwest China and East China were recruited and analyzed in this study. The cohort from Southwest China was collected at the Mental Health Center of West China Hospital, which was composed of 333 unrelated sporadic LO-AD patients and 334 cognitively healthy controls. The cohort from East China was composed of 416 unrelated sporadic LO-AD patients and 426 cognitively healthy individuals, which was recruited from the Shanghai Mental Health Center and Tongde Hospital of Zhejiang Province. Around 67 % of the AD patient and control samples had been genotyped for the LRRK2 genetic polymorphisms in our previous study [18]. All participants were of Han Chinese origin. Patients were independently diagnosed by two psychiatrists according to the criteria of DSM-IV and NINCDS-ADRDA [19]. The healthy controls were confirmed as cognitively intact and neurologically normal. All patients were identified as sporadic LO-AD since none of their first-degree relatives had dementia. Written informed consents following the principles of the Declaration of Helsinki were obtained from each participant or guardian. This study was approved by the institutional review board of the Kunming Institute of Zoology, Chinese Academy of Sciences.

SNP Selection and Genotyping

A total of 14 SNPs, including 8 GWAS-reported variants (BIN1, rs744373; CLU, rs11136000; ABCA7, rs3764650; PICALM, rs3851179; MS4A6A, rs610932; CD33, rs3865444; MS4A4E, rs670139; CD2AP, rs9349407) and 6 tag SNPs (BIN1, rs1060743; CLU, rs9331942; ABCA7, rs3752237; PICALM, rs11234495, rs592297, and rs676733), were selected according to the linkage disequilibrium (LD) pattern of the respective genes based on data from HapMap (HapMap, http://hapmap.ncbi.nlm.nih.gov/, phase 3, CHB) and 1000 Genomes (http://www.broadinstitute.org/mpg/snap/). These tag SNPs had a r 2 value >0.8 with the other SNPs and/or were located in different LD blocks with the GWAS hit SNPs. For those genes that span a relatively short genomic region (MS4A6A, CD33, MS4A4E), or tag SNPs that were located in the same LD block with the GWAS hit SNPs (CD2AP), we did not analyze tag SNPs but only the GWAS hit SNPs. The detailed information of each SNP is shown in Table 1. We genotyped different alleles of the APOE gene, which was a well-established risk gene for LO-AD [20, 21], using the same approach described in our previous study [18].

Table 1 Detailed information of 14 SNPs in eight GWAS-linked genes

All these 14 SNPs were genotyped by using the SNaPshot assay, which comprising a multiplex PCR and a subsequent single-base extension process, according to the detailed step-by-step procedure in our previous studies [18, 22]. Briefly, multiplex PCR was carried out in a volume of 8-μL reaction solution containing 20–50 ng genomic DNA, 0.4 mM dNTPs, 0.2–0.5 μM of each primer (Supplementary Table S1), 2.0 mM MgCl2, and 1.0 U of FastStart Taq DNA Polymerase (Roche Applied Science). The amplification program was composed of a pre-denaturation cycle at 94 °C for 2 min; 40 amplification cycles at 94 °C for 30 s, 55 °C for 30 s, and 72 °C for 1 min; and ended with an incubation cycle at 4 °C. Multiplex PCR products were cleaned up at 72 °C for 40 min with 1.0 U of shrimp alkaline phosphatase (SAP) and 0.5 U of Exonuclease I (TaKaRa Biotechnology Co. Ltd., Dalian, China), followed by a final incubation at 96 °C for 10 min to inactivate the enzyme. The single-base extension was conducted in a total of 10-μL reaction solution containing 4-μL multiplex PCR products, 5-μL SNaPshot Multiplex Ready Reaction Mix, and 0.4–0.8-μM pooled SNP-specific oligonucleotide primers (Supplementary Table S1) according to the protocol of the ABI PRISM® SNaPshot® Multiplex Kit (Applied Biosystems). The single-base extension program contained 25 cycles of 96 °C for 10 s, 50 °C for 5 s, and 60 °C for 30 s. The products were then purified by SAP (1.0 U) at 37 °C for 40 min, followed by a heat inactivation at 75 °C for 20 min. A mixture of 4-μL purified products and 9 μL Hi-Di™ formamide was analyzed by the ABI PRISM™ 3730xl DNA analyzer (Applied Biosystems) at the Kunming Biodiversity Large-Apparatus Regional Center, Kunming Institute of Zoology. The GeneMarker software (http://www.softgenetics.com/GeneMarker.html) was employed to read the genotyping results..

Statistical Analysis and Meta-Analysis

The power analysis was calculated using the Quanto software [23]. The frequency of allele, genotype, and haplotype was analyzed using the PLINK software [24]. Binary logistic regression under the general model was performed by SPSS 16.0 (SPSS Inc., Chicago, Illinois) to assess the association between these SNPs and LO-AD, with an adjustment of APOE ε4 status (APOE ε4 + or APOE ε4 ). A P value <0.05 was regarded as statistically significant. When considering Bonferroni correction for multiple testing, a cutoff of P < 0.0035 (0.05/14) was set as statistically significant. The deviation from the Hardy-Weinberg equilibrium of each SNP was calculated for both the control and case populations by using the PLINK software [24]; SNPs with a P value less than 0.001 were regarded as departure from the Hardy-Weinberg equilibrium. LD structures of the BIN1, CLU, PICALM, and ABCA7 genes were reconstructed using Haploview software version 4.2 [25]. Haplotypes were reconstructed by using the PHASE 2.0 program [26]. All the above analyses were performed for the two independent cohorts separately. Considering the fact that these individuals are all of Han Chinese origin and a combined sample may increase the power of the statistical test [27], we pooled all the samples together to investigate the association between these GWAS hit SNPs and tag SNPs and AD risk even though there existed some potential genetic differences between populations from Southwest China and East China. In addition, we investigated whether these SNPs affect messenger RNA (mRNA) expression level of relevant genes in different tissues in the GTEx database (http://www.gtexportal.org/home/) [28].

We performed meta-analysis for SNPs rs3851179, rs3764650, rs11136000, rs744373, rs610932, rs3865444, and rs9331942 with AD by using the RevMan 5.2 software (http://www.cochrane.org/revman) based on data from this study and reported Asian populations (Supplementary Table S2). Those SNPs with no available data in Asian populations were not considered in the meta-analysis.

Results

Statistical Power and LD Pattern of Analyzed SNPs

As the minor allele frequency (MAF) of all SNPs in this study ranged from 11.9 to 46.4 %, assuming a false positive rate controlled as 0.05, the power to detect the odds ratio (OR) value as 1.5 for risk allele was expected to be from 72.8 to 95.5 % based on the samples from Southwest China. However, when we considered the OR value as 1.2, the power was much reduced, with a value range from 20.0 to 38.2 %.

Total genotyping call rate of all individuals was 99.8 %. The allele and genotype counts for each SNP were shown in Supplementary Table S3. All 14 SNPs analyzed in this study were in Hardy-Weinberg equilibrium in both AD patients and controls (Supplementary Table S3). We validated the genotyping results of SNaPshot assay by direct sequencing 5 % of total samples and obtained consistent results. The ε4 allele of the APOE gene showed a significant association with AD risk in both cohorts and in the combined Han Chinese sample (Supplementary Table S4). The LD patterns of these SNPs in the BIN1 (rs1060743-rs744373), CLU (rs9331942-rs11136000), PICALM (rs11234495-rs592297-rs676733-rs3851179), and ABCA7 (rs3764650-rs3752237) genes were similar between AD patients and controls (Fig. 1).

Fig. 1
figure 1

Linkage disequilibrium (LD) patterns of the BIN1 (a), CLU (b), ABCA7 (c), and PICALM (d) genes in AD cases and controls from Southwest China, East China, and combined population. The value in each square refers to r 2 × 100

SNPs in the PICALM Gene Were Associated with LO-AD in Han Chinese

Genotype frequencies of four SNPs of the PICALM gene were significantly different between the case and control populations from Southwest China (rs11234495, P = 0.013; rs592297, P = 0.035; rs676733, P = 0.0004; rs3851179, P = 0.001), and two of them also showed a significant difference at the allelic level (rs11234495, P = 0.016, OR = 0.767, 95 % confidence interval (CI) 0.618–0.951; rs676733, P = 0.030, OR = 0.782, 95 % CI 0.626–0.976) (Table 2). We failed to discern any positive association of these SNPs with LO-AD in samples from East China. When we combined these two cohorts together, the results remained to be significant (Table 2). Logistic regression analysis revealed that these SNPs were significantly associated with LO-AD in populations from Southwest China and the combined sample after adjustment for the APOE4 status (Table 2). However, after Bonferroni correction for multiple testing, only SNPs rs676733 and rs3851179 remained to be significantly associated with AD (Table 2) in the cohort from Southwest China. The inconsistence of association between the two independent cohorts might simply be an issue of low power of the study and/or potential region difference.

Table 2 Association between SNPs in GWAS-linked genes and AD in Han Chinese

Haplotype construction based on the four SNPs of the PICALM gene revealed that haplotype TTTG (SNP order: rs11234495-rs592297-rs676733-rs3851179) increased the risk of AD in the Southwest China cohort (P = 0.043, OR = 1.256, 95 % CI 1.013–1.558), haplotype CTCA increased the risk of AD in the East China cohort (P = 0.023, OR = 1.620, 95 % CI 1.070–2.454), and haplotype CCCA decreased the risk of AD in the combined population (P = 0.039, OR = 0.848, 95 % CI 0.726–0.991) (Table 3). However, all haplotype associations did not survive the Bonferroni correction for multiple testing, possibly due to the low power of this study.

Table 3 Haplotype frequencies of SNPs in the BIN1, CLU, PICALM, and ABCA7 genes in 333 AD cases and 334 healthy controls from Southwest China and in 416 AD cases and 426 normal controls from East China

Intriguingly, the GWAS hit variant rs3851179 and tag SNP rs592297 of the PICALM gene were correlated to its mRNA level in brain cingulate cortex (rs3851179, P = 0.01; rs592297, P = 0.01) according to expression data in the GTEx database (http://www.broadinstitute.org/gtex/), suggesting that these variants might be functional.

SNP rs3851179 of the PICALM Gene Confers Susceptibility to LO-AD in Asian Populations

SNP rs3851179 (PICALM) has been reported to be a susceptibility locus for AD in European populations [911], whereas recent studies in different Asian populations had controversial results. In three reported Han Chinese studies, rs3851179 was not associated with AD [15, 29, 30], but in one Japanese population, it showed a weak association [31].

To evaluate the association between rs3851179 and AD, we performed a meta-analysis of 7739 individuals from Asian populations, which were composed of the above-mentioned four validation studies and the two cohorts analyzed in this study (Supplementary Table S2). Results of the Cochran’s Q test indicated no significant heterogeneity among these populations. The allele test (Z = 2.97, P = 0.003, OR = 0.90, 95 % CI 0.84–0.96) indicated a significant association between rs3851179 and LO-AD (Fig. 2a).

Fig. 2
figure 2

Meta-analysis of SNPs rs3851179, rs744373, rs11136000, rs9331942, rs610932, rs3865444, and rs3764650 with AD in Asian populations. Allele tests of SNPs rs3851179 (a), rs744373 (b), rs11136000 (c), rs9331942 (d), rs610932 (e), rs3865444 (f), and rs3764650 (g) were performed by using the RevMan 5.2 software (http://www.cochrane.org/revman). The detailed information of the reported data from Asian populations is listed in the Supplementary Table S2

SNPs of the BIN1 (rs744373), CLU (rs9331942), MS4A4E (rs670139), and CD2AP (rs9349407) Genes Were Associated to LO-AD in Han Chinese

In Han Chinese cohort from East China, SNPs rs744373 (BIN1, P = 0.038), rs9331942 (CLU, P = 0.013), and rs670139 (MS4A4E, P = 0.022) were significantly different between cases and controls at the genotypic level. SNP rs744373 (BIN1, P = 0.026, OR = 1.256, 95 % CI 1.028–1.535) and rs9349407 (CD2AP, P = 0.037, OR = 1.256, 95 % CI 1.013–1.558) showed a significant difference at the allelic level (Table 2) in the East China cohort and the combined sample, respectively. However, we failed to discern any association between these SNPs (rs744373, rs9331942, rs670139, and rs9349407) and AD in Han Chinese from Southwest China. Note that a positive association of SNPs rs9331942 and rs670139 with LO-AD was observed in the combined Han Chinese populations both before and after adjustment for APOE4 status. Nonetheless, all these associations were no longer significant after Bonferroni correction for multiple testing.

Haplotype analysis showed that there was no significant difference of haplotype frequency of BIN1 and CLU SNPs between AD cases and controls in both cohorts and the combined population (Table 3). Notably, SNPs rs744373 (BIN1, P = 0.009) and rs670139 (MS4A4E, P = 0.03) were found to be significantly associated with their mRNA levels in pituitary, and rs670139 showed a weak association with mRNA level of the MS4A4E gene in brain cingulate cortex (P = 0.07) based on the GTEx database.

Meta-Analysis of the ABCA7, CLU, BIN1, MS4A6A, and CD33 Genes with LO-AD in Asian Populations

Meta-analysis of SNPs rs3764650 (ABCA7), rs11136000 (CLU), rs9331942 (CLU), rs744373 (BIN1), rs610932 (MS4A6A), and rs3865444 (CD33) by using the reported data from Asian populations and data from this study revealed that rs744373 (BIN1) and rs11136000 (CLU) were significantly associated with LO-AD risk in Asian populations (rs744373: Z = 3.22, P = 0.001, OR = 1.14, 95 % CI 1.05–1.24; rs11136000: Z = 2.05, P = 0.04, OR = 0.93, 95 % CI 0.87–1.00) (Fig. 2b, c), while no significant association was observed for other SNPs (Fig. 2d–g). It should be mentioned that the association pattern of the two CLU SNPs were different in our samples and in the result of meta-analysis: rs9331942 was not associated with AD in the meta-analysis but was positively associated with AD in Han Chinese, whereas rs11136000 had a reverse association pattern with rs9331942.

Discussion

GWAS is a burgeoning method to seek novel susceptibility genes/loci for AD. Recently, several GWASs had identified many novel AD susceptibility genes/loci in European and populations of European origin [711]. Independent replication studies in different populations are important for interpreting the case-and-effect relationship and the mechanism of genetic susceptibility.

In order to investigate whether the reported AD susceptibility loci confer risk to AD in Han Chinese populations, we screened GWAS hit and tag SNPs in eight GWAS-reported AD risk genes in two independent Han Chinese sample sets. Albeit the sample size of our sample was modest, we were able to validate the association of three SNPs (rs744373 of BIN1, rs3851179 of PICALM, and rs670139 of MS4A4E) with LO-AD in our Han Chinese samples, and the effect direction of these AD risk SNPs was as reported in previous GWAS [7, 8, 10, 11]. SNP rs9349407 of CD2AP was marginally associated with LO-AD in population from East China, with similar effect direction as reported in previous studies [7, 8]. The GWAS hit SNP rs11136000 in the CLU gene showed no association with AD in our study, but we observed a positive association of another SNP rs9331942 in this gene with LO-AD.

Of note, the selected PICALM tag SNPs in this study (rs11234495, rs592297, and rs676733) were all associated with LO-AD in our Han Chinese population from Southwest China and the combined population, indicating an important role of the PICALM gene in AD. Moreover, the association of rs3851179 and rs676733 of the PICALM gene with AD risk in the cohort from Southwest China (rs3851179, P = 0.002; rs676733, P = 0.002, with an adjustment of the APOE ε4 status) remained to be significant even after Bonferroni correction.

The PICALM gene encodes phosphatidylinositol-binding clathrin assembly protein (PICALM), which is a key component in clathrin-mediated endocytosis [32]. As APP must be internalized into cells through endocytosis before being cleaved to Aβ [33], PICALM may play a key role in the Aβ production and release and influence Aβ levels. In this study, all four PICALM SNPs (rs3851179, rs11234495, rs592297, and rs676733) were significantly associated with LO-AD. Several haplotypes determined by these SNPs also conferred AD risk. Note that this association did not survive the Bonferroni correction, which can be explained by the limited sample size of this study. The GWAS hit SNP rs3851179 (which is located at 88.5-kb upstream of PICALM) and tag SNPs rs11234495 and rs676733 (both are within the intron region) may be linked to some functional variants conferring the etiology of AD. SNP rs592297 is a synonymous variant in exon 5 that may potentially influence exon splicing, which was first reported to be associated with LO-AD in a large-scale GWAS, but the association did not reach statistical significance at the genome level (2 × 10−7) [11]. Furthermore, according to the GTEx database, SNPs rs3851179 and rs592297 were associated with mRNA expression level of the PICALM gene in brain cingulate cortex, which was identified to be involved in learning and memory process [34]. Though the association between rs3851179 and LO-AD has been widely identified in European populations [913] and was validated in this study, several replication studies produced inconsistent results in different populations [12, 30, 35]. Meta-analysis of rs3851179 based on the data of this study and previously reported studies [15, 2931] demonstrated that this SNP indeed conferred susceptibility to LO-AD in Asians, which was in accordance with a recent report by Liu and colleagues [36]. Our result and previous reports supported a notion that the PICALM gene was a common susceptibility gene for AD in different populations.

Gene expression data in the GTEx database indicated that the GWAS-associated SNPs in the BIN1 (rs744373) and MS4A4E (rs670139) genes may affect AD risk through their influence on gene expression. SNP rs744373 of BIN1 was confirmed to be associated with AD risk in different ethnic backgrounds including East Asian and Caucasian populations according to a meta-analysis based on several validation studies [37]. In this study, we also observed an association between rs744373 and AD risk. Furthermore, meta-analysis for rs744373 in Asian populations supported its role in AD. The BIN1 mRNA expression level was found to be significantly associated with different genotypes of rs744373 in pituitary region by re-analyzing expression data from the GTEx database, which was in line with previous evidence that variant rs59335482 (rs744373 is tightly linked to this SNP) could mediate AD risk by increasing cerebral expression of BIN1 [38]. In addition, we successfully replicated the association between rs670139 and LO-AD in this study. SNP rs670139 lies within an intergenic region between the MS4A4E and MS4A6A genes [7]. The gene expression data of the GTEx database indicated a potential association between rs670139 and MS4A4E mRNA level in both cingulate cortex and pituitary region. One recent study also demonstrated that rs670139 was associated with AD patient’s Braak tangle and Braak plaque score [39]. There may be some other functional variants that were tightly linked with rs670139 and affected the expression and function of nearby genes [40]. SNPs rs9331942 in the 3′ UTR of CLU and rs9349407 in the 5′ UTR of CD2AP showed an association with AD in the combined Han Chinese sample, which may participate in the transcriptional regulation of the related genes. Further experimental assay should be performed to solidify our speculations.

Failure in replicating GWAS results is a common issue in genetic studies of complex disease, which is influenced by genetic heterogeneity and environmental risk factors [41]. For instance, we recently failed to validate the GWAS hit risk loci for schizophrenia in Han Chinese, albeit these susceptibility loci were initially identified in the same ethnic populations [42]. We encountered similar conditions in this study, in which the association of certain GWAS hit SNPs with AD was inconsistent between the two case-control cohorts from Southwest China and East China (Table 2). The discrepancy may be attributed to several reasons. First, the weak statistical power due to relatively small sample size may account for the inconsistent results in different populations. Second, we only considered the GWAS top hit SNP in each locus and one to four tag SNPs for some of the top hit genes. As the GWAS hit SNPs are most probably not the functional variants, but themselves tag SNPs of the functional variants (in populations from European ancestry). Those top hit SNPs might not correctly tag the functional variants in Han Chinese because the LD pattern differs among populations. Third, the gene-gene and gene-environment interactions may also influence the effect of risk allele between different populations. Genetic heterogeneity of AD may imply more and more risk genes for this disease, as exemplified by the most recent meta-analysis that identified 11 new susceptibility loci [43].

In summary, we verified the association between the PICALM, CLU, MS4A4E, BIN1 genes and LO-AD in Han Chinese populations, whereas the association of the remaining GWAS hit genes with AD was not validated in our populations. Particularly, meta-analysis revealed that the PICALM, BIN1, and CLU genes were significantly associated with AD risk in Asian populations. One limitation of this study is that we lacked demographical data, such as age, sex, and education year of subjects for all individuals, which disabled us from retrieving more information in the association analyses. Further independent validating studies and essential functional assays are needed to solidify the current conclusions and to characterize the putative role of these genes concerning the production, transport, release, and clearance of Aβ from brain into blood and their participation in innate immunity in the central nervous system.