Introduction

Alzheimer’s disease (AD) is the most prevalent form of progressive dementia in the elderly population. Like several other neurodegenerative diseases, AD is influenced by both genetic and environmental factors. Genetic studies have shown that three major genes, APP, PSEN-1, and PSEN-2, are implicated in the familiar early-onset form of AD [1], whereas the apolipoprotein E (APOE) ε4 allele has been reported to be a risk factor for both familial AD (FAD) and sporadic AD (SAD) [2]. However, the APOE ε4 allele only accounts for a fraction of SAD patients, which indirectly suggests that other susceptibility genes may exist.

Genome-wide association studies (GWAS) may be useful for identifying novel genes and single-nucleotide polymorphisms (SNPs) that affect the risk of complex diseases. To identify SAD susceptibility genes, a series of GWAS using Caucasian populations as the study subjects were conducted in recent years, and multiple gene polymorphisms associated with AD have been reported. A GWAS in 2007 found that five loci located in the GALP (rs3745833), TNK1 (rs1554948), PCK1 (rs8192708), and PGBD1 (rs3800324) genes and a locus with unknown function in 14q32.13 (rs11622883) were associated with AD pathogenesis in British and American populations [3]. In addition, in 2007 Reiman et al. [4] conducted a GWAS and reported that six loci in the GAB2 gene were associated with the pathogenesis of AD4. Also, a GWAS conducted in 2008 based on Canadian and British populations showed that two polymorphic sites in the GOLPH2 gene, rs10868366 and rs7019241, were associated with onset of SAD [5]. There have been several other GWAS targeting populations of the United States, England, Canada, Belgium, Finland, Italy, and Spain which reported that a few polymorphic sites in the CLU, PICALM, LRAT, PCDH11X, CD33 genes were also associated with the pathogenesis of AD [614]. These reports are breakthrough studies on AD susceptibility genes. However, because different nations and regions have strong heterogeneity in genetics and phenotypes of SAD, as well as the fact that SNPs allele frequency distribution is different among diverse populations, population association studies on these SAD-related loci identified through GWAS have been carried out in different regions and ethnic groups [1517]. Nevertheless, there have been few reports related to the Asian population [1821]. As the follow-up and replication of GWAS findings is the only way to validate proposed GWAS signals [22], our study used samples from the North Chinese Han population to detect SNPs reported in the above GWAS (Table 1), in order to explore their association with SAD in the Chinese population.

Table 1 General information and primer sequences of the 19 SNPs

Materials and methods

Subjects

The study included 191 SAD patients (88 men and 103 women; mean age: 73.1 ± 8.9 years; mean age at onset: 66.7 ± 7.5 years) who were enrolled from the Xuan Wu Hospital of the Capital Medical University and from the Beijing Senile Hospital in Beijing City. Medical interviews, physical examinations, blood tests, brain magnetic resonance imaging (MRI), and neuropsychological assessments, which included the Mini-Mental State Examination (MMSE), Clinical Dementia Rating (CDR), Hachinski Ischemic Scale (HIS), Activity of Daily Living scales (ADL), and Hamilton Depression Scale (HDS), were performed. All patients were diagnosed as having “probable AD” according to the National Institute of Neurologic and Communicative Disorders and Stroke-AD and Related Disorders Association (NINCDS-ADRDA) criteria [23]. None of these patients reported a family history of AD. The control group consisted of 180 healthy subjects (85 men and 95 women; mean age: 74.3 ± 5.6 years) who were recruited from the outpatient department of Xuan Wu Hospital in Beijing. They underwent regular health examinations, and were confirmed to be healthy and neurologically normal by medical history, physical examination, and neuropsychological assessment including the MMSE, a clinical memory scale (CMS), and the HDS to rule out cognitive deficit. All the SAD patients and control subjects were unrelated Han Chinese residents from the North of China. Informed consent was obtained for each subject, either directly or from his or her guardian. The protocol of this study was approved by the Institute Ethical Committee.

SNP selection

We selected for this study AD-associated polymorphic sites reported in recent GWAS. Expecting loci in APOC1, SORL1, and other genes reported by our laboratory previously [24] and loci with a minor allele frequency (MAF) of less than 5% in the Chinese population, we chose a total of 19 loci in this study (Table 1).

Genotyping

Genomic DNA was isolated from peripheral blood samples by a salting-out procedure [25], and all DNA samples were normalized to 50 ng/μl. The genotyping was performed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry(MALDI-TOF MS) assay (MassArrayTM, Sequenom Inc., San Diego, CA, USA). Polymerase chain reaction (PCR) and extension primers were designed using the Sequenom MassARRAY Assay Design software (primer sequences shown in Table 1). Genotype calls were made using SEQUENOM Typer 4.0 software with the post-processing calling procedure. Ten percent of randomly selected DNA samples from both patients and controls were sequenced to validate the genotyping by MALDI-TOF MS.

All subjects were genotyped for APOE using methods previously described [26].

Statistical analyses

Hardy–Weinberg equilibrium (HWE) was tested at http://analysis.bio-x.cn/myAnalysis.php. Allelic and genotypic distributions in SAD patients and controls were assessed by chi-square test. Genotype and allele frequencies for each SNP were also stratified by the presence of the APOE ε4 allele. The results were adjusted for age, sex, and APOE ε4 status using multiple logistic regression models. Analyses were performed using the Statistical Package for the Social Science (SPSS) version 13.0. Linkage disequilibrium was checked using EH program and D′ and r 2 were calculated at http://analysis.bio-x.cn/myAnalysis.php. The odds ratio (OR) and 95% confidence intervals (CI) were calculated as estimates of the strength of the association between genotypes or haplotypes and SAD. A value of P < 0.05 on a two-sided test was considered significant.

Results

Eighteen of the 19 SNPs were in HWE. The genotype frequencies of SNP19 (SNP rs5984894 on Xq21.3) in male samples did not conform to the HWE due to its location on the X-chromosome. The control females were in HWE (χ2 = 2.280, P = 0.131), and the case females were in HWE (χ2 = 0.264, P = 0.607). This result is similar to that reported by Wu et al. [18].

As expected, the APOE ε4 allele in the SAD patients was significantly higher than in the controls (22.0 vs. 9.7%; χ2 = 24.708, P < 0.01).

For SNP1 (GOLPH2/rs10868366) and SNP2 (GOLPH2/rs7019241), there were significant differences in allele frequencies between SAD patients and controls (rs10868366: P = 0.026, rs7019241: P = 0.019), but not in genotype distribution (rs10868366: P = 0.081, rs7019241: P = 0.07). Higher frequencies of rs10868366 G and rs7019241 C carriers were observed in patients with SAD compared with control subjects (rs10868366 G: 52.4 vs. 44.2% P = 0.026, OR = 1.4, 95% CI: 1.0–1.9; rs7019241 C: 50.8 vs. 42.2% P = 0.019; OR = 1.4, 95% CI: 1.6–1.9). However, after stratification by age of onset, gender, and APOE ε4 status, there was no significant difference in the allele or genotype frequencies between the controls and SAD patients for the two SNPs (Table 2). The genetic models of the two SNPs were analyzed and the results remained no significant differences between the SAD patients and controls after gender, age, and APOE adjustment by logistic regression.

Table 2 Genotype and allele frequencies of three SNPs in SAD cases and controls stratified by APOEε4 status

Linkage disequilibrium between alleles at the two SNPs were studied and we found rs10868366 and rs7019241 to be in strong linkage disequilibrium (D′ = 0.983, r 2 = 0.900). The rs10868366T allele was in strong linkage disequilibrium with the rs7019241 T allele, and likewise there was strong linkage disequilibrium between the rs10868366 G and rs7019241 C alleles. Furthermore, the rs10868366 T/rs7019241 T haplotype occurred at a lower frequency in the AD patients than in the controls, suggesting that it is a possible protective factor for AD, whereas rs10868366 G/rs7019241 C is a risk factor (Table 3).

Table 3 Distribution of haplotype of GOLPH2 in SAD cases and controls stratified by APOE ε4 status

For SNP18 (SNP rs3826656 in near gene 5′ region of CD33), there were no significant differences in the distributions of either allele or genotype between SAD patients and controls. However, in subjects with APOE ε4 alleles, a significant difference in the distribution of alleles (χ2 = 5.929, df = 1, P = 0.015) and genotypes (χ2 = 8.324, df = 2, P = 0.016) between the SAD and control groups was seen, with A allele and AA genotype lower in the SAD group (Table 2). The A allele was associated with a reduced risk of SAD compared with the G allele (OR: 0.479; 95% CI: 0.263–0.870, P = 0.015), and the AA genotype was associated with a reduced risk of SAD compared with the genotype AG + GG (OR: 0.395; 95% CI: 0.158–0.659, P = 0.008).

There were no significant differences in the allele or genotype frequencies between the control and SAD subjects for the other 16 SNPs, even after stratification by age of onset, gender, and APOE ε4 status (data not shown). No statistical interaction was observed between APOE and any of the SNPs examined.

Discussion

Genome-wide association studies have opened a new era for the study of AD, allowing researchers to compare all variations in allele frequencies between patients and controls among the whole genome, and thereby discover sequence variations associated with the disease. However, one of the important issues concerning GWAS is enlarged type I error due to multiple hypothesis testing and false-positive association. Repeat of the GWAS results, especially in other ethnic cohorts, is necessary to identify the true SNPs associated with the disease [22]. Therefore, in this study we used North Chinese Han population samples to detect such loci reported in GWAS, in order to explore the association of SNPs with SAD in the Chinese population.

Using a cohort of 191 LOAD cases and 180 controls, we replicated in North Chinese Han population the association of the two SNPs (rs10868366 and rs7019241) in the Golgi phosphoprotein 2 gene (GOLPH2) that were reported to be associated with AD in Caucasians [5]. According to Li et al.’s study, rs10868366 TT + TG genotype and rs7019241 TT + CT genotype decreased the risk of AD when compared with rs10868366 GG genotype and rs7019241 CC genotype. Our results are in partial agreement with results of Li et al.; we found that lower frequencies of rs10868366 T and rs7019241 T carriers were observed in patients with SAD compared with control subjects. The rs10868366 T allele is in strong linkage disequilibrium with rs7019241 T allele, and likewise there is strong linkage disequilibrium between the rs10868366 G and rs7019241 C alleles. Furthermore, the rs10868366 T/rs7019241 T haplotype is a possible protective factor for SAD, while rs10868366 G/rs7019241 C is a risk factor. Our study provides further support for the association of SAD with the common variant rs10868366 and rs7019241 at the GOLPH2 locus. According to our data (Table 2), there was significant difference in allele frequencies at GOLPH2 gene loci rs10868366 and rs7019241 between SAD group and control group, and the frequencies of rs10868366 G and rs7019241 C carriers was higher in SAD group than in control group. Comparison of allele frequencies at the two loci showed no significant difference after stratification by APOE ε4 status. However, the data (Table 2) showed that, in subjects with APOE ε4 alleles, the frequencies of loci rs10868366 G and rs7019241 C carriers was higher in SAD group than in control group, demonstrating a trend similar to the overall allele frequency distribution prior to stratification. The relatively fewer number of cases in APOE ε4 positive group may be responsible for the insignificant statistical difference. In further research, it may be necessary to use larger sample size and stratified comparison.

The GOLPH2 gene is located in 9q21.33 with suggestive linkage to AD [27]. The Golgi phosphoprotein 2 gene is highly expressed in subfields of the hippocampal formation in mouse brain [28]. In addition, Inkster et al. [29] reported that AD patients with the rs10868366 higher risk GG genotype have less left prefrontal cortical gray matter volume relative to AD patients with the lower risk genotype. The SNPs rs10868366 and rs7019241 are located in the intron of the GOLPH2 gene and seems not to be associated with known functional roles, such as splicing or transcriptional regulation. Therefore, we presumed that identified SNPs may be in linkage disequilibrium with some causative loci of the GOLPH2 gene, which might affect GOLPH2 expression and impact the risk of AD. However, two recent studies reported that no significant association existed between GOLPH2 gene polymorphisms and the risk of AD [16, 30]. We used the same type of study subjects as Li et al. [5], i.e., probable AD patients, while Antúnez et al. [30] and Schjeide et al. [16] chose definite, probable, and possible AD patients as their study subjects. In addition, unlike the family-based study by Schjeide et al., Li et al., and our group selected unrelated individuals as study subjects. Also, variable sample sizes and experimental methods may have produced statistical bias. The number of cases in our study is relatively few, so future studies need to use a larger size sample of the Chinese population to detect the polymorphic sites in the promoter region, coding region, and 3′ end of the GOLPH2 gene to confirm the association of this gene with SAD.

In addition, we found that a SNP rs3826656 in near gene 5′ region of CD33 allele A and AA was associated with SAD in APOE carriers, which is different from the conclusion of Bertram et al. [10] in a family-based GWAS study that used the American population. According to Bertram and colleagues’s study, the increased risk for AD was found in the dominant model of the minor allele G, but their data were not based on APOE-stratified comparison. However, we found that the rs3826656 A allele and AA genotype may decrease the risk for development of SAD in APOE ε4 carriers in the north Chinese Han population. Meanwhile, there was also a study reporting that there was no association between SNP rs3826656 and AD in the Belgian population [31]. A possible explanation for the discrepancy might be the genetic heterogeneity of AD. The SNP rs3826656 may have contributed to AD pathology in Americans and Chinese, whereas in Belgians, other genetic factors may play a role in the development of AD.

SNP rs3826656, on 19q13.33, resides less than 2 kb proximal of the transcription initiation site of the CD33 gene. The CD33 gene encodes a cell surface receptor on cells of monocytic or myeloid lineage, and CD33 is a member of sialic-acid-binding immunoglobulin-like lectins [32]. Currently, there is no evidence that the CD33 gene may be involved in the pathogenesis of AD. GWAS of AD may identify more sequence variations in some unknown gene or chromosome regions that are associated with the disease. Such information may help us understand the disease biology in greater detail in the future. Hence, further experimental research is necessary to define the direct functional association between the SNP rs3826656 on chromosome 19q33 and the occurrence of SAD.

In this study, we failed to find a significant association between alleles, genotypes of the other 16 SNPs, and SAD in the North Chinese Han population. Stratification by APOE ε4 allele status or age at onset did not change the results. According to our data, the association of the 16 previously reported SNPs with AD, if it exists, was not statistically significant in our study population. In view of these inconsistent results, a possible explanation is that if these SNPs play a role in influencing risk for SAD, it is likely to be small and may only exist in certain defined populations. However, the discrepancies might also be explained by reduced statistical power of our study resulting from moderate sample size. We estimated the power of our study material and found that our sample size had a 87% power to detect a genotypic OR of 2.0 at P < 0.05 for a risk allele frequency of 0.20, assuming disease prevalence of 5%. Therefore, further studies in larger samples are necessary to decrease the chance of false negatives.

In summary, through an exploration of 19 candidate SNPs our results indicate that rs10868366 and rs7019241 in GOLPH2 and rs3826656 in near gene 5′ region of CD33 are significantly associated with SAD in the North Chinese Han population. Further studies in larger samples in Chinese populations and meta-analyses of genetic association studies will be the best tools for validating GWAS findings in the future.