Introduction

Chronic obstructive pulmonary disease (COPD) is one of the leading causes of morality worldwide, and is a major public health problem (Bossé 2012; Laratta and van Eeden 2014). Prevalence of the disease is estimated to be about 10% in the general population, which increases to 20–30% in the population aged above 40 years (Gagnon et al. 2014). COPD is a complex disease with multiple risk factors including environmental, smoking and genetic factors. COPD is defined as a preventable and treatable disease with major extrapulmonary effects. Previously, it was characterized by shortness of breath with exertion, wheezing and chronic cough (Mapel and Roberts 2012; Naik et al. 2014). Currently, a decrease in forced expiratory volume in 1 s (FEV1%) and the ratio of FEV1 to forced vital capacity (FEV1/FVC) are the diagnostic criteria of COPD (Berndt et al. 2012). Hypoxic vasoconstriction with advanced airflow limitation in COPD patients may result in pulmonary hypertension (Wells and Dransfield 2013). Vascular endothelial growth factor A (VEGF-A) is considered one of the most important regulatory factors in pulmonary hypertension. In recent years, case–control studies confirmed the relationship between single-nucleotide polymorphisms (SNPs) in VEGF-Aand increased risk of disease development, including lung cancer, acute lung injury, etc. (Meyer et al. 2012; Lin et al. 2013; Maeda et al. 2013). VEGF-A plays an important role in the migration, proliferation and survival of endothelial cells (Gong et al. 2011). Previous studies report that SNPs rs10434 and rs3025028 contained within VEGF-A are associated with lung function from infancy to adulthood, and play an important role in determination of airway function (Simpson et al. 2012). VEGF-A is located on chromosome 6p21.1 and comprises a coding region separated by seven introns. To our knowledge, there is little information on SNPs in VEGF-A related to COPD in patients. We designed a case–control study to determine whether alterations in VEGF-Aincrease susceptibility to COPD occurrence. The purpose of our study was to identify the relationship between SNPs in VEGF-Aand COPD.

Materials and methods

Study population

All cases and controls were restricted to Hainan province, located in the Hainan Provincial People’s Hospital. All patients with COPD were diagnosed according to the criteria established by the World Health Organization Global Initiative for Chronic Obstructive Lung Disease (GOLD) (Rabe et al. 2007) and they did not have history of other diseases. Case individuals were chosen regardless of age, gender, smoking status or COPD classification. Normal subjects were selected randomly from a native Hainan population while attending medical checkup at their community hospital, Hainan Provincial People’s Hospital. A total of 246 cases and 350 controls were recruited in this study. Basic characteristics of the participants such as gender, age and smoking status are listed in table 1 and the classification of COPD with FEV1/FVC levels is shown in table 2.

Table 1 Characteristics of controls and COPD cases in this study.
Table 2 Classification of severity of COPD.

Clinical data and demographic information

Personal data, including residential regions, age, smoking status, gender, education status and family history of cancer were collected using standard epidemiological questionnaire and in-person interview. The case information was collected through consultation with treating physicians or from medical chart review. All participants signed an informed consent agreement. The Human Research Committee for approval of research involving human subjects, Hainan Provincial People’s Hospital approved use of human tissue in this study.

Selection of SNPs and method of genotyping

We selected SNPs from published polymorphisms associated with COPD. SNPs with minor allele frequency (MAF) >5% in a HapMap CHB population were selected resulting in 18 SNPs in different genes. Genomic DNA was extracted from whole blood using phenol–chloroform (Köchl et al. 2005).

Genomic DNA was extracted from whole blood using the GoldMag-Mini whole blood genomic DNA purification kit (GoldMag, Xi’an, China). We used Sequenom Mass ARRAY assay design 3.0 software (Sequenom, San Diego, USA) to design the multiplexed SNP mass EXTEND assay. SNP genotyping was performed by Sequenom Mass ARRAY RS1000 using the standard protocol recommended by the manufacturer sequenom type r4.0 software was used to perform data management and analysis.

Statistical analysis

To perform statistical analyses, we used Microsoft Excel and SPSS 17.0 statistical package (SPSS, Chicago, USA). In our study, P≤ 0.05 is the threshold of statistical significance. The validation of each SNP frequency in control subjects was tested for departure from Hardy–Weinberg equilibrium (HWE) using an exact test. Chi-square test was done to calculate the genotype frequencies of case and control individuals (Adamec 1964; Köchl et al. 2005). Unconditional logistic regression analyses with adjusted for age, gender and smoking states was performed to test. (Bland and Altman 2000). The possibility of gender difference as a source of population substructure was evaluated by a genotype test for each SNP in male and female controls, and the number of significant results at 5% level was compared with the number expected by the chi-square test. The association of certain SNPs with the risk of COPD was tested in four genetic models (allele, codominant, dominant and log-additive); unconditional logistic regression analysis with adjustment for age, gender and smoking was done to calculate ORs and 95% CIs (Bland and Altman 2000; Purcell et al. 2007).

Results

From the 596 participants we detected 18 SNPs. Two SNPs (rs9951925 and rs8102683) were excluded at 5% HWE statistically significant P level. The association between SNP genotypes and the susceptibility of alleles to COPD was performed by chi-square analysis. All results are shown in table 3.

Table 3 Basic information for candidate SNPs in this study.

We assumed that the minor allele of each SNP was a risk factor compared to the wild-type allele. Three models: codominant, dominant and log-additive were applied for analysing the association between SNPs and COPD risk by using logistic tests. We discovered one SNP (rs10007052) in the RNF150 gene which was a risk factor for COPD in the dominant model (rs10007052, OR 1.60; 95% CI 1.13–2.26; P = 0.0076), codominant model (rs10007052, OR 1.55; 95% CI 1.08–2.21; P = 0.023) and log-additive model (rs10007052, OR 1.50; 95% CI 1.12–2.01; P = 0.006). We found two SNPs in VEGF-Awhich were risk factors for COPD (rs3025030, OR 1.40; 95% CI 1.05–1.87; P = 0.022; rs3025033, OR 1.38; 95% CI 1.03–1.84; P = 0.03). We also detected one SNP (rs9296092) using the allele model which had a similar relationship with the susceptibility to COPD (rs9296092, OR 2.63; 95% CI 1.27–5.45; P = 0.078). According to the results of the three models calculated by unconditional logistic regression analyses adjusted for age, gender and smoking status, we first found that only rs9296092 was still considered a risk factor (rs9296092, OR 3.53; 95% CI 1.12–11.11; P = 0.031) (table 4). Pairwise linkage disequilibrium (LD) analysis was performed for VEGF-Ausing the polymorphisms detected in this study. The pattern of LD was analysed using two parameters, r 2 and D . One main linkage block was observed across the locus (figure 1). This block was comprised of seven closely-linked SNPs: rs833068, rs833070, rs3024994, rs3024997, rs3025000, rs3025030 and rs3025033. Then, the association between inferred haplotypes and COPD risk among the individuals was analysed. We found the haplotype ‘CG’ between rs3025030 and rs3025033 associated with the risk of COPD (OR 1.39; 95% CI 1.04–1.86; P = 0.028) (table 5).

Figure 1
figure 1

Linkage disequilibrium of polymorphic sites in VEGF-A on chromosome 6. Block 1 includes rs833068, rs833070, rs3024994, rs3024997 and rs3025000. Block 2 includes rs3025030 and rs35025033. The LD between two SNPs is standardized \(D^{\prime }\) (red schemes).

Table 4 Logistic regression analysis of the association between SNPs and risk of COPD.
Table 5 Multiple genes haplotype frequencies and association with the risk of COPD.

Discussion

Environmental risk factors and effects of smoking may increase the occurrence of COPD. However, several genetic polymorphisms associated with susceptibility to COPD have been identified. Whereas each individual polymorphism may contribute to only a small relative risk of COPD, a combination of several implicated polymorphisms and other risk factors may increase the relative risk. In this study, we detected 18 SNPs from the case–control study in the Han Chinese population from Hainan city. We considered two SNPs in VEGF-A (rs3025030 and rs3025033) as risk factors for COPD. Our results suggest that VEGF-A may play an important role in the risk of COPD in Hainan individuals.

VEGF-Awhich is highly polymorphic and exists with multiple common SNPs, complicating the investigation for the genetic components of COPD occurrence. Previous studies have shown that genetic variability affects VEGF-A activity and expression (Fang et al. 2009). VEGF, coded for VEGF-A, is a key mediator of angiogenesis and vascular permeability (Su et al. 2011). VEGF-A overexpression or suppression causes a decisive effect on the function of VEGF, which may lead to a complex change in angiogenesis of tissue. The changes of pulmonary small vessels is one of pathological changes of COPD patients (Hogg and Timens 2009), in which VEGF-A plays a vital role. Additionally, a previous study reported that VEGF-Acan disrupt the pattern formation of pulmonary smooth muscle (Akeson et al. 2003), which may result in progression of pulmonary hypertension, further aggravating limited pulmonary ventilation. Moreover, previous literature identified several SNPs, including rs3025030 and rs3025033, as associated with colon or rectal cancer by influencing the expression of VEGF-A in the angiogenesis pathway (Slattery et al. 2014). In our case–control study, we found two VEGF-A polymorphisms (rs3025030, rs3025033) caused a more pronounced effect than the other single polymorphisms in VEGF-A. This finding suggests a novel idea that VEGF-A polymorphism combinations may be risk factors for COPD. SNPs may influence the expression of VEGF-A in the pathway, which may reduce beneficial effects of VEGF in lung tissue and increase the risk of COPD. The combination of two polymorphisms (rs3025030, rs3025033) may further increase the risk of COPD. Unfortunately, single SNP analysis of rs3025030 and rs3025033 calculated by unconditional logistic regression analyses adjusted for age, gender and smoking were nonsignificant. In our opinion, smoking may be one of the most important confounding factors and it may overshadow the genetic effect of VEGF-A polymorphisms in COPD. In future studies, researchers may identify the relative interaction between VEGF-A polymorphisms and COPD development.

A previous genomewide association study has not only proved that rs9296092 lies in the gene region between the zinc finger and BTB domain containing 9 and BCL2-antagonist/killer1 (ZBTB9-BAK1), but also is associated with age at smoking initiation (Siedlinski et al. 2011). We also found rs9296092, located near 6p21.1, predicts a 2.63-fold COPD risk by the allele model and a 3.53-fold disease susceptibility calculated by unconditional logistic regression analyses adjusted for confounding factors. We hypothesize that rs9296092 may be a risk factor for COPD, although the exact genotype of the loci is not identified.

We did not find either rs7937 or rs3733829 in EGLN2were associated with COPD or lung function in the Chinese Han population from Hainan province. The results included strong levels of D between rs7937 and rs3733829 observed without adjustment, which might be a relative factor with COPD patients. Overexpression of EGLN2 at a known COPD risk locus leads to occurrence of disease (Salit et al. 2014)

Two limitations should be considered in the current study. First, because our sample size was relatively small, we did not conduct population stratification of smokers, therefore, we have not confirmed that this locus is significant in smokers. We will increase the sample size in subsequent experiments. Second, the heterogeneity in smoking behaviours and comorbidities was not evaluated in this study, and evaluation of heterogeneity in smoking behaviours has contributed to elucidating the pathogenesis of COPD in other studies.

We performed Bonferroni correction in our statistical analysis and found no statistical significant associations between VEGF-A SNPs and COPD. This may be due to the relatively small sample size, the selection criteria of VEGF-A SNPs (MAF 0.5%) and the weakness of Bonferroni correction itself. Adjustments for multiple tests, such as Bonferroni correction analysis, are required for medical association studies, but also create more problems than they solve. The main weakness of Bonferroni correction is that the interpretation of a finding depends on the number of other tests performed. True important differences may be deemed nonsignificant since the likelihood of type II errors are also increased. However, Bonferroni corrections are considered acceptable while performing associations without preestablished hypotheses.

In conclusion, the combined actions of different confounding factors, such as small sample size and lack of stratification of smokers resulted in a nonsignificant phenomenon with adjustment. In our opinion, both rs3025030 and rs3025033 in gene VEGF-Amay be risk factors for COPD, and interactions between loci in VEGF-A may be more important than a single locus change. We also found a new link between rs9296092 and COPD risk.