Introduction

Lung cancer is the most frequently diagnosed cancer worldwide, accounting for 13 % (1.6 million) of all cancer cases and 18 % (1.4 million) of all cancer deaths in 2008 [1]. It is recognized as the leading cause of cancer death in males and the second one in females [1]. Cigarette smoking is the main necessary etiologic factor for lung cancer, accounting for 80 % of the worldwide lung cancer burden in males and at least 50 % of the burden in females [2, 3]. Tobacco smoke contains free radicals as well as 55 carcinogens that can generate reactive oxygen species and lead to mutations [4]. Among them, 20 compounds have been found convincingly to induce lung tumors in at least one animal species [5]. The carcinogens can react with human DNA and cause DNA damages, and if left unrepaired, such DNA damages can induce mutations and initiate tumorigenesis [6]. Nevertheless, only a small fraction of smokers eventually develop lung cancer, suggesting a wide inter-individual variability in susceptibility [7]. Polymorphisms of DNA repair genes may modulate DNA repair capacity, thus lead to genomic instability and contribute to inter-individual diversity in cancer susceptibility, including lung cancer [8, 9].

Over 130 DNA repair genes have been identified in the four major DNA repair pathways, including base excision repair (BER) pathway [10]. The BER pathway plays an important role in repairing small base lesions in DNA resulting from oxidation and alkylation damage by the specific DNA glycosylase [11]. Mammalian cells contains a series of DNA glycosylases including human 8-oxoguanine DNA glycosylase1 (hOGG1), which encodes the hOGG1 enzyme responsible for the excision of 8-oxoguanine, a mutagenic base byproduct resulting from exposure to reactive oxygen [12].

The hOGG1 gene is located at chromosome 3p26.2 and expressed as twelve alternatively spliced isoforms, among which only the 1α form contains a nuclear translocation signal [13]. A few coding region single nucleotide polymorphisms (SNPs) have been identified in the hOGG1 gene, including the most commonly investigated Ser326Cys polymorphism, which has been reported to be associated with a reduced enzyme activity in a bacterial complementation assay system [14]. Numerous studies have explored the association between hOGG1 Ser326Cys polymorphism and lung cancer susceptibility [1450]; however, the results were inconsistent and conflicting. To overcome the deficiency of small sample size included in each study and limited statistical power, we performed this meta-analysis from all eligible studies to derive a more precise estimation of the association under different genetic models, especially for the smoking status and smoking intensity.

Material and methods

Search strategy and selection criteria

We performed a comprehensive literature search from MEDLINE and EMBASE databases using the following key words: “hOGG1 or OGG1,” “polymorphism or variant or variation,” and “lung” (last search was updated on May 13, 2013). Additional studies and review articles were hands-on searched from references of related original studies or review articles. If more than one article was published using the same patient population or overlapping data from the same institutions, only the latest or the largest study would be included in the final meta-analysis. We also searched related investigations from Chinese Biomedical (CBM) database (http://cbmwww.imicams.ae.cn/cbmbin) with the combinations of “hOGG1” and “lung cancer” in Chinese to maximize the coverage and minimize the selection bias.

Studies included in the final meta-analysis had to meet the following inclusion criteria: a case–control design, an evaluation of the association between hOGG1 Ser326Cys and lung cancer risk, sufficient information to estimate odds ratios (ORs) and their 95 % confidence intervals (CIs), and independence from other studies.

Data extraction

Two authors (Zong-Bao Yin and Rui-Xi Hua) independently assessed the articles and extracted data for compliance with the inclusion criteria. Disagreements were resolved by discussions between the two authors until consensus on all of the eligibility items was reached. Disputes were resolved by an additional author, and the final decision was made by the majority of the votes.

The following data were extracted from each publication: the first author's surname, year of publication, country of origin, ethnicity, match, source of controls, smoking status, cigarette takes, genotyping methods, total number of cases and controls, and numbers of cases and controls with the Ser/Ser, Ser/Cys, and Cys/Cys genotypes. The stratification analysis was conducted by ethnicity (categorized as Asians, Caucasians, Africans, Latinos, or Hawaiian), control source (hospital-based and population-based), smoking status (never smokers and ever smokers), and smoking intensity.

Statistical methods

We evaluated the strength of association between hOGG1 Ser326Cys polymorphism and lung cancer susceptibility by crude ORs together with their corresponding 95 % CIs. We also calculated the pooled ORs and 95 % CIs for Ser326Cys by homozygous model (Cys/Cys vs. Ser/Ser), recessive model (Cys/Cys vs. (Ser/Cys+Ser/Ser)), and dominant model ((Ser/Cys+Cys/Cys) vs. Ser/Ser) as well as allele comparison (Cys vs. Ser). Chi square-based Q test was performed to assess the between-study heterogeneity. Additionally, the heterogeneity was also quantified using the I 2 values, and I 2 lies between 0 and 100 % with higher values indicating a greater degree of heterogeneity [51]. We used the fixed effects model (Mantel–Haenszel method) when the P value of the heterogeneity test was ≥0.10 [52]; otherwise, we used the random effects model (DerSimonian and Laird method) which tends to provide wider 95 % CIs because the constituent studies differ among themselves [53]. The potential publication bias was verified by standard error of log (OR) for each study plotted against its log (OR). Funnel plot asymmetry was assessed by Egger's linear regression test [54]. Sensitivity analyses were performed by excluding a single study individually each time and recalculating the ORs and the 95 % CIs.

We performed the false-positive report probability (FPRP) [55, 56] analysis for all the significant findings, and set 0.2 as FPRP threshold and assigned a prior probability of 0.1 to detect an OR of 1.50 (for risk effects) for associations with genotypes under investigation. Only the results with FPRP values less than 0.2 were considered as significant findings.

We performed all the analyses using the STATA software (version 11.0; Stata Corporation, College Station, TX, USA) and SAS software (version 9.1; SAS Institute, Cary, NC, USA). All P values were two sided, and P values less than 0.05 were considered statistically significant.

Results

Characteristics of the study

A total of 86 publications that examined the association between hOGG1 Ser326Cys and lung cancer risk were identified from MEDLINE and EMBASE, and additional 14 publications were identified from CBM using the search terms described previously (Fig. 1). Only 37 case–control studies which met the crude inclusion criteria were chosen for further analysis [1450]. Among them, four studies [43, 45, 47, 48] were excluded for data overlapping with another one [24] which covered the largest sample size, one [42] was covered by a later investigation [23], and another one [46] was covered by a former investigation with larger sample size [21]; three studies [44, 49, 50] were also excluded in the final analysis because they used the same samples as others [19, 22, 40]. The distribution of genotypes for the hOGG1 Ser326Cys polymorphism in the controls of all studies was in agreement with Hardy–Weinberg equilibrium (HWE), except for three studies [22, 27, 35]. Thus, the other polymorphisms in the controls were in agreement with HWE [22, 27], so these two were not excluded to enlarge the sample size; no further evidence from other polymorphisms was presented in the study of Liu et al. [35], so we performed the analysis with and without this one and found no substantial difference between the two analyses (data not shown), so this study was also included in the final analysis. The studies carried out by Le et al. [18] and Chang et al. [33] were extracted separately according to the ethnic groups. Overall, 28 publications with 31 investigations were included in the final meta-analysis (Table 1), consisting of 10,220 cases and 12,284 controls for the hOGG1 Ser326Cys polymorphism. Of the 31 investigations, sample sizes of cases ranged from 45 to 2,155 and from 42 to 2,163 for controls. There were 18 studies on Asians, 10 studies on Caucasians, and 1 study on African–Americans, Latinos, and Hawaiian, respectively. Of all the studies, 11 were population-based, 19 were hospital-based, and one was not available. As to the smoking status, we observed that there were 15 studies which provided the genotype frequency information for ever smokers and 12 for never smokers. Controls were mainly matched for sex and age. Almost all of the cases were histologically or cytologically confirmed. The detailed genotype frequency data for all the studies was shown in Supplemental Table 1; for the smoking status, in Supplemental Table 2; and for the smoking intensity, in Supplemental Table 3.

Fig. 1
figure 1

Flow chart for the process of selecting the final 31 investigations

Table 1 Characteristics of studies included in the meta-analysis

Meta-analysis results

When all the eligible studies for the Ser326Cys polymorphism and lung cancer risk were pooled together, we found that the hOGG1 Cys carriers were significantly associated with the overall lung cancer risk (homozygous model: OR = 1.24, 95 % CI = 1.05–1.47, P = 0.013; recessive model: OR = 1.22, 95 % CI = 1.05–1.41, P = 0.008; and allele comparing: OR = 1.11, 95 % CI = 1.02–1.21, P = 0.022), as shown in Table 2 and Fig. 2. In the stratified analysis by ethnicity, the comparisons showed that a statistically significant association was found for Asians (homozygous model: OR = 1.25, 95 % CI = 1.03–1.52, P = 0.022; recessive model: OR = 1.22, 95 % CI = 1.04–1.43, P = 0.017; and allele comparing: OR = 1.12, 95 % CI = 1.01–1.25, P = 0.035) and hospital-based studies (homozygous model: OR = 1.24, 95 % CI = 1.00–1.52, P = 0.047 and recessive model: OR = 1.20, 95 % CI = 1.01–1.42, P = 0.034). As to the smoking status and intensity, a statistically significant association was found for never smokers (homozygous model: OR = 1.27, 95 % CI = 1.02–1.58, P = 0.031; dominant model: OR = 1.18, 95 % CI = 1.01–1.39, P = 0.042; and allele comparing: OR = 1.13, 95 % CI = 1.02–1.25, P = 0.022), and more-cigarette takers (homozygous model: OR = 6.97, 95 % CI = 1.51–32.17, P = 0.013; recessive model: OR = 5.66, 95 % CI = 1.24–25.83, P = 0.025; dominant model: OR = 2.22, 95 % CI = 1.31–3.76, P = 0.003; and allele comparing: OR = 2.20, 95 % CI = 1.40–3.46, P = 0.001 for more than 34 pack-years, and homozygous model: OR = 2.70, 95 % CI = 1.40–5.23, P = 0.003 and recessive model: OR = 1.74, 95 % CI = 1.25–2.41, P = 0.001 for more than 40 pack-years).

Table 2 Meta-analysis of the association between hOGG1 Ser326Cys polymorphism and lung cancer risk
Fig. 2
figure 2

Forest plot of overall lung cancer associated with the hOGG1 Ser326Cys polymorphism in a recessive model (Cys/Cys vs. Ser/Cys+Ser/Ser) by the random effects for each of the 31 investigations. For each study, the estimates of OR and its 95 % CI are plotted with a box and a horizontal line. Diamond indicates pooled ORs and its 95 % CIs.

The FPRP values for significant findings from the Ser326Cys polymorphism and lung cancer risk at different prior probability levels are shown in Table 3. For a prior probability of 0.1, with a statistical power of 1.000, the FPRP values were 0.103, 0.065, and 0.168 for the homozygous model, recessive model, and allele comparing, respectively, with an increased risk of lung cancer for all individuals. Positive associations with the Cys/Cys genotype were observed in the subgroups of Asians at homozygous and recessive models, ≥40 pack-years at recessive and allele comparing, and never smokers at allele comparing were also considered as noteworthy findings, for their probability to be a false-positive finding was less than 20 %. In contrast, greater FPRP values were observed for other noteworthy findings between hOGG1 variants, and lung cancer risk may be ascribed to the reduced sample size in some subgroups, which need further validation in larger investigations.

Table 3 False-positive report probability values for associations between lung cancer risk and the frequency of genotypes of hOGG1 gene

Heterogeneity and sensitivity analyses

We used the random effects model to generated wider CIs for all genetics models, for substantial heterogeneities were observed among all the studies for hOGG1 Ser326Cys polymorphism and overall lung cancer risk (homozygous model: P < 0.001, I 2 = 60.4 %; recessive model: P < 0.001, I 2 = 62.2 %; dominant model: P = 0.004, I 2 = 58.1 %; and allele comparing: P < 0.001, I 2 = 72.6 %). Therefore, we found that no single study can alter the pooled ORs qualitatively by leave-one-out sensitivity analysis (data not shown).

Publication bias

The shapes of the funnel plots seemed symmetrical, indicating that there was no obvious publication bias for the association between hOGG1 Ser326Cys polymorphism and lung cancer risk (Fig. 3). The Egger's test further provided statistical evidence that no publication bias existed in this meta-analysis (homozygous model: P = 0.175; recessive model: P = 0.114; dominant model: P = 0.168; and allele comparing: P = 0.145).

Fig. 3
figure 3

Begg's funnel plot to detect publication bias in a recessive model. Each point represents an individual study for the indicated association

Discussion

In this updated meta-analysis including 10,220 lung cancer cases and 12,284 controls from a total of 31 investigations for the hOGG1 Ser326Cys polymorphism, we found that the Ser326Cys polymorphism was significantly associated with the overall lung cancer risk in the homozygous model, recessive model, as well as allele comparing. Furthermore, the stratification analysis showed that the risk was more prominent in studies of Asian subjects, hospital-based controls, never smokers, and more-cigarette takers. We also performed FPRP analysis and calculated the statistical power for all the significant findings. To our knowledge, there is no quite similar meta-analysis that has comprehensively evaluated the association between hOGG1 Ser326Cys polymorphism and lung cancer risk especially for smoking status and smoking intensity as we did here.

There were several meta-analyses which investigated the association between hOGG1 Ser326Cys polymorphism and lung cancer risk. In a previous meta-analysis by Li et al. [57] including 6,375 cases and 6,406 controls from 17 studies, no significant increased lung cancer risk was found for the hOGG1 Ser326Cys polymorphism in homozygous and recessive models; however, significant increased risk was found for Asians in a dominant model and for the population-based studies as well as never smokers. In another meta-analysis, Guan et al. [58] included a total of 7,592 patients and 8,129 controls from 18 studies and found that the hOGG1 Ser326Cys polymorphism was associated with the risk of lung cancer. In the subgroup analyses, this risk was more prone in Asians, squamous carcinoma and adenocarcinoma patients, and heavy smokers. In the third meta-analysis written in Chinese including 8,575 cases and 9,484 controls, 326Cys genotype can significantly increase the lung cancer risk especially for Asians and hospital-based studies [59]. In the fourth meta-analysis, Duan et al. [60] did not find any association between hOGG1 Ser326Cys polymorphism and the overall lung cancer risk; however, significant increased risk was found for adenocarcinoma. In the meta-analysis by Wei et al. [61], including 9,203 cases and 10,994 controls from 25 studies, significant risk was observed in homozygous and recessive models, and in the stratified analysis, significant increased risk was found for the population-based studies and non-smokers. Zhong et al. [62] performed an updated meta-analysis including 8,739 cases and 10,385 controls from 20 studies and found a significant increased risk for the Caucasians. Some of the findings were not repeated in the current meta-analysis, such as the Caucasians had a significant increased risk [62]; with two more additional studies, this risk disappeared. We noticed that some of the previous meta-analyses found significant increased risk for the population-based studies [57, 61], hospital-based studies [39], both hospital-based and population-based studies [58, 62], as well as no associations [60]. In the current one, we found significant increased risk for the hospital-based studies; thus, after we performed the FPRP analysis, this risk disappeared. Some of the discoveries may be false-positive findings due to the limited sample size in each stratum, so it is important to perform FPRP analysis to avoid them, especially when the sample size is not large enough. Though some of the previous meta-analysis paid attention to the smoking status [58, 61, 62], nearly none of them paid attention to smoking intensity, allele comparing as well as the statistical power and the opportunity to be false-positive findings.

BER pathway is initiated by the recognition and excision of small lesions such as oxidized or reduced bases, fragmented or nonbulky adducts, or byproducts of methylating agents [63]. The role of BER in carcinogenesis has been evaluated extensively, and polymorphisms of BER genes may be associated with cancer risk especially for those caused by tobacco use [33]. The hOGG1 gene is one of the major genes involved in the BER pathway [11], and polymorphisms of the hOGG1 gene may lead to the defective repair of 8-hydroxyguanine [14]. One of the most commonly investigated polymorphisms is Ser326Cys; thus, the conclusions from numerous studies were still inconsistent, for example, the frequency of the hOGG1 326Cys/Cys genotype was found to be significantly higher in the lung cancer patients when compared with controls [18, 20, 36, 37, 41] and significantly lower in some studies [27, 35]; however, no association was found by other studies. Overall, we found that the hOGG1 Cys carriers had significantly increased lung cancer risk when compared with the Ser carriers, which was also confirmed in the subgroup analysis for Asians, never smokers, and more-cigarette takers. There are several possible reasons for our findings. The Asians who carry the hOGG1 326Cys allele had an obviously higher risk than Caucasians which may be ascribed to the ethnicity difference. The never smokers with the hOGG1 326Cys were more prone to get lung cancer which may be ascribed to their genetic susceptibility even being exposed to a low level of tobacco smoke, as well as more susceptible to second-hand smoke. More-cigarette takers with the 326Cys had obviously higher risk than the fewer takers which may be due to the fact that tobacco smoke contains lots of carcinogens that can induce various kinds of DNA damage and may accumulate when more cigarettes are consumed.

Of note, several limitations in the current meta-analysis need to be addressed. Firstly, some studies provided insufficient genotype as well as different studies provided different definitions of smoking status and smoking intensity. Secondly, we found that most studies included in this meta-analysis had relatively small sample size for cases (<500) except for five studies [21, 23, 30, 32, 39], which were insufficient for genetic association study and may attenuate the statistical power. Thirdly, lacking detailed information, this meta-analysis was based on unadjusted estimates of ORs, while a more precise association should be performed if individual's age, sex, and occupation exposure data were available. Finally, some of the findings in stratification analysis may have been overestimated for there was only one trail available.

Despite these limitations, our meta-analysis strongly suggests that hOGG1 Ser326Cys polymorphism may be associated with lung cancer, especially for Asians, never smokers, and more-cigarette takers. However, well-designed prospective studies with larger sample sizes, more information for smoking status, as well as smoking intensity are required to validate our findings.