Introduction

Stroke is one of the most frequent contributors to disability and mortality in the world, including China. In China, according to a cross-sectional survey of stroke burden in 155 urban and rural centers in 31 provinces, the crude stroke incidence rate was 345.1/100,000 person-years [1]. Ischemic stroke (IS) is the most common type of stroke, which makes up approximately 80% of all stroke cases. The occurrence and development of most IS is the result of the interaction between genetic and environmental risk factors.

Phosphodiesterase 4D (PDE4D), which can specifically degrade cyclic adenosine monophosphate (cAMP), has been implicated in the pathogeny of IS. It is expressed in many cells such as immune cells (T lymphocytes, macrophages, and monocytes), endothelial cells, smooth muscle cells, and atrial myocytes [2,3,4]. A decrease in the cAMP concentration promotes vascular smooth muscle proliferation [5]. Conversely, an increase in cAMP attenuates the formation of neointima and suppresses vascular smooth muscle proliferation after arterial injury [6]. PDE4, including PDE4D, are the major enzymes of cAMP signal transduction pathway in inflammatory cells. A decline in cAMP leads to inflammation [7]. Inflammation may contribute to atrial fibrillation and atherosclerosis, both of which are risk factors for IS.

In 2002 and 2003, the deCODE Genetics group conducted linkage and association analyses in the Icelandic population and identified PDE4D as a susceptible gene of IS [8, 9]. Since then, whether the genetic variants of this gene are related to stroke has become a research hotspot. In Chinese populations, single nucleotide polymorphism (SNP) rs918592, lying in an intron near the 5’ end of PDE4D, has been investigated whether it was associated with IS risk. But the results of different studies were discordant. For instance, Tang (2007) [10] reported a negative result, while Xu et al. (2008) [11] and He et al. (2012) [12] reported positive results. Therefore, in order to further elucidate the correlation between SNP rs918592 and the risk of IS in Chinese populations, we carried out the present meta-analysis. No studies have explored the functions of SNP rs918592 to date, so we preliminarily analyzed it using bioinformatics approaches.

Methods

The present study was conducted following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 statement [13]. The PRISMA checklist can be found in Additional file 1.

Search strategy

We searched the PubMed, Embase, ISI Web of Science, Weipu, China National Knowledge Infrastructure (CNKI), Chinese Biomedical (CBM) and Wanfang databases from inception to November 30, 2023, using the following items: “phosphodiesterase 4D”, “PDE4D”, “rs918592”, “polymorphism”, “stroke”, “cerebral infarction”, “ischemic stroke”, “cerebrovascular disease”, and their synonyms. The references of the identified articles, as well as relevant reviews and meta-analyses, were also manually scanned for other potentially eligible studies.

Study selection

The selection of studies should be based on the following criteria: (a) case-control, nested case-control, or cohort studies; (b) assessment of the correlation between SNP rs918592 and IS risk in Chinese populations; and (c) using validated techniques to detect SNP rs918592. Articles were excluded if they (a) had no primary findings, (b) were reviews, editorials, case reports, case-only studies, or family-based studies, or (c) were duplicate studies.

Data extraction

The following information was collected from each qualified study: first author’s name, year of publication, ethnicity of participants, number of cases and controls, mean age of cases and controls, methods for detecting SNP rs918592, matching variables of controls, IS subtypes (if mentioned in the article), as well as number of alleles and genotypes.

Literature screening, data collection, and assessment of study quality were conducted independently by two researchers (X.Y. and G. Z.). The divergences that occurred through the process were settled by discussing with the corresponding author (R. L.).

Statistical analyses

Hardy-Weinberg equilibrium (HWE) was evaluated by a Chi-square test in the controls. Statistical analyses were performed using STATA 11.0 software (Stata Corporation, College Station, TX). The strength of association between SNP rs918592 and IS risk was calculated using pooled odds ratios (ORs) with 95% confidence intervals (CIs) for two comparisons between different genotypes (AG vs. AA, and GG vs. AA), as well as under dominant (GG + AG vs. AA), additive (G vs. A), and recessive (GG vs. AG + AA) genetic models.

The heterogeneity between studies was assessed with chi-square-based Q-test and I2 test. When P value was above 0.10, a fixed-effects model using the Mantel-Haenszel method was selected for data analysis; otherwise, a random-effects model using the DerSimonian-Laird method was conducted. I2 metric was used to show the degree of heterogeneity, where 0–25%, 25–50%, 50–75% and 75–100% meant no, moderate, large, and extreme heterogeneity, respectively. The underlying factors causing heterogeneity were explored by meta-regression analysis.

Sensitivity analyses by sequentially removing each study at a time were performed to test the stability of the results. Publication bias was estimated by Begg’s test with funnel plot and Egger’s linear regression test with publication bias plot.

In Silico bioinformatics analysis

To explore the functions of SNP rs918592, two online prediction websites were used for bioinformatics analysis: HaploReg (http://pubs.broadinstitute.org/mammals/haploreg/haploreg.php) and RegulomeDB (http://regulomedb.org/). HaploReg was applied to discover noncoding genomic annotations for variants and determine their underlying causal correlations with disease pathogenesis. RegulomeDB was utilized for the annotation of variants with regulatory elements by giving ranks. The lower the rank, the more likely it is to have a regulatory function.

Results

Eligible studies

After literature search and further screening, 12 articles in total met the inclusive criteria (Fig. 1). Four were subsequently excluded by careful reading of the full text. Xu’s (2008) [14], Bai’s (2011) [15] and Sun’s (2013) [16] studies overlapped with Xu’s (2008) [11], He’s (2012) [12] and Ma’s (2014) [17] studies, respectively, and then were excluded. In Zhang’s (2019) study, SNP87 was incorrectly labeled as rs918592 and actually rs2910829 [18]. Two articles investigated the association in independent populations, so each article was considered as two independent studies [17, 19]. Finally, 10 studies (in 8 articles involving 2,348 IS cases and 2,289 controls) were enrolled in the meta-analysis of the correlation between SNP rs918592 and IS risk (Table 1 and Additional file 2: Table S1) [10,11,12, 17, 19,20,21,22]. Each study design was case-control. The genotypic distribution of one study [22] deviated from HWE expectation in controls (Additional file 2: Table S2).

Fig. 1
figure 1

Flow diagram of the literature selection process

Table 1 Main characteristics of studies included in the meta-analysis of the relationship between SNP rs918592 and the risk of ischemic stroke

SNP rs918592 and ischemic Stroke risk

Significant association of SNP rs918592 with IS risk was observed in two comparisons and all the three genetic models (Table 2; Figs. 2, 3, 4, 5 and 6). The G allele was related to reduced risk of IS (G vs. A: OR 0.83, 95% CI 0.74–0.95, P = 0.005). Among the three genetic models, the dominant model had the smallest OR (GG + AG vs. AA: 0.74, 95% CI 0.61–0.90) and P value (0.003), and it might be the best-fitting model. Moderate or large heterogeneity was identified across all studies in the comparison between GG and AA, as well as under the dominant and additive models. Ethnicity, genotyping method, mean age, sample size, and HWE were not the main factors causing heterogeneity, while Song’s (2015) study [21] might be one of the factors causing heterogeneity. After excluding Song’s (2015) study, there was no heterogeneity between studies under the dominant and additive models (Table 2 and Additional file 2: Figs. S1-S5). Notably, all of the patients included in Song’s (2015) study had hypertension, which was markedly different from other studies. All the pooled OR values were not substantially altered after excluding Song’s (2015) study.

Fig. 2
figure 2

Forest plot for the relationship between the risk of ischemic stroke and SNP rs918592 (AG vs AA) (fixed effects)

Fig. 3
figure 3

Forest plot for the relationship between the risk of ischemic stroke and SNP rs918592 (GG vs AA) (random effects)

Fig. 4
figure 4

Forest plot for the relationship between the risk of ischemic stroke and SNP rs918592 under the dominant model (AG + GG vs AA) (random effects)

Fig. 5
figure 5

Forest plot for the relationship between the risk of ischemic stroke and SNP rs918592 under the additive model (G vs A) (random effects)

Fig. 6
figure 6

Forest plot for the relationship between the risk of ischemic stroke and SNP rs918592 under the recessive model (GG vs AG + AA) (fixed effects)

Table 2 Meta-analysis of the relationship between SNP rs918592 and the risk of ischemic stroke

Sensitivity analyses and publication bias

When any single study was removed, the pooled OR value was not significantly affected (Additional file 2: Figs. S6-S10), suggesting good stability of the results in the present study. Moreover, the results of Begg’s and Egger’s linear regression tests displayed no significant publication bias (Figs. 7, 8, 9, 10 and 11).

Fig. 7
figure 7

Begg’s funnel and Egger’s publication bias plots for the relationship between the risk of ischemic stroke and SNP rs918592 (AG vs AA) (Begg’s P = 0.371, Egger’s P = 0.283)

Fig. 8
figure 8

Begg’s funnel and Egger’s publication bias plots for the relationship between the risk of ischemic stroke and SNP rs918592 (GG vs AA) (Begg’s P = 0.592, Egger’s P = 0.418)

Fig. 9
figure 9

Begg’s funnel and Egger’s publication bias plots for the relationship between the risk of ischemic stroke and SNP rs918592 under the dominant model (AG + GG vs AA) (Begg’s P = 0.474, Egger’s P = 0.295)

Fig. 10
figure 10

Begg’s funnel and Egger’s publication bias plots for the relationship between the risk of ischemic stroke and SNP rs918592 under the additive model (G vs A) (Begg’s P = 1.000, Egger’s P = 0.537)

Fig. 11
figure 11

Begg’s funnel and Egger’s publication bias plots for the relationship between the risk of ischemic stroke and SNP rs918592 under the recessive model (GG vs AG + AA) (Begg’s P = 1.000, Egger’s P = 0.687)

Functional annotation

We analyzed the functional roles of SNP rs918592 and variants in strong linkage disequilibrium (LD) with it (defined as r2 ≥ 0.8 with rs918592 in the East Asian (CHB, JPT, and CHS) population) using HaploReg v4.1 (Additional file 2: Table S2). The results showed that SNP rs918592 and the correlated 20 variants mapped to PDE4D intronic regions. All of them might affect transcriptional regulatory element activity and be identified as expression quantitative trait loci (eQTL) for prostate androgen-regulated transcript 1 (PART1), whose 5’ end overlaps with the 5’ end of PDE4D and whose transcript is a long non-coding RNA, in thyroid tissue. Among the correlated variants, two might be located within the histone modification regions of enhancers and one in promoters; two were in DNase I-hypersensitive regions; two had the alteration in transcription factor (TF) binding; one (rs918590) was related to ubiquitin conjugating enzyme E2 E1 (UBE2E1) expression in peripheral blood monocytes; and two were located in evolutionarily conserved regions predicted to be functionally constrained according to SiPhy or GERP analysis. As a whole, SNP rs34168777 (r2 = 0.99) (with evidence of conserved region, enhancer histone mark, DNase I-hypersensitive region, TF-binding, any TF motif, and eQTL hit) might be worthiest of further functional study. The results of RegulomeDB v2.1 also showed that rs34168777 had a rank of 1b (eQTL + TF binding + any motif + DNase Footprint + DNase peak), which was the best ranking among the 21 SNPs (Additional file 2: Table S3). The rank of SNP rs918592 was 1f.

Discussion

The present meta-analysis revealed that SNP rs918592 was related to the risk of IS in Chinese populations. Although the sample size of the present study was not large, the results were stable in various comparisons and models. So far as we know, this study is the first meta-analysis of the relationship between SNP rs918592 and IS risk in Chinese populations.

In fact, we found only one (Song’s (2006)) study [23] on the correlation between SNP rs918592 and IS risk outside the Chinese populations during literature search. This study focused on early-onset IS in a female population, which showed the A allele of SNP rs918592 was the risk allele in African-Americans and Caucasians, similar to the present study. They also pointed out linkage disequilibrium existed between SNPs rs918592 and rs152312 (SNP 41, related to IS in Icelanders) among Caucasians (LD = 0.66) and African-Americans (LD = 1.0). But the deCODE Genetics group incorrectly labeled SNP41 as rs152312 in 2003 [9], which was corrected to rs12153798 in 2005. That is, actually, rs12153798, rather than rs152312, was associated with IS risk in the Icelandic population. Since both Song’s (2006) study and our meta-analysis suggest that SNP rs918592 is associated with IS risk, we suggest that future studies should be extended not only in the Chinese populations but also in others.

The deCODE Genetics group revealed the association of PDE4D variants with IS risk, particularly strong with the risk of cardioembolic (CE) and large artery atherosclerosis (LAA) stroke [9]. We tried to perform the analysis in IS subtypes. There were only two studies for LAA stroke (402 cases and 420 controls) and two studies for small vessel stroke (295 cases and 420 controls) (Additional file 2: Tables S4, S5). Therefore, there were not enough data for meta-analyses of IS subtypes. It is unclear which subtype of IS SNP rs918592 is mainly associated with. It is a shortcoming of this study.

SNP rs918592 is an intron variant of PDE4D and may be a causal variant or just a marker in LD with the causal variant. It is necessary to further investigate the functions of SNP rs918592 and variants in LD with it to find the true pathogenic variant. The results of functional prediction showed that SNP rs918592 and its linked 20 variants might have regulatory functions and SNP rs34168777 was the most likely causal variant among them. We only predicted their possible functions, but did not carry out experiments to validate them, which is another shortcoming of this study.

Conclusion

This study suggests that SNP rs918592 in PDE4D may contribute to IS risk in Chinese populations. It provides a better answer for the association of PDE4D SNP rs918592 with IS risk in Chinese populations. Larger and more refined studies will be conducive to elucidate this effect on IS, especially on CE and LAA stroke. Further functional studies are also required to identify the causal variant(s).