Introduction

Patatin-like phospholipase domain-containing 2 (PNPLA2), also called adipose triglyceride lipase (ATGL) [1], the ζ-isoform of a calcium-independent phospholipase A2 (iPLA2ζ) [2] and desnutrin [3], was identified simultaneously by several laboratories in 2004 as a key enzyme for adipose lipid mobilization. It catalyzes the first step in triglyceride hydrolysis in mammalian adipocyte and nonadipocyte cells [4, 5]. Porcine PNPLA2 gene is located in a region on porcine chromosome 2p17 (SSC2pter) [6], which harbours a quantitative trait locus for the trait of backfat thickness [7].

Previous studies have discovered functional mutations in PNPLA2 gene. In human, mutations in PNPLA2 gene were associated with degradation of lipid droplets and disorders related to fat deposition such as type II diabetes [8, 9]. In obesity model pig, our previous research had identified a missense mutation of G/A392 (R/H131) in porcine PNPLA2 gene and found the polymorphism was significantly associated with several economic traits detected, including subcutaneous fat thickness, viscera adipose tissue weight, lean meat percentage (LMP), loin eye traits and even rib numbers, where allele A was associated with decrease of fat deposition [10]. As expected, porcine PNPLA2 mRNA was expressed abundantly in backfat, mildly in heart, muscle and small intestine, and undetectable in lung, stomach, liver, spleen, kidney and ovary.

A variety of transcription factor binding sites were predicted in the upstream region of porcine PNPLA2 gene, such as CCAAT/enhancer binding protein (C/EBP), cyclic AMP response element-binding protein (CREBP1) and myogenic differentiation 1 (MyoD) binding sites [10]. Variations in the promoter region may affect the allele-specific gene expression patterns by altering the binding sites for original transcription factor [11]. We therefore selected the porcine PNPLA2 gene as a potential candidate gene for fat deposition and carcass traits, identified the mutations in the promoter region, determined their roles on fat deposition related traits in different pig breed populations by haplotype construction and association analysis, and evaluated their effects on transcriptional activity by dual-luciferase reporter assay.

Materials and methods

Animals and traits

To investigate the allele frequency, animals from five different pig breed populations were chosen, including 19 Chinese Shengxian spot (SXS) pigs, 24 Chinese Bamei (BM) pigs, 161 Duroc pigs, 63 Landrace pigs and 51 Yorkshire pigs. The association analyses were carried out in Duroc, Landrace and Yorkshire pigs, which were all backup boars from different sires and litters, and kept at eight Pig Performance Testing Stations in Experimental Farms of Zhejiang Academy of Agricultural Sciences under the same housing and feeding conditions from 30 to 100 kg body weight. The parameters measured by Feed Intake Record Equipment (FIRE, American Osborn) on individual pigs during the test were FCR and test daily gain (TDG). Off-test backfat thicknesses of Duroc, Landrace and Yorkshire pigs were measured between 3rd and 4th last lumbar vertebrae and 7 cm from the midline (BF1), and between 3rd and 4th last rib and 7 cm from the midline (BF2) with an A-mode ultrasound device (SFK-Technology, Piglog 105, Denmark). Muscle depths were also measured between 3rd and 4th last rib and 7 cm from the midline (MD) with the same device. Lean meat percentage is acquired by determining the percentage weight of the whole carcass, however, in live animals, the LMP was estimated by Piglog105 automatically according to the instruction, on the basis of different fat and meat depths (BF1, BF2, and MD).

PCR amplification and polymorphism screening of upstream sequence of porcine PNPLA2 gene

Genomic DNA was extracted from whole blood samples or ear tissues according to standard phenol/chloroform procedure. According to the working draft sequence of Sus scrofa chromosome 2 clone CH242-144M19 (GenBank: CU928976.18), oligonucleotide primers P1F and P1R were designed to amplify and verify the upstream sequence of porcine PNPLA2 gene (Table 1).

Table 1 List of primers for amplification and plasmid construction

PCR reaction systems were prepared in a final volume of 50 μl, containing 25 μl 2× PrimeSTAR™ GC buffer (Takara), 0.5 μl PrimeSTAR™ HS DNA polymerase (2.5 U/μl, Takara), 4 μl dNTPs (2.5 mM), 1 μl each primer (10 μM) and 100 ng genomic DNA. The touchdown PCR program was set as 10 s at 98 °C, followed by 13 cycles of 10 s at 98 °C, 10 s at 60–48 °C (annealing temperature was decreased 1 °C after each cycle), and 3.5 min at 72 °C. This touchdown step was followed by 28 cycles of 10 s at 98 °C, 10 s at 50 °C, and 3.5 min at 72 °C. After being checked by 1.5 % agarose gel electrophoresis, the PCR products were purified using QIAquick Gel Extraction Kit (Qiagen), and subjected to direct sequencing by an ABI 3100 Avant DNA sequencer (ABI).

In order to explore the genetic variations of the obtained 5′ flanking sequence of porcine PNPLA2 gene in the tested groups, 20 individuals (five individuals per breed) were sequenced and multiple alignments were analyzed using ClustalW (http://www.ebi.ac.uk/clustalw/index.html). The result indicated four single nucleotide polymorphisms (SNPs) in the fragment.

Bioinformatics analysis of the four SNPs

The obtained sequence covering the four SNPs was subjected to transcription factor prediction. The transcriptional binding sites were predicted using TESS (http://www.gene-regulation.com/ pub/databases.html), SignalScan (http://www-bimas.cit.nih.gov/molbio/signal/) and MatInspector (Genomatix Software GmbH).

Polymorphism genotyping

In order to perform genotyping, primers P2F and P2R (Table 1) were designed to amplify the fragment covering the polymorphic sites in 5′ flanking sequence of porcine PNPLA2 gene. Each reaction mixture (50 μl) contained 1.5 μl DNA, 10 μM primers and 25 μl GoTaq Green Master Mix (Promega). The PCR program was set as 2 min at 95 °C, 33 cycles of 30 s at 95 °C, 10 s at 58.5 °C, 50 s at 72°C, and then followed by 5 min at 72 °C. PCR products were purified and subjected to direct sequencing by an ABI 3100 Avant DNA sequencer (ABI). Genotypes were identified by visual inspection of both chromatograms with BioEdit [12]. The genotyping analyses of the four SNPs were conducted using all the 318 pigs in the study.

Haplotype construction and statistical analysis

Pairwise tests for linkage disequilibrium (LD) and haplotype construction were performed using the SHEsis software platform (http://analysis2.bio-x.cn/myAnalysis.php) [13]. Normalized LD coefficient (D′ value) and correlation coefficient between a pair of the alleles (r 2 value) were estimated. Association of diplotypes with economic traits were tested using the MIXED procedure of SAS (SAS Institute, Cary, NC), with a model including genotype as fixed effect, sire and litter as random effects, and off-test date or weight as a covariate. Significant level was set to be 0.05.

Plasmid construction

To construct the plasmids containing promoter region of porcine PNPLA2 gene, the 5′ end of primers were modified (highlighted in bold) to contain SacI and HindIII restriction sites in the forward primer (P3F, P4F) and reverse primer (P3R), respectively (Table 1). The 1143 bp length of promoter region (−848 to +295 nt), which harbouring the different haplotypes (H1, H2, H3), was amplified by primer pairs P3F/P3R, and 920 bp length of promoter region (−615 to +295 nt), which lacking the haplotypes (Del), was amplified by primer pairs P4F/P3R. PCR was performed under the same condition described above. The amplified fragments were purified and quantified, and then double-digested with SacI and HindIII and ligated into the pGL3-basic vector (Promega). All the recombinant plasmids were identified by DNA sequencing,and designated as pGL3-basic-H1, pGL3-basic-H2, pGL3-basic-H3 and pGL3-basic-Del, respectively (Fig. 1). A pGL3-promoter vector (Promega) driven by SV40 promoter served as a positive control, and a promoter-less pGL3-basic vector as a background control.

Fig. 1
figure 1

Different haplotype constructs of porcine PNPLA2 gene (left) and dual-luciferase reporter assays in HEK293 cell. Left: diagram of four promoter constructs showing three different haplotypes and haplotype deficiency as control. The number indicated the relative nucleotide position of the four SNPs from the transcriptional start site (TSS; position +1). Right: luciferase activity was presented as the value relative to that of pRL-TK. Bars with different lowercase letters (a, b, c, d and e) indicate significantly promoter activities (P < 0.05)

Transfection and dual-luciferase reporter assay

Human embryonic kidney (HEK) 293 cells were cultured in Dulbecco’s modified Eagle’s medium (HyClone) with 10 % fetal bovine serum (Biowest) and 1 % penicillin streptomycin (Invitrogen) under 5 % CO2 at 37 °C. The recombinant plasmids containing different DNA sequences of porcine PNPLA2 promoter were transfected into HEK293 cells using lipofectamine 2000 (Invitrogen). There were six groups in the experiment: positive control group (transfected by pGL3-promoter plasmid), negative control (transfected by pGL3-basic plasmid), control group (transfected by pGL3-basic-Del plasmid) and experimental groups (transfected by pGL3-basic-H1, pGL3-basic-H2 and pGL3-basic-H3 plasmid respectively). The plasmid pRL-TK (Promega) was co-transfected into all cells for normalization. Luciferase activity was measured using the Dual-Luciferase Reporter Assay System (Promega) and GloMax-Multi Detection System (Promega). All experiments were performed in three independent replication and each reaction was carried out in triplicate. One-way ANOVA analysis was conducted to compare the differences among different groups.

Results

Identification and in silico analysis of four SNPs

A length of 3107 bp upstream sequence of porcine PNPLA2 gene was obtained with primer pairs P1F/P1R and deposited in GenBank (JN615017). Four new SNPs were identified in this region, including A>G at −697 nt (SNP1), T>C at −690 nt (SNP2), G>C at −681 nt (SNP3)and C>A at −632 nt (SNP4), relative to the predicted transcription start site (+1). These were all submitted to the dbSNP (NCBI) and received SNP IDs as follows: ss#1985401161: NC_010444.3:g.161969 T>C (SNP1), ss#1985401160: NC_010444.3:g.161962 A>G (SNP2), ss#1985401159: NC_010444.3:g.161953 C>G (SNP3) and ss#1985401158: NC_010444.3:g.161904 G>T(SNP4).

In-silico analysis of the SNP loci using transcription factor binding site prediction tool TESS by searching TRANSFAC Matrices is presented in Table 2. Both C allele in SNP1 and T allele in SNP4 introduced a binding site for glucocorticoid receptor (GR), while G allele in SNP4 introduced a binding site for specificity protein 1 (Sp1). An allele in SNP2 introduced binding sites for CAC-binding protein and activating protein 2 (AP-2), while G allele introduced the binding site for estrogen receptor alpha (ERα). No different binding sites predicted for the different allele in SNP3.

Table 2 Prediction of transcription factors binding with porcine PNPLA2 gene

Genotyping of four SNPs in five pig populations

By direct sequencing of a 608 bp fragment amplified with primer pairs P2F/P2R (Table 1), the four SNPs were detected in 318 animals of five different pig breeds (Table 3). The allele distribution revealed SNP1 and SNP4 were occurred in a chain, while SNP2 and SNP3 were chain mutations. All breeds were polymorphic at the SNP2 (SNP3) locus except Landrace breed, and all four SNPs were in agreement with Hardy–Weinberg equilibrium in Duroc breed.

Table 3 Allele frequencies of four SNPs in porcine PNPLA2 gene in five populations

Linkage disequilibrium and haplotype construction

The LD between the four SNPs in all pig populations were estimated, which indicated that SNP1 and SNP4, SNP2 and SNP3 were in complete LD (D′ = 1 and r 2 = 1) in all pig breeds (Table 4).

Table 4 Estimated values of linkage equilibrium analysis for four SNPs in porcine PNPLA2 gene in studied populations

Haplotype-based analysis can provide higher power, precision and quality to assess the complex relationship between genetic variation and phenotypes compared to single SNP analysis [1417]. Through estimating SNP haplotype frequencies, three haplotypes were identified, including H1(CGGT), H2(TACG) and H3(CACT) (Table 5). In the five pig populations, three haplotypes had a moderate distribution in Yorkshire breed, with frequencies of 17.6 % for H1, 38.2 % for H2 and 44.2 % for H3. However, only two haplotypes had allele frequencies greater than 5 % in the other four populations. H1, H2 and H3 were the most abundant haplotypes in Duroc (0.75), Landrace (0.78) and Chinese indigenous breeds (>0.73), respectively. In the Duroc population analysed, three haplotypes and five diplotypes were identified, and H1 and H2 accounted for 99 % of the observations. The most prevalent haplotype, H1, accounted for 74.5 % of all haplotypes, and the most common diplotype (58.4 % of samples) comprised H1 homozygotes (Table 5). In Landrace population, H2 and H3 were identified with frequencies of 77.8 and 22.2 % respectively, and the most common diplotype was H2H2 (60.3 % of all investigated samples). Chinese indigenous breeds SXHZ and BM had similar haplotype distributions, which comprising a high frequent haplotype H3 and a low frequent haplotype H1, and the most common diplotype was H3H3.

Table 5 Diplotype and haplotype frequencies of the four SNPs in 5′ flanking region of porcine PNPLA2 gene in different populations

Haplotype association analysis

Associations of the three diplotypes (H1, H2 and H3) with fat deposition related traits were analyzed in three pig populations as shown in Table 6, in which rare diplotypes (frequency < 0.03) were excluded from further analyses.

Table 6 Effects of the different diplotypes in porcine PNPLA2 gene on economic traits

In Duroc breed, H1 and H2 were the main haplotypes. There were significant associations between the diplotypes and FCR and BF1 (P < 0.05), and Duroc individuals with H1H1 diplotype exhibited the lowest FCR (P < 0.05), while H2H2 had the thickest BF1 (P < 0.05). Duroc pigs with H1H1 diplotype had a lower FCR (−0.19 %, P < 0.05) but a thicker BF1 (+0.23 mm, P > 0.05) than pigs with H1H2 diplotype, and a lower FCR (−0.24 %, P > 0.05) and a thinner BF1 (−2.26 mm, P < 0.05) than pigs with H2H2 diplotype. Pigs with H1H2 diplotype had a similar FCR (−0.05 %, P > 0.05) but a significant decrease in BF1 (−2.49 mm, P < 0.05) compared with pigs with H2H2 diplotype. There were no significant differences among different diplotypes for the other traits including TDG, BF2, MD2 and LMP. H3 was present at too low a frequency (0.01) in Duroc breed population for subsequent analysis.

In Landrace breed analyzed, only two haplotypes (H2, H3) and three diplotypes were identified. There were significant associations between the diplotypes and BF1 and MD (P < 0.05), extremely significant with LMP (P < 0.05). Landrace individuals with H2H3 had lower BF1 (P < 0.05), higher MD (P < 0.05) and LMP (P < 0.05) than those with diplotype H2H2 and H3H3. No significant differences were observed among diplotypes for the other traits including TDG, FCR and BF2.

In Yorkshire breed analyzed, three haplotypes were identified and five diplotypes were selected for further association because of the rare frequency of H1H1. However, there were no significant differences among different diplotypes for all the traits measured.

Functional analysis of different haplotypes

To investigate the potential effects of different haplotypes on porcine PNPLA2 transcription, the 1143 bp fragment of 5′-flanking region from −848 to +295 nt was cloned into a luciferase reporter vector. Four constructs differing only at sites −697 (SNP1), −690 (SNP2), −681 (SNP3) and −632 (SNP4) were obtained, three of which harbouring the identified haplotypes, and the remaining control was short of the sequence from −848 to −615 nt. As shown in Fig. 1, functional analysis on PNPLA2 promoter haplotypes indicated that the DNA fragment of PNPLA2 promoter (−848 to +295 nt) harbouring the four SNPs had a strong activity of promoter, compared with positive control transtected with pGL3-promoter. The activity of PNPLA2 promoter was influenced by the polymorphisms of the four SNPs. The mean relative luciferase activity (RLA) of H2 (TACG) and H3 (CACT) haplotype were significantly higher than that of H1 (CGGT) haplotype (P < 0.05), but there was no significant difference with the RLA between H2 and H3 haplotype (P > 0.05). The RLA of pGL3-basic-H2 was the highest among the all vectors, even significantly higher than pGL3-promoter containing SV40 promoter and enhancer, which indicated H2 might cause a strong positive regulatory element. As an experimental control, the activity of pGL3-basic-Del (containing the sequence from −615 to +295nt but lacking the variation region) was significantly higher than pGL3-basic (P < 0.05), which indicated the DNA fragment of PNPLA2 promoter (from −615 to +295nt) also had a high activity of promoter. However, the activity of pGL3-basic-Del was lower than the other three recombinant vectors containing the variation region from −848 to −615nt, which indicated the DNA fragment (from −848 to −615 nt) had an activity of enhancer.

Discussion

PNPLA2 specifically catalyzes the first step in triglyceride hydrolysis in mammalian cells. Therefore, PNPLA2 gene has been an attractive candidate gene for fat deposition related traits in livestock species. Mutations in regions upstream of transcription initiation site may lead to variation in the level or timing of gene expression by destroying or increasing the number of transcription factor binding sites [11, 18, 19]. In this study, we first reported and confirmed four SNPs in the promoter region of PNPLA2 gene in 318 pigs by direct PCR amplification and DNA sequencing methods, revealed significant genetic effects between the mutations and growth and fat deposition traits of pigs, and identified the transcriptional activities for different haplotypes.

Comparison analysis revealed that there are four contiguous SNPs on the upstream of transcription initiation site of porcine PNPLA2 gene. It is considered that a haplotype analysis can be more practical for most complex traits, because a single SNP usually function in coordination with other SNPs to manifest an effect [20, 21]. In the study, haplotypes were constructed using four SNPs of porcine PNPLA2 gene, among which SNP1 and SNP4 are linked completely, so as for SNP2 and SNP3. H1 (CGGT) and H2 (TACG) occupy the whole Duroc population detected and carry the opposite alleles at all the four loci. H2 (TACG) and H3 (CACT) carry the same alleles at SNP2 and SNP3, but opposite at SNP1 and SNP4.

Statistical analysis is one of the ways to clarify the genetic effects. Significant associations between PNPLA2 haplotypes and backfat thickness were observed in Duroc and Landrace. Haplotype analysis showed in Duroc, individuals with H1 haplotype had the lower FCR and BF1 (P < 0.05), which indicated H1 was the preferential haplotype for FCR and backfat thickness traits in Duroc. Interestingly, the decrease effect of H1 on backfat thickness is reflected by the two predicted binding sites for GR and ERα introduced in SNP1 and SNP4. However, individuals with H1 were absent in Landrace, and it showed the preferential heterozygote of H2H3 for backfat thickness (BF2), muscle depth (MD) and LMP in this population. The genetic effects of these haplotypes were not consistent across breed studied. Although the genetic effects should be verified in a larger scale of pig populations, the associations between PNPLA2 promoter haplotypes and phenotypes suggest the possibility of differential promoter activity.

In vitro study, three haplotypes, which consists of four nucleotide variants from −848 to −615 nt in porcine PNPLA2 promoter, showed different basal transcriptional activities in HEK293 cells. It indicated that the haplotype H2 (TACG), which was the most frequent in Landrace, had the highest transcriptional activity, followed by H3(CACT) which mainly existed in Yorkshire, SXH and BM pigs, and then H1 (CGGT) which mainly existed in Duroc. This suggests that the different haplotypes caused by nucleotide variants could change binding affinity to transcription factors individually or interactively. From in vitro data, a lower transcriptional activity of H1 (compared to H3) could be due to differences in ERα binding site at G allele in SNP2 and CAC-binding protein and AP2 binding sites at A allele in SNP2, while a higher transcriptional activity of H2 (compared to H3) could due to differences in GR binding site at G allele in SNP1 (T allele in SNP4) and Sp1 binding sites at G allele in SNP4.

In H1 haplotype, two GR binding sites and an ERα binding site were predicted to be introduced. It has been demonstrated that PNPLA2 gene expression is induced by glucocorticoids (GCs), one of the hormones involved in the response of an organism to fasting [3]. Dexamethasone (GC) can directly stimulate lipolysis by up-regulating mRNA and protein levels of PNPLA2 in a dose and time responsive manner [22]. Therefore, in present study, the action of GCs on HEK293 cells may be mediated by GR, which is a widely expressed nuclear receptor in virtually all tissues. The GR homodimer binds to the glucocorticoid response elements (GREs) and then stimulates PNPLA2 expression [23, 24]. Fortunately, we found GREs were predicted to be introduced in H1 and H3 or abolished in H2 in the 5′-promoter region of porcine PNPLA2 gene due to the mutations. In vivo, GC can be induced in the stress condition such as being fasted, and then pigs with C allele both in SNP1 and SNP4 in PNPLA2 gene could be more easily induced by GC and GR, as a result, the enhanced lipolysis activities of PNPLA2 would gradually result in their thinner backfat thicknesses than pigs with T allele both in SNP1 and SNP4.

Furthermore, evidence has demonstrated a pivotal role of estrogens and estrogen receptors (ERs) in fat formation and metabolism [25]. Estrogen (E2) and ERα have been shown to inhibit adipogenesis [26], and the role of estrogen in adiposity is mediated mainly though ERα [27]. It has been demonstrated that E2 signals via ERα to regulate PNPLA2 and perilipin-mediated lipid metabolism and droplet size in adipocytes [28]. In our study, the ERα binding site introduced in H1 might be recognized by estrogenic complex and lead to the transcriptional activation of PNPLA2 in estrogen stimulation in vivo. Therefore, the GC-GR and E2-ERα actions in vivo might explain the reason why haplotype H1 had thinner backfat thickness than Duroc pigs with H2. Studies of tissue-specific differences in GC and estrogen metabolism and their receptors-mediated actions will be helpful to understand the effects of introduced GR and ERα binding sites on porcine PNPLA2 regulation.

However, while in vitro, HEK293 cells in the present study were not being fasted or treated with GC or estrogen. So there were no promotion effects of GR binding sites or ERα binding site on PNPLA2 transcription. This might explain the reason why haplotype H1 had a relatively low transcriptional activity in HEK293 cells. In vitro results might not represent for in vivo results under the environment with various hormones such as GC and E2.

There were other transcriptional factor binding sites predicted in H2 and H3, such as Sp1, AP-2 and CAC-binding proteins, which are broadly expressed transcription factors and capable of contributing to basal promoter activity. GR binding sites in H1 and H3 might not be induced by GC in HEK293 cells in vitro. Therefore, we considered it may explain the reason why H2 had the highest transcriptional activity in HEK293 cells.

Sp1, as a basal transcription factor, is a zinc finger transcription factor with important roles in inducing the expression of genes associated with lipid metabolism [2932]. It can activate the transcription of many cellular genes that contain putative CG-rich Sp-binding sites in their promoters [33].

AP-2 is an inducible transcription factor and represents a family of five sequence-specific DNA-binding proteins [34]. It can regulate genes containing AP-2 sites in their promoters and play roles in various important biological functions, such as apoptosis and cell growth [35, 36].

CAC box in the promoter could be recognized by CAC-binding proteins encoded by genes with 3′UTR containing motif GRGGSTGGG [37]. It was reported that deletion of a CAC box could result in a drastic decrease in transcription of the porphobilinogen deaminase (PBGD) promoter [37].

Moreover, in the upstream region of porcine PNPLA2 gene, a number of transcription factor binding sites were predicted such as CREBP1, MyoD, C/EBP, nuclear factor 1 (NF-1). These transcription factors might also be involved in the functioning of the PNPLA2 promoter besides the five transcription factors above.

Further biochemical and molecular characterization using electrophoretic mobility shift assays and DNAse I protection assays could elucidate key elements, as well as further research on the transcriptional level of PNPLA2 gene in pigs may provide useful information on lipometabolism and suggestions on marker-assisted selection as well.

Conclusion

In conclusion, four functional polymorphisms were identified in the promoter region of porcine PNPLA2 gene. The allele order of C–G–G–T, corresponding to SNP1-4 or H1, had consistent trends in decreasing FCR and backfat thickness in Duroc pigs, but had a lower basal transcriptional activity in HEK293 cells in vitro. These results suggest that the four SNPs were potential DNA markers for backfat thickness in Duroc pig breeding program and had a regulatory role in PNPLA2 transcription. The in vivo and in vitro experiments should be in more way in further study.