Abstract
The Dlk1-Dio3 imprinted domain is located on the cattle chromosome 21 and contains three paternally expressed protein-coding genes and a number of maternally expressed short or long noncoding RNA genes. We have previously obtained two maternally expressed long noncoding RNA genes, Meg8 and Meg9, from the cattle. In this study, we identified a novel noncoding RNA located between Meg8 and Meg9 known as LINC24061 according to the GENCODE annotated bibliography. Two alternatively spliced transcripts (LINC24061-v1 and LINC24061-v2) were obtained using RT-PCR and RACE, and the expression pattern of LINC24061-v1 and LINC24061-v2 was shown to be tissue-specific. The LINC24061-v1 splice variant was expressed in only three types of tissues: heart, kidney and muscle; in contrast, LINC24061-v2 was expressed in all eight tissues examined, including heart, liver, spleen, lung, kidney, skeletal muscle, subcutaneous fat, and brain of adult cattle. The allele-specific expression of LINC24061 was identified based on a single nucleotide polymorphism (SNP) in exon 2 of LINC24061. The results showed that LINC24061 exhibited monoallelic expression in all the examined cattle tissues.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
In mammals, nearly 1% of protein-coding genes show parent-of-origin monoallelic expression due to the different epigenetic modifications inherited by the zygote from the sperm and the egg; these parent-dependent epigenetic marks are defined as genomic imprinting (Barlow and Bartolomei 2014). To date, there are more than 300 imprinted genes identified in mouse and human (http://igc.otago.ac.nz), which often occur in clusters. The Dlk1-Dio3 domain is a large imprinted cluster located on mouse chromosome 12q and human chromosome 14q32. This domain is a 1 Mb region flanked by the paternally expressed protein-coding gene Dlk1 (delta-like homolog 1) and Dio3 (the type III iodothyronine deiodinase), with an interior containing maternally expressed long noncoding RNAs (lncRNAs): Meg3/Gtl2, Meg8/Rain, Meg9/Mirg, Peg11as, Irm, and numerous microRNAs and small nuclear RNAs (snoRNAs) (Charlier et al. 2001) (Fig. 1). The expression of the maternally expressed noncoding RNAs in the Dlk1-Dio3 locus has been associated with the development potential of induced pluripotent cells (iPSCs) (Liu et al. 2010; Stadtfeld et al. 2010).
In recent years, the international ENCODE (Encyclopedia of DNA Elements) and GENCODE project has uncovered over 98% of the human genome does not encode protein sequences, whilst at least 80% of genomic DNA is transcribed as noncoding RNAs (Harrow et al. 2012; Pennisi 2012). LncRNAs are non-protein-coding transcripts greater than 200 bp in length, or molecules longer than 2 kb with a coding potential of less than 100 amino acids. LncRNAs can be classified into antisense lncRNAs, intergenic lncRNAs, intronic lncRNAs, and enhancer lncRNAs. By chromatin signature analysis, over a thousand highly conserved large intergenic noncoding RNAs (lincRNAs) have been identified in the mouse (Guttman et al. 2009). Using public expressed sequence tag (EST) data from many developmental stages and tissues of the cattle, 23,060 cattle ncRNAs were predicted, with the majority (57%) of these ncRNAs being intergenic transcripts (Qu and Adelson 2012).
Recently, two novel intergenic long noncoding RNAs (AK044800 and B830012L14Rik) were identified between Meg8 and Meg9 genes in mouse Dlk1-Dio3 imprinted region (Han et al. 2013; Zhang et al. 2011). Previously, we have analyzed the gene structure and alternative splicing patterns of the cattle Meg8 and Meg9 genes (Hou et al. 2011; Zhang et al. 2014). Searching candidate imprinted genes in the cattle Dlk1-Dio3 domain in the NCBI database and UCSC Genome Browser revealed two EST sequences, CB457990 and CB460301, located 115 bp apart in the region between Meg8 and Meg9 (Fig. 1). The two EST sequences were determined to be part of the same transcript, and named LINC24061 according to the GENCODE annotated bibliography. The aim of the present work was to first obtain the full-length cDNA sequence of LINC24061, then analyze the expression of alternative splicing variants and determine the imprinting status of LINC24061.
Materials and methods
Animals and tissues
Samples from heart, liver, spleen, lung, kidney, skeletal muscle, subcutaneous fat, and brain from 32 dairy cattle (Holstein) were collected from a local abattoir and frozen at −70 °C for further analysis. Protocols involving the use of animals were approved by the Agriculture Research Animal Care Committee of the Agricultural University of Hebei.
RNA isolation and cDNA synthesis
Total RNA was extracted from tissue samples of heterozygous individuals stored at −70 °C using Trizol Reagent (Invitrogen, Carlsbad, CA, USA) according to manufacturer’s instructions. First-strand cDNA was synthesized using 20 μL reaction volume, it containing 1 μL total RNA, 1 μL oligo (dT)18 (0.5 μg/μL, Sangon, Shanghai, China), 1 μL dNTP (10 mM each), 4 μL 5 × M-MLV buffer, 0.5 μL (200 units) M-MLV reverse transcriptase (Promega, Madison, WI, USA), 0.5 μL RNase inhibitor (40 units/μL) and 12 μL RNase-free H2O, with incubation at 42 °C for 60 min.
Cloning of LINC24061
A 1050-bp LINC24061 cDNA sequence was first obtained by RT-PCR with primers LINC24061-F1 (5′-TCTAAATACTTGCCCGAG-3′) and LINC24061-R1 (5′-AGAGTTACAGAACCCGTG-3′) designed according to the EST sequences CB457990 and CB460301, respectively. RT-PCR was performed in a 25 μL volume containing 1 μL of first-strand cDNA (50 ng/μL), 12.5 μL 2 × Es Taq MasterMix (CWBIO, Beijing, China), 0.5 μL of forward and 0.5 μL of reverse primers (10 mM), 10.5 μL of ddH2O and using the following temperature cycle: 94 °C for 5 min, 30 cycles of 94 °C for 30 s, 55 °C for 30 s, 72 °C for 45 s and final extension at 72 °C for 10 min. The PCR product was separated on 1.5% agarose gel, purified with an E.Z.N.A.GelR Extraction Kit (Omega, Monsanto, USA), and sequenced (BGI, Beijing, China).
The 5′ RACE (rapid amplification of cDNA ends) and 3′ RACE reaction was performed to obtain the 5′ and 3′ ends of the LINC24061 cDNA sequence, respectively, using the SMARTer™ RACE cDNA Amplification Kit. The RACE reaction was performed by nested PCR according to the manufacturer’s protocol. Gene-specific primers were as follows: 5′ GSP1 (5′-ATGGAGCAGCATCTACAAAGTTCCGAGG-3′) and 5′ GSP2 (5′-GGTAGGACTTCCAGGTCAACAGATACGATG-3′), 3′ GSP1 (5′-TGCTTCTGGGGATTCTGGCTTTTCTAA-3′) and 3′ GSP2 (5′-CATGCTGGCAAAGTCACGGTGGGGGAAC-3′). The amplified products were purified using an E.Z.N.A.GelR Extraction Kit (Omega, USA), cloned into PMD18-T (TaKaRa, Shanghai, China), and sequenced (BGI, China).
RT-PCR
RT-PCR was performed to detect the expression patterns of the two transcripts of LINC24061 in different tissues, including heart, liver, spleen, lung, kidney, skeletal muscle, subcutaneous fat, and brain. Two forward primers and one reverse primer were as follows: LINC24061-F2 (5′-TGGCATCCGTGTCACT-3′) on exon 1, LINC24061-F3 (5′-CTCCAGGGAAGACACACT-3′) on exon 2 and LINC24061-R2 (5′-GCCCTGATGGTTACTTCTGGG-3′). An amplified fragment of the GAPDH (GenBank accession no.BTU85042) gene was used as an internal control with primers GAPDH-F (5′-GCACAGTCAAGGCAGAGAAC-3′) and GAPDH-R (5′-GGTGGCAGTGATGGCGTGGA-3′).
Strand-specific RT-PCR (ssRT-PCR) of LINC24061
To detected the LINC24061 mRNA is transcribed in sense or in antisense, samples of eight tissues (heart, liver, spleen, lung, kidney, skeletal muscle, subcutaneous fat, and brain) were performed strand-specific reverse transcription using using primer LINC24061-F4 (5′-AAGTAACCATCAGGGCT-3′) or LINC24061-R1, which were used to detect antisense and sense strand transcripts, respectively. A 721 bp PCR product was amplified using LINC24061-F4 and LINC24061-R3 (5′-GCATCTACAAAGTCCGA-3′) in a 25 μL volume.
DNA extraction and SNP identification
Genomic DNA was extracted from liver tissue of 32 adult Holstein cattle with a DNA Extraction kit (Sangon, Shanghai,China) according to the manufacturer’s instruction. The SNP site was identified by sequencing the PCR products directly. A 1050 bp fragment was amplified using the primers LINC24061-F1 and LINC24061-R1, which are located on a commonly shared exon 2 sequence of both transcripts. PCR was performed in a 25 μL volume containing: 1 μL Genomic DNA (100 ng/μL), 0.5 μL forward and reverse primers (10 μM/μL), 2 μL dNTP (2.5 nm/μL), 2.5 μL 10 × Taq Buffer, 0.3 μL Taq DNA polymerase (5 U/μL) and 18.2 μL ddH2O. The PCR procedure was as follows: 94 °C for 5 min, 35 cycles of 94 °C for 30 s, 55 °C for 30 s, 72 °C for 45 s and final extension at 72 °C for 10 min. The amplified products were purified using an E.Z.N.A.GelR Extraction Kit (Omega, monsanto, USA) and sequenced (BGI, China). The SNP site was identified by observing the double peaks in the sequencing chromatogram.
Allele-specific expression of LINC24061
The heterozygous animals with the identified SNP site were used to analyze the allele-specific expression of LINC24061. Total RNA prepared from the tissues of heterozygous animals was reverse transcribed into cDNA. RT-PCR was then performed using the primers LINC24061-F1 and LINC24061-R1, and the reaction volume and cycling conditions were the same as that of the SNP identification. The amplified products were purified and sequenced directly.
Results
Cloning and structure analysis of LINC24061 gene
The RT-PCR and RACE were used to determine the full-length cDNA sequences of LINC24061. A 1050 bp product containing partial sequence of both CB457990 and CB460301 was first obtained by RT-PCR using primers LINC24061-F1 and LINC24061-R1, indicating that the two EST sequences belong to the same transcript. The sequencing results of RACE indicated that two transcripts exist in the 5′ end and one in the 3′ end. The two splice variants of LINC24061 have been submitted to GenBank (KU870638 and KU870639). The LINC24061 molecular structure and two splice variants are shown in Fig. 2a. LINC24061-v1 has 2 exons, but LINC24061-v2 lacks exon 1 and only has a shorter exon 2 with 181 bp missing at the 5′ end. The open reading frames (ORFs) were detected using the Open Reading Frame Finder (www.ncbi.nlm.nih.gov/gorf.html), and multiple small ORFs were found in the two transcripts of LINC24061. However, none of the ATG start codons were consistent with the Kozak consensus sequence (ACCAUGG), suggesting that LINC24061 is a noncoding RNA gene (Fig. 2b). The results of ssRT-PCR assay indicated that LINC24061 is only transcribed in senses in the cattle tissues (Fig. 2c).
Expression profile of LINC24061
The expression patterns of the two splice variants of LINC24061 in eight tissues were determined using RT-PCR, including heart, liver, spleen, lung, kidney, skeletal muscle, subcutaneous fat, and brain (Fig. 3). Two forward primers (LINC24061-F2 and LINC24061-F3) and one reverse primer LINC24061-R2 were designed according to the two splice variants of the cattle LINC24061. The locations of the primers were shown in Fig. 2a. A 1123 bp fragment of LINC24061-v1 and a 598 bp fragment of LINC24061-v2 were obtained using RT-PCR. The LINC24061-v1 splice variant was expressed in only three types of tissues: heart, kidney, and muscle, whereas LINC24061-v2 was expressed in all eight tissues examined.
Identification of SNP
The SNP site was identified by direct sequencing of the PCR products. A 1050 bp fragment of LINC24061 was amplified from genomic DNA using the primers LINC24061-F1 and LINC24061-R1 (Fig. 4b). An A/C SNP was determined at nucleotide 1512 of LINC24061-v1 (accession number: KU870638). The sequencing results of three genotypes, heterozygous (A/C) and homozygous (A/A) and (C/C), were shown in Fig. 4b. In 32 cattle, six individuals were the A/C heterozygous genotype, three were the A/A homozygous genotype, and 23 were the C/C homozygous genotype.
Allele-specific expression analysis of LINC24061 in the cattle
The allele-specific expression analysis of LINC24061 was investigated by comparing the base of the heterozygosis site from the sequencing chromatograms between products of the genomic DNA PCR and RT-PCR from the same heterozygous animal. The sequencing results of RT-PCR products obtained from heart, liver, spleen, lung, kidney, skeletal muscle, subcutaneous fat, and brain tissues revealed that only one parental allele (C) was expressed at the A/C SNP locus (Fig. 4c), suggesting that LINC24061 is imprinted in cattle.
Discussion
Long noncoding RNAs (lncRNAs) are transcribed across most of the mammalian genome (Carninci et al. 2005), and the field of lncRNA research has been rapidly advancing in recent years. The ultra RNA deep sequencing analysis revealed that the number of lncRNA expressed genes is more than double of protein-coding genes in human (Iyer et al. 2015). LncRNAs can be classified into intergenic lncRNAs, antisense lncRNAs, intronic lncRNAs, and enhancer lncRNAs based on their location. Most lncRNAs play a functional role in genomic imprinting, with the exception of intronic lncRNAs (Kanduri 2016). In this study, we identified LINC24061 as a novel imprinted lncRNA in the Dlk1-Dio3 domain.
Alternative splicing is a normal phenomenon in eukaryotes, and can greatly add to the biodiversity without increasing the number of genes encoded by a genome (Black 2003). Approximately 95% of multi-exonic genes are alternatively spliced in humans (Pan et al. 2008). Previously, we obtained the alternative splices of three lncRNAs in the cattle Dlk1-Dio3 domain: six splice variants of Gtl2 gene (Su et al. 2011), 12 splice variants of Meg8 (Hou et al. 2011), and three splice variants of Meg9 (Zhang et al. 2014). In our analysis of LINC24061, two splice variants (LINC24061-v1 and LINC24061-v2) were obtained using RT-PCR and RACE method. Although multiple small ORFs were encoded by the two splice variants, neither contained a Kozak sequence for translation initiation, suggesting that LINC24061 plays a role as a noncoding RNA.
In this study, LINC24061 was identified a ncRNA with monoallelic expression in tissues of adult cattle. LncRNAs often show tissue- and cell-specific expression (Dinger et al. 2008; Mercer et al. 2008; Ravasi et al. 2006). In mouse, the intergenic lncRNA B830012L14Rik was primarily expressed in brain, heart, lung, and liver at embryonic day 15.5 (Zhang et al. 2011). In this study, the expression of an LINC24061-v1 variant was detected in three tissues, heart, kidney, and muscle, whereas LINC24061-v2 was detected in all eight tissues examined, including heart, liver, spleen, lung, kidney, skeletal muscle, subcutaneous fat, and brain. These expression patterns were similar to that of Meg9, with three splice variants showing tissue-specific expression (Zhang et al. 2014).
The function of LINC24061 is unknown, but several lines of evidence provide hints to its potential role. The potential promoter of LINC24061 was predicted using PromoterScan software. Analysis of the potential promoter region of LINC24061 revealed that two DNA elements, Bov-A2 and La2, were observed in the 5 kb upstream sequence of LINC24061. The Bov-A2 is a retroposon, that is one of the most common short interspersed nucleotide elements (SINEs) among the genomes of ruminants, and is generally present in the noncoding regions of several genes preferentially expressed during the cellular response to environmental stress or activation signals (Damiani et al. 2008). The DNA element, L2a, contains binding sites for two MAR (matrix-associated region)-interacting proteins (SATB1 and CDP), and functions as a silencer of CD8 gene encoding an important T cell co-receptor in mouse (Yao et al. 2010). Therefore, LINC24061 may play a role in the cell growth and differentiation.
References
Barlow DP, Bartolomei MS (2014) Genomic imprinting in mammals. Cold Spring Harb Perspect Biol 6:a018382. doi:10.1101/cshper-spect.a018382
Black DL (2003) Mechanisms of alternative pre-messenger RNA splicing. Annu Rev Biochem 72:291–336. doi:10.1146/annurev.biochem.72.121801.161720
Carninci P, Kasukawa T, Katayama S et al (2005) The transcriptional landscape of the mammalian genome. Science 309(5740):1559–1563. doi:10.1126/science.1112014
Charlier C, Segers K, Wagenaar D et al (2001) Human-ovine comparative sequencing of a 250 kb imprinted domain encompassing the callipyge (clpg) locus and identification of six imprinted transcripts: DLK1, DAT, GTL2, PEG11, antiPEG11, and MEG8. Genome Res 11:850–862. doi:10.1101/gr.172701
Damiani G, Florio S, Panelli FS, Capelli E, Cuccia M (2008) The Bov-a2 Retroelement played a crucial role in the evolution of ruminants. Riv Biol 101:375–404
Dinger ME, Amaral PP, Mercer TR et al (2008) Long noncoding RNAs in mouse embryonic stem cell pluripotency and differentiation. Genome Res 18:1433–1445. doi:10.1101/gr.078378.108
Guttman M, Amit I, Garber M et al (2009) Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature 458:223–227. doi:10.1038/nature07672
Han Z, Liu Q, Huang Z et al (2013) Expression and imprinting analysis of AK044800, a transcript from the Dlk1-Dio3 imprinted gene cluster during mouse embryogenesis. Mol Cells 35:285–290. doi:10.1007/s10059-013-2275-z
Harrow J, Frankish A, Gonzalez JM et al (2012) GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res 22:1760–1774. doi:10.1101/gr.135350.111
Hou XH, Li DJ, Su H, Hu JQ, Li N, Li SJ (2011) Molecular cloning, expression, and imprinting status of maternally expressed gene 8 (Meg8) in dairy cattle. Rus J Genet 47:994–998. doi:10.1134/S1022795411080096
Iyer MK, Niknafs YS, Malik R et al (2015) The landscape of long noncoding RNAs in the human transcriptome. Nat Genet 47:199–208. doi:10.1038/ng.3192
Kanduri C (2016) Long noncoding RNAs: lessons from genomic imprinting. Biochim Biophys Acta 1859:102–111. doi:10.1016/j.bbagrm
Liu L, Luo GZ, Yang W et al (2010) Activation of the imprinted Dlk1-Dio3 region correlates with pluripotency levels of mouse stem cells. Biol Chem 285:19483–19490. doi:10.1074/jbc.M110.131995
Mercer TR, Dinger ME, Sunkin SM, Mehler MF, Mattick JS (2008) Specific expression of long noncoding RNAs in the mouse brain. Proc Natl Acad Sci 105:716–721. doi:10.1073/pnas.0706729105
Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ (2008) Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat Genet 40(12):1413–1415. doi:10.1038/ng.259
Pennisi E (2012) Genomics. ENCODE project writes eulogy for junk DNA. Science 337:1159–1161. doi:10.1126/science.337.6099.1159.PMID22955811
Qu Z, Adelson DL (2012) Bovine ncRNAs are abundant, primarily intergenic, conserved and associated with regulatory genes. PLoS ONE 7:e42638. doi:10.1371/journal.pone.0042638
Ravasi T, Suzuki H, Pang KC et al (2006) Experimental validation of the regulated expression of large numbers of non-coding RNAs from the mouse genome. Genome Res 16:11–19. doi:10.1101/gr.4200206
Stadtfeld M, Apostolou E, Akutsu H et al (2010) Aberrant silencing of imprinted genes on chromosome 12qF1 in mouse induced pluripotent stem cells. Nature 465:175–181. doi:10.1038/nature09017
Su H, Li D, Hou X et al (2011) Molecular structure of bovine Gtl2 gene and DNA methylation status of Dlk1-Gtl2 imprinted domain in cloned bovines. Anim Reprod Sci 127:23–30. doi:10.1016/j.anireprosci.2011.07.002
Yao X, Nie H, Rojas IC et al (2010) The L2a element is a mouse CD8 silencer that interacts with MAR-binding proteins SATB1 and CDP. Mol Immunol 48:153–163. doi:10.1016/j.molimm.2010.08.014
Zhang FW, Zeng TB, Han ZB et al (2011) Imprinting and expression analysis of a non-coding RNA gene in the mouse Dlk1-Dio3 domain. J Mol Histol 42:333–339. doi:10.1007/s10735-011-9337-3
Zhang K, Li D, Wang M, Wu G, Shi Y, Li S (2014) The differential expression of alternatively spliced transcripts and imprinting status of MEG9 gene in cows. Anim Genet 45:660–664. doi:10.1111/age.12195
Acknowledgements
This study was supported by National Natural Science foundation of China (31372312 and 30972098) and Natural Science foundation of Hebei (C2016204092).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, M., Zhao, Y., Wang, G. et al. An imprinted long noncoding RNA located between genes Meg8 and Meg9 in the cattle Dlk1-Dio3 domain. Genetica 145, 1–7 (2017). https://doi.org/10.1007/s10709-016-9939-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10709-016-9939-5