De novo transcriptome analysis of Rhododendron molle G. Don flowers by Illumina sequencing

Xiao, Zheng; Su, Jiale; Sun, Xiaobo; Li, Chang; He, Lisi; Cheng, Shangping; Liu, Xiaoqing

doi:10.1007/s13258-018-0662-8

De novo transcriptome analysis of Rhododendron molle G. Don flowers by Illumina sequencing

Research Article
Published: 03 February 2018

Volume 40, pages 591–601, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Genes & Genomics Aims and scope Submit manuscript

De novo transcriptome analysis of Rhododendron molle G. Don flowers by Illumina sequencing

Download PDF

Zheng Xiao¹,
Jiale Su¹,
Xiaobo Sun¹,
Chang Li¹,
Lisi He¹,
Shangping Cheng¹ &
…
Xiaoqing Liu¹

739 Accesses
17 Citations
1 Altmetric
Explore all metrics

Abstract

Rhododendron molle G. Don occupies an important phylogenetic node in the genus rhododendron with unique yellow flower and medicinal functions. However, only limited genetic resources and their genome information are available for the generation of rhododendron flowers. The next generation sequencing technologies enables generation of genomic resources in a short time and at a minimal cost, and therefore provide a turning point for rhododendron research. Our goal is to use the genetic information to facilitate the relevant research on flowering and flower color formation in R. molle. In total, 66,026 unigenes were identified, among which 31,298 were annotated in the NCBI non-redundant protein database and 22,410 were annotated in the Swiss-Prot database. Of these annotated unigenes, 9490 and 18,680 unigenes were assigned to clusters of orthologous groups and gene ontology categories, respectively. A total of 7177 genes were mapped to 118 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database. In addition, 8266 simple sequence repeats (SSRs) were detected, and these SSRs will undoubtedly benefit rhododendron breeding work. Metabolic pathway analysis revealed that 32 unigenes were predicted to be involved in carotenoid biosynthesis. Our transcriptome revealed 32 engines that encode key enzymes in the carotenoid biosynthesis pathway, including PSY, PDS, LCYB, LCYE, etc. The content of β-carotene was much higher than the other carotenoids throughout the flower development. It was consistent with the key genes expression level in the carotenoid biosynthesis pathway by the Illumina expression profile analysis and the qRT-PCR analysis. Our study identified genes associated with carotenoid biosynthesis in R. molle and provides a valuable resource for understanding the flowering and flower color formation mechanisms in R. molle.

De novo transcriptome characterization of Lilium ‘Sorbonne’ and key enzymes related to the flavonoid biosynthesis

Article 12 October 2014

De novo sequencing of the Freesia hybrida petal transcriptome to discover putative anthocyanin biosynthetic genes and develop EST–SSR markers

Article 28 August 2018

Transcriptome-sequencing analyses reveal putative genes related to flower color variation in Chinese Rosa rugosa

Article 26 February 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Rhododendrons, which are commonly referred to as rosage, are decorative shrubs with beautiful flowers that are widespread around the world in countries characterized by mild climates (Chosson et al. 1998). While there is a remarkably broad range of rhododendron flower colors, such as red, pink, purple, and white, there is no yellow-flowered cultivar of evergreen rhododendron (Ureshino et al. 2016). A yellow-flowered evergreen rhododendron has been the dream and goal of many rhododendron breeders. Rhododendron molle is a deciduous Rhododendron species with unique yellow flowers. Variations in the morphological characteristics and pigments of wild evergreen rhododendron flowers have been studied in detail (Huyen et al. 2016; Mizuta et al. 2009; Nakatsuka et al. 2008). It has been reported that the flower color of Rhododendron japonicum with yellow-flower is attributed to the presence of a carotenoid (Miyajima et al. 2000). Carotenoids are 40-carbon isoprenoids with polyene chains, which are usually responsible for yellow-orange petal colors and play an important role in furnishing flowers and fruits with distinct colors that are designed to attract pollinators (Hirschberg 2001). During the past decade, nearly all of the genes involved in the carotenoid biosynthesis pathway have been identified and characterized (Pandurangaiah et al. 2016; Rezaei et al. 2016; Yang et al. 2016). The differential expression of these genes may lead to differences in petal color (Sagawa et al. 2016; Xin; Li et al. 2015). However, there are few genetic data concerning carotenoids of R. molle. Therefore, the study of transcrioptome sequencing will provide an important molecular basis for study of R. molle and understanding its flowering mechanisms.

In recent years, the number of EST data sets has rapidly increased (Han et al. 2017; Xia et al. 2017). However, rhododendrons have little data available currently in DNA or protein sequence databases. It hampers the molecular breeding research of rhododendron. Sequencing of large genome remains expensive, even using next generation sequencing technologies. On the contrary, de novo transcriptome sequencing has proved to be accurate and efficient with much higher throughput and relatively low cost, providing opportunities to discover massive expressed sequence data for model and non-model species, such as chrysanthemum (Wang et al. 2017), tea (Li et al. 2017), and lettuce (Zhang et al. 2016). Moreover, the transcriptome is a complete set of transcripts in a cell at a specific developmental stage, providing information on amino acid content of proteins, gene expression and gene regulation. Thus the transcriptome analysis is essential to interpret the functional elements of the genome and reveal the molecular constituents of cells and tissues (Liu et al. 2012; Wang et al. 2009). In addition, transcriptome sequencing can provide a simple and effective method to develop a large number of simple sequence repeats (SSRs) marker and improved the efficiency of molecular marker development in many non-model species (Chen et al. 2016; Guo et al. 2016; Wang et al. 2016). To our knowledge, this is the first report on the characterization of transcrioptome of R. mollle. Hence, the study of transcrioptome sequencing will provide an important molecular basis for study of R. molle and understanding its flowering mechanisms.

In the present study, we used the Illumina sequencing platform to conduct the transcriptome sequencing analysis of mixed RNA separately extracted from flowers at four developmental stages of R. molle flower. A total of 123,120 different transcripts and 66,026 unigenes were identified. Meanwhile, a large number of SSRs were determined. Particularly, the transcripts involved in carotenoid biosynthesis were sequenced, aiming to facilitate color-regulating mechanisms research in rhododendron.

Materials and methods

Plant materials and RNA extraction

Rhododendron molle was grown in the field in Nanjing (China). Samples of flowers at four developmental stages were collected in 2016 as follows: tight bud stage (S1), soft bud stage (S2), initial flowering stage (S3) and full flowering stage (S4). The samples were collected, frozen immediately in liquid nitrogen, and stored in − 80 °C for RNA extraction. Total RNA was extracted using the Quick RNA isolation kit (Bioteke Corporation, Beijing, China) following the manufacturer’s instructions. RNA quality and purity were checked using a Nanodrop 2000 spectrophotometer (Thermo, Wilmington, DE, USA) and Qubit RNA Assay Kit in Qubit 2.0 Fluorometer (Life Technologies, CA, USA). The RNA integrity was assessed using the RNA Nano 6000 Assay Kit of Agilent Bioanalyzer 2100 system (Agilent Technologies, CA, USA).

cDNA library construction and sequencing

A pooled sample containing equal amounts of purified total RNA from each of the four samples was sent to Biomarker Technology Co. Ltd (Beijing, China). Construction of the libraries and the RNA-Seq were performed using the Illumina HisSeq 2500 platform. Firstly, mRNAs were isolated using oligo (dT) magnetic beads and fragmented into small pieces using a fragmentation buffer. First strand cDNA was reverse transcribed by random hexamers, followed by second strand cDNA synthesis. The double-stranded cDNA was subjected to an end repair process and the addition of single nucleotide A. Then paired-end adapters were ligated to the ends of the cDNA fragments. Suitable fragments were selected as templates for polymerase chain reaction amplification. Finally, the library preparations were sequenced on an Illumina HiSeq 2500 platform.

De novo transcriptome assembly

Clean reads were obtained by removing reads with adapters, reads with unknown nucleotides larger than 5% and low quality reads from raw data. Transcriptome de novo was carried out by program Trinity (Grabherr et al. 2011). All transcripts were clustered and the longest transcript in each cluster was defined as the unigene for functional annotation. The initial assembly quality was evaluated using the total number of contigs, mean contig length, maximum contig length and N50 (Martin and Wang 2011). The Trinity utility was used to predict ORFs of at least 100 amino acids from the reconstructed contigs.

Gene annotation and pathway assignments

All assembled unigenes were searched against NCBI non-redundant protein sequences (Nr) to determine the putative mRNA functions using the BLAST algorithm with an E-value lower than 10⁻⁵ (Altschul et al. 1997). GO (Gene ontology) terms were extracted from the best hits obtained from the BLASTX against Nr using the Blast2GO program (Götz et al. 2008). The sequences with corresponding enzyme commission (EC) numbers obtained from blast2GO were mapped to the Kyoto Enzyclopedia of Genes and Genomes (KEGG) Web server (Kanehisa et al. 2008). The WEGO software (http://wego.genomics.org.cn/cgi-bin/wego/index.pl) was used to obtain GO functional classifications. In addition, Unigenes were also aligned to KOG/COG (Clusters of Orthologous Groups of proteins) database, Nt (NCBI non-redundant nucleotide database) and Swiss-Prot to predict and classify possible functions of unigenes (Tatusov et al. 2001).

Extraction and measurement of carotenoids

The samples of flowers at four development stages were collected from five trees. And each stage contained five flowers with three replicates. The carotenoids were extracted and measured according to the previous study (Yan et al. 2015). Frozen flower petals (0.8 g) were homogenized in 4 mL of acetone, and extracted for 24 h away from light at 4 °C. Then the extracts were centrifuged at 16,000×g for 10 min at 4 °C, and filtered through a 0.22 µm filter for analysis in the Agilent 1100 series HPLC system (Agilent, USA). Samples (20 µL of extract) were analyzed using a YMC-C30 chromatographic column coupled with a UV detector at 450 nm, using methyl tertiary butyl ether and methyl alcohol (V/V = 30:70) as solvents: the flow rate was 1.0 mL min⁻¹ and temperature was 25 °C. The lutein, zeaxanthin, α-carotene and β-carotene were quantified by comparing the peak areas.

Quantitative real-time PCR analysis

Quantitative real-time PCR (qRT-PCR) analysis was used to verify the expression levels of genes identified in sequencing. The RNA samples used for qRT-PCR assays were the same as the sequencing samples. Housekeeping gene actin was used as the control gene (Xiao et al. 2016). All primers are listed in Table S1.The qRT-PCR reactions were performed using a SYBR Premix Ex Taq Kit (TaKaRa, Dalian, China) following manufacturer’s instructions and was analyzed in the ABI 7500 Real-Time PCR system (Applied Biosystems, CA, USA). In a white 96-well plate (Axygen, CA, USA), 10 µL SYBR Master Mix and 0.4 µL of 10 µM forward primer and reverse primer was used in combination with 2 µL cDNA template, resulting in a total volume of 20 µL. Cycling conditions involved 10 s at 95 °C followed by 40 cycles of 5 s 95 °C and 34 s 60 °C. A melting dissociation curve was created to identify amplicon characteristics. The final threshold cycle values were average values. Each reaction was performed in triplicate. Analysis of the relative gene expression data was conducted using the 2^−ΔΔCt method.

Development of SSR markers

Simple sequence repeats (SSRs) were detected using the MISA software (http://pgrc.ipk-gatersleben.de/misa/). The parameters were adjusted for identification of perfect di-, tri-, tetra-, penta-, and hexa-nucleotide motifs with a minimum of 6, 5, 5, 5 and 5 repeats, respectively.

Results

Sequencing and de novo assembly

A total of 20,143,879 (5.07 G data) clean reads were obtained after removing duplicate reads, trimming adapters and low quality sequences. The Q30 percentage and GC-content were 88.47 and 47.69%, respectively. The sequencing raw data were deposited in the NCBI database and can be accessed in the Short Reads Archive (SRR6056607).

The trinity software generated 124,120 all-transcripts with length > 200 bp, mean length of 1093 bp and an N50 of 1862 bp (Table 1). 66,026 all-unigenes were achieved with an N50 of 1243 bp. There were 26,539 (40.19%) unigenes of 200–300 bp; 16,007 (24.24%) unigenes of 300–500 bp; 10,656 (16.14%) unigenes of 500–1000 bp; 8053 (12.20%) unigenes of 1000–2000 bp; and the remaining 4771 (7.23%) unigenes were > 2000 bp (Table 1). The length distribution of unigenes is shown in Fig. 1.

Table 1 Length distribution of assemble transcripts and unigenes

Full size table

Functional annotation and classification

All of the contigs generated by Illumina sequencing were aligned to public protein and gene databases. The overall functional annotation is shown in Table 2. The contigs were compared to the sequences in the NR database, using BLASTX (E-value < 1e−3). A total of 31,298 unigenes (47.40%) returned a significant BLASTX result. The contigs homologous to sequences in the NR database were annotated using Blast2GO. A total of 18,680 (28.29%) unigenes was assigned to 4658 GO term annotations, which could be classified into three categories: biological processes, molecular functions and cellular components (Fig. 2). Among biological processes, “oxidation–reduction process” represented the most abundant subcategory followed by “protein phosphorylation” and “regulation of transcription”. In the molecular function category, “ATP binding” and “Zinc ion binding” were the most represented GO terms. Regarding cellular components, the major GO terms were “plasma membrane” and “internal component of membrane”.

Table 2 Functional annotation of the R. molle transcriptome

Full size table

The KOG analysis allowed the functional classification of 17,739 unigenes (Fig. 3). The most frequently identified classes were “general function” (4455, 25.11%), followed by “posttranslational modification” (1976, 11.14%) and “signal transduction mechanisms” (1723, 9.71%).

KEGG pathway and predicted coding sequences (CDS)

A total of 7177 unigenes had significant matches in KEGG, mapped with 118 KEGG pathways (Table S2). Figure 4 summarizes the top 20 represented biological pathways. The most highly represented pathways were “Ribosome” (349 unigenes, 4.86%, ko03010) and “Oxidative phosphorylation” (268 unigenes, 3.73%, ko00190). Four pathways were found to be related to plant color: “Anthocyanin biosynthesis” (2 unigenes, ko00942), “Carotenoid biosynthesis” (32 unigenes, ko00906), “Flavone and flavonol biosynthesis” (7 unigenes, ko00944) and “Flavonoid biosynthesis” (38 unigenes, ko00941).

Based on the public protein database, a total of 55,456 CDSs was obtained. Among these CDSs, 47,695 were between 100 and 1000 bp, 5934 were between 1000 and 2000 bp, and 1827 were over 2000 bp (Fig. 5).

Key enzymes related to the carotenoid pathway

There were 32 unigenes annotated as encoding enzymes involved in carotenoid synthesis based on the KEGG pathway assignments while only 2 unigenes encoding enzymes of the anthocyanin biosynthesis pathway were identified (Table S2). Phytoene synthase (PSY), a rate-limiting enzyme in the carotenoid biosynthesis pathway, catalyzes a head-to-head coupling of two geranylgeranyl diphosphate (GGPP) molecules to produce phytoene (Fig. 6). Then phytoene is catalyzed to lycopene by phytoene desaturase (PDS), ζ-carotene desaturase (ZDS) and carotenoid isomerase (CRTISO). All of these enzymes were identified: c18470.graph_c0 annotated as PSY, c25193.graph_c0 as PDS, c21632.graph_c0 as ZDS and c29096.graph_c0 as CRTISO. Then the carotenoid biosynthetic pathway was split into two routes. In one route, lycopene ε-cyclase (LCYE) and lycopene β-cyclase (LCYB) catalyzed the cyclization of lycopene to produce α-carotene, which was subsequently converted to lutein (Chen et al. 2015). Transcript c30693.graph_c0 and c25695.graph_c0 were annotated as LCYB and LCYE, respectively. Enzymes in the other route were essential for the biosynthesis of abscisic acid (ABA). The route yielded β-carotene with catalysis by LCYB. Then β-carotene was catalyzed by β-ring hydroxylase (CHYB, c18642.graph_c0), zeaxanthin epoxidase (ZEP, c12244.graph_c0), and violaxanthin de-epoxidase (VDE, c11312.graph_c0) produced zeaxanthin and violaxanthin. Finally, violaxanthin was converted to ABA by neoxanthin synthase (NSY) and 9-cis-epoxycarotenoid dioxygenase (NCED, c24238.graph_c0, c18704.graph_c0, c46847.graph_c0 and c62319.graph_c0).

To study whether high carotenoid percentage results from high expression levels of carotenoid biosynthesis related genes in R. molle, the expression of transcripts encoding key enzymes for carotenoids was analyzed. Transcript c21632.graph_c0_seq1, annotated as Zeta-carotene desaturase (ZDS), reached the highest expression level (267.79 FPKM). PSY (C18470.graph_c0), PDS (c25193.graph_c0), CHYB (c18642.graph_c0) and ZEP (c12244.graph_c0) were 104.92 FPKM, 71.38 FPKM, 35.56 FPKM and 19.4 FPKM, respectively. Meanwhile, the expression of LCYB (c30693.graph_c0) was 26.01 FPKM, 3.22-fold higher than LCYE (c25695.graph_c0). These finds impel us to hypothesize that most of lycopenes are converted into β-carotenes by LCYB, and β-carotenes are further modified by hydroxylation or epoxidation, providing a variety of structural features.

Changes of carotenoid content during flower development

The results showed that there were a large number of carotenoids accumulated in the petal during the flower early development. The concentration ranges of lutein, zeaxanthin, α-carotene and β-carotene were 4.24–48.15 µg g⁻¹ FW, 1.72–18.71 µg g⁻¹ FW, 1.05–2.63 µg g⁻¹ FW, 46.22–95.25 µg g⁻¹ FW, respectively. The content of β-carotene was much higher than the other carotenoids during the flower development (Fig. 7). The α-carotene concentration remained low throughout flower development, and declined to the lowest at stage S3. The content of β-carotene tended to decline at stage S2 and rise to the highest at stage S4. On the contrary, the content of lutein and zeaxanthin tended to rise at stage S2 and decline to the lowest at stage S4. It was indicated that β-carotene played an important role in the coloring of R. molle flower.

Validation of Illumina sequencing result by qRT-PCR

To confirm the accuracy and reproducibility of the Illumina expression profiles, qRT-PCR analysis was carried out to analyze the expression level of five genes related to carotenoid catabolism (Fig. 8). The expression levels of each gene were measured through qRT-PCR and compared with its abundance from sequencing data. The qRT-PCR analysis showed that PSY (C18470.graph_c0) and PDS (c25193.graph_c0) were highly expressed during the flower development, followed by CHYB (c18642.graph_c0), LCYB (c30693.graph_c0), and LCYE (c25695.graph_c0). It is generally agreed well with the sequencing result, thus confirmed the Illumina expression profile analysis.

SSR discovery

Approximately 3–7% of expressed genes in the plant genome contain putative SSR motifs, which are mainly located in the untranslated regions of mRNA (Liu et al. 2012). SSRs are widely used for genetics, evolution and breeding research. To explore SSR profiles in the R. molle, the assembled sequences (≥ 1 kb) were mined for SSRs. In total, 8266 SSRs were obtained from 12,824 unigenes with 1707 unigene sequences contained more than one SSR. The di-nucleotide repeat motif was the most abundant, accounting for 56.15%, followed by tri-nucleotide repeat motif (15.56%), tetra-nucleotide (0.57%), penta-nucleotide (0.17%) and hexa-nucleotide repeat units (0.08%) (Table 3).

Table 3 Distribution of identified SSRs using the MISA software

Full size table

The distribution and frequency of classified SSRs were analyzed (Fig. 9). Di-nucleotide was the most common repeat motif with a frequency of 56.15%, followed by tri-nucleotide (15.56%), tetra-nucleotide (0.57%), penta-nucleotide (0.17%), and hexa-nucleotide (0.08%). Among the dinucleotide repeats, the TC/GA (27.44%) and AG/CT (24.28%) were the most frequency motifs, accounting for 92.11% of the total dinucleotide repeats, much higher than that of Dendrocalamus latiflorus Munro and Huperzia serrate Thunb (Liu et al. 2012; Luo et al. 2010). Among the trinucleotide repeats, GAA/CTT (1.44%) and AGA/TCT (1.16%) were the most frequent motifs. The SSRs mining from this database will be valuable resources for genetic manipulations in R. molle.

Discussion

The high-throughput sequencing technology is an efficient and reliable tool for the transcriptome analysis in organisms without a reference genome. Prior to our work, there was no sequencing information on R. molle public database. The cDNA library was constructed using pooled RNA samples. In this study, a total of 5.07 Gb clean data was generated and assembled into 66,026, 31,542 of which were annotated in public databases, and 32 carotenoid structural genes was found. In addition, 8266 SSRs were identified from 12,824 unigenes. This study provides valuable information for future molecular research on R. molle.

In total, 7177 unigenes were assigned to different pathways in the KEGG classification system. Four pathways, flavone and flavonol biosynthesis (ko00944), flavonoid biosynthesis (ko00941), anthocyanin biosynthesis (ko00942), and carotenoid biosynthesis (ko00906), involved in the color formation of plant flower were identified. The carotenoid biosynthesis pathway is important for the synthesis of plant pigments. The color of yellow flowers is usually caused by the presence of carotenoids. The carotenoid metabolic pathway has been extensively studied (Yuan et al. 2015). In pale-yellow petals of marigold, expression levels of nearly all the carotenogentic genes are relatively lower than those in orange petals (Ohmiya 2013). In chrysanthemum petals, no significant difference in the expression levels of carotenoid biosynthesis genes was detected between the white and yellow petals. However, carotenoids are degraded into colorless compounded for CmCCD4a highly expressed in the white-flowered chrysanthemum, which results in the white petal color (Ohmiya et al. 2006). The yellow color fades in azalea petals are also contributed by high expression of CCD4 (Ureshino et al. 2016). Disruption of CCD4 gene can convert flower color from white to yellow in Brassica species (Zhang et al. 2015). White petals of chrysanthemum turned yellow after RNAi construct of CmCCD4a was introduced (Ohmiya et al. 2006). However, the mechanisms that control carotenoid catabolism in R. molle flower are far from conclusive. Therefore, the estimation of candidate enzymes in the biosynthesis of carotenoids will facilitate further research on the underlying mechanisms. In this study, 32 unigenes belonging to the carotenoid synthesis were discovered. According to the sequencing result, the most highly expressed genes related to carotenoid biosynthesis were ZDS, followed by PSY and PDS while no unigene related to CCD4 was detected. In addition, the qRT-PCR results showed that the expression levels of PSY and PDS were low during the early flowering stage, and increased as the flowers developed. The expression level of LCYB was lower than LCYE during the early flowering stage, but higher than LCYE during the later stages. This result was consistent with the changes of carotenoid content in the flower petals of R. molle during the flower development. The content of β-carotene was much higher than the other carotenoids during the flower development, which declined at the stage S2 and rose to the highest at stage S4. The α-carotene concentration remained low throughout flower development. The identification of enzymes and related genes involved in the carotenoid biosynthesis and metabolic pathway would facilitate both color-regulating mechanisms and molecular breeding research in rhododendron.

SSRs were developed as co-dominant and multi-allelic, powerful molecular markers for comparative genetic mapping and genotyping since they are co-dominant, multi-allelic, highly polymorphic, locus-specific, and transportable among species within genera (Bruni et al. 2012; Varshney et al. 2005a, b). Six SSRs have been successfully used to analyze the genetic structure of rhododendron hybrid population for better characterize the evolution of hybridization events (Bruni et al. 2012; Gismondi and Canini 2013). However, only a few SSRs were developed in rhododendron species due to deficient in genomic information. Therefore, it is important to develop more SSR markers. In this study, a total of 8266 SSRs from the transcriptome of R. molle flower were identified. The dinucleotide repeat motifs were the most frequent SSR motifs. The TC/GA/AG/CT motifs were the most frequency motifs, accounting for 92.11% of the total dinucleotide SSRs. Among the di-nucleotide repeat motifs, GA repeats were the most common, which is different from that of Arabidopsis in which AG repeats were the most frequent (Lawson and Zhang 2006). It may due to the introduction of additional repeats during chromosome replication (Eckert et al. 2002). Furthermore, (GA)_n serve as regulatory elements that contains a series of overlapped GAG motifs involved in light regulation (Liu et al. 2012). The unigenes obtained from R. molle transcriptome have provided a valuable genetic resource for SSRs mining and applications in gene mapping, cultivar fingerprinting, and selection of desirable genotypes in rhododendron breeding programs.

We conducted Illumina sequencing to profile the transcriptome of R. molle. To the best of our knowledge, this study presents the first next-generation sequencing of R. molle. The resource information would lay an important foundation for future genetic or genomic studies on rhododendron species. Meanwhile, a large number of putative R. molle homologs that are probably involved in carotenoid biosynthesis pathway were identified. We have identified a large amount of SSRs that are ready for marker development.

References

Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Article PubMed PubMed Central CAS Google Scholar
Bruni I, De Mattia F, Labra M, Grassi F, Fluch S, Berenyi M, Ferrari C (2012) Genetic variability of relict Rhododendron ferrugineum L. populations in the Northern Apennines with some inferences for a conservation strategy. Plant Biosyst 146:24–32
Article Google Scholar
Chen L, Li LN, Dai YP, Wang XR, Duan YF, Yang GD (2015) De novo transcriptome analysis of Osmanthus serrulatus Rehd. flowers and leaves by Illumina sequencing. Biochem Syst Ecol 61:531–540
Article CAS Google Scholar
Chen H, Chen X, Tian J, Yang Y, Liu Z, Hao X, Wang L, Wang S, Liang J, Zhang L et al (2016) Development of gene-based SSR markers in rice bean (Vigna umbellata L.) based on transcriptome data. PLoS ONE 11:e0151040
Article PubMed PubMed Central CAS Google Scholar
Chosson E, Chaboud A, Chulia AJ, Rayhaud J (1998) Dihydroflavonol glycosides from Rhododendron ferrugineum. Phytochemistry 49:1431–1433
Article CAS Google Scholar
Eckert KA, Mowery A, Hile SE (2002) Misalignment-mediated DNA polymerase beta mutations: comparison of microsatellite and frame-shift error rates using a forward mutation assay. Biochemistry 41:10490–10498
Article PubMed CAS Google Scholar
Gismondi A, Canini A (2013) Microsatellite analysis of Latial Olea europaea L. cultivars. Plant Biosyst 147:686–691
Article Google Scholar
Götz S, García-Gómez JM, Terol J, Williams TD, Nagaraj SH, Nueda MJ, Robles M, Joaquín MT, Conesa DA (2008) High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res 36:3420–3435
Article PubMed PubMed Central CAS Google Scholar
Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29:644–652
Article PubMed PubMed Central CAS Google Scholar
Guo LN, Zhao XL, Gao XF (2016) De novo assembly and characterization of leaf transcriptome for the development of EST-SSR markers of the non-model species Indigofera szechuensis. Biochem Syst Ecol 68:36–43
Article CAS Google Scholar
Han Y, Wan HH, Cheng TR, Wang J, Yang WR, Pan HT, Zhang QX (2017) Comparative RNA-seq analysis of transcriptome dynamics during petal development in Rosa chinensis. Sci Rep 7:43382
Article PubMed PubMed Central CAS Google Scholar
Hirschberg J (2001) Carotenoid biosynthesis in flowering plants. Curr Opin Plant Biol 4:210–218
Article PubMed CAS Google Scholar
Huyen DTT, Ureshino K, Van DT, Miyajima I (2016) Co-pigmentation of anthocyanin-flavonol in the Blotch Area of Rhododendron simsii Planch. flowers. Hortic J 85:232–237
Article CAS Google Scholar
Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T et al (2008) Kegg for linking genomes to life and the environment. Nucleic Acids Res 36:D480-D484
Google Scholar
Lawson MJ, Zhang LQ (2006) Distinct patterns of SSR distribution in the Arabidopsis thaliana and rice genomes. Genome Biol 7:R14
Article PubMed PubMed Central CAS Google Scholar
Li X, Lu M, Tang D, Shi Y (2015) Composition of carotenoids and flavonoids in narcissus cultivars and their relationship with flower color. PLoS ONE 10:e0142074
Article PubMed PubMed Central CAS Google Scholar
Li J, Lv X, Wang L, Qiu Z, Song X, Lin J, Chen W (2017) Transcriptome analysis reveals the accumulation mechanism of anthocyanins in ‘Zijuan’ tea (Camellia sinensis var. asssamica (Masters) kitamura) leaves. Plant Growth Regul 81:51–61
Article CAS Google Scholar
Liu MY, Qiao GR, Jiang J, Yang HQ, Xie LH, Xie JZ, Zhuo RY (2012) Transcriptome sequencing and de novo analysis for Ma Bamboo (Dendrocalamus latiflorus Munro) using the Illumina platform. PLoS ONE 7:e46766
Article PubMed PubMed Central CAS Google Scholar
Luo H, Sun C, Li Y, Wu Q, Song J, Wang D, Jia X, Li R, Chen S (2010) Analysis of expressed sequence tags from the Huperzia serrata leaf for gene discovery in the areas of secondary metabolite biosynthesis and development regulation. Physiol Plant 139:1–12
Article PubMed CAS Google Scholar
Martin JA, Wang Z (2011) Next-generation transcriptome assembly. Nat Rev Genet 12:671–682
Article PubMed CAS Google Scholar
Miyajima I, Ureshino K, Kobayashi N, Akabane M (2000) Flower color and pigments of intersubgeneric hybrid between white-flowered evergreen and yellow-flowered deciduous azaleas. J Jpn Soc Hortic Sci 69:280–282
Article CAS Google Scholar
Mizuta D, Ban T, Miyajima I, Nakatsuka A, Kobayashi N (2009) Comparison of flower color with anthocyanin composition patterns in evergreen azalea. Sci Hortic 122:594–602
Article CAS Google Scholar
Nakatsuka A, Mizuta D, Kii Y, Miyajima I, Kobayashi N (2008) Isolation and expression analysis of flavonoid biosynthesis genes in evergreen azalea. Sci Hortic 118:314–320
Article CAS Google Scholar
Ohmiya A (2013) Qualitative and quantitative control of carotenoid accumulation in flower petals. Sci Hortic 163:10–19
Article CAS Google Scholar
Ohmiya A, Kishimoto S, Aida R, Yoshioka S, Sumitomo K (2006) Carotenoid cleavage dioxygenase (CmCCD4a) contributes to white color formation in chrysanthemum petals. Plant Physiol 142:1193–1201
Article PubMed PubMed Central CAS Google Scholar
Pandurangaiah S, Ravishankar KV, Shivashankar KS, Sadashiva AT, Pillakenchappa K, Narayanan SK (2016) Differential expression of carotenoid biosynthetic pathway genes in two contrasting tomato genotypes for lycopene content. J Biosciences 41:257–264
Article CAS Google Scholar
Rezaei MK, Deokar A, Tar’an B (2016) Identification and expression analysis of candidate genes involved in carotenoid biosynthesis in chickpea seeds. Front Plant Sci 7:1867
Article PubMed PubMed Central Google Scholar
Sagawa JM, Stanley LE, LaFountain AM, Frank HA, Liu C, Yuan Y-W (2016) An R2R3-MYB transcription factor regulates carotenoid pigmentation in Mimulus lewisii flowers. New Phytol 209:1049–1057
Article PubMed CAS Google Scholar
Tatusov RL, Natale DA, Garkavtsev IV, Tatusova TA, Shankavaram UT, Rao BS, Kiryutin B, Galperin MY, Fedorova ND, Koonin EV (2001) The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res 29:22–28
Article PubMed PubMed Central CAS Google Scholar
Ureshino K, Nakayama M, Miyajima I (2016) Contribution made by the carotenoid cleavage dioxygenase 4 gene to yellow colour fade in azalea petals. Euphytica 207:401–417
Article CAS Google Scholar
Varshney RK, Graner A, Sorrells ME (2005a) Genic microsatellite markers in plants: features and applications. Trends Biotechnol 23:48–55
Article PubMed CAS Google Scholar
Varshney RK, Sigmund R, Borner A, Korzun V, Stein N, Sorrells ME, Langridge P, Graner A (2005b) Interspecific transferability and comparative mapping of barley EST-SSR markers in wheat, rye and rice. Plant Sci 168:195–202
Article CAS Google Scholar
Wang Z, Gerstein M, Snyder M (2009) RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10:57–63
Article PubMed PubMed Central CAS Google Scholar
Wang P, Ma Y, Ma L, Li Y, Wang S, Li L, Yang R, Wang Q (2016) Development and characterization of EST-SSR markers for Catalpa bungei (Bignoniaceae). Appl Plant Sci 4:1500117
Article Google Scholar
Wang JJ, Wang HB, Ding L, Song AP, Shen F, Jiang JF, Chen SM, Chen FD (2017) Transcriptomic and hormone analyses reveal mechanisms underlying petal elongation in Chrysanthemum morifolium ‘Jinba’. Plant Mol Biol 93:593–606
Article PubMed CAS Google Scholar
Xia EH, Zhang HB, Sheng J, Li K, Zhang QJ, Kim C, Zhang Y, Liu Y, Zhu T, Li W et al (2017) The tea tree genome provides insights into tea flavor and independent evolution of caffeine biosynthesis. Mol Plant 10:866–877
Article PubMed CAS Google Scholar
Xiao Z, Sun X, Liu X, Li C, He L, Chen S, Su J (2016) Selection of reliable reference genes for gene expression studies on Rhododendron molle G. Don. Front Plant Sci 7:1547
PubMed PubMed Central Google Scholar
Yan J, Cai ZX, Shen ZJ, Zhang BB, Ma RJ, Yu ML (2015) Extraction and analytical methods of carotenoids in fruit of yellow flesh peach. J Fruit Sci 32:1267–1274
CAS Google Scholar
Yang LE, Huang XQ, Lu QQ, Zhu JY, Lu S (2016) Cloning and characterization of the geranylgeranyl diphosphate synthase (GGPS) responsible for carotenoid biosynthesis in Pyropia umbilicalis. J Appl Phycol 28:671–678
Article CAS Google Scholar
Yuan H, Zhang J, Nageswaran D, Li L (2015) Carotenoid metabolism and regulation in horticultural crops. Hortic Res 2:15036
Article PubMed PubMed Central CAS Google Scholar
Zhang B, Liu C, Wang Y, Yao X, Wang F, Wu J, King GJ, Liu K (2015) Disruption of a CAROTENOID CLEAVAGE DIOXYGENASE 4 gene converts flower colour from white to yellow in Brassica species. New Phytol 206:1513–1526
Article PubMed CAS Google Scholar
Zhang YZ, Xu SZ, Cheng YW, Ya HY, Han JM (2016) Transcriptome analysis and anthocyanin-related genes in red leaf lettuce. Genet Mol Res 15:gmr.15017023
Google Scholar

Download references

Acknowledgements

This work was supported by the National Science and Technology Program (2013BAD01B070403), National Natural Science Foundation of China (31600570), Jiangsu Natural Science Foundation (BK20150548), Jiangsu Forestry Sciences Innovation Fund (lykj[2017]48), and Jiangsu Agricultural Sciences Innovation Fund (CX(16)1005).

Author information

Authors and Affiliations

Institute of Leisure Agriculture, Jiangsu Key Laboratory for Horticultural Crop Genetic Improvement, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu, China
Zheng Xiao, Jiale Su, Xiaobo Sun, Chang Li, Lisi He, Shangping Cheng & Xiaoqing Liu

Authors

Zheng Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Jiale Su
View author publications
You can also search for this author in PubMed Google Scholar
Xiaobo Sun
View author publications
You can also search for this author in PubMed Google Scholar
Chang Li
View author publications
You can also search for this author in PubMed Google Scholar
Lisi He
View author publications
You can also search for this author in PubMed Google Scholar
Shangping Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoqing Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaoqing Liu.

Ethics declarations

Conflict of interest

Zheng Xiao declares that he does not have conflict of interest. Jiale Su declares that he does not have conflict of interest. Xiaobo Sun declares that she does not have conflict of interest. Chang Li declares that she does not have conflict of interest. Lisi He declares that she does not have conflict of interest. Shangping Cheng declares that he does not have conflict of interest. Xiaoqing Liu declares that she does not have conflict of interest.

Research involving human and animal participants

This article does not contain any studies with human subjects or animals performed by any of the authors.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOCX 15 KB)

Supplementary material 2 (XLSX 43 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xiao, Z., Su, J., Sun, X. et al. De novo transcriptome analysis of Rhododendron molle G. Don flowers by Illumina sequencing. Genes Genom 40, 591–601 (2018). https://doi.org/10.1007/s13258-018-0662-8

Download citation

Received: 27 October 2017
Accepted: 18 January 2018
Published: 03 February 2018
Issue Date: June 2018
DOI: https://doi.org/10.1007/s13258-018-0662-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

De novo transcriptome analysis of Rhododendron molle G. Don flowers by Illumina sequencing

Abstract

Similar content being viewed by others

De novo transcriptome characterization of Lilium ‘Sorbonne’ and key enzymes related to the flavonoid biosynthesis

De novo sequencing of the Freesia hybrida petal transcriptome to discover putative anthocyanin biosynthetic genes and develop EST–SSR markers

Transcriptome-sequencing analyses reveal putative genes related to flower color variation in Chinese Rosa rugosa

Introduction