Abstract
The chloroplast genomes have been utilized in many studies of evolution, population structure and molecular identification. The aim of this study was to assemble the chloroplast genome of Hancornia speciosa Gomes, an economically important fruit species from Brazil, using next-generation sequencing data in order to reconstruct the phylogenetic analysis of Rauvolfioideae. Twenty-four million Illumina short paired-end reads were used for de novo assembly, and the chloroplast contig was identified by BLAST. The chloroplast genome of H. speciosa contains 155,357 bp, with 25,755 bp of inverted repeat A, 25,654 bp of inverted repeat B, 85,702 bp of large single copy and 18,229 bp of small single copy. The coding regions contained 83 genes, eight rRNAs and 36 tRNAs. The phylogenomic analysis reported similar topologies as in previous studies, which used partial chloroplast genomes, and demonstrated that Rauvolfioideae is paraphyletic. In summary, we demonstrated: (1) the first complete chloroplast genome of H. speciosa; (2) phylogenies analyzed using the complete chloroplast genomes revealed a robust phylogenetic topology for Rauvolfioideae; and (3) the phylogenetic analysis showed that the subfamily Rauvolfioideae is paraphyletic.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The Apocynaceae family consists of the subfamilies Rauvolfioideae, Apocynoideae, Periplocoideae, Secamonoideae and Asclepiadoideae (Simões et al. 2007), among which the Rauvolfioideae includes the Hancornia speciosa Gomes species. The genus Hancornia is monotype, and for H. speciosa there have been described four botanical varieties: H. speciosa var. cuyabensis Malme, H. speciosa var. gardneri (A. DC.) Muell. Arg., H. speciosa var. speciosa Gomes and H. speciosa var. pubensces (Nees. and Martius) Muell. Arg. (Collevatti et al. 2018). Hancornia speciosa is known as mangabeira in the vernacular language and is of economic, ecological and social importance, utilized in agriculture and industry for human food (Silva-Júnior and Lêdo 2006; Silva et al. 2011). The phylogeny of the Apocynaceae is robust and contains monophyletic and paraphyletic subfamilies; however, some tribes and the paraphyletic Rauvolfioideae and Apocynoideae subfamilies need new studies for better understanding of paraphyly (Fishbein et al. 2018).
The chloroplasts are essential for plants and are important for photosynthesis, biosynthesis, and carbon sequestration and have been utilized for phylogenetic analysis with robust branch support in other taxa (Barrett et al. 2016). These cytoplasmic organelles have a genome independent of the nuclear genome, and it is generally inherited through the female line only. The chloroplast genome is organized in a quadripartite circular structure, with a 100–200 kbp size range, primarily composed of two large inverted repeats (IRs), which contain the ribosomal genes and other plastid genes, separated by a large single copy (LSC) region and a small single copy (SSC) region. The coding regions include 16S, 23S, 5S, and 27–31 tRNA genes, which are sufficient to translate all of the amino acids, including three genes for the RNA polymerase subunit (similar to the situation in prokaryotes) and most of the genes for photosystem I, photosystem II, cytochrome, and ATP synthesis (revised by Green 2011), totaling approximately 80 proteins (Huang et al. 2013).
The chloroplast genomes are haploid and highly conserved regarding their genetic content and quadripartite genomic structure. They have been widely used to study the evolutionary relationships of different taxonomic levels in plants (Ravi et al. 2008). The present study investigated three questions: (1) What is the structure of the H. speciosa chloroplast genome? (2) Is phylogeny using complete chloroplast genomes more robust than when using partial chloroplast sequences? (3) Is the evolution of the structure of IRs, LSC, and SSC different within the subfamily Rauvolfioideae? To answer these questions, we obtained the complete H. speciosa chloroplast genome, and an evolutionary analysis was constructed for the family using complete chloroplast genomes.
2 Materials and methods
Plant material, DNA isolation, and high-throughput DNA sequencing
– Hancornia speciosa var speciosa plant material was collected in the state of Pernambuco, Brazil, and total DNA was extracted (including nuclear, chloroplast, and mitochondrial DNA) from the leaves utilizing the cetyltrimethylammonium bromide (CTAB) extraction method (Doyle and Doyle 1987). The quality and quantity of the extracted DNA were verified by visualization on a 1% agarose gel and a spectrophotometer, respectively. The DNA sample was fragmented into 400–500 bp to construct the sequencing library. The fragments were ligated with adapters using the “Nextera DNA Sample Preparation” (Illumina), and we sequenced the 100 nt paired-end reads. The sequencing was performed at the Central Laboratory for High Performance Technologies in Life Sciences (LacTad-Laboratório Central de Tecnologias de Alto DesempenhoemCiências da Vida) at the State University of Campinas-UNICAMP, São Paulo.
Chloroplast genome assembly and annotation
– Twenty-four million reads were utilized for de novo assembly using Ray software (Boisvert et al. 2012). The de novo contigs for the chloroplast genomes were then manually merged using paired-end pair reads in the Geneious R9 software. Genome annotation was achieved using Catharanthus roseus (L.) (Rauvolfioideae) as the reference by the Geneious software. For annotation using Geneious, a minimum of 70% identity cutoff between the genomes was considered. The annotations were individually checked, and if necessary, were manually corrected for start and stop codons. The annotated genome was checked with cpGavas (Liu et al. 2012). A graphic representation of the plastomes was created using Geneious R9.
Phylogenetic analysis and genome comparison
– The complete chloroplast genome sequences were aligned using the program MAFFT v7.017 (Katoh and Standley 2013) implemented as the “Multiple align” tool in Geneious R9. The GTR model was determined using Bayesian Information Criterion Evolutionary implemented in MEGA7 software (Kumar et al. 2016). The evolutionary history was inferred by using the maximum likelihood method, and branch support was assessed with 1000 bootstrap replicates conducted in MEGA7 software. For analysis of the branches support, phylogenetic trees were performed using rbcL gene, rbcL + matK genes, rbcL + matK + trnH-psbA and complete chloroplast genome sequences. The linear comparison among chloroplast genomes of the Rauvolfioideae was performed using BLAST Ring Image Generator with the default software settings (Alikhan et al. 2011), and the number of interspecific SNPs was identified using Geneious.
3 Results and discussion
Chloroplast genome of H. speciosa
– Genetic studies based on NGS analyses are capable of unlocking information about the genome, transcriptome, and epigenome of any organism (Chaitankar et al. 2016). In these circumstances, the Illumina paired-end reads from H. speciosa var speciosa were de novo assembled, which produced contigs that allowed the entire genome to be included in the draft. The complete genome was finished by mapping reads in the draft genome, which corrected all errors. After obtaining the final genomes, the 24 million reads were mapped, resulting in an average coverage of 340×. The chloroplast genome of H. speciosa consisted of 155,357 bp (NCBI access number MG049918), with the typical conserved “quadripartite” structure, with a pair of inverted repeats (IRA and IRB) separated by LSC and SSC regions (Fig. 1). The IRA and IRB consisted of 25,755 bp and 25,654 bp, respectively, separated by 85,702 bp of LSC and 18,229 bp of SSC. The genome included 127 genes, 83 protein-coding, 36 tRNAs, and 4 rRNAs (Table 1 and Fig. 1), and all protein-coding genes used AUG as the start codon. The chloroplast genome of H. speciosa showed a typical structure, with the expected content of LSC, SSC, and two IRs, corroborating the findings of several other studies that have described the chloroplast genome as a conserved genome and that the genome size is largely due to indels in the LSC regions (Barrett et al. 2016; Chen et al. 2017). In the Rauvolfioideae subfamily, only three chloroplast genomes are available: Rhazya stricta Decne (Park et al. 2014), Catharanthus roseus (Ku et al. 2013), and Carissa macrocarpa (Eckl.) A. DC. (Jo et al. 2017). The genome sequences produced in the present study will contribute to future phylogenetics analysis using complete chloroplast genomes.
Comparative chloroplast genome analysis in Rauvolfioideae
– The complete chloroplast genomes for the Rauvolfioideae subfamily were utilized in the comparative analyses, and the results showed that regions IRA and IRB showed smaller abundance of SNPs when compared with the LSC and SSC regions (Fig. 2). The genome includes similar genes annotation, and the arrangements of these regions are exclusively collinear (Fig. 3). The phylogenetic analysis showed that H. speciosa is close to Catharanthus roseus with high support (100%), while Rhazya stricta and Carrissa macrocarpa were in another clade, with high support (Fig. 3). When the partial chloroplast genome sequences were used (rbcL gene, rbcL + matK genes and rbcL + matK + trnH-psbA), the results showed low support when compared to using complete chloroplast genome sequences (Fig. S1). The previous phylogenetic analysis in Apocynaceae indicated 100% support for many branches; however, there are many other branches with 90% or less than 90% support (Fishbein et al. 2018). In the present study, all branches showed 100% bootstrap support, suggesting that phylogeny using complete chloroplast genome is more robust than using partial chloroplast sequences. For SSR analysis, we observed motifs of di-, tri-, and tetra-nucleotides; however, tetra-nucleotides were found only in H. speciosa (Fig. 4). Among these SSR motifs, the di-nucleotides were more abundant in the four species, followed by tri-nucleotides, of which the number of di-nucleotides ranged from 8 to 14 among genomes and tri-nucleotides ranged from 1 to 5 (Fig. 4).
The Apocynaceae family contains 370 genera, organized into 25 tribes in five subfamilies, and although the phylogeny of Apocynaceae has been analyzed based on morphological characteristics and DNA sequences, the phylogenetics analysis has shown that Rauvolfioideae is first diverging lineage, while other subfamilies are derived (Lahaye et al. 2007; Simões et al. 2007; Livshultz 2010; Simões et al. 2016; Fishbein et al. 2018; Ollerton et al. 2018). The phylogenetic tree using complete chloroplast genomes corroborates the findings of prior studies (Simões et al. 2007; Livshultz et al. 2007; Fishbein et al. 2018), in which Rauvolfioideae is basal in the Apocynaceae; however, two clades have been observed, suggesting that Rauvolfioideae is paraphyletic as described in previous studies (Simões et al. 2007; Fishbein et al. 2018). The paraphyly (Rauvolfioideae and Apocynoideae) and monophyly (Periplocoideae, Secamonoideae and Asclepiadoideae) in the family happened mainly due to its wide distribution in the tropics around the world, which resulted in different modifications and adaptations between species. However, the paraphyly within Rauvolfioideae have not yet been analyzed for their chloroplast genome structure, but a comparison among genomes has shown that their gene order and content are highly conserved.
References
Alikhan NF, Petty NK, Ben Zakour NL, Beatson SA (2011) BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons. BMC Genom 12:402. https://doi.org/10.1186/1471-2164-12-402
Barrett CF, Baker WJ, Comer JR, Conran JG, Lahmeyer SC, Leebens-Mack JH, Li J, Lim GS, Mayfield-Jones DR, Perez L et al (2016) Plastid genomes reveal support for deep phylogenetic relationships and extensive rate variation among palms and other commelinid monocots. New Phytol 209:855–870. https://doi.org/10.1111/nph.13617
Boisvert S, Raymond F, Godzaridis E, Laviolette F, Corbeil J (2012) Ray Meta: scalable de novo metagenome assembly and profiling. Genome Biol 22:1–13. https://doi.org/10.1186/gb-2012-13-12-r122
Chaitankar V, Karakülah G, Ratnapriya R, Giuste FO, Brooks MJ, Swaroop A (2016) Next generation sequencing technology and genomewide data analysis: perspectives for retinal research. Prog Retin Eye Res 55:1–3. https://doi.org/10.1016/j.preteyeres.2016.06.001
Chen Z, Grover CE, Li P, Wang Y, Nie H, Zhao Y, Ang M, Liu F, Zhou Z, Wang X, Cai X, Wang K, Wendel JF, Hua J (2017) Molecular evolution of the plastid genome during diversification of the cotton genus. Mol Phylogenet Evol 112:268–276. https://doi.org/10.1016/j.ympev.2017.04.014
Collevatti RG, Rodrigues EE, Vitorino LC, Lima-Ribeiro MS, Chaves LJ, Telles MPC (2018) Unravelling the genetic differentiation among varieties of the Neotropical savanna tree Hancornia speciosa Gomes. Ann Bot 122:973–984. https://doi.org/10.1093/aob/mcy060
Doyle JJ, Doyle JL (1987) A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull 19:11–15
Fishbein M, Livshultz T, Straub SCK, Simões AO, Boutte J, McDonnell A, Foote A (2018) Evolution on the backbone: Apocynaceae phylogenomics and new perspectives on growth forms, flowers, and fruits. Am J Bot 105:495–513. https://doi.org/10.1002/ajb2.1067
Green BR (2011) Chloroplast genomes of photosynthetic eukaryotes. Plant J 66:34–44. https://doi.org/10.1111/j.1365-313X.2011.04541.x
Huang YY, Matzke AJM, Matzke M (2013) Complete sequence and comparative analysis of the chloroplast genome of coconut palm (Cocos nucifera). PLoS ONE 8:1–12. https://doi.org/10.1371/journal.pone.0074736
Jo S, Kim HW, Kim YK, Cheon S-H, Kim KJ (2017) The complete plastome sequence of Carissa macrocarpa (Eckl.) A. DC. (Apocynaceae). Mitochondrial DNA Part B 2:26–28. https://doi.org/10.1080/23802359.2016.1233468
Katoh K, Standley DM (2013) MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30:772–780. https://doi.org/10.1093/molbev/mst010
Ku C, Chung WC, Chen LL, Kuo CH (2013) The complete plastid genome sequence of Madagascar periwinkle Catharanthus roseus (L.) G. Don: plastid genome evolution, molecular marker identification, and phylogenetic implications in asterids. PLoS ONE 8:1–11. https://doi.org/10.1371/journal.pone.0068518
Kumar S, Stecher G, Tamura K (2016) MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol 33:1870–1874. https://doi.org/10.1093/molbev/msw054
Lahaye R, Klackenberg J, Källersjö M, Campo EV, Civeyrel L (2007) Phylogenetic relationships between derived Apocynaceae s.l. and within Secamonoideae based on chloroplast sequences. Ann Missouri Bot Gard 94:376–391
Liu C, Shi L, Zhu Y, Chen H, Zhang J, Lin X, Guan X (2012) CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences. BMC Genom 13:715. https://doi.org/10.1186/1471-2164-13-715
Livshultz T (2010) The phylogenetic position of milkweeds (Apocynaceae subfamilies Secamonoideae and Asclepiadoideae): evidence from the nucleus and chloroplast. Taxon 59:1016–1030
Livshultz T, Middleton DJ, Endress ME, Williams JK (2007) Phylogeny of Apocynoideae and the Apsa clade (Apocynaceae S.L.)1. Ann Missouri Bot Gard 94:324–359. https://doi.org/10.3417/0026-6493(2007)94%5b324:poaata%5d2.0.co;2
Ollerton J, Liede-Schumann S, Endress ME, Meve U, Rech AR, Shuttleworth A, Keller HA, Fishbein M, Alvarado-Cárdenas LO, Amorim FW et al (2018) The diversity and evolution of pollination systems in large plant clades: Apocynaceae as a case study. Ann Bot. https://doi.org/10.1093/aob/mcy127
Park S, Ruhlman TA, Sabir JSM, Mutwakil MHZ, Baeshen MN, Sabir MJ, Baeshen NA, Jansen RK (2014) Complete sequences of organelle genomes from the medicinal plant Rhazya stricta (Apocynaceae) and contrasting patterns of mitochondrial genome evolution across asterids. BMC Genom 15:1–18. https://doi.org/10.1186/1471-2164-15-405
Ravi V, Khurana JP, Tyagy AK, Khurana P (2008) An update on chloroplast genomes. Plant Syst Evol 271:101–122. https://doi.org/10.1007/s00606-007-0608-0
Silva-Júnior JF, Lêdo AS (2006) A cultura da mangaba. Embrapa Tabuleiros Costeiros, Aracaju, p 253
Silva AVC, Santos ARF, Wickert E, Silva Júnior JF, Costa TS (2011) Divergência genética entre acessos de mangabeira (Hancornia speciosa Gomes). Revista Brasileira de Ciências Agrárias 6:572–578
Simões AO, Livshultz T, Conti E, Mary E (2007) Phylogeny and systematics of the Rauvolfioideae (Apocynaceae) based on molecular and morphological evidence. Ann Missouri Bot Gard 94:268–297
Simões AO, Kinoshita LS, Koch I, Silva MJ, Endress ME (2016) Systematics and character evolution of Vinceae (Apocynaceae). Taxon 65:99–122. https://doi.org/10.12705/651.7
Acknowledgements
We would like to thank the Federal University of Alagoas for the laboratories and scientific support, the Fundação de Apoio à Pesquisa de Alagoas (FAPEAL) for funding this Project, and the National Council for the Improvement of Higher Education (CAPES).
Author information
Authors and Affiliations
Contributions
EJSO performed the DNA library, de novo assembles. AM and CA analyzed data and wrote the manuscript.
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Fig. S1
Molecular phylogenetic analysis by the maximum likelihood method, with supported values estimated by bootstrap. (A) Phylogenetic tree using complete chloroplast genome and rcbL gene; (B) Phylogenetic tree using complete chloroplast genome and rcbL + matK genes; (C) Phylogenetic tree using complete chloroplast genome and rcbL + matK + trnH-psbA (TIFF 33,752 kb)
Rights and permissions
About this article
Cite this article
da Silva Oliveira, E.J., Marques, A. & Almeida, C. The chloroplast genome of Hancornia speciosa Gomes: structural organization and phylogenomic studies in Rauvolfioideae (Apocynaceae). Braz. J. Bot 42, 449–455 (2019). https://doi.org/10.1007/s40415-019-00549-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40415-019-00549-8