Abstract
Two complete chloroplast genome sequences of Asteropyrum, as well as those of 25 other species from Ranunculaceae, were assembled using both Illumina and Sanger sequencing methods to address the structural variation of the cp genome and the controversial systematic position of the genus. Synteny and plastome structure were compared across the family. The cp genomes of the only two subspecies of Asteropyrum were found to be differentiated with marked sequence variation and different inverted repeat-single copy (IR-SC) borders. The plastomes of both subspecies contains 112 genes. However, the IR region of subspecies peltatum carries 27 genes, whereas that of subspecies cavaleriei has only 25 genes. Gene inversions, transpositions, and IR expansion-contraction were very commonly detected in Ranunculaceae. The plastome of Asteropyrum has the longest IR regions in the family, but has no gene inversions or transpositions. Non-coding regions of the cp genome were not ideal markers for inferring the generic relationships of the family, but they may be applied to interpret species relationship within the genus. Plastid phylogenomic analysis using complete cp genome with Bayesian method and partitioned modeling obtained a fully resolved phylogenetic framework for Ranunculaceae. Asteropyrum was detected to be sister to Caltha, and diverged early from subfamily Ranunculoideae.
Similar content being viewed by others
Introduction
In recent years, the use of whole chloroplast (cp) genome data for plant phylogenetic reconstruction has been greatly improved our understanding of evolutionary relationships of angiosperms at a wide range of taxonomic levels1,2,3,4. Chloroplast genome shows uniparental inheritance in most angiosperm species and has a size ranging from 115 to 165 kb5. It usually has a conserved circular structure containing a large single copy (LSC) and a small single copy (SSC), which are separated by two copies of inverted repeat (IR) regions. Rates of nucleotide substitution in the cp genome are relatively slow and therefore can provide resolution of plant phylogeny at generic and familial levels6,7,8,9,10,11,12,13. The gene number and arrangement of the cp genome are often well conserved in angiosperms14. However, variations in the structure of the cp genome, including inversions, transpositions of certain regions, and expansion of IRs, are not uncommon in many families, including Ranunculaceae, Fabaceae, and Asteraceae11,12,15,16,17,18,19,20. Recent studies of the cp genomes of Ranunculales showed that inversions, transpositions, and IR expansions may provide strong phylogenetic information, and plastid phylogenomic analysis can yield infra-familial phylogeny with high resolution3,11,12,15,20.
Asteropyrum Drumm. et Hutch. is a small but very distinctive genus in Ranunculaceae distributed predominantly in China. It is famous for its great pharmaceutical value and can be used as substitutes of goldthread (Coptis Salisb.) for curing icterus, hydroncus, and diarrhea in rural areas of southern China21. The plants of Asteropyrum are small perennial herbs with simple peltate leaves, white sepals and small golden yellow petals (Fig. 1). Traditionally, the genus was considered to have two species, A. peltatum (Franch.) Drumm. et Hutch. and A. cavaleriei (Lévl. et Vant.) Drumm. et Hutch. They have different sizes and leaf shapes21,22,23. Asteropyrum peltatum is often smaller (usually less than 10 cm tall) than A. cavaleriei with suborbicular to inconspicuously 5-sided peltate leaves (Fig. 1A,B). Whereas, scape of A. cavaleriei is often 12–20 cm tall and the leaf is also much larger with five-angled blade (Fig. 1C,D). However, intermediate forms are common in the overlap zones of these two taxa (Fig. 1E,F). Thus, A. cavaleriei has often been treated as a subspecies of A. peltatum24.
The systematic position of Asteropyrum within Ranunculaceae has long been an interesting issue and disputed for almost a century. Drummond & Hutchinson25 separated this genus from Isopyrum L. and reported that Asteropyrum may be closely related to Caltha L. and Trollius L. However, later authors argued that this genus may be closely related to Isopyrum26, Copis21,27, or Calathodes Hook. f. & Thoms.23, and its chromosome type was also interpreted differently24,28,29,30,31.
Recent molecular phylogenetic studies also did not clearly resolve the position of Asteropyrum. It was found to be closely related to the (Beesia, (Eranthis, Cimicifuga)) clade by a study using rbcL and 26s rDNA32, or sister to Caltha by a study using three plastid regions, one nuclear region, and morphological data33, or sister to tribe Cimicifugeae by a study using five plastid and one nuclear regions34, and sister to Callianthemum C. A. Mey. in a study using four plastid, three nuclear, and one mitochondrial regions35. However, these studies did not obtain a robust phylogenetic framework of Ranunculaceae, and the systematic position of Asteropyrum in the family still remains to be clarified.
Plastid phylogenomic studies may be particularly suitable for resolving the generic relationship within Ranunculaceae, and structural variations of the cp genome, e.g., gene inversion, gene transposition, and IR expansion-contraction, may provide important systematic information about the family3,11,12,15,20. Zhai et al. carried out a plastid phylogenomic study focusing on Ranunculaceae based on a well sampling scheme15. They also discussed structural variation of plastid genome sequences of the family. However, their study focused on tribal relationship of Ranunculaceae and only included one sample of Asteropyrum. In their study, they detected and reported five types of plastome sequences of the family based on gene inversions and transpositions but left IR expansion-contraction (widely distributed and highly diverged in the family) issue unaddressed. For phylogenomic analyses, they used two matrices (coding region and complete plastome sequences) and three phylogenetic reconstruction methods (Parsimony, Maximum Likelihood, and Bayes) with no partitioned modeling applied the Bayesian analysis. The results gave a better resolution of the family. However, position of Asteropyrum was still not well supported.
In this study, we reported the complete cp genome sequences of both subspecies of Asteropyrum, as well as 25 plastome sequences from other genera in Ranunculaceae using genome skimming data. We then compared the synteny and plastome structure across the family and conducted a plastid phylogenomic study using partitioned modeling method to resolve deep-level relationships and explore plastome structural evolution across Ranunculaceae. The aims of these analyses were to clarify variation in the cp genome across the genus Asteropyrum, to detect structural variation in the Asteropyrum cp genome in comparison with other genera in the family, to infer the phylogenetic position of Asteropyrum within the family, and to try to compare and reconstruct the deep-level relationships of Ranunculaceae using different plastome partitions.
Results
Plastome organization and features of Asteropyrum and its relatives
We obtained 3.0 Gb and 3.2 Gb Next-Generation Sequencing (NGS) clean datasets for Asteropyrum peltatum ssp. peltatum (Franch.) Drumm. & Hutch. (16,552,494 reads) and A. peltatum ssp. cavaleriei (Lévl. & Vant.) Q. Yuan & Q. E. Yang (17,655,994 reads), respectively. Blat analysis was used to select 133,486 putative plastid reads for ssp. peltatum and 144,722 reads for ssp. cavaleriei. The reads from ssp. peltatum were used to obtain two large contigs from the de novo assembly (51,204 bp, and 81,421 bp). One gap was bridged using Sanger sequencing (with primers: LSCF: TGCGATGCTCTAACCTCTGAG; LSCR: AGAGCAATCCTAACCAGAATCATCT). For the ssp. cavaleriei data, only one large contig (131,836 bp), which included complete LSC, IR, and SSC regions was derived. Information regarding the cp genome assembly for all of the other newly sequenced samples from Ranunculaceae is presented in Supplementary Table S1.
The complete cp genome sequences of ssp. peltatum and ssp. cavaleriei are 164,455 bp and 164,274 bp, respectively (Fig. 2, Table 1), and the rate of identical sites between the two plastome sequences was 98.1%. The two complete cp genome sequences had similar GC content (38.0% for ssp. cavaleriei and 37.9% for ssp. peltatum). Within the circular plastid genome of Asteropyrum, the IR region had the richest GC content, followed by the LSC region, while the SSC region had the lowest GC content (Table 1).
The LSC region of ssp. cavaleriei was found to be longer than that of ssp. peltatum (84,284 bp vs. 81,819 bp), whereas the IR region of ssp. cavaleriei was significantly shorter than that of ssp. peltatum (31,429 bp vs. 32,659 bp), because the LSC region of ssp. cavaleriei carried two more genes (rpoA and rps11) at the border of the LSC and IR than that of ssp. peltatum. At the same time, the gene number of the IR region of ssp. cavaleriei was two less than that of ssp. peltatum (Fig. 2).
Because the gene numbers in the IR regions of the two subspecies differed, we designed two pairs of primers to assess the stability of the IR-SC boundary of both subspecies using additional samples. We also tested the IR-SC boundary of an intermediate form as well. The results showed that the IR-SC boundaries within each subspecies are stable. The intermediate individual collected from Yibin, Sichuan had an IR-SC boundary that was identical with that of ssp. cavaleriei. The features of the other 25 newly sequenced cp genomes and their gene maps are presented in Supplementary Table S2 and Supplementary Fig. S1.
Structural variation of the chloroplast genome in Ranunculaceae
When the LAGAN alignment program was applied, the mVISTA results showed a large proportion of un-matched area in tribe Anemoneae and the genus Adonis (Fig. 3), which indicated inversions and gene transpositions in the cp genomes of these plants. The shuffle-LAGAN method was used to obtain a well-matched global alignment for all plastome sequences of Ranunculales (Supplementary Fig. S2). Inversions and transpositions in cp genomes of tribe Anemoneae have been reported previously11,12,15,20, and the cp genome of Adonis was found to have a large inversion in the LSC region between rps16 and trnT-UGU, which contains 35 genes (43,411 bp in total length), including 21 protein-coding genes and 14 tRNA genes (Supplementary Fig. S1-Adonis). The inversion within Adonis showed no phylogenetic relationship with that in tribe Anemoneae.
IR expansion-contraction was found to be very common in Ranunculaceae and its relatives (Table 2, Supplementary Fig. S3). After comparing a wide range of angiosperm cp genomes, we identified the cp genome of Amborella as the standard IR composition (with 17 genes) that is commonly shared by most of angiosperms. In Ranunculaceae, the cp genomes of Asteropyrum, Dichocarpum W. T. Wang & P. K. Hsiao, Hydrastis L., and tribe Anemoneae (Anemone L., Pulsatilla Mill., Hepatica Mill., Anemoclema (Franch.) W. T. Wang, Clematis L., Naravelia DC., etc.) showed IR expansion, whereas IR contraction was detected in Helleborus L. and Ceratocephala Moen. (detailed IR information for all tested Ranunculaceae samples is presented in Supplementary Table S3). The longest IR in Ranunculales was found to be that of Berberis amurensis Rupr., which carries 32 genes. Asteropyrum showed the longest IR regions within the family Ranunculaceae (Fig. 2, Table 2).
Plastid phylogenomic analyses
Seven datasets and three methods were used to obtain 21 phylogenetic frameworks for the family Ranunculaceae. All of these frameworks were largely similar to each other (presented in Supplementary Fig. S4). Topology conflicts among different datasets or methods were usually not well supported statistically.
The parameters of the parsimony analyses are presented in Table 3. This analysis yielded less resolved phylogenies in comparison with the ML and Bayesian methods. The phylogenetic inferences using the parsimony method showed largely congruent results among the seven datasets. The complete cp genome dataset obtained the best resolved and supported phylogenetic tree, in which Asteropyrum was sister to Nigella with weak bootstrap support (BS = 62, Supplementary Fig. S4). This result was different from ML and Bayesian analyses (Asteropyrum sister to Caltha).
The topologies from the Bayesian and ML analyses were almost the same for each dataset, but the ML analyses also yielded less resolved and supported trees than the Bayesian method. Among all the 21 phylogenetic trees, the one obtained from the complete plastome sequence dataset with Bayesian method showed a fully resolved phylogeny with all the branches supported by PP value of 1 (Fig. 4). Therefore, the phylogenetic relationships discussed below were mainly based on this result.
Discussions
After comparing the first diverged Amborella36 and other angiosperm species, we determined that a total number of 113 genes (with 17 genes in IR region) in the cp genome can be set as a primitive cp genome structure based on its prevalence in angiosperms. Then, we summarized the structural variation of the cp genomes (including gene composition, gene inversion and transposition, and IR expansion-contraction) across Ranunculaceae based on all the available data.
Structural variations of the cp genome within Ranunculaceae have been reported11,12,15,20,37,38. Using restriction site mapping, Johansson38 reported large inversions and gene loss in plastomes of Adonis species. In this study, one inversion in A. coerulea, which was identical with the A. vernalis inversion described by Johansson38, was confirmed and located in detail. Zhai et al. reported the same inversion in A. sutchuenensis15. We also detected one gene loss (rpl32) in A. coerulea (Supplementary Table S3). The inversion within Adonis plastomes could be a synapomorphy within the genus. Structural variations of cp genome have been reported within tribe Anemoneae by previous studies11,12,15,20,37. In comparison with Adonis, those cp genome structural variations of tribe Anemoneae differed in size and number. No evidence indicates that these inversions in the cp genomes of Adonis and tribe Anemoneae are phylogenetically related.
IR expansion-contraction is also an important process involved in cp genome variation within Ranunculaceae. Based on a broad comparison with other angiosperm species, ten genera (Asteropyrum, Clematis, Archiclematis, Naravelia, Pulsatilla, Anemone, Anemoclema, Hepatica, Dichocarpum, and Hydrastis) were found to exhibit IR expansion, whereas two genera (Helleborus and Ceratocephala) showed IR contraction. Some of these IR expansion events may be phylogenetically informative. For example, within the well supported tribe Anemoneae, a single IR-expansion, which may be a synapomorphy of the plastome variation in the tribe, was detected12. However, in other case, IR expansion-contraction may occur independently in some genera. For example, the cp genomes from two species of Dichocarpum showed different IR regions. One species, D. dalzielii, had a normal 17-gene IR, whereas the other, D. sutchuenense, carried expanded IR regions (19 genes) (Table 2). On the other hand, IR contraction and gene loss in Helleborus and Ceratocephala (Table 2) also evolved independently within Ranunculaceae because these two genera were separated phylogenetically (Fig. 4).
The genus Asteropyrum did not have gene inversions or transpositions in their cp genomes, but they showed significant IR expansion and carried the longest IR regions in the family (Table 2). Different gene compositions in IR regions of the two subspecies were detected (Fig. 2, Table 2). Two more genes (rpoA and rps11) were present in the IR region of ssp. peltatum, and this variation is stable within the subspecies. These phenomena suggested that plastome structural variation can occur not only among genera but also within a single species.
In the present study, an intermediate sample showed the same IR-SC boundary with A. peltatum ssp. cavaleriei. Morphologically, ssp. peltatum and ssp. cavaleriei can be easily distinguishable by their size and shape of leaves. Intermediate forms between the two subspecies in morphology and palynology are sometimes present at their overlapping zones24 and can be either more similar to ssp. peltatum or to ssp. cavaleriei. The intermediate individual sampled in this study was small in its leaf and scape size but with slightly five-angled leaves (Fig. 1E,F). According to Yuan & Yang24, it can be recognized as an intermediate form but more similar to ssp. cavaleriei. The IR-SC boundary indicated that this intermediate individual could be a hybrid one between the two subspecies with ssp. cavaleriei as its maternal parent.
A large number of molecular phylogenetic as well as several phylogenomic analyses have been conducted for Ranunculaceae33,34,35,39,40,41,42,43,44,45. However, these studies have suffered from poor resolution largely due to an insufficient phylogenetic signal or insufficient sampling. Zhai et al. carried out a well sampled plastid phylogenomic study on Ranunculaceae15, and they obtained a better phylogenetic framework of the family. However, this study did not separate the plastid sequence in detail and the substitution model was tested without partitioning. Thus, we still do not know the diversification and phylogenetic resolving ability of different partition of the plastid genome sequences.
In this study, phylogenetic relationships within Ranunculaceae were inferred from six separate datasets, as well as a complete plastome sequence data. Among all the six separate datasets, intergenic spacer data showed the highest rate of informative sites (45.37%). Whereas, the IR region showed the lowest rate of informative sites (8.50%), indicating that it is the most conserved region in the entire plastid genome (Table 3). The phylogenetic resolution of IR region was also shown to be the worst among the seven datasets (Supplementary Fig. S4) because of its small number of phylogenetic signals.
All the phylogenetic trees inferred by different datasets were largely congruent (Supplementary Fig. S4). Hydrastis and Coptis consistently appeared as the earliest diverged lineages in the family. Major clades, including subfamily Thalictroideae, tribe Adonideae, tribe Ranunculeae, tribe Anemoneae, tribe Cimicifugeae, and tribe Delphinieae were resolved in all the analyses. However, the systematic positions of Asteropyrum, Caltha, Helleborus, Callianthemum and Nigella were found to be unstable and often had low statistical support values using the six separate datasets. These genera may have undergone unusual evolutionary processes, such as ancient hybridization or rapid radiation, coupled with their origin and evolution process.
Because of the different rates and patterns of nucleotide substitutions among the cp genome sequences46, data partitioning methods are required for phylogenetic reconstruction to ensure the accuracy of the analysis2,47,48,49,50. Our results showed that Bayesian analyses with partitioned models always obtained the best resolved Ranunculaceae phylogeny for each dataset. Using Bayesian analysis, the complete plastome dataset obtained a fully resolved phylogeny for Ranunculaceae (Fig. 4), which was better resolved than all the previous molecular and phylogenomic studies15,33,34,35,39,40,41,42,43,44,45.
In general, the Bayesian phylogeny inferred from the complete cp genome sequences is largely congruent with the phylogeny obtained by Cossard et al.35 Using eight DNA fragments from the chloroplast, mitochondrial, and nuclear genomes, Cossard et al. identified the sister relationship of subfamilies Thalictroideae and Adonideae with insignificant statistical support (PP = 0.80)35. This relationship was not resolved by most previous molecular phylogenetic studies32,33,40,41,42,44,51,52. In the present study, our plastid phylogenomic analysis resolved the sister relationship between subfamilies Thalictroideae and Adonideae (Fig. 4), and supported the hypothesis of Cossard et al.
From the Bayesian analysis, we detected the sister relationship of Callianthemum and Helleborus (Fig. 4) which was not resolved by all the previous studies of Ranunculaceae phylogeny32,33,40,41,42,44,51,52. The clade of Callianthemum and Helleborus was found to be related to a well-supported clade of tribe Ranunculeae + tribe Anemoneae. The sister relationship of tribe Cimicifugaea and the tribe Delphinieae + tribe Nigellaea clade (Fig. 4) was resolved by our analysis. This relationship was previously reported by Hoot but without sufficient statistical support41.
The phylogenetic position of Asteropyrum has been disputed for a century and still remains to be resolved by now20,24,32,35,53. In this study, the sister relationship of Asteropyrum and Caltha was discovered using the complete cp genome datasets with ML and Bayesian methods. This clade was further found to be first diverged from subfamily Ranunuloideae (Fig. 4). Wang et al. also proposed sister relationship of Asteropyrum and Caltha by combining molecular and morphological data, but their results was not statistically supported32. It is noteworthy that Cossard et al. resolved a sister relationship of Asteropyrum and Callianthemum with a strong support value using a combination of chloroplast, mitochondrial, and nuclear genes35. This unexpected result was solely contributed by the nuclear gene RanaCYL1 dataset. The contradictory results by our cp genome analysis and nuclear RanaCYL1 data may be caused by either an ancient hybridization event or incomplete lineage sorting of nuclear RanaCYL1. Further studies focused on this issue may be conducted using additional markers from nuclear genome.
In the Bayesian phylogram, many major clades of Ranunculaceae had a very short branch length, including the Asteropyrum + Caltha clade, Cimicifugeae + Delphinieae + Nigelleae clade, and Ranunculeae to Nigelleae clade (Fig. 4). This indicated ancient lineage radiation of the subfamily Ranunculoideae, which was also proposed by Zhai et al.15 Thus, it is not surprising that previous molecular phylogenetic analyses using limited molecular markers failed to resolve the phylogenetic framework of Ranunculaceae. Further phylogenomic studies using additional evidence, such as mitochondrial and nuclear genomic data, may help to deeper our understanding of the evolution of Ranunculaceae.
Conclusion
The two subspecies of Asteropyrum carried quite different plastid genomes with different IR-SC borders and much sequence variation. The plastome sequence of Asteropyrum has the longest IR regions in the family. Unlike Adonis and species from tribe Anemoneae, no gene inversions and transpositions were detected in the Asteropyrum cp genome. The complete cp genome showed excellent suitability for drawing phylogenetic inferences within Ranunculaceae. The complete cp genome sequence, as well as its structural variation (gene inversions-transpositions and IR expansion-contraction), can provide abundant phylogenetic information for the family. In contrast, non-coding regions have excessive variations and a high level of noise, so they are not suitable for resolving the generic relationships within the family. As mentioned by Ma et al., analyses in plastid genomic studies should always be conducted using partitioned datasets, and parsimony analysis often obtains unsatisfactory results2. Our plastid phylogenomic inferences, which were obtained using the complete cp genome sequence and Bayesian analysis, provided a better resolved phylogenetic framework for Ranunculaceae in comparison with all the previous studies. Asteropyrum was detected to be closely related to Caltha, and unusual ancient evolutionary processes for Asteropyrum was also suggested by our findings.
Methods
Taxon sampling and sequencing
Fresh leaves were collected from plants from field by the authors (Table 4) and the samples of Nigella and Helleborus were from cultivated plants (seeds were obtained from Mr. Fothergill’s Seeds Limited Company). The leaf samples were dried with tell-tale silica-gel. Samples from both subspecies of Asteropyrum were collected from different populations. We also sequenced 25 other species from Ranunculaceae and mined 21 accessions of complete cp genomes (including five outgroups from Ranunculales) from GenBank for comparative analysis. The samples covered 31 genera and represented most tribes of Ranunculaceae, but did not include the basal clades of Trib. Glaucidieae and Trib. Xanthorhizeae33 (Table 4). For all leaf tissue samples, total genomic DNA was isolated using the CTAB method54 and assessed by agarose gel electrophoresis.
The total DNA samples of the 27 newly sequenced species were sent to Novogene (http://www.novogene.com, China) for library construction and next-generation sequencing. Short-insert (350 bp) paired-end read library preparation and 2 × 150 bp sequencing were performed on an Illumina (HiSeq4000) genome analyzer platform. Approximately 2–4 Gb of raw data for each species were first filtered using the FASTX-Toolkit to obtain high-quality clean data by removing adaptors and low-quality reads (http://hannonlab.cshl.edu/fastx_toolkit/download.html). The remaining clean reads (high-quality reads) were sent to the authors for further analysis.
Chloroplast genome assembly and annotation
For the clean reads, BLAT analysis was used to exclude nuclear and mitochondrial reads using published plastid genome sequences from Ranunculaceae as references55. Next, de novo assembly was performed using Geneious R11 with a medium-low sensitivity setting56 to assemble plastid genome sequences. For most of the tested samples, only one contig (approximately 130 kb) was obtained by de novo assembly. If more than one smaller contig was obtained, the whole-genome reads were mapped to those contigs using the Fine Tuning program in Geneious R11 (iterating up to 100 times) to fill gaps. Contigs were connected by overlapping their terminal sequences using the Repeat Finder program implemented in Geneious R11. Sanger sequencing was also used to bridge gaps when necessary. When a 130 kb contig (including SSC, IR, and LSC) was built for each sample, the IR region was determined using the Repeat Finder program, after which the IR region was inverted and copied manually to construct the complete cp genome sequence. Because great variation was found at the IR-SC boundaries in the two Asteropyrum plastomes, we subjected these regions to Sanger sequencing. Broader population sampling of the two Asteropyrum subspecies, as well as an intermediate individual, was applied to assess the stability of the variation at IR-SC boundaries using Sanger sequencing (Supplementary Tables S4 and S5).
Complete plastid genomes were annotated using the Unix program Plann 1.1.257 and manually verified using Geneious Annotate R11 and the online program Blast58. The cp genome sequences and annotations were uploaded to GenBank using Bankit (https://www.ncbi.nlm.nih.gov/books/NBK63590/). Accession numbers are shown in Table 4. Illustrations of all the newly sequenced plastomes were obtained using the Organellar Genome DRAW tool59.
Comparative chloroplast genomic analyses for two subspecies of Asteropyrum and their relatives
The IR/SC boundaries of Asteropyrum and the other tested species were illustrated and compared with other outgroups to address IR expansion-contraction. MAFFT v7.30960 was used to align the plastome sequences, whereas mVISTA61 was used to export visual results to allow evaluation of the structural similarity of plastomes. The alignment programs applied in mVISTA were LAGAN, which produces true multiple alignments regardless of whether they contain inversions or not, and Shuffle-LAGAN, which can detect rearrangements and inversions62,63. The sequence diversification of the two subspecies of Asteropyrum was assessed by comparing the two aligned sequences. The rate of identical sites was calculated using Geneious R11.
Plastid phylogenomic analysis
In this study, 48 cp genome sequences, including those of five outgroups from Ranunculales, were aligned for phylogenomic analysis. The gene orders for the cp genome sequences of Tribe Anemoneae and Adonis (in which inversions and/or transposition regions were present) were shuffled in the same order with other Ranunculaceae and outgroup species. The following data sets were applied for phylogenetic reconstruction: complete cp genome sequence (with only one IR region), LSC, SSC, IR, coding regions (CDs), intron, and intergenic spacer regions. All datasets were aligned using MAFFT v7.30960. Ambiguous alignments and sites with more than 80% missing data4 were deleted automatically using a Python script (https://github.com/HeJian151004/get_homology).
Maximum Parsimony (MP) analysis was conducted for all the seven datasets using PAUP v4.0b1064. Characters were treated as unordered and equally weighted, whereas gaps were treated as missing data. Branch-and-Bound or a 1000-replicate heuristic search was applied with simple addition, and tree bisection reconnection branch swapping with MUL-trees was utilized to search the MP tree(s). Statistical support was assessed by 1000 bootstrap replicates with 1000 random taxon addition replicates and 10 trees held at each step.
Maximum likelihood (ML) analyses were carried out with RAxML-HPC2 v8.2.1065 performed on the online server (https://www.phylo.org/). The GTR + G model was applied for all datasets as suggested by the software instructions (see RAxML manual). The statistical value was inferred using the combined rapid bootstrap method (1000 replicates).
Bayesian inference (BI) was performed with MrBayes v3.2.366 using partitioned substitution models tested by PartitionFinder v2.1.167 with a minimum subset size of 5000, because excessively parameter-rich models often cause calculation problems in Bayesian analysis and fail to converge2,50. The best substitution models and data partition schemes were selected by Akaike information criterion (AIC)68. Two parallel independent Markov chain Monte Carlo (MCMC) chains were run, each of which consisted of three hot chains and one cold chain for 5,000,000 generations. The trees were sampled and saved every 100 generations. The MCMC convergence was tested by calculating the standard deviation value of split frequencies (less than 0.01) and by assessing the convergence of the parameter values of the two runs. The first 25% of trees were discarded as burn-in, and the remaining trees were used to generate the consensus tree.
Data availability
The new sequenced plastome sequenced are all deposited at NCBI and seen in Table 4.
References
Moore, M. J., Soltis, P. S., Bell, C. D., Burleigh, J. G. & Soltis, D. E. Phylogenetic analysis of 83 plastid genes further resolves the early diversification of eudicots. Proc. Natl. Acad. Sci. USA 107, 4623–4628 (2010).
Ma, P. F., Zhang, Y. X., Zeng, C. X., Guo, Z. H. & Li, D. Z. Chloroplast phylogenomic analyses resolve deep-level relationships of an intractable bamboo tribe Arundinarieae (Poaceae). Syst. Biol. 63, 933–950 (2014).
Sun, Y. X. et al. Complete plastome sequencing of both living species of Circaeasteraceae (Ranunculales) reveals unusual rearrangements and the loss of the ndh gene family. BMC Genomics 18, 592, https://doi.org/10.1186/s12864-017-3956-3 (2017).
Sun, Y. X. et al. Plastome phylogenomics of the early-diverging eudicot family Berberidaceae. Mol. Phylogenet. Evol. 128, 203–211 (2018).
Ravi, V., Khurana, J. P., Tyagi, A. K. & Khurana, P. An update on chloroplast genomes. Pl. Syst. Evol. 271, 101–122 (2008).
Waters, D. L. E., Nock, C. J., Ishikawa, R., Rice, N. & Henry, R. J. Chloroplast genome sequence confirms distinctness of Australian and Asian wild rice. Ecol. Evol. 2, 211–217 (2012).
Davis, C. C., Xi, Z. & Mathews, S. Plastid phylogenomics and green plant phylogeny: almost full circle but not quite there. BMC Biol. 12, 11, https://doi.org/10.1186/1741-7007-12-11 (2014).
Saarela, J. M. et al. Plastid phylogenomics of the cool-season grass subfamily: clarification of relationships among early-diverging tribes. AoB Pl. 7, plv046, https://doi.org/10.1093/aobpla/plv046 (2015).
Ross, T. G. et al. Plastid phylogenomics and molecular evolution of Alismatales. Cladistics 32, 160–178 (2016).
Zhang, S. D. et al. Diversification of Rosaceae since the Late Cretaceous based on plastid phylogenomics. New Phytol. 214, 1355–1367 (2017).
Liu, H. J. et al. Complete chloroplast genomes of Archiclematis, Naravelia and Clematis (Ranunculaceae), and their phylogenetic implications. Phytotaxa 343, 214–226, https://doi.org/10.11646/phytotaxa.343.3.2 (2018).
Liu, H. J. et al. Comparative analysis of complete chloroplast genomes of Anemoclema, Anemone, Pulsatilla, and Hepatica revealing structural variations among genera in Tribe Anemoneae (Ranunculaceae). Front. Pl. Sci. 9, 1097, https://doi.org/10.3389/fpls.2018.01097 (2018).
Yan, M. H. et al. Plastid phylogenomics resolves infrafamilial relationships of the Styracaceae and sheds light on the backbone relationships of the Ericales. Mol. Phylogenet. Evol. 121, 198–211 (2018).
Wolfe, K. H., Li, W. H. & Sharp, P. M. Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. Proc. Natl. Acad. Sci. USA 84, 9054–9058 (1987).
Zhai, W. et al. Chloroplast genomic data provide new and robust insights into the phylogeny and evolution of the Ranunculaceae. Mol. Phylogen. Evol. 135, 12–21 (2019).
Kim, K. J., Choi, K. S. & Jansen, R. K. Two chloroplast DNA inversions originated simultaneously during the early evolution of the sunflower family (Asteraceae). Mol. Biol. Evol. 22, 1783–1792 (2005).
Raubeson, L. A. & Jansen, R. K. Chloroplast genomes of plants in Diversity and Evolution of Plants-Genotypic Variation in Higher Plants (ed. Henry, R.) 45–68 (CABI Press, 2005).
Jansen, R. K. & Ruhlman, T. A. Plastid genomes of seed plants in Genomics of Chloroplast and Mitochondria (eds Bock, R. & Knoop, V.) 103–126 (Springer, 2012).
Walker, J. F., Zanis, M. J. & Emery, N. C. Comparative analysis of complete chloroplast genome sequence and inversion variation in Lasthenia burkei (Madieae, Asteraceae). Am. J. Bot. 101, 722–729 (2014).
Jiang, N., Zhou, Z., Yang, J. B. & Yu, W. B. Complete chloroplast genome of Anemoclema glaucifolium (Ranunculaceae), a vulnerable and threatened species endemic to the Hengduan Mountains. Conserv. Genet. Resour. 10, 601–604 (2018).
Hsiao, P. K. Asteropyrum Drumm. et Hutch. in Flora Reipublicae Popularis Sinicae Vol. 27 (ed. Wang, W. T.) 598–601 (Science Press, 1979).
Tamura, M. Asteropyrum Drumm. et Hutch. in Die Natürlichen Pflanzenfamilien, Band 17 a IV (ed. Heipko, P.) 443–444 (Duncker & Humblot, 1995).
Fu, D. Z. & Robinson, O. R. Asteropyrum Drumm. et Hutch. in Flora of China. Vol. 6 (eds Wu, Z. Y. & Raven, P.) 280–281 (Science Press & Missouri Botanical Garden Press, 2001).
Yuan, Q. & Yang, Q. E. Cytology, palynology, and taxonomy of Asteropyrum and four other genera of Ranunculaceae. Bot. J. Linn. Soc. 152, 15–26 (2006).
Drummond, J. R. & Hutchinson, J. A revision of Isopyrum (Ranunculaceae) and its nearer allies. Kew Bull. 1920, 145–169 (1920).
Janchen, E. Die Systematische Gliederung der Ranunculaceen und Berberidaceen. Denkschriften der Akademie der Wissenschaften in Wien Mathematisch-Naturwissenschaftliche Klasse. 108, 1–82 (1949).
Tamura, M. Morphology, ecology and phylogeny of the Ranunculaceae VIII. Sci. Rep. Osaka. Univ. 17, 41–56 (1968).
Fu, D. Z. Phylogenetic consideration on the subfamily Thalictroideae (Ranunculaceae). Cathaya 2, 181–190 (1990).
Yang, Q. E., Gong, X., Gu, Z. J. & Wu, Q. A. A karyomorphological study of five species in the Ranunculaceae from Yunnan, with a special consideration on systematic positions of Asteropyrum and Calathodes. Acta. Bot. Yunnan. 15, 179–190 (1993).
Yang, Q. E., Luo, Y. B. & Hong, D. Y. A karyotypic study of six species in the Ranunculaceae from Hunan, C. China. Guihaia 14, 27–36 (1994).
Loconte, H., Campbell, L. M. & Stevenson, D. W. Ordinal and familial relationships of Ranunculid genera. Pl. Syst. Evol. 9, 99–118 (1995).
Wang, W., Li, R. Q. & Chen, Z. D. Systematic position of Asteropyrum (Ranunculaceae) inferred from chloroplast and nuclear sequences. Pl. Syst. Evol. 255, 41–54 (2005).
Wang, W., Lu, A. M., Ren, Y., Endress, M. E. & Chen, Z. D. Phylogeny and classification of Ranunculales: evidence from four molecular loci and morphological data. Perspect. Pl. Ecol. Evol. Syst. 11, 81–110 (2009).
Wang, W. et al. The rise of angiosperm-dominated herbaceous floras: Insights from Ranunculaceae. Sci. Rep. 6, 27259, https://doi.org/10.1038/srep27259 (2016).
Cossard, G. et al. Subfamilial and tribal relationships of Ranunculaceae: evidence from eight molecular markers. Pl. Syst. Evol. 302, 419–431 (2016).
Goremykin, V. V., Hirschernst, K. I., Wolfl, S. & Hellwig, F. H. Analysis of the Amborella trichopoda chloroplast genome sequence suggests that Amborella is not a basal Angiosperm. Mol. Biol. Evol. 20, 1499–1505 (2003).
Hoot, S. B. & Palmer, J. D. Structural rearrangements, including parallel inversions, within the chloroplast genome of Anemone and related genera. Mol. Evol. 38, 274–281 (1994).
Johansson, J. T. There large inversions in the chloroplast genomes and one loss of the chloroplast gene rps16 suggest an early evolutionary split in the genus Adonis (Ranunculaceae). Pl. Syst. Evol. 218, 133–143 (1999).
Johansson, J. T. & Jansen, R. K. Chloroplast DNA variation among five species of Ranunculaceae: structure, sequence divergence, and phylogenetic relationships. Pl. Syst. Evol. 178, 9–25 (1991).
Johansson, J. T. & Jansen, R. K. Chloroplast DNA variation and phylogeny of the Ranunculaceae. Pl. Syst. Evol. 187, 29–49 (1993).
Hoot, S. B. Phylogeny of the Ranunculaceae based on preliminary atpB, rbcL and 18S nuclear ribosomal DNA sequence data. Pl. Syst. Evol. 9, 241–252 (1995).
Johansson, J. T. A revised chloroplast DNA phylogeny of the Ranunculaceae. Pl. Syst. Evol. 9, 253–262 (1995).
Kosuge, K., Sawada, K. & Denda, T. Phylogenetic relationships of some genera in the Ranunculaceae based on alcohol dehydrogenase genes. Pl. Syst. Evol. 9, 263–271 (1995).
Ro, K. E., Keener, C. S. & McPheron, B. A. Molecular phylogenetic study of the Ranunculaceae: Utility of the nuclear 26S ribosomal DNA in inferring intrafamilial relationships. Mol. Phylogenet. Evol. 8, 117–127 (1997).
Lane, A. K. et al. Phylogenomic analysis of Ranunculales resolves branching events across the order. Bot. J. Linn. Soc. 187, 157–166 (2018).
Palmer, J. D. Comparative organization of chloroplast genomes. Ann. Rev. Genet. 19, 325–354 (1985).
Brown, J. M. & Lemmon, A. R. The importance of data partitioning and the utility of Bayes factors in Bayesian phylogenetics. Syst. Biol. 56, 643–655 (2007).
Li, C. H., Lu, G. Q. & Orti, G. Optimal data partitioning and a test case for ray-finned fishes (Actinopterygii) based on ten nuclear loci. Syst. Biol. 57, 519–539 (2008).
Ward, P. S., Brady, S. G., Fisher, B. L. & Schultz, T. R. Phylogeny and biogeography of dolichoderine ants: Effects of data partitioning and relict taxa on historical inference. Syst. Biol. 59, 342–362 (2010).
Marshall, D. C. Cryptic failure of partitioned Bayesian phylogenetic analyses: Lost in the land of long trees. Syst. Biol. 59, 108–117 (2010).
Lehnebach, C. A. et al. Phylogenetic relationships of the monotypic Peruvian genus Laccopetalum (Ranunculaceae). Pl. Syst. Evol. 264, 109–116 (2007).
Hoot, S. B., Kramer, J. & Arroyo, M. T. K. Phylogenetic position of the South American dioecious genus Hamadryas and related Ranunculeae (Ranunculaceae). Int. J. Plant. Sci. 169, 433–443 (2008).
Zhao, L., Wang, W., Ren, Y. & Bachelier, J. B. Floral development in Asteropyrum (Ranunculaceae) implications for its systematic position. Ann. Bot. Fenn. 49, 31–42 (2012).
Doyle, J. J. & Doyle, J. L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull. 19, 11–15 (1987).
Kent, W. J. BLAT–The BLAST-like alignment tool. Genome Res. 12, 656–664 (2002).
Kearse, M. et al. Geneious. Bioinformatics 2012, 1647–1649 (2012).
Huang, D. I. & Cronk, Q. C. B. Plann: A command-line application for annotating plastome sequences. Appl. Pl. Sci. 3, 1500026 (2015).
Gish, W. & States, D. J. Identification of protein coding regions by database similarity search. Nat. Genet. 3, 266–272 (1993).
Lohse, M., Drechsel, O., Kahlau, S. & Bock, R. OrganellarGenomeDRAW–a suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets. Nucleic. Acids. Res. 41, W575–W581 (2013).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Frazer, K. A., Pachter, L., Poliakov, A., Rubin, E. M. & Dubchak, I. VISTA: computational tools for comparative genomics. Nucleic. Acids. Res. 32, W273–W279 (2004).
Brudno, M. et al. Comparative sequencing program. LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Res. 13, 721–731 (2003).
Brudno, M. et al. Global alignment: finding rearrangements during alignment. Bioinformatics. 19, i54–i62 (2003).
Swofford, D. L. PAUP*: Phylogenetic analysis using parsimony, version 4.0 b10 (2003).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 30, 1312–1313 (2014).
Ronquist, F. & Huelsenbeck, J. P. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 19, 1572–1574 (2003).
Lanfear, R., Frandsen, P. B., Wright, A. M., Senfeld, T. & Calcott, B. PartitionFinder 2: new methods for selecting partitioned models of evolution for molecular and morphological phylogenetic analyses. Mol. Biol. Evol. 34, 772–773 (2016).
Posada, D. & Buckley, T. R. Model selection and model averaging in phylogenetics: advantages of Akaike information criterion and Bayesian approaches over likelihood ratio tests. Syst. Biol. 53, 793–808 (2004).
Kong, H. H., Liu, W. Z., Yao, G. & Gong, W. Characterization of the whole chloroplast genome of a rare and endangered species Aconitum reclinatum (Ranunculaceae) in the United States. Conserv. Genet. Resour. 10, 165–168 (2018).
Park, K. T. & Park, S. Complete chloroplast genome of Clematis fusca var. coreana (Ranunculaceae). Mitochondrial DNA Part A. 27, 4056–4058, https://doi.org/10.3109/19401736.2014.1003841 (2016).
Li, M. et al. The complete chloroplast genome sequence of Clematis terniflora DC. (Ranunculaceae). Mitochondrial DNA Part A. 27, 2470–2472, https://doi.org/10.3109/19401736.2015.1033702 (2016).
Zhang, Y. Y., Dong, J. X., Feng, Y., Wang, Z. S. & Li, P. The first complete chloroplast genome of Coptis quinquesecta, a critically endangered medicinal plant in China. Mitochondrial DNA Part B-Resour. 3, 370–372, https://doi.org/10.1080/23802359.2018.1450684 (2018).
Kim, Y. K., Park, C. W. & Kim, K. J. Complete chloroplast DNA sequence from a Korean endemic genus, Megaleranthis saniculifolia, and its evolutionary implications. Mol. Cells. 27, 365–381 (2009).
Raubeson, L. A. et al. Comparative chloroplast genomics: analyses including new sequences from the angiosperms Nuphar advena and Ranunculus macranthus. BMC Genomics 8, 174, https://doi.org/10.1186/1471-2164-8-174 (2007).
Ye, W. Q., Yap, Z. Y., Li, P., Comes, H. P. & Qiu, Y. X. Plastome organization, genome-based phylogeny and evolution of plastid genes in Podophylloideae (Berberidaceae). Mol. Phylogen. Evol. 127, 978–987 (2018).
Sun, Y. X. et al. Phylogenomic and structural analyses of 18 complete plastomes across nearly all families of early-diverging eudicots, including an angiosperm-wide analysis of IR gene content evolution. Mol. Phylogen. Evol. 96, 93–101 (2016).
Park, S., Jansen, R. K. & Park, S. Complete plastome sequence of Thalictrum coreanum (Ranunculaceae) and transfer of the rpl32 gene to the nucleus in the ancestor of the subfamily Thalictroideae. BMC Pl. Biol. 15, 40, https://doi.org/10.1186/s12870-015-0432-6 (2015).
Acknowledgements
We thank Mr. Yang Zongzong, Mr. He Qian, Dr. Zhu Xinxin, and Dr. He Li for kindly providing valuable plant samples from Callianthemum, Oxygraphis, Delphinium, and Asteropyrum. This study was financially supported by the Medium- and Long-term Scientific Study Projects for Young Teachers of Beijing Forestry University (grant no. 2015ZCQ-BH-03 to Lei Xie), the National Natural Science Foundation of China (grant no. 31670207 to Lei Xie), and the Beijing Natural Science Foundation (grant no. 5182016 to Lei Xie). The funders are not responsible to the design and scientific point of this study.
Author information
Authors and Affiliations
Contributions
J.H., M.Y. and R.D.L. contributed equally to this paper with regard to second generation sequencing and data processing. L.L.L., H.J.L., S.X.Y. and L.Y.P. helped with plant material collection. J.C. and L.X., obtained funding conceived and wrote the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
He, J., Yao, M., Lyu, RD. et al. Structural variation of the complete chloroplast genome and plastid phylogenomics of the genus Asteropyrum (Ranunculaceae). Sci Rep 9, 15285 (2019). https://doi.org/10.1038/s41598-019-51601-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-019-51601-2
- Springer Nature Limited
This article is cited by
-
Sequence characteristics, genetic diversity and phylogenetic analysis of the Cucurbita ficifolia (Cucurbitaceae) chloroplasts genome
BMC Genomics (2024)
-
Complete chloroplast genomes of eight Delphinium taxa (Ranunculaceae) endemic to Xinjiang, China: insights into genome structure, comparative analysis, and phylogenetic relationships
BMC Plant Biology (2024)
-
Genetic characterization and phylogenetic analysis of the Nigella sativa (black seed) plastome
Scientific Reports (2024)
-
Insight into chloroplast genome structural variation of the Mongolian endemic species Adonis mongolica (Ranunculaceae) in the Adonideae tribe
Scientific Reports (2023)
-
Complete chloroplast genome and phylogenetic analysis of Anemone shikokiana
Molecular Biology Reports (2023)