Introduction

Polyploidization, or whole-genome duplication, has played a critical role in plant speciation and diversification by promoting adaptation to new ecological niches or conferring reproductive isolation (Grant 1971, 1981; Stebbins 1950, 1971; Otto and Whitton 2000; Levin 2002; Soltis et al. 2009; Otto 2007; Paun et al. 2007; Arrigo and Barker 2012). Phylogenetic reconstruction has provided systematists further insights into the assessment of phylogenetic relationships, genome origin and evolution of polyploid species (Soltis and Soltis 1999; Wendel and Doyle 2004; Slotte et al. 2006; Brassac and Blattner 2015). Maternally inherited chloroplast DNA (cpDNA) and biparentally inherited nuclear DNA (nrDNA) sequences have long been treated as a source of phylogenetic data to document origin or evolution of polyploid species (Sharbel and Mitchell-Olds 2001; Abbott and Lowe 2004; Aïnouche et al. 2004; Doyle et al. 2004; Jakobsson et al. 2006; Fortune et al. 2008). Comparison of gene trees derived from different genomes can provide unambiguous evidence of polyploidization and estimate of relative timing without requiring fossil calibration of a tree (Yuan and Olmstead 2008; Yost et al. 2013).

Ranunculaceae represents one of the earliest diverging lineages among the eudicots (APG IV 2016; Wang et al. 2009; Sun et al. 2011). The genus Aconitum L. is a member of the tribe Delphinieae in the Ranunculaceae family. Many species in this genus are proved to be essential components in the formulations of traditional Chinese herbal medicine. Even though some species are highly toxic, they possess variety of medicinal importance (Zhao et al. 2010; Semenov et al. 2016; Wada et al. 2016; Liang et al. 2017). Aconitum currently comprises two subgenera, A. subg. Lycoctonum and A. subg. Aconitum (Jabbour and Renner 2012; Wang et al. 2013). Subgenus Lycoctonum contains approximately 40 species distributed in Eurasia and eastern North America, with approximately 20 species reported in China (Tamura 1995; Gao 2009). Subgenus Lycoctonum differs from subg. Aconitum by its perennial rhizomes, whereas the latter has biennial paired tuberous roots (Tamura and Lauener 1979). Subgenus Lycoctonum has more polyploid members with in-depth investigations of chromosome number, karyotype and geographical distribution (Yang et al. 1994; Yang 1996, 1999, 2001; Gao 2009; Hong et al. 2016). In subg. Lycoctonum, A. angustius (W.T.Wang) W.T.Wang was first revealed to be tetraploid with 32 chromosomes, which are larger than other chromosomes of the tetraploid species in subg. Lycoctonum. The 32 chromosomes can be arranged into eight groups, but four chromosome quartets are heteromorphic with differences in size or morphology (Shang and Lee 1984; Gao et al. 2012; Hong et al. 2016). This tetraploid species was suggested to be widely distributed in southern and central China (Gao 2009; Gao et al. 2012; Hong et al. 2016). An additional four species, including A. brevicalcaratum (Finet and Gagnep.) Diels, A. chrysotrichum W. T. Wang, A. crassiflorum Hand.-Mazz., A. rilongense Kadota, and one variety of A. brevicalcaratum var. parviflorum Chen & Liu, were also determined to be tetraploid (2n = 4x = 32). These five tetraploids are all concentrated in the Hengduan Mountains (HDM) (Yang et al. 1994; Yuan and Yang 2006). The chromosomes of those five tetraploids are similar in size and karyotype constitutions, which are smaller and distinct from that of A. angustius. Eight groups of four homeologous can be fairly well arranged for the chromosomes of the five tetraploids (Hong et al. 2016). The only known hexaploid (2n = 6x = 48) is A. apetalum (Huth) B. Fedtsch with a restricted range in Tianshan Mountains in Xinjiang, northwestern China (Gao 2009). It represents the highest ploidy level currently known in subg. Lycoctonum and is the second polyploid species found to occur outside of the HDM. In summary, a total of six polyploid species and one variety in subg. Lycoctonum have been identified, which are all endemically distributed in China.

Various hypotheses have been proposed in an attempt to interpret polyploid relationships of subg. Lycoctonum based mostly on morphological or cytological evidence. As A. angustius shows heterologous chromosomes as well as intermediate morphological characteristics and overlapping geographical distribution with A. finetianum and A. sinomontanum, hybridization between A. finetianum and A. sinomontanum was suggested for A. angustius by Gao (2009). As for tetraploids endemic to the HDM, they are scarcely distinguishable from one another, particularly A. chrysotrichum, A. brevicalcaratum and A. crassiflorum, showing similar degrees of leaf division, growing habitats and geographical distributions. Yuan and Yang (2006) suggested local origins during the upheaval of the Qinghai-Tibetan Plateau (QTP) for the tetraploids in the HDM. Polyploidization seems to be an important speciation mechanism in the subg. Lycoctonum. However, in the last twenty years, the phylogeny of the subg. Lycoctonum has not yet been accomplished, not to mention the relationships among the polyploid species. Previously, Utelli et al. (2000) suggested early divergence between subg. Lycoctonum and subg. Aconnitum in Europe. High morphological variability within and among populations were detected, most probably due to recent speciation. However, the morphological characters showed no value as systematic characters. Recent phylogenetic reconstruction was conducted within Delphineae, among which nine species from the subg. Lycoctonum were included (Jabbour and Renner 2012). In this study, subg. Lycoctonum was suggested to be monophyletic clade and sister group of Delphinium based on trnL-F and ITS data set (Jabbour and Renner 2012). To the best of our knowledge, no phylogenetic inference has ever been made to infer the relationships between the polyploid species of the subg. Lycoctonum in China.

Uniparentally inherited cpDNA and biparentally inherited nrDNA are frequently combined to make phylogenetic inference, informing relationships among polyploids, origin of hybridization, and the mode of polyploidy in particular groups (Soltis et al. 2004, 2008; Fortune et al. 2008; Kim and Donoghue 2008; Kelly et al. 2013; Dufresne et al. 2014; Brassac and Blattner 2015; Mavrodiev et al. 2015). The nrDNA ITS region is considered concerted evolution of repeated gene families and has been widely analyzed in phylogenetic reconstruction and polyploidization speciation of plants species (Buckler et al. 1997; Álvarez and Wendel 2003; Fortune et al. 2008; Brassac and Blattner 2015). cpDNA is considered to be particularly suitable for investigating processes associated with origin of maternal lineage (Birky 1995; Song et al. 2002). As are motivated to understand phylogenetic relationship and evolutionary history in subg. Lycoctonum, we examined four cpDNA regions with nuclear ITS and ETS sequences from fresh and herbarium material covering a global geographical range, particularly for polyploids in China. Phylogenetic assessment in combination with diversification rate estimation, ancestral area reconstruction and molecular clock dating was used to interpret the phylogenetic relationships and understand the origin of the polyploid species within subg. Lycoctonum. The phylogenetic inference was also determined for the consistence with previous morphological or karyotypic evidence.

Materials and methods

Plant materials

For the present study, we used 48 samples of subg. Lycoctonum representing 26 species and three varieties (Online Resource 1). In particular, we sampled all 16 species and two varieties from China, four species from Europe, three species from Korea, three species (including one variety) from Japan, and the only one species from North America. Three species (A. delavayi, A. tanguticum and A. racemulosum) in A. subg. Aconitum were used as outgroup based on previous studies (Jabbour and Renner 2012; Wang et al. 2013; Luo and Yang 2005). From year 2009 to 2012, we collected individual samples in the field by harvesting young leaves and preserving them in silica gel. All of the voucher specimens of the collected individuals are deposited in the Herbarium of South China Botanical Garden (IBSC). Samples of A. vulparia, A. lycoctonum, A. monticola and A. gigas var. hondoense were obtained from herbarium specimens with the permission of the curators (the first three species from LE and the fourth from PE).

DNA extraction, PCR amplification, and DNA sequencing

Total genomic DNA from approximately 50 mg of fresh dried leaves was extracted using a DNeasy Plant Mini Kit (Qiagen, Hilden, Germany). Six molecular markers, including the cpDNA regions ndhF-trnL, psbA-trnH, psbD-trnT, and trnT-L and the two nrDNA regions, ITS, and ETS, were analyzed. The primer information is listed in Table 1. PCRs were performed in a total volume of 25 μl, containing 14.75 μl sterile purified water, 5.0 μl 5 × PrimeSTAR® Buffer (Mg2+ plus), 2.0 μl dNTP Mixture (2.5 mM each nucleotide), 0.5 μl of each primer (10 μM), 0.25 μl PrimeSTAR® HS DNA Polymerase (2.5 U/μl), and 2.0 μl DNA template (ca. 50 ng). The PCR protocol included an initial denaturation (3 min at 95 °C) followed by 32 cycles of 30 s denaturation (94 °C), 40 s annealing (52–56 °C), and 1 min extension at 72 °C, and concluded with a final extension of 8 min at 72 °C. All PCR products were purified using a TIANquick Midi Purification Kit (TIANGEN, Beijing, China) and sequenced using the corresponding PCR primers. We obtained sequences from an ABI PRISM 3100 Genetic Analyzer. We deposited all sequences in GenBank (accession numbers shown in Online Resource 1).

Table 1 Primer information for the four cpDNA regions ndhF-trnL, psbA-trnH, psbD-trnT, and trnT-L and the two nrDNA regions internal transcribed spacer (ITS) and external transcribed spacer (ETS)

Sequence alignment and phylogenetic analyses

All sequence alignments were conducted using MAFFT v.6.717 (Katoh and Frith 2012) components from SATéwin v.2.2.7-2013 Feb15 (Liu et al. 2012). Alignment improvements were conducted manually by eye using BioEdit v.7.0.9.0 (Hall 1999). The complete combined alignment has been deposited in TreeBASE (http://treebase.org/treebase-web/home.html) with accession number S20706. Gaps were treated as missing data in the alignments and in downstream analyses. To determine the potential of combining the cpDNA and nrDNA sequence datasets for the phylogenetic analyses, we used Incongruence Length Difference (ILD) tests (Farris et al. 1995) implemented in PAUP* v.4.0b10 (Swofford 2002). The ILD tests were conducted using 1000 replicates and random addition (10 additional sequences), maintaining a maximum of 100 trees in each repetition. We considered P < 0.001 as a more realistic criterion to reject combinability (Cunningham 1997a). In particular, congruence was tested among the molecular markers and between combined cpDNA and nrDNA data sets, which were used separately and in combination for further phylogenetic analyses.

We performed a Bayesian Inference (BI) of phylogenies using MrBayes v.3.2 (Huelsenbeck and Ronquist 2001; Ronquist and Huelsenbeck 2003). Modeltest v.3.7 (Posada and Crandall 1998) was used to select from 24 models of nucleotide substitution to determine the best model for each sequence partition. Four simultaneous independent runs were performed with four chains each (one cold and three heated) and initiated with random trees. We unlinked the data partitions during the analyses so each partition had independently estimated parameters, and we saved one tree every 1000 generations. All BI analyses reached an average standard deviation of split frequencies ≤0.01 after 20 million generations, and we excluded 25% of the generated trees as burn-in. To confirm the standard deviations and burn-in values, we assessed convergence and stationarity, respectively, by examining the split frequencies diagnostic and by comparing the mean and variance of the log likelihoods in the “.stat” output file from MrBayes. We used the remaining trees from the four runs to estimate the 50% majority-rule consensus tree and the Bayesian posterior probabilities. Maximum Parsimony (MP) was performed using PAUP* v.4.0b10 (Swofford 2002). All characters were treated as equally weighted. Heuristic searches included 1000 replicates with random sequence additions, tree bisection-reconnection (TBR) branch swapping, Multrees on, and maxtrees set to 10 for each step. We assessed clades supported by the MP analyses with bootstrap values (Felsenstein 1985). Bootstrap values were calculated in PAUP* with 1000 bootstrap replicates, each with 10 random sequence addition replicates holding maximally one tree per replicate, TBR branch swapping, and Multrees on. Maximum likelihood (ML) analyses used the identical partition scheme and were conducted using RAxML (Stamatakis 2006) and the RAxML graphical interface (raxmlGUI v.1.3 (Silvestro and Michalak 2012)). RAxML was conducted using Python v.2.7.6 (http://www.python.org/ftp/python/2.7.6/python-2.7.6.msi) with 1000 rapid bootstrap replicates. For the ML analyses, the general time-reversible (GTR) model was used with a gamma model of the rate of heterogeneity. Additionally, to visualize the conflicts among the gene trees, the FilteredSuperNetwork implemented in SplitsTree4 v4.13.1 (Huson and Bryant 2006) was used to generate a consensus tree of the separate MP trees (Four cpDNA and two nrDNA regions). The input trees were edited as multiple individuals in a monophyletic clade collapsed to one terminal. The default settings were used to construct the network.

cpDNA haplotype network analysis

To detect representative groups of polyploids, a genealogical degree of relatedness among the cpDNA haplotypes of subg. Lycoctonum was estimated using TCS v.3.11 (Clement et al. 2000) based on a maximum parsimony approach, performing best for non-recombinant sequences (Woolley et al. 2008). In this analysis, indels were treated as single mutation events regardless of their length. Different types of insertions occurring at one site were coded as one of four nucleotides. In both the parsimonious and network analyses, length variations in mononucleotide repeats (poly A or T) were excluded because of their tendency for homoplasy.

Reconstruction of the ancestral area

For ancestral area reconstruction, the distribution of subg. Lycoctonum was divided into eight areas: northwestern China (A), Qinghai–Tibetan Plateau (QTP; B), Hengduan Mountains (HDM; C), central China (D), eastern Asia (E), central Asia (F), Europe (G) and North America (H) (Fig. 1). Korea and Japan were included in Eastern Asia, as they were connected via the Eastern Sea Land Bridge during the Pleistocene glaciations (Harrison et al. 2001; Sun et al. 1999; Qiu et al. 2011). Statistical dispersal-vicariance analysis (S-DIVA) and Bayesian binary MCMC (BBM) programs implemented in RASP v.3.2 (Yu et al. 2015) were used to reconstruct the possible ancestral areas of subg. Lycoctonum on the combined consensus ML tree generated by RAxML (Stamatakis 2006), which was the dataset used for the divergence time analysis. In this method, the frequencies of an ancestral area at a node in the ancestral reconstructions are averaged over all trees. To account for uncertainties in the phylogeny, we used 3000 trees from the MCMC output and ran a BBM analysis on the condensed tree. The ten MCMC chains were run simultaneously for 500,000 cycles, and the state was sampled every 100 generations. A fixed Jukes-Cantor model with equal among-site rate variation was used for the BBM analysis with root distribution set to “outgroup.” The maximum number of areas for this analysis was six. The dispersal and vicariance events were inferred using a dispersal-vicariance analysis (Ronquist 1997) implemented in RASP v.3.2 (Yu et al. 2015).

Fig. 1
figure 1

Map of geographical distribution and sample localities of subg. Lycoctonum species both inside and outside China. Circles with numbers 1–29 represent the species corresponding with Online Resource 1. Diploid, tetraploid and hexaploid species are each indicated by circles filled in three different colors of gray, orange and blue. Circles outlined with eight different colors represent eight different geographical distribution regions of subg. Lycoctonum, corresponding to S-DIVA analysis. Gray area with black outline shows the geographical distribution range of Aconitum sinomontanum, orange area with orange outline for A. angustius, and gray areas with orange outline for A. finetianum. Gray straight lines and dotted lines with arrows indicate the vicariance and dispersal events, respectively, by S-DIVA analysis

Molecular clock estimation and diversification rate analyses

We analyzed the divergence time in BEAST v.1.8.0 (Drummond et al. 2006, 2012) for a combined dataset of 51 subg. Lycoctonum samples and the outgroup with 2915 aligned nucleotides. The divergence time analyses were conducted using a GTR+G model with four rate categories, a Yule tree prior, and relaxed clocks with uncorrelated and a log-normally distributed rate variation across the branches. We ran the MCMC simulations in BEAST for 20 million generations with parameters sampled every 2000th step. At the conclusion of the analysis, we applied a burn-in of 10%. We combined the results from individual runs and used Tracer v.1.6 (Rambaut et al. 2013) to check that the effective sample sizes for all relevant parameters were well above 200 and stationary. The final trees were edited using FigTree v.1.4.0 (http://tree.bio.ed.as.uk/software/figtree/).

Secondary calibration points are divergence time estimates that have been derived from one molecular dataset on the basis of a primary external calibration point and are used again independently of the original external calibration point on a second dataset. As Ranunculaceae fossils were not available to use as calibration points (Pigg and DeVore 2005; Jabbour and Renner 2011), Jabbour and Renner (2012) used the cross-validated calibration approaches to analyze the diversification of Ranunculaceae. They further used those result as secondary calibrations for the phylogenetic analyses of Delphinieae, which should offer more precise divergence time to serve as calibration points. In the same way, we used the result of Jabbour and Renner (2012) as calibration points, in which Aconitum was shown to evolve 12.70 Ma and the crown node of subg. Aconitum began diversifying 7.80 Ma. Therefore, we used those two ages as calibration points with normal prior distributions and standard deviations of 2.20 Ma, which covered the 95% HPD range. We attended to the secondary calibrations’ limitations, which tend to result in younger ages compared to those obtained with fossils (Sauquet et al. 2012).

For the diversification rate analyses, we excluded three species of the outgroup, five species outside of China or eastern Asia and two varieties, to assure a broad taxonomic sampling in China and eastern Asia (Deng et al. 2015). One random sample from each species was used for the following analyses. The temporal dynamics of diversification in subg. Lycoctonum was measured using LTT plots in the APE package of R (Paradis et al. 2004). To compare the diversification rates, the net diversification rates were estimated for different lineages and regions. Plots were produced based on the branching time information obtained from the BEAST analysis. The GEIGER package in R (Harmon et al. 2008) was used to calculate the net diversification rates (γ) of subg. Lycoctonum based on the equation (7) of Magallón and Sanderson (2001) (with no extinction ε = 0 and high extinction rate ε = 0.90). However, the BAMM v.2.2.0 program was also applied to explore the diversification rate heterogeneity among lineages in different geological periods (Rabosky 2014). The analysis was run with 10 million MCMC generations, which were sampled every 2000 generations. The convergence was tested based on the MCMC output using the CODA package in R (Plummer et al. 2006) with a burn-in of 10% and effective sample sizes exceeding 1000 for all estimated parameters. The BAMM output was used to calculate diversification rates with BAMMtools v.2.0.2 in R (Rabosky et al. 2014). The 95% credible shift configurations were filtered using a Bayes factor of 5, and the mean phylorate for net diversification, and marginal probabilities were plotted with the getEventData and plot.bammdata functions.

Results

Sequence characterization, ILD test and phylogenetic reconstruction

Forty-eight samples representing 26 species and three varieties of subg. Lycoctonum, and three samples from outgroup species were analyzed for both the cpDNA and nrDNA data sets. For the cpDNA dataset, samples collected from herbarium specimens were able to be amplified successfully. In Table 2, the aligned length of four cpDNA regions varied from 302 to 917 bp. The ambiguously aligned characters of the cpDNA dataset were excluded from all analyses. Therefore, 2071 aligned nucleotides were generated. A total of 174 bp (8.40%) were variable and 71 bp (3.43%) were parsimony-informative loci in the cpDNA dataset. The lengths of the ITS and ETS were 518 and 326 bp, respectively. The combined nrDNA sequences consisted of 844 aligned nucleotides, of which a total of 254 bp (30.09%) were variable and 162 bp (19.19%) were parsimony informative. The ILD test showed that the sequences within the cpDNA and nrDNA datasets, independently, were highly congruent (P > 0.01 for cpDNA; P = 0.42 for nrDNA). While the ILD test detected that sequences between the cpDNA and nrDNA datasets are incongruent (P = 0.001). The combination of the two incongruent matrixes produced largely congruent topologies with the filtered supernetwork, generated by the consensus tree of the separate MP trees (Fig. 2; Online Resource 2–4). Therefore, we used the combined two matrixes to improve phylogenetic accuracy (Cunningham 1997b; Yi et al. 2014). For the combined matrix of the cpDNA and nrDNA datasets, 2915 bp aligned nucleotides were detected containing 196 (7.30%) parsimony-informative sites. With the combined sequences, the MP analysis recovered 1000 most parsimonious trees of 625 steps with consistency index (CI) and retention index (RI) values of 0.77 and 0.89, respectively. The tree topologies obtained from the MP, BI and ML analyses, based on the cpDNA and nrDNA datasets, are largely congruent (Online Resource 2–3). Because the filtered supernetwork also showed largely congruent topologies with a combined ML tree in subg. Lycoctonum, the MP, BI, and ML analyses, based on combined datasets and the filtered supernetwork, will be discussed together.

Table 2 Sequence characteristics, statistics of polymorphic and informative sites, as well as best models of DNA substitution for Bayesian analysis
Fig. 2
figure 2

The best-scoring maximum likelihood (ML) tree inferred from the combined cpDNA and nrDNA data sets with the GTRGAMMA model implemented in RAxML. The numbers near the branches are Bayesian posterior probabilities and bootstrap percentages of maximum parsimony and maximum likelihood trees, respectively. Dashed lines indicate nodes with less than 50% support. Species with multiple individuals are represented by the abbreviations of voucher numbers. Two major clades were identified (clade 1 and clade 2), each with three subclades. The tetraploid species, hexaploid species and the potential diploid ancestor are all highlighted with shades in three different colors of orange, blue and gray. Species distributed in the Hengduan Mountains (HDM), Korea and Japan are all indicated

Two diploid species, A. novoluridum and A. fletcheranum, were positioned outside of the major clades at the base branches in the consensus tree compiled from ML with high support values (100/95/99 BI/MP/ML and 100/100/100 BI/MP/ML, respectively). Two major clades (clade 1 and clade 2) were detected, each of which was divided into three subclades (Fig. 2). The first clade (clade 1), with three subclades 1–1, 1–2 and 1–3, was composed of tetraploid A. angustius and all diploid species except A. scaposum. Aconitum angustius, A. finetianum and A. alboviolaceum in eastern and northeastern China, three Korean and two Japanese species together formed subclade 1–1, showing monophyly with strong support (100/100/99 BI/MP/ML). However, the inner support for the deeper nodes was weaker except for the three Korean species (99/68/91 BI/MP/ML). In subclade 1–1, A. angustius was more frequently present with A. finetianum (Fig. 2). Subclade 1–2 was composed of diploids, three from Europe and six from China with strong support (100/99/97 BI/MP/ML). Aconitum sinomontanum is placed in subclade 1–2. Subclade 1–3 consisted of diploid species each from Japan, North America and central Asia and three diploid species from China with strong support (99/82/89 BI/MP/ML). The second clade (clade 2) had three subclades, 2–1, 2–2 and 2–3, and was composed of all tetraploids in the HDM, the hexaploid A. apetalum and widespread diploid species A. scaposum. Moderate support values (65/69/79 BI/MP/ML) were detected for clade 2, but strong support values were detected for the subclades. Subclade 2–1 was formed by all tetraploids in the HDM and showed the high support values of 100/95/98 BI/MP/ML. Aconitum apetalum and A. scaposum were clustered into separate subclades 2–2 and 2–3, respectively, with full support in all analyses.

Three species, A. gigas var. hondoense, A. scaposum and A. apetalum, showed incongruent positions based on partial datasets (Online Resource 2–3). The position of A. gigas var. hondoense changed from subclade 1–1 based on cpDNA data to subclade 1–3 based on nrDNA data. However, no support values were detected in either partial dataset for the placement of A. gigas var. hondoense (Online Resource 2–3). The positions of A. scaposum and A. apetalum, which were clustered with tetraploids in the HDM by the cpDNA data (80/73/88 BI/MP/ML), had extremely low support (51/-/- BI/MP/ML) as a sister to all other species in clade 1 based on the nrDNA data (Online Resource 2–3).

cpDNA haplotype network

Forty haplotypes (H1–H40) were identified among the four cpDNA regions for 26 species and three varieties, of which 37 were species-specific (Online Resource 1; Fig. 3). Haplotype H2 was present in both A. angustius and A. finetianum, haplotype H5 shared between A. barbatum, A. barbatum var. puberulum, A. shennongjiaense and A. sinomontanum, and H39 occurring in two Korean species, A. longecassidatum and A. quelpaertense. Two cpDNA haplotype groups, Group 1 and Group 2, consisting of polyploidy species, were identified in the TCS network, which largely corresponded with subclade 1–1 and clade 2, respectively (Fig. 2). In haplotype Group 1, A. angustius (H1, H2, H28 and H31) was positioned at the end step and was closely connected with A. finetianum (H2, H10 and H11) and A. alboviolaceum (H29 and H30). Two Korean haplotypes (H39 and H40) were derived from haplotype H30 of A. alboviolaceum. The Japanese species were represented by three haplotypes H13, H18 and H38, of which H38 was six steps from A. alboviolaceum (H29). In haplotype Group 2, all tetraploids (H6–H9, H20, and H32–H37) in the HDM were connected with A. scaposum (H21, H22 and H23) and A. apetalum (H3 and H4). Ancestry missing haplotypes were detected for species in haplotype Group 2.

Fig. 3
figure 3

The 95% plausible cpDNA haplotypes network inferred by TCS. Forty cpDNA haplotypes are shown. The size of each circle corresponds to the frequency of each haplotype. Dots with black outlines indicate unsampled or extinct haplotypes. The diploid, tetraploid and hexaploid species are all highlighted with three different colors of gray, orange and blue. Two cpDNA haplotype groups (1–2) consisting polyploid species were identified. Species distributed in Hengduan Mountains (HDM), Korea and Japan are all indicated

Ancestral area reconstruction

S-DIVA indicated an obvious biogeographical history for polyploidy species in subg. Lycoctonum. Altogether, nine vicariance and seven dispersal events with Bayesian probabilities (P values) ≥ 0.70 were postulated (Figs. 1, 4). In subclade 1–1, three vicariance events (V1, V2 and V3; P = 1.00) were identified between central China (D) and eastern Asia (E). V1 (P = 1.00) was detected at the node where the Korean species diverged from A. angustius and A. finetianum. Two dispersal events (D1 and D2; P = 1.00) were detected showing a route from eastern Asia (E) to central China (D) and back to eastern Asia (E). Eastern Asia (E) was considered to be the ancestral area (AA1) for subclade 1–1. In clade 2, two vicariance events (V4 and V5; P = 1.00) were identified, of which V5 (P = 1.00) was between the HDM (C) and northwestern China (A). Two dispersal events (D3 and D4; P = 1.00) were detected showing a routes from the HDM (C) to northwestern China (A) and central China (D) with the HDM (C) being the center of origin (AA2).

Fig. 4
figure 4

Ancestral area reconstruction of subg. Lycoctonum species as inferred by S-DIVA. Color legends indicated eight geographical distribution regions (AH) and other ancestral area. The eight geographical distribution regions (AH) assigned for each sample, the numbers (129) representing the species and the ploidy levels of each sample are all given at the bottom. Vicariance events (V1V9) and dispersal events (D1D7), with Bayesian probabilities (P values) ≥0.70, are shown for nodes. Ancestral areas (AA1 in region E and AA2 in region C) are indicated for subclade 1–1 and subclade 2–1 consisting polyploidy species, respectively

Divergence time estimation and diversification rate analyses

According to Fig. 5, the calculated divergence time of subg. Lycoctonum was estimated as 10.63 (95% HPD = 6.70–15.00) Ma during the middle Miocene. The divergence time between clade 1 and clade 2 was inferred as 5.72 (95% HPD = 3.23–8.56) Ma in the late Miocene. Subclade 1–1 was estimated to diverge approximately 2.24 (95% HPD = 1.03–3.73) Ma in late Pliocene. The ancestor of clade 2 was estimated to arise approximately 5.13 (95% HPD = 2.74–7.74) Ma in the late Miocene. Tetraploids in the HDM, forming subclade 2–1, were estimated to have diversified approximately 3.00 (95% HPD = 1.44–4.90) Ma and diverged from A. apetalum approximately 4.13 (95% HPD = 2.19–6.48) Ma in mid-Pliocene. Compared to the results of Jabbour and Renner (2012), the estimated divergence ages were younger because of the limitations of the secondary calibrations (Sauquet et al. 2012).

Fig. 5
figure 5

Chronogram for the combined dataset obtained under a Bayesian relaxed clock with log-normally distributed rates. The 95% highest posterior density (HPD) for major lineages with a posterior probability >0.95 are indicated at the nodes. Time scale is shown on the bottom. The numbers corresponded to the code numbers in Table 1. Chromosome numbers are indicated following the species names. Two major clades were identified (clade 1 and clade 2), each with three subclades. The tetraploid species, hexaploid species and the potential diploid ancestor are all highlighted with shades in three different colors of orange, blue and gray. Species distributed in the Hengduan Mountains (HDM), Korea and Japan are all indicated

In particular, the LTT analysis suggested an increasement in the diversification rate of subg. Lycoctonum at approximately 6.35 Ma in the late Miocene (Fig. 6). The net diversification rates of subg. Lycoctonum were 0.24 sp/Myr (ε = 0) and 0.11 sp/Myr (ε = 0.90) (Table 3). The diversification rates of clade 1–1 were 0.8402 sp/Myr (ε = 0) and 0.34 sp/Myr (ε = 0.90) (Table 3). The diversification rates of clade 2–1 were 0.26 sp/Myr (ε = 0) and 0.07 sp/Myr (ε = 0.90), which were much lower than other subclades (Table 3). The BAMM analysis also indicated a high heterogeneity of diversification rates among different lineages during different geological periods. Subclade 2–1 showed a comparably lower diversification rate in the early Pliocene (Fig. 7). The species diversification rate in the HDM was obviously lower than other regions (Table 3; Fig. 7).

Fig. 6
figure 6

Log lineage-through-time plot for the subg. Lycoctonum

Table 3 Net diversification rates estimated for subg. Lycoctonum
Fig. 7
figure 7

A phylorate plots showing the heterogeneity of diversification rates in subg. Lycoctonum estimated from BAMM

Discussion

The phylogenetic relationship and diversification rate of subgenus Lycoctonum

The ILD test detected incongruence between the sequences of the cpDNA and nrDNA datasets. Phylogenetic incongruence between genes is widespread and expected because unlinked genes have different evolutionary histories (Huelsenbeck et al. 1996; Mansion et al. 2005; Lihova et al. 2006). According to Cunningham (1997a), P < 0.001 is a more realistic criterion to reject combinability. However, studies have previously shown that even when the ILD test indicates incongruence between datasets, the combined analysis can provide better resolved trees (Camacho-García et al. 2014; Oskars et al. 2015).

Major phylogenetic relationships within subg. Lycoctonum were resolved and generally well supported in the current study. Remarkably, relationships among the subclades were also strongly supported. Different methods (MP, BI, ML and a filtered supernetwork) and datasets (cpDNA and nrDNA) generated largely consistent phylogenetic tree. Aconitum fletcheranum and A. novoluridum, which are distributed in Qinghai-Tibetan Plateau (QTP), were revealed at the base branches and showed distant relationship with other species in subg. Lycoctonum. Aconitum fletcheranum is the only species displaying terminal inflorescence (Tamura 1995), and it represents a different base chromosome numbers in subg. Lycoctonum (Langlet 1927; Gregory 1941; Kurita 1957; Tamura 1995; Yuan and Yang 2006; Hong et al. 2016). The karyotype of A. novoluridum mainly consists of m (median centromeric) and sm (submedian centromeric) chromosomes, remarkable different from other diploid species mainly consisting of st (subterminal centromeric) chromosomes (Yuan and Yang 2006). Additionally, previous karyotypic analysis and taxonomic studies suggested the two species as two independent sections (Tamura 1995; Gao 2009). We inferred that the two phylogenetically distant species have suffered from a long-term geographical isolation, contributing to their unique chromosome characteristics and distant phylogenetic positions. The widespread A. scaposum was also supported to be independent Ser. Scaposa W. T. Wang (Gao 2009), which is consistent with the phylogenetic result. The position of the Japanese species A. gigas var. hondoense remains uncertain because it showed alternative and inconsistent clustering without support values based on both datasets (Online Resource 2–3).

Aconitum finetianum, A. aloviolaceum and other Korean and Japanese species formed monophyly with A. angustius (subclade 1–1). The diploid species in subclade 1–1 displayed extensive morphological variation with no relation to species or geographical distribution, often leading to misidentification among them. For example, A. finetianum and A. longecassidatum, with minor differences in sepal hairs, were previously treated as conspecific by Tamura and Lauener (1979). Aconitum pterocaule was previously thought to be synonymous with A. finetianum and A. longecassidatum (Tamura and Lauener 1979), which were present in Eastern Asia, including Korea Peninsula, Japan Island as well as northeastern and eastern China (Fig. 1). These areas were geographically connected via the Eastern Sea Land Bridge during the Pleistocene glaciations (Sun et al. 1999; Harrison et al. 2001; Qiu et al. 2011). Meanwhile, this geographical region exhibits a little bit flat topography and has experienced multi times of climatic oscillation. Therefore, gene introgression could be facilitated and resulted in extensive morphological similarities and close phylogenetic relationships among those diploid species. Aconitum angustius shows intermediate morphological characteristics between A. finetianum and A. sinomontanum. It is reasonable to understand that A. angustius was closely related with A. finetianum on the phylogenetic tree. However, it is not expected that A. angustius demonstrated distant positions from A. sinomontanum. This is most probably due to the limitation of the uniparentally inherited cpDNA regions, which might not be able to reveal the paternal lineages among species. Aconitum sinomontanum was arranged in Ser. Lycoctonia Tamura & Lauener with A. shennongjiaense, A. monticola, and A. umbrosum based on taxonomical analysis (Gao 2009). The taxonomical result was further supported by the current phylogenetic tree.

All the tetraploid species in the HDM formed one monophyly, supporting them to be classified into Ser. Crassiflora Tamura & Lauener (Gao 2009). Only minor divergences could be identified for the morphological characteristics, living habitats and geographical distribution among the tetraploid species in the HDM. This might suggest that all the tetraploid species in the HDM originated from the same ancestor with a rapid radiation of species diversity. The HDM is dominated by complicated topography with deep and parallel valleys allowing extreme elevation changes within relatively short distances, thus producing local habitat uniqueness and plant species richness. It was also suggested to be a long-term refugium for a few species with ancient origins. Various examples for rapid radiation and glacial refugium of the HDM could be found in Magacodon styrophorus (Ge et al. 2005), Ligularia tongolensis (Wang et al. 2011) and some cypress species (Xu et al. 2010). The hexaploid A. apetalum, showing the highest ploidy level in subg. Lycoctonum, is known to be geographically isolated and distributed in central Asia. Taxonomically, Gao (2009) set it as an independent Ser. Micrantha Steinb. ex Tamura & Lauener due to the different upper sepal characteristics from other tetraploid species in the HDM. Consistently, it was revealed to be a monophyly based on the current phylogenetic tree. In chromosome size and particularly chromosome morphology, A. apetalum showed somewhat similar characteristics to the tetraploid species in the HDM, but markedly different from A. angustius (Hong et al. 2016). Therefore, it was supported as the sister group of the tetraploid species in the HDM.

The overall diversification rates were 0.24 sp/Myr (ε = 0) and 0.11 sp/Myr (ε = 0.90) for subg. Lycoctonum, which were different from those (0.61 sp/Myr (ε = 0) and 0.33 sp/Myr (ε = 0.90)) detected by Jabbour and Renner (2012). The difference in diversification rates in the two studies resulted from different species sampling, as relatively fewer ancient species in subg. Lycoctonum were sampled in Jabbour and Renner (2012) than in the current study. Polyploidy drives speciation and diversification as is evident in many angiosperm families with origins of many major clades being preceded by whole-genome duplications (Stebbins 1971; Otto and Whitton 2000; Levin 2002; Otto 2007; Paun et al. 2007; Soltis et al. 2009; Arrigo and Barker 2012; Estep et al. 2014). However, no obvious correlation between polyploidization and the subsequent increase in net diversification rate were suggested based on theoretical arguments and empirical studies most recently (Wood et al. 2009; Mayrose et al. 2011; Estep et al. 2014). In the current study, the lineages containing A. angustius (subclade 1–1) demonstrated a lower diversification rate than subclade 1–2. In addition, the lineage containing tetraploids in the HDM (subclade 2–1) showed a much lower diversification rate than the other lineages (Table 3; Fig. 7). Studies have shown that a high diversification rate does not always occur in areas with noted biodiversity (Escudero et al. 2012; Christin et al. 2014; Schilling et al. 2015), such is the case in the current study. The unanticipated low diversification rate occurred in the HDM, where there is high species diversity. This might have been produced by the identically local origin in the HDM.

Phylogenetic relationships among the polyploid species

Interspecific hybridization and subsequent genome doubling were suggested to occur frequently as one of the mechanisms of polyploid speciation (Thompson and Lumaret 1992; Soltis and Soltis 1999; Ramsey and Schemske 1998, 2002). Usually hybridization is resulted from different diploid species and thus leading to polyploid speciation. Cytological investigations have shown that A. angustius possesses 32 chromosomes mainly with st types, which are larger and cannot be arranged into eight groups of four homologs as other tetraploid species in the HDM (Gao 2009; Gao et al. 2012; Hong et al. 2016). Thus, tetraploidy in A. angustius could not have arisen from a single progenitor and most likely resulted from hybridization. Morphologically, A. angustius displays intermediate morphologies between A. finetianum and A. sinomontanum. It shows morphological similarities in having leaves 3-parted, pedicels retrosely pubescent and the upper sepal cylindric with A. finetianum, often making them confused with each other (Gao et al. 2012). Meanwhile, A. angustius often displays overlapping geographical ranges with A. finetianum and A. sinomontanum. For example, it co-occurs with A. finetianum on several mountains in southeast China (Lushan Mountains, Hengshan Mountains). Especially, on Lushan Mountains, they even grow side by side within a community (Yuan and Yang 2006). Its northern margin of the geographical distribution range overlapped with A. sinomontanum in western Hubei and northeastern Chongqing (Fig. 1). However, the current study only indicated close phylogenetic relationship (Fig. 2) and common shared cpDNA haplotype H2 between A. finetianum and A. angustius (Table 1; Fig. 3). In combination with morphological, cytological and phylogenetic evidence, we therefore suggest A. finetianum as the potential maternal progenitor of A. angustius. Though A. sinomontanum was previously suggested to be another progenitor of A. angustius, no molecular evidence was available in the current study. This is most probably due to the limitation of the uniparentally inherited cpDNA regions and concerted evolution of nrDNA ITS region. The relationships of A. angustius and A. sinomontanum need further determination.

Heterogeneous habitats were suggested as geographical regions for rapid diversification and long-term preservation of plant species (Tuomisto et al. 2003; Kreft et al. 2006; Antonelli and Sanmartín 2011; Verboom et al. 2015). Tetraploids in the HDM displayed similar morphological characteristics and karyotypes (Yuan and Yang 2006; Hong et al. 2016) and formed monophyly in the phylogenetic tree. Those species can only be distinguished by minor differences of sepals and spurs. Their geographical distribution range and growing habitats were hard to be differentiated. Karyotypic constitutions showed mainly sm or m types of those species, which are smaller than that of A. angustius and can be arranged fairly well into eight groups of four homologs. All above evidence supported the same origin of the five tetraploid species in the HDM. The geographical center of the local origin was detected in the HDM (C) (AA2) (Fig. 4), where the tetraploids were estimated to originate approximately 3.00 (95% HPD = 1.44–4.90) Ma in the middle Pliocene (Fig. 5). This period was coincident with the orogeny of the HDM (approximately 3.4 Ma) (Li et al. 1979, 1996; Cui et al. 1996; Shi et al. 1998; Wang et al. 2010; Liu et al. 2013). We therefore suggest local origin in the HDM of the five tetraploids. Perennial growth habit is considered to be one of the factors promoting polyploidy (Grant 1971, 1981). In the HDM, the uplift of the QTP and climate change could have led to high levels of vegetative propagation (e.g., unreduced gametes), thus resulting in local species diversification. Meanwhile, the HDM is dominated by heterogeneous habitats, showing deep and parallel valleys with extreme elevation changes within relatively short distances. It is considered as an important geographical region for rapid radiation of plant species (Li and Li 1993; Wu 1988; Myers et al. 2000), which facilitated species diversification and long-term preservation. However, the progenitors of tetraploids in the HDM were unable to be detected due to the missing cpDNA haplotypes (Fig. 3).

Aconitum apetalum is the only hexaploid species in subg. Lycoctonum, which is distributed in the Tianshan Mountains, Xinjiang, northwestern China and Kazakhstan and Tajikistan in central Asia. It demonstrated similar karyotypic constitutions and close phylogenetic relationship with the tetraploid species in the HDM. In the present study, A. apetalum was estimated to originate approximately 4.13 (95% HPD = 2.19–6.48) Ma, which was earlier than the orogeny of the HDM (approximately 3.40 Ma) and the fourth period of the uplift of the QTP (approximately 1.60–3.50 Ma) (Liu et al. 2013; Harrison et al. 1992; Chen 1992; Spicer et al. 2003; Wen et al. 2014). A vicariance event (V5; P = 1.00) was also detected between the HDM (C) and northwestern China (A) according to the S-DIVA inference (Fig. 4). Therefore, it is reasonable to infer that A. apetalum has suffered strongly from long-term geographical isolation or forced migration northwards resulting from the formation of geographical barriers by the HDM and QTP.

Conclusions

Subgenus Lycoctonum has a widespread range and a well-known background of polyploidy. It is a diverse subgenus in one of the earliest diverging lineages of Ranunculaceae, composed of diploid, tetraploid as well as hexaploid species. However, phylogenetic assessment has never been well accomplished in particularly for the polyploid species. In the current study, we examined cpDNA regions and nrDNA sequences from subg. Lycoctonum species. Phylogenetic inference was conducted informing the relationship between the polyploid species. Major phylogenetic relationships within subg. Lycoctonum were resolved with generally high support values and consistence with previous taxonomic studies. No evidence of polyploidization leading to an increased diversification rate was observed. Ancestral area was reconstructed, and divergence time was estimated in particular for the polyploid lineages. Hybridization was suggested for A. angustius and A. finetianum was inferred to be the potential maternal progenitor. Local origin was suggested for tetraploids in the HDM. Heterogeneous habitats were considered to be important factors promoting rapid speciation and long-term preservation in the HDM. Evidently, A. apetalum has suffered from long-term geographical isolation by the QTP and HDM. Therefore, hybridization and heterogeneous habitats have played an important role in the polyploidization of subg. Lycoctonum. However, more molecular evidence is in need for further interpretation of the phylogenetic relationship and polyploids evolutionary mechanism of subg. Lycoctonum.