Introduction

The occurrence of complexes of cryptic species—groups of species that are not confidently distinguishable based only on morphology—has become widely recognised in biodiversity analyses (Knowlton, 1993; Bickford et al., 2007). The revolution brought by efficient DNA sequencing technologies has driven an explosion of studies on biodiversity, unmasking hidden morphological diversity and revealing that cryptic species are common and widespread across all animal phyla (Pfenninger & Schwenk, 2007; Trontelj & Fiser, 2009). While deciphering hidden diversity in species complexes remains a taxonomic challenge, it is crucial to address important questions in speciation research to understand patterns and processes in biodiversity (Butlin et al., 2009).

Phylum Rotifera is one of several phyla with a high level of cryptic diversity (Fontaneto et al., 2009; García-Morales & Elías-Gutiérrez, 2013; Gabaldón et al., 2016). Cryptic diversity is expected in rotifers, due to the small size of these animals, the paucity of taxonomically relevant morphological features, and the scarcity of rotifer taxonomists (Wallace et al., 2006). Moreover, the reliance of rotifers on chemical communication in species recognition (Snell, 1998) may contribute to the prevalence of morphological cryptic diversity. One clear example of cryptic diversity in the phylum is the species complex Brachionus plicatilis Müller, 1786, a cosmopolitan taxon with an affinity for saline environments. Here we report an extensive study undertaken to unravel the hidden diversity within this species complex.

Two morphotypes of B. plicatilis were reported as early as the 19th century when Ehrenberg ascribed the name Brachionus muelleri Ehrenberg, 1834, as distinct from the first record for the species complex, B. plicatilis (although the former name is now considered a junior synonym of the latter). A modern discussion of diversity in B. plicatilis began when two strains with differing morphological and ecological characteristics were recognised as the L (large) and S (small) types (Oogami, 1976). From the early 1980s, it became increasingly clear that the morphological and genetic differences between the L and S strains supported the hypothesis that the two morphotypes should be recognised as separate species. Serra & Miracle (1983) noted marked seasonal cyclomorphosis in individuals from Spanish water bodies commenting that, while B. plicatilis populations were thought to exhibit high levels of phenotypic plasticity in their natural habitat, laboratory clones founded from single individuals could be readily distinguished biometrically. They also noted a good correlation between biometric classification and spatial distribution of wild populations, hypothesising that some of their clones may constitute a “well-differentiated genetic race”.

The idea of discriminatory genetic structure within what was considered a single species was further supported by Snell & Carrillo (1984) who examined 13 strains of B. plicatilis sourced globally, concluding that strain identity was the most important deterministic factor of size. Serra & Miracle (1987) supported these observations, reporting that size in B. plicatilis populations seemed to be largely under genetic control. Furthermore, these authors noted that size could be defined to a narrow range of biometric deviations at different salinities and temperatures. In the same year, King & Zhao (1987) reported a substantial amount of genetic variation in three enzyme loci between clones established from individuals collected at different times from Soda Lake, Nevada (USA). Other phenotypic traits provided evidence for distinct species. For example, some members of the species complex retain their resting eggs within the body, while others employ a thin thread to hold them outside their body (Serrano et al., 1989).

The existence of cryptic species within B. plicatilis was reinforced by Fu et al. (1991a), who examined 67 isolates from around the globe and showed that they could be clearly classified into large (L) and small (S) morphotypes based upon morphometric analysis alone. In a second study, the same group clearly discriminated between L and S strains on a genetic basis and concluded that at least two species existed (Fu et al., 1991b). Additional evidence for the existence of at least two species within the taxon came from the examination of chromosomes: L and S morphotypes have karyotypes of 2n = 22 and 2n = 25, respectively (Rumengan et al., 1991, 1993). The size discontinuities between L and S morphotypes were shown to correspond to behavioural reproductive isolation between these groups (Snell & Hawkinson, 1983). Snell (1989) showed how male mate recognition could be used as a means of establishing species boundaries in monogonont rotifers in this case. Both Fu et al. (1993) and Gómez & Serra (1995) also identified reproductive isolation between the L and S types based on male mating behaviour. Thus, in reviewing morphological, behavioural, and genetic studies, Segers (1995) concluded that the L and S strains could be defined as two distinct species, namely B. plicatilis sensu stricto (s.s.) and Brachionus rotundiformis Tschugunoff, 1921, respectively.

Further investigations by Gómez & Serra (1995), Gómez et al. (1995), Gómez & Snell (1996), Serra et al. (1998), and Ortells et al. (2000) using molecular markers and reproductive isolation tests revealed that several cryptic species could be ascribed to both B. plicatilis and B. rotundiformis. This revelation culminated in a paper by Ciros-Pérez et al. (2001a) that used morphological, ecological, and genetic differences to support B. plicatilis s.s. and B. rotundiformis and to introduce a medium size type, designated SM, to the species complex with the description of Brachionus ibericus Ciros-Pérez, Gómez & Serra, 2001. At this stage, three groups were known: L with B. plicatilis s.s., SM with B. ibericus, and SS (here so called with two capital ‘s’ to be clearly differentiated from the S strains) with B. rotundiformis (Fig. 1).

Fig. 1
figure 1

Photomicrographs of three representative lineages of the Brachionus plicatilis species complex. A, B, C) dorsal view; D, E, F) lateral view; G, H, I) ventral view. A, D, G Large strain, clone BUSCL (clade L1 in Figs. 25); B, E, H Medium strain, clone MULCL (clade SM4); C, F, I) Small strain, clone TOWCL (clade SS1). Scale bar = 100 µm

A phylogenetic analysis of mitochondrial and nuclear gene sequences (namely COI and ITS1) on a worldwide dataset supported an ancient differentiation of this rotifer lineage into at least nine species, often sympatric, which were clustered into the morphologically recognised L, SM, and SS morphotypes (Gómez et al., 2002). Suatoni et al. (2006) suggested the existence of 14–16 species across the three clades, based on DNA sequence data and the high degree of concordance between genealogical and reproductive isolation (based on experimental trials). Supporting this diversity, genetic and phenotypic data were then used to describe two additional species: Brachionus manjavacas Fontaneto, Giordani, Melone & Serra, 2007, within the L type (Fontaneto et al., 2007) and Brachionus koreanus Hwang, Dahms, Park & Lee, 2013 within the SM type (Hwang et al., 2013). Finally, another species, already described as Brachionus asplanchnoidis Charin, 1947, were known to be a member of the group (Kutikova, 1970; Segers, 1995; Jersabek & Bolortsetseg, 2010); however, no DNA sequences could be unambiguously attributed to it.

Thus, a sizable amount of analyses using molecular, morphological, ecological, and reproductive isolation suggests that there are many putative species within the B. plicatilis complex. However, only six species have been formally described (in chronological order): B. plicatilis s.s., B. rotundiformis, B. asplanchnoidis, B. ibericus, B. manjavacas, and B. koreanus, respectively, by Müller (1786), Tschungunoff (1921), Charin (1947), Ciros-Pérez et al. (2001a), Fontaneto et al. (2007), and Hwang et al. (2013). Nevertheless, there are additional clades that may correspond to putative new species and that have been designated by the scientific community simply as “Brachionus sp. ‘Locality’”, where ‘Locality’ refers to the place where the samples were first collected. Examples of this designation include Brachionus sp. ‘Almenara’ (Ortells et al., 2000; Gómez et al., 2002), Brachionus sp. ‘Nevada’ (Gómez et al., 2002), and Brachionus sp. ‘Mexico’ (Alcántara-Rodríguez et al., 2012).

In an effort to clarify the systematics of the B. plicatilis species complex, we present an analysis of the most extensive dataset on genetic diversity in the species complex. The first aim of our contribution is to provide a clear phylogenetic structure to support identification and designation of species in the complex through the use of several approaches in DNA taxonomy. Our second aim is to present a study of the evolutionary relationships among the species in the complex for a comparative analysis exploring the phylogenetic signal of biological traits and correlations among species-specific traits of the different species. The B. plicatilis species complex is by far the most extensively studied group of rotifers, and these animals have been used to investigate a wide variety of phenomena including ecological interactions (Ciros-Pérez et al., 2001b, 2004, 2015; Montero-Pau et al., 2011; Gabaldon et al., 2015), toxicology (Serrano et al., 1986; Snell & Persoone, 1989; Dahms et al., 2011), osmoregulation (Lowe et al., 2005), local adaptation (Campillo et al., 2009; Alcántara-Rodríguez et al., 2012), the evolution of sex (Carmona et al., 2009), phylogeography (Gómez et al., 2000, 2007; Mills et al., 2007), ageing (Snell et al., 2015), and evolutionary processes (Stelzer et al., 2011; Fontaneto et al., 2012; Tang et al., 2014a). In addition, due to the ease and low cost of producing highly dense cultures of these rotifers, members of this species complex have been widely used in aquaculture as a source of live feed for larval crustaceans and fishes (Fukusho, 1983; Watanabe et al., 1983; Lubzens & Zmora, 2003). We make use of this information to provide a first assessment of the evolutionary trajectories of biological and ecological traits in the B. plicatilis species complex.

Methods

Data collection

We gathered all the DNA sequences for COI (Cytochrome Oxidase c subunit I) and ITS1 (Internal Transcribed Spacer 1) from members of the B. plicatilis species complex that were available in GenBank in March 2015. To ensure the quality of the data, we removed short sequences (4 sequences shorter than 300 bp were removed from the COI dataset), confirmed that the COI sequences lacked internal stop codons (given that NCBI did not do it automatically for the older sequences), investigated that the maximum uncorrected genetic difference among the sequences was less than 40%, and verified that the best BLAST hit for each sequence was from a rotifer of the genus Brachionus. This resulted in the retention of 811 COI and 184 ITS1 sequences. In addition, we sequenced COI and ITS1 from a total of 449 wild—caught individuals or existing lab strains, using DNA extraction and gene amplification protocols established for the species complex more than a decade ago (Gómez et al., 2002). The full list of 1273 isolates used for the study and the GenBank accession numbers of their COI and ITS1 sequences are provided in Supplementary File S1. All newly obtained sequences were deposited in GenBank with accession numbers from KU299052 to KU299752. We did not include sequences from clades 15 and 16 of Suatoni et al. (2006), as they seem to be outside the species complex, they have never been found again, no voucher or lab cultures exist, and no additional information is available for them.

In addition to DNA sequence data, we collected contextual data for all 1273 isolates, when available. These data included the name of the water body where they were found, the country and continent of collection (following the divisions of the Taxonomic Database Working Group, TDWG, by Brummitt, 2001), geographic coordinates, and habitat type (either coastal system or continental saltwater body). This was done by scanning the literature mentioning the isolates, and by searching through our personal records in the cases when the samples were originally collected by one of the authors. In addition to these ecological and geographical data, we included information on body length, genome size, either from the literature, or by measuring them specifically for this study.

Phylogenetic reconstructions

Analyses of the phylogenetic relationships among isolates of the B. plicatilis complex were performed on three datasets: COI, ITS1, and the concatenated COI + ITS1 dataset. For the three datasets, the analytical steps were the same and included alignment, selection of the best evolutionary model, and phylogenetic reconstructions through Maximum Likelihood (ML) and Bayesian Inference (BI). For the outgroup, we selected one isolate of the congeneric Brachionus calyciflorus Pallas, 1766 for which both COI and ITS1 existed (isolate XZ8: GU012801, GU232732, Xiang et al., 2011).

Alignments were straightforward for COI, whereas the most reliable alignment for ITS1 was obtained with MAFFT v6.814b using the Q-INS-I algorithm (regarded as the optimal strategy for ribosomal markers; Katoh et al., 2009). Alignments were trimmed at the ends for a total length of 661 positions for COI and 359 positions for ITS1. Alignments were reduced to unique sequences by collapsing all identical sequences into one single sequence. These unique sequences are similar to haplotypes, but may underestimate diversity because sequences of different lengths (and with gaps for ITS1) were collapsed into a single unique sequence if they were identical in the overlapping part. In those cases, we used the longest sequence for the purpose of phylogenetic reconstruction. In order to avoid ambiguities between COI and ITS1 unique sequences, we used different prefixes: we named unique sequences for COI as numbers with ‘H’ as a prefix, and unique sequences for ITS1 as numbers with ‘h’ as a prefix.

The most appropriate evolutionary model for the COI and the ITS1 datasets was determined using ModelGenerator v0.85 (Keane et al., 2006) independently for each marker. The best model was identified as GTR+G+I in both cases.

Maximum Likelihood reconstructions were performed with PhyML 3.0 (Guidon & Gascuel, 2003) for the COI and ITS1 datasets. GTR+G+I with 4 gamma categories was implemented as an evolutionary model; support values were estimated through approximate Likelihood Ratio Test, aLRT (Guidon & Gascuel, 2003). For the concatenated dataset, RAxML v8 (Stamatakis, 2014) was used with default settings; the alignment was partitioned by gene and all parameters were estimated independently for each of the two partitions.

Bayesian Inference reconstructions were performed in BEAST v1.6.1 (Drummond et al., 2012) using the default settings except for: GTR+G+I as the site model, an uncorrelated lognormal relaxed clock, a Yule speciation tree prior with lognormal distribution of birth rate, 100 million generations, and trees saved every 10,000 generations. Effective sample sizes (ESS) were checked in Tracer v1.5 (Rambaut et al., 2013), and the consensus tree was obtained in TreeAnnotator v1.6.1 with a 20% burnin. For the concatenated dataset, all parameters were estimated independently for each partition.

DNA taxonomy

Three methods of DNA taxonomy were used to identify putative species from DNA sequence data (Fontaneto et al., 2015). For all methods, the outgroup was excluded from the analyses. Consistency among methods and among the three datasets was considered as increased confidence in the identification of the species in the B. plicatilis complex. In case of discordance in the amount of splitting, we chose to keep the smallest number of entities, in order to avoid over-splitting the species complex; thus, if a mistake is made in the identification of taxa, it is made in the direction of being more conservative in the amount of cryptic diversity.

The Automatic Barcode Gap Discovery (ABGD) was applied independently to the COI and ITS1 alignments to test for the existence of a barcode gap in the genetic distances and then to identify groups of individuals united by shorter genetic distances than the gap. These groups were considered to be equivalent to species (Puillandre et al., 2012). ABGD was used through its online tool (http://wwwabi.snv.jussieu.fr/public/abgd/abgdweb.html) with default settings. For COI, we considered only results obtained with prior intraspecific divergence higher than 1.5%, given what is known in rotifers for this marker (Fontaneto, 2014); for ITS1, given that there is no previous knowledge of prior intraspecific divergence, we explored all the possible prior intraspecific divergences available in the default settings. The ABGD method, based on genetic distances calculated in one marker, was applied only to the alignments of the single markers and not to the concatenated alignment.

The Poisson Tree Process (PTP) was applied to the three ML trees (COI, ITS1, and CO1 + ITS) to search for evidence of independently evolving entities akin to species, optimising differences in branching patterns within and between species (Zhang et al., 2013). PTP was used through its online tool (http://species.h-its.org/) with default settings for all three analyses: the output is reported from its ML and BI optimisation algorithms.

The generalised mixed Yule coalescent (GMYC) model was applied to search for evidence of independently evolving entities akin to species, optimising the threshold between within-species coalescent processes and between-species Yule processes on the branching patterns (Fujisawa & Barraclough, 2013). GMYC models were run on (i) the BEAST trees for the three alignments (COI, ITS, and CO1 + ITS), (ii) the ML trees made ultrametric (i.e. with branching patterns proportional to the evolutionary model and to time) through r8s using penalised likelihood and cross-validation to choose the optimal smoothing parameter among 1, 10, and 100 (Sanderson, 2003), and (iii) ML trees made ultrametric through the chronoMLP and chronos functions in the R v 3.1.2 (R Core Team, 2014) package ape v 3.2 (Paradis et al., 2004). Parts (i) and (ii) were performed as recommended by Tang et al. (2014b). All GMYC models were run with the R package splits v 1.0–19 (Ezard et al., 2009).

Further hypothesis testing and validation

We used several approaches to support the hypothesis that the new taxa identified by DNA taxonomic methods represent species.

First, we made a direct comparison of our putative species with the species that are already described in the complex (i.e. B. asplanchnoidis, B. ibericus, B. koreanus, B. manjavacas, B. plicatilis s.s., and B. rotundiformis). Our expectation was that species identified by DNA taxonomy would correspond to known species in the complex.

Second, we calculated uncorrected genetic distances between each pair of sequences in the alignments, and compared the distances within and among species with what is known in other rotifers and in animals in general. The expectation, in comparison to what is known in other rotifer species complexes, is to have a barcoding threshold in COI that is higher than the commonly accepted 3% for other animals (Hebert et al., 2003; Fontaneto, 2014).

Third, we checked whether the maximum genetic distances found in pairwise comparisons within each species were related to sample size (defined both as number of individuals and as number of unique sequences for each marker) for the same species. Given the possibility of a phylogenetic signal (Münkemüller et al., 2012) in the comparisons between species in the complex, we tested whether our data were phylogenetically structured using Pagel’s lambda (Pagel, 1999) and Blomberg’s K (Blomberg et al., 2003). We then used phylogenetic generalised least square (PGLS) analyses to account for the confounding factor of phylogenetic relatedness (Garamszegi, 2014). Values of Pagel’s lambda and Blomberg’s K of zero indicate no phylogenetic signal, which occurs when closely related species are not more similar than distantly related ones; values of one or higher indicate that closely related species are significantly more similar than expected (Kamilar & Cooper, 2013). In PGLS, the phylogeny is used to account for phylogenetic pseudoreplication in the statistical models. As a phylogeny for the PGLS, we used the one obtained from RAxML + r8s on the combined alignment of COI + ITS1 dataset, randomly pruned to one single sequence per species, with branch length transformations (lambda, delta, and kappa) optimised by maximum likelihood given the data and the model. The combination RAxML + r8s was chosen because it gave the lowest number of species with the smallest confidence interval according to all of the DNA taxonomy methods (see Table 1). There is, of course, the possibility of methodological biases due to uncertainties in the phylogenetic reconstructions. Therefore, to provide further support for the results obtained from the combined dataset, we repeated the analyses also using the phylogenies obtained from the single markers (Supplementary File S2). Concordance in the results, despite differences in the tree topologies that were obtained from the different phylogenetic reconstructions, would enhance the reliability of the results. For the statistical models, we used all the variables expressing count data (e.g. number of individuals and number of unique sequences) with their log-transformed values. Pagel’s lambda and Blomberg’s K values were estimated with the R package phytools v 0.4–31 (Revell, 2012); PGLS models were performed in the R package caper v 0.5.2 (Orme et al., 2013).

Table 1 Results of the different methods of DNA taxonomy

Using the same methods, we also tested whether a phylogenetic signal was present in the species complex in (1) habitat type (coastal waters vs. continental saltwater bodies), (2) body length (from measurements available in the original descriptions of the species), (3) genome size (as reported in Stelzer et al., 2011), (4) geographic range (as number of continents where the species has been found), (5) genetic diversity (as number of unique sequences relative to the number of analysed individuals), and (6) number of occurrences.

Results

Out of the 1273 isolates used in this study for COI and ITS1: the alignment for COI included 1223 isolates, collapsed into 275 unique sequences; the alignment for ITS1 included 481 isolates, collapsed into 45 unique sequences; the concatenated alignment included 431 isolates, collapsed into 174 unique sequences.

Phylogenetic reconstructions for each marker were highly congruent for Maximum Likelihood and Bayesian Inference (Figs. 2, 3, Supplementary Figs. S1–S4). The three known major groups of L, SM, and SS clades were supported, but not always with maximum confidence (Figs. 2, 3, Supplementary Figs. S1–S4). For the combined dataset (Fig. 4, Supplementary Fig. S5), BEAST failed to converge, and values of ESS were not higher than 200 for all parameters. Thus, no reliable phylogenetic reconstruction was obtained with a Bayesian approach on the combined dataset, potentially due to the contrasting topologies of the two markers for the deeper nodes and to the mitonuclear discordance between different individuals within each species (see below), preventing convergence (Figs. 2, 3).

Fig. 2
figure 2

Phylogenetic relationships of the 45 ITS1 haplotypes from 481 individuals in the Brachionus plicatilis species complex, according to Bayesian inference reconstructions. The consensus of 8000 sampled trees from Bayesian analysis run in BEAST is shown, displaying all compatible groupings and with average branch lengths proportional to numbers of substitutions per site under a GTR+I+G substitution model. Posterior probabilities from BEAST/support values as approximate Likelihood Ratio Test from PhyML are shown above each branch, but not for within-species branches; the ‘−’ symbol indicates support <0.90 for posterior probabilities and <0.80 for HLR tests. The complete trees with all haplotypes names and all support values are available as Supplementary Figs. S1 and S2. The three grey circles on basal nodes indicate the three main groups known in the species complex, namely Large (L), Small-Medium (SM), and Small (SS). Clade names are according to Table 2. The number of potential independently evolving units is consistent across the different methods in DNA taxonomy (see Table 1). Pairwise uncorrected genetic distances within each species are reported as median values (range minimum–maximum)

Fig. 3
figure 3

Phylogenetic relationships of the 275 COI haplotypes from 1223 individuals in the Brachionus plicatilis species complex, according to Bayesian Inference reconstructions. The consensus of 8000 sampled trees from Bayesian analysis run in BEAST is shown, displaying all compatible groupings and with average branch lengths proportional to numbers of substitutions per site under a GTR+I+G substitution model. Posterior probabilities from BEAST/support values as approximate Likelihood Ratio Test from PhyML are shown above each branch, but not for within-species branches; the ‘−’ symbol indicates support <0.90 for posterior probabilities and <0.80 for aLRT tests. The complete trees with all haplotypes names and all support values are available as Supplementary Figs. S3 and S4. The three grey circles on basal nodes indicate the three main groups known in the species complex, namely Large (L), Small-Medium (SM), and Small (SS). Clade names are according to Table 2. The number of potential independently evolving units within each species according to the different methods in DNA taxonomy (ABGD and GMYC on different chronograms) is reported as circles, with numbers of slices representing number of units (see Table 1). Results for PTP are not reported as this method produced an overestimation of units from the COI phylogenies (more than 50: Table 1). Pairwise uncorrected genetic distances within each species are reported as median values (range minimum–maximum)

Fig. 4
figure 4

Phylogenetic relationships among the 14 species of the Brachionus plicatilis species complex for which both COI and ITS1 are available. The tree was obtained from a RAxML run on combined alignments, made ultrametric with r8s, and pruned to include only one random terminal per species; bootstrap supports are from 100 replicates. The name of the six described species in the complex are reported on the tree. The original tree is available as Supplementary Fig. S5. Additional information on sample size, genetic diversity, ecological, and biological traits is reported for each species; not all information is available for all sequenced individuals. Body length and genome size data come from published literature, except for those marked with an asterisk, which were measured in this study. Maps depict the known distribution of each species at continental level (continents defined according to TDWG Level 1). Pagel’s lambda and Blomberg’s K are reported for each variable to estimate the phylogenetic signal. The symbol + for phylogenetic signal for habitat denotes that zero values were transformed to 0.00001 to avoid dealing with infinite ratios. Lambda (and K) for other variables not in figure is maximum COI genetic distances = 2.19 (1.05) and maximum ITS1 genetic distances = 1.97 (1.13)

DNA taxonomy

DNA taxonomy tools based on the three datasets provided estimates of cryptic species ranging from 14 to 67 (Table 1). Estimates based on COI ranged from 17 to 55. The minimum estimate of 17 (provided by ABGD) was well below the range of the most conservative estimate within the potential solutions from PTP (52–55 species) and GMYC (27–53 species). Using ITS1, all the methods consistently indicated at least 14 species (Table 1; Fig. 2). The GMYC model on ITS1 gave optimal solutions of 15 or 17, but 14 was consistently the most conservative estimate among the equally likely solutions within the 95% confidence interval for all the GMYC models (Table 1). For the concatenated alignment, estimates of the number of species ranged from 19 to 67 (Table 1): these results are the most variable, and thus they will not be considered further.

The most conservative estimate of 17 species from ABGD using COI sequences included all 14 species identified from ITS1, plus one species for which no ITS1 sequence was available (species SM9; Fig. 3), and two species (SM3 and L4) with two entities each instead of one (Fig. 3). The other methods provided more splits within seven of the 15 species (Fig. 3). Therefore, the most consistent number of lineages appears to be the estimate of 14 species obtained from ITS1, plus one single COI lineage for which no ITS1 sequence is available (species SM9 from Lake Turkana in Kenya). These 14(+1) potential species are also the main well-supported lineages that can be easily seen on the phylogenetic trees (Figs. 2, 3, 4), and six of them match the six species that have already been described in the genus: B. asplanchnoidis (L3), B. ibericus (SM1), B. koreanus (SM2), B. manjavacas (L2), B. plicatilis s.s. (L1), and B. rotundiformis (SS1).

In the 14 species for which both COI and ITS1 were available, no evidence was found of phylogenetic discordance between mitochondrial and nuclear phylogenies, that is of individuals harbouring COI of one species and ITS1 of another one (Fig. 5).

Fig. 5
figure 5

Tanglegram for all individuals for which both COI (left) and ITS1 (right) were available. Each phylogeny was obtained from the complete BEAST reconstructions (Supplementary Figs. S1 and S3) pruned in order to have only unique sequences. Polytomies were enforced when the topology was not congruent with that of Fig. 4. Dashed lines connect individuals in which COI and ITS1 co-occurred. Thick dashed lines represent instances of mitonuclear discordance (individuals sharing the same COI sequence but with different ITS1). Alternating grey and white areas under the dashed lines separate the 14 species, marked on the trees with their names

Evidence of independent biological entities

For COI sequences, maximum uncorrected genetic distances within the 15 putative species ranged from 0.3 to 13.3% (median = 3.79%, mean = 3.90%) (Fig. 3); distances between species ranged from 11.9 to 23.2% (median = 18.9%, mean = 18.6%). Distances between the species of the L group ranged from 13.6 to 22.1%, between the species of the SM group from 11.9 to 22.4%, and between the species of the SS group from 14.3 to 17.3%. Thus, all species of the L and SS group had within-species distances up to 13.1 and 13.3%, respectively (Fig. 3); these values are lower than the between-species distances, meaning that a barcoding gap existed. On the other hand, two of the species in the SM group (SM4 and SM5) had within-species distances below 3.3%, but between-species distances ranging from 12.4 to 14.5%, partially overlapping with the maximum values of the within-species distances, up to 13.3%, in other species in other parts of the tree (i.e. B. koreanus (SM2), B. rotundiformis (SS1), and L4: Fig. 3).

For ITS1 sequences, maximum uncorrected genetic distances within the 14 putative species ranged from 0.3 to 1.9% (median = 0.95%, mean = 0.95%; Fig. 2); distances between species ranged from 2.5 to 22.0% (median = 15.6%, mean = 13.9%). Distances between the species of the L group ranged from 2.5 to 9.5%, between the species of the SM group from 3.7 to 10.6%, and between the species of the SS group from 6.4 to 7.0%.

The number of unique COI sequences and maximum genetic distances in COI within each species, both metrics of potential genetic diversity for each species, were significantly correlated to the number of analysed individuals (PGLS: t 12 = 5.71, P < 0.001; t 12 = 3.05, P = 0.010, respectively). The same pattern was found for ITS1 sequences, with both the number of unique sequences (PGLS: t 12 = 4.4, P = 0.001) and maximum genetic distances (PGLS: t 6 = 2.7, P = 0.033) related to the number of individuals. Among the analysed variables, the number of unique sequences for COI and for ITS1 and the number of individuals found in each species had a low phylogenetic signal (Fig. 4). On the other hand, the phylogenetic signal was strong for the maximum genetic distances both for COI (Pagel’s lambda = 2.19, Blomberg’s K = 1.05) and for ITS1 (Pagel’s lambda = 1.97, Blomberg’s = 1.13), with the species in the L group exhibiting, on average, higher diversity than the species in the SS and in the SM group.

The number of continents where each species was found had a strong phylogenetic signal (Fig. 4), with species of the SM group being present in a lower number of continents than species of the L or SS group. Moreover, geographic distribution, expressed as the number of continents where each species was found, was not related to the number of individuals for each species (PGLS: t 12 = 1.23, P = 0.242).

Body length had a strong phylogenetic signal (Fig. 4), with species of the L group effectively larger than those of the SM group, themselves larger than those of the SS group. Body length seems to be significantly correlated to genome size (PGLS: t 7 = 5.8, P < 0.001), whereas genome size does not have a strong phylogenetic signal (Fig. 4).

The results obtained on the phylogeny obtained from the combined datasets were qualitatively supported in the tests on comparative analyses using the topology of either only COI or ITS1 phylogenies (Supplementary File S2); the results on phylogenetic signals were qualitatively supported using the COI phylogeny, whereas they were not that clear when using the topology of the ITS1 phylogeny (Supplementary File S2).

Discussion

Despite the importance of the B. plicatilis species complex in basic research and aquaculture, the systematics and taxonomy of this group have remained unclear. Cryptic species complexes are, by definition, a set of closely related species that share very similar morphological traits, thus, deciphering the diversity of these complexes has been difficult because of morphological stasis (Campillo et al., 2005). The morphospecies criterion used in taxonomy—identifying groups of individuals with typical morphological characteristics distinguishable from other groups—is usually the first approach for diversity studies. However, use of morphological attributes alone to differentiate species has limitations, especially in rotifers and other microscopic animals with few morphological features (Tang et al., 2012) and phenotypic plasticity such as cyclomorphosis and inducible defences (Gilbert & Stemberger, 1984; Sarma et al., 2011). Thus, as in the case of the B. plicatilis species complex, the use of tools from DNA taxonomy on more than one marker may be informative, adding a genealogic and phylogenetic concept to the approaches used to define species in the complex.

Overall, our extensive analyses of the genetic diversity in COI and ITS1 sequences within the B. plicatilis complex revealed, as a conservative estimate, 15 species: four belonging to the L group (B. asplanchnoidis, B. manjavacas, B. plicatilis s.s., and clade L4), two belonging to the SS group (B. rotundiformis and clade SS2), seven belonging to the SM group (B. ibericus, B. koreanus, and clades SM3-7), and two (SM8 and SM9) for which the inclusion in the SM group is suggested but needs to be confirmed. Six of these species were already described before this study, and the correspondence with the previously used names of Brachionus sp. ‘Locality’ for all the species is reported in Table 2. The species identified by our DNA taxonomy approach are in complete agreement with the taxa already identified by Gómez et al. (2002) and Suatoni et al. (2006).

Table 2 List of the 14 + 1 clades with unambiguous evidence of cryptic species in the Brachionus plicatilis species complex, and correspondence with described species and unofficial names that are used in the literature

Moreover, our study offers a basis for further analyses on the species complex, providing a phylogenetic structure for comparative studies. The phylogeny shown in Fig. 4 can be downloaded in Supplementary File S3 and from FigShare (10.6084/m9.figshare.2077531), for further phylogenetic comparative analyses on other biological traits.

Support for species identity

We chose the most conservative estimates of species diversity in our DNA taxonomy approach to identify species. Our rationale was to avoid dividing the species complex into taxa that could not be well supported. Different approaches from DNA taxonomy provided different estimates of diversity in the complex. Previous comparisons between different methods (Tang et al., 2012; Dellicour & Flot, 2015) usually relied on smaller datasets for each species complex or on simulated data, whereas our study can be used also as a caveat for the uncertainties in phylogenetic-based approaches on DNA taxonomy from single markers. Apparently, ABGD seems to be more robust for large datasets than PTP or GMYC.

Six formally described species in the complex perfectly matched the species highlighted by ABGD, using either ITS1 or COI datasets. Two of the still unnamed species (SM3 and L4) could be unambiguously delimited as unique species with the ITS1 but not with the COI dataset, for which at least two species were found (Fig. 3). This is consistent with previous results showing that COI is more rapidly evolving and thus apparently showing more taxonomic diversity than other commonly used markers (Tang et al., 2012).

Uncorrected genetic distances within and between species for the two markers are rather high in comparison with what is known in other animals (Hebert et al., 2003; Pfenninger & Schwenk, 2007). Wide variability in the thresholds for the barcoding gap is known across phyla and even within phyla, and rotifers were already known to have a COI barcoding threshold much higher than the commonly accepted 3% (Fontaneto, 2014). The DNA taxonomy approach that we used was able to identify a clear and unambiguous barcoding gap in ITS1, with maximum genetic distances within species of 1.9% and minimum genetic distances between species of 2.5%. In contrast, the situation for COI was not that clear: the maximum within-species genetic distance of 13.3% was higher than the minimum between-species genetic distance of 11.9%. Thus, a strict barcoding approach in COI may be misleading if we assume the existence of 15 species in the complex. Overall, COI did not score coherently well as a marker for DNA taxonomy in this species complex, given that each approach provided different and often non-overlapping results (Table 1; Fig. 3). Previous analyses had shown that COI provided more than 15 species in the complex (e.g. Fontaneto et al., 2009; Malekzadeh-Viayeh et al., 2014). Yet, both COI and ITS1 provide congruent monophyletic lineages, at least for the 14 species with both markers available. To avoid the possibility of over-splitting the complex, we suggest use of ITS1 as a more reliable marker for DNA taxonomy in the B. plicatilis complex. Using only COI as a molecular marker will be fine to identify new individuals within the currently delimited 15 species; if COI is used to support additional species, this should always be done in addition to other approaches from morphology, physiology, ecology, or with cross-mating experiments. Given that COI is more variable than ITS1, the former is still the best marker to be used for exploration of population genetic structure within species and phylogeography. Overall, some species in the complex (e.g. B. plicatilis s.s. and SM4), which are well sampled with hundreds of sequenced individuals, exhibit rather shallow phylogenetic structure, with a relatively recent least common ancestor. However, others species (e.g. B. asplanchnoidis, B. koreanus, B. rotundiformis, and SM3) show deep within-species genetic divergences, regardless of sample size. The reason for such differences is still unknown and deserves further investigation.

Another approach that can be used to support the existence of species is to apply the biological species concept (Mayr, 1963), which defines a species as a population or group of populations that have the potential to interbreed and produce fertile offspring. Detection of cryptic species by means of direct tests on reproductive isolation is challenging because experimental cross-mating trials in the laboratory may result in mating that would not occur in nature, as observed during the tests of reproductive isolation carried out by Suatoni et al. (2006). Nevertheless, the 14 species for which we had both COI and ITS1 from several individuals revealed absolutely no evidence of potential hybrids. That is, despite extensive geographic overlap in distribution and habitat, and therefore potential opportunities for cross-fertilisation, we found no evidence of hybrid individual with phylogenetic discordance between mitochondrial and nuclear markers (Fig. 5). This observation provides strong, indirect support for the existence of reproductive barriers acting in the field among the 14 species.

In contrast, within each of the species, we observed phylogenetic discordance in COI and ITS1 sequences between individuals. For example, some individuals that share the same COI sequence have different ITS1 sequences in B. asplanchnoidis, B. plicatilis s.s., B. rotundiformis, and SM4 (tips connected with thick dashed lines in Fig. 5). Such free segregation of markers is exactly what should be expected when comparing individuals of the same species and supports the idea of the 14 (+1) species as actual arenas for recombination (Doyle, 1995; Flot et al., 2010).

The absence of hybrids in the B. plicatilis complex is in stark contrast with what is known in the B. calyciflorus complex, for which a high level of hybridization and mitonuclear discordance between cryptic species is present (Papakostas et al., 2016). The reasons for such differences in the level of hybridization in the two species complexes of the same genus are still unknown and deserve further investigation.

Ecology and geography

Brachionus plicatilis has traditionally been considered a cosmopolitan species found in almost any type of saline aquatic habitat. The identification of B. plicatilis as a species complex suggested the possibility that each cryptic species represented an independent lineage with a limited geographic distribution and a narrower ecological tolerance. This general concept has received recent support for other cryptic species groups in Rotifera (Obertegger et al., 2014; Gabaldón et al., 2016).

A detailed investigation into the geographic distribution of genetic lineages of the cosmopolitan cryptic species B. plicatilis s.s. revealed existence of four clades associated to four geographic regions, one in North America, two in Europe, and one in Australia, with a high amount of variability in genetic distance explained by geographic distance (R 2 = 0.91) (Mills et al., 2007). Such results reinforced the idea that each member of the complex may have a limited geographic distribution. Yet, our results indicate that most species within the complex are indeed cosmopolitan: all the species with at least 140 isolates sampled were found in five or more continents (Fig. 4). Three species were found in one continent only, but this could be due to their small sample sizes (<34 individuals). However, two species with very small sample sizes (SS2 with 8 and SM5 with 13 individuals) were found in two continents, and the most widespread species, B. rotundiformis found in 7 continents, had a relatively low sample size of 58 (Fig. 4). Being present in more than two continents cannot be used as an argument towards limited geographic distribution, even if some geographical structure may exist at the regional level; a pattern that was not specifically explored in this study. Yet, distributional patterns and processes in microscopic animals are known to act at different spatial scales than in macroscopic organisms (Fontaneto, 2011), with rotifers having a larger distribution at the global scale than macroscopic animals (Fontaneto et al., 2006; Segers & De Smet, 2008), together with strong spatial patterns in the structure of genetic diversity at the local and regional scale (De Meester et al., 2002; Mills et al., 2007).

Regarding ecological correlates of diversity in the B. plicatilis complex, our results did not clearly support the concept of niche conservatism (Wiens & Graham, 2005). In several species of the complex, the preference for either coastal or inland habitats seems to have a clear signal from the visual inspection of the tree (Fig. 4), but the explicit tests for phylogenetic signal did not show such evidence. The co-occurrence of three or more species of the B. plicatilis complex in the same pond (Ortells et al., 2003) seems to be in contrast with niche conservatism given that niche conservatism would prevent co-occurrence of closely related species. In support of a potential mechanism allowing co-occurrence even in case of strong niche conservatism, seasonal species replacement has been observed (Gómez et al., 1995). A detailed exploration of ecological correlates of diversity should be performed on samples collected with this idea in mind in order to minimise potential sampling bias, which was difficult to control for in our general analysis.

Body length and genome size

One of the first indications of phenotypic differences among strains, supporting existence of cryptic species, was due to differences in body length. Three main groups were identified based on this criterion: large (L), medium (SM), and small (SS), which have already received support from other phylogenetic studies (Gómez et al., 2002; Suatoni et al., 2006). Our phylogenetic reconstruction confirmed these groups to be monophyletic and provided evidence of a strong phylogenetic signal in body length, which is the trait with the highest signal among the ones we tested: closely related species are indeed similar in body length and, with Pagel’s lambda and Blomberg’s K higher than unity, they are even more similar than expected under a Brownian motion model of trait evolution (Kamilar & Cooper, 2013).

Body length seems to be related to genome size; yet, our approach did not include within-species variability in body length and genome size, which is known to be large for example in B. asplanchnoidis (Stelzer et al., 2011). Using only mean values for each species may be why our results conflict with the lack of correlation found by Stelzer et al. (2011). Thus, the relationship between genome size and phenotypic traits should be explored in more detail: e.g. including additional traits such as egg size (as was done by Stelzer et al., 2011) or trophi size, and expanding the dataset for the analyses using an approach that is able to disentangle the within-species and the between-species contribution to the variability. Such analyses will surely provide interesting inferences on the evolutionary trajectories of phenotypic differences in rotifers and in animals in general.

Conclusions

This study represents the first of its kind to employ a worldwide effort of researchers to unravel the phylogeny of a cryptic species complex. This achievement was possible due to several factors: years of studies on a species with commercial importance, its ease of culture, and its importance as a model system for other avenues of research. If other rotifer species possess a similarly high level of genetic diversity, our taxonomic knowledge of this phylum is minuscule.

We can also infer that the same situation could be found in most microscopic animals for which few resources or little effort has been invested in taxonomy and for which morphological features are not readily discernable. Thus, we suggest that diversity in microscopic animals is higher than currently estimated (Appeltans et al., 2012; Curini-Galletti et al., 2012). Such revolution may greatly affect estimates of species richness (Costello et al., 2012).