Introduction

Sympetrum Newman 1833 (Libellulidae) contains over 60 species and occurs on every continent except Australia (Needham et al. 2000; Walker and Corbet 1975). The genus has few tropical species (7 species endemic to Central and South America and 1 species in sub-Saharan Africa) and is most species-rich in the Holarctic (∼13 species in North America and nearly 40 species in Europe or Asia). Most species of Sympetrum are predominantly red and are relatively small for dragonflies (most <40 mm long). Sympetrum species are found most often in habitats with slow or stationary water such as bogs, fens, or marshes where they are most typically observed perched on reeds or other vegetation.

The taxonomic limits of Sympetrum are problematic because the description of the genus appears to include no synapomorphies. The original generic description (Newman 1833) is vague at best. More recent re-descriptions (Needham et al. 2000; Walker and Corbet 1975) of the genus outline the taxon as a unique combination of non-unique characters. The most striking characteristic of the genus is an expanded, bilobed prothorax fringed with long setae; this character, however, is shared with several other libellulid genera such as Celithemis Hagen, Erythemis Hagen, Erythrodiplax Brauer, Leucorrhinia Brittinger, Nesogonia Kirby, and Pachydiplax Brauer (Walker and Corbet 1975). Some other characteristics of Sympetrum (e.g., slender legs, rounded head, slightly compressed pterothorax; Walker and Corbet 1975) are descriptive, but also are present in many other libellulid genera.

The lack of synapomorphies for Sympetrum could explain the long-standing debate over which taxa should be included in the genus. In 1936, Needham and Fisher erected Tarnetrum to contain two species previously placed in Sympetrum, S. corruptum (Hagen) 1861 and S. illotum (Hagen) 1861. Tarnetrum was successively expanded to include other species such as S. nigrocreatum (Calvert) and S. gilvum (Hagen) of South America (Borror 1945), S. madidum (Hagen) of western North America (Walker and Corbet 1975), S. fonscolombii (Selys), a migratory species from Africa (Schmidt 1987), and S. roriamae DeMarmels and S. villosum Ris, also of South America (Carle 1993). The status of Tarnetrum as a valid genus is not without controversy, as some authors later treated it as a subgenus of Sympetrum (Cannings 1981; Dunkle 2000; Needham et al. 2000; Walker and Corbet 1975) or considered it invalid (Gloyd and Wright 1959; Kormondy 1958, 1960; Paulson 2009). Further questions concerning the limits of Sympetrum involve the monotypic Hawaiian endemic Nesogonia Kirby, which also has been suggested as a possible sister to Sympetrum or may even belong within the genus (Carle 1993; Kiauta 1969).

The relationships within Sympetrum are even more unsettled than the validity of Tarnetrum or Nesogonia, because no phylogenetic work on Sympetrum has been done. Recent taxonomic work has focused on alpha taxonomy, with several new species described in the last 20 years (Cannings and Garrison 1991; Carle 1993; DeMarmels 1994, 2001; Han and Zhu 1997), and others invalidated (Pilgrim and von Dohlen 2007). North American species have been arranged into three ‘sections’ based on similarity of their genitalia (Walker and Corbet 1975), and some have been placed into a subgenus Kalosympetrum (Carle 1993). These North American groups are likely artificial, however, because the morphological similarities between Palearctic species and Nearctic species have never been addressed with respect to relationships. The Nearctic species have been regarded as a single lineage even though some North American species have greater morphological similarities to some Palearctic species than to the other North American groups. To date, there has been no phylogenetic analysis to test the monophyly of Sympetrum or to delineate species-groups based on clades instead of geographic range.

Apart from the taxonomic issues surrounding the genus, Sympetrum constitutes an interesting subject from a biogeographic perspective. The nearly global distribution of the genus could suggest that it is an ancient lineage whose distribution could be due to global vicariance events. Putative fossils of Sympetrum, however, are known only from a few species and from European specimens, and only as far back as the Upper Miocene (Gentilini 1988) when the positions of continents were very similar to the present. The presence of multiple fossil species from the Miocene would suggest that the genus is older, possibly dating to earlier periods when the positions of landmasses, and connections between them, were very different. Regardless of the age of Sympetrum, the historical biogeography of the genus is likely complex. The strong flight capabilities of Sympetrum species imply that dispersal could also play a major role in the current distribution of the genus. Sympetrum currently contains two migratory species (S. corruptum and S. fonscolombii), but the possibility that these two species belong in another genus (see above) complicates the potential role of migratory ancestors as dispersal agents for Sympetrum. Previous biogeographic work in Calopteryx damselflies, which are predominately Holarctic like Sympetrum, found that the genus arose in the Palearctic with the Nearctic taxa resulting from a single dispersal event to North America (Misof et al. 2000). The center of diversity for Sympetrum is the Palearctic, but the center of diversity for a lineage may not necessarily be the center of origin for that group (Lomolino et al. 2005). The morphological affinities between different groups of North American species and Eurasian species hint at the possibility of a complex biogeographic history with multiple dispersal events.

The goal of this study was to develop a robust phylogeny of Sympetrum to test the monophyly of the genus and address the historical biogeography of the group. Our taxon sample included 40 species, including the monotypic genus Nesogonia, and outgroup taxa from several related Libellulidae genera. Data were collected from morphological characters and DNA sequences of the nuclear genes elongation factor-1α (EF1) and the second ribosomal internal transcribed spacer (ITS2), and the mitochondrial genes 16S/tRNA-val/12S, and cytochrome oxidase I (COI).

Materials and methods

Specimen collection, vouchers, and taxon selection

We sampled recently collected specimens from museum loan as well as freshly collected material. Fresh material was stored in acetone for approximately 24 h to remove moisture and lipids. Specimens were then stored dry in polypropylene envelopes. Loan material included both acetone-dried specimens as well as specimens collected into 95-100 % ethanol. Voucher specimens were deposited into the Department of Biology Insect Collection, Utah State University, Logan, UT, or returned to the loaning institution or individual.

The 35 Sympetrum species included in the study represented all previously designated species groups and the full geographic range of the genus (Table 1). Two populations of Sympetrum danae were included because this widely distributed species actually may represent two cryptic species. The Hawaiian, monotypic genus Nesogonia was included as a possible synonym of Sympetrum, as well as five of the eight species formerly included in Tarnetrum; we follow Garrison et al. (2006) in regarding Tarnetrum as a junior synonym of Sympetrum. Three outgroup genera (Celithemis, Leucorrhinia, and Rhyothemis) were also represented: Celithemis and Leucorrhinia are the closest relatives of Sympetrum, and Rhyothemis variegata (Linnaeus) is a more distant outgroup (Pilgrim and von Dohlen 2008).

Table 1 Specimen collection data

Morphological characters

Twenty-five morphological characters were coded for all Sympetrum taxa and outgroups Celithemis and Leucorrhinia (see Appendix for character descriptions and data matrix). All characters were treated as unordered and equally weighted. Although the entire morphology of these taxa was investigated, the most informative characters derived predominantly from male and female genitalia. Penises were dissected from ethanol-softened specimens for examination.

Molecular techniques

DNA was isolated from fresh specimens stored in 95 % ethanol or from relatively recent, dried museum specimens. To extract DNA, a middle and hind leg of each specimen was removed, leaving the remainder of the specimen as voucher. DNA was extracted with the High Pure PCR Template Preparation Kit (Roche Pharmaceuticals, Indianapolis, IN).

Sequence data were collected from the nuclear genes EF-1α and ITS2, and the mitochondrial genes 16S, tRNA-valine, 12S, and COI. We also obtained sequences for the ITS1 region, but this locus was too variable to be aligned with confidence in homology; thus, we elected not to include it. ITS1 should be informative, however, for more focused analyses within species groups or closely related species. For EF-1α, 16S, tRNA-valine, 12S, and COI, the PCR conditions, primers, and thermal cycler programs followed those described in Pilgrim and von Dohlen (2008). For ITS2, the PCR conditions, primers, and thermal cycler programs followed those described in Pilgrim and von Dohlen (2007). PCR products were visualized on agarose gels stained with ethidium bromide, and successful amplifications were cleaned using standard isopropanol purification. PCR amplifications of the 16S through 12S gene of Celithemis elisa consistently failed, as did amplifications of the EF-1α gene of C. eponina; therefore, the 16S/12S sequence of C. eponina and the EF-1α sequence of C. elisa were pooled for the combined genetic analysis and are hereafter referred to as Celithemis sp.

DNA sequencing reactions were performed using either the ABI Big Dye Terminator Cycle Sequencing Ready Reaction Kit (PE Applied Biosystems, Foster City, CA) or the DYEnamic ET Terminator Cycle Sequencing Kit (Amersham Biosciences, Piscataway, NJ) following the manufacturer’s protocols. Purified products were analyzed with either an ABI Prism 377 Genetic Analyzer or ABI 3730xl DNA Analyzer. All sequence data have been deposited in GenBank (see Table 1 for accession numbers).

All PCR products were sequenced in both directions and were assembled into complete contigs using Sequencher v4.1-4.8 (Gene Code Corp., Ann Arbor, MI). COI and EF-1α data sets were aligned in Sequencher. Introns in the EF-1α data set were removed prior to alignment. The 16S-tRNA-valine-12S and ITS2 data sets each were aligned in MUSCLE (Edgar 2004) with minor editing post-alignment. Most indels in the alignments were only 1-2 bp in length, and we did not remove any portions of the data sets because of unreliable alignment.

Phylogenetic analyses

For model-based inference, the appropriate models of evolution for individual genes were determined with MrModeltest v2.2 (Nylander 2004) (GTR + I + G in all cases). Model-based analyses were performed on the combined-gene data set. Bayesian inference (BI) was performed with MrBayes v3.1.2 (Ronquist and Huelsenbeck 2003). Data were structured into six partitions: 16S + tRNA-val + 12S, EF-1α codon positions 1 + 2, EF-1α codon position 3, COI positions 1 + 2, COI position 3, and ITS2 (six partitions). The partition containing EF-1α codon positions 1 + 2 was modeled as nst = 2 and nucleotide frequencies as equal; all other partitions were modeled as nst = 6 and base frequencies estimated. Analyses included two independent runs with three heated and one cold chain in each run. Trees were sampled every 1,000 generations for 10 million generations. Convergence, ESS values, and burn-in were assessed with Tracer v1.5 (Rambaut and Drummond 2007). Maximum-likelihood (ML) analyses were performed with Garli v2.0 at molecularevolution.org. The combined-gene data set was analyzed with partitions modeled as in strategy (ii) above, except with no invariant rate category; base frequencies and proportion of invariant sites were estimated. Two analyses of 500 bootstrap replicates were performed.

Analysis of combined molecular and morphological data sets was performed under BI. Morphological data were treated as a seventh, unlinked data partition with the model set to the standard Markov k model used for morphological characters in MrBayes.

Clock-constrained Bayesian inference and divergence time estimation

Clock-constrained phylogenetics and nodal age estimates on the combined-gene data set were performed with BEAST v1.6.2 (Drummond and Rambaut 2007), after preparing the input xml file with BEAUti v1.6.2 (in the BEAST package). Data were partitioned as in strategy (ii) above. Substitution models were set to GTR + G [following the recommendation of Stamatakis (2006) to omit the invariant category] for the mitochondrial RNA and nuclear ITS2 partitions. Models for protein-coding COI and EF-1α were set to SRD06 (Shapiro et al. 2006), which partitions the data into 1st + 2nd vs. 3rd codon positions and applies the HKY model with rate variation. All model parameters were unlinked across partitions, the clock model set to uncorrelated lognormal, and tree prior set to Yule process. Putative fossils of Sympetrum have been described from Late Miocene deposits (Gentilini 1988). However, there are two issues regarding their utility as calibrations for the dating analysis. First, in our opinion, these fossils (consisting solely of wings) are only dubiously assigned to Sympetrum. While they are definitely Libellulidae, we strongly suspect they are not Sympetrum. Second, they bear no affinity to any particular extant Sympetrum species, and thus cannot be assigned to an internal node in the phylogeny. At best, they place an absolute minimum date for the origin of the genus, which is almost certainly much older. Therefore, to calibrate the molecular clock, we used a divergence date estimated in Ware et al. (2008). These authors dated a phylogeny of Libelluloidea from mitochondrial and nuclear rRNA data, calibrated with multiple fossils applied to family-level and older nodes. The age of the node subtending Sympetrum, Leucorrhinia, and Celithemis was estimated at 65 Mya; thus, we set the treeModel.rootHeight in our analysis to a mean of 65.0 and standard deviation of 2.0. Priors were left at default values except for .mu priors and yule.birthRate, which were set to normal distributions based on results from preliminary runs. Two final runs of 50 million generations were performed, with auto-optimize operators enabled and trees and parameters sampled every 10,000 generations. In addition, the analysis was run twice from an empty alignment to assess the influence of the priors on the posterior distribution. Tree files were combined with LogCombiner 1.6.2, and the consensus tree was calculated with TreeAnnotator 1.6.2 (both programs included in the BEAST package).

Results

Phylogenetic analyses

As the 16S, tRNA-val, and 12S loci were all amplified and sequenced together, these loci were also aligned together to produce an alignment of 1,680 characters with 483 variable sites. The COI alignment consisted of 752 bases with 254 variable sites. For nuclear loci, the EF-1α alignment of 705 bp had 78 variable characters, and the ITS2 alignment of 383 bp had 153 variable characters.

BI and ML analyses of the genetic data set recovered similar topologies (Fig. 1) showing a monophyletic Sympetrum with six strongly supported clades (PP > 95 %), each containing four to eight species (Fig. 1). We have designated each of these clades as species groups named for the most senior species in each group: danae group, fonscolombii group, flaveolum group, infuscatum group, pedemontanum group, and vulgatum group (Table 2). Our designated fonscolombii group includes species sometimes placed in the genus Tarnetrum, but does not include all the Sympetrum species that have been attributed to Tarnetrum, such as S. illotum and S. madidum. Bayesian and ML analyses lacked either support or resolution for relationships among the danae, flaveolum, infuscatum, pedemontanum, and vulgatum species groups (Fig. 1). These model-based analyses nested Nesogonia within Sympetrum (between the fonscolombii group and the rest of Sympetrum) with high posterior probability. BI analysis of the combined molecular and morphological data set produced an identical topology and similar PPs as with genetic data alone.

Fig. 1
figure 1

Sample post-burnin tree from Bayesian analysis of the combined-gene (EF-1α, ITS2, 16S, tRNA-valine, 12S, COI) data set. Numbers preceding the nodes represent BI posterior probabilities and ML bootstraps, respectively. Unsupported or unresolved nodes are marked with a hyphen. Gray circle 1 denotes Sympetrum sensu lato. Gray circle 2 denotes Sympetrum sensu stricto

Table 2 Sympetrum species groups as supported by phylogenetic analysis

Divergence estimates and historical biogeography

Relaxed-clock BI produced a consensus topology with fully resolved relationships between species groups, although PP values for these relationships were below 0.9 (Fig. 2). This analysis also placed Nesogonia as sister to Sympetrum (but without support). Divergence time estimates suggest that Sympetrum arose nearly 50 million years ago (MYA), with the fonscolombii group being the oldest species group in the genus. These date estimates also indicate that the remaining five Sympetrum species groups arose ∼30 to 40 MYA, with the ancestor of the infuscatum group diverging ∼37.8 MYA, the flaveolum group diverging ∼34.9 MYA, the danae group diverging ∼32.9 MYA, and the pedemontanum and vulgatum groups diverging ∼31.8 MYA.

Fig. 2
figure 2

Chronogram from the Bayesian relaxed-clock analysis of the combined-gene data set in BEAST. Gray circles denote nodes with 0.90 – 1.0 PP. Gray boxes show estimated divergence dates (MYA) for ancestors of species groups defined in this study

The lack of well-supported relationships between species groups presents some difficulty for inferring the historical biogeography of Sympetrum, but the strongly supported species groups and the geographic ranges of their extant taxa provide some information. The fonscolombii group has extant taxa that are migratory (S. corruptum and S. fonscolombii) and that cover a broad geographic range (S. corruptum: most of North America through Central America into northern South America; S. villosum: much of South America; S. fonscolombii: most of Africa and Europe). The infuscatum and pedemontanum groups have Palearctic taxa only, but the danae, flaveolum, and vulgatum groups have Nearctic and Palearctic extant species. Within the danae group, S. costiferum and S. semicinctum are exclusively Nearctic, S. croceolum, S. depressiusculum, S. speciosum, and S. uniforme are exclusively Palearctic, and S. danae encompasses a Nearctic lineage and a Eurasian-Beringian lineage. The vulgatum group contains Palearctic-restricted taxa such as S. meridionale, S. sanguineum, S. striolatum, and S. vulgatum, but if the Nearctic species S. signiferum and S. vicinum are included in this species group (strongly supported in BI), then the species group would be considered to have a Holarctic distribution. In the flaveolum group, only S. flaveolum has a Palearctic distribution, while the rest of the taxa in the species group are entirely Nearctic. None of the analyses show the Nearctic taxa as a single clade or the Palearctic taxa as a single clade.

Discussion

Our analyses provide strong evidence for circumscription of Sympetrum as a monophyletic taxon, although delineating exactly which species should be placed within the genus could require more study. Conforming to the ideal of classification based on robust monophyletic groups, two possible options emerge from our study. Option one would encompass all species currently considered as Sympetrum, plus Nesogonia and all species previously placed in Tarnetrum (e.g., Needham and Fisher 1936; Schmidt 1987; Carle 1993) (Fig. 1). Option two would restrict Sympetrum to the well-supported lineage of S. illotum and its sister clade of five species groups (Fig. 1). This option would preserve Nesogonia and reinstate Tarnetrum as represented by the fonscolombii group. We think it is premature to choose between these options at this time, because we were unable to acquire specimens of the other, rare South American Sympetrum species that could belong to the fonscolombii group. Until such material can be included in a future analysis, we suggest referring to the clade in option one as Sympetrum sensu lato and the clade in option two as Sympetrum sensu stricto. Our results do show, however, that if Tarnetrum is considered a valid genus in the future, S. illotum and S. madidum should not be included because they did not cluster with the fonscolombii species group members formerly assigned to that genus. Our results also show that the subgenus Kalosympetrum (see Table 1) (Carle 1993) does constitute a clade, but is nested within the flaveolum species group, and therefore its utility as a taxon is reduced as it is now a subgenus nested within a species group. Future molecular studies should be combined with more extensive morphological studies to describe synapomorphies (if they exist) to support the choice of option one or two, above.

Lack of resolution and support along the backbone of the tree topologies suggest that that Sympetrum may have undergone a rapid radiation during the divergence of its species groups. All analyses produced topologies with short internode branches with low PP support between highly supported species group; such short internode branch lengths have been suggested as a hallmark of a rapid radiation (e.g., Whitfield and Kjer 2008). An argument could be made that the genetic loci chosen in this work simply lack the information necessary to resolve species group relationships; however, the nuclear and mitochondrial genetic loci used herein produced over 900 variable characters for determining relationships between ∼40 taxa, strongly suggesting that locus choice likely has not hindered generation of a resolved phylogeny. Our divergence time estimates corroborate a period of rapid radiation in Sympetrum, indicating that five of the six species groups arose within a narrow interval of approximately 6 million years, as compared to the 50 million year history of the entire group (see below).

The species groups recovered in our study are supported by various morphological characters. Taxa within each species group have similar penis morphology (Fig. 3), such as a lack of large cornua (horn-like projections on the apical penis segment) in the fonscolombii group, flattened, down-curved cornua in the flaveolum group, and tusk-like, up-curved cornua in the danae and vulgatum groups. The flaveolum group is also united by the shapes of the male hamules and the female vulvar laminae (both are external accessory structures used in species recognition for mating). Outlining morphological characters to define Sympetrum species groups is beyond the scope of this work, but further research could test the use of genitalia characters for placing Sympetrum species not included herein to the proper species group.

Fig. 3
figure 3

Comparison of penis morphology across the Sympetrum species groups. a Danae group, S. uniforme; b flaveolum group, S. rubicundulum; c fonscolombii group, S. corruptum; d infuscatum group, S. risi; e pedemontanum group, S. pedemontanum; f vulgatum group, S. striolatum

Given the unresolved nature of the relationships between most of the Sympetrum species groups, the divergence estimates of the species groups should be considered preliminary. Unfortunately, the fossil record for Sympetrum (Gentilini 1988) is questionable, and the use of a secondary calibration from another study was necessary (Ware et al. 2008). Those caveats being considered, however, the divergences between many of the Sympetrum species groups do coincide with known changes in global climate. Our results suggest five of the six species groups arose during the late Eocene to early Oligocene when the global climate became cooler and drier, and increases in the volume of polar ice caps led to drops in sea level (Sanmartin et al. 2001; Miller et al. 2008). These changes in climate could have fragmented ancestral Sympetrum populations leading to divergence and speciation.

Although this phylogenetic work did not recover highly supported relationships between many of the Sympetrum species groups, we may still draw several conclusions about the historical biogeography of this genus. Neither the Nearctic nor Palearctic Sympetrum species forms exclusive clades, and several of the species groups include taxa from both regions. This implies that both dispersal and vicariance have shaped the biogeographic history of the genus, including several dispersal events between the Nearctic and Palearctic. Our divergence date estimates and tree topology also imply the importance of dispersal, as it places most cladogenesis in the last ∼30 million years, which is well after major vicariance events induced by continental drift. This period in the late Tertiary, however, was marked by continued climate change, appearance, and submergence of Nearctic-Palaearctic land bridges, mid-continental orogeny, and erosion—all contributing to the possibility of multiple disruptions in gene flow, or facilitation of dispersal (Sanmartin et al. 2001; Denk et al. 2010). As the earliest branching lineages in the phylogeny, the fonscolombii group, Nesogonia, and S. illotum could provide insight into the geographic origins for Sympetrum. The current broad geographic ranges encompassed by these lineages (the fonscolombii group extends through North America, Central America, South America, Europe, and Africa; Nesogonia occurs in Hawaii; S. illotum occurs in North and Central America), however, confound our efforts to place a geographic origin for Sympetrum. Their distributions could even suggest a point of origin in the Southern Hemisphere, well outside the current center of diversity for Sympetrum (Asia). Future systematic studies of Sympetrum should attempt to include the other South American species to help address the geographic origin of this genus.