Background

Keel flowers sensu Westerkamp are bilaterally symmetrical (zygomorphic, monosymmetric), pentamerous flowers with three different petal types, with the reproductive organs enclosed by keel petals and generally with connation of floral parts such as stamens and keel petals [1,2,3,4,5,6,7,8,9,10]. Keel flowers are dominant in two species-rich lineages within Fabales, tribe Polygaleae Chodat (Polygalaceae Hoffmanns. & Link) and subfamily Papilionoideae L. (DC.) (Fabaceae Lindl.) [11, 12]. Indeed, this flower type is typical of subfamily Papilionoideae of Fabaceae, and the prevalence of keel flowers in Papilionoideae has prompted some to refer to them as papilionate flowers.

Fabaceae is the third largest angiosperm family with approximately 765 genera and 19,500 species [13,14,15,16]. The keel-flowered subfamily, Papilionoideae, includes almost 72% of Fabaceae species, ca. 14,000 species in 504 genera [8, 14, 15]. Similarly, within Polygalaceae (with approximately 1000 species in 20 genera), tribe Polygaleae holds 80% of the species richness of the family with ca. 800 spp. [1, 5, 17]. In the Fabaceae, keel flowers are also found outside of Papilionoideae, in Cercidoideae, Dialioideae and Caesalpinioideae: Cercis, Poeppigia procera C. Presl and Peltophorum Vogel (Benth.) [18, 19]. Other, unrelated families including species with keel flowers are the Ranunculaceae, Onagraceae, Sapindaceae, Trigoniaceae, Geraniaceae, Tropaeolaceae, Solanaceae, Acanthaceae and Commelinaceae, but they have fewer keel-flowered species than either Fabaceae or Polygalaceae [9].

In general, the evolution of keel flowers within Fabaceae, Polygalaceae and different clades of angiosperms has been attributed to bees [9, 20,21,22], but particularly to skilled and strong bees [9, 18, 20, 21, 23,24,25]. Less commonly, large and brightly coloured flowers may be bird-pollinated (e.g., Mucuna Adans., Erythrina L.), or butterfly-pollinated (e.g., Berlinia grandiflora Vahl Hutch. & Dalziel), and a few keel-flowered species with specific scents are beetle (e.g., Aotus lanigera A. Cunn. ex Benth.) or fly-pollinated (e.g., Apios americana Medik.) [18, 26, 27]. Specific adaptations of keel flowers are associated with bee pollination. For example, hiding pollen inside keel flowers protected by a tripping mechanism may limit pollen loss associated with pollen theft [9, 10, 18, 19, 21, 28]. Furthermore, many Papilionoideae flowers exhibit different primary and secondary pollination mechanisms such as valvular, pump, explosive and brush mechanisms, which also ensure accuracy and efficiency of pollen deposition and so limit the pollen waste ([29] and references therein). In these ways, pollen is hidden in the deepest part of the flower, and pollen is transmitted to, for example, a bee’s head, or above the insertions of the legs or wings, so pollen cannot be easily removed during grooming [9, 10, 30, 31]. This specific position of the pollen (i.e., location of the pollen on different bee species, such as back of the head, under the mandible or on inner side of mandible) is also another precaution against non-pollinator visitors [30]. Thus, the evolution of keel flowers has been referred to as an adaptive response to bees; the keel flowers evolved not just to attract bees but also to protect the pollen from bee robbery [9, 18, 22, 31, 32]. The independent evolution of the keel flower syndrome in the Fabales is likely to have secured pollination and promoted cross-pollination of keel-flowered species [33, 34].

Previously, the morphology and development of keel flowers of two species-rich clades of Fabales, Papilionoideae and Polygaleae, have been compared (e.g., [1, 17, 22, 35]). In both clades, keel flowers are at least superficially similar, they both are 5-cyclic and 5-merous, and consist of three parts, a standard for visual attraction, two wings as a landing platform and a keel to conceal the pollen from pollinators [8, 9, 18, 22, 35]. However, while the functions of these parts are the same in the two groups of Fabales, the developmental origins are different. The standard (flag) consists of a single median petal in Fabaceae but is composed of two lateral sepals in Polygalaceae. The wings are formed by two petals in Papilionoideae but consist of two petaloid lateral sepals in Polygalaceae [22]. One or two fused lower lateral petals serve as the keel in legumes, but the keel comprises one median petal in Polygalaceae, since Polygalaceae keel flowers consist of five petals, only three of which are fully developed, and the abaxial one forming an asymmetric keel [5, 22, 36,37,38]. Keel flowers of Papilionoideae have ten stamen filaments and a single carpel, but Polygalaceae keel flowers have eight filaments and a syncarpous gynoecium which consists of two carpels [22, 35]. Thus, keel flowers in the two families represent a superficial functional and morphological convergence, rather than a homologous similarity [1, 17, 22, 35].

Although the flowers of these two Fabales lineages are not homologous, their striking similarity has led some authors to propose that this shared similarity is more than convergance on a floral syndrome. Bello et al. [17] proposed that the rapid diversification of the tribe Polygaleae, previously documented by Forest et al. [39], may have been prompted “because pollinators of pre-existing papilionoid legume flowers were immediate and effective pollinators of the later-evolving papilionoid flowers of Polygaleae”. Indeed, Bello et al. [17] were not the first to propose such a scenario. Pseudo-papilionoid flowers of Cercis L. (sensu Polhill et al. [19]) have some similarities with Papilionoideae keel flowers such as a bilaterally symmetrical corolla, three different petal types, and enclosed stamens and gynoecium, prompting Tucker [7] to hypothesize a mimicry relationship between Cercis keel flowers and Papilionoideae flowers. Tucker [7] supposed that Cercis likely evolved in an environment where keel flowers were already present. However, whether the later-evolving keel flowers benefitted from pollinators familiar with the keel-flowers of earlier-evolving clades is an open question.

To meet the criteria which evidence floral mimicry, species should share a common pollinator which freely moves between two taxa, share display traits apparent to the pollinators (e.g., colour, UV patterns, nectar, scent and size), there should be a reproductive benefit to one or both species, the species should have areas of sympatry, overlapping flowering phenology, and maybe most importantly the mimic should be more successful in terms of reproduction because of its resemblance to the model, so the mimic receives more visits when it co-occurs with the model [40,41,42,43]. Aside from numerous examples of mimicry involving the Orchidaceae (e.g., [44]), floral mimicry is evidenced for angiosperm and gymnosperm beetle pollinated plants [45] and Turneraceae and Malvaceae flowers [40], with the study of oil-offering plants the first to scrutinise putative floral mimicry in deep evolutionary time [46]. Identifying older and younger oil-offering clades, Renner and Schaefer [46] suggested that ancestral reconstructions were consistent with a gradual niche broadening, but that attributing each later appearance to mimicry would depend on studies of pollinator behavior. For inferences based on phylogenetic reconstructions, in the absence of observations of pollinator behaviour, we therefore refer to findings consistent with a mimicry, but we cannot provide evidence of mimicry from evolutionary reconstructions alone. However, though this was not the case, we could have refuted mimicry if keel flowers of younger clades did not share pollination syndromes or were not sympatric with species in existing clades.

The aim of the current study is to determine whether evolution of the keel flowers of Polygaleae and Papilionoideae are consistent with a mutualistic relationship by characterising the early evolution of the keel flowers of these two species-rich clades. Specifically, we set out to confirm whether keel flowers appeared first in Papilionoideae, whether the keel-flowered Papilionoideae were likely to have been present in the geographical areas where the keel-flowered Polygaleae first appeared, and whether the functional morphology of the keeled Papilionoideae flowers was broadly similar to the morphology of the first evolving Polygaleae flowers. If these criteria are met, though not proving mutualism, this would be consistent with a mutualistic relationship, with Polygaleae flowers benefitting from existing Papilionoideae pollinators due to their resemblance to pre-existing Papilionoideae keel flowers.

Results

Molecular dating analysis

The divergence time analysis (Additional file 1: S1) generated a ((Fabaceae+Polygalaceae) (Surianaceae+Quillajaceae)) topology within monophyletic Fabales (1.00 PP). Within Polygalaceae, tribe Moutabeae was not monophyletic (Fig. 1). However, both tribe Polygaleae and tribe Carpolobieae were strongly monophyletic (0.96 and 1.00 PP, respectively). On the other hand, a (((Papilionoideae+Caesalpinioideae) Dialioideae) (Detarioideae (Duparquetioideae+Cercidoideae))) topology is estimated within Fabaceae. Duparquetia Baill. (Duparquetioideae) was sister to monophyletic Cercidoideae (1.00 PP) with posterior probability of only 0.65. Monophyletic Detarioideae (1.00 PP) was sister to this clade with moderate support (0.82 PP). Monophyletic Caesalpinioideae (1.00 PP) was sister to monophyletic Papilionoideae (1.00 PP) with posterior probability of 1.00, and monophyletic Dialioideae (1.00 PP) was sister to this clade (1.00 PP).

Fig. 1
figure 1

The origins of Papilionoideae clades (Clade 1–8) which evolved during 49–45.16 Ma and the origin of the evolution of keel flowers within Polygalaceae. Posterior probabilities for the key nodes are indicated. Four Fabales families, six Fabaceae subfamilies, Cercis and Xanthophyllum are indicated. Standard error bars were excluded from the figure for clearer presentation. Letters and numbers in red correspond to the calibration points. Scale bar in million years

The crown age of Fabales is estimated to be at least 74.97 Ma (95% HPD 69.3–76.7); the (Surianaceae+Quillajaceae) crown node as 68.62 Ma (95% HPD 50.2–73.9); Surianaceae crown node as 47.59 Ma (95% HPD 33.2–53.1); Fabaceae crown node as 71.89 Ma (95% HPD 67.9–69.3), subfamily Papilionoideae crown node as 67.19 Ma (95% HPD 62.5–64.9); Polygalaceae family crown node as 63.59 Ma (95% HPD 58.2–62.7); and 45.16 Ma (95% HPD 38.8–44.7) for the crown node of tribe Polygaleae. Additionally, the molecular dating analyses yielded a 54.80 Ma (95% HPD 55.1–48.1) crown age for the crown Cercidoideae node, 57.66 Ma (95% HPD 55.6–53.1) for the crown Detarioideae node, 38.28 Ma (95% HPD 44.7–27.1) for the crown Dialioideae node, 64.46 Ma (95% HPD 64.1–58.2) for the crown Caesalpinioideae node, and 69.14 Ma (95% HPD 68.5–63.3) for Duparquetioideae.

The first node with keel flowers in the Polygalaceae was 46.98–45.16 Ma (crown age of tribe Polygaleae). The evolution of keel flowers coincided with the evolution of the keel flower tribe, Polygaleae (99.6%) (Table 1).

Table 1 Results of ancestral area, age, ancestral flower type, ancestral floral symmetry, the presence or absence of a pentamerous corolla (petals+sepals in Polygalaceae), the presence or absence of three distinct petal types (petals+sepals in Polygalaceae), the presence or absence of enclosed reproductive organs, ancestral androecium type, ancestral inflorescence type, ancestral floral size, ancestral height and habit analyses for tribe Polygaleae, and clades 1–8 of Papilionoideae

Based on the reconstructions of the timing of keel flower origin in Polygalaceae (46, 98–45, 16 Ma), we selected the Papilionoideae nodes which were extant in South America at that time to test whether a mimicry between Polygalaceae and Papilionoideae was plausible. To be conservative, we expanded the time frame up to 49 Ma. In this logic, there were 16 clades in total. However, out of these 16 nodes, one originated between ~ 49 and 45 and could not be included here due to lack of information. We also excluded seven clades due to their geographic distribution (not included in Table 1). For the eight remaining relevant clades, the ancestral area and ancestral state for each of the 11 pollination syndrome characters were reconstructed. These clades are shown in Fig. 1, listed in Appendix 2, and the ages of these nodes are reported in Table 1.

Ancestral area analyses

The Lagrange analyses indicated a possible South American origin for the keel flowered Polygalaceae (67% G, 18% EG, 14%E) (Table 1, Additional file 2: S2). For the eight Papilionoideae clades hypothesized, ancestral areas are listed in Table 1.

Additionally, the Lagrange analyses support a South American origin for the subfamily Papilionoideae (100%) and Africa+South American origin for the Fabaceae (75% BG, 14% BEG, 11% G). A South American origin was also suggested by Lagrange analysis for the origin of Fabales with low support (30% G, 29% BEG, 20% BG).

Ancestral state analyses

The ancestral floral type for Polygalaceae was non-keeled (92%), whilst the ancestral flower type of tribe Polygaleae was keeled (99.6%) with three distinct petal+sepal types (89.6%), enclosed reproductive organs (99.9%), fused stamens (98%), probably pentamerous petals+sepals (63%) and bilateral symmetry (79%) (Table 1). The most recent common ancestor (MRCA) of tribe Polygaleae most possibly had vertical inflorescences (e.g., raceme, panicle, spike and thyrse) (81%), and a small habit (e.g., herb, 53%; however, the possibility of a medium-sized habit was also high at 41%). The ancestral floral size of tribe Polygaleae was between 2.9 and 10.5 mm, and ancestral plant height was between 76 cm and about 9.1 m.

There were eight clades of Papilionoideae with ancestral morphologies that might have allowed pollinators to move freely between the flowers of these and tribe Polygaleae flowers. These clades were clades 2–8 (Table 1), based on shared pollination syndrome characters. Where characters were not a perfect match, but differences were not likely to significantly impact pollinators, we considered shared pollinators a possibility. For example, ancestral reconstruction of clade 2 suggested these plants were of similar height to the ancestors of tribe Polygaleae, so despite the different habits we considered it possible that ancestors of this clade shared pollinators with tribe Polygaleae. Similarly, ancestral reconstructions of free stamens in clades 2 and 8 were not considered to have significant impact on pollinator behavior because other ancestral floral characteristics of these two clades match the ancestral flowers of tribe Polygaleae, and keel flowers with free stamens still exist in Papilonideae (e.g., Bolusanthus and Baptisia).

Discussion

To determine whether the first-evolving Polygalaceae keel flowers were functionally similar to existing Papilionoideae keel flowers, and might have evolved in an environment where functionally similar Papilionoideae keel flowers were present, we carried out temporal, spatial and trait analyses.

The divergence-time analysis was congruent with previous studies (e.g., [12, 39]), with short internal branches of Fabales reflecting the rapid radiation already highlighted for this clade [12, 48, 49]. The divergence-time analysis also showed that keel flowers in crown Papilionoideae 67.19 Ma (95% HPD 62.5–64.9) evolved before the origin of Polygalaceae (63.59 Ma, 95% HPD 62.7–58.2 Ma) and tribe Polygaleae (45.16, 95% HPD 44.7–38.8 Ma), even if we accept Xanthophyllum as polymorphic in terms of the presence or absence of keel-flowers [50]. Similarly, even accepting a keel flowered ancestor for Polygalaceae (Moutabea Aubl. excluded), the evolution of keel flowers within crown Papilionoideae was still earlier than in the crown Polygalaceae (56.87 Ma, 95% HPD 62.7–58.2 Ma). We are confident that keel flowers within Papilionoideae evolved many millions of years (22.03–10.32 Ma) before the evolution of keel flowers in Polygalaceae.

Our ancestral area reconstructions show that at the time keel flowers appeared in the Polygaleae, the Papilionoideae was distributed almost globally. There were eight clades in South America at the time that the keel-flowered Polygaleae originated in this continent. Trait analyses show that at least seven of these clades are consistent with a mutualistic relationship. In other words, at least for these seven nodes, the functional morphology of the keeled Papilionoideae flowers was broadly similar to the morphology of the first evolving Polygaleae flowers. Thus, Polygaleae flowers might have benefited from existing Papilionoideae pollinators that were already visitors to Papilionoideae keel flowers.

When pollinators are available, “a plant should specialize on the most effective and/or most abundant pollinator” [51, 52]. In this case, if Papilionoideae keel flowers’ pollinators are readily available, exploratory visits by these effective and/or abundant pollinators to Polygaleae flowers resembling Papilionoideae keel flowers might have been the first stage of this ancient mimicry [46]. In this case, if the Polygaleae keel flowers were rewardless, this could be a type of Batesian mimicry; however, if the Polygaeae keel flowers possessed abundant amount of reward, and this mutualism could be beneficial to both model and the mimic, this could be a Müllerian mimicry. Yet, whether Polygaleae keel flowers were more or less rewarding than the keel flowers of Papilionoideae is an open question. We suppose that Polygalaeae exploited existing Papilionoideae pollinators, but co-flowering might have increased the pollination rate of both Papilionoideae and Polygaleae keel flowers at that time [53]. However, since we are reconstructing a scenario that occured millions of years ago, it is almost impossible to determine the community interactions that occurred among existing plants and their pollinators. Without knowing these community interactions robust evidence for mimicry is elusive, because in a mimicry scenario the model is abundant in the environment, but the mimic is in low densities, while in a convergent evolution scenario both mimic and model are found in similar densities [40].

Pollinator shifts can cause diversification [54, 55], and a shift to long-tongued bees might have been one key innovation associated with the diversification rate shift already documented for the stem lineage of Polygaleae [39]. Indeed, pollinator shifts account for ca. 25% of documented divergence events in the Orchidaceae plus ca. 25% of other angiosperms [54, 56]. Subsequent to the origin of keel flowers, successive ecological opportunities to colonize new habitats [54], expanding the geographic and niche range [57, 58] has likely accompanied the diversification of the keel-flowered lineages. For the subfamily Papilionoideae, on the other hand, radiations in multiple lineages have driven increases in species richness [59]. In the current study, we did not perform any diversification rate analyses for Papilionoideae, and whether keel flowers are key innovations in this subfamily remains an open question.

Complex flowers, such as keel and bilabiate flowers (i.e., dorsiventral blossoms sensu [9, 10], are hypothesized to have evolved to hide protein rich pollen from bees by blocking the entrance of the flower, and at the same time, to ensure pollination by attaching pollen to either the dorsal (bilabiate or lip blossoms) or ventral (most keel blossoms) side of bees [9, 10, 20, 30, 51, 54]. Westerkamp [9] and Westerkamp & Claßen-Bockhoff [10] stated that, in terms of pollination, while the evolution of keel flowers is much more effective, keel and lip flowers are fundamentally the same. While lip flowers evolved in at least 38 angiosperm families (including Fabaceae), keel flowers evolved at least 16 times within 10 different orders independently within angiosperms. However, excepting Trigoniaceae and Fumarioideae, there are fewer species of keel flowers in angiosperm families other than in Fabales [9, 22]. Some genera or species of these other families have flowers with notable morphological similarities to legume keel flowers, such as Pelargonium rapaceum (L.) L’Hér. (Geraniaceae), Calceolaria L. (Lamiales) and Monopsis lutea (L.) Urb. (Campanulaceae) with enclosed reproductive organs and standard(s); Schizanthus Ruiz & Pav. (Solanaceae) with a standard, wings and keel petals [7, 9, 18, 19, 60]. Collinsia Nutt. (Lamiales) with a standard, wings, a keel and hidden reproductive organs, has greatest similarities with Papilionoideae and Polygaleae keel flowers [9, 60,61,62,63]. In Aconitum L., pollen is also deposited on the ventral side of the pollinator during pollination. In Corydalis cava (L.) Schweigg. & Körte on the other hand, lateral petals enclose the reproductive organs to hide pollen from bumblebees, as in Papilionoideae keel flowers [28]. Both Hyptis Jacq. (Lamiaceae) and Levenhookia R.Br. (Stylidiaceae) have spring-loaded keels which cause explosive release of fertile parts when triggered by pollinators.

The similarities of these keel flowers have been attributed to convergence due to bee specialization, and therefore, pollinator pressure [9, 54, 64, 65]. In this case, not only the morphological characters of keel flowers but also the choices of pollinators (e.g., exact location of pollen on bees’ body where it cannot be removed easily, enclosed reproductive organs which supports the minimum loss of pollen by keeping it away from non-functional visitors, visual clues such as bilateral symmetry, a standard for visual attraction, a landing platform and a standard for visual impact) might have canalized the convergent evolution of keel flowers within unrelated angiosperm clades. Here, for example, keel flowered Collinsia heterophylla Graham (Lamiales), Aconitum napellus subsp. lusitanicum Rouy (Ranunculales) and Corydalis cava (Ranunculales) are also pollinated by similar pollinators as the keel flowers of Papilionoideae [60,61,62, 66,67,68,69]. Keel flowers might have evolved to host these efficient pollinators (strong bees or long-tongued bees of Apidae and Megachilidae families) [63, 67, 70]. Indeed, Ruxton and Schaefer [71] suggested, instead of mimicry, that convergent evolution (i.e., pollination syndromes sensu Faegri andVan der Pijl [25]) driven by shared pollinators could be a more plausible explanation not only for the evolution of keel blossoms within unrelated lineages of angiosperms [9, 58, 62, 72,73,74] but also for most supposed plant mimics. Moreover, since the hymenopterans were already highly diversified at that time, a competition free space with similar morphologies would have helped Polygalaceae keel flowers to benefit this new area and radiate ([54, 75], Anonymous Reviewer 2, pers. comm.). For instance, for the orchid genus Ophrys L., it was suggested that diversification of the genus began after the well establishment of hymenopterans and because of the coevolution of plants and their pollinators, adaptive radiation has caused a species burst in the genus [75]. Furthermore, when two species share the same pollinator with the same flowering time and distribution, not only convergence of some floral traits (e.g., shape and color) to attract the same suit of available pollinators, but also divergence in others (i.e., different pollination niches) (e.g., odor) to prevent hybridization is common [75, 76]. Indeed, this floral adaptation under pollinator pressure (i.e., sharing the same suits of pollinators, 10) might have caused tribe Polygaleae to modulate its pollinator niche (Anonymous Reviewer 2, pers. comm.), and evolution of a similar floral bauplan with the Papilionoideae keel flowers. Therefore, in order to better understand keel flower evolution, it is necessary to conduct detailed comparative studies on keel flowers and their pollinators. For instance, while tripping mechanisms are reported for keel-flowered Polygalaceae [77], for other keel-flowered lineages among angiosperms the situation is unknown. Similarly, choice tests of keel flower pollinators, especially with naïve bees may reveal whether these pollinators move freely between different angiosperm keel flowers or not, and whether mimics (i.e., Polygalaceae keel flowers) receive more visits when they co-occur with the model (i.e., Fabaceae keel flowers), or mimics are more successful in terms of reproductive biology with the presence of legume keel flowers or not [40].

Conclusions

To determine whether the first-evolving Polygalaceae keel flowers were functionally similar to existing Papilionoideae keel flowers and pollinated by similar pollinators, we carried out molecular dating, ancestral area and ancestral state analyses. The current study is the first to use ancestral reconstructions of traits associated with pollination to demonstrate that the multiple evolutionary origins of the keel flower pollinator syndrome in Fabales are consistent with, though do not prove, mimicry. Our results have shown that Polygaleae flowers might have benefited from existing Papilionoideae pollinators that were already visitors to Papilionoideae keel flowers. However, further research is needed to establish a better understanding of the context of the pollinators of keel flowers of different angiosperm clades. Whether other angiosperm clades that have keel flowers might have benefited from existing pollinators of keel-flowered Fabales, or vice-versa, could be addressed using the approaches we present here and a more inclusive phylogeny. Better understanding of the pollination biology of keeled and non-keeled genera of both Fabaceae and Polygalaceae would also be informative. In the field, research to better understand the pollination biology could include investigations to determine whether (1) there is any interspecific facilitation or competition (or both) between co-existing keel flowers [53]; (2) there are phenological differences or pollinator choice differences among co-existing keel flowers [53, 55, 78, 79]; (3) detailed measurements of floral parts (e.g., keel/flag length, colour, minor floral shape differences) are suggestive of shared pollination niche [75]; (4) there are shared main and secondary pollinators; (5) there are different body positions of the pollinators during pollen removal and pollination; (6) there is any evidence of adaptation of mechanical parts of the flowers [75]; (7) population sizes and plant densities of the keel flowers effect pollination success [75]; (8) rewards are [54] offered by sympatric keel-flowered species and (9) whether keel-flowered species share odours. On the other hand, future studies should also focus on the nectar-free Fabales keel flowers, particularly whether these species have more colorful and showy flowers, different odor emissions, high specialization rates, lower levels of pollination and/or pollination with mostly naïve bees, smaller geographical ranges and whether these taxa might have undergone a more rapid and extensive radiation compared to others, as in the case of orchids ([76]; Anonymous Reviewer 2, pers. comm.). Ultimately however, even if the pollination biology of extant species was suggestive of mimicry and a phylogenetic study supported it, an ancient mimicry scenario cannot be as robustly tested as a contemporary mimicry.

Methods

Taxon sampling and sequence data

Our phylogeny was reconstructed using published matK, rbcL and trnL plastid gene regions for 678 taxa, with 43 Fabidae outgroups. We reconstructed our phylogeny from the most widely available sequence data for all families of the Fabales. Whilst recent studies have employed considerable more sequence data (e.g., [80,81,82]), these data are not presently available across the Fabales. The monophylly of Fabales has been strongly supported; yet, both the fossil record and molecular studies (e.g., [12, 17, 48, 49]) suggest a rapid radiation for the order, which causes unstable phylogenetic relationships among the four families, namely, change in the topology and the root of the order by the choice of genes, outgroups and phylogenetic methods. Similarly, the evolutionary origin of the six subfamilies of Fabaceae has been reported as near simultaneous [81]. While the current study exceeds the taxon sampling of Bello et al. [12, 17], Aygoren Uluer et al. [49] and Koenen et al. [81, 82], it is the same as Aygoren Uluer et al. [48]; however, none of these studies were able to support a robust topology for the order. Furthermore, Koenen et al.’s [81] recently published phylogeny of the early evolving Leguminosae based on 1103 nuclear orthologues included only two Polygalaceae and one Surianaceae taxa, and therefore, it would not be possible to explore our molecular dating and ancestral reconstructions on their trees. Yet, it is encouraging that the previous attempts, particularly Koenen et al.’s [81] reconstructions in earliest evolving Papillionoideae are congruent to the current one. Furthermore, while we acknowledge that more data might indeed yield a higher resolution and support in the future; however, we believe that by using sets of Bayesian trees, we did not introduce spurious accuracy where data are not decisive.

The dataset contained the matK, trnL and rbcL plastid gene regions for 678 taxa (615 Fabaceae, 14 Polygalaceae, five Surianaceae and the sole genus of Quillajaceae, Quillaja). The GenBank numbers for these taxa are provided in (Appendix 1). Our sampling strategy was designed to include one species from each Fabales genera and Fabidae families as outgroups (43 outgroup taxa from Celastrales, Cucurbitales, Fagales, Malpighiales, Oxalidales, Rosales, Zygophyllales). Our sampling corresponds to 80% of Fabaceae genera (3% of species number) and 70% of Polygalaceae genera (1.4% of species number).

Alignment, phylogenetic analyses

We assembled, trimmed, and aligned the sequences by using Geneious Pro 4.8.4 [83]. All indels were treated as missing data in all analyses.

The data matrix consisted of 3894 characters from 43 outgroup and 635 ingroup taxa, and 2445 (63%) characters were variable. Of these, 477 (12%) characters were parsimony uninformative, while 1968 (51%) characters were parsimony informative.

Maximum likelihood (ML) analyses were performed with RAxML [84], under a gamma model of heterogeneity with 1000 bootstrap replicates and with defined outgroup taxa and partitions for each data set and gene.

Model choice and molecular dating analyses

For divergence time estimates we used BEAST v.1.8.0 [85]. The alignment was imported in BEAUti v.1.8.0 to [86] generate BEAST input files. BEAST was allowed to perform 2 × 107.

MCMC generations, sampling every 1000th generation. We used a Yule process with a randomly generated starting tree and a lognormal relaxed model (uncorrelated) [87]. By using jModelTest2.1.10 [88, 89], for each of the genes, the most appropriate model was selected as GTR+G+I. Our study uses 30 fossil (24 ingroup and 6 outgroup) calibrations (Table 2). Other than the two relatively recent fossils, these fossil calibrations were taken from Lavin, Herendeen & Wojciechowski [90], Bruneau et al. [91] and Simon et al. [92]. Furthermore, six outgroup fossils were adopted for the first time in the context of a Fabales phylogeny.

Table 2 Fossils used to calibrate the Fabales tree

The two new fossils used in this study are: (1) fossil leaves and fruits of Cercis from Oregon, USA with ~ 36 Ma age [93] (calibration point C). We did not perefer to use the 34 Ma old Cercis fossil [106] which was used by Lavin, Herendeen & Wojciechowski [90], Bruneau et al. [91] and Simon et al. [92], instead we used this recently described fossil in the current study because it represents the oldest fossil record of Cercis (Herendeen, pers. comm.). The 34 Ma old Cercis fossil was attributed to C. herbmeyeri Jia & Manchester, based on the “presence of indehiscent pods with a wing like flange along one margin” [106]. The fossil specimen was collected by S. R. Manchester and students from Teater Road, Crook County, Oregon in the 1980s, and was dated radiometrically to ~ 36 Ma with the help of an age for a nearby location, namely White Ash of Post tuff [106]. (2) fossil seeds of Newtonia from Ethiopia [114]. This earliest fossil record for the genus dates to 22–21 Ma, and used in the current study for the calibration point C (Herendeen, pers. comm.). A fossil of Newtonia mushensis Pan, Currano, Jacobs, Feseha, Tabor et Herendeen is the earliest and only definitive Newtonia fossil, was collected from The Mush Valley deposits, and dated by U-Pb radioisotope dating Method [114]. This new taxon is distinguished by seed size and seed characteristics (e.g., “flat, elongate tapering seeds bearing a membranous wing the degree of curvature near the funiculus attachment”) [114]. We also preferred to include Lavin, Herendeen & Wojciechowski’s [90] 60–70 Ma legume stem node constraint to the molecular clock rooting analysis (i.e., uniform prior distribution) for two reasons: (1) Lyson et al. [121] indicated that the Fabaceae oldest fossil record corresponds to 65.35 Ma, and this age is within the range we used to constrain the 60–70 Ma legume stem node, and (2) any convincing Fabaceae fossils prior to ca. 58 Ma are lacking [59, 90, 92, 101, 122]. To accommodate for gaps in the fossil record and uncertainity in fossil age estimates [105, 123], other than the 60–70 Ma legume stem node constraint [90], we used lognormal prior distribution with minimum age constraints. No fossils from Polygalaceae (e.g., [124,125,126,127]) and Surianaceae (e.g., [128, 129]) were used due to their unconfirmed status [12, 39, 90].

All BEAST analyses were implemented online via the CIPRES Portal [130]. Two independent runs were combined using LogCombiner v.1.8.0 [131]. Tracer v.1.6 [132] was used to check for proper mixing and convergence. TreeAnnotator v1.8.0 [133] was used to elect the maximum clade credibility trees. To annotate the tree, Interactive Tree of Life (iTOL) [134] was used.

Ancestral area reconstruction

Eight biogeographic regions were defined according to Buerki et al. [47], with one addition. These areas were: A: Eurasia; B: Africa; C: Madagascar; D: Southeast Asia including Pacific Islands; E: Australia including New Guinea, New Caledonia and New Zealand; F: North America; G: South America including Central America and H: the Indian subcontinent. Here, area H includes India, Pakistan, Sri Lanka and Bangladesh, but not Nepal. The Indian subcontinent was treated as a separate area (H) in this study due to its recent (ca. 55 Ma) collision with the Eurasian plate [135]. The crown age of Fabales was estimated to be 84 Ma (maximum) by Bello et al. [12]. Therefore, taking the Indian subcontinent as a separate geographic area would be appropriate.

Geographic information for legume taxa was obtained from Legumes of the World [14]. For non-legume taxa, biogeographic information was obtained from published sources. Geography was scored at generic level for both legume and non-legume taxa, rather than for the species that were actually sampled. However, for the purpose of consistency, some clarifications are needed. First, the regions or countries covering more than one of the geographic regions were coded as: Mexico: F+G; Himalayas: A+H; Pakistan: A+H; pantropical, tropics or circumtropical: all areas from A to H; neotropical: F+G; paleotropical: A+B+C+D+E+H; subtropics: A+B+E+F+G; Southeast Asia: A+D; Asia: A+D+H. Second, when literature referred to a centre of diversity, this was taken as the distribution area. For instance, for a genus with a centre of diversity containing 30 spp. in North America, and one species with pantropical distribution, North America was accepted as the distribution area.

Biogeographical analyses were performed on our Bayesian tree with 678 taxa. The Lagrange (Dispersal-Extinction-Cladogenesis, DEC model; [136, 137]) option of RASP v.4.2 (Reconstruct Ancestral State in Phylogenies; [138]), was employed with default settings to calculate probabilities of the most likely ancestral areas for each clade (Additional file 3: S3), except the “maximum number of areas” option, which was set to 2 (the minimum), 4 and 6 to compare the results. A larger value (8, the maximum number of areas) was not tested; we specified the maximum number of areas as 4 since varying numbers of areas only slightly modified probabilities. The notable changes were for Fabaceae: the ancestral origin was 52% G or 49% BG if the maximum number of areas was set to 2, but 67–74% B+G or 33–26% G if the maximum number of areas was set to 4 and 6.

We applied a stratified biogeographical model by dividing our model into four time slices: before 80 Ma, between 80 and 65 Ma, between 65 and 30 Ma, and 30 Ma to the present day [47]. We also applied Buerki et al.’s [47] Q matrix in our ancestral area analysis, in which transition rates were dependent on the geographic location of areas.

Ancestral trait reconstructions

Aygoren Uluer’s [70] review shows that the actual pollinators (rather than visitors) are known for only 33 keel flowered species of Fabales. The present study uses trait data from literature review of hundreds of published papers, and all available Floras score the traits that can be used to infer pollination syndrome. Many floral traits contribute to pollinator attraction. These include flower type, floral (corolla) symmetry, fusion of floral parts, flower size, length of nectar tube, inflorescence size, number of flowers in an inflorescence, inflorescence type, flower colour, floral reflectance, habit, height, height of flowers from ground, phenology, and floral scent [22, 28, 56, 139,140,141,142,143,144,145]. Having a bilateral symmetry, a pentamerous corolla with three different petal types, with the reproductive organs enclosed by keel petals and generally with connation of floral parts such as stamens and keel petals are also the essential characters of keel flowers [1,2,3,4,5,6,7,8,9,10]. In the current study, eleven morphological traits were selected as potentially the most important from the point of view of a pollinator (explained in detail below) and traced: floral type (keeled or not), presence or absence of a pentamerous corolla (petals+sepals in Polygalaceae), presence or absence of three distinct petal types (petals+sepals in Polygalaceae), presence or absence of enclosed reproductive organs, floral symmetry, androecium type, inflorescence type, inflorescence size, flower size, height and habit (Table 3).

Table 3 Explanation of eight pollination syndrome characters coded as A, B and/or C

1.a. The first character scored flowers as keeled or not keeled (Table 3, Additional file 4: S4). For this ancestral floral type analyses, we coded two states: A = keeled and B = non-keeled. In this case, Bello et al. [1] did not accept Xanthophyllum Roxb. flowers as keeled, but Van der Meijden [50] reported that some Xanthophyllum species may have keel flowers. On the other hand, Breteler and Smissaert-Houwing [146] reported that both Carpolobia G. Don and Atroxima Stapf have a keel petal which encloses the style and the stamen sheath similar to the Papilionoideae keel flowers. Unlike Van der Meijden [50], these authors avoided using the term papilionate flowers. Therefore, following Bello et al. [1], we did not accept Carpolobia and Atroxima as keel-flowered. In contrast, we coded Xanthophyllum as polymorphic for the character of being keel flowered or not.

The keel flower trait is known a priori to have more than one origin, and the purpose of our analyses are to highlight the recurring origin of the trait in a much more transparent and explicit way. We note however that non-keel flowers within Papilionoideae are not homologous, and referring to all of them as non-keeled may mislead analyses seeking to understand the transitions to keel morphologies [4]. With this in mind, we also divided the keel-flower trait into five further sub-traits which are: presence or absence of a pentamerous corolla (pentamerous petals+sepals in Polygalaceae), presence or absence of three types of petals (or petals+sepals in Polygalaceae), presence or absence of enclosed reproductive organs, presence or absence of a bilateral symmetry and androecium type [7, 9, 147] (Table 3, Additional files 5: S5, 6: S6, 7: S7, 8: S8, 9: S9).

1.b. For the presence or absence of a pentamerous corolla (pentamerous petals+sepals in Polygalaceae) analyses, we coded two states: A = YES and B = NO (Table 3).

1.c. For the presence or absence of three types of petals (or petals+sepals in Polygalaceae), analyses, we coded three states: A = YES, B = NO and C = the presence or absence of only two types of petals (or petals+sepals in Polygalaceae) (Table 3).

1.d. For the presence or absence of enclosed reproductive organs analyses, we coded two states: A = YES and B = NO (Table 3).

1.e. For the floral symmetry analyses, we coded two states: A = radial symmetry (including slightly bilateral symmetry) and B = bilateral symmetry (Table 3).

1.f. For the androecium type analyses, we coded three states: A = free stamens, B = fused stamens and C = polymorphic (Table 3). In this case, if the stamens are united at the base, we accepted this as free, due to a possible visual impact for pollinators. Although we code fusion of the androecium, we think that fusion of the petals is particularly common among later-diverging Papilionoideae tribes [6, 7, 9, 147]. Therefore, we did not include this character to our ancestral state analyses.

2. Inflorescence architecture is another important factor that affects pollinator visitation [56, 140, 148]. For example, two-dimensional inflorescences receive more hummingbird visits than three-dimensional ones [149]. Likewise, for vertical inflorescences, such as racemes, pollinators generally move from the bottom upwards (i.e., from oldest to youngest flowers) and starts to forage at the next inflorescence in this exact way [148, 150, 151]. However, it is not possible to find the same pattern on horizontal inflorescences such as umbels or heads. For these reasons, it is possible that for a pollinator, the visual impact of the inflorescence may be more important than the type of inflorescence, with convergent evolution on function between different inflorescence types. In other words, there might not be much visual difference between a panicle and a raceme in terms of what a bee sees. For these reasons, inflorescence morphology was coded as: A = Sequenced inflorescences (i.e., vertical inflorescences): raceme, panicle, spike and thyrse; B = Cluster type inflorescences (i.e., horizontal inflorescences): umbel, cyme, corymb, head, spike and fascicle; C = Solitary flowers (Table 3, Additional file 10: S10). It might have been informative to break this character down to overall shape of inflorescence, perhaps as width to length ratio, but this information was not available for all taxa.

3. Similarly, by effecting the foraging time of bees, not only the number of flowers in an inflorescence [152], but also the inflorescence size [140, 153] are other important criteria for pollinator attraction. While we could not obtain sufficient information about the number of flowers per inflorescence, we traced ancestral inflorescence sizes of first keel-flowered lineages of both Fabaceae and Polygalaceae. Both floral size and inflorescence size were scored in millimetres, and we considered both the smallest and the largest reported sizes for all taxa (Additional files 11: S11 and 12: S12, respectively).

4. Flower size is frequently reported to be an important part of pollinator attraction and therefore pollinator visitation [40, 145, 154,155,156]. The correlation between flower size and pollinator size (e.g., [157]), flower size and pollinator visitation rate (e.g., [158]), flower size and searching time (e.g., [145]) were shown by several studies; however, the results are almost always case dependent. Furthermore, while Fabaceae contains great diversity in flower size, from species with tiny flowers only a few millimetres in length (e.g. Trifolium L.) to giants (e.g., Erythrina L.), in Polygalaceae small flowers predominate. Therefore, we find it necessary to investigate ancestral floral sizes of first keel-flowered lineages of both Fabaceae and Polygalaceae.

5. Since pollinators tend to forage at constant heights to decrease flight distances [159,160,161,162], another important criterion for the study was height of plants and inflorescences from ground. However, since it is not always possible to find information about the inflorescence height from ground, approximations were made based on habit and height of plants, since both are frequently reported in the literature. We coded height of plants in centimetres, and we considered both the smallest and the largest reported sizes for all taxa (Additional files 13: S13 and 14: S14, respectively).

6. For the habit analyses, we coded three states: A = Tall plants: Tree, climber, liana, vine, scrambler; B = Medium plants: shrub, subshrub; C = Small plants: herb (Table 3, Additional file 1: S15).

Unfortunately, some traits could not be included in the current study due to the scarcity of information. These included UV reflectance (e.g., FReD: the floral reflectance database, [166]), presence or absence of pollen, presence or absence of secondary pollen presentation, presence or absence of nectar, length of nectar tube, height of flowers from ground and number of flowers in an inflorescence. Tracing others, such as phenology and floral colour was not possible due since we scored traits at generic level. Mapping global phenological data of a legume genus resulted a year-long flowering season for that taxon. Similarly, tracing the colour trait resulted as “all colours” for a genus, due to the occurrence of many differently coloured flowers of a large genus around the world. The 11 characters were scored for 635 taxa (excluding outgroups) in the four Fabales families. Data for these morphological traits were gathered from hundreds of appropriate, previously published sources including floras, articles and online sources (Additional file 1: S16). Our data are presented in the same linear order as the phylogenetic classification of Lewis [14], Gagnon et al. [163] and LPWG [15]. For all analyses, missing data were coded as "-".

In our analyses, we did not score geography and morphology at the species level, because our aim was to reflect the diversity within each genus, not each species. For example, while the flower size ranges from 3 mm to 2.5 cm in genus Polygala (e.g., P. triflora vs. P. karensium), in terms of pollination, scoring Polygala flowers as 3 mm or 25 mm would not be meaningful. The same logic applies here for at least floral type, habit, height, inflorescence size and inflorescence type analyses. Therefore, we think that scoring both the geography and morphology at the genus level is more appropriate in our situation.

Excluding outgroups which were scored as missing data, the amount of missing data for the flower type and ancestral area analyses was 0%, 0.5% for the presence or absence of three distinct petal types (petals+sepals in Polygalaceae) analysis, 2.8% for the presence or absence of enclosed reproductive organs analysis, 1.1% for the floral symmetry analysis, 8.7% for the androecium type analysis, 3.78% for the presence or absence of a pentamerous corolla (petals+sepals in Polygalaceae) analysis, 2.99% for the inflorescence type analysis, 15.9% for both the smallest and the largest flower size analyses, 16.38% for the smallest height analysis, 7.87% for the largest height analysis and 1.4% for the habit analysis. Unfortunately, the amount of missing data for both the smallest (46.6%) and the largest (42.2%) inflorescence size analyses was very high, due to the scarcity of information. For this reason, we could not estimate ancestral inflorescence sizes for most of the clades and ultimately we excluded ancestral inflorescence size analyses from our study.

To account for phylogenetic uncertainty, ancestral state reconstructions were performed on a sample of bootstrap trees with branch lengths. Since the phylogenetic relationships of the early-branching Papilionoideae are better resolved in our ML tree(s), we preferred to use the population of ML trees over our Bayesian tree. The program BayesTraits v2.0 [164, 165] was used for Bayesian estimation of ancestral states. For the “MultiState” model, MCMC analyses were run for 2 × 106 generations, with default settings except the ratedev (rate deviation) and rjhp exp (RevJump) parameters and burn-in (the first 200,000 iterations). The “Continuous Random Walk” analyses were run with default settings, except the ratedev parameter. For the flower type analysis, we also conducted additional MCMC analyses for several Papilionoideae nodes in order to pinpoint the origin of keel flowers within the subfamily.