Abstract
Brassicales comprise 17 families, c. 400 genera and more than 4600 species. Despite the mustard family (crucifers, Brassicaceae) continuing to be the subject of intensive research, the remaining 16 families are largely under studied. Here I summarize the available data on chromosome number and genome size variation across Brassicales in the context of a robust phylogenetic framework. This analysis has revealed extensive knowledge gaps in karyological data for non-crucifer and species-rich families in particular (i.e., Capparaceae, Cleomaceae, Resedaceae and Tropaeolaceae). A parsimonious interpretation of the combined chromosomal and phylogenetic data set suggests that the ancestral pre-Brassicales genome had 9 or 14 chromosome pairs, later multiplied by the At-β (beta) whole-genome duplication (WGD) to n = 18 or 28. This WGD was followed by post-polyploid diploidization marked by diversification to 12 or 13 families and independent decreases in chromosome numbers. Family-specific WGDs are proposed to precede the diversification of Capparaceae, Resedaceae and Tropaeolaceae.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
The Brassicales order, despite its name referring to crucifer species (Brassicaceae) primarily dominating the northern Hemisphere, is an eco-geographically and morphologically diverse clade spread over all continents except Antarctica. Brassicales comprise 17 families and some 400 genera and c. 4700 species (Christenhusz and Byng 2016; Cardinal-McTeague et al. 2016; Edger et al. 2018). Due to the pivotal role of Arabidopsis thaliana and other Brassicaceae species as model organisms in various research disciplines, Brassicales made their way into contemporary textbooks. The diversity and importance of Brassicaceae, the largest Brassicales family comprising of 3997 species in 341 genera (BrassiBase accessed on January 18, 2018), over shadow the remaining sixteen families. The exceptions are the papaya tree (Carica papaya, Caricaceae), commonly grown for its fleshy berries in tropical and subtropical regions of the world; the caper bush (Capparis spinosa, Capparaceae), cultivated for edible flower buds (capers) and fruits (caper berries); and the lesser-known but multi-purpose drumstick tree or moringa (Moringa oleifera, Moringaceae).
The recently published phylogenetic analyses of Brassicales (Cardinal-McTeague et al. 2016; Rockinger et al. 2016; Edger et al. 2018) show a renewed interest in evolution of this group. The robust phylogenetic frameworks permit the reinterpretation of existing data sets, identification of knowledge gaps and formulation of new hypotheses. Here I am reassessing what is currently known about chromosome number and genome size variation across Brassicales and how these data can be used to reconstruct chromosomal and genome evolution patterns in this order. Furthermore, three family-specific whole-genome duplication (WGD) events are inferred, and chromosome numbers of an ancestral pre-Brassicales genome and the paleotetraploid genome shared by the core Brassicales are discussed.
Chromosome number and genome size data: knowledge gaps
Chromosome numbers for Brassicaceae species are continuously updated in BrassiBase (Kiefer et al. 2014; Koch et al. 2018), and several new counts for Caricaceae were published by Rockinger et al. (2016). The Chromosome Counts Database (CCDB, Rice et al. 2015) represents the most comprehensive chromosome number data resource for Brassicales species, and CCDB was explored to draw the conclusions detailed on the following lines (see also Online Resource 1 and Fig. 1).
Chromosome number data are lacking for three monospecific Brassicales families, namely Emblingiaceae, Pentadiplandraceae and Setchellanthaceae. Chromosome numbers are known for only one of the two species in four bispecific families—Akaniaceae, Bataceae, Koeberliniaceae and Tovariaceae. Chromosome number variation is relatively well characterized in smaller Brassicales families, such as Caricaceae (all 6 genera, c. 21 out of 35 species), Gyrostemonaceae (3 out of 4 genera, 3 out of 20 spp.), Limnanthaceae (all 8 spp.), Moringaceae (5 out of 13 spp.) and Salvadoraceae (2 out of 3 genera, 4 out of 11 spp.).
Among the species-rich families, Brassicaceae is the best researched Brassicales family. Warwick and Al-Shehbaz (2006) reported chromosome counts for 232 out of 338 genera (69%) and for 1558 out of 3709 species (42%). Since 2006, the number of known chromosome counts for Brassicaceae species has further increased (cf. BrassiBase and CCDB). On the other hand is Tropaeolaceae, with counts for only eight species (8% of species); Capparaceae, with records for 38 species (12% of species) in 13 out of 30 genera; Cleomaceae, with 49 (14%) species counted; and Resedaceae, with chromosome number counts for c. 39 species in five genera (36% of species).
At least one genome size record is available for ten out of 17 Brassicales families (Online Resource 1 and Fig. 1). Not surprisingly, Brassicaceae is the family with the most extensive genome size variation knowledge. Genome sizes in other species-rich families remain largely unexplored. Kew’s Plant DNA C-values database (Bennett and Leitch 2012, accessed on January 20, 2018) contains only a single C-value for Cleomaceae and Tropaeolaceae, and only two entries for Resedaceae. Not a single value is available for Capparaceae, harboring more than 300 species.
This statistics summary reveals that the least known are chromosome numbers for three monospecific families (i.e., Emblingiaceae, Pentadiplandraceae and Setchellanthaceae) and that four species-rich Brassicales families (i.e., Capparaceae, Cleomaceae, Resedaceae and Tropaeolaceae) remain largely understudied. The knowledge gap on genome size variation in Brassicales is even more evident, with genome sizes either unknown or reported only for c. 1% of species in all families, except Brassicaceae and Caricaceae. To expedite and navigate future comparative phylogenomic studies in Brassicales, a collaborative effort should be undertaken to compile basic karyological information for most Brassicales taxa.
Extreme chromosome numbers and genome sizes in Brassicales
The lowest haploid chromosome number known for angiosperms is n = 2, found so far only in about five species (Cremonini 2005). Across Brassicales, the lowest known chromosome number is n = 4, known only in a few Brassicaceae species (e.g., Physaria bellii, Stenopetalum nutans). The second lowest chromosome count, n = 5, has been detected in only several Brassicaceae species (e.g., Arabidopsis thaliana, Physaria spp. and Stenopetalum lineare), and in all eight Limnanthaceae species. All n = 4 and n = 5 genomes resulted from descending dysploidies, i.e., reductions in chromosome number, following one or more WGD events postdating the ancient whole-genome triplication (WGT) At-γ (gamma) shared by all eurosids and for the first time detected in grape (Jaillon et al. 2007). The highest chromosome counts in some North American Cardamine species (Brassicaceae) resulted from more recent polyploidization cycles (2n = ±240 in C. concatenata, 2n = c. 256 in C. diphylla).
Genome sizes across Brassicaceae and also Brassicales varies 32-fold. The lowest genome size was reliably reported for Arabidopsis thaliana (157 Mb), whereas the highest DNA content was estimated in the polyploid Crambe cordifolia (4630 Mb) with c. 60 chromosome pairs. Genome sizes of diploid-like species in non-crucifer Brassicales families tend to be small to medium, with the exception of Limnanthes douglasii (Limnanthaceae) having its 1360-Mb genome divided among only five chromosomes (Fig. 2).
Ancient WGDs in Brassicales
The first ancient WGD in Brassicales was detected during the Arabidopsis genome sequencing and assembly, when several genome regions were found to be duplicated on the same or different chromosomes (Arabidopsis Genome Initiative 2000). This paleotetraploid event was referred to as 3R or α (alpha), or alternatively as At-α duplication (e.g., Bowers et al. 2003; Maere et al. 2005; Barker et al. 2009). The At-α is shared by all Brassicaceae tribes including Aethionemeae (Schranz et al. 2012), the tribe sister to all other Brassicaceae tribes. Similarly, Cleomaceae, the second largest Brassicales family (Christenhusz and Byng 2016), experienced a WGT referred to as Cs-α or Th-α duplication (Schranz and Mitchell-Olds 2006; Cheng et al. 2013). Both Brassicaceae and Cleomaceae, but not Caricaceae, share an older WGD, called At-β (beta) (Ming et al. 2008; Barker et al. 2009). This pattern suggests that the At-β must have postdated the divergence of core Brassicales families from Caricaceae and the other three early diverging families (i.e., Akaniaceae, Moringaceae and Tropaeolaceae), which experienced only the At-γ WGT. However, precise phylogenetic placement of the At-β duplication remains unclear. Most recently, Edger et al. (2018) placed the WGD either before or after the split between Setchellanthaceae and core Brassicales (Fig. 1).
Based on the association between family-specific WGDs, post-polyploid diploidization and increased species and genus diversity observed in Brassicaceae and Cleomaceae, as well as in other angiosperm families (e.g., Schranz et al. 2012; Huang et al. 2016; Mandáková et al. 2017; Mandáková and Lysak 2018), I propose that the large Brassicales families—Capparaceae, Resedaceae and Tropaeolaceae—have each experienced their own WGD predating their diversification (Fig. 1). Future sequencing efforts will help to elucidate this assumption.
Evolution of ancestral genomes and chromosome numbers in Brassicales
From a phylogenetic perspective, a hypothetical ancestral Brassicales genome was placed between two modern species whose genomes have been sequenced—Theobroma cacao (cocoa, Malvales) and Carica papaya (papaya). As a gap of millions of years and many as yet unsequenced genomes lay between those two genomes, here we can only speculate on the number of linkage groups and the structure of the ancestral Brassicales genome.
Both Malvales and Brassicales are descendants of the paleohexaploid At-γ genome common to all eudicots and supposedly having 21 chromosome pairs (n = 21; resulting from a triplication of an ancestral n = 7 genome; Salse 2016). The extant chromosome number of cocoa (n = 10) most likely originated from descending dysploidy from the n = 21 genome (Argout et al. 2010). As papaya has a similarly low chromosome number (n = 9) to cocoa and n = 9 was also inferred as the most probable ancestral chromosome number for the entire Caricaceae (Rockinger et al. 2016), an ancestral genome with nine linkage groups can be proposed for the early diverging Brassicales clades. This is further supported by n = 9 reported for Akania and Bretschneidera, two genera of Akaniaceae, which together with Tropaeolaceae, are sister to all the remaining Brassicales families (Fig. 1).
However, the idea of an ancestral genome with n = 9 is weakened by the extant karyological variation in families sister to Akaniaceae and Caricaceae. Whereas the species richness and chromosome number variation across Tropaeolaceae (n = 12–14, up to n = 21, plus some higher counts) is presumably a consequence of post-polyploid diversification following a family-specific WGD, the origin of chromosome numbers in Moringaceae is more puzzling. All Moringaceae species chromosome numbers [n = 14 (13)] are higher than n = 9. Recently, Tian et al. (2015) analyzed a whole-genome sequence of Moringa oleifera, compared it with the papaya genome and showed that Moringaceae did not experience a family-specific genome duplication. This is supported by the low species diversity of this monogeneric family. The absence of a WGD could suggest that the extant chromosome numbers in Moringaceae (n = 14) were derived directly from the ancestral Brassicales genome, either through stasis (n = 14) or by chromosome fissions (n = 9 → n = 14).
To sum up, current phylogenomic data do not allow us to decide on the structure of the ancestral Brassicales genome(s). Two alternative scenarios should be tested: (1) the proto-Brassicales genome had n = 9, still preserved in genomes of Akaniaceae and Caricaceae, and n = 14 in Moringaceae originated by ascending dysploidy via chromosome fissions, or (2) the ancestral genome had more than nine chromosomes, most likely n = 14, still conserved in Moringaceae, and ancestral n = 9 genomes of Akaniaceae and Caricaceae originated through independent descending dysploidies—once in Africa (Caricaceae; Rockinger et al. 2016) and once in America (Akaniaceae; Gandolfo et al. 1988; Stevens 2001 onwards, accessed on January 20, 2018).
At-β paleotetraploid genome
The uncertainty about Brassicales ancestors chromosome number also hampers the inference of the At-β paleotratraploid genome’s chromosome number. A more parsimonious scenario favors a genome with nine chromosomes being duplicated to n = 18, rather than the origin of a tetraploid genome with n = 28 from its n = 14 precursor. However, neither n = 18 nor n = 28 is retained in modern Brassicales genomes, which all diversified after the At-β duplication (Fig. 1). With the exception of Koeberliniaceae (n = 22), chromosome numbers lower than n = 18 found in all post-At-β families without a younger WGD would be in accord with the idea of a tetraploid At-β genome with n = 18, later diploidized by independent dysploidies. Then, the most extensive descending dysploidy would be found in Limnanthaceae, presumably representing a 3.6-fold reduction in chromosome number from n = 18 to n = 5.
References
Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408:796–815. https://doi.org/10.1038/35048692
Argout X, Salse J, Aury JM, Guiltinan MJ, Droc G, Gouzy J, Allegre M, Chaparro C, Legavre T, Maximova SN, Abrouk M, Murat F, Fouet O, Poulain J, Ruiz M, Roguet Y, Rodier-Goud M, Barbosa-Neto JF, Sabot F, Kudrna D, Ammiraju JS, Schuster SC, Carlson JE, Sallet E, Schiex T, Dievart A, Kramer M, Gelley L, Shi Z, Bérard A, Viot C, Boccara M, Risterucci AM, Guignon V, Sabau X, Axtell MJ, Ma Z, Zhang Y, Brown S, Bourge M, Golser W, Song X, Clement D, Rivallan R, Tahi M, Akaza JM, Pitollat B, Gramacho K, D’Hont A, Brunel D, Infante D, Kebe I, Costet P, Wing R, McCombie WR, Guiderdoni E, Quetier F, Panaud O, Wincker P, Bocs S, Lanaud C (2010) The genome of Theobroma cacao. Nat Genet 43:101–108. https://doi.org/10.1038/ng.736
Barker MS, Vogel H, Schranz ME (2009) Paleopolyploidy in the Brassicales: analyses of the Cleome transcriptome elucidate the history of genome duplications in Arabidopsis and other Brassicales. Genome Biol Evol 1:391–399. https://doi.org/10.1093/gbe/evp040
Bennett MD, Leitch IJ (2012) Angiosperm DNA C-values database (release 8.0, Dec. 2012). Available at: http://www.kew.org/cvalues/
Bowers JE, Chapman BA, Rong J, Paterson AH (2003) Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature 422(6930):433–438. https://doi.org/10.1038/nature01521
Cardinal-McTeague WM, Sytsma KJ, Hall JC (2016) Biogeography and diversification of Brassicales: a 103 million year tale. Molec Phylogen Evol 99:204–224. https://doi.org/10.1016/j.ympev.2016.02.021
Cheng S, van den Bergh E, Zeng P, Zhong X, Xu J, Liu X, Hofberger J, de Bruijn S, Bhide AS, Kuelahoglu C, Bian C, Chen J, Fan G, Kaufmann K, Hall JC, Becker A, Bräutigam A, Weber AP, Shi C, Zheng Z, Li W, Lv M, Tao Y, Wang J, Zou H, Quan Z, Hibberd JM, Zhang G, Zhu XG, Xu X, Schranz ME (2013) The Tarenaya hassleriana genome provides insight into reproductive trait and genome evolution of crucifers. Pl Cell 25:2813–2830. https://doi.org/10.1105/tpc.113.113480
Christenhusz MJ, Byng JW (2016) The number of known plants species in the world and its annual increase. Phytotaxa 261:201–217. https://doi.org/10.11646/phytotaxa.261.3.1
Cremonini R (2005) Low chromosome number angiosperms. Caryologia 58:403–409. https://doi.org/10.1080/00087114.2005.10589480
Edger PP, Hall JC, Harkess A, Tang M, Coombs J, Mohammadin S, Schranz ME, Xiong Z, Leebens-Mack J, Meyers BC, Systma KJ, Koch M, Al-Shehbaz IA, Pires JC (2018) Brassicales phylogeny inferred from 72 plastid genes: a reanalysis of the phylogenetic localization of two paleopolyploid events and origin of novel chemical defenses. Amer J Bot 105. https://doi.org/10.1002/ajb2.1040
Gandolfo MA, Dibbern MC, Romero EJ (1988) Akania patagonica n. sp. and additional material on Akania americana Romero & Hickey (Akaniaceae), from Paleocene sediments of Patagonia. Bull Torrey Bot Club 115:83–88. https://doi.org/10.2307/2996138
Huang CH, Zhang C, Liu M, Hu Y, Gao T, Qi J, Ma H (2016) Multiple polyploidization events across Asteraceae with two nested events in the early history revealed by nuclear phylogenomics. Molec Biol Evol 33:2820–2835. https://doi.org/10.1093/molbev/msw157
Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, Vezzi A, Legeai F, Hugueney P, Dasilva C, Horner D, Mica E, Jublot D, Poulain J, Bruyère C, Billault A, Segurens B, Gouyvenoux M, Ugarte E, Cattonaro F, Anthouard V, Vico V, Del Fabbro C, Alaux M, Di Gaspero G, Dumas V, Felice N, Paillard S, Juman I, Moroldo M, Scalabrin S, Canaguier A, Le Clainche I, Malacrida G, Durand E, Pesole G, Laucou V, Chatelet P, Merdinoglu D, Delledonne M, Pezzotti M, Lecharny A, Scarpelli C, Artiguenave F, Pè ME, Valle G, Morgante M, Caboche M, Adam-Blondon AF, Weissenbach J, Quétier F, Wincker P, French-Italian Public Consortium for Grapevine Genome Characterization (2007) The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449:463–467. https://doi.org/10.1038/nature06148
Kiefer M, Schmickl R, German DA, Lysak M, Al-Shehbaz IA, Franzke A, Mummenhoff K, Stamatakis A, Koch MA (2014) BrassiBase: introduction to a novel database on Brassicaceae evolution. Pl Cell Physiol 55:e3. https://doi.org/10.1093/pcp/pct158
Koch MA, German DA, Kiefer M, Franzke A (2018) Database taxonomics as key to modern plant biology. Trends Pl Sci 23:4–6. https://doi.org/10.1016/j.tplants.2017.10.005
Maere S, De Bodt S, Raes J, Casneuf T, Van Montagu M, Kuiper M, Van de Peer Y (2005) Modeling gene and genome duplications in eukaryotes. Proc Natl Acad Sci USA 102:5454–5459. https://doi.org/10.1073/pnas.0501102102
Mandáková T, Lysak MA (2018) Post-polyploid diploidization and diversification through dysploid changes. Curr Opin Pl Biol 42:55–65. https://doi.org/10.1016/j.pbi.2018.03.001
Mandáková T, Li Z, Barker MS, Lysak MA (2017) Diverse genome organization following 13 independent mesopolyploid events in Brassicaceae contrasts with convergent patterns of gene retention. Pl J 91:3–21. https://doi.org/10.1111/tpj.13553
Ming R, Hou S, Feng Y, Yu Q, Dionne-Laporte A, Saw JH, Senin P, Wang W, Ly BV, Lewis KL, Salzberg SL, Feng L, Jones MR, Skelton RL, Murray JE, Chen C, Qian W, Shen J, Du P, Eustice M, Tong E, Tang H, Lyons E, Paull RE, Michael TP, Wall K, Rice DW, Albert H, Wang ML, Zhu YJ, Schatz M, Nagarajan N, Acob RA, Guan P, Blas A, Wai CM, Ackerman CM, Ren Y, Liu C, Wang J, Wang J, Na JK, Shakirov EV, Haas B, Thimmapuram J, Nelson D, Wang X, Bowers JE, Gschwend AR, Delcher AL, Singh R, Suzuki JY, Tripathi S, Neupane K, Wei H, Irikura B, Paidi M, Jiang N, Zhang W, Presting G, Windsor A, Navajas-Pérez R, Torres MJ, Feltus FA, Porter B, Li Y, Burroughs AM, Luo MC, Liu L, Christopher DA, Mount SM, Moore PH, Sugimura T, Jiang J, Schuler MA, Friedman V, Mitchell-Olds T, Shippen DE, dePamphilis CW, Palmer JD, Freeling M, Paterson AH, Gonsalves D, Wang L, Alam M (2008) The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature 452:991–996. https://doi.org/10.1038/nature06856
Patchell MJ, Roalson EH, Hall JC (2014) Resolved phylogeny of Cleomaceae based on all three genomes. Taxon 63:315–328. https://doi.org/10.12705/632.17
Rice A, Glick L, Abadi S, Einhorn M, Kopelman NM, Salman-Minkov A, Mayzel J, Chay O, Mayrose I (2015) The Chromosome Counts Database (CCDB)—a community resource of plant chromosome numbers. New Phytol 206:19–26. https://doi.org/10.1111/nph.13191
Rockinger A, Sousa A, Carvalho FA, Renner SS (2016) Chromosome number reduction in the sister clade of Carica papaya with concomitant genome size doubling. Amer J Bot 103:1082–1088. https://doi.org/10.3732/ajb.1600134
Salse J (2016) Ancestors of modern plant crops. Curr Opin Pl Biol 30:134–142. https://doi.org/10.1016/j.pbi.2016.02.005
Schranz ME, Mitchell-Olds T (2006) Independent ancient polyploidy events in the sister families Brassicaceae and Cleomaceae. Pl Cell 18:1152–1165. https://doi.org/10.1105/tpc.106.041111
Schranz ME, Mohammadin S, Edger PP (2012) Ancient whole genome duplications, novelty and diversification: the WGD Radiation Lag-Time Model. Curr Opin Pl Biol 15:147–153. https://doi.org/10.1016/j.pbi.2012.03.011
Stevens PF (2001 onwards) Angiosperm phylogeny website. Version 14, July 2017 [and more or less continuously updated since]. Available at: http://www.mobot.org/MOBOT/research/APweb/
Tian Y, Zeng Y, Zhang J, Yang C, Yan L, Wang X, Shi C, Xie J, Dai T, Peng L, Zeng Huan Y, Xu A, Huang Y, Zhang J, Ma X, Dong Y, Hao S, Sheng J (2015) High quality reference genome of drumstick tree (Moringa oleifera Lam.), a potential perennial crop. Sci China Life Sci 58:627–638. https://doi.org/10.1007/s11427-015-4872-x
Warwick S, Al-Shehbaz IA (2006) Brassicaceae: chromosome number index and database on CD-Rom. Pl Syst Evol 259:237–248. https://doi.org/10.1007/s00606-006-0421-1
Acknowledgements
I thank Patrick P. Edger for discussing the phylogenetic placement of the At-β event and Terezie Mandáková for her help with the chromosome images presented in Fig. 2. This work was supported by a research grant from the Czech Science Foundation (Grant No. P501/12/G090) and the CEITEC 2020 Project (Grant No. LQ1601).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Human and animal rights
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Handling editor: Karol Marhold.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Information on Electronic Supplementary Material
Information on Electronic Supplementary Material
Online Resource 1. List of chromosome numbers and genome sizes of Brassicales taxa.
Rights and permissions
About this article
Cite this article
Lysak, M.A. Brassicales: an update on chromosomal evolution and ancient polyploidy. Plant Syst Evol 304, 757–762 (2018). https://doi.org/10.1007/s00606-018-1507-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00606-018-1507-2