Abstract
Many plants harbor complex mechanisms that promote outcrossing and efficient pollen transfer. These include floral adaptations as well as genetic mechanisms, such as molecular self-incompatibility (SI) systems. The maintenance of such systems over long evolutionary timescales suggests that outcrossing is favorable over a broad range of conditions. Conversely, SI has repeatedly been lost, often in association with transitions to self-fertilization (selfing). This transition is favored when the short-term advantages of selfing outweigh the costs, primarily inbreeding depression. The transition to selfing is expected to have major effects on population genetic variation and adaptive potential, as well as on genome evolution. In the Brassicaceae, many studies on the population genetic, gene regulatory, and genomic effects of selfing have centered on the model plant Arabidopsis thaliana and the crucifer genus Capsella. The accumulation of population genomics datasets have allowed detailed investigation of where, when and how the transition to selfing occurred. Future studies will take advantage of the development of population genetics theory on the impact of selfing, especially regarding positive selection. Furthermore, investigation of systems including recent transitions to selfing, mixed mating populations and/or multiple independent replicates of the same transition will facilitate dissecting the effects of mating system variation from processes driven by demography.
You have full access to this open access chapter, Download protocol PDF
Similar content being viewed by others
Key words
- Self-fertilization
- Arabidopsis
- Capsella
- Self-incompatibility
- Mating system evolution
- Heterozygosity
- Effective population size
- Recombination rate
- Transposable element
- Efficacy of selection
1 Introduction
Flowering plants harbor a great variety of mating systems and associated floral and reproductive adaptations [1], and there is a rich empirical and theoretical literature on the causes of this diversity [2,3,4,5,6]. About half of all flowering plants harbor genetic self-incompatibility (SI), a molecular recognition system that allows plants to recognize and reject self pollen, and that has arisen multiple times in the history of flowering plants [7]. Despite the fact that molecular SI systems are widespread, loss of SI, often accompanied by a shift to higher selfing rates, has occurred even more frequently, in many independent plant lineages [8]. This transition can be favored under conditions when the benefits of selfing, such as reproductive assurance [2] and the 3:2 inherent genetic transmission advantage of selfing [9], outweigh the costs of inbreeding depression and reduced opportunities for outcrossing through pollen (pollen discounting). As the favorability of the transition hinges on ecological factors including access to mates and pollinators that may vary greatly spatially or temporally, it is perhaps not surprising that the transition to selfing has occurred repeatedly [10]. Over a longer term, however, the loss of SI is associated with a reduction in the net diversification rate [11], a finding that provides tentative support for Stebbins’s suggestion that selfing is an evolutionary dead end [12]. While the underlying ecological and evolutionary mechanisms behind this observation remain unclear, it was suggested already by Stebbins [12] that decreased adaptive potential in selfers would lead to higher extinction rates, a suggestion that is supported by theoretical modeling [13]. However, selfing does not only affect adaptation but also the impact of purifying selection [14, 15], and the relative importance of accumulation of deleterious mutations vs. reduced potential for adaptation in selfing lineages currently remains unclear. To fully understand the impact of mating system shifts on evolutionary processes, it is necessary to combine theoretical and empirical investigations, and ideally to study several parallel transitions from outcrossing to selfing. Molecular population genetics has proven to be a powerful tool to shed light on the role of natural selection in shaping the patterns of variation in selfing species. Here we will give an outline of recent work in this area, with a focus on two main model systems in the Brassicaceae.
2 The Molecular Basis of the Loss of SI and Evolution of Self-Fertilization in Brassicaceae
The effects of the transition to self-fertilization on population genomic variation and molecular evolution have been extensively studied in two systems from the Brassicaceae family, Arabidopsis and Capsella. Both of these genera have outcrossing SI as well as SC species with high selfing rates, and thus serve as good models to study this evolutionary transition [16,17,18]. The most widely studied SC species are Arabidopsis thaliana and Capsella rubella, which have both been estimated to be highly selfing [19,20,21,22]. The patterns of variation and molecular evolution in these selfers are often contrasted with those in their diploid sister species Arabidopsis lyrata and Capsella grandiflora, which are both SI and outcrossing. Investigations of the other SC species from these genera such as allopolyploid Arabidopsis suecica [23] and Arabidopsis kamchatica [24,25,26], diploid Capsella orientalis and allopolyploid Capsella bursa-pastoris [18] have given further insight into the evolution of selfing. In Fig. 1, we provide an overview of the evolutionary relationships among the best-studied Arabidopsis and Capsella species.
Knowledge on the molecular basis of the breakdown of SI is at the center of studies investigating the early genetic causes of the transition to selfing in the Brassicaceae. In Brassicaceae, the SI recognition system includes two key genes at the nonrecombining self-incompatibility locus (S-locus) as well as modifier genes. The gene SRK encodes an S-locus receptor kinase that is located on the stigma surface and acts as the female specificity determinant, whereas the gene SCR encodes a pollen ligand that is deposited on the pollen surface and acts as the male specificity determinant [27]. This reaction is a key-lock protein interaction between the female determinant on the stigma and the male determinant on the pollen coat [27]. When SRK on the stigma binds to SCR from the same S-haplotype, a downstream reaction is triggered which culminates in the prevention of pollen tube growth and fertilization [28]. The evolution of selfing proceeds by disruptions of the SI reaction, for example due to loss-of-function mutations in key S-locus genes or in unlinked modifier genes (e.g., [29]), after which the selection at these loci is relaxed and the genes may be degraded further.
There has been intense interest in the role of parallel molecular changes underlying repeated shifts to selfing associated with the loss of SI (reviewed in [30]). In particular, theory predicts that mutations that disrupt the function of the male specificity determinant might spread more easily than those that disrupt the female specificity determinant [31, 32], and there is accumulating support for this prediction. For instance, in A. thaliana, Tsuchimatsu et al. [33] showed that an inversion in SCR underlies SC in many European accessions. In some accessions, SI can be restored by introduction of functional SRK-SCR allele from self-incompatible sister species A. lyrata but variability between accessions exists [34, 35]. In Capsella homologous machinery has been shown to underlie the SI reaction [36] and there is widespread transspecific shared polymorphism between C. grandiflora and SI Arabidopsis species at the S-locus [37]. The loss of SI in C. rubella is due to changes at the S-locus [36, 38], and experiments suggest that breakdown of the male specificity function is responsible for this loss [36]. However, the molecular basis of the breakdown of SI in C. rubella remains unclear [39] and this is also true for the other SC Capsella species. With recent progress in long-read sequencing, which facilitates assembly of the S-locus, this area is ripe for further investigation.
3 Population Genetics Consequences of Selfing
3.1 Theoretical Expectations
Self-fertilization has drastic consequences on the patterns and distribution of genetic variation, and for the impact of natural selection. The level of selfing is therefore an important factor to consider in population genetics. Here we summarize the expected population genetic consequences of selfing (Fig. 2) and then present empirical results from Arabidopsis and Capsella that illustrate the theoretical expectations.
Selfing has two major key effects; it results in reduced heterozygosity and a reduced effective population size (Ne). Under complete selfing, heterozygosity is halved every generation. Hence, the heterozygosity of a completely selfing population is almost fully eliminated already after six generations of complete selfing. A side effect is a rapid generation of isolated lines of different genotypes [40] which is expected to result in stronger population structure in selfing species compared to outcrossers [41]. Moreover, selfers are expected to exhibit a reduction in Ne for several reasons. First, selfing immediately results in a twofold reduction of the number of independently sampled gametes, and this is expected to reduce the Ne by a factor of two [42, 43]. Even greater reductions in Ne are expected if selfers undergo more frequent extinction and recolonization dynamics than outcrossers [44], or if the origin of selfing species is often associated with bottlenecks [14]. Furthermore, because selfing results in a rapid decrease in heterozygosity, recombination is less efficient at breaking up linkage disequilibrium in selfers than in outcrossers [45]. In this situation, background selection or recurrent hitchhiking (linked selection) will have a greater impact, reducing neutral genetic diversity beyond what would be expected in an outcrosser [46]. Together, these factors decrease the overall genetic diversity [41] and increase the linkage disequilibrium (LD) of selfing populations [43].
The combined effect of reduced Ne and effective recombination rate will also affect the efficacy of selection genome-wide. On the one hand, when Ne is reduced, a higher proportion of the genome behaves neutrally and alleles that were slightly deleterious in large populations become effectively neutral [47]. In addition, as an effect of the reduced effective recombination rate in selfers, Hill-Robertson interference [48] will increase and therefore limit selection efficacy further. As a consequence, one may expect selfing lineages to have an excess of nonsynonymous divergence compared to synonymous divergence (dN/dS or Ka/Ks) as well as polymorphisms (πN/πS), mainly due to weakened selection against slightly deleterious variants [41]. Further, reduced efficacy of selection and recombination rate may also decrease the level of codon usage bias [49, 50]. However, it should be noted that spurious signals of relaxed purifying selection can result as a result of recent demographic change [51], because the time to reach equilibrium after a bottleneck is longer for nonsynonymous than for synonymous polymorphism. Ideally, forward population genetic simulations incorporating selection and demography should therefore be undertaken to validate inference of relaxed purifying selection.
The dynamics of alleles with different levels of dominance will also be affected by the mating system [52], in ways that can sometimes counteract the effect of reduced Ne. For instance, in selfing species, increased homozygosity renders recessive alleles visible to selection, and as a result, fixation probabilities of recessive advantageous alleles are expected to be higher in selfers than in outcrossers [53]. Harmful recessive alleles can also be removed more efficiently, resulting in purging of recessive deleterious alleles, unless the reduction in Ne in selfers is severe enough that genetic drift overpowers the homozygosity effect [54].
Selfing and outcrossing populations have also been shown to differ in the dynamics of adaptive alleles. The reduced efficacy of selection will decrease the probability of slightly beneficial mutation fixation but also the fixation time of beneficial mutation in selfers is faster in comparison with the outcrossing species regardless of the dominance level [13, 55]. Further, in selfing populations, adaptation is more likely to result from new mutations (hard sweeps) while in outcrossers adaptation from standing variation (soft sweeps) is predicted to be more frequent [13, 15]. On the other hand, the effect of linked selection is expected to be stronger in selfing species due to reduced effective recombination rate which will increase the fixation probability of linked harmful mutations [56] and potentially limiting the adaptive potential of selfers.
Mating system has also been hypothesized to affect the transposable element (TE) content of the genome. TEs are mobile genetic elements that make up a large yet variable proportion of many plant genomes [57, 58]. Theory predicts that both mating system variation and differences in the effective population size (Ne) should affect TE content, because these factors affect possibilities for TEs to spread, the potential for evolution of self-regulation of transposition, and the efficacy of selection against deleterious TE insertions. For instance, outcrossing enhances opportunities for TE spread [59], and transposition rates should evolve to be highest in outcrossers, which should therefore be expected to have higher TE content than highly selfing species [60]. Likewise, if the harmful effects of TEs are mostly recessive or codominant, increased homozygosity in selfers leads to more efficient purifying selection against TEs in selfers [61]. On the other hand, natural selection against slightly deleterious TE insertions could be compromised in selfing species, because of their reduced effective population size [41]. Increased homozygosity in selfers might also decrease the deleterious effect of TEs, because this decreases the probability of ectopic recombination [62]. Under this scenario, a transition to selfing would be expected to lead to an increase in TE content. Different models therefore yield contrasting predictions regarding the expected effect of mating system variation on TE content.
3.2 Empirical Results
Several empirical results have confirmed the theoretical predictions regarding the population genetics effects of selfing. Figure 3 summarizes some empirical population genetics results from selfing and outcrossing Arabidopsis and Capsella species. First, while it is difficult to disentangle the effect of past demographic events from the effect of selfing, both C. rubella and A. thaliana present high levels of population structure [21, 22, 63]. Indeed, population structure is stronger in the selfer C. rubella than in the outcrossing C. grandiflora, consistent with theoretical expectations for selfers [22] (Fig. 3).
Furthermore, evidence for a reduction in Ne has been found in natural populations of C. rubella and A. thaliana. In C. rubella both synonymous diversity and population recombination rate are significantly lower than in the outcrossing sister species C. grandiflora [22, 64]. Similarly, the selfer A. thaliana shows lower synonymous polymorphism in comparison with the outcrossing A. lyrata [65, 66] (Fig. 3). An early study also found evidence for a faster decay of linkage disequilibrium (LD) in A. lyrata than in A. thaliana [66]. However, more recent work has shown that there is also high variation in nucleotide diversity and LD patterns in A. thaliana, both between different parts of the genome and across different geographic regions and habitats [67,68,69,70]. Patterns in A. lyrata are also complicated by strong population size decrease in several extensively studied A. lyrata populations [71], which also decreases the population recombination rate. Hence, it is worth noticing that both diversity and LD are both highly dependent on the past demographic history and the local variation in the recombination rate across the genome. The decay of LD also depends on whether estimates are based on local or global population samples [72]. In A. thaliana LD decays faster in a world-wide sample in comparison with local populations [68, 70] which may be due to local populations having a low number of founders [72].
Empirical evidence for the impact of selfing on the efficacy of selection started with early investigations of divergence and polymorphism in A. thaliana and its outcrossing sister species A. lyrata, using a limited number of loci. This study found very limited evidence for relaxed selection in A. thaliana [73]. However, using genome-wide polymorphism data, Slotte et al. [74] found evidence for weaker purifying selection on nonsynonymous sites in A. thaliana relative to the outcrosser C. grandiflora. Further studies confirmed decreased codon usage bias in both A. thaliana and C. rubella in comparison with the outcrossing A. lyrata and C. grandiflora [50] (Fig. 3e). Analyses of population genomic data from C. rubella and C. grandiflora further found evidence for a higher ratio of nonsynonymous to synonymous polymorphism in C. rubella [75, 76]. Forward population genomic simulations demonstrated that this was likely primarily a result of the reduced Ne in C. rubella, and not due to a major shift in the distribution of fitness effects (DFE) in association with the shift to selfing [76]. More recently, a study that directly estimated the DFE based on analyses of site frequency spectra found evidence for a higher proportion of nearly neutral nonsynonymous mutations in the selfers C. orientalis and C. bursa-pastoris relative to the outcrosser C. grandiflora [18]. These results are in general agreement with the theoretical expectation that purifying selection should be relaxed in selfers.
While some models predict that a shift to selfing should lead to a reduced prevalence of TEs, other models predict the opposite. So far, empirical evidence from Arabidopsis and Capsella do not unequivocally support either of these predictions. On one hand, the comparison of structure of the A. thaliana and A. lyrata genome sequences suggested that the A. thaliana genome contained large numbers of small deletions, especially in TEs [77]. This result is consistent with the hypothesis that in selfing species TEs are more efficiently removed [61]. One the other hand, Lockton and Gaut [78] found that many TE families are in higher frequency and are subjected to weaker selection in A. thaliana in comparison with A. lyrata. Furthermore, comparison of C. rubella, C. grandiflora, A. thaliana and A. lyrata revealed that the TE frequency and density in Capsella showed a stronger resemblance to A. thaliana than to A. lyrata [76]. This may indicate that the reason for the TE abundance difference between the Arabidopsis species is accumulation of TEs in the A. lyrata genome rather than decline in the selfing A. thaliana lineage.
Focusing on TE content in selfing and outcrossing Capsella species, Ågren et al. [79, 80] found an increase in TE number in C. rubella but a slight decrease in the selfer C. orientalis, in comparison with the outcrossing C. grandiflora. In the polyploid selfer C. bursa-pastoris, no evidence for a difference in TE dynamics was found in comparison with its parental species, C. grandiflora and C. orientalis [80]. Thus, while there is some evidence for a reduced prevalence of TEs in selfers, the results are not unequivocal, and further work is needed to clarify whether the contrasting findings might be related to the timing of the shift to selfing and demographic history. Indeed, there is evidence for an effect of demographic history on selection against TEs in A. lyrata, where large refugial populations exhibited a signature of purifying selection against TE insertions, whereas in bottlenecked populations, TEs were evolving neutrally [81]. In addition to broad comparative genomic studies contrasting species that differ in their mating system, studies of intraspecific variation can thus provide insight into population genomics and selection on TEs [82]. Because TEs are important contributors to variation in plant genome size and TE silencing can also affect gene regulation [83,84,85], it is of considerable interest to improve our understanding of the impact of mating system on variation in TE content.
4 Discovering the Geographic Origin and the Timing of the Mating System Shift
Understanding the timing, mode, and geographic location of the shift to selfing is of key importance for proper interpretation of population genomic data from selfing species [86]. For instance, improved understanding of the timing and geographical location of the shift can be key for interpretation of genetic structure, and can allow one to account for underlying neutral (demography induced) processes when investigating changes in the efficacy of selection.
In A. thaliana, several studies have estimated the timing of the emergence of selfing based on patterns of polymorphism and demographic modeling. A. thaliana is native to the Eurasia and Africa [87,88,89] and widely spread especially in Europe. It has also recently spread into North America in association with humans [21]. All the currently known accessions are SC, indicating that the evolution of this trait preceded the worldwide spread of the species. Linkage disequilibrium patterns suggested that the transition to selfing in A. thaliana occurred as early as 1,000,000 years ago [90] while coalescent modeling using S-locus diversity suggested a younger origin with an upper estimate of approximately 400,000 years ago [91].
The recent development of large-scale population genomics datasets from 1135 A. thaliana accessions [63], offers one of the best population genomics resource for studying plant population genetics and molecular evolution. A recent demographic modeling study exploited this resource and included additional accessions covering roughly the African distribution of the species to investigate the timing and geographic origin of the shift to selfing in A. thaliana [89]. Using a combination of the MSMC method that infers cross-coalescent times and fluctuations in the effective population size using a whole-genome data from multiple populations [92] and the site frequency spectrum based diffusion approximation method δaδi [93], Durvasula et al. [89] inferred the demographic history of the different groups from Africa and Europe. They suggest that partial loss of SI occurred 500,000–1,000,000 years ago subsequent to the migration of the ancestral founding A. thaliana population to Africa approximately 800,000–1,200,000 years ago. Although the existence of multiple nonfunctional S-haplotypes suggests that the final loss of SI occurred multiple times independently [94, 95], the new study including African accessions shows that all the currently known S-haplotypes are found coexisting in Morocco, suggesting that selfing originated in this geographic region [89]. This result was further supported by the higher estimated Ne in African populations, and they estimated that the species spread out-of-Africa some 90,000–140,000 years ago.
The postglacial colonization of A. thaliana within Europe has likely proceeded through acquirement of a weedy lifestyle [63]. The first European colonists likely occurred in the southern and eastern part of Europe [63, 96, 97] and the massive spread over central and northern parts of Europe occurred later, possibly associated with human action [63]. Population structure analysis revealed that there are two distinct groups in Central Europe and these two groups have admixed in Central Europe [98,99,100]. Two distinct lineages are also present in Scandinavia with accessions from Finland as well as from the northern parts of Sweden and Norway forming their own cluster while the southern Swedish accessions cluster with the southern accessions [99, 101]. Using a whole-genome population-genomics approach Lee et al. [102] found five different clusters within European A. thaliana accessions and they suggest that that these groups have given rise to the current distribution of the species within Europe.
Another selfing example from Arabidopsis, where the demographic and colonization history has been studied, is the polyploid species A. suecica, which is a selfing allopolyploid between A. thaliana and A. arenosa (Fig. 1). The species is spread in central Sweden and southern Finland. Early investigation explored 52 microsatellites and four nuclear sequences [103] and inferred a single and recent origin of the species approximately 12,000–300,000 YA followed by northward spread using a Bayesian coalescent population modeling. This single origin has also been supported by other studies based on the amount of variation in chloroplast sequence data [104, 105]. However, recent investigation of whole-genome resequencing data from 15 A. suecica accessions concluded that the multiple origins hypothesis cannot be ruled out [106]. Based on the S-locus haplotype dominance patterns in these accessions they suggest that A. suecica could have been SC, at least to some degree, immediately after the species emergence approximately 15,100–16,600 years ago (Fig. 1).
In Capsella it has been estimated that the timing of the loss of SI is much more recent. Isolation-migration analyses based on 39 gene fragments and assuming a mutation rate of 1.5 × 10−8 suggested that the shift to selfing was concomitant with speciation of C. rubella from an outcrossing ancestor similar to present-day C. grandiflora, and that this occurred some 20,000 years ago [64]. Likewise, an investigation of the diversity patterns at the S-locus across the European range of the species suggested that loss of SI took place in Greece approximately 40,000 years ago [37], since the presumably ancestral long form of SRK allele is present in this region while the other accessions harbor a shorter form. Divergence estimates based on analysis of genome-wide founding haplotypes suggested that selfing evolved later, approximately 50,000–100,000 years ago [75]. Assuming a different mutation rate of 7.1 × 10−9, Slotte et al. [76] analyzed genome-wide site frequency spectra using δaδi, estimated that the timing of the split between C. grandiflora and C. rubella occurred <200,000 years ago (Fig. 1). Later on, coalescent-based analyses of genome-wide joint site frequency spectra from C. grandiflora, C. orientalis and C. bursa-pastoris were used to infer the timing of allopolyploid speciation resulting in the selfing species C. bursa-pastoris ([18], Fig. 1). Together these analyses have provided an evolutionary framework for further genomic studies of the consequences of selfing and allopolyploidy in Capsella (Fig. 1).
5 Some Caveats
As many authors point out, estimates of population split times are usually based on assumptions that contain considerable uncertainty (see, e.g., ref. 103). For example, a fixed mutation rate or a distribution around a mean is often assumed. The direct mutation rate estimate from A. thaliana mutation accumulation lines of 7.1 × 10−9 [107] is commonly used for Arabidopsis, whereas, as pointed out above, early estimates in Capsella were based on a mutation rate of 1.5 × 10−8 [108] while later studies (e.g., Slotte et al. [76] and Douglas et al. [18]) have used the Ossowski et al. [107] mutation rate estimate. Other assumptions may include constant generation time, recombination rate and analyses may further be restricted to a limited number of demographic models. These factors may be variable between studies and it is important to pay attention to these details when evaluating the results of demographic modeling.
A more severe limitation concerns the effect of selection on linked sites on demographic inference. Simulations have shown that the increased intensity of background selection in selfers can rapidly lead to a strong reduction in neutral diversity [109]. Based on these findings, some authors have questioned whether it is possible to reliably infer demographic changes associated with the shift to selfing based on neutral polymorphism [109]. The impact of linked selection on patterns of neutral polymorphism is indeed a general problem for demographic inference [110,111,112]. To some degree, it may be possible to circumvent this problem by judiciously choosing which sites to use for demographic inference [46], and by using forward population genetic simulations in software such as SLiM [113, 114] to assess whether results are robust to the effect of selection on linked sites. As an example, a recent study on Arabis alpina used site frequency spectra for sites in genomic regions with high recombination rates and low gene density, which should be least affected by linked selection, to infer the demographic history of selfing Scandinavian populations [115]. The reliability of this inference was further checked with forward simulations incorporating background selection and a shift to selfing [115]. This study therefore demonstrated one way to assess the effect of linked selection on demographic inference in selfing populations.
6 Future Directions
In this chapter, we have given a brief overview of population genomic studies of the transition to selfing, focusing on the two model systems Arabidopsis and Capsella. One limitation of most of the studies presented here is that they have focused on comparing pairs of species with contrasting mating systems. Ideally, to be able to distinguish between idiosyncrasies of one particular contrast and effects of selfing per se, future comparative population genomic studies of the effect of selfing should include multiple phylogenetically independent contrasts.
An additional limitation is that most studies have focused on contrasting only highly selfing and obligate outcrossing species, although there is a lot more diversity to plant mating systems. For instance, a substantial proportion (approximately 42%) of flowering plants undergoes a mix of outcrossing and self-fertilization [116]. Despite this fact, and despite the existence of theoretical and simulation-based work on the expected population genomic consequences of partial selfing [15, 54, 56, 117], there is a dearth of empirical population genomic studies including mixed mating populations and species (but see, e.g., 115, 118). One exception is a recent study which tested for a difference in the impact of purifying selection among obligate outcrossing, mixed-mating, and highly selfing populations of Arabis alpina [115]. This study found no major detectable difference in purifying selection between mixed mating and outcrossing populations, whereas purifying selection efficacy was significantly lower in Scandinavian selfing populations, most likely as a result of a postglacial colonization bottleneck [115]. These results are consistent with the expectation that a low level of outcrossing may be sufficient to prevent accumulation of deleterious alleles [117]. However, further empirical studies in more mixed mating species and populations, ideally including larger sample sizes, are required to establish the generality of this pattern.
Another less empirically studied issue is how the prevalence of positive selection and especially how selective sweeps are impacted by mating system. In A. thaliana, the proportion of amino acid substitutions driven by positive selection has been estimated to be close to 0% [74, 119] while for example in the outcrossing relative C. grandiflora this proportion has been estimated to be as high 40% [74]. However, making inferences on the underlying cause of this difference is not straightforward. For example, positive selection has also been shown to be rare in the outcrossing A. lyrata [120] suggesting that other factors, such as for instance demographic history, may also result in low rates of adaptive substitutions. The theoretical work on the effect of dominance on fixation probabilities has yielded results that could be tested by taking advantage of genome-wide population genetics datasets and methodological developments regarding sweep detection (e.g., [121, 122]). For example, with such methodology, it is possible to test whether there is a difference in the relative occurrence of hard and soft sweeps in selfers and outcrossing populations as predicted by theoretical work [13]. Further advances in the classification of mutations into different dominance classes [123] will allow for testing hypotheses related to the behavior of recessive, additive, and dominant alleles in selfing vs. outcrossing species.
References
Barrett SC (2002) Evolution of sex: the evolution of plant sexual diversity. Nat Rev Genet 3:274
Darwin C (1876) The effects of cross and self fertilisation in the vegetable kingdom. J Murray, London
Darwin C (1877) The different forms of flowers on plants of the same species. J Murray, London
Lloyd DG (1984) Gender allocation in outcrossing cosexual plants. In: Dirzo R, Sarukhán J (eds) Perspectives on plant population ecology. Sinauer, Sunderland, MA, pp 277–300
Charlesworth D, Charlesworth B (1987) Inbreeding depression and its evolutionary consequences. Annu Rev Ecol Syst 18:237–268
Charnov EL (1982) The theory of sex allocation. Princeton University Press, Princeton, NJ
Raduski AR, Haney EB, Igić B (2012) The expression of self-incompatibility in angiosperms is bimodal. Evolution 66:1275–1283
Igic B, Lande R, Kohn J (2008) Loss of self-incompatibility and its evolutionary consequences. Int J Plant Sci 169:93–104
Fisher RA (1941) Average excess and average effect on an allelic substitution. Ann Eugenics 11:53–63
Charlesworth D (2006) Evolution of plant breeding systems. Curr Biol 16:R735
Goldberg EE, Kohn JR, Lande R et al (2010) Species selection maintains self-incompatibility. Science 330:493–495
Stebbins GL (1957) Self fertilization and population variability in the higher plants. Am Nat 91:337–354
Glémin S, Ronfort J (2013) Adaptation and maladaptation in selfing and outcrossing species: new mutations versus standing variation. Evolution 67:225–240
Wright SI, Kalisz S, Slotte T (2013) Evolutionary consequences of self-fertilization in plants. Proc R Soc B 280:20130133
Hartfield M (2016) Evolutionary genetic consequences of facultative sex and outcrossing. J Evol Biol 29:5–22
Clauss MJ, Koch MA (2006) Poorly known relatives of Arabidopsis thaliana. Trends Plant Sci 11:449–459
Hurka H, Friesen N, German DA et al (2012) ‘Missing link’ species Capsella orientalis and Capsella thracica elucidate evolution of model plant genus Capsella (Brassicaceae). Mol Ecol 21:1223–1238
Douglas GM, Gos G, Steige KA et al (2015) Hybrid origins and the earliest stages of diploidization in the highly successful recent polyploid Capsella bursa-pastoris. PNAS 112:2806–2811
Abbott RJ, Gomes MF (1989) Population genetic structure and outcrossing rate of Arabidopsis thaliana (L.) Heynh. Heredity 62:411
Bergelson J, Stahl E, Dudek S et al (1998) Genetic variation within and among populations of Arabidopsis thaliana. Genetics 148:1311–1323
Platt A, Horton M, Huang YS et al (2010) The scale of population structure in Arabidopsis thaliana. PLoS Genet 6:e1000843
St. Onge KR, Källman T, Slotte T et al (2011) Contrasting demographic history and population structure in Capsella rubella and Capsella grandiflora, two closely related species with different mating systems. Mol Ecol 20:3306–3320
Säll T, Lind-Halldén C, Jakobsson M et al (2004) Mode of reproduction in Arabidopsis suecica. Hereditas 141:313–317
Shimizu KK, Fujii S, Marhold K et al (2005) Arabidopsis kamchatica (Fisch. ex DC.) K. Shimizu & Kudoh and A. kamchatica subsp. kawasakiana (Makino) K. Shimizu & Kudoh, new combinations. Acta Phytotax Geobot 56:163–172
Tsuchimatsu T, Kaiser P, Yew C et al (2012) Recent loss of self-incompatibility by degradation of the male component in allotetraploid Arabidopsis kamchatica. PLoS Genet 8:e1002838
Shimizu-Inatsugi R, LihovÁ J, Iwanaga H et al (2009) The allopolyploid Arabidopsis kamchatica originated from multiple individuals of Arabidopsis lyrata and Arabidopsis halleri. Mol Ecol 18:4024–4048
Takayama S, Isogai A (2005) Self-incompatibility in plants. Annu Rev Plant Biol 56:467–489
Nasrallah JB (2017) Plant mating systems: self-incompatibility and evolutionary transitions to self-fertility in the mustard family. Curr Opin Genet Dev 47:54–60
Mable BK, Hagmann J, Kim ST et al (2017) What causes mating system shifts in plants? Arabidopsis lyrata as a case study. Heredity 118:52–63
Shimizu KK, Tsuchimatsu T (2015) Evolution of selfing: recurrent patterns in molecular adaptation. Annu Rev Ecol Evol Syst 46:593–622
Uyenoyama MK, Zhang Y, Newbigin E (2001) On the origin of self-incompatibility haplotypes: transition through self-compatible intermediates. Genetics 157:1805–1817
Tsuchimatsu T, Shimizu KK (2013) Effects of pollen availability and the mutation bias on the fixation of mutations disabling the male specificity of self-incompatibility. J Evol Biol 26:2221–2232
Tsuchimatsu T, Suwabe K, Shimizu-Inatsugi R et al (2010) Evolution of self-compatibility in Arabidopsis by a mutation in the male specificity gene. Nature 464:1342–1346
Nasrallah ME, Liu P, Nasrallah JB (2002) Generation of self-incompatible Arabidopsis thaliana by transfer of two S locus genes from A. lyrata. Science 297:247–249
Nasrallah ME, Liu P, Sherman-Broyles S et al (2004) Natural variation in expression of self-incompatibility in Arabidopsis thaliana: implications for the evolution of selfing. PNAS 101:16070–16074
Nasrallah JB, Liu P, Sherman-Broyles S et al (2007) Epigenetic mechanisms for breakdown of self-incompatibility in interspecific hybrids. Genetics 175:1965–1973
Guo YL, Bechsgaard JS, Slotte T et al (2009) Recent speciation of Capsella rubella from Capsella grandiflora, associated with loss of self-incompatibility and an extreme bottleneck. PNAS 106:5246–5251
Slotte T, Hazzouri KM, Stern D et al (2012) Genetic architecture and adaptive significance of the selfing syndrome in Capsella. Evolution 66:1360–1374
Vekemans X, Poux C, Goubet PM et al (2014) The evolution of selfing from outcrossing ancestors in Brassicaceae: what have we learned from variation at the S-locus? J Evol Biol 27:1372–1385
Hedrick PW (2005) Genetics of populations. Jones and Bartlett, Boston (MA)
Charlesworth D, Wright SI (2001) Breeding systems and genome evolution. Curr Opin Genet Dev 11:685–690
Pollak E (1987) On the theory of partially inbreeding finite populations. I. Partial selfing. Genetics 117:353–360
Nordborg M (2000) Linkage disequilibrium, gene trees and selfing: an ancestral recombination graph with partial self-fertilization. Genetics 154:923–929
Ingvarsson P (2002) A metapoulation perspective on genetic diversity and differentiation in partially self-fertilizing plants. Evolution 56:2368–2373
Roze D, Lenormand T (2005) Self-fertilization and the evolution of recombination. Genetics 170:841–857
Slotte T (2014) The impact of linked selection on plant genomic variation. Brief Funct Genomics 13:268–275
Ohta T (1973) Slightly deleterious mutant substitutions in evolution. Nature 246:96–98
Hill WG, Robertson A (1966) The effect of linkage on limits to artificial selection. Genet Res 8:269–294
Marais G, Charlesworth B, Wright SI (2004) Recombination and base composition: the case of the highly self-fertilizing plant Arabidopsis thaliana. Genome Biol 5:R45
Qiu S, Zeng K, Slotte T et al (2011) Reduced efficacy of natural selection on codon usage bias in selfing Arabidopsis and Capsella species. Genome Biol Evol 3:868–880
Brandvain Y, Wright SI (2016) The limits of natural selection in a nonequilibrium world. Trends Genet 32:201–210
Glémin S (2007) Mating systems and the efficacy of selection at the molecular level. Genetics 177:905–916
Charlesworth B (1992) Evolutionary rates in partially self-fertilizing species. Am Nat 140:126–148
Glémin S (2003) How are deleterious mutations purged? Drift versus nonrandom mating. Evolution 57:2678–2687
Caballero A, Hill WG (1992) Effects of partial inbreeding on fixation rates and variation of mutant genes. Genetics 131:493–507
Hartfield M, Glémin S (2014) Hitchhiking of deleterious alleles and the cost of adaptation in partially selfing species. Genetics 196:281–293
Tenaillon MI, Hollister JD, Gaut BS (2010) A triptych of the evolution of plant transposable elements. Trends Plant Sci 15:471–478
Ågren JA (2014) Evolutionary transitions in individuality: insights from transposable elements. Trends Evol Ecol 29:90–96
Boutin TS, Le Rouzic A, Capy P (2012) How does selfing affect the dynamics of selfish transposable elements? Mob DNA 3(1):5
Charlesworth B, Langley CH (1986) The evolution of self-regulated transposition of transposable elements. Genetics 112:359–383
Wright SI, Schoen DJ (1999) Transposon dynamics and the breeding system. Genetica 107:139–148
Charlesworth D, Charlesworth B (1995) Transposable elements in inbreeding and outbreeding populations. Genetics 140:415–417
1001 Genomes Consortium (2016) 1,135 genomes reveal the global pattern of polymorphism in Arabidopsis thaliana. Cell 166:481–491
Foxe JP, Slotte T, Stahl EA et al (2009) Recent speciation associated with the evolution of selfing in Capsella. PNAS 106:5241–5245
Savolainen O, Langley CH, Lazzaro BP et al (2000) Contrasting patterns of nucleotide polymorphism at the alcohol dehydrogenase locus in the outcrossing Arabidopsis lyrata and the selfing Arabidopsis thaliana. Mol Biol Evol 17:645–655
Wright SI, Foxe JP, DeRose-Wilson L et al (2006) Testing for effects of recombination rate on nucleotide diversity in natural populations of Arabidopsis lyrata. Genetics 174:1421–1430
Nordborg M, Hu TT, Ishino Y et al (2005) The pattern of polymorphism in Arabidopsis thaliana. PLoS Biol 3:e196
Kim S, Plagnol V, Hu TT et al (2007) Recombination and linkage disequilibrium in Arabidopsis thaliana. Nat Genet 39:1151–1155
Bomblies K, Yant L, Laitinen RA et al (2010) Local-scale patterns of genetic variability, outcrossing, and spatial structure in natural stands of Arabidopsis thaliana. PLoS Genet 6:e1000890
Cao J, Schneeberger K, Ossowski S et al (2011) Whole-genome sequencing of multiple Arabidopsis thaliana populations. Nat Genet 43:956–963
Mattila TM, Tyrmi J, Pyhäjärvi T et al (2017) Genome-wide analysis of colonization history and concomitant selection in Arabidopsis lyrata. Mol Biol Evol 34:2665–2677
Nordborg M, Borevitz JO, Bergelson J et al (2002) The extent of linkage disequilibrium in Arabidopsis thaliana. Nat Genet 30:190–193
Wright SI, Lauga B, Charlesworth D (2002) Rates and patterns of molecular evolution in inbred and outbred Arabidopsis. Mol Biol Evol 19:1407–1420
Slotte T, Foxe JP, Hazzouri KM et al (2010) Genome-wide evidence for efficient positive and purifying selection in Capsella grandiflora, a plant species with a large effective population size. Mol Biol Evol 27:1813–1821
Brandvain Y, Slotte T, Hazzouri KM et al (2013) Genomic identification of founding haplotypes reveals the history of the selfing species Capsella rubella. PLoS Genet 9:e1003754
Slotte T, Hazzouri KM, Ågren JA et al (2013) The Capsella rubella genome and the genomic consequences of rapid mating system evolution. Nat Genet 45:831–835
Hu TT, Pattyn P, Bakker EG et al (2011) The Arabidopsis lyrata genome sequence and the basis of rapid genome size change. Nat Genet 43:476–481
Lockton S, Gaut B (2010) The evolution of transposable elements in natural populations of self-fertilizing Arabidopsis thaliana and its outcrossing relative Arabidopsis lyrata. BMC Evol Biol 10:10
Ågren JA, Wang W, Koenig D et al (2014) Mating system shifts and transposable element evolution in the plant genus Capsella. BMC Genomics 15:602
Ågren JA, Huang H, Wright SI (2016) Transposable element evolution in the allotetraploid Capsella bursa-pastoris. Am J Bot 103:1197–1202
Lockton S, Ross-Ibarra J, Gaut BS (2008) Demography and weak selection drive patterns of transposable element diversity in natural populations of Arabidopsis lyrata. PNAS 105:13965–13970
Horvath R, Slotte T (2017) The role of small RNA-based epigenetic silencing for purifying selection on transposable elements in Capsella grandiflora. Genome Biol Evol 9:2911–2920
Steige KA, Laenen B, Reimegård J et al (2017) Genomic analysis reveals major determinants of cis-regulatory variation in Capsella grandiflora. PNAS 114:1087–1092
Wang X, Weigel D, Smith LM (2013) Transposon variants and their effects on gene expression in Arabidopsis. PLoS Genet 9:e1003255
Steige KA, Reimegård J, Koenig D et al (2015) Cis-regulatory changes associated with a recent mating system shift and floral adaptation in Capsella. Mol Biol Evol 32:2501–2514
Charlesworth D, Vekemans X (2005) How and when did Arabidopsis thaliana become highly self-fertilising. BioEssays 27:472–476
Hoffmann MH (2005) Evolution of the realized climatic niche in the genus Arabidopsis (Brassicaceae). Evolution 59:1425–1436
Brennan AC, Méndez-Vigo B, Haddioui A et al (2014) The genetic structure of Arabidopsis thaliana in the south-western Mediterranean range reveals a shared history between North Africa and southern Europe. BMC Plant Biol 14:17
Durvasula A, Fulgione A, Gutaker RM et al (2017) African genomes illuminate the early history and transition to selfing in Arabidopsis thaliana. PNAS 114(20):5213–5218
Tang C, Toomajian C, Sherman-Broyles S et al (2007) The evolution of selfing in Arabidopsis thaliana. Science 317:1070–1072
Bechsgaard JS, Castric V, Charlesworth D et al (2006) The transition to self-compatibility in Arabidopsis thaliana and evolution within S-haplotypes over 10 Myr. Mol Biol Evol 23:1741–1750
Schiffels S, Durbin R (2014) Inferring human population size and separation history from multiple genome sequences. Nat Genet 46:919–925
Gutenkunst RN, Hernandez RD, Williamson SH et al (2009) Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genet 5:e1000695
Boggs NA, Nasrallah JB, Nasrallah ME (2009) Independent S-locus mutations caused self-fertility in Arabidopsis thaliana. PLoS Genet 5(3):e1000426
Shimizu KK, Shimizu-Inatsugi R, Tsuchimatsu T et al (2008) Independent origins of self-compatibility in Arabidopsis thaliana. Mol Ecol 17(2):704–714
Beck JB, Schmuths H, Schaal BA (2008) Native range genetic variation in Arabidopsis thaliana is strongly geographically structured and reflects Pleistocene glacial dynamics. Mol Ecol 17:902–915
Picó FX, Méndez-Vigo B, Martinez-Zapater J et al (2008) Natural genetic variation of Arabidopsis thaliana is geographically structured in the Iberian peninsula. Genetics 180:1009–1021
Sharbel TF, Haubold B, Mitchell-Olds T (2000) Genetic isolation by distance in Arabidopsis thaliana: biogeography and postglacial colonization of Europe. Mol Ecol 9:2109–2118
François O, Blum MGB, Jakobsson M et al (2008) Demographic history of European populations of Arabidopsis thaliana. PLoS Genet 4:e1000075
Long Q, Rabanal FA, Meng D et al (2013) Massive genomic variation and strong selection in Arabidopsis thaliana lines from Sweden. Nat Genet 45:884–890
Huber CD, Nordborg M, Hermisson J et al (2014) Keeping it local: evidence for positive selection in Swedish Arabidopsis thaliana. Mol Biol Evol 31:3026–3039
Lee C, Svardal H, Farlow A et al (2017) On the post-glacial spread of human commensal Arabidopsis thaliana. Nat Commun 8:14458
Jakobsson M, Hagenblad J, Tavare S et al (2006) A unique recent origin of the allotetraploid species Arabidopsis suecica: evidence from nuclear DNA markers. Mol Biol Evol 23:1217–1231
Säll T, Jakobsson M, Lind-Halldén C et al (2003) Chloroplast DNA indicates a single origin of the allotetraploid Arabidopsis suecica. J Evol Biol 16:1019–1029
Jakobsson M, Säll T, Lind-Halldén C et al (2007) The evolutionary history of the common chloroplast genome of Arabidopsis thaliana and A. suecica. J Evol Biol 20:104–121
Novikova PY, Tsuchimatsu T, Simon S et al (2017) Genome sequencing reveals the origin of the allotetraploid Arabidopsis suecica. Mol Biol Evol 34:957–968
Ossowski S, Schneeberger K, Lucas-Lledó JI et al (2010) The rate and molecular spectrum of spontaneous mutations in Arabidopsis thaliana. Science 327:92–94
Koch MA, Haubold B, Mitchell-Olds T (2000) Comparative evolutionary analysis of chalcone synthase and alcohol dehydrogenase loci in Arabidopsis, Arabis, and related genera (Brassicaceae). Mol Biol Evol 17:1483–1498
Barrett SC, Arunkumar R, Wright SI (2014) The demography and population genomics of evolutionary transitions to self-fertilization in plants. Philos Trans R Soc Lond B Biol Sci 369(1648):20130344
Schrider DR, Shanku AG, Kern AD (2016) Effects of linked selective sweeps on demographic inference and model selection. Genetics 204:1207–1223
Messer PW, Petrov DA (2013) Frequent adaptation and the McDonald-Kreitman test. PNAS 110:8615–8620
Ewing GB, Jensen JD (2016) The consequences of not accounting for background selection in demographic inference. Mol Ecol 25(1):135–141
Messer PW (2013) SLiM: Simulating evolution with selection and linkage. Genetics 194(4):1037–1039
Haller BC, Messer PW (2016) SLiM 2: flexible, interactive forward genetic simulations. Mol Biol Evol 34:230–240
Laenen B, Tedder A, Nowak MD et al (2018) Demography and mating system shape the genome-wide impact of purifying selection in Arabis alpina. Proc Natl Acad Sci U S A 115(4):816–821
Goodwillie C, Kalisz S, Eckert CG (2005) The evolutionary enigma of mixed mating systems in plants: occurrence, theoretical explanations, and empirical evidence. Annu Rev Ecol Evol Syst 36:47–79
Kamran-Disfani A, Agrawal AF (2014) Selfing, adaptation and background selection in finite populations. J Evol Biol 27:1360–1371
Salcedo A, Kalisz S, Wright SI (2014) Limited genomic consequences of mixed mating in the recently derived sister species pair, Collinsia concolor and Collinsia parryi. J Evol Biol 27:1400–1412
Bustamante CD, Nielsen R, Sawyer SA et al (2002) The cost of inbreeding in Arabidopsis. Nature 416:531–534
Foxe JP, Dar V, Zheng H et al (2008) Selection on amino acid substitutions in Arabidopsis. Mol Biol Evol 25:1375–1383
Garud NR, Messer PW, Buzbas EO et al (2015) Recent selective sweeps in North American Drosophila melanogaster show signatures of soft sweeps. PLoS Genet 11:1–32
Schrider DR, Kern AD (2016) S/HIC: robust identification of soft and hard sweeps using machine learning. PLoS Genet 12:e1005928
Huber CD, Durvasula A, Hancock AM et al (2017) Gene expression drives the evolution of dominance. Nature Communications 9:2750
Guo X, Liu J, Hao G et al (2017) Plastome phylogeny and early diversification of Brassicaceae. BMC Genomics 18:176
Beilstein MA, Nagalingum NS, Clements MD et al (2010) Dated molecular phylogenies indicate a Miocene origin for Arabidopsis thaliana. PNAS 107:18724–18728
Mattila TM (2017) Post-glacial colonization, demographic history, and selection in Arabidopsis lyrata: genome-wide and candidate gene based approach. University of Oulu. PhD thesis
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Copyright information
© 2020 The Author(s)
About this protocol
Cite this protocol
Mattila, T.M., Laenen, B., Slotte, T. (2020). Population Genomics of Transitions to Selfing in Brassicaceae Model Systems. In: Dutheil, J.Y. (eds) Statistical Population Genomics. Methods in Molecular Biology, vol 2090. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-0199-0_11
Download citation
DOI: https://doi.org/10.1007/978-1-0716-0199-0_11
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-0198-3
Online ISBN: 978-1-0716-0199-0
eBook Packages: Springer Protocols