Abstract
Evolution of mating systems has become one of the most important research areas in evolutionary biology. Cyrtomium falcatum is a homosporous fern species native to eastern Asia. Two subspecies belonging to a sexual diploid race of C. falcatum are recognized: subsp. littorale and subsp. australe. Subspecies littorale shows intermediate selfing rates, while subsp. australe is an obligate outcrosser. We aimed to evaluate the process of mating system evolution and divergence for the two subspecies using restriction site associated DNA sequencing (RAD-seq). The results showed that subsp. littorale had lower genetic diversity and stronger genetic drift than subsp. australe. Fluctuations in the effective population size over time were evaluated by extended Bayesian skyline plot and Stairway plot analyses, both of which revealed a severe population bottleneck about 20,000 years ago in subsp. littorale. This bottleneck and the subsequent range expansion after the LGM appear to have played an important role in the divergence of the two subspecies and the evolution of selfing in subsp. littorale. These results shed new light on the relationship between mating system evolution and past demographic change in fern species.
Similar content being viewed by others
Introduction
The evolution of mating systems has long been one of the most important areas of study in evolutionary biology. Transition from outcrossing to selfing is a common evolutionary pathway in angiosperms1,2, and selfing is considered to be advantageous for gene transmission and reproductive assurance in the short term. If repeated selfing is effective at purging genetic load, predominant selfing may constitute a stable state when inbreeding depression levels are low3,4. Theoretical studies predict that a population bottleneck favors selfing because it can reduce inbreeding depression3. Empirical studies also suggest a close relationship between bottlenecks and selfing. For example, in Capsella (Brassicaceae), the self-compatible C. rubella separated from the self-incompatible C. grandiflora in the last glacial period, and this speciation was associated with a major population bottleneck in C. rubella5. The selfing species C. rubella then expanded its distribution into new northern habitats that may have emerged after the Last Glacial Maximum (LGM)5,6. Since selfing would confer an advantage when filling such new habitats, the environmental shifts that occurred during the last glacial period and as it ended are considered to have favored the evolution of selfing. However, the general patterns of mating system evolution and their relationship to the demographic history of species have not yet been closely examined.
To deepen our understanding of the relationship between selfing evolution and demographic history (e.g. the occurrence of population bottlenecks), in this study we focused on a homosporous fern species, Cyrtomium falcatum (L.f.) C.Presl. Since self-incompatibility systems such as those based on S-loci are not observed in ferns or other seedless land plants, their mating system would be expected to be determined simply by the balance between inbreeding depression and the advantages of selfing, both of which could be affected by demographic history. The life cycle of homosporous ferns differs from that of seed plants as they have free-living gametophytes that potentially bear both male and female gametangia. The gametophytes are capable of three types of mating: gametophytic selfing (selfing within a gametophyte), sporophytic selfing (crossing between two gametophytes of the same sporophyte) and sporophytic outcrossing (crossing between two gametophytes from separate sporophytes)7. Among land plants, gametophytic selfing is unique to homosporous ferns and lycophytes and monoecious bryophytes, and it is considered to be an extreme form of inbreeding because it results in a zygote that is homozygous at every gene locus. Sporophytic selfing is equivalent to selfing in seed plants. These two types of selfing are responsible for inbreeding in homosporous ferns. Elucidating the evolutionary patterns and process of selfing in homosporous ferns and comparing them to those in seed plants is crucial for understanding the fundamental processes of mating system evolution in vascular plants.
Cyrtomium falcatum is a homosporous fern in the family Dryopteridaceae, and is native to Japan, Korea, South China, Taiwan, and Indochina. Two cytotypes have been identified: sexual diploid and apogamous triploid8. Matsumoto8 recognized two forms of the sexual diploid cytotype from the apogamous triploid, C. falcatum subsp. falcatum. These diploid cytotypes were recently formally described as two subspecies: C. falcatum subsp. littorale S.Matsumoto ex S.Matsumoto et Ebihara and C. falcatum subsp. australe S.Matsumoto ex S.Matsumoto et Ebihara9. Cyrtomium falcatum subsp littorale is distinguished by its smaller plant size, fewer numbers of pinnae, and the indusium does not have a black spot, cf. C. falcatum subsp. australe. Subspecies littorale is distributed in the north-eastern part of Japan and laboratory experiments have demonstrated variation in sexual expression (gametangium formation) in gametophytes among its populations8. Sporophytes referred to as the separate type (S-type) produce gametophytes bearing antheridia and archegonia that are separated both spatially and temporally. Conversely, sporophytes of the mixed type (M-type) produce gametophytes that bear antheridia and archegonia simultaneously in a mixed manner8. Variation in the sexual expression of gametophytes can profoundly affect the rate of gametophyte selfing in natural populations10. Our recent study using microsatellites showed that levels of inbreeding (FIS) and gene diversity (h) differed significantly between S-type (FIS = 0.208, h = 0.367) and M-type (FIS = 0.626, h = 0.152) populations, suggesting higher selfing rates and lower genetic diversities in M-type populations11. Subspecies australe is distributed in the southern part of Japan and exhibited little selfing ability when its gametophytes were cultured in isolation8. Only S-type sexual expression has been observed to date in subsp. australe8. Subspecies littorale may be a lineage derived from subsp. australe because the former subspecies has specialized ecological and morphological characters, including saxicolous habitat, early maturation, progenesis, white indusium without a central black spot, and considerable levels of selfing. The variation in the mating systems of the derived subspecies and the predominance of outcrossing in its ancestral subspecies constitute an evolutionary pattern that makes C. falcatum a useful model for studying mating system evolution in homosporous ferns.
In this work, restriction-site associated DNA Sequencing (RAD-seq)12 was performed to acquire a large amount of genetic data for high resolution population genetic analysis in order to evaluate phylogenetic relationships among S-type and M-type populations of C. falcatum subsp. littorale and S-type populations of subsp. australe. RAD-seq data also enabled us to estimate the demographic history of subsp. littorale and subsp. australe. In non-model organisms that are lacking extensive molecular marker development, RAD-seq is a valuable tool for molecular population genetics. We addressed two questions by means of these analyses: (1) Are S-type populations in subsp. littorale more closely related to subsp. australe than M-type ones? (2) Are there any associations between demographic events, lineage divergence and the evolution of selfing? By answering these questions, we aimed to clarify the process of mating system evolution in the two subspecies of diploid sexual C. falcatum.
Materials and methods
Preparation of samples
Eighty-four samples of Cyrtomium falcatum were collected from 21 populations in Japan (Table 1, Fig. 1). Populations 1 to 11 (49 individuals) were located in the distribution range of C. falcatum subsp. littorale, while the remaining ten populations (34 individuals) were in that of C. falcatum subsp. australe8,9. Of the subsp. littorale populations, Pop6 and Pop8 are S-type and the rest are M-type8,11. Populations 2, 3, 6, 8 and 11 are the same as those examined in previous microsatellite analyses11; they were referred to as ESAN1, ESAN2, IZU2, SADO and SAND, respectively, in the previous study11. Most of the samples of these populations were collected during the previous work11, but new samples of Pop6 were collected during this study. The remaining samples were collected from natural populations or obtained from plants cultivated in Tsukuba Botanical Gardens, which were originally collected as described by Matsumoto (2003)8. In addition to the sporophyte samples, twelve haploid gametophyte samples were used. The gametophytes were grown from spores of a hybrid between subsp. littorale and australe (A1-55 × A2-2), which was made through artificial crossing by Matsumoto8. The sequence data from these haploid samples were used to check and remove contigs containing paralogous sequences. All of the 95 samples referred to above were subjected to RAD-seq. Details of the sources of the samples are shown in Table S1.
RAD-seq
Total DNA was extracted from silica-dried sporophyte leaves or from fresh gametophytes using the HEPES/CTAB method13. To detect genetic structure and population demographic events, we extracted genome wide SNP from samples using Double-digest RAD-seq (ddRAD-seq)14. The DNA library was prepared with two restriction enzymes, EcoRI and BglII as described by Sakaguchi et al.15. Fifty-one bp single-end sequencing was conducted in one lane of the Illumina HiSeq2500 (Illumina, San Diego, CA, USA) by Macrogen (Seoul, South Korea). Finally, 9.2 gigabases with 181 million reads were obtained.
SNP discovery and filtering of the data
The raw fastq files were ascribed to individuals based on barcode sequences. These individual fastq files were mapped onto the chloroplast genome sequence of C. falcatum (NC_028705, GenBank)16 to exclude chloroplast sequences from the data. For this purpose, we used the BWA17 ‘aln’ and ‘samse’ functions with default settings. We extracted chloroplast SNPs using the ‘mpileup’ function of samtools 1.3.118. After mapping, we extracted unmapped single end reads with samtools 1.3.1 ‘view’ function. These unmapped sequences were considered to contain no chloroplast sequences. After excluding chloroplast reads, we converted bam files to fastq files using bamtofastq in Hydra version 0.5.319. Next, we created contigs using pyRAD20. Low quality reads (quality value Q < 33) were discarded. At the within-sample clustering stage, the minimum coverage (number of reads per sample), maximum number of heterozygous sites, and allowed number of alleles were set to 5, 5, and 2 respectively. The maximum proportion of shared heterozygote sites allowed was less than 40%. Variant Call Format (VCF) files were used for the following analysis, which included only SNPs generated by pyRAD. As the first data filtering step, we removed loci with > 20% missing data using vcftools21 because missing data can bias data analysis. We also removed individuals with > 30% missing data to avoid the adverse effects of low-quality DNA data. Next, we removed loci with minor allele frequencies (< 3%) from the non-chloroplast vcf file. Roesti et al. 22 suggested that minor alleles can be uninformative and lead to bias in genome scanning (e.g. detection of outlier loci), which is important when evaluating both selection and neutral genetic structure23. A minor allele frequency threshold of 3% has been widely used in recent NGS-based studies24,25,26. We also filtered out chloroplast loci with > 20% missing data in the same way. To estimate genetic structure based on chloroplast data, we used Splits tree427 to construct a neighbor net with the chloroplast data.
Detection of outlier loci
To evaluate genetic structure and demography based on neutral genetic information and to avoid bias due to outlier loci21, we conducted outlier loci detection. Although population-based outlier loci detection methods have been used extensively, a recent study22 showed that historical or on-going demographic events (e.g. gene flow, admixture, population bottlenecks, and expansions) can lead to bias in these population-based approaches. Moreover, admixture and a historical bottleneck were observed in C. falcatum populations in our previous study11. We therefore used an individual-based principal component analysis (PCA) approach to detect outlier loci. This approach was implemented using the R package pcadapt28 which robustly detects outlier loci under several population demographic patterns. A principal component number of K = 7 was initially employed based on the trend of the scree plot. However, the number of outlier loci was too large under these conditions. Because the slope of the scree plot began leveling out at K = 3, we instead set K to 3. We also excluded non-neutral loci estimated by pcadapt using vcftools. The remaining loci, which were considered to be neutral, were used for population genetic analysis.
Genetic diversity within populations and population differentiation
The average gene diversity and FST values at each SNP locus were calculated using hierfstat29. To estimate the effect of differences in mating system, we also used hierfstat to compute the mean inbreeding coefficient, FIS, for populations of more than 4 individuals. According to Hedrick30, the FIS value reflects the relative contribution of selfing sensu-lato (both gametophytic and sporophytic selfing) to outcrossing in each population. To validate the geographical pattern of inbreeding coefficients, we also tested the correlation between FIS and latitude for subsp. littorale and australe populations. We estimated effective population size by linkage disequilibrium using NeEstimator 2.0131.
Individual-based genetic structure and admixture
To evaluate genetic structure among individuals, we performed principal coordinate analysis (PCoA) using GenAlEx 6.532. To carry out the PCoA, pairwise genetic distances33 between individuals were calculated with the “interpolating missing” option to correct for any bias due to missing data. We also conducted a model-based cluster analysis using STRUCTURE34. Numbers of clusters (K) between 1 and 25 were evaluated under the correlated allele frequencies model35 by running 10,000 burn-in Markov Chain Monte Carlo (MCMC) repetitions and 10,000 subsequent repetitions. We used the CLUMPAK server36 to evaluate multimodality37 among runs for each K and to generate barplots for the different values of K. The probability of the data (Ln P(D)) and ΔK38 were summarized using STRUCTUREHARVESTER39. We estimated a pattern of population bifurcation and events of mixing between populations by using TreeMix 1.1240. TreeMix infers the points at which admixtures happened in a population tree under the assumed number of admixture events. We assumed 1–5 admixture events, and the best tree was defined by likelihood41.
Demographic inference
Population demography in terms of temporal changes in population size was evaluated by generating an Extended Bayesian Skyline Plot (EBSP)42 and a Stairway plot43. We first generated an EBSP using BEAST 2.4.444 for the two subspecies, excluding the OGAS populations. We used a modified version of the method introduced by Trucchi et al.45, who successfully detected temporal changes in the size of king penguin populations and showed that highly polymorphic loci identified by RAD-seq can reflect general trends that also apply to other kinds of loci. To obtain all sequences with more than four polymorphic sites, following Trucchi et al.45, we sampled one sequence per individual randomly. We used a substitution rate of 0.75 × 10–8 per site per year based on estimates for Arabidopsis thaliana46. The substitution rate parameter for sequences with four polymorphic sites was set to 1.0, and was increased in proportion to the number of polymorphic sites (i.e. it was set to 1.25 for sequences with five polymorphic sites and 1.5 for those with six). We ran 250,000,000 MCMC chains and collected a trace and EBSP log after every 1000 chains. Tracer 1.647 was used to verify that the Estimated Sample Size was greater than 100 for all parameters. We calculated Tajima’s D48 for these loci using DnaSP 649 to evaluate the general trend in the demographic pattern.
Stairway plot v243 is a newly developed method for inferring temporal changes in population size by using site frequency spectrum (SFS) data to test demography. To estimate SFS, we used a sequence from the individual in Pop 17 as the ancestral sequence because the TreeMix results indicated that this population was at the base of the population tree. We mapped sequence data for all individuals onto this sequence using BWA17 and used samtools 1.818 to convert SAM files to BAM files and sort the BAM files. We estimated the site frequency spectrum with ANGSD50, using the following options: dosaf 1 (multi sample genotype likelihood estimation with assumed Hardy–Weinberg equilibrium), GL: 1 (genotype likelihood estimation of the samtools model), and RealSFS using the maximum number of iterations (100). We used the same mutation rate as for EBSP and assumed a generation time of three years in the demography estimation for the Stairway plot.
Results
Data filtering
The non-chloroplast dataset contained 9350 loci before filtering. After excluding low frequency minor alleles and missing data, 2636 loci remained. Pcadapt detected 355 loci estimated to be non-neutral among these loci. Thus, 2281 loci were used for population genetic analyses. For chloroplast DNA (cpDNA), 191 SNPs were extracted and 180 remained after excluding missing data. Although 10 haplotypes were detected, most of the individuals were fixed for haplotype 1 (Table S1) and the neighbor net (Fig. S1) showed no clear pattern. Thus, no clear genetic structure was detected at the cpDNA level.
Genetic diversity, structure and admixture
Gene diversity was low in subsp. littorale. Subspecies australe had higher gene diversity than all the subsp. littorale populations except for Pop8 (SADO). SADO had the highest gene diversity of the subsp. littorale populations, but subsequent analyses suggested that this population belongs to subsp. australe (see the STRUCTURE and TreeMix analysis below). Taking this result into account, all subsp. australe populations had higher gene diversity than subsp. littorale (Table 1). The FST value over all populations was 0.424. The FST value between the two subspecies was 0.357 when the SADO population was treated as subsp. australe. The inbreeding coefficient FIS was higher in subsp. littorale populations than subsp. australe populations, some of which showed negative FIS values.
PCA showed that subsp. littorale individuals generally clustered together in a narrow range, whereas SADO individuals were clustered with subsp. australe or located between subsp. littorale and australe (Fig. 2). Two individuals in the SAND population were also within the cluster of subsp. australe; one of these individuals had been detected as an admixed individual in our previous study11. Subspecies australe showed greater variance and several populations were divided into multiple clusters.
In the STRUCTURE analysis of all populations, the probability of the data (LnP(D)) increased progressively with each K until it reached a plateau at K = 6 (Fig. S2). The highest ΔK was detected when K = 2 (Fig S3). At K = 2, the clustering mostly supported taxonomical classification of subsp. littorale and australe; however, all of the SADO and two of the SAND individuals were included in the australe cluster or showed an admixture-like pattern. The OGAS and KASA populations also showed population admixture at K = 2, K = 3, and K = 4. However, at K = 5 the OGAS1—3 populations formed a new cluster. At K = 3, several subsp. australe populations appeared as admixtures of multiple clusters. At K = 4, NOMO and TSUN appeared as pure clusters; this outcome was observed up to K = 25.
The TreeMix analysis showed that the likelihood increased with the assumed number of migration events. We identified subsp. australe as the ancestral form on the basis of gene diversity, so we presumed that subsp. australe would have a lower drift parameter. This was the case when the migration number m was set to 3. All subsp. littorale populations other than the SADO population constituted one group that departed from subsp. australe (Fig. 3). TreeMix showed a pattern of divergence from south to north in subsp. littorale; three populations on the Kii Peninsula (SAND, SHIO, and KANT) near the western limit of the subspecies’ distribution (Fig. 1) were located at the base of the clade of subsp. littorale. S-type (low selfing rate) populations in subsp. littorale were placed in two different positions of the tree: SADO was in subsp. australe, and IZU was in a branch of subsp. littorale. The populations from the Ogasawara Islands were divided into two groups: OGAS1 had a low drift parameter, while OGAS2 and 3 formed a group sister to subsp. littorale (Fig. 3).
Demographic inference and temporal changes in effective population size
An EBSP analysis was performed using 82 loci for littorale and 77 loci for australe. This indicated that the population size of subspecies littorale fell between 100,000 and 22,000 years before the present (BP) and then increased from 22,000 years BP onwards (Fig. 4b). The effective population size at 22,000 years BP was estimated to be around 250,000 individuals based on the assumption of three years per generation. Although the confidence interval was wide, the general pattern still applied even when the width of this interval was taken into account. Conversely, subsp. australe showed an almost constant population size (Fig. 4a). The mean values of Tajima’s D of the loci examined for subsp. littorale and australe were 0.151 (SD: 1.109) and -0.025 (SD: 1.080), respectively.
The Stairway plot also revealed a population bottleneck in subsp. littorale; the effective population size fell from about 7000 years BP and reached its minimum at 40,000 years BP (Fig. 5b). The population size began to increase after the bottleneck but decreased again 600 years BP; the effective current population size was found to be about 40,000 individuals. The australe subspecies exhibited no population bottleneck (Fig. 5a); its estimated current effective population size was about 20,000 individuals. The effective population size of subsp. littorale before the population bottleneck was about 80,000; that of subsp. australe was 60,000 during the same period.
Discussion
Genetic structure of subspecies littorale and australe
PCA (Fig. 2) and STRUCTURE (Fig. 6) analyses showed that the SADO population and two individuals of the SAND population belong to subsp. australe despite its morphological characters with respect to indusium color and plant size. TreeMIX analysis (Fig. 3) also supported this conclusion. Unlike other populations of subsp. littorale, the SADO population had the same level of gene diversity as subsp. australe and showed no sign of selfing11 (Table 1). We therefore treated the SADO population as subsp. australe in this study and in the discussion below. Based on this classification, subsp. littorale was clearly divided from subsp. australe by our analyses, and was characterized by lower genetic diversity and higher selfing rates than subsp. australe.
Selfing reduces effective population size and results in increasing population differentiation caused by genetic drift. However, PCA revealed a low level of variance among subsp. littorale populations, which would not result from genetic drift by selfing. The low genetic variation of subsp. littorale compared to subsp. australe suggests that the ancestor of subsp. littorale experienced strong genetic drift. Neighbor net analysis based on chloroplast DNA showed no clear clustering for littorale and australe populations. This was presumably due to the small number of SNP sites in the cpDNA and the difference in substitution rate between the chloroplast and the highly polymorphic nuclear loci.
Both subsp. australe and subsp. littorale individuals were found in the SAND population (Fig. 6), which is close to the geographical boundary of the two subspecies (Fig. 1). A previous morphological study indicated that this boundary is located on the western side of Shikoku9. In contrast to previously reported findings8,9, it may be that these two subspecies do not show strictly parapatric distribution and that their distribution ranges overlap in Shikoku and on the western side of the Kii peninsula.
Demographic history of subspecies littorale and australe populations
The two subspecies showed different demographic patterns and levels of genetic diversity. Subspecies littorale showed a trend indicative of population shrinkage or a bottleneck, which would result in low genetic diversity in modern populations of subsp. littorale. Site frequency spectrum simulations conducted during a recent study by Lapierre et al. 51 indicated that Stairway plots are biased by noise intrinsically present in the data when inferring true demographic history, and that caution is therefore needed when interpreting them. However, since the demographic trends indicated for subsp. littorale and australe by the Stairway plot were consistent with those determined by EBSP, we consider the general patterns of temporal change in population size presented here to be reliable. While the two methods suggested slightly different timings for the identified changes in population demography, both of them indicated that the timing of the bottleneck was related to the last glacial period. A study of Asplenium species in Europe demonstrated the effects of the last glacial period on genetic diversity and species range52. Subspecies littorale appears to have undergone population shrinkage during the last glacial period, whereas subsp. australe showed no evidence of any population shrinkage. Indeed, Tajima’s D value for subsp. australe was almost zero, suggesting that its effective population size was constant, neither shrinking nor expanding. The relative geographic positions of the two subspecies may explain their different demographic histories: the area providing suitable habitats for the northern subsp. littorale is likely to have been reduced significantly during the last glacial period, whereas that for the southern subsp. australe would have been largely unaffected. Alternatively, it may be that a more northerly population of this species was isolated in a refugium (for example, in the southern edge of the Kii Peninsula or Shikoku Island) and simultaneously experienced population shrinkage in the last glacial period, and that this population became a founder of subsp. littorale. In other words, the postulated population bottleneck event may be closely associated with the divergence between subsp. littorale and australe. Both the EBSP and Stairway plot analyses suggest that the effective population sizes for the two species were similar before the bottleneck event, which supports the second scenario.
The mixed mating system of subsp. littorale would have helped population size recovery after the population bottleneck. Both the EBSP and Stairway plot methods indicated a rapid increase in population size of subsp. littorale after the population bottleneck, which could be explained by range expansion to the north. Gametophytic selfing has been considered advantageous for reproductive assurance during rapid range expansion because it enables single-spore colonization of distant regions53. After the last glacial period, habitats suitable for subsp. littorale would have been empty in the northern area. M-type sexual expression and relatively high levels of selfing (Table 1) in subsp. littorale would be advantageous in filling empty habitats because they facilitate dispersal and population establishment.
Mating system evolution in subspecies littorale
In our previous study using microsatellite markers, we calculated inbreeding coefficients (F) for the subsp. littorale populations to estimate their levels of selfing and obtained F values ranging from 0.22 to 0.7911. The IZU population was known to have S-type sexual expression8 and had the lowest F value (0.22). The present study using genome-wide SNP data yielded lower F values, ranging from 0.053 to 0.427 for the subsp. littorale populations (Table 1). The F values obtained here may be biased due to the small sample sizes of 4–15 and/or possible contamination by non-neutral loci despite multiple filtering steps. The bias appears to be towards negative F values, as most of the subsp. australe populations that seem to be outcrossers showed negative F values. Whereas, the IZU population, which had an adequate sample size (15) for estimating allele frequencies, had nearly the same F value (0.233) as that in our previous study11. TreeMix analysis (Fig. 3) showed a divergence pattern of populations from south to north in the clade of subsp. littorale, and the IZU population located in the middle of the distribution range (Fig. 1) was placed in a derived position in the tree of subsp. littorale. This branching order of populations is consistent with the scenario of northward distribution expansion of subsp. littorale after the last glacial period. However, it is not concordant with the distribution of sexual expression (S-type vs. M-type). There are two possible explanations for the non-concordance between sexual expression and tree topology. First, after the evolution of M-type sexual expression favoring selfing in the common ancestor of subsp. littorale populations, S-type sexual expression may have evolved newly in the IZU population or been acquired through introgression from subsp. australe. Second, M-type sexual expression may have emerged later on each of the littorale branches via parallel evolution or gene flow from the population in which M-type evolved first. In this view, IZU is a unique population that has maintained ancestral S-type sexual expression. Although IZU has S-type sexual expression like subsp. australe, the population shows mixed-mating (FIS = 0.233), unlike the obligate outcrossing mating system shown by subsp. australe populations (Table 1). This makes the former explanation seem more likely. In both scenarios, mating system evolution must have occurred within a short period, which implies that the mating system of ferns is less stable than that of seed plants. Despite this, mating system transition is likely to be a rare event. The S-type sexual expression of subsp. littorale was reported in two distant locations: Sadogashima Island and Izu peninsula8. Because the present study excluded the SADO population on Sadogashima Island from subsp. littorale, only the populations around the IZU population are known to have the S-type sexual expression. Although IZU shows the lowest level of selfing among subsp. littorale populations, it retains a mixed mating system. Therefore, the putative transition from M-type to S-type in IZU is not an unequivocal example of recovery of outcrossing from selfing, which has not been observed in seed plants54. However, it can safely be described as an interesting example in which an evolutionary change that enhanced outcrossing emerged in mixed-mating plants.
The mating system of ferns seems to be determined by the balance between inbreeding depression and the advantages of selfing. Additionally, it appears that changes in the sexual expression pattern, timing, and the position of the reproductive organs on gametophytes can alter the ease of gametophytic selfing. As discussed previously11, the evolution of selfing via transmission advantage operates only under a low genetic load and predominant selfing is expected3. Therefore, the existence of mixed-mating even in the M-type populations suggests that reproductive assurance, rather than transmission advantage, is the main factor affecting the evolution of selfing in this species. If this is the case, populations of subsp. littorale are expected to harbor sufficient genetic load to avoid the positive feedback process of selfing evolution driven by transmission advantage3. If genetic diversity and inbreeding depression increased in selfing populations that have been stable for a long time as a result of migration, it would be possible for them to change their sexual expression toward enhancing outcrossing, especially in populations that were not completely selfing.
Conclusion
This study successfully showed that C. falcatum subsp. littorale underwent a population bottleneck during the last glacial period, which appears to have induced the divergence of C. falcatum subsp. littorale and australe. The evolution of selfing may have been favored by reproductive assurance during rapid range expansion after the last glacial period. This is the first study to address the relationship between mating system evolution and range expansion after a glacial period based on demographic inferences, and it sheds new light on mating system evolution and ecological dynamics in both ferns and other plant species.
Change history
21 September 2021
A Correction to this paper has been published: https://doi.org/10.1038/s41598-021-98822-y
References
Barrett, S. C. H. Evolution of plant sexual diversity. Nat. Rev. Genet. 3, 274–284 (2002).
Shimizu, K. K. & Tsuchimatsu, T. Evolution of selfing: Recurrent patterns in molecular adaptation. Annu. Rev. Ecol. Evol. Syst. 46 (2015).
Lande, R. & Schemske, D. W. The evolution of self-fertilization and inbreeding depression in plants. 1. Genetic models. Evolution (N. Y). 39, 24–40 (1985).
Schemske, D. W. & Lande, R. The evolution of self-fertilization and inbreeding depression in plants. II. Empirical observations. Evolution (N. Y). 39, 41–52 (1985).
Guo, Y.-L. et al. Recent speciation of Capsella rubella from Capsella grandiflora, associated with loss of self-incompatibility and an extreme bottleneck. Proc. Natl. Acad. Sci. USA. 106, 5246–5251 (2009).
Foxe, J. P. et al. Recent speciation associated with the evolution of selfing in Capsella. Proc. Natl. Acad. Sci. USA. 106, 5241–5245 (2009).
Haufler, C. H., Pryer, K. M., Schuettpelz, E., Sessa, E. B. & Farrar, D. R. Sex and the single gametophyte: Revising the homosporous vascular plant life cycle in light of contemporary research. Bioscience 66, 928–937 (2016).
Matsumoto, S. Species ecological study on reproductive systems and speciation of Cyrtomium falcatum complex (Dryopteridaceae) in Japanese archipelago. Ann. Tsukuba Bot. Gard. 22, 1–141 (2003).
Ebihara, A., Matsumoto, S., Mazumdar, J. & Yamamoto, K. Updates of taxonomic treatments for ferns of Japan 2. Athyrium Cyrtomium. 43, 19–25 (2017).
Klecowski, E. J. Reproductive biology of the Pteridophyta. II. Theoretical considerations. Bot. J. Linn. Soc. 62, 347–359 (1969).
Imai, R., Tsuda, Y., Matsumoto, S., Ebihara, A. & Watano, Y. The relationship between mating system and genetic diversity in diploid sexual populations of Cyrtomium falcatum in Japan. PLoS ONE 11, e0163683 (2016).
Miller, M., Dunham, J., Amores, A., Cresko, W. & Johnson, E. Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers. Genome Res. 17, 240–248 (2007).
Shepherd, L. D. & McLay, T. G. B. Two micro-scale protocols for the isolation of DNA from polysaccharide-rich plant tissue. J. Plant Res. 124, 311–314 (2011).
Peterson, B. K., Weber, J. N., Kay, E. H., Fisher, H. S. & Hoekstra, H. E. Double digest RADseq: An inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PLoS One 7 (2012).
Sakaguchi, S. et al. High-throughput linkage mapping of Australian white cypress pine (Callitris glaucophylla) and map transferability to related species. Tree Genet. Genomes 11 (2015).
Raman, G., Choi, K. S. & Park, S. Phylogenetic relationships of the fern Cyrtomium falcatum (dryopteridaceae) from Dokdo Island, sea of East Japan, based on chloroplast genome sequencing. Genes (Basel). 7 (2016).
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Quinlan, A. R. et al. Genome-wide mapping and assembly of structural variant breakpoints in the mouse genome. Genome Res. 20, 623–635 (2010).
Eaton, D. A. R. PyRAD: Assembly of de novo RADseq loci for phylogenetic analyses. Bioinformatics 30, 1844–1849 (2014).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Roesti, M., Salzburger, W. & Berner, D. Uninformative polymorphisms bias genome scans for signatures of selection. BMC Evol. Biol. 12, 94 (2012).
Luikart, G., England, P. R., Tallmon, D., Jordan, S. & Taberlet, P. The power and promise of population genomics: From genotyping to genome typing. Nat. Rev. Genet. 4, 981–994 (2003).
Vandepitte, K. et al. SNP discovery using Paired-End RAD-tag sequencing on pooled genomic DNA of Sisymbrium austriacum (Brassicaceae). Mol. Ecol. Resour. 13, 269–275 (2013).
Larson, W. A. et al. Genotyping by sequencing resolves shallow population structure to inform conservation of Chinook salmon (Oncorhynchus tshawytscha). Evol. Appl. 7, 355–369 (2014).
Dou, J. et al. Evaluation of the 2b-RAD method for genomic selection in scallop breeding. Sci. Rep. 6, 19244 (2016).
Huson, D. H. & Bryant, D. Application of phylogenetic networks in evolutionary studies. Mol. Biol. Evol. 23, 254–267 (2006).
Luu, K., Bazin, E. & Blum, M. G. B. pcadapt: An R Package to Perform Genome Scans for Selection Based on Principal Component Analysis. https://doi.org/10.1101/056135 (2016).
Goudet, J. HIERFSTAT, a package for R to compute and test hierarchical F-statistics. Mol. Ecol. Notes 2, 184–186 (2005).
Hedrick, P. W. Population genetics of intragametophytic selfing. Evolution (N. Y). 41, 137–144 (1987).
Do, C. et al. NeEstimator v2: Re-implementation of software for the estimation of contemporary effective population size (Ne) from genetic data. Mol. Ecol. Resour. 14, 209–214 (2014).
PE, P. R. S. GenAlEx 6.5: Genetic analysis in Excel. Population genetic software for teaching and research? An update. Bioinformatics 28, 2537–2539 (2012).
Smouse, P. E. & Peakall, R. Spatial autocorrelation analysis of individual multiallele and multilocus genetic structure. Heredity (Edinb). 82, 561–573 (1999).
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
Falush, D., Stephens, M. & Pritchard, J. K. Inference of population structure using multilocus genotype data: Linked loci and correlated allele frequencies. Genetics 164, 1567–1587 (2003).
Kopelman, N. M., Mayzel, J., Jakobsson, M., Rosenberg, N. A. & Mayrose, I. Clumpak: A program for identifying clustering modes and packaging population structure inferences across K. Mol. Ecol. Resour. https://doi.org/10.1111/1755-0998.12387 (2015).
Jakobsson, M. & Rosenberg, N. A. CLUMPP: A cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23, 1801–1806 (2007).
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Mol. Ecol. 14, 2611–2620 (2005).
Earl, D. A. & von Holdt, B. M. STRUCTURE HARVESTER: A website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361 (2012).
Pickrell, J. K. & Pritchard, J. K. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 8 (2012).
Raghavan, M. et al. Upper Palaeolithic Siberian genome reveals dual ancestry of Native Americans. Nature https://doi.org/10.1038/nature12736 (2014).
Heled, J. & Drummond, A. J. Bayesian inference of population size history from multiple loci. BMC Evol. Biol. 8, 289 (2008).
Liu, X. & Fu, Y.-X. Exploring population size changes using SNP frequency spectra. Nat. Genet. 47, 555–559 (2015).
Bouckaert, R. et al. BEAST 2: A software platform for Bayesian evolutionary analysis. PLoS Comput. Biol. 10, 1–6 (2014).
Trucchi, E. et al. King penguin demography since the last glaciation inferred from genome-wide data. Proc. Biol. Sci. 281, 20140528 (2014).
Ossowski, S. et al. The rate and molecular spectrum of spontaneous mutations in Arabidopsis thaliana. Science 327, 92–94 (2010).
Rambat, A. & Drummond, A. J. Tracer v 1.5. (2009).
Tajima, F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123, 585–595 (1989).
Rozas, J. et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol. Biol. Evol. 34, 3299–3302 (2017).
Korneliussen, T. S., Albrechtsen, A. & Nielsen, R. ANGSD: Analysis of next generation sequencing data. BMC Bioinform. 15, 1–13 (2014).
Lapierre, M., Lambert, A. & Achaz, G. Accuracy of demographic inferences from the site frequency spectrum: The case of the Yoruba population. Genetics 206, 439–449 (2017).
Bystriakova, N. et al. Present, past and future of the European rock fern Asplenium fontanum: combining distribution modelling and population genetics to study the effect of climate change on geographic range and genetic diversity. Ann. Bot. 113, 453–465 (2014).
Baker, H. G. Self-compatibility and establishment after ‘long-distance’ dispersal. Evolution (NY). 9, 347–349 (1955).
Takebayashi, N. & Morrell, P. L. Is self-fertilization an evolutionary dead end? Revisiting an old hypothesis with genetic theories and a macroevolutionary approach. Am. J. Bot. 88, 1143–1150 (2001).
Acknowledgements
We thank Mr. Nobuo Imai for assistance with sample collection and Dr. Leanne K Faulks for English editing. This work was supported by the Research Support Program Fund 2014 from Chiba University and a Grant-in-Aid (No. 15K07180 to Y. Watano) from the Ministry of Education, Science and Culture of Japan, and the Environment Research and Technology Development Fund (4-1605 to Y. Tsuda) from the Japanese Ministry of the Environment.
Author information
Authors and Affiliations
Contributions
R.I., Y.T. and Y.W. wrote the main manuscript text. R.I., A.E. and S.M. cultured and collected sample. R.I., A.T. and A.N. conducted molecular experiment. R.I., Y.T. and R.O. conducted data analysis. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this Article was revised: In the original version of this Article, Yasuyuki Watano was omitted as a corresponding author. Correspondence and request for materials should also be addressed to watano@faculty.chiba-u.jp. In addition, the authors Ryosuke Imai and Yoshiaki Tsuda were omitted as equally contributing authors.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Imai, R., Tsuda, Y., Ebihara, A. et al. Mating system evolution and genetic structure of diploid sexual populations of Cyrtomium falcatum in Japan. Sci Rep 11, 3124 (2021). https://doi.org/10.1038/s41598-021-82731-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-82731-1
- Springer Nature Limited