Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Introduction

Climatic and geological changes in the Tertiary and Quaternary dramatically influenced the distribution of flora and fauna in North America. In the Tertiary, North America became separated from Eurasia, resulting in vicariant populations of many species of the ancient flora and fauna (Budantsev 1992; Graham 1999). During glacial maxima, plants, fungi and animals were forced to unglaciated refugia, from where they expanded to newly exposed areas in warmer interglacial periods. Reconstruction of these historical events is of paramount importance because they had major influences on past speciation events and are reflected in present day population structures.

Beringia, including Alaska and North-Eastern Siberia, has long been a focal point for biogeographical research in a wide range of plant and animal taxa. This high level of interest arises for two principal reasons. First, due to its diverse landscape and climate and the fact that much of the region remained ice-free during glacial maxima, Beringia served as a refugium for Arctic and subarctic flora and fauna (Adams and Faure 1997; Brubaker et al. 2005; Edwards et al. 2000; Hultén 1968). Second, during much of the Tertiary and the Quaternary periods, Beringia was the major land connection between Asia and North America and provided migration routes to a wide variety of organisms (for example, see Elias et al. 2000; Qian 1999; Swanson 2003). High genetic diversity and evidence for glacial refugia in Beringia have been reported in several, predominantly arctic-alpine animal and plant taxa, such as the Tundra Vole (Microtus oeconomus) (Brunhoff et al. 2003), Bighorn Sheep (Ovis canadensis) and Dall Sheep (O. dalli) (Loehr et al. 2005), Columbian Ground Squirrel (Spermophilus columbianus) (MacNeil and Strobeck 1987), Old World Swallowtail (Papilio machaon) (Sperling and Harrison 1994), Amara alpina, an arctic-alpine ground beetle (Reiss et al. 1999), Paranoplocephala arctica, a parasitic cestode of collared lemmings (Wickström et al. 2003), Entire-leaved Avens (Dryas integrifolia) (Tremblay and Schoen 1999), Purple Saxifrage (Saxifraga oppositifolia) (Abbott and Comes 2003), Arctic Bell-heather (Cassiope tetragona) (Eidesen et al. 2007) and the Bog Blueberry (Vaccinium uliginosum) (Alsos et al. 2005).

Whether fragments of boreal forest existed in Beringia during the Last Glacial Maximum (LGM) is a major, but, as yet, unanswered question in Quaternary science. Although earlier pollen-based biome reconstructions suggest that all of Beringia was covered by arctic tundra with no conifers (Edwards et al. 2000; Kaufman et al. 2004; Swanson 2003), phylogeographic analyses of DNA sequences and the most recent pollen data support the theory that Picea glauca and P. mariana existed in small refugia in Alaska (Anderson et al. 2006; Brubaker et al. 2005).

Ectomycorrhizal (ECM) fungi are obligate root-associated mutualistic symbionts of trees in boreal, temperate and tropical forests (Smith and Read 1997; Bas 1969; Härkönen et al. 1994; Simmons et al. 2002) and their past and present populations likely have been shaped by the distribution of their host trees. The likely importance of host trees in the distribution of ECM fungi has been repeatedly noted. However, despite their ecological importance, very little is known about ECM phylogeography. Furthermore, learning about the phylogeographic history of ECM taxa not only provides information related to the organism in question, but may also help to resolve whether or not their hosts survived the LGM in Beringian refugia. To test the hypothesis of persistent LGM forest refugia in Alaska, we analysed molecular data from Amanita muscaria, an ECM fungus native to temperate and boreal forest regions of the Northern Hemisphere.

Materials and Methods

Multi-locus DNA sequence data for A. muscaria specimens were generated in previous studies (Geml et al. 2006, 2008, and Oda et al. 2004). Geml et al. (2006) reported that there are at least three phylogenetic species clades within the A. muscaria species complex that occur in Alaska (referred to as Clades I, II, and III). To our knowledge, Clade I is restricted to North America, while Clades II and III occur in Eurasia and in Alaska. For our tests in this study, 114 ITS rDNA sequences were chosen, which represent the three phylogenetic species. Because the phylogenetic species mentioned are non-interbreeding entities, population-level analyses were conducted separately for each species clade. Multiple sequence alignments were made using Clustal W (Thompson et al. 1997) and subsequently were corrected manually. Identical sequences were collapsed into haplotypes using SNAP Map (Aylor and Carbone 2003) after recoding insertion or deletions (indels) and excluding infinite-sites violations. The analyses presented here assume an infinite sites model, under which a polymorphic site is caused by exactly one mutation and there can be no more than two bases segregating. Base substitutions were categorised as phylogenetically uninformative or informative, and as transitions or transversions. Site compatibility matrices were generated from each haplotype dataset using SNAP Clade and SNAP Matrix (Markwordt et al. 2003) to examine compatibility/incompatibility among all variable sites, with four resultant incompatible sites removed from the data set. This was important as subsequent coalescent analyses assume that all variable sites are fully compatible. Two migration models were used. First, MDIV (Nielsen and Wakeley 2001) was used to determine whether there was any evidence of migration between Alaskan and non-Alaskan populations in each species clade, i.e. to test whether Alaskan populations could have survived the LGM in local forest refugia. For this purpose, specimens were assigned either to the ‘Alaskan’ or the ‘non-Alaskan’ group based on their localities. In Clades I and II, the ‘non-Alaskan’ groups consisted of specimens collected in the contiguous states of the US or in Eurasia, respectively. These represented populations that survived the LGM in Southern refugia. MDIV implements both likelihood and Bayesian methods using Markov chain Monte Carlo (MCMC) coalescent simulations to estimate the migration rate (M), population mean mutation rate (Theta), divergence time (T) and the time since the most recent common ancestor (TMRCA). This approach assumes that all populations descended from one panmictic population that may or may not have been followed by migration. For each dataset, the data was simulated assuming an infinite sites model with uniform prior. We used 2,000,000 steps in the chain for estimating the posterior probability distribution and an initial 500,000 steps to ensure that enough genealogies were simulated before approximating the posterior distribution. Second, if MDIV showed evidence of migration, MIGRATE was used to estimate migration rates assuming equilibrium migration rates (symmetrical or asymmetrical) in the history of the populations (Beerli and Felsenstein 2001). We applied the following specifications for the MIGRATE maximum-likelihood analyses: M and Theta generated from the FST calculation, migration model with variable Theta and constant mutation rate. Subsequently, we reconstructed the genealogy with the highest root probability, the ages of mutations and the TMRCA of the sample using coalescent simulations in Genetree v. 9.0 (Griffiths and Tavaré 1994). Ages were measured in coalescent units of 2N, where N is the population size.

Results

Two of the three species clades (Clades I and II) possessed sufficient variation in the ITS to test whether the Alaskan populations were genetically differentiated from non-Alaskan populations. The third, Clade III, was only represented by ten individuals forming three haplotypes with minimal variation. In this species, all Alaskan specimens carried the most frequent haplotype that was shared with Asian specimens as well. The datasets consisted of 653 and 640 characters for Clade I and Clade II, respectively, including gaps.

In Clade I, the ‘North American’ clade, 15 haplotypes were found, of which only one, haplotype D, was found in Alaska (Tables 1 and 2). MDIV showed no evidence of gene flow (M∼0) and estimated a divergence time of T∼0.9 between the Alaskan and non-Alaskan populations (Fig. 1a). Therefore, in simulations using Genetree, we assumed subdivided population structure and a very low level of migration (close to 0) for the starting backward migration matrix. The coalescent-based genealogy was informative with respect to the resolution of the splitting event between the Alaskan and non-Alaskan populations and inference of the mutational history with respect to variation between and within populations (Fig. 2a).

Table 1 Identity of A. muscaria haplotypes in our sample. Haplotype letters correspond to those in Table 2 and Fig. 2
Table 2 Polymorphic sites in haplotypes collapsed after recoding indels and excluding infinite-sites violations from the original dataset for coalescent analyses. Position refers to that in the original alignment, site number is the designation of the given mutation as shown on Fig. 1, site type refers to transition (t), transversion (v), or deletion (-) change with regard to the consensus sequence. Character type indicates either parsimony-informative (i) or non-informative (-) sites. Capital letters refer to haplotype designations on Fig. 2
Fig. 1
figure 1_9figure 1_9

Posterior probability distributions of migration (M), divergence time (T) and population mean mutation rate (Theta) between population pairs using MDIV. Estimates (axis X) with the highest distribution function (axis Y) values have the greatest probabilities. A and C show examples of zero migration and non-zero divergence time, while graphs in B indicate recent gene flow and no divergence between population pairs

Fig. 2
figure 2_9figure 2_9

Coalescent-based genealogies with the highest root probabilities showing the distribution of mutations. The inferred genealogies are based on 2 million simulations of the coalescent. The time scale is in coalescent units of 2N, where N is the population size. Mutations and bifurcations are time ordered from the top (past) to the bottom (present). Mutation designations correspond to the site numbers in Table 2. The numbers below the tree designate the distinct haplotypes, their observed frequencies in total and in the different geographic regions

In Clade II, the ‘Eurasian’ clade, 14 haplotypes were found, six of which occur in Alaska (Tables 1-2). Of these six, five were found only in Alaska (haplotypes C, D, G, H, I), while one (haplotype A) represented the most abundant haplotype in the sample, spanning Eurasia and Alaska. It must be noted, that haplotype D (or subclade II/A in Geml et al. 2006), a lineage endemic to the maritime rainforests of the Pacific Northwest, was considered ‘Alaskan’ as opposed to ‘Eurasian’, despite not being strictly restricted by the political boundaries and occurring as far South as Northwestern Washington state. Apart from this, no other Clade II haplotypes have been found in North America, outside Alaska. Despite the very similar values for Theta (Fig. 1), the population structure in Clade II proved to be very different from that of Clade I discussed above. MDIV showed evidence for intermediate levels of gene flow (M∼2.5) between Alaskan and Eurasian populations and estimated no population divergence (T∼0) (Fig. 1b). Our analyses using MIGRATE suggested that migration between Alaska and Eurasia has been asymmetrically bidirectional with Alaska receiving more migrants (M 12=9,482.98 and M 21=33.53; M 12 referring to the migration from Eurasia to Alaska, and M 21 vice versa). Therefore, in simulations using Genetree, we assumed panmictic population structure. As expected, the coalescent-based genealogy did not show any historical population division (Fig. 2b). However, based on the fact that the specimens from the maritime rainforest ecoregion of the Pacific Northwest of North America formed a well-supported sister group (II/A) to the rest of Clade II in our previous multi-locus phylogenetic analyses (Geml et al. 2006, 2008), we tested in a separate analysis whether or not there was any gene flow between this group inhabiting a special habitat with a restricted distribution and the rest of Clade II, this latter including boreal Alaskan and Eurasian specimens. In this case, MDIV showed no migration (M∼0) and estimated a divergence time of T∼1.5 between the two populations, i.e. subclade II/A vs. Clade II (Fig. 1c). Nonetheless, the coalescent-based genealogy estimated under an assumption of population subdivision did not differ significantly from the one generated assuming panmixia (Fig. 2c).

Discussion

The main goal of this project was to estimate migration between Alaskan and non-Alaskan populations within distinct phylogenetic species in the A. muscaria species complex in order to test whether or not these fungi could have survived the LGM in Alaska. Because non-Alaskan populations most likely survived the LGM in refugia South of the major ice shields, the lack of migration between these and the Alaskan ones would suggest local survival of the latter, implying forest refugia in Alaska.

Overall, we found support for at least two endemic regional populations (haplotype D in Clade I, and haplotype D in Clade II) in different parts of Alaska, both of which exhibited genetic isolation and differentiation from other conspecific populations. Unfortunately, we could not estimate the times of divergence of the clades with certainty due to the lack of fossils and the great variance in nucleotide substitution rates in fungi. Nonetheless, it seems very likely that the major lineages (Clades I-III) separated well before the Pleistocene glacial cycles (Geml et al. 2006). On the other hand, the endemic regional populations likely represent groups that became isolated from the rest of their species in the Pleistocene and survived one or more glacial maxima in local refugia. Our findings support the existence of at least two independent such glacial forest refugia: (1) boreal forest in Interior Alaska; and (2) maritime rainforest in Southeast Alaska and the Pacific Northwest (Fig. 3). Apart from this, many Alaskan haplotypes in Clade II were shared with specimens from Eurasia. Our results suggesting high migration, most likely both pre- and postglacial, across the Bering Strait and the similar observed numbers of haplotypes endemic to either Eurasia or Alaska leave open the question whether boreal members of Clade II survived the LGM in Alaska or colonised Alaska from Asia in multiple times in postglacial events.

Fig. 3
figure 3_9figure 3_9

Outline map of Alaska showing the collecting locations for the 43 Alaskan specimens included in this study. Dark grey squares and circles indicate sampling localities for endemic haplotypes, while light grey circles refer to haplotypes shared between Alaska and Eurasia. Black stylised spruce drawings mark putative forest refugia during the Last Glacial Maximum (LGM) suggested by this and other studies mentioned in the discussion. Grey tree figures indicate present forested area, without making a distinction between boreal and maritime forests of Interior and Southeast Alaska, respectively

The possible existence of forest refugia in Interior and Southeast Alaska is also supported by several other independent lines of evidence. For example, although previous palynological biome reconstructions suggest that all of Beringia was covered by arctic tundra (Edwards et al. 2000; Kaufman et al. 2004; Swanson 2003), recent pollen data by Brubaker et al. (2005) supports the theory that Picea forests were present in Eastern Beringia, at least in small fragments restricted to habitats with favourable microclimates. The Northernmost distribution of Picea, unlike that of Betula and Populus, is restricted, by definition, to the boreal region. Therefore, Brubaker’s findings indicate the existence of the boreal forest biome in Alaska during the LGM. Similarly, Anderson et al. (2006), surveying forest stands across Northwestern North America, found several chloroplast DNA haplotypes of Picea glauca that were unique to Alaska, suggesting local survival. Maroja et al. (2007) analysed mitochondrial DNA sequence data from the spruce beetle (Dendroctonus rufipennis) and suggested that the postglacial range expansion of these beetles occurred from three refugia, one of which was in Interior Alaska.

Forest refugia in Southeast Alaska are also supported by genetic studies in several different taxa. For example, this is a region with high mammalian endemism due to its fragmented landscape and complex glacial history. Even animals, as mobile as wolves, have been shown to exhibit a strong signal of independent histories for the coastal and continental populations (Weckworth et al. 2005). Also, phylogenetic studies of Fleming and Cook (2002) on ermine (Mustela erminea) showed that there was at least one lineage strictly endemic to Southeast Alaska that likely survived locally during the Wisconsin glaciation. Locations for such glacial refugia in Southeast Alaska have been proposed by Carrarra et al. (2003) based on geological evidence.

In this paper, we demonstrated the use of ECM fungal taxa to indirectly infer glacial vegetation history. Therefore, our results are not only relevant to the mycological community, but to a broader biological community in general, particularly to those with a biogeographic interest.