Introduction

Nuclear genome sizes vary tremendously between eukaryotic species (Whitney and Garland (2010) report a range spanning 8 orders of magnitude) and this variation is due to a multitude of processes, including polyploidization, gene duplication and amplification of transposable elements (see, e.g., Soltis and Soltis 1999; Cui et al. 2006; Kazazian 2004). Although these mutational processes can induce structural and functional changes that are assumed to promote further evolution and adaptation in plants, animals, and fungi, it is currently still under debate whether the variation in genome size between species is predominantly the result of adaptive or of neutral processes (see, e.g., Whitney and Garland 2010).

Besides variation in genome size between species, variation within species is also commonly observed. This within-species genome plasticity has been well studied in several fungal species where it has been found to be associated with life cycle or cellular differentiation and with changing environmental conditions (Albertin and Marullo 2012). For example, Candida albicans is a diploid yeast that frequently enters a cryptic mating or parasexual cycle in which somatic cells fuse and the resulting tetraploid cells undergo a random loss of chromosomes. As a result, a variety of ploidy levels, including aneuploidy, can be observed among C. albicans isolates (Ibrahim et al. 2005). Candida glabrata on the other hand is a haploid fungal species that displays frequent changes in its chromosome complement in relation with pathogenicity and a changing environment (Poláková et al. 2009). Specific environmental factors that have been shown to affect fungal genome size include salt stress (Dhar et al. 2011), fungicide treatments (Welker and Williams 1980), heat shock treatments (Hilton et al. 1985), and host–pathogen interactions (Raffaele and Kamoun 2012).

We aimed to investigate genome size in the ectomycorrhizal ascomycete Cenococcum geophilum Fr. and assess whether it is affected by serpentine edaphic conditions. Mycorrhizal fungi are a diverse group of soil microorganisms that form multifunctional symbiotic associations with the roots of vascular plants, in which the fungi aid nutrient and water uptake by the plant, improve plant tolerance to drought and pathogens, and aid plant performance in environments high in heavy metal content (Smith and Read 2008). C. geophilum, which associates with the roots of both gymnosperms and angiosperms in temperate to tropical regions (Trappe 1964), has become the focus of increased research effort, partly due to its broad geographic and ecological distribution (LoBuglio 1999). Although multilocus genotype assessments suggest the occurrence of recombination and a possible cryptic sexual cycle (Taylor et al. 1999, Lobuglio and Taylor 2002, Douhan et al. 2007), a sexual stage has so far been assumed to be absent in C. geophilum (although see Fernandez-Toiran and Agueda 2007, and Spatafora et al. 2012). While this absence would imply clonal reproduction and a relatively low genetic diversity, various studies have demonstrated an unexpectedly high level of genetic diversity within and between C. geophilum populations (e.g., Panaccione et al. 2001; Wu et al. 2005; Douhan et al. 2007; Gonçalves et al. 2007).

C. geophilum has been shown to be locally adapted to serpentine soil conditions (Gonçalves et al. 2007, 2009). Serpentine soils are widespread soils formed by the weathering of ultramafic rocks and typically have a low calcium-to-magnesium ratio, with calcium levels significantly lower compared to surrounding non-serpentine soils. In addition, they often contain relatively high concentrations of heavy metals such as chromium, cobalt, and nickel and low concentrations of essential nutrients such as nitrogen, potassium, and phosphorus, and are prone to drought. These edaphic conditions usually present a major challenge to exposed biota and may give rise to a unique flora and mycoflora (Proctor 1999; Panaccione et al. 2001), rendering serpentine soils into appealing systems for addressing mechanistic questions of adaptive evolution in natural environments (Brady et al. 2005). Although many studies have been performed on serpentine flora and systematics, natural history, ecology, and physiology, possible relationships between adaptation to serpentine soil conditions and genome size variation have not received much attention. Nevertheless, independent polyploid evolution in serpentine populations of Knautia arvensis (Dipsacaceae) that promoted further evolution of serpentine lineages has recently been reported (Kolář et al. 2012).

Here, we report on the genome size of C. geophilum and its naturally occurring variation as determined by flow cytometry analysis of 40 isolates obtained from two serpentine and two non-serpentine populations in the northeast of Portugal. We first present data describing variation in genome size occurring in this fungal species and discuss the implications of our findings in the context of adaptive evolution of serpentine tolerance. In addition, we show that our data provides support for a possible cryptic mating or parasexual cycle in C. geophilum.

Material and methods

Field sites and fungal material

Soil samples containing C. geophilum sclerotia were collected in two serpentine (S) and two non-serpentine (NS) Quercus ilex subsp. ballota woodlands, located in Trás-os-Montes, northeastern Portugal: Serra da Nogueira (serpentine, 41°47′58.98″ N, 06°54′15.55″ W), Espinhosela (serpentine, 41°51′22.13″ N, 06°50′424.00″ W), Rabal (non-serpentine, 41°52′14.88″ N, 06°44′40.00″ W) and Aveleda (non-serpentine, 41°52′08.41″ N, 06°41′58.97″ W).

To avoid multiple sampling of the same fungal genotype, soil samples were collected around oaks at least 5 m apart. After sieving and washing the soil samples, viable sclerotia (Trappe 1964) were manually picked, surface sterilized in 30 % hydrogen peroxide for 10–20 s (Douhan and Rizzo 2005), rinsed in sterile water and transferred to Potato Dextrose Agar (PDA; Difco, Sparks, MD, USA) with streptomycin (0.2 mg/L) to germinate. The plates were incubated in the dark at room temperature and regularly observed for up to 1 month. Growing C. geophilum isolates were then subcultured on PDA without antibiotics and maintained in the dark at room temperature until further use.

Genome size estimation

The nuclear DNA content of C. geophilum was estimated by flow cytometry using fresh mycelium from young colonies. Propidium iodide (PI)-stained nuclei were extracted from ten in vitro-cultured isolates per population and the reference isolate Cg 1.58, which was isolated from sclerotia found in a Scots pine forest in Pfynwald VS by Martina Peter in 2008 and which was recently used in a study by Spatafora et al. (2012) on the phylogenetic placement of the genus Cenococcum within the family Gloniaceae. For each isolate, three independent replicate measurements were performed. Mycelium was collected with a needle and added to a Petri dish with 1 mL of modified woody plant buffer (WPB; 0.2 M Tris–HCl, 4 mM MgCl2·6H2O, 2 % Triton X-100, 2 mM disodium EDTA, 86 mM NaCl, 20 mM metabisulfite, 4 % PVP-10, [pH 7.5]; Loureiro et al. 2007). Nuclei were released following the procedure of Galbraith et al. (1983) by chopping approximately 10 mg of mycelium with a razor blade together with 50 mg of fresh leaf tissue of Raphanus sativus (internal reference standard with 2C = 1.16 pg or 1,135 Mbp; Doležel et al. 1998) to ensure that nuclei of both species were exposed to identical chemical and mechanical conditions.

The nuclear suspension was filtered through a 30 μm nylon filter to remove large fragments, and 50 μg/mL of PI (Fluka, Buchs, Switzerland) together with 50 μg/mL of RNase (Fluka, Buchs, Switzerland) were subsequently added to respectively stain the nuclei and prevent staining of double-stranded RNA. After incubation for 5 min, the fluorescence intensity of at least 2,000 nuclei per sample was analyzed using a Partec CyFlow Space flow cytometer (Partec GmbH, Görlitz, Germany), equipped with a green solid state laser for PI excitation. The G0/G1 peak of the standard was set to channel 720, with the amplification system kept at a constant voltage and gain throughout the experiment.

ITS1 sequencing

DNA sequencing of the internal transcribed spacer 1 (ITS1) region was performed for selected C. geophilum isolates. Total DNA was extracted from mycelial cultures grown on PDA using the DNeasy Plant Mini Kit (Qiagen GmbH, Hilden, Germany) according to the manufacturer’s instructions except for an initial incubation in lysis buffer AB1 at 75 °C for 3 h with occasional vortexing. Polymerase chain reaction (PCR) amplification of the ITS1 region was carried out using the fungus-specific oligonucleotide primer ITS1F (5′-CTTGGTCATTTAGAGGAAGTAA-3′; Gardes and Bruns 1993) and the universal primer ITS4 (5′-TCCTCCGCTTATTGATATGC-3′; White et al. 1990) in 50 μL reactions containing approximately 20 ng of extracted DNA, 500 nM of each of the oligonucleotide primers ITS1F and ITS4, 250 μM of each dNTP, 2 mM MgCl2, and 2.5 U of Taq DNA polymerase (Fermentas GmbH, St. Leon-Rot, Germany) in 1× Taq buffer with KCl (Fermentas GmbH, St. Leon-Rot, Germany). The following PCR temperature profile was used: initial denaturation at 96 °C for 2 min; 20 cycles of 94 °C for 30 s, 56 °C for 1 min and 72 °C for 1 min; final extension at 72 °C for 10 min. PCR products were examined by agarose gel electrophoresis and sequenced using the ABI BigDye® Terminator reaction mixture and an ABI PRISM® 3730 XL DNA Analyzer (Applied Biosystems, Darmstadt, Germany) by STAB VIDA (Setúbal, Portugal).

Data analysis

Histograms of PI fluorescence intensity were obtained and evaluated using FloMax software v2.4d (Partec GmbH, Münster, Germany), and the genome size (in picogram) of each sample was determined according to the following formula:

$$ \mathrm{DNA}\kern0.5em \mathrm{content}\ \mathrm{of}\ \mathrm{sample}\ \mathrm{nuclei}\kern-11.8em =\frac{\mathrm{Mean}\kern0.5em {\mathrm{G}}_0/{\mathrm{G}}_1\kern0.5em \mathrm{fluorescence}\kern0.5em \mathrm{of}\ \mathrm{sample}\ \mathrm{nuclei}}{\mathrm{Mean}\kern0.5em {\mathrm{G}}_0/{\mathrm{G}}_1\kern0.5em \mathrm{fluorescence}\ \mathrm{of}\ \mathrm{reference}\ \mathrm{nuclei}}\times 1.16 $$

The reliability of the genome size measurements was verified by evaluating the quality of the flow cytometry histograms based on the coefficient of variation (CV) of the G0/G1 peaks and the background debris, and the CV of the genome size estimation of each isolate based on three independent measurements.

As ANOVA assumptions were not met, differences in genome size between populations were evaluated by a Kruskal–Wallis one-way analysis of variance by ranks (H test) with a significance level of P < 0.05 using SigmaPlot v12.0 (Systat Software Inc., San Jose, CA, USA). A contingency test was performed to assess the association between ploidy level and ecological background using the Fisher’s exact test for count data as implemented in R v2.15.1 (R Core Team 2012). Genome size data were downloaded from the Fungal Genome Size Database (FGSD; Kullman et al. 2005) and included all records available in October 2012, with a mean value calculated for each species upon the occurrence of multiple entries.

Forward and reverse DNA sequences of the ITS1 region were edited and aligned using Bioedit v7.1.11 (Hall 1999) and compared with all fungal sequences available through the BLAST Network Service (National Center for Biotechnology Service, NCBI; http://www.ncbi.nlm.nih.gov/BLAST) and the UNITE database (Kõljalg et al. 2005) using the BLASTn algorithm (Altschul et al. 1997). All generated DNA sequence data have been deposited with the EMBL/GenBank Data Libraries and are available under accession nos. KC894709–KC894711.

Results

Clear symmetrical G0/G1 fluorescence intensity peaks of both the fungal and the plant reference nuclei were obtained (see Fig. 1), with a mean CV value of the fungal G0/G1 peaks of 7.34 %, falling within the range of values reported in similar works (e.g., Sędzielewska et al. 2011). No mean CV measures of individual genome size estimations exceeded 10 % (see Table S1). The endopolyploid nature of R. sativus (Kudo and Kimura 2001) did not affect the analysis as only the peak corresponding to the 2C nuclei, which is used for genome size estimation, appeared on the scale set (see Fig. 1).

Fig. 1
figure 1

Fluorescence intensities in flow cytometry analysis. Histograms of relative fluorescence intensities obtained through simultaneous flow cytometry analysis of propidium iodide-stained nuclei of C. geophilum (peaks 1 and 3) and R. sativus cv. Saxa (peak 2; internal reference standard with 2C = 1.16 pg or 1,135 Mb). Illustrative histograms are shown for the following ploidy classes: a haploid (isolate Cg 1.58), b diploid (isolate B5.1), c aneuploid (isolate B57.3), and d mixoploid (isolate D70.1). Peaks are marked as: 1, C. geophilum nuclei at the G0/G1 phase; 2, R. sativus nuclei at the G0/G1 phase; 3, diploid C. geophilum nuclei at the G0/G1 phase and haploid nuclei at the G2 phase. The inset table provides information on the mean channel number (the mean fluorescence intensity of stained nuclei), the DNA index (DI = mean channel number of sample/mean channel number of reference standard) and the coefficient of variation (CV, %) of each peak

The mean DNA content per nucleus across the triplicate measurements for each C. geophilum isolate varied between 0.170 ± 0.010 pg (166 ± 10 Mbp) and 0.439 ± 0.040 pg (430 ± 39 Mbp), with an overall median and median absolute deviation (MAD) across isolates of 0.209 ± 0.012 pg (205 ± 12 Mbp) (see Table 1, Table S1 and Fig. 2). As the distribution of the average DNA content per isolate is skewed towards the lower end of its range and C. geophilum is assumed to be a haploid organism with n = 6 chromosomes (Portugal et al. 2002), the majority of the isolates included in this study (85 %; see Fig. 2 and Table S1) are presumed to be haploid with an average mean DNA content per nucleus of 0.208 ± 0.016 pg (203 ± 16 Mbp). An estimated DNA content of 0.246 ± 0.027 pg (241 ± 27 Mbp) was obtained for the reference isolate Cg 1.58 (see Table S1), in accordance with initial estimations based on preliminary genome sequencing results (M. Peters, personal communication). Six isolates had genome sizes suggesting variation in ploidy level within C. geophilum, with three isolates giving diploid and two isolates giving aneuploid estimates (see Fig. 1 and Table S1). Interestingly, one isolate (D70.1) was found to be mixoploid and displayed an approximately equal amount of haploid and diploid nuclei (see Fig. 1d and Table S1).

Table 1 Summary of genome sizes in C. geophilum
Fig. 2
figure 2

Genome size variation in C. geophilum. Histogram of nuclear DNA content (in picogram) estimated for 40 isolates of C. geophilum by propidium iodide flow cytometry

DNA sequences of ITS1 in isolates of each of the three ploidy classes (haploid: isolate C79.2, diploid: isolate A99.1, and aneuploid: isolate B57.3; see Table S1) were obtained to rule out misidentification of our fungal cultures and therefore taxonomic diversity having caused the observed variation in genome size. Average pairwise sequence similarity between the three ITS1 sequences was 97 %, indicating a close phylogenetic relatedness of isolates with different ploidy levels, and sequence comparisons with all available fungal DNA sequences in the NCBI and UNITE databases resulted in sequences from other cultured C. geophilum isolates to be identified as best hits (with 100 % query coverage, 99 % maximum identity and E values ≤ 2E-63).

Comparison of our estimates, of both presumed haploid and diploid individuals, with genome size estimates submitted to the FGSD reveal C. geophilum to have a genome size distribution in excess of the 95th percentile of the genome size distribution of all fungal species (see Fig. 3). The mean genome size of presumed haploid C. geophilum isolates within each population ranged between 0.199 pg (195 Mbp) and 0.218 pg (213 Mbp) (see Table 1), with no significant difference in genome size between the populations (Kruskal–Wallis test, H 3,31 = 4.438, P = 0.218). In addition, no significant association was found between ploidy level and edaphic background (Fisher’s exact test, P = 0.604).

Fig. 3
figure 3

Genome size variation among fungal species. Mean genome size in picogram of each species reported in the Fungal Genome Size Database (FGSD; open bars) with the distribution of Ascomycota highlighted (light gray) and the genome size estimates of 40 C. geophilum isolates (dark gray). Inset graph: detailed view of the size distribution for the upper range, including C. geophilum (colours as before). The 95th percentile of all species reported in the FGSD (0.0795 pg) is marked as a vertical dashed line

Discussion

Although the important role that fungi play in ecosystem diversity and functioning is under active research (e.g., Read and Perez-Moreno 2003), until recently, there has been relatively scarce information on genome structure and size for non-model fungal species despite its relevance for comparative biodiversity research (Gregory et al. 2007). Nevertheless, a growing body of evidence indicates large variation in genome size in the fungal kingdom. While about 90 % of estimations range from 10 to 60 Mbp (Gregory et al. 2007), extreme haploid genome sizes of up to 795 Mbp have been reported for Scutellospora castanea (Glomeromycota; Hosny et al. 1998, Hijri and Sanders 2005). The Ascomycota are well represented throughout this range, with estimates of 7.7 Mbp for Bulgaria inquinans (order Helotiales; Weber 1992) to 750 Mbp for Neottiella vivida (order Pezizales; Kullman 2002).

Considering the range of genome sizes observed in the Ascomycota, our estimates of an average haploid genome size of 203 Mbp in C. geophilum indicate that the genome size of this species is much larger than expected a priori and this holds even when extending our comparison to the whole fungal kingdom (see Fig. 3). Although large genome sizes are generally recorded for highly polyploid species (e.g., N. vivida has a ploidy level of 70; Kullman 2002), exceptionally large genome sizes have been observed in other haploid species (e.g., the haploid arbuscular mycorrhizal species Glomus etunicatum has a genome size of 750 Mbp; Hijri and Sanders 2005).

In a recent analysis of the underlying processes associated with genome size expansion within the Ascomycota, Kelkar and Ochman (2012) suggest that genetic drift, reflected through a decrease in gene density and a proliferation of introns, has played a significant role across many lineages on broad evolutionary time scales. The large genome size of C. geophilum might therefore result from ancestral genome size expansion driven by genetic drift, which would lead to a repeat-rich and relatively gene-poor genome. However, selective and neutral processes together are associated with genome size expansion within some lineages as observed by Kelkar and Ochman (2012) and others (see, e.g., Martin et al. 2010). Future analyses of the genome sequence of C. geophilum may provide more insights into whether its surprisingly large genome size is the consequence of genetic drift alone or in combination with selective processes.

Besides variation in genome size among the haploid C. geophilum isolates, variation in ploidy level was observed as four C. geophilum isolates were considered to be diploid (including the observed mixoploid) and two to be aneuploid based on our genome size estimations. Such intraspecific variation in genome size, including the variation among the isolates of a given ploidy level, is common among fungi. This variation often results from chromosome length polymorphisms and/or from gain/loss of complete chromosomes or sets thereof, as has been demonstrated in Cronartium quercuum (Anderson et al. 2010), Cryptococcus neformans (Hu et al. 2011), and Saccharomyces cerevisiae (Codón et al. 1998; Albertin et al. 2009).

Variation in genome size and the potential for genome plasticity can contribute to adaptation in heterogeneous environments. Differential selection across ploidy levels has been observed in different abiotic environments (e.g., Dhar et al. 2011) and increased ploidy has been suggested to provide a selective advantage in adaptation to new environmental conditions (Pawlowska and Taylor 2004; Ma et al. 2009). However, despite the strong selection pressure that is expected in serpentine soils and a report of possible influences of serpentine on the evolution of polyploids in K. arvensis (Kolář et al. 2012), we did not find significant differentiation in genome size between the C. geophilum populations included in this study, suggesting that genetic and demographic processes are more important in shaping the genome size variation of this species than environmental selection in the form of home soil chemistry. While these results suggest that variation in ploidy level is not related to serpentine tolerance in C. geophilum, the relatively large genome size—providing the potential for local adaptation to develop in vastly different environments—may still contribute to the wide ecological breadth, including the occurrence in serpentine soils, reported for this species.

Although unrelated to adaptation to serpentine soil conditions, it is noteworthy to state that the identification of diploid and mixed haploid/diploid C. geophilum isolates may indicate the occurrence of a cryptic mating or parasexual cycle in this presumed asexual fungal species. As first described for Aspergillus nidulans (Pontecorvo et al. 1953), the standard parasexual cycle starts with the formation of a heterokaryon (plasmogamy) followed by a fusion of the two haploid genomes (karyogamy). The resulting diploid nuclei divide mitotically along with remaining haploid nuclei and may form a discrete sector or even propagate as separate diploid strains. However, the diploid nuclei may be unstable and produce haploid or aneuploid segregants through mitotic recombination and chromosomal nondisjunction. Our observation of diploid and mixed haploid/diploid C. geophilum isolates thus supports an earlier suggestion of a cryptic mating or parasexual cycle in C. geophilum based on the indirect observation of genetic recombination using DNA sequence or genetic marker data (Lobuglio and Taylor 2002; Douhan et al. 2007). A sexual stage in C. geophilum was also recently proposed by Spatafora et al. (2012) based on the large genetic diversity detected in this species and a close phylogenetic relationship with Glonium, for which a sexual stage has been identified. As sexual reproduction in fungi can be geographically isolated and therefore difficult to detect (O’Gorman et al. 2009; Spatafora et al. 2012), it is not surprising that we found only 6 in 40 isolates to deviate from a presumed haploid condition. However, the existence of a parasexual cycle is not the only possible explanation for the observed variance in ploidy level. Alternatively, this variance may be the result of an endopolyploidization process that involves recurrent cycles of DNA replication without cellular division and produces an increase of the genome content in the somatic line or it may reflect the existence of phylogenetically diverged lineages within a C. geophilum sensu lato species complex that was proposed by Douhan et al. (2007).

In conclusion, this study revealed C. geophilum to have one of the largest genome sizes so far measured in the Ascomycota, with a mean haploid genome size estimate of 0.208 pg or 203 Mbp. Although within-species variation in genome size was detected, it was unrelated to the edaphic background of the isolates indicating that serpentine soil conditions do not affect genome sizes in C. geophilum. The observed variation in genome size, more specifically ploidy level, may however support the existence of a cryptic mating or parasexual cycle. As other explanations for this variation are possible, additional analyses are still required to confirm the presence and/or nature of a cryptic sexual stage in this widely distributed ectomycorrhizal species.