Introduction

Declining plant populations often face an increased short-term risk of extinction because of reduced reproduction due to pollen limitation and inbreeding depression, and an increased long-term risk due to decreased ability to respond to environmental changes (Kery et al. 2000; Frankham et al. 2002). Restoration activities, including introduction of individuals (Demauro 1993; Robichaux et al. 1997; Hogbin and Peakall 1999), can result in recovery of fitness or genetic rescue, but risk jeopardizing evolutionary distinctiveness and the ecological viability of populations owing to genetic swamping by introduced genotypes and outbreeding depression (Hufford and Mazer 2003; Hedrick 2005). Therefore, if introduction is planned, it is desirable to do that among populations representing similar ecotypic forms (Crandall et al. 2000) and belonging to the same lineage or management unit (Moritz 1994, 2002), which is a group of populations having a statistically significant differences of allelic composition at nuclear or organelle loci from others.

Several reviews of the relationship between molecular and quantitative variation suggest that neutral loci often do not match patterns in ecologically important traits (Reed and Frankham 2001; Mckay and Latta 2002). Hence, we should evaluate historical distinctiveness of populations and adaptive divergence by appropriate genetic markers (Fraser and Bernatchez 2001). Molecular marker assay enables us to estimate the degree of genetic connectivity among populations via gene flow and to detect endemic lineages that are the component of biodiversity (Newton et al. 1999; Soltis and Gitzendanner 1999; Gompert et al. 2006). Saltonstall (2002) revealed the extinction of local lineages of common reed (Phragmites australis) in North America by the cryptic invasion of lineages from Eurasia by using chloroplast DNA (cpDNA) variation. The investigation of heritable phenotypic variation enables the estimation of local adaptation and outbreeding depression (Montalvo and Ellstrand 2001; Hufford and Mazer 2003). Further, comparison of the degree of genetic differentiation among populations in quantitative traits (Q ST) to that in neutral markers (F ST) allows the evaluation of local adaptation caused by natural selection (Spitze 1993).

Primula sieboldii E. Morren (Primulaceae) is a perennial heterostylous herb that is distributed throughout Japan. In recent years, natural P. sieboldii populations have declined owing to loss or fragmentation of habitats, abandonment of traditional management of woodlands and grasslands, and commercial collection (Environment Agency of Japan 2000). In this context, restoration activities led by private citizens have been growing in a number of regions. Because of the failure of seed production due to the loss of either of the two heterostylous morphs (Matsumura and Washitani 2000) and to strong inbreeding depression (Kitamoto 2005; Nagai et al. 2006), the introduction of individuals may ensure population viability, as well as the abovementioned risk. Hence, it is urgent to characterize the genetic variation in P. sieboldii to ensure effective genetic conservation.

Honjo et al. (2004) examined the phylogeography of the species in Japan on the basis of the sequence variation in five noncoding regions of cpDNA. Twenty-two haplotypes belonging to three distinct clades were recognized, with some geographic structure. Furthermore, the use of nuclear genetic markers gives insight into genetic structure reflecting gene flow by pollen and seed among populations.

We consider that conservation units of a species should be determined to conserve both evolutionary distinctiveness and ecological viability of populations. The purpose of this study is to delineate the management units of P. sieboldii in Japan as the first step in determining conservation units. We examined microsatellite variation among and within 32 remnant wild populations throughout Japan. The geographical patterns are interpreted jointly with previously reported cpDNA data.

Materials and methods

Study species and sampling procedure

Primula sieboldii is a heterostylous clonal plant that occurs in moist habitats in northeastern Asia (Yamazaki 1993). Each genet is composed of physiologically independent ramets originating from short rhizomes. A main pollinator is the queen bumblebee, Bombus diversus (Washitani et al. 1995), and seed dispersal occurs by gravity. Secondary dispersal of seed and clonal propagules by water sometimes occur (Kitamoto et al. 2005a).

In Japan, P. sieboldii is distributed in five regions: southern Hokkaido, northern, central, and western Honshu, and central Kyushu. At flowering time (April–June) in 2000–2003, we collected fresh leaves from 32 populations (Table 1; Fig. 1), representative of the remnant populations in each region. In the Hidaka region of southern Hokkaido and the Karuizawa region of central Honshu, leaves were collected from eight and seven populations, respectively, within a radius of 10 km. For samples of Tajimagahara (population 16), we used plants grown from seedlings obtained in a previous study of P. sieboldii (Washitani and Kabaya 1988). To understand genetic diversity retained in remnant small populations precisely, we collected leaves from as many genets as possible in smaller populations consisting of less than 50 genets, while from randomly chosen genets in larger populations. The sample size was 920 genets in total, and ranged from 10 to 49 per population. Although we collected leaves by distinguishing genets in the field by visual observation of interclonal variations in floral morphology (i.e., size, color, shape of petal, and heterostyly; Washitani et al. 1991) and the spatial arrangement of ramets, we found several replicate multilocus genotypes within samples from the same population in the following genotyping experiment. Because those pairs of individuals were grown adjacently in each habitat and considered to belong to the same genet, we excluded the overlapped samples from the data analysis. The above-mentioned number of samples, 920, corresponds to the number of a distinct multilocus genotype.

Table 1 Locality, size, and genetic diversity measurements of each sampled population of Primula sieboldii in Japan
Fig. 1
figure 1

Sampling locations. Numbers correspond to population numbers in Table 1. The colors of plots represent the clades to which the genets within the population belong, based on chloroplast DNA sequences (Honjo et al. 2004): white, clade I; gray, clade II; black, clade III

DNA extraction and PCR amplification

Genomic DNA was extracted from leaves by a modified CTAB method (Murray and Thompson 1980). Genotypes of the samples were amplified by PCR with eight pairs of microsatellite primers (Table 2). PCR amplifications were performed in 10 μl reaction mixtures containing 10–15 ng template DNA, 10 mM Tris·HCl (pH 8.3), 50 mM KCl, 100 μM each dNTP, 0.02% Triton X-100, 0.01% gelatin, 1.5 mM MgCl2, 0.25 units Taq DNA polymerase (Promega, Madison, USA), and 0.2 μM each primer. Thermocycling conditions were as follows: 3 min at 94°C; 30 cycles of 30 s at 94°C, 30 s at a primer-specific annealing temperature (Table 2), and 30 s at 72°C; with a final extension step at 72°C for 7 min. The PCR was carried out with a GeneAmp PCR System Model 9700 (Applied Biosystems, Foster City, USA). The PCR products were run on a 3100 Genetic Analyzer with GeneScan (Applied Biosystems).

Table 2 Microsatellite markers used in the analysis of P. sieboldii in Japan, annealing temperature, and the total number of alleles observed

Data analysis

To reveal the microsatellite diversity within each population, we obtained unbiased estimates of gene diversity averaged over loci (H e; Nei and Roychoudhury 1974), average number of alleles per locus (n a), and summed number of rare alleles over loci (n ra), whose frequency was less than 1% among all populations. We also computed allelic richness (El Mousadik and Petit 1996), that is, the expected number of alleles (n 10a) and rare alleles (n 10ra) observed in 10 genets of each population (the smallest sample size). To infer the level of inbreeding in each population, we computed the mean inbreeding coefficient (F IS). The significance of F IS in terms of departure from random mating at the P < 0.05 level was tested by 5120 permutations of alleles among individuals within populations, with Bonferroni correction. This analysis also indicates the significance of the departure from Hardy–Weinberg equilibrium per locus and population. These calculations were performed with the aid of the software FSTAT 2.9.3 (Goudet 2001), except H e, which was calculated by the program Arlequin 2.001 (Schneider et al. 2000). We examined the relationship between genetic diversity within populations (H e, n a, n ra, n 10a, and n 10ra) and the logarithmic population size (total number of genets within a population) and latitude by multiple regression analyses. The latitude of each population was transformed into the plane orthogonal coordinate before analysis. The relationship between population size and latitude was examined by correlation analysis. We also examined the relationship between F IS and population size or latitude by simple regression analysis. In all these analyses, we omitted Mukawa 4, Tajimagahara, and Akagi (populations 1, 14, 16), where we were unable to count the number of genets owing to the partial destruction of habitat by road construction, restricted access, and sampling before sufficient blooming.

Levels of microsatellite variation among and within the five geographical regions were estimated by hierarchical F-statistics (Yang 1998) using the software HIERFSTAT (Goudet 2005). Genetic divergence between each population pair was estimated by calculating population pairwise F ST (Weir and Cockerham 1984) using FSTAT 2.9.3. To test the significance of correlation between geographic distance and pairwise F ST values among all pairs of populations, a Mantel test (Mantel 1967) was conducted using FSTAT 2.9.3. The genetic similarity (1 − F ST) matrix was used in principal coordinate analysis (PCO) to extract the association of populations by using the software PCO 1.0 (Iwata 2004).

We also examined genetic clustering within species by using a Bayesian clustering method implemented in the program STRUCTURE 2.1 (Pritchard et al. 2000; Pritchard and Wen 2003), which estimates the number of clusters (K) present in a sample without the geographical locations of individuals and then assigns them to these clusters. We first ran the program for the total data set (n = 920) assuming correlated allele frequencies and admixture. We calculated ten independent runs of K = 1–20 with 105 MCMC cycles each for burn-in and data collection. Then, to detect the true number of clusters, we calculated ΔK (Evanno et al. 2005), which is based on the rate of change in posterior probability values for K [log probability of data; L (K)] between successive K values. Evanno et al. (2005) showed that the real number of clusters is best detected by the modal value of ΔK rather than L (K) itself. They also showed that STRUCTURE tends to detect the uppermost level of a population hierarchy, and that within these clusters there can be sublevels of structuring. Therefore, we also explored whether substructure could be detected within each detected cluster in a similar manner to the first run (but K = 1–10).

Results

Genetic diversity within populations

All loci showed high polymorphism, with the number of alleles per locus ranging from 10 (ga0668) to 38 (ga0666; Table 2). In total, 160 alleles were detected in 920 distinct genets from 32 populations of P. sieboldii across eight loci. Genetic diversity measurements within each population, H e, n a, n ra, n 10a, and n 10ra ranged from 0.420 to 0.786, 2.88 to 9.13, 0 to 14, 2.72 to 6.47, and 0 to 10, respectively (Table 1). Sendai (population 11) in northern Honshu showed the lowest value of every measurement. Four populations in Hokkaido (populations 2, 3, 5, 8) also do not have any rare alleles as well as Sendai. On the other hand, Aso (population 32) in southwestern Japan showed the highest or second highest values for all measurements. The mean inbreeding coefficient (F IS) ranged from −0.157 of Kofu (population 28) to 0.129 of Nasu (population 13).

In the multiple regression analyses, five genetic diversity measurements (H e, n a, n ra, n 10a, and n 10ra) were significantly correlated with latitude (Table 3). That is, all measurements, especially the number of rare alleles, significantly decreased with increasing latitude. Also, H e, n a, and n ra were significantly correlated with population size. These measurements increased with increasing population size. On the other hand, n 10a and n 10ra were not correlated with population size (Table 3), indicating that allelic diversity in the same number of genets was not dependent on population size. Population size and latitude were not significantly correlated (P = 0.11). In the simple linear regression analysis, population size and F IS were positively correlated (P < 0.05; Fig. 2), indicating an excess of heterozygotes in a smaller population. Latitude and F IS was not related (P = 0.91). The tests for significance of F IS showed that individual populations did not deviate significantly from Hardy–Weinberg equilibrium after Bonferroni correction.

Table 3 Results of multiple regression analyses with genetic diversity measurements as the dependent variable and population size (pop. size) and latitude as the independent variables
Fig. 2
figure 2

Relationship between log population size and the inbreeding coefficient F IS in populations of P. sieboldii in Japan

Genetic differentiation among populations

According to hierarchical F-statistics, 10.0% of the total microsatellite variation occurred among regions, 12.1% among populations within regions, and 77.9% within populations. Population pairwise F ST ranged from 0.002 to 0.430 (Table 4). The Mantel test revealed a significant correlation (r = 0.691, P < 0.0001) between pairwise F ST values and the logarithm of geographic distances (Fig. 3).

Table 4 Population pairwise F ST among 32 populations of Primula Sieboldii
Fig. 3
figure 3

Relationship between log geographic distance and pairwise F ST between all pairs of populations of P. sieboldii in Japan

PCO analyses of all 32 populations detected four genetic groups, corresponding to the geographic regions (Fig. 4). The first axis (PCO1) accounted for 22.0% of the variation and separated the Hokkaido populations from the more southern populations. Along the PCO2 axis, which accounted for 12.4% of the variation, the populations from western Japan (Kyushu and western Honshu), central Honshu, and northern Honshu were arranged in geographic order.

Fig. 4
figure 4

Principal coordinate analysis (PCO) of 32 populations of Primula sieboldii analyzed at eight microsatellite markers. The first (PCO1) and second (PCO2) axes explain 22.0% and 12.4% of the total variation, respectively. Numbers refer to Table 1. Four genetic groups, corresponding to geographic regions, are encircled. The colors of plots indicate cpDNA clades according to Honjo et al. (2004): white, clade I; gray, clade II; black, clade III

In the STRUCTURE analysis of the total dataset, the ΔK statistic clearly indicated that the sample included at least two distinct groups (Fig. 5A, B), though the log probability of data L (K) still increased after K = 2. Nearly all individuals from Honshu and Kyushu were assigned to cluster 1 and individuals from Hokkaido to cluster 2 (Fig. 6A). We also explored whether substructure could be detected within the clusters. Cluster 1 was split into three subclusters (1a, 1b, 1c; Figs. 5C, D, 6B), consisting of individuals mainly from western Japan, central Honshu, and northern Honshu, respectively, corresponding well to the geographic and PCO patterns (Fig. 4) except for Akagi (population 14) and Takayama (population 26). Most individuals of Akagi, which is located most proximal to northern Honshu among populations in central Honshu and was plotted in central Honshu group in PCO analysis, were assigned to cluster 1c. In Takayama, contrary to geographical location, a high rate of assignment to cluster 1c was detected, although the population was plotted in the PCO analysis by geographical location (namely, plotted between western Japan populations and central Honshu populations).

Fig. 5
figure 5

Detection of the number of genetic clusters in the whole dataset (A and B) and in cluster 1, consisting of individuals of P. sieboldii mainly from Honshu and Kyushu in Japan (C and D; see text and Fig. 6). (A) and (C): Mean L (K) (±SD) over ten runs as a function of K. (B) and (D): ΔK (Evanno et al. 2005) as a function of K

Fig. 6
figure 6

Genetic clustering within P. sieboldii in Japan inferred by the STRUCTURE program. The relative lengths of each color in each bar represent average assignment probabilities of every population to each cluster. (A) In the analysis of the total dataset, two clusters were detected. Nearly all individuals from Honshu and Kyushu were assigned to cluster 1 and individuals from Hokkaido to cluster 2. (B) Cluster 1 was split into three clusters (1a, 1b, 1c), consisting of individuals mainly from western Japan, central Honshu, and northern Honshu, respectively

Discussion

Genetic diversity within populations

All microsatellite diversity measurements within a population were negatively correlated with latitude (Table 3). This result corresponds to the previous cpDNA analysis of P. sieboldii (Honjo et al. 2004), which revealed that only a few lineages might have expanded northward. Similar decrease of genetic diversity has been observed in some temperate plant species in Japan (Tomaru et al. 1997, 1998; Ohi et al. 2003). The Karuizawa region, where high haplotype diversity was observed (Honjo et al. 2004), also showed high microsatellite diversity (Table 1). These results suggest that Karuizawa was a glacial refugium or that it was a region of admixture of populations descended from distinct refugia (Walter and Epperson 2001).

Total allelic diversity within a population (n a, n ra) was higher in larger populations, but not so after rarefaction adjustment (n 10a, n 10ra; Table 3). Looking at the relationship between population size and F IS, an excess of heterozygotes was observed in smaller populations (Fig. 2). Although this might be the result of demographic stochasticity because the relationship was relatively weak, it may suggest that more heterozygous genets tend to survive the habitat contraction possibly because of the higher fitness associated with heterozygosity as observed in several long-lived perennials (Schaal and Levin 1976; Rajimann et al. 1994; Oostermeijer et al. 1995; Luijten et al. 2000). The latter case may stem genetic decline of populations. Although further investigation is needed to determine whether more heterozygous genets have higher fitness or not, the purge of homozygosity of recessive detrimental genes may be likely, as strong inbreeding depression was observed in seed production, in germination, and in growth of seedlings of P. sieboldii (Kitamoto 2005; Nagai et al. 2006).

Genetic differentiation among populations

The data from different markers provide valuable information for understanding the evolutionary relationships of populations and identifying the appropriate units of biodiversity for conservation purposes. Previous cpDNA analysis (Honjo et al. 2004) did not detect obvious genetic differentiation between Hokkaido and Honshu populations of P. sieboldii. All three cpDNA haplotypes found from Hokkaido were also distributed in Honshu. However, PCO and STRUCTURE analysis based on nuclear microsatellite variation revealed clear genetic differentiation between those populations (Figs. 4, 6A). Ohshima (1990) suggested that Hokkaido and Honshu have been separated for about 100,000 years, even during the last glacial period. The genetic differentiation may result from long-term isolation, even though cpDNA sequences have not yet diverged. Along the PCO2 axis in Fig. 4, populations from western Japan, central Honshu, and northern Honshu paralleled their geographical location. This pattern supports the hypothesis based on cpDNA analysis (Honjo et al. 2004) that some lineages have migrated from western to northeastern Japan. Overall, the microsatellite allelic composition was similar among geographically adjacent populations as shown in a significant isolation-by-distance pattern of Fig. 3, even when the cpDNA lineage of the populations differed. Hierarchical F-statistics revealed that 10.0% of the total microsatellite variation occurred among regions and 12.1% among populations within regions, while analysis of molecular variance (AMOVA; Excoffier et al. 1992) suggested that 59.9% of the total cpDNA variation existed among regions and 32.5% among populations within regions of P. sieboldii (Honjo et al. 2004). These relatively lower degrees of population differentiation estimated by nuclear microsatellites than cpDNA variation suggests that substantial pollen flow among populations might have historically occurred.

Conservation units of Primula sieboldii

To recognize conservation units is important in the conservation and restoration of endangered species (Moritz 1994, 2002; Crandall et al. 2000; Fraser and Bernatchez 2001). We think that historical distinctiveness and adaptive divergence of populations should be considered in determining conservation units. From PCO and STRUCTURE analysis (Figs. 4, 6), we detected four genetic groups well corresponding to geographic regions (Hokkaido, northern Honshu, central Honshu, and western Japan), which could be recognized as management units (Moritz 1994, 2002) of P. sieboldii. Although it is not known whether these genetic groupings differentiate in adaptive traits, intraspecific endemic lineages merit conservation attention as a component of biodiversity. If supplementation with plants from other populations were planned, plant materials should be obtained from within the same management unit. Because the genetic composition of Takayama (population 26) may be somewhat different from geographically adjacent populations, careful management of this population may be needed. Admixture of individuals of northern Honshu descent and native one may be occurred. Also, since the assignment of Akagi (population 14) has varied between the PCO and STRUCTURE analysis, care may be needed for the handling of this population. This population might have experienced gene flow from both central and northern Honshu.

The patterns of neutral molecular and quantitative variation often do not correspond (Reed and Frankham 2001; Mckay and Latta 2002). For example, Cai et al. (2004) revealed that quantitative traits of common wild rice, Oryza rufipogon, were differentiated into perennial and annual types in response to different habitat conditions, whereas molecular markers showed a genetic isolation-by-distance pattern. A study of a vulnerable tree endemic to South America has shown that neutral DNA markers failed to detect genetic divergence in an ecologically important trait relating to drought tolerance (Bekessy et al. 2003). Steinger et al. (2002) found a strong correlation between molecular and quantitative genetic differentiation between pairs of populations, but the degree of differentiation in several quantitative traits was much larger than the neutral expectation. Therefore, we need to further evaluate population differentiation in adaptive traits to avoid outbreeding depression. A common garden experiment is effective to examine the genetic variation of phenotypic traits (Mckay and Latta 2002). Because each genet of P. sieboldii is composed of several clonally propagated ramets, we will be able to transplant some ramets to the common garden experiment while leaving other ramets in the wild habitat. By comparing relative levels of among-population divergence in quantitative traits (as measured by Q ST) and neutral DNA markers (as measured by F ST), we can examine the extent of local adaptation of phenotypic traits (Spitze 1993; Steinger et al. 2002).

We detected endemic cpDNA haplotypes and significant allelic variation at microsatellite loci among populations of P. sieboldii. These regional features of genetic variation may allow us to infer the origins of individuals of P. sieboldii with statistical methods such as an assignment test (Manel et al. 2005), as actually demonstrated in Honjo et al. (in press). Because commercial collection is posing an ever greater threat to the species, tracing the origin of each individual will contribute to managing species conservation. It will be also available to confirm the origins of ex situ stocks, which could be useful resources for genetic restoration.