Introduction

Habitat fragmentation threatens the survival of many species due to the formation of smaller and discrete plant populations that eventually become more spatially isolated (Fahrig and Merriam 1994; Young et al. 1996; Frankham et al. 2002). If populations are separated over successive generations, their heterozygosity decreases because of greater random genetic drift and inbreeding (Rumball et al. 1994). This produces a series of negative effects, e.g., the accumulation of deleterious recessive alleles, lower fecundity in individual plants, higher seed/seedling mortality, and slower individual growth rates, thereby reducing fitness and perhaps leading to local extinction (Lande 1988; Young et al. 1996; Newman and Pilson 1997; Aguilar et al. 2008).

Also, certain life-history traits in plants may confer different levels of vulnerabilities to fragmentation effects. For example, the onset of genetic influences from fragmentation is frequently delayed in long-lived woody species (Torres-Díaz et al. 2007). For long-lived woody species, two alternative hypotheses might explain why they can counteract negative inputs (Van Geert et al. 2008). First, although the habitats for some species might become fragmented, their populations are still interconnected through pollen flow and/or seed dispersal, which can counteract genetic erosion (Ellstrand 1992; Kwak et al. 1998; Newman and Tallmon 2001). Second, for some species with relatively long-life cycles of up to a few hundred even over 1000 years, the long lifespan of individual plants reflects a historic situation in which habitat fragmentation is less expansive (Ellstrand and Elam 1993; Oostermeijer et al. 1994; Sork et al. 1999; Honnay et al. 2005). However, evidence from both theoretical (Hanski and Ovaskainen 2002) and empirical (Lindborg and Eriksson 2004) indicates that a woody species can become extinct after 50–100 years of habitat fragmentation. Thus, when making conservation decisions about endangered species, it is of the utmost importance to understand the genetic variation and structure of the species in order to elucidate the factors that determine the genetic architecture of plant populations (Milligan et al. 1994; Schaal et al. 1998; Avise 2000). Research on the influence of habitat fragmentation has largely concentrated on assessing the genetic consequences of recent events associated with human activity (Mona et al. 2014). For example, Fagus sylvatica L. is a wind-pollinated tree, which dominates forest over large regions of Europe. The habitat fragmentation induced by historic deforestation in this species has led to genetic bottlenecks and the disruption of the species’ breeding system (Alistair and Josep 2006). In Scotland, deforestation caused chronic habitat fragmentation on Sorbus aucuparia Poir., which led to the reduced pollen-mediated gene flow (Bacles et al. 2004). English yew (Taxus baccata L.) is a Tertiary relict, whose genetic variation was highly structured at the local scale, while only a low proportion of the observed variation was attributed to regional differences. The patterns probably reflect the combined effects of Quaternary climatic changes and recent impact of human activities (González-Martínez et al. 2010). Although the effects of long-term fragmentation on genetic structure have been reported, but the role that historical processes play in affecting population genetic structure is still unclear.

C. macrolepis (Chinese Incense-cedar) is a relictual, long-lived woody plant. It is native to southwestern China [southeast of the Yunnan-Guizhou Plateau (YGP)] and the adjacent regions of Myanmar and Vietnam (Fu and Yu 1999; Farjon 2005). The genus includes four species, which have a typical eastern Asia–western North American disjunct pattern of distribution and represent an ancient lineage (Averyanov et al. 2008; Chen et al. 2009). It is an endangered ancient relictual and fragmented conifer, and is a heliophile and subtropical plant. It is monoecious with unisexual flowers on the same branches. It belongs to wind-pollinated plant and contains light-weight seeds that have unequal wings (Liao et al. 2014; Wang et al. 2004). Based on fossil evidence from the European Tertiary, C. macrolepis was formerly widespread in temperate regions (Kracek 1999), and sediments also exist from the Asiatic Oligocene to Pliocene (WGCPC 1978; Liu et al. 1996). Because of its valuable wood, it has been threatened by over-harvesting in the past century and its habitat has also been severely degraded ecologically. Its wild populations are dramatically shrinking, and an overall decline continues throughout its entire area of distribution (Liao et al. 2014; Liu et al. 2010). According to criteria from the 2001 International Union for the Conservation of Nature and Natural Resources Red List (Farjon 2001), C. macrolepis is classified as a vulnerable species and is listed as an endangered species in China (Fu 1995) and Vietnam (WCMC 1997).

Although few microsatellite characterizations have been made of its genetic population structure and diversity, recently markers have been developed (Liao et al. 2010). In this study, we conducted genetic diversity and structure analyses of this species by microsatellite. Our objectives were to examine: (1) the degree of genetic diversity within and among remnant populations, (2) significant genetic differentiation among our study populations, (3) factors that determine their genetic patterns, and (4) optimum strategies for future conservation.

Materials and methods

Population descriptions and sample collection

In China, Calocedrus macrolepis is distributed on sites within Yunnan and southern Guizhou Provinces, at elevations ranging from 300 to 2000 m. This species also can occur sporadically in western Guangxi and Hainan Provinces. As a medium-sized, subtropical evergreen, trees can be 15–25 m tall, with trunks as large as 60–80 cm in diameter at breast height (dbh). Remnant populations are generally fragmented and highly disturbed because of heavy deforestation. We analyzed 14 isolated populations that represent those found throughout China. These included eight from Yunnan Province, three from Guizhou Province, two from Guangxi Province, and one from Hainan province (Fig. 1; Table 1). For each population, Needle samples from 17 to 34 mature individuals (dbh > 10 cm) were randomly collected. After collection, samples were dried in silica gel and stored at 4 °C prior to DNA extractions.

Fig. 1
figure 1

Geographic locations for 14 study populations of C. macrolepis

Table 1 Locations for the 14 populations of Calocedrus macrolepis sampled in this study

DNA extraction, microsatellite amplification, and scoring

Genomic DNA samples were extracted from dry leaf tissue, which was grounded in liquid nitrogen according to CTAB methodology (Doyle and Doyle 1987). Nuclear DNA was PCR-amplified by microsatellite primer. Nine microsatellite primers (Cm2Cm4, Cm6Cm8, and Cm10Cm12) (Liao et al. 2010) that showed good repetition and distinct polymorphism were selected for this study. Amplification conditions were as described by Liao et al. (2010). PCR products were separated on an 8 % denaturing polyacrylamide gel, using a 20 bp DNA ladder molecular size standard (TaKaRa, China) and silver staining.

Genetic diversity, Hardy–Weinberg equilibrium, and linkage disequilibrium

To estimate overall genetic diversity, we used GenAlEx, Version 6 (Peakall and Smouse 2006) to calculate the following measures: mean number of alleles per locus (Na), average number of effective alleles (Ne), observed heterozygosity (H O) and expected heterozygosity (H E) for each locus and population, as well as private alleles (Np), and the inbreeding coefficient (F IS) for each population. The presence of null allele was tested via MICRO-CHECKER 2.2 (Van Oosterhout et al. 2004). We applied the Markov chain method for estimating the probability of significant deviation from the Hardy–Weinberg equilibrium (HWE) and also used default parameters to perform pair-wise tests for linkage disequilibrium (LD), using the online version of GENEPOP (2013). Sequential Bonferroni corrections were made to determine significance levels for all tests at P < 0.05 (Rice 1989).

Genetic differentiation and assignment analyses

Overall and pair-wise values for genetic differentiation (F ST) were generated by the AMOVA routine in GenAlEx program, Version 6. Significance of the differentiation between pairs of populations was tested using permutation procedures (1000 replicates). The average effective number of migrants exchanged between populations in each generation, i.e., gene flow (Nm), was estimated from F ST.

Cluster analysis

Unbiased genetic distances (Nei 1978) were computed with PopGene software, Version 1.32 (Yeh et al. 1997). A dendrogram based on these resulting coefficients of genetic distance was constructed using the unweighted pair-group method with arithmetic mean (UPGMA). This distance matrix was analyzed with NTSYSpc, Version 2.11c (Rohlf 1997). Cluster analysis (UPGMA) and dendrogram construction were performed with the SHAN program in NTSYSpc to determine the genetic distance among the 14 populations. A simple Mantel test was conducted with TFPGA 1.3 (Miller 1997) to assess the correlation between the Nei’s unbiased genetic distance and the geographic distance that separated populations.

Genetic structure

Bayesian clustering was used for inferring the population structure by STRUCTURE v2.3.4 (Pritchard et al. 2000). The each level of (K = 1–20) was tested and determined by a no-admixture model with 106 iterations and 105 burn-in steps. Ten independent runs and tests for each K were performed to ensure consistent results. Different clustering scenarios were evaluated using the mean log probability [L(K)] and the change in log probability, ΔL(K) (Evanno et al. 2005).

Long-term migration and effective population size

The MIGRATE-N 3.6.5 (2013) was employed to estimate long-term migration rates and effective population sizes. Analyses assumed a Brownian motion model which represents a continuous approximation to the stepwise mutation model for microsatellite evolution (Kimura and Ohta 1978). Long-term migration rates, M, 4N e generations in the past based on a coalescent approach, were estimated using the maximum-likelihood mode. MIGRATE-N tests all possible combinations of symmetrical or no migration between each regional population group and between populations within a region with the likelihood ratio test. MIGRATE-N estimates the long-term effective population size (N e) of each population group as parameter θ (that is, 4N e μ, where μ is the mutation rate per site). MIGRATE-N also estimates the bi-directional M among population groups and among populations within a region (M = m/μ, where m is the immigration rate per generation) (Hsieh et al. 2013). The Markov Chain settings consisted of ten short and three long chains per run, with an increment of 20. For short and long chains, 500 and 1000 genealogies were respectively recorded, with a burn-in of 1000.

Results

Genetic diversity

A total of 70 alleles at nine microsatellite loci were revealed across 291 individuals of 14 natural populations for C. macrolepis. Allele numbers ranged from four at locus Cm7 to 13 at locus Cm12, with a mean value of 7.8 per locus. Mean gene diversity ranged from H E = 0.497 at locus Cm7 to H E = 0.748 at locus Cm10, H O values ranged from 0.534 at locus Cm8 to 0.869 at locus Cm3. Null alleles were detected at a negligible frequency in two loci (Cm11 and Cm12). The average variation for intrapopulation microsatellites was Na = 4.7 (3.6–5.9), Ne = 3.2 (2.5–4.0), H E = 0.636 (0.563–0.701), and H O = 0.712 (0.467–0.844). Four private alleles (Np) were detected in two populations—DZ (1; 2.5 %) and LY (3; 5–25 %) (Table 2). The level of inbreeding for each population was between −0.311 and 0.365, and significant inbreeding coefficients (F) were detected in the LY population (F = 0.365) (Table 2). When all loci were combined, nine populations deviated from the HWE at P < 0.05 (Table 2). Significant linkage disequilibrium between SSR loci within each population was detected for 31 out of 485 comparisons; only four remained significant after the Bonferroni correction (P ≤ 0.05), suggesting that the loci used in this study were independent of each other.

Table 2 Genetic diversity and fixation indices for the 14 populations of Calocedrus macrolepis

Population genetics differentiation and structure

The global genetic differentiation coefficient (F ST) was 0.163 (Table 3). Pair-wise F ST estimates ranged from 0.060 (LS vs. JH) to 0.245 (CN vs. LY) (Table 4). Historical gene flow (Nm) between pairs of populations averaged Nm = 1.562 and ranged from 0.770 (between CN and LY) to 3.895 (between JH and LS) (Table 3). No significant correlation was detected by the Mantel test between genetic and geographical distances among pair-wise comparisons (r = 0.197, P = 0.103). The relationships among these 14 populations were clustered based on Nei’s genetic distances (Fig. 2). Population CJ from Hainan province was clustered with YM and DZ from Yunnan and Guizhou provinces, respectively. The easternmost population, LS, was grouped with the westernmost population, TC. Guangxi populations HJ and LY were clustered into another group.

Table 3 Genetic differentiation (F ST), gene flow (Nm), and fixation index values within populations (F IS) and over all populations (F IT) for 14 populations of Calocedrus macrolepis
Table 4 Pair-wise F ST (below the diagonal) and Nm (above the diagonal) comparisons among 14 populations of Calocedrus macrolepis
Fig. 2
figure 2

Dendrogram showing genetic association among 14 populations of Calocedrus macrolepis based on genetic distances. The dendrogram based on coefficients of genetic distance was constructed using the unweighted pair-group method with arithmetic mean (UPGMA)

Based on nine loci, a global analysis of pooled data from all populations was implemented by Structure program. The results showed that individuals of the 14 C. macrolepis populations could be generally assigned to two separate groups (western and southern) (Fig. 3) according to changes in the log probability (ΔL(K); Fig. 4). Western group included populations Yimen (YM), Tengchong (TC), Changning (CN), Leishan (LS), Jianhe (JH), Lincang (LC), Mojiang (MJ), Puer (PE), Shiping (SP), and Yuanjiang (YJ). Southern group comprised populations Changjiang (CJ), Huanjiang (HJ), Danzhai (DZ), and Leyie (LY). According to historical migration rates (M) calculated by MIGRATE-N, there is no significant asymmetrical migration among the two regional groups. However, significant asymmetrical migration among populations within the western and southern regional groups according to non-overlapping 95 % CIs was revealed (Table 5; Fig. 5, respectively).

Fig. 3
figure 3

Analysis with STRUCTURE for K = 2. Each color represents a different cluster. Black lines separate populations. See Table 1 for population codes

Fig. 4
figure 4

Log likelihood and changes in the log likelihood for different scenarios of groupings were based on nine microsatellite loci of Calocedrus macrolepis

Table 5 The migrations estimated using MIGRATE-N among ten regional populations
Fig. 5
figure 5

Graphical representation of results of migrations estimated using MIGRATE-N among the four regional populations. Maximum-likehood estimates and extreme value in each locus (in parentheses) of the long-term migration rate (M) and mutation-scaled effective population size (θ) using the MIGRATE-N program. Arrows represent the direction of gene flow among sites. Dotted lines means no value

Discussion

Genetic diversity

Compared with previously reported genetic patterns for C. macrolepis that were based on inter simple sequence repeat (ISSR) analysis (mean H E = 0.1114, overall F ST = 0.042) (Wang et al. 2004), we found much greater variation and higher differentiation in our 14 remnant natural populations (mean H E = 0.636, overall F ST = 0.163). One potential explanation for this discrepancy is that only five populations were sampled in that earlier study, which may have led to somewhat biased estimates of genetic diversity. Another possibility is that techniques utilizing dominant molecular markers, such as ISSR, AFLP (amplified fragment length polymorphisms), or RAPD (rapid amplification of polymorphic DNA), provide limited resolution when examining interpopulation differentiation and gene flow among populations. By contrast, microsatellite markers that rely on specific primers for a certain species, and which are present in all species genomes, are highly polymorphic, codominant, and widespread across the genome (Rallo et al. 2000; Sefc et al. 2000). Therefore, they can be used to discriminate between heterozygotes and homozygotes, and can truly reveal the genetic diversity within a species.

Compared with other plants, most conifers show a relatively high level (H E = 0.151) of variation within populations and a low level (F ST = 0.073) of differentiation among populations, as measured by isozyme markers (Hamrick et al. 1992; Matusova 1995). However, values for genetic diversity vary among types of markers, making it necessary to compare diversities in different species based on the same molecular markers. Here, we found that diversity (H E = 0.636) was similar with the mean values (H E = 0.65) reported from studies of other regional species such as Larix occidentalis Nutt. (H E = 0.58) (Khasa et al. 2006) Fitzroya cupressoides I. M. Johnst. (H E = 0.55) (Allnutt et al. 1999) and long-lived perennial species (H E = 0.68) (Nybom 2004) such as Araucaria araucana (Molina) K. Koch (H E = 0.69) (Bekessy et al. 2002), Quercus lobata Née (H E = 0.78), Juniperus oxycedrus Hochst. (H E = 0.72) (Boratyński et al. 2014), and Quercus douglasii Hook. & Arn. (H E = 0.88) (Craft et al. 2002). However, the diversity was much lower than that estimated with SSR markers for five pinaceae species with broad distribution (H E = 0.734–0.815, mean = 0.79) (Mariette et al. 2001; Acheré et al. 2005; Maghuly et al. 2006; Mehes et al. 2009) and was higher than the cupressaceae species with narrow distribution, such as Pilgerodendron uviferum (D. Don) Florín (H E = 0.035) (Premoli et al. 2001), F. cupressoides (H E = 0.077) (Premoli et al. 2000), and Podocarpus nubigena Lindl. (H E = 0.218) (Quiroga and Premoli 2010).

Genetic structure and gene flow

Because of greater random genetic drift and inbreeding as well as reduced gene flow, habitat fragmentation is expected to erode genetic variability and increase the interpopulation genetic divergence of plant populations (Young et al. 1996; Sork et al. 1999; Lowe et al. 2005; Aguilar et al. 2008). However, many tree species are probably resilient to habitat fragmentation for one or two generations after it occurs because they already contain high diversity (Bacles et al. 2004, 2005; Yao et al. 2007). In this study, we calculated that the degree of population differentiation was 0.163, based on SSR loci. This indicated that 16 % of the total genetic diversity could be attributed to differences among populations. Our F ST value for C. macrolepis was relatively higher than the level of genetic differentiation observed in other conifer species when examined with microsatellite markers. These include Pinus monticola, F ST = 0.057 and P. strobus, F ST = 0.084 (Mehes et al. 2009); Picea abies (L.) H. Karst., F ST = 0.009 (Acheré et al. 2005); P. abies, F ST = 0.005 (Maghuly et al. 2006); Pinus pinaster, F ST = 0.111 (Mariette et al. 2001); J. oxycedrus, F ST = 0.160 (Boratyński et al. 2014); J. cedrus, F ST = 0.08 (Rumeu et al. 2014); P. uviferum, F ST = 0.159 (Premoli et al. 2002); F. cupressoides, F ST = 0.078 (Premoli et al. 2001) and F. cupressoides, F ST = 0.14 (Allnutt et al. 1999). However, our findings were similar to those for other relictual conifer species in China (Wang et al. 1996; Ge et al. 1998; Tang et al. 2008). The level of gene flow for C. macrolepis (Nm = 1.562) suggests that the genetic diversity in C. macrolepis is low, and the probability of population differentiation has increased due to the fragmentation of habitat. Similar results were found for populations with narrow and fragmented distributions (Larix lyallii Parl., Nm = 1.4, (Khasa et al. 2006) and P. pinceana, Nm = 1.39, (Ledig et al. 2001). C. macrolepis is wind-pollinated, monoecious, and long-lived, and its small, light-weight seeds have two sub-apical, unequal wings (Wang et al. 2004). These biological features generally play a significant role in enhancing gene flow and decreasing genetic differentiation among populations. The genetic differentiation coefficient (F ST) represents the extent of gene differentiation among populations. Generally, 0.05–0.15 of F ST indicates the relatively middle level of gene differentiation, and 0.15–0.25 of F ST indicates the relatively high level of gene differentiation (Balloux and Lugon-Moulin 2002). Nevertheless, we found relatively high population differentiation (F ST = 0.163) and low gene flow.

From the UPGMA, populations HJ and LY are clearly separated from the rest, also MJ, YJ, and PE form a clear cluster, marginal populations TC, LS, JH, CN, and LC form another cluster, which is supported by Structure results. A reasonable hypothesis for the modern distribution patterns is that because of their suitability as subtropical climates, regions that include the southeastern portion of the YGP on the southeastern edge of the Qinghai-Tibetan Plateau may have been a Pleistocene refugium for C. macrolepis after the Quaternary glaciations. Concurrently, because of neotectonic movement since earlier in the Quaternary, complex and various climates and topography have appeared, and various microclimates have been shaped in the YGP. C. macrolepis is a heliophile and subtropical plant and after the Quaternary glaciations, it was maintained only in areas with subtropical microclimates. The genetic patterns presented here possibly reflect a scenario of glacial refugees (may be in lowland areas) with posterior recolonization of highlands when the temperature raised. Populations LS, JH, and DZ show great genetic diversity, also population DZ and LY present private alleles. In addition these populations have the lowest elevations, therefore these populations could be considered may be as glacial refugees. These deductions accorded with the results of Migrate analysis. Because the evolutionary rate of conifers are very slow (Petit and Hampe 2006; Savolainen and Pyhäjärvi 2007), these results implied that the genetic patterns within C. macrolepis were caused mainly by a long history of population isolation.

Management implications and recommendations for future research

Genetic diversity plays an important role in determining the survival and adaptability of a species. The loss of such diversity is often associated with reduced fitness. Maintaining genetic variation is a major objective when developing most programs for species conservation (Avise and Hamrick 1996). In the present study, although the majority of genetic variation by C. macrolepis resided within (84 %) rather than between populations, its diversity was lower and its differentiation was much greater than is found with other conifers that become broadly distributed due to population fragmentation. Nine populations in 14 sampled populations deviated from HW equilibrium. All of the results indicate that populations of C. macrolepis are declining. Moreover, their fragmentation may have been shaped by long-term historical events as well as independent evolutionary processes, such as neotectonic movement, glaciation, and glacial retreat.

Hence, to capture the greatest within-species genetic variation, decision-makers must include as many populations as possible in their programs for ex situ conservation or breeding. Because all extant populations are small and isolated, in situ conservation strategies should be designed to preserve large areas in order to minimize the loss of diversity that may result from future genetic drift. From our example, special attention should be given to Populations LY and DZ because of their private alleles, as well as to Population CJ in Hainan province because of its isolation from the mainland. The negative consequences of habitat fragmentation, such as genetic erosion and increased genetic divergence among populations, may be delayed in long-lived perennial plant species. This is because mature, established individuals may still reflect a historic situation with less-severe habitat fragmentation (Sork et al. 1999; Tomimatsu and Ohara 2003; Galeuchet et al. 2005; Van Geert et al. 2008). To determine the extent to which fragmentation affects the genetic variation and structure of C. macrolepis among generations, we must compare diversity and differentiation between adult and seedling populations, and integrate simultaneous investigations of demography, reproductive biology, and genetics.