Introduction

Turnip (Brassica rapa ssp. rapa) has been cultivated over 2100 years since its first introduction into China during the Western Han Dynasty period and is probably the ancestor of Chinese cabbage (B. rapa ssp. pekinensis) (Cao 1996). China is also the center of origin of Chinese turnip rape (ssp. oleifera) (Li 1981), which is a unique turnip rape (oil type). Turnip is used to be grown as a popular vegetable in many regions in China with various Chinese names such as Manjing, Yuancaitou, Yuangen, Pancai, Buliuke, etc. However, with the introduction of new types and varieties of vegetables as well as the change in cultivation system, the planting area of turnip has been significantly reduced in recent years (Wang et al. 2015).

In Xinjiang Uygur Autonomous Region, the northwest of China, turnip known as “Qamgur” in Uygur is one of the popular and favorite foods of the Uygurs (Zhang et al. 2009; Ma et al. 2016). The Uygurs compare “Qamgur” to “renshen” (the dry root of Panax ginseng C.A. Mey.), a worldwide well-known traditional Chinese medicine popularly known as “ginseng”. But eating “Qamgur” does not cause inflammation while eating “renshen” could. Thus, most people in southern Xinjiang even consider “Qamgur” as an essential daily food (Ma et al. 2016). The roots and seeds of “Qamgur” are also commonly used in Uygur medicine (Aikebaierjiang et al. 2011; Xie et al. 2014). With the vigorous development of the primary industry in Xinjiang, the production area of turnip is actually increasing year by year (Ma et al. 2016). Since Xinjiang is located in the center of Eurasia, which is close to several centers where plant species originate, it has been reported that germplasm of several plant species in this region are somewhat genetically different from those in other parts of China (Wang et al. 2013, 2016). Although “Qamgur” belongs to the same species of turnip, its shape and flavor are different from other turnips (Xie et al. 2014; Ma et al. 2016). It remains unclear whether varieties of “Qamgur” in Xinjiang are genetically different from turnip varieties grown in other provinces in China.

Previous studies suggest that turnip has a higher genetic variation than other crops within B. rapa (McGrath and Quiros 1992; Persson et al. 2001). The genetic variation within 31 accessions including varieties, elite stocks, landraces and older turnip of slash-and-burn type collected from Nordic area was 18.7% based on 26 alleles of nine isozyme loci (Persson et al. 2001). According to 34 morphological and agronomical traits, a panel of 120 landraces consisting of turnips, turnip greens, and turnip tops collected from northwestern Spain could be classified into five clusters, suggesting that there were enough variability in these landraces to differentiate among appropriate populations for each one of the distinct crops (Padilla et al. 2005). Similar studies on 15 Chinese turnip accessions and 11 Iranian turnip accessions based on four and 16 morphological traits, respectively, also revealed high variability among accessions (Sun et al. 2007; Moghadam et al. 2014). Analysis of three European winter B. rapa (Turnip rapa) open-pollinated cultivars using 16 SSR markers indicated that genetic diversity within cultivars was also high (Ofori et al. 2008). However, most studies on genetic diversity of turnip were based on morphological traits or few molecular markers.

Various molecular markers including random amplified polymorphic DNA, restriction fragment length polymorphisms (RFLP), amplified fragment length polymorphisms (AFLP), simple sequence repeats (SSR), and single nucleotide polymorphism (SNP) have been developed and used for diverse purposes in Brassica (Ahmad and Khalil 2013). Although large-scale discovery combining high-throughput genotyping of SNPs have shown its power in genetic study in B. rapa (Cheng et al. 2016a, b; Tanhuanpää et al. 2016), high costs and technical or equipment demands is still a major obstacle to breeders particularly in the developing countries to adopt this approach. On the contrary, the genotyping of SSR is relatively easy and inexpensive with a simple PCR and electrophoresis. SSR can also be analyzed with high throughput technologies and in large-scale multiplexing (Tsonev et al. 2015; Li et al. 2017). Therefore, this marker type is still an attractive tool for genetic diversity analysis and marker-assisted selection in B. rapa. To date, a large number of SSR markers have been discovered in the expressed sequence tags (EST) and the genomic DNA sequences in B. rapa (Suwabe et al. 2002; Lowe et al. 2004; Choi et al. 2007; Hobson and Rahman 2016). In addition, efforts have also been put on discovering SSR markers in ESTs from turnip (Wu et al. 2012; Luo et al. 2012). Some SSR markers have been used in genetic diversity analysis, classification of species, and marker-assisted selection (Suwabe et al. 2002; Lowe et al. 2004; Choi et al. 2007; Hobson and Rahman 2016). However, most of these studies only used a limited number of turnip accessions as plant materials to test feasibility of markers without investigating the genetic diversity within turnip accessions.

This study was initiated to evaluate (i) the feasibility of applying SSR markers derived from other B. rapa crops to turnip and (ii) the genetic diversity in “Qamgur” and other turnip varieties in China. The experience gained in this study will provide molecular markers for the use in understanding genetic diversity and mapping genes controlling economically important traits, and provide breeders information on the genetic relationship between “Qamgur” and other turnip varieties for cultivar improvement in turnip.

Materials and methods

Plant materials

Thirty turnip varieties (Table 1) were used to compare genetic variation within and among populations. Of these, 18 were collected from six regions of Xinjiang Uyghur Region in northwest of China, while the remaining 12 were collected from six Provinces/Municipalities in the middle-east of China. All the 30 varieties belong to turnip root type. The seedlings were grown in 128 Square Plug Tray Deep filled with a mixture of peat soils and vermiculite (3:1) in the greenhouse for DNA isolation.

Table 1 Description of 30 turnip varieties used in this study

DNA isolation and marker analysis

Young leaves were collected from at least eight plants of each variety such that there was a greater than 99% probability of detecting both alleles at a given marker in a segregating variety (Huhn and Piepho 2003). Genomic DNA was isolated using the modified CTAB isolation method as described by Kabelka et al. (2002).

A total of 170 SSR markers (Table 2) were used to genotype all varieties. These markers were developed for B. rapa ssp. chinensis (Suwabe et al. 2002; Lowe et al. 2004) or B. rapa ssp. pekinensis (Choi et al. 2007) or EST sequences B. rapa ssp. rapa (Wu et al. 2012), and have been used to detect polymorphisms within the species of B. rapa. PCR reactions were conducted in a 10-μl reaction volume consisting of 5 μl 2 × Es Taq MasterMix (Beijing ComWin Biotech Company, Beijing, China), 0.5 μM each primer, 1 μl (5–10 ng) genomic DNA template, and 3 μl ddH2O. Reactions were heated at 95 °C for 5 min, followed by 36 cycles of 30 s denaturing at 95 °C, 30 s annealing at 52–58 °C depending on the Tm values of primer pairs, and 30 s extending at 72 °C, with a final extension of 5 min at 72 °C. The PCR products were subsequently separated in 7.5% polyacrylamide gel and visualized using the silver-staining approach (Chen et al. 2009).

Table 2 Primer sequences, number of alleles and polymorphic information content (PIC) of 52 markers in Xinjiang Qamgur and other turnip varieties

Genetic variation analysis

The presence or absence of each single fragment was coded by 1 or 0, respectively, and scored for a binary data matrix. Polymorphism information content (PIC) for each marker was calculated using the formula of PIC = 1 − Σp 2 i (Weir 1990), where p i is the frequency of i th allele for each marker locus. Nei’s genetic distance (Nei 1972) were calculated for each pair of varieties using the program in the software package NTSYSpc 2.11a (Rohlf 2002). Unweighted Pair Group Method with Arithmetic Mean (UPGMA) cluster analysis was performed to develop a dendrogram. Principal component analysis (PCA) was conducted using the Past 3.13 software (Hammer et al. 2001) to estimate relationships among varieties.

Population structure of the 30 turnip varieties was estimated by a model without prior population information using a free software package of STRUCTURE2.3.4 (Pritchard et al. 2000; Falush et al. 2003, 2007). Number of populations (K) and the best K were determined by the methods described in Wei et al. (2012).

Results

Success of PCR amplification and polymorphisms of SSR markers

The success rates of PCR amplification using genomic DNA of turnip as templates varied obviously among markers originated from different Brassica species. The markers based on EST sequences of turnip (B. rapa ssp. rapa) yielded the lowest PCR success rate (45.5%), while the markers originated from the genomic DNA sequences of B. rapa ssp. pekinensis had the highest of 97.4% (Table 3). However, all markers from B. rapa ssp. rapa with PCR success produced scorable bands, while only 34.2% markers from B. rapa ssp. pekinensis were scorable. Of the scorable markers from three Brassica species, the ability of detecting polymorphisms in turnip ranged from 60.0 to 83.3% (Table 3).

Table 3 PCR success rate and polymorphisms detected by SSR markers originated from various sources

Marker polymorphism and allelic variation in 30 turnip varieties

Among the 52 scorable markers, 75.0% detected polymorphisms in the whole collection of 30 turnip varieties. The proportion of polymorphic markers was lower in Xinjiang Qamgur (61.5%) than in other turnip varieties (73.1%). A total of 153 alleles were produced by the 52 markers with an average of 2.9 alleles and a range of 1–7 alleles per marker. Most alleles (82%) were shared by Xinjiang Qamgur and other varieties, while the remaining alleles were unique to either one (Fig. 1). The average PIC of 52 markers was also lower in Xinjiang Qamgur (0.254) than in other varieties (0.379) though the range was similar (0–0.783 in Xinjiang Qamgur and 0–0.800 in other varieties). Twenty-nine markers had lower PIC values in Xinjiang Qamgur than in other varieties, while 11 markers had higher PIC values in Xinjiang Qamgur than in other varieties (Table 2).

Fig. 1
figure 1

Pie diagram shows the proportion of common alleles among Xinjiang Qamgur and other turnip varieties

Genetic variation in 30 turnip varieties

The average Nei’s genetic distance for each of the 30 turnip varieties varied from 0.156 (Xinjiang Qamgur, W6 and W10) to 0.349 (Yuhuan Pancai, W23) with an average of 0.204 (Table 4). The mean genetic distance in Xinjiang Qamgur (0.127) was lower than in other varieties (0.235). Majority (77.0%) of the pairwise genetic distance in the whole collection of 30 varieties were between 0.101 and 0.300. However, the distribution of genetic distance for Xinjiang Qamgur concentrated on 0.101–0.200, while the genetic distance in other varieties had a wider distribution (Fig. 2).

Table 4 Statistics of Nei’s genetic distance for 30 turnip varieties
Fig. 2
figure 2

Distribution of Nei’s genetic distance values in 30 Xinjiang Qamgur and other turnip varieties

The genetic variation was narrower in Xinjiang Qamgur than other varieties. Xinjiang Qamgur varieties W4 and W7 had the largest genetic distance of 0.237, while W2 and W13 had the least genetic distance of 0.047. In other varieties, Yuhuan Pancai (W23) and Wenzhou Pancai (W24) had the least genetic distance of 0.089, while Yuhuan Pancai and Little turnip had the largest genetic distance of 0.499 (Table 4).

Genetic architecture of 30 turnip varieties

Phylogenetic tree based on the Nei’s genetic distance suggested that the 30 turnip varieties could be classed into four groups at the genetic coefficient of 0.20. All varieties of Xinjiang Qamgur (W1-W17, W20) were in group I, eight varieties (W18, W19, W21, W25, W27–W30) from North China were in group II, two varieties Little turnip (W22) and Imported turnip (W26) were in group III, and two varieties Yuhuan Pancai (W23) and Wenzhou Pancai (W24) were in group IV (Fig. 3). Groups I and II had a relatively close relationship, and were more distinct from groups III and IV.

Fig. 3
figure 3

UPGMA dendrogram of 30 Xinjiang Qamgur and other turnip varieties. Codes (W1–W30) for varieties can be found in Table 1

Principle component analysis was used to visualize and test relationships among the 30 turnip varieties. When Xinjiang Qamgur and other varieties were analyzed together, the first two principal components explained 30.5% of the total variation and clear clusters emerged (Fig. 4). The 30 varieties could be divided into three divergent clusters. Cluster I included all 18 varieties of Xinjiang Qamgur, cluster II contained all varieties from phylogenetic groups II and III, while cluster III consisted of two varieties Yuhuan Pancai (W23) and Wenzhou Pancai (W24).

Fig. 4
figure 4

The genetic relationships among 30 turnip varieties revealed by principle component analysis. Codes (W1–W30) for varieties can be found in Table 1

To assign the 30 varieties to an appropriate sub-population, a series of independent runs of the data were conducted at a range of values of K from one to 10 using the software STRUCTURE. Although the best number of sub-populations was two based on the plot of ΔK against K (Fig. 5a), the log likelihood L(K) still increased till K = 4 when the plateau occurred (Fig. 5b). From the summary plot of membership coefficients (Q), when K increased from two to 10, all 18 Xinjiang Qamgur always stood as one sub-population (Fig. 5c), while other 12 varieties could be assigned to one to seven sub-populations. When K = 4, the other 12 varieties could be divided into three sub-populations (Fig. 5c), which was exactly the same as the results of phylogenetic tree analysis.

Fig. 5
figure 5

Population structure of 30 Xinjiang Qamgur and other turnip varieties. Codes (W1–W30) for varieties can be found in Table 1. a Estimation of optimum number of groups (K). b The graph for the parameter L(K) and number of clusters (K). c Population structure

Discussion

The genetic relationship among 30 turnip varieties was investigated in this study. The data showed that the 18 varieties of “Qamgur” were always clustered together and distinct from the other turnip varieties. This could be due to the distinct shape and flavor of “Qamgur” from other turnips (Xie et al. 2014; Ma et al. 2016). Of the remaining 12 turnip varieties, eight from North China (Beijing Municipality, Tianjin Municipality and Hebei Province) formed one group, two Pancai from Eastern China (Zhejiang and Anhui Provinces) were in one group, while the remaining two clustered together. These results agree with previous findings that turnip varieties from certain regions were classified into the same cluster based on morphological, molecular or isozyme marker data (McGrath and Quiros 1992; Persson et al. 2001; Guo et al. 2014). This suggests that the long history of domestication and selection influenced the development of region-specific varieties. The genetic variation within the 18 varieties of “Qamgur” was low compared to the 12 varieties revealed by SSR marker data in the current study. This could be partly explained by the high morphological variation in fleshy roots in the 12 varieties than in the 18 “Qamgur” varieties. Of the 18 “Qamgur” varieties, 94.4% fleshy roots had the same color of green/white and 88.9% were round in shape, while the fleshy roots had four types of colors (white, red, yellowish white, and green) and three types of shapes (round, oblate, and long cylindrical) in the 12 varieties (Table 1).

Previous studies on genetic divergence of turnip suggest that both Asia and Europe are geographic origins of turnips (Takahashi et al. 2016; Bird et al. 2017). Phylogenetic tree analysis in the current study showed that the 30 accessions could be divided into four groups (Fig. 3), which could be supported by population structure analysis when K = 4 (Fig. 5c). However, principle component analysis suggested the existence of three clusters. Groups II and III in the dendrogram was in the same cluster in PCA (Fig. 4), which was consistent with the population structure when K = 3 (Fig. 5c). Whereas based on the best K estimation (Fig. 5a), the 30 accessions could be divided into two sub-populations (Fig. 5c, K = 2), one included 18 “Qamgur” accessions corresponding to group I in the dendrogram and the other contained 12 other accessions corresponding to groups II, III, and IV in the dendrogram (Fig. 3). Due to a lack of common accessions for direct comparisons between our study and others, the relationship between the 30 accessions from China and turnips from other countries cannot be determined. However, since “Qamgur” is uniquely from Xinjiang, the center of Eurasia, it is most likely that “Qamgur” is a native species there while other Chinese turnip varieties are introductions from other regions.

It has been reported that using markers developed for B. rapa have a higher chance of getting PCR products within subspecies of B. rapa than using markers from other Brassica species (Lowe et al. 2004). The PCR success rates varied from 59.0 to 83.1% and the polymorphic ratio ranged from 4.26 to 20.7% (Wu et al. 2012; Luo et al. 2012). Therefore, 170 SSR markers developed for B. rapa were adopted to analyze genetic variation in 30 turnip varieties in this study. Both PCR success rate (80.0%) and polymorphic ratio (75.0%) were much higher than previous reports (Wu et al. 2012; Luo et al. 2012). One possible interpretation could be that both previous studies used only two cultivars while this study included 30 varieties. The larger number of samples might increase the chance of detecting more polymorphisms. In addition, the data also showed that the success rate of PCR amplification using markers derived from B. rapa ssp. chinensis and B. rapa ssp. pekinensis was higher than using markers developed for B. rapa ssp. rapa (Table 3), which was out of our expectation. Previous study suggested that designing primers using EST sequences might increase the possibility of failure in getting PCR success when the PCR was conducted using genomic DNA as templates (Wang et al. 2010). In the present study, the SSR markers for B. rapa ssp. chinensis and B. rapa ssp. pekinensis were developed by mining the genomic DNA sequences (Suwabe et al. 2002; Lowe et al. 2004; Choi et al. 2007), while the SSR markers for B. rapa ssp. rapa were developed by mining the EST sequences (Wu et al. 2012). It was most likely that there were large introns between the coding sequences (EST) used for primer design resulting in large PCR products beyond the ability of Taq DNA polymerase to complete the reaction in a short time of 30 s.

Genetic distance in turnip accessions varied in different studies using diverse marker systems and germplasm. The genetic distance for five turnip accessions including three from Asia and two from Europe analyzed with five isozyme and four RFLP loci ranged from 0.089 to 0.184 with an average of 0.124 (McGrath and Quiros 1992), the genetic distance in 31 turnip accessions collected from Nordic area varied from 0.016 to 0.146 based on nine isozyme (Persson et al. 2001), while the genetic distances in 30 varieties analyzed with 52 SSR markers in this study were between 0.156 and 0.349 (Table 4). This suggested that the 30 turnip varieties cultivated in China had a higher genetic variation than those in other two studies. Although “Qamgur” varieties had a low genetic distance (0.098–0.160) within the group (Table 4), they were in a unique group that was different from other Chinese varieties. This result suggests that “Qamgur” might be a special source for developing new cultivars of turnip.

In conclusion, SSR markers developed for other B. rapa subspecies can be used for genetic study in turnip. The markers originated from the genomic DNA sequences of B. rapa ssp. pekinensis yielded the highest PCR success (97.4%) and polymorphic rate (83.3%) in turnip (Table 3). Therefore, they might be a good source for marker development in turnip before its genomic DNA sequence is available. In addition, “Qamgur” is a valuable source for developing new turnip varieties due to its genetic distinction from other turnip varieties in China.