Introduction

Dasypyrum villosum (L.) (Candargy, Dv; 2n = 14, VV) (L.) Schur [syn. Haynaldia villosa (L.) Schur], a wild relative of wheat, has been used to introgress genes that confer resistances to powdery mildew, rusts, take-all, eyespot, wheat spindle streak mosaic virus (WSSMV), tolerances to drought and frost, good tiller ability, and high grain protein content (Blanco et al. 1987; Murray et al. 1994; Chen et al. 1995; Grądzielewska 2006; De Pace et al. 2011). A large amount of common wheat-Dv cytogenetic materials, including amphiploid, disomic and ditelosomic addition lines, and disomic substitution lines, as well as translocation lines that display the expression of those useful genes have been developed (Grądzielewska 2006; De Pace et al. 2011). For example, the translocation line T6VS·6AL has both powdery mildew (Pm21 on 6VS) and stripe rust (Yr26 on 1B introduced from Triticum turgidum) resistance. The introduction of 6VS (and corresponding absence of 6AS) has no significant negative effect on yield or quality traits (Chen et al. 1995; Li et al. 2005b, 2007), and T6VS•6AL lines are widely used as parents in wheat breeding programs. More genes have been transferred into wheat by the development of alien translocation lines, including wheat yellow mosaic (WYM) resistance gene Wss1 (Zhang et al. 2005; Zhao et al. 2013), stem rust resistance gene Sr52 (Qi et al. 2011), grain softness genes (Zhang et al. 2010, 2012), seed storage protein genes (Zhao et al. 2010; Vaccino et al. 2010; Zhang et al. 2014), yield-related genes (Zhang et al. 2015), the powdery mildew resistance gene Pm55 (Zhang et al. 2016b) and the cereal cyst nematode resistance gene CreV (Zhang et al. 2016a). Such wheat lines represent excellent resources, and greatly enrich the genetic basis of wheat.

To improve the efficiency of gene transfer from Dv to wheat, it is necessary and important to develop molecular markers linked to useful genes for accelerating wheat plant breeding by marker-assisted selection (MAS). The development of a large number of molecular markers specific for Dv will help to accurately trace the introduced alien chromatin carrying useful alien genes. Although it is a diploid genome, the outcrossing breeding system of Dv makes its genome very complex. Several different types of Dv chromosome-specific molecular markers have been reported, including random amplified polymorphic DNAs (RAPD) (Qi et al. 1996; Li et al. 2005a), restriction fragment length polymorphism (RFLP) (Qi et al. 1996; Li et al. 2005a), simple sequence repeat (SSR) (Zhang et al. 2006), sequence characterized amplified region (SCAR) (Liu et al. 1999; Cao et al. 2006; Chen et al. 2006), conserved-intron scanning primers (CISPs) (He et al. 2013), and expressed sequence tags (EST-PCR) (Wang et al. 2007; Cao et al. 2009; Chen and Chen 2010; Zhao et al. 2012; Zhang et al. 2012; Chen et al. 2013; He et al. 2013; Zhao et al. 2014; Zhang et al. 2015; Bie et al. 2015; Zhang et al. 2016a). However, the number of molecular markers specific for Dv chromosomes is still very limited. Therefore, there is an urgent need to mine more molecular markers, especially those having a wider genomic distribution and amenable to high throughput genotyping, in order to identify alien introgression and translocation lines and also aid molecular genetics and breeding studies involving Dv.

Introns are an attractive source of polymorphism for marker development because insertions/deletions and base substitutions are more common within intron than within exon sequences (Kimura 1983). Intron length polymorphism has been considered as a convenient and reliable source of informative markers with high interspecies transferability (Poczai et al. 2013). Based on the orthologous gene conservation between rice and wheat (Ishikawa et al. 2007), a set of PCR-based landmark unique gene (PLUG) primer pairs were designed; subsequent analysis showed that PLUG markers are suitable not only for detecting the polymorphisms among wheat A, B and D genomes (Ishikawa et al. 2009), but also for identifying the homology between wheat and alien chromosomes. Thus, they are useful in MAS, comparative genomics, alien chromosome tracing, taxonomic studies and genotyping (Poczai et al. 2010; Li et al. 2013; Hu et al. 2012; Zhan et al. 2013).

Next-generation high-throughput DNA sequencing techniques (NGS) have opened opportunities for sequencing an entire plant genome at low cost, allowing fast characterization of genome structure at high resolution and detailed comparative study across species (Egan et al. 2012). All wheat chromosomes have been sequenced using NGS and their gene contents has been determined (International Wheat Genome Sequencing Consortium 2014). These new genomic resources have opened an avenue for efficient high-throughput marker development, high-density mapping, gene cloning, comparative genomics, functional studies and phylogenetic analyses.

In this paper, the Dv genome was sequenced and assembled to develop more specific gene markers. Based on blasting against the genome sequences of Triticum aestivum cv Chinese Spring (Poczai et al. 2013), intron length polymorphisms were identified and chromosome-specific molecular markers for Dv were successfully developed and allocated on specific Dv chromosome arms.

Materials and methods

Plant materials

Triticum durum-Dv amphiploid (AABBVV), T. aestivum-Dv disomic addition lines [DA1V, 3V–7V, in a Chinese Spring (CS) background] and T. aestivum -Dv disomic substitution line DS2V (in a Xiangmai1 background), T. aestivum -Dv translocation lines involving different Dv chromosome arms, including T1VL·1DS(CS), T1VS·1DL(CS), T2VL·2DS(CS), T2VS·2DL(CS), T4VL·4DS(CS), T4VS·4DL(CS), T5VS·5DL(CS), T6VL·6AS(CS) and T6VS·6AL(Yangmai 5) were developed by the Cytogenetics Institute, Nanjing Agricultural University (hereafter CINAU). The Dv (accession No. 91C43) was introduced from Cambridge Botanical Garden (Cambridge, UK).

NGS and assembling of Dv

About 24 μg genomic DNA from young leaves of Dv was used to create the eight shotgun DNA-seq libraries with 626–636 bp and 311–321 bp inserted sizes, respectively. The libraries were sequenced in a single lane of Illumina HiSeq 2500 platform. De novo assembly of the Illumina paired-end reads was performed using the software SOAPdenovo (version 2.0) using k-mer size of 31, which provided the assembly with the best sequence coverage and N50 size to generate Dv genome scaffolds.

Sequence resources for primer design

DNA sequences used in this work included the Dv assembly (SRP077844), assemblies and annotated genes of 21 chromosomes of CS (T. aestivum Ensembl plants, Release 30, http://plants.ensembl.org/index.html).

Homology and alignment analysis

The flowchart for designing the Dv chromosome-specific markers is shown in Fig. S1. In the first step, we chose a set of annotated genes on the subgenomic D genome of CS to calculate the exon–exon junction sizes in the genomic sequences of individual homoeologous groups of wheat chromosomes (WHG1–WHG7) and 1V–7V chromosomes of Dv, respectively. In the second step, all genes were compared with the genomic sequences of individual homoeologous groups of wheat chromosomes (WHG1–WHG7) and chromosomes 1V–7V of Dv using a local Blastn program, respectively. All genes matching individual homoeologous groups of wheat chromosomes (WHG1 to WHG7) and appropriate Dv 1V–7V assemblies and possessing at least one predicted exon–exon junction were selected. Intron sizes of the corresponding genes were then calculated and compared against each other. Genes whose intron sizes in Dv 1V–7V chromosomes differed by at least 10% from those in corresponding allele of individual wheat homoeologous groups (WHG1–WHG7) were chosen for marker design, respectively. The primer pairs were designed in the exons flanking the targeted introns.

Intron targeting primer design

Intron targeting (IT) primers were designed in the exon regions flanking a targeted intron using the online software Primer 3 (version 4.0, http://frodo.wi.mit.edu/primer3//primer3/). The designed primers were expected to amplify the genomic DNA both in common wheat and Dv. The PCR setting for these primers is: melting temperature 55–65 °C (optimum: 60 °C), primer length 18–25 bp (optimum: 20), the estimated sizes of the amplified fragments were approximately 50 bp more than the targeted introns. All primers were synthesized by Shanghai Invitrogen Biotechnology Company (Shanghai, China).

DNA extraction and PCR

Genomic DNA was extracted from 2 g fresh leaves of plants at three-leaf stage with the SDS-phenol-chloroform method according to Sharp et al. (1989) and Devos et al. (1992) and purified to eliminate RNA, amylase and other unwanted compounds. The DNA purity and concentration was assessed with microplate reader (M200, TECan, Männedorf, Switzerland). Each DNA sample was finally diluted to approximately 50 ng/μl and stored at −20 °C until use.

PCR amplification was carried out in a 10-μl reaction volume containing 40 ng genomic DNA, 2 μM each of the primer pairs, 2.5 mM each dNTPs, 2.5 mM MgCl2, 1 × PCR buffer (10 mM Tris-HCl, pH 8.5, 50 mM KCl), and 0.5 U Taq DNA polymerase with a PTC-200 thermal cycler (Bio-Rad, Hercules, CA). The amplification was conducted at 94 °C for 3 min, followed by 35 cycles at 94 °C for 30 s, annealing of different primers at 50, 55, or 60 °C for 50 s at a ramp rate of 0.5 °C/s, 72 °C for 1 min 10 s, and a final extension at 72 °C for 10 min. PCR products were resolved in 8% non-denaturing polyacrylamide gels (Acr: Bis = 19: 1 or 39: 1) and the band patterns were visualized by silver staining (Bassam and Gresshoff 2007).

Results

Amplification patterns of different IT markers

In our previous study, we developed IT markers specific for chromosome arms 4VS and 6VS of Dv by chromosome sorting and NGS. In the present paper, we focused on the remaining arms of Dv chromosomes. A total of 1624 primer pairs was designed, including 265 for 1V, 332 for 2V, 211 for 3V, 151 for 4VL, 328 for 5V, 138 for 6VL and 199 for 7V (Table 1). They were used for amplification in CS, Dv and T. durum-Dv amphiploid. The results indicated that the majority (97.84%) of these primer pairs were suitable for producing PCR amplicons in all the three materials. Only 841 primer pairs (51.79%) amplified specific polymorphic fragments that were common in Dv and T. durum-Dv amphiploid, indicating they were specific for the genome of Dv. We also found that 35 primer pairs had no amplicons in any of the three materials.

Table 1 Number of intron targeting (IT) markers obtained by PCR amplification of the nuclear DNA of CS, Dv, Triticum durum-Dv amphiploid, using primer pairs specific for introns of the Dv chromosome arms

Allocation of IT markers to specific Dv chromosomes

To allocate specific markers on Dv chromosomes, the 841 primer pairs were used for amplification in different wheat-Dv addition/substitution lines involving different Dv chromosomes. If a primer pair amplified a distinct PCR product in Dv, T. durum-Dv amphiploid and an alien chromosome line involving one of the Dv chromosomes, while not in T. durum and CS, it might serve as a specific marker for this alien chromosome. To summarize, we assigned 135, 175, 120, 89, 140, 71, and 111 IT specific markers to 1V, 2V, 3V, 4V, 5V, 6V and 7V, respectively (Tables S1, S2).

Some of the amplification results for the 21 primer pairs are shown in Fig. 1. The representative three types of IT markers corresponding to the sequence information in subgenome AA, BB, DD and VV, the sequences of forward and reverse primers, and the intron sizes are given in Tables 2 and 3, respectively. For example, as shown in Fig. 1a, primer CINAU954 produced three bands with different sizes in Chinese Spring (AABBDD), presumably on AA, DD and BB, respectively, according to the predicted amplified size of the flanking intron. The primers also produced a band in Dv (VV) that has different size (500 bp) from those in CS, indicating this band is V genome specific. When amplified in T. durum-Dv amphiploid (AABBVV), CS-Dv addition/substitution lines (DA1V, DA3V–7V and DS2V), the primer pair produced the same band in DA1V and T. durum-Dv amphiploid as that in Dv (VV); thus, we assigned this 500 bp band to chromosome 1V.

Fig. 1a–c
figure 1

Bands in non-denaturing 8% polyacrylamide gels electrophoresis evidencing PCR products of intron targeting (IT) markers specific for individual chromosome of Dv. Lanes: M Marker, 1 Chinese Spring (CS), 2 Dv, 3 T. aestivum-Dv amphiploid, 4 DA1V, 5 DS2V, 6 DA3V, 7 DA4V, 8 DA5V, 9 DA6V, 10 DA7V. a The amplification of representative Type I IT markers (CINAU954, CINAU1068, CINAU1205, CINAU1331, CINAU1454, CINAU1532, CINAU1566) specific for Dv chromosome of 1V, 2V, 3V, 4V, 5V, 6V and 7V from top to bottom, respectively. b The amplification of representative Type II IT markers (CINAU862, CINAU1010, CINAU1162, CINAU1325, CINAU1459, CINAU1542, CINAU1661) specific for Dv chromosome of 1V, 2V, 3V, 4V, 5V, 6V and 7V from top to bottom, respectively. c The amplification of representative Type III IT markers (CINAU893, CINAU1067, CINAU1161, CINAU1295, CINAU1474, CINAU1517, CINAU1608) specific for Dv chromosome of 1V, 2V, 3V, 4V, 5V, 6V and 7V from top to bottom, respectively. Arrows Amplified bands specific for Dv individual chromosomes

Table 2 Catalog of some of the Type I, II, and III markers displaying nucleotide sequence homoeology to sequence scaffolds in the chromosomes of the A, B, D, and V genomes. CS Chinese Spring
Table 3 Sequences of forward and reverse primers for identifying Type I, II, and III IT markers for syntenic introns in the homoeologous chromosomes of the A, B, D, and V genomes

We classified the developed IT markers into different types based on the observed two, three or four band patterns on the polyacrylamide gel. Type I markers evidenced four bands, three of which each contained an amplicon from one of the orthologous introns in the three chromosomes of the same homoeologous group of CS (WHG1 to WHG7), and the fourth band containing an amplicon from the orthologous intron in the Dv-chromosome syntenic to that wheat homoeologous group. For example, CINAU954 is 1V specific, and amplified fragments specific for 1V-, 1A- 1B- and 1D- simultaneously (Fig. 1). These markers are co-dominant, and are useful to simultaneously trace the alien Dv chromosome and its wheat homoeologous group chromosomes.

Type II markers evidenced three bands: one band containing amplicons of the same size from orthologous introns (Tables 3, S3, S4) in any of two chromosomes of the same wheat homoeologous group, the second band containing the amplicon from the third chromosome of that homoeologous group, and the third band containing an amplicon from the orthologous intron in the Dv-chromosome syntenic to that wheat homoeologous group. For example, CINAU1010 is specific for 2V and amplified fragments specific for 2V-, and any two of wheat chromosomes 2A- 2B- or 2D- (Fig. 1).

Type III markers evidenced two bands: one band containing amplicons of the same size from introns (Tables 3, S3, S4) on three homoeologous chromosomes of the same wheat homoeologous group, and the second band containing an amplicon from a Dv-chromosome corresponding to that wheat homoeologous group. For example, CINAU1161 is specific for 3V and amplified a specific fragments from 3V-, and another fragment from CS chromosomes (possibly one of the 3A or 3B or 3D), or two CS chromosomes of (3A and 3D, 3D and 3B, or 3A and 3D); or all three chromosomes (3A, 3D and 3B) (Fig. 1).

Allocation of the IT markers to specific arms of Dv chromosomes

To further map these specific IT markers on specific arms of individual Dv chromosomes, all the IT markers specific for Dv chromosomes were used for amplification in different alien translocation lines. Using chromosome 2V as an example, if the amplification of a primer pair specific for 2V generated a common polymorphic band in Dv, T. durum-Dv amphiploid, translocation line T2VS·2DL (or T2VL·2DS), but not in CS, it is considered as a specific marker for chromosome arm 2VS (or 2VL)(Fig. 2, Tables S3, S4). Using those criteria, we assigned 47, 57, 44, 3, 51 and 51 IT specific markers to chromosome arms 1VS, 2VS, 3VS, 4VS, 5VS and 7VS, respectively. Similarly, 88, 118, 76, 86, 89, 71, and 60 IT specific markers were assigned to chromosome arms 1VL, 2VL, 3VL, 4VL, 5VL, 6VL and 7VL, respectively (Tables S1, S2).

Fig. 2a–e
figure 2

Bands in non-denaturing 8% polyacrylamide gel electrophoresis evidencing PCR products of IT markers specific for individual chromosome arms of Dv. Lanes: M Marker; 1 CS; 2 Dv; 3 T.aestivum-Dv amphiploid; 4, 5 different T.aestivum-Dv translocations. a Lanes: 4 T1VL·1DL, 5 T1VS·1DS. The amplification of different representative IT markers (CINAU944, CINAU939, CINAU967 from left to right panels, respectively) specific for chromosome arm 1VS. b Lanes: 4 T2VL·2DS, 5 T2VS·2DL. The amplification of different representative IT markers (CINAU1011, CINAU1015, CINAU1013 from left to right panels, respectively) specific for chromosome arm 2VL. c Lanes: 4 T4VL·4DS, 5 T4VS·4DL. The amplification of different representative IT markers (CINAU1295, CINAU1325, CINAU1319 from left to right panels, respectively) specific for chromosome arm 4VL. d Lanes: 4 DA5V, 5 T5VS·5DL. The amplification of different representative IT markers (CINAU1404, CINAU1363, CINAU1454 from left to right panels, respectively) specific for chromosome arm 5VL. e Lanes: 4 T6VL·6AS, 5 T6VS·6AL. The amplification of different representative IT markers (CINAU1517, CINAU1541, CINAU1532 from left to right panels, respectively) specific for chromosome arm 6VL

Discussion

In recent years, the great accumulation of genetic markers and a large number of DNA sequences have made the study of comparative genomics in the grass family more feasible. It is found, in most cases, that the linear order (colinearity) of genetic markers is well conserved and is retained at the molecular level (microcolinearity) (Gale and Devos 1998). Sponteneous substitution usually occurs among chromosomes belonging to the same homoeologous groups; thus, the identified T. aestivum-Dv substitution lines combined with isozyme analysis have been used successfully for assigning the added V genome chromosomes to specific homoeologous groups (Liu et al. 1995). RFLP and microsatellite molecular markers further confirmed the homoeology relationship between Dv and wheat chromosomes (Qi et al. 1999; Zhang et al. 2006). Zhao et al. (2014) developed EST-based molecular markers and proved that chromosome 4V is highly homoeologous to wheat homoeologous group 4 chromosomes.

However, the above types of molecular markers developed for Dv chromosomes have some disadvantages. RFLPs are dependent on radioactive probes and are also time-consuming. SSRs have low transferability among wild relatives and their application in wheat breeding is limited by the lack of locus specificity (Mullan et al. 2005). Out of 276 SSR markers, 148 could amplify polymorphic amplicons between Dv and CS, but only 7 (wmc49 on 1BS, wmc25 on 2BS, gdm36 on 3DS, gdm145 on 4AL, wmc233 on 5DS, wmc256 on 6AL and gwm344 on 7BL) amplified specific polymorphic DNA fragments from chromosome 1V to 7V, respectively (Zhang et al. 2006). Wheat EST bin maps have been explored as a source for development of specific markers for alien chromosomes, but the polymorphic rate is very low (Qi et al. 2007). For example, Zhao et al. (2014) designed a total of 607 EST-based primer pairs, but only 58 (polymorphic rate being 9.23%) could amplify bands specific for chromosome 4V. A total of 240 EST-based STS primer pairs was designed, and only 13 (polymorphic rate being 5.42%) were specific for chromosomes of Dv (Cao et al. 2009). In the present study, we developed 841 chromosome-specific IT markers for Dv based on NGS technology, and the polymorphic rate was as high as 51.79%, indicating this is a highly efficient approach for developing chromosome-specific markers with higher success rate, specificity, stability and lower cost.

The IT markers discovered in this study are based on the sequence conservation of orthologous genes. All annotated genes from subgenomic D genome of CS were compared with genomic sequences of Dv using a local Blastn program. We found that most genes located on homoeologous group 1 to 7 wheat chromosomes could hit the genes from the corresponding homoeologous 1 to 7 Dv chromosomes, respectively. Using different alien translocation lines previously identified by FISH and molecular markers in our laboratory, these specific IT markers were mapped to specific arms of individual Dv chromosomes. The 1VL (88), 3VL (76), 5VL (89), 6VL (71) and 7VL (60) IT markers were derived from the long arm of the wheat homoeologous group 1, 3, 5, 6, and 7, respectively (Table S2). The IT markers for the 1VS (47), 3VS (44), 5VS (51) and 7VS (51) were derived from the wheat 1S, 3S, 5S and 7S arm, respectively.

However, we also observed some exceptions. For chromosome 2VL, all 118 IT markers were derived from the long arm of wheat group 2, while for 57 IT markers for 2VS, 56 were derived from short arm of wheat group 2, and one (CINAU975) was from the long arm of wheat group 2. For chromosome 4V, all 86 4VL IT markers were derived from the long arm of wheat group 4, and three 4VS IT markers (CINAU1275, CINAU1290 and CINAU1349) were also derived from the wheat 4L. This confirmed that Dv and T. aestivum had a close phylogenetic relationship, and suggested that their collinear regions were highly conserved, reinforcing the homoeologous relationship between Dv and wheat chromosomes (Table S5) (Qi et al. 1999; Zhang et al. 2006). The four exceptions indicated there present complex syntenic relationships between the wheat and Dv genomes, which may be the results of chromosomal rearrangements that most likely occurred during the karyotype evolution of Dv.

The IT markers developed in this study will dramatically increase the density of the Dv physical and cytological map to detect structural changes in Dv chromosomes or arms. Furthermore, some of the IT markers are co-dominant and hence will be very helpful in identifying chromosome changes both for the alien chromosomes and wheat chromosomes in a large population. In breeding program, they can be used to distinguish the homozygous and heterozygous individuals by MAS.