Introduction

Bread wheat (Triticum aestivum L.) is the most widely grown crop in the world and feeds 40% of the world’s population. It is a hexaploid species with a nuclear genome consisting of seven homoeologous groups of chromosomes. Each group includes three chromosomes from each of the three homoeologous subgenomes (A, B, and D). Using a recombinant inbred line population derived by crossing the synthetic hexaploid line W7984 with Opata 85, a reference genetic map (International Triticeae Mapping Initiative, ITMI) of the wheat genome containing more than 2,400 loci was constructed; the loci were mainly detected by restriction fragment length polymorphism (RFLP) markers (Nelson et al. 1995a, b, c; Van Deynze et al. 1995; Marino et al. 1996), genomic simple sequence repeat (gSSR) markers (Röder et al. 1998; Pestsova et al. 2000; Gupta et al. 2002; Song et al. 2005; Hayden et al. 2006), and expressed sequence tag (EST)-derived SSR markers (eSSR markers) (Gao et al. 2004; Nicot et al. 2004; Yu et al. 2004). Recently, a few wheat genome maps were constructed using populations derived from intervarietal crosses for mapping genes with agronomical and adaptive significance (Paillard et al. 2003; Sourdille et al. 2003; Quarrie et al. 2005; Suenaga et al. 2005; Torada et al. 2006); all of these maps span over 3,000 cM. Although some of these maps have as many as six gaps larger than 50 cM, they show good consistency with the ITMI map and are complementary to one another in regional coverage. These maps improve understanding of the wheat genome and permit the localization of genes related to target traits such as yield, quality, and resistance to biotic and abiotic stresses. However, because of the low level of polymorphism among wheat cultivars (Bryan et al. 1999), these intervarietal maps are usually low in marker density and incomplete in their genome coverage.

ESTs represent partial complementary DNA (cDNA) sequences from expressed genes. Determining the order of ESTs along the chromosomes through genetic mapping would increase the marker density of genome maps, complement the sequence-based EST bin maps (Qi et al. 2004), facilitate genomic information transfer from related species, and directly target parts of the genome that could aid the identification of genes of agronomical importance. In addition to eSSR markers, EST-derived sequence-tagged-site markers (eSTS) have been extensively explored in cereal crops such as rice (Harushima et al. 1998), maize (Davis et al. 1999), and barley (Stein et al. 2007). They are derived from various tissues at different growth stages or under different treatments. More than 1 million wheat ESTs (NCBI EST database, 2007, 159th release) are currently available. Thus, construction of an EST marker-rich genetic map of wheat is feasible.

In this study, we report a high-density intervarietal map of the wheat genome enriched with newly developed eSSR and eSTS markers using a recombinant inbred line (RIL) population developed by crossing two bread wheat varieties Nanda2419 and Wangshuibai.

Materials and methods

Plant materials

The mapping population used in this study is a set of 136 F7 recombinant inbred lines developed by single-seed descent from an intervarietal cross between bread wheat varieties, Nanda2419 and Wangshuibai. In addition, a set of Chinese Spring nulli-tetrasomic lines was used for chromosomal assignment of some polymorphic markers.

Marker development and analysis

The gSSR markers, eSSR markers, eSTS markers, STS-RFLP markers as well as some 10-oligo-mer randomly amplified polymorphic DNA (RAPD) markers were used in map construction (Table 1). SCAR marker S1021 was converted from a RAPD marker. The EST-derived markers included 43 markers from cnl, ksum (Yu et al. 2004), and sts series (Liu and Anderson 2003; Liu et al. 2006); 320 eSSR markers (MAG464-624 and MAG792-953, synthesized based on primer sequences derived from ESTs for putative transcription factors and signal transduction factors published by Triticeae EST-SSR Coordination, http://wheat.pw.usda.gov/ITMI/2002/EST-SSR/), and 2,293 new markers developed in the authors’ lab.

Table 1 Types of markers surveyed and the polymorphism detection rates

Among the new developed markers, 138 were eSSR marker type and 2,155 were eSTS marker type. They were designed based on 1,637 putatively Fusarium infection-inducible ESTs from seven cDNA libraries of Sumai No. 3 spikes (http://www.ncbi.nlm.nih.gov) and 656 other ESTs. With these ESTs, we used the Perl script MISA program (http://pgrc.ipk-gatersleben.de/misa/misa.html) to screen for SSRs containing 2–5 repeat motifs with a run of at least 16 nucleotides. All primers were designed with the program MacVector 8.1 (Accelrys, UK) or Primer 3 (http://www.basic.northwestern.edu/biotools/Primer3.html). Moreover, 68 STS-RFLP markers were converted from RFLP probes mapped on the ITMI map, and STS markers were developed for six known genes, including TaGST-zeta, TaLTP3, TaHO and TaPGAM from the authors’ lab, which encode for glutathione S-transferase, lipid transfer protein, hydroxylase and phosphoglycerate mutase, respectively, and Vrn-A1 (Yan et al. 2003) and Wx-A1b (Vrinten et al. 1999). The primer sequences, expected product sizes, chromosome locations and other related information for the new markers that detected polymorphism between the mapping parents is provided in Table S1 of the online Electronic Supplementary Materials.

DNA was extracted according to Ma and Sorrells (1995). Polymerase chain reaction (PCR) for RAPD was performed in a PE9600 thermal cycler (Perkin-Elmer, Norwalk, CT, USA) programmed for 3 min at 96°C, followed by five cycles of 40 s at 96°C, 50 s at 35°C, and 1 min at 72°C; 40 cycles of 40 s at 94°C, 50 s at 36°C, and 1 min at 72°C; and 5 min at 72°C. SSR and STS amplification was performed following the procedure of Ma et al. (1996). The PCR products were separated in 8% non-denaturing polyacrylamide gels (Acr:Bis = 19:1 or 39:1) at room temperature in vertical gel apparatus (170 × 150 × 1.0 mm or 105 × 185 × 1.0 mm) with 1 × TBE buffer and visualized by silver staining (Bassam et al. 1991).

Linkage mapping

The genetic linkage map was constructed with MAPMAKER/EXP Version 3.0 (Lincoln et al. 1992). Loci were grouped using the criteria LOD = 3.0 and recombination fraction = 0.15. Linkage groups belonging to the same chromosomes were integrated using the criteria LOD = 3.0 and recombination fraction = 0.40. Multipoint analysis was used to construct the framework map. The order of “Framework” loci was determined using the “order” command with a LOD threshold of 3.0 and checked using the “ripple” command. Other markers were placed on the framework map using the “try” and “ripple” commands. The maps were drawn with MAPMAKER Version 2.0 (Lander et al. 1987), with the recombination distances calculated with the Kosambi mapping function (Kosambi 1944). Double crossovers between closely linked markers were checked to avoid misscoring. The segregation fit of each locus to a 1:1 ratio was examined using the chi-square test. A region with at least three adjacent loci showing significant segregation distortion (P < 0.05) was defined as the segregation distorted region (SDR) (Paillard et al. 2003).

In order to evaluate the synteny relationship and regional coverage between the Nanda2419–Wangshuibai (NW) map and the ITMI map, data for 1,592 markers were downloaded from the GrainGenes website to reconstruct the ITMI map. The framework marker order was determined by referring to Song et al. (2005). Other markers were placed on the framework map using the “try” and “ripple” commands. Data for 154 markers were not included in the map because they considerably extended the map length when placed into the linkage groups.

Results

Polymorphic survey of the mapping parents

We surveyed the mapping parents Nanda2419 and Wangshuibai with 4,180 PCR-based markers, including RAPD, gSSR, eSSR, eSTS, and STS-RFLP markers, in addition to known genes (Table 1). Seven hundred eighty-five of these markers (18.8%) detected polymorphism. The efficiencies of gSSR, eSSR, eSTS, STS, and RAPD markers in detecting polymorphism between the parents varied from 9.9% with eSSR to 35.5% with gSSR. Most SSR markers detected codominant loci, while 66% of the eSTS markers detected dominant loci.

For the newly developed 2,293 EST markers, only 298 (13.0%) gave no amplification or produced smear products (Table 2). The reasons that led to the unsuccessful amplification could be mostly attributed to poor EST sequence quality or sequence polymorphism between different genotypes, since re-designing primers could often alleviate this kind of problem (Z Ma and H Yi, unpublished data). A little higher level of successful amplification was achieved for the eSSR markers than for the eSTS markers; however, both types of markers detected a similar level of polymorphism with an average polymprohism detection rate of 12.8% (Table 2), which was two and three times lower than those for the surveyed STS-RFLP markers and gSSR markers (Table 1). Interestingly, among markers producing clear bands, only 12.5% eSSR markers, but 33.4% eSTS markers produced a single strand. All eSSR primer pairs and 93.4% eSTS primer pairs that produced clear bands amplified products within the expected size ranges or larger than the expected sizes.

Table 2 Amplification of Nanda2419 and Wangshuibai genomic DNA with the newly developed EST markers

Among the gSSR marker loci surveyed, 243 had been previously mapped to the A subgenome, 272 to the B subgenome and 334 to the D subgenome, with 39.5, 40.4, and 30.5% of them detecting polymorphism, respectively. Consistent with other mapping studies of wheat, the D subgenome had the lowest level of polymorphism. Less than 25% of the surveyed markers previously mapped to chromosomes 4D, 6A, and 6D detected polymorphism; this was one of the lowest rates. For the published intervarietal maps, chromosomes 4D and 6D usually had the fewest mapped markers. It seems that these two chromosomes are best conserved in varieties. Chromosomes 4B and 5B were most polymorphic between Nanda2419 and Wangshuibai among all the chromosomes; 54.6 and 50.0% of the surveyed markers previously mapped to these two chromosomes, respectively, detected polymorphism. Over 40% of markers surveyed for each of the chromosomes 1A, 2A, 2D, 3A, 3B, 4B, 5A, 5B, 6B, and 7A was polymorphic.

Most of the SSR and STS markers detected a single locus, but 86 of them detected two or more loci. One marker, GWM497, identified five polymorphic loci. The multiple polymorphic loci detected by a marker were located, in approximately half of these cases, in the same homoeologous group as shown in the mapping construction.

Genetic mapping

We obtained population data of 908 marker loci detected by the polymorphic markers. The data in dominant loci were verified with internal control. After excluding 11 loci that could not be placed into linkage groups and 10 loci that considerably extended the map length when placed into the linkage groups, the chromosome-based marker linkage maps were established. Chromosome assignment of the linkage maps was determined using the known mapping positions of gSSR markers and nulli-tetrasomic mapping of some markers. To save space, the maps showing order and location of markers and other related information were provided in Fig. S1 as online Electronic Supplementary Materials.

This Nanda2419–Wangshuibai (NW) map comprises 887 marker loci, including 542 detected by anonymous markers and 345 detected by EST-derived markers (273 by the new EST markers), targeting at least 794 discrete positions in the wheat genome. It covered 4,223.1 cM of the wheat genome, excluding the seven >40 cM gaps distributed on chromosomes 2A, 2D, 3D, 4D, and 6D (these gaps were also excluded from later analysis, unless specifically indicated) (Fig. S1 and Table 3). The A, B, and D subgenome maps spanned 1,478.3, 1,510.4, and 1,234.4 cM, respectively. This coverage is proportional to the respective subgenome size estimated by Gill et al. (1991). However, the genetic markers did not evenly distribute across the subgenomes. Twice as many loci mapped to the B subgenome versus the D subgenome and the B subgenome had a much higher marker density compared with the A and D subgenomes (Table 3). There was also great variability in the number of loci mapping to individual chromosomes, which, however, was not proportional to the individual chromosome lengths (χ2 = 128.2, P ≤ 0.0001), suggesting that the polymorphism level varied between different chromosomes.

Table 3 Distribution of mapped markers on 21 wheat chromosomes in the NW map

Marker density

In the NW map, the marker density of individual chromosomes ranged from 2.2 markers/20 cM for 5D to 7.8 markers/20 cM for 6B (Table 3). The EST-derived markers significantly increased the marker density of chromosomes 4A, 6B, 7A, and 7B (Fig. S1), where the number of loci mapped by EST-derived markers exceeded that mapped by anonymous markers. The marker density also varied within the individual chromosomes. In 30 regions, distributed on all chromosomes but 1B, 1D, 4D, 5D, 6D, and 7D, more than eight loci were mapped per 20 cM (Fig. S1). Sixteen of these regions with a high marker density occurred in interstitial regions and four occurred near the telomeres. The EST-derived marker loci accounted for 46.2% of the loci in these high marker density regions, whereas they accounted for only 38.9% in the map as a whole. Particularly significant was the 17 cM Xgdm19.2Xmag4089 interstitial interval on chromosome 2D, where 17 EST-derived marker loci were mapped (Fig. S1).

Regions of low marker density or gaps occurred on almost every chromosome, most of which were present in subgenomes A and D. Among the seven large gaps, five were in the D subgenome. It appeared that these gap regions are conserved among the three subgenomes of common wheat, for example, the 47.4-cM gap on 2DS and the short arm of 2A, the 43.8-cM gap on 3DS and the 27.0-cM gap on 3AS, the 33.4-cM gap on 5DL and the 23.7-cM gap on 5AL, the 40.5- and 42.0-cM gaps on 6D and the 35.1- and 30.3-cM gaps on 6A, and the 35.6-cM gap on 7DL and the 26.4-cM gap on 7AL (Fig. S1). The one gap greater than 50 cM flanked by Xsts-cdo1312 and Xmag2944 on 4D was also homoeologous to the 24.5-cM gap region flanked by Xwmc413 and Xmag1682 on 4B.

Segregation distortion regions

The majority of mapped loci in the NW map segregated in the expected 1:1 ratio in the population. However, segregation of 162 mapped loci, over half of which are inherited in a codominant manner, significantly deviated from this ratio (P < 0.05). Approximately 70% of the dominant loci showing segregation distortion (SD) were mapped by eSTS markers. The SD loci clustered in 15 regions, distributed on chromosomes 1B, 2A, 2B, 2D, 3A, 4D, 5A, 6B, 6D, and 7B (Fig. S1), in ten of which the Wangshuibai alleles were favored. All seven homoeologous groups have been involved in SD, however, nearly half of the SDRs were identified in the B subgenome and the D subgenome ranked second in the number of SDRs. The two adjacent 4D SDRs skewed towards the opposite parents, and as a result, might have extended the distance between them (Fig. S1). Xmag1652, which flanks SDR-4D1 and is closest to the telomere of the short arm, exhibited the most skewed segregation among the loci mapped to this region, with 78% of the RILs having the Nanda2419 allele.

Discussion

We have presented a 4,223.1 cM intervarietal genetic marker map of wheat using the Nanda2419 × Wangshuibai RIL population. Of the two mapping parents, Nanda2419 is a selection from Italian wheat variety Mentana and has been widely grown in China. It could be found in the pedigrees of more than 110 Chinese cultivars and therefore was viewed as one of key breeding parents in China’s wheat breeding programs. Wangshuibai is an indigenous germplasm originating in Jiangsu, China, best known for its excellent scab resistance (Lin et al. 2004, 2006). The different geographical origins of both parents did not lead to a high level of polymorphism. The polymorphism level detected between them by gSSR markers was 9.5% lower than that reported by Suenaga et al. (2005) between cultivars Fukuho-komugi and Oligoculm, and 25.5% lower than that reported by Paillard et al. (2003) between winter cultivars Arina and Forno. Between ‘Opata 85’ and the synthetic wheat ‘W7984’, as high as 80% of the SSR primer pairs detected polymorphism (Röder et al. 1998). In our results, the newly developed eSTS markers and eSSR markers had similar polymorphism detection efficiency (Table 2). However, we were able to raise the efficiency for eSTS markers from 118 ESTs or EST contigs to 34.7% by designing primers near the 5′ or 3′ untranslated regions (unpublished data).

The NW map was constructed with only PCR-based markers. There are 30 intervals with a high marker density in the map, with 20 of which mapped to the interstitial or telomerical regions where genes often cluster (Qi et al. 2004). Since the EST-derived markers represented nearly half of the loci mapped to these regions, this map would be especially useful for mapping traits with functional markers. This map conforms well to other published intervarietal maps according to the positions and orders of commonly mapped markers. As far as we know, among the intervarietal maps, the NW map has the best genome coverage, a higher average marker density, less gaps, and is 275 cM larger than the next best map, the 3,948-cM FkO map (Suenaga et al. 2005), which includes 11 > 40 cM gaps. Efforts were made to close or minimize the gaps by surveying STS markers converted from RFLP markers mapped to these regions or ESTs mapped to the corresponding deletion bins or chromosomes besides the gSSR markers found in the published maps. However, we were unable to substantially close the gaps due to the lack of polymorphism. For instance, in the 73.9-cM region between Xwmc312 and Xgwm99 on chromosome 1A, only two of the 13 surveyed markers detected polymorphism; in the 30.3-cM gap flanked by Xbarc113 and Xgwm169 on chromosome 6A and in the 40.5-cM gap flanked by Xgwm469 and Xwmc773 on chromosome 6D, none of the surveyed markers (14 and 15, respectively) detected polymorphism (Fig. S1).

To compare the NW map and the ITMI map, a 5,758.7 cM ITMI map was constructed (see the online Electronic Supplementary Materials in Fig. S2) with all markers publicly available. By aligning 185 common markers mapped in both maps, we found only 20 intervals that were inconsistent between the two maps. These inconsistencies were often involved in small intervals, and 13 of them resided in regions surrounding the centromeres, for instance, those on chromosomes 2A and 5B. Thus, they might have resulted from inaccuracy in ordering loci due to lack of recombination. Another possible reason for these inconsistencies is structural change in the chromosomes. Xgwm18 and Xgwm246.1 are two tightly linked loci on chromosome 1B. They have the same orientation in both the NW map and the Kitamoe–Münstertaler (KM) map (Torada et al. 2006), with Xgwm18 located on 1BS and Xgwm246.1 located on 1BL, as shown by deletion mapping (Sourdille et al. 2004). However, in the ITMI map, these loci were mapped to the short arm in a reversed order and with a genetic distance of 60.3 cM. Similar inconsistencies were also observed between the ITMI map and other intervarietal maps (Cadalen et al. 1997; Paillard et al. 2003).

Even though the NW map was more than 1,500 cM shorter than the ITMI map in genome coverage, it only lacked coverage of the terminal regions of 2AS, 3AL, 6AL, 6BL, and 6DL that totaled 247.8 cM in the ITMI map. Furthermore, it covered approximately 158 cM that were absent in the ITMI map, including the terminal regions of 2BS, 4BL, 5AS, and 5BL (Fig. S2). Moreover, some regions in the NW map, such as the distal portions of chromosomes 4AL, 5BL, 7AL, and 7BL, had an even better marker coverage or a higher resolution. Thus, these two maps complement each other well. In most cases, the regions with a better coverage in the NW map were rich in EST-derived markers.

Interestingly, we found that regions of low marker density were largely conserved among cultivars and between homoeologous subgenomes, as shown in the NW map and other published intervarietal maps. All 13 > 30 cM gaps in the NW map were corresponding to the low marker density regions in the ArFo (Paillard et al. 2003), CSSQ1 (Quarrie et al. 2005), and KM (Torada et al. 2006) intervarietal maps, even though the corresponding gap size varied between them (Table 4). This kind of conservation could be due to the low level of polymorphism in the corresponding regions, since these regions in the ITMI map, whose mapping parents include a synthetic wheat line, had a much higher marker density (Table 4).

Table 4 Comparison of gaps greater than 30 cM in the NW map to corresponding intervals in four other genetic maps of wheat

In the NW map, 15 intervals showed SD, involving 10 of the 21 chromosomes. As in this study, Groos et al. (2002) reported SDRs on chromosomes 2A, 2B, and 2D, even though the correspondences in locations between both studies could not be determined. The corresponding regions of the two 2B SDRs showed SD in the CtCS and ArFo maps (Sourdille et al. 2003; Paillard et al. 2003). SDRs on the short arm and the pericentromeric region of chromosome 4D have been identified in the CSSQ1 (Quarrie et al. 2005) and FkO (Suenaga et al. 2005) double haploid populations, in a wheat × T. spelta RIL population (Messmer et al. 1999), and in an Aegilops tauschii F2 population (Faris et al. 1998). The SDR-5A corresponding region has been involved in SD in four different populations (Messmer et al. 1999; Peng et al. 2000; Groos et al. 2002; Liu et al. 2005). Campbell et al. (1999), Sourdille et al. (2003), and Liu et al. (2005) reported distorted segregation in the region corresponding to SDR-6B (Fig. S1). SD in the regions corresponding to SDR-6D1 and SDR-7B1 were reported, respectively, by Suenaga et al. (2005) and Messmer et al. (1999). Since over 80% of the loci segregated normally in the NW population, physiological conditions are unlikely the reason of SD. Genetically, both gametophytic genes and mutants unfavorable to growth and reproduction could cause skewed segregation of linked alleles. But deleterious mutants cannot be maintained in cultivars as breeders would select against them. In RIL populations, genes for sterility would also cause SD. Eleven gametophytic genes have been detected in rice (Kinoshita 1991) and five in maize (Coe and Polacco 1995); some of these genes are associated with SD (Harushima et al. 1996; Lu et al. 2002). To our knowledge, no gametophytic genes have been isolated or mapped in wheat, but the fact that gametocidal factors in heterozygous and hemizygous states from Aegilops relatives of wheat would affect the function of gametes without the gametocidal alleles was well documented (Endo 1990). We assume that at least some of the SDRs identified in this study involve gametophytic genes.

The NW map provides a supplement to the previously published genetic maps and could find wide application in structure and function analysis of the wheat genome, as well as in gene mapping, cloning, and wheat breeding (Lin et al. 2004, 2006; Ma et al. 2007). The use of PCR markers makes this map especially useful for information exchange between different labs. Even though EST-derived markers usually detect less polymorphism than do genomic markers, such as gSSR markers, their mapping could provide invaluable information for exploration of the expressed portion of the wheat genome and functional analysis of traits of interest. In this study, we identified some EST markers clustered in QTL intervals for resistance to wheat Fusarium head blight (FHB) such as QFhi.nau-2D1 (Lin et al. 2006) and QFhs.nau-6B (Lin et al. 2004), probably because the ESTs from F. graminearum-induced spike cDNA libraries were used in the marker development. Some of these ESTs displayed differential expression patterns between the FHB-resistant Wangshuibai and the FHB-susceptible Nanda2419 (unpublished data) and thus may serve as candidate genes. EST markers linked to genes of interest could also be used to demarcate orthologous gene regions in the genomes of rice and Brachypodium distachyon, two model species of cereal crops, which in turn would aid identification of candidate genes or development of new markers for fine mapping.