Introduction

The development of several types of molecular markers in the last 20 years (Peters et al. 2003; Gupta and Rustgi 2004) has greatly facilitated the construction of genetic linkage maps for a variety of crop species. These linkage maps have been used for marker-assisted breeding (reviewed in Dekkers and Hospital 2002), map-based cloning strategies (Mayerhofer et al. 2005; van Os et al. 2006), comparative genomics (reviewed in Paterson et al. 2000) and dissection of quantitative traits (reviewed in Salvi and Tuberosa 2005).

Genetic maps have been developed using intra- or inter-specific crosses, mostly predicated on whether sufficient polymorphism exists within the species. Maps generated from intra-specific crosses can be more readily used for breeding purposes because they contain markers that are informative within the closely-related gene pool of the cultivated species. However, in self-pollinated species like lettuce, the development of comprehensive maps from intra-specific crosses has been obstructed by limited polymorphism within the cultivated species. Inter-specific crosses have a higher degree of polymorphism but segregation distortion for some crosses may be frequent and hybrid sterility precludes the development of permanent populations such as recombinant inbred lines. Furthermore, reduced recombination frequency or segregation distortion towards the cultivated allele in certain regions may limit access to wild alleles in those regions of the genome.

Genetic maps developed using individual populations can be of limited use for other populations if markers are not frequently polymorphic among populations. Integrated maps that combine information from multiple populations increase the number of markers in a given genomic region and improve the chances of obtaining sufficient polymorphic markers in a specific population as well as allowing comparison of locations of genes of interest across maps. Integrated linkage maps have been developed for several crop species including pepper (Paran et al. 2004; Lefebvre et al. 2002), soybean (Song et al. 2004), melon (Périn et al. 2002), sunflower (Gedil et al. 2001), rapeseed (Lombard and Delourne 2001), tomato (Haanstra et al. 1999) and barley (Qi et al. 1996).

Lettuce (Lactuca sativa L.) is a diploid (2n = 18) species within the Lactucae tribe of the Asteraceae. There are four well-established species within the subsection Lactuca: the cultivated L. sativa and three wild species, Lactuca serriola, Lactuca saligna, and Lactuca virosa (in order of decreasing sexual compatibility with L. sativa). L. serriola is probably the progenitor of and con-specific with L. sativa (Kesseli et al. 1991; de Vries 1997). Lettuce is a major horticultural crop with a worldwide production of over 21 million tons in 2004 and accounts for $2.06 billion in farm value within the USA (Anonymous 2004). Lettuce has been the subject of classical and molecular studies. It is one of the species studied in the Compositae Genome Project (CGP; http://compgenomics.ucdavis.edu/). An objective of this project is the characterization of genes underlying agriculturally important and domestication traits. This includes genetic analyses to dissect qualitative and quantitative traits as well as the identification of candidate genes from extensive expressed sequence tag (EST) databases and the study of synteny among sunflower, lettuce and Arabidopsis (Timms et al. 2006; http://cgpdb.ucdavis.edu/).

Five genetic maps have been published previously for lettuce. Kesseli et al. (1994) and Waycott et al. (1999) reported the construction of two linkage maps assembled on populations derived from crosses within cultivated lettuce (L. sativa). These two maps are fragmented with multiple linkage groups exceeding the chromosomal number of Lactuca species. This is probably a consequence of the low intra-specific polymorphism and a high frequency of monomorphic regions between the parental lines. Two other maps are from crosses between cultivated lettuce and its closest wild relatives L. serriola and L. saligna. Johnson et al. (2000) described a framework map of AFLP® markers (a registered trademark of Keygene N.V.) for a QTL analysis of a population derived from L. sativa and L. serriola. Jeuken et al. (2001) used AFLP markers to generate a consensus map of nine chromosomes for two populations derived from crosses between L. saligna and L. sativa. Because these four maps have been constructed using different sets of markers, alignment of linkage groups across maps was not possible, excluding the possibility of integrating information between them. Recently, a map from a cross between L. serriola DH-M21 and L. sativa cv. Dynamite has been published (Syed et al. 2006). This map was developed using retrotransposon-based markers as well as some AFLP markers in common with the map of Jeuken et al. (2001).

Here we report on the construction of a consensus map in lettuce that integrates four previously published maps of lettuce with data from three new populations: two maps generated from crosses between L. sativa and L. serriola and an additional map derived from L. saligna × L. sativa. This map integrates data from all seven individual maps, thus providing for a better coverage of all genomic regions. It is also more representative of the Lactuca genus. L. serriola and L. saligna are the closest wild relatives of cultivated lettuce (L. sativa) and their inclusion allows for more universal utility of markers across different genetic backgrounds. Display of the data in the public Compositae Genome Database (http://cgpdb.ucdavis.edu/) provides a current resource for exploiting lettuce genetic and genomic data.

Materials and methods

Plant material

Seven different populations were used to develop an integrated map of lettuce (Table 1). Two were F2 populations of intra-specific crosses between cultivars or breeding lines of L. sativa: Population DB0D was a F2 of a cross between cv. Calmar (crisphead type) × cv. Kordaat (butterhead type); The RYDER population was a cross between F4 breeding lines 87-25-1M × 87-1090M (for more detailed pedigrees see Waycott et al. 1999). Three other populations were inter-specific crosses between L. sativa and L. serriola. Population DB0H was a F2 from a cross between L. sativa cv. Salinas (crisphead type) and L. serriola US96UC23; population DA0F was a RIL7 of the same cross derived by single seed descent. Population DB9X was a F2 from a cross between L. sativa cv. Salad Bowl (leaf type) and L. serriola CGN14263. The last two populations were inter-specific crosses between L. saligna and L. sativa. Population DB0F was a F2 from L. saligna UC82US1 × L. sativa cv. Vanguard 75 (crisphead type). Population DB0T was a F2 from L. saligna CGN5271 × L. sativa cv. Olof (butterhead type).

Table 1 Generation, number of individuals and markers of the seven populations integrated on the consensus map of lettuce

Construction of individual maps

Linkage maps for populations DB0D and RYDER utilizing restriction fragment length polymorphism (RFLP), random amplified polymorphic DNA (RAPD), resistant genes and morphological markers have been previously published (Kesseli et al. 1994; Waycott et al. 1999). Additional markers were run on these populations to assist in aligning and integrating the maps. AFLP markers were analyzed in all populations except RYDER using standard procedures (Vos et al. 1995). A total of 53 primer combinations were assayed in these populations. Of those, 13 were public primer combinations, the rest were proprietary (Keygene). Images of AFLP gels were electronically scanned and AFLP markers were scored co-dominantly by using the proprietary software developed for AFLP analysis at KeyGene. The KeyGene software analyses band intensity and accurately assigns genotypes using a mixture model of normal distributions as described by Jansen (2001). Therefore the majority of markers used for map integration were co-dominant. A set of proprietary simple sequence repeat (SSR) markers developed by Seminis Inc. (Woodland, CA, USA) were analyzed in all seven populations following standard procedures. Individuals were genotyped as “A” when they were homozygous for the parental female allele, “B” when they were homozygous for the parental male allele, “H” when they were heterozygous, “C” (not A) for dominant markers where the B allele was dominant and “D” (not B) for dominant markers where the A allele was dominant. Information for each marker and segregation data for each population is provided at http://cgpdb.ucdavis.edu/mjt_2006.

Individual linkage maps were constructed using Joinmap v 2.0 (Stam 1993; Stam 1995) and are displayed at the same website. Markers were assigned to linkage groups (LGs) by increasing the LOD score for grouping in steps of 0.5 LOD. This iterative process for each population selected a maximum LOD score for grouping on the basis of stability of groups over different LOD values and sufficiently large groups approaching the number of chromosomes for lettuce. In some cases, additional steps were required to split up some large groups at a higher LOD value. Mapping was carried out using the following thresholds for Joinmap; REC of 0.49, LOD of 0.10, RIPPLE of 1, JUMP of 4 and TRIPLET of 7. No order was forced during the linkage analysis. Recombination frequencies were converted to map distances in centimorgans (cM) using the Kosambi mapping function (Kosambi 1943). Using the proprietary Keygene software program “Genome Typer” graphical genotypes were generated for each population. These graphical genotypes provide an overview of the genotype configurations of all individuals. In this way, possible inconsistencies in the map, like the occurrence of frequent double crossovers in small cM intervals and “recombination hotspots”, are readily detected. Markers which resulted in frequent apparent double crossovers (in at least 15% of the individuals) were removed from the map and the map was re-calculated and checked in a similar way. On average, the number of markers removed did not exceed 3%. The quality of the final maps were visualized using the publicly available CheckMatrix program (see data visualization below).

Marker distortion

Segregation ratios were calculated for all markers. Goodness of fit χ2 tests were calculated with expected Mendelian ratios of 1:2:1 (A:H:B; co-dominant marker) or 3:1 (C:A or D:B; dominant marker) for F2 populations and of 1:1 (A:B) with a residual heterozygosity of 1.5625% for the F7 RIL population. Markers were considered distorted at a level of significance of P < 0.05. For co-dominant markers with individuals that were scored as “C” or “D” because they could not be unambiguously assigned a genotype, two χ2 tests were calculated. One assumed a ratio of 1:2:1 (A:H:B) while not considering “C” or “D” scores. The other assumed a 3:1 ratio (C:A or D:B), where “H” genotypes were added to the dominant homozygous allele. Both χ2 tests had to be significant for the marker to be considered skewed.

Recombination percentage

Recombination percentages were calculated for conserved marker intervals between individual maps. Markers defining conserved intervals were chosen from framework markers used for marker integration (see below). Conserved intervals among each pair of maps were identified and the number of recombinant individuals calculated as those individuals where there was a change on the genotype at the markers defining the interval; for example, if markers mk1 and mk2 defined a conserved interval in two populations, a recombination event was considered as present in those individuals where there was a change in the genotype at mk1 versus mk2 from A to H, B to H, A to C, B to D and vice versa and two recombination events were considered as present when there was a change in genotype from A to B and vice versa. Therefore, some recombination values may be slightly underestimated since in dominant markers changes between A to C (H or B) and B to D (H or A) can result from two recombination events; however, this situation is not frequent because most markers used were co-dominant rather than dominant.

Integrated map construction

Individual maps were integrated using INT_MAP (Peleman et al. 2000). This proprietary software integrates individual genetic linkage maps into a single consensus map by using common markers between populations as anchor markers. Map integration consisted of three separate steps. First, markers were identified that were common to at least two populations. These common markers were identified a priori as framework markers and then used to associate the corresponding linkage groups in the individual maps. Second, the consensus order of framework markers within a linkage group was calculated from the relative positions of the framework markers in each individual map. Starting sequentially with the most common marker, additional markers were added one at the time following the rule of most proximate, most common marker. Therefore within a set of equally common markers, markers are added in order depending on their distance to the already placed markers. Subsequently, the next step is made with less common markers. The consensus position of each framework marker was calculated as a weighted average of the individual map distances between new markers and the previously positioned markers. Two parameters controlled map integration: (1) The minimum fraction of overlap between groups (the minimum fraction of markers that are common to at least two groups, for example 3 out of 10 common markers) and (2) the maximum disagreement allowed in the distance between two adjacent framework markers on the individual maps. We used 0.30 and 10.0 cM for these two parameters respectively for the construction of the integrated lettuce map. Markers deviating from these criteria were not used as framework markers for map integration and were not included on the integrated map but remained on the individual maps. Lastly, markers unique to individual populations were positioned on the integrated map using interpolation and extrapolation to the framework markers. Markers that were unique to a map were placed in a “bin” between framework markers; the accuracy of their position depended on the size of the interval between the framework markers in the individual map.

Framework RFLP markers were identified by the use of the same probe name. Common RAPD markers were identified when the same size of polymorphic fragment was amplified with the same primer. Common AFLP markers were identified when, using the same enzyme/primer combination, bands co-migrated on reference gels including all the parents (Rouppe van der Voort et al. 1997). Framework SSRs were recognized by the use of the same set of primers.

Linkage groups in the integrated map were assigned numbers corresponding as close as possible to the linkage group designations in the previously published maps; priority was given to the first map published (Kesseli et al. 1994).

Nomenclature

Each of the populations and the maps derived from them were identified by an alpha-numeric code in upper case (see section “Plant Material” above). The same code in lower case followed by a number was used to identify the linkage groups within each of the maps. For example, the F2 population derived from L. sativa cv. Calmar × L. sativa cv. Kordaat was identified by the code DB0D and linkage group 3 in the map derived from this population was designated Db0d-3. The code for the integrated map was INT (Table 2).

Table 2 Number of markers and size in cM for linkage groups (LG) in all populations integrated on the lettuce map. For previously reported maps the correspondence among published linkage groups and the linkage groups in the integrated map is given in the PM (previous map) column. In populations DB0D and RYDER, linkage groups designated between quotations indicated those LG where numbers were not assigned on the published map. For the purpose of this table numbers have been assigned corresponding to the order of the linkage groups in the previously published map

Data curation and visualization

In order to curate and query all the genetic data, we developed several modules and utilities for the Compositae database (http://cgpdb.ucdavis.edu/). This website and database was created originally to display and query EST data for lettuce and sunflower generated by the Compositae Genome Project. Segregation data is available for all loci at http://cgpdb.ucdavis.edu/mjt_2006. Enabling information and images are also available for most public markers in a mySQL relational database.

Lettuce Genetic Map Viewer is a PHP program that allows the visualization of markers on the integrated lettuce map as well as the individual maps (http://cgpdb.ucdavis.edu/mjt_2006). It uses GD graphic libraries and the PHP scripts interact with the mySQL database dynamically to generate graphical images “on the fly” upon a user’s query via the web interface. Each map can be displayed separately allowing the search for specific markers within a specific map or linkage groups can be compared between maps to identify common markers and to align them to the integrated map. The types of markers displayed and the appearance of the map can be customized depending on the needs of the researcher.

A CheckMatrix plot is displayed for each linkage group. CheckMatrix (http://cgpdb.ucdavis.edu/XLinkage/) is a Python script designed to assess the quality of genetic maps by generating 2D plots (heat maps) of the degree of linkage between all markers. Markers are ordered according to their genetic position. The colored pattern of the linkage relationships indicates the quality of the map and helps identify problematic markers and regions. Checkmatrix also displays the segregation ratios for each marker, which facilitates the analysis of segregation distortion.

Results

Individual genetic maps

Individual linkage maps for populations DB0D, RYDER, DB0H and DB0T have been previously published (Table 1). Additional AFLP and SSR markers were analyzed on populations DB0D, RYDER and DB0H to provide markers common to multiple maps. AFLP markers were analyzed for six of the seven populations (Table 1). RFLP markers were analyzed only for populations DB0D and DB0F and RAPD markers were analyzed for populations DB0D, RYDER and DB0F. SSR markers were the only marker type assayed in all seven populations. Limited numbers of isozymes, morphological markers, resistant genes and SCAR markers were also assayed in populations DB0D, RYDER, DB0H and DB0F.

New individual maps were constructed for all populations, except DB0T, integrating the old and new marker data. Segregation data and individual maps for the DB0D, RYDER, DB0H, DA0F, DB9X and DB0F populations are available at http://cgpdb.ucdavis.edu/mjt_2006. The two maps constructed on crosses between L. saligna × L. sativa (DB0F and DB0T) and the RIL map (DA0F) comprised nine chromosomal linkage groups. Maps constructed on crosses within L. sativa had higher numbers of linkage groups (20 linkage groups in DB0D and 11 on RYDER). Population DB0T was used for construction of the consensus map without any modifications (Jeuken et al. 2001). The seven maps ranged from 618 to 1368 cM and comprised of 104 to 910 markers.

Different types of markers were distributed uniformly through the linkage groups in the individual maps. Clustering of AFLP markers was evident in most of the maps. RFLPs and RAPDs were also clustered in the map from population DB0D.

Marker order of the individual maps was generally consistent between the maps as compared using Lettuce Genetic Map Viewer. Markers showing inconsistent positions were not subsequently used for map integration. Comparisons of linkage groups among individual maps and with the integrated map revealed that some individual maps were missing large segments of the integrated map. For example, in map DB9X, linkage group Db9x-5 is only 53.2 cM and corresponds to a terminal third of Int-5. The other two thirds of Int-5 are missing from map DB9X (supplementary material S1). In map DB0D, linkage groups Db0d-6a and Db0d-6b correspond to three quarters of Int-6 (supplementary material S2); similarly, linkage groups Db0d-8a and Db0d-8b correspond to two thirds of Int-8 (supplementary material S3). However, linkage group Db0d-add1 could not be integrated in the consensus INT map for lack of markers in common with other maps and this linkage group may correspond to the missing portions of either Int-6 or Int-8.

Segregation distortion

All populations had some markers with segregation ratios that deviated from Mendelian expectations. Populations DB0F and DB0T had markers with distorted segregation in almost all the linkage groups (Table 3). On the individual maps most of these markers were associated in groups, indicating that the distortion was due to biased transmission rather than mis-scoring of individual markers. Segregation distortion was highest in the L. saligna × L. sativa crosses. Some regions showed segregation distortion in multiple populations with the same direction of the distortion (Table 3). Linkage group Int-7 was distorted in DB0D (linkage groups Db0d-7b and Db0d-7c), DB0F (linkage group Db0f-7) and in DB0T (linkage group Db0t-7) with a maximum number of distorted markers in Db0t-7 where the segregation of all but one of the markers was distorted (Table 3). In all these populations, the distortion of linkage group 7 markers was towards the L. sativa haplotype. Linkage groups Int-5, Int-6 and Int-9 were distorted only in populations DB0F and DB0T; in these groups, even though the region of distortion was similar, the distortion was towards different parental haplotypes (Table 3) except for one of the regions in linkage group 9 distorted towards the cultivated haplotype in both maps. Linkage group Int-3 had groups of distorted markers in populations DB0H, DA0F, DB9X, DB0F and DB0T. All the distorted regions were coincident except the one in DB0F map (linkage group Db0f-3) that did not overlap with the others (Table 3) Distortion was towards the L. sativa haplotype in DB0H, DA0F and DB9X and towards the heterozygous haplotype in DB0F and DB0T.

Table 3 Distribution of groups of distorted markers across linkage groups in the different populations considered in this study. A group of skewed markers is defined as a group of at least three linked distorted markers with gaps of non-distorted markers of three or less

Recombination percentage

We calculated recombination percentages in all the F2 populations except RYDER (that could not be included due to the small number of conserved intervals between this map and any of the others). Recombination percentages for conserved intervals between pairs of maps are presented in Table 6. In most of the comparisons when more than one interval was considered, populations where L. saligna was one of the parents had always a lower number of recombination events. For the four conserved intervals between DB9X and DB0F in LG2 the same rate of recombination was observed (Table 6).

When comparing the maps from the two inter-specific crosses with L. saligna (DB0F and DB0T, Table 6), DB0T had a more severe suppression of recombination. This tendency of suppressed recombination was also apparent in the smaller sizes of the linkage groups in these two maps when compared to the other individual maps or to the integrated map (Table 2; http://cgpdb.ucdavis.edu/mjt_2006).

Map integration

A total of 695 markers common to two or more maps were available for map integration. Of those, 62 (8.9%) were discarded as having inconsistent grouping (mapping in different linkage groups in the individual maps). A total of 47 markers were discarded for having contradictory genetic positions (exceeding the 10 cM maximum distance disagreement allowed between two adjacent framework markers). A total of 560 (80.6% of the original framework markers) were used for map integration. Of these 560 markers, 474 were AFLP, 24 RFLP and 25 SSR markers (Table 4; http://cgpdb.ucdavis.edu/database/genome_viewer/viewer). Only 16 RAPD markers were used as framework markers, mostly to allow the integration of the RYDER linkage groups. As expected, maps DB0H and DA0F had the most common framework markers between them (274) since these maps are from different generations derived from the same cross. Maps DB0F and DB0T derived from crosses between L. saligna and L. sativa had the second most common number of markers (136). The RYDER map had the fewest common markers (18) with any of the other maps (Table 5).

Table 4 Number of common framework markers among populations used for linkage group integration
Table 5 Distribution of common framework markers between populations

Most of the linkage groups from the individual maps were successfully integrated into the consensus map. The correspondence among linkage groups in the individual maps and the linkage groups in the integrated map is shown in Table 2. When possible, linkage groups were designated to be consistent across maps with priority given to the earliest report. Only linkage groups add1, add2 and add3 in the RYDER map and linkage group add1 in the DB0D map were not integrated on the consensus map for lack of common framework markers (Table 2).

Integrated map characteristics

The integrated lettuce map was composed of 2,744 markers assembled into nine linkage groups, corresponding to the nine chromosomes of lettuce, with a total map length of 1,505 cM (Table 2; Fig. 1). The integrated map was ∼150 cM larger than the largest individual maps from populations DB0H and DA0F (1,368 cM and 1,346 cM). This was the result of the expansion of linkage groups 3 and 9 by intercalating new markers from the other maps (Table 2). Average interval size between markers was greatly reduced on the integrated map (Fig. 2). Less than 1% of the intervals were larger than 5 cM.

Fig. 1
figure 1

Integrated map of lettuce. AFLP and SSR markers are displayed below the linkage group line. RFLP, RAPD and morphological markers are displayed above the linkage group line. For details, see http://cgpdb.ucdavis.edu/mjt_2006

Fig. 2
figure 2

Distribution of size intervals among populations. Comparison with INT map

Marker distribution in the integrated map was a reflection of the individual maps distributions (Fig. 1). AFLP markers were obviously clustered in multiple regions and few regions exhibited clustering of RFLP and RAPD markers. Morphological markers that only segregated in populations DB0D and RYDER (DM resistance genes etc.) were integrated into the linkage groups using anonymous markers.

Discussion

Our integrated map of lettuce integrates four previously published maps of lettuce with data from three previously unreported datasets. The seven maps were constructed from crosses within and between L. sativa (cultivated lettuce) and its closest wild relatives L. serriola and L. saligna. Consequently it provides markers for accessing and exploiting variability within the sexually-compatible gene pool of cultivated lettuce.

Segregation distortion

We found two different types of distortion: specific regions distorted in multiple populations and distortion of multiple linkage groups in a single population suggesting multiple reasons for segregation distortion. Markers with segregation distortion were found restricted to certain genomic positions in maps within L. sativa and between L. sativa and L. serriola. Factors that affect the segregation of alleles during meiosis (e.g. direct gamete competition, male killing) can produce distorted ratios at affected loci (Moyle and Graham 2006). A region in Int-3 was consistently distorted in populations DB9X, DB0H and DA0F towards the L. sativa maternal allele. In tomato and rice, distortion towards one of the parental alleles was the result of selective abortion of gametes carrying a specific allele at a gamete eliminator locus (Rick 1966; Sano 1990).

In crosses between L. saligna and L. sativa, distorted markers were widely distributed through the genome present in almost all the linkage groups. In a comparison among 58 intra and inter-specific crosses in different agricultural crop species, Jenczewski et al. (1997) reported an increase of distorted markers in inter-specific crosses versus intra-specific crosses in agreement with our results. The degree of distortion in a genetic mapping population is thought to be correlated with the level of genomic divergence between parental taxa (Taylor and Ingvarsson 2003). Hybrid inviability or sterility is known to occur by the accumulation of inter-locus incompatibilities between divergent populations (Sweigart et al. 2006). F1 hybrids between L. saligna and L. sativa are partially sterile (de Vries 1990; Koopman et al. 1998). Possibly disruptive genetic interactions among divergent loci between L. saligna and L. sativa are responsible for hybrid sterility and observed distorted ratios in the F2. Distorted segregation during the development of a set of 29 lettuce backcross inbred lines that introgressed single segments from L. saligna into L. sativa over more than five generations resulted in six regions in which the L. saligna segment could not be obtained as homozygous (Jeuken and Lindhout 2004). This extreme preference for L. sativa alleles was observed for segments that mapped to Int-6, Int-8, Int-7 (two regions) and Int-9 (two regions). One segment corresponding to a region on Int-7 showed complete distortion as only the L. sativa haplotype was obtained. These segments are similar to those that showed segregation distortion in the L. saligna × L. sativa F2 populations DBOT and DBOF (Table 3).

Wild ancestors of crop plants have been useful sources of genetic variation that has been lost during domestication of crop plants (Zamir 2001). Wild species in the Lactuca genus have been successfully used in the introgression of horticulturally important traits in breeding programs, particularly disease resistance (Crute and Dickinson 1976). Understanding the mechanisms responsible for segregation distortion in a given cross is important since such distortion can obstruct gene flow between species and hamper the introgression of horticulturally interesting alleles in certain regions of the genome. In contrast to the development of BILs (Jeuken and Lindhout 2004), the distortion observed in our interspecific populations was never complete (i.e. only one of the parental haplotypes was recovered); all the possible haplotypes were recovered in the progeny for all regions, albeit at different frequencies. The consequence for selection of genes in these skewed regions is the increased progeny sizes required to recover wild alleles in regions that become distorted towards the cultivated allele. Horticulturally, interesting genes can be located in regions exhibiting segregation distortion; quantitative trait locus (QTL) analysis of numerous horticulturally important traits segregating in population DA0F identified QTLs for leaf shape and heading in a distorted region of Da0f-3 (MJ Truco, unpublished data).

Recombination rates

Observed recombination frequencies are dependent on the accuracy of the dataset as well as the actual rate of recombination. Mis-scored markers are especially problematic on dense maps, where a few errors can greatly increase apparent marker distances. This type of error was reduced by searching for and removing individual markers flanked by recombination events. The maps from L. sativa × L. sativa and between L. sativa and L. serriola have similar rates of recombination supporting the idea that these taxa are conspecific (Kesseli et al. 1991) (Table 6).

Table 6 Comparison of recombination frequencies among DB0D, DB0H, DB9X, DB0F and DB0T populations for conserved intervals within linkage groups

Recombination was reduced in crosses between L. saligna and L. sativa resulting in more compressed linkage maps (Table 2; Supplementary material S5). This was probably the consequence of the greater sequence divergence between these parents. The higher levels of divergence were reflected by the mean numbers of AFLP markers per primer combination; populations DB0T and DB0F averaged 35 and 45 AFLP markers respectively, while DB0H and DB9X averaged 23 and DBOD only 12 AFLP markers per primer combination. This is similar to inter-specific hybrids in tomato with a high degree of marker polymorphism that showed severe suppression of recombination between homeologous chromosomal segments due to reduced chiasma frequency in F1 hybrids (Chetelat et al. 2000). Sequence divergence is known to repress homologous recombination. The recombination rate in Arabidopsis was reduced threefold by single nucleotide mismatches (Opperman et al. 2004). Extensive recombination studies in maize have documented reduced recombination associated with sequence heterogeneity (Schnable et al. 1998). Within the bronze gene, a hot spot for recombination in maize, recombination occurs randomly within the gene in the absence of sequence divergence but is reduced by point mutations or single base pair mismatches (Dooner and Martínez-Férrez 1997).

Integrated map

Of the 695 potential framework markers available for map integration, over 90% were readily assembled into nine chromosomal linkage groups. Only 62 (8.9%) had to be discarded because of inconsistent grouping. This proportion is consistent with map integration studies with other species (Keygene unpublished). The reason for inconsistent grouping of markers has been studied in maize by re-sequencing bands that gave inconsistent grouping in different maps; one third of these bands were the result of different sequences with very close mobility being called the same. The remaining two thirds of the inconsistent markers had the same or similar sequence and were presumed to be duplicated sequences. The inconsistent positions may reflect the segregation of one or more polymorphisms in different paralogs.

The integrated map of lettuce reported here is a more informative genetic resource than any of the individual maps previously available. It combines 2,744 markers into nine linkage groups corresponding to the nine chromosomes of Lactuca spp. with better marker coverage (1,505 cM) than any of the other individual maps with nine linkage groups (DA0F, 1,346 cM; DB0F, 1,073 cM; DB0T, 9,17 cM). Genomic regions that were either split into separated linkage groups or missing in the individual maps for lack of polymorphism are now combined into single linkage groups. For example, linkage group Int-3 is split into two linkage groups in DB0D (36.8 and 58.5 cM), DB0H (85.5 and 33.8 cM) and DB9X (96.6 and 64.5 cM) and has missing regions in DB0F (91.5 cM) and DB0T (77 cM). Int-3 has a total length of 193 cM and is the result of combining regions from all the populations (Int-3, supplementary material S4). Some of the maps, for example DB9X, contribute to this linkage group with unique regions not present in any of the other maps.

Even marker distributions through the linkage groups were observed for most of the marker types except for clustering of AFLP markers. The same situation has been reported previously in an ultradense map of potato (van Os et al. 2006). Clustering of AFLP markers may be due to a more even distribution of AFLPs over the physical map. Consequently, clusters of AFLPs may be indicative of heterochromatic regions near centromeres (Haanstra et al. 1999); confirmation of this for lettuce awaits cytological analysis.

The integrated map of lettuce combines diverse types of markers into a single map. Combining information from multiple crosses increases the chances of identifying putative polymorphic markers in a given region. Dm13, a disease resistance gene against lettuce downy mildew, was previously mapped in DB0D to a region of Db0d-3b with only dominant RAPD markers linked to it (Kesseli et al. 1994). In the integrated map this gene is linked to a suite of AFLP, SSR and RAPD markers. To map new traits that segregate in previously unmapped populations, bulked segregant analysis (BSA, Michelmore et al. 1991) could be used to identify a few markers linked to the gene of interest. These markers would then be aligned to the integrated map to identify other candidate markers in the region avoiding the time-consuming construction of a detailed map in the new population.

Comparisons among individual maps and the integrated map of lettuce were greatly facilitated by Lettuce Map Viewer. This program provided graphical displays of the comparisons and allowed easy access of the raw data behind the construction of the individual and integrated maps including genotypes and details of markers. These tools are publicly available for application to other species. These tools and the integrated map data will greatly facilitate analyses of synteny with Arabidopsis (Timms et al. 2006), and marker assisted selection as well as genetic dissection (QTL analysis) and map-based cloning of agriculturally important traits.