Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Introduction

The high-throughput and low-cost next-generation sequencing technologies make genome sequencing possible for non-model species. The typical approaches of genome sequencing include either “BAC-by-BAC” or “whole-genome shotgun sequencing” approaches. A high-quality physical map plays essential roles in both sequencing strategies (The Arabidopsis Genome Initiative 2000; Schnable et al. 2009; The Tomato Genome Consortium 2012; Yu et al. 2002). The key step in “BAC-by-BAC” sequencing projects is to identify BAC clones on the “minimal tiling path,” which heavily relies on an accurate BAC-based physical map. In whole-genome shotgun sequencing projects, a physical map, in combination with BAC end sequences (BES), provides anchor points to build scaffolds, especially for large repeat regions. Physical maps provide a valuable genomic resource not only for genome sequence assembly but also for studying genome structure and organization and positional cloning of genes associating with economically and agronomically important traits. In addition, physical maps can be used for comparative genomics studies of closely related species to enhance our understanding of the evolutionary process.

Genetic and physical maps are constructed using independent methods and are not directly related to each other. Various methods have been developed to integrate the genetic and physical maps, including (1) genetically mapping BAC-anchored markers (Yu et al. 2009; Troggio et al. 2007; Mun et al. 2006), (2) BAC pooling with PCR screening (Klein et al. 2000), and (3) hybridization of overgo probes (Yüksel et al. 2005) (Fig. 10.1a–c).

Fig. 10.1
figure 00101

The three major methods for integrating genetic and physical maps. (a) Genetically mapping BAC-anchored markers. (b) BAC pooling with PCR screening. (c) Hybridization of overgo probes

Among these three methods, genetically mapping BAC-anchored markers is the most commonly used. Genetic and physical maps are independent genomic resources and complementary to one another. The integrated map could minimize errors and enhance the quality of these resources. The integrated maps with BES allow assignment of genome sequence contigs on linkage groups. The integrated genomic resources provide a simplified and efficient interface for data mining and bioinformatics analysis.

Significant progress has been made in recent years in developing genomic resources to expedite genetic research for improvement of papaya. Three bacterial artificial chromosome (BAC) libraries, one for each sex type, were constructed, providing the foundation for studying papaya genome structure and organization (Ming et al. 2001; Gschwend et al. 2011). BAC ends of the entire hermaphrodite papaya BAC library were sequenced (Lai et al. 2006; Ming et al. 2008; Yu et al. 2009), providing the first glimpse of the sequence composition of the papaya genome. Three high-density genetic linkage maps have been constructed (Ma et al. 2004; Chen et al. 2007; Blas et al. 2009) (Table 10.1), providing essential tools for comparative genomic analysis, marker-assisted selection, and genomic dissection of complex traits. The first high-density genetic map of papaya was constructed with 1,501 markers, including 1,498 amplified fragment length polymorphism (AFLP) markers, 2 morphological markers, and one transgenic marker (Ma et al. 2004). Due to the anonymous nature of AFLP markers, this map cannot be used to integrate the genetic and physical maps and is not suitable for aligning papaya genome sequence to linkage groups. To overcome this limitation, highly informative sequence-based simple sequence repeat (SSR) markers were used to construct the second high-density genetic map. This sequence-tagged genetic map contains 707 markers including 706 sequence-based SSR markers and one morphological marker (Chen et al. 2007). The SSR markers were developed from either BES or whole-genome shotgun sequence reads (Chen et al. 2007); and this map has contributed to the integration of genetic and physical maps and the genome sequences (Yu et al. 2009). A BAC-based physical map of papaya was constructed using the high-information-content fingerprinting method (Yu et al. 2009). The draft genome sequence of papaya was published in 2008 and contains 271 Mb of contig sequence and 370 Mb of scaffolds with embedded gaps (Ming et al. 2008). The physical map was integrated with the genetic map and genome sequence through mapping microsatellite markers derived from BES and the whole-genome shotgun sequences (Yu et al. 2009). This integrated map provides essential information for studying the papaya genome structure and comparative genomics.

Table 10.1 Comparison of the three papaya genetic maps

Papaya BAC Libraries

Bacterial artificial chromosome (BAC) libraries, containing large genomic fragments with deep coverage, have served as valuable research tools for physical mapping, positional cloning, and genome sequencing. The first papaya BAC library was constructed from a hermaphrodite plant of inbred cultivar “SunUp” using HindIII partial digestion (Ming et al. 2001) (Table 10.2). This BAC library consists of 39,168 clones with an average insert size of 132 kb, providing 13.7× papaya genome equivalents (Ming et al. 2001). This BAC library has been used for cloning flower development (Yu et al. 2005, 2008; Ackerman et al. 2008) and fruit flesh-color genes (Skelton et al. 2006; Blas et al. 2010) and characterization of transgenic insertions in genetically modified papaya (Suzuki et al. 2008). The entire library was end sequenced (Lai et al. 2006; Yu et al. 2009). After trimming the vector sequences and removing the clones with cross-well contamination, a total of 67,179 BES remained with an average length of 666 bp. The total length of the cleaned BES is 44,725,370 bp, accounting for 12 % of the papaya genome. The BES played essential roles in integration of genetic and physical maps and in assembling the genome sequence.

Table 10.2 The characteristics of the three papaya BAC libraries

Two additional papaya BAC libraries, one for “SunUp” female and the other for “AU9” male plants (Table 10.2), were constructed to assist physical mapping of the male-specific Y chromosome (MSY) region and its corresponding region on the X chromosome (Gschwend et al. 2011). These two independent BAC libraries used two different restriction enzymes, EcoRI and BstYI, to increase representative genome coverage. The female BAC library consists of 36,864 clones with an average insert size of 104 kb, providing 10.3× genome equivalents (Gschwend et al. 2011). The male BAC library consists of 55,296 clones with an average insert size of 101 kb, providing 15.0× genome equivalents (Gschwend et al. 2011). The three papaya BAC libraries of three sex types provided essential genomic resources for physical mapping of the hermaphrodite-specific region of the Yh chromosome (HSY), the male-specific region of the male Y chromosome (MSY), and the corresponding region on the X chromosome (Na et al. 2012).

BAC Fingerprinting and Contig Assembly

The basic strategy for construction of a physical map is to identify overlapping clones based on shared restriction fragments or DNA markers. Various methods have been developed to create physical maps. Briefly, these methods can be grouped into either PCR/hybridization-based or DNA fingerprinting-based approaches. Compared with PCR/hybridization-based methods, fingerprinting-based methods are rapid and better suited to map at a whole-genome level. The high-information-content fingerprinting (HICF) method has proved to be the most effective method owing to its high resolution and high throughput (Luo et al. 2003).

The entire library of the hermaphrodite papaya BAC library was fingerprinted using the HICF method. After the 1st round of fingerprinting, the failed clones were repeated twice to obtain successful fingerprints for 38,522 BAC clones (98.4 % of the BAC library). After excluding the clones with no or small inserts (referred as less than 20 true fragments after editing), and cross-contamination (70 % or higher shared fragments for neighboring clones), the remaining 30,824 fingerprints (78.7 % of the BAC library) were subjected to contig assembly using the FPC (FingerPrinted Contigs) program (Soderlund et al. 1997).

A total of 26,466 BAC clones were assembled into 963 contigs, while 4,358 clones remained as singletons. The average number of fragments (bands) of each clone was 69.4. Considering the clones with no or small inserts were excluded from contig assembly and the average insert size of the BAC clones used for contig assembly should be higher than 132 kb, the BAC clones mapped on contigs (26,466 BAC clones) represented at least 9.4× genome equivalents of the papaya genome. On average, each contig contains 27.5 BAC clones. The longest contig contains 1,571 consensus bands, which is about 0.7 % of the total length of the FPC contigs.

Overgo and Single-Copy Probe Hybridization

Overgo probes consist of two complementary oligonucleotides that can anneal at the 3′-end to generate a double-stranded probe with labeled deoxynucleotides (Ross et al. 1999). Overgo probes are designed to bind to conserved regions between two genomes at regular intervals. The overgo hybridization system was developed for large-scale BAC library screening in a high-throughput format to anchor EST sequences, or genetic markers, on physical maps. Overgo hybridization has been successfully used for integration of genetic and physical maps in cotton, cacao and peanut (Lin et al. 2010; Saski et al. 2011; Yüksel et al. 2005).

Papaya and Arabidopsis belong to the order Brassicales and shared the last common ancestor approximately 72 MYA (Wikström et al. 2001). Unlike Arabidopsis, papaya has not undergone recent whole-genome duplication events (Ming et al. 2008; Lyons et al. 2008a, b; Tang et al. 2008). To facilitate comparative and evolutionary genomic study within Brassicales, a total of 2,277 overgo probes representing conserved sequences in Arabidopsis and genetically mapped Brassica loci were tested against 36,864 papaya BACs. A total of 1,329 overgos (58 % of overgos designed) detected positive BACs in papaya. After eliminating low-quality data, the remaining 1,181 overgos were anchored on the papaya FPC map. Among these overgos, 756 (64.0 %) hit single contigs; the average number of clones per overgo is 6.0.

In addition to overgo probes, 16 probes designed from single-copy loci of papaya were used to screen the papaya BAC library. A total of 153 positive BAC clones were obtained. These 16 single-copy probes were anchored on the FPC map. All the probes except papaya Pistillata (PI) gene hit single contigs. The probe of papaya Pistillata (PI) gene hit two contigs, ctg1350 and ctg577. Ctg577 contains only three clones and all of them are PI positive. Thus, ctg577 should be part of the ctg1350, but the high stringency setting failed to merge them into a single contig.

These anchored overgos and single-copy genes further improved the quality of the physical map and provide direct links among papaya, Arabidopsis, and Brassica genomes for comparative genomic research. These anchored overgos and single-copy genes could help reveal synteny and rearrangements in target regions of these genomes, particularly in these recently duplicated genomes of Arabidopsis and Brassica.

Integration of Physical Map, Genetic Map, and Genome Sequence

Physical map, genetic map, and genome sequence are independent genomic resources. Among these resources, physical map forms an intermediate between genetic map and genome sequence. Integration of physical map, genome sequence, and genetic map could help to correct errors and enhance the quality of each of these resources that are complementary to one another. Integration of these three sets of genomic information provides a valuable resource for comparative genomics, particularly for analyses of Brassicales (Freeling et al. 2008).

The papaya genetic and physical maps were integrated by mapping microsatellite markers derived from BES and whole-genome shotgun sequences (Yu et al. 2009). The sequence-tagged high-density genetic map of papaya was constructed by mapping 706 microsatellite markers (Chen et al. 2007). The resulting genetic map consists of nine major and three minor linkage groups. The three minor linkage groups, LGs 10–12, were merged with major linkage groups, LGs 8, 9, and 7, respectively, based on fluorescence in situ hybridization (FISH) (Wai et al. 2010).

Among the 706 mapped SSR markers, 153 were derived from BES, 466 were designed from shotgun sequence reads, and 87 were developed from assembled shotgun contigs. The BES-derived SSRs directly anchored the associated FPC contigs to the genetic map. Using the 153 SSRs derived from BESs, 122 FPC contigs containing 46,475 consensus bands and 97 shotgun scaffolds covering 132 Mb (35.5 % of the papaya genome) were anchored to the genetic map. The initial integration was extended by aligning BESs to the draft genome sequences. The remaining 553 microsatellites were positioned on shotgun scaffolds by blastn. The BAC clones with paired ends covering the mapped microsatellites were used as bait to search the FPC physical map to find the corresponding FPC contigs. The order and orientation of the FPC contigs were validated by aligning associated BESs on the shotgun scaffolds.

In the final integrated map, 55.6 % FPC contigs (535 FPC contigs) and 81.2 % BAC clones (21,371 BACs) were anchored on the genetic linkage map. A total of 255 shotgun scaffolds covering 233 Mb were anchored to the genetic map. Overall, 63 % of the papaya genome sequences were placed on the genetic map. The summary of the integrated map is shown in Table 10.3.

Table 10.3 Summary of the integrated genetic and physical map and genome sequence

Integration of Genetic and Cytogenetic Maps

Papaya chromosomes are relatively small and have similar size and morphology at metaphase stage (Storey 1953). Thus, it is difficult to identify individual papaya chromosomes using traditional karyotyping methods, such as banding pattern, arm ratio, morphology, and size. A set of chromosome-specific cytogenetic markers were developed to assist papaya chromosome identification and cytological analysis (Wai et al. 2010). The chromosome-specific cytogenetic markers were derived from the integrated genetic and physical maps. A total of 104 BAC clones containing SSR markers distributed on 12 linkage groups were tested for signals on papaya chromosomes using FISH. Fifty-four BACs producing repeatable and unambiguous FISH signals were selected as chromosome-specific markers (Wai et al. 2010). This set of chromosome-specific markers served as a foundation for papaya chromosome karyotyping and assigning linkage groups to chromosomes.

The papaya SSR genetic map consists of nine major linkage groups (LGs 1–9) and three minor linkage groups (LGs 10–12) (Chen et al. 2007). The nine major linkage groups were assigned to each individual chromosome by hybridizing LG-specific markers on papaya chromosomes (Wai et al. 2010; Zhang et al. 2010). The three minor linkage groups, LGs 10–12, were assigned to the major linkage groups by simultaneously labeling cytogenetic markers derived from the minor linkage groups with the LG-specific markers developed from the nine major linkage groups. Using this strategy, LGs 10–12 were assigned to the major LGs 8, 9, and 7, respectively (Wai et al. 2010).

A pachytene chromosome-based karyotype of papaya was developed with the assistance of chromosome-specific cytogenetic markers (Zhang et al. 2010). The chromosomes were numbered based on their length except X/Y chromosome pair (Table 10.4). In general, the papaya chromosomes have similar sizes as they were observed at metaphase stage (Table 10.4). The longest chromosome is about 58.08 μm and accounts for 12.8 % of the total length of papaya chromosomes, while the shortest chromosome is about 41.67 μm and accounts for 9.20 % of the total length of papaya chromosomes. The difference between the longest and shortest chromosomes is about 28 %.

Table 10.4 The physical size of papaya pachytene chromosomes and the length of their corresponding linkage groups

The papaya genome is composed largely of euchromatin distributed mainly on chromosome arms (Ming et al. 2008; Zhang et al. 2010). Approximately 83 % of the papaya genome is euchromatic, and 17 % is heterochromatic (Ming et al. 2008). The DAPI brightly stained heterochromatin is concentrated in the centromeric and pericentromeric regions (Zhang et al. 2010), which may explain the recombination suppression found in the middle of most linkage groups (Chen et al. 2007).

The nucleolus organizing regions (NOR), one of the major chromosome structure features, were anchored on the papaya genetic linkage map by FISH (Wai et al. 2010; Zhang et al. 2010). The 5S rDNA loci were located on chromosome 3, 5, 8, 9, and the Y chromosome (Zhang et al. 2010). The strongest signals of 5S rDNA were found at interstitial positions of chromosomes 5 and 8 (Wai et al. 2010; Zhang et al. 2010). The 25S rDNA was located at constriction sites of chromosome 4 (Wai et al. 2010; Zhang et al. 2010).

Genome Coverage

The genome coverage of the fingerprinted physical map was evaluated by estimating the average consensus band size and excluding the organelle genome contamination. BAC clones containing papaya chloroplast genome sequences were identified by hybridizing sorghum chloroplast ropB and trnK probes against papaya BAC grids (Yu et al. 2009). By searching the BES using the papaya chloroplast genome sequence as a query, additional BAC clones containing the papaya chloroplast genome were identified. The positive clones were scanned against the papaya FPC map, and FPC contigs 972 and 426 were identified as containing papaya chloroplast genome sequence.

The BAC clones containing papaya mitochondrial genome sequences were identified by searching the papaya BES database. A total of 356 BAC clones were identified with at least one end sharing over 95 % identity with the papaya mitochondrial genome sequence. Among them, 144 BACs were verified with both ends containing the papaya mitochondrial genome sequences. FPC contigs 867, 524, 1,172, 553, and 47 were identified as containing the papaya mitochondrial genome fragments.

Lengths of fingerprinted contigs are analyzed as consensus band (CB) units. To estimate the genome coverage of the fingerprinted physical map, the average band size was determined on 22 randomly selected, non-overlapping contigs (Yu et al. 2009). Based on the length of these 22 FPC contigs of 7,503 CB units and their physical size of 12,256,987 bp in the shotgun assembly, the average band size of the papaya FPC map was estimated at about 1.6 kb.

The total length of the papaya FPC map is 224,354 CB units. After excluding the FPC contigs containing papaya organelle genome fragments, the total length of the remaining contigs was 222,808 CB units and approximately equal to 356.5 Mb. Considering the genome size of papaya at 372 Mb (Arumuganathan and Earle 1991), the papaya physical map covers about 95.8 % of the papaya genome, and 72.4 % of papaya genome (269.15 Mb in total length) was anchored to the genetic map.

Genetic Recombination

Genetic maps are constructed by using recombination frequencies between markers. Thus, the distance between two loci on a genetic map reflects their recombination frequency, instead of an actual physical distance. On the other hand, a physical map is developed using restriction enzyme fingerprinting data which reflect the relative physical location of overlapped clones. The integrated genetic and physical map makes it possible to reveal the relationship between physical distance and genetic distance at the genome level.

The ratio between genetic and physical distance on the integrated map varies among chromosomes. Most recombination suppressed regions were observed near the center of linkage groups, while most recombination hot spots were found toward the ends of linkage groups. DAPI (4′,6-diamidino-2-phenylindole)-stained papaya chromosomes revealed highly condensed heterochromatin knobs located in the centromeric and pericentromeric regions of all nine pairs of chromosomes (Ming et al. 2008). Apparent lengths of the condensed heterochromatin knobs varied among the chromosomes (Ming et al. 2008). Consistent with papaya chromosome structure, recombination suppression regions were found near the center of all the major linkage groups, suggesting that those regions might be centromeric. Higher recombination rates observed toward the telomeric regions could reflect these regions favored by natural selection to contain beneficial alleles.

Extensively suppression of recombination was observed at the male-specific Y chromosome region (MSY) on LG1. The size of the suppressed region was estimated at about 8–9 Mb based on the integrated genetic and physical map. Based on pachytene FISH images, the physical location of the MSY is in the middle of the Y chromosome (Zhang et al. 2008), whereas the MSY was located on the upper half of LG1 on the genetic map (Chen et al. 2007). This discrepancy in locating the MSY is likely caused by the lower recombination rate on one arm of the Y chromosome (Chen et al. 2007; Wai et al. 2012). Also, pachytene FISH images consistently revealed denser heterochromatic regions on one of the Y chromosome arms (Ming et al. 2008). Suppression of recombination has spread to adjacent regions of the MSY, but recombination rates recovered gradually and then increased dramatically at about 10 Mb from the MSY.

Map-Based Cloning of the Major Gene Controlling Fruit Flesh Color in Papaya

Carotenoid antioxidants, which are also called provitamin A, are precursors of vitamin A that carries out a number of functions in the human body including serving as antioxidants, precursors of steroid hormones involved in growth and differentiation, and production of visual proteins (Paiva and Russell 1999; Krinsky 1998; Bartley and Scolnik 1995). Papaya is one of the top fruits that are rich in vitamin A and vitamin C (Liebman 1992). Promoting papaya consumption in underdeveloping and developing tropical and subtropical countries will significantly improve health of poor people as golden rice projected (Dawe et al. 2002). In papaya, the flesh color of the fruit is considered as a quality trait that is often targeted for papaya improvement because it correlates with nutritional value and is linked to shelf life of the fruit (Yamamoto 1964). Two fruit flesh colors, yellow and red, of papaya fruit are present in the papaya germplasm. Red-fleshed papaya is preferred by some consumers, but it is associated with a faster rate of softening thus a shorter shelf life.

The fruit flesh color of papaya is controlled by a single gene with yellow color as dominant (Hofmeyr 1938; Storey 1969). It has been previously documented that yellow-fleshed fruits have high amounts of the carotenoid β-cryptoxanthin and ζ-carotene but lack lycopene, while red-fleshed fruits have high amounts of lycopene along with β-cryptoxanthin, β-carotene, and ζ-carotene (Karrer and Jucker 1950; Yamamoto 1964; Chandrika et al. 2003). The red color of papaya fruit is due to the accumulation of lycopene; the yellow color is the result of converting lycopene to β-carotene and β-cryptoxanthin (Yamamoto 1964).

The high-density AFLP genetic map of papaya was constructed using 54F2 plants derived from cultivars Kapoho and SunUp (Ma et al. 2004). The map consists of 1,501 markers, including 1,498 amplified fragment length polymorphism (AFLP) markers, the papaya ringspot virus coat protein marker, morphological sex type, and fruit flesh color (Ma et al. 2004). The gene controlling the fruit flesh color was placed on linkage group 7 where it is flanked by two AFLP markers at recombination distances 3.4 and 3.7 cM (Ma et al. 2004). These closely linked AFLP markers were converted to sequence-characterized amplified region (SCAR) markers and used as probes to screen the papaya BAC library (Blas et al. 2010). Meanwhile, candidate gene approaches were carried out to clone the gene controlling the fruit flesh color. A tomato CYC-b cDNA was used as a probe to screen the papaya BAC library. Five BACs were identified by the SCAR marker and two by the tomato CYC-b cDNA probe. These seven positive BACs were used as seed BACs to search into papaya BAC-based physical map. Six of the positive BACs were mapped to a single fingerprint contig FPC-962 of the papaya physical map (Fig. 10.2). Co-localization of the fruit flesh-color-linked SCAR marker and the CYC-b cDNA probe on a single contig indicated that the papaya CYC-b orthologous gene was a strong candidate for controlling fruit flesh color. The papaya draft genome sequence integrated with FPC-962 was used to identify the papaya homologous gene of tomato CYC-b (named as cp CYC-b).

Fig. 10.2
figure 00102

FPC Ctg 962 of the papaya BAC-based physical map. The purple color box highlights the BAC clones identified by the cDNA probe of the tomato CYC-B gene. The green color box highlights the BAC clones identified by the genetic marker linked with flesh color

The two alleles, the dominant yellow-fleshed allele and recessive red-fleshed allele, were fully sequenced (Blas et al. 2010). A 2 bp insertion was identified within the red-fleshed CpCYC-b coding region, which produces a frameshift mutation resulting in a premature stop codon and a truncated coding region in red-fleshed CpCYC-b allele (Blas et al. 2010). The function of CpCYC-b alleles was tested by transformation into lycopene-accumulating E. coli strain (Cunningham et al. 1994). The gene product of the full-length yellow-fleshed CpCYC-b was able to mediate conversion of lycopene (red) to β-carotene (yellow) (Blas et al. 2010). While the red-fleshed CpCYC-b lost the lycopene β-cyclase activity as indicated by the lack of color change (Blas et al. 2010).

A SCAR marker located at 580 bp away from CpCYC-b gene was developed to support marker-assisted selection in papaya breeding programs. Using this marker, a simple PCR-based screening test using agarose gels can identify individuals in a segregating population with the desired fruit flesh color (Blas et al. 2010). Due to the extremely high recombination rate in the surrounding region, this simple PCR test will be able to identify fruit flesh-color genes with approximately 98 % certainty (Blas et al. 2010). A more precise test could be done at a higher cost and lower throughput using acrylamide sequencing gels by targeting the two nucleotides insertion of the red flesh CpCYC-b (Blas et al. 2010).

Perspective

The three major genomic resources, genetic map, physical map, and genome sequence, complement one another and correct errors from each individual source. The integrated information enhances the capacity for map-based cloning, identification of underlying genes controlling quantitative traits in papaya, and comparative genomics studies.

Although the draft genome sequence of papaya covers 92 % of the genes (Ming et al. 2008), most quantitative trait loci (QTLs) controlling economically and agronomically important traits in papaya have not yet been characterized, including those controlling sugar content, fruit size, shape, weight, and complex biotic and abiotic stress reactions. The integrated map in combination with genome sequence will expedite the mapping and cloning of target genes to facilitate papaya breeding through marker-assisted selection.

The current draft sequence of the papaya genome represents about 75 % of the papaya genome (Ming et al. 2008). With rapid progress in next-generation sequencing technology, finishing the papaya genome may be carried out in the foreseeable future. The papaya BAC-based physical map integrated with the genome sequences and genetic map will be an essential resource for closing gaps of any particular genomic region under investigation. The BAC clones located at the ends of the scaffolds can be used as baits to search the FPC physical map. For example, the gap between scaffolds 6 and 129 of the draft genome can be filled by FPC Ctg 355 (Fig. 10.3). The BAC clones on the minimal tilling path can be sequenced to fill the targeted gaps. Moreover, the integrated genetic and physical map could also provide a framework to guide the genome sequencing of related species.

Fig. 10.3
figure 00103

With assistance from BAC end sequences and BAC-based physical map, the gap between shotgun scaffolds 129 and 6 was filled by FPC Ctg 355, and the gap between shotgun scaffolds 6 and 29 was filled by FPC Ctg 662

The integrated genetic and physical map can serve as a reference to study gene and genome evolution and to reveal the genetic base of unique ecological adaptations. A total of 1,181 overgos representing conserved sequences of Arabidopsis and genetically mapped Brassica loci were anchored on the integrated genetic and physical map and the draft genome sequence of papaya. These overgos provide direct links among papaya, Arabidopsis, and Brassica genomes for comparative genomic research among species within the order Brassicales. The overgos were designed from single-copy genes and sequences of Arabidopsis and Brassica. Along with the FPC contigs and WGS sequences, overgo markers could help identify synteny and rearrangements in target regions of these genomes, particularly in these recently duplicated genomes of Arabidopsis and Brassica.