Introduction

Melon (Cucumis melo L.) is an economically important horticultural crop and is a diploid (n = 12) with a genome size of 450 Mb (Arumuganathan and Earle 1991). Melons belonging to the two subspecies melo and agrestis were previously grouped under seven morphotypes: cantaloupes, honeydew, Casaba, Persian, Santa Claus, Crenshaw and Juan Canari (Munger and Robinson 1991; Robinson and Decker-Walters 1997). According to the morphological observations of Jeffrey (1980) and Stepansky et al. (1999) vars. cantalupensis (cantaloupe) and inodorus (honeydew) should be placed in subspecies melo and vars. momordica, conomon, dudaim, and chito in subspecies agrestis (Decker-Walters et al. 2002). Pitrat (2008) grouped melons into 15 morphotypes (cantalupensis, reticulatus, adana, chandalak, ameri, inodorus, chate, flexuosus, dudaim and tibish (in subsp. melo), and momordica, conomon, chinensis, makuwa, and acidulous in subsp. agrestis. Many of the taxonomic groups are widely disputed as their accessions share genealogies of both melo and agrestis and are heterogenous, exhibiting mixed features (Blanca et al. 2012). However, molecular studies strongly supported the sub-specific division (Deleu et al. 2009; Stepansky et al. 1999) and Blanca et al. (2012) suggested reclassifying some of the morphotypes back to their respective subgroups based on molecular genetic relationships.

There is little information about melon germplasm from eastern Europe, particularly on how it is related to melons from Asia and the western hemisphere. Eastern European melons have a known cultivated history of about 2,500 years, as excavations from Chersonese, located on the outskirts of Sevastopol on the Crimean peninsula of Ukraine, have unearthed fossilized melon seeds (Pangalo 1958). Melons are known to have been cultivated in the territory of Ukraine and Russia for about 25 centuries, and the genealogy records of these melon types have been traced to eastern and central Asian lineages (Pyzhenkov and Malinina 1994). Currently the particular market group of melons that are broadly known as adana occupy about 50 thousand hectares throughout eastern Europe (Nimmakayala et al. 2009). Based on genome-wide transcript-based SNP profiles, eastern European germplasm was considered as basal to the taxonomic group of cantalupensis (Blanca et al. 2012).

Current research has been designed to take advantage of the abundance of melon genetic diversity to perform association mapping (syn. linkage disequilibrium mapping) to resolve complex trait variation and identify quantitative trait loci (QTL). This approach offers three advantages over traditional linkage analysis: (1) increased mapping resolution; (2) increased efficiency (reduced resources), and (3) the ability to identify diverse alleles (Yu and Buckler 2006; Risch and Merikangas 1996; Nordborg and Tavare 2002). Association mapping has been widely used in all the major crops including maize, rice, barley, tomato, wheat, sorghum, sugarcane, soybean, grape and several tree and grass species (Zhang et al. 2011). It has proved to be very effective for mining new genes if the population structure can be resolved accurately to reduce spurious associations due to confounding effects of subpopulations. For estimating population structure (Q) and relative kinship matrix (K), the multi-allelic and codominant microsatellites can be very useful as they are selectively neutral (Zhu et al. 2008) and recent when compared to single nucleotide polymorphisms (SNPs) (Remington et al. 2001). A unified mixed model approach for association mapping in combination with Q and K provides a dependable and robust system for identification of reliable QTL (Zhao et al. 2007). Most ideal background markers that are required to accurately estimate Q and K are multi-allelic microsatellites when compared to the biallelic SNPs in the onset of association mapping (Zhu et al. 2008; Yang et al. 2011). Zhu et al. (2008) further argued that a good starting point for simple sequence repeat (SSR) marker number can be four times the chromosome number of the species.

Melon is an important desert fruit with tremendous diversity that is a product of consumer preferences from different countries, ecologies and cultures. Understanding footprints of divergence and genomic footprints of adaptation underlying the formation of various morphotypes is very important to the development of disease-resistant and high-quality melons. A basic understanding of population structure and linkage disequilibrium (LD) is limited and has lags behind that of other crops. Characterization of population structure and LD are paramount for association mapping of the QTL that underlie various complex traits (Yu and Buckler 2006). The distribution pattern of LD across the genome directly depends on evolutionary forces such as genetic drift, population structure, levels of inbreeding across the genome and the map regions contributing genetic differentiation among the subpopulations. Emergence and maintenance of LD is based on such evolutionary forces and pattern of selection (Ersoz et al. 2007; Xiao et al. 2012).

We used mapped single locus SSRs to resolve population structure in diverse melon accessions, as this research is the first that has adapted association mapping in melons. As genomic technologies continue to evolve, high-density coverage of SNPs will be feasible in the near future. Currently genome-wide SNPs could be resolved only for a limited set of melon accessions. When compared to the other marker systems such as SNPs, SSRs are relatively new alleles and exhibit higher rates of mutation, increasing the opportunity for unique length variants to have arisen after domestication (Remington et al. 2001). Consequently, the SSR-derived datasets can resolve the most recent population structure when compared to SNPs (Flint-Garcia et al. 2005).

This current study aimed to resolve the genetic diversity and relatedness of eastern European melon germplasm with the melons of Asia and the western hemisphere using microsatellite markers that are mapped to various linkage groups. Our other objective was to compare LD among the three groups (Eastern Europe, Asia and Euro-North American) across the linkage groups and to perform association mapping for various fruit traits.

Materials and methods

Thirty-eight accessions of melons belonging to a wide geographical region of Eastern Europe were included for the molecular diversity analysis along with 24 accessions of cantalupensis, three inodorus, and 22 Asian collections, makuwa, dudaim and memordica (Table 1). Eighty-seven accessions that were previously selfed for the three seasons were grown in three replications during three seasons (2005–2007) using a row-to-plant spacing of 140 × 70 cm. Twenty plants per accession were grown per replication. Data were collected pertaining to fruit weight (kg), fruit length (cm), fruit diameter (cm), fruit shape (the ratio of fruit length and diameter), soluble solids concentration (measured as % Brix with a hand-held refractometer manufactured by Atago Co. Ltd.), flesh thickness (cm) and rind pressure in kg/mm2 (measured using a FT 011 penetrometer).

Table 1 List of accessions used in the current study

Marker resources

The 268 single-locus SSRs used in the study are from various linkage groups of a published consensus map (Diaz et al. 2011). SSRs that had several nonspecific bands and SSRs with multiple loci were carefully eliminated from the study. Thirty-two of the primer combinations were from the linkage group (LG) 1 that spanned a length of 99 cM, along with 11 from LG2 of length 94 cM, 19 from LG3 of length 95 cM, 40 from LG4 of length 119 cM, 14 from LG5 of length 11 0 cM, 27 from LG6 of length 98 cM, 24 from LG7 of length 99 cM, 28 from LG8 of length 123 cM, 11 from LG9 of length 84 cM, 16 from LG10 of length 73 cM, 20 from LG11 of length 80 cM and 26 from LG12 of length 76 cM.

DNA isolation and PCR conditions

The genomic DNA isolation was carried out using the procedure of Nimmakayala et al. (2009). The PCR reaction consisted of 200 ng of genomic DNA, 0.20 μM of mixed forward and reverse primers, 1× Buffer (10 mM Tris–HCl pH 8.2, 50 mM KCl, Triton 0.1 %, BSA 1 mg/ml), 1.5 mM MgCl2, 0.2 mM dNTPs and 1 U Taq polymerase in 10 μL reaction volumes. Amplification was performed in a GeneAmp PCR 9700 System thermal cycler (Applied Biosystems Inc.) programmed to 94 °C for 2 min followed by 35 cycles of 94 °C for 30 s, 50–65 °C for 30 s, 72 °C for 1 min, and a final extension step at 72 °C for 10 min. Amplified products were separated on 3.5 % SFR-grade agarose gels. In scoring, stutters were avoided and discernible bands were scored as alleles. Allele sizes were estimated for the survey panel based on comparison with a 50-bp molecular weight ladder that was loaded twice on each SFR-grade agarose gel.

Data analysis

Molecular genetic diversity, neutrality and gene flow

The genetic variance was partitioned between and among the melon groups using the analysis of molecular variance (AMOVA) in the program Arlequin 2.0 (Schneider et al. 2000). Molecular genetic diversity was estimated using Nei’s (1973) genetic diversity and heterozygosity (h). F IS and F ST were estimated based on Wright’s F statistics (Weir and Cockerham 1984) using the program PopGene version 1.31 (Yeh et al. 1999). For each locus, polymorphism information content (PIC) was calculated using the Excel Microsatellite Toolkit (Park 2001). Analysis of gene flow (Nm) based on the private allele method (Slatkin 1985) was estimated using the program Genepop version 4.1 (Rousset 2008). The Ewens–Watterson test (Manly 1985) uses the significantly different observed and expected Hardy–Weinberg homozygosities assuming that various loci are at mutation-drift equilibrium. Fluctuation in the distribution of observed F when compared to the mean is suggestive of balancing selection (if above mean) or selective sweep (if below mean) and even distribution if the populations are under equilibrium. The Ewens–Watterson neutrality test (Ewens 1972; Watterson 1978) was performed using Manly’s algorithm (1985) implemented in the software PopGene version 1.31 (Yeh et al. 1999) to investigate neutrality across the linkage groups. This is to test whether the observed homozygosity, calculated as the sum of squared allele frequency, is significantly higher or lower than the expected homozygosity based on simulation under neutrality expectations. This approach could suggest whether selection is in operation on a particular genomic region across the linkage groups (Ejsmond et al. 2010; Xiao et al. 2012).

Population structure analysis and validation

Structure version 2.2 (Pritchard et al. 2000) which calculates posterior probability of the data for a given K, Pr(X|K) was used to cluster individuals into K groups. The number of populations (K) was determined following the admixture model with correlated alleles, with a K of 2–20. Five independent runs of 50,000 Markov chain Monte Carlo (MCMC) generations and 50,000 generations of burn-in were used for each value of K. The optimal K value was determined by the posterior probability [ln P(D)] and an ad hoc statistic ΔK based on the rate of change in [ln P(D)] between successive K (Evanno et al. 2005). The number of Ks in each dataset was evaluated using ΔK values according to Evanno et al. (2005) using the software Structure Harvester.

Linkage disequilibrium

LD was estimated as the r 2 between all the pairs of SSRs with 1,000 permutations using the package TASSEL version 2.2 (http://www.maizegenetics.net) (Bradbury et al. 2007). Rare alleles with allele frequency of 0.5 and less were removed from the dataset before analysis. LD was estimated for global, linked and unlinked markers. The 99th percentile of r 2 distribution for unlinked markers was considered as the background level of LD, which determined whether LD is due to physical linkage (Mather et al. 2007). The decay of LD along with the map distance was estimated according to Mather et al. (2007). LD decay over genetic distance was investigated by plotting pair-wise r 2 values against genetic distance (cM) between the markers. To visualize LD throughout the genome, heatmaps were produced based on the P values of pair-wise r 2 estimates of all marker pairs. These heatmaps were used to identify LD blocks in various melon groups at specific map locations.

Association mapping

The marker set after removing minor allele frequencies was used to estimate a kinship (K) matrix using the software TASSEL 2.1 that uses the proportion of alleles shared between each pair of accessions in the study. The Q matrix was adapted from the K-3 cluster result of the analysis obtained from Structure version 2.2. A mixed linear model (MLM) can significantly reduce the spurious marker–trait associations (Type I error showing false positives) resulting from population structure, as Q and K are used as covariants in the analysis. A false discovery rate (FDR) of 0.05 was used as a threshold for significant association using highly stringent Bonferroni correction (Sidak 1967) as well as moderate Benjamini and Hochberg correction (1995).

Results

Phenotypic data and correlations of fruit traits

Phenotypic data recorded on 87 accessions for three different seasons is presented in Supplementary Table 1. The highest heritability (broad sense) was noted for fruit shape (0.82) and fruit diameter (0.81) when compared to the other traits. The lowest heritability (0.56) was recorded for the trait fruit weight. Significant positive correlation was noted between fruit weight and flesh thickness (0.79), fruit length (0.75) and fruit diameter (0.87). Flesh thickness was also noted to be in positive association with fruit diameter (0.82) and fruit length (0.66). Fruit shape and fruit length were significantly associated with a correlation coefficient of 0.66. Soluble solids content and rind pressure were not correlated with any of the other traits in the study (Supplementary Table 2).

Allele frequency, gene flow and molecular genetic diversity among the melon collections

Analysis of molecular variance of the three groups of eastern European (EE or adana), CI (cantalupensis and inodorus) and Asian (makuwa, dudaim and memordica) accessions indicated highly significant differences (P ≤ 0.0001) across (17.52 % of variance) and within (68.08 % of variance) groups. Allele numbers amplified ranged from 2 to 10. Nei’s expected heterozygosities are better indicators of divergence and are presented in Supplementary Fig. 1. PIC (polymorphic information content) values were in the range of 0.7–0.88 for 49 loci, 179 loci had PIC values in the range of 0.4–0.69 and the rest of them were below 0.4 (Supplementary Fig. 2). The distributions of F ST, which is population genetic differentiation, and F IS, an indicator of inbreeding depression or fixation index, are presented across the entire linkage map in Fig. 1. Twenty-four loci had F ST in the range of 0.3–0.64, indicating that these markers contributed the majority of the divergence among the three groups, 106 loci had F ST in the range from 0.1 to 0.29 and the rest of them were below 0.1. F IS was above 0.9 for 128 loci, indicating that these loci are fixed among the three groups (Fig. 1). Genetic divergence (Nei 1973) analysis indicated that EE and CI diverged by 20 %, EE and Asian 15 % and CI and Asian 23 %. The neighbor-joining (NJ) tree based on Nei’s genetic diversities resolved three robust clusters with some exceptions, of which adana is notable in the middle position in the tree and further split into two subclusters, and on the top and bottom of the tree were clusters of Asian and CI groups, respectively (Fig. 2). The CI cluster had two subclusters of which one is purely cantalupensis and the other is a mixed group containing accessions from cantalupensis, both the inodorus that are in the study, Pepsha of adana and K-5886 of makuwa. The middle adana group had the first subcluster, which is predominantly adana type containing cultivars that are early (known as europeus) and mid-maturity (aestivalis) types. The second subcluster of adana group was with rest of the aestivalis and winter accessions (late maturity and known as haemalis) along with a mixture of various other morphotypes, viz. Rocky Ford Green flesh (USA), Charentais (France) and Old original Israel (Israel) of cantalupensis, both the dudaim types in the study (Plum Granny and Queen Anne’s Pocket of USA) and K-5360 (China) and Raketa (a collection from far east Russia) of makuwa, clustered at the bottom of second subcluster of adana. The NJ tree had a third cluster predominantly with makuwa and momordica from China and India with the sole exception of Gorkovskaia310 of adana group. The population structure analysis revealed that K-3 is the most appropriate grouping that can be assumed as it had the highest ΔK (Supplementary Fig. 3). The Nm estimates (private allele method) is to identify private alleles and number of migrant alleles per population (Barton and Slatkin 1986). This test revealed that the mean frequency of private alleles p(1) = 0.107956 per group and number of migrant alleles (Nm indices) ranged from 0.13 to 19.1 with an average of 7.72 per locus. An overall Ewens–Watterson test for neutrality was carried out for all the linkage groups to understand the genomic regions which are under selection sweep (Supplementary Fig. 4). A comprehensive view of melon genetic diversity and its footprints can be appreciated after taking into consideration the collective distribution of F ST, F IS and selection sweep across various linkage groups. For example, linkage group 12 has undergone remarkable selection sweep (the curve of the observed mean is significantly below the curve of the mean in Supplementary Fig. 4) when compared to the other linkage groups, where observed F is often fluctuating above the mean. Intriguingly, F ST distribution for LG12 is the lowest of all linkage groups and was in the range of 0–0.2 throughout, which indicates that the markers on this LG contributed less to population genetic differentiation because of selection sweep. In contrary, F IS distribution across LG12 was the highest, implying fixation of alleles and inbreeding (Fig. 1).

Fig. 1
figure 1

Distribution pattern of F ST and F IS across the length of the melon genetic map

Fig. 2
figure 2

Molecular genetic relationships as depicted on neighbor-joining tree and population structure among melon accessions (numbers on top of tree clusters are bootstrap values)

Distribution of linkage disequilibrium across various groups

The total of significant LDs was 3,405 (based on r 2, P < 0.05) out of 35,778 pairs, which includes both inter- and intrachromosomal combinations. The extent of LD was significant among 425 marker pairs (based on r 2, P < 0.05) out of 1,342 intrachromosomal pairs of SSR loci, which is 31.7 %. LD decay in melon varies across the linkage groups when diverse melons of all the three groups are included. However, a general trend can be drawn using the LOESS line, which indicated a sharp fall of LD at 5 cM. To start with we defined a base line at which the significant LD values intercept the LOESS curve and defined the mapping distance at which LD decays in melon. LD decay for the entire collection is at 5 cM (Supplementary Fig. 5) and for individual groups 5, 5 and 4 cM for EE, CI and Asian, respectively. LD distribution patterns across various linkage groups as represented by P values of marker associations of adana group is presented as a heatmap (Fig. 3) and similar heatmaps for the CI and Asian groups are in Supplementary Figs. 7 and 8. Position and length of significant LD blocks present across the melon LGs among various groups are presented in Supplementary Table 3.

Fig. 3
figure 3

LD pattern in adana group of melons

The third block of LG1 was at 55 cM and spread across 3 cM between the markers TJ26 and ECM58. Linkage group 1 was noted to have three major LD blocks, of which the first block in Eastern European and Asian groups spans 1 cM and is between the markers CMAAGN207 and CMAAGN221 located at the 8 cM position of LG1. This corresponding block in the CI group is larger than the other groups spanning between 7 and 9 cM positions of LG1 for 2 cM and is located between the markers ESM163 and CMAAGN221. A second block of the EE group was detected at 36 cM of LG1 and further spread across 4 cM and located between the markers CMN53_36 and CMCTN86 and was larger than the corresponding block of the CI group. This block in CI was detected at 36 cM of LG1 spread across 2 cM between the markers CMN53_36 and CMN23_44. Intriguingly, this block is absent in the Asian group. A third block of LG1 in the EE group was at 55 cM and spread across 3 cM between the markers TJ26 and ECM58. In contrast, this block in the CI group was at 56 cM and spread across 1 cM between the markers TJ26 and CMMS4_3. The corresponding block in the Asian group is located around 57 cM and between CMMS4 and CMN22_22, which is clearly a weaker and smaller block with reduced significance of P < 0.01.

In the current study, no LD block was detected on LG2. The block on LG3 in the EE group was located at the 41 cM position between CMBR083 and CMBR056 spanning up to the 4 5 cM position, which is slightly smaller than the block of CI and Asian. Corresponding blocks in CI and Asian were located at the 41 cM position between CMBR083 and CMBR018 spanning up to the 46 cM position. Locus CMBR100 decayed at the middle LD block in the Asian group. LG4 has one major block at the 71 cM position with the size of 1 cM in EE and CI groups, located between CMBR094 and CMBR104. Its corresponding block in the Asian group is exactly the same size but decayed in the middle at CMBR072. LG5 has a block at 26 cM of LG5 spanning 3 cM distance and located between CMAGN61 and CMAGN52 in CI only. A block at 77 cM of LG6 with flanking markers CMBR143 and CMBR108 was located in CI and Asian groups, which is completely absent in the Eastern European accessions.

Current study detected two discernible blocks on LG7, of which one is located at 56 cM and spread up to 58 cM between CMATCN184 and CMAGN141. The second block of LG7 was located between the 65 and 66 cM positions with flanking markers CMBR021 and CMGAN21. The current study detected two blocks on LG7, of which one is located at 57 and 58 cM between CMBR053 and CMBR027 in the EE group. The corresponding blocks in CI and Asian were located between 56 and 57 cM between CMBR012 and CMBR027. The second block of LG7 in all the three groups was located at the 65 cM position with flanking markers CMBR021 and CMBR084. The LD block on LG8 is located at the 16 cM position between CMBR109 and CMBR098, which is same in all the three groups that were studied in the current research. LG9 had one block in the Asian group at 38 cM, which was spread up to 42 cM with flanking markers CMN04_19 and DM0030 at either side. This block was completely absent in the EE and CI groups. Two blocks were detected on LG10. The first block of EE and CI is of 1 cM size and located at the 27 cM position and found between CMBR055 and MU6549. The longest second block of LG10 was at 52 and 56 cM with flanking markers CMCT134B and CMTC134 and was present in all the three groups. Out of two blocks of LG12, one was at 38 cM between CMN62_03 and CMN09_76 and present only in the Asian group. The other block on LG12 was seen at 61 cM between the markers CMCTTN259 and CMBR051, in exactly similar orientation among all the three groups of melons.

Diversity of phenotypic traits

Association mapping to locate QTL for various fruit traits

A MLM was used in the current study to locate QTL for three consecutive years. Marker significance was adjusted using Bonferroni’s correction. For 268 tests at an alpha of 0.05, the adjusted P = 0.0002206, and for alpha of 0.01 the adjusted P = 4.41E−5. Lists of markers that are linked to various traits are given along with their corresponding R 2 and P values across all the years in Table 2. A majority of the QTL detected are repeated across years, indicating the robustness of the identified QTL. Among the major QTL, for the trait fruit shape, which is estimated as a ratio of fruit length and width, CMN-21_82 and CMBR061of LG4 located at 42 and 47 cM, respectively, could explain around 50 % of the variation in two consecutive years and CMBR27 of LG7 explained 14 % in one year. MU3594 of LG8 was linked with a R 2 value of 0.7441–0.893 in all 3 years. This marker is highly significant and could withstand Bonferroni cut-off. The same marker revealed highly significant association with fruit length (with R 2 values of 0.257, 0.3546 and 0.3856) for all three years. As the fruit length is one of the contributing traits to estimating fruit shape, the common association of MU3594 with both the traits reveals the cogency of the current research. Five QTL were detected for fruit diameter on chromosomes 1, 6, 7, and 11 with a cumulative contribution that amounts to 41 % of the total variation. Soluble solids is another important trait which was studied all the three years and in the current study; seven different markers cumulatively explained about 25 % of the variation. Two strong QTL were detected for rind pressure.

Table 2 List of markers linked to various traits and their R 2 and associated P value

Discussion

Our study provides useful information pertaining to morphological and molecular characterizations of melons specific to adana collections in relation to the other world collections. The eastern European melon group, which is an adana morphotype known for its sweetness and high flavor, contains diverse accessions and consists of the subgroups aestivalis, europeus and hiemalis which are heterogenous but can be distinguished. Desetnaia 5, Dachnitza, Lipneva, Pepsha and Samarskaia are predominantly aestivalis group and have furrowed fruits. The fruit color of aestivalis varies from green to dark green in color while immature, changing to a yellowish brown at maturity. The europeus group had accessions with netted fruit surfaces with white colored flesh and is predominantly early-type (Nimmakayala et al. 2009). The subgroup hiemalis is a non-climacteric fruit and hence possesses long shelf life. Another distinguishing feature is the diversity in sex expression within the adana morphotype. The sex expression in europeus and hiemalis groups is andromonecious or monecious, whereas in aestivalis, the majority of collections are andromonecious (Tomason 2002). Melon groups cantalupensis and inodorous have sweet flesh and are cultivated in Europe and the USA. Groups momordica, dudaim and conomon have low-sugar flesh with smooth skin, small seed type and are cultivated in South and East Asia (Akashi et al. 2002; Tanaka et al. 2007). Similar to the results of the current study, melon accessions were classified by several researchers using molecular markers into Euro-American melons (cantalupensis and inodorous) and Asian melons (Silberstein et al. 1999; Stepansky et al. 1999; Staub et al. 2000; Tanaka et al. 2007) with the comparable genetic divergence. The current study revealed that the eastern European types are genetically close to Asian types and the Asian and EE types are more diverse than the CI group. Sebastian et al. (2010), after studying various African and Asian species, concluded that Asia is the center of origin for melon and as such it contains enormous genetic diversity. Blanca et al. (2012) and Nimmakayala et al. (2009) considered adana type to be ancestral to cantalupensis group. The variability found in this group can be effectively used for introgression into the CI group as this is a relatively underexploited reservoir of genetic variability for improving disease resistance, yield and fruit quality.

This study is the first attempt at detecting LD in the melon genome. The common occurrence of several LD blocks across the three melon groups suggests that LD mapping is feasible for locating genes underlying complex traits in the diverse melon germplasm. Polymorphic SSRs used in the current study were found to be very valuable for evaluating population structure among the three groups of melons used in the current study. Similar to our study, SSRs proved to be very useful for accurately resolving population structure in many other crops (Maccaferri et al. 2010; Huang et al. 2002; Matsuoka et al. 2002; Garris et al. 2005; Liu et al. 2012; D’hoop et al. 2010; Jin et al. 2010; Malysheva-Otto et al. 2006; Zhang et al. 2011). Microsatellite markers generally have higher power than SNPs even if large numbers of SNPs are used (Ohashi and Tokunaga 2003) for initial genome-wide screening, and SNPs should be used for fine-scale mapping after the screen.

A MLM was used in the current study for three consecutive years to locate QTL with the subsequent correction using Bonferroni’s method. The most intriguing part of the current study is that many of the QTL previously published using the recombinant mapping approach could be validated. Validation of the QTL identified by the other researchers makes the current study robust and very reliable. We summarized all the QTL identified in the current study and validated them by comparing previous studies using the recombinant mapping approach in Supplementary Table 4. For example, Eduardo et al. (2007) characterized a near-isogenic line population developed from the cross Piel de Sapo × PI161375 to locate QTL for fruit shape on linkage groups 1 (CMCCA145 at 52 cM and CMCTN4 at 99 cM), LG3 (CSWCT10 at 29 cM and TJ10 at 84 cM), LG7 (CMAGN75 at 38 cM and CMGA15 at 96 cM), LG9 (CMCT47 at 14 cM and CMCTN7 at 62 cM) and LG11 (CMCT160a at 10 cM and CMGA104 at 61 cM) positions. In the current study, the fruit shape QTL identified linked to CMCTN4 at 99 cM of LG1, CMBR105 at 42 cM of LG3, between CMBR27 at 57 cM of LG7, DMO431 at 44 cM of LG9, CMATN121 at 53 cM of LG11 are the same markers or the markers that are located near the markers identified by Eduardo et al. (2007). For fruit diameter, QTL identified by Zalapa et al. (2007), Paris et al. (2008), Harel-Beja et al. (2010) and Xu (unpublished) are the same markers identified in the current study. For fruit length, the QTL positions identified by Eduardo et al. (2007) and Harel-Beja et al. (2010) are the same as in the current study. QTL positions for fruit weight identified by Eduardo et al. (2007) and Harel-Beja et al. (2010) are again comparable to positions identified in the current study. Four QTL linked to soluble solids content were in the same position identified by Eduardo et al. (2007) and one QTL that was previously located by Monforte et al. (2004) could be revalidated in the current study. Minor discrepancies in the position, strength and the variation explained by the QTL can be bridged if more melon groups and accessions are involved in future studies combined with high-throughput marker panels. Results pertaining to selection sweep and further resolution of LD can be more accurately predicted when high-throughput marker panels such as SNPs are used on a larger set of diverse melon accessions.