Introduction

The taste of most fruits is determined by a combination of sugar and organic acid levels, together with their characteristic volatile compounds. While most fruit have a strongly acidic pH, the sweet melons, Cucumis melo, are unique in that they have an unusual low acidity level in the mature fruit, and the pH values for cultivated sweet melons are in the near neutral range of >6. Therefore, melon fruit quality is determined primarily by sugar content alone (Yamaguchi et al. 1977). However, a tremendous range of genetic variability exists in this species and there are genotypes accumulating high levels of acid in the fruit, characterized by low pH of <5 (Burger et al. 2009; Kubicki 1962).

Genetic variability for both quantitative and qualitative levels of organic acids and fruit pH has been observed in other fruit species, as well. For example, genetic control for the level of fruit acid has been reported for peach (Etienne et al. 2002; Moing et al. 1998), citrus limes (Brune et al. 2002; Fang et al. 1997; Marsh et al. 2001) and tomato (Fulton et al. 2002; Stevens 1972). However, the genetic variability in melon is unique in terms of the extreme difference in acidity level between sour and non-sour genotypes (nearly two pH units, or [H+] differences of 102). The main organic acids that accumulate in most fruits, including melon (C. melo), are citric and malic acid (Ulrich 1970; Leach et al. 1989; Wang et al. 1996) and we have previously shown that the citric and malic acid levels of mature fruit from sour melon lines are indeed higher than the levels from near-isogenic non-sour counterparts (Burger et al. 2003).

The trait of fruit sourness in C. melo was initially described by Kubicki (1962), based on a cross between sour and sweet varieties and determined to be inherited as a single locus (So) with sour fruit dominant to non-sour. Danin-Poleg et al. (2002) mapped the trait of fruit pH as a single gene based on an F2 population derived from the inter-varietal cross of a sweet, non-acid cultivar ‘Dulce’ (reticulatus group) and a non-sweet, sour accession of the momordica group of melon, PI 414723.

We present here a detailed QTL analysis of fruit pH and organic acid contents based on a recently described F7–F8 RILs population (Harel-Beja et al. 2010) derived from the PI 414723 × ‘Dulce’ cross. Furthermore, we extend the significance of this major QTL to additional C. melo genotypes and show that the same locus determines fruit acidity in a broad spectrum of melon groups. In addition, we mapped nearly 60 genes encoding the complete metabolic pathway of organic acid metabolism, including the vacuolar proton transporters, and show that none co-localize with the major locus for melon fruit pH.

Materials and methods

Plant material

A population of recombinant inbred lines (RILs, designated 414xDul) was developed from a cross between PI 414723-S5 (C. melo var. momordica), an Indian cooking melon line with non-sweet acidic fruit, and ‘Dulce’ (C. melo var. reticulatus), an American muskmelon with non-sour, sweet fruit (Danin-Poleg et al. 2002). F2 individuals (n = 112) originating from two F1 plants were propagated by the single-seed descent method to create the population, consisting of 99 RI lines comprising a mix of F6, F7 and F8 generations, as described in Harel-Beja et al. (2010).

The two parents and their RI lines were grown in the open field in Newe Ya’ar, Israel, during the summer of 2006. Each line, in a completely randomized design, was represented by ten plants. The two parents were each grown in three replications of 10 to 12 plants. A single fruit per plant was harvested when the abscission layer developed.

Segregating F2 and F3 populations were also developed from the following two crosses: (1) ‘Noy Yizre’el’ (NY, C. melo var. cantaloupensis), a sweet, non-sour variety, crossed with ‘Faqqous’ (FAQ, C. melo var. chate), a non-sweet, sour variety; (2) ‘Sakata Sweet’ (SAS, C. melo var. conomon), a sweet, non-sour variety, crossed with ‘Doya’ (DOY, C. melo var. flexuosus), a non-sweet, sour variety.

Evaluation of fruit pH and organic acid analysis by HPLC

Harvested fruits were brought to the laboratory and sampled as follows. Approximately 1 g fresh weight of mesocarp tissue, taken from the center-equatorial portion of the fruit, was placed in 80% EtOH and stored at −20°C until extraction. The organic acids, together with the soluble sugars were extracted in 80% alcohol which was evaporated to dryness, and re-suspended in ddH2O. Following filtration through a 0.45 mm filter, analysis was performed by HPLC using an Aminex® Organic Acid Column (100 × 7.8 mm, BioRad Laboratories, Hercules, CA) with 0.008 N H2SO4 (pH 2.2) as running solvent at 1.0 ml min−1, according to manufacturer’s recommendations. The HPLC system consisted of an Agilent 1200 binary SL pump and PDA detector. Citric and malic acid were identified by retention time and quantified, compared to known standards.

An additional peak eluted on the Aminex® Organic Acid Column at the same retention time as a standard of succinic acid; however, an enzyme linked assay (Boehringer, succinic acid kit, UV method, cat # 176281) indicated that this compound was not succinic acid. To identify the compound it was sequentially purified from 40 g of cv. ‘Dulce’ mature fruit using the following three HPLC separations: (1) Bio-Rad Aminex® Organic Acid Column, described above and collecting the peak eluting at 14.6 min; (2) Restek Allure® Organic Acids Column (4.6 × 300 mm), 0.1 M phosphate buffer, pH 2.5, flow rate 0.5 ml min−1, and collecting the peak eluting at retention time of 28 min; (3) Dionex Acclaim® Mixed-Mode Wax-1 Column (4.6 × 150 mm), 50/50 v/v acetonitrile/50 mM phosphate buffer pH 6.0, flow rate 1 ml min−1 and collecting the peak eluting at 2.4 min. The UV/VIS spectrum was obtained by PDA of each of the collected peaks. The highly purified compound was identified as uridine, as follows.

Identification of uridine in ‘Dulce’ melon fruit

The identification of uridine in water extracts of melon flesh tissue (cv. Dulce), was performed by mass spectral analysis, carried out by the ultra-performance liquid chromatography coupled to a quadrupole time-of-flight (UPLC-qTOF) instrument [Waters, High Definition MS System (HDMS) Synapt], with the UPLC column connected on-line to a PDA detector (Waters, Acquity), and then to the MS detector, equipped with an electrospray probe. The settings of the apparatuses were as described by Itkin et al. (2011). Uridine was first putatively identified in the positive and negative modes, using the elemental composition, selected according to the accurate masses and the isotopic pattern using the MassLynx software, and MS/MS fragments were compared to those reported in the literature (Linden 2004; Kellner et al. 2011). Then the identification was verified by comparison of retention time, MS/MS fragments and UV spectral properties of the putative compound to those of the corresponding standard (Sigma, USA). The concentration of uridine in planta was quantified against a standard using the Bio-Rad Aminex® column, as above.

DNA preparation

DNA was extracted from young leaf tissue from ten plants of each of the parental lines and ten plants of each of the RI lines according to the preparation procedure described by Fulton et al. (1995), as recently described by Harel-Beja et al. (2010).

Organic acid metabolism genes: identification and mapping

Genes belonging to the organic acid metabolism pathway were selected based on the literature and the International Cucurbit Genomics Initiative database (Clepet et al. 2011; http://www.ICUGI.org). In addition, an exhaustive search for additional paralogues, based on BLAST analysis of a melon fruit EST database derived from a 454 transcriptome analysis (Portnoy et al. 2011) was also performed. Information related to these genes is summarized in Tables 1 and 2. Polymorphisms were identified based on comparative sequencing of the parental lines (Supplemental Table 1). In cases where no polymorphism was observed in the gene sequence, BAC clones harboring the gene were end-sequenced and polymorphisms identified in genomic regions along the BAC. To identify specific BACs that contained genes of interest, specific probes were designed for hybridization with the melon BAC library (MR1, CM_MBa, Clemson University, Clemson, SC). Standard hybridization procedures were used with a radioactively labeled probe on BAC library membranes (Luo et al. 2001). Positive BACs were re-checked by sequencing the BAC using the primers that had been used to prepare the probes. Representative PCR products were further sequenced to validate the results.

Table 1 List of organic acid metabolism genes in the melon genome referred to in this study
Table 2 List of transporter genes in the melon genome mapped in this study

Single nucleotide polymorphisms (SNP) genotyping was carried out by the two methods described in Harel-Beja et al. (2010): dCAP (dCAPS Finder 2.0, Neff et al. 2002) or Sequenom (San Diego, CA) MassARRAY Compact Analyzer, at the Cancer Research Center and Pediatric Hematology-Oncology, Sheba Medical Center, Tel Hashomer, Israel. Assays and primers were designed using the MassARRAY assay Design 2 software.

Map construction and QTL analysis

Mapping was performed using JoinMap® 3.0 software (Van Ooijen and Voorrips 2001). Markers were grouped at a minimum logarithm of odds (LOD) score of 4.0, and a recombination frequency value of 0.4. Linkage group (LG) designation was according to the recently merged melon map (Diaz et al. 2011), which includes the 414xDul map. QTL analyses were performed by the MapQTL® 5 software (Van Ooijen 2004), including: interval mapping (IM), multiple QTL model (MQM) and permutation analysis. QTL was significant when LOD score was higher than the LOD threshold calculated by permutation tests (1,000 permutation at p = 0.05; ≥2.3 LOD for all traits).

SSR markers

Simple sequence repeat (SSR) markers that were found to be adjacent to the pH trait (Harel-Beja et al. 2010) were screened in 14 F2 plants of each of the additional populations: NY × FAQ and SAS × DOYA. Marker CMCTTN181 which was polymorphic in the NY × FAQ was screed in the plants of this population while marker CMAT141 was polymorphic and was screened in the plants of SAS × DOYA.

Seven of the plants were of high pH and seven low pH. SSR genotyping was performed using the fragment analysis protocol of the AB3130xl Genetic Analyzer. In short, microsatellite loci were amplified by PCR using fluorescently labeled forward and unlabeled reverse primers. The PCR amplicons were separated by size using electrophoresis. The dyes labeled products were identified by fluorescence detection. GeneMapper® Software version 4 (Applied Biosystems) was used to size and genotype the alleles.

Primers used to amplify the SSR markers:

  • CMCTTN181F: CTCTCTGCAATTCTCGCC;

  • CMCTTN181R: CAACCATCCGCTTCACTC;

  • CMAT141F: AAGCACACCACCACCCGTAA;

  • CMAT141R: GTGAATGGTATGTTATCCTTG

Results

QTLs for pH and organic acids of ripe melon fruit

The 414xDul RILs population was analyzed for fruit pH, as well as content of the major organic acids, determined by HPLC (Table 3). Significant variability was observed for all of the traits between families as indicated by Welch ANOVA test. The distribution mode of the accumulation of the three metabolites among the RI population was normal according to the Shapiro–Wilk goodness-of-fit test. The acidic parent PI414723 was characterized by pH values of the extracted juice of <5.0 while the sweet ‘Dulce’ variety has fruit pH values >6.0. The two parents did not significantly differ in either their citric or malic acid concentrations in the present experiment. Fruit of the same lines grown in different seasons have, at times, shown higher levels of malic acid in the sour PI 414723 parent, compared to the non-sour ‘Dulce’ parent (e.g., spring 2000 season: PI 414723: citric, 2.78 + 0.44 mg/gfw, malic, 0.86 + 0.22 mg/gfw; ‘Dulce’: citric, 2.84 + 0.30 mg/gfw, malic, 0.28 + 0.10 mg/gfw). The non-sour ‘Dulce’ had significantly higher levels of a third metabolite visible under our chromatographic conditions. The non-sour ‘Dulce’ had significantly higher levels of a third metabolite visible under our chromatographic conditions.

Table 3 Organic acids and pH data of the parental accessions and the RI lines

In light of this observation we undertook to identify the third metabolite. MS, as well as UV analysis identified the peak component as uridine and chromatographic co-elution of an uridine standard under three chromatographic separation systems, further confirmed the identification (Fig. 1). The uridine contents of the RILs were calculated according to the Aminex chromatographic separation of an uridine standard. Absolute amounts of uridine were ~three orders of magnitude lower than the major organic acids but there were nevertheless significant differences in uridine content between the acidic and non-acidic genotypes.

Fig. 1
figure 1

MS spectrum of a uridine standard and b unidentified melon fraction. Insets are of UV spectrum (left) and extracted ion chromatogram (EIC) (m/z = 245) (right)

The characteristics of pH, citric, malic and uridine were analyzed in the RILs mapping population of nearly 100 F7 and F8 families and QTLs for the traits were mapped (Table 4). A major QTL, on LG8, and with a LOD score of >35 was observed for pH. Co-localizing with this QTL were major QTLs for each of the two organic acids (LOD of ~7–8 for both citric and malic) and a QTL with LOD of >10 for uridine. In addition, there were minor QTLs for citric acid on LG4, LG11 and LG12, and QTL for malic acid more proximal on LG 8. Interestingly, all the minor QTLs for organic acid levels were positively additive for the low acid ‘Dulce’ parent and only the major QTL on LG8 contributed acidity in correlation with the parental genotypes.

Table 4 Significant organic acids, uridine and pH QTLs identified by interval mapping analysis

Mapping and co-localization of genes involved in organic acid metabolism and transport

An exhaustive list of 58 melon genes encoding approximately 20 enzymatic reactions of organic acid metabolism, together with genes for vacuolar proton transport and organic acid transport, was compiled. Polymorphisms between the two parents of the RILs mapping population were identified for 53 of the genes (Supplemental Table 1). Polymorphisms included both indels and SNPs and determination of the genotypes of the ~100 RILs were performed by a mix of gel separations, standard sequencing reactions and Sequenom pyrosequencing technology. In many cases, polymorphisms were not observed in the coding region of the gene and were identified in flanking genomic sequences, either following identification of BACs harboring the gene of interest or searching for flanking regions of the genes in the genome of the highly syntenous cucumber. For five genes (see Supplemental Table 1) no polymorphisms were identified in any of these regions.

Each of the 53 genes clearly mapped to one of the 12 linkage groups of melon (Fig. 2). The genes were randomly distributed throughout the genome, showing no indication of clustering. With regard to co-localization with the pH and acid traits, none of the genes co-localized with the major QTL on LG8. A single instance of co-localization was observed, between a minor QTL for pH on LG11 and the gene for one of the ATP-citrate lyase paralogues, ACL2. None of the other minor QTLs showed evidence of co-localization with a structural gene for acid metabolism and transport. The identification of BAC clones harboring the genes allowed us also to map 15 BAC clones, contributing to the future physical map of melon (Supplemental Table 1).

Fig. 2
figure 2

Genetic map of the 414 × Dul RI population that includes QTLs for fruit acidity and candidate genes. Linkage group (LG) numbers are according to Diaz et al. (2011). Distances in centiMorgans from the top of each LG are marked on the left side and marker names are on the right side of each linkage group. Fruit acidity candidate genes are in red. Significant QTLs are presented by lines and for QTLs with LOD >5.0, LOD scores in the highest 10% are indicated by dark black lines within the broader QTL

Universality of the major QTL in C. melo

To determine whether the same locus determines fruit pH in other C. melo genetic backgrounds, we tested the closest linked SSR markers for the pH trait (CMCTTN181, CMAT141, located 2 and 3 cM from the trait, respectively; Harel-Beja et al. 2010) in two additional inter-varietal segregating populations. The F3 populations, derived from the low acid Noy Yizre’el and the acidic Faqqous, and from the low acid SAS and the acidic DOY, were analyzed for fruit pH and seven plants from each of the two phenotypes (high and low acidity) of each of the two populations were genotyped for the pH-linked markers. The CMAT141 marker was polymorphic and informative for the Noy Yizre’el and Faqqous population while the CMCTTN181 marker was polymorphic and informative for the SAS × DOY population. In both populations, there was strong linkage between the marker and mature fruit pH (Fig. 3) and the high pH segregants were all homozygous for the recessive allele derived from the high pH parent while the low pH segregants were either heterozygous or homozygous for the low pH allele.

Fig. 3
figure 3

Genotyping of SSR markers in 14 F2 plants and the parental lines using the fragment analysis option of 3130xl Genetic Analyzer. Each line is the amplicon of one F2 plant or parental line (indicated). On the left is an electrophoregram of the amplicons obtained using the primers of CMCTTN181 in the NY × FAQ population (red); on the right is an electrophoregram of the amplicons obtained using the primers of CMAT141 in the SAS × DOYA population (black). Peak location represents the size of the various alleles

Discussion

The results of this study clearly point to a single major gene controlling acid levels in melon fruit. The results of the analysis of three different segregating populations, whose parents derive from different subgroups of the C. melo species, further extends the significance of this locus to the species as a whole. The crosses between PI 414723 × Dul, SAS × DOY, NY × FAQ together comprise selections from both subspecies of C. melo, subspecies agrestis and subspecies melo. Together, the six parental varieties represent the melon groups reticulatus, momordica, conomon, flexuosus, cantalupensis, and chate, described by Pitrat et al. (2000). The presence of the same QTL in such diverse genetic backgrounds suggests that the trait is determined throughout the species by variability in the same gene and is indicative of a major role of the evolution of this gene in determining this important domestication trait of the sweet dessert melons.

Although there was a single major QTL on LG8 which showed very high LOD scores for all the traits involved in fruit pH, there were also QTLs for the traits at other map positions. However, these were minor QTLs with low LOD scores and their physiological and genetic significance is not clear. Interestingly, the minor QTLs on LG 4, 11 and 12 behave in the opposite direction as would be expected and in each case the ‘Dulce’ allele is correlated with an increase in acidity by increasing citric acid. This may indicate that there exists the possibility of transgressive modulation of fruit acidity by combining alleles from the two parents. This may be evidenced by the transgressive segregation in the RILs for pH values, as well as the three components assayed (Table 3). Nevertheless, the major QTL on LG8 is responsible for the overwhelming contribution to fruit acidity.

Previously a study of QTLs for pH and organic acids in melon based on near introgression lines (NILs) derived from the cross of a conomon variety, PI161375, and an inodorus cultivar, Piel de Sapo (Obando-Ulloa et al. 2009). Since these two parents are both low acid lines the QTL results did not indicate a major gene for this trait; nevertheless a minor QTL was reported (pHqf8.4, Diaz et al. 2011; Obando-Ulloa et al. 2009) in the region and may indicate genetic variation in the same locus even between lines of similar pH. However, since the report was based on NILs the introgression was relatively large and comparative co-localization cannot be performed.

The identification of well-defined QTLs for pH, accompanied by concomitant differences in organic acid content allowed for the performance of a co-localization test to determine whether any of the QTLs can be accounted for by enzymes of organic acid metabolism. In this study, we focused on a comprehensive list of enzymes involved in organic acid metabolism in melon, including all of the members of individual gene families of both enzymes and proton transporters (Tables 1, 2). The enzymes comprise 19 different reactions, many of which are composed of multiple protein subunits. Some of the enzyme families comprise multiple isozymes which are associated with distinct cellular compartmentalization (Schnarrenberger and Martin 2002). Finally, some of the enzyme families are also encoded for by gene paralogues with apparent tissue functionalization. The proton transporter genes comprise those encoding 13 subunits of the vacuolar ATP-dependent proton transporter, as well as two different PPi-dependent proton transporters.

We identified polymorphisms for practically all of these genes and mapped 53 of them, including the major genes which are significantly expressed in developing melon fruit, as indicated by the public melon EST database (http://www.ICuGI.org). To reduce the possibility that one of the five unmapped genes co-localizes with the pH trait we performed a synteny analysis between map positions of the corresponding cucumber homologs and the mapped melon genes (Supplemental Fig. 1, Supplemental Table 2). None of the five cucumber homologs map to a syntenous position of the melon pH locus on LG8 near the melon ICD1 and ICD2. Since there are numerous chromosomal rearrangements between melon and cucumber we cannot conclude that these genes do not co-localize with the pH locus; nevertheless, it would seem highly unlikely.

The results of this co-localization mapping study point to a striking absence of co-localization between the candidate genes coding for organic acid metabolism and the QTLs for the components of fruit acidity. Only a single instance of potential co-localization was observed, between the minor QTL for pH on LG11 and the gene for a ATP-citrate lyase paralogue. Further analysis of fine-mapping populations and sub-lines will be necessary to discern the contribution, if any, of this gene to fruit acidity. The closest instance of co-localization with the pH locus is with the citric acid cycle gene ICD1 (mitochondrial NAD-isocitrate dehydrogenase-α) but the QTL analysis based on the RILs population separates the two by nearly 10 cM.

The negative results with regard to the co-localization of candidate genes and phenotypic traits are not altogether surprising. In our previous study of melon fruit quality QTLs, which focused on QTLs for fruit carotenoid and sugar contents, a total of over 50 candidate genes encoding the enzymes of the sugar and carotenoid metabolic pathways were mapped and none co-localized with any of the significant QTLs for either carotenoid or sugar contents (Harel-Beja et al. 2010). This included 13 genes from the carotenoid biosynthetic pathway and over 30 genes for the sugar metabolism pathway (Cuevas et al. 2008, 2009; Harel-Beja et al. 2010). Similar results have been reported for tomato in which the quantitative component of carotenoid content variability was also not found to be associated with the structural carotenogenesis genes in tomato (Liu et al. 2003). However, genetic control of qualitative differences in carotenoid composition, such as in high β-carotene (Beta) or δ-carotene (Delta) lines, is determined by structural genes of the carotenoid biosynthetic pathway (Liu et al. 2003). Similarly, the strategy of co-localization of candidate genes was successful in identifying genes for the qualitative differences in anthocyanin metabolism and ascorbic acid accumulation in tomatoes (De Jong et al. 2004; Stevens et al. 2007). Thus, it appears more likely for genes encoding metabolic pathway enzymes to control qualitative, rather than quantitative, variation. Furthermore, many of the QTL-candidate gene co-localization studies, particularly in tomato (e.g., Baxter et al. 2005; Bermudez et al. 2008; Causse et al. 2004; Schauer et al. 2006) have been based on the analysis of NILs which allows for the co-localization at the level of bins, harboring many centimorgans of introgression. Our co-localization studies utilized a RILs population, allowing for a relatively fine-tuned co-localization analysis and the reduction of possible false-positives in candidate gene determination.

The identification of the nucleoside uridine as a significant component inversely associated with melon fruit acidity was unanticipated and, at present, we cannot explain its physiological significance, or the mechanism of the interaction with organic acid accumulation. Uridine has not previously been reported for melon, or for other fruits, to the best of our knowledge. There have been recent reports of uridine levels in ziziphus fruits (Guo et al. 2010), ginseng roots (Qian et al. 2008) and fritillaria tubers (Cao et al. 2010) and the absolute amounts are similar to those we observed, in the μg/gm range. In contrast, the levels of organic acids are in the mg/gm range, ~1,000-fold higher, and it is likely that the uridine content contributes little to the fruit pH but is rather an, as yet, unexplained effect of the pH gene. Alternatively, the possibility exists, based on the present data, that the uridine QTL is an independent QTL closely linked to the pH QTL. Further detailed comparative metabolic analyses, together with the functional identification of the pH gene should shed light on this surprising observation. Nevertheless, the results of this study suggest that the biological function of the pH gene is not at the level of organic acid metabolism. Together with the observation that the parental lines differ greatly in pH values but not necessarily in citric and malic acid levels, also supports this suggestion.

In conclusion, the mapping results indicate that an as yet unidentified gene, not one of the obvious candidates involved directly in acid metabolism or vacuolar proton transport, determines fruit acidity in the melon fruit in the broad range of C. melo germplasm. The identification of this gene will likely shed light on the elusive topic of fruit acidity and on the evolution under domestication of the sweet dessert melon.