Introduction

Cucumber (Cucumis sativus L.; 2n = 2x = 14)—belonging to the family Cucurbitaceae—is a very important vegetable crop worldwide. Owing to its diverse sex types and the extensive physiological and genetic studies conducted on it, cucumber is becoming a model plant for sex determination research (Atsmon 1968; Tsao 1988; Perl-Treves 1999; Tanurdzic and Banks 2004). Cucumber plants generally produce male and female flowers separately in the same individual; however, certain lines also produce bisexual flowers. In the early stage of development, all floral buds contain staminate and pistillate primordia. The selective arrest of either the staminate or pistillate part results in female or male flowers, respectively. Further, if the arrest does not happen, hermaphrodite flowers are formed. (Atsmon and Galun 1960; Goffinet 1990; Perl-Treves 1999).

Depending on the ratio of the three types of flowers produced in the plant, the cucumber lines are classified into five phenotypes: monoecious, gynoecious, andromonoecious, hermaphrodite, and androecious. The monoecious lines—the most common cucumbers—bear male and female flowers separately. The gynoecious lines produce only female flowers, while the androecious cucumber plants bear only male flowers. Male and bisexual flowers are produced in the andromonoecious cucumber plants, while the hermaphrodite lines produce only bisexual flowers (Galun 1961; Shifriss 1961; Kubicki 1969; Malepszy and Niemirowicz-Szczytt 1991). Early genetic studies by Galun (1961), Kubicki (1969), and Robinson et al. (1976) have indicated that three major genes are responsible for sex expression and segregation in the cucumber plant—F/f, M/m, and A/a. The F gene may promote femaleness, while the M gene is responsible for the unisexuality of the whole plant. Further, along with the homozygous recessive F/f gene, the recessive a gene can intensify the androecious nature.

In addition to the effects of the three major genes, sex expression can be easily regulated by many environmental factors such as photoperiod and temperature and plant hormones such as IAA, GA, and ethylene (Atsmon 1968; Flankel and Galun 1977; Flankel and Galun 1977, 1983; Perl-Treves 1999; Yamasaki et al. 2005). Base on the “One-hormone hypothesis” (Yin and Quinn 1995) and the study by Yamasaki et al. (2001), it can be proposed that the F locus regulates the ethylene levels promoting the development of the pistillate primordial state. Further, M locus production mediates the inhibition of stamen development by ethylene. The ethylene signal is not transmitted if the genotype is m homozygous. As a result of this, stamen development is not inhibited and the plant produces hermaphroditic flowers.

A series of studies (Kamachi et al. 1997; Trebitsh et al. 1997; Kamachi et al. 2000; Yamasaki et al. 2003; Mibus and Tatlioglu 2004; Knopf and Trebitsh 2006; Saito et al. 2007) have been conducted to investigate the F/f gene. These studies have shown that CsACS1G, which encodes a key enzyme of ethylene synthesis, is the candidate gene for the F/f locus. However, studies on the M/m gene are not as advanced as those on the F/f gene. Kater et al. (2001) have provided evidence proving that sex determination in cucumber is restricted to specific whorls, thereby suggesting that the active point of the M-allele product may be in the third whorl of the flowers. Hao et al. (2003) and Terefe and Tatlioglu (2005) identified an anther-specific DNase and a putative nucleotide sugar epimerase, respectively; both of these are considered to be the downstream products of the M-allele.

The genetic base of cucumber is rather narrow (Dijkhuizen et al. 1996), and polymorphism in cucumber varieties is lesser than that in other vegetables such as tomatoes. In this study, by combining the bulked segregant analysis [(BSA) Michelmore et al. 1991] and the sequence-related amplified polymorphism (SRAP) technology developed by Li and Quiros (2001), we identified eight markers linked to the M/m locus in our plant population. Among them, the two closely linked SRAP markers flanking the M/m locus were co-dominant marker ME1EM26 and dominant marker ME1EM23. Further, the co-dominant marker ME8SA7 co-segregated with the M/m locus. With the chromosome walking method using the cucumber genomic bacterial artificial chromosome (BAC) library, we successfully developed a co-dominant SCAR marker S_ME1EM23 from the ME1EM23 sequence. Along with the other two co-dominant SCAR markers S_ME1EM26 and S_ME8SA7 (developed from ME1EM26 and ME8SA7, respectively) in a larger segregating population (900 individuals), we located the M/m locus between S_ME1EM26 (5.4 cM) and S_ME1EM23 (0.7 cM) and S_ME8SA7 co-segregated with it.

Materials and methods

Plant materials

Cucumber monoecious, indeterminate line S52 (inbred line derived from a southern Chinese local cultivar, ffMM) was crossed with the hermaphrodite, indeterminate inbred line H34 (European greenhouse type, FFmm), and F1 was self-pollinated to obtain 167 individuals of F2 plants in 2005. The following year, we used the same parental lines to obtain a larger F2 population (670 plants) and a BC1 population (230 plants, F1 was backcrossed with H34). The parental plants, F1, F2, and BC1 were grown under natural sunlight in a greenhouse at Shanghai Jiaotong University. From the time of initiation of flowering, two parents (ten plants each), F1 (15 plants), and individuals of the F2 and BC1 segregating populations were scored for the sex expression of the flowers at every node up to 30 or 35 nodes on the main stem and the primary lateral branch. A plant with normal bisexual flowers was considered to be a bisexual plant (hermaphrodite or andromonoecious). If this was not the case, it was considered to be a unisexual plant (monoecious or gynoecious).

DNA extraction and BSA design

The young leaves of individual cucumber plants were harvested, rapidly frozen in liquid nitrogen, and stored at −80°C. For the BSA design, ten unisexual (MM or Mm) and ten bisexual (mm) plants were randomly selected from the F2 population (167 individuals) in 2005, and were mixed in equal fractions to construct two DNA bulks. The genomic DNA of all materials (S52, H34, F1, F2, BC1, and two bulks) was extracted by the CTAB method (Clark 1997), dissolved in water, visualized after electrophoresis in 0.8% agarose gel in 1× TAE, diluted to 30 ng/μl by comparing with DNA standards (Promega, Shanghai, China), and maintained at −20°C until use for the PCR reactions.

SRAP analysis

The SRAP primers were according to the studies by Li and Quiros (2001), Ferriol et al. (2003), Li et al. (2003), and Wang et al. (2005) and also by Vos et al. (1995), and Xu et al. (2000), where they were used as AFLP markers. These AFLP primers were used in the SRAP method in the present study, and the resulting polymorphic bands were considered to be SRAP markers. Table 1 indicates the polymorphic SRAP primers defining the molecular markers used in the study. The primers were synthesized by Sangon Biological Engineering Technology & Service Co. (Shanghai). The PCR reaction for SRAP was started at 94°C for 3 min, followed by eight cycles at 94°C for 30 s, 35°C for 30 s, and 72°C for 1 min; 35 cycles at 94°C for 30 s, 50°C for 30 s, and 72°C for 1 min; and ended by extension at 72°C for 5 min. The products were separated on 4% denatured polyacrylamide gel with 1× TBE buffer and 8 mol/l urea. Electrophoresis was performed at a constant power of 50 W for 1.5–2 h. After electrophoresis, the gel was stained with AgNO3 solution (Bassam et al. 1991).

Table 1 The primer sequences of polymorphic SRAP markers used in this study

Conversion of SRAP markers

The DNA bands corresponding to the SRAP markers linked closely to the M/m locus in the F2 population (167 individuals) were excised from the gel, washed three times with 20 μl ddH2O, and incubated at 95°C for 15 min. After slight centrifugation, the upper phase was used for re-amplification that was performed in a volume of 50 μl. The PCR reaction was started at 94°C for 3 min, followed by 30 cycles at 94°C for 30 s, 50°C for 30 s, and 72°C for 1 min and ended by extension at 72°C for 5 min. The resulting PCR products were purified using the EZ-10 Spin Column DNA Gel Extraction Kit (Bio Basic Inc.) according to the manufacturer’s instructions. The reclaimed DNA fragments were cloned in the T/A-cloning vector pUCm-T (Fermentas) and transformed into the Escherichia coli strain DH5α. Clones of the expected size were sequenced using the T7/M13 primer manufactured by Sangon Biological Engineering Technology & Service Co. (Shanghai). The sequences obtained from an ABI 3700 Sequencer (ABI, China) were used to design the SCAR primers. Based on the co-dominant features of the SRAP markers ME1EM26 and ME8SA7, the primer pairs were designed to amplify the region containing the polymorphism regions between the two parents, and the new SCAR markers derived from them were named S_ME1EM26 and S_ME8SA7, respectively. With regard to the dominant SRAP marker ME1EM23, we first designed two specific oligonucleotides—P1F and P1R—from both ends of the SRAP fragment. The amplifications for all the SCAR markers were same as those for the SRAP analysis, with the exception of the cycling condition: 94°C for 3 min, followed by 35 cycles at 94°C for 30 s, 55°C for 30 s, and 72°C for 30 s and ended by extension at 72°C for 5 min. The products were analyzed in 3% agarose gel in 1× TAE for S_ME8SA7 and 4% denatured polyacrylamide gel in 1× TBE for S_ME1EM26.

Chromosome walking

To develop a co-dominant SCAR marker from the ME1EM23 marker, a cucumber genomic BAC library was used to determine the DNA sequence flanking the known fragment. The library which containing approximately 19,200 Hind III clones was constructed with an inbred line derived from a northern Chinese monoecious cultivar (ffMM) by a previously reported procedure (Guan et al. 2008). After screening with the PCR method, one of the clones containing the specific amplification fragment derived from the primer pair P1F and P1R was sequenced by the Beijing Genomics Institute (Beijing). P1F and P1R were also used as the sequencing primers. Therefore, both the resulting sequences contained the same specific amplification fragment as the SRAP marker ME1EM23. From both ends of the enlarged region, another primer pair—P2F and P2R—was designed to amplify the corresponding fragment in the two parental plants. With the method described in the subsection “Conversion of SRAP markers”, we developed a co-dominant SCAR marker S_ME1EM23 from the SRAP marker ME1EM23. The detection was the same as that for S_ME1EM26. Table 2 indicates all the SCAR primers defining the molecular markers used in the study.

Table 2 SCAR markers linked to the M locus

Linkage analysis and marker nomenclature

MAPMAKER/EXP3.0 (Lander et al. 1987) was used for linkage analysis with a LOD threshold of 3.0 or more. The recombination percentages were converted to genetic distance by using Kosambi mapping function (Kosambi 1944). The polymorphic markers were named according to their corresponding primer or primer combination codes.

Results

Genetic analysis of sex expression in the cucumber plant

All the 15 plants of F1 obtained by crossing between S52 with H34 were gynoecious (unisexual plants), thus indicating that the development of unisexual flowers was dominant. Further, all the three (F2:2 and BC1:1) segregating populations were used to analyze the inheritance of the sex expression; the results are shown in Table 3. All these results confirmed the monogenic dominant inheritance of the unisexual (monoecious or gynoecious)/bisexual (hermaphrodite or andromonoecious) trait in the cucumber plant, which is generally defined as the effect of the M/m locus.

Table 3 The χ2 test for the three segregating populations

SRAP markers linked to the M/m locus

Of the 736 SRAP primer pairs tested, 360 primer combinations generated polymorphic bands (48.9%) between the two parental plants. On further testing with the DNA bulks, eight SRAP primer combinations—ME8SA7, ME1EM23, ME1EM24, ME1EM26, DC1EM9, ME42OD17, ME23SA4, and ME67 (with polymorphic band amplified from a single primer)—exhibited polymorphic bands between the unisexual and bisexual bulks. All the markers were mapped in the F2 population (167 individuals, 2005). The preliminary linkage analysis performed using MAPMAKER/EXP3.0 revealed that the two closely linked SRAP markers flanking the M/m locus were ME1EM26 (5.4 cM) and ME1EM23 (1.0 cM). The SRAP marker ME8SA7 co-segregated with the locus. Since their relatively loose linkage values, others markers’ pattern without any development showed in Fig. 3.

Development of SCAR markers

The primer pair ME1EM26 and ME8SA7 designed to amplify the region containing the polymorphism between the two parents behaved like co-dominant markers. Therefore, their conversion to SCAR markers was easy. The new markers were named S_ME1EM26 (GenBank Accession Number EU716639, EU716640) and S_ME8SA7 (GenBank Accession Number EU716641, EU716642), respectively. Both of them maintained their co-dominant pattern between the two parental plants. For the S_ME1EM26 marker, the sequence of the unisexual allele was 116 bp—3 bp longer than the bisexual allele (Supplementary material S1). Further, for the S_ME8SA7 marker, the unisexual allele had a 149 bp fragment—16 bp shorter than the bisexual allele (Supplementary material S2 and Fig. 1).

Fig. 1
figure 1

PCR amplification of the co-dominant SCAR marker S_ME8SA7 in the parental lines S52 and H34, F1 progeny, and 14 F2 plants (random samples)

The analysis of the S_ME1EM26 sequence from the unisexual allele against the translated database (tblastx, Altschul et al. 1997) revealed that it has 81% similarity with the putative acetyl-CoA synthetase in Arabidopsis thaliana (AK118300.1), and its nucleotide sequence has a homologous relationship (75%) with a Vitis vinifera sequence (AM455130.2) determined through the blastn analysis in NCBI (http://www.ncbi.nlm.nih.gov). These results indicated that the S_ME1EM26 fragment may be a part of a gene sequence, and it is conserved in planta. However, in case of the S_ME8SA7 marker, homology to a known sequence in the NCBI database was not found.

After the polymorphism test, the SRAP marker ME1EM23 showed a dominant pattern. Through the sequencing of the unisexual allele band, we designed the first pair of primer (P1F and P1R) from both ends of the fragment. However, the primary dominant SCAR marker derived from the first primer pair was not found to be very reliable during the linkage analysis for the validation of the marker with the same F2 population (167 individuals, 2005). Therefore, a chromosome walking approach was used to identify the flanking sequence adjacent to the marker. We successfully obtained more than 1.1 kb flanking sequence that showed a 1 bp and a 3 bp insertion/deletion between the two parents (GenBank Accession Number EU716643, EU716644), thereby enabling the development of the co-dominant SCAR marker S_ME1EM23. The S_ME1EM23 marker sequence of the unisexual allele was 227 bp—4 bp longer than the bisexual allele (Supplementary material S3 and Fig. 2). The homology analysis did not reveal any known sequence.

Fig. 2
figure 2

PCR amplification of the co-dominant SCAR marker S_ME1EM23 in the parental lines S52 and H34, F1 progeny, and 20 F2 plants (random samples)

Fine mapping of the M/m locus

Three co-dominant SCAR markers were used to analyze the enlarged segregating population (670 F2 individuals and 230 BC1 plants) for fine mapping the M/m locus. The linkage pattern of S_ME1EM26 did not change. Further, no recombinant events were detected between the S_ME8SA7 marker and the M/m locus, thus indicating that S_ME8SA7 was still co-segregated with it. Owing to its co-dominant characteristic, S_ME1EM23 could detect more information than the previous dominant SRAP marker, and the linkage value was modified to 0.7 cM. The final result of the markers linked to the M/m locus is shown in Fig. 3. After mapping in the segregating population, we tested the co-segregation feature between the S_ME8SA7 marker and the M/m locus among several different cucumber plant lines. The co-segregation pattern was found to be conservative in at least five other cultivar lines (Fig. 4).

Fig. 3
figure 3

Genetic linkage map of the M locus generated in recombinant population (900 individuals) from S52/H34. Genetic distance in centimorgans (cM) was calculated using the Kosambi function

Fig. 4
figure 4

SCAR analysis among 7 different cultivar cucumber lines with S_ME8SA7. Head indicates the polymorphic bands amplified by S_ME8SA7 between S52 (left) and H34 (right). Arrowhead represents the amplification from the cultivar hermaphrodite line “Lemon”. All the other lines were unisexual

Discussion

As a relatively new marker system, SRAP has been used in genetic studies on plant species such as Brassica oleracea (Li et al. 2003), Cucurbita pepo (Ferriol et al. 2003), and Brassica rapa (Rahman et al. 2007). In these tested plant species, in comparison with other randomly amplified markers, the SRAP markers were found to be distributed more evenly throughout the whole genomes. This is suitable for gene localization, genetic map construction, and transcription mapping (Li et al. 2003). In the present study, we modified the SRAP analysis (5–8 cycles) according to Li and Quiros (2001) and obtained a good result for the cucumber plant with the silver straining method. Among the eight SRAP markers linked to the M/m locus, the marker ME1EM24 was first mapped to the same location as the marker ME1EM26 (data not shown), and further sequencing (GenBank Accession Number EU716645) confirmed that the two markers were derived from the same location of the cucumber genome (Supplementary material S4). Their “ME1” ends were the same; however, the “EM24” end was located in the interior of the ME1EM26 sequence. Therefore, we used the marker ME1EM26 to represent this polymorphic locus. On aligning the marker ME1EM26 and primer EM24 sequences, we found the genomic sequence at 11 nt from the 3′ end of the primer EM24. This incidental finding may explain why SRAP technology is more reliable than RAPD for identifying polymorphism. During the first 5 or 8 “lower” annealing cycles—considered to be “non-special” steps—the condition enables a “shorter” sequence (approximately 10 nt from the 3′ end) in the SRAP primer for promoting amplification, which is similar to RAPD. However, there are two different primers in SRAP analysis; therefore, the polymorphism may be twice or more than in RAPD technology. Interestingly, based on the combinations with other primers, we obtained the SRAP marker M67 whose polymorphic band was amplified by a single primer. The next 35 “higher” annealing cycles may “specially” amplify the products of the previous “non-special” cycles. With a moderate addition of the “lower” annealing cycles (5–8 cycles), we achieved a better polymorphism identification (48.9%) than the previous design derived from Li and Quiros (2001) in the cucumber plant with the silver straining method.

Since a dominant marker may supply less genetic information, the conversion from a dominant marker into a co-dominant marker is very valuable. Several researchers (Barret et al. 1998; Bradeen and Simon 1998; Negi et al. 2000; Noguera et al. 2005; Liu et al. 2006; Rahman et al. 2007) have successfully converted their dominant markers into co-dominant markers, such as RAPD markers, AFLP markers, and SRAP markers. A dominant marker sequence cannot supply sufficient information for a co-dominant conversion, which is based on the insertion or deletion fragment located in any of the two lines. Therefore, the additional flanking region needs to be sequenced. It has been confirmed that the adaptor ligation-based PCR-mediated walking (Mibus and Tatlioglu 2004) and inverse PCR method (Knopf and Trebitsh 2006) are efficient in isolating the flanking sequence in the cucumber plant. At the initial stage, based on the sequence of SRAP marker ME1EM23 obtained from S52, we designed several primer combinations for detecting polymorphism between the two parents. After optimizing the annealing temperature, we obtained the special primer pair P1F and P1R. However, when tested in the F2 population, the primary dominant SCAR marker derived from this primer pair was not very reliable. Next, by using the BAC library-mediated walking strategy, approximately 1.1 kb sequence containing the SRAP marker ME1EM23 was obtained, thus indicating that the polymorphism of the primary dominant SCAR marker derived from a nucleotide variance was located in the 3′ end of the primer P1R (Supplementary material S3). Based on the enlarged allele region of the two parental plants, we found insertion/deletion polymorphic sequences and successfully developed a co-dominant SCAR marker. Interestingly, the final marker did not contain any nucleotide derived from the primary SRAP marker.

The characteristic of one gene locus controlling the unisexual expression is unique in the Cucurbitaceae plants, particularly cucumber (Cucumis sativus L.) and melon (Cucumis melo L.). With the potential evolutionary course, the unisexual phenotype may be a gain-of-function mutant during the long-term breeding selection, which is not rare in the cucumber plant. Knopf and Trebitsh (2006) reported that the gynoecious-specific CsACS1G—mapped to the F locus—is a result of a relatively recent gene duplication and recombination between CsACS1 and CsBCAT. Therefore, clarifying the mechanism and function of unisexual expression has remarkable evolutionary significance.

Engelke et al. (1999) have reported an approach for isolating the M/m gene with differential display reverse transcription-polymerase chain reaction (DDRT-PCR). Przybecki et al. (2003) and Terefe and Tatlioglu (2005) attempted to isolate the M/m gene transcript by using similar methods; however, the results were not satisfactory. More recently, Saito et al. (2007) have reported that CsACS2 mRNA begins to accumulate just beneath the pistil primordia of flower buds at the bisexual stage, but not prior to the formation of the pistil primordia. Similarly, based on the study by Kater et al. (2001), transcription of the M/m gene may be transient or may have spatio-temporal localization. Therefore, the transcript may not be easily detected. On the other hand, the inheritance of the M/m locus is beyond dispute; therefore, cloning the M/m gene with the positional method appears to be feasible. In this study, we identified certain markers closely linked to the M/m locus. No recombinant events were detected between the marker S_ME8SA7 and the M/m locus, and the linkage value of the marker S_ME1EM23 was 0.7 cM. Further, the co-segregation feature between the marker S_ME8SA7 and the M/m locus was conservative in certain cultivar lines. These markers will be particularly useful for fine-mapping studies and ultimately for the cloning of the M/m gene.