Introduction

Gray leaf spot is a warm weather foliar disease. The causal agent Stemphyllium spp. has a wide range of hosts including important crops such as tomato (Li et al. 2010; Nasehi et al. 2012a), cucumber (Vakalounakis and Markakis 2013), garlic (Zheng et al. 2008), sweet potato (Chai et al. 2015), pepper (Kim et al. 2004; Nasehi et al. 2012b), eggplant (Nasehi et al. 2013) and spinach (Everts and Armentrout 2001; Koike et al. 2001). This disease has been widely reported in tomato in many countries including America (Hendrix and Frazier 1949), Israel (Rotem et al. 1966), Korea (Min et al. 1995; Kim et al. 1999), Venezuela (Cedeño and Carrero 1997), Malaysia (Nasehi et al. 2012a, b) and China (Li et al. 2010). Tomato gray leaf spot usually occurs and develops under warm air temperature and high humidity (Kim et al. 1999). It mainly affects the leaves of tomatoes, starting with the oldest leaves. In the early stage, small and dark spots can be seen on both surfaces of the leaves, and then, it enlarges and turns a grayish brown. Eventually, the centers of the spots crack and fall out, and surrounding leaf areas will turn yellow and the leaves will dry and drop. Tomato gray leaf spot does not directly affect fruit, but damage to foliage reduces fruit quality and yield.

The pathogen causing tomato gray leaf spot mainly include four species: S. solani Weber, S. floridanum Hannon and Weber, S. botryosum Wallr., and S. vesicarum (Wallr.) Simmons (Bashi et al. 1973; Blancard et al. 1986). An outbreak of this disease in China was first reported in the early 2000s. More than 40% of the 30,000 greenhouses in Yutai County, Shandong Province were infected in 2002 and increased to 90% in 2003 (Liu et al. 2004). Since then, the prevalence of this disease has caused severe yield losses throughout the country (Li et al. 2010), especially in protected cultivation facilities tomatoes in spring season. A survey was performed in main tomato growing areas and two species, S. solani and S. lycopersici, were isolated from infected tissues of tomato plants growing in different seasons and localities in China. S. solani was identified as the epidemic species (Li et al. 2010). The fungal spores are spread by wind and water, and can survive for a long time in soil and on plant debris and weeds. Under favorable environmental conditions, S. solani can spread at an alarming pace, and the compatible host is devastated within several days (Li et al. 2010). It is difficult to control the disease using traditional agricultural practices and agrochemicals. Thus, the use of resistant cultivars is the most acceptable, environmentally-friendly and economically feasible method to control the disease.

Resistance to gray leaf spot was identified in Solanum pimpinellifolium accession PI 79532 (Andrus et al. 1942) and conferred by a single incompletely dominant gene, Sm (Hendrix and Frazier 1949), which is linked to the fusarium wilt race 1 resistance gene, I, with a crossover rate of 36% (Dennett 1950). In an incipient study, the Sm gene was mapped to chromosome 11 between restriction fragment length polymorphism markers TG110 and T10 at distances of 4.1 cM and 6.8 cM from the markers, respectively (Behare et al. 1991). In a recent study, Yang et al. (2017) mapped Sm gene in a interval of 0.26-Mb region on chromosome 11 based on bulked segregant analysis in combination with genome resequencing. However, the physical positions of the target gene from the two studies are not consistent. Until now, no further research has been reported on mapping or cloning the Sm gene. The most noteworthy is the Sm gene confers resistance to all four species of Stemphylium, and there are no reports of new virulent races overcoming the resistance since its introgression (Bashi et al. 1973; Blancard et al. 1986; Behare et al. 1991). Durable disease resistance is the ultimate goal of many breeding programs. There are some major R-genes, such as Sr2 and Lr34 in wheat, Mlo in barley, have proved durable over many years of agricultural use (Spielmeyer et al. 2003; Krattinger 2009; Jørgensen 1992). Therefore, cloning of Sm resistance gene will not only enable design of perfect markers for marker-assisted selection (MAS) in breeding, but also provide us a valuable tool to understand molecular mechanisms of host resistance against the pathogen.

Materials and methods

Plant materials

Two tomato inbred lines, ‘9706’ (KR2) and ‘Heinz 1706’, were used as parental lines to develop a segregating population for mapping of the Sm gene. The resistant line ‘9706’ is from the Institute of Vegetables and Flowers, and its genome has been resequenced (Lin et al. 2014). ‘Heinz 1706’ is the susceptible line for which a draft genome assembly is available (http://solgenomics.net). The resulting F1 plants were self-pollinated to generate the F2 population. A total of 441 F2 plants were used for inheritance analysis and fine mapping of the Sm gene. Subsequently, a larger F2 population, with 4730 individuals, was subjected to a recombinant screening in batches and the selected F2 recombinants were tested for gray leaf spot resistance. To further confirm the phenotype, 315 F3 plants derived from 15 F2 recombinants were evaluated for gray leaf spot resistance. Each F3 family contained 21 plants.

A collection of 289 diverse accessions previously described in detail by Lin et al. (2014) were used in a genome-wide association study (GWAS) for Sm. These included 145 S. lycopersicum accessions,112 S. lycopersicum var. cerasiforme accessions, 28 S. pimpinellifolium accessions and 4 other accessions (Table S1). Three plants of each accession were tested in two independent experiments. To validate molecular marker linked to the Sm resistance locus, the 289 accessions were also examined using a marker-trait analysis. All of the plant materials were grown in a greenhouse in Beijing, China. The day and night average temperatures of the greenhouse were controlled at 25 °C and 20 °C, respectively.

DNA extraction and marker development

Genomic DNA was extracted from young leaves of the first 441 F2 individuals and 289 accessions for the GWAS using the modified CTAB method (Fulton et al. 1995). For the other F2 individuals used in the recombinant screening, DNA was extracted according to the NaOH method, as described by Wang et al. (1993) with slight modifications. In brief, small leaf samples were ground for 2 min in 120 μl of 0.4 N NaOH using a tissue striker (TissueLyser II; QIAGEN, Germany), incubated at 80 °C for 8 min, and centrifuged for 1 min at 5000×g. Then, 10 μl supernatant was added to 200 μl of 0.1 M Tris (pH = 8) to form (crude) genomic DNA templates for PCR.

All of the molecular markers used in this study were Insertion and Deletion (InDel) markers which were developed based on whole genome re-sequencing data of 20 cultivated tomato lines with PCR product length 80–200 bp and InDel sizes 4–6 bp. During the initial mapping stage, the InDel markers on chromosome 11 were chosen for polymorphism screening and the development of a framework map. After flanking markers were identified, more InDel markers were developed for genotyping recombinants in this region by comparing genomic DNA sequences between ‘Heinz 1706’ and ‘9706’. The genome of ‘9706’ was sequenced with 5.1 × genome coverage in a previous study (Lin et al. 2014). Information on the primers for the identified markers is listed in Table S2.

Disease evaluation

The pathogenicity of two species of Stemphylium, 105 S. solani strains and 57 S. lycopersici strains isolated from different infected vegetables in different regions, was tested on susceptible tomato “zhongza 105” (Xie 2017). Based on the results, we choose two S. solani strains and two S. lycopersici strains in this study to test their pathogenicity on the resistant tomato line ‘9706’ and susceptible line ‘Heinz 1706’. Eventually, an isolate of S. solani, numbered FQ12031619, which was collected from Shandong Province, was used for inoculation. Pure isolates were grown on potato dextrose agar (PDA) plates for 7 days at 28 °C in the dark and incubated in liquid PD medium without agar in the dark at 28 °C at 160 rpm for 7 days before inoculation (Xie 2017). The whole-plant assay was performed as described by Chen (1996), with some slight modifications. In brief, plants with four fully expanded leaves were inoculated after being watered thoroughly using a sprayer to disperse the suspension (OD = 1.5–1.6) over the plants. The plants were covered with a plastic sheet for the first 2 days after inoculation to increase the humidity. Thereafter, plants were grown at 80–90% relative humidity and 24 ± 2 °C with natural light. After 7 days, the disease severity (DS) levels of plants were assessed using the severity of symptoms on a scale of 0–5: 0 = no symptoms; 1 = some older leaf area affected and showing small lesions; 2 = most leaf area affected and showing small lesions; 3 = most leaf area affected and defoliation of older leaves; 4 = most leaves wilted and/or with extensive stem damage; 5 = defoliation of the entire plant, resulting in death. For disease screening data in F2 plants, a χ2-test for goodness-of-fit was used to test for deviations between observed data and the expected segregation.

Molecular marker analysis and gene mapping

Initially, seven InDel polymorphic markers well-distributed on tomato chromosome 11 based on the physical position and genetic distance were selected to construct a genetic map using 441 F2 individuals. After the gene was located to a narrowed region, another 4730 F2 individuals derived from the same cross were screened for recombinants using the flanking markers. Subsequently, the recombinants were analyzed with nine new markers designed in the narrowed region and their F2:3 families were tested for gray leaf spot resistance. A linkage analysis of the Sm resistance locus was performed using JoinMap 4.0 (Van Ooijen 2006) with a minimum LOD threshold of 3.0.

The GWAS for Sm

We used the sequencing data of Lin et al. (2014) for the genome-wide association study of Sm gene. To eliminate sequencing and mapping errors, single nucleotide polymorphisms (SNPs) with minor allele frequencies > 5% and missing rates < 10% were used for the association analysis (Lin et al. 2014). The efficient mixed-model association expedited method was used to conduct the analysis (Kang et al. 2010).

Sequence annotation and gene expression analysis

The putative genes in the region encompassing the Sm locus were identified using the tomato gene model (ITAG release 3.1) in SGN (https://solgenomics.net/) and predicted with the online program FGENESH (http://linux1.softberry.com/), protein functions were predicted with BLASTP at the NCBI website (https://blast.ncbi.nlm.nih.gov/Blast.cgi) (Table S3).

Quantitative RT-PCR was performed to analyze the annotated genes expression levels in parents before and after inoculation. Total RNA was isolated from leaves of the two parental lines at 0, 2, 4 and 6 days after inoculation considering that the hyphae began infecting tomato via stomata at 2 days after inoculation (Xie 2017). Total RNAs for all of the samples were extracted using an EasyPure Plant RNA Kit (TransGen Biotech, Beijing, China). Reverse transcription was conducted using Transcript One-Step gDNA Removal and cDNA Synthesis Supermix (TransGen Biotech). The tomato actin gene (Solyc03g078400, F: GGGATGGAGAAGTTTGGTGGTGG and R: CTTCGACCAAGGGATGGTGTAGC) was used as an internal reference in the analyses. The qRT-PCR assay was conducted using the SYBR Green I Master Mix in a LightCycler 480 system (Roche, Switzerland). The qRT-PCRs were performed with three biological replications and three technical replicates. Gene expression levels were calculated on the basis of the 2−ΔΔCt method (Livak and Schmittgen 2001).

Virus-induced gene silencing (VIGS)

The VIGS experiment was performed with sprout vacuum-infiltration (SVI) method as described by Yan et al. (2012), an easy and effective method of agroinoculation for VIGS. Primers were designed based on predicted genes sequences to amplify 100–450 bp fragments from the cDNAs of 9706 using the Phusion high-fidelity DNA polymerase (Thermo Fisher Scientific, Waltham, MA, USA). PCR products were assembled into the tobacco rattle virus-based VIGS vector TRV2 and sequenced. The VIGS TRV1, TRV2 and its derivatives were transformed into Agrobacterium tumefaciens strain GV3101 by freeze–thaw method. A single colony was selected and inoculated in liquid culture of LB medium with antibiotics with shaking at 200 rpm at 28 °C for 12 h. The cells were re-suspended in infiltration medium (10 mmol/L MES, 10 mmol/L MgCl2, 200 μmol/L AS, pH = 5.6) till OD600 = 1.0 and left at room temperature for 3–4 h before agroinfiltration. Agrobacterium strains containing the TRV1 and TRV2 vector were mixed at a 1:1 ratio and vacuum-infiltrated into the germinating seeds of the Sm-resistant line 9706 using an infiltration system (a vacuum dryer connected to portable air compressor). In detail, the infiltration mixture and germinating seeds (about 21 seeds) were placed in each 10-mL beaker. And then, beakers were placed into vacuum dryer. Agrobacterium was infiltrated into sprouts using sprout vacuum-infiltration system set at relative vacuum degree of − 25 kPa. Vacuum pressure was maintained for about 1 min and then released rapidly to atmospheric pressure. Treated sprouts were sown in pots in a growth chamber (23 ± 2 °C, 60–80% RH). The tomato phytoene desaturase gene (PDS) was used as the reference gene to assess the VIGS system. The TRV2 empty vector (TRV2-ev) and water were used as negative controls. Forty days after sowing, the plants were inoculated with S. solani. The agroinfiltration experiments were performed again with cotyledon infiltration method to confirm the candidate gene for Sm.

Screening of BAC library

An existing bacterial artificial chromosome (BAC), constructed with Ty2-resistant line 12 g-60 which was identified as resistant to S. solani, was screened in this study. The BAC library consisted of 99,840 clones with an average insert size of 100 kb, corresponding to 10-fold coverage of the tomato genome. The library was stored in 260 384-well microtiter plates, and all 384 clones in one plate were mixed to form a BAC pool. The BAC pool DNA was isolated by alkaline lysis method and screened by PCR using 10 primer pairs within and flanking the Sm region (Table S4). Afterward, the single colony from the 384-well plates corresponding to the positive pool was identified using the same markers and DNA was isolated from the positive colony.

DNA sequencing and analysis

BAC ends were sequenced to confirm that they originated from the Sm region, and they were also analyzed by pulsed-field gel electrophoresis to estimate the insert size. Sequence of the single-positive BAC clone harboring the predicted ORF was obtained by constructing a library of subclones (1–3 kb). Both ends of the subclones were sequenced using the ABI 3730xl platform and then assembled (BGI, Beijing, China). Putative genes in the BAC sequence were predicted with the online program FGENESH, and protein functions were predicted with BLASTP at the NCBI website.

Validation of molecular marker linked to the Sm gene

To verify the associations between the genotypes and phenotypes, the 289 accessions were subject to genotyping with the developed marker Sm-InDel (F: 5′-CTACACTTTCTCGTTCCCAATG-3′, R: 5′-ATCGCCAAACCAATCAAATC-3′). The PCR was conducted with an initial denaturation step of 3 min at 94 °C, followed by 32 cycles of 30 s at 94 °C, 30 s at 55 °C and a 30 s extension at 72 °C. Final reactions were extended for 5 min at 72 °C. The PCR products were separated on 3% agarose gel, stained with ethidium bromide and photographed using a BIO-RAD Gel Documentation System (Bio-Rad Laboratories, Shanghai, China).

Results

Inheritance of gray leaf spot resistance in tomato

Previously, Sm has been described as an incompletely dominant gene for gray leaf spot resistance (Hendrix and Frazier 1949). To verify this conclusion, we performed the gray leaf spot disease assay on a segregating F2 population, 12 F1 plants and their parental lines. The resistant parent ‘9706’ exhibited a high level of gray leaf spot resistance to S. solani (P1, DS level = 0), while the other parent, ‘Heinz 1706’, showed lesions, wilted and died 7 days after inoculation (P2, DS level = 5) (Fig. 1a). The F1 plants showed small lesions on old leaves (DS level = 1 or 2). Of the 441 F2 individuals, 340 had DS levels from 0 to 3 and were considered as resistant. The remaining 101 plants were completely or extensively blighted (having DS levels between 4 and 5) and were regarded as susceptible (Fig. 1b, c). The resistant/susceptible segregation ratio among plants was 3:1 (χ2 = 1.03, P = 0.31), suggesting the presence of a single dominant resistance gene.

Fig. 1
figure 1

Symptoms of parental lines (a) and F2 population (b) after inoculation with S. solani. The number in each image indicates the symptom rating index (0–5). The frequency distribution of the disease scale in the F2 population (c). The numbers above the bars indicate the number of individuals for each scale. The DS level of resistant parent “9706” (P1) = 0, susceptible parent “Heinz 1706” (P2) = 5, and F1 = 1 or 2

Fine mapping of the Sm locus

Based on the phenotypic and genotypic data for the 441 F2 individuals with seven markers, the Sm genetic map was created by JoinMap4.0 (Fig. 2a). All seven markers showed a linear order between their genetic and physical locations. Subsequently, another 4730 F2 individuals were screened for recombinants using the flanking markers in batches. In brief, we first screened 2066 F2 individuals with the markers InDel335 and InDel240 and got 61 recombinants, analyzed these individuals with more markers between InDel335 and InDel240, narrowed down the mapping region. Then, we screened another F2 individuals with new flanking markers and narrowed down to smaller region, and then the third batch. Eventually, the 15 selected F2 recombinants were analyzed with nine new markers (Fig. 2b) and their F2:3 families were tested for gray leaf spot resistance. By combining the F2 genotypes and the phenotypes of their corresponding F2:3 families (Table 1), for example, the recombinant 7-81 was homozygous resistant for markers downstream of InDel343 and gray leaf spot resistance uniform in its F3 family; thus, it was deduced that the Sm gene is downstream of InDel343. Similarly, from the recombinant 7-77, it was concluded that the Sm gene is upstream of InDel-FT-32. Therefore, the Sm was narrowed to an interval between markers InDel343 (SL3.0ch11_9176276) and InDel-FT-32 (SL3.0ch11_9361705), a 185-kb region on the ‘Heinz 1706’ genome that carries five predicted genes according to the tomato genome annotation (Fig. 2c).

Fig. 2
figure 2

Genetic and physical maps of the Sm gene and candidate genes analysis. a A genetic map of the Sm locus. Positions of the markers are indicated in cM. The linkage map was generated using 441 F2 individuals using JionMap 4.0, b the physical map of the Sm locus. Positions of the markers are indicated in Mb. The Sm gene was located between markers InDel343 and InDel-FT-32 by recombinants and c the locations of the five candidate genes named as ORF-1, ORF-2, ORF-3, ORF-4 and ORF-5 at the Sm locus in the ‘Heinz 1706’ genome sequence

Table 1 Genotypes (F2) and phenotypes (F3) of recombinants

GWAS analysis for the Sm locus

To confirm the map-based cloning results, we analyzed 289 diverse accessions using a GWAS for which a genome resequencing data set was available (Lin et al. 2014). We identified a strong association signal, P = 1.12 × 10−20, SNP: SL2.50ch11_9317565 (Fig. 3), for tomato gray leaf spot resistance, which was coincident with the mapping analysis. Among the 289 accessions, most corresponded to the ordinary rule in which single-base “C” indicates susceptible and single-base “T” indicates resistance (Table S1), which represents a tight linkage between the SNP and the Sm gene.

Fig. 3
figure 3

Genome-wide association study of tomato gray leaf spot resistance. a Manhattan plot of the GWAS for tomato gray leaf spot resistance using the compressed mixed linear model (MLM). Negative log10-transformed P values from a genome-wide scan are plotted against position on each of 12 chromosomes. Black horizontal dashed line indicates the genome-wide significance threshold. b Quantile–quantile plot of compressed MLM for tomato gray leaf spot resistance

Gene identification and expression

Five putative genes which named ORF-1 to ORF-5 were identified in the 185-kb region that corresponded to the Sm locus by searching the tomato genome annotation database in SGN (Fig. 2c; Table 2). However, none of them encoded a typical disease resistance protein. We performed a qRT-PCR analysis of the annotated genes in the leaf tissues of the two parental lines at 0, 2, 4 and 6 days post-inoculation, and the results are presented in Fig. S1. Among these five annotated genes, Solyc11g018743 showed very low expression levels in both parents, which is consistent with the RNA sequencing coverage information in SGN. Among the other four genes, Solyc11g018740 and Solyc11g018747 showed lower expression levels in the susceptible genotype. In contrast, the expression of the Solyc11g018715 gene in the susceptible plants was greater than in the resistant plants. What is more, the Solyc11g018710 expression was not coincident on different time points; it is higher in susceptible plants than resistant before and 2 days after inoculation and antipodal at 4 days and 6 days post-inoculation. In conclusion, the putative genes showed small differences in expression levels in the two parents at some time points, but they were not coincident across all time points. Thus, further studies are required to ascertain a candidate gene for Sm.

Table 2 Annotated genes in the Sm region. List shows predicted genes in the 185-kb Sm region (ITAG3.1) in SGN (https://solgenomics.net/), and the primers used for the expression level analysis of the five genes

Gene prediction and functional analysis

The 185-kb genomic DNA region in the Heinz1706 draft genome delimited by InDel343 and InDel-FT-32 were manually annotated. Thirty-three genes named ORF1 to ORF33 were predicted in this region by the FGENESH program, which is more than the number of Heinz 1706 genome annotations. The details of these predicted genes are provided in supplemental Table S3.

To test the implication of the predicted genes on gray leaf spot resistance, specific VIGS constructs were designed to silence these predicted genes. Among the 33 predicted genes, 25 were amplified and assembled into TRV2 vector (Table S3). Subsequently, the infiltration mixture was vacuum-infiltrated into the germinating seeds of resistance line 9706 and the treated sprouts were sown. Photo-bleaching was observed on the true leaves of the PDS-silenced tomato seedling 20 days post-infection and sowing (Fig. 4a), indicating a successful silencing effect. Forty days later, plants with five fully expanded leaves were inoculated with S. solani. Those infiltrated with an empty TRV2 vector (Fig. 4b), water and most ORF-silenced plants showed no obvious symptoms except the ORF9-silenced plants. Among the 21 ORF9-silenced plants, eight showed leave lesions, wilt and even defoliation (Fig. 4c). The independent VIGS experiments in cotyledon infiltration method were performed, and the results were consistent. These results suggested that ORF9 was involved in Sm-mediated resistance and it seems to be good candidate of the Sm locus.

Fig. 4
figure 4

The phenotype of plants inoculated with S. solani in virus induced gene silencing experiment. a Silencing of tomato PDS causes visible phenotypic symptoms, b control plant infiltrated with empty TRV2 vector after inoculation with S. solani, c symptoms of ORF9-silenced tomato leaves after inoculation with S. solani

BAC library screening

According to the sequence and physical position of ORF9, we found unsequenced region with a large number of N in Heinz 1706 reference genome. In order to obtain the sequence covering the ORF9 region, an existing BAC was screened. Ultimately, a positive clone B80B15 was picked up and sequenced (Fig. S2). The whole BAC sequence of B80B15 carries an insert of 80,429 bp corresponding to an interval of 82,790 bp in the Heinz1706 reference genome starting from SL2.50ch11: 9198580 and ending at SL2.50ch11: 9281370. The ~ 80 kb BAC sequence was analyzed with online program FGENESH, and the result showed predicted gene harboring ORF9 consists of 14 exons of 5256 nucleotides, encoding a predicted polypeptide of 1751 amino acids. The predicted protein harbors Retrotrans_gag (Retrotransposon gag protein), zf-CCHC (Zinc knuckle), retropepsin_like (pepsin-like aspartate proteases), RVP_2 (Retroviral aspartyl protease), RT-LTR (reverse transcriptases), RVT_1(reverse transcriptase) and RNase_HI_RT_Ty3 (Ty3/Gypsy family of RNase HI in long-term repeat retroelements) (Fig. S3). The conserved domain and structure characteristics demonstrated that the Sm candidate gene was not the typical type R gene.

Confirmation of linkage between marker Sm-InDel and the Sm gene

A 18-bp deletion was found downstream 15 kb from the Sm candidate gene when aligning the BAC sequence to Heinz 1706 reference genome and a co-dominant marker Sm-InDel was developed which produced a 122-bp fragment for resistant and a 140-bp fragment for susceptible alleles (Fig. 5). It was also tested on the 289 diverse accessions. The Sm-InDel marker predicted the gray leaf spot resistance of the germplasm, and consistent results between phenotype and genotype were obtained for 279 of the 289 accessions (Table S1), indicating a 96.5% accuracy rate.

Fig. 5
figure 5

PCR fragments amplified using marker Sm-InDel from tomato gray leaf spot resistant and susceptible genotypes resolved in a 3% agarose gel. Lane 1: Heinz 1706, Lane 2: 9706, Lane 3: F1 (Heinz 1706 × 9706), Lane 4: LA2823, Lane 5: LA1589, Lane 6: Moneymaker, Lane 7: Ailsa Craig, Lane 8: M82, Lane 9: E-6203, Lane 10: Micro-Tom, Lane 11: 12 g-60, Lane 12: F1 (Heinz 1706 × LA1589). M, 50-bp DNA ladder; R, resistant; S, susceptible; H: heterozygote

Discussion

Sequencing technologies and applications of bioinformatics accelerate the genetic studies

Next-generation sequencing technologies are faster and more efficient and thus make it easier to establish associations between phenotypic traits and molecular markers or candidate genes, which are necessary for MAS in molecular breeding. With the rapid development of next-generation sequencing technologies and application of bioinformatics, many new strategies have been proposed to identify major quantitative trait loci or genes. GWAS has emerged a powerful approach for identifying genes underlying complex diseases at an unprecedented rate. Now, it has been applied to the dissection of complex traits in the crop plants rice (Huang et al. 2010, 2012; Wang et al. 2014), soybean (Zhou et al. 2015) and maize (Li et al. 2013; Tian et al. 2011). In this study, GWAS was carried out to identify the locus associated with gray leaf spot disease resistance in 289 tomato accessions, and a significantly associated locus was identified that was consistent with the map-based cloning analysis.

Gene prediction and functional analysis

The tomato genome is estimated to encode ~ 35,960 genes, majority of which are populated at distal euchromatic regions of the chromosomes (Khush and Rick 1968; Peterson et al. 1998; Van der Hoeven et al. 2002) with an approximate gene density of 6.7 kb/gene (Wang et al. 2005). Based on the ‘Heinz 1706’ genome annotations, there were five annotated genes in the defined region, which is less than expected. Moreover, these annotated genes expression levels were not significantly different in the two parents across all of the time points. By analyzing the target region in the ‘Heinz 1706’ reference genome, we found several regions that were not sequenced in the corresponding 185-kb region, which might be one reason for the scarcity of annotated genes. Thus, we predicted genes using FGENESH within our mapping region, and 33 putative open reading frames were found in the corresponding 185-kb region, which included the five annotated genes. Subsequently, the virus-induced gene silencing approach was performed for the 33 putative open reading frames to determine the candidate genes. Due to such reasons as low level of putative gene expression, mistakes on gene prediction, unsatisfactory primer design, eventually, 25 VIGS vectors were constructed and used for gene functional analysis. In this study, we performed the VIGS experiments using SVI method, which was a simple, effective and low-cost agroinoculation method to elicit VIGS in tomato. Ultimately, the ORF9-silenced plants showed susceptible to S. solani. This result indicated ORF9 was involved in Sm-mediated resistance, and it seems to be good candidate of the Sm locus.

VIGS is a transcript suppression technique used for gene function analysis. This technique offers a tremendous advantage for knocking down the target genes due to the needless for transformation in plant (Baulcombe 1999; Dinesh-Kumar et al. 2003). Agroinoculation is a widely used method to deliver VIGS vectors into plants for RNA silencing (Liu et al. 2002; Gossele et al. 2002). However, the current agroinfiltration methods have their limitations. Among the existing methods, leaf infiltration method is widely used to introduce the infiltration solution into seedling leaves using a needle-less syringe, vacuum machine, or high-pressure spray gun, but they are laborious for large-scale VIGS screens. However, the barrier has been overcome using the SVI method which proved to be applicable for high-throughput VIGS screens. With the improvement in the tomato genome sequence and annotation, a large number of candidate genes of interest could be examined at the levels of transcriptome in present. Therefore, SVI will be a better agroinoculation method to implement large-scale VIGS experiments with minimal manpower in a relatively short time.

Candidate gene cloning through BAC sequencing

Based on the analysis of the ORF9 sequence and its physical position in Heinz1706 reference genome, we found two GAPs before and after this target region and we tried to sequence the GAPs using the PCR products. Unfortunately, the full length of candidate gene could not be amplified from the Sm-containing tomato lines with the primers designed according to the Heinz1706 reference genome. In order to obtain the sequence covering the ORF9 region, a BAC library was screened and the positive clone was sequenced. The result demonstrated that the Sm candidate gene does not belong to the typical type R gene which contains domains such as NBS-LRR. In plant, there also exists the particular, such as Ty1, which was a completely new class gene code for an RNA-dependent RNA polymerase and increased cytosine methylation of viral genomes to confer resistance (Verlaan et al. 2013; Butterbach et al. 2014). The Mlo gene in barley, whose resistance is caused by rapid formation of large cell wall appositions at the encounter sites, prevents penetration by the fungus (Jørgensen 1992). Anyway, genetic transformation is required to identify the candidate gene in the further study.

Applicability of markers in MAS

Marker-assisted selection (MAS) is a powerful tool in plant breeding. The development of molecular markers tightly linked with the target gene is a prerequisite for efficient MAS. Until now, there have been few reports of molecular research concerning tomato resistance to gray leaf spot. An incipient study mapped Sm on chromosome 11 between RFLP markers TG110 and  T10, due to the distance to the target gene and not PCR-based, these two markers are not breeder friendly. In 2009, Ji and Scott reported a recessive CAPS marker CT55, which cannot distinguish between heterozygous and homozygous susceptible plants. Therefore, its use may be partly restricted in practice. In a recent research, Sm gene was mapped and screened based on bulked segregant analysis in combination with genome resequencing, accompanied with a SCAR marker D5 designed (Yang et al. 2017). However, this marker hardly distinguishes resistant and susceptible tomato germplasm in our verification experiment, which indicated that the marker showed poor applicability in tomato breeding. In the present study, a co-dominant marker Sm-InDel was developed that produced a 122-bp or a 140-bp fragment for resistant or susceptible alleles, respectively. It can detect gray leaf spot resistance in germplasm with 96.5% accuracy and easily resolved in a standard agarose gel. The inconsistency between phenotypes and genotypes in 289 diverse accessions might result from the following: (1) a deficiency in the inoculation tests; (2) crossing-over occurring. This result indicates that the co-segregating marker Sm-InDel is very close to the gene Sm and could be used in MAS breeding for gray leaf spot resistance.

Author contribution statement

XMS performed the experiments, analyzed the data, and wrote the manuscript; GTZ analyzed the data; ZJH, XXW and YMG revised the manuscript; BJL provided pathogen and inoculation method; YCD, WCY and JCG conceived and designed the experiments, and revised the manuscript.