Introduction

The Phytophthora genus contains some of the most destructive plant pathogens that attack a wide range of economically important plant species worldwide. Black shank, caused by the soil-borne pathogen Phytophthora nicotianae, is the most important disease affecting tobacco (Nicotiana tabacum L.) production in many countries. In contrast to chemical control strategies, genetic resistance is the most cost-effective protective method to reduce economic losses caused by this pathogen.

Monogenic-specific resistance to race 0 of P. nicotianae, which is conferred by the Php and Phl genes, has been introgressed into N. tabacum from Nicotiana plumbaginifolia (Apple 1962; Chaplin 1962) and Nicotiana longiflora (Valleau et al. 1960), respectively. However, the widespread planting of cultivars possessing the Php or Phl gene has resulted in race shifts from race 0 to race 1. Race 1 of P. nicotianae is able to overcome resistance, which is becoming prevalent with continuous planting (Sullivan et al. 2005). Additionally, race 2 of P. nicotianae has been reported in South Africa, and race 3 is virulent on varieties containing the Phl gene (van Jaarsveld et al. 2002; Gallup and Shew 2010; McIntyre and Taylor 1978). High levels of resistance to both race 0 and race 1 of P. nicotianae are associated with an introgressed Nicotiana rustica genomic region, designated as Wz (Drake et al. 2015; Drake and Lewis 2013). Subsequently, Wz-adapted isolates of P. nicotianae have also been found (McCorkle et al. 2018). Because of the low durability of monogenic resistance mechanisms, a rotation of cultivars with single-gene resistance and cultivars with a high level of polygenic resistance would reduce disease incidence and minimize pathogen race shifts (Sullivan et al. 2005).

Two primary polygenic black shank-resistance sources are from the cigar tobacco ‘Florida 301’ (Tisdale 1931) and ‘Beinhart-1000’ (Heggestad and Lautz 1957; Chaplin 1966; Wills 1971; Nielsen 1992). The ‘Beinhart-1000’ cultivar expresses a high level of partial resistance to both race 0 and race 1 of P. nicotianae (Chaplin 1966). Vontimitta and Lewis (2012a and b) detected two major quantitative trait loci (QTLs) associated with resistance using a doubled haploid mapping population generated from a ‘Beinhart 1000’ × ‘Hicks’ cross under field and growth chamber environmental conditions (Vontimitta and Lewis 2012a; Vontimitta and Lewis 2012b). The largest QTL on linkage group (LG) 8 and the second QTL on LG 4 explained 25.4-54.7% and 16.8-20.4% of the phenotypic variance, respectively. Because of the associations between resistance and undesirable cigar-type characteristics, the resistance of this line is less utilized in burley and flue-cured tobacco cultivars (Nielsen 1992).

Based on pedigree information, most varieties of flue-cured tobacco are believed to have derived polygenic black shank resistance from cigar tobacco ‘Florida 301’. ‘Florida 301’ exhibits partial resistance in nature and is non-race-specific (Tisdale 1931). However, this type of resistance appears to be negatively correlated with yield in flue-cured tobacco. DNA markers closely associated with major genes affecting resistance might increase the possibility of transferring this resistance to commercial cultivars without the negative influence on yield. Using microsatellite markers, Xiao et al. (2013) identified 11 QTLs associated with a high level of black shank resistance in the cigar tobacco ‘Florida 301’ (Xiao et al. 2013). Among those, the largest effect QTL explained 16.9-18.6% of the phenotypic variation. The QTL also had the greatest phenotypic effects in a ‘Beinhart 1000’ × ‘Hicks’ doubled haploid (DH) population (Vontimitta and Lewis 2012b) and a ‘K346’ × ‘TI1068’ recombinant inbred line (RIL) population (Drake-Stowe et al. 2017). Recently, the QTL has been localized to within a genetic interval of approximately 3 cM using SNP markers and near-isogenic lines (NILs) (Ma et al. 2019).

A high-density simple sequence repeat (SSR)-based linkage map was generated containing 2318 microsatellite markers in tobacco (Bindler et al. 2011), and the vast majority of QTL mapping studies in N. tabacum have used these microsatellite markers (Drake-Stowe et al. 2017; Xiao et al. 2013; Vontimitta and Lewis 2012a; Vontimitta and Lewis 2012b). Owing to the low polymorphism rates revealed among tobacco lines, especial flue-cured lines, the efficiency and accuracy of QTL mapping using microsatellite markers were limited. The rapid development of next-generation sequencing technologies has been used to discover single-nucleotide polymorphisms (SNPs) and complete tobacco genome sequencing including four allotetraploid Nicotiana (N. benthamiana and three N. tabacum cultivars, TN90, K326, and Basma xanthi) and three diploid Nicotiana (N. otophora, N. sylvestris, and N. tomentosiformis) (Bombarely et al. 2012; Edwards et al. 2017; Sierro et al. 2013; Sierro et al. 2014). The first SNP linkage map for N. tabacum was recently published based on an F2 population using specific length amplified fragment sequencing (SLAF-seq), and two major QTLs were identified for curing trait (Gong et al. 2016).

Although a number of QTLs have been identified for black shank resistance, few QTLs have been finely mapped across genetic backgrounds and/or environments. In the current study, we aimed to improve the efficiency and accuracy of black shank resistance-related QTL mapping in tobacco using a high-density genetic linkage map, comprising 7734 SNP markers, through restriction site-associated DNA tag sequencing (RAD-seq) technology on 177 F7:8-9 RILs. The RIL population was generated from a cross between highly resistant flue-cured tobacco ‘Yunyan 85’ and susceptible flue-cured tobacco ‘Dabaijin 599’. We finely mapped QTLs for black shank resistance under four field and one greenhouse conditions.

Materials and methods

Plant materials and DNA extraction

An RIL mapping population containing 177 F7:8-9 progenies was developed from a cross between two flue-cured tobacco varieties with distinctly different resistance levels to P. nicotianae. ‘Yunyan 85’ and ‘Dabaijin 599’ are considered to be the highly resistant and susceptible parents, respectively. ‘Yunyan 85’ was developed from the cross ‘Yunyan 2’ × ‘K326’, and ‘Dabaijin 599’ is a Chinese tobacco landrace (Tan et al. 1997). Young leaves were collected from parents and individual plants of the RIL population and frozen in liquid nitrogen for DNA extraction. Total genomic DNA was extracted using the cetyl trimethylammonium bromide method. DNA qualities were evaluated using NanoDrop 2000 spectrophotometer (Thermo Scientific, Milan, Italy) and electrophoresis in agarose gels.

Disease evaluation

Disease evaluation of all 177 RILs, and their parents was conducted in four fields and one greenhouse.

Black shank disease nurseries were going at the field station in Shandong, Zhucheng (N 35° 59′, E 119° 24′) in 2017 (ZC2017), 2018 (ZC2018), and 2019 (ZC2019), and in Shandong, Jimo (N 36° 38′, E 120° 45′) in 2018 (JM2018). Plants were transplant in single 12-plant row plots following a randomized complete block design. Two replications were completed in ZC2017 and JM2018, and with three replications in ZC2019 and with four replications in ZC2018. The susceptible cultivar ‘Dabaijin 599’ was planted in every third plot to check the uniformity of disease pressure. The inter-row spacing was 1.2 m and the within-row plant spacing was 0.5 m at both locations. Starting 45 days after transplanting, the severity of plants killed by black shank were evaluated at 15-day intervals using an empirical 6-point scale (GB/T23222-2008, China), where 0 = no symptoms; 1 = up to 33% stem lesions or chlorotic leaves; 3 = up to 50% stem lesions or chlorotic leaves; and 5 = up to 90% stem lesions or 66% chlorotic leaves; 7 = up to 100% stem lesions or chlorotic leaves; 9 = death. Disease index (DI) scores were calculated using the following formula: DI = 100 × ∑[(disease evaluation scale score × number of plants with each scale score)]/(total number of plants observed × the highest disease evaluation scale score)].

The RIL population and parents were also evaluated for black shank resistance in the greenhouse in Shandong, Jimo, in 2019 (GH2019) according to Vontimitta et al. (Vontimitta and Lewis 2012a). Isolates of P. nicotianae race 0 were provided by Tobacco Research Institute of Chinese Academy of Agricultural Sciences Pathology Department. The experimental design was a randomized complete block design with three replications. Experimental units consisted of 8 plants contained within an 8-compartment segment of a nursery tray. Approximately 50 days after germination, plants were inoculated by wounding the stem base with a knife and inserting four P. nicotianae-infested oat grains into the soil. The temperature was maintained at 30 °C and soil moisture was uniformly maintained by subirrigation. Approximately 7 days after inoculation, the disease severity of plants was recorded at 7-day intervals. Five data from four fields and one greenhouse environments were used for analysis of variance (ANOVA) using SPSS 23.0 software.

RAD library preparation and sequencing

The RAD library was constructed following the protocol described by Baird et al. (2008), with minor modifications. Briefly, genomic DNA from the parents and each of the 177 progenies were digested for 15 min at 37 °C in a 50-μL reaction containing 20 units (U) of EcoRI (New England Biolabs, USA). A modified Illumina P1 adapter (USA) was added to the samples. Sample were pooled and randomly sheared to an average size of 500 bp. DNA fragments of 300-500 bp were isolated using the QIAquick Gel Extraction Kit (Qiagen, Germany). The ends of the DNA were repaired using the Quick Blunting kit Enzyme Mix (New England Biolabs). Then, a modified Solexa P2 adapter was ligated to the DNA fragments (Illumina). Finally, purified and eluted DNA products were PCR-amplified using the Phusion Master Mix (New England Biolabs). The prepared DNA libraries were sequenced using Illumina HiSeq X Ten instrumentation.

SNP identification and genotyping

To ensure high-quality genotype calling, raw RAD sequence reads of low quality (quality score ≤ Q20) and without unique barcodes were filtered. After the barcodes and the terminal bases were trimmed, the clean reads from each individual were mapped onto the tobacco reference genome (Edwards et al. 2017) using BWA software (Li and Durbin 2009). Input data was prepared by SAMtools, and SNP calling was performed by GATK (Li et al. 2009; DePristo et al. 2011). SNP loci with more than three continuous SNPs in 300 bp were filtered out first. Because one SNP could contain at most four genotypes for diploid species, SNP loci with more than four alleles were discarded. All SNP loci were genotyped for consistency at the parental and offspring SNP loci. Before genetic map construction, all of the markers were filtered using the following criteria: First, sequence depths were > 10-fold in the parents. Second, markers with more than 30% missing data in the RIL population were filtered. Third, the chi-squared test was performed to examine the segregation distortion. Markers with significant segregation distortions (P < 0.01) were initially excluded from the map construction and were then added as accessory markers. Finally, markers having parental homozygous genotype (aa × bb) were used to construct the genetic map.

Linkage map construction and analysis

The HighMap strategy was utilized to order the SNP markers and correct genotyping errors (Liu et al. 2014). The enhanced algorithm of Gibbs sampling, spatial sampling, and simulated annealing was used to order markers (Van Ooijen 2011; Jansen et al. 2001). The SMOOTH algorithm was used to correct genotyping errors (van Os et al. 2005). The missing genotypes in progenies were imputed by the k-nearest neighbor algorithm (Huang et al. 2011). The skewed markers were added to the linkage map using a multipoint method of maximum likelihood. The map distances were calculated using the Kosambi mapping function (Kosambi 1944). Heat and haplotype maps were constructed to evaluate the map quality (West et al. 2006). All of the sequences of the SNP markers in the linkage map were aligned back to the physical sequences of the reference genome and two ancestral parents, N. sylvestris and N. tomentosiformis, using a Basic Local Alignment Search Tool.

QTL detection and candidate gene identification

QTL IciMapping 4.1 (Meng et al. 2015) was used to identify QTLs by the inclusive composite interval mapping method (Li et al. 2007). The threshold of the log-likelihood (LOD) scores was more than 2 and determined using 1000 permutations for P = 0.05. Additive and phenotypic contribution rates were estimated. When the positions of the QTLs overlapped within a range of 10 cM, they were interpreted to be the same QTL. The major QTLs, detected in five environments, were selected to identify the candidate genes. Based on the positions of the flanking markers, all of the genes within the confidence interval were identified as candidate genes.

Results

Genetic analysis of resistance to P. nicotianae

‘Yunyan 85’ exhibited a very high level of resistance in five environments, and with disease indices (DIs) less than 20. ‘Dabaijin 599’ exhibited a high level of susceptible reactions, with DIs of above 90. The RIL population of 177 lines displayed a wide range of variation, with DIs ranging from 0 to 100 and a mean DI value of 55 for the four field environments (Fig. 1) and one greenhouse environments (Fig. 2). Few lines exhibited DIs that were greater than those of ‘Dabaijin 599’ and lower than those of ‘Yunyan 85’. The RIL population displayed significantly skewed distributions in the resistance direction, indicating that polygenes, including major genes, control resistance to black shank. The DI data for RILs in five environments were subjected to an ANOVA, which detected highly significant differences among lines (P < 0.0001) and significant environment × line interactions (P < 0.0001) (Table 1).

Fig. 1
figure 1

Frequency distribution for the RIL population exhibiting disease indices averaged over four field environments. The disease indices for parental lines are indicated by arrows

Fig. 2
figure 2

Frequency distribution for the RIL population exhibiting disease indices in greenhouse environment. The disease indices for parental lines are indicated by arrows

Table 1 ANOVA analysis of variance for disease indices for RIL population in five environments

SNP genotyping

A total of 179 RAD-seq libraries, two parental and 177 progenies, were constructed and sequenced in Illumina HiSeq X Ten. After removing low quality sequences, 4,618,576,318 high-quality paired-end reads were obtained, with each read being ~ 151 bp in length, representing ~ 1394.81 GB of sequences. The clean reads were deposited at NCBI under the bioproject. Of the clean reads, 34,944,329 were from ‘Yunyan 85’ and 39,685,656 were from ‘Dabaijin 599’, with the remaining being from the 177 RILs (Fig. 3). Read numbers for the individuals ranged from 10,648,452 to 68,786,148, with an average of 25,672,013. The filtered reads were then mapped to the reference tobacco genome. The average depths of sequencing were 7.95-fold in ‘Yunyan 85’, 8.40-fold in ‘Dabaijin 599’, and 5.94-fold in the progenies (Fig. 3).

Fig. 3
figure 3

Sequencing data and depth statistics of parental lines and each individual of the RIL population. The x-axis represents the parental lines and each individual of the RIL population, and the y-axis represents the total numbers of bases (Gb; black) and average sequencing depth (X; gray)

In total, 2,404,474 SNPs were detected using GATK software. For the SNP loci, two to four alleles were identified as polymorphic and considered potential markers. Across the whole RIL population, 460,210 polymorphic markers were successfully identified. The SNP markers were classified into four genotypes: aa × bb, hk × hk, lm × ll, and nn × np, where aa × bb indicates that both the parents were homozygous, with the genotype of one parent being aa and the other being bb; lm × ll and nn × np indicate that one parent was homozygous and the other was heterozygosis, and the hk × hk indicates that both the parents were heterozygosis. The numbers of SNPs classified into these four marker types were 150,489; 120,264; 87,089; and 102,368, respectively. Only 150,489 SNPs with parental homozygous genotype (aa × bb) were used for further analyses. The markers with low sequence depths, missing in the parents or RIL population, or having segregation distortion, were filtered. The remaining 7878 markers were used to construct the genetic map (Additional file 1: Table S1).

Construction of a high-density genetic map

In total, 7734 SNP markers were mapped to 24 LGs (Fig. 4, Additional file 2: Table S2). The number of LGs corresponded to the number of chromosomes in tobacco (x = 24). The genetic map contained 2800 recombination loci with 63.79% co-segregation markers. The map spanned a total distance of 2689.06 cM, with an average marker interval of 0.35 cM (Table 2). LG3 was the largest group, containing 1230 markers that spanned 190.83 cM, with an average marker interval of 0.16 cM. The shortest linkage group was LG4, which harbored 184 markers and had a genetic length of 53.82 cM with an average inter-marker distance of 0.29 cM. The degree of linkage between markers was reflected by “Gaps ≤ 5” indicating the percentage of gaps in which the distance between adjacent markers was smaller than 5 cM. The value ranged from 95.28 to 100.00%, with an average value of 98.61%. The largest gap on this map was 16.74 cM located on LG12. Tobacco is an allotetraploid (2n = 4x = 48) and is most likely the result of a tetraploidization inter-specific hybrid between N. sylvestris (S-genome) and N. tomentosiformis (T-genome). According to the alignment with the two diploid ancestral genomes of common tobacco (Sierro et al. 2013), 61.09% markers were mapped to the S-genome and 35.75% to the T-genome. Several LGs contained both S- and T-genome-specific markers. The haplotype and heat maps reflect the map quality (Additional files 3 and 4: Figures S1 and S2, respectively).

Fig. 4
figure 4

High-density genetic map of tobacco constructed by SNP markers. The markers position and name in Table S2. Marker names of the QTL identified on each chromosome are given on the right. The vertical colored bars represent QTLs for four field and greenhouse under the disease progress. Length of vertical colored bars indicates confidence interval calculated with a support interval of 1 LOD score

Table 2 Distribution of mapped markers on the 24 linkage groups in tobacco

QTL analysis

Ten QTLs with additive effects were significantly associated with the DI of black shank of tobacco, having LOD scores > 2.0 (Table 3) over five different environments. These QTLs were mapped to 10 link groups. Among those QTL, two stable QTL named qBS7 and qBS14 were repeatedly detected to resistance black shank under all five environments. The QTL qBS7 located in LG7 was a major QTL and explained the greatest observed phenotypic variations ranging from 16.48 to 62.20%. High LOD score values of 28.33, 13.75, 43.40, 7.54, and 38.72, respectively, were observed for qBS7 in the five environments. QTL qBS7 in ZC2017, ZC2018, and ZC2019 was flanked by markers M376697 and M336593 at ~ 1.5 cM, by markers M403163 and M376697 at 4 cM in GH2019, and by M336593 and M336601 at 2 cM in JM2018. QTL qBS14 was identified in LG14, explaining 3.94-11.29% of the phenotypic variation in the five different environments. QTL qBS14 was flanked by two markers, M356486 and M136059, at 0.75 cM. In addition, the qBS16 was a minor QTL that explained 2.95-5.31% of phenotypic variation in four environments, with 2.70-3.39 LOD score of values. Minor QTL qBS1, qBS21, and qBS24 can be found in three environments. QTL qBS1 was detected not only in the field including ZC2017 and ZC2019 but also in the greenhouse in GH2019, explaining 2.46%, 2.33%, and 3.33% of the phenotypic variance while qBS21 and qBS24 were just found in field surroundings. In additional, minor QTL qBS6 with lower PVE and LOD value were found in environments of GH2019 and ZC2019. The remaining three minor QTL qBS2, qBS3, and qBS18 were just detected in one environment. ‘Yunyan 85’ contributed the favorable allele at each of the five genomic regions while the favorable alleles at five of the 10 QTL positions were from the Dabaijin 599 parent.

Table 3 QTLs of tobacco black shank resistance detected in the mapping population

Identification of the candidate genes

Four markers (marker376697, marker336593, marker336601, and marker403163) and two markers (marker356486 and marker136059) were located in QTLs qBS7 and qBS14, respectively. Unfortunately, these six markers were not mapped to the physical map of the tobacco genome but were merely located on five unanchored scaffolds. By overlap extension, combined with four genome sequences (‘TN90’, ‘K326’, N. sylvestris, and N. tomentosiformis) (Edwards et al. 2017; Sierro et al. 2013; Sierro et al. 2014), we constructed 3.9 MB and 2.8 MB of scaffold sequences near the five markers for qBS7 and qBS14, respectively (Table S3). According to the gene annotation information for ‘K326’, 72 and 66 candidate genes were found within the confidence intervals of qBS7 and qBS14 respectively (Additional file 5: Table S3). Among these 138 predicted genes, fourteen candidate genes encoded nucleotide-binding site-leucine-rich repeat (NBS-LRR) proteins associated with plant defense responses to pathogens. Each of the two QTLs has one candidate gene that encodes a protein with high homology to downy mildew disease resistance protein recognition of Peronospora parasitica 13 (RPP13) in Arabidopsis thaliana (Bittner-Eddy et al. 2000). In total, eleven candidate genes encode proteins with high homology to the late blight resistance protein R1 in potato (Ballvora et al. 2002). Five candidate genes encode glycine-rich protein (GRP) and two candidate genes encode cytochrome P450 protein (CYP). Besides, Nitab4.5_0002781g0070.1 encodes serine/threonine-protein kinase (STK) protein with LRR receptor.

Discussion

A high-density linkage map is an important tool for genomic analyses, QTL mapping, and marker-assisted breeding. The first high-density linkage map in tobacco was constructed using 2318 microsatellite markers and covered 3270 cM (Bindler et al. 2011) based on an F2 mapping population derived from the intervarietal cross of ‘Hicks Broadleaf’ × ‘Red Russian’. However, few SSR markers can be used for QTL mapping for resistance to black shank in the following population. In total, 156, 373, 204, and 258 SSR markers were used in ‘Beinhart 1000-1’ × ‘Xiaohuangjin 1025’ (Zhang et al. 2018), ‘Florida 301’ × ‘Hicks’ (Xiao et al. 2013), ‘Beinhart 1000’ × ‘Hicks’ (Vontimitta and Lewis 2012a; Vontimitta and Lewis 2012b), and ‘K346’ × ‘TI 1068’ (Drake-Stowe et al. 2017), respectively. These maps have low density levels and even the gaps divide the chromosomes into two or more LGs. The insufficient SSR marker numbers do not allow the construction of SSR marker-based maps to finely map QTLs.

With the development of high-throughput sequencing technologies, SNP markers have been applied to genetic map construction, improving the coverage and resolution of genetic maps even from populations of flue-cured tobacco crosses. Gong et al. (2016) used SNP markers and SLAF-seq to construct a map that harbored 4215 markers, with a total distance of 2662.43 cM and an average inter-marker distance of 0.60 cM (Gong et al. 2016). Here, we constructed a high-density genetic map using the SNP markers developed through a RAD-seq method. In total, 7734 SNPs were assigned to 24 LGs corresponding to the 24 chromosome pairs of the tobacco genome. The map contained a greater number of markers and was more saturated than the previous two maps, with an average inter-marker distance of 0.35 cM. The total covered distance was approximately the same as that of the SLAF-based SNP map and slightly less than that of the SSR map (Bindler et al. 2011). Nevertheless, the saturated SNP genetic map had 63.79% co-segregated markers and represented 2800 recombinant loci. Several gaps > 10 cM can be seen on the genetic map (Fig. 4). Because of a large number of repetitive or homologous sequences in allotetraploid tobacco and low sequencing depths, many true SNPs are being eliminated, which might cause the gaps.

The inheritance of the high level of partial resistance to P. nicotianae in ‘Florida 301’ and ‘Beinhart 1000’ is complex and controlled by polygene (Vontimitta and Lewis 2012b; Xiao et al. 2013). The current QTL study suggests that the resistance of ‘Yunyan 85’ is controlled by one major and several minor gene loci. The favorable alleles contributed by ‘Yunyan 85’ were at five of the 10 QTL. Meanwhile, the black shank-susceptible cultivar ‘Dabaijing 599’ may contribute five minor resistance QTLs. ‘Yunyan 85’ × ‘Dabaijin 599’ RIL exhibited significantly greater resistance of black shank than ‘Yunyan 85’.

Using the high-density SNP genetic map, the accuracy of QTL mapping for resistance to black shank in tobacco improved. Our fine mapping results in five distinct environments revealed that three stable QTLs, qBS7, qBS14, and qBS16, were within a tight interval. QTL qBS7, a stable major QTL, had the greatest effects and controlled the largest percentage of the phenotypic variation for black shank resistance (16.48-62.20%) in ‘Yunyan 85’. This QTL also had the greatest effect and controlled the largest percentages of the phenotypic variation for black shank resistance in ‘Florida 301’ (16.9%) (Xiao et al. 2013), ‘K346’ (38.0%) (Drake-Stowe et al. 2017), and ‘Beinhart 1000’ (25.4%) (Vontimitta and Lewis 2012a; Vontimitta and Lewis 2012b). In the present study, qBS7 was confirmed and the confidence interval was narrowed to 1-4 cM, which is a significant improvement compared with previous reports (Vontimitta and Lewis 2012a; Vontimitta and Lewis 2012b; Xiao et al. 2013; Drake-Stowe et al. 2017). Recently, the QTL was localized to within a genetic interval of approximately 3.51 cM between marker Nt1AF8640 and marker Nt1AD1494 using sub-NILs (Ma et al. 2019). In our result, the interval of M336601-M403163 was the same as Nt1AF8640-Nt1AD1494 based upon mapping the marker sequences with physical sequence of K326 reference genome. In the interval, our M376697 and four reported markers Nt1AF0086, Nt1AG1690, NtAB1658, and Nt1AA1707 were most highly associated with resistance in nearby position. These markers can be useful for marker-assisted selection in breeding programs at the qBS7 locus. In addition, the QTL was identified as being associated with resistance to bacterial wilt (Drake-Stowe et al. 2017). ‘Yunyan 85’ exhibited a low-to-moderate level of resistance to bacterial wilt in the field. It is valuable for fine mapping and cloning the gene(s) underlying the qBS7 QTL. The second favorable allele contributed by ‘Yunyan 85’ was in QTL qBS1 in three environments with smaller effect. The other three minor QTLs with positive additive effects were only identified in one environment.

The second-effect QTL qBS14 was also found to have the same location as the third QTL identified from ‘Florida 301’ and ‘K346’ based on SSR primer sequences and SNP sequence mapping to the ‘K326’ physical map (Xiao et al. 2013; Drake-Stowe et al. 2017). The QTL explained 6.82% of the average phenotypic variation of black shank resistance in the five environmental conditions in ‘Yunyan 85’ × ‘Dabaijin 599’ with 0.75-cM confidence interval, while it controlled 6.0% and 4.4% of the variation in a ‘Florida 301’ × ‘Hicks’ and ‘K346’ × ‘TI1068’ RIL mapping populations with more than 6-cM confidence interval, respectively. The QTL had a minor effect on black shank resistance and no association with resistance to bacterial wilt. The favorable allele was contributed by the black shank-susceptible parent ‘Dabaijing 599’. The pedigree information suggests that the flue-cured variety ‘Yunyan 85’ possesses polygenic black shank resistance derived from cigar tobacco ‘Florida 301’. It is possible that ‘Yunyan 85’ lost many minor QTLs from ‘Florida 301’, such as qBS14. The other four QTLs qBS1, qBS6, qBS21, and qBS24 with negative additive effects have lower LOD values and PVEs. Nevertheless, these QTLs were identified in multiple environments. Markers associated with these QTLs could be useful for transferring this favorable allele to increase the resistance of ‘Yunyan 85’ or other varieties.

In total, 138 candidate genes for black shank resistance were predicted in the target area. Of these, 14 were typical NBS-LRR resistance (R) genes and several were defense-related genes. R genes confer very high levels of narrow spectrum race-specific resistance to plant disease, but major R genes are lack of durability in some systems (McDonald and Linde 2002). Few R genes to date confer partial pathogen resistance. However, Jiang et al. reveal for the first time that NB-LRR gene R8 conferred broad spectrum and durable field resistance against P. infestans (Jiang et al. 2018). Apart from the typical NBS-LRR resistance protein, others such as serine/threonine-protein kinase (STK), glycine-rich protein (GRP), and cytochrome P450 protein (CYP) also involve in defense-responsive or defense-related against pathogen attack in crops. Further fine mapping will be required to reduce the number of candidate genes. This will be necessary to develop additional markers and to use larger populations to construct a fine physical map. The candidate gene expression data based on RNA-seq could also be useful for identifying differentially expressed gene. Moreover, the CRISPR/Cas9 gene editing technology can be used to inactivate candidate genes to validate functions.

Conclusions

This research reported a high-density SNP genetic linkage map of flue-cured tobacco using a RIL population and SNP markers developed by RAD-seq. The genetic map had 7734 markers and 2689 cM, with an average marker interval of 0.35 cM. We also identified 10 QTLs associated with resistance to black shank across five environments. Furthermore, 138 candidate genes of two major QTLs were identified in the confidence intervals. The result of this research will be helpful in elucidating the mechanism of disease resistance and in marker-assisted selection.