Introduction

Phytophthora sojae (P. sojae), a soil-borne oomycete pathogen, causes Phytophthora root or stem-rot (PRR) of soybean (Glycine max (L.) Merr) at different growth stages of soybean (Schmitthenner 1985). As the second most destructive disease in soybean, the yield losses caused by PRR are 10–40% or complete yield loss (Li and Ma 1999; Zhang et al. 2013a), and the economic loss is $10–20 billion per year (Tyler 2007). In China, PRR was first found in Heilongjiang Province in 1991 (Shen and Su 1991) and has been continuously found in other soybean production areas. Although new races and virulence biotypes have continuously emerged in China (Huang et al. 2016), P. sojae race 1 is still the most prevalent strain in China, especially in Northeast China.

Soybean resistance to P. sojae is controlled by two mechanisms, partial (or tolerance) and complete resistance (Sugimoto et al. 2012). Presently, the analysis of P. sojae resistance mechanisms heavily relies on linkage analysis derived from population mapping of parents with significant phenotypic differences. Partial resistance to P. sojae is regulated by a complex QTL-mediated resistance network. More than 70 QTLs have been found to lead to partial resistance to P. sojae through linkage analysis. Of these QTLs, 15 have been identified in eight separate populations of the PRR-resistant cultivar ‘Conrad’ (Burnham et al. 2003a, b; Han et al. 2008; Li et al. 2010; Tucker et al. 2010; Wang et al. 2010, 2012a, b; Weng et al. 2007; Wu et al. 2011a, b). These QTLs are mainly distributed on chromosome 2 (Chr.02), Chr.13, Chr.18 and Chr.19. For complete resistance, the first single-dominant Rps gene (Rps1k) was identified in 1957. Currently, nearly 30 Rps genes or alleles on nine different soybean chromosomes (Chr.) have been identified from various resistance sources based on linkage analysis and genetic mapping. Of them, Chr.03 contained the most Rps genes/alleles, including Rps1 (five alleles Rps1a, Rps1b, Rps1c, Rps1d, and Rps1k), Rps7, Rps9, RpsYD25, RpsYD29, RpsUN1, RpsQ, RpsHC18, RpsWY, RpsHN and RpsX (Demirbas et al. 2001; Weng et al. 2001; Gao et al. 2005; Fan et al. 2009; Sugimoto et al. 2011; Sun et al. 2011; Wu et al. 2011a; Lin et al. 2013; Zhang et al. 2013b, 2014; Li et al. 2017; Zhong et al. 2017; Cheng et al. 2017; Niu et al. 2017; Zhong et al. 2019). Additionally, Rps2 and RpsUN2 were located on Chr.16 (Demirbas et al. 2001; Lin et al. 2013). Rps3 (three alleles Rps3a, Rps3b, and Rps3c) and RpsSN10, which was either linked to or allelic with Rps8, were identified on Chr.13 (Demirbas et al. 2001; Sandhu et al. 2005; Gordon et al. 2006; Yu et al. 2010; Burnham et al. 2003a). Rps4, Rps5, Rps6 and RpsJS were found on Chr.18 (Demirbas et al. 2001; Sandhu et al. 2004; Sun et al. 2014a, b). RpsYB30, RpsZS18, RpsSu, Rps10 and Rps11 were located on chromosome 19, 2, 10, 17 and 7, respectively (Zhu et al. 2007; Yao et al. 2010; Wu et al. 2011b; Zhang et al. 2013a; Ping et al. 2016). In recent years, the P. sojae strain has become increasingly complicated due to mutations and outcrossings among different P. sojae strains (Grau et al. 2004). The emergence of new virulent pathotypes of P. sojae often overcomes Rps gene-mediated resistance in commercial cultivars (Grau et al. 2004). Normally, Rps genes provide P. sojae resistance for only 8–15 years. Thus, researchers must constantly seek for and identify some new Rps genes. Recently, linkage analysis has been used in discovering QTLs for the resistance to P. sojae; however, due to the lack of the repeatedly reported loci, most of them were still difficult to directly utilise in molecular-assisted selection (MAS).

Genome-wide association analysis (GWAS), an alternative to linkage analysis, has begun to be used to effectively analyse the genetic basis of resistance to the disease (Sun et al. 2014a; Schneider et al. 2016; Huang et al. 2016). Sun et al. (2014b) detected 4 association signals that led to partial resistance via 214 accessions and 495 SSR markers. Schneider et al. (2016) found 16 QTNs that led to partial resistance based on 1395 Korean samples and 495 SSR markers. High-throughput sequencing technologies have further promoted the application of GWAS for studying P. sojae resistance. Huang et al. (2016) identified 14 association signals from 11 P. sojae isolates through genome-wide association analysis (GWAS) based on 224 germplasm accessions and 1645 single-nucleotide polymorphisms (SNPs) and reverified Rps1b, Rps1c, Rps4, Rps7 and novel resistance genes. This study only found a few novel QTNs that were different from the QTLs identified by the bi-parental mapping strategy due to lower marker density. By early 2019, no studies had been conducted to identify the QTNs underlying P. sojae race 1 resistance based on high-throughput technology.

In the present study, GWAS and linkage mapping of resistance to P. sojae race 1 based on high-resolution SNP maps generated from genome resequencing were performed. The aim of this study was to identify loci associated with P. sojae race 1 resistance and to screen candidate genes located in QTN regions by a qRT-PCR approach and Rps loci-based association analysis.

Materials and methods

Plant materials

Two mapping populations were used in this study. One was a natural population consisting of 225 diverse soybean accessions to represent the genetic diversity inside and outside China. These germplasms covered the regions of China (from 19 degree to 48 degree of north latitude and from 74 degree to 128 degree of east longitude, Table S1). Additionally, 24 soybean cultivars from outside of China were also included. Among the 225 germplasms, soybean cultivars with good and poor resistance to P. sojae race 1 were involved, including ‘DongongL-28’ (a highly resistant cultivar) and ‘Hefeng 25’ (a highly susceptible cultivar). The cross population, consisting of 109 F5:15 RILs, was derived from crossing ‘Hefeng 25’ and ‘DongongL-28’.

Phenotypic evaluation of soybean resistance to P. sojae race 1

P. sojae race 1 resistance of all the tested plant materials was evaluated through the hypocotyl wound infestation technique described by Burnham et al. (2003a). P. sojae race 1 was isolated from infected soybean plants in 2017. A total of 30 seedlings of each tested material were incubated in a mist chamber at 25 + 2 °C, with 100% relative humidity and a 14-h photoperiod at a light intensity of 350 μmol m−2 s−1. All tested materials were evaluated 5 days after inoculation. A line was classified as resistant if 70% of the seedlings were alive and susceptible if 30% of the seedlings were alive. A line was classified as resistant if more than 70% of the seedlings were alive and susceptible if less than 30% of the seedlings were alive. Evaluation of all lines was repeated three times.

SNP genotyping data collection

Leaf genomic DNA of the association panel was isolated via the CTAB method and then reduced-sequencing via the SLAF-seq methodology (Sun et al. 2013). The reference genome Williams 82 (www. phetozome.net) was pre-analysed, and two restriction digestion enzymes, MseI (EC 3.1.21.4) and HaeIII (EC 3.1.21.4), were selected to produce evenly distributed sequencing tags in unique genomic regions of soybean. For each tested sample, more than 50,000 sequencing tags (300–500 bp in length) were obtained, which were further used to define sequencing libraries. The combination of the barcode method and Illumina Genome Analyser II system (Illumina Inc., San Diego, CA, USA) was utilised to obtain 45-bp sequence reads at both ends of the sequencing tags from each sample library. Alignments between raw paired-end reads and the reference genome Williams 82 (www.phetozome.net) via the Short Oligonucleotide Alignment Program 2 (SOAP2) software were conducted. Raw reads in the same genomic position were used to define SLAF tags. A total of 60 thousand high-quality SLAF tags were obtained from each of the 225 genotypes. In SNP calling, the minor allele frequency (MAF) threshold was set at 0.05. If the depth of the minor allele was larger than 1/3 of the total sample depth, the genotype was regarded as heterozygous.

For linkage analysis of a population, raw sequencing data were generated from the Illumina HiSeq2500 system. The soybean reference genome (Glycine max Wm82.a2. v1) was used for read mapping of two parents using SOAPaligner. The sequencing depth and coverage relative to the reference genome were calculated based on the mapping results. A consistent sequence of each parent was obtained using SOAPsnp software. The reads of the RIL population were aligned with the parental genome sequences using SOAPaligner software. For each sample, only the reads of unique mapping positions were considered. If a mismatch happened between a certain single read and one of the two parents, the read was considered to be from the genome of the other parent; a similar process was used for all the unique mapping reads. Chi-square tests were conducted for all SNPs to detect segregation distortion. Markers with a segregation distortion test p < 0.001 or that contained an abnormal base were filtered out. Markers were deleted if more than 10% of individuals had missing genotypes. For construction of the linkage map, the genetic distance between bin markers was calculated using the Kosambi mapping function implemented in the QTL IciMaping v3.0 software.

Population structure evaluation and linkage disequilibrium (LD) analysis

The population structure of the tested natural population was defined by the principle component analysis (PCA) approach using GAPIT software (Lipka et al. 2012). The SNP threshold value (MAF > 0.1 and missing data < 10%) and r2 (squared allele frequency correlations) were used to define LD between pairs of SNPs based on the TASSEL version 3.0 software (Bradbury et al. 2007). In contrast to the GWAS, missing SNP genotypes were not imputed with the major allele before LD analysis. The parameters of the programme included the MAF (> 0.05) and the integrity of each SNP (> 80%).

Genome-wide association analysis

Association signals of resistance to P. sojae race 1 were identified based on 28,722 SNPs from 225 tested accessions using the Compressed Mixed Linear Model (CMLM) in GAPIT (Lipka et al. 2012) and the FARMCPU method (Liu et al. 2016). p values were calculated using the Bonferroni method at 0.05 (≤ 1.74 × 10−6) (Holm 1979). The p values were set as the threshold to determine whether significant association signals existed.

QTL mapping based on linkage analysis

QTL analysis was performed using the CIM method implemented with QTL IciMaping v3.0 using stepwise regression for cofactor selection. The LOD score threshold was determined by the results of 1000 permutations for each trait. The percentage of phenotypic variance and additive effect explained by a QTL for a trait were also estimated.

Prediction of candidate genes controlling resistance to P. sojae race 1

Candidate genes located in the 200-kb genomic region of each peak SNP were classified and annotated underlying the soybean reference genome (http://www.soybase.org).

Quantitative RT-PCR assay

Two lines, ‘Williams 82’ (resistant to P. sojae race 1) and ‘Hefeng25’ (susceptible to P. sojae race 1), from the association panel were selected to investigate their expression pattern. The same method described in the resistance evaluation section was used to inoculate the above two lines of P. sojae race 1. The control groups of each line were exposed to the same cultural conditions except for inoculation. Stems of each line from the inoculation group and the control group were collected at 0 h, 4 h, 8 h, 12 h, 24 h, 36 h and 48 h after inoculation. Total mRNA of each sampled root was isolated using the GenElute™ Total RNA Purification Kit. The expression level of candidate genes was determined by quantitative real-time reverse transcription polymerase chain reaction (RT-PCR) analysis. Real-time RT-PCR amplifications were performed using a real-time RT-PCR kit according to the manufacturer’s instructions (Takara, Japan) on a CFX96 Touch™ Real-Time PCR Detection System (Bio-Rad, USA). Approximately 1 μg of total RNA was used for each reverse transcription according to the protocol of the ReverTra Ace qPCR RT Kit (TOYOBO Co., Ltd., Japan). Each amplification reaction was performed with 1 μL of the resulting cDNA first-strand cDNA synthesis solution, 1.5 μL of each primer (10 μM), 10 μL of SYBR Green PCR Master Mix and 7.5 μL RNase free ddH2O in a total reaction of 20 μL. The PCR cycling conditions were as follows: 95 °C for 5 s, 60 °C for 20 s, 72 °C for 20 s for 40 cycles and 60 °C for 1 min. The relative mRNA level of each candidate gene was evaluated against the soybean actin4 (GmACTIN) (GenBank Accession Number AF049106) reference gene. Three biological duplications and three technical repetitions of each gene were conducted by qRT-PCR. The sequences of the primer pairs used to amplify the candidate genes are presented in Table S2.

Association analysis for Rps loci on Chr.03

For eleven resistant lines and twenty-two susceptible lines, genome resequencing with a 10-fold depth was conducted on an Illumina HiSeq 2500 sequencer. Paired-end resequencing reads were mapped to the soybean reference genome with BWA (Version: 0.6.1-r104) (Zhou et al. 2015) using the default parameters. SAMtools4 (Version: 0.1.18) software was used to convert the mapping results into the BAM format and to filter unmapped and non-unique reads. Duplicated reads were filtered with the Picard package (picard.sourceforge.net, Version: 1.87) (Zhou et al. 2015). The BEDtools (Version: 2.17.0) (Zhou et al. 2015) coverage Bed software was used to compute the coverage of the sequence alignments. A sequence was defined as absent if its coverage was lower than 90% and present if its coverage was greater than 90% (Zhou et al. 2015).

SNP detection was performed using the Genome Analysis Toolkit (GATK, version 2.4–7-g5e89f01) (Zhou et al. 2015) and SAMtools. Only SNPs detected by both methods were analysed further. SNPs with allele frequencies lower than 20% in the population were discarded. SNP annotation was performed according to the soybean genome (Williams 82 assembly v2.1) using the package ANNOVAR (Version: 2013-08-23) (Zhou et al. 2015). The sequence variations of the genomic region of Rps loci on Chr.03 was collected based on the SNP genotyping data. Association analysis was conducted with the same method to GWAS as described above.

Results

Phenotype characteristic of P. sojae race 1 resistance in RILs and the natural population

The mean, coefficient of variation, skewness and kurtosis for resistance to P. sojae race 1 of the association and linkage analysis panels are shown (Table S3 and Fig. 1). The association panel consisted of 225 accessions with resistance to P. sojae race 1 (ranging from 0 to 100%), and an average disease loss percentage of 51.74% was calculated (Table S3). The linkage panel comprised 109 RIL lines, which varied from 0 to 100 (with an average disease loss percentage of 38.71%). The coefficients of variation in the association and linkage panels were 0.61 and 0.91, respectively. Normal distributions without any significant skewness or kurtosis for the association and linkage panels were observed (Table S3).

Fig. 1
figure 1

Phenotypic distribution of the seedling death rate of the tested soybean lines after inoculation with P. sojae race 1

Genotyping and characteristic analysis of the association and linkage panels

For the association panel, a total of 28,722 SNPs was identified through SLAF sequencing (minor allele frequency ≥ 0.05 and missing data ≤3%), which were distributed across all 20 chromosomes of the soybean genome. These SNP markers covered 947 Mbp, which accounted for approximately 99.79% of the entire soybean genome (Fig. S1A). The number of SNPs was different among the 20 chromosomes, and the mean SNP per chromosome was 1436 (ranging from 747 in Chr.11 to 2335 in Chr.18). The average marker density was approximately one SNP per 33.04 kb (Fig. S2A).

For the linkage mapping population, a total of 5221 bins markers along the 20 chromosomes were identified based on resequencing (Fig. S1B), with their physical coordinates referenced to the soybean reference genome. Chr.18 and Chr.12 also had the most and fewest bin markers, respectively. The total genetic length of this bin map was approximately 3152.3 cM, with a mean interval between markers of 0.60 cM (an average physical bin length of 181.41 Kb). The longest and shortest distances in this map were 193.67.24 cM (Chr18) and 115.17 cM (Chr.11), respectively, which included 381 and 165 bin markers, respectively.

Genome-wide association and linkage mapping of P. sojae race 1 resistance

Principal component and kinship analyses of the association panel were conducted using 28,722 SNPs, and the results showed that the first three principle components (PCs) could explain 16.19% of the total genetic variation (Fig. S2C) and that the relatedness among the tested accessions was low (Fig. S2B). In this study, the LD decay distance was 212 Kb (Fig. S2D). For association analysis, the compressed Mixed Linear Model (CMLM) and Fixed and random model Circulating Probability Unification (FarmCPU) were utilised to identify association signals for P. sojae race 1. A total of eight QTNs were associated with P. sojae race 1 resistance (Fig. 2A, B and Table 1), which were distributed on 5 chromosomes: Chr.03, Chr.07, Chr.14, Chr.15 and Chr.17. Among them, four QTNs were overlapped or located near the genomic region of the known QTL underlying P. sojae race 1 resistance, and the four another QTNs were novel for P. sojae race 1 resistance (Table 1). Rprr-3-2 and Rprr-17-2 were detected by two mapping methods, CMLM and FarmCPU. The LD pattern among the SNPs from the genomic region of association signals on Chr.03 and Chr.17 was evaluated. Rprr-3-1 and Rprr-3-2 were not in the same LD block. Also, Rprr-17-1, Rprr-17-2 and Rprr-17-3 were not in the same LD block (Fig. S3), indicating that these loci might be independent with each other. The allele effects of all the identified resistance QTNs were analysed, and the results indicated that different alleles for each identified QTN significantly affected P. sojae race 1 resistance of tested samples (Table 1). These identified QTNs and their alleles were valuable for MAS of cultivars with excellent resistance to P. sojae race 1.

Fig. 2
figure 2

GWAS and QTL mapping results. (A) Manhattan plot of association mapping of soybean resistance to P. sojae race 1 based on the CMLM method. (B) Manhattan plot of association mapping of soybean resistance to P. sojae race 1 based on the FARMCPU method. (C) QTL mapping of soybean resistance to P. sojae race 1 by the CIM method, and the candidate genes in the colocation region of the Chr03 locus by GWAS and linkage mapping

Table 1 Peak SNP and beneficial alleles associated with soybean resistance to P. sojae race 1 identified by GWAS

For linkage analysis, two QTL (Qprr3-1 and Qprr3-2) intervals underlying P. sojae race 1 resistance were detected that both mapped on Chr.03 (Fig. 2C). These two QTLs explained 5.87% and 56.7% of the phenotypic variation (Table 2), which was derived from ‘DongongL-28’. QTL Qprr3-2 and QTN Rprr-3-2 colocated by linkage mapping and GWAS.

Table 2 QTL mapping to identify loci related to soybean resistance to P. sojae race 1

Analysis of the candidate genes providing resistance to P. sojae race 1 on Chr.03

For association analysis, 84 genes were found in the 200-kbp flanking region of eight QTNs (Table S4). Of these identified genes, four had no functional annotations/known domains or were derived from unknown functional families. Another 80 genes were classified into leucine-rich repeat protein kinase family proteins, LRR and NB-ARC domain-containing disease resistance proteins, hormone metabolism, secondary metabolism and signalling (Table S4). Many of these identified genes have been shown to be involved in plant disease resistance (Jiang et al. 2018; Marie-Cécile et al. 2014). Jiang et al. (2018) reported a nucleotide-binding site leucine-rich repeat gene from tomato that conferred resistance to Phytophthora sojae in tomato. This gene has the same domain as Glyma.03G034200, located near Rprr-3-1 of Chr.03. Marie-Cécile et al. (2014) found a downy mildew resistance gene, PDLP1, and Glyma.07G225800, which derived from the same gene family. Other genes were novel for Phytophthora sojae resistance.

The resistance loci on Chr.03 were identified by both linkage mapping and GWAS. Hence, the eighteen genes in the 200-Kb flanking region of the QTLs Qprr3-2 and QTNs Rprr-3-1 and Rprr-3-2 on Chr.03 were considered candidate genes for further analysis. The 1.5-kb upstream sequences of the eighteen genes were predicted for cis-acting elements. Cis-acting regulatory elements involved in methyl jasmonate responsiveness, salicylic acid responsiveness and/or defence and stress responsiveness were found in the upstream region of the sixteen genes (Table S5). Quantitative RT-PCR of the sixteen candidate genes was performed based on the results of the cis-acting element prediction. Of the sixteen genes, Glyma.03G033700, which encodes a C2H2-like zinc finger protein, was upregulated by P. sojae race 1 in the resistant line ‘Williams 82’ at 4 h post inoculation. Glyma.03G033800, which encodes a cell wall β-expansin protein, was downregulated by P. sojae race 1 in the resistant line ‘Williams 82’ at 4–12 h post inoculation (Fig. 3).

Fig. 3
figure 3

Expression patterns of the candidate genes. The mRNA levels of each candidate gene were analysed in resistant soybean accession ‘Williams 82’ and susceptible soybean accession ‘Hefeng25’. Infected and uninfected samples were collected at 0, 4, 8, 12, 24, 36 and 48 h post inoculation from isolated P. sojae race 1 and empty carrot agar medium. The GmACTIN (GenBank Accession Number AF049106) gene was used as an internal reference. The values are the mean ± SD of three biological replicates

A total of 379 SNPs were identified in the genomic region of Rps loci on Chr.03 (Fig. 4A). One SNP, Chr.03:3930474, was significantly associated with soybean resistance to P. sojae race 1. Two genes closest to Chr.03:3930474 are Glyma.03G033800 and Glyma.03G033700. (Fig. 4B). Combined with the result of quantitative RT-PCR assay, two genes, Glyma.03G033700 and Glyma.03G033800, were identified as the possible causal genes at the loci on Chr.03 of P. sojae race 1 resistance in soybean. These genes should be further studied to determine the mechanism of P. sojae resistance and their application in MAS.

Fig. 4
figure 4

Physical map and association analysis underlying P. sojae resistance of Rps loci on Chr.03. (A) Physical map of Rps loci on Chr.03 reported by the present study and previous studies. The marks with red colour indicate loci detected by the present study. The marks with blue colour indicate loci detected by previous studies. (B) Association analysis based on 379 SNP within the Rps loci (Chr.03: 2.9 Mb–9.9 Mb) generated from 33 soybean germplasms through genome resequencing. The X-axis is the physical position of SNP, and the Y-axis indicates the significance level of each SNP. The blue dotted line indicates the threshold of -Log10(p) value for the association signals

Discussion

Breeding new soybean cultivars with resistance is the most effective method in controlling P. sojae in the field. Resistance to P. sojae is controlled by two mechanisms, partial (or tolerance) and Rps-mediated complete resistance (Sugimoto et al. 2012). No single strategy can completely control P. sojae. Partial resistance offers a high degree of protection by delaying or limiting the damage caused by P. sojae to the soybean plant, even under low or moderate disease pressure (Dorrance et al. 2003; Tooley and Grau 1982). However, partial resistance cannot provide a level of control adequate to maintain profitable yields. Rps-mediated complete resistance is not durable due to the selection pressure that is exerted by the diverse virulence pathotypes of P. sojae. Although Rps-mediated complete resistance does not control all strains of P. sojae, Rps genes or alleles are still helpful in managing specific P. sojae strains. P. sojae race 1 is the most prevalent strain found in Northeast China, but other complicated P. sojae populations have been found in this region as well. Thus, long-term management of P. sojae in Northeast China may require the integration of multiple disease management strategies, such as breeding cultivars with both partial resistance and appropriate single gene resistance combined with tillage, rotation, soil drainage and seed treatments (Dorrance et al. 2003; Han et al. 2017). In this study, 225 accessions, including landraces or elite cultivars, were examined for resistance to P. sojae race 1, most of which were a key parent or the main cultivated cultivars in Northeast China. The results indicated that only a few of the tested samples had a high level of partial resistance to P. sojae race 1 (Table S1 and Fig. 1), some of which also were partially resistant to P. sojae isolated from Northeast China (data not shown). These accessions with resistance or partial resistance may be useful as sources of resistance for breeding new varieties adapted to Northeast China.

To date, few resistance QTLs have been identified based on different mapping populations using limited specific materials (www.soybase.org); most of these identified QTLs are specific to the genetic background. Therefore, the novel QTL/gene of the cultivar from Northeast China still needs to be identified for P. sojae MAS. In this study, 8 QTNs and two QTLs were identified to be associated with P. sojae race 1 through association and linkage analyses. Qprr3-2, a QTN identified from GWAS, has a similar genomic region to that of Rprr-3-2 from the resistance source ‘DongnongL-28’. Among these identified QTNs, four QTNs overlapped with or were near the known resistance QTL (Tables 1 and 2). Another four QTNs were novel for P. sojae resistance. Presently, only a few P. sojae race 1 resistance genes have been identified and cloned. GWAS could offer valuable information to identify and validate candidate genes of P. sojae resistance, especially in the relatively smaller LD block (average length 150–200 kbp) of soybean. Among the identified candidate genes, Glyma.03G033700 and Glyma.03G033800 were determined being the potential candidates response to P. sojae race 1 in soybean through Rps loci-based association analysis and a quantitative RT-PCR assay. Although Rprr-3-2 and Qprr3-2 have similar genomic regions as a cluster of Rps loci such as RpsYD29 (Zhang et al. 2013b), RpsHN (Niu et al. 2017) and RpsUN1 (Lin et al. 2013), etc. (Fig. 4A), the resistance donor of the soybean line and the candidate genes were different from those found in our study. However, the specific mechanism of the two candidate genes identified in this study still needs to be further analysed.

The soybean germplasm has been shown to be a valuable reservoir of disease resistance (Athow et al. 1986; Dorrance and Schmitthenner 2000; Kyle et al. 1998), which can be used to improve P. sojae resistance in soybean. In this study, two resistance lines (Ken14 and DN1068) exhibited high and partial resistance to P. sojae race 1, and these two lines had high yields and were adapted to Northeast China. Hence, they are of great value to transfer high resistance levels to other cultivars in Northeastern China through MAS.