Introduction

Blast disease, caused by the filamentous ascomycete fungus Pyricularia oryzae (Couch and Kohn 2002) is one of the most important biotic constraints to rice production in Benin (Vodouhe et al. 1981; Afouda et al. 2007). It is the most widespread disease in African rice production causing yield losses of up to 100% (Séré et al. 2013). In Benin, blast disease hotspots have recently been mapped by the Africa Rice Center (AfricaRice) and partners, revealing the presence of isolates belonging to varying pathotypes in the rice growing areas. Extensive and uncontrolled use of fungicides is injurious to human health and poses environmental safety concerns. The deployment of host resistance in rice cultivation has been recognized to be the most economical and effective way of controlling the disease and reducing the need for chemical applications. In this regard, more than 100 single resistance genes (R genes) and around 100 quantitative trait loci (QTL) have been identified and some were mapped on rice chromosomes (Ballini et al. 2008; Sharma et al. 2012). Unfortunately, country-specific field evaluation of a set of differential lines, carrying most of identified genes showed, the capacity of new aggressive races of P. oryzae to overcome a number of dominant genes (R genes) in African (Pi1, Pi7, Pi5, Pikp, Pia, Pita2, Piks, Pi3, Pik, Pita, Piz, Pikh and Pikm) (Baboy et al. 1995; WARDA 1999; Odjo et al. 2011). Currently, blast disease poses a significant threat to rice food security in Benin (Odjo et al. 2011). In fact, P. oryzae is highly variable, and loss of resistance in varieties is quite common (Zeigler et al. 1994; Jia et al. 2003). This often appears when resistance is based on a single resistance (R) gene (Mackill and Bonman 1992; Dai et al. 2010). Scientists argued that each of the R genes known so far only acts against a subset of P. oryzae isolates. Over time, varieties may thus become susceptible when new more virulent isolates will occur. However, a continued search for new R genes and their pyramiding into genotypes remains necessary for efficient disease management (Wang et al. 2010; RoyChowdhury et al. 2012). The approach by genes pyramiding will confer overlapping resistance spectra to multiple isolates/races of P. oryzae (Hittalmani et al. 2000). It was also shown that quantitative resistance that provides broad spectrum resistance and is controlled either by several minor or major genes, durably protects the host plant against P. oryzae (Wisser et al. 2005; RoyChowdhury et al. 2012; Wang et al. 2013).

There are only two types of cultivated rice species in the world: Oryza sativa L. and O. glaberrima Steud. Both are commonly grown in Africa. O. glaberrima or African rice was domesticated in West Africa approximately 3500 years ago (Jones et al. 1997). This indigenous rice possesses useful traits such as weed competitiveness, tolerance to various abiotic stressors (drought, iron, acidity, salinity) as well as pest and disease resistance (Pham 1992; Jones et al. 1997; Futakuchi and Sié 2009; Sié et al. 2010; Thiémélé et al. 2010). However, O. glaberrima is becoming rare in African cropping systems where O. glaberrima is mostly replaced by O. sativa cultivars due to the former rice’s negative traits such as poor yield, grain shattering, poor resistance to lodging (Linares 2002; National Research Council 1996). O. sativa or Asian rice comprises two major groups: (1) the Japonica Group with sticky, short-grained genotypes that are mainly cultivated upland; and (2) the Indica Group with non-sticky, long-grained genotypes that are mostly grown in lowland conditions. Asian rice was introduced some 500 years ago to Africa by Portuguese sailors. Today, most varieties grown in Africa belong to O. sativa. Successful artificial interspecific hybridizations of both O. glaberrima and O. sativa have led to a series of popular hybrids known as NERICA (“New Rice for Africa”), which combines the resilience of the African species with the productivity of the Asian ones (Pham 1992; Jones et al. 1997; Futakuchi and Sié 2009; Sié et al. 2010; Thiémélé et al. 2010).

To date, only limited information about agricultural characteristics including pest and disease resistance of African rice genepool is available. Germplasm without consistent data on diseases and pest tolerance are of little value for breeding programs and sustainable paddy production. The present work studied genetic variation a set of cultivated rice accessions (O. sativa and O. glaberrima) of diverse African origins (Benin, Mali, Nigeria, Ivory Coast etc.) and examined their reactions to blast disease. Linkages between the genetic structure/markers and disease resistance was be investigated. Finally, a sub-set germplasm was proposed for a thorough evaluated of agronomic traits.

Materials and methods

Source of the germplasm collection

The 350 rice (5 O. sativa and 345 O. glaberrima) accessions used in this study (Suppl. Table 1), were obtained from the germplasm collection of AfricaRice, a Consultative Group for International Agricultural Research (CGIAR) centre located in Cotonou, Benin. Three reference accessions (WAB0006871 and WAB0007880 described as O. glaberrima rice and WAB0006881 as O. sativa species) were also included in the study as taxonomic and blast resistance/susceptibility references.

DNA extraction and AFLP analysis of rice accessions

Genomic DNA extraction from each accession was done according to the CetylTrimethyl Ammonium Bromide (CTAB) method described by Saghai-Maroof et al. (1984). Five seeds from each of the 350 rice samples were grown in a growth chamber at 27 °C. Approximately 200 mg of leaf samples were harvested on 21 days-old seedlings and ground in liquid nitrogen in 2 mL of CTAB using a pestle and mortar. Then, 500 μL of each of the obtained suspensions were transferred to centrifuge tubes and incubated in a water bath for 30 min at 60 °C. An equal volume of chloroform:isoamyl alcohol (24:1, v/v) was added to each tube. The suspension was gently mixed by inverting the tubes. Tubes were centrifuged at 12,000 rpm for 10 min and the supernatant recovered and mixed with an equal volume of ice-cold isopropanol. DNA was recovered as a pellet by centrifugation at 12,000 rpm for 5 min, washed with 100 μL of 70% ethanol, dried under vacuum and dissolved in 30 μL of TE buffer. Two μL of RNase were added to the DNA and incubated for 1 h. DNA concentrations were measured using the Quantus™ Fluorometer (Promega) and adjusted to approximatively 250 ng/μL.

AFLP polymorphic markers were used to analyse the genomic DNA using the modified protocol described by Vos et al. (1995). To this end, DNA (250 ng/μL) was completely digested with 5 U of each MseI and EcoRI restriction enzymes and ligated to 5 pmol EcoRI and 25 pmol MseI adapters. Pre-amplification of ligation products was done with specific EcoRI and MseI primers without additive nucleotides and over 20 cycles of 94 °C during 30 s, 56 °C during 60 s, and 72 °C during 60 s were run in a thermal cycler. Concentration of the amplified DNA was checked on a 1.5% agarose gel. Next, DNA was diluted 25 times in TE buffer. For selective amplification, three primer combinations with three selective bases at the end (EcoRI + ACC/MseI + CTC, EcoRI + ACC/MseI + CTA, EcoRI + ACC/MseI + CAT) were used to reveal a maximum number of polymorphic markers in the rice samples. EcoRI primer was end-labeled with NED fluorescent dye. Amplified products were mixed with GeneScan-500 ROX size standard (Applied Biosystems Inc., USA) and deionized formamide. Finally, they were loaded on an Applied Biosystems electrophoresis instrument 3130 for electrophoretic size separation with performance optimized polymer (POP 4).

Field screening for resistance to blast in Benin

Benin republic has two rainy seasons during which blast disease can naturally occur. The AfricaRice Cotonou experimental site was identified as a disease hotspot (Odjo et al. 2011). Two field experiments were successively conducted under upland rainfed ecology in 2015 during the two rainy seasons, i.e. from April to July and from August to October, respectively. The first experiment was implemented using an augmented randomized complete block design with 17 control varieties replicated in the design whereas an Alpha Lattice design with four replicates was used during the second experiment. For each of the 350 accessions, 18 seeds were sown in a plot of three 50 cm long rows with a spacing of 20 cm between rows and 10 cm between plants. Fourteen days prior to sowing of the latter accession seeds, three infesting rows, consisting of three blast-susceptible reference varieties (Maratelli, CO39 and IRBLTA2-PI) were sown perpendicularly to the test plots in order to spread the disease. Thinning of the accessions was done 14 days after sowing to maintain only one seedling grown in each hill. A pre-drilling base application of 300 kg/ha of NPK fertilizer (15-15-15) was applied at sowing of the test accessions. Next, 300 kg/ha of urea was applied in two equal splits at 21 and 42 days after sowing. Hand-weeding and watering were done as per need. No treatment against diseases and insects was done. Disease symptoms were scored on the inner plants of each row of each plot leaving aside the first and last plant in each row. Scoring was done weekly starting from the appearance of the first symptoms until harvest. The IRRI 1–9 scale (2013) was used and four to five scoring events were performed during the two experiments. Mean Disease Index (DI) was obtained by calculating the average disease scores for each accession. Four disease resistance classes were considered: resistant (R): DI < 3, moderately resistant (MR): 3 < DI < 4, moderately susceptible (MS): 4 < DI < 5 and susceptible (S): DI > 5. A representative sub-set of the total germplasm approximate genetic diversity was then selected according to their blast response and variability for AFLP markers for efficient use in crop improvement programmes. Actions were taken to avoid duplicate the most similarly similar accessions.

Statistical data analysis

AFLP electropherograms were analyzed by GeneMapper Software Version 4.0 using the marker GeneScan-500 ROX in the range size of 30–500. Genotype marker results were automatically compiled in a standard binary Excel format (1; present, 0: absent). Polymorphism Information Content (PIC) values were calculated according to Botstein et al. (1980). Genetic dissimilarity coefficients using the Jaccard distance were calculated to determine the genetic relationship between accessions and the hierarchical clustering tree was constructed using Unweighted Neighbor Joining (UWNJ). Analysis of molecular variance (AMOVA) was carried out to estimate molecular diversity at each hierarchical level among and within the groups identified and genetic diversity parameters calculated using GenAlEx 6.502. DI values were subjected to a Linear Discriminant Analysis (LDA) with identified polymorphic AFLP markers based on continuous and categorical approaches using the R software (R Core Team 2017) to define the linear combination of AFLP markers which best separate accessions according to blast resistance. In addition, stepwise regression models were constructed for selecting an appropriate subset of markers to predict the rice blast resistance.

Results

Genetic diversity detected by AFLP markers among assembled rice germplasm

The three primer combinations generated a total of 10,384 bands out of which 72% were polymorphic at 77 different loci (putative loci/genome landmarks). Average Polymorphic Information Content (PIC) detected was 24 ± 4%. The primer combination EcorI-ACC:Mse-CAT appeared to be most informative with a PIC of 29% (Table 1). Jaccard’s genetic dissimilarity coefficients ranged from 0.03 to 0.98 across all the germplasm.

Table 1 Results of polymorphism detection for each primer combination

The UWNJ tree based on Jaccard’s dissimilarity coefficients derived from the 77 polymorphic AFLP markers evidenced three main genetic clusters: one with 184 accessions including the O. glaberrima control WAB0006871; a second with 83 accessions including the only O. sativa control (WAB0006881, Moroberekan), and a third with 86 accessions including the O. glaberrima control WAB0007880 (Fig. 1). Dissimilarity coefficients varied from 0.05 to 0.84 in cluster 1 and from 0.03 to 0.59 in cluster 3, whereas a larger range was observed in cluster 2 (0.03–0.98). The highest Shannon’s diversity index (I) was found within cluster 2 with I = 0.4 followed by cluster 1 with 0.32, whereas a much smaller I-value was observed in cluster 3 (0.26). Expected heterozygosity averaged 0.26, 0.21 and 0.17 in cluster 2, cluster 1 and cluster 3, respectively (Table 2). A high proportion of genetic diversity was most apparent in cluster 2, which comprised all O. sativa accessions, including control variety WAB0006881. Cluster 1 and cluster 3 contained exclusively O. glaberrima accessions. AMOVA used to determine the percentage of genetic variation within and among the three identified clusters (Table 3), revealed that the within-cluster diversity explained most of the genetic variation (77%), whereas only 23% of total variance was due to differences between the three clusters (P < 0.001).

Fig. 1
figure 1

UWNJ tree cluster analysis showing accessions reaction to blast disease. Colours indicate the disease status: resistant accessions (Severity < 3) are written in red; moderate resistant (3 < Severity < 4) in blue; moderate susceptible (4 < Severity < 5) in green and susceptible (Severity > 5) are indicated in black. (Color figure online)

Table 2 Level of genetic variability estimated among accessions of the three identified clusters
Table 3 AMOVA analysis for cluster differentiation by AFLP markers

Frequency of blast disease among 350 cultivated rice accessions

DI ranged from 0 to 9 in the first experiment, while for the second experiment, DI ranged from 0.33 to 7 (Fig. 2). A significant positive association (R2 = 0.64, P < 0.001) was found for blast disease index between the two successively conducted field experiments from April to October 2015. In other words, blast disease score distributions were similar in the two experiments. However, we observed that disease pressure was lower during the second experiment. Nevertheless, resistance/susceptibility status of accessions did not change. Average disease scores recorded on the susceptible control varieties WAB0007142 (CO 39) and GSOR301253 (Maratelli) were 8 and 7.6, respectively. Out of the total 350 tested accessions, 12% were highly resistant, 24% moderately resistant, 2 9% moderately susceptible and 35% susceptible. The highest number of resistant accessions was found in cluster 2 (20), whereas cluster 1 contained the highest number of moderately resistant rice accessions (50).

Fig. 2
figure 2

Correlation of disease severity means of two field experiments for blast resistance screening carried out in Benin from April to October 2015

Relationship between AFLP markers and blast disease resistance/susceptibility

Among the major genetic clusters identified based on 77 polymorphic AFLP markers, we observed a sub-cluster A within cluster 2 that contained the highest number of resistant accessions compared to cluster 1 and 3. This affinity in dispersal of accessions reaction spectrum, gives an indication that AFLP markers may help to detect genetic differences in the resistance levels of rice accessions by distances correlation (Fig. 1). LDA using the total of 77 AFLP markers was performed (Fig. 3) revealing a significant association of AFLPs to accessions’ reactions in the field (R2 = 0.37). A series of models were performed to better examine the relation of genetic diversity with blast disease. Stepwise regression analysis revealed that only 13 out of 77 polymorphic markers were necessary to explain accurately accessions reactions pattern (Table 4). Based on 13 most significant AFLP markers, we attempted to predict three disease classes (Resistant: DI < 3, moderate resistant: 3 < DI < 4 and susceptible with DI > 4) by a General Linear Model (GLM). Thi s GLM showed significant coefficients (P < 0.05) for a total of seven markers with disease status (Table 5). The probability of accessions assignment to the right disease class was 68%. Better results were obtained when we considered only two disease classes (DI < 3 and DI > 3). In case a binary classification was used, the algorithm was able to explain 90% of the responses and the coefficients of a number of eight markers were found to be significant (Suppl. Table 2).

Fig. 3
figure 3

Genetic differentiation for response of 350 rice accessions to blast disease by Linear Discriminant Analysis using AFLP markers colours indicate the disease status: resistant accessions (Severity < 3) are written in red; moderate resistant (3 < Severity < 4) in blue; moderate susceptible (4 < Severity < 5) in green and susceptible (Severity > 5) are indicated in black. (Color figure online)

Table 4 Stepwise regression revealed 13 associated AFLP markers to the blast disease
Table 5 Association (R2) of AFLP loci with the blast severity split into three main disease clusters: resistant (R), moderate resistant (MR) and sensible (S)

Among the significant markers, CTA22-366 with a size of 366 base pairs was particularly present in highly resistant accessions. Markers utility was such that at molecular level, when CTA22-366 was present, the observed status of the accessions was always resistant. But when this marker was absent, we found that accessions were either resistant or susceptible. A representative sub-set of 42 rice accessions (12% of the total germplasm) of the total 350 rice accessions studied was proposed (based on their molecular representativeness and diversified blast resistance responsiveness) for breeding purposes. This selected rice sub-set consisted of 26 resistant accessions, 9 moderately resistant and 7 susceptible accessions.

Discussion

The PIC value obtained for the studied germplasm collection (mean = 24%) was lower than those reported by Semon et al. (2005) and Dramé et al. (2011) who assessed genetic diversity of 198 and 82 O. glaberrima accessions, respectively using Simple Sequence Repeats (SSR) markers. They reported PIC values of 34 and 52%, respectively. According to Vieira et al. (2016), there are some advantages of SSR markers in polymorphism detection due to multi-allelic and codominant nature. Furthermore, a higher accessions number (350) was examined in the present study and might contain more genetically similar samples.

The partitioning of molecular variance in the total germplasm indicated significant genetic variation both among clusters (23%) and within clusters (77%). Genetic diversity was found to be more within than between clusters. High genetic variance estimates within-groups ranging from 71 to 95% have been reported in earlier studies of African cultivated rice (Ming et al. 2010; Salem and Sallam 2016; Ndjiondjop et al. 2017). Our results provide a global insight into how the germplasm stratification manifests, which revealed an asymmetric division of the collection. Cluster 2, including all O. sativa accessions from Benin as well as the control WAB0006881 showed the highest level of genetic diversity. This result is in close agreement with many other studies, suggesting a significant reduction in genetic diversity in O. glaberrima (Li et al. 2011; Wang et al. 2014). Within cluster 2, the sub-cluster A grouped 20 O. glaberrima accessions together with the six O. sativa accessions. As it sometimes happens that farmers grow both species in the same field, it is possible there was a gene flow between O. sativa and O. glaberrima. Spontaneous interspecific hybridizations is not uncommon in rice fields (Second 1982; Barry et al. 2007; Nuijten et al. 2009). Thereby, the pollen flow is usually from O. sativa to O. glaberrima (Sano 1989). According to Jusu (1999) and Semon et al. (2005), interspecific hybridizations result in new varieties.

Among all screened rice accessions, 12% were highly resistant and 24% were moderately resistant to blast disease. The other accessions were moderately susceptible or susceptible (29 and 35%, respectively). Results suggest a large genetic variability in blast resistance studied within the germplasm. Accessions showing high blast disease resistance might possess a single resistance gene. Quantitative variability in blast resistance resulting from multiple gene control could be detected within the category of moderately resistant accessions (Vasudevan et al. 2014). Out of 353 tested rice accessions (including the three controls), 50 were moderately resistant in cluster 1, whereas only 15 and 23 moderately resistant were found in cluster 2 and cluster 3, respectively. This suggests a higher level of quantitative resistance within cluster 1 than within the other clusters. It would be interesting to compare field resistance levels with artificial inoculation results using specific Beninese blast isolates. There is also a need to evaluate the yield of the identified resistant material for breeding purposes. Genebanks and breeders will more likely conserve O. glaberrima germplasm when it is high-yielding (Wambugu et al. 2013). The remaining, lower-yielding resistant germplasm can be used by breeders as resistance gene donors.

Diversity in cultivated rice accessions was found to correlate with varying phenotypic blast disease reactions. Based on their AFLP pattern, susceptible and resistant accessions could be distinguished. A number of AFLP markers were significantly associated with blast resistance. These associations were strongest when using CTA22-366 marker. Marker utility for blast resistance screening will be examined through future genetic studies by developing hybrid individuals and backcross populations. This study suggested a total of 13 AFLP markers which have the potential to identify a highly variable germplasm core set that is easier to evaluate and would facilitate the conservation and use in future breeding programs. Particularly, this will help formulate future strategies for improving rice blast resistance. Clustering identified accession groups that are genetically distinct from and that included different levels of blast resistance. Plant genetic diversity information is important pre-breeding information for selecting parents that can be used in intra and inter-crossing to broaden the genetic base of modern rice cultivars (Thakur et al. 2015). The choice of donor parents could be based on the knowledge on the accession’s genetic group. AFLPs was confirmed to be a very useful molecular tool for exploration of relationship of genetic diversity with blast resistance. This new resistant germplasm is an additive value for broadening horizon towards the identification of new blast R genes.