Introduction

Cassava (Manihot esculenta Crantz) is an important tuberous root crop in Africa, South America, and Asia. It is the sixth most important staple crop worldwide and is consumed by 600 million people. Cassava offers the advantages of high photosynthetic efficiency, high root/starch yield, and tolerance of drought and poor soils. Furthermore, cassava is cultivated to be a cash crop for the bioenergy and starch industries in China and other countries. The annual production of cassava in 2010 was 229 million tons and production increased by approximately 5 million tons/year from 2000 to 2010 (FAOSTAT). However, current production is insufficient to meet the rapidly increasing market demand.

The application of molecular marker techniques for quantitative trait locus (QTL) mapping is an effective approach to dissect complex quantitative traits. Cassava is monoecious and protogynous, and thus predominantly outcrossing, which leads to a high degree of heterozygosity that considerably complicates breeding (Hershey and Jennings 1992). QTL analysis of complex and quantitative traits in cassava is helpful to identify favorable alleles and to obtain recombinants with desirable traits through marker-assisted selective breeding.

The QTL analyzes so far has focused on disease resistance and root yield/quality. Diseases such as cassava bacterial blight (CBB), cassava mosaic disease (CMD), and cassava in cassava, anthracnose disease (CAD) are important biotic constraints that greatly decrease cassava yield worldwide, particularly in Africa and South America. Jorge et al. (2000, 2001) detected 14 and eight QTLs for tolerance to CBB by a single-marker regression method in the progeny of the cross TMS30572 × CM2177-2 in successive years. Wydra et al. (2004) and Lopez et al. (2007) identified additional QTL associated with resistance to CBB. Several QTLs associated with resistance to CMD and CAD have been identified (Akano et al. 2002; Akinbo et al. 2007; Lokko et al. 2005). These studies have provided marker resources that are useful for breeding disease resistance in cassava, and the identified QTsL/markers could be utilized and further validated by molecular marker-assisted selection (MAS). In addition, Cortés et al. (2002) published ten QTLs associated with resistance to post-harvest physiological deterioration (PPD), and these QTLs could explain 5–13 % of phenotypic variation in the same mapping population by Jorge et al. (2000, 2001) formerly.

Increased root yield and improvement of eating quality are also important objectives in cassava breeding. Okogbenin and Fregene (2002) and Okogbenin et al. (2008) successively identified a series of QTLs that control traits related to cassava root yield in the progeny of the cross TMS30572 × CM2177-2, and a portion of these QTLs were detected repeatedly. Furthermore, Balyejusa Kizito et al. (2007) identified two and six QTLs that control cyanogenic glucoside potential and root dry matter content (DMC), respectively.

In the present study, a cassava F1 population that comprised 208 progeny was raised from the cross SC6 × Mianbao. Amplified fragment length polymorphism (AFLP; Vos et al. 1995), sequence-related amplified polymorphism (SRAP; Li and Quiros 2001), simple sequence repeat (SSR), and expressed sequence tag (EST)–SSR markers were applied to generate a genetic linkage map, which was used to identify QTL for DMC, fresh root yield (FRY), and root starch content (SC) with the F1 population.

Materials and methods

Plant material and field experiments

A cassava F1 population that comprised 208 progeny was raised from a cross between two non-inbred lines, SC6 (the female parent) and Mianbao (the male parent), at the Tropical Crops Genetic Resources Institute, Chinese Academy of Tropical Agricultural Sciences, Danzhou City, Hainan Province, China. SC6 is a predominant cultivar that is grown in south China and Mianbao is a landrace grown on Hainan Province that shows tolerance to drought and barren stresses. The F1 population was designated the population SM. Stem cuttings from the 208 F1 progeny and the parents were planted in the field, with a randomized block design and one replication in 2003, 2004, 2005, and 2008 in Wenchang, Hainan Province, and each genotype had six individuals. The field management essentially followed normal agricultural practices. About 10 months after planting, three uniform individuals were harvested from each line for the collection of phenotypic data.

Trait evaluation

Fresh root yield was determined as the average weight of the roots of the three selected individuals. Dry matter root content was determined by drying 200 g slices of storage root (the mixed sample of three individuals) at 60 °C in a forced air dryer to a constant weight. The dried root slices were ground in a household grinder, filtered through a 0.5 mm mesh sieve, and placed in a desiccator until subsequent analysis. Root SC was measured by a polarimetric method (Mitchell 1990).

AFLP analysis

Total DNA (200 ng) of each line was digested with two restriction enzymes, EcoRI and MseI. Adapter ligation and two successive PCR amplifications for AFLP analysis were performed in accordance with the method described by Vos et al. (1995) with slight modification. The 5′ end of the MseI primer was labeled with the fluorescent FAM marker. The PCR products were mixed with loading buffer and denatured. Then, a 1.0 μl aliquot was electrophoresed on an ABI PRISM™ 377 DNA sequence analyzer for 2 h, and the amplified fragments were visualized by FAM fluorescence. The AFLP primers used in the study are listed in Supplementary Table 1.

Table 1 Summary of measurements of fresh root yield per individual (FRY), dry matter root content (DMC), and starch content (SC) for SC6, Mianbao, and the population SM in 2003, 2004, 2005, and 2008 in Wenchang, Hainan Province, China

SRAP analysis

Sequence-related amplified polymorphism analysis is a PCR-based marker system that uses two primers, a forward primer of 17 bases and a reverse primer of 18 bases. The forward primer consists of a core sequence of 14 bases. The first ten bases at the 5′ end comprise a ‘filler’ sequence of no specific constitution, which is followed by the sequence CCGG and then by three selective nucleotides at the 3′ end. The reverse primer of 18 bases is similar to the forward primer, but its ‘filler’ sequence is followed by AATT instead of CCGG and there are four selective nucleotides. The first five cycles of the PCR amplification comprised denaturation at 94 °C for 60 s, annealing at 35 °C for 60 s, and extension at 72 °C for 60 s, and then the annealing temperature was raised to 50 °C for an additional 35 cycles (Li and Quiros 2001). The PCR products were electrophoresed on a 2 % agarose gel for 30 min at 90 V, and visualized under ultraviolet light. The SRAP primers used in the study are listed in Supplementary Table 1.

SSR and EST–SSR analysis

Twenty-four SSR primers and 30 EST–SSR primers were first screened against the parents and ten gene pools. Each gene pool consisted of ten individual progeny’s DNA. The SSR markers that were polymorphic were used to evaluate the entire population. The PCR products were separated on 6 % denaturing polyacrylamide gels for 1.5 h using a BIO-RAD Sequi-Gen sequencing system (Bio-Rad, USA). The separated products were visualized by silver staining.

Construction of the genetic linkage map

JoinMap3.0 was used to construct a linkage map (Stam 1993) with the population SM. The threshold of fit was set to ≤5.0 with LOD scores >1.0 and a recombination frequency <0.4. The population SM was considered to be a CP-type (cross pollination between two heterozygosity lines) population because the genetic background of the two parents was heterozygous. Hence, each AFLP, SRAP, and SSR marker was assumed to be located at a single locus. The parent-specific markers that showed 1:1 segregation in the population SM were scored and recorded as lm × ll (marker in female parent) and nn × np (marker in male parent). Only the markers that showed 1:1 segregation were used to generate the genetic map; all other markers were excluded.

Mapping of QTL

Inclusive composite interval mapping (ICIM), as implemented in the IciMapping 3.0 software (Li et al. 2007), was used to detect QTL for FRY, DMC, and SC in four trials. Model 7 of the procedure was employed, and the likelihood of a QTL and its corresponding effect was estimated at every 1 cM. A LOD score of 2.5 was set as the threshold for declaring the presence of a QTL. The dominance (d) and additive (a) values for each QTL that were given by the program were used to calculate the ratio of d/a. The QTLs based on the average data of 4 years/single year data were list in Table 3/Supplementary Table 1.

Results

Phenotypic evaluation

Measurements for the FRY, DMC, and SC traits in the population SM and the two parents are listed in Table 1. The FRY of SC6 was higher than that of Mianbao in all 4 years (t-test, P = 0.029), whereas the DMC of SC6 was lower than that of Mianbao in all 4 years (t-test, P = 0.107). The SC of SC6 was lower than that of Mianbao in 3 years (2003, 2004, and 2005), but slightly higher in 2008.

Construction of a genetic linkage map

Out of a total of 367 polymorphic markers, 200 markers showed 1:1 segregation and were assigned to 20 linkage groups (LGs) with JoinMap. The genetic map consisted of 139 AFLP markers, 21 SRAP markers, 11 SSR markers, and 29 EST–SSR markers, and was designated the SM11 genetic map because the mapped markers all conformed to a 1:1 segregation ratio (Table 2; Fig. 1). The markers were not distributed evenly among different LGs; the largest LG contained 33 markers, and the smallest LG only contained three markers. The length of the LGs varied from 43.5 cM (LG19) to 125.1 cM (LG17). The SM11 genetic map spanned a total of 1645.1 cM (Kosambi units). The marker intervals ranged from 0.1 to 39.4 cM and the average marker interval was 8.2 cM.

Table 2 Linkage group size, number of markers, and average marker interval per linkage group in the cassava genetic linkage map
Fig. 1
figure 1

SM11 genetic map and the positions of repeatedly detected QTL with the phenotypic data of four trials in the population SM. E5M1, the AFLP primer combination E5 and M1; srM4-E10, the SRAP primer combination me4 and em10

QTL mapping

The results of QTL mapping for the three traits that were analyzed in the four trials are listed in Supplementary Table 1. For each trait, the analysis was carried out for the phenotypic data obtained in each individual trial as well as the average phenotypic data from all four trials. Although the interaction effects for QTL × environment were significant for some QTL, but most of the QTL that were detected repeatedly across environments were also detected when the average values of the four trials (see Table 3; Fig. 1).

Table 3 Identified QTL for three phenotypic traits detected in the population SM using the average phenotypic data of the four trials

For FRY, 60 QTLs were detected, for which the phenotypic variation explained (PVE) ranged from 1.60 to 48.38 % with an average value of 26.75 %. Among these 60 QTLs, 13 QTLs were detected repeatedly in two trials, eight QTLs were detected repeatedly in three trials, and four QTLs (qfry3b, qfry11a, qfry14a, and qfry20b) were detected repeatedly in all four trials. Eleven repeatedly detected QTLs (rd-QTLs) were linked to SSR markers and their average PVE was 17.42 %.

For DMC, 46 QTLs were detected, for which the PVE ranged from 1.09 to 26.37 % with an average value of 10.37 %. Among these 46 QTsL, six QTLs were detected in two trials, eight QTLs were detected in three trials, and one QTL (qdmc6a) was detected in all four trials. Seven rd-QTLs were linked to SSR markers. The average PVE for these rd-QTLs was 6.55 % with a maximum value of 23.92 %, and five of them were detected in three trials.

For SC, 38 QTLs were detected, for which the PVE ranged from 1.11 to 30.04 % with an average value of 12.21 %. Eight QTLs were detected in two trials, seven QTLs were detected in three trials, and two QTLs (qsc11 and qsc18b) were detected in all four trials. Seven rd-QTLs were linked to SSR markers, and had a maximum PVE of 4.06 %. One of the rd-QTL was detected in four trials, whereas the other rd-QTLs were detected in two or three trials.

For the three traits, in addition to the 25 SSR-linked rd-QTLs, 31 AFLP-linked rd-QTLs and one SRAP-linked rd-QTL were detected. Among the latter 32 rd-QTLs, the average PVE was 19.51 % for FRY, 8.09 % for DMC, and 10.57 % SC. Although these 32 QTLs had high PVE values, it is difficult to use AFLP and SRAP markers for MAS; hence, they would need to be converted first into sequence-characterized amplified region (SCAR) or other simple PCR markers.

In total, 57 rd-QTLs were identified for the three traits in the population SM. Of these rd-QTLs, 27 rd-QTLs were detected in two trials, 23 rd-QTLs were detected in three trials, and seven rd-QTLs were detected in all four trials. The rd-QTLs were not distributed evenly among LGs. Ten LGs contained most of the rd-QTLs, and 15 marker intervals contained QTLs that corresponded to two or three traits (Fig. 1; Table 4), which indicated that these genome regions have pleiotropic effects.

Table 4 QTL regions that associated with two or three traits

Discussion

Cassava ranks as the sixth most important food crop in the world, but basic research on this crop has lagged behind that on other crops and relatively few researchers actively study cassava, especially in the field of quantitative genetics research. The first segregated cassava population to be analyzed was the progeny of the cross TMS30572 × CM2177-2, from which two genetic maps were constructed; one map consisted of 394 markers and the other consisted of 100 SSR markers (Fregene et al. 1997; Okogbenin et al. 2006). An additional five segregated populations were developed in Colombia (Akano et al. 2002), Nigeria (Lokko et al. 2005; Akinbo et al. 2007), Thailand (Kunkeaw et al. 2010), and China (Chen et al. 2010). These populations were used to construct genetic maps and/or identify QTL for resistance to CMD and CAD, for example. However, most of these QTL were identified in a single-marker analysis and in only one case by the composite interval mapping method. This shortcoming may explain why many false positive QTL were identified and some genuine QTL were missed. Hence, in the present study, the ICIM method (Li et al. 2007) was applied to minimize the false detection rate and increase the detection power.

The 57 rd-QTLs that were recognized for the three traits in the population SM were identified by the ICIM method using the phenotypic data from the four trials. Among these 57 rd-QTLs, 25 rd-QTLs were linked to SSR markers, and 15 marker intervals on ten LGs showed pleiotropic effects. With reference to the genetic map of Balyejusa Kizito et al. (2007) identified a QTL for DMC that was located in the shared marker interval NS80–NS178 on LG18, which contained QTL for all three traits in the present study. Fregene et al. (1997) constructed a genetic map that consisted mainly of RFLP markers using the F1 progeny of the cross TMS30572 × CM2177-2, and mapped a series of QTLs on this map, but in practice it is difficult to use RFLP markers in MAS. Subsequently, Okogbenin et al. (2006) constructed a genetic map based on SSR markers using the F2 progeny (K150) of the cross TMS30572 × CM2177-2, and mapped several QTLs on this map (Okogbenin et al. 2008). None of the published maps contained any shared markers; hence it was impossible to identify common QTLs among different reports, which in turn was an obstacle to the application of published identified QTL in relation to MAS of cassava.

The SM11 genetic map incorporated only 11 SSR and 29 EST–SSR markers., and most of the AFLP or SRAP markers clustered into one LG alone (LG1–LG11, LG15, and LG16). On the basis of these results, the strategy for construction of a cassava genetic map should be modified. SSR and EST–SSR markers should form the major components of a cassava genetic map, and there were 472 SSRs and 431 EST–SSRs that could be used currently. AFLP or SRAP markers should be considered as supplementary components to fill in gaps. In addition, the use of highly inbred lines as parents is needed to develop segregated populations for the construction of genetic maps and QTL analysis in the future.