Introduction

The common bean (Phaseolus vulgaris L.) is a major staple food crop in Africa and Latin America, and is valued as one of the primary protein sources in the diet of low-income populations in developing countries. Moreover, it is a legume that is favorable to genetic studies because it has certain model plant characteristics, such as a small genome (637–675 Mb per haploid genome; Arumuganathan and Earle 1991; Guo et al. 2007), diploidy (2n = 2x = 22), and a highly self-pollinated breeding system.

The cultivated bean is morphologically diverse and has variations in many traits, such as growth habit, phenology, pod pigmentation, and seed pigmentation (Singh et al. 1991). Determinacy is an important agronomic trait in the common bean. Indeterminate growth habit was first reported to be controlled by a single dominant gene by Lamprecht (1935). It is considered a simple trait, and there is no consensus about its genomic location or the number of loci involved (McClean et al. 2002).

Genetic maps provide useful information for many studies, such as the localization of genomic regions that control phenotypic traits, synteny studies, and gene cloning. Moreover, they can be used to study the genetic architecture of quantitative traits. Morphological traits can be used as markers when qualitative variation and discrete phenotypic segregation classes are present.

Microsatellites or simple sequence repeats (SSRs) (Tautz and Renz 1984) are one of the most popular markers used to study polymorphisms between DNA sequences. Variations in these repetitive sequences within loci are detected using polymerase chain reaction (PCR). Microsatellites present a high level of polymorphism, co-dominance, multi-allelism, and a Mendelian pattern. They have been used in genetic analyses in mammals (Moore et al. 1998), insects (Harper et al. 2003), birds (Primmer et al. 1997), fishes (Rico et al. 1993) and plants (Condit and Hubbell 1991; Yang et al. 1994; Yu et al. 2000).

As a result of research efforts from many groups aiming to increase knowledge of the common bean, microsatellites have recently been described (Yu et al. 1999; Gaitán-Solís et al. 2002; Métais et al. 2002; Blair et al. 2003, 2008; Buso et al. 2006; Hanai et al. 2007; Benchimol et al. 2007; Campos et al. 2007; Cardoso et al. 2008). When microsatellites are derived from ESTs (expressed sequence tags), they can be associated with genes of known function and can be used as functional markers, tagging genes of interest in a more efficient manner (Hackuf and Wehling 2002; Oliveira et al. 2009).

Genetic mapping of the common bean began with the use of morphological markers and isozymes (Basset 1991). The maps developed by Vallejos et al. (1992), Gepts et al. (1993), Nodari et al. (1993) and Adam-Blondom et al. (1994) were first created using restriction fragment length polymorphism (RFLP)-based markers. A core map was established in the common bean that was based on a recombinant inbred population BAT 93 × Jalo EEP558 (Freyre et al. 1998). Random amplified polymorphic DNA (RAPD) (Adam-Blondom et al. 1994; Bai et al. 1997; Rodríguez-Suárez et al. 2006) and amplified fragment length polymorphism (AFLP) (Tar’an et al. 2002) were also used to construct linkage maps for the crop.

Microsatellites have been used in diversity analyses (Blair et al. 2006a, 2009; Zhang et al. 2008) and genetic maps (Yu et al. 2000; Blair et al. 2003, 2006b; Grisi et al. 2007), but some linkage groups that exist have low saturation and gaps. A central effort of the research on the common bean genome is to integrate physical and genetic maps. Schlueter et al. (2008) recently published a draft physical map for the common bean genome. Our primary objective was to characterize new microsatellites and develop a linkage map based on genic and genomic microsatellite markers. Another major objective was to map phenotypic markers and to study the genetic control of the growth habit trait.

Materials and methods

Mapping population

A recombinant inbred line (RIL) population was obtained from a cross between the CAL143 and IAC-UNA inbred lines. CAL143 is an Andean line with cream coat color, red stringed seeds, and calima type. IAC-UNA is a black seeded variety developed by the Agronomic Institute of Campinas (IAC) and is from the Mesoamerican gene pool. The two parents are divergent for many phenotypic traits and have contrasting responses to many common bean pathogens. This mapping population was advanced by inbreeding to the F10 generation and 380 lines were obtained.

Microsatellite amplification and analysis

Total genomic DNA for all recombinant inbred lines was isolated from bulked young leaves of ten plants per line, using the CTAB extraction method as described in Hoisington et al. (1994).

A total of 871 microsatellites were tested for polymorphisms among the CAL143 and IAC-UNA lines. Of these, 567 have been published previously. There were 44 markers from the PV series (Yu et al. 2000; Caixeta et al. 2005); 46 markers from the BM, AG, and GAT series (Gaitán-Solís et al. 2002); 2 markers from the PVbng series (Murray et al. 2002); 49 markers from the Bmd series (Blair et al. 2003); 15 markers from the M series (Yaish and Pérez de la Vega 2003); 18 markers from Guerra-Sanz (2004); 71 markers from the PVBR series (Buso et al. 2006; Grisi et al. 2007); 80 markers from the PvM and FJ series (Hanai et al. 2007); 222 markers from SSR-IAC series (Benchimol et al. 2007; Cardoso et al. 2008); and 20 markers from the FJ series (Campos et al. 2007). The gene-based microsatellites used were from the PvM series that was developed from ESTs, Bmd, and PV from Yu et al. (2000) and the markers from Guerra-Sanz (2004) that were developed from GenBank gene searches. The remaining 304 tested markers were developed in this work according to Campos et al. (2007). The polymorphic markers developed are further described in Table 1.

Table 1 Description of the polymorphic microsatellites for CAL143 and IAC-UNA that were developed, including primer sequences, repeat motifs, allele sizes and annealing temperatures (T a)

Microsatellite fragments were amplified using PCR, with each reaction containing 40 ng of template DNA, 0.8 μM each of forward and reverse primer, 150 μM of each dNTP, 1.5 mM MgCl2, 10 mM Tris–HCl, 50 mM KCl, and 0.5 U Taq DNA polymerase. Reactions were performed using the following cycling conditions: 1 min at 94°C; followed by 30 cycles of 94°C for 1 min, the specific annealing temperature for 1 min and 72°C for 1 min; and a final step of 72°C for 5 min. Some microsatellites were mapped using cycles of touchdown (TD) PCR (Table 1), as described by Hanai et al. (2007). PCR products were separated using electrophoresis in denaturing polyacrylamide gels (6% w/v) and visualized using silver staining (Creste et al. 2001). The molecular size of the fragments was estimated using a 10-bp ladder (Invitrogen). Fluorescent primers were used for 39 loci as described by Schuelke (2000), and these loci were genotyped using an ABI 3730 automatic sequencer.

Morphological markers

Three morphological markers were evaluated in the mapping population: flower color, growth habit, and pod tip shape. Flower color is controlled by the V locus (Lamprecht 1939) and segregates as purple (IAC-UNA) and pink (CAL143) in the mapping population. Growth habit segregates as determinate type I, present in the Andean parent, and indeterminate type III, present in the Mesoamerican parent. Determinacy has been described as a qualitative trait controlled by the fin locus (Lamprecht 1935; Rudorf 1958). The pod tip shape can be curved (Ct; Al-Mukhtar and Coyne 1981), present in IAC-UNA, or straight (ct), present in CAL143.

Segregation and linkage analyses

The chi-square (χ2) test for 1:1 segregation ratios was performed for all polymorphic markers. The expected segregation ratios were tested based on P-values after performing Bonferroni corrections (Lynch and Walsh 1998). The linkage map was constructed using the software MAPMAKER 3.0b (Lander et al. 1987). The threshold for considering markers to be linked was a LOD score of 3.0 and a maximum genetic distance of 37.5 cM using the Kosambi (1944) map function.

To order the markers within groups, markers with known positions in other published maps were anchored in the corresponding group. Markers whose position could not be anchored with high precision were verified based on their LOD score.

Ordering analyses of markers within groups were performed following the guidelines suggested by Mollinari et al. (2009). First, a subset of nine informative markers was chosen based on missing data and segregation distortion. For each group, all possible orders of the nine markers were compared based on their log-likelihoods. Additional markers from each group were added using the ‘TRY’ command, starting with more informative markers. The final order was verified using the ‘RIPPLE’ command, with a window with six markers. Finally, multipoint estimates of distance were obtained using the ‘MAP’ command.

Results

Microsatellite polymorphisms tests

The conditions for amplification of the 871 microsatellites were optimized and the polymorphisms were tested for the parents of the IAC-UNA × CAL143 population. A total of 248 (28.5%) microsatellites were polymorphic and presented adequate visualization patterns with no stutter or non-specific bands.

There were eight (38.1%) polymorphic markers from the PV series described by Caixeta et al. (2005), 19 (41.3%) from the BM, AG, and GAT series, one (50.0%) from the PVbng series, 38 (53.5%) from the PVBR series, 11 (26.3%) from the FJ series described by Hanai et al. (2007), 69 (31.1%) from the SSR-IAC series, and five (25.0%) from the FJ series described by Campos et al. (2007). All the 15 M series markers were monomorphic. Among the gene-based microsatellites, there were five (21.8%) described by Yu et al. (2000), 15 (32.7%) from the Bmd series, three (16.7%) described by Guerra-Sanz (2004), and 10 (25.0%) from the PvM series that were polymorphic. In total, there were 33 (25.4%) polymorphic markers from 130 gene-based markers tested and 232 (31.3%) polymorphic markers from 741 genomic-based markers. The 64 new IAC series microsatellites, described in Table 1, were also polymorphic.

Genetic map

A total of 198 microsatellites and three phenotypic markers were mapped, with distributions in all 11 linkage groups (Fig. 1). The total map length was 1865.9 cM with an average linkage group length of 170.5 cM. The average distance between markers was 9.4 cM, with an average of 18 markers in each group. The number of markers in each group varied from five (B11 group) to 37 (B2 group).

Fig. 1
figure 1

Genetic map for IAC-UNA × CAL143 (UC). B1 to B11 refer to linkage groups, and I to XI are the corresponding chromosomes (Pedrosa et al. 2008). The underlined microsatellites were anchored to form the linkage groups. (*) Markers with segregation distortion to CAL143 and to (**) IAC-UNA. (***) Phenotypic marker V linked to group B6. Arrows indicate possible position of the fin and Ct loci, and dotted lines indicate distances larger than 37.5 cM

The B2, B3, B4, B5, B7, B8, and B9 linkage groups had gaps that require more saturation. The B2, B4, B8, and B9 groups have markers with genetic distances greater than 37.5 cM; in these cases, sub-groups were discriminated by segmented lines (Fig. 1). The B8 group was split into A and B sub-groups, as it was not possible to determine the correct sense order between them. The 8A subgroup was formed by ten markers with previous mapping position unknown in the literature, not allowing further comparison. While there was statistical evidence for the PVBR251 marker being linked to the B1 group, its distance from the adjacent marker (IAC235) was greater than 37.5 cM. The same phenomenon occurred in the B9 group with marker IAC242.

The Bonferroni correction was used, but even markers with segregation distortion were included in the map. Distortion was observed for 92 (37.3%) microsatellite loci, and 76 of these were positioned on the map (Fig. 1). As expected, these markers clustered in certain chromosomal regions.

Phenotypic markers

All three phenotypic markers were mapped. The V locus was detected in the B6 group and was closely linked to the microsatellites IAC268 (0.2 cM) and IAC183 (0.6 cM) at the edge of the B6 group. The fin and Ct loci were mapped, but their position in the linkage groups could not be estimated with sufficient accuracy; therefore they are indicated by arrows in Fig. 1. The pod tip shape trait, determined by the Ct locus, was linked to the B9 group and was close to the molecular markers FJ17 and PVBR60.

Phenotypic segregation (Table 2) was tested. The expected Mendelian segregation ratio (1:1) was observed for flower color and pod tip shape. However, the growth habit trait showed significant deviation from this value (P < 10−5), which could be the result of two loci controlling this phenotype. A ratio of 259:106, which is a good fit to the ratio 3:1 (χ2 = 3.62, P = 0.06), was observed for growth habit, confirming this hypothesis. In addition, the F1 phenotype was 100% determinate growth habit. We propose a novel genetic model wherein two recessive loci control the phenotype of the indeterminate growth habit. For example, consider the cross finfinfin′fin′ (indeterminate growth) × FinFinFin′Fin′ (determinate growth). Four homozygous genotypes with equal proportions are expected in a RIL population: FinFinFin′Fin′, FinFinfin′fin′, finfinFin′Fin′ and finfinfin′fin′. If only the homozygous genotype with two recessive alleles (finfinfin′fin′) expressed indeterminate growth, a segregation of 3:1 would be expected. The growth habit trait (fin) was mapped in the B1 (LOD = 3.76) and B9 (LOD = 3.95) groups, supporting the hypothesis that two loci could be involved in the genetic control of this trait.

Table 2 Segregation of three qualitative traits from an F10 population of the common bean cross IAC-UNA × CAL143

Discussion

Mapping population and polymorphism tests

The new mapping population IAC-UNA × CAL143 (UC) established in this study represents the largest population currently developed with regard to the number of recombinant inbred lines (n = 380) and generations of self-breeding. The UC population is able to detect recombinant events because of the high number of recombinant lines. For this reason, a well-saturated map with precise estimates of distance and marker order is expected.

The parents were chosen from different gene pools (Mesoamerican and Andean) and have contrasting features for several agronomical traits. Segregation for disease resistance, such as resistance against anthracnose (caused by Colletotrichum lindemuthianum), angular leaf spot (caused by Pseudocercospora griseola) and rust (caused by Uromyces phaseoli), was observed. These parents define an interesting mapping population, as several traits can be mapped.

A moderate polymorphism rate (28.5%) was found between microsatellite markers. Low diversity is expected for a cross between two cultivated genotypes. The observed polymorphism rate is similar to results reported in other studies, such as 45.7% (Blair et al. 2006b) and 42.8% (Grisi et al. 2007). If the parents were from the same gene pool, they would most likely have a narrower genetic base and less genetic variability.

One probable reason for the moderate polymorphism rate found is the extensive number of microsatellites developed with small motif repeats, considered from five units in enriched library screening (Benchimol et al. 2007; Campos et al. 2007; Hanai et al. 2007). For example, the PVBR series contains larger repeated motifs and was the most polymorphic microsatellite series (53.5%) in polymorphism tests. In this work, we described 64 new microsatellite markers that presented parental UC polymorphisms, which will be useful for other genetic studies in the common bean.

Genetic map

The final UC genetic map covered all 11 linkage groups of the common bean. The total length (1865.9 cM) was consistent with other genetic mapping studies (Blair et al. 2003; Grisi et al. 2007). Some linkage groups presented a highly dense coverage distribution of markers (e.g., B5 and B7), while others contained gaps (e.g., B2 and B8) or presented few markers (e.g., B11, with only five markers). The microsatellite-based genetic map presented by Grisi et al. (2007) also found a small number of markers in the B11 group (only four markers were detected). The Co-2 cluster, containing one of the R genes involved in anthracnose disease resistance in the common bean, was found by molecular analysis in the subtelomeric region of the B11 group (Creusot et al. 1999). R genes contain leucine-rich regions called coiled-coil nucleotide-binding site leucine-rich repeat (CNL), and can contain heterochromatic knobs. There is evidence that the Co-2 cluster, through an ectopic recombination, gives rise to the Pv B4-CNL sequences in the B4 group (David et al. 2009). The difficulty in mapping the B4 linkage group could be due to the presence of two knobs detected in the CNL subtelomeric region (Geffroy et al. 2009).

A total of 201 markers have been positioned in the UC map to date, including 131 microsatellite markers that were not mapped previously. The use of anchor markers was efficient to form the linkage groups and to order the markers within them. Among the gene-based microsatellites, in the B4 linkage group the loci BMd9, Bmd15 and PVatgc002 were mapped very close together and were derived from different phytohemagglutinin gene sequences, suggesting a single location of a gene family cluster in this region. It confirms the results of Blair et al. (2003), who found other markers (Bmd16 and PVatgc004) related to phytohemagglutinin gene sequences also in the B4 group.

In the UC map, a concentration of markers with distortion to Andean alleles was detected in the B1 group, and a concentration of markers with distortion to Mesoamerican alleles in the B2 and B5 groups. In the B2 linkage group only a small region presented markers with segregation distortion. For the B5 group, the region with distortion is larger, and in the B1 group, Andean alleles predominated (Fig. 1). Blair et al. (2003) found the same segregation distortion to Mesoamerican alleles in regions of the B5 group. Freyre et al. (1998) also found the same distortion for the B1 and B2 groups for the same parental gene pools.

The phenomenon of markers with segregation distortion in the common bean has been extensively described (Freyre et al. 1998; Blair et al. 2003; Grisi et al. 2007; Hougaard et al. 2008). This phenomenon was observed not only for microsatellite markers, but also for legume anchor markers (Hougaard et al. 2008) and RFLP markers (Freyre et al. 1998). The reasons for this segregation distortion could be associated with genomic regions related to adaptive advantages that were selected during each self-breeding cycle of the population mapping development. These distortion genomic regions did not present mixture between the Andean and Mesoamerican pools, supporting an incompatible genomic region hypothesis. The distortion of Mendelian ratios associated with these markers clustering inside the linkage groups represents a trend towards retaining the ancestral pool origin.

The segregation distortion observed for some markers did not exclude them from being potentially informative for recombinant detection. For this reason, they were incorporated into the linkage analysis. The selection effect cannot be ignored, however, even in a controlled cross in a population mapping experiment. According to Shizhong (2008), the use of markers with segregation distortion could actually increase the mapping saturation and statistical power of quantitative trait locus mapping.

The phenotypic mapping of the V locus in the B6 group is consistent with other genetic maps, as it was also obtained by Nodari et al. (1993) and McClean et al. (2002). The Ct locus was previously identified by Al-Mukhtar and Coyne (1981), but it was never mapped. Flower color is related to seed color, an important trait selected by the consumer market. The markers that are tightly linked to the V and Ct loci can be used for future cloning experiments. The use of markers for simply inherited traits in marker-assisted backcrossing and introgression across Andean and Mesoamerican gene pools is also suggested.

The fin and fin′ loci were mapped in two distinct groups (B1 and B9), with almost the same LOD score for each. It is necessary to obtain more markers for these groups to saturate the regions in order to estimate the positions of these loci. Koinange et al. (1996) and Blair et al. (2006b) also mapped the fin locus to the B1 group. Kwak et al. (2008) found a candidate gene for the fin locus, PvTFL1y, and they also mapped it to the B1 linkage group. Ta’an et al. (2001) and Tar’an et al. (2002) mapped the growth habit gene to the B9 linkage group.

The existence of only one locus to define growth habit in the common bean has been previously questioned, and it is probable that multiple genes controlling growth habit exist in the common bean (McClean et al. 2002). We propose a genetic model based on two loci (fin and fin′), according to Mendelian segregation ratios and linkage mapping results. This genetic model approach related the phenotypic and genotypic analyses of segregating progeny. The single cross UC confirmed that the two growth habit loci probably resulted from the huge effective size of the mapping population, with the segregation and linkage data analyzed together.

The source of determinacy of CAL143 may be an example of a dominant determinate genotype derived from the Andean region that has been domesticated without maize as a physical support for the climbing bean in traditional agriculture, as proposed by Koinange et al. (1996). The F1 cross UC also revealed the dominant inheritance of the determinate growth habit trait, which was usually inherited recessively. Determinant genotypes must represent different sources of genetic control of determinacy, and the frequent recessive state cannot be an exclusive state for the common bean growth habit. This is in agreement with Kornegay et al. (1992), whose results also suggested a genotype (G13624) with two dominant determinate growth habit genes.

The use of qualitative traits such as morphological markers allows the study of genetic models and represents basic genetic knowledge about any species. We propose a new genetic model regarding growth habit, a characteristic of simple inheritance that is important in an agronomic context. Further studies, such as fine-structure mapping in these regions, other crosses and segregating progeny could be performed to study both loci. It is evident that advances can be achieved in genetic breeding of the common bean with the development of genetic mapping. The presented map gives an overview of the genomic organization and provides anchor points for future maps.