Introduction

Common bean (Phaseolus vulgaris L.) is the most important food legume for direct human consumption (Broughton et al. 2003). Nutritionists characterize this crop as the “poor man’s meat”, a nearly perfect food because of its high protein content that complements other foods like maize and rice. In Latin America and Africa, common bean is produced mostly by resource-poor farmers on small-scale, marginal farms. An environmentally friendly and low-cost strategy for improving productivity in common bean cultivation is the exploitation of symbiotic nitrogen fixation (SNF) which involves a symbiotic interaction between leguminous plants and Rhizobium bacteria. However, present-day commercial bean varieties are poor nitrogen fixers (Martínez-Romero 2003). A number of studies comparing several bean genotypes for SNF capacity pointed out that, in general, climbing beans tend to have higher SNF potential as well as higher yield potential compared to bush beans (Graham and Halliday 1976; Graham and Rosas 1977; Kumarasinghe et al. 1992). Climbing common bean genotypes might therefore be promising genetic sources for improving SNF capacity in common bean cultivation.

Only a few genetic studies to detect natural variation and quantitative trait loci (QTL) for SNF and related traits in legumes have been performed so far. Most of these studies focused on QTL analyses of nodulation traits and shoot dry weight under nitrogen-fixing conditions. For soybean, Nicolas et al. (2006) used an F2 population from a cross between two Brazilian soybean cultivars and revealed two genomic regions associated with nodule number (NN) and nodule dry weight (NDW) and six epistatic interactions among non-linked QTL for shoot dry weight (SDW), NN and NDW explaining up to 15 % of the trait variation. In an F2-derived recombinant inbred line (RIL) population, Tanya et al. (2005) found various QTL for NN, NDW, SDW as well as for acetylene reduction activity, which is a direct measure of the amount of nitrogen fixed per plant. Recently, a study in pea identified nine QTL for NN, four for NDW and eight for nodule area in a RIL population derived from parents which are contrasting for root and nodule traits (Bourion et al. 2010).

In common bean, Nodari et al. (1993) were the first to report on the identification of four QTL for NN in the RIL population of a cross between BAT-93 and Jalo EEP558, with the QTL together accounting for 50 % of the phenotypic variation. This same RIL population was used to detect QTL for NN under low and high mineral nitrogen (N) fertilization (Tsai et al. 1998; Souza et al. 2000). Under low N conditions, Tsai et al. (1998) identified the same four QTL and two additional QTL. Under high N conditions, three additional regions were linked to NN. Souza et al. (2000) reported the identification of seven QTL under low N conditions and only five QTL under high N conditions, accounting for 34 and 28 % of the total phenotypic variation in NN respectively. However, it is difficult to know if the same QTL for NN were identified in the latter two studies as the genetic maps used for QTL analysis appeared to be different, with varying numbers of linkage groups. In conclusion, genetic studies on SNF in common bean and even in legumes are sparse, despite the need for genetic information on SNF in breeding. Moreover, the reported studies do not make an effort to associate detected QTL with actual SNF or yield.

The objective of this study was to identify QTL controlling SNF capacity and related traits in a bush (G19839) × climbing (G2333) bean cross. This population has previously been used in other genetic studies. A genetic map of this population has been published (Ochoa et al. 2006). Prior to QTL analysis, a set of 108 markers based on genes known to be involved in the nodulation process in model legumes (Galeano et al. 2012) were evaluated in the population. Phenotypic characterization of the parents and RIL population for SNF and related growth traits was performed under greenhouse conditions and in field conditions in tropical soils. Using these phenotypic data and the improved genetic map of the population, QTL analyses were performed for the SNF- and growth-related traits, and candidate genes were tentatively identified among the nodulation markers.

Materials and methods

Plant material and bacterial strains

A total of 83 F5:8 RIL were developed from the cross G2333 × G19839 by single-seed descent as described by Checa et al. (2006), where G2333 (climber) is a landrace from Mexico, known as ‘Colorado de Teopisca’, which belongs to the Mesoamerican gene pool and G19839 (bush) is a Peruvian germplasm accession from the Andean gene pool (Singh 1991).

As Rhizobium inoculant, R. tropici CIAT899 was used. The inoculum preparation was carried out following the methodology reported by Remans et al. (2007). Briefly, the strain was grown overnight in liquid tryptone-yeast extract (TY) medium at 30 °C. These liquid cultures were washed twice with 10 mM MgSO4 and resuspended in 10 mM MgSO4 at a density of 107 colony forming units (cfu) per ml. In the greenhouse experiment, this bacterial inoculum was directly applied at planting. For the field experiment, 30 ml of this inoculum was injected into sterilized humus in air-closed plastic bags (50 g humus per bag). The humus was thoroughly shaken and left at room temperature for 7 days (shaken once a day).

Molecular marker and linkage analysis

DNA was extracted from the parental lines and all RIL using proteinase K extraction as described by Mahuku (2004). DNA concentrations were measured in a Hoefer fluorometer (Dynaquant 200, Hoefer Scientific Instruments, San Francisco, CA, USA). The DNA was diluted to 5 ng/μl for further use in PCR reactions. A total of 108 gene-based markers based on nodulation genes were evaluated by single strand conformation polymorphism (SSCP) and 67 allele-specific primers were evaluated by Sequenom as reported by Galeano et al. (2009, 2012).

The polymorphic markers plus 29 new simple sequence repeat (SSR) markers (Garzon, unpublished data) were placed into the earlier genetic map for G2333 × G19839 by Ochoa et al. (2006). This was done using Mapdisto software version 1.7.5 beta (Lorieux 2012).

Experimental design, growth conditions and trait measurements in the greenhouse

The greenhouses were located at the KU Leuven (Heverlee, Belgium). Due to space limitations, a subset comprising 73 RILs with good germination capacity and both parental lines were used in the greenhouse experiments. Two experiments were carried out: the first one in the period October–November (fall) 2009 and the second one in the period June–July (summer) 2010, referred to as Oct09 and June10, respectively. Seeds of every RIL and of the parental lines were surface-sterilized according to Vlassak et al. (1998) and pre-germinated for 3 days on agar plates (10 % agar in distilled water) in the dark at 30 °C. Three seedlings were planted per one-liter pot. The seedlings were inoculated at time of planting with 100 μl of inoculum (106 Rhizobium cells) per seedling.

One week after planting, two out of the three seedlings were randomly removed by cutting the stalk, so that only one seedling per pot remained. For each bean genotype, eight plants were used in a randomized complete block design (eight blocks) resulting in a total of 600 plants per experiment. Plants were growing in sand:vermiculite mixture (proportion of 2:1 by volume) in the absence of mineral N. One-liter pots were filled with the sand:vermiculite mixture and moistened with 250 ml N-free Snoeck medium (Snoeck 2001). During growth, a drip irrigation system was used. In terms of phenotyping, leaf chlorophyll values were measured weekly with a soil and plant analysis development (SPAD) chlorophyll meter model 502 (Soil and Plant Analysis Development) (Monje and Bugbee 1992). Sampling of plants was performed at 42 days after planting (DAP) at the onset of flowering (R6 stage). Harvested plants were oven-dried at 60 °C for at least 2 days. Shoot and root dry weights were recorded. Shoot and root samples were ground and N concentration (%) was measured by ignition with a Variomax CN analyser (Dumas method).

Field trials and traits measured

The field trial was conducted at the CIAT research farm in Darien, Colombia (03°56′N; 76°28′W, elevation 1,537 masl) during the rainy season from September 2010 until February 2011. The average yearly temperature for this site is 20.8 °C and annual rainfall totals 1288 mm. The soil at the field site is an Udand with pH 5.4 (soil:water 1:2), 13.3 % organic matter (dicromate–sulfuric acid oxidation), 7,274.9 ppm total N, 90.6 ppm plant-available N (NH4 + and NO3−) and 2.3 ppm native soil phosphorus as determined by Bray II extraction (Olsen and Sommers 1982).

A total of 88 bean genotypes were used including the two parental lines G2333 and G19839, 83 F5:8 RIL and three non-nodulating lines, BAT477-NN, DOR364-NN and Gloriabamba-NN. Non-nodulating isolines were used as non-fixing controls to measure the amount of SNF by the δ15N natural abundance method according to Shearer and Kohl (1986).

Before planting, 90 kg/ha phosphorus (as Triple Super Phosphate) and 60 kg/ha potassium (as KCl) were applied. The inoculation was carried out following the methodology reported by Remans et al. (2008). Briefly, seeds of the 88 bean lines were mixed with inoculum (30 g humus/kg seed) and distilled water (1 ml per 30 g seed). This mix was left to dry in the shade for 15 min and seeds were then manually sown at a density of 20 cm between seeds. Sowing at this density corresponds to approximately 50,000 plants/ha. Later, at 12, 20 and 28 DAP, micro-elements boron and zinc were applied by foliar spray over the developing plants (300 g/ha).

The field experiment was organized as a randomized complete block design with three replications. Each experimental unit was a double 3-m-row plot sown with 15 seeds per row (30 plants/plot and thus 90 plants/genotype). One meter spacing between rows was used to accommodate both climbing and semi-climbing RILs.

During the experiment, a total of four samplings was taken for phenotypic trait analysis. The first two samplings were taken at 21 and 42 DAP, corresponding to the primary trifoliate V3-growth stage and early flowering R6-growth stage of the bean plants, respectively (Singh 1982). For these two samplings, three plants per plot (nine plants per bean genotype) were separately harvested by cutting the plant at the first node (starting from the soil surface) and afterwards the root was dug up, taking care that the whole root was collected.

Subsequently, nodules were separated from the root. Shoot, root and nodules of every plant were oven-dried separately at 60 °C for at least 2 days. Shoot, root and nodule dry weights were recorded. For the first sampling, nodule number was also recorded. For the second sampling, both shoot and root samples were ground and δ15N and total N concentration (%) were measured at the Stable Isotope Facility (UC Davis, California) using a PDZ Europa ANCA-GSL elemental analyzer interfaced to a PDZ Europa 20–20 isotope ratio mass spectrometer (Sercon Ltd., Cheshire, UK; Sharp 2005). For the third sampling at late pod-filling stage (R7/R8-growth stage; Singh 1982), shoots of three plants per plot were harvested together. The first three samples were taken from one row of the experimental unit while the second row was left intact for harvest. Shoot dry weight, δ15N and total N was measured for these pooled samples as explained earlier.

Leaf chlorophyll values of ten plants per plot were measured with a SPAD chlorophyll meter model 502 for the first three samplings. At harvest, a sample of three plants from the second row was taken, and afterwards the entire row was harvested. Plant counts per row were recorded for yield calculation. For the three-plant sample taken at harvest, the following traits were recorded: seed and pod number; seed, pod and plant dry weight; δ15N and total N of seed, pod and plant, as explained earlier.

The amount of nitrogen fixed (%Ndfa) was calculated using the following formula (Shearer and Kohl 1986):

$$ \% {\text{Ndfa}} = \frac{{\delta ^{{15}} {\text{N}}\,{\text{of}}\,{\text{non}} {\text{-}} {\text{fixing}}\,{\text{reference}}\,{\text{plant}} - \delta ^{{15}} {\text{N}}\,{\text{of}}\,{\text{N}}_{{\text{2}}} {\text{fixing}}\,{\text{legume}}}}{{\delta ^{{15}} {\text{N}}\,{\text{of}}\,{\text{non}}{\text{-}} {\text{fixing}}\,{\text{reference}}\,{\text{plant}} - {\text{B}}}} \times 100 $$

where B is the δ15N value from the nitrogen-fixing bean plant grown in N-free medium. This B-value was obtained by taking the average of δ15N measurements of a total of 20 plants grown in the greenhouse experiment.

Data analysis

First, normal distribution of the data was verified per trait using the Shapiro–Wilk distribution test. Data analysis was done with the ANOVA Mixed Model procedure using the statistical analysis system (SAS) 9.2 combined with the statistical interface SAS Entreprise Guide 4 (SAS Institute, Cary, NC, USA).

In the analysis of variance (ANOVA), the main factor of interest was ‘genotype’ [73 or 83 RIL (greenhouse and field respectively) and two parental lines]. Significant differences were determined by the Tukey–Kramer post hoc test at significance level P = 0.05. The Kenward and Roger calculation was selected as the degrees of freedom method. Correlations for all experiments were calculated using Pearson correlation coefficients. For the greenhouse experiment, %N Root, %N Shoot, %N Plant, Total N Shoot, Total N Root and Total N Plant were all correlated. Therefore, only Total N Plant was considered.

QTL analysis was conducted with the resulting genetic map of the population and the phenotypic means for each RIL using the computer software program QTL Cartographer version 2.5 for Windows (Wang et al. 2011). QTL were identified with composite interval mapping (CIM) features. Traits for CIM analysis included a forward/backward regression with a window size of 10 cM, a walk speed of 1 cM and probability thresholds of 0.05 each for the partial F-test for both marker inclusion and exclusion. The empirical thresholds for QTL detection with the CIM method were estimated using 1,000 permutation tests as suggested by Churchill and Doerge (1994). Additive genetic effect estimates for each trait were computed by the QTL Cartographer program at the peak of the LOD profile. Identified QTL were added on the genetic map using Mapchart version 2.1 (Voorrips 2002).

Sequence comparison for identification of putative genes underlying QTL

For some QTL identified in greenhouse and/or field experiments, sequences of markers that were found to be closely linked to the QTL were available. These sequences were compared to (1) the most recent expressed sequence tag (EST) assembly of common bean (TC-gene index available at http://compbio.dfci.harvard.edu/cgi-bin/tgi/tc_ann.pl?gudb=p_vulgaris) and (2) to a collection of gene sequences in soybean (available at http://www.phytozome.org). Additionally, the most similar common bean EST, or alternatively the gene sequence of soybean, was BLASTed (http://blast.ncbi.nlm.nih.gov). These three types of comparisons made it possible to annotate the marker sequences linked to identified QTL and, in this way, to suggest putative genes underlying the identified QTL.

Results

Genetic mapping of nodulation gene-based markers

Based on SSCP and Sequenom techniques, a total of 42 markers could be mapped, comprising 16 markers (15 %) by SSCP and 26 markers (39 %) by Sequenom (Fig. 1). The final linkage map consisted of a total of 207 markers comprising 57 rapid amplification of polymorphic DNA, 106 SSRs, 1 sequence characterized amplified region, 1 isoenzyme and 42 single nucleotide polymorphisms (SNPs). In addition, the linkage map had a total map length of 1,601 cM and the average length of the linkage groups was 145 cM (Supplementary Table 1).

Fig. 1
figure 1

Genetic map of the G2333 × G19839 common bean RIL population showing location of quantitative trait loci (QTL) for traits evaluated in both greenhouse experiments and in the field experiment. The QTL identified in the first greenhouse experiment (Oct09) are indicated in black, those from the second greenhouse experiment (June10) in red and the QTL of the field experiment in green. Nodulation gene-based markers that were added to the existing map are shown in bold and underlined (Sequenom MassARRAY iPLEX platform), and the SSCP markers are differentiated with italics

SNF capacity and related growth traits in the G2333 × G19839 population in the greenhouse

For the results of the first experiment (Oct09), comparison of parental means for SPAD values measured weekly during a time course of 6 weeks showed superior SPAD values of the bush bean G19839 in the first 3 weeks as compared to G2333 (Table 1). However, during the last 3 weeks of growth, which is the main period of symbiotic nitrogen fixation, the climber G2333 showed superior SPAD values. Other growth traits including shoot and root dry weight, and N content of shoot, root and plant, showed that the climbing bean parent G2333 had somewhat a superior growth rate and N fixation compared to the bush bean parent G19839.

Table 1 Parental and RIL population (G2333 × G19839) means of the greenhouse experiments (Oct09, June10)

The parameter nitrogen use efficiency (NUE), which is a measure of the amount of tissue growth per unit of N fixed by the plant, was similar for both parental bean lines. The second experiment also showed superior SPAD values of the bush bean parent G19839 in the second week (SPAD values in the first week were not determined for this experiment) as compared to G2333, while in the third week parental SPAD values were similar (Table 1). During the last 3 weeks, SPAD values of both parents increased, with the bush bean parent G19839 increasing more rapidly than the climbing bean parent G2333. In the last week, both parents showed a sudden decrease in SPAD values with a higher decrease for G2333 than G19839. Only SPAD values at 2 and 6 weeks were significantly different.

Phenotypic growth traits in this second experiment, including shoot and root dry weight, showed that G19839 was also superior for these traits and significantly different from G2333 (Table 1). Similarly, N% in shoot and total plant and total N contents of shoot, root and plant were also significantly higher for G19839. An exception was the N% in the root which was higher, though not significantly so, for G2333. NUE, which was similar for both parental varieties in the previous experiment, was significantly higher for G19839 in this experiment.

The analysis of variance showed significant differences between RIL genotypes for almost all traits measured for both experiments (Table 1). The average trend of the SPAD values was similar to the trend observed for the parents, with an initial decrease in SPAD values followed by a gradual increase in SPAD values.

Population histograms for all traits in the G2333 × G19839 RIL population are shown in Supplementary Figure 1. All the traits and all the histograms showed normal distributions. There was no evidence of extreme kurtosis or skewing in any of the histograms which suggested that all traits measured were inherited in a quantitative manner. In almost all cases, parental means tended to be less distinct than the lowest and highest RIL, suggesting that transgressive segregation was important in the inheritance of the traits with both parents contributing positive as well as negative alleles for each trait.

Correlations among all measured traits were analyzed. Only significant correlations were listed in Supplementary Table 2. For both experiments, SPAD values from 3 weeks onwards were significantly correlated with biomass accumulation (shoot, root and total dry weight) measured at 6 weeks. This showed that from 3 weeks onwards, SPAD values were able to predict the biomass of the plant at 6 weeks. For both experiments, shoot and root dry weight strongly correlated, suggesting that plants invest biomass in both tissues simultaneously without favoring root over shoot tissue or vice versa.

In addition, the total nitrogen content of the shoot was correlated with SPAD values from 4 weeks onwards in both experiments. In the second experiment the N% in the shoot was significantly correlated with the SPAD values at 6 weeks. These last two correlations indicated the utility of the SPAD equipment as a cheap and time-saving alternative for N analysis.

SNF capacity and related growth traits in the G2333 × G19839 population in the field

SPAD values were measured at each sampling (corresponding to plant growth stages V3, R6 and R7/8). Comparisons of parental mean for SPAD values showed superior SPAD values of the climber G2333 at all samplings as compared to G19839 (Table 2), suggesting greater N uptake (through SNF and soil absorption) by the climber G2333. Average shoot dry weight of G2333 was initially (V3 stage) somewhat lower than G19839 but was higher at the R6 and R7/8 sampling stages, although the difference was only significant at the second date. For the root dry weight, the bush bean parent G19839 had higher root dry weights with a significant difference only at the V3 stage.

Table 2 Parental and RIL population (G2333 × G19839) means of the field experiment at the CIAT research farm in Darien, Colombia

Generally, at the V3 stage, plants had very few nodules, while at the R6 stage the amount of nodules was abundant, suggesting that formation of nodules mainly took place between 3 and 6 weeks after planting. At the early V3 stage, G19839 had a significantly higher average nodule dry weight compared to G2333, while at flowering stage (R6) the opposite was true. When looking at nodule dry weight and thereby accounting for differences in root dry weight (NDW/RDW), the difference between G2333 and G19839 is even clearer.

Percentage of N2 fixed (%N derived from air or %Ndfa) and total amounts of N fixed (total Ndfa) were calculated at R6 and R7/8 growth stages as well as at harvest. At R6 growth stage, the climbing bean parent G2333 had fixed about 50 % of its total N content while G19839 had only fixed approximately 30 %. However, at harvest, the opposite was observed, with 58 % fixed by the bush bean genotype G19839 and 43 % fixed by G2333. The latter difference was significant, and the same trends could be observed when comparing total N contents fixed at V3 and R6 growth stages and at harvest. The main period of SNF for the climbing bean parent G2333 appeared to be the period between flowering and pod-filling (313.6 mg/plant), while for G19839 this peak of SNF activity appeared to be between pod-filling and harvest (180 mg/plant).

Percentage and total N absorbed from soil (%N derived from soil or %Ndfs and total N derived from soil or Ndfs) were also calculated. In the beginning (up to R6), the main part (about 70 %) of the plant N was taken up from the soil rather than fixed from atmospheric N. From R6 to R7/8 growth stages, the climbing bean parent G2333 started fixing considerable amounts of N (up to 50 % at R7/8) while G19839 kept relying mainly on N absorption from the soil (70 %).

When looking at total N contents (sum of total Ndfs and total Ndfa), the main period of N accumulation by G2333 was between flowering and late pod-filling (292.9 mg/plant). From late pod-filling till harvest, this genotype had a net loss of N (−501.6 mg/plant) possibly due to decreased N accumulation during the later stages of plant development. G19839 also had a significant N accumulation from flowering till late pod-filling (353.2 mg/plant) but continued to accumulate N from late pod-filling up to harvest (very small net N loss of 4.7 mg/plant despite leaf fall).

Many additional traits were measured at harvest. Only parental differences that were significant will be discussed further. The seed number per pod was significantly higher for G2333 with approximately six seeds per pod as compared to G19839 with an average of three seeds per pod. An important trait which partly determines the nutritional value (in protein content) of bean seeds was the proportion of N that is translocated to the seed by the plant. For this trait, G2333 was able to translocate about 93 % of its total N to the seed while G19839 only translocated 80 %. Finally, the average yield of G2333 was significantly higher than G19839, at more than 2 t/ha for the climbing bean parent compared to 600 kg/ha for the bush bean parent.

Population averages for NUE at the growth stage R7/8 and at harvest were also higher for many of the RILs than for both parental means, indicating that most RILs performed better than both parents for this trait. The population mean for %N fixed (%Ndfa) was somewhat lower than for both parents at all sampling stages, suggesting that many RILs were fixing less N than either parent. Similar to the greenhouse experiments, all histograms showed normal distributions without extreme kurtosis or skewing, suggesting that all traits measured were inherited in a quantitative manner (data not shown). For each trait, parental means tended to be less distinct than the lowest and highest RIL (also observable when looking at the population ranges), suggesting that transgressive segregation was important in the inheritance of the traits with both parents contributing positive as well as negative alleles for each trait.

In correlation tests (Supplementary Table 3), the parameter for yield was inversely related to the climbing capacity scale (Checa and Blair 2008). Given this, there was an inverse relation between yield and climbing capacity which indicated that yield increased with increasing climbing ability. Yield was also correlated with total N fixed at the R7/8 growth stage and at harvest, suggesting that SNF actually contributed significantly to yield. Yield was likewise positively correlated with total N absorbed from the soil at the earlier R6 stage. The correlation of yield with SPAD values at R6 and R7/8 growth stages confirmed the important role of the plants’ N status in the final yield.

Climbing capacity also showed a negative correlation with total plant N content and NUE at harvest. In other words, climbing beans tended to have higher total N contents and higher NUE than bush beans. Another intriguing correlation was that between SPAD values measured at flowering (R6) and NUE at harvest, indicating that higher chlorophyll contents at earlier growth stages coincided with a more efficient use of N. Total N contents of seeds, pods and plants at harvest were all positively correlated with total N absorbed from the soil at flowering (R6) and late pod-filling stage (R8), but not with total N fixed. These correlations indicated that N absorption from the soil, as opposed to N fixation from the air, was the most important factor determining final N contents.

QTL for SNF capacity and related growth traits

In the greenhouse experiments, 20 and 23 QTL were identified for the first and second experiments, respectively. QTL were found for all measured traits with some QTL overlapping (Table 3). For leaf chlorophyll content (SPAD values), a QTL was found five times on linkage group b01 (at 7, 14, 21 and 28 days). This QTL was linked to two microsatellite markers and can be considered to be a major QTL as it is able to explain up to 33 and 35.5 % of the total phenotypic variance (R 2) of SPAD values at 14 and 28 days, respectively, in the second experiment. Another important QTL was found on linkage group b06 which was linked to the nodulation gene-based marker BSn38.

Table 3 Quantitative trait loci (QTL) analysis for G2333 × G19839 RIL population in the greenhouse experiments (Oct09, June10). QTL showing overlap are highlighted in the same colour

Additionally, overlapping QTL were found on linkage group b10 during the last 3 weeks of the experiment, which was the main period of SNF, indicating that this QTL could have a function in chlorophyll accumulation through the process of nitrogen fixation. This QTL was linked to microsatellite markers and explained from 12 to 23 % of the total phenotypic variance (R 2). G19839 provided the positive allele for this QTL. For the other QTL identified for SPAD values, both G19839 and G2333 provided an equal amount of positive alleles for higher SPAD values. QTL for shoot dry weight were found only in the second experiment, with three QTL identified. One of these QTL on linkage group b02 overlapped with a QTL found for SPAD values at 35 DAP in the second experiment and explained over 23 % of the total phenotypic variance.

Several QTL were found for root dry weight in both experiments. One of these QTL on linkage group b08 (RDW_3) overlapped with a QTL for SPAD values at 42 DAP (SPAD42D_1) which was found in both experiments. The bush bean parent G19839 provided the positive allele for this QTL. Another three and two QTL were found for N% in shoot and plant, respectively, but only in the first experiment. The QTL found on linkage group b01 for these traits overlapped (%NSHOOT_1 and %NPLANT_1) and the climbing bean parent G2333 contributed the positive allele which explained about 20 % of the total phenotypic variance. No QTL were detected for total N shoot content. Another group of QTL were identified for total N content of root and plants, but only in the second experiment. The QTL found for total N content of the plant explained 17–18 % of the total phenotypic variance, in both cases with G19839 providing the positive alleles for high total N fixed by the plant.

In the field experiment, a larger number of QTL were identified (Table 4). At flowering, a QTL for SPAD overlapped with a QTL for SPAD in the first greenhouse experiment. Another QTL found on linkage group b07 for SPAD values at late pod-filling stage coincided with QTL for SPAD values at 28 days and 35 days in the first greenhouse experiment. In the field experiment, two QTL for SPAD values were found twice: (1) the QTL on linkage group b06 was found for SPAD values at both V3 and R6 stages and (2) the QTL on linkage group b03 was found for SPAD values at both R6 and R7/8 growth stages.

Table 4 Quantitative trait loci (QTL) analysis for G2333 × G19839 RIL population grown in a field trial at the CIAT research farm in Darien, Colombia

Finally, for root biomass at flowering stage (R6), a QTL was detected on linkage group b10 and overlapped with a QTL for root dry weight at the R6 growth stage in the second greenhouse experiment. For shoot dry weight at the early V3 stage, two QTL were found on linkage groups b07 and b10 with G2333 providing the positive allele for high shoot dry weight. The QTL on linkage group b07 was found to be linked to the nodulation gene-based marker BSn69.

Nodulation parameters (NDW, NDW/RDW and NN) were also associated with some QTL including one QTL for NDW/RDW at the V3 growth stage which was most closely linked to the nodulation gene-based marker BSn33_SNP1. A QTL found for nodule and root characteristics, NDW/RDW, at the R6 growth stage overlapped with another QTL for %N fixed at harvest. The climbing bean parent G2333 provided the positive allele in both cases. For total N amounts fixed at late pod-filling stage (R7/8), a QTL on linkage group b01 was found to be closely linked to another nodulation gene-based BSn73.

QTL found for the seed number per pod and for the total N accumulated at harvest overlapped with QTL for SPAD values at V3 stage on linkage group b01. For the seed, plant and total %N accumulation, the same QTL was detected on linkage group b04. The climbing bean parent G2333 provided the positive allele for this QTL. For the proportion of N translocated to the seed (%), one major QTL was detected on linkage group b02 that accounted for 20 % of the total observed variance.

For some QTL identified in greenhouse and/or field experiments, sequences of markers that were found to be closely linked to the QTL were available. These sequences were compared to the most recent EST assembly of common bean and to a collection of gene sequences in soybean. Additionally, the most similar common bean EST, or alternatively the gene sequence of soybean, was subjected to a BLASTn analysis. This made it possible to annotate the marker sequences linked to some QTL, and the results are shown in Supplementary Table 4.

Discussion

The results of the greenhouse experiments varied for parental and RIL genotypes, with contrasting results in terms of parental N fixation in the first and second experiments. This could have been due to photoperiod differences in the greenhouse trials (fall vs. summer). In terms of N2 fixation, G2333 performed better in the field experiment with higher yields but G19839 accumulated more N both from soil and through SNF, but only by the end of the experiment.

This variability in SNF responses can be considered to be quite typical for SNF and related traits and is mainly caused by a large environmental impact on SNF performance (Bliss 1993; Peoples et al. 1995). Environmental factors that affect SNF performance have been observed to include abiotic and biotic stresses such as P deficiency, drought, pests and diseases (Herridge and Rose 2000) as well as physiological parameters such as growth habit. High mineral N availability and endogenous non-fixing rhizobial populations can be contributing factors to environmental variability. These environmental factors can obscure the true SNF capacity of a legume crop and make it very difficult to obtain stable results over time and location.

Despite the variation in phenotypic results between greenhouse and field, some consistent QTL were identified in all three experiments or in at least two out of the three experiments. A first interesting QTL for leaf chlorophyll content (SPAD) at early plant growth stages was detected in both greenhouse experiments on linkage group b01 (see Fig. 1) with an overlap of QTL underlying SPAD values at the early V3 stage in the field experiment. There was also an overlap with another QTL for the total N fixed in the shoots at late pod-filling stage (R7/8). A second interesting QTL for SPAD values at later growth stages was found in both greenhouse experiments on linkage group b10. However, this QTL was not identified under field conditions. On the same linkage group, a QTL for root dry weight at flowering was found in the second greenhouse experiment as well as in the field experiment. On linkage group b07, a QTL for SPAD values at 4–6 weeks in the greenhouse coincided with a QTL for SPAD values at late pod-filling stage in the field.

Various QTL for %N fixed and total N fixed were identified. In the first greenhouse experiment, two QTL were identified for N% fixed on linkage groups b01 and b04. The QTL on linkage group b01 overlapped with a QTL detected for total N content at harvest in the field experiment. In the second greenhouse experiment, two QTL for total plant N content were identified on the same linkage groups but these did not show any overlap with other QTL for amounts of fixed N. Finally, the field experiment also yielded some QTL for SNF-related traits, including one on linkage group b04 for the N% fixed at harvest and one on linkage group b01 for the total N fixed in the shoot at late pod-filling stage. These QTL, however, did not show any overlap with previously identified QTL from the greenhouse experiments. Variability between greenhouse and field results have also been observed for studies of phosphorus uptake in common bean (Miklas et al. 2006) and therefore was not surprising for the more complex biological interactions of Rhizobia and common bean.

In terms of QTL detection accuracy, the present study was developed with a small mapping population (<100 individuals). Therefore, these results have to be interpreted with special caution because of the “Beavis effect” (Xu 2003). For this reason, further analysis, based on our preliminary results, must be done. Specifically, using the linked markers to QTL reported here, QTL confirmation and population size expansion (Lander and Kruglyak 1995) approaches will be useful to validate the loci involved in SNF capacity.

Previous genetic studies have been performed using the G2333 × G19839 common bean population. The first one was an inheritance study and QTL analysis of climbing ability (Checa et al. 2006; Checa and Blair 2008). In two other studies, QTL for adventitious rooting (Ochoa et al. 2006) and phytate and phosphorus content of seed (Blair et al. 2009) were described. QTL for SPAD values at V3 and R6 growth stages identified in the field experiment overlapped with QTL identified in the field for adventitious root dry weight on linkage group b06 (Ochoa et al. 2006). Another QTL for RDW at flowering (R6) in the field coincided with one of the QTL underlying specific root length measured in the greenhouse (Ochoa et al. 2006). Several QTL identified in our study overlapped with QTL identified for climbing bean growth traits in the study by Checa and Blair (2008). Of the overlap, the most interesting would be on linkage group b07, where a QTL for SPAD values measured at 28 days overlapped with one of the QTL found for climbing ability by Checa and Blair (2008). These results might strengthen the hypothesis that climbing ability is genetically linked to nitrogen fixation capacity, which would explain why many climbing beans are good at N2 fixation as reported by previous authors (Graham and Halliday 1976; Graham and Rosas 1977; Kumarasinghe et al. 1992).

A number of studies to detect natural variation and QTL for SNF and related traits in legumes have been published (Nodari et al. 1993; Tsai et al. 1998; Souza et al. 2000; Tanya et al. 2005; Nicolas et al. 2006; Santos et al. 2006; Bourion et al. 2010). Only two of these studies used common bean for identifying QTL for nodule number. In the greenhouse study by Tsai et al. (1998), a total of four QTL for high and low level of mineral N fertilization were identified on linkage groups b01, b03, b04 and b07. In the present study, two QTL for nodule number were found on linkage group b03 and b05, explaining each about 17 % of the total observed phenotypic variance and with G2333 providing the positive allele for high nodule number.

When comparing these QTL for nodule dry weight, an overlap of QTL was evident on linkage group b03 with one QTL from Tsai et al. (1998). The QTL for NDW at R6 stage appeared to coincide with a QTL for NN under both high and low mineral N fertilization levels. Another QTL for NN at the V3 growth stage on linkage group b03 overlapped with a QTL for NN under low mineral N fertilization from Tsai et al. (1998). Finally, a third QTL for NDW at R6 stage overlapped with a QTL for NN under high mineral N fertilization on linkage group b04 from that previous study.

A second study by the same research group (Souza et al. 2000) using the same experimental population identified a total of eight and five QTL for NN under low and high mineral N fertilization, respectively. When comparing the present study with the results obtained by Souza et al. (2000) it seems that the QTL identified in this study for NN at V3 stage on linkage group b03 again overlapped with a QTL for NN under low mineral N fertilization. However, it was difficult to exactly compare the results of this study with results from Souza et al. (2000) as different populations were used.

Analysis of N% and total N contents of all plant parts at harvest, including seed, pod and plant parts, allowed us to calculate the proportion of N that was translocated to the seed. This trait, together with the absolute N% in the seed, is important for the seed protein content. Absolute seed N% was somewhat lower for the seed of the climbing bean parent G2333 than for the bush bean parent G19839. For both seed N% and translocation of N to the seed, QTL were found including one on linkage group b04 for seed N% and one for proportion of N translocated to the seed on linkage group b02. These QTL were both linked to microsatellite markers which could be used for marker-assisted selection of these traits.

Some QTL that were identified in this study were linked to the nodulation gene-based markers that were mapped on the existing map of the G2333 × G19839 population. For these QTL and for some other QTL linked to markers with known sequences, it was possible to compare sequences with the most recent common bean EST assembly and with a soybean gene collection in order to annotate tentatively the sequences linked to the QTL identified in this study.

First, a QTL was identified for early SPAD values on linkage group b06 for the greenhouse experiments, more specifically at 7 days in the first greenhouse experiment and for SPAD at 21 days for the second greenhouse experiment, which was closely associated with the gene ASTRAY (BSn37_SNP2) which is a known bZIP transcription factor with a function in the regulation of nodulation (Nishimura et al. 2002). Legumes control nodulation by sensing environmental cues where ASTRAY can play a roll.

For SPAD at 7 days, another QTL on linkage group b01 was also linked to a bZIP transcription factor. We expected transcriptional control to be important in determining SNF-related traits as a whole, since these are responsive to complex signal perception and a cascade of gene responses (Yokota and Hayashi 2011).

Another interesting candidate gene—QTL association was identified for a QTL for shoot dry weight in the second greenhouse experiment, on linkage group b02, which was linked to the gene-based marker BSn67. This sequence contains a conserved region of an auxin-responsive transcription factor ARF4 of Arabidopsis thaliana. The parents of the population G2333 and G19839 might have different auxin levels, resulting in differing biomass accumulation. As the positive allele for this QTL originated from the climbing bean parent G2333, this variety might be the one with higher expression levels of the transcription factor or higher actual levels of auxin under conditions of symbiotic nitrogen fixation.

Both shoot and root dry weight at V3 growth stage in the field experiment showed overlapping QTL on linkage group b07 that in turn were linked to the marker BSn69. This sequence contains a WRKY-DNA binding domain and an associated plant zinc cluster domain. The WRKY proteins are a superfamily of transcription factors with up to 100 representatives in Arabidopsis (Eulgem et al. 2000). Family members appear to be involved in the regulation of various plant physiological programs. At this stage, however, there is no detailed information available yet on the exact function of each transcription factor in this process or possible associations with N2 fixation.

Another candidate gene of interest was based on the marker BSn33_SNP1 underlying the QTL for NDW/RDW at the V3 growth stage. This sequence is homologous to LYK3 originally identified in Medicago truncatula and is known to be one of the LysM domain receptor kinases in a two-receptor model that regulates rhizobial infection in this model legume together with LYK4 (Limpens et al. 2003). The positive allele for high NDW/RDW was provided by the climbing bean parent G2333 for this QTL.

In addition, the QTL for total N fixed at late pod-filling stage (R7/8) on linkage group b01 was shown to be linked to the marker BSn73 (Supplementary Table 4) which showed high homology to an AP2/ERF-domain transcription factor. This family of transcription factors integrates jasmonic acid and ethylene signals in expression regulation of several plant defense or abiotic stress-related genes (Nakano et al. 2006). Additionally, in M. truncatula, these transcription factors have found to be involved in both activation and repression of nodulation factor-elicited gene transcription in root hairs at the beginning of the infection process by Rhizobium (Andriankaja et al. 2007). These findings make BSn73 a promising putative gene for marker-assisted selection or evaluation.

Finally, the marker BMd017, related to a DNA-J protein, was associated with a QTL for SPAD at flowering in the field experiment on linkage group b02. This gene, known as a Heat Shock Protein 40 (Hsp40), is from a family of heat-shock proteins that contain a 70 amino-acid consensus sequence known as the J domain (Schlesinger 1990). Hsp40 plays a role in regulating ATPase activity with the expression of heat-shock proteins increasing upon exposure to elevated temperatures or other stress. Heat-shock proteins also function as molecular chaperones binding substrate proteins that are in unstable, non-native structural states during protein folding (Boston et al. 1996). Parental bean varieties G2333 and G19839 might have a difference in expression of a certain heat shock protein which functions as molecular chaperone in the correct folding of Rubisco leading to a difference in chloroplast accumulation in the leaves.

In order to confirm candidate gene—QTL associations, one of the strategies could be to look at variation in expression levels of these genes (e.g. by qRT-PCR) in the population. Another strategy that can be used to further confirm and fine map some of the most interesting QTL identified in our study is through the approach of heterogeneous inbred-family analysis which is a rapid method for developing near-isogenic lines that differ only at the QTL of interest (Tuinstra et al. 1997). This method is particularly suitable in the G2333 × G19839 RIL population, as the degree of heterozygosity is substantial. For some QTL, including the QTL for N % in the seed and the proportion of N translocated to the seed (determining the nutritional value of the seed), it would also be worthwhile to further study their consistency through repeated QTL analysis and/or through association studies in natural populations. If consistent, the markers linked to these QTL could be used for marker-assisted selection in breeding efforts to improve the nutritional value of bean seeds.