Introduction

From the adaptation perspective, plants with higher yield quantity and quality are better adapted to certain environments (Cooper and Byth 1996). As a result, one of the most important traits to be improved by plant breeders is seed yield with increased resistance to biotic and abiotic stresses. The variation observed in yield could be due to the variation in different agronomic traits that are correlated with yield.

To combat the potential problem of limited genetic gains in soybean breeding programs, the introgression of new sources of germplasm appears to be a solution. This may increase genetic variability allowing better genetic gains from selection (Ude et al. 2003). Several studies demonstrated that plant introductions (PI) or exotic germplasm, such as modern Asian cultivars, which are genetically different from North American germplasm, represent an important reservoir of favorable alleles that can be used in breeding programs (Orf et al. 1999b; Li et al. 2001a, b; Concibido et al. 2003).

The use of molecular markers has facilitated the identification and localization of quantitative traits loci from exotic germplasm. Reports by Thompson and Nelson (1998), Orf et al. (1999a), Concibido et al. (2003), Ude et al. (2003), Kabelka et al. (2004), Smalley et al. (2004), and Wang et al. (2004) demonstrated that exotic genes can be successfully introduced from wild species for yield enhancement through genetic mapping and molecular markers in a number of crop species.

As an example of the use of exotic germplasm and molecular tools to improve yield and yield components, Orf et al. (1999a, b) compared two recombinant inbred line (RIL) population derived from ‘Minsoy’ and ‘Archer’, and ‘Noir 1’ and Archer. Archer is an elite cultivar from North America (Cianzio et al. 1991) while Minsoy and Noir 1 are PIs (Mansur et al. 1996; Orf et al. 1999a, b). QTL were identified for plant height, lodging, days to flowering and maturity, seed yield, seed weight, and seed oil and protein content. Many of the QTL identified in this study have the beneficial alleles provided by the exotic parents.

In another study, Kabelka et al. (2004) found yield QTL that were associated with significant changes in days to maturity, in plant height and in seed protein concentration in a population derived from a PI and an adapted cultivar. Using a population derived from Canadian × Chinese elite lines, Palomeque et al. (2009a, b) found four yield QTL (tagged with markers Satt100, Satt277, Satt162 and Sat_126) that were detected across the Canadian and Chinese mega-environments. Those QTL were co-localized with QTL associated with different agronomic traits such as 100 seed weight, plant height, days to maturity and oil content. However, none of these seed yield QTL were confirmed in a second population derived from a cross between a different Canadian and Chinese elite soybean line (Pioneer 9071 × #8902) (Palomeque et al. 2010).

Quantitative traits, such as yield, are strongly influenced by environmental factors. As a result, when adaptation is studied, the performance of different genotypes is evaluated across different environments. Mega-environments have been defined by the International Maize and Wheat Improvement Center (CIMMYT) as “a broad, not necessarily contiguous geographic area, occurring in more than one country and frequently transcontinental, defined by similar biotic and abiotic stresses, cropping system requirements, consumer preferences and volume of production” (Braun et al. 1996).

Therefore, the objectives of this study were: (1) to identify QTL for high yield in two populations derived from crosses between Canadian and Chinese elite soybean lines (OAC Millennium × Heinong 38 and Pioneer 9071 × #8902) tested at three hypothetical mega-environments: Ontario, Canada; Minnesota, United States and northeast China, (2) to evaluate the relationship between yield and yield components and seed quality parameters and (3) to determine if any seed yield QTL present in two or more mega-environments is co-localized with any agronomic trait QTL across the same three mega-environments.

Materials and methods

Field experiments

Two populations developed from crosses between Canadian and Chinese soybean lines were used in this experiment. Population 1 consisted of 92 F4:7 RILs derived from the cross OAC Millennium × Heinong #38. Population 2 was comprised of 131 lines at F4:7 stage originated from the cross between Pioneer 9071 and #8902. OAC Millennium and Pioneer 9071 are high-yielding cultivars well adapted to northern US and Canada (MG = 0 and 0.7, respectively), which were developed by the University of Guelph and Pioneer Hi-Bred, a DuPont Company, respectively. Heinong #38 and #8902 are well-established elite cultivars developed and widely used in the Heilongjiang province of the People’s Republic of China. Both parents of each cross as well as commercial varieties from either environment were used as controls in field trials. Because of their elite nature, neither parent in both populations was considered a completed non-adapted PI.

Both populations were grown in three hypothetical mega-environments: Ontario, Canada; Minnesota, United States (US), and northeast China at different locations (Table 1). The parental line Pioneer 9071 was not evaluated in Woodstock in 2009. In Canada and United States, the experimental design was an RCBD Nearest Neighbor with two replications while in China the experiment was arranged as a rectangular lattice design with two replications. The plot size was 1.5 × 5.5 m in the Canadian sites and 1.5 × 2.4 m in the US sites, with four rows and 35 cm between-row spacing in all sites. For China, the plot size was 2.8 × 3 m with four elevated ridges per plot planted at 70 cm between ridges in Harbin, while in Changchun plots were 2.4 × 3 m with four rows separated by 60 cm between rows. Number of seeds per plot was adjusted based on germination rate. Seed density was the same at all locations within each mega-environment being 18 seeds per meter in Canada and the US, whereas in China it was 28 seeds per meter. Similar plant stands were achieved at all locations. The number and diversity of the environments used in this study along with different resources used by different collaborators added complexity that we could not completely control. Plots were machine harvested in the Canadian and US environments while they were hand-harvested and machine-threshed in the Chinese environments.

Table 1 Location of field experiments in three hypothetically different mega-environments during 2009 and 2010 using two Canadian by Chinese soybean populations

Phenotypic scoring

Seed yield (kg/ha) was recorded per plot at each location and year and adjusted to 13 % moisture. Other yield-related traits were recorded per plot at each location and year: days to maturity, as the number of days from planting to the stage when 95 % of the pods show maturity color (R8) (Fehr et al. 1971); lodging score (1 for no lodging to 5 for completely prostrate); plant height (in centimeters) and 100 seed weight (in grams). Each measurement was performed following the protocol detailed by Palomeque (2007). Percentages of oil and protein were analyzed per plot using a portable grain analyzer ZX50 (Zeltex Inc., Hagerstown, MD, USA).

DNA extraction and PCR reaction

Genotypic data from population 1, OAC Millennium × Heinong #38, generated by Palomeque et al. (2009a) was used in this experiment. For population 2, Pioneer 9071 × #8902, tissue samples were collected from parents and RILs in Elora, ON, Canada during 2009, with the aid of a leaf puncher. Leaf disks were collected in pre-labeled 2.0 ml centrifuge tubes, transported on ice and freeze dried with a Labonco FreeZone® freeze dry system (Savant Modulyo, Kansas City, MO, USA) to remove moisture and inactivate endogenous nucleases and then stored at −80 °C.

Table 2 Least square means and standard error for seed yield of OAC Millennium (Canadian parent) and Heinong 38 (Chinese parent), the checks, the recombinant inbred lines (RIL) derived from them and the highest-yielding RIL
Table 3 Least square means and standard errors for seed yield of Pioneer 9071 (Canadian parent) and #8902 (Chinese parent), the checks, the recombinant inbred lines (RIL) derived from them and the highest-yielding RIL

Genomic DNA was obtained from 10 to 15 stored leaf disks per line previously ground by FastPrep FP 120 (Savant Instruments Inc., Holbrook, NY, USA). GenElute™ Plant Genomic DNA Kit (SIGMA®, Saint Louis, MO, USA) was used for DNA extraction. A Nanodrop ND1000 Spectrophotometer (Nanodrop Technologies Inc., Wilmington, DE, USA) was utilized to check the concentration and quality of each extraction.

For PCR reactions, the extracted DNA was diluted in a 1/100 proportion and stored as template DNA in a 96-well polypropylene plate at 4 °C. Reagents were combined in the following amounts in a 1.5 ml centrifuge tube: 1.5 μl 10× PCR buffer, 1 μl 3 mM dNTPs, 1.5 μl 50 mM MgCl Sigma2 , 2 μl primer forward and 2 μl primer reverse, 0.4 μl 2.5 U/μl TaqSigma, 3 μl 10 ng/μl gDNA and 3.6 μl of deionized and distilled water. In a 96-well PCR plate, 12 μl of master mix (combination of reagents previously described) and 3 μl of template DNA were placed in each wall and spun in a centrifuge (Hermle Z180M, Labnet, Edison, NJ, USA). Finally, 12 μl of mineral oil was added on each plate to prevent evaporation.

PCR reactions were performed in the 96-well RoboCycler® (Stratagene, La Jolla, CA, USA) with an amplification program consisting of 2 min at 95 °C, followed by 35 cycles of denaturation at 95 °C for 1 min, annealing at 47 °C for 1.5 min, and extension at 68 °C for 1.5 min. A final extension step of 72 °C for 5 min was set and the finished reaction mixture was held at 4 °C. Amplification products were separated by electrophoresis on a 4.5 % Metaphor® high-resolution agarose gel (Bio Whittaker Molecular Applications, Rockland, ME, USA) using a Sunrise TM 96 Horizontal Electrophoresis Apparatus (Gibco BRL, Life technologies, Carlsbad, CA, USA) with 115–130 MA/V of current supplied by an EC105 Electrophoresis Power Supply (ThermoEC, Holbrook, NY, USA). DNA bands were visualized under UV light after staining with ethidium bromide.

The University of Guelph SSR library available consisted of 450 SSR primer pairs selected from the integrated soybean genetic map (Cregan et al. 1999), which covered approximately equidistantly the 20 soybean genetic linkage groups. Both parental lines of the Pioneer 9071 × #8902 population (population 2) were screened with the available SSR markers. One hundred-and-two of the available 450 markers that were polymorphic between the parents were used to genotype the entire RIL population. The polymorphic markers were used to screen the entire RIL population.

Linkage mapping and QTL analysis

The linkage map from OAC Millennium × Heinong 38 population created by Palomeque et al. (2009a) was used in this study for QTL analysis. In that map 26 linkage groups were obtained covering 17 chromosomes in total (Palomeque et al. 2009a). A new map was obtained for population 2, using JoinMap 4® program (van Ooijen 2004a). Segregation distortion was calculated to determine the departure from the expected 1:1 radio. Since segregation distortion can be considered as a normal phenomenon in wide crosses such as the ones used in this study, only the markers that showed extreme segregation distortion (10:1 or greater ration between alleles in the RILs) were not used in the analysis. The parameter that was used to group markers into LGs was recombination frequency. The mapping algorithm was regression and the mapping function selected was Kosambi’s (van Ooijen 2004a). Nineteen linkage groups were generated for population 2. The mapping order and distances among markers were similar to the Soybean Composite Map 2003 (Soybase 2011). Marker distance ranged between 1.2 (Ch. 17) and 38.1 (Ch. 7) cM.

Yield QTL were detected using a single-factor ANOVA and interval mapping. The single factor ANOVA analysis was performed using a macro program from SAS ver. 9.1.3 (SAS Institute Inc. 2003) obtained from Dr. Elizabeth Lee of the University of Guelph (Guelph, Ontario, Canada). QTL were also determined using interval mapping analysis (IM) of MapQTL5® (van Ooijen 2004b) with a LOD score of 2.3 and 2.4, for population 1 and 2, respectively. LOD score was calculated by performing a permutation test using MapQTL5® (van Ooijen 2004b) with a set of 10,000 interactions.

Only the SSRs markers that were significantly associated with yield in two or more potential mega-environment were used for QTL analysis of yield-related traits of each population. Only single-factor ANOVA analysis was performed to detect agronomic traits QTL. The type I error (α) was set at 0.01 for each analysis.

Statistical analysis

The variation in seed yield and yield-related traits for the Canadian and American data within a test location and year was analyzed using the protocol for nearest-neighbor RCBD of Agrobase Software (Agronomix Software Inc., Winnipeg, MB, Canada). The variation in yield and other traits for the Chinese experiments was partitioned into effects of genotype, replications and incomplete blocks within replications using PROC MIXED procedure from SAS ver. 9.1.3 (SAS Institute Inc. 2003) for a rectangular lattice design (Bowley 1999). Replications and blocks within replications were considered to be random variables. Test of residuals was evaluated using PROC UNIVARIATE and PROC PLOT procedure from SAS ver. 9.1.3 (SAS Institute Inc. 2003). LSMEANS and PDIFF statements were used to calculate differences in RIL’s means. Tukey’s multiple range comparison test was utilized for mean comparisons. Linear correlations between yield and yield traits were calculated using PROC CORR and PROC PLOT procedures. Seed yield and agronomic traits means were not averaged over years and locations since significant interactions were between genotypes and years and between genotypes and locations (Reinprecht et al. 2006). The type I error (α) was set at 0.01.

Results

Phenotypic evaluation

The mean yield of the population derived from the cross between OAC Millennium × Heinong #38 (population 1) was lower than the adapted parent, the non-adapted parent and the checks across Canadian, US and Chinese environments in 2009 and 2010 (Table 2). Only at Elora in 2009 and Rosemount in 2010, the population yielded higher than the non-adapted and the adapted parent, respectively. However, the mean of the highest yielding line had an increase in seed yield when it was compared with the respective adapted parent across environments, except at one location. In Changchun in 2010, no significant differences were found between the highest yielding line and the adapted parent, Heinong #38. On average, the highest yielding lines yielded 33, 50 and 17 % more than the adapted parent in Canada, US and China, respectively. In addition, the highest yielding lines exceeded in 40 and 30 % the mean yield of the checks across some of the Canadian and US environments, respectively. No differences in seed yield were found between the highest yielding lines and the checks at Elora and St. Pauls in 2010 and at the Chinese environments, Harbin and Changchun.

Similar results were found for population 2, Pioneer 9071 × #8902, where the mean yield of the population was lower than the mean yield of the adapted parent, the non-adapted parent and the checks across the three mega-environments, Canada, US and China (Table 3). The population mean was observed to be higher than the adapted parent’s mean in Woodstock 2010 and Changchun 2010. Several RILs significantly exceeded the yield of the adapted parent except at two locations. At Elora and St. Pauls in 2010, no significant differences in yield were found between the adapted parent (Pioneer 9071) and the highest yielding lines. On average, the highest yielding lines had an increase in seed yield of 40 and 36 % compared with the adapted parent, Pioneer 9071, in Canada and US environments. In China, the highest yielding lines yielded on average 48 % more than the adapted parent, #8902. In addition, the highest-yielding lines have an increase in seed yield of 30–40 % when compared with the yield of the commercial lines across Canada, US and China, except at one location. At Elora in 2009, no significant differences in yield were found between the highest yielding lines and the mean of the checks.

Correlations between seed yield and agronomic traits

Most of the agronomic traits were correlated with seed yield in population 1 (Table 4) and 2 (Table 5). Maturity (R8; Fehr et al. 1971) was positively correlated with seed yield in 7 out of 9, and in 6 out of 11 environments in populations 1 and 2, respectively. In the case of lodging, it was positively correlated with seed yield in four out of nine of the environments for population 1. However, the correlation was negative for most of the environments of population 2 where the interactions between lodging and seed yield were significant. Disease data were not collected across locations and years as no significant diseases were observed in the trials. No significant abiotic stress was observed in these trials either.

Table 4 Correlation coefficients between yield and other agronomic traits found in a population derived from a cross between Heinong 38 and OAC Millennium across different environments in Canada, United States and China during 2009 and 2010
Table 5 Correlation coefficients between yield and other agronomic traits found in a population derived from a cross between Pioneer 9071 and #8902 across different environments in Canada, United States and China during 2009 and 2010

Plant height was also positively correlated with seed yield in five out of seven and in six out of nine environments in population 1 and 2, respectively. A negative correlation between this trait and seed yield was observed in one of the Chinese locations in population 2. Correlation between 100 seed weight and seed yield was significant for population 1 in three of the nine environments being positive in one environment but negative in the other two. In the case of population 2, the correlation was positive for all environments where it was significant.

Oil content was significantly associated with seed yield in just two US environments in population 1. This association was inconsistent since it was positive in one but negative in the other one. In population 2, the correlation was negative at one location in the Canadian mega-environment, one location of the US mega-environment and one location in China. On the contrary, protein content was negatively correlated with seed yield at one, and at three locations in populations 1 and 2, respectively.

QTL analysis

OAC Millennium × Heinong 38 RIL population

Two markers, the interval Satt364–Satt591 and Satt277, were significantly associated with seed yield across the three mega-environment Canada, US and China when single-factor ANOVA and interval mapping were performed (Table 6). The interval Satt364–Satt591 was significantly associated with seed yield at two of the Canadian environments, at one US environment and at one Chinese environment. It is located on chromosome 5 (LG A1) based on the Composite Map 2003 (Cregan et al. 1999; Soybase 2011) and explained between 10 and 23 % of the variation observed in seed yield. The adapted parent, OAC Millennium, contributed the high-yielding allele to this QTL (Table 6) These markers were also linked with a QTL associated with oil content that was identified at one of the US locations explaining approximately 11 % of the variation observed for this trait (Table 7).

Table 6 Seed yield QTL identified by single-factor ANOVA and interval mapping (IM) in a minimum of one macro-environment in at least two mega-environments and higher yielding allele in a recombinant inbred population derived from the cross between a Canadian and a Chinese elite soybean line, OAC Millennium × Heinong 38, respectively
Table 7 Maturity, lodging, plant height, 100 seed weight, protein and oil content QTL identified by single-factor ANOVA in a recombinant inbred population derived from the cross of OAC Millennium and Heinong 38 across different mega-environments (ME), Canada, United States and China in 2009 and 2010

Satt277 was also significantly associated with seed yield across three Canadian environments, at two US environments and at one Chinese environment. It was located on chromosome 6 (LG C2) and explained between 9.0 and 14 % the yield variation observed in this study. However, in this case the Chinese parent, Heinong #38, contributed the favorable allele. The QTL tagged by Satt100, located in the same chromosome 6 (LG C2), at approximately 7 cM from Satt277, was the one that explained most of the variation observed in seed yield with an R 2 that varied between 13 and 32.3 % depending on the environment. It was identified as a QTL not only in all the Canadian environments but also in two of the US ones (Table 6). QTL Satt277 and Satt100 were also linked to maturity and height QTL across Canada, US and China. These two yield QTL also overlapped other agronomic trait QTL associated with lodging, 100 seed weight, oil and protein content across Canada and US (Table 7).

Seven markers were associated with seed yield in just two mega-environments, Canada and US. All of them also co-localized with other agronomic trait QTL associated with maturity, lodging, plant height, 100 seed weight, oil and protein content that were detected not only in Canada and US but also in China (Tables 6, 7).

In addition, 11 markers were associated with seed yield in at least one macro-environment within the mega-environment. Six of those yield QTL were identified in Canadian environments, whereas 2 and 3 out of 11 QTL were associated with yield in the US and Chinese environments, respectively (data not shown). Palomeque et al. (2009a) defined those QTL as mega-environment-specific QTL (QTLsp).

In this RIL population, both parents contributed favorable alleles to the progeny. Among the QTL that were significant in two or more mega-environment, four of them carried the high-yielding allele from Heinong #38, which was the exotic parent in the Canadian and United States mega-environments while being the adapted parent in the Chinese one. In particular, the QTL linked to marker Satt273 inherited the high-yielding allele from the Chinese parent during 2009 and from the Canadian parent during 2010 in Canada and United States. This result could be attributed to more favorable environmental conditions in 2010 that resulted in a longer growing season than in 2009. There was more rain, higher temperature and humidity that could affect some physiological mechanisms that were reflected in differential expression of alleles at loci yield.

The most frequent top ten high-yielding lines across all mega-environments carried most of the beneficial alleles of the QTL that were present in two and three mega-environments. Line 26 was the most frequent line being among the top ten high-yielding lines in six of nine environments. It inherited all the favorable alleles of the QTL that were presented in two and three mega-environments (Table 8).

Table 8 Ten most frequent RILs that rank among the top ten yielding lines, derived from the cross OAC Millennium × Heinong 38 and their contributing alleles at nine marker loci linked to seed yield QTL detected in a minimum of one macro-environment of at least two mega-environments

Pioneer 9071 × #8902 RIL population

In this population, Sat_341 was significantly associated with yield across two mega-environments, Canada and China. A possible significant yield QTL was detected associated with this marker in US when the significance level was set at 0.05. Sat_341 was located on chromosome 10 (LG O) based on the Composite Map 2003 (Cregan et al. 1999; Soybase 2011) and explained between 7.5 and 8.2 % of the variation observed in seed yield. In this case, the high-yielding allele was contributed by the Chinese parent #8902 (Table 9). No agronomic traits that were measured in this study co-localized with this seed yield QTL (Table 10).

Table 9 Seed yield QTL identified by single-factor ANOVA and interval mapping (IM) in at least one macro-environment in a minimum of two mega-environments and the high-yielding allele in a recombinant inbred population derived from the cross between a Canadian and a Chinese elite soybean line, Pioneer 9071 × #8902, respectively
Table 10 Maturity, lodging, plant height, 100 seed weight, protein and oil content QTL identified by single-factor ANOVA in a recombinant inbred population derived from the cross Pioneer 9071 × #8902 across different megs-environments (ME), Canada, United States and China during 2009 and 2010

Markers Satt005 and Satt600 were mapped very close with an estimated average distance between them of 0.12 cM in the published composite map 2003 (Cregan et al. 1999; Soybase 2011). The QTL that was flanked by those markers was the one that explained most of the variation observed for seed yield in this population, with an R 2 that varied, depending on the location, between 8 and 13.2 %. This yield QTL was identified in Woodstock, Canada, in 2009 and Changchun, China, in 2010. When the significance level was set at 0.05, this QTL was also detected at Lamberton and Rosemount, US, in 2010 explaining approximately 6 % of the variation observed for this trait. The Chinese parent, #8902, contributed the high-yielding allele for this QTL (Table 5). Significant associations were found between these markers and maturity, lodging, plant height and oil content across different environments in this population (Table 10).

In addition, 18 QTL were associated with seed yield in only 1 mega-environment, being significant in at least one macro-environment within the mega-environment. Nine of those QTL were found in the Canadian environments, four in the US environments and only one marker was found significant in China. Both parents contributed favorable alleles to the progeny. In this population, line 132 ranked first among the most frequent top 10 high-yielding lines in 6 of 11 environments and contained all the expected favorable alleles for the QTL that were presented in 2 and 3 mega-environment (Table 11).

Table 11 Ten-most frequent RILs that rank among the top ten yielding lines, derived from the cross Pioneer 9071 × #8902 and their contributing alleles at 3 marker loci linked to seed yield QTL detected in a minimum of one macro-environment of at least two mega-environments

Genetic similarities between populations

Both populations were screened with the same set of SSR markers and the markers that were polymorphic in both were used to compare the QTL between them. Satt600 tagged a seed yield QTL in both genetic backgrounds at one location in Canada in the population 1 (data not shown). For population 2, this marker was located close to Satt005 and together explained the greatest amount of the variation observed in seed yield in this population. The Chinese parents, Heinong #38 and #8902, contributed the high-yielding allele for the first and second populations, respectively. However, when this marker was compared with other agronomic traits, the results varied between populations. No agronomic traits were detected as co-localizing with this seed yield QTL in population 1 while significant associations were found between this marker and maturity, lodging, plant height and oil content across different environments in population 2 (Table 10).

Satt263, located on chromosome 15 (LG E), was also detected as seed yield QTL in both populations. It was identified in Canadian and US environments in population 1 while it was detected only in US for population 2. This marker was also linked to other QTL that appeared to be significantly associated with 100 seed weight across all mega-environments in population 2 (data not shown). No agronomic traits were associated with this marker in population 1 (Table 7).

Even though Satt277 and Satt100, located on chromosome 6 (LG C2), tagged two important seed yield QTL only population 1, a seed yield QTL linked to marker Satt365, which mapped in the same region of the same linkage group, was identified in population 2 in Elora in 2010. The distance between Satt277 and Satt100 was estimated at approximately 7 cM in the published composite map 2003 (Cregan et al. 1999; Soybase 2011). Satt365 was located on the same linkage group, between those two markers, with an approximately distance of 4 cM from Satt277 and 2 cM to Satt100, based on the composite map (Cregan et al. 1999; Soybase 2011) which indicated the importance of this region of chromosome 6 (LG C2) for breeding purposes. The Chinese parent contributed the favorable allele for these markers. Maturity, plant height, 100 seed weight, oil and protein were associated with this region in both populations.

Discussion

From an agricultural standpoint, plants are considered better adapted to a particular region than others when they have higher seed yield and seed quality than other individuals in certain environments (Cooper and Byth 1996). For decades, yield improvements have been achieved through conventional breeding using elite lines as parents, which could lead to diminishing gains due to genetic similarities among cultivars (Manjarrez-Sandoval et al. 1997; Sneller 1994; Ude et al. 2003). However, the introgression of genes from PI and exotic germplasm into the current soybean genetic base may increase genetic variability and lead to greater gains from selection (Thompson and Nelson 1998).

With the idea of understanding how relevant high-yield QTL behave under different ME, two Canadian × Chinese soybean populations were tested in this study at different locations in Ontario, Canada; in Heilongjiang and Jilin, provinces of China and in a new mega-environment, Minnesota, US. The objectives were to find additional seed yield QTL across one, two and three mega-environments and in both populations. From the reported seed yield QTL that were detected in two or three mega-environments, we attempted to analyze the correlations between yield and yield components and to determine the co-localization with other agronomic trait QTL.

In both crosses across most of the environments, the RIL population yield means were lower than that of the adapted parents. Similar results were reported by Orf et al. (1999a) and Palomeque et al. (2009a). However, several lines exceeded the yield of the adapted parents. Among them, the most frequent top ten high-yielding lines across all mega-environments in both populations carried most of the beneficial alleles for seed yield QTL that were significant in two or three mega-environments. This suggests that PI alleles could be introduced from different germplasm sources without a yield penalty and used as potential sources to increase genetic variability leading to higher and potentially more sustainable yields.

Although some inconsistencies were found, significant correlations existed between seed yield and maturity, lodging, height, seed weight, protein and oil. In addition, most of the seed yield QTL co-localized with agronomic trait QTL. Results confirmed that, in general, yield variation observed in the progenies of both crosses grown in the Canadian, United States and Chinese mega-environments was the consequence of the variation observed in different agronomic traits. Those agronomic traits were correlated with yield and associated with QTL detected in one, two or three mega-environments. In particular, maturity was considered one of the most important traits that influenced seed yield (Egli et al. 1981). Thus, higher yield could be mainly explained by the variation in traits associated either with late maturity or with optimum flowering, which could increase the seed filling period. Similar results were found for population 1 by Palomeque et al. (2009b) and by Orf et al. (1999a), Burton (1987), and Wehrmann et al. (1987) for another population for these agronomic traits.

Brummer et al. (1997) suggested that QTL needed to be stable across environments to be useful in breeding programs and that breeders should be careful about incorporating environmentally sensitive QTL into their cultivars unless they work in specific environments. We assessed stability by evaluating both populations across different mega-environments. The interval Satt364–Satt591 was significantly associated with seed yield in at least one location of each of the three mega-environments in population 1. It co-localized with oil content in US. This association was previously reported in a bi-parental cross between Essex and Williams (Hyten et al. 2004) and also with maturity in another cross (Panthee et al. 2004). The yield QTL associated with marker Satt277 was also identified in all mega-environments in population 1 and co-localized with other agronomic traits QTL such as maturity, lodging, plant height and protein content. It was previously reported in the same population associated with seed yield, 100 seed weight, pods per plant, pods per node, height, R1, R5 and R8 in the Canadian and Chinese mega-environments by Palomeque et al. (2009a, b). It was also identified as a QTL associated with lodging, seed weight, plant height, yield per plant height (Orf et al. 1999a), seed yield (Orf et al. 1999a, b; Reinprecht et al. 2006; Smalley et al. 2004; Specht et al. 2001), oil and protein concentration (Reinprecht et al. 2006) in other populations. In population 2, derived from the cross between Pioneer 9071 and #8902, the QTL tagged by Sat_341 was the only one that was significantly associated with yield across two and three mega-environments. None of the other agronomic traits that were measured in this study were associated with this marker. This result implied that some physiological process might be involved in the variation observed for yield. There is no other previous report of QTL linked to Sat_341 (Soybase 2011). However, unless breeding programs work for specific environments, environmentally stable QTL are usually more useful for marker-assisted selection.

Seven and two QTL were associated with seed yield in two of three mega-environments in populations 1 and 2, respectively. Four of them were previously reported by Palomeque et al. (2009a) as universal QTL in the population 1 in the Chinese and in the Canadian environments. However, in this study, those QTL were only identified in the Canadian and in the US mega-environments. In particular, Satt259 was detected by the same authors (Palomeque et al. 2009a) as a specific seed yield QTL in the Canadian mega-environment while in this study it was found also in the US. Among the markers that were detected in two of the three mega-environments, the QTL tagged by Satt100 and by Satt005–Satt600 were the ones that explained most of the variation observed in seed yield in the population 1 and 2, respectively. Yield QTL Satt100 also co-localized with other agronomic traits. Similar results were reported for the same population by (Palomeque et al. 2009a, b). In the second RIL group, Satt005–Satt600 was previously associated with partial resistance to Phytophthora sojae (Burnham et al. 2003). In both cases, the high-yielding allele came from the PI-derived parent, which demonstrates once again that high-yielding alleles can be successfully introduced without penalty.

In general, different QTL were found in particular populations due to differences in their genetic backgrounds (Brummer et al. 1997). However, QTL are generally more useful for breeders if they are identified across different populations. Fasoula et al. (2004) supported the necessity of validating QTL in different populations to increase the spectrum of application for such markers in breeding programs. The universal and specific QTL that Palomeque et al. (2009a, b) found in the population OAC Millennium × Heinong, were not validated in population 2, Pioneer 9071 and #8902, even though the parental lines in both groups were Canadian and Chinese elite lines. Since in the current study both populations were evaluated with the same markers, we identified two seed yield QTL, Satt600 and Satt263 in both genetic backgrounds. QTL tagged by marker Satt263 was previously associated with nitrogen accumulation at growth stage R5 and R6 (Panthee et al. 2004) and with seed weight (Soybase 2011). In addition, it is important for breeders to give special attention to the region tagged by markers Satt277, Satt365 and Satt100 on chromosome 6 (LG C2), as in both populations this genomic region carried alleles with important yield effects. Furthermore, seed yield and other agronomic traits QTL were previously reported in this region in different genetic backgrounds (Orf et al. 1999a; Smalley et al. 2004; Reinprecht et al. 2006; Palomeque et al. 2009a). QTL tested at different genetic backgrounds provide reliable estimation of marker limitations and strengths for use in marker-assisted selection.

Our results provide a better understanding of how QTL behave under specific environments relative to the adaptation area of each parent. We attempted to determine the value of these markers/QTL in breeding programs for further studies based on their genetic and environmental stability. Both the environmental and genetic stability of QTL play an important role in breeding as they could affect the success of introgression of exotic alleles into adapted germplasm. In addition, most of these seed yield QTL were also linked with some agronomic trait QTL. Results suggested that most of the variation observed in yield could be explained by the variation observed in different agronomic traits. For all these markers, both parents contributed their respective alleles to the progeny and the ratio varied between populations. The importance of both gene pools, Canadian and Chinese, is denoted in this study. They represent a special source of beneficial alleles that could contribute directly or indirectly to increase variability and lead to further increments in soybean yields both in North America and in China. Further studies should focus on inter-mating different lines from these major sources with the objective of discovering new recombination events that could benefit soybean breeding programs.