Introduction

Flowering represents the transition from the vegetative to reproductive phase in plants and is influenced by many factors (Levy and Dean 1998). One of the important cue is the photoperiod. Soybean [Glycine max (L.) Merr.] is sensitive to photoperiod, which makes each cultivar is restricted to a very narrow range of latitudes (Pooprompan et al. 2006). Widely adaptable soybean cultivars have been created by natural variation in the major genes and quantitative trait loci (QTLs) controlling flowering. By classic methods, ten major genes (E1-E9, and J) controlling flowering and maturity time have been characterized in soybean (Bernard 1971; Buzzell 1971; Buzzell and Voldeng 1980; McBlain and Bernard 1987; Ray et al. 1995; Bonato and Vello 1999; Cober and Voldeng 2001a, b; Cober and Morrison 2010; Kong et al. 2014). Among these genes, E1 has been cloned by a map-based approach and identified as a legume-specific transcription factor with a putative nuclear localization signal and a domain distantly related to the B3 domain (Xia et al. 2012), and E2 has been identified as a soybean ortholog of the Arabidopsis GIGANTEA gene (Watanabe et al. 2011). E3 has been confirmed as a phyA homolog by fine-mapping around a QTL for flowering time (qFT3) (Watanabe et al. 2009). Liu et al. (2008) have concluded that the E4 gene also encodes a soybean phyA protein and that the recessive e4 allele is a loss-of-function allele caused by the insertion of a Ty1/copia-like retrotransposon. In cultivated soybean, there are at least three mutated alleles in the E1 gene (Xia et al. 2012), four in the E3 gene (Xu et al. 2013) and six in the E4 gene (Tsubokura et al. 2013). The diversity of the allelic variations and the different allelic combinations of the E1, E3 and E4 genes condition soybean flowering time, post-flowering responses and photoperiod insensitivity and greatly contribute to the wide adaptation of soybean (Xu et al. 2013; Jiang et al. 2014). In addition to these cloned maturity genes, among the more than ten copies of the FLOWERING LOCUS T (FT) homolog in the soybean genome, two homologs, GmFT2a and GmFT5a, have been found to encode components of ‘‘florigen’’, the mobile flowering promotion signal that is involved in the transition to flowering, and these two FT homologs coordinately control flowering in soybean (Kong et al. 2010). GmFT2a and GmFT5a redundantly and differentially regulate flowering through interactions with the bZIP transcription factor, GmFDL19, for the subsequent up-regulation of this protein in soybean (Nan et al. 2014). The E1, E2, E3 and E4 maturity genes have been shown to down-regulate GmFT2a and GmFT5a expression to delay flowering and maturation under LD conditions in soybean, suggesting that GmFT2a and GmFT5a are the soybean flowering integrators and major flowering regulation targets (Kong et al. 2010; Thakare et al. 2011; Watanabe et al. 2011).

In previous research, the genes mentioned above (E1, E2, E3, E4, GmFT2a and GmFT5a) were shown to play an important role only in LD photoperiod. It is known that soybean is a short-day (SD) plant, and most cultivars have a SD requirement for floral induction. When soybean cultivars are grown under SD conditions, cultivars with sensitivity to photoperiod flower early, result in low grain yield, and consequently limit the growing area. It is therefore important to research the genetic control on delaying flowering time under SD environments. This trait was termed the “long-juvenile” (LJ) trait (Parvez and Gardner 1987; Sinclair and Hinson 1992; Ray et al. 1995). The LJ trait plays a pivotal role in extending the range of adaptation of soybean cultivars to lower latitudes and to new management schemes with shifted sowing dates in tropical countries. It has been reported that the northward expansion of soybean production in South America, where more extensive research has been performed, is dependent on the LJ trait (Spehar 1995). However, the genetic control mechanism for this trait remains elusive. Two genes, J and E6, had been reported to play important role in LJ trait (Ray et al. 1995; Bonato and Vello 1999). The single locus J has been identified in a number of crosses with PI 159925 (Ray et al. 1995). The single locus E6 is created by natural variation in ‘Paraná’, and finally produces the long-juvenile ‘Paranagoiana’ (Bonato and Vello 1999). Recently, an F2 population resulting from a cross between conventional juvenile (CJ) lines OT94-47 and the LJ line Paranagoiana exhibited a 15:1 early:late flowering ratio in 12 h photoperiods. A similar 15:1 ratio was observed in offspring of a cross between CJ line OT94-47 and the LJ line PI 159925. These results suggest that the LJ trait is conditioned by at least two recessive alleles in PI 159925 and Paranagoiana (Cober 2011). Further studies of LJ parents have shown that recessive alleles at two or three loci control the long-juvenile trait (Carpentieri-Pípolo et al. 2000, 2002). Though so many researched had been conducted on LJ trait, but only one gene, J, has been mapped to the soybean linkage group Gm 04 between the SSR markers Sat_337 and Satt396, where the genetic distance between the J allele and the closet marker Sat_337 is 0.7 cM (Cairo et al. 2002, 2009).

In addition to these major genes, many QTLs controlling flowering time have been reported (Keim et al. 1990; Lee et al. 1996; Tasma et al. 2001; Chapman et al. 2003; Funatsuki et al. 2005; Liu et al. 2007; Khan et al. 2008; Liu and Abe 2010; Cheng et al. 2011). Some of these QTLs most likely correspond with one of the known major genes, such as E1, E2, E3, E4, or E8 (Watanabe et al. 2004; Funatsuki et al. 2005; Githiri et al. 2007; Khan et al. 2008; Liu and Abe 2010; Cheng et al. 2011), while the others are described in the SoyBase database (http://soybase.org/). In addition to affecting flowering and maturity, the major genes and QTLs for flowering often influence agronomic traits, including plant height and yield (Lee et al. 1996; Chapman et al. 2003; Cober and Morrison 2010), degree of cleistogamy (Khan et al. 2008), seed coat pigmentation, and cracking caused by chilling stress (Takahashi and Abe 1999; Githiri et al. 2007). Therefore, the understanding of QTLs at the molecular level and their interactions with environmental factors will help to optimize the genotypic combinations that lead to higher or more stable yields during the cropping season in a particular region.

The objectives of the present study were as follows: (1) to identify QTLs associated with soybean flowering time using a recombinant inbred line (RIL) population exposed to different environments; (2) to identify QTLs associated with the LJ trait under different SD environments; and (3) to analyze the interactions between QTLs and the environments.

Materials and methods

Plant materials

A population of 91 F9 soybean RILs obtained by single seed descent (SSD) from a cross between AGS292 and K3 was used. The vegetable soybean cultivar AGS292 was a pure line selected from the Japanese cultivar ‘Taishoshiroge’ by the AVRDC (the World Vegetable Center, Taiwan). K3 was a grain soybean that delayed flowering than AGS292. It was a pure line derived by pedigree selection from a cross between ‘G8891’ and ‘G7945’ (both were obtained from the AVRDC collection) by the soybean breeding project of Kasetsart University, Thailand.

Field observation

Seeds from each RIL and the parents were planted at Kasetsart University, Kamphaeng Saen Campus, Nakhon Pathom Province, Thailand (13°82′N, 100°04′E). Field trials were carried out over two seasons (rainy and dry) and two years (August 2004–February 2005 and August 2010–February 2011). The plot was located between Equator and the Tropic of Cancer, where belonged to low latitudes, so is considered a SD environment.

The RILs were also grown under LD conditions in Japan and China. In Japan, seeds were sown in June of 2010 and 2011 in the research field of the National Institute of Agrobiological Sciences at Tsukuba (36°02′N, 140°11′E) and in May of 2010 in the field of Hokkaido University, Sapporo (43°07′N, 141°39′E). In China, the seeds were sown in May of 2010 in the field of the Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Harbin (45°44′N, 126°36′E) and in June of 2011 in the field of Shandong Normal University at Jinan (36°40′N, 117°00′E). These plots were located north of the Tropic of Cancer, where belonged to mid-latitude regions, so were considered LD environments.

In total, the QTLs were analyzed in nine different environments. On each of the nine experimental occasions, all 91 lines, together with their parents AGS292 and K3, were grown in three fully randomized block replications. Every block contained all 91 lines and parents. Each individual was sampled for analysis of the phenotypic parameter flowering time (R1), which was defined as the time from emergence to the opening of the first flower (Fehr et al. 1971). Flowering times were tested for deviations from normality using the parameters of kurtosis and skewness by SPSS 16.0 software (SPSS Inc., Chicago, IL, USA).

DNA isolation and molecular marker analysis

DNA was extracted from the young leaves of each RIL and the parents following a previously described method (Doyle et al. 1990). SSR analysis was built on using primers selected from an integrated soybean genetic linkage map (Cregan et al. 1999; Song et al. 2004; Hyten et al. 2010). The SSR primer sequences were obtained from the SoyBase web site of the USDA, ARS Soybean Genome Database (http://soybase.agron.iastate.edu/). In addition, we developed 52 promoter-specific indel (PSI) and 9 exon-specific indel (ESI) markers (Table S1). Five allele-specific markers for E2 (Watanabe et al. 2011), E3 (Xu et al. 2013), E4 (Liu et al. 2008), FT2a and FT3a (Kong et al. 2010) were also used. The polymerase chain reaction (PCR) mixture contained 30 ng of total genomic DNA, 0.25 µM of 5′ and 3′ primers, 200 µM of each dNTP, 0.5 U of Taq polymerase (TaKaRa, Otsu, Japan) and 1 × PCR buffer (10 mM Tris–HCl, pH 8.3, 50 mM KCl, and 1.5 mM MgCl2) in a total volume of 20 µL. PCR was performed with a GeneAmp PCR System 9700 (Perkin Elmer/Applied Biosystems, Foster City, CA, USA) using the following program: 94 °C for 5 min, followed by 35 cycles of 30 s at 94 °C, 30 s at 48 °C, and 30 s at 72 °C, and a final step of 5 min at 72 °C. PCR products were separated on a 6 % denatured polyacrylamide gel (PAGE) by electrophoresis.

Genetic linkage map construction

In total, 338 polymorphic and informative markers, including 52 PSI, 9 ESI, 5 allele-specific and 272 SSR markers, were chosen as anchors to construct the linkage map covering all 20 linkage groups. Marker order and distance were determined by Map Manager program QTXb20 ( http://mapmgr.roswellpark.org/mapmgr.html) using the Kosambi function and a criterion of 0.001 probability (d.f. = 1). Most of the markers were assigned to the 20 linkage groups as expected from the integrated map (Cregan et al. 1999; Song et al. 2004). Finally, we used Mapchart 2.1 to draw the linkage groups (Voorrips 2002).

Statistical analysis and QTL identification

Two models were used to detect QTLs and analyze the interactions between the QTLs and the environments: the multiple QTL model (MQM), implemented by MapQTL 5.0 (Van Ooijen 2004), and mixed linear-based composite interval mapping (MCIM), implemented by QTLNetwork 2.1 (Yang et al. 2008).

For the MQM, a LOD score of 3.0 was used as a minimum to declare the significance of a QTL in a particular genomic region. 1000 permutations at a 0.05 probability were also conducted to identify the genome-wide LOD (Churchill and Doerge 1994). QTLs with a LOD score exceeding the genome-wide LOD were declared as significant QTLs, whereas the other QTLs with LOD less than the genome-wide LOD but more than 3.0 were identified as suggestive QTLs.

MCIM was used to map QTLs with additive and epistatic effects as well as their interactions with the environments (additive by environment and epistatic by environment). This analysis was performed using a 2D genome scan, with a 1-cM walking speed and 10-cM window size. Significant thresholds (critical F-values) for QTL detection were calculated with 1000 permutations and a genome-wide error rate of 0.05.

Results

Phenotypic analysis

For different environments, the average flowering time for each RIL was used to analyze the segregation pattern (Table 1). We found that the skewness and kurtosis values of different environments deviated slightly from zero, except for the 2010 dry season in Thailand. These results show that the segregation pattern of this trait under different environments fits the normal distribution model and the RILs can be used for genetic map construction and QTL identification. The RILs under LD conditions flowered significantly later than those under SD conditions. The flowering time of the RILs grown at Harbin was the longest of the nine environments, which may be attributed to its high latitude (45°N).

Table 1 Statistical analysis of the flowering times of recombinant inbred lines (RILs) in multiple environments

Construction of genetic linkage map

Using polymorphic 338 markers, a genetic linkage map covering 2546.7 cM was constructed using the Kosambi function (Figure S1). The main marker type contributing to this linkage map was the SSR markers, while the linkage gaps between the SSR markers were bridged by indel PSI and ESI markers. However, the Gm 18 chromosome still lacked polymorphic markers and was divided into Gm 18-1 and Gm 18-2. The map length is approximately consistent with the currently known recombination distance of 2524 cM in the integrated soybean linkage map (Cregan et al. 1999; Song et al. 2004). The marker order of our map was in good accordance with that of the integrated map with only slight differences. However, all of the discordant marker orders occurred within 5 cM of their respective orders on the integrated map.

QTL identification of LJ trait under SD conditions

Under the four SD environments, a total of six QTLs was detected by the MQM (Table 2). They were distributed over four linkage groups and explained 15.2–35.4 % of the phenotypic variation. The additive effect for qFT-F was positive, which indicated that the positive allele for this QTL originated from AGS292. The other five QTLs originated from K3, i.e. they delayed flowering time under SD and were associated with the LJ trait. Among the six QTLs, only qFT-J-2 significantly (p < 0.05) affected flowering time as shown by genome-wide analyses with permutation tests for two rainy environments. It accounted for 34.4 and 35.4 % of the total variances observed for the two environments (Table 2). When we used MCIM to detect QTLs for single SD environment at a 0.001 significant probability level, only qFT-J-2 and qFT-O were detected, and the other four were missed (Table S2). qFT-J-2 was consistently detected under different SD environments by both MCIM and MQM approaches, suggesting that it is the major QTL conditioning LJ trait in this RIL population.

Table 2 Identification of main-effect QTLs for single environment by multiple QTL mapping (MQM), implemented by MapQTL 5.0

QTL identification of flowering time under LD conditions

Under the five LD environments, two QTLs, qFT-O and qFT-L, were identified by the MQM (Table 2). They were located near the allele-specific markers for E2 and E3, respectively, and explained 18.1–50.5 % of the phenotypic variance, with additive effects ranging from 4.32 to 12.15, suggesting that these two QTLs may be attributed to the E2 and E3 loci. Either the qFT-L or the E3 locus affected flowering time as shown by genome-wide analyses of all five LD environments with permutation tests. When we detected QTLs by MCIM at a 0.001 significant probability level, in addition to qFT-O and q-FT-L, qFT-I was also identified. qFT-I was located near the allele-specific markers for E4 suggesting that qFT-I may be conferred by the E4 locus; it was found to exist in four LD environments except at Jinan in 2011 (Table S2). Using allele-specific markers of E1, E2, E3 and E4 genes, the genotypes at these four loci of the two parents AGS292 and K3 were identified as E1e2e3e4 and E1E2E3E4, respectively. The genotyping results confirmed that flowering QTLs qFT-O, qFT-L and qFT-I were conditioned by E2, E3 and E4 genes, respectively. Our results also suggest that the two approaches for detecting QTL, MCIM and MQM, can complement each other to pyramid QTLs in RIL population. qFT-I had an epistatic effect with qFT-L in four LD environments (Table S3). This epistasis contributed 2.45–8.58 days to the flowering time and accounted for 5.63–12.09 % of the phenotypic variance.

QTLs with additive and additive-by-environment interaction effects under nine environments

In order to analyze the interactions between QTLs and environments, we performed a conjoint analysis. Compared with the single environment analysis, we detected four additional minor QTLs: qFT-B2-1, qFT-C1-1, qFT-D2 and qFT-J-4 (Table 3). These four QTLs demonstrated weak interactions with the environment. The other five QTLs, which were also detected in the single environment analysis, displayed additive-by-environment interaction effects with multiple environments. These additive-by-environment interaction effects were opposite between the LD and SD environments, which indicated that the environments had different roles on the genes for these QTLs (LD and SD). Of the nine QTLs, the qFT-L or E3 locus was responsible for the largest phenotypic variation due to both additive and additive-by-environment effects, and the heritability of the additive effect was higher than that of the additive-by-environment effect, which showed that genotypic background had a greater effect on this QTL than the environment.

Table 3 QTLs with additive effects and additive-by-environment interaction effects detected in nine environments

QTLs with epistasis and epistasis-by-environment interaction effects for nine environments

Nine pairs of QTLs with epistatic effects were detected (Table 4). Among these effects, the epistasis occurring between qFT-I and qFT-L was the largest, contributing 2.26 days to the delayed flowering time and accounting for 1.96 % of the phenotypic variance by epistasis in multiple environments. This epistasis also had significant interaction effects with five environments (p < 0.001) (Table 4). We detected three other QTLs by epistatic effects only: qFT-B2-2, qFT-K and qFT-C1-2. These results indicate that analysis of the interactions between the environment and the QTLs allowed for the detection of more minor QTLs.

Table 4 QTLs with epistatic effects and epistasis-by-environment interaction effects detected in multiple environments

Discussion

QTLs for LJ trait

Delayed flowering and maturity time under SD conditions in soybean was termed the LJ trait (Hartwig and Kiihl 1979; Ray et al. 1995; Spehar 1995). This trait is especially important for extending the range of adaptation of soybean to lower latitudes and to new management schemes with shifted sowing dates to increase soybean productivity in such regions (Hartwig and Kiihl 1979; Ray et al. 1995; Spehar 1995). To date, there are few reports of the detection of LJ QTLs through multiple environments using RILs (Liu et al. 2011). In our study, we grew RILs under four SD environments and identified five QTLs for the LJ trait, including qFT-O, qFT-J-1, qFT-J-2, qFT-J-3 and qFT-L, in which all the alleles originating from K3 delayed flowering time and were considered to condition the LJ trait. Among the five LJ QTLs, qFT-O, qFT-J-1, qFT-J-2 and qFT-L were localized to the regions near the allele–specific DNA markers for E2, GmFT5a, GmFT2a and E3, respectively (Watanabe et al. 2009; Kong et al. 2010; Watanabe et al. 2011). Previous research suggests that maturity genes E2, E3 and E4 do not have any effect on flowering time and maturity under SD conditions (Cober et al. 1996). Surprisingly, we found that qFT-O (E2 gene) and qFT-L (E3 gene) can be detected under a SD environment and in association with the LJ trait (Table 2). To our knowledge, this is the first report that the E2 and E3 genes condition flowering time (or the LJ trait) under SD conditions. In addition, while in other genetic models the recessive allele conditioned the LJ trait (Carpentieri-Pípolo et al. 2000, 2002), the dominant allele from the E2 and E3 loci conditioned the LJ trait in our study. Further study is needed to confirm this new finding. qFT-J-1 and qFT-J-2 mapped very tightly to allele-specific markers of GmFT5a and GmFT2a, the two florigens of soybean (Kong et al. 2010), suggesting that GmFT5a and GmFT2a may condition the LJ trait in soybean.

To minimum the influence of environmental factors affecting flowering time of the LJ trait, the 91 RILs and the parental lines were grown at 25 °C under SD conditions (12L/12D) with three replications in growth chambers. Any three of the seeds were grown in one plant pot and all of the plant pots were randomly placed. The flowering time for every seed was detected. The phenotypic data for them can find in table S4. The four QTLs qFT-J-1, qFT-J-2, qFT-J-3 and qFT-L could also be detected by the MQM (Table S5). These results confirm that these four QTLs, particularly those located in association with the E3 locus, were truly present under a SD environment in both indoor and outdoor conditions. The interval for qFT-J-2 and qFT-J-3 had already been found to be associated with flowering time in previous studies (Tasma et al. 2001; Pooprompan et al. 2006). It will be of great interest to perform fine mapping to further elucidate the underlying genetic mechanisms of these QTLs.

Relationships between QTLs and the environments

The results of single environment analysis do not always provide valid predictions of the effects of QTLs controlling a target trait. Analysis by MCIM has been proven to be effective for detecting minor-effect QTLs in a variety of crops (Wang et al. 1999; Gutierrez-Gonzalez et al. 2009, 2010; Xu et al. 2014). In the present study, we used multiple environments to perform an integrated analysis by MCIM, identifying nine additive QTLs. Compared with single environment analysis, four additional QTLs (qFT-B2-1, qFT-C1-1, qFT-D2 and qFT-J-4) were detected and had little interactions with the environments. qFT-B2-1 was located near the marker Satt467. In the SoyBase database (http://soybase.org/), there was only one QTL for flowering time near the marker Satt534 on Gm 14. Compared with the integrated soybean linkage map, Satt467 is located at 19.17 cM and Satt534 is located at 75.73 cM (Hyten et al. 2010). They were far from each other, therefore, qFT-B2-1 may be a new QTL for flowering time (Reinprecht et al. 2006).

Using multiple environments to perform integrated analysis by MCIM not only greatly facilitated the detection of QTLs but also allowed for the identification of epistatic and epistasis-by-environment interaction effects. These results further elucidate the mechanisms underlying the genetic control of flowering time. In this study, three QTLs (qFT-B2-2, qFT-K and qFT-C1-2) were detected with only epistatic effects (Table 4). qFT-C1-2 was located between Sat_140 and GMES0780. This interval was very close to the marker Sat_337, which harbored the J allele that is associated with the LJ trait (Cairo et al. 2009). Thus, it was clear that analysis of interactions between the QTLs and the environments facilitated the detection of QTLs. None of these QTLs had major effects, but they were able to influence flowering time through interactions with other loci, an observation in accordance with those reported by Jannink (2007). Furthermore, for qFT-K, no QTL associated with flowering has been previously identified (Li et al. 2010; Ha et al. 2012).

Conclusions

The objective of this study was to identify QTLs associated with flowering time, especially for the LJ trait. Under SD environments, we identified a total of six QTLs. Of these, qFT-F has not been previously reported, suggesting that it is a novel QTL for flowering time. The other five QTLs originated form K3 and were associated with the LJ trait. Among the five LJ QTLs, four QTLs, qFT-J-1, qFT-J-2, qFT-O and qFT-L, may control the known genes GmFT5a, GmFT2a, E2 and E3, and this is the first report that these genes may be associated with the LJ trait. Additional studies are necessary to confirm these new findings. In addition, we also identified five QTLs (qFT-B2-1, qFT-C1-1, qFT-K, qFT-D2 and qFT-F) by the integrated analysis, which may represent novel flowering time genes.

In conclusion, our research provides insights into the mechanisms of flowering time, especially with regard to the LJ trait. The information obtained from our findings will facilitate gene cloning and functional elucidation for soybean molecular breeding under different environmental conditions.