Introduction

A narrow genetic base for domesticated Upland cotton (Gossypium hirsutum L.) exists in the major cotton producing countries such as U.S. (Van Esbroeck et al. 1998, Bowman, 2000), China (Liu et al. 2003), Australia (Multani and Lyon 1995), India (Rana and Bhat 2005), and Pakistan (Rahman et al. 2002). The narrow genetic base of U.S. domesticated cotton is due to a high degree of similarity among ancestral cultivars and limited sources of wild germplasm (Van Esbroeck et al. 1999) and may increase the crop’s vulnerability to pests, diseases, and environmental stresses (Holley and Goodman 1989).

The narrow genetic base has been considered as one of the major obstacles in cotton improvement (Meredith 2000). The utilization of primitive G. hirsutum germplasm is an important approach to acquire new alleles and broaden cotton’s genetic base. Perennial, primitive accessions of cotton have been described as an important source of useful genetic variability for cotton genetic improvement and may limit its vulnerability to pests, diseases, and environmental stresses (Percival 1987; Meredith 1991; McCarty and Jenkins 1993, 2004; McCarty et al. 1995, 2004a, b). Primitive G. hirsutum accessions have been reported to carry valuable alleles for resistance to cotton pests (Jenkins 1986) and pathogens (Haley and Wilhelm 1975; Percival 1987), variability for seed-oil content (Kohel 1978), and high fiber quality (McCarty et al. 2006). However, most primitive accessions are photoperiod sensitive; they do not flower under the long days of the U.S. cotton belt. Thus the photoperiod sensitivity is a major obstacle in the use of primitive germplasm in cotton breeding programs.

A phenotypic backcross breeding program has been undertaking by USDA-ARS to incorporate day-neutral genes into the primitive accessions for their utilization in breeding (McCarty et al. 1979). However, some day neutral converted accessions showed lower genetic diversity than their wild photoperiodic counterparts by molecular markers assay (Liu et al. 2000; Zhong et al. 2002). Linkage drags during backcrossing in the day neutral conversion program hampered the full recovery of alleles from the photoperiodic parent.

Few systematic genetic studies have been conducted for flower initiation in cotton. Although conventional genetic analysis concluded the inheritance of flowering response in G. hirsutum followed a multigenic pattern, the inheritance mode and the number of genes involved depended on the materials used (Lewis and Richmond 1957, 1960; Waddle et al. 1961; Kohel et al. 1974). Given the complex character of flower initiation in plants studied in detail (Koester et al. 1993; Lin et al. 1995; Yano et al. 1997; Börner et al. 2002; Komeda 2004), it is reasonable to hypothesize that multiple QTLs may jointly determine cotton flower initiation. Since the knowledge of the flowering mechanisms of cotton at the molecular level is limited and no genes/QTLs for flowering in Upland cotton have been localized on specific chromosomes in previous studies, such a study should provide insight to this vital function.

Node of first fruiting branch (NFB) is an indication of flowering time and a measure of relative photoperiodism. In cotton, it reflects the flowering character of stocks. Hutchinson (1959) concluded that fruiting branches are not formed until a certain number of nodes develop on the main stem. Ray and Richmond (1966) studied various morphological measures of earliness in cotton and considered NFB as the most reliable and practical measurement. Moreover, Low et al. (1969) suggested using the main-stem NFB as a criterion to measure earliness in consideration of its good heritability and correlation with final picking.

The objective of this study was to determine cotton flowering time QTLs reflected by node of first fruiting branch (NFB) in a primitive accession Texas 701(T701). This research provides the first investigation using QTL mapping for flowering time (NFB) in cotton.

Materials and methods

Plant materials and DNA isolation

Upland cotton cultivar ‘Deltapine 61’ (DPL61) (PI No. 607174) was crossed as female parent with short-day accession ‘Texas 701’ (T701) (PI No. 165329), at a cotton winter nursery in Tecoman, Mexico. T701 is a photoperiod sensitive accession, which does not produce flowers in Mississippi under the normal photoperiod of the cotton growing season, but flowers in Central Mexico. DPL61 is a day-neutral cultivar released by Delta and Pine Land Company. The F1 generation was self-pollinated at the winter nursery in Tecoman, Mexico. The F2 generation was planted in the second week of May at Mississippi State, MS (33.4 N, 88.8 W) in year 2000 following standard agronomic practices. The population contains 251 individuals and segregated for flowering response. The segregation of leaf (L 2 S) shape (abbreviated as LS), petal spot (R 2) (abbreviated as PS), and petal color (Y 1) (abbreviated as PC) of the F2 population were scored on Aug 21st, 2000 as Mendelian traits. Node of first fruiting branch (NFB) was classified for each plant on October 17th, 2000. NFB is the first node above the cotyledon node on the mainstem that a fruiting branch develops and produces fruiting buds that develop into flowers and fruit. Mauney (1986) reviewed the development of cotton fruiting structures. The first fruiting branch (sympodial) usually develops at mainstem node 6 to 7 and a fruiting branch will continue to arise at each vertical mainstem node for the remainder of the growing season for commercial Upland cotton. Genomic DNA was isolated from frozen young leaf samples using DNeasy Mini Kit (QIAGEN, Chatsworth, CA, USA) following the manufacture’s protocol.

Markers assay

A total of 1165 SSR primer pairs were used for parental lines polymorphism screening. These SSR primers included BNL (Research Genetics Co., Huntsville, AL, USA), JESPR (Reddy et al. 2001), MGHES (Qureshi et al. 2004), TMB (provided by USDA-ARS, Crops Germplasm Research Unit, College Station, TX), CIR (Nguyen et al. 2004), and NAU (Han et al. 2006). The marker acronym followed Nguyen et al. (2004). Capillary electrophoresis analysis was conducted with an ABI 3100 genetic analyzer (Applied Biosystems, Foster City, CA, USA) using fluorescently labeled SSR markers according to Gutierrez et al. (2002).

Selective determination of marker-trait association

The purpose of selective determination was to identify NFB tightly related markers among all the polymorphic ones. Two sub-groups each consisting of 15 (6%) F2 plants were selected to represent the two extreme groups, photoperiod sensitive and day neutral. All the 222 polymorphic primer pairs were screened against individuals within both groups. Pearson Correlation coefficients between genotyping and day neutrality trait were calculated by SAS 9.0 (SAS institute Inc., 1999) with the command ‘PROC CORR’. Markers with a P value of less than 0.05 were first chosen to screen the whole F2 population, and then the markers with P value of less than 0.1, less than 0.2, were used orderly to screen the full F2 population. After the construction of a tentative linkage map and the estimation of putative QTL positions and effects, more markers on the specific putative QTLs located chromosomes were added for genotyping.

Linkage and QTL analysis

Genetic linkage map was constructed by JOINMAP3.0 (Van Ooijen and Voorrips 2001). A minimum LOD score (log10 of the likelihood odds ratio) of 5 was set as a threshold to allocate marker locus into linkage groups, and a maximum recombination fraction of 0.40 was employed as general linkage criteria to establish linkage groups. The Kosambi function was used to order markers and to estimate map unit distances (Kosambi 1944). Segregation distortion at each marker locus was tested against the expected segregation ratios (1:2:1 for co-dominant markers and 3:1 for dominant markers) using a chi-square goodness of fit test. Linkage groups chromosomal assignment were achieved by CMD (Cotton Microsatellite Database, http://www.cottonmarker.org) inquiry, comparison to the published integrated molecular maps (Nguyen et al. 2004; Lacape et al. 2005; Han et al. 2006; Wang et al. 2006b; Guo et al. 2007), and our unpublished data.

Since the NFB distribution in the segregating population deviated from normality, several methodologies were used for trait and marker association analyses. Interval mapping, multiple QTL mapping (MQM), and non-parametric genome scan based on the Kruskal–Wallis (KW) analysis were performed with MAPQTL5.0 (Van Ooijen 2004). For interval mapping, genome wide LOD significance threshold (3.8) were calculated by 1,000 × permutation test (Van Ooijen, 1999), which restricted the occurrence of Type I statistical errors (false positives) to less than 5%. A suggestive QTL had LR value above 9.2 (equal to LOD score of 2.0) according to Lander and Kruglyak (1995). Markers with the highest LOD values were then used in various combinations as co-factors in multiple QTL models in MQM mapping. QTL graphs were made by MAPCHART 2.2 (Voorrips 2002). We followed the typical QTL nomenclature (McCouch et al. 1997) which is a designation of ‘q’ followed by an abbreviation of a trait name (NFB), then the chromosome on which it was located, and the number of the detected QTLs affecting the trait on the chromosome. All the markers on chromosome 15, 16, 21 and 25 were further tested for their association with NFB and epistasis using regression analyses by ‘PROC GLM’ command of SAS 9.0 (SAS Institute Inc., 1999).

Results

Performance of NFB

NFB for DPL61 ranged from 6 to 8 while T701 did not set any fruiting branches by harvest. The F1 plants had NFB ranging from 13 to 16 (setting flowers at main stem node 13–16) indicating NFB was controlled by partially dominant genes. In the F2 population, NFB ranged from 7 to 24 with a mean of 21 and standard deviation of 6.6. There were 15 (6%) plants with NFB from 7 to 9, 25 (10%) plants from 10 to 12, 63 (25%) from 13 to 24, and 148 (59%) plants that did not set any flowers by harvest. The distribution of NFB in the F2 population was skewed toward the photoperiod sensitive parent (T701). The skewness was −0.64, and the kurtosis was −1.29. Transgressive segregation was not observed. Plants with NFB ≤ 12 or 13 are generally considered as day neutral (the flower will set bolls which could be harvested by frost). Either NFB ≤ 12 or NFB ≤ 13, the segregation ratio carried by χ2 test indicated that at least two genes with epistasis controlling the NFB.

Selective determination of marker-trait association

Selective 10–20% of the population for genotyping is an effective and efficient way for QTL detection (Darvasi and Soller 1992; Barrett 2002). This procedure was employed to seek the most NFB related markers, increase power of QTL detection, and reduce cost. The phenotypic extreme plants used for selective determination consisted of about 12% of the population. Markers detected at significant level P < 0.01, 0.01 < P < 0.05, 0.05 < P < 0.1, and 0.1 < P < 0.2 were 7, 19, 21, and 32, respectively. All 47 markers significant at P < 0.1 level detected by selective determination were significant at P < 0.05 level in the full set population with nonparametric KW test. In addition, nine markers detected by selective determination by 0.1 < P < 0.2 significant level were also significant at P < 0.05 level by using the full population with nonparametric KW test (data not shown). This step ensured most of the NFB related markers were used for constructing the linkage map.

Linkage map construction

Ninety four SSR markers and three morphological markers were used to construct the genetic linkage map. Among them, 15 markers had distorted segregation (9 at α = 0.05 level, 1 at 0.01 level, 5 at 0.001 level). Seventy SSR makers and three morphological makers coalesced in 17 linkage groups covering 650.8 cM (centimorgans), approximately 14.5% of the cotton genome (Rong et al. 2004). The 17 linkage groups were assigned to 14 chromosomes/linkage groups. The marker order of our linkage map agreed with previous published maps (Jiang et al. 2000; Nguyen et al. 2004; Lacape et al. 2005; Han et al. 2006; Guo et al. 2007).

QTL analysis for NFB

The location of significant QTLs and their flanking markers, LOD-1 and LOD-2 confidence intervals are presented in Table 1 and Fig. 1. By interval mapping, three significant QTLs for NFB were found on chromosome 16, 21, and 25, explaining 9.1, 19.3, and 10.2% of the phenotypic variation, with additive effect of −2.73, −3.39, and −2.37, respectively. Another two suggestive QTLs with small effects were detected on chromosome 15 and 16. Their LOD values were between 2.5 and 3.8, explained 8.1 and 4.9% of the phenotypic variation, with additive effects of 2.28 and −1.89, respectively. No QTLs were found within the A subgenome (chromosomes 1–13). MQM mapping results confirmed the three QTLs detected by interval mapping, except the interval for each QTL became much narrower (Table 1 and Fig 1.). The location of qNFB-c21-1 remained the same as interval mapping, but explained 17.3% of the phenotypic variation. The location of qNFB-c16-1 and qNFB-c25-1 estimated by MQM mapping shifted 2 and 6 cM from that in interval mapping. The phenotypic variation explained change to 7.8 and 15.2%, respectively. Results from both interval mapping and MQM mapping indicated that most alleles originated from T701 appeared to increase NFB. In addition to QTL mapping, non-parametric KW analysis was used for detecting markers associated with NFB. Results showed that chromosome 15, 16, 21, and 25 contained far more NFB associated markers than the other chromosomes (Fig. 1).

Table 1 QTLs detected for node of first fruiting branch (NFB) by interval mapping and MQM mapping by MAPQTL 5.0
Fig. 1
figure 1

QTLs detected on chromosome15, 16, 21 and 25 for node of first fruiting branch (NFB) by interval mapping and Multiple QTL Mapping. LS2 represents super okra leaf shape (L 2 S); Stars indicate significant markers detected from KW non-parametric test at different significant level with *, **, and *** representing significant at P = 0.05, P = 0.01, and P = 0.001, respectively

Regression analysis

Of the markers used for single marker analysis, eight (Table 2) showed significant additive or dominant effects, and individually explained 4–11% of the phenotypic variation for NFB. Multiple marker regression analysis showed that four markers near the significant QTLs peak collectively explain 33% of the phenotypic variance for NFB (Table 2). They were JESPR102-110, JESPR251-72, BNL2812-146, and TMB0436-157. These markers could be tentatively used as the major indicator of flowering time in marker assistant selection (MAS). If we include the nearest markers to the two suggestive QTLs for multiple marker regression analysis, the total explained phenotypic variance would increase to 37%. Thus the two suggestive QTLs on chromosome 15 and 16 only contributed 4% of the phenotypic variance. Regression analysis also detected the epistasis interaction between two pairs of markers. One is JESPR102-110 (chromosome 16) and TMB0436-157 (chromosome 25); the other is JESPR102-110 (chromosome 16) and BNL2812-146 (chromosome 21). This suggested epistasis interactions might be present between qNFB-c16-1 and qNFB-c25-1 or between qNFB-c16-1 and qNFB-c21-1.

Table 2 Genetic effects and coefficients associated with single and multiple markers

Discussion

Genes controlling flowering time in cotton

Limited research had been conducted on cotton flower initiation. In G. barbadense L., the day neutral trait was controlled by one gene pair and the short-day non-flowering response gene was dominant to the flowering gene (Lewis and Richmond 1960). In G. hirsutum, flower initiation was controlled by a complex of genes (Lewis and Richmond 1957, 1960; Waddle et al. 1961), controlled by partially dominant (Lewis and Richmond 1957; Waddle et al. 1961), or recessive genes (Lewis and Richmond 1960). Segregation patterns varied within different accessions. The partially dominant genetic mechanism detected in our F2 population agreed with the conclusion from Lewis and Richmond (1960). Chi-square test suggested day neutrality may be primarily controlled by two genes, but QTL mapping results indicated at least three significant QTLs and two suggestive QTLs involved in main stem fruiting branch initiation. The differences may be due to some QTLs started taking effect on NFB initiation after node 12/13. This might indicate that flowering time is a process not simply controlled by several genes, but by multiple genes that may start their function at different cotton development stages. Also, these genes probably form a complex network to interact with each other, which is supported by the potential epistasis QTLs found in this study.

QTL scattering pattern may be responsible for linkage drag

Maintaining the highest recovery rate of accession parent allele is crucial to broaden the genetic base of Upland cotton in a day neutral conversion program. Zhong et al. (2002) used AFLP markers to study five populations (F6, BC1F6, BC2F6, BC3F6, BC4F6) of day neutral plants derived from crossing and backcrossing of four accessions. The results showed the recovery of markers from the exotic parent did not increase significantly during additional backcross generations. Many AFLP markers tended to stay together as linked blocks when selecting the day neutral phenotype, which leads to linkage drag. Liu et al. (2000) assessed ninety-six day neutral converted accessions by SSR markers. The graphic genotype indicated poor recovery for some accessions. All these indicated linkage drag might locate near genes controlling day neutrality in the donor parents.

Our significant QTLs for NFB are on chromosomes 16, 21, and 25. These chromosomes also host QTLs for fiber quality related traits. Compared with previously published papers for QTLs responsible for cotton fiber quality (Paterson et al. 2003; Mei et al. 2004; He et al. 2005, 2007; Lacape et al. 2005; Shen et al. 2005, 2006, 2007; Wang et al. 2006a), most of the QTLs for NFB (photoperiod sensitivity) were linked with some QTLs related with fiber quality. Lacape et al. (2005) proposed that in cotton, there might be QTL-rich regions along a chromosome congruent between mapping populations, generations and locations. Wang et al. (2006a) suggested that QTLs influencing the quality of cotton fiber might exist in clusters. If genes responsible for NFB and genes related to fiber quality are linked, the selection of day neutral plants during conversion of accessions to day neutral lines would also select fiber and yield genes from the day neutral parents, instead of from the photoperiod sensitive accessions. This may be the reason for linkage drag observed by Zhong et al. (2002) and the relatively stable character of fiber quality traits across backcrossing generations (McCarty et al. 1998). Simultaneous selection of day neutral plants with desired fiber quality by molecular markers assistance may facilitate achieving the aim to maintain the maximum genetic diversity in a day neutral conversion program. This study firstly provided the information on the inheritance of flowering time in cotton by QTL mapping. It may have the potential to accelerate the simultaneous selection of day neutral plants with desired fiber quality and yield.