Mapping-by-sequencing of Ligon-lintless-1 (Li 1 ) reveals a cluster of neighboring genes with correlated expression in developing fibers of Upland cotton (Gossypium hirsutum L.)

Thyssen, Gregory N.; Fang, David D.; Turley, Rickie B.; Florane, Christopher; Li, Ping; Naoumkina, Marina

doi:10.1007/s00122-015-2539-4

Mapping-by-sequencing of Ligon-lintless-1 (Li ₁) reveals a cluster of neighboring genes with correlated expression in developing fibers of Upland cotton (Gossypium hirsutum L.)

Original Paper
Published: 29 May 2015

Volume 128, pages 1703–1712, (2015)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Theoretical and Applied Genetics Aims and scope Submit manuscript

Mapping-by-sequencing of Ligon-lintless-1 (Li ₁) reveals a cluster of neighboring genes with correlated expression in developing fibers of Upland cotton (Gossypium hirsutum L.)

Download PDF

Gregory N. Thyssen¹,
David D. Fang¹,
Rickie B. Turley²,
Christopher Florane¹,
Ping Li¹ &
…
Marina Naoumkina¹

974 Accesses
16 Citations
Explore all metrics

Abstract

Key message

Mapping-by-sequencing and SNP marker analysis were used to fine map the Ligon-lintless-1 ( Li ₁ ) short fiber mutation in tetraploid cotton to a 255-kb region that contains 16 annotated proteins.

Abstract

The Ligon-lintless-1 (Li ₁) mutant of cotton (Gossypium hirsutum L.) has been studied as a model for cotton fiber development since its identification in 1929; however, the causative mutation has not been identified yet. Here we report the fine genetic mapping of the mutation to a 255-kb region that contains only 16 annotated genes in the reference Gossypium raimondii genome. We took advantage of the incompletely dominant dwarf vegetative phenotype to identify 100 mutants (Li ₁ /Li ₁) and 100 wild-type (li ₁ /li ₁) homozygotes from a mapping population of 2567 F₂ plants, which we bulked and deep sequenced. Since only homozygotes were sequenced, we were able to use a high stringency in SNP calling to rapidly narrow down the region harboring the Li ₁ locus, and designed subgenome-specific SNP markers to test the population. We characterized the expression of all sixteen genes in the region by RNA sequencing of elongating fibers and by RT-qPCR at seven time points spanning fiber development. One of the most highly expressed genes found in this interval in wild-type fiber cells is 40-fold under-expressed at the day of anthesis (DOA) in the mutant fiber cells. This gene is a major facilitator superfamily protein, part of the large family of proteins that includes auxin and sugar transporters. Interestingly, nearly all genes in this region were most highly expressed at DOA and showed a high degree of co-expression. Further characterization is required to determine if transport of hormones or carbohydrates is involved in both the dwarf and lintless phenotypes of Li ₁ plants.

Mapping-by-sequencing the locus of EMS-induced mutation responsible for tufted-fuzzless seed phenotype in cotton

Article 10 June 2021

Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement

Article Open access 20 April 2015

Identification of candidate genes for fiber length quantitative trait loci through RNA-Seq and linkage and physical mapping in cotton

Article Open access 31 May 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The Ligon-lintless-1 (Li ₁) short fiber mutant of cultivated Upland cotton (Gossypium hirsutum L.) was originally identified in 1929 and was among the phenotypic markers used to establish the first linkage groups in this important fiber crop (Kohel 1972; Narbuth and Kohel 1990). Beyond its role in the foundation of cotton genetics and genomics, Li ₁ has long been used as a model for understanding fiber cell elongation, a trait of great interest to both cotton breeders and cell biologists (Bolton et al. 2010; Kim and Triplett 2001; Triplett et al. 1989; Wang et al. 2013). While wild-type cotton fiber cells of G. hirsutum cv. DP5690 impressively elongate up to 30 mm in 4 weeks of development with fastest elongation around 8 days post anthesis (DPA), the Li ₁ fiber cells only extend about 3 mm when fully mature (Gilbert et al. 2013). The single dominant gene that confers this qualitative short fiber phenotype has previously been mapped to a region of chromosome 22 that has also been repeatedly identified in studies of quantitative cotton fiber traits, including quantitative trail loci (QTLs) for fiber length, fiber uniformity and yield of seed cotton (Chee et al. 2005; Fang et al. 2014; Jiang et al. 1998; Karaca et al. 2002; Rong et al. 2005; Yu et al. 2013). Despite many approaches including identification of differentially expressed proteins and transcripts, the number of candidate genes for the Li ₁ mutation remains very high (Ding et al. 2014; Gilbert et al. 2013; Liu et al. 2012; Zhao et al. 2009).

Although a reference genome sequence for the cultivated tetraploid cotton has not yet been published, reference genomes for two related diploids, Gossypium raimondii Ulbr. and Gossypium arboreum L. are available (Li et al. 2014; Paterson et al. 2012). Allotetraploid cotton contains A and D subgenomes which are closely related to the genome of the extant diploid G. arboreum (A₂ genome) and the genome of G. raimondii (D₅ genome), respectively (Wendel and Cronn 2003). We were previously able to significantly narrow the list of candidate genes for a different fiber mutant, Ligon-lintless-2 (Li ₂), by using the G. raimondii reference sequence, super bulked segregant sequencing (sBSAseq), bioinformatics and traditional fine mapping approaches (Thyssen et al. 2014a). Our current objective was to make similar progress towards the identification of the causative Li ₁ mutation using a pseudo-tetraploid reference genome consisting of the reference sequences of the extant diploids.

We took advantage of the long known incomplete dominance of the pleiotropic vegetative phenotypes of Li ₁ plants to select and sequence bulks of Li ₁ mutant and wild-type homozygotes from a large segregating population (Kohel 1972). This enabled us to confidently identify linked single nucleotide polymorphisms (SNPs) at a high allele frequency, which defined the genomic region containing the Li ₁ locus and provided us with markers to test on the F₂ mapping population. Ultimately, we were able to define a 255-kb region with only 16 candidate genes, which include a cluster of co-expressed genes, including several mitochondria-targeted genes, and a dramatically under-expressed small molecule transport protein from the major facilitator superfamily.

Materials and methods

Plant materials

The plants and populations used in this study were all described previously (Gilbert et al. 2013). Briefly, near isogenic lines (NIL) of Li ₁ mutant and its wild-type G. hirsutum cv. DP5690 were generated by five generations of backcrossing and nine generations of self-pollination with single seed descent to introgress the Li ₁ mutation into the DP5690 background. These parental plant lines were grown for mRNA isolation in New Orleans, LA in 2013. A segregating population of 2567 F₂ progeny was grown in Stoneville, MS in 2012. Standard conventional field practices were followed at both locations.

RNA isolation, Illumina sequencing and RT-qPCR

Three biological replicates of fibers from different developmental time points were collected from the field in New Orleans, LA. Total RNA from 8-DPA fiber cells were Illumina sequenced by Data2Bio LLC. (Ames, IA) with paired 101-bp reads which are available in the SRA database at NCBI with BioProject accession PRJNA273732. Total RNA from 0, 3, 5, 8, 12, 16, and 20-DPA fiber cells were converted to cDNA and subjected to reverse transcription quantitative polymerase chain reaction (RT-qPCR) as described elsewhere (Naoumkina et al. 2014). Primer sequences are included as Table S1.

Super bulked segregant analysis sequencing (sBSAseq)

The incomplete dominance of the dwarf phenotype of Li ₁ plants (Fig. 1 and Fig. S1) allowed us to score homozygosity of the segregating F₂ progeny at the Li ₁ locus. Based on this phenotype, 100 Li ₁/Li ₁ and 100 li ₁ /li ₁ (wild-type) plants were randomly selected from the F₂ population of 2567 individuals to be bulked and sequenced according to a sBSAseq approach (Michelmore et al. 1991; Takagi et al. 2013). Total genomic DNA from each bulk was Illumina sequenced by Data2Bio LLC with paired 101-bp reads.

Identification of diverse genomic regions

We aligned the sBSAseq total genomic reads, using GSNAP software, to a pseudo-reference genome for G. hirsutum that consisted of the 13 reference chromosomes of G. arboreum and 13 chromosomes of the G. raimondii genome which we present as the A and D subgenomes of G. hirsutum, respectively (Jiang et al. 1998; Paterson et al. 2012; Wu and Nacu 2010). We used InterSNP software at three different minor allele frequency (MAF) thresholds, 0.1, 0.2 and 0.3, to call SNPs between the wild-type and mutant bulk sequences (Page et al. 2014). We generated histograms by counting the number of SNPs in 1-Mb and 10-kb intervals.

Differential gene expression

We carried out differential gene expression of RNAseq reads as described elsewhere (Naoumkina et al. 2014, 2015). All reads were aligned to the reference G. raimondii genome following the PolyCat pipeline (Page et al. 2013). PolyCat software uses a database of homeoSNPs to categorize tetraploid reads into subgenomes. We made two adjustments to the PolyCat pipeline, as reported previously: (1) we only counted exonic reads; (2) we used the ratio of A-assigned to D-assigned reads to proportionately divide the total number of mapped reads for each gene to ensure that unassigned reads contribute to the total expression of genes (Thyssen et al. 2014a). The data normalization and ANOVA process were conducted as previously described (Naoumkina et al. 2014). In this paper, we only present RNAseq expression for candidate genes. All the significantly differentially expressed genes are reported elsewhere (Naoumkina et al. 2015).

Subgenome specific primer design

Manual inspection of read alignments in sBSAseq data was used to identify true SNPs and nearby homeoSNPs. We designed subgenome-specific SNP primers pairs essentially as described previously (Thyssen et al. 2014a). Each forward primer ends with the mutant allele SNP, while each reverse primer ends with the D-subgenome homeoSNP. Both primers contain an additional mismatch at the third base from the 3’ end, which increases annealing temperature stringency (Drenkard et al. 2000). Primer sequences are included as Table S2.

Mapping population

The 2567 F₂ plants used in this study were previously scored for phenotypes and by simple sequence repeat (SSR) markers (Gilbert et al. 2013). The newly developed SNP markers were validated by running qPCR reactions on parental NILs and F₁ plants as described previously (Thyssen et al. 2014a). The flanking SSR markers, C2-034C and DPL0489, were used to identify 85 informative plants with recombinational break points between the markers (Fig. 3). Since Li ₁ is a dominant mutation, only plants where one flanking marker was wt/wt and the other was Li ₁/wt were considered informative. These plants were tested with the new SNP markers and scored as either Li ₁/wt or wt/wt based on the presence or absence of a qPCR product with a C_t value that matched the control parental and F₁ plants. SNP marker genotypes for the non-informative plants were not imputed but were simply treated as missing data. The SNP marker CFB5857 was tested on the entire population and scored as a dominant marker. A genetic linkage map was constructed in JoinMap with default parameters and a LOD score of 10 (Van Ooijen 2006).

Results

Identification of the genomic region containing the Li ₁ locus from sBSAseq

Since the dwarf, twisted stem and wrinkled leaf vegetative phenotypes of the Li ₁ mutation are incompletely dominant in the DP5690 background in both the greenhouse (Fig. 1) and the field (Fig. S1), we were able to select only homozygous plants for sBSAseq. This enabled us to set a high allele frequency threshold for SNP calling. If heterozygotes were present in the dominant pool, we would expect reads from both alleles in that pool (Fig. 2). After aligning the reads to a reference pseudo-G. hirsutum genome composed of the A genome species G. arboreum and the D-genome diploid G. raimondii, we called SNPs at increasing minor allele frequencies (MAF). At a MAF of 0.1, there is a striking single peak of 422 SNPs per Mb in the 14th Mb of Chromosome 12 of the D-genome, which corresponds to Chr. 22 of the tetraploid, the expected location of Li ₁ based on earlier reports (Gilbert et al. 2013; Karaca et al. 2002; Rong et al. 2005). We took this density of SNPs to indicate a diverse region closely linked to the Li ₁ mutation that accompanied Li ₁ during the process of NIL development. We developed qPCR based SNP markers to interrogate these polymorphisms in the segregating F₂ population.

Segregation of markers in 2567 F₂ progeny

Analysis of SNP markers on the segregating population significantly narrowed the genetic interval that contains the Li ₁ locus (Fig. 3). Our new genetic map shows good correspondence with the physical map of the homologous chromosome from G. raimondii, and confines the Li ₁ locus to an interval of 255-kb.

Differential expression of genes on the interval of the Li ₁ locus

The region of reference G. raimondii sequence that is bound by SNP markers CFB5856 and CFB5857 contains only 16 annotated genes, of which only 8 were detected by RNAseq in 8-DPA fibers (Table 1). Three of these were significantly over-expressed in Li ₁ (Gorai.012G086600 “PPR”, Gorai.012G086800 “TOM”, and Gorai.012G086900 “DCD”) and two genes were significantly under-expressed (Gorai.012G086000 “DUF” and Gorai.012G086100 “MFS”).

Table 1 RNAseq differential expression of annotated genes at the Li ₁ locus in 8-DPA fiber cells

Full size table

RT-qPCR of candidate genes during fiber development

We tested all sixteen annotated genes in the Li ₁ interval by RT-qPCR across the development of cotton fiber cells (Fig. S2 and Fig. S3). In wild-type fiber cells, most of the 8 highly expressed genes were most highly expressed during lint fiber initiation at DOA. The expression of these genes was low at 3- and 5-DPA, with increased expression at the peak of elongation, 8-DPA (Fig. S2). The 8 genes which were not detected by RNAseq in 8-DPA fiber also mostly had their highest expression at DOA, with somewhat increasing expression later in development, at 16 or 20-DPA, though only to levels approximately ten-fold less than the highly expressed genes (Fig. S3). We computed the simple Pearson correlation coefficients for the expression of the 16 genes in mutant and wild-type fibers across the developmental stages as measured by RT-qPCR (Fig. 4). It is clear that two clusters of highly correlated genes are present in the wild-type fibers, with three of the very low expressed genes correlating well with each other, and the remaining 13 genes forming a second cluster of co-expression (Fig. 4). In mutant fibers, four genes lose their correlation with each other and with their neighboring genes (Fig. 4). Namely, Gorai.012G086100 “MFS”, Gorai.012G086400 “SEC”, Gorai.012G086800 “TOM”, and Gorai.012G086900 “DCD” have aberrant expression, most strikingly a dramatically reduced expression in lint fiber initials at DOA (Fig. S2).

Discussion

Mapping the genetic locus by sequencing of two homozygote pools

As genetic resources for cotton continue to improve and costs for sequencing decrease, mapping-by-sequencing will increasingly become the technique of choice for the identification of genetic loci that control phenotypes. We took advantage of the well annotated G. raimondii reference genome and recently released G. arboreum genome to construct a pseudo-reference tetraploid genome for G. hirsutum (Li et al. 2014; Paterson et al. 2012). We also benefitted from the incomplete dominance of the pleiotropic dwarf, twisted stem and wrinkled leaf phenotypes of the Li ₁ short fiber mutation to limit our sequencing to two pools of homozygotes, which significantly simplified the SNP-calling bioinformatic analysis. In the future, when mapping genes with complete dominance, we will consider testing the segregation of the progeny of several dominant F₂ plants before selecting plants for total genomic sBSAseq to ensure that only homozygotes are included in the bulks. While progeny testing adds time, it increases the expected allele frequency of the causative polymorphism from 0.67 to 1.0 (Mendel 1941). In the present study, at a MAF of 0.3, the SNP-calling software generates considerable background, while a MAF of 0.1 revealed an unambiguous region (Fig. 2). Even with this advantage, we nevertheless failed to discover any polymorphisms on the interval between our two closest flanking SNP markers, CFB5856 and CFB5857 in our short read sequencing data. This may mean that the causative Li ₁ mutation is a structural rearrangement, transposon insertion, copy number variation or resides in a region of low sequence complexity or in part of the region that is missing from the reference sequence. Indeed, this region of the G. raimondii assembly contains several long (up to 2-kb) runs of Ns (Paterson et al. 2012). Although there is good colinearity for most of the tetraploid D-subgenome with the G. raimondii genome, we cannot exclude the possibility of significant differences between genomes in the proposed Li ₁ interval, including differences in gene content. We look forward to a true G. hirsutum reference genome and longer sequencing read lengths to help resolve these issues.

Traditional fine genetic mapping

Mapping-by-sequencing allowed us to rapidly identify a region of a few Mb that contained a cluster of diversity that differentiates the progenitor of Li ₁ from the DP5690 cultivar that we used to generate the NILs. Since diversity between cultivars is not uniformly distributed, there was no warranty that the causative Li ₁ mutation would be within the greatest density of polymorphism, although it should be nearby. For this reason we designed markers in the vicinity, but not exclusively within the peak of diversity (Fig. 2). We tested segregation of our new SNP markers and the Li ₁ phenotype in the large F₂ population of 2567 individuals to narrow the region to 255-kb and 16 candidate genes. However, once flanking markers are confidently established, only a small subset of plants needs to be tested with the new markers to refine the interval (Blair et al. 2003).

Li ₁ candidate gene expression

We tested the expression of all 16 annotated G. raimondii genes in the Li ₁ locus during fiber development expecting to pick a candidate based on differences in elongation stage (~8-DPA) expression. However, we were surprised to observe that nearly all the candidate genes are most highly expressed at DOA, a time point that is traditionally associated with lint fiber initiation rather than elongation (Basra and Malik 1984). Classically, it is though that the shorter fuzz fibers initiate later than lint fibers, around 5-DPA, and the Li ₁ fiber phenotype was originally described as lintless but fuzzy (Kohel 1972; Lang 1938). Differential gene activity at DOA might therefore explain the lack of lint but presence of fuzz on Li ₁ seeds. However, it is not clear whether the fibers on Li ₁ are lint fibers, fuzz fibers or a combination of both lint and fuzz fibers (Triplett et al. 1989).

Correlated expression of neighboring genes

Because of the highly correlated expression of many genes in the Li ₁ locus interval, and potential unknown differences between the G. raimondii reference genome and the D-subgenome of G. hirsutum, it is so far impossible to pick a single candidate gene. By identifying orthologous genes in Arabidopsis, it is clear that the Li ₁ locus preserves some syntenic relationships of gene order (Table 1). Operon-like clusters of eukaryotic genes have been observed for triterpene synthesis in Arabidopsis and oat but were not found in Medicago (Field and Osbourn 2008; Naoumkina et al. 2010). Co-regulated genes in developing Arabidopsis root tissues show chromosomal clustering and recently a global analysis of Arabidopsis gene expression confirmed that neighboring genes are co-expressed (Birnbaum et al. 2003; Williams and Bowles 2004). The most significant contribution to neighboring gene co-expression was found to be orientation of gene pairs. Parallel or divergent orientations resulted in higher levels of co-expression, while the convergent orientation of genes reduced their co-expression, suggesting the effect of shared regulatory elements (Williams and Bowles 2004). However, the orientation of genes in the Li ₁ locus does not obviously explain their coordinated expression since DCD and ENY are highly correlated and are in a convergent orientation (Fig. 4; Table 1).

Most intriguing for the present study is the report that AT3G27080, the ortholog of “TOM” Gorai.012G086800, a subunit of the mitochondrial outer membrane translocase, is part of a chromosomal region in Arabidopsis that is unusually rich in mitochondrial genes (Elo et al. 2003). This region is also present in rice, and we have observed that the genes adjacent to TOM: Gorai.012G086900 “DCD” and Gorai.012G087000 “ENY” are orthologous to AT3G27090 and AT3G27100 which neighbor AT3G27080 (Elo et al. 2003). Additionally, Gorai.012G086600 “PPR” is predicted to be a mitochondria-targeted RNA binding protein that may affect organellar transcript editing or stability (Barkan and Small 2014). We have recently shown that a different short fiber mutant, Ligon-lintless-2 (Li ₂), has altered mitochondrial gene activity during fiber elongation, so it is interesting to observe a cluster of co-expressed mitochondrial genes in the Li ₁ candidate interval (Thyssen et al. 2014b).

Major facilitator superfamily

We also consider the gene Gorai.012G086100 “MFS” as an attractive candidate for Li ₁. This gene is a member of the major facilitator superfamily of transport proteins, a large family that includes sugar, amino acid and hormone transporters (Pao et al. 1998; Remy et al. 2013). A good match (98 % nucleotide identity) for MFS is NCBI EST DW231799 which has previously been listed as an Li ₁ candidate gene based on RNAseq of leaves (Ding et al. 2014). DW231799 was shown to have elevated transcript abundance in leaf and reduced expression in fibers of Li ₁ plants (Ding et al. 2014). We observed that MFS is highly expressed at DOA in wild-type fibers but is 40-fold under-expressed in Li ₁ fiber initials (Fig. S2). At 8-DPA, wild-type expression of MFS rebounds to about a quarter of the expression level at DOA, but expression in Li ₁ fibers remains very low (Fig. S2). The Li ₁ vegetative phenotypes mimic the exposure of cotton to low dosages of 2,4-D, a synthetic auxin herbicide, which has long been used to identify mutants of polar auxin transport (Bennett et al. 1996; Sciumbato et al. 2004). Injury to cotton plants by low dosages of auxin-type herbicides is characterized by bending and twisting of stems and wrinkling of leaves, although fiber length is not affected (Sciumbato et al. 2004; Smith and Wiese 1972). However, transgenic over-expression of auxin synthesis during lint fiber initiation resulted in an increase in lint yield (Zhang et al. 2011). Additionally, several important auxin transporters that also belong to the major facilitator superfamily were shown to be under-expressed in developing fibers of Li ₁ plants, suggesting a role for defective polar auxin transport in the Li ₁ fiber phenotype (Wang et al. 2013). Taken together, it is intriguing to speculate that over-expression of MFS in the leaves of Li ₁ plants contributes to the vegetative phenotypes, while under-expression of MFS in the fiber initials results in the short fiber phenotype. It is not possible to determine the substrate specificity of MFS by sequence analysis alone (Pao et al. 1998). However, the action of sugar transporters of the major facilitator superfamily can also produce dwarf plants in Arabidopsis and affect cotton fiber cell elongation (Gottwald et al. 2000; Ruan et al. 2001). Further work is required to determine if the aberrant transport of hormones or metabolites in fact underlies the phenotypes of the Ligon-lintless-1 short fiber mutant.

Author contribution statement

GNT, DDF, and MN conceived and designed the experiment. GT analyzed the sequencing data, designed the SNP markers and wrote the paper. DDF oversaw the project. RT developed the NILs and grew the F₂ population. CF and PL conducted the SNP and SSR marker analysis. MN performed and analyzed the RT-qPCR experiments. All authors read and approved the manuscript.

References

Barkan A, Small I (2014) Pentatricopeptide repeat proteins in plants. Annu Rev Plant Biol 65:415–442
Article CAS PubMed Google Scholar
Basra AS, Malik C (1984) Development of the cotton fiber. Int Rev Cytol 89:65–113
Article CAS Google Scholar
Bennett MJ, Marchant A, Green HG, May ST, Ward SP, Millner PA, Walker AR, Schulz B, Feldmann KA (1996) Arabidopsis AUX1 gene: a permease-like regulator of root gravitropism. Science 273:948–950
Article CAS PubMed Google Scholar
Birnbaum K, Shasha DE, Wang JY, Jung JW, Lambert GM, Galbraith DW, Benfey PN (2003) A gene expression map of the Arabidopsis root. Science 302:1956–1960
Article CAS PubMed Google Scholar
Blair MW, Garris AJ, Iyer AS, Chapman B, Kresovich S, McCouch SR (2003) High resolution genetic mapping and candidate gene identification at the xa5 locus for bacterial blight resistance in rice (Oryza sativa L.). Theor Appl Genet 107:62–73
Article CAS PubMed Google Scholar
Bolton JJ, Soliman KM, Wilkins TA, Jenkins JN (2010) Aberrant expression of critical genes during secondary cell wall biogenesis in a cotton mutant, Ligon lintless-1 (Li-1). Comp Funct Genomics 2009:659301
PubMed Central Google Scholar
Chee PW, Draye X, Jiang C-X, Decanini L, Delmonte TA, Bredhauer R, Smith CW, Paterson AH (2005) Molecular dissection of phenotypic variation between Gossypium hirsutum and Gossypium barbadense (cotton) by a backcross-self approach: III. Fiber length. Theor Appl Genet 111:772–781
Article CAS PubMed Google Scholar
Ding M, Jiang Y, Cao Y, Lin L, He S, Zhou W, Rong J (2014) Gene expression profile analysis of Ligon lintless-1 (Li ₁) mutant reveals important genes and pathways in cotton leaf and fiber development. Gene 535:273–285
Article CAS PubMed Google Scholar
Drenkard E, Richter BG, Rozen S, Stutius LM, Angell NA, Mindrinos M, Cho RJ, Oefner PJ, Davis RW, Ausubel FM (2000) A simple procedure for the analysis of single nucleotide polymorphisms facilitates map-based cloning in Arabidopsis. Plant Physiol 124:1483–1492
Article PubMed Central CAS PubMed Google Scholar
Elo A, Lyznik A, Gonzalez DO, Kachman SD, Mackenzie SA (2003) Nuclear genes that encode mitochondrial proteins for DNA and RNA metabolism are clustered in the Arabidopsis genome. Plant Cell 15:1619–1631
Article PubMed Central CAS PubMed Google Scholar
Fang DD, Jenkins JN, Deng DD, McCarty JC, Li P, Wu J (2014) Quantitative trait loci analysis of fiber quality traits using a random-mated recombinant inbred population in Upland cotton (Gossypium hirsutum L.). BMC Genom 15:397
Article Google Scholar
Field B, Osbourn AE (2008) Metabolic diversification—independent assembly of operon-like gene clusters in different plants. Science 320:543–547
Article CAS PubMed Google Scholar
Gilbert MK, Turley RB, Kim HJ, Li P, Thyssen G, Tang Y, Delhom CD, Naoumkina M, Fang DD (2013) Transcript profiling by microarray and marker analysis of the short cotton (Gossypium hirsutum L.) fiber mutant Ligon lintless-1 (Li ₁). BMC Genom 14:403
Article CAS Google Scholar
Gottwald JR, Krysan PJ, Young JC, Evert RF, Sussman MR (2000) Genetic evidence for the in planta role of phloem-specific plasma membrane sucrose transporters. Proc Natl Acad Sci 97:13979–13984
Article PubMed Central CAS PubMed Google Scholar
Jiang C-X, Wright RJ, El-Zik KM, Paterson AH (1998) Polyploid formation created unique avenues for response to selection in Gossypium (cotton). Proc Natl Acad Sci 95:4419–4424
Article PubMed Central CAS PubMed Google Scholar
Karaca M, Saha S, Jenkins J, Zipf A, Kohel R, Stelly D (2002) Simple sequence repeat (SSR) markers linked to the Ligon Lintless (Li ₁) mutant in cotton. J Hered 93:221–224
Article CAS PubMed Google Scholar
Kim HJ, Triplett BA (2001) Cotton fiber growth in planta and in vitro. Models for plant cell elongation and cell wall biogenesis. Plant Physiol 127:1361–1366
Article PubMed Central CAS PubMed Google Scholar
Kohel R (1972) Linkage tests in Upland cotton, Gossypium hirsutum L. II. Crop Sci 12:66–69
Article Google Scholar
Lang A (1938) The origin of lint and fuzz hairs of cotton. J Agric Res 56:507–521
Google Scholar
Li F, Fan G, Wang K, Sun F, Yuan Y, Song G, Li Q, Ma Z, Lu C, Zou C (2014) Genome sequence of the cultivated cotton Gossypium arboreum. Nat Genet 46:567–572
Article CAS PubMed Google Scholar
Liu K, Sun J, Yao L, Yuan Y (2012) Transcriptome analysis reveals critical genes and key pathways for early cotton fiber elongation in Ligon lintless-1 mutant. Genomics 100:42–50
Article CAS PubMed Google Scholar
Mendel G (1941) Versuche über Pflanzen-Hybriden. Theor Appl Genet 13:221–268
Article Google Scholar
Michelmore RW, Paran I, Kesseli R (1991) Identification of markers linked to disease-resistance genes by bulked segregant analysis: a rapid method to detect markers in specific genomic regions by using segregating populations. Proc Natl Acad Sci 88:9828–9832
Article PubMed Central CAS PubMed Google Scholar
Naoumkina MA, Modolo LV, Huhman DV, Urbanczyk-Wochniak E, Tang Y, Sumner LW, Dixon RA (2010) Genomic and coexpression analyses predict multiple genes involved in triterpene saponin biosynthesis in Medicago truncatula. Plant Cell 22:850–866
Article PubMed Central CAS PubMed Google Scholar
Naoumkina M, Thyssen G, Fang DD, Hinchliffe DJ, Florane C, Yeater KM, Page JT, Udall JA (2014) The Li ₂ mutation results in reduced subgenome expression bias in elongating fibers of allotetraploid cotton (Gossypium hirsutum L.). PLoS One 9:e90830
Article PubMed Central PubMed Google Scholar
Naoumkina M, Thyssen GN, Fang DD (2015) RNA-seq analysis of short fiber mutants Ligon-lintless -1 (Li ₁) and – 2 (Li ₂) revealed important role of aquaporins in cotton (Gossypium hirsutum L.) fiber elongation. BMC Plant Biol 15:65
Article PubMed Central PubMed Google Scholar
Narbuth E, Kohel R (1990) Inheritance and linkage analysis of a new fiber mutant in cotton. J Hered 81:131–133
Google Scholar
Page JT, Gingle AR, Udall JA (2013) PolyCat: a resource for genome categorization of sequencing reads from allopolyploid organisms. G3 Genes Genom Genet 3:517–525
CAS Google Scholar
Page JT, Liechty ZS, Huynh MD, Udall JA (2014) BamBam: genome sequence analysis tools for biologists. BMC Res Notes 7:829
Article PubMed Central PubMed Google Scholar
Pao SS, Paulsen IT, Saier MH (1998) Major facilitator superfamily. Microbiol Mol Biol Rev 62:1–34
PubMed Central CAS PubMed Google Scholar
Paterson AH, Wendel JF, Gundlach H, Guo H, Jenkins J, Jin D, Llewellyn D, Showmaker KC, Shu S, Udall J et al (2012) Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres. Nature 492:423–427
Article CAS PubMed Google Scholar
Remy E, Cabrito TR, Baster P, Batista RA, Teixeira MC, Friml J, Sá-Correia I, Duque P (2013) A major facilitator superfamily transporter plays a dual role in polar auxin transport and drought stress tolerance in Arabidopsis. Plant Cell 25:901–926
Article PubMed Central CAS PubMed Google Scholar
Rong J, Pierce GJ, Waghmare VN, Rogers CJ, Desai A, Chee PW, May OL, Gannaway JR, Wendel JF, Wilkins TA (2005) Genetic mapping and comparative analysis of seven mutants related to seed fiber development in cotton. Theor Appl Genet 111:1137–1146
Article CAS PubMed Google Scholar
Ruan Y-L, Llewellyn DJ, Furbank RT (2001) The control of single-celled cotton fiber elongation by developmentally reversible gating of plasmodesmata and coordinated expression of sucrose and K+ transporters and expansin. Plant Cell 13:47–60
PubMed Central CAS PubMed Google Scholar
Sciumbato AS, Chandler JM, Senseman SA, Bovey RW, Smith KL (2004) Determining exposure to auxin-like herbicides. I. Quantifying injury to cotton and soybean 1. Weed Technol 18:1125–1134
Article CAS Google Scholar
Smith DT, Wiese AF (1972) Cotton response to low rates of 2, 4-D and other herbicides. Tex Agric Exp Stn Bull B-1120
Takagi H, Abe A, Yoshida K, Kosugi S, Natsume S, Mitsuoka C, Uemura A, Utsushi H, Tamiru M, Takuno S (2013) QTL-seq: rapid mapping of quantitative trait loci in rice by whole genome resequencing of DNA from two bulked populations. Plant J 74:174–183
Article CAS PubMed Google Scholar
Thyssen GN, Fang DD, Turley RB, Florane C, Li P, Naoumkina M (2014a) Next generation genetic mapping of the Ligon-lintless-2 (Li ₂) locus in upland cotton (Gossypium hirsutum L.). Theor Appl Genet 127:2183–2192
Article CAS PubMed Google Scholar
Thyssen GN, Song X, Naoumkina M, Kim H-J, Fang DD (2014b) Independent replication of mitochondrial genes supports the transcriptional program in developing fiber cells of cotton (Gossypium hirsutum L.). Gene 544:41–48
Article CAS PubMed Google Scholar
Triplett BA, Busch WH, Goynes WR Jr (1989) Ovule and suspension culture of a cotton fiber development mutant. In Vitro Cell Dev Biol 25:197–200
Article Google Scholar
Van Ooijen J (2006) JoinMap 4 Software for the calculation of genetic linkage maps in experimental populations. Kyazma BV, Wageningen
Google Scholar
Wang M-Y, Zhao P-M, Cheng H-Q, Han L-B, Wu X-M, Gao P, Wang H-Y, Yang C-L, Zhong N-Q, Zuo J-R (2013) The cotton transcription factor TCP14 functions in auxin-mediated epidermal cell differentiation and elongation. Plant Physiol 162:1669–1680
Article PubMed Central CAS PubMed Google Scholar
Wendel JF, Cronn RC (2003) Polyploidy and the evolutionary history of cotton. Adv Agron 78:139–186
Article Google Scholar
Williams EJ, Bowles DJ (2004) Coexpression of neighboring genes in the genome of Arabidopsis thaliana. Genome Res 14:1060–1067
Article PubMed Central CAS PubMed Google Scholar
Wu TD, Nacu S (2010) Fast and SNP-tolerant detection of complex variants and splicing in short reads. Bioinformatics 26:873–881
Article PubMed Central CAS PubMed Google Scholar
Yu J, Zhang K, Li S, Yu S, Zhai H, Wu M, Li X, Fan S, Song M, Yang D (2013) Mapping quantitative trait loci for lint yield and fiber quality across environments in a Gossypium hirsutum × Gossypium barbadense backcross inbred line population. Theor Appl Genet 126:275–287
Article PubMed Google Scholar
Zhang M, Zheng X, Song S, Zeng Q, Hou L, Li D, Zhao J, Wei Y, Li X, Luo M (2011) Spatiotemporal manipulation of auxin biosynthesis in cotton ovule epidermal cells enhances fiber yield and quality. Nature Biotech 29:453–458
Article CAS Google Scholar
Zhao P-M, Wang L-L, Han L-B, Wang J, Yao Y, Wang H-Y, Du X-M, Luo Y-M, Xia G-X (2009) Proteomic identification of differentially expressed proteins in the Ligon lintless mutant of upland cotton (Gossypium hirsutum L.). J Proteome Res 9:1076–1087
Article Google Scholar

Download references

Acknowledgments

This project was financially supported by the USDA-ARS CRIS project #6435-21000-017-0DD and Cotton Incorporated project #12-210. Mention of trade names or commercial products in this article is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U. S. Department of Agriculture that is an equal opportunity provider and employer.

Conflict of interest

The authors declare that they have no conflict of interest.

Author information

Authors and Affiliations

Cotton Fiber Bioscience Research Unit, USDA-ARS-SRRC, 1100 Robert E. Lee Blvd, New Orleans, LA, 70124, USA
Gregory N. Thyssen, David D. Fang, Christopher Florane, Ping Li & Marina Naoumkina
Crop Genetics Research Unit, USDA-ARS, 141 Experiment Station Road, Stoneville, MS, 38776, USA
Rickie B. Turley

Authors

Gregory N. Thyssen
View author publications
You can also search for this author in PubMed Google Scholar
David D. Fang
View author publications
You can also search for this author in PubMed Google Scholar
Rickie B. Turley
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Florane
View author publications
You can also search for this author in PubMed Google Scholar
Ping Li
View author publications
You can also search for this author in PubMed Google Scholar
Marina Naoumkina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marina Naoumkina.

Additional information

Communicated by M. Gore.

Electronic supplementary material

Below is the link to the electronic supplementary material.

122_2015_2539_MOESM1_ESM.tif

Supplementary material 1 (TIFF 1362 kb) Fig. S1 Field grown plants show incomplete dominance of the Li ₁ vegetative phenotypes. Homozygous wild-type, heterozygous and homozygous Li ₁ plants are labeled

122_2015_2539_MOESM2_ESM.jpg

Supplementary material 2 (JPEG 2153 kb) Fig. S2 RT-qPCR expression of highly expressed Li ₁ candidate genes during development of Li ₁ and wild-type DP5690 G. hirsutum fiber cells. Error bars indicate standard deviation from three biological replicates. Along the x-axis, DPA indicates the number of days post anthesis, a measure of developmental time for cotton fiber cells

122_2015_2539_MOESM3_ESM.jpg

Supplementary material 3 (JPEG 2119 kb) Fig. S3 RT-qPCR expression of Li ₁ candidate genes with no detectable expression at 8-DPA based on RNAseq. Error bars indicate standard deviation from three biological replicates. Along the x-axis, DPA indicated the number of days post anthesis

122_2015_2539_MOESM4_ESM.xlsx

Supplementary material 4 (XLSX 12 kb) Table S1 Primers for RT-qPCR of genes at the Li ₁ locus. F—forward, R—reverse, DF—D-subgenome specific forward primer, Table S2 Primers for SNP loci near the Li ₁ mutation. mF—mutant-specific forward, DR—D-subgenome-specific reverse primer

Rights and permissions

Reprints and permissions

About this article

Cite this article

Thyssen, G.N., Fang, D.D., Turley, R.B. et al. Mapping-by-sequencing of Ligon-lintless-1 (Li ₁) reveals a cluster of neighboring genes with correlated expression in developing fibers of Upland cotton (Gossypium hirsutum L.). Theor Appl Genet 128, 1703–1712 (2015). https://doi.org/10.1007/s00122-015-2539-4

Download citation

Received: 10 February 2015
Accepted: 11 April 2015
Published: 29 May 2015
Issue Date: September 2015
DOI: https://doi.org/10.1007/s00122-015-2539-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Mapping-by-sequencing of Ligon-lintless-1 (Li 1 ) reveals a cluster of neighboring genes with correlated expression in developing fibers of Upland cotton (Gossypium hirsutum L.)

Abstract

Key message

Abstract

Similar content being viewed by others

Mapping-by-sequencing the locus of EMS-induced mutation responsible for tufted-fuzzless seed phenotype in cotton

Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement

Identification of candidate genes for fiber length quantitative trait loci through RNA-Seq and linkage and physical mapping in cotton

Introduction

Materials and methods

Plant materials

RNA isolation, Illumina sequencing and RT-qPCR

Super bulked segregant analysis sequencing (sBSAseq)

Identification of diverse genomic regions

Differential gene expression

Subgenome specific primer design

Mapping population

Results

Identification of the genomic region containing the Li 1 locus from sBSAseq

Segregation of markers in 2567 F2 progeny

Differential expression of genes on the interval of the Li 1 locus

RT-qPCR of candidate genes during fiber development

Discussion

Mapping the genetic locus by sequencing of two homozygote pools

Traditional fine genetic mapping

Li 1 candidate gene expression

Correlated expression of neighboring genes

Major facilitator superfamily

Author contribution statement

References

Acknowledgments

Conflict of interest

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

122_2015_2539_MOESM1_ESM.tif

122_2015_2539_MOESM2_ESM.jpg

122_2015_2539_MOESM3_ESM.jpg

122_2015_2539_MOESM4_ESM.xlsx

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Mapping-by-sequencing of Ligon-lintless-1 (Li ₁) reveals a cluster of neighboring genes with correlated expression in developing fibers of Upland cotton (Gossypium hirsutum L.)

Identification of the genomic region containing the Li ₁ locus from sBSAseq

Segregation of markers in 2567 F₂ progeny

Differential expression of genes on the interval of the Li ₁ locus

Li ₁ candidate gene expression