Abstract
Main conclusion
A novel tobacco mutant library was constructed, screened, and characterized as a crucial genetic resource for functional genomics and applied research.
A comprehensive mutant library is a fundamental resource for investigating gene functions, especially after the completion of genome sequencing. A new tobacco mutant population induced by ethyl methane sulfonate mutagenesis was developed for functional genomics applications. We isolated 1607 mutant lines and 8610 mutant plants with altered morphological phenotypes from 5513 independent M2 families that consisted of 69,531 M2 plants. The 2196 mutations of abnormal phenotypes in the M2 putative mutants were classified into four groups with 17 major categories and 51 subcategories. More than 60% of the abnormal phenotypes observed fell within the five major categories including plant height, leaf shape, leaf surface, leaf color, and flowering time. The 465 M2 mutants exhibited multiple phenotypes, and 1054 of the 2196 mutations were pleiotropic. Verification of the phenotypes in advanced generations indicated that 70.63% of the M3 lines, 84.87% of the M4 lines, and 95.75% of the M5 lines could transmit original mutant phenotypes of the corresponding M2, M3, and M4 mutant plants. Along with the increased generation of mutants, the ratios of lines inheriting OMPs increased and lines with emerging novel mutant phenotypes decreased. Genetic analyses of 18 stably heritable mutants showed that two mutants were double recessive, five were monogenic recessive, eight presented monogenic dominant inheritance, and three presented semi-dominant inheritance. The pleiotropy pattern, saturability evaluation, research prospects of genome, and phenome of the mutant populations were also discussed. Simultaneously, this novel mutant library provided a fundamental resource for investigating gene functions in tobacco.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Rapid progress in the field of high-throughput sequencing technology has led to the accomplishment of more than one hundred plant genome sequencing projects to date, including tobacco (Bombarely et al. 2012; Sierro et al. 2013, 2014), and five other Solanaceae members (Potato Genome Consortium 2011; Tomato Genome Consortium 2012; Bolger et al. 2014; Hirakawa et al. 2014; Kim et al. 2014; Bombarely et al. 2016). To subsequently decipher the functions of predicted genes, identify novel genes, and understand the molecular basis of important agronomic traits in these plants, large libraries of mutant lines have been generated by random mutagenesis via chemical [e.g., ethyl methane sulfonate (EMS)], physical (e.g., fast neutrons and X-rays) or biological (e.g., T-DNA and transposon) agents based on the requirement of functional genomics (Alonso and Ecker 2006).
The most direct and effective method of investigating gene functions is to characterize the phenotypic variations of mutants. These phenotypic variations constitute the raw material for many fields of plant biology, including research on gene functions, mutation breeding for desirable characteristics, mutant plant tolerance to other species (e.g., virus, bacteria, and fungus), etc. (Oellrich et al. 2015). Ultimately, large-scale phenotyping of saturated mutant libraries can determine the relationship between genotypes and phenotypes and build a highly informative bridge between genomes and phenomes.
T-DNA and transposon insertions have been the most successful approaches for linking a genotype to a phenotype via the generation of sequence-indexed mutant collections suitable for systematic reverse genetics in Arabidopsis (Kuromori et al. 2009; O’Malley and Ecker 2010; Bolle et al. 2011; Wilson-Sánchez et al. 2014). A large number of insertion mutant libraries have been generated in rice, and these libraries are widely used for phenotypic analyses and gene function studies (Kuromori et al. 2009; Chang et al. 2012; Wang et al. 2013). In these diploid insertion mutants, the relative merits between loss-of-function and gain-of-function mutagenesis can complement one another. However, 12.2% of the genes in Arabidopsis still lack inserts even the amount of existing insertion mutants have been increased to 385,000 (O’Malley and Ecker 2010); and one of the probable reasons is that T-DNA insertions have a large integration site bias, such as favering intergenic regions (Sessions et al. 2002; Alonso et al. 2003; Rosso et al. 2003). Insertional mutagenesis also has a much lower mutagenic efficiency relative to EMS mutagenesis (Alonso and Ecker 2006; Warnasooriya and Montgomery 2014). The EMS mutagenesis generates multiple types of DNA alterations, although it predominantly generates not only single base-pair substitutions but also small insertions and deletions as demonstrated in a study of more than 1900 EMS-induced mutations in Arabidopsis (Greene et al. 2003; Till et al. 2003). More importantly, approximately 400 mutations per genome can be generated by EMS treatment, compared with an average of 1.5 insertions per T-DNA mutant by insertion mutagen (Alonso et al. 2003; Till et al. 2003). Owing to its high efficiency and unbiased mutagenic distribution, EMS mutagenesis is the preferred choice in exploratory mutagenesis studies to identify specific mutant phenotypes (Alonso and Ecker 2006), especially for morphological phenotypes of allopolyploid, such as tobacco (Nicotiana tabacum, allotetraploid, 2n = 48).
Tobacco is an economic crop and an important model plant for studying fundamental biological processes and plant disease susceptibility (Sierro et al. 2014). Moreover, tobacco can manufacture patient-specific vaccines as a bioreactor system (Arntzen 2008), and its ability to produce biodiesels makes it a potential bioenergy crop because of its large biomass and easy plant-transformation (Andrianov et al. 2010; Fuchs et al. 2013; Vanhercke et al. 2014). To accelerate tobacco research, the genomes of four tobacco species, N. benthamiana (Bombarely et al. 2012), N. sylvestris, N. tomentosiformis (Sierro et al. 2013), and N. tabacum (Sierro et al. 2014), have been sequenced. N. sylvestris (2n = 24) and N. tomentosiformis (2n = 24) are considered to be two ancestors of N. tabacum (Leitch et al. 2008; Sierro et al. 2013). Many databases related to tobacco that are useful for reverse genetics research have been developed, and the genome and transcriptome sequences are stored in the Sol Genomics Network (SGN), and the expressed sequence tags (ESTs) are stored in the National Center for Biotechnology Information (NCBI). One large-scale phenotyping study was also performed in RNA-interference transgenic tobacco plants to identify phenotypic changes (Lein et al. 2008). However, collections of tobacco mutants as a powerful tool for genetic research are not reported, except that one activation-tagging mutant library of tobacco containing approximately 100,000 transgenic tobacco plants was recently generated. In 4105 T1 family lines, three hundred and eleven lines displayed abnormal phenotypes in leaf and flower morphology, plant height, flowering time, branching, and fertility (Liu et al. 2015).
To compensate for the shortcomings of activation-tagging mutagenesis and take the first step to link phenotypes to genotypes in tobacco, EMS mutagenesis was performed on N. sylvestris, N. glutinosa, N. suaveolens, N. tabacum cv. Zhongyan100 (ZY100), HonghuaDajinyuan (HD), and CuiBi1 (CB1) in our mutant library. Mutant collections of ZY100 are presented and summarized in emphasis because of the high diversity and proportion of the populations, and a total of 94,997 M2 family seeds of ZY100 have been collected. To date, 1607 mutant family lines (containing 2196 mutations of abnormal phenotypes) and 8610 mutant plants have been isolated from 5513 M2 family lines and 69,531 M2 plants, respectively. All of the mutants with visible altered morphological phenotypes were collected and classified into different phenotypic categories. The pleiotropy, hereditability, and saturability of the mutant populations are analyzed and discussed. This novel mutant library can be widely utilized as a crucial and fundamental resource for functional genomics and applied research in tobacco.
Materials and methods
Plant materials and mutagenic treatment
Tobacco seeds were obtained from the National Infrastructure for Crop Germplasm Resources (Tobacco, Qingdao). Five cultivars containing N. tabacum cv. ZY100, Yuyan97 (Y97), Yunyan87 (Y87), HD, and K326 were used for the selection of donor parents. EMS of Sigma (M0880-5G, CAS: 62-50-0) were used as the chemical mutagen. The seeds were soaked in distilled water for 4 h and then treated with three different EMS concentrations (0.4, 0.6, and 0.8%) for 16 h at room temperature under gentle shaking. After removing the EMS solutions, the treated seeds were washed three times with distilled water for 2 h by gentle shaking. The resulting M1 seeds were then sown in the greenhouse or in field.
Procedure for isolating putative mutants
The detailed workflows for constructing mutagenized populations, isolating and characterizing putative mutants are indicated in Fig. 1. The resulting M1 seeds under EMS mutagenesis were sown in the greenhouse or in field to harvest M2 seeds from each of the single M1 plants. Next, 50–100 M2 seeds were sown in a pot, and 15 seedlings were selected randomly for transplantation as a M2 family line. The M2 plants were screened to detect alterations in their visible morphological phenotypes during the whole field stage. M3 seeds from part or all of the M2 plants in each line were harvested and mixed, and the M3 seeds from each individual putative mutant were also harvested. Individual mutant images, their phenotypic categories, and all M3 seeds were registered in the tobacco mutant database (http://www.tobaccomdb.com/english/). Subsequently, each M3 mutant family line with 30 plants was sown for characterization and verification of putative mutant phenotypes. The M3 homozygous mutants were crossed with wild-type cultivars to produce the F1 generation and the F2 populations for genetic analysis and genome mapping. If new mutant phenotypes were identified in an individual M3 or M4 mutant plant, then the M4 or M5 seeds of this plant were added to the seed library and then sown as a mutant family line to verify the new mutant phenotypes. M4 and M5 seeds from confirmed homozygous mutants were also collected in the seed library.
Plant phenotyping
Phenotyping was performed from seed germination to seed harvesting at different developmental stages, including the nursery stage, rosette stage, vigorous growing stage, flower budding stage, flowering period, and fructicative period (Fig. S1). Large-scale and limited phenotyping was conducted for the M2 generation, in which the plants with altered morphological phenotypes were regarded as ‘putative M2 mutants’. Systematic evaluations of mutant phenotypes were performed in the M3 to M5 generations. Distinguishable phenotypes were recorded and photographed using a Canon EOS 5D MarkII digital camera.
Each putative mutant was characterized according to four groups, 17 major categories, and 51 subcategories, which are reported in Table 1 and Table S1. The phenotypic categories of the screened mutants describe the morphological changes at all the developmental stages (Table S1).
Evaluation criterion of the heritability of mutant phenotypes
To test the reproducibility and heritability of the mutant phenotypes, six types of family lines were defined based on the reproducibility of the original mutant phenotypes (OMPs) and the emergence of novel mutant phenotypes (NMPs). Taking the M3 family line for example, the phenotypes of its mutant plants in the previous M2 generation were named ‘OMPs’, and new phenotypes in this M3 line that differed from OMPs were named ‘NMPs’. When all M3 plants displayed phenotypes that were indistinguishable from their OMPs, the line was recorded as a ‘stable line with OMPs’. When part of the M3 plants in the line showed the same phenotypes for OMPs and no NMPs, the line was noted as a ‘segregating line with OMPs’. When all or part of the M3 plants in the line completely recovered OMPs simultaneously manifesting NMPs, the line was designated as a ‘segregating line with OMPs and NMPs’. When OMPs were not observed in all M3 plants, and NMPs emerged in part of the M3 plants, the line was recorded as a ‘segregating line with NMPs’. When all M3 plants exhibited NMPs and no OMPs, the line was recorded as a ‘stable line with NMPs’. When all M3 plants displayed a morphology that was indistinguishable from that of wild type, without OMPs and NMPs, the line was recorded as a ‘line without abnormal phenotypes’ (Table 2).
Results
Selection of donor parents and EMS concentrations for treatment
Common tobacco is a large-sized plant (more than 1.5 m in height after the flowering stage) that has a long life cycle (160–180 days from seed germination to leaf maturity), consisting of several developmental stages (Fig. S1). A large number of cultivated tobacco varieties are available worldwide, and polymorphisms occur in the different genetic backgrounds. To optimize the conditions for mutagenesis and to obtain eligible cultivars, the seeds of five cultivars with different ecotypes were chosen to test the concentration and duration of EMS treatment (Table S2) (Wang et al. 2011). The donor cultivar ZY100 is widely cultivated in the north producing area of China, and Y97, Y87, and HD are cultivated in the southwest. K326 is adaptable to grow in most of the producing areas. In preliminary tests, 16 h appeared to be an effective duration for EMS treatment (data not shown). Seven concentrations of EMS treatments (0.1, 0.3, 0.5, 0.7, 0.9, 1.1 or 1.3%) and one control treatment using a water solution without EMS were applied for 16 h. The results indicated that the lethal dose (LD) for all five varieties was 1.3%, and the appropriate concentrations of EMS for treating tobacco seeds ranged from 0.35 to 0.52%. The sensitivity of the five tobacco varieties to EMS treatment in descending order was as follows: with significant difference, Y97/Y87, HD, ZY100, and K326. Ultimately, 0.4% (LD30, lethal dose causing a 30% reduction in seed germination), 0.6% (LD50), and 0.8% (LD65) were selected as the treatment concentrations, and HD and ZY100 served as the donor parents for EMS mutagenesis. The rate of observable phenotypic changes in HD (19.26% of 862 M2 lines) is far lower than that in ZY100 (39.83% of 5513 M2 lines); therefore, the phenotypes of mutants from ZY100 were prioritized for analysis.
Phenotypic classification of putative mutants
Mutant libraries obtained via EMS-mutagenesis have been constructed from 2008 to 2015, during which a 3-year preliminary experiment, and a 5-year large-scale screening and verifying experiment were also performed. 94,997 M2 family seeds were collected in total. Statistical analyses of the data indicated that 84,531 of 101,512 mutagenic M1 plants planted in the field survived, for a survival rate of 83.27% (Table S3). The strategy for constructing mutagenized populations and isolating and characterizing putative mutants is summarized in detail (Fig. 1).
From 2012 to 2013, a total of 5513 M2 family lines (69,531 M2 plants in total) had been sown and screened (Table 1, Table S1), and they showed a sterility rate of 2.85% for family lines and 0.48% for plants. In total, the screen for mutants with visible altered morphological phenotypes yielded 1607 mutant family lines and 8610 mutant plants, which demonstrated mutation frequencies of 29.15% in the family lines. There were 2196 different mutations of abnormal phenotypes in 1607 mutant lines (Table 1, Table S1). All of the visible mutations of abnormal phenotypes were classified into four groups with 17 major categories and 51 subcategories, and there were one to six subcategories for each of the 17 major categories in the catalog. The major categories and subcategories are described and defined in more specific detail in Table S1.
To evaluate the distributions of different mutation phenotypes in the tissues and organs of the tobacco plant, the number and proportion of the groups, major categories, and subcategories of defined visible phenotypes were summarized. The most abundant group was leaf morphology (38.89% in the lines), followed by plant habit (25.68%), flower (19.44%), and leaf color and senescence (15.98%) in M2 mutant populations, with a mutation rate (the percentage of putative M2 mutant lines in all M2 family lines) of 15.49, 10.23, 7.75, and 6.37%, respectively (Fig. S2; Table 1). The five most abundant major categories were plant height, leaf shape, leaf surface, leaf color, and flowering time, which covered more than 60% of the abnormal phenotypes observed. Leaf shape and surface were the two most prominent categories (16.53 and 15.03% in the lines) in the leaf morphology group, and they had mutation rates of 6.58% and 5.99%. In the plant habit group, plant height was the most prominent category (14.71%), and it had a mutation rate of 5.86%. Leaf color and flowering time were the most abundant categories in the leaf color and senescence and flower groups, which exhibited mutation rates of 5.21 and 3.50%, respectively (Fig. 2; Table 1, Table S1).
Rough and rugose leaf, dark green leaf, dwarf plant, high plant, and narrow leaf were the five top subcategories and accounted for more than 40% of all of the categories. Rough and rugose leaf was the largest subcategory and presented mutation rates of 4.84% (Fig. 3e-1, e-2; Table S1). The mutations of this subcategory has several types that are usually accompanied by the mutations of other categories, such as leaf shape (Fig. 3d4-1, d5), leaf color (Fig. 3f3), and leaf vein (Fig. S3e1-2, e2). Many other subcategories had multiple types of mutant phenotypes. For example, mutants with spirally twisted leaves consisted of several types and included all leaves spiraling clockwise (Fig. 3d4-1), all spiraling anticlockwise (Fig. 3d4-2), part of leaves spiraling clockwise, and others spiraling anticlockwise (Table S1). The white (albino) phenotypes appeared in the form of spotted (Fig. S3i1-1), small sheet, large sheet (Fig. 3f1), and spotted confused sheet (Fig. S3i1-2). Mutants with yellow (etiolated) leaves were also indicated of several mutant types, such as entirely yellow leaf (Fig. S3i2-1, i2-3) and mostly yellow leaf (Fig. 3f2, Fig. S3i2-2).
Pleiotropy between mutants and phenotypes
Because mutants occasionally present multiple phenotypes, the number of mutations of abnormal phenotypes exceeded the total number of mutant lines, thereby demonstrating as ‘pleiotropy’. 1607 mutant family lines and 2196 mutations of abnormal phenotypes were screened from the M2 mutant populations for a mutation frequency of 29.15% in the family lines and 39.83% in the morphological phenotypes (Table 1, Table S1). Of the 1607 M2 mutant lines, 1142 mutant lines showed single phenotypes and the others showed multiple phenotypic categories in each line. Each of the 357 mutant lines represented two kinds of phenotypic categories, and each of the 87 lines represented three kinds. Sixteen and three lines revealed four and five kinds of phenotypic categories in each line, respectively. Two mutant lines indicated six to seven categories. In summary, the pleiotropic mutant lines accounted for 28.94% of the total mutant family lines, and non-pleiotropic mutant lines accounted for the remainder (Fig. 4a).
Of the 2196 mutations of abnormal phenotypes, 1054 (48%) were pleiotropic (Fig. 4b; Table S4). The rates of pleiotropy in the 17 major categories ranged from 17.19% (leaf senescence) to 78.57% (leaf number), with the rates in most major categories varying around 50%. The rates of pleiotropy in the top five major categories ranged from 45.82 to 51.79%, and their total number accounted for 68.88% of the pleiotropic phenotypes. The category containing the fewest pleiotropic phenotypes was stem color, which only had one mutant line (0.09%). Leaf shape constituted the maximum numbers of pleiotropic phenotypes with 188 mutant lines (17.84%). The remaining top pleiotropic categories included leaf surface at 15.65% (165), plant habit at 14.04% (148), and leaf color at 12.81% (135).
Reproducibility and heritability of the mutant phenotypes
To test the reproducibility and heritability of the mutant phenotypes, we randomly selected and examined the family lines with mutant phenotypes in the M3, M4, and M5 plants. Six types of family lines were defined as the evaluation criterion of the heritability of mutant phenotypes based on the reproducibility of OMPs and the emergence of NMPs (Table 2).
The heritability of the mutant phenotypes indicated that as the mutant generation increased, the ratios of lines inheriting OMPs increased and lines with emerging NMPs decreased. Thirty inbred seeds in each line were sown from all 143 M3 family lines to evaluate the reproducibility and heritability of OMPs in M2 putative mutants. In 101 (70.63%) M3 lines, the phenotypes were reproducible from OMPs of each corresponding M2 mutant and included 31 (21.68%) stable lines with OMPs, 51 (35.66%) segregating lines with OMPs, and 19 (13.29%) segregating lines with OMPs and NMPs. Eighty-two (57.34%) lines could completely transmit OMPs and showed stable inheritance. However, 41 (28.67%) lines produced NMPs, including 19 (13.29%) segregating lines with OMPs and NMPs, 16 (11.19%) segregating lines with NMPs, and six (4.20%) stable lines with NMPs. Twenty (13.99%) lines did not display any of the described phenotypes. One hundred fifty-two M4 family lines from part of the M3 mutant plants were selected and planted. One hundred twenty-nine (84.87%) M4 lines transmitted the M3 OMPs and included 44 (28.95%) stable lines with OMPs, 46 (30.26%) segregating lines with OMPs, and 39 (25.66%) segregating lines with OMPs and NMPs. Ninety (59.21%) lines indicated stable inheritable of OMPs without NMPs. Twelve (7.89%) lines showed the same morphology as the wild-type donor. NMPs appeared in 50 (32.89%) M4 lines, which contained 39 (25.66%) segregating lines with OMPs and NMPs, five (3.29%) segregating lines with NMPs, and six (3.95%) stable lines with NMPs. In 47 M5 family lines, 45 (95.75%) lines completely reproduced their corresponding M4 OMPs and included 40 (85.11%) stable lines with OMPs and five (10.64%) segregating lines with OMPs. NMPs and no mutant phenotype were observed in one line each (2.13%).
Determination of the inheritance patterns of stably heritable mutant phenotypes
To determine the inheritance patterns of stably heritable mutants, 18 individual plants from homozygous mutant lines in the M3 and M4 generations were crossed with wild-type or mutant plants, and their genetic segregations in the F1, F2 progenies and BC1F1 populations were examined (Table 3). This genetic analysis revealed that two mutants were double recessive and 16 mutants were monogenic. Two white stem mutants (ws1 and white stem 2) displayed double recessive patterns of inheritance, with a segregation ratio of 15:1 (WT:others) in the F2 generation and 3:1 in the BC1F1 generation produced by test-crossing F1 plants with mutants. The ws1 mutant phenotype has been confirmed to be controlled by two recessive nuclear genes, ws1a and ws1b, which were preliminarily mapped using the BC1F2 populations. The allelism test has revealed that the burley character in burley tobacco is controlled by the same two genes (Wu et al. 2014).
Five mutants appeared to carry monogenic recessive mutations, such as spirally twisted and rough leaf 1, revolute margins 1, white (albino) leaf 1, and two white flower mutants (white flower 1 and 2), and their segregation ratios were the same at 3:1 in the F2 generation and 1:1 in the BC1F1 generation produced by test-crossing F1 plants with mutants. Eight mutants displayed monogenic dominant patterns of inheritance, including four plant height mutants (high plant 1–4) and four leaf petiole mutants (short petiole 1–2, and extremely long petiole 1–2). Monogenic and semi-dominant patterns were validated in three mutants, wide and large leaf 1 and deep red flower 1 and 2. The segregation ratios in the monogenic dominant and semi-dominant mutants were 1:3 in the F2 generation and 1:1 in the BC1F1 generation produced by crossing F1 plants with wild type.
Discussion
Optimization of EMS mutagenesis on multiple donor cultivars
It is important to assess parameters related to the efficient generation of mutant collections. The EMS-treated seeds were subjected to multiple concentrations and the incubation times of 12, 16, 17, 18, 24, and 48 h as observed for different species, such as Arabidopsis (Berná et al. 1999; Till et al. 2003, 2007), rice (Wu et al. 2005), barley (Caldwell et al. 2004), sorghum (Xin et al. 2008), and tomato (Menda et al. 2004; Saito et al. 2011). Saito et al. (2011) also suggested that the incubation duration was not the critical parameter. Therefore, the commonly used incubation time of 16 h was selected after a primary test. A concentration gradient was generated to examine the effects of EMS concentrations on the tobacco seeds (Table S2), and LD50 and LD65 were selected to obtain a relatively high mutant efficiency. In the two donor cultivars with different ecotype, ZY100 was cultivated in the north growing area, and HD was cultivated in the southwest. In 2012, the ecotype CB1, which covers a large area in the southeast growing area, was also mutagenized. Although the mutagenesis rates of HD and CB1 according to the visible morphological phenotypes were far lower than that of ZY100, the integrated mutagenesis based on multiple ecotypes can further enhance the number of polymorphisms in the mutant populations.
Pleiotropy: compound mutant phenotypes based on one gene, multiple genes, or the causality of different phenotypes
Many cases of pleiotropy were found in the tobacco mutants that included one line listed under two or more phenotypic categories. This event occurred extensively in almost all of the EMS-mutagenized mutant libraries. In our studies, we considered these cases as compound mutant phenotypes that are present in three forms. One form was the real pleiotropism, which included two or more mutations of abnormal phenotypes that were always presented together in one mutant plant. This observation likely indicated that these phenotypes were controlled by one gene. For example, rough leaf and dark green leaf were always present simultaneously, similar to spirally twisted leaf and dwarf plant. The second form was the compound mutant phenotype based on the random distribution of multiple phenotypes in one mutant line. Most of the pleiotropic events were of this type, which also caused the appearance of NMPs in advanced generations of mutants, such as M3 and M4. The last form was represented by one phenotype that led to another phenotype. For example, disordered veins produced a rugose leaf surface. Short internodes in the mutants could cause dwarf plants. Many early flowering mutants exhibited fewer leaves. In summary, the real cause and mechanism underlying these events will be illuminated via multifaceted genetic analyses.
Genetic analysis of the heritability and inheritance patterns of the mutant phenotypes
The reproducibility and heritability of the mutant phenotypes were confirmed by visible verification of the phenotyping of the M3, M4, and M5 mutant plants. We found that 70.63% of the M3 lines, 84.87% of the M4 lines, and 95.75% of the M5 lines were reproducible from OMPs of the corresponding M2, M3, and M4 mutant plants. Therefore, selfing in advanced generations may be a highly efficient method of achieving rapid isozygoty and mutant stabilization. Moreover, NMPs continued to emerge in the M3 and M4 generations at high percentages of 28.67 and 32.89%, respectively, which indicates that the M3 and M4 generations were important generations for screening the phenotypic mutants. Three aspects may explain the appearance of NMPs: NMPs are inherited recessively; the OMPs mask the NMPs; and interactions, such as epistasis, occur between OMPs and NMPs. The emergence of NMPs suggested that the ratio of mutants displaying pleiotropism might be relatively underestimated in the M2 generation.
Inheritance patterns in 18 mutants were demonstrated according to their F1, F2, and BC1F1 progenies. Seven members exhibited monogenic or digenic recessive inheritance, eight members demonstrated monogenic dominant inheritance, and two members demonstrated semi-dominant inheritance. These data confirmed that the phenotypes of the 18 mutants were reproducible in subsequent progenies, and their heritability was sufficiently stable for further functional analyses. However, the ratio of every type of inheritance pattern in these mutants does not represent the inheritance patterns of the whole population.
Preliminary evaluation of the saturability of the mutant populations
The population size necessary for conducting saturation mutagenesis in tobacco was estimated by referring to the genome sizes and saturability of other plants. Tomato is closely related to tobacco and has a genome size of approximately 900 Mb. A population of 13,000 M2 families was considered to represent a saturated collection (Menda et al. 2004). A total of 15,020 M2 mutagenized lines were produced and considered nearly saturated (Saito et al. 2011). Barley is a type of Triticeae crop, and it has a genome size of approximately 5 Gb and >80% repetitive sequences; thus, its features are similar to the tobacco genome. Caldwell et al. (2004) inferred that a saturated EMS mutant population in barley would require more than 100, 000 plants for comprehensive coverage because the mutation rate of EMS mutagenesis appears to be relatively independent of the genome size. Tobacco (N. tabacum) has a large genome size of 4.5 Gb and over 70% repetitive sequences. In general, a population with 100,000 M2 mutant lines is considered to be saturated with mutations, although the presence of multiple alleles per locus has not been determined. Therefore, allelism tests and other approaches to assess the functional genetics of tobacco will be conducted to evaluate the saturation of the ZY100 EMS-mutagenized populations.
Improvement of the mutant library
Ethyl methane sulfonate mutagenesis is an efficient way to identify mutant morphological phenotypes of tobacco, owing to its high efficiency and unbiased mutagenic distribution. All the data and images can be searched online (http://www.tobaccomdb.com/english/). In combination with the whole genome sequence of tobacco, these mutant resources provide preliminary data for associating the genome with the phenome. However, collection and storage of phenotypic mutants is not yet sufficient to facilitate the identification and characterization of gene functions. And the mutant library must be further improved to link specific phenotypes and associated genomic sequence information.
From phenotype to phenome
Phenotyping in classical genetics is largely based on large-scale phenotype screenings of mutant collections based on visual observations and scoring systems or simple instrumental inspections of organisms to understand the functional plant traits. However, the process of performing a manual evaluation of tobacco mutant phenotypes is subjective, inefficient, and error prone. Additionally, visible phenotypes are only part of the total phenome. From phenotype to phenome, multifunctional, high-throughput, and high-resolution phenotyping platforms should be developed, and the lack of these resources has become a new bottleneck in plant biology, functional genomics, and crop breeding, which are based on traditional methods of phenotyping large mutant collections (Finkel 2009; Furbank and Tester 2011; Yang et al. 2013). Certain shortcomings still limit the generalization of the high-throughput phenomic platforms, such as their high cost, sophisticated data analysis infrastructures, and inconvenient high-throughput field phenotyping (Yang et al. 2013; Grosskinsky et al. 2015). However, phenomics based on the high-throughput phenotyping has been performed to assess abiotic stress tolerance in Arabidopsis (Jansen et al. 2009; Berger et al. 2010), cereals (Sirault et al. 2009; Munns et al. 2010), tobacco, and cotton (Furbank and Tester 2011).
Genetic technologies in genome-wide phenomics
Functional genomics investigations of the mutant library were performed using forward and reverse genetic approaches. Forward genetics approaches mainly depend on positional cloning to map and clone the gene responsible for the mutant phenotype. Large numbers of genetic markers for map-based cloning have been developed, such as microsatellite markers (SSRs) (Bindler et al. 2007, 2011; Tong et al. 2012b), and amplified fragment length polymorphisms (AFLPs) (Marché et al. 2001; Dadras et al. 2014). SSR markers have been widely used for gene mapping (Wu et al. 2014), linkage disequilibrium analyses (Fricano et al. 2012), quantitative trait locus (QTL) mapping (Tong et al. 2012a), and evolutionary analyses (Wu et al. 2010). However, because of the poor genetic diversity among different tobacco cultivars, the density of SSR markers ultimately cannot meet the demand of positional cloning. Therefore, single nucleotide polymorphism (SNP) markers may be the most appropriate markers for mapping and should be exploited to construct a genetic linkage map (Xiao et al. 2015). The availability of tobacco genome sequence also makes possible to address mapping-by-sequencing approaches in this crop, which considerably accelerate forward genetic and mutant gene identification.
TILLING is a reverse genetic approach that was developed to identify mutations in EMS-mutagenized populations. This strategy has been validated by its successful application in many plants through multiple improvements (Gilchrist and Haughn 2005; Minoia et al. 2010; Tsai et al. 2011, 2013). The construction of a high-quality tobacco mutant collection also required the technological support of a systematic TILLING platform. Therefore, the mixed pools of DNA samples in the mutant populations were used to develop a high-efficiency TILLING platform and screen large numbers of various mutants in the LS, RAX, ANS, and QPT genes. In the future, this platform will facilitate functional genomics applications in tobacco.
Author contribution statement
LG and WY conceived and designed the research. WD, WS, CJ, WX, SY, LF, LJ, GX and LG performed technical work for investigating mutants. WD, WS, CJ, WX, and LG analyzed and summarized the data. WD wrote the manuscript. All authors read and approved the manuscript.
Abbreviations
- EMS:
-
Ethyl methane sulfonate
- LD:
-
Lethal dose
- NMP:
-
Novel mutant phenotype
- OMP:
-
Original mutant phenotype
References
Alonso JM, Ecker JR (2006) Moving forward in reverse: genetic technologies to enable genome-wide phenomic screens in Arabidopsis. Nat Rev Genet 7(7):524–536. doi:10.1038/nrg1893
Alonso JM, Stepanova AN, Leisse TJ et al (2003) Genome-wide insertional mutagenesis of Arabidopsis thaliana. Science 301(5633):653–657. doi:10.1126/science.1086391
Andrianov V, Borisjuk N, Pogrebnyak N, Brinker A, Dixon J, Spitsin S, Flynn J, Matyszczuk P, Andryszak K, Laurelli M, Golovkin M, Koprowski H (2010) Tobacco as a production platform for biofuel: overexpression of Arabidopsis DGAT and LEC2 genes increases accumulation and shifts the composition of lipids in green biomass. Plant Biotechnol J 8(3):277–287. doi:10.1111/j.1467-7652.2009.00458.x
Arntzen CJ (2008) Using tobacco to treat cancer. Science 321(5892):1052–1053. doi:10.1126/science.1163420
Berger B, Parent B, Tester M (2010) High-throughput shoot imaging to study drought responses. J Exp Bot 61(13):3519–3528. doi:10.1093/jxb/erq201
Berná G, Robles P, Micol JL (1999) A mutational analysis of leaf morphogenesis in Arabidopsis thaliana. Genetics 152(2):729–742
Bindler G, van der Hoeven R, Gunduz I, Plieske J, Ganal M, Rossi L, Gadani F, Donini P (2007) A microsatellite marker based linkage map of tobacco. Theor Appl Genet 114(2):341–349. doi:10.1007/s00122-006-0437-5
Bindler G, Plieske J, Bakaher N, Gunduz I, Ivanov N, Van der Hoeven R, Ganal M, Donini P (2011) A high density genetic map of tobacco (Nicotiana tabacum L.) obtained from large scale microsatellite marker development. Theor Appl Genet 123(2):219–230. doi:10.1007/s00122-011-1578-8
Bolger A, Scossa F, Bolger ME et al (2014) The genome of the stress-tolerant wild tomato species Solanum pennellii. Nat Genet 46(9):1034–1038. doi:10.1038/ng.3046
Bolle C, Schneider A, Leister D (2011) Perspectives on systematic analyses of gene function in Arabidopsis thaliana: new tools, topics and trends. Curr Genom 12(1):1–14. doi:10.2174/138920211794520187
Bombarely A, Rosli HG, Vrebalov J, Moffett P, Mueller LA, Martin GB (2012) A draft genome sequence of Nicotiana benthamiana to enhance molecular plant-microbe biology research. Mol Plant Microbe Interact 25(12):1523–1530. doi:10.1094/MPMI-06-12-0148-TA
Bombarely A, Moser M, Amrad A et al (2016) Insight into the evolution of the Solanaceae from the parental genomes of Petunia hybrida. Nat Plants. doi:10.1038/Nplants.2016.74
Caldwell DG, McCallum N, Shaw P, Muehlbauer GJ, Marshall DF, Waugh R (2004) A structured mutant population for forward and reverse genetics in Barley (Hordeum vulgare L.). Plant J 40(1):143–150. doi:10.1111/j.1365-313X.2004.02190.x
Chang Y, Long T, Wu C (2012) Effort and contribution of T-DNA Insertion mutant library for rice functional genomics research in China: review and perspective. J Integr Plant Biol 54(12):953–966. doi:10.1111/j.1744-7909.2012.01171.x
Dadras AR, Sabouri H, Nejad GM, Sabouri A, Shoai-Deylami M (2014) Association analysis, genetic diversity and structure analysis of tobacco based on AFLP markers. Mol Biol Rep 41(5):3317–3329. doi:10.1007/s11033-014-3194-6
Finkel E (2009) With ‘phenomics’, plant scientists hope to shift breeding into overdrive. Science 325(5939):380–381. doi:10.1126/science.325_380
Fricano A, Bakaher N, Del Corvo M, Piffanelli P, Donini P, Stella A, Ivanov NV, Pozzi C (2012) Molecular diversity, population structure, and linkage disequilibrium in a worldwide collection of tobacco (Nicotiana tabacum L.) germplasm. BMC Genet 13:18. doi:10.1186/1471-2156-13-18
Fuchs J, Neuberger T, Rolletschek H, Schiebold S, Nguyen TH, Borisjuk N, Börner A, Melkus G, Jakob P, Borisjuk L (2013) A noninvasive platform for imaging and quantifying oil storage in submillimeter tobacco seed. Plant Physiol 161(2):583–593. doi:10.1104/pp.112.210062
Furbank RT, Tester M (2011) Phenomics-technologies to relieve the phenotyping bottleneck. Trends Plant Sci 16(12):635–644. doi:10.1016/j.tplants.2011.09.005
Gilchrist EJ, Haughn GW (2005) TILLING without a plough: a new method with applications for reverse genetics. Curr Opin Plant Biol 8(2):211–215. doi:10.1016/j.pbi.2005.01.004
Greene EA, Codomo CA, Taylor NE, Henikoff JG, Till BJ, Reynolds SH, Enns LC, Burtner C, Johnson JE, Odden AR, Comai L, Henikoff S (2003) Spectrum of chemically induced mutations from a large-scale reverse-genetic screen in Arabidopsis. Genetics 164(2):731–740
Grosskinsky DK, Pieruschka R, Svensgaard J, Rascher U, Christensen S, Schurr U, Roitsch T (2015) Phenotyping in the fields: dissecting the genetics of quantitative traits and digital farming. New Phytol 207(4):950–952. doi:10.1111/nph.13529
Hirakawa H, Shirasawa K, Miyatake K, Nunome T, Negoro S, Ohyama A, Yamaguchi H, Sato S, Isobe S, Tabata S, Fukuoka H (2014) Draft genome sequence of eggplant (Solanum melongena L.): the representative solanum species indigenous to the old world. DNA Res 21(6):649–660. doi:10.1093/dnares/dsu027
Jansen M, Gilmer F, Biskup B, Nagel KA, Rascher U, Fischbach A, Briem S, Dreissen G, Tittmann S, Braun S, De Jaeger I, Metzlaff M, Schurr U, Scharr H, Walter A (2009) Simultaneous phenotyping of leaf growth and chlorophyll fluorescence via GROWSCREEN FLUORO allows detection of stress tolerance in Arabidopsis thaliana and other rosette plants. Funct Plant Biol 36(10–11):902–914. doi:10.1071/FP09095
Kim S, Park M, Yeom SI et al (2014) Genome sequence of the hot pepper provides insights into the evolution of pungency in Capsicum species. Nat Genet 46(3):270–278. doi:10.1038/ng.2877
Kuromori T, Takahashi S, Kondou Y, Shinozaki K, Matsui M (2009) Phenome analysis in plant species using loss-of-function and gain-of-function mutants. Plant Cell Physiol 50(7):1215–1231. doi:10.1093/pcp/pcp078
Lein W, Usadel B, Stitt M, Reindl A, Ehrhardt T, Sonnewald U, Börnke F (2008) Large-scale phenotyping of transgenic tobacco plants (Nicotiana tabacum) to identify essential leaf functions. Plant Biotechnol J 6(3):246–263. doi:10.1111/j.1467-7652.2007.00313.x
Leitch IJ, Hanson L, Lim KY, Kovarik A, Chase MW, Clarkson JJ, Leitch AR (2008) The ups and downs of genome size evolution in polyploid species of Nicotiana (Solanaceae). Ann Bot 101(6):805–814. doi:10.1093/aob/mcm326
Liu F, Gong D, Zhang Q, Wang D, Cui M, Zhang Z, Liu G, Wu J, Wang Y (2015) High-throughput generation of an activation-tagged mutant library for functional genomic analyses in tobacco. Planta 241(3):629–640. doi:10.1007/s00425-014-2186-z
Marché L, Valette S, Grenier E, Mugniéry D (2001) Intra-species DNA polymorphism in the tobacco cyst-nematode complex (Globodera tabacum) using AFLP. Genome 44(6):941–946. doi:10.1139/gen-44-6-941
Menda N, Semel Y, Peled D, Eshed Y, Zamir D (2004) In silico screening of a saturated mutation library of tomato. Plant J 38(5):861–872. doi:10.1111/j.1365-313X.2004.02088.x
Minoia S, Petrozza A, D’Onofrio O, Piron F, Mosca G, Sozio G, Cellini F, Bendahmane A, Carriero F (2010) A new mutant genetic resource for tomato crop improvement by TILLING technology. BMC Res Notes 3:69. doi:10.1186/1756-0500-3-69
Munns R, James RA, Sirault XR, Furbank RT, Jones HG (2010) New phenotyping methods for screening wheat and barley for beneficial responses to water deficit. J Exp Bot 61(13):3499–3507. doi:10.1093/jxb/erq199
O’Malley RC, Ecker JR (2010) Linking genotype to phenotype using the Arabidopsis unimutant collection. Plant J 61(6):928–940. doi:10.1111/j.1365-313X.2010.04119.x
Oellrich A, Walls RL, Cannon EK, Cannon SB, Cooper L, Gardiner J, Gkoutos GV, Harper L, He M, Hoehndorf R, Jaiswal P, Kalberer SR, Lloyd JP, Meinke D, Menda N, Moore L, Nelson RT, Pujar A, Lawrence CJ, Huala E (2015) An ontology approach to comparative phenomics in plants. Plant Methods 11:10. doi:10.1186/s13007-015-0053-y
Potato Genome Consortium (2011) Genome sequence and analysis of the tuber crop potato. Nature 475(7355):189–195. doi:10.1038/nature10158
Rosso MG, Li Y, Strizhov N, Reiss B, Dekker K, Weisshaar B (2003) An Arabidopsis thaliana T-DNA mutagenized population (GABI-Kat) for flanking sequence tag-based reverse genetics. Plant Mol Biol 53(1–2):247–259. doi:10.1023/B:PLAN.0000009297.37235.4a
Saito T, Ariizumi T, Okabe Y, Asamizu E, Hiwasa-Tanase K, Fukuda N, Mizoguchi T, Yamazaki Y, Aoki K, Ezura H (2011) TOMATOMA: a novel tomato mutant database distributing Micro-Tom mutant collections. Plant Cell Physiol 52(2):283–296. doi:10.1093/pcp/pcr004
Sessions A, Burke E, Presting G, Aux G, McElver J, Patton D, Dietrich B, Ho P, Bacwaden J, Ko C, Clarke JD, Cotton D, Bullis D, Snell J, Miguel T, Hutchison D, Kimmerly B, Mitzel T, Katagiri F, Glazebrook J, Law M, Goff SA (2002) A high-throughput Arabidopsis reverse genetics system. Plant Cell 14(12):2985–2994. doi:10.1105/tpc.004630
Sierro N, Battey JN, Ouadi S, Bovet L, Goepfert S, Bakaher N, Peitsch MC, Ivanov NV (2013) Reference genomes and transcriptomes of Nicotiana sylvestris and Nicotiana tomentosiformis. Genome Biol 14(6):R60. doi:10.1186/gb-2013-14-6-r60
Sierro N, Battey JN, Ouadi S, Bakaher N, Bovet L, Willig A, Goepfert S, Peitsch MC, Ivanov NV (2014) The tobacco genome sequence and its comparison with those of tomato and potato. Nat Commun 5:3833. doi:10.1038/ncomms4833
Sirault XRR, James RA, Furbank RT (2009) A new screening method for osmotic component of salinity tolerance in cereals using infrared thermography. Funct Plant Biol 36(10–11):970–977. doi:10.1071/FP09182
Till BJ, Reynolds SH, Greene EA, Codomo CA, Enns LC, Johnson JE, Burtner C, Odden AR, Young K, Taylor NE, Henikoff JG, Comai L, Henikoff S (2003) Large-scale discovery of induced point mutations with high-throughput TILLING. Genome Res 13(3):524–530. doi:10.1101/gr.977903
Till BJ, Cooper J, Tai TH, Colowit P, Greene EA, Henikoff S, Comai L (2007) Discovery of chemically induced mutations in rice by TILLING. BMC Plant Biol 7:19. doi:10.1186/1471-2229-7-19
Tomato Genome Consortium (2012) The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485(7400):635–641. doi:10.1038/nature11119
Tong ZJ, Jiao TL, Wang FQ, Li MY, Leng XD, Gao YL, Li YP, Xiao BG, Wu WR (2012a) Mapping of quantitative trait loci conferring resistance to brown spot in flue-cured tobacco (Nicotiana tabacum L.). Plant Breed 131(2):335–339. doi:10.1111/j.1439-0523.2011.01940.x
Tong ZJ, Yang ZM, Chen XJ, Jiao FC, Li XY, Wu XF, Gao YL, Xiao BG, Wu WR (2012b) Large-scale development of microsatellite markers in Nicotiana tabacum and construction of a genetic map of flue-cured tobacco. Plant Breed 131(5):674–680. doi:10.1111/j.1439-0523.2012.01984.x
Tsai H, Howell T, Nitcher R, Missirian V, Watson B, Ngo KJ, Lieberman M, Fass J, Uauy C, Tran RK, Khan AA, Filkov V, Tai TH, Dubcovsky J, Comai L (2011) Discovery of rare mutations in populations: TILLING by sequencing. Plant Physiol 156(3):1257–1268. doi:10.1104/pp.110.169748
Tsai H, Missirian V, Ngo KJ, Tran RK, Chan SR, Sundaresan V, Comai L (2013) Production of a high-efficiency TILLING population through polyploidization. Plant Physiol 161(4):1604–1614. doi:10.1104/pp.112.213256
Vanhercke T, El Tahchy A, Liu Q, Zhou XR, Shrestha P, Divi UK, Ral JP, Mansour MP, Nichols PD, James CN, Horn PJ, Chapman KD, Beaudoin F, Ruiz-López N, Larkin PJ, de Feyter RC, Singh SP, Petrie JR (2014) Metabolic engineering of biomass for high energy density: oilseed-like triacylglycerol yields from plant leaves. Plant Biotechnol J 12(2):231–239. doi:10.1111/pbi.12131
Wang JW, Jiang CH, Song ZM, Zhang L, Wang L, Wang YY, Liu GS (2011) Mutagenic effects of ethyl methane sulfonate on flue-cured tobacco seeds. Chin Tob Sci 32(3):5. doi:10.3969/j.issn.1007-5119.2011.03.004
Wang N, Long T, Yao W, Xiong L, Zhang Q, Wu C (2013) Mutant resources for the functional analysis of the rice genome. Mol Plant 6(3):596–604. doi:10.1093/mp/sss142
Warnasooriya SN, Montgomery BL (2014) Using transgenic modulation of protein synthesis and accumulation to probe protein signaling networks in Arabidopsis thaliana. Plant Signal Behav 6(9):1312–1321. doi:10.4161/psb.6.9.16437
Wilson-Sánchez D, Rubio-Díaz S, Muñoz-Viana R, Pérez-Pérez JM, Jover-Gil S, Ponce MR, Micol JL (2014) Leaf phenomics: a systematic reverse genetic screen for Arabidopsis leaf mutants. Plant J 79(5):878–891. doi:10.1111/tpj.12595
Wu JL, Wu C, Lei C, Baraoidan M, Bordeos A, Madamba MR, Ramos-Pamplona M, Mauleon R, Portugal A, Ulat VJ, Bruskiewich R, Wang G, Leach J, Khush G, Leung H (2005) Chemical- and irradiation-induced mutants of indica rice IR64 for forward and reverse genetics. Plant Mol Biol 59(1):85–97. doi:10.1007/s11103-004-5112-0
Wu F, Eannetta NT, Xu Y, Plieske J, Ganal M, Pozzi C, Bakaher N, Tanksley SD (2010) COSII genetic maps of two diploid Nicotiana species provide a detailed picture of synteny with tomato and insights into chromosome evolution in tetraploid N. tabacum. Theor Appl Genet 120(4):809–827. doi:10.1007/s00122-009-1206-z
Wu Q, Wu X, Zhang X, Jiang C, Xiao B, Zhang Y, Wang Y, Liu G (2014) Mapping of two white stem genes in tetraploid common tobacco (Nicotiana tabacum L.). Mol Breed 34(3):1065–1074. doi:10.1007/s11032-014-0097-0
Xiao B, Tan Y, Long N, Chen X, Tong Z, Dong Y, Li Y (2015) SNP-based genetic linkage map of tobacco (Nicotiana tabacum L.) using next-generation RAD sequencing. J Biol Res (Thessalon) 22:11. doi:10.1186/s40709-015-0034-3
Xin Z, Wang ML, Barkley NA, Burow G, Franks C, Pederson G, Burke J (2008) Applying genotyping (TILLING) and phenotyping analyses to elucidate gene function in a chemically induced sorghum mutant population. BMC Plant Biol 8:103. doi:10.1186/1471-2229-8-103
Yang W, Duan L, Chen G, Xiong L, Liu Q (2013) Plant phenomics and high-throughput phenotyping: accelerating rice functional genomics using multidisciplinary technologies. Curr Opin Plant Biol 16(2):180–187. doi:10.1016/j.pbi.2013.03.005
Acknowledgements
We thank the members of The Agricultural Science and Technology Innovation Programs (ASTIP-TRIC01, ASTIP-TRIC02) who assisted with the mutant collection and characterization. This work was financially supported by the China Tobacco Genome Project [Grant Nos. 110201101009 (JY-03), 110201201004 (JY-04), and 110201301005 (JY-05)] and The Agricultural Science and Technology Innovation Program (ASTIP-TRIC02).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no conflicts of interest.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Wang, D., Wang, S., Chao, J. et al. Morphological phenotyping and genetic analyses of a new chemical-mutagenized population of tobacco (Nicotiana tabacum L.). Planta 246, 149–163 (2017). https://doi.org/10.1007/s00425-017-2690-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00425-017-2690-z