Introduction

Tocochromanols represent eight distinct compounds that are commonly referred to as vitamin E. These eight compounds are grouped into two classes, tocotrienols (α-, β-, γ-, and δ-tocotrienol) and tocopherols (α-, β-, γ-, and δ-tocopherol) (Kamal-Eldin and Appelqvist 1996). The role of tocochromanols in plants as potent antioxidants has been well-documented (Falk and Munné-Bosch 2010; Fryer 1992; Munné-Bosch and Alegre 2002). Studies in Arabidopsis demonstrated that the primary role of tocochromanols in plants is to protect the seed from lipid peroxidation, which can result in a gradual loss of seed viability (Sattler et al. 2004). There is also evidence that tocochromanols also have a role in quenching singlet oxygen in adult plants thereby reducing photo-oxidative damage to photosystem II and increasing overall photosynthetic efficiency (Havaux et al. 2005).

Tocochromanols are an essential nutrient in the human diet. Vitamin E deficiency in humans is rare but does occur when an individual is severely malnourished or has a genetic condition involving the dysfunction of α-tocopherol transferase. In these instances, a neurodegenerative disease known as ataxia with vitamin E deficiency (AVED) can develop (Ouahchi et al. 1995). Some tocochromanols have been shown to prevent oxidative damage to cells and are also associated with a reduction in deaths from cancer (Borek 2004; Kline et al. 2007; Wright et al. 2006) and coronary heart disease (Emmert and Kirchner 1999). Tocochromanol structure influences vitamin E activity, with α-tocopherol having the highest activity (Eitenmiller 1997) because of its preferential retention during digestion (DellaPenna 2005).

Maize is a staple food crop of global importance. Current maize varieties that produce grain for human consumption do not provide sufficient levels of vitamin E to meet recommended dietary requirements (Fitzpatrick et al. 2012). However, maize germplasm exhibits a substantial amount of tocochromanol variation (Chander et al. 2008; Li et al. 2012; Wong et al. 2003; Lipka et al. 2013) that can be exploited to improve the vitamin E content of this staple crop through biofortification (Bouis and Welch 2010). Genetic analysis of tocochromanol variability in maize grain will facilitate the identification of chromosomal targets for marker-assisted selection, which could ultimately expedite the production of maize grain with improved vitamin E status.

The first quantitative trait loci (QTL) study to analyze the natural variation of tocopherols in maize utilized composite interval mapping (CIM) in a F2:4 biparental mapping population with a genetic map containing 163 markers (Wong et al. 2003). Numerous QTL were identified, most notably those with the largest effects residing on chromosome 1 and 5, where maize γ-tocopherol methyl transferase (ZmVTE4) and p-hydroxyphenylpyruvic dioxygenase (ZmHPPD) were later located. A subsequent study (Chander et al. 2008) used a biparental recombinant inbred line (RIL) population and a genetic map with 203 markers, including tocochromanol biosynthetic gene-targeted markers. Of the 31 QTL identified in this study, ten contained ZmVTE4, ZmVTE5 (phytol kinase), or ZmHPPD within their support intervals (Chander et al. 2008). More recently, a QTL study using two F2:3 biparental mapping populations that shared a common parent and a high-density linkage map constructed from 1536 single-nucleotide polymorphism (SNP) markers located 30 QTL associated with various tocopherols, some of which contained ZmVTE4 and ZmHPPD in the support intervals (Shutu et al. 2012).

Genome-wide association studies have been conducted to analyze tocochromanol content in maize. The first reported tocochromanol genome-wide association study (GWAS) in maize analyzed tocopherols in 500 diverse maize inbred lines and identified three polymorphisms associated with α-tocopherol variation, two indels within ZmVTE4, and another SNP that is approximately 85 kb upstream of ZmVTE4 (Li et al. 2012). A more recent GWAS (Lipka et al. 2013) provided a comprehensive analysis of tocochromanol genetic architecture in maize by analyzing six tocochromanol compounds and 14 derived sums, ratios, and proportions in the Goodman diversity panel, which consists 281 lines (Flint-Garcia et al. 2005). This study reported the first association for tocotrienols in maize and strengthened the evidence for the association between ZmVTE4 and α-tocopherol.

The objective of this research was to locate QTL responsible for tocochromanol variation in maize. Two biparental mapping populations were created specifically for this purpose, with parents of the respective populations representing contrasting phenotypes for two different tocochromanol traits. Both populations were phenotyped using liquid chromatography for tocopherol and tocotrienol content of the maize grain and a de novo genetic map was constructed for each population from genotyping-by-sequencing (GBS)-produced sequence data (Elshire et al. 2011). To our knowledge, this research represents the first QTL mapping study in maize where all tocochromanols were included in the analysis, rather than just tocopherols. We hypothesized that these methods would allow (1) associations with additional loci to be detected through the inclusion of measurement of additional tocochromanol compounds and calculated sums and ratios of those compounds, and (2) the use of GBS-derived higher marker density genotypic data would allow for narrower QTL intervals than was previously possible thus allowing for finer resolution of the genetic architecture.

Materials and methods

Germplasm

The tocochromanol phenotypic data (Lipka et al. 2013) collected on the Goodman diversity panel and the founder parents of the maize nested association mapping population (NAM) (McMullen et al. 2009) were used to identify four lines that exhibited variation for tocochromanol compounds of interest. These lines (N6, NC296, E2558W, and Co126) were selected for unique tocochromanol variation that was not present in the NAM founders and was used to create two biparental mapping populations. The parents of the N6xNC296 population exhibited extremely low and high levels of α-tocopherol and α-tocotrienol in the grain, whereas the parents of the E2558WxCo125 population exhibited extremely high and low ratios of total tocopherols to total tocotrienols in the grain, relative to the other lines measured.

For each population, F2:3 families were produced by crossing the two respective parents to generate F1 seed. The F1 seed was planted and self-pollinated to produce F2 seed which was planted and self-pollinated to produce F2:3 progeny. The planted population size was 231 F2:3 families for N6xNC296 and E2558WxCo125. After filtering for missing genetic and phenotypic measurements, the final dataset of the N6xNC296 population consisted of 213 F2:3 families and for E2558WxCo125 consisted of 197 F2:3 families.

Field experimental design

One replicate of the N6xNC296 population and two replicates of the E2558WxCo125 population were grown during the summer of 2012 at the Purdue Agronomy Center for Research Education (ACRE) in West Lafayette, Indiana, USA. The second replicate of the N6xNC296 F2:3 was grown during the summer of 2013 at ACRE in West Lafayette, Indiana, USA. The experiment was planted in an augmented incomplete block design (Federer and Raghavarao 1975) with each block containing both of the respective population parents. Each replicate of the population consisted of 11 blocks. Within each block, there were 21 individuals and four checks, representing two checks for each of the respective parents. This design resulted in 231 individuals and 44 checks per replicate.

Tocochromanol analysis

The two mapping populations, including experimental checks, were analyzed by liquid chromatography (LC) for α-tocopherol (αT), δ-tocopherol (δT), γ-tocopherol (γT), α-tocotrienol (αT3), δ-tocotrienol (δT3), and γ-tocotrienol (γT3). The high-throughput extraction method and subsequent LC analysis were conducted as previously described (Lipka et al. 2013) with modifications detailed here. Approximately 20 g of maize grain samples was ground from a bulk of each F2:3 family using a commercial Foss Cyclotec 1093 sample mill (ThermoFisher Scientific, Waltham, MA). The ground samples were stored in cryogenic Poly-Con containers for long-term storage at − 80 °C until ready for LC analysis (US Plastic, Lima, Ohio). Extraction and LC solvents (Sigma-Aldrich, St. Louis, MO) used were certified HPLC grade. For extraction of the tocochromonal compounds, 15–20 mg of ground maize seed was transferred into a 1.4-ml U-bottom bar-coded extraction tube (Micronic USA, Aston, PA) containing two glass beads (5 mm size). Four hundred microliters of extraction solution was added to each tube. The extraction solution is composed of 60:40 v:v acetone:ethyl acetate containing 1 mg/mL of butylated hydroxytoluene (Sigma-Aldrich, St Louis, MO). In addition, 150 μL of HPLC grade water (Sigma-Aldrich, MO) was added to each tube. The 96-well plate of samples was capped with strip caps and shaken on a TissueLyser at 20 Hz for 10 min. Samples were centrifuged for 10 min at 250g in a Sorvall Legend RT centrifuge (Kendro Laboratory Products, Newtown, CT). After centrifugation, 200 μL of the upper organic phase was transferred into a 750-μL tube and dried in a Speedvac (ThermoFisher Scientific, Waltham, MA) at room temperature for approximately 30 min. The dried samples were re-suspended by shaking on a microplate shaker for 15 min at 2000 rpm in 100 μL of re-suspension solution which consisted of 3:1 v:v methanol:methyl tert-butyl ether. The plates were then centrifuged for 5 min at 2500g and the supernatant transferred to a microtiter plate.

For LC analysis, 10 μL of the re-suspended extraction was injected into a Shimadzu HPLC LC-20AD (Shimadzu, Kyoto, Japan). The tocochromonal compounds were separated on a YMC 3.0 × 100 mm C30 reverse-phase column with a 3-μm particle size (YMC, Kyoto, Japan). The column oven temperature during separation was 30 °C. Mobile phase A consisted of methyl tert-buthyl ether. Mobile phase B consisted of 90:10 (v:v) of methanol:ammonium acetate (1 M, pH 4.6). The mobile phases were pumped at a rate of 0.8 mL/min using the following gradient: 0 to 12 min at 0% B to 60% B, 12 to 17.5 min at 60% B to 22.5% B, 17.5 to 19.5 min at 22.5% B to 100% B, and 19.5 to 21 min at 100% B, followed by re-equilibration of mobile phase A. The tocochromanol compounds were detected using a Shimadzu (Kyoto, Japan) RF-535 fluorescence detector with 290-nm excitation and 325-nm emission wavelengths. The tocochromanols were quantified using external standard curves constructed with authentic standards of αT, δT, γT, αT3, δT3 (Sigma-Aldrich, St Louis, MO), and γT3 (Cayman Chemical, Ann Arbor MI). The reverse-phase LC conditions applied in this study could not completely resolve β-tocochromanols and γ-tocochromanols, and therefore, the β-tocochromanols were not measured. β-Tocochromanols are considered a minor component in maize, and the γ-tocochromanols present as the primary components in the maize tocochromonal profile and the exclusion of β-tocochromanols in our analysis is consistent with previous studies on maize tocochromanols (Lipka et al. 2013; Shutu et al. 2012; Li et al. 2012; Chander et al. 2008). This procedure was used to analyze one sample from each field plot of the two replicates of each population.

Phenotypic data analysis

A total of 20 phenotypic traits were analyzed, including the six primary phenotypes (αT, δT, γT, αT3, δT3, and γT3) and 14 derived sums, ratios, and proportions. The sums included total tocopherols (TT), total tocotrienols (TT3), and total tocochromanols (TT + TT3). The ratios of biological relevance for tocochromanol analysis are αT/γT, αT3/γT3, γT/(γT + αT), γT3/(γT3 + αT3), δT/(γT + αT), δT/αT, δT/γT, δT3/(γT3 + αT3), δT3/αT3, δT3/γT3, and TT/TT3 (Lipka et al. 2013). In individuals where the δT and δT3 levels were below detection limits, a uniformly distributed random number was assigned that was between zero and the minimum level of detection (Lubin et al. 2004). Studentized deleted residuals (Kutner et al. 1996) were obtained by fitting a mixed linear model using SAS version 9.3 (SAS Institute 2012) with the genotypes and blocks set as random effects, and the checks set as fixed effects. The studentized deleted residuals were used to identify outliers in the final dataset. The heritability (h2) of each trait was calculated on a line-mean basis using the Gibbs sampling algorithm (Casella and George 1992; Sorensen et al. 1994; Yang et al. 2007).

Best linear unbiased predictors (BLUPs) of each trait for each line were predicted from a random effects model analysis across environments in ASReml version 3.0 (Gilmour et al. 2009) using the following model:

$$ {\mathrm{Y}}_{ij k}=\upmu +{\mathrm{gen}}_i+{\mathrm{env}}_j+{\left(\mathrm{gen}\times \mathrm{env}\right)}_{ij}+{\upvarepsilon}_{ij k}, $$

where Y ijk is an individual observation, μ is the overall mean, gen i is the main effect of the ith genotype, env is the main effect of the jth environment, (gen × env) ij is the two-way interaction effect between the ith genotype and jth environment, and ε ijk denotes the random error term. All terms in the model except for the intercept are random.

Genotyping and genetic map construction

The genotypes representing the F2 generation of the N6xNC296 and E2558WxCo125 populations were derived from bulking tissue samples from ten F2:3 plants. The genotypes were determined using the genotyping-by-sequencing (GBS) procedure (Elshire et al. 2011) performed at The Institute for Genomic Diversity at Cornell University, in addition to the bioinformatics analysis for calling single-nucleotide polymorphism (SNP) markers. The original genotypic dataset for N6xNC296 and E2558WxCo125 consisted of 955,690 SNPs for each population.

The Full-Sib Family Haplotype (FSFHap) imputation method was used to correct for the high degree of missing data and undercalled heterozygotes that are prevalent in GBS data (Swarts et al. 2014). The imputation and subsequent filtering of SNPs were conducted in Trait Analysis by Association, Evolution and Linkage (TASSEL) version 5.0 (Bradbury et al. 2007). Before imputation was conducted, any SNP or individual that had more than 20% missing data was filtered out. Imputation and subsequent filtering for minor allele frequencies (MAF) less than 0.30 resulted in 89,125 SNPs for the N6xNC296 population and 76,081 SNPs for the E2558WxCO125 population that was polymorphic between the parents of the respective mapping population.

A subset of SNPs was selected that gave high-density coverage of the genome for each population. Redundant SNP markers were identified and removed using a customized R script (R Development Core Team 2011). OneMap was used to calculate recombination fractions, form linkage groups, test for segregation distortion, order the markers, and estimate genetic maps (Margarido et al. 2007). The recombination fractions were converted to map distance using the Kosambi mapping function (Vinod 2011). Markers that tested significant for segregation distortion using a Bonferroni correction were removed. The COMPARE and TRY algorithms (Lander and Kruglyak 1995) were used in OneMap to order the markers in their respective linkage groups.

Quantitative trait loci analysis

Quantitative trait loci mapping was performed to identify chromosomal regions that contribute to tocochromanol variation in maize. The QTL analyses were conducted using the standard model (6) in QTL-Cartographer (Basten et al. 1994) to conduct composite interval mapping (CIM) with a walking speed of one centimorgan (cM) (Zeng 1993, 1994). The model included up to five marker covariates, which entered and exited the model based on forward and backward regression model fitting, to account for variation within a three cM window. The significance thresholds were determined for each trait individually using 1000 permutation tests in each of the populations (Churchill and Doerge 1994). The QTL analysis was conducted on the BLUPs that represent a combined data point of the two replications of each population. The QTL that exceeded the permutation thresholds were reported, and for each QTL, the one logarithm of odds (LOD) support interval, additive effect, dominance effect, and cM position were estimated. The markers immediately flanking the support interval were manually identified. The physical position of each SNP marker, according to the B73 reference genome v2 (Lai et al. 2010), was used to identify a physical position interval. The physical positions of the tocochromanol biosynthetic genes were determined from the Maize Genomic Database (Maize GDB) (Andorf et al. 2010). The presence of putative tocochromanol biosynthetic genes in the one LOD support intervals was reported.

Results

Phenotypic variability

The N6xNC296 population was created to analyze variation for α-T and α-T3. The α-T content for the parental checks (n = 22) was 5.97 and 51.89 μg/g for N6 and NC296, respectively. The α-T content for the entire N6xNC296 F2:3 population was 33.08 μg/g, and ranged from 9.4 to 90.91 μg/g for individual families. The α-T3 content was 23.54 and 66.55 μg/g for N6 and NC296, respectively. The α-T3 content for this population was 46.34 μg/g and ranged from 28.86 to 76.27 μg/g. Thus, α-T and α-T3 both showed transgressive segregation to the high end of their phenotypic distributions, but not to the low end. This pattern of transgressive segregation was also observed for αT, γT, and αT3 in the N6xNC296 population (Table 1).

Table 1 Descriptive statistics for the N6xNC296 parents and population BLUPs

The E2558WxCo125 population was created to study variation in the ratio of total tocotrienols to total tocopherols (TT/TT3). The observed TT/TT3 was 0.92 and 8.29 for E2558W and Co125 parental checks (n = 22), respectively. The TT/TT3 for the population was 5.51 and ranged from 5.39 to 9.26, which indicates transgressive segregation at the high end of the distribution (Table 2).

Table 2 Descriptive statistics for the E2558WxCo125 parents and population BLUPs

Genetic linkage map

A de novo high-density genetic map was constructed for each population. The original genotypic dataset for the N6xNC296 population had 57% missing data and only 1% heterozygous SNPs. After filtration and imputation with FSFHap, there were 0% missing data and 58% heterozygous SNPs. The genetic map for the N6xNC296 population (Fig. 1) consists of 1280 SNP markers, with an average marker spacing of 1.08 cM. Similarly, the original genotypic dataset for the E2558WxCo125 population had 55% missing data and 1% heterozygous SNPs. After filtration and imputation with FSFHap, there was 0% missing data and 56% heterozygous SNPs. The genetic map for the E2558WxCo125 population (Fig. 2) consists of 1249 markers, with an average spacing of 1.1 cM between markers (Table 3).

Fig. 1
figure 1

The high-density genetic map for the N6xNC296 population. The left side of the chromosome is the centimorgan (cM) position of the marker and the right side is the marker name

Fig. 2
figure 2

High-density genetic map for the E2558WxCo125 population. The left side of the chromosome is the centimorgan (cM) position of the marker and the right side is the marker name

Table 3 The total marker number and length in centimorgans (cM) of the N6xNC296 and E2558WxCo125 genetic maps

Quantitative trait loci analysis

For the N6xNC296 population (Table 4), a total of 31 QTL were detected that contained a tocochromanol biosynthetic gene or explained substantial phenotypic variation for the primary traits. Two QTL were detected for the ratio trait, δT3/(γT3 + αT3). One QTL explained 25% of the phenotypic variation for δT3/(γT3 + αT3) and contained homogentisic acid geranylgeranyl transferase (ZmHGGT1) in the 5.1 cM support interval on chromosome 9. For TT3, a QTL on chromosome 9 was mapped that explained 22% of the variation and contained homogentisate phytyltransferase (ZmVTE2) in the 2.8 cM support interval. For γT/(γT + α)T, a QTL on chromosome 5 explained 53% of the phenotypic variation and contained γ-T methyl transferase (ZmVTE4) within the support interval. A QTL was mapped on chromosome 5 that explained 5% of variation for αT but did not contain a known tocochromanol biosynthetic gene in the 7.7 cM support interval. Another QTL was detected on chromosome 5 that explained 14% of the αT3 variation and also did not contain a known tocochromanol biosynthetic gene in the 4.1 cM support interval.

Table 4 The QTL detected using BLUPs calculated from replicates 1 and 2 of the N6xNC296 population

For the E2558WxCo125 population (Table 5), a total of 58 QTL were detected, with those that contained a tocochromanol biosynthetic gene or explained substantial phenotypic variation for primary traits presented here. A QTL for TT3 on chromosome 5 contained hydroxyphenylpyruvate dioxygenase (ZmHPPD1) in the 12 cM support interval. A QTL for TT/TT3 on chromosome 5 explained 10% of the phenotypic variation but did not contain a known tocochromanol biosynthetic gene. One QTL for TT was mapped on chromosome 2 and explained 4% of the phenotypic variation. Two QTL for TT3 were mapped on chromosomes 5 and 8 and explained 7 and 5% of the variation, respectively.

Table 5 The QTL detected using BLUPs calculated from replicates 1 and 2 of the E2558WxCo125 population

Discussion

The approach for this study originated from review of tocochromanol content data on the Goodman diversity panel and founders of the maize NAM population. These data revealed maize lines in the Goodman diversity panel that exhibited unique variation for tocochromanol compounds of interest. A GWAS was conducted on the Goodman diversity panel for tocochromanol compounds and the respective sums, ratios, and proportions to explore the genetic architecture of these traits (Lipka et al. 2013). A limitation of this GWAS was that it used an underpowered population of 252 inbreds, which limits ability to detect rare causative alleles. Therefore, to complement the GWAS and gain a better understanding of the genetic architecture of tocochromonal accumulation in maize, the N6xNC296 and E2558WxCo125 populations were developed and analyzed.

Genetic architecture of tocochromanols in maize

The populations analyzed both displayed transgressive segregation for traits of interest. This transgressive segregation facilitated mapping of the genetic architecture of tocochromanols in maize. The analysis of the N6xNC296 mapping populations revealed QTL for T3 and TT/TT3 that contained a putative homogentisate phytyltransferase (ZmVTE2) gene in the support interval that previously has not been associated with these traits in maize. The ZmVTE2 and ZmHGGT1 genes lie within 15 megabases from each other on chromosome 9, and previously, it may not have been possible to resolve the positions of two distinct QTL without the use of a high-density linkage map. In the tocochromanol biosynthetic pathway, ZmVTE2 condenses homogentisic acid with phytyl diphosphate to yield 2-methyl-6-phytylbenzoquinol (MPBG), which is the precursor for tocopherols. Given that the ZmVTE2 was associated with a QTL for total tocotrienols and the ratio of total tocotrienols to total tocopherols, the gene mapped here may indicate that one parent has a weaker allele that sends more flux into the tocotrienol branch of the pathway. Notably, the associated SNP positions relative to the reference genome indicate ZmHGGT1 is outside of the T3 and TT/TT3 QTL intervals, providing some support for ZmVTE2 as a candidate gene. However, further work needs to be performed to be able to conclude that polymorphic regions in ZmVTE2 segregate with the phenotype.

Results from the two mapping populations support some associations previously reported in GWAS and QTL studies of tocochromanol variation in maize grain. Results from the N6xNC296 population further strengthened previous associations of tocotrienol traits with ZmVTE1, ZMVTE4, and ZmHGGT1, and from the E2558WxCo125 population further strengthened previous associations of tocotrienol traits with ZmVTE1, ZmVTE4, and ZmHPPD (Wong et al. 2003; Chander et al. 2008; Shutu et al. 2012; Lipka et al. 2013).

High-density linkage mapping with GBS produced SNPs

The F2:3 generation of each population was sequenced using GBS that was used to produce two high-density genetic maps. The GBS procedure has commonly been used in maize to genotype individuals for GWAS, and due to the low cost, it is gaining popularity for genotyping biparental mapping populations for QTL analysis. A limitation of the low coverage multiplexing GBS procedure is that it results in a high degree of missing genotypic information and heterozygous loci in individuals that are undercalled (Nielsen et al. 2012). This occurs because in order to call heterozygotes correctly, the sample must be covered by at least two reads which should come from sister chromatids. With next-generation sequencing methods like GBS, there is a greater chance that only one of the two sister chromatids of a diploid individual is sampled at a specific base pair position (Nielsen et al. 2011). Specifically, when using GBS in maize, it is expected that only 12% of the genome will be sampled two or more times and this introduces error when identifying heterozygous individuals (Swarts et al. 2014). To overcome these limitations of GBS, imputation was used. The research presented here used an imputation method called FSFHap that utilizes the hidden Markov model and the Viterbi algorithm to impute missing data and improve heterozygote calling (Swarts et al. 2014). The low-cost sequencing information available from GBS was improved with this imputation approach so that accurate high-density genetic maps could be constructed.

Increased QTL resolution with high-density linkage mapping

High-density genetic maps with an average interval spacing of one cM or less have been reported to provide increased QTL resolution, improved effect estimates for QTL, and increased power to resolve closely linked QTL (Stange et al. 2013). In the N6xNC296 population, using 213 F2:3 families and 1280 markers, we detected a QTL associated with γT/(γT + αT) that contained ZmVTE4 in a support interval of 1.4 cM. An earlier study on a biparental population of 233 recombinant inbred lines with 201 markers reported a QTL associated with α-T that contained ZmVTE4 within a support interval of 10 cM (Chander et al. 2008). Another linkage mapping study detected QTL associated with αT, γT, and αT /γT that contained ZmVTE4 in a 14.6-cM interval (Shutu et al. 2012). This study used two populations of 237 and 218 F2:3 families with a common parent and 468 and 357 markers on each population, respectively. Our study provides results consistent with these earlier reports, and our higher-resolution mapping provides more precise support of ZmVTE4 as a candidate gene for tocochromanol traits involving αT and γT.

The complementary nature of GWAS and linkage mapping

Previous GWAS in maize has provided useful information on genetic architecture of tocochromanols (Li et al. 2012; Lipka et al. 2013). Yet notably, there are QTL detected in experimental biparental crosses that are not detected in diverse association panels used in GWAS (Gibson 2012). A main limitation of GWAS is that in the case of small sample size, there is not enough statistical power to detect certain QTL. This is especially true when causal variants have a low MAF (Yang et al. 2010). This limitation may be overcome by utilizing traditional composite interval mapping in biparental populations. Biparental populations by design increase the probability of detecting rare alleles (Gibson 2012). The two parents of biparental mapping populations are usually chosen because they show considerable differences in the trait of interest. If one of the parents carries a rare causative allele, that allele will be at a much higher frequency in the biparental mapping population in comparison to an association panel of the same number of individuals where the MAF for a rare causative allele would be very low. If one parent carries the rare allele, the frequency of that allele is approximately 0.5 in a biparental population, which should increase the ability to detect the QTL containing that allele. The parental lines in this study were previously part of a panel used in a GWAS for tocochromanol variation in maize, where an association with ZmVTE2 was not detected. This illustrates the complementary nature of using high-density QTL mapping as a complementary analysis to GWAS to better characterize and refine the genetic architecture of a trait.

Parental lines with contrasting phenotypes were chosen to create the biparental mapping populations used in this study. In addition to the detection of a novel association that was made with ZmVTE2, QTL intervals containing the genes for all of the major branch points in the tocochromanol pathway were detected. The allelic variation in these five genes, and some families in this population with extreme transgressive segregation for tocol traits, could be used in a breeding program to improve tocochromanol levels in maize grain resulting in improved grain products that could directly impact human and animal nutrition.