Introduction

Alternative sources of fuel to replace petroleum are urgently needed because worldwide reserves are decreasing while global demand is exponentially increasing. Bioethanol production from lignocellulosic biomass represents a sustainable alternative to fossil fuels because of its low overall emission of greenhouse gases [1], its abundance and its high content of energy-rich polysaccharides [2]. On the other hand, cell wall (CW) resistance to enzymatic degradation is currently the principal barrier to ethanol production from lignocellulosic feedstock [2].

With its high biomass production and its capacity to symbiotically fix atmospheric nitrogen, perennial alfalfa (Medicago sativa L.) has considerable potential as a lignocellulosic feedstock for the sustainable production of bioethanol [3]. Additionally, alfalfa cultivation, harvesting and storage methods are well established. These established processing procedures convey a strong advantage to using alfalfa as a bioenergy feedstock if we consider that the cost for biomass collection, storage, preprocessing and transport to the biorefinery gate can reach between 35 % and 65 % of the total production costs of cellulosic ethanol [4]. The value of alfalfa as a lignocellulosic feedstock depends largely on its chemical composition. More specifically, the fermentable CW sugar concentration, mainly glucose from cellulose, should be larger than that of compounds that restrict enzymatic access to fermentable carbohydrates, such as lignin and hemicellulose, during the saccharification process [3]. The relative proportion of CW fermentable polysaccharides and interfering compounds also varies with stem maturity. While the cellulose content increases, the lignin and xylose concentrations also increase, resulting in a reduction of CW degradability in mature alfalfa stems [5]. Delaying harvest from late flowering to green pod maturity stages has been shown to increase the overall availability of fermentable carbohydrates because the increased yield surpasses the loss of degradability [6]. This production system also has the advantage of minimizing crop management and harvest costs. Another advantage of alfalfa as bio-ethanol production feedstock is its relatively high degradability. For instance, we found that up to 40 % of highly lignified stems are available for ethanol production after enzymatic saccharification without any pre-treatments (unpublished results).

Alfalfa is an open-pollinated autotetraploid species (2n = 4x = 32) with extensive genetic variability that can be exploited for genetic improvement [7, 8]. Significant differences in CW composition among alfalfa cultivars and individual genotypes have been reported [5, 9], but this potential has not been fully exploited, as shown by the lack of systematic selection for this trait [3]. In a recent study, we showed that a large genetic variability of CW degradability in response to enzymatic hydrolysis exists within and among multiple alfalfa genetic backgrounds [10]. Glucose released from cell wall following enzymatic hydrolysis is a function of both glucose concentration and the efficiency with which that glucose is released. Although this previous study included only a limited number of genotypes, we found that enzyme-released glucose, a direct measure of CW degradability, varied from 70 to 120 mg g−1 CW. In addition, the biomass-type cultivars of Flemish origin (Orca and Europe) were less degradable, on average, than the winterhardy-type cultivars (54V54, Amerigraze and Perfect) found in North America. This previous determination of CW degradability was based on the evaluation of a limited number of genotypes because the enzymatic screening test developed was time-consuming and labour intensive. In order to use the natural genetic diversity of alfalfa to develop new biofuel-type cultivars, high-throughput CW degradability screening methods are needed.

Near-infrared reflectance spectroscopy (NIRS) technology is based on the correlation of near-infrared reflectance spectra of scanned samples with actual analytical measurements of the constituents by other analytical methods [11, 12]. NIRS is a high-throughput method requiring minimal sample preparation. It has been successfully applied in alfalfa stem samples to predict the concentration of many structural carbohydrates [6, 13], soluble carbohydrates [14] and, with less accuracy, total lignin [13, 15]. Furthermore, NIRS has been shown to successfully predict glucose released by enzymatic saccharification following pretreatment in corn stover [16] as well as the ethanol yield in switchgrass [17]. Thus, NIRS allows the screening of a large number of genotypes, increasing the likelihood of identifying genotypes with extreme phenotypes, which are highly valuable for genetic improvement.

Characterization of CW composition of genotypes varying in degradability is essential for unravelling the underlying biochemical factors associated with the variability of stem degradability. Acid hydrolysis of CWs is a procedure commonly used to separate the complex CW matrix into simple, analysable fractions. For instance, polymeric carbohydrates are turned into monomers and total lignin is separated into acid-soluble and acid-insoluble fractions. Combined with analytical methods such as liquid or gas chromatography, spectroscopy and thermogravimetry, chemical composition and relative abundance of CW constituents can accurately be determined [18]. More recently, “glycome profiling”, a powerful tool to study plant CW structure and function was developed [19]. This technique consists of determining and comparing glycan epitope compositions amongst sequential CW extracts by ELISA using a comprehensive toolkit of 147 glycan-directed monoclonal antibodies [20].

The objectives of this study were: (a) to validate NIRS prediction of enzyme-released glucose for the high-throughput assessment of genetic variability in alfalfa, (b) to identify genotypes with high (D+) and low (D−) stem degradability, (c) to identify underlying factors associated with the differences in their degradability using glycome profiling and acid hydrolysis analysis of CWs of pooled D+ and D− genotypes and (d) to intercross genotypes with superior (D+) or inferior (D−) enzyme-released glucose and assess the selection potential for this trait.

Materials and Methods

Plant Material

Field Nursery

Alfalfa was seeded in a greenhouse on March 27, 2008 and transplanted to a field in Lévis QC, Canada (Kamouraska clay loam; 46° 9′ 15″ N, 71° 12′ 00″ W; altitude ≈45 m) on June 4, 2008. Individual plants were transplanted at 90 cm intervals. Weeds were controlled by a combination of pre-emergence herbicides and hand weeding. A number of genotypes from each of three winterhardy-type cultivars (136 genotypes of the cultivar Perfect, 119 genotypes of the cultivar Amerigraze and 136 genotypes of the cultivar 54V54) and one biomass-type cultivar (228 genotypes of the cultivar Orca) were individually harvested on September 22, 2009 at the green pod maturity stage. Plants were harvested at this late stage of maturity to ensure that stems were lignified. At that stage, internode elongation and radial growth have ceased, and cellulose, hemicellulose and pectin concentrations are stabilized [5]. Only the bottom parts (25 cm) of the plants were harvested in order to further minimize plant maturity differences [21].

Samples were dried for 48 h at 55°C in a forced-air oven and leaves were removed by shaking. Dried stems were ground a first time using a Wiley mill (model Digital ED-5 midsized mill; Arthur H. Thomas Co., Philadelphia PA, USA) fitted with a 2-mm screen, and a second time with a Cyclone Lab Sample mill (UDY Corporation, Fort Collins, CO, USA) fitted with a 1-mm screen.

Plant Crosses

For two cultivars, Orca and 54V54, 20 genotypes with high (D+) and low (D−) degradability, as assessed by enzyme-released glucose, were identified based on NIRS predictions and a subsequent validation by enzymatic saccharification. These genotypes were transplanted in 15-cm pots and transferred to a greenhouse at the end of the 2009 growing season. Plants within each group (D+ and D−) were intercrossed to generate four populations: Orca D+, Orca D−, 54V54 D+ and 54V54 D−. Crosses were carried out by hand with toothpicks. Mature seeds were harvested from individual plants and identical amounts from each genotype within each D+ and D− group were pooled. All seed lots used and produced in this study were kept under optimal storage conditions (4°C, 10 % relative humidity).

Progeny Assessment

Approximately 110 plants of each progeny population, Orca D−, Orca D+, 54V54 D− and 54V54 D+, were sown in a greenhouse under a 25/15°C (day/night) temperature regime and a 16-h photoperiod provided by a combination of natural and artificial irradiance using mercury vapour lamps (H36GW-1000/DX; Osram Sylvania, Missisauga, ON, Canada). Plants were transplanted to the field in spring 2010. At the end of October 2010, plants were cut, transplanted into 15-cm pots and transferred to a greenhouse. At the green pod stage, the lower 25 cm of the stems was assessed for enzyme-released glucose by NIRS [21].

Analysis of Alfalfa Stem Composition

Non-Structural Carbohydrate Quantification and CW Preparation

Removal of sugar and starch from the alfalfa biomass was done based on the method of Theander et al. [22]. Approximately 200 mg of ground stem material was incubated in 7 mL of deionised H2O at 100°C for 90 min. Tubes were subsequently centrifuged 10 min at 1,500×g and 500 μL of the supernatant was collected for quantification of soluble carbohydrates (sucrose, glucose, pinitol and fructose) by high-performance liquid chromatography (HPLC) using a Model 515 pump, a Model 717plus autosampler and a Model 2410 refractive index detector, controlled by Empower 2 software (Waters, Milford, MA, USA). Carbohydrates were separated on an Aminex HPX-87P column (300 mm × 7.8 mm × 9 μm) preceded by a Carbo-P pre-column (Bio-Rad, Mississauga, ON, Canada) and eluted isocratically at 80°C, at a flow rate of 0.5 mL min−1, with deionised H2O.

Tubes containing the remaining supernatant and pellet were subsequently vortexed and starch was hydrolyzed by adding 3 mL of digestion buffer (200 mM sodium acetate, pH 4.5) containing amyloglucosidase (15 U mL−1; Sigma-Aldrich, Oakville, ON, Canada). Tubes were incubated 60 min at 55°C and then centrifuged 10 min at 1,500×g. The supernatant was collected for quantification of glucose by HPLC, as described above. Starch was quantified by subtracting the amount of soluble glucose from the total amount of glucose measured following digestion with amyloglucosidase. Non-structural carbohydrates (NSC) were estimated by the sum of soluble carbohydrates and starch. Pellets remaining after extraction of soluble carbohydrates and starch were washed three times with methanol at 60°C and air dried to obtain CW preparations. Approximately 115 mg of CW was obtained from 200 mg starting material.

Structural Carbohydrate and Lignin Quantification

Structural carbohydrates (glucose, xylose, galactose, arabinose, mannose, rhamnose, fucose and galacturonic acid) were quantified by HPLC following enzymatic saccharification or acidic hydrolysis.

Enzymatic saccharification was carried out according to the modified method of Selig et al. [23]. Briefly, CW preparations were incubated for 48 h at 50°C with shaking in 10 mL of buffer containing 50 mM sodium citrate, 0.02 % (w/v) sodium azide and 25 μL of each of the following enzyme preparations: Accellerase 1500 (2638 CMC U/g cellulase activity and 674 p-NPG U/g β-glucosidase activity, Genencor, Rochester, NY, USA); Accellerase XY (22454 ABXU/g xylanase activity, Genencor), Accellerase XC (2500–3800 ABXU/g xylanase activity and 1207 CMC U/g cellulase activity, Genencor) and Pectinex 3XL (3000 FD U/g pectinase activity, Sigma-Aldrich, Oakville, ON, Canada).

Acid hydrolysis was done with H2SO4 according to the National Renewable Energy Laboratory (NREL) analytical protocols [18]. A 20-mL aliquot of the hydrolysate was neutralized with CaCO3 prior to the quantification of sugars. Glucose, xylose, galactose, arabinose and mannose were quantified by HPLC using an Aminex HPX-87P column (Bio-Rad, Hercules, CA, USA) preceded by a Deashing pre-column (Bio-Rad) and eluted isocratically at 80°C, at a flow rate of 0.5 mL min−1 with H2O. Galacturonic acid, rhamnose and fucose were separated using a HPX-87H column preceded by a Cation H pre-column (Bio-Rad) and eluted at 55°C with 0.01 N H2SO4 at a flow rate of 0.6 mL min−1. Peak identity and quantity were determined by comparison to standards. Carbohydrate concentrations were expressed either on a 105°C dry matter (DM) basis or on a 105°C cell wall (CW) basis.

Total lignin was determined as the sum of Klason lignin and acid-soluble lignin. Klason lignin was estimated as the ash-free, acid-insoluble residue remaining after acidic hydrolysis, using a thermogravimetric analyzer (Model TGA 701, Leco, St-Joseph, MI, USA). Acid soluble lignin was quantified by spectroscopy (Lambda 35; Perkin Elmer, Waltham, MA, USA) with an absorptivity constant of 25 L g−1 cm−1 at a wavelength of 240 nm.

Near Infrared Reflectance Spectroscopy

Dried and ground alfalfa stem samples were scanned over 400 to 2498 nm, at 2 nm intervals, using a NIR-Systems 6500 monochromator (Foss, Silver Spring, MD, USA), as described in Nie et al. [14]. Seventy-five samples were selected based on the scores to form a calibration set (n = 60) and a validation set (n = 15). The 75 samples were chemically analysed for carbohydrate fractions (non structural and structural carbohydrates, lignin and enzyme-released glucose) as previously described.

Glycome Profiling

Sequential Extraction of CW Samples

The milled dried biomass samples were first suspended in 80 % (v/v) ethanol (100 mL g−1) and stirred for 1 h followed by centrifugation at 3,000×g for 15 min at room temperature. The supernatants were discarded and the pellets were sequentially washed with absolute ethanol and acetone. The resultant biomass residues were dried overnight under a hood to make the CW preparations [alcohol-insoluble residues (AIR)]. Sequential extraction of the AIR was carried out with reagents of increasing harshness on 0.3 g of four samples of alfalfa stems. Each sample was a pool of 20 genotypes of contrasting degradability representing: (a) Orca, highly degradable (D+); (b) Orca, poorly degradable (D−); (c) 54V54, highly degradable (D+); (d) 54V54, poorly degradable (D−). All extractions were done in 10 mg mL−1 suspensions, based on the starting AIR weight used. Each sample was first suspended in 50 mM ammonium oxalate (pH 5) with constant agitation for 24 h at room temperature. The suspension was then centrifuged at 3,400×g for 15 min at room temperature to pellet the insoluble residue, and the clear supernatant was decanted and saved. The pellet was re-suspended with the same volume of deionized H2O, centrifuged again and the supernatant was discarded. Subsequent extractions were carried out in a similar manner using sequentially the following reagents, 50 mM Na2CO3 containing 0.5 % (w/v) NaBH4 (pH 10), 1 M KOH and 4 M KOH each containing 1.0 % (w/v) NaBH4. Following the 4 M KOH step, each sample was extracted using sodium chlorite (100 mM) [24] to break down the lignin polymers into small fragments. The insoluble residue after the chlorite extraction was once again subjected to a 4 M KOH [containing 1.0 % (w/v) NaBH4] extraction as explained above. All 1 M KOH and 4 M KOH extracts were neutralized using glacial acetic acid. All extracts were dialyzed against deionized H2O four times (with sample to water ratio of 1:60) at room temperature and freeze-dried individually.

Total Sugar Estimation and ELISA

All extracts were dissolved in deionised H2O at a concentration of 0.2 mg mL−1. The total sugar estimation was done using a phenol-sulphuric acid colorimetric assay [25, 26]. The ELISA plates (Costar 3598) were coated with 50 μL well−1 of CW extracts (60 μg sugar mL−1) and allowed to evaporate to dryness overnight at 37°C. ELISAs were performed as described in Pattathil et al. [19]. Briefly, the glycome profiling analyses involves ELISA-based screening of CW materials following the sequential extractions, with a toolkit of 147 monoclonal antibodies (See Supplemental Table 1) directed against diverse plant CW glycan epitopes. The collection of 147 monoclonal antibodies used comprised 19 defined groups that recognize most major classes of plant CW glycans including non-fucosylated xyloglucans, fucosylated xyloglucans, pectic backbones, arabinogalactans and xylans [19]. ELISA results are presented as heat maps of binding intensities of antibodies to the extracts [20].

Monoclonal Antibodies

Monoclonal antibodies were obtained as hybridoma cell culture supernatants from laboratory stocks at the Complex Carbohydrate Research Center (CCRC, JIM and MAC series antibodies are available from CarboSource Services, www.carbosource.net).

The assessment of CW components of the four pools of samples was also done by chemical analysis following acid hydrolysis as described above. Furthermore, the degradability of the four pools of samples was measured by enzymatic saccharification and was predicted by NIRS.

Statistical Analysis

Concentrations of structural carbohydrates of alfalfa genotypes were assessed individually and results for each genetic background and levels of degradability were compared statistically. The P < 0.05 probability level was used to determine significance. When model parameters had a significant F test, the least-significant difference method was used to compare means. All biochemical analyses were done in duplicate.

Results

NIRS Prediction of Carbohydrate Fractions

The validation statistics of the NIRS calibration (Table 1) showed that most non structural carbohydrates could be successfully predicted by NIRS. For instance, sucrose, glucose and fructose had R 2 values of 0.90, 0.93 and 0.96, respectively. Total non-structural carbohydrates were the most successfully predicted, with a R 2 value of 0.97, while starch alone was less predictable (R 2 = 0.78). The concentrations of the CW structural carbohydrates rhamnose, mannose, xylose and galactose, measured by HPLC after acid hydrolysis, were not well predicted, with R 2 varying from 0.09 for galactose to 0.57 for mannose. The R 2 for the prediction of soluble and insoluble lignins were of 0.78 and 0.68, respectively. CW degradability, measured as glucose released by enzymatic saccharification, was successfully predicted by NIRS with a R 2 of 0.94 (Table 1).

Table 1 Statistics on carbohydrate and lignin concentrations (mg g−1 DM) in samples of the calibration sets used to develop near-infrared reflectance spectroscopy (NIRS) equations and statistics on the performance of each NIRS prediction

Assessment of CW Degradability

NIRS prediction of enzyme-released glucose of 619 highly lignified alfalfa stem samples revealed a large genetic variability for CW degradability among and within the four cultivars studied (Fig. 1). On average, degradability of winterhardy-type cultivars (54V54, Amerigraze and Perfect) tended to be higher than the biomass-type cultivar Orca (P = 0.1). On average, enzyme-released glucose amounts were 130.0, 136.5, 140.5 and 140.6 mg g−1 CW for Orca, 54V54, Amerigraze and Perfect, respectively. Within each cultivar, enzyme-released glucose amounts varied extensively: from 89.8 to 169.8 mg g−1 CW for Orca, from 98.0 to 168.8 mg g−1 CW for 54V54, from 106.3 to 177.1 mg g−1 CW for Amerigraze and from 104.1 to 176.8 mg g−1 CW for Perfect. Enzyme-released glucose amounts showed a strong negative correlation (R = −0.83) with total lignin concentrations in mature alfalfa stems (Fig. 2).

Fig. 1
figure 1

NIRS prediction of glucose released from alfalfa stems by enzymatic saccharification for the biomass-type cultivar Orca (228 samples), and the three winterhardy-type cultivars 54V54 (136 samples), Amerigraze (119 samples) and Perfect (136 samples). The full line in boxes represents the 50th percentile (the median) and the dotted line represents the mean. A box represents the interquartile range (IQR) in which 50 % of the ranked data are found (the bottom and top of the box are the 25th and 75th percentile, respectively). Dots represent the outliers which values exceed ± 1.5 IQR

Fig. 2
figure 2

Relationship between the amounts of enzyme-released glucose and total lignin in alfalfa stems of four cultivars, as predicted by NIRS (y = −0.7077x + 308.88, R 2 = 0.6914)

The 10 most (D+) and the 10 least (D−) degradable Orca and 54V54 genotypes, established by NIRS, were individually analysed to determine their CW composition as determined by acid hydrolysis (Table 2). CW degradability (enzyme-released glucose) was also confirmed by submitting the 40 genotypes to the enzymatic saccharification assay (Pearson rank correlation value of 0.84). The assay showed that for both cultivars the D+ genotypes had significantly higher amounts of enzyme-released glucose (>35 %) than the D− genotypes (P < 0.001). Comparing the results of enzymatic and acidic digestions showed that 36.6 % of total CW glucose was released during the enzymatic assay for the Orca D+ genotypes, as compared to only 28.9 % for the Orca D− genotypes. Similar results were obtained for the 54V54 cultivar, with 34.6 % of available CW glucose released for the D+ genotypes and only 26.4 % for the D− genotypes.

Table 2 Average CW composition (±SD) of 10 genotypes of Orca D+, Orca D−, 54V54 D+ and 54V54 D−a

Acid hydrolysis also showed that xylose concentrations from the hemicellulose fraction of CW, as well as the galactose concentrations from the pectin fraction of CW, differed significantly between Orca and 54V54 cultivars: concentrations of both components were higher in 54V54 (Table 2; P = 0.004). Within each cultivar, lignin and enzyme-released glucose differed significantly between the two classes of degradability, D+ and D− (P < 0.0001). Average lignin concentrations were higher in D− (222 mg g−1 CW) than in D+ genotypes (198 mg g−1 CW). Conversely, amounts of enzyme-released glucose were higher in D+ (169 mg g−1 CW) than in D− genotypes (124 mg g−1 CW).

Cellulose concentrations, assessed as acid hydrolysis-released glucose, differed significantly between both cultivars (P = 0.002) and degradability groups (P = 0.02), with higher concentrations in 54V54 than in Orca and in D+ than in D− (Table 2).

Glycome Profiling Studies

Ground stem samples of the 20 most (D+) and 20 least (D−) degradable Orca and 54V54 genotypes were pooled and submitted to glycome profiling. Overall, the glycome profiles of Orca D+, Orca D−, 54V54 D+ and 54V54 D− pools were similar, indicating that there were no major differences in CW carbohydrate extractability or CW epitope composition between the samples tested (Fig. 3). Small differences between the two genetic backgrounds (Orca vs 54V54) were nevertheless present. For instance, the binding patterns of the xylan-3 and the xylan-4 groups of antibodies showed slight differences in the carbonate and 1 M KOH extracts when comparing the two genetic backgrounds.

Fig. 3
figure 3

Glycome profiling of sequential cell wall extracts of four alfalfa biomass samples pooled from 20 plants: (a) high degradability (D+) biomass-type cultivar Orca, (b) low degradability (D−) cultivar Orca, (c) high degradability (D+) winterhardy-type cultivar 54V54, (d) low degradability (D−) cultivar 54V54. The 18 groups of antibodies against CW glycan epitopes (See Supplemental Table 1) are identified on the right panel of the figure, while the cell wall extractants used are indicated at the top of the figure in order of increasing harshness (left to right). Binding intensities of antibodies to the extracted CW glycans, as measured in ELISAs, are depicted as heatmaps with black to bright yellow colors representing no binding to strongest binding, respectively. The boxes outlined in white dots highlight the largest differences observed in the glycome profiles between the four alfalfa samples

The winterhardy type 54V54 pools exhibited slightly more oxalate extractable pectin (as indicated by the higher binding of HG-backbone and RG-I backbone groups of antibodies) and arabinogalactan and pectic arabinogalactan epitopes (as indicated by the higher binding of RG-I/AG, AG-1, AG-2, AG-3 and AG-4 groups of antibodies) than do the Orca pools. These differences are marked in Fig. 3 as dotted white blocks. There are other subtle differences in the glycome profiles between the different plant pools. The significance of these differences, if any, in terms of altered recalcitrance, will require additional studies.

Selection for Stem Degradability

The 20 most (D+) and the 20 least (D−) degradable Orca and 54V54 genotypes identified by NIRS were pooled into four groups (Orca D−, Orca D+, 54V54 D− and 54V54 D+) and submitted to the enzyme-released glucose assay to confirm the predicted values and to ensure we had truly contrasting material to proceed with the crosses. These four samples were the same ones used for glycome profiling studies. Predicted and measured enzyme-released glucose amounts obtained from the pools were very similar (Fig. 4).

Fig. 4
figure 4

Amounts of enzyme-released glucose as assessed by NIRS (black) and by HPLC (gray). Pools of the 20 most degradable and the 20 least degradable genotypes of the biomass-type cultivar Orca and the winterhardy-type 54V54 were used. Each bar represents the mean ± SE (n = 3)

For each cultivar, there was a significant difference of degradability between the D+ and the D− pools (P < 0.001, Figs. 4 and 5, Parents), with average enzyme-released glucose amounts of 156 mg g−1 CW for the D+ pools and 116 mg g−1 CW for the D− pools. This contrasting material was then suitable to start a first cycle of recurrent selection. No significant difference was observed between the two cultivars.

Fig. 5
figure 5

NIRS prediction of glucose amounts released by enzymatic saccharification of alfalfa stems in parent plants (20 plants for each group of parents) and in progenies obtained after one cycle of recurrent selection (110 plants for each group of progenies). Parent plants were field-grown and the 20 most (D+) and 20 least (D−) degradable genotypes of both Orca and 54V54 cultivars were selected. Progenies were derived from crosses within the 20 genotypes of each 54V54 D−, 54V54 D+, Orca D− and Orca D+ and were greenhouse-grown. See Fig. 1 for box plot explanations

Assessment of enzyme-released glucose of 110 progenies issued from each of the four crosses (Orca D−, Orca D+, 54V54 D− and 54V54 D+) revealed significant differences in enzyme-released glucose amounts between the classes of degradability and the cultivars (P < 0.001; Fig. 5, Progenies). The average enzyme-released glucose amounts of 54V54 D+ and D− progenies were of 196.5 mg g−1 CW and 185.6 mg g−1 CW, respectively. The difference between D+ and D− Orca progenies was even larger, with average amounts of 193.3 mg g−1 CW for the D+ and of 163.3 mg g−1 CW for the D−.

Discussion

NIRS Predictions of CW Degradability

High-throughput screening methods for CW degradability are needed to identify genetic material prone to enzymatic degradation and, consequently, to develop a biofuel-type alfalfa germplasm. In the current report, we document that NIRS can successfully predict glucose released by enzymatic saccharification of lignified alfalfa stems, the core step in bio-ethanol production from cellulosic feedstock. We gathered evidence that this screening approach is sensitive, robust and reliable and that it can readily predict enzyme-released glucose of hundreds of alfalfa genotypes of various cultivars. NIRS has been shown to successfully predict enzyme-released glucose in maize stovers [16] as well as ethanol yield in switchgrass [17]. In alfalfa, in vitro neutral detergent fibre digestibility (dNDF) has been successfully predicted [27]. dNDF reflects the availability of cell wall carbohydrates to rumen enzymes, a trait closely associated to the convertibility of biomass into ethanol. Using NIRS screening, we found that an extensive genetic variability for CW degradability exists within and between Orca and winterhardy-type alfalfa cultivars. All cultivars considered, enzyme-released glucose amounts varied from 90 to 180 mg g−1 CW. We also found that enzyme-released glucose was generally greater in winterhardy-type cultivars than in the cultivar Orca. Finally, we observed some morphological differences at harvest between the cultivars under study: Orca had larger stem diameter with hollow pith and longer internodes than the winter-hardy type cultivars.

Assessment of CW Composition and Degradability

The strong negative correlation between enzyme-released glucose and lignin content support the idea that cross-linking of lignin with CW polysaccharides interferes with enzymatic degradation of cellulose and limits the conversion of herbaceous plant material into ethanol, as reported previously [3, 28]. Similar results have been observed with transgenic alfalfa featuring low lignin content [29]. Suppression of early lignin biosynthesis genes led to a 50 % reduction in lignin content in comparison to the wild-type phenotype. In comparison, using our NIRS-based screening approach, with all cultivars considered, we found that the D+ genotypes had on average 20 % less lignin than the D− genotypes, with total lignin concentrations of 185 mg g−1 CW and 231 mg g−1 CW, respectively. Low-lignin alfalfa plants, such as the D+ lines identified here, could reduce the reliance on high-cost pre-treatments that are commonly used with lignocellulosic feedstock prior to enzymatic saccharification, the products of which can inhibit fermentation [30]. On the other hand, a lower lignin content could also reduce biomass yield [31]. For instance, Chen and Dixon [29] observed a 40 % yield reduction in low-lignin alfalfa transgenic lines compared to wild-type alfalfa. However, this yield loss was offset by a large increase in fermentable sugar content. Taken together, our results suggest that NIRS screening of broad-based populations is a promising tool for the development of alfalfa having low lignin content while maintaining high biomass yield.

Alfalfa is a good potential bioethanol feedstock [6], but much of the effort to develop new biomass-type crops has focussed on grasses because they produce high yields of cellulose-rich biomass [31]. With concentrations of glucose in the cell wall, determined by acid hydrolysis, exceeding 430 mg g−1 CW (Table 2), mature alfalfa stems can yield higher cellulose concentrations than grasses [31]. After making a conversion from mg g−1 DW to mg g−1 of CW and considering an average of 70 % CW, the glucose concentration of alfalfa stems that we measured is comparable to values around 420 mg g−1 CW reported by others for alfalfa [15, 32]. The slightly higher cellulose concentrations observed here are likely due to a more advanced stage of maturity (green pod) at harvest as compared to the full flower stage used in other reports, as well as to the fact that we only used the highly lignified lower part of the stems (25 cm) for our assessments. Our rationale for using more mature plants was based on the fact that, while the recovery of enzyme-released glucose decreases with maturity due to higher lignin concentrations, glucose yield may actually increase because of higher cellulose content [32]. Larger cellulose contents are advantageous for ethanol production because glucose is efficiently converted to ethanol by industrial yeast strains [33].

Glycome Profiling Studies

Glycome profiling was previously done on a number of different plant biomass samples. The profiles obtained for alfalfa pools in our study are distinct from other plant species such as poplar [34], nicotiana [35] and sorghum [36]. The glycome profiling approach showed that CW composition of alfalfa stems from 54V54 and Orca as well as of D+ and D− groups share many similarities. Acidic hydrolysis revealed that D+ and D− genotypes had significantly different concentrations of glucose, galacturonic acid and total lignin. Also, Orca and 54V54 had different amounts of glucose, xylose, galactose and galacturonic acid. Interestingly, those significant differences in CW composition did not result in major shifts in CW carbohydrate extractability, as shown by glycome profiling. However, glycome profiling revealed a few differences between the samples analysed, with more differences observed between the two genetic backgrounds (Orca versus 54V54) than between the two levels of degradability (D+ versus D−). The glycome profile differences observed between the two cultivars suggest marginally higher polysaccharide extractability for the winterhardy-type than for the biomass-type cultivar, which is in accordance with the results obtained from acid hydrolysis. On the other hand, although chemical analyses showed that lignin concentrations differed markedly between D+ and D−, CW polysaccharide extractability was not significantly affected. Altogether, these results suggest that the >35 % difference in CW degradability (enzyme-released glucose) between D+ and D− genotypes, as predicted by NIRS and validated by biochemical assays, could be associated with subtle differences in CW extractable polysaccharides.

Selection for CW Degradability

One of the major roadblocks in the improvement of lignocellulosic biomass conversion to ethanol relates to the difficulty in exploiting a large heterogeneity of stem degradability to develop improved genetic material. Here, we showed that NIRS can be fully integrated in a breeding program for identifying germplasm with highly contrasting CW degradability as assessed by enzyme-released glucose. This method was applied to identify 20 high-degradable D+ genotypes that were > 35 % more degradable (154 mg g−1 CW of enzyme released glucose) than the 20 low-degradable D− genotypes (112 mg g−1 CW). The assessment of enzyme-released glucose from progenies issued from these crosses confirmed that this trait is genetically inherited. Around 110 progenies from each cross were assessed for enzyme-released glucose by NIRS prediction, confirming significant differences between both cultivars and groups of genotypes. On average, enzyme-released glucose differed by 6 % between the D+ and the D− progenies for 54V54 while it differed by 18 % between these groups in Orca, showing a strong response to divergent selection for both cultivars. Interestingly, when assessing enzyme-released glucose of the 25 most degradable progenies at late flowering, we obtained values greater than 220 mg g−1 CW for both Orca and 54V54 without pre-treatment (data not shown). Yield of enzymatically released glucose is the result of both glucose concentration in the biomass and the efficiency with which that glucose is released from polysaccharides. Although glucose concentration is higher in D+ selections due to lower lignin content, our results clearly show that D+ lines had significantly higher glucose released yield than D− lines. Whether the two germplasms achieved improved glucose released in D+ lines by the same mechanism remains to be investigated.

Conclusions

NIRS can accurately predict CW degradability (enzyme-released glucose) in mature alfalfa stems of various cultivars. The use of this high-throughput screening method on a large number of genotypes revealed a large genetic variability for this trait in alfalfa. Through recurrent selection, we were able to exploit this heterogeneity of stem degradability to produce a new germplasm more prone to enzymatic saccharification. Assessment of CW degradability (enzyme-released glucose) of progenies, after one cycle of divergent selection, clearly showed the possibility to select for this trait. Progenies with contrasting CW degradability characteristics are currently being used to develop molecular markers for this trait.