Introduction

Cotton (Gossypium sp.) is the most important natural fiber in the world (Wakelyn et al. 2010). Its fibers are mainly composed of secondary cell wall (SCW) cellulose that determines fiber properties and quality (Hsieh 1999; Kim et al. 2014, 2017; Meinert and Delmer 1977). For phenotyping cotton fibers and interpreting cotton genomics information, the cotton research community has used two automated instruments, high-volume instrument (HVI) and advanced fiber information system (AFIS). These systems were originally designed for determining the market value of cotton fibers or for predicting spinning performance and yarn quality. Micronaire (MIC) and maturity ratio (MR) values indirectly measured by the automated instruments are used for phenotyping SCW of cotton fibers and interpreting genetic and genomic data. However, they are not sensitive enough to detect minor differences in SCW levels from cotton fibers (Kim et al. 2017; Paudel et al. 2013; Rodgers et al. 2013).

To identify cotton genes regulating fiber properties, cotton scientists have been using comparative genomic analyses that compare the transcriptomic profiles or gene expression patterns of two developing cotton fibers differing in fiber properties or quality (Kim 2015). When these genomic data were interpreted with the fiber phenotypic data that were mostly measured by the HVI and AFIS instruments, common genes were identified for regulating two different fiber development processes, SCW development (Wang et al. 2014) and fiber elongation (Fang et al. 2014). Thus, methods for phenotyping SCW cellulose more reliable and sensitive than HVI and AFIS are needed to improve gene discovery and fiber quality through cotton genetic and genomic studies that are rapidly advancing with emerging techniques.

Several methods including Updegraff cellulose content analysis (Updegraff 1969) and X-ray diffraction (XRD) (Nam et al. 2016), and image analysis microscopy of cross-sectioned fibers (Gordon and Rodgers 2017) can be used for monitoring SCW cellulose from cotton fibers. However, those methods are unfavorable to cotton geneticists and breeders due to their lengthy and complicated processes (Rodgers et al. 2013). An indirect method, Cottonscope is used as a supplement to the direct image analysis (Brims and Hwang 2010). It rapidly estimates the degree of SCW development by measuring birefringence intensity from developing cotton fibers (Long et al. 2010) despite the results can be affected by the levels of physiological sugars causing stickiness in developing fibers (Kim et al. 2014).

In contrast, Fourier transform infrared (FT-IR) spectroscopy has been extensively used by plant biologists to identify SCW in a rapid and non-invasive way from model plants like Arabidopsis and corn (McCann et al. 1992). To analyze large data sets of FT-IR spectra effectively, a mathematical technique known as principal component analysis (PCA) is frequently used (Abidi et al. 2010a, 2014; Kemsley 1998). The first principal component (PC1) score was observed to increase with cotton fiber DPA consistently and significantly (Abidi et al. 2010a, 2014; Liu and Kim 2015), while the second PC (PC2) score could provide complementary information to the PC1 score. Thus, the FT-IR spectroscopy with the PCA method was successfully used for screening for mutants that were SCW deficient, studying SCW formation, identifying novel genes involved in SCW cellulose biosynthesis, and determining roles of plant polysaccharide compositions in cell walls (Brown 2005; Chen et al. 1998; McCann et al. 2001; Mouille et al. 2003; Vorwerk et al. 2004).

FT-IR spectroscopy and PCA method have been introduced to study cotton fiber. Abidi and his colleagues used the entire FT-IR spectra (600–3800 cm−1) for PCA analysis to monitor cotton fiber development by evaluating fiber wall composition (Abidi et al. 2008), SCW (Abidi et al. 2010a), and sugar composition (Abidi et al. 2010b). Later, there were several attempts to extract cotton fiber SCW information from a few specific FT-IR bands with new algorithms. The intensity ratios of three FT-IR bands such as 956, 1032, and 1500 cm−1 (Liu et al. 2011), 708, 730, and 800 cm−1 (Liu et al. 2012a), and 1236, 1315, or 1800 cm−1 (Liu and Kim 2015) were proposed to correlate with fiber maturity, crystallinity, and fiber development, respectively. The integrated intensities of two FT-IR bands, 667 and 897 cm−1 were also proposed to correlate with the percentage crystallinity (Abidi et al. 2014; Abidi and Manike 2017). The FT-IR methods using either entire spectra or a few specific bands were tested with limited numbers of upland cotton varieties, but were not validated with various cotton species and lines grown in different conditions that cotton biologists frequently use for their studies. Therefore, the FT-IR methods have been rarely utilized by cotton geneticists and breeders. Interestingly, the FT-IR spectral regions corresponding to cellulosic components have not been tested unlike the entire FT-IR spectra and specific bands.

In this present work, we determined which FT-IR methods can quantitatively determine SCW cellulose development for various cotton fibers including developing and developed fibers from different cotton species grown in planta or in vitro. In addition to the two known FT-IR methods analyzing entire spectra (600–3800 cm−1) or specific bands (708, 730, and 800 cm−1), we analyzed the various cotton fibers with the FT-IR spectral regions corresponding to cellulosic components (700–760, 950–1020, 1100–1200, 1260–1340, and 3200–3400 cm−1). Comparisons of the SCW cellulose levels determined by the FT-IR methods with the reference methods including Updegraff, XRD, and gene expression assays showed strengths and weakness of each FT-IR method. Among the three FT-IR methods, a FT-IR method with the specific FT-IR bands allowed quantifying cotton fiber SCW cellulose, whereas the two other methods could only distinguish SCW from PCW qualitatively.

Experimental

Materials and sample preparation

Developing cotton fibers were harvested from Gossypium hirsutum, Texas marker-1 (TM-1, PI 607172) that is a standard polyploid upland cotton variety for genetic and genomic research (Kohel et al. 1970; Zhang et al. 2015) and G. arboreum Shixiya1 (SXY1) that is a standard diploid cotton used for genome sequencing (Li et al. 2014). Cotton flowers grown in the field of USDA-ARS in New Orleans, LA were tagged at the day of anthesis (DOA) and developing fibers from 10 to 30 cotton bolls were harvested from multiple cotton plants for two biological replications at various developmental stages in 2011 (10, 17, 24, 28, 33, and 37 days post anthesis, DPA) and 2015 (12, 23, and 30 DPA). The developing fibers were manually cut from the seeds and dried at 40 °C. Cotton bolls were fully developed and opened at 42–44 DPA. The fully dried and developed fibers were harvested and ginned with a laboratory roller gin. The soil type in New Orleans is Aquent dredged over alluvium and is elevated to provide adequate drainage.

Cotton ovule culture

Unfertilized ovules at flowering day (DOA) from the upland variety TM-1 were used for cotton ovule culture. The ovules were incubated on Beasley and Ting (BT) medium with two different phytohormone conditions (Beasley and Ting 1974). One contained a natural auxin, indole-3-acetic acid (IAA, 5.0 µM) with 0.5 μM gibberellic acid (GA3) and the other had a synthetic auxin, 1-naphthaleneacetic acid (NAA, 5.0 µM) with 0.5 μM GA3. Cotton ovules were cultured in the darkness at 30 °C in a 5% CO2 atmosphere. The cultured ovules at different developmental stages (13, 17, 24, and 27 DPA) were harvested and manually cut from the seeds. The collected fibers were frozen immediately with liquid nitrogen for RNA extraction or dried in a 40 °C incubator for physical property analyses.

RNA extraction and gene expression assay by RT-qPCR

Total RNA was extracted from developing fibers at 13, 17, 24 and 27 DPA using the Sigma Spectrum™ Plant Total RNA Kit (Sigma-Aldrich, St. Louis, MO) with DNase I digestion according to the manufacturer’s protocol. The quality and quantity of total RNA were determined using a NanoDrop 2000 spectrophotometer (NanoDrop Technologies Inc., Wilmington, DE). Two biological replications were used for RNA extractions. The experimental procedures and data analysis related to RT-qPCR were performed according to the Minimum Information for Publication of Quantitative Real-Time PCR Experiments guidelines. Abundances of GhCesA2 transcripts encoding cellulose synthase catalytic subunit were compared by RT-qPCR analyses. The detailed description of cDNA preparation, qPCR, and calculations were previously reported (Kim et al. 2013a). Specific primer pairs were designed from the GhCesA2 encoding cellulose synthase catalytic subunit (Table 1). The endogenous reference genes, 18S rRNA and α-tubulin 4, were used for normalizing the transcript levels. Three biological replications and three technical replications for each time-point were used for RT-qPCR.

Table 1 Primer sequences for quantitative RT-PCR

Cellulose content measurement

Cellulose contents of developing fibers at various DPAs and fully developed fibers were measured by the method described by Updegraff (1969) with minor modifications. Dried fiber samples were cut into small pieces. Ten milligrams of the fibers were placed to 5 mL of reacti-vials. Non-cellulosic materials in fibers were hydrolyzed with the Updegraff reagent (73% acetic acid, 9% nitric acid, and 18% water). The remaining cellulose was hydrolyzed with 67% sulfuric acid (v/v) and measured by a colorimetric assay with anthrone. Avicel PH-101 (FMC, Rockland, ME) was used as a cellulose standard. A650 from the samples and standards was measured by a plate reader (ThermoMax, Molecular Devices, Sunnyvale, CA). The average cellulose content for fibers was obtained from two biological and three technical replications.

ATR FT-IR spectral collection and data analysis

All spectra were collected with an FTS 3000MX FTIR spectrometer (Varian Instruments, Randolph, MA) equipped with a ceramic source, KBr beam splitter, and deuterated triglycine sulfate (DTGS) detector as previously described in Liu and Kim (2015). The ATR sampling device utilized a DuraSamplIR single-pass diamond-coated internal reflection accessory (Smiths Detection, Danbury, CT) and consistent contact pressure was applied by way of a stainless steel rod and an electronic load display. At least six measurements at different locations for individual samples were collected over the range of 600–4000 cm−1 at 4 cm−1 and 16 co-added scans. All spectra were given in absorbance units and no ATR correction was applied. Following the import to GRAMS IQ application in Grams/AI (Version 9.1, Thermo Fisher Scientific, Waltham, MA), the spectra were smoothed with a Savitzky–Golay function (polynomial = 2 and points = 11). Then, the spectral set was loaded into Microsoft Excel 2000 to assess CI IR values from IR measurement by using a previously proposed algorithm analysis (Liu et al. 2011, 2012a). Principal component analysis (PCA) calculations for IR spectra were performed in the 600–3800 cm−1 IR region with mean centering (MC) and Savitzky–Golay first-derivative (2 degrees and 13 points) spectral pretreatment and also with leave-one-out cross-validation method, after these spectra were normalized by dividing the intensity of individual band in the 600–3800 cm−1 region with the average intensity in this 600–3800 cm−1 region.

X-ray diffraction (XRD)

XRD experiments were performed with a PANalytical Empyrean diffractometer (PANalytical, Netherlands), equipped with a Cu X-ray source (λ = 1.5404 Å), and operated at 45 kV and 40 mA. The instrumental profile was calibrated with LaB6 (NIST 660a). Fully dried cotton samples were ground in a Wiley mill to 20 mesh, hand pressed into a pellet, and placed on a quartz zero-background holder before analysis. Scans were measured in the 2θ range of 8°–45° in 0.05° steps. The fitting procedures using a peak deconvolution method for cellulose crystallinity index (CIXRD) in developing cotton fibers were described earlier (Lee et al. 2015b). The diffraction data were also analyzed by Rietveld method that is incorporated in the MAUD software (Lutterotti 2010). The refinements typically included 11 variables, including scale factor, quadratic background, unit cell a and gamma, crystallite size, amorphous particle size, preferred orientation along (00l) axis for both phases, and amount of phase 1, amount of phase 2. The detailed Rietveld refinement information of the developing fibers was previously described in Kim et al. (2017).

Statistical analyses

Statistical analyses and construction of graphs were performed using t test, two-way ANOVA and Prism version 5 software (Graph-Pad Software, Inc., San Diego, CA). The p-value cutoff for significance was 0.05.

Results and discussion

Determination of cotton fiber SCW cellulose development by a chemical reference assay

Cotton fibers elongated during PCW biosynthesis stage after they were differentiated from ovule epidermis of upland cotton variety TM-1 around the day of anthesis (Fig. 1a). Cellulose contents of developing fibers were determined by the Updegraff method (Updegraff 1969), which is a destructive chemical analysis requiring lengthy and laborious processes, but also generally considered as a reference method for determining SCW development in upland cotton fibers (Meinert and Delmer 1977). Developing fibers at PCW stage (10 DPA) consisted of low cellulose content (6.7%) while they were actively elongated (Fig. 1b and Table 2). At transition (17 DPA) from PCW to SCW biosynthesis stage, the cellulose content of developing cotton fibers increased dramatically to 32.8%. During the active SCW biosynthesis stage (20–37 DPA), the cellulose contents continued increasing and reached 90.4% at 37 DPA. Cotton bolls became fully developed and opened approximately 42–44 DPA. The fully developed fibers were harvested at 44 DPA after they were dehydrated in the field. They were composed of nearly pure cellulose. The TM-1 fiber development pattern was consistent with the developmental pattern that was previously reported by our group and others (Haigler 2010; Kim 2015; Kim and Triplett 2001; Meinert and Delmer 1977).

Fig. 1
figure 1

Cotton fiber development and cellulose biosynthesis. a Development of an upland cotton TM-1 variety. Four major fiber developmental stages are classified based on the physical properties described in Kim (2015). The ovary walls were removed from developing cotton bolls to show the cotton fiber growth inside the bolls. Fully developed bolls were naturally opened at 42–44 DPA and developed fibers were harvested at 44 DPA. PCW, primary cell wall stage for fiber elongation; SCW, secondary cell wall stage for cellulose biosynthesis; DPA, days post anthesis. b Cellulose content was determined from developing TM-1 fibers (10–44 DPA) by the method described by Updegraff (1969)

Table 2 Fiber properties of developing G. hirsutum TM-1 fibers measured by Updegraff assay and ATR FT-IR spectroscopy

Determination of FT-IR spectral regions corresponding to cellulosic components during cotton fiber development

ATR FT-IR spectra (600–3800 cm−1) were obtained from developing fibers at 10, 17, 24, 28, 33, and 37 DPA and developed fibers at 44 DPA (Fig. 2a). The characteristic bands of these developing fibers were assigned in Supplementary Table S1 according to the previously published reports (Abidi et al. 2010a, 2014; Kim et al. 2013b; Liu and Kim 2015). As DPA increased, the intensities of the bands at 1740, 1620, 1545, 1455, 1405, and 1236 cm−1 as well as those below 850 cm−1 decrease, whereas those at 1425, 1365, 1335, 1315, 1200, 1158, 1104, 1055, and 1028 cm−1 as well as those in the 895 cm−1 region increase. These intensity distinctions suggest chemical, compositional, and structural changes during cotton fiber cellulose development, and subsequently these unique bands have proved to be effective in monitoring the increasing dominance of SCW.

Fig. 2
figure 2

FT-IR analyses of cotton fibers. a Comparative ATR FT-IR spectra of developing fibers from 10 to 37 DPA and fully developed fibers at 44 DPA from an upland cotton variety TM-1. During fiber development, the wavenumbers with increasing and decreasing intensities were marked with red and black arrows, respectively. b PCA of entire data set and loading plot of principal component 1, PC-1 (93% explained variance) representing the predominant spectral elements. The letters C, H, Pc, Pr, and W indicate cellulose, hydrocarbon, pectin, protein, and water components, respectively. The dotted line represents zero PC1 loading. In the regions shaded with gray, PCl scores were mostly contributed by non-cellulosic components. Five regions (I–V) contributed by cellulosic components were identified and highlighted with blue

The PCAs were performed with entire spectra including all vibration modes from non-cellulosic components as well as cellulose. The principal component (PC1) loadings feature negative and positive peaks, with large positive (or negative) loadings contributing largely to positive (or negative) PC1 scores. In the PC1 loading plot, the peaks pointing downwards are the ones dominant in PCW (water, hemicellulose, cuticles, proteins, and pectin), whereas the peaks pointing upwards are the component dominant in SCW that are mostly composed of cellulose in cotton fibers. Several negative peaks were found from water peaks at 1640 and 3100–3600 cm−1, protein peaks at 1520–1650 cm−1, pectin peaks at 1740 cm−1, and long-chain alkyl hydrocarbon peaks at 2850 and 2926 cm−1 (Fig. 2b and Table S1). Based on the previous FT-IR work by Abidi et al. (2014) and Lee et al. (2015), the PC scores in the gray shaded regions in Fig. 2b have strong contributions from non-cellulosic components. Thus, the five blue regions [I (700–760 cm−1), II (950–1020 cm−1), III (1100–1200 cm−1), IV (1260–1340 cm−1), and V (3200–3400 cm−1)] with positive PC scores in Fig. 2b can be attributed to cellulose.

Estimation of SCW development with entire FT-IR spectra and PCA approach

All ATR FT-IR spectra (600–3800 cm−1) obtained from developing TM-1 fibers were analyzed by the conventional PCA method. The PC1 score dramatically increased from 10 to 17 DPA due to the transition from PCW to SCW stage, reached the maximal value at 24 DPA, and plateaued from 24 to 44 DPA (Fig. 3a and Table 2).

Fig. 3
figure 3

Comparisons between cellulose contents (%) and PC1 scores determined from FT-IR full spectra (600–3800 cm−1) of developing cotton fibers. a PC 1 scores were calculated from FT-IR spectra of developing fibers at various DPA. b Correlation between cellulose content and PC1 scores from developing fibers. The correlation coefficient values (r) were obtained by Pearson’s method, and R 2 values were determined by GraphPad Prisim software. Statistical significance was shown at the probability (p) levels value under 0.05*, 0.01**, and 0.001***

Large negative PC1 score at 10 DPA fibers indicates the existence of non-cellulosic components (water, proteins, pectins, and hydrocarbon in the cuticle) in PCW. Positive PC1 scores result from the existence of the SCW cellulose in developing and developed fibers. The pattern of increasing PC1 scores from 10 to 24 DPA was compared with that of cellulose content (Fig. 3b). There was a positive correlation (r = 0.896; R 2 = 0.803) between the cellulose content and PC1 scores from 10 to 44 DPA. However, the PC1 scores from 24 to 44 DPA showed no significant correlations (r = − 0.022, R 2 = 0.0005, p = 0.972) with the cellulose contents during the SCW stage. The results confirmed that the PC1 scores determined by the PCA method from the entire FT-IR spectra (600–3800 cm−1) can distinguish the transition from PCW to SCW biosynthesis during cotton fiber development. However, the PCA scores were not sensitive enough to show the progress of fiber development during SCW biosynthesis stage (24–44 DPA), unlike the Updegraff cellulose assay showing the progress of SCW biosynthesis (Figs. 1, 3).

Estimation of SCW development with FT-IR spectral regions corresponding to cellulosic components and PCA approach

The PCA with entire spectra from 10 to 44 DPA fibers proved the difference between PCW and SCW but failed to quantitatively measure the degree of SCW cellulose from 24 to 44 DPA fibers (Fig. 3b). Thus, we re-ran PCA for the four IR regions [I (700–760 cm−1), III (1100–1200 cm−1), IV (1260–1340 cm−1), and V (3200–3400 cm−1)] that specifically corresponded to cellulose (Fig. 2b). The patterns (Fig. 4a) of PC1 scores performed with the cellulose IR regions showed striking similarities to those (Fig. 3a) performed with entire IR spectra.

Fig. 4
figure 4

Comparisons between cellulose contents (%) and PC1 scores determined from the cellulose FT-IR spectra of developing cotton fibers. a PC 1 scores were calculated from four cellulose FT-IR spectral regions [I (700–760 cm−1), III (1100–1200 cm−1), IV (1260–1340 cm−1), and V (3200–3400 cm−1)] of developing fibers. b Correlation between cellulose content and PC1 scores corresponding to cellulose from developing fibers. The correlation coefficient values (r) were obtained by Pearson’s method, and R 2 values were determined by GraphPad Prisim software. Statistical significance was shown at the probability (p) levels value under 0.05*, 0.01**, and 0.001***

The PC1 score increased significantly from 10 to 17 DPA due to the transition from PCW development to SCW development, reached the maximum value at 24 DPA, and plateaued from 24 to 44 DPA (Fig. 4a). The correlation (r = 0.899, R 2 = 0.807) between PC1 scores and cellulose content obtained from the cellulose IR regions (Fig. 4b) was also almost identical to that obtained from the entire IR spectra (Fig. 3b). The PC1 scores from 24 to 44 DPA showed no significant correlations (r = 0.160, R 2 = 0.026, p = 0.797) with cellulose content during the SCW stage.

Since it was originally expected the method with the FT-IR spectral regions specific for cellulose to be much better for monitoring SCW cellulose development than that with the entire FT-IR spectra of cotton fibers, it was surprising that the PC1 scores determined with the cellulose-characteristic regions provided only marginally better correlations (p and R 2 values) than those based on the entire IR spectra. Neither the combined cellulose regions nor individual cellulose IR regions [I (700–760 cm−1), III (1100–1200 cm−1), or IV (1260–1340 cm−1)] were able to quantify the SCW biosynthesis (Supplementary Fig. S1). Based on the PCA analyses in Figs. 3, 4 and S1, we concluded that the PCA method is useful to distinguish between PCW and SCW but fails to show the progress of fiber development during SCW biosynthesis stage.

Estimation of SCW development with FT-IR bands and a three band ratio algorithm

We analyzed the FT-IR bands at 708, 730, and 800 cm−1 using a simple three band ratio algorithm to estimate crystallinity index (CI IR) values from cotton fiber FT-IR spectra (Liu et al. 2012a). This method compares the relative intensities of a cellulose-characteristic band and an internal reference band:

$$ R_{2} = \left( {I_{708} - I_{800} } \right)/\left( {I_{730} - I_{800} } \right) $$

where I is the absorbance at the specific wavenumbers (Liu et al. 2011). The 708 cm−1 band is characteristic to crystalline cellulose Iβ and the 730 cm−1 band is attributed to cellulose Iα or amorphous cellulose (Imai and Sugiyama 1998; Liu et al. 2012a; Sassi et al. 2000; Yamamoto et al. 1996). The intensity at 800 cm−1 is used as a common background for normalization. For a data set of 402 fibers, Liu et al. (2012a) assigned the most immature and mature fibers the CI IR values of 0.0 and 100.0%, respectively, then the CI IR values are determined by converting the relative R 2 value for individual fiber sample.

The increasing trend of the CI IR values during entire fiber development from 10 to 44 DPA (Fig. 5a) was similar to that of the cellulose contents (Table 2). There was a significant correlation (r = 0.969; R 2 = 0.938) between the CI IR values and cellulose contents during entire fiber development (Fig. 5b). During the SCW stage (24–44 DPA), the CI IR values consistently showed significant correlation with the cellulose contents, unlike the PC1 scores that had no correlation in Figs. 3b and 4b. The CI IR standard deviation values in the Table 2 were similar to the variations (1.10–19.49%) that were previously determined from developmental cotton fibers at medium and tip positions of one locule (Liu et al. 2012b).

Fig. 5
figure 5

Correlations of cellulose content with crystallinity index (CI IR). a CI IR values were calculated from FT-IR spectra of developing fibers by the TBR algorithm. b Correlation between cellulose content and CI IR values from developing fibers (10–44 DPA). The correlation coefficient values (r) were obtained by Pearson’s method, and R 2 values were determined by GraphPad Prisim software. Statistical significance was shown at the probability (p) levels value under 0.05*, 0.01**, and 0.001***

Thus, we concluded that CI IR values determined from the three FT-IR bands are good indicators of quantitatively measuring the progress of cotton fiber development and good estimators of determining the cellulose quantities during the SCW biosynthesis stage of developing fibers, in addition to those in the PCW and transition stages (10–24 DPA).

Comparison of crystalline cellulose content of cotton fibers measured between XRD and FT-IR methods

The crystallinity of cotton fibers has been widely studied with XRD analysis, and XRD crystallinity index (CIXRD) are often rapidly determined as the relative peak height between crystalline and amorphous materials according to the method that Segal proposed (Segal et al. 1959). However, recent studies showed that there were limits of the Segal crystallinity index (French and Santiago Cintrón 2013; Nam et al. 2016). Driemeier and Calligaris (2011) found that the peak area is more representative of crystalline fraction than the peak height. Thus, we first determined the crystallinity index (CIXRD) values representing the relative amount of crystalline portion in cellulose according to the peak deconvolution method (Kim et al. 2017; Park et al. 2010). Individual crystalline peaks were extracted with curve-fitting process, and CIXRD values of cotton fibers at 17, 24, 37, and 44 DPA were successfully calculated from relative intensities of the fitted diffraction peak and the background component (Table 3 and Fig. 6a). We also analyzed XRD data with another analysis, Rietveld method that is incorporated in the MAUD software (Lutterotti 2010). In the analysis, the cellulose Iβ crystal structure (Nishiyama et al. 2002) was used as a model for the crystalline cellulose I phase, and the cellulose II crystal structure (Langan et al. 2001) was used for the amorphous phase. The 24, 37, and 44 DPA cotton fibers containing high level of SCW cellulose showed excellent fit with the Rietveld model as we previously showed in Kim et al. (2017), whereas the 17 DPA cotton fibers containing high level of PCW component did not fit with the model (Table 3). Therefore, the CIXRD values were only calculated from cotton fibers at 24, 37, and 44 DPA based on the Rietveld method (Table 3). Between the two different XRD analyzing methods, the CIXRD values determined by the Rietveld method were higher than those determined by the deconvolution method.

Table 3 Comparison of crystalline cellulose content of cotton fibers measured between XRD and ATR FT-IR methods
Fig. 6
figure 6

Correlation of crystallinity index values of developing cotton fibers measured between FT-IR and XRD method. a X-ray diffractograms of G. hirsutum TM-1 fiber powders at 17, 24, 37, and 44 DPA. Linear backgrounds were subtracted from raw data. b The crystalline portions of total cellulose were calculated based on the XRD CIXRD values determined by both deconvolution and Rietveld methods, and FT-IR CI IR values determined by a simple algorithm from developing fibers

The crystalline portions of total cellulose were calculated by multiplying either CIXRD or CI IR values with the total cellulose contents determined by the Updegraff method in Table 3. The crystalline portions determined from the XRD by both peak deconvolution and Rietveld methods showed similar increasing patterns with those determined from the FT-IR by the three band algorithm (Table 3 and Fig. 6b). There were significant correlations (R 2 = 0.968; p = 0.016*) of the crystalline portions determined by the peak deconvolution method when they were calculated from the total cellulose contents of developing fibers. The results showed that the CI IR values determined from FT-IR spectra could be used to calculate the crystalline portions that successfully trace the progress of SCW development during cotton fiber development, although they were not quantitatively matched to the CIXRD values determined from XRD profiles.

Here, it should be noted that calculation of the CIXRD values from XRD profiles is subject to the base line corrections and the XRD peaks can be affected in a complex manner with the crystalline component concentration and crystallite size (Lee et al. 2015a). For example, the (200) peak position of the 17 DPA XRD peak position is slightly lower in 2θ than those of the higher DPA samples (Fig. 6a), indicting the spacing between (200) crystallographic planes is slightly larger. This could be due to low degree of cellulose chain packing in the PCW development stage. For that reason, the Rietveld refining did not work well for the 17 DPA XRD data; thus, the CIXRD of the 17 DPA sample could not be determined from the Rietveld method. However, the peak deconvolution method is not affected by any differences in the peak position because the peak deconvolution method simply ignores subtle deviations in the crystalline order. It is important to note that this does not mean the deconvolution method is better than the Rietveld method. Thus, the CIXRD values must be considered as a qualitative or semi-quantitative evaluation of the crystalline cellulose amounts of fiber samples (Lee et al. 2015a; Liu et al. 2012a).

It is also important to remember the different definition of crystallinity among Updegraff, IR, and XRD methods. The IR method is based on the absorption peaks sensitive to certain structural constraints imposed on the normal vibrational modes due to the crystalline lattice. Thus, it is the “crystallinity based on the vibrational mode difference”. The Updegraff method is based on the difference in the hydrolysis rate between the crystalline and amorphous phases. Hence, it is the “crystallinity based on hydrolysis kinetics”. The XRD method is based on scattering coherence of x-ray from planes with the same electron density. Therefore, it is the “crystallinity based on scattering coherence”. The effects of amorphous phase in these three physical processes would not be the same. If the kinetics control is done properly in the Updegraff method and the peak selections as well as baseline corrections are done properly in the IR method, then these two methods can give “similar” values since both kinetics and IR are the linear sums. However, the XRD method involves many scattering processes including Bragg, thermal, Compton, etc., and the effects of the crystalline versus amorphous phases on the total signal in XRD vary among different scattering processes. Because of these differences in three methods, it may not be physically meaningful to come up with one common definition of “crystallinity”. However, it would be still meaningful to compare the “crystallinity values” calculated from different methods and find which ones correlate better to one another since this allows researchers to use a simpler method to find the value equivalent to the more laborious method. For that reason, the crystallinity values estimated by the FT-IR spectroscopy from various plant tissues and textiles have been frequently compared with those determined by the XRD and other chemical methods (Abidi and Manike 2017; Åkerholm et al. 2004; Liu et al. 2012a; Nelson and O’Connor 1964; O’Connor et al. 1958; Oh et al. 2005; Schultz et al. 1985). Although we knew these limitations in CIXRD, we compared the CI IR values from the three band ratio algorithm with the CIXRD values determined from the Rietveld and peak deconvolution methods because the CIXRD values have been used so frequently and widely in the previous literature.

Comparison of SCW development in cultured fibers analyzed by FT-IR methods as well as chemical and genomic assays

We tested if and which FT-IR methods are sensitive enough to detect subtle differences of SCW cellulose biosynthesis that is regulated by phytohormones in culture conditions. For differentiating cotton fibers from unfertilized cotton ovules at DOA in vitro, a phytohormon, auxin, is required (Beasley and Ting 1974; Meinert and Delmer 1977). The SCW cellulose biosynthesis in cotton fibers cultured with a synthetic auxin, NAA, is delayed as compared with that in cotton fibers cultured with a natural auxin, IAA (Singh et al. 2009). Thus, we cultured DOA ovules with 5 µM IAA or 5 µM NAA for 13, 17, 24, and 27 DPA (Fig. 7a, b), and compared transcript abundance of GhCesA2 responsible for SCW cellulose biosynthesis in cotton fibers. Comparisons of GhCesA2 transcript levels by RT-qPCR showed a recognizable interval of GhCesA2 expression responsible for SCW biosynthesis of cotton fibers when cotton ovules were cultured with IAA or NAA (Fig. 7c). IAA up-regulated GhCesA2 transcript levels between 13 and 17 DPA, whereas NAA up-regulated GhCesA2 transcript levels between 17 and 24 DPA. We previously showed that the onset of the GhCesA2 up-regulation indicates a transition from PCW to SCW stage during cotton fiber development (Kim et al. 2012).

Fig. 7
figure 7

Differential expressions of GhCesA2 transcript responsible for SCW cellulose biosynthesis by two different phytohormones. a Unfertilized ovules at day of anthesis (DOA) in petri dish. b Cultured cotton ovules with developing fibers at 24 DPA. Unfertilized ovules at DOA were cultured on basal medium with a natural auxin and GA3 for 24 days. c Comparisons of GhCesA2 transcript abundance from developing fibers cultured with different phytohormones. Unfertilized cotton ovules at DOA were cultured with 0.5 μM GA3 containing either a natural auxin (5 μM IAA) or a synthetic auxin (5 μM NAA) for 13, 17, 20, 24, and 27 DPA. GhCesA2 transcript levels in developing fibers were determined RT-qPCR and normalized with 18S rRNA (U42827) and α-tubulin 4 (AF106570) as references genes

We further analyzed the developing fibers cultured with IAA or NAA by comparing cellulose contents (Fig. 8a), CI IR values from the three FT-IR bands (Fig. 8b), and two different PCA methods with entire IR spectra (Fig. 8c) or cellulose IR spectral regions (Fig. 8d). Consistent with the GhCesA2 transcript levels in Fig. 7c, the CI IR and PC1 values of developing fibers cultured with IAA were higher than those cultured with NAA at three developmental time points. However, the patterns of increasing cellulose levels and CI IR values from developing fibers cultured with IAA and NAA were substantially different from those of the PC1 scores from the same developing fibers.

Fig. 8
figure 8

Comparisons of methods of detecting cellulose biosynthesis that is differentially regulated by various phytohomones during cotton fiber development. Unfertilized cotton ovules at DOA were cultured in vitro with 0.5 μM GA3 containing either a natural auxin (5 μM IAA) or a synthetic auxin (5 μM NAA) for 13, 17, 20, 24, and 27 DPA. Comparisons of cellulose contents (a), crystallinity index, CI IR % (b), PC1 scores of full IR spectra (c), and PC1 scores of cellulose IR spectra (d) in developing G. hirsutum TM-1 fibers at various DPA cultured with IAA or NAA

The differences between cellulose levels and CI IR values cultured between IAA and NAA significantly increased from 17 to 27 DPA and the greatest difference was found at 27 DPA (Fig. 8a, b). In contrast, the PC1 scores calculated from entire IR spectra (Fig. 8c) and the cellulose IR spectra (Fig. 8d) similarly increased from 17 to 24 DPA, but not from 24 to 27 DPA. Unlike the cellulose and CI IR results (Fig. 8a, b), a very little difference was detected at the 27 DPA fibers cultured between IAA and NAA (Fig. 8c, d). With the FT-IR spectra obtained from the developing fibers at the active SCW stage (27 DPA) cultured with IAA or NAA, the two PCA methods failed to distinguish the different progress of SCW cellulose biosynthesis, although it successfully detected the differences during PCW and transition stage (Fig. 8c, d). These results were very similar to those in Figs. 3b and 4b, confirming that the PCA method was not sensitive enough to show the progress of SCW development from both field grown and cultured fibers. Therefore, we concluded the CI IR values determined from the FT-IR enabled the progressive quantitative measurement in cellulose content, whereas the PCA methods with the entire spectra or cellulose spectral regions were only able to differentiate SCW from PCW.

Comparison of SCW development in different cotton species between FT-IR and chemical assays

We also tested whether any FT-IR methods enabled the monitoring of differences in fiber SCW development between two cotton species. G. arboreum is the closest diploid cotton species (A2 genome) of the maternal parent for the allotetraploid cotton species, G. hirsutum (AD1 genome) that is the agriculturally important cotton species (Applequist et al. 2001; Wendel and Cronn 2003). The diploid A2 cotton genome was sequenced from G. arboretum SXY1 (Li et al. 2014), and the allotetraploid AD1 genome was sequenced from G. hirsutum TM-1 (Li et al. 2015; Zhang et al. 2015). Different fiber lengths between the two species were visually detected (Fig. 9a). G. arboreum SXY1 undergoes an earlier transition to SCW stage as compared to G. hirsutum TM-1 (Li et al. 2015).

Fig. 9
figure 9

Comparisons of methods of monitoring cellulose biosynthesis between two different cotton species. a Cottonseed phenotype between G. arboreum SXY1 and G. hirsutum TM-1. be Comparisons of fiber properties measured by different methods. cellulose contents (b), crystallinity index CI IR % (c), PC1 scores of full IR spectra (d), and PC1 scores of cellulose IR spectra (e) were determined from developing fibers at 12, 23, and 30 DPA between G. arboreum SXY1 and G. hirsutum TM-1

Cellulose levels (Fig. 9b) of both cotton species were low at the PCW stage (12 DPA). At the early SCW stage (23 DPA), cellulose content of G. arboreum SXY1 fibers (62.8%) was significantly higher than that of G. hirsutum TM-1 fibers (44.3%) as predicted by Li et al. (2015). During the SCW stage from 23 to 30 DPA, the cellulose contents of both G. arboreum (89.9%) and G. hirsutum (87.2%) significantly increased and became similar to each other. We also measured CI IR values from the three IR bands (Fig. 9c), PC1 scores from entire IR spectra (Fig. 9d), and PC1 scores from cellulose IR regions (Fig. 9e) of the two cotton species. Both species showed low PC1 scores and CI IR % commonly at PCW stage (12 DPA). At the early SCW stage (23 DPA), the CI IR values (Fig. 9c) showed significant differences between G. arboreum and G. hirsutum that were consistently monitored by the cellulose assay (Fig. 9b), whereas both PC1 scores commonly failed to show differences (Fig. 9d, e).

During the active SCW stage from 23 to 30 DPA, the CI IR values continued to increase as the cellulose contents increased, whereas the both PC1 scores had not changed. As results, we concluded that the algorithm with three bands was able to trace the SCW developments from two different cotton species as the reference chemical method did, whereas the PCA method with entire spectral and cellulose regions proved insufficient when attempting to determine the degree of SCW biosynthesis.

Conclusion

We compared three different FT-IR methods to identify which method can estimate SCW cellulose level from cotton fibers in a quantitative way. The two methods analyzing the entire spectra and cellulose regions by PCA showed a transition from the PCW to the SCW biosynthesis stage, but they cannot quantitatively determine SCW levels. In contrast, the CI IR values determined from three FT-IR bands quantitatively monitor cotton fiber development including both PCW and SCW stage. Thus, the FT-IR method that can measure SCW cellulose in a quantitative, quick, and non-invasive way from a very small amount of cotton fibers may improve the way of phenotyping cotton fibers, interpreting genomic data, and identifying the molecular mechanisms of SCW cellulose depositions that affect cotton fiber properties and quality.