Introduction

Just over a decade ago, Nishiyama et al. (2002) combined X-ray and neutron diffraction data to develop a high-resolution crystallographic structure of cellulose Iβ, describing it in terms of both heavy and hydrogen atom positions. This structure, based on samples of tunicin, is characterized by an ordered array of perfectly parallel glucosyl chains associated into layers by a well-defined hydrogen bond network (Fig. 1a). In recent years, molecular dynamics (MD) simulation studies of finite or “mini-crystal” microfibril models based on these coordinates have demonstrated a preference to distort from the linearly oriented crystal structure and adopt a twisted conformation (Fig. 1b) (Bu et al. 2009; Glass et al. 2012; Hadden et al. 2013; Matthews et al. 2012; Matthews et al. 2011; Matthews et al. 2006; Paavilainen et al. 2011; Yui et al. 2006). While some experimental data supports the existence of twisted microfibrils (Hanley et al. 1997; Hirai et al. 1998), the apparent contradiction with high-resolution crystallographic data has been the source of considerable controversy in the cellulose structure community.

Fig. 1
figure 1

View down the microfibril axis for a the crystallographic structure of cellulose Iβ, b a finite (twisted) structure, and c an infinite (linearly constrained) structure with chain termini bonded across the simulation box periodic boundary

More recently, Nishiyama et al. (2012) published work comparing theoretical diffraction patterns for the crystallographic coordinates with those calculated from twisted structures produced by MD simulation. They observed significant changes in peak positions and intensities and attributed these distortions both to microfibril twisting and to changes in unit cell dimensions induced by the empirical force field. To separate the individual contributions of these two effects and provide a better understanding of how twisting in crystalline cellulose manifests in diffraction data, the present study demonstrates a method for generating twisted and linear cellulose structures that can be compared without the bias of dimensional changes. While the twisted structure represents a typical finitely modeled microfibril (Fig. 1b), the linear structure is constrained from twisting by bonding chain termini across the simulation box periodic boundary, making it essentially infinite (Fig. 1c). As the infinite model is nevertheless permitted to adapt to dimensional changes induced by the force field, it represents a linearly oriented form of cellulose Iβ, similar to the crystallographic structure, with unit cell dimensions that are comparable to those of the twisted form. Comparison of powder diffraction patterns calculated for these two structures thus allows an unbiased assessment of the impact of twisting alone.

Computational methods

Microfibril starting structures for MD simulations were prepared with Mercury 2.0 (Macrae et al. 2008), based on the cellulose Iβ crystallographic coordinates reported by Nishiyama et al. (2002). Models consisted of 81 total chains (9 per face), and were constructed such that the \(1\bar{1}0\) and \(110\) planes formed the exposed surfaces (Fig. 1a). Finite microfibrils were generated with 20 glucosyl units per chain [degree of polymerization (DP) 20], while infinite microfibrils were generated with DP 18, with terminal residues bonded across the simulation box periodic boundary. Models were surrounded with a 12 Å solvent buffer.

Microfibrils were parameterized with the GLYCAM06 (Kirschner et al. 2008) (version h) force field for carbohydrates, and solvent was modeled as TIP3P water (Jorgensen et al. 1983). Simulation files were generated using the tleap module of AmberTools12 (Case et al. 2012), and subsequently converted to GROMACS format with the glycam2gmx.pl script from Wehle et al. (2012). The double precision parallel implementation of the mdrun module from GROMACS 4.5.5 (Berendsen et al. 1995; Hess et al. 2008; van der Spoel et al. 2005) was employed for all simulation work.

An energy minimization protocol consisting of 12,500 cycles of steepest descent, followed by 12,500 cycles of conjugate gradient, was applied to the solvent and subsequently to the entire system prior to the start of dynamics. MD simulations were performed under isothermal, isobaric (NPT) conditions utilizing Berendsen thermo- and barostats (time constants of 1 ps) to maintain a reference temperature of 300 K and pressure of 1 bar. A nonbonded cutoff of 8 Å was applied, beyond which van der Waals interactions were truncated and long-range electrostatics were handled with particle mesh Ewald (Darden et al. 1993). Covalent bonds to hydrogen were constrained using the LINCS algorithm (Hess et al. 1997) to allow a simulation time step of 2 fs. Systems were equilibrated for 1 ns, followed by 10 ns of production dynamics. The periodic-molecules option was set for the infinite microfibril model to allow coupling of chain termini across the periodic boundary.

GROMACS trajectories were converted to AMBER format for analysis using VMD (Humphrey et al. 1996). Water molecules were removed, and microfibril models were cropped to contain only the center 16 glucose repeats (DP 16). This ensured comparison between models of equivalent dimension, as well eliminated noise resulting from terminal fraying during simulation of the finite model. Unit cell dimensions, hydrogen bond percentages, glycosidic torsion angles, and hydroxymethyl group orientations were determined using the ptraj module of AmberTools12 (Case et al. 2012). Average microfibril volume was calculated with Mol_Volume (Balaeff 2013), based on 100 evenly spaced frames extracted from simulation trajectories. Theoretical powder diffraction patterns were calculated with Debyer (Wojdyr 2012) for this ensemble of 100 trajectory frames, employing a value of λ = 1.5418 Å to denote CuKα radiation. Reference patterns for the crystallographic coordinates were calculated with Mercury 2.0 (Macrae et al. 2008), which assumes an infinite crystallite size and eliminates artifacts associated with low angle scattering seen in patterns calculated with Debyer.

Twisting was quantified using the metric for calculating \(\mathop \theta \nolimits_{Twist}\) defined by Hadden et al. (2013). This method designates two vectors (\(v,u\)) across the microfibril \(1\bar{1}0\) face, perpendicular to the axis. While the vectors are parallel in the linear starting structure, their dot product describes the angle by which they diverge as the microfibril twists (Eq. 1).

$$\mathop \theta \nolimits_{Twist} = \frac{180}{\pi }\mathop {cos}\nolimits^{ - 1} \frac{v \cdot u}{|v||u|}$$
(1)

Values of \(\mathop \theta \nolimits_{Twist}\) were normalized by the number of cellobiose repeats between the vectors and averaged over time to give \(\left\langle {\mathop \theta \nolimits_{Twist} } \right\rangle\).

Results and discussion

MD simulations were performed on the 10 ns timescale for finite (twisted, Fig. 1b) and infinite (linearly constrained, Fig. 1c) cellulose microfibrils. The twisted structure displayed a \(\left\langle {\mathop \theta \nolimits_{Twist} } \right\rangle\) value of 1.17° per cellobiose, with root-mean-squared fluctuation of 0.14° and standard deviation of the mean of 0.02°. Although the unit cell showed some deviation from the crystallographic coordinates in all three dimensions (−1.3 % in a, +0.5 % in b, and +4.0 % in c), as has previously been observed during simulations of cellulose microfibrils (Matthews et al. 2006), there were no significant differences in dimensions between the twisted and linear structures. Values for microfibril volume showed an increase of approximately 5 % from the crystallographic coordinates, yet corresponded exactly for the two MD models. Additionally, the twisted and linear structures contained equivalent, as well as experimentally consistent, internal hydrogen bond patterns, glycosidic torsion angles, and hydroxymethyl group orientations. These data are summarized in Tables 1 and 2, and together indicate that with the exception of twisting, the twisted and linear structures produced with this method are essentially identical, allowing for straightforward, unbiased comparison of theoretical diffraction data.

Table 1 Values used for quantitative comparison of twisted and linear microfibril models
Table 2 Values used for quantitative comparison of twisted and linear microfibril models

Powder diffraction patterns calculated based on the final frames of simulation trajectories for both the twisted and linear microfibrils are presented in Fig. 2, along with three reference patterns corresponding to the original crystallographic coordinates. The lower two reference patterns assume infinite crystallite size, and demonstrate the absence of artifacts associated with low angle scattering present in the remaining patterns, which were calculated based on non-periodic 81-chain, DP 16 mini-crystals. This is especially obvious in the 10°–13° region of 2-θ. The upper reference pattern was calculated in an identical manner to that of the twisted and linear models and provides a direct comparison to assess distortions arising from MD simulation.

Fig. 2
figure 2

Theoretical powder diffraction patterns for twisted and linear microfibril models and three reference patterns corresponding to the original crystallographic coordinates for cellulose Iβ. The upper three patterns were calculated with Debyer (Wojdyr 2012) as non-periodic 81-chain, DP 16 mini-crystals, while the lower two patterns were calculated with Mercury 2.0 (Macrae et al. 2008) with the assumption of infinite crystallite size (PWHM of 0.1° and 1.5°) to demonstrate the absence of artifacts associated with low angle scattering present in the Debyer patterns. Peak heights were each normalized to 100

While the patterns for the twisted and linear structures indeed display distortions relative to that of the crystallographic structure, as previously discussed by Nishiyama et al. (2012), they are effectively indistinguishable from each other. The PWHM for the \(200\) peak (averaged over patterns for 100 simulation frames) was equivalent for the two MD models, but was broader (by less than 10 %) than that of the crystal coordinates, with the value for 2-θ shifted by less than 2 % (Table 1). Because the twisted and linear structures give rise to equivalent theoretical diffraction patterns, the distortions relative to the pattern for the crystallographic structure must arise from dimensional changes induced by the force field during simulation, as suggested by Nishiyama et al. (2012), and not from microfibril twisting. This is expected to hold for all microfibrils, provided that the model is large enough that twisting does not excessively disorder the unit cell of the crystalline assembly.

As discussed by Nishiyama et al. (2012), low-angle scattering can influence both peak position and intensity, in particular for the \(1\bar{1}0\) reflection at 14.5° 2-θ. While some low-angle scattering can manifest in experimental patterns as a result of discontinuity between crystallites or inhomogeneity in the sample, the extent of it observed in the present theoretical patterns represents the worst-case scenario for non-periodic models of this size. Along with issues of preferred orientation of crystals in experimental patterns, low-angle scattering may present a greater concern for making comparisons between theory and experiment than modest twisting.

While this study has demonstrated that powder patterns are unaffected by a limited degree of microfibril twisting, it may be that other experimental techniques are capable of detecting this subtle structural difference. For example, Nishiyama et al. (2012) presented theoretical fiber diffraction patterns that displayed wedge-shaped layer lines for twisted models that were not subjected to convolution to account for crystallite tilt distribution. These wedges were shown to have larger angles for models with fewer numbers of chains, which display a greater degree of twisting. It should be noted, however, that the models used in that study were relatively small compared to the tunicate samples (10–20 nm) used to solve the cellulose Iβ crystal structure. Hadden et al. (2013) estimated that twisted microfibrils observed experimentally by Hanley et al. (1997), which ranged in size from 20 to 50 nm, likely would have had a uniform twist of only 0.26° per cellobiose when suspended in aqueous solution. This observation suggests that the degree of twisting present in large samples, such as those used for X-ray diffraction, is sufficiently subtle that it may be difficult to detect experimentally. Even with the use of comparatively small microfibril models, both the present work and that of Nishiyama et al. (2012) imply that a modest degree of twisting is not necessarily inconsistent with crystallographic data.

Conclusions

The method presented here for generating internally consistent twisted and linear structures provides a previously unexploited strategy for probing the effect of microfibril twisting on experimental cellulose diffraction data. While this study has demonstrated that theoretical powder patterns for models in the 81-chain size regime are unaffected by twisting, suggesting that a modest degree of twist is not inconsistent with available crystallographic data, it may be that other diffraction techniques are capable of detecting this subtle structural difference. Further analysis is thus required to assess the effect of twisting on other back-calculated diffraction properties, as well as to confirm these observations experimentally. Until such time as definitive experimental evidence comes to light, the results of this study suggest that both twisted and linear microfibrils may represent an appropriate model for cellulose Iβ.