Introduction

Cinnamoyl CoA reductase (CCR), one of the key enzyme in lignin synthesis, catalyzes the NADPH dependent reduction of p-hydroxycinnamoyl CoA, the phenyl propanoid pathway metabolites, to p-hydroxycinnamaldehydes. These are further catalyzed by enzyme cinnamoyl alcohol dehydrogenase (CAD) to monolignols [16]. The later are involved in oxidative polymerization process, catalyzed by NADPH oxidase, resulting into the synthesis of cross linked plant polymer lignin. It plays an important role in plant defense responses which act as a mechanical barrier to various plant pathogens [710]. Reduction in lignin content is always desirable in pulp and paper industry as lignin is one of the obstacles for good quality product synthesis and makes the process costlier [5]. Being the first committed enzyme of monolignol biosynthesis and directing the metabolite flux of phenyl propanoid pathway towards lignin synthesis, CCR shows an association with lignin biosynthesis. It has been demonstrated that the down regulation of CCR in transgenic tobacco, arabidopsis and tomato, leads to the significant reduction of lignin content and distinct pH requirement for its catalysis as compared to the neighboring reactions catalyzed by other enzymes. These reasons indicate CCR as a strong control point for regulating lignin content [13, 7, 8]. On the basis of mutant analysis in Arabidopsis thaliana, it was examined that the alternative splicing in CCR region is responsible for affecting the physical properties of cell wall such as stiffness and strength [2, 11]. CCR is also believed to be involved in regulating stem rigidity in wheat, reported according to the parallel correlation existing between Triticum aestivum CCR1 (TaCCR1) mRNA level and high CCR activity present in wheat stem [8]. CCR acts at optimum pH of 6.25 for its catalytic activities, as reported in Arabidopsis thaliana, Eucalyptus gunii (crude protein extract and purified one), poplar stems, spruce and soya bean extract [5, 1215]. As already reported in case of wheat, TaCCR1 and TaCCR2 are known to exhibit differences in their primary amino acid sequences, but they have identical motifs for NADPH binding and active site [7]. CCR exhibits the substrate specificity with feruloyl CoA, caffeoyl CoA, 5-hydroxyferuloyl CoA and sinapoyl CoA, converting them into their respective products coniferaldehyde, caffealdehyde, 5-hydroxyconiferaldehyde and sinapaldehyde [1, 7, 16]. On the basis of kinetic studies where Kcat and Kcat/Km values for feruloyl CoA are highest in mixed substrate reaction having all the substrates in equal concentration, where coniferaldehyde is obtained as the major product along with a trivial amount of 5-hydroxyconiferaldehyde [1]. As calculated on the basis of Ki values in cocatalyzed reactions, feruloyl CoA is known to exert a strong competitive inhibition with other substrates of CCR, while the later exert a very weak inhibition with feruloyl CoA, revealing feruloyl as the most competitive inhibitor in CCR reaction [1, 7].

The present study addresses the homology model and docking analysis of CCR in the Leucaena leucocephala. The information from docking conformations will provide new insight to study the mechanism of CCR catalyzed enzymatic reactions. The approach is applicable in engineering 3D structures of enzymatic models, and studying interactions of active site residues with substrates [17].

Materials and methods

Sequence alignment and homology modeling of CCR

CCR protein sequence of Leucaena leucocephala (NCBI protein accession number: CAK22319) was selected. BLAST algorithm against Protein Data Bank (PDB) was used to carry out the sequence homology searches. The sequence and 3D structure of template proteins were extracted from the PDB database [18]. Multiple sequence alignments of the target and template sequences were carried out using ClustalW 2.0.10 program with default parameters [19]. High-resolution crystal structure of homologous protein as a template were considered for homology modeling. The coordinates of templates were used to build the initial models of CCR. The homology models of CCR were constructed using MODELLER 9v7 [20, 21]. The MODELLER generated models optimally satisfies spatial restrains, which includes homology-derived restraints on the distances and dihedral angles, stereo chemical restraints. Coordinates from the template proteins to the structurally conserved regions (SCRs), structurally variable regions (SVRs), N-termini and C-termini were assigned to the target sequence based on the satisfaction of spatial restraints. All side chains of the model protein were set by rotamers.

Energy minimization and simulations

The homology model obtained from the MODELLER was improved further by molecular dynamics and equilibration methods using Nano Molecular Dynamics (NAMD 2.5) [22] and Chemistry of Harvard Molecular Modelling (CHARMM27) force field for lipids and proteins [23, 24]. A 50,000-step minimization was employed in the simulation to remove all bad contacts of side chains. Minimum switching distance of 8.0 Å and a cut off of 12.0 Å for Vander Walls interactions was used. The pair list of the non-bonded interactions was recalculated every 20 steps with a pair list distance of 13.5 Å. All hydrogen atoms were included during the calculation.

Validation of predicted homology model

Structural and packing architecture of the modeled protein were calculated, i.e., by calculating mean hydrogen bonds distances, mean dihedral angles, accessible surface area and packing volume, by using VADAR 1.4 program [25]. PROCHECK analysis [26] of the model was done to check that the residues were falling in the most favored region in the Ramachandran’s plot. The resultant energy minimized protein models were used for the active site identification and for the docking analysis with substrate.

Active site identification

The substrate accessible pockets and active sites of CCR were identified by computed atlas of surface topography of proteins (CASTp) calculation [27] and CCDC GOLD [2830]. The identified active sites were analyzed for amino acid cluster groups based on the solvent exposed active site atoms and bonding capacity of the polar groups. The substrate molecule feruloyl CoA, caffeoyl CoA, 5-hydroxyferuloyl CoA and sinapoyl CoA were downloaded from Pubchem database of NCBI [31], and converted to 3D structure with VEGA ZZ software [32].

Docking studies

Substrate molecules were docked to the binding sites by CCDC’s GOLD (genetic optimization for ligand docking) [2830]. One-hundred genetic algorithm (GA) runs were performed for each compound, and 10 ligand bumps were allowed in an attempt to account for mutual ligand/target fit. The binding region for the docking study was defined as a 10 Å radius sphere centered on the active site. For each of the GA run a maximum number of 100,000 operations were performed on a population of 100 individuals with a selection pressure of 1.1. The number of islands was set to 5 with a niche size of 2. The weights of crossover, mutation, and migration were set to 95, 95, and 10 respectively. The scoring function Gold Score implemented in GOLD was used to rank the docking positions of the molecules, which were clustered together when differing by more than 2 Å rmsd. The best ranking clusters for each of the molecules were selected. Hydrogen bonds, bond lengths and close contacts between enzyme active site and substrate atoms were analyzed.

Results and discussion

Construction of model

Potential templates of target proteins, were obtained from the BLAST search for CCR. Template selection was performed on the basis of sequence similarity, residues completeness, crystal resolution and functional similarity. Table 1 shows the basic information on the selected template used in the study. Among the available candidate templates the sequence identities between target and templates are about ≤43%. In general, sequence identities of 30% are enough to construct the 3D model of target proteins through the homology modeling. 2C29.pdb was chosen as the template for the modeling of CCR. Its structure was solved as a ternary complex obtained with the oxidized form of nicotinamide adenine dinucleotide phosphate and dihydroquercetin. The model with the lowest value of the MODELLER objective function was selected as the best model for CCR. The final alignment of CCR and template (PDB ID: 2C29.pdb) with conservation was showed in Fig. 1.

Table 1 Target and template proteins information
Fig. 1
figure 1

Sequence alignment of CCR with template. The asterisk indicates an identical or conserved residue; a colon indicates a conserved substitution; a dot indicates a semi-conserved substitution

Structural analysis of predicted models

Assessment of stereochemical properties of main-chain and side-chain residues was performed using Ramachandran plot. Figure 2 shows 91.9% of CCR residues are located in the most favoured regions, 7.1% residues in additional allowed regions and 1% residues in generously allowed regions in the Ramachandran plot of the selected model. The model which has above 90% of the residues falling in the most favorable region of Ramachandran plot was considered as a good model. In addition to mean helix phy, psi parameters and mean Chi Gauche values, the mean hydrogen bond distances and energy in the models were similar to the known protein crystal structures data. Mean residue volume in the model is 139.8 Å3 and total packing volume of the model is 46835.0 Å3, which shows very good packing architecture. All the structural parameters are shown in Table 2. The final model structure of CCR and superimposition of CCR and template were displayed in Fig. 3a and b. The central core region of this model mainly consists of β sheets surrounded by helices.

Fig. 2
figure 2

Residue profile of CCR in Ramachandran’s plot

Table 2 Structural characteristics and features of the models
Fig. 3
figure 3

(a) Homology model of Cinnamoyl CoA reductase (CCR) represented in ribbon diagram, (b) Superimposition of CCR and 3D structure of Dihydroflavonol-4-reductase (PDB entry 2C29.pdb), CCR represented in yellow and Dihydroflavonol-4-reductase represented in red color

Catalytic active site prediction and docking studies

The sequences of CCR were observed conserved across number of plant species, in Fig. 4 multiple sequence alignments of CCR were presented. Figure 5 shows the predicted solvent exposed active site residues of the CCR. The solvent exposed active site residues are shaded in red color in spacefill model. Substrate binding site pocket is made up from 15 residues, i.e., residues 52–60, 62–63, 73, and 206–208. Among these, seven residues are polar uncharged, six residues are polar charged and two residues are nonpolar hydrophobic. The active site residue conservation was marked in multiple alignments by boxes (Fig. 4). The residues 52, 53, 55, 58, 60, 73, 206, and 208 are conserved in plant CCR sequences; in remaining active site residues 54 and 59 are variable. NADPH binding site pocket composed with residues ASP77, LEU78, LEU79, SER136, TYR180, PRO197, VAL198, LEU199 and SER212. In the template, active site for substrate binding comprises SER14, GLY15, PHE16, ILE17, VAL36, ARG37, LYS44, ASP64, LEU65, VAL84, ALA85, THR86, MET88, THR126, SER127, THR163, LYS167, PRO190, THR191, VAL193, PRO204, and SER205. However in case of CCR the substrate binding site residues are different, this is because of a different substrate. Perhaps the NADPH binding residues SER136, TYR180, PRO197, LEU199 and SER212 of CCR are conserved. Enzyme active site-feruloyl CoA docking was performed in 100 genetic algorithm runs which resulted in 10 highest score enzyme-substrate conformations. Among these, the best enzyme-substrate conformation was selected. The best enzyme-substrate interacting complex gives Gold fitness function score of 88.3428. Further, eight hydrogen bonds, involving residues ARG51, ASN52, ASP54 and ASN58, were observed to interact with the substrate atoms. Residue ARG51 forms four hydrogen bonds with substrate atoms, which is assumed as most active residue in the catalytic active site of CCR followed by ASN58 with only two hydrogen bonds. Table 3 describes the hydrogen bonding profile of docking conformations of CCR-feroloyl CoA. Figure 6 shows the active site cavity and feruloyl CoA binding conformation at active site. Figure 7 shows the hydrogen bond interactions between CCR and feruloyl CoA. All the hydrogen bond distances were observed within the range of 1.373 Å to 2.669 Å. The docking of CCR-feruloyl CoA interactions reveals that the residues ARG51, ASN52, ASP54 and ASN58 are involved in substrate binding. Interestingly, residue ARG51 is present just out side of the active site mouth pocket, enabling CCR-feluloyl CoA interaction more strong among the other substrates. Therefore in addition to the high number of hydrogen bond interactions, it is supposed to be most active residue in interacting complex.

Fig. 4
figure 4

Multiple sequence alignments of CCR homologues showing conserved regions and active site residues are marked in boxes

Fig. 5
figure 5

Solvent exposed active site residues (red) of CCR, in space-fill representation

Table 3 Hydrogen bonding profile of docking conformations of CCR-feroloyl CoA
Fig. 6
figure 6

Surface representation of catalytic active site in the CCR model (a) and close up view (b). Feruloyl CoA Conformation in the active site of CCR represented in both stick (c) and solid surface diagram (d)

Fig. 7
figure 7

Interaction of feruloyl CoA (stick form) with CCR catalytic active site residues ARG51, ASN52, ASP54, ASN58 (ball and stick form)

Comparative analysis of CCR-substrates conformations

Docking conformations of all the substrates with CCR reveals that, CCR exhibits the highest Gold fitness function score of 88.3428 with feruloyl CoA. CCR Docking conformation with 5-hydroxyferuloyl CoA, caffeoyl CoA and sinapoyl CoA were shown with gold score of 57.0898, 46.6212 and 40.1429 respectively. 5-hydroxyferuloyl CoA forming 8 hydrogen bonds with residue GLY29, ALA32, ARG51, ASN52, ASN58, SER99 and SER207. The hydrogen bonds are in the range between 1.700 Å to 2.471 Å. The interacting residues are distantly placed in the sequence but due to usual protein folding they are closer. These residues are present at the active site cavity and produce productive binding with 5-hydroxyferuloyl CoA. The CCR-5-hydroxyferuloyl CoA docking conformation is shown in Fig. 8. Residues GLY28, ARG51, ASN52, ASP55 and ASN58 interact with Caffeoyl CoA by forming seven hydrogen bonds. Residue ARG51 forms three hydrogen bonds with caffeoyl CoA, and each of the remaining interacting residues GLY28, ASN52, ASP55 and ASN58 form only one hydrogen bond. The CCR-caffeoyl CoA conformation showing that the substrate is not fully accessing all the active site residues. The hydrogen bonds are in the range between 1.379 Å to 2.706 Å. The CCR- caffeoyl CoA docking conformation is presented in Fig. 9. Sinnapoyl CoA binds to residues ASP55, SER56, LYS57 and SER207 with seven hydrogen bonds. Residue LYS57 alone makes three hydrogen bonds, SER56 makes two hydrogen bonds and the remaining residues ASP55 and SER207 make one hydrogen bond each. The hydrogen bonds are in the range between 1.369 Å to 2.612 Å. The CCR- sinnapoyl CoA docking conformation is presented Fig. 10. The hydrogen bond interactions and gold scores of docking conformations are depicted in Table 4. The substrate binding residue profile shows that, only feruloyl CoA is interacting with charged polar aminoacids, where as other substrates are interacting with polar-uncharged and non-polar amino acids, due to this reason the CCR-feruloyl CoA interaction is stronger than other substrates.

Fig. 8
figure 8

5-hydroxyferuloyl CoA Conformation in the active site of CCR represented in both stick (a) and solid surface diagram (b)

Fig. 9
figure 9

Caffeoyl CoA Conformation in the active site of CCR represented in both stick (a) and solid surface diagram (b)

Fig. 10
figure 10

Sinnapoyl CoA Conformation in the active site of CCR represented in both stick (a) and solid surface diagram (b)

Table 4 Docking statistics of CCR and substrates

Conclusions

CCR plays a prime role in the lignin biosynthesis pathway as it provides cinnamaldehyde intermediates for the formation of guaiacyl lignin and syringyl lignin. Homology model of CCR of Leucaena leucocephala showed 91.9% of residues fall in the most favorable region and 7.1% of residues are in the allowed region of Ramchandran plot, suggesting the modeled CCR structure was reliable for the docking studies. Top ten ranked CCR-feruloyl CoA docking conformation analysis reveals that, the feruloyl CoA binds at the mouth of the active site pocket. The best docking conformation showed residues ARG51, ASN52, ASP54 and ASN58 are involved in feruloyl CoA binding with eight hydrogen bonds. The residue ARG51 is present just outside the active site mouth and sharing half of the total hydrogen bonds with feruloyl CoA, this result concludes the active participation of ARG51 in the enzyme-substrate interaction. CCR-feruloyl CoA conformations showed, feruloyl CoA binds at the gateway to the active site. These docking studies deduce that the CCR-feruloyl CoA complex is stable to produce its product coniferaldehyde and will be involved in competitive inhibition of other substrates.