Abstract
The helicase gene of Autographa californica multiple nucleopolyhedrovirus (AcMNPV) is not only involved in viral DNA replication, but also plays a role in viral host range. To identify the codon usage bias of helicase of AcMNPV, the codon usage bias of helicase was especially studies in AcMNPV and 41 reference strains of baculoviruses by calculating the codon adaptation index (CAI), effective number of codon (ENc), relative synonymous codon usage (RSCU), and other indices. The helicase of baculovirus is less biased (mean ENc = 50.539 > 40; mean CAI = 0.246). AcMNPV helicase has a strong bias toward the synonymous codons with G and C at the third codon position (GC3s = 53.6%). The plot of GC3s against ENc values revealed that GC compositional constraints are the main factor that determines the codon usage bias of major of helicase. Several indicators supported that the codon usage pattern of helicase is mainly subject to mutation pressure. Analysis of variation in codon usage and amino acid composition indicated AcMNPV helicase shows the significant preference for one or more postulated codons for each amino acid. A cluster analysis based on RSCU values suggested that AcMNPV is evolutionarily closer to members of group I alphabaculovirus. Comparison of the codon usage pattern among E. coli, yeast, mouse, human and AcMNPV showed that yeast is a suitable expression system for AcMNPV helicase. AcMNPV helicase shows weak codon usage bias. This study may help in elucidating the functional mechanism of AcMNPV helicase and the evolution of baculovirus helicases.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
The alternative synonymous codons in the genes do not occur with equal frequencies but follow the particular codon usage pattern in most organisms (Babbitt et al. 2014). It seems that each amino acid prefers one synonymous codon to the other. This phenomenon is termed codon usage bias. The majority of genes in a genome have similar selections among the synonymous codons (Grantham et al. 1980). The same protein encoded by synonymous codons may differ in their protein structures and functions (Buhr et al. 2016). According to existing literatures, the restriction factors contributing to the codon usage preference include the mutational bias, translational and transcriptional selection, tRNA abundance, RNA stability, GC content, gene length and protein structure (Hunt et al. 2014; Pop et al. 2014). Analysis of codon usage bias can provide the information on the molecular evolution of species and the mechanism of action of protein, and help to optimize the protein expression system for improving the production of heterologous proteins (Plotkin and Kudla 2010; Zhou et al. 2016).
Members of Baculoviridae family contain rod-shaped enveloped virions carrying double-stranded circular DNA genome in size ranging from 80 to 180 kbp. Traditionally, baculoviruses can be morphologically classified into two distinct subgroups: nuclear polyhedrosis viruses (NPVs) and granulosis viruses (GVs). The new classification has divided baculoviruses into four genera: alphabaculovirus (Lepidoptera NPV), betabaculovirus (Lepidoptera GV), gammabaculovirus (Hymenoptera NPV) and deltabaculovirus (Lepidoptera NPV) (Miele et al. 2011; Miller 1997). The alphabaculoviruses could be subdivided into two groups known as group I and group II based on the phylogenetic studies (Wang et al. 2014; Zhu et al. 2014). According to the effective number of codon (ENc), previous study found that most of sequenced baculovirus species do not have the strong codon usage bias (Jiang et al. 2008). Autographa californica multiple nucleopolyhedrovirus (AcMNPV), exhibiting a wide host range from 33 Lepidopteran species in ten families, has been most comprehensively studied in the molecular biology level because of its use as a foreign protein expression vector, gene therapy tool and arthropod-specific non-chemical biopesticide.
The helicase gene (also termed p143) of AcMNPV encodes a protein with the predicted size of 143 kDa with the motifs of Helicase, which could unwind the double strands DNA to promote the replication of virus genome (McDougal and Guarino 2001). Recent study showed that there is a replication origin (ori) in the encoding region of AcMNPV helicase, which could replicate in the insect cells and mammalian cells (Wu et al. 2014). The ori also could promote the gene expression in the mammalian cells. Moreover, AcMNPV helicase is also related to host range and shows the species specificity (Croizier et al. 1994; Hamajima et al. 2015). Although the functions of helicase in AcMNPV have been well studied, the study on the synonymous codon usage bias of helicase has not yet been comprehensively reported.
In this study, synonymous codon usage bias of helicase of AcMNPV C6 strain was analyzed and compared with those of 41 other baculoviruses. Then, we explored the possible restriction factors contributing to the codon usage bias. Moreover, the codon usage preference of helicase of AcMNPV was compared with those of E. coli, yeast, mouse and human. All these analysis results may provide some clues to the features of genetic evolution of baculovirus and the possible functional mechanism of helicase.
Materials and methods
Sequence data
The nucleotide sequences of helicase of AcMNPV clone C6 (Accession number: L22858.1) and other 41 other baculoviruses (Table 1) were downloaded from the NCBI GenBank database (http://www.ncbi.nlm.nih.gov/).
Codon usage analysis of helicase genes of AcMNPV C6 strain and 41 reference baculoviruses
To analyze the codon usage bias in helicase genes of AcMNPV and other baculoviruses, some indicators were calculated.
CAI means codon adaptation index. It is a useful and universal measure of synonymous codon usage of genes of a number of organisms (Carbone et al. 2003). In addition, this index can be utilized to predict the expression level of a gene, and assess the adaptation of viral genes to their hosts (Sharp and Li 1987). The values of CAI range from 0 to1, and higher values of CAI denote higher levels of codon usage bias.
ENc means effective number of codons. ENc values range from 20 to 61, and small values of ENc mean high levels of codon usage bias. ENc = 20 means only one codon is used for each amino acid while ENc = 61 means alternative synonymous codons are used in equal frequency. ENc is a meaningful indicator of the codon bias.
GC means the frequency of the nucleotides G + C in codon, and GC3s means the frequency of the nucleotides G + C at the third position of codon except Met, Trp and termination codons. They are the effective indicators of the content of the base composition bias.
RSCU means relative synonymous codon usage. It is employed to investigate the overall synonymous codon usage bias among the genes, and this value is demonstrated by the ratio of the observed codon usage to the expected value (Sharp and Li 1986). RSCU value > 1.0 means that the corresponding codon is more frequently used than expected. RSCU value would be 1.0 in the absence of codon usage bias. RSCU value < 1.0 means that the codon is less frequently used.
The fraction means the proportion of the synonymous codons encoding the same amino acid. The frequency of each codon means the appearance ratio of each codon in the 1000 bp of coding sequence.
The relative abundance of dinucleotides (RAD) in the helicase genes of baculoviruses was calculated by counting the odds ratio as previously described (Burge et al. 1992). The odds ratio
where f x refers the frequency of the nucleotide X, f y refers the frequency of the nucleotide Y, and f xy the frequency of the dinucleotide XY. Compared with a random association of mononucleotides, if ρ xy > 1.23, the XY dinucleotide is considered to be over-represented; if ρ xy < 0.78, the XY dinucleotide is considered to be under-represented.
Neutrality plot (GC12 vs. GC3). The correlation analysis between the GC contents at the first and second codon positions (GC12) and that at the third position (GC3) is used to judge the influence of mutation pressure and translational selection on the base composition (Wang et al. 2016).
Parity rule 2 (PR2) plot. PR2 plot can be constructed to examine the effect of mutation pressure and translational selection on the codon usage pattern (Sueoka 1999). The plot is displayed by the value of AU-bias as the ordinate and GC-bias as the abscissa at the third position of the four-codon amino acids (alanine, arginine, glycine, leucine, proline, serine, threonine and valine). The centre of the plot is the position where both coordinates are 0.5, with no bias between influence of mutation and translational selection rates.
CAI, ENc, GC, GC3s, A3s, T3s, C3s, G3s, hydrophobicity (GRAVY), aromaticity (AROMO) and RSCU were analyzed by CodonW 1.4.2 program (Peden 2005; http://codonw.sourceforge.net/). Correspondence analysis (CA) based on RSCU values was proceeded using CodonW 1.4.2 program. The GC12, GC3, frequency and fraction were analyzed by EMBOSS CUSP program (Rice et al. 2000). The heatmap based on RSCU values was produced by HemI software (Deng et al. 2014). The mutation analysis of baculovirus Helicases was carried out by Multalin (Corpet 1988).
Comparison between the codon preference of helicase of AcMNPV and those of E. coli, yeast, mouse and human
To examine whether different species have the similar codon usage pattern and further select suitable expression system, the comparison of the codon usage bias of AcMNPV helicase with those of E. coli, yeast, mouse and human was performed. The codon usage frequency data in E. coli, yeast, mouse and human is obtained from the web site http://www.kazusa.or.jp/codon.
Statistical analysis
Correlation analysis was performed using Spearman’s rank correlation test. All statistical analyses were conducted using SPSS 13.0 software package (SPSS Inc., 2003) (SPSS 2003). The comparison of values of CAI, ENc, GC content and GC3s content between NPVs and GVs was carried out by the one-way ANOVA test using SPSS 13.0 software package (SPSS Inc. 2003) (SPSS 2003). The P values of < 0.05 were considered statistically significant.
Results
Analysis of the codon usage of the helicase genes in 42 baculoviruses
The values of CAI, ENc, GC content, GC3s content and mononucleotide frequency were shown in Table1. The results indicated that the codon usage of helicases is generally random.
The CAI values of helicases in 42 baculoviruses range from 0.190 to 0.288, with an average value of 0.246 and a standard deviation (SD) of 0.022 (Table1). The average CAI value of helicases in NPVs is 0.248 (0.190–0.288; SD, 0.024) whereas that in GVs is 0.240 (0.214–0.267; SD, 0.016). The mean CAI value of helicases of NPVs and GVs is almost equal (one-way ANOVA, P > 0.05).
The values of ENc of helicases in 42 baculoviruses vary from 36.030 to 56.560, with a mean value of 50.539 and a SD of 3.766, indicating that the codon usage bias of helicases is small. The ENc values of helicases of NPVs range from 36.030 to 54.360, with an average value of 49.739 and a SD of 3.700 whereas those of GVs range from 47.750 to 56.560, with an average value of 52.792 and a SD of 3.082. The ENc values of NPVs are significantly higher than that of GVs (one-way ANOVA, P < 0.05). Particularly, compared with other members of baculoviruses, the value of ENc of helicase of LdMNPV is 36.03, indicating slightly higher codon usage bias.
The mean value of GC content in 42 baculoviruses is 40.112% (31.80–57.30%; SD, 5.264), whereas the average value of GC3s content is 49.319 (29.00–85.20%; SD, 11.334). The mean value of GC content in NPVs is 40.80% (31.800–57.300%; SD, 5.672), whereas the average value of GC3s content is 50.80% (29.000–50.800%; SD, 11.869). The mean value of GC content in GVs is 38.18% (32.20–44.80%; SD, 3.402), whereas the average value of GC3s content is 45.14 (29.00–63.00%; SD, 8.835). The GC and GC3s contents of NPVs are similar to those of GVs (one-way ANOVA, P > 0.05).
Correlation analysis among ENc, GC3s, CAI and gene length
The ENc value has been demonstrated to correlate with the GC3s content. A plot of ENc against GC3s was used to detect the heterogeneity of codon usage (Wright 1990). If GC3s is the only limiting factor for shaping the codon usage pattern, the values of ENc would fall on a continuous curve, representing random codon usage (Jiang et al. 2007; Wright 1990). Here, ENc value of each helicase against GC3s was plotted (Fig. 1a). It could be seen that although only a few points lie on the theoretical fitting curve, the bulk of points appear to lie near the solid curve, indicating that these helicases are subject to GC compositional constraints (r = −0.437, P < 0.01).
In addition, plots of gene length against ENc (Fig. 1b) and GC3s (Fig. 1c) were used to demonstrate the distribution of each helicase. The gene length of helicases of NPVs is significantly higher than that of GVs (one-way ANOVA, P < 0.01). From Fig. 1b, c, it appeared that the longer genes have relatively higher variance of ENc and GC3s values than the shorter genes. Correlation analysis showed that gene length has a significant correlation with GC3s values (r = 0.376, P < 0.05). However, there was no significant correlation between gene length and ENc values (r = −0.114, P > 0.05).
The effect of mutation pressure on the codon usage pattern of helicases of baculoviruses
The codon usage of base composition of genes of a species with A/T rich genome may behave differentially from these with G/C rich genomes. Correlation between CAI and ENc has been utilized to show the effect of mutation pressure and translational selection on the codon usage bias (Vicario et al. 2007). If the correlation (r) between the two indices approaches − 1, the translational selection is preferred. Otherwise, if r approaches 0 (no correlation), the mutation pressure may be influential. The results showed that CAI value of helicases of baculoviruses has no significant correlation with ENc value (r = − 0.243, P > 0.05). This result reflected the influence of mutation pressure on the codon usage pattern of baculovirus helicases.
To further determine the effect of mutation pressure and translational selection on the formation of codon usage bias of helicases of baculoviruses, the neutrality plot analysis was performed. The result showed that there is a significant correlation between GC12 and GC3 (r = 0.615, P < 0.01), indicating that mutation pressure affects the codon usage bias of helicases of baculviruses.
To determine the relationship between pyrimidines (C and T) and purines (A and G) contents in fourfold degenerate codon families, PR2 plot analysis was carried out. The result indicated that A and G are more frequently used than T and C in fourfold degenerate codon families (Fig. 2). This result indicated that mutation pressure and other factors may predominately shape the codon usage bias of helicases of baculoviruses.
Variation in gene codon usage and amino acid composition of AcMNPV helicase
The overall codon preference of the helicase gene in AcMNPV C6 strain was shown in Table 2. RSCU values were calculated for 59 codons (excepting Met, Trp, and termination codons) in the helicases. The RSCU values of 25 codons are more than 1, indicating that these codons are more frequently used. This result was consistent with the corresponding fractions and frequencies. Among 25 codons, 16 have an end base of G- or C, while nine have an end base of A- or T. The amino acids Ala, Gly, Leu, Pro, Arg, Ser, Thr and Val show a higher level of diversity of codon usage due to their six and fourfold coding degeneracy. In addition, the amino acids Cys, Asp, Glu, Phe, Ile, His, Lys, Asn, and Gln also have a high level of diversity of codon usage bias, although they only have two or threefold coding degeneracy. Thus, it is concluded that the most and least frequencies of the used codons of each amino acid are different in AcMNPV helicase.
Relationship between dinucleotide biases and codon usage in helicases of baculoviruses
Dinucleotide bias analysis indicated that the occurrences of dinucleotides in helicases are not randomly distributed (Table 3). In particular, the dinucleotides CG, TG and TT are over-represented (ρxy > 1.23) while AG, CT and TA are under-represented (ρxy < 0.78). Moreover, the analysis of RSCU values of AcMNPV helicase showed that the most of codons containing AG, CT and TA are not preferentially used, including GCT, GAG, ATA, AAG, CTA, CTC, CTG, CTT, TTA, CCT, CAG, AGG, AGT, TCT, ACT, and TAT. Totally, the compositions of dinucleotides play a role in shaping the synonymous codon usage pattern of helicases of baculoviruses.
Effect of other factors on codon usage of helicases of baculoviruses
GRAVY and AROMO also affect the codon usage bias of viruses (Wang et al. 2016). Correlation analysis indicated that AROMO is positively correlated with Axis2 and ENc while GRAVY is negatively correlated with Axis2 and ENc. However, both AROMO and GRAVY do not have any correlation with Axis1, GC3s and GC (Table 4). This result indicated that aromaticity and degree of protein hydrophobicity do not play an essential role on the codon usage pattern of helicases of baculoviruses.
Phylogenetic persistence in codon usage bias of the AcMNPV helicase gene and amino acid mutation analysis
To generate a visual representation of the variation in the codon usage bias, a cluster analysis was performed based on the RSCU values of the helicase genes of AcMNPV C6 strain and other 41 baculoviruses (Table 2). From heatmap (Fig. 3), AcMNPV, PlxyMNPV, BomaNPV, BmNPV, RoMNPV and MaviMNPV in the group I alphabaculovirus firstly cluster together and form a separate branch, then cluster with the other members of group I alphabaculovirus, such as ThorNPV, AngeNPV, ChfuNPV and HycuNPV, and subsequently cluster with the some strains of group II alphabaculovirus. These results wholly showed the internal relationship of the codon usage feature between AcMNPV and other alphabaculoviruses, indicating that the more distant the genetic relationship, the wider the expected variation in the codon usage bias. Overall, the codon usage preference of AcMNPV is fairly close to that of the strains of group I alphabaculovirus.
Three amino acids at positions 556, 564, 577 in AcMNPV helicase were found to be involved in host range extension of AcMNPV (Croizier et al. 1994). Therefore, the amino acid alignment of Helicase in 42 baculoviruses was carried out. The result showed that, at position 556, Val (n = 23, n means the number of baculovirus species that contains this amino acid in Helicase) is universal; at position 564, Ser (n = 16) is universal; at position 577, Phe (n = 17) is universal (Table 5).
Correspondence analysis and correlation analysis: compositional characteristics of helicases of baculoviruses
The A, T, C, G, GC contents were compared with A3s, T3s, C3s, G3s, and GC3s contents (Table 6). It can be seen that both the A and T contents have a significant positive correlation with the contents of A3s or T3s as well as a significant negative correlation with the contents of C3s, G3s, and GC3s. In addition, the C, G, GC contents have a significant negative correlation with the contents of A3s, T3s as well as a significant positive correlation with that of C3s, G3s, and GC3s. These data indicated that nucleotide composition constraint can influence the codon usage pattern of helicases of baculoviruses.
Correspondence analysis was performed to identify the main trends in the codon usage variation and distribution of each helicase gene along the continuous axes. The positions of helicase genes defined by the first (Axis1) and second (Axis2) axes were shown in Fig. 4. The first axis accounts for 43.79% of total variation, and the second axis accounts for 8.49% of total variation. Hence, the first axis explains a substantial amount of the total variation of synonymous codon usage of helicase. A correlation analysis indicated that Axis1 has a significant positive correlation with ENc and the contents of A, T, A3s, and T3s as well as a significant negative correlation with the contents of C, G, C3s, G3s, GC, and GC3s. However, Axis2 is only positively correlated with ENc (Table 7). Totally, the results indicated that mutation pressure from base composition makes an effect in shaping the codon usage of helicases of baculoviruses.
Comparison of codon usage of AcMNPV helicase gene with those of E. coli, yeast, mouse and human
The codon usage bias of the helicase of AcMNPV C6 strain was compared with that of E. coli, yeast, mouse and human in order to select the optimal host for the expression of helicase. From the Table 8, there are 29, 27, 27 and 30 codons showing an AcMNPV-to-E. coli, AcMNPV-to-yeast, AcMNPV-to-mouse and AcMNPV-to- human ratio higher than two or lower than 0.5, respectively, demonstrating that the distinct diversity of codon preference exists in these hosts. It seemed that the codon usage pattern of AcMNPV helicase approximates to that of yeast and mouse. Yeast is a familiar eukaryotic expressing system, which may be more suitable to express AcMNPV helicase.
Discussion
This study analyzed the synonymous codon usage bias of helicases of AcMNPV and other 41 baculoviruses. We found that baculovirus helicases show weak codon usage bias.
Analysis of gene sequences reveals that the synonymous codon usage bias exists widely in different species. Among the mammals and birds, the codon usage preference may arise from the genomic compositional constraints. For some unicellular organisms, the codon usage patterns of the highly expressed genes are subject to translational selection and largely tended to match the more abundant tRNA species whereas that of the lower expressed genes are subject to mutational bias without the translational selection (Gouy and Gautier 1982; Moriyama and Powell 1997). Codon usage bias was regarded as a consequence of the balance between the mutation bias and translational selection. The mutation bias seems to make more effects than translational selection on the codon usage bias of some viruses (Auewarakul 2005; Nasrullah et al. 2015). In this study, correlation analysis between ENc and GC3s, GC12 and GC3, CAI and ENc, as well as the first two axes and the nucleotide contents indicated that baculovirus helicases are subject to GC compositional constraints and mutation pressure plays an essential role in shaping the codon usage pattern of helicases. PR2 plot analysis suggested that mutation pressure and other factors may shape the codon usage bias of helicases of baculoviruses. Although GRAVY and AROMO are related to codon usage bias of some viruses, the results presented in this study showed that they do not play a critical role in shaping the codon usage bias of helicases.
To assess the synonymous codon usage pattern independent of amino acid composition of different genes, RSCU was calculated to examine the codon usage bias of a variety of codons in the helicase of AcMNPV. The obvious bias toward G- or C-ended codon at the third position was shown and a higher level of diversity of codon usage bias was observed for coding amino acids, Ala, Gly, Leu, Pro, Arg, Ser, Thr and Val. These results indicated that AcMNPV helicase shows the significant preference for one or more postulated codons for each amino acid. The usage of optional codons may contribute to the presence of mutational pressure. Recent study found that codon usage regulates protein structure and activity as well as mRNA stability by affecting translation elongation speed. Accordingly, optimal codons promote the translation elongation rate whereas non-optimal codons reduce it (Zhao et al. 2017). Therefore, it is possible that these preferred codons may be required for high translation efficiency and activity in host cells, which might be beneficial to baculovirus DNA replication.
Increasing evidence suggests that the relative abundance of dinucleotides can affect the codon usage bias in some RNA viruses (Greenbaum et al. 2008; Wang et al. 2016). Previous study showed that TA and CG are under-represented in many sequence sets (Burge et al. 1992). However, for baculovirus helicases, CG is over-represented while TA is under-represented. The low abundances of CpG and UpA were thought to be required for viral immune escape and efficient transcription, respectively (Kumar et al. 2016). As the immune system of insect is different from that of mammalian in some aspects, it would be possible that the high abundance of CG in baculovirus genome may not be recognized by host cells and lead to the activation of the immune system.
The NPVs could infect the larvae in the orders Lepidoptera, Hymenoptera and Diptera whereas GVs only infect the larvae in the order Lepidoptera. In particular, AcMNPV can infect a wide range of Lepidopteran larval hosts. For example, AcMNPV can infect Plutella xylostella that is more permissible to PlxyMNPV (Harrison and Lynn 2007; Kadir et al. 1999). The underlying mechanism that determines the wide host range of AcMNPV is still unclear. The Helicase has turned out to be a candidate related to the host range of AcMNPV. When AcMNPV helicase is replaced by the counterpart of BmNPV, a member of group I alphabaculovirus, AcMNPV could kill Bm cells and larvae (Croizier et al. 1994). However, the helicase gene of TnGV, which belongs to betabaculovirus, could not substitute the AcMNPV helicase (Bideshi and Federici 2000). We hypothesized that the codon usage pattern of helicases among different baculovirus species may be highly variable, contributing to the difference of their host range. However, the codon usage analysis indicated that baculovirus helicase is less biased. Therefore, it is possible that, in addition to helicase, other genes or factors may also play a role in the determinant of extension of host range of AcMNPV. This proposal can be supported by some experimental evidence. For example, ChfuMNPV p143 solely cannot substitute AcMNPV p143. However, ChfuMNPV p143 and lef3 together can substitute the corresponding homologous genes of AcMNPV and initiate the virus replication (Yu and Carstens 2012). Xu et al. found that GP64 plays an important role in host range determination of BomaNPV S2 which can cover the host ranges of AcMNPV and BmNPV (Xu et al. 2012). Previous study also suggested the co-evolutionary adaptations of baculoviruses to their insect hosts, which may play a role in specialization of baculoviruses for their hosts (Herniou et al. 2004).
Previous study found that the replacement of three amino acids at positions 556, 564, 577 in AcMNPV Helicase by the corresponding amino acids of BmNPV can extend AcMNPV host range to B. mori (Croizier et al. 1994). We attempted to use the counterpart of CpGV to replace the helicase of AcMNPV. However, the recombinant AcMNPV cannot produce progeny virus (unpublished data). The amino acid at position 564 in AcMNPV is different from that in CpGV, which may be one reason why the CpGV helicase cannot replace the homologue of AcMNPV. Interestingly, the three amino acids in some alphabaculoviruses, including AcMNPV, PlxyMNPV, RoMNPV, MaviMNPV, HearNPV, HzSNPV, LeseNPV, and SpliNPV, are identical. It is unknown whether the helicases in these alphabaculoviruses can substitute the homologue of AcMNPV.
This study showed that the CAI value, GC and GC3s contents of helicases of NPVs are similar to that of GVs (one-way ANOVA, P > 0.05) while the ENc values of helicases of NPVs are significantly higher than that of GVs (one-way ANOVA, P < 0.05), suggesting their co-evolution. Previous study proposed that evolution enables the alphabaculovirus and betabaculovirus hosted by the same host species to share more similar patterns (Shi et al. 2016). In fact, some members of both NPVs and GVs evolve in same hosts, even at species levels, such as Plutella xylostella, Helicoverpa armigera and Agrotis segetum.
To compare the codon usage difference of genes from diverse species, the codon usage bias of AcMNPV was compared with those of E. coli, yeast, mouse and human. It appears that the codon usage bias of helicase of AcMNPV resembles that of yeast and mouse. Hence, the yeast expression system is more suitable for heterologous expression of AcMNPV helicase. These information would be beneficial for the optimization of AcMNPV helicase expression in vitro.
Taken together, the data presented here showed the random codon usage pattern of helicases from AcMNPV and other 41 baculovirus species. The main factor shaping codon usage bias is GC compositional constraints. This study may provide some clues for understanding the functional mechanism of AcMNPV Helicase and the evolution of baculoviruses.
References
Auewarakul P (2005) Composition bias and genome polarity of RNA viruses. Virus Res 109:33–37
Babbitt GA, Alawad MA, Schulze KV, Hudson AO (2014) Synonymous codon bias and functional constraint on GC3-related DNA backbone dynamics in the prokaryotic nucleoid. Nucleic Acids Res 42:10915–10926
Bideshi DK, Federici BA (2000) The Trichoplusia ni granulovirus helicase is unable to support replication of Autographa californica multicapsid nucleopolyhedrovirus in cells and larvae of T. ni. J Gen Virol 81:1593–1599
Buhr F, Jha S, Thommen M, Mittelstaet J, Kutz F, Schwalbe H, Rodnina MV, Komar AA (2016) Synonymous codons direct cotranslational folding toward different protein conformations. Mol Cell 61:341–351
Burge C, Campbell AM, Karlin S (1992) Over- and under-representation of short oligonucleotides in DNA sequences. Proc Natl Acad Sci USA 89:1358–1362
Carbone A, Zinovyev A, Képes F (2003) Codon adaptation index as a measure of dominating codon bias. Bioinformatics 19:2005–2015
Corpet F (1988) Multiple sequence alignment with hierarchical clustering. Nucleic Acids Res 16:10881–10890
Croizier G, Croizier L, Argaud O, Poudevigne D (1994) Extension of Autographa californica nuclear polyhedrosis virus host range by interspecific replacement of a short DNA sequence in the p143 helicase gene. Proc Natl Acad Sci USA 91:48–52
Deng W, Wang Y, Liu Z, Cheng H, Xue Y (2014) HemI: a toolkit for illustrating heatmaps. PLoS ONE 9:e111988
Gouy M, Gautier C (1982) Codon usage in bacteria: correlation with gene expressivity. Nucleic Acids Res 10:7055–7074
Grantham R, Gautier C, Gouy M (1980) Codon frequencies in 119 individual genes confirm corsistent choices of degenerate bases according to genome type. Nucleic Acids Res 8:1893–1912
Greenbaum BD, Levine AJ, Bhanot G, Rabadan R (2008) Patterns of evolution and host gene mimicry in influenza and other RNA viruses. PLoS Pathog 4:e1000079
Hamajima R, Kobayashi M, Ikeda M (2015) Identification of amino acid residues of AcMNPV P143 protein involved in rRNA degradation and restricted viral replication in BM-N cells from the silkworm Bombyx mori. Virology 485:244–251
Harrison RL, Lynn DE (2007) Genomic sequence analysis of a nucleopolyhedrovirus isolated from the diamondback moth, Plutella xylostella. Virus Genes 35:857–873
Herniou EA, Olszewski JA, O’Reilly DR, Cory JS (2004) Ancient coevolution of baculoviruses and their insect hosts. J Virol 78:3244–3251
Hunt RC, Simhadri VL, Iandoli M, Sauna ZE, Kimchi-Sarfaty C (2014) Exposing synonymous mutations. Trends Genet 30:308–321
Jiang P, Sun X, Lu Z (2007) Analysis of synonymous codon usage in Aeropyrum pernix K1 and other Crenarchaeota microorganisms. J Genet Genom 34:275–284
Jiang Y, Deng F, Wang H, Hu Z (2008) An extensive analysis on the global codon usage pattern of baculoviruses. Arch Virol 153:2273
Kadir HBA, Payne CC, Crook NE, Fenlon JS, Winstanley D (1999) The comparative susceptibility of the diamondback moth Plutella xylostella and some other major lepidopteran pests of brassica crops to a range of baculoviruses. Biocontrol Sci Technol 9:421–433
Kumar N, Bera BC, Greenbaum BD, Bhatia S, Sood R, Selvaraj P, Anand T, Tripathi BN, Virmani N (2016) Revelation of influencing factors in overall codon usage bias of equine influenza viruses. PLoS ONE 11:e0154376
McDougal VV, Guarino LA (2001) DNA and ATP binding activities of the baculovirus DNA helicase P143. J Virol 75:7206–7209
Miele SA, Garavaglia MJ, Belaich MN, Ghiringhelli PD (2011) Baculovirus: molecular insights on their diversity and conservation. Int J Evol Biol 2011:379424
Miller LK (1997) The Baculoviruses. Plenum Press, New York, p p35
Moriyama EN, Powell JR (1997) Codon usage bias and tRNA abundance in Drosophila. J Mol Evol 45:514–523
Nasrullah I, Butt AM, Tahir S, Idrees M, Tong Y (2015) Genomic analysis of codon usage shows influence of mutation pressure, natural selection, and host features on Marburg virus evolution. BMC Evol Biol 15:174
Peden J (2005) CodonW, Version 1.4.2. Available at http://codonw.sourceforge.net/
Plotkin JB, Kudla G (2010) Synonymous but not the same: the causes and consequences of codon bias. Nat Rev Genet 12:nrg2899
Pop C, Rouskin S, Ingolia NT, Han L, Phizicky EM, Weissman JS, Koller D (2014) Causal signals between codon bias, mRNA structure, and the efficiency of translation and elongation. Mol Syst Biol 10:770
Rice P, Longden I, Bleasby A (2000) EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet 16:276
Sharp PM, Li W-H (1986) Codon usage in regulatory genes in Escherichia coli does not reflect selection for ‘rare’codons. Nucleic Acids Res 14:7737–7749
Sharp PM, Li W-H (1987) The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res 15:1281–1295
Shi SL, Jiang YR, Yang RS, Wang Y, Qin L (2016) Codon usage in Alphabaculovirus and Betabaculovirus hosted by the same insect species is weak, selection dominated and exhibits no more similar patterns than expected. Infect Genet Evol 44:412
SPSS S (2003) Base 12.0 for windows user’s guide. SPSS Inc., Chicago
Sueoka N (1999) Translation-coupled violation of Parity Rule 2 in human genes is not the cause of heterogeneity of the DNA G + C content of third codon position. Gene 238:53–58
Vicario S, Moriyama EN, Powell JR (2007) Codon usage in twelve species of Drosophila. BMC Evol Biol 7:226
Wang M, Wang J, Yin F, Tan Y, Deng F, Chen X, Jehle JA, Vlak JM, Hu Z, Wang H (2014) Unraveling the entry mechanism of baculoviruses and its evolutionary implications. J Virol 88:2301–2311
Wang H, Liu S, Zhang B, Wei W (2016) Analysis of synonymous codon usage bias of zika virus and its adaption to the hosts. PLoS ONE 11:e0166260
Wright F (1990) The ‘effective number of codons’ used in a gene. Gene 87:23–29
Wu Y-L, Wu C-P, Huang Y-H, Huang S-P, Lo H-R, Chang H-S, Lin P-H, Wu M-C, Chang C-J, Chao Y-C (2014) Identification of a high-efficiency baculovirus DNA replication origin that functions in insect and mammalian cells. J Virol 88:13073–13085
Xu YP, Gu LZ, Lou YH, Cheng RL, Xu HJ, Wang WB, Zhang CX (2012) A baculovirus isolated from wild silkworm encompasses the host ranges of Bombyx mori nucleopolyhedrosis virus and Autographa californica multiple nucleopolyhedrovirus in cultured cells. J Gen Virol 93:2480–2489
Yu M, Carstens EB (2012) Choristoneura fumiferana multiple nucleopolyhedrovirus LEF-3–P143 complex can complement DNA replication and budded virus in an AcMNPV LEF-3–P143 double knockout bacmid. J Gen Virol 93:383–388
Zhao F, Yu C-h, Liu Y (2017) Codon usage regulates protein structure and function by affecting translation elongation speed in Drosophila cells. Nucleic Acids Res 45:8484–8492
Zhou Z, Dang Y, Zhou M, Li L, Yu C-h, Fu J, Chen S, Liu Y (2016) Codon usage is an important determinant of gene expression levels largely through its effects on transcription. Proc Natl Acad Sci USA 113:E6117–E6125
Zhu Z, Yin F, Liu X, Hou D, Wang J, Zhang L, Arif B, Wang H, Deng F, Hu Z (2014) Genome sequence and analysis of Buzura suppressaria nucleopolyhedrovirus: a group II Alphabaculovirus. PLoS ONE 9:e86450
Acknowledgements
This work was supported by grants from the National Natural Science Foundation of China (31501701) and the Plant Foundation for Young Scientists of Henan University (CX0000A40557).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Author Wenqiang Wei declares that he has no conflict of interest. Author Hongju Wang declares that she has no conflict of interest. Author Tao Meng declares that he has no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Rights and permissions
About this article
Cite this article
Wang, H., Meng, T. & Wei, W. Analysis of synonymous codon usage bias in helicase gene from Autographa californica multiple nucleopolyhedrovirus. Genes Genom 40, 767–780 (2018). https://doi.org/10.1007/s13258-018-0689-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13258-018-0689-x