Abstract
The sequence of amino acid monomers in the primary structure of a protein is decided by the corresponding sequence of codons (triplets of nucleic acid monomers) on the template messenger RNA (mRNA). The polymerization of a protein, by incorporation of the successive amino acid monomers, is carried out by a molecular machine called ribosome. We develop a stochastic kinetic model that captures the possibilities of mis-reading of mRNA codon and prior mis-charging of a tRNA. By a combination of analytical and numerical methods, we obtain the distribution of the times taken for incorporation of the successive amino acids in the growing protein in this mathematical model. The corresponding exact analytical expression for the average rate of elongation of a nascent protein is a ‘biologically motivated’ generalization of the Michaelis–Menten formula for the average rate of enzymatic reactions. This generalized Michaelis–Menten-like formula (and the exact analytical expressions for a few other quantities) that we report here display the interplay of four different branched pathways corresponding to selection of four different types of tRNA.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Enzymes are known to play crucial roles in almost all kinds of intracellular processes (Rittie and Perbal 2008). For the simplest enzymatic reaction studied theoretically by Michaelis and Menten more than a century ago (Johnson and Goody 2011; Michaelis and Menten 2013), the rate of the formation of the product in bulk is given by the so-called Michaelis–Menten (MM) equation (Johnson 2013; Michel and Ruelle 2013). However, at extremely low population of an enzyme, the time taken for each enzymatic cycle fluctuates from one cycle to another; the time taken in each round is often referred to as the turnover time. The distribution of the turnover time is the key statistical characteristics of reactions studied by single-molecule enzymology (Grima et al. 2014). Interestingly, in spite of the fluctuations in the turnover time, the mean turnover time for a large class of enzymatic reactions still follows the MM equation (Xie 2013b). Over the last century, various generalizations of the MM equation have emerged in several different contexts (Schnell and Maini 2003). In this paper, we present a generalization that emerges naturally in the context of biophysical chemistry of protein synthesis.
Proteins are polymers whose monomeric subunits are amino acids. The specific sequence of the amino acid species in the primary linear structure of a given protein is directed by the corresponding sequence of codons (triplets of nucleotide monomers) on the corresponding template messenger RNA (mRNA). The template-directed polymerization of a protein, called translation, is carried out by a molecular machine called ribosome (Spirin 2002; Rodnina et al. 2011; Frank 2011; Frank and Gonzalez 2010; Chowdhury 2013a, b) that consists of two loosely connected subunits designated as large and small. Transfer RNA (tRNA) molecules play crucial roles in translation (Kim 2014). When “charged” (amino-acylated) by a specific enzyme, called amino-acyl tRNA synthetase (aa-tRNA synthetase) (Ling et al. 2009; Reynolds et al. 2010; Yadavalli and Ibba 2012), one end of each species of these “adapter” molecules carries a specific amino acid. The amino acid brought in by a correctly charged cognate tRNA is also the correct one, as directed by the corresponding template; the other end of the same cognate tRNA molecule, referred to as anticodon, matches perfectly, by complementary base pairing, with the codon on the template mRNA. In contrast, increasing degree of mismatch makes the tRNA near-cognate or non-cognate.
Most aa-tRNA synthetases employ editing mechanisms to ensure correct charging of the corresponding tRNA molecules. However, because of the intrinsic stochasticity of aminoacylation, and occasional failure of the editing mechanism of those aa-tRNA synthetase that are capable of correcting erroneous aminoacylation, a mis-charged tRNA may be produced (Ling et al. 2009; Yadavalli and Ibba 2012). Therefore, even when it turns out to be a cognate tRNA for a given codon, such a mis-charged tRNA compromises the translational fidelity by contributing an amino acid which is different from that dictated by the mRNA template. Erroneous substitution of one amino acid by another is called mis-sense error. Pre-translational mis-charging of tRNA is not the only possible cause of mis-sense error. Erroneous selection of a correctly charged near-cognate or non-cognate tRNA, i.e., a co-translational mis-reading of a codon, also contributes to mis-sense error (Parker 1989; Cochella and Green 2005; Zaher and Green 2009; Johansson et al. 2008; Rodnina 2012).
Thus, at the beginning of each elongation cycle the macromolecular complex consisting of the ribosome and accessory molecules select one of the four possible pathways indicated by the tRNA selected: (i) correctly charged cognate tRNA, (ii) incorrectly charged cognate tRNA, (iii) correctly charged near-cognate tRNA, and (iv) correctly charged non-cognate tRNA. Along each of these pathways, the sequence of intermediate steps is identical although the molecular identities of the complexes are different. In other words, the network of mechano-chemical states consists of four distinct cycles that shares the same initial state.
The time taken by a ribosome to incorporate a single amino acid in the growing protein is also the duration of the ribosome’s dwell at the corresponding codon on the mRNA template. The distribution of the dwell times (DDT) characterizes the intrinsic stochastic nature of the process of elongation of the nascent protein by the ribosome. Here, we develop a stochastic kinetic model for the elongation phase of translation capturing, within a single mathematical framework, all the four cycles that branch out from the initial state. Our model also distinguishes between the concentrations of four distinct types of tRNA molecules, namely correctly charged cognate tRNA, incorrectly charged cognate tRNA, correctly charged near-cognate tRNA and correctly charged non-cognate tRNA. Solving the corresponding master equations (a set of coupled ordinary differential equations), for an appropriate initial condition that captures the beginning of translation of a codon, we obtain the DDT of the ribosome. Moreover, using the steady-state solutions of these master equations, we derive the exact analytical expression for the average velocity of the ribosome which is also the average rate of amino acid incorporation (i.e., average rate of elongation of the nascent protein) catalyzed by the ribosomal machinery. This expression is a generalization of the MM equation and, as we demonstrate explicitly, it reduces to the standard form of MM equation in the appropriate special limits of our model. Few graphical plots of the average rate of elongation, corresponding to some typical values of the rate constants, are presented to provide an intuitive understanding of the relative contributions of the four competing cycles.
Three of the four pathways originating from the initial state lead to translational error if the cycle is completed by adding an amino acid to the growing protein. Therefore, as a by-product of our calculation, we also get exact expressions for the translation error. The more stringent is the mechanism of selection of incoming tRNA the lower is the mis-reading error. But, increasing the probability of rejecting near-cognate and non-cognate tRNAs would increase the likelihood of accepting not only correctly charged cognate tRNA but also that of a mis-charged cognate tRNA. A few illustrative plots display the interplay of the effects of micharging of tRNA and mis-reading of mRNA in the overall mis-sense error in translation.
2 Model
Sharma and Chowdhury (2010) developed a five-state stochastic kinetic model (from now onwards referred to as SC model) for the elongation cycle of translation (see Fig. 1). The arrival of a aa-tRNA molecule, bound to GTP and EF-Tu, and its rejection because of the codon-anticodon mismatch is captured by the forward and reverse transitions \( 1 \rightleftharpoons 2\). The second stage of quality control (kinetic proofreading) involves hydrolysis of GTP by EF-Tu (\(2 \rightarrow 3\)) followed by either rejection (\(3 \rightarrow 1\)) or incorporation (\(3 \rightarrow 4\)) in the growing protein by formation of a peptide bond [see Blomberg (2007) for a pedagogical introduction to kinetic proofreading]. The first of the two-step translocation process consists of the Brownian rotation of large subunit of the ribosome with respect to the small subunit and simultaneous reversible transitions of the tRNAs between the so-called classical and hybrid states. The second, and the final, step of translocation, driven by hydrolysis of another molecule of GTP by EF-G completes the cycle irreversibly. More detailed stochastic models of mechano-chemical kinetics of each elongation cycle have been developed [see, for example, Xie (2013a), Kinz-Thompson et al. (2015)]. However, in order to capture some other aspects of translational kinetics, we describe the kinetics of elongation cycle by the simpler SC model. Nevertheless, the strategy of modeling followed here can be implemented also using the more detailed descriptions as the basic models of elongation cycle.
The SC model was used further to account for the stochastic alternating pause-and-translocation kinetics of a single ribosome (Sharma and Chowdhury 2011a) as well as for analyzing collective spatio-temporal organization of ribosomes in a polysome (Sharma and Chowdhury 2011b). Because of the extreme simplicity of the SC model model, no clear distinction could be made, in terms of different rate constants, between processes involving near-cognate and non-cognate tRNAs. More importantly, the SC model captured the possibility of mis-sense error arising from only mis-reading of the codons; it was not possible to incorporate the contributions from both mis-reading and mis-charging errors explicitly. The non-trivial extension of the SC model that we present here does not suffer from any of the above-mentioned limitations of the original SC model.
We begin formulation of the model with the four alternative elongation cycles shown in Fig. 2 which correspond to the four different mutually exclusive pathways that open up with the arrival of (a) correctly charged cognate tRNA, (b) incorrectly charged cognate tRNA, (c) correctly charged near-cognate tRNA, and (d) correctly charged non-cognate tRNA. Note that each of these cycles is formally identical to the only cycle that appeared in the original SC model. However, by opening up the possibility of four distinct pathways, each associated with a distinct identity of aa-tRNA, this model not only allows for a clear distinction between non-cognate, near-cognate and cognate tRNAs but also that between correctly and incorrectly charged cognate tRNAs.
Next, we simplify the model by exploiting some well known facts from the existing literature (Spirin 2002; Rodnina et al. 2011; Frank 2011). First, we note that \(\omega _{a}, \omega _{a}', \omega _{a}''\) and \(\omega _{a}'''\) are proportional to the concentrations of the corresponding aa-tRNA species; therefore, we assume:
where the symbol [.] denotes the concentration of the corresponding tRNA species and the prefactors are measures of the intrinsic rates of the reactions for unit concentration of the tRNA species. Thus, as stated in the introduction, concentrations of all the four types of tRNA molecules are incorporated explicitly.
The assumption (1) is valid under the “abundant substrate” condition, i.e., all four species of tRNA molecules are much more abundant than the ribosomes. This condition is commonly used in the stochastic models of enzyme kinetics although strong deviation from this condition can lead to drastically different results (Grima and Leier 2017).
We do not distinguish \(4'\) from 4 and \(5'\) from 5 because both the pathways \(3 \rightarrow 4 \rightarrow 5\) and \(3' \rightarrow 4' \rightarrow 5'\) involve movement of cognate tRNAs (see Fig. 3). Similarly, assuming the rates of translocation of near-cognate and non-cognate tRNA molecules to be comparable, but discriminating these from the corresponding cognate tRNAs, we assume \(4'' \equiv 4'''=4^{*} \not \equiv 4\) and \(5'' \equiv 5'''=5^{*} \not \equiv 5\) (see Fig. 3). These assumptions help in combining the four pathways shown in Fig. 2 within the single and simpler kinetic scheme depicted in Fig. 3 thereby also reducing the number of parameters (rate constants). From now onwards, unless stated otherwise, all our discussions will be based on the model kinetic scheme shown in Fig. 3.
We use the symbol \(P_{\mu }(j,t)\) to denote the probability at time t that the ribosome is in the “chemical” state \(\mu \) and is decoding the \(j\mathrm{th}\) codon. In the steady state, all the probabilities \(P_{\mu }(j,t)\) become independent of time. We define translational fidelity by the fraction
where we have used the relation \(\Omega _{p} P^{\prime \prime }_{3}+\Omega ^{\prime }_{p} P^{\prime \prime \prime }_{3}=\Omega _{h2} P^{*}_{5}\).
The total mis-sense error \(E=1-\phi \) is defined by the relation
which is the sum of the total mis-charged mis-sense error (i.e., mis-sense error arising solely from mis-charged tRNAs)
and the total mis-reading mis-sense error (i.e., mis-sense error arising only from mis-reading of codons)
Similarly, the fraction
is the fraction of mis-sense error caused by mis-charged cognate tRNAs, while the corresponding fraction of mis-sense error caused by mis-reading is defined by
Obviously, the average velocity of a ribosome in the steady-state can be obtained by substituting the expressions of \({\mathcal {P}}_{5}\) and \({\mathcal {P}}_5^*\) into the defining relation
where \({\ell }_\mathrm{c}\) is the length of a codon. We also note that the average velocity V of a ribosome is same as the average rate of elongation of the protein that it polymerizes.
We define the rejection factors
The four rejection factors characterize the frequencies of rejection of the incoming charged tRNA molecules in the four alternative pathways depicted in Fig. 3. The higher the value of a rejection factor the more frequent is the corresponding futile cycles.
The analytical results for this model that we report here are exact, i.e., these are derived without making any mathematical approximations. The derivations of these analytical expressions do not require imposition of any condition on the numerical values of the rate constants. However, we now list some biologically motivated constraints on the relative magnitudes of the rate constants that we’ll use later in this paper only for presenting the results graphically for biologically relevant situations. Based on the levels of base-pair complementarity between the codon and the anticodon of the incoming tRNA, we expect that under normal physiological conditions the following conditions would be satisfied: \(\omega _{r1}'''> \omega _{r1}'' > \omega _{r1}' = \omega _{r1}\). Motivated by similar considerations, for graphical plots, we also assume \(\omega _{r2}'''> \omega _{r2}'' > \omega _{r2}' = \omega _{r2}\). Continuing similar justification for the reduction in the number of model parameters, we assume \(\Omega _{p} \simeq \Omega _{p}' \simeq \omega _{p}' < \omega _{p}\).
3 Results
We begin our theoretical analysis by first solving the master equations (26) under steady-state conditions to get the corresponding expressions for \(P_{\mu }\); the full analytical expressions are given in Appendix A. Then, using those expressions for \(P_{\mu }\), we calculate the quantities of our interest namely, \(\phi \), \(E_\mathrm{mc}\), \(E_\mathrm{mr}\), \(\epsilon _\mathrm{mc}\), \(\epsilon _\mathrm{mr}\) and V. The results are listed below.
and, hence,
which is sum of the two contributions
and
Similarly, we get
and
In all the expressions (10)–(15) A, B, C and D are given by
The expressions (10)–(15) can be easily justified by intuitive physical arguments. Let us first consider the special case where \(\omega _{r1} = \omega _{r1}' = \omega _{r1}'' = \omega _{r1}''' = 0 =\omega _{r2} = \omega _{r2}' = \omega _{r2}'' = \omega _{r2}'''\). In this case, the expressions for A, B, C and D reduce to \(A=\omega _{a}, B=\omega _{a}', C=\omega _{a}''\) and \(D=\omega _{a}'''\), respectively. Consequently, \(\phi = \omega _{a}/(\omega _{a}+\omega _{a}'+\omega _{a}''+\omega _{a}''')\) is the probability of following the path \(1 \rightarrow 2\), instead of the other three alternatives, namely \(1 \rightarrow 2'\), \(1 \rightarrow 2''\) and \(1 \rightarrow 2'''\). Similarly, in this special case, the expression \(\epsilon _\mathrm{mc} = \omega _{a}'/(\omega _{a}'+\omega _{a}''+\omega _{a}''')\) is expected because \(\epsilon _\mathrm{mc}\) is the probability of following the path \(1 \rightarrow 2'\), instead of the two alternatives \(1 \rightarrow 2''\) and \(1 \rightarrow 2'''\).
In the general case, the rate of transition \(1 \rightarrow 2 \rightarrow 3 \rightarrow 4\) is given by
The quantities B, C and D have similar interpretations as rates for the transitions \(1 \rightarrow 2' \rightarrow 3' \rightarrow 4\), \(1 \rightarrow 2'' \rightarrow 3'' \rightarrow 4^{*}\) and \(1 \rightarrow 2''' \rightarrow 3''' \rightarrow 4^{*}\), respectively. Once the system reaches the state 4 it cannot return to state 1 without completing the full cycle. Therefore, fidelity \(\phi \) is the ratio \(A/(A+B+C+D)\). The expressions (12)–(15) for \(E_\mathrm{mc}, E_\mathrm{mr}\) and \(\epsilon _\mathrm{mc}, \epsilon _\mathrm{mr}\) also follow from the same interpretations of A, B, C and D.
Ribosome is an enzyme; interestingly, at any given instant of time its substrate-specificity depends on the codon that it is engaged in translating. In recent years, the average rate of translation has been shown to be a generalization of the rate of enzymatic reactions. Recall that for the Michaelis–Menten (MM) enzymatic reaction
the rate of the reaction under steady-state condition is given by the MM equation
where the Michaelis constant \(K_{M} = (k_{-1}+k_{2})/k_{+1}\) and \(V_\mathrm{max}=k_{2} [E]_{0}\), with \([E]_{0}\) being the initial (total) concentration of the enzyme. In the past, the average rate of translation by a ribosome has been shown to follow a generalized MM-like equation where the concentration of aa-tRNA is interpreted as the substrate concentration. For simpler models of translation reported earlier, the average rate of translation has been expressed as generalized MM equation (Garai et al. 2009; Chowdhury 2014).
For the full kinetic model shown in Fig. 3, the average rate of translation (i.e., the average velocity V of a ribosome) is given by
where
Equation (20) is a generalized version of the MM equation (19) for our model. An intuitive derivation of the expression (20), that provides a deeper physical interpretation of this formula, is given in Appendix B.
The connection between the two can be elucidated by considering a special case of our model. In the limit \([\hbox {tRNA}]_{c2} = [\hbox {tRNA}]_{n} = [\hbox {tRNA}]_{N} = \omega _{r2} = \omega _{br} = 0\) and \(\omega _{p} \rightarrow \infty \), \(\omega _{bf} \rightarrow \infty \), \(\omega _{h2} \rightarrow \infty \), the model reduces to
which is, formally, identical to the MM reaction (18). In this limit, the expression (20) reduces to
which is identical to MM equation (19) because of the correspondence \(\omega _{h1} \longleftrightarrow k_{2} = V_\mathrm{max}\) and \((\omega _{h1}+\omega _{r1})/\omega ^{0}_{a,c1} \longleftrightarrow (k_{-1}+k_{2})/k_{1} = K_{M}\). Thus, on the right-hand side of the equation (20), the sum of the last four terms is the generalized counterpart of \(1/V_\mathrm{max}\), while the first term is the generalized analog of \(K_{M}/V_\mathrm{max}[S]\).
Note that, from the perspective of enzymatic reactions, in each elongation cycle four distinct species of substrates (tRNA molecules) compete for the same enzyme. In order to graphically demonstrate the effects of this competition among the substrates, in Fig. 4 we plot 1 / V as a function of \(1/[\hbox {tRNA}]_{c1}\) under four different conditions.
Figure 4a corresponds to the simplest scenario where only correctly charged tRNA molecules are present. In this case, because of the absence competition among substrates, the plot is linear. This linear plot in Fig. 4a is the characteristic of MM equation displayed in what is known as the Lineweaver–Burk plot. By fitting the data of Fig. 4a to the MM equation (19), where the substrate concentration [S] is identified as \([\hbox {tRNA}]_{c1}\), we find \(K_{m}/V_\mathrm{max}=1.44\) and \(1/V_\mathrm{max}=0.208\); hence \(K_{m} \simeq 6.9\) and \(V_\mathrm{max} \simeq 4.8\) amino acids per second (or, equivalently, codons per second). The deviation from the linearity, as shown in Fig. 4b, arises from the presence of the competing second species of the tRNA molecules. The corresponding plots in Fig. 4c, d exhibit the increasing deviations from the single-substrate MM equation as the number of competing substrates increases. Interestingly, all the curves plotted in Fig. 4a–d have the same slope 1.44 in the limit \(1/[\hbox {tRNA}]_{c1} \rightarrow 0\), i.e., \([\hbox {tRNA}]_{c1} \rightarrow \infty \). One way of characterizing the extent of the deviation of the curves in Fig. 4b–d from linearity, with the increasing number of competing substrates, is by computing the saturation values of these curve in the limit \(1/[\hbox {tRNA}]_{c1} \rightarrow \infty \); these values are approximately 0.27, 0.24 and 0.23, respectively.
The numerical values assigned to the rate constants for the plots in Fig. 4 are not realistic in the sense that these do not reflect the intuitively expected relative strengths of the corresponding rates for the four different species of tRNA molecules. The sole purpose of treating all the four species of tRNA molecules on equal footing is to demonstrate the trend of increasing deviation from linearity on the Lineweaver–Burk plot with the increasing number of competing substrates. For plotting all the remaining graphs, we have used the parameter values as given in the Table 1.
The two-dimensional plots of the error fractions \(\epsilon _\mathrm{mc}\) and \(\epsilon _\mathrm{mr}\) against the rejection factors \({\mathcal {R}}''\) and \({\mathcal {R}}'''\) are shown in Figs. 5 and 6, respectively. Both show how the error fraction \(\epsilon _\mathrm{mr}\) decreases, while the fraction \(\epsilon _\mathrm{mc}\) increases with increasing \({\mathcal {R}}''\) and \({\mathcal {R}}'''\). The total mis-sense error can also decrease because, under favorable conditions, the increase of \(E_\mathrm{mc}\) with \({\mathcal {R}}''\) is more than compensated by the simultaneous decrease of \(E_\mathrm{mr}\), as shown in Fig. 7.
One of the main results reported above is the analytical expression for the average rate of translation, as given by Eq. 20, in the steady state. This average rate is the inverse of the mean time of dwell of a ribosome at successive codons (Sharma and Chowdhury 2011a) in the stochastic model reported in this paper. Ideally, for any such stochastic model of translational kinetics it is desirable to derive the full probability density for the dwell times of a ribosome. Therefore, we now give an outline of our derivation of the probability density f(t) of the dwell times of a ribosome in our model.
In order to simplify our calculations, following the trick used in Sharma and Chowdhury (2011a), we assume that the ribosome makes a transition to a hypothetical state \(\tilde{1}\) at the \((j+1)\)-th codon, after reaching the chemical state 5 or \(5^{*}\) at the j-th codon. It then relaxes to the chemical state 1 at the same codon \(j+1\) at a rate \(\delta \). We can recover our original model by taking \(\delta \rightarrow \infty \). The probability of finding the ribosome in this hypothetical state is denoted by \(\widetilde{P_1}(j+1,t)\). Now, we define the dwell time of the ribosome at a particular codon, say the j-th, by the time taken by the ribosome to reach state \(\tilde{1}\) at the \((j+1)\)-th codon, starting from the state 1 at the j-th codon. The master equations governing the time evolution of the probabilities, the normalization condition as well as the initial conditions for the set of master equations are given in Appendix C. The probability of incorporation of one amino acid to the growing polypeptide in the time interval between t and \(t+\Delta t\) is \(f(t)\Delta t\) where f(t) is given by Sharma and Chowdhury (2011a)
Since \(P_{\mu }(t)\) are time-dependent solutions under the specific initial condition mentioned above, we cannot use the steady-state (i.e., time-independent) solutions derived in Appendix A. Instead, we adopted two alternative approaches for finding the probability density f(t). In the first, which is essentially an analytical approach, we found f(t) by the use of a standard technique (Sharma and Chowdhury 2011a) based on Laplace transform. However, the analytical expressions of \(P_{5}(s)\) and \(P_{5}^{*}(s)\) in the Laplace space are too long (covering several pages) to be reproduced here. Therefore, instead, we have substituted the numerical values of the parameters listed in Table 1 and, then, carried out the inverse Laplace transform that involved finding the roots of a 5th degree polynomial which were calculated numerically. The four curves plotted graphically (by lines) in Fig. 8 have been obtained by repeating this procedure for four different values of \(\omega _{a}\). In the second approach, for the given initial condition, we numerically solved the master equations given in Appendix C (which are essentially a set of coupled ordinary differential equations) by a standard ODE solver in MATLAB and hence obtained f(t) by substituting \(P_{5}(t)\) and \(P^{*}_{5}(t)\) into (24). These numerical results for f(t) are plotted in Fig. 8 by discrete symbols. Results obtained by the two methods are in excellent agreement with each other.
As \(\omega _{a}\) increases the probability density f(t) becomes sharper. This trend of variation of the width of the distribution is consistent with the intuitive expectation that fluctuations in the dwell time, caused by the low concentration of tRNA, would become stronger with the decrease of \(\omega _{a}\) which is directly proportional to \([\hbox {tRNA}]_{c1}\). To quantify the relative strength of the fluctuation and mean of the dwell times, we have computed the numerical values of the randomness parameter, defined as
for the four curves plotted in Fig. 8; these data are presented in Table 2.
Naturally occurring mRNA templates in living cells consist of a heterogeneous sequence of nucleotides and, consequently, not all the codons are identical, in general. This feature of real mRNA templates can be captured in our theoretical model by assigning to each of rate constant different numerical values for translating different codons. However, only numerical results can be obtained in such cases. But, in order to derive the results analytically (in terms of closed form mathematical formulae), in this paper, we have considered the special case where the numerical values of each rate constant is independent of the type of the codon.
We now propose an in vitro experimental setup for testing our theoretical predictions reported in this paper. A sequence homogeneous mRNA template (for example, a poly-U, as shown schematically in Fig. 9) would be ideally suited for this purpose. Such templates are used routinely for in vitro experiments (Uemura et al. 2010). The coding sequence in such a mRNA template that is actually translated consists of \(N_\mathrm{c}\) number of identical codons UUU; in the poly-U template of Fig. 9, \(N_\mathrm{c} = 6\). The coding sequence is preceded by a normal start codon (AUG) and is followed by a stop codon (UAA). The untranslated region (UTR) upstream from the start codon is required not only for assembling the ribosome from its subunits but also for stabilizing the pre-initiation complex. At the \(3^{\prime }\)-end, the stop codon is followed by a sequence of noncoding codons UUU; this region merely ensures the absence of any ‘edge effect’, i.e., the translation is not affected when the ribosome approaches the \(3^{\prime }\)-end of the codon sequence.
The optical method proposed here exploits labeling of the four species of tRNA molecules by four different fluorescent dyes of four distinct colors. Each UUU codon codes for the amino acid phenylalanine (abbreviated Phe or F). Targeted (site specific) mutation at the editing site of the aa-tRNA synthetase would produce a defective variant of the enzyme whose editing mechanism has been disabled. The cognate tRNA molecules labeled by red fluorescent dye should be charged with amino acid Phe by the wild-type aa-tRNA synthetase. In contrast, the cognate tRNA molecules labeled by green fluorescent dye should be separately charged with some amino acid other than Phe by the mutated aa-tRNA synthetase. Although the latter charging reaction is expected to proceed at a much slower rate than the former, yield can still be significant if the concentration of the amino acid substrate is sufficiently high. Thus, red and green fluorescence would signal correctly charged and mis-charged cognate tRNA species, respectively.
A good choice for the corresponding near-cognate tRNA would be tRNALeu because it is cognate for the codon CUU which codes for leucine (abbreviated L). The correctly charged near-cognate tRNA can be labeled by orange dye while the non-cognate tRNA can be labeled by yellow dye. The color of the fluorescence pulse identifies the monomer species that elongates the polypeptide by one unit; monitoring the colors of the fluorescence pulses, one would get an estimate of mis-sense error arising separately from mis-charging of tRNA and mis-reading of mRNA. Moreover, the time interval between the arrival of the successive aa-tRNA molecules provides an estimate of the dwell times of the ribosome. Usually, the coding sequence of such poly-U mRNA strands is quite short. Therefore, for collecting enough data to extract the DDT, the experiment has to be repeated sufficiently large number of times.
4 Summary and Conclusion
In this paper, we have developed a theoretical model that includes both the effects of mis-charging of tRNA and mis-reading of mRNA during the elongation cycle of gene translation. It also allows explicit distinction between (i) correctly charged cognate tRNA, (ii) incorrectly charged cognate tRNA, (iii) correctly charged near-cognate tRNA, and (iv) correctly charged non-cognate tRNA. For each of the four species, the master equations capture only the essential steps of the elongation cycle. From these equations, we obtain the distribution of the dwell times of the ribosome at successive codons which is identical to the distribution of the times taken to incorporate the successive amino acids in the growing protein. From the steady-state solutions of the master equations, we also derive exact analytical formulae (10)–(15) that characterize various aspects of the erroneous protein polymerization process, particularly the average speed and fidelity of polymerization. The average speed of the ribosome, i.e., the average rate (20) of elongation of the protein, is an interesting generalization of the Michaelis–Menten equation that governs the average rate of a very simple enzymatic reaction. Some important implications of the analytical results reported here have been emphasized by plotting the results graphically. In particular, the plots show how with increasing rates of rejection of the near-cognate and non-cognate tRNAs the relative contribution of the mis-charged cognate tRNAs to the overall mis-sense error increases.
Most of the experimental works on mis-reading error have been carried out for bacteria. So far as the eukaryotes are concerned, a comprehensive analysis of translational mis-reading error in budding yeast has been reported by Kramer et al. (2010). Mis-charging of tRNA and the failure of the editing mechanisms have been investigated separately for a long time (Ling et al. 2009; Yadavalli and Ibba 2012). However, to our knowledge, the relative contribution of mis-charging error to the overall mis-sense error has not been measured quantitatively in any experiment on translational kinetics. It is worth pointing out that a mis-charging error is not always detrimental for biological function of a cell and are believed to play some regulatory roles under special conditions (Pan 2013; de Pouplana et al. 2014; Ruan et al. 2008; Fan et al. 2015).
The dependence of the frequencies of mis-reading error on the codon usage and tRNA concentration have been investigated extensively in the past (Garai et al. 2009; Fluitt et al. 2007; Zouridis and Hatzimanikatis 2008; Basu and Chowdhury 2007; Shah and Gilchrist 2010; Rudorf and Lipowsky 2015); typical frequencies of mis-reading error can be as high as 1 in \(10^{3}\) (Kramer and Farabaugh 2007). But, to our knowledge, mis-charging has not been incorporated so far in any mathematical model of kinetics of translation because under normal circumstances mis-charging error is as low as 1 in \(10^5\) (or even lower). But, when subjected to various types of stress, at least ten fold increase in mis-charging has been observed (Netzer 2010). The model and the analytical formulae derived here will be useful in analyzing the data collected in future experiments that might be performed for investigation of the same phenomenon.
Next, we point out some features of the model that should be reflected in the experimental set up which may, in near future, be analyzed with the analytical formulae reported in this paper. For a natural mRNA molecule, because of its sequence inhomogeneity, the identity of the cognate tRNA keeps changing from one codon to another. On the other hand, the rate constants in our mathematical derivation is based on the assumption that the rates are independent of the position of the ribosome, i.e., independent of the identity of the codons. Thus, the sequence heterogeneity of natural mRNA molecules make those unsuitable for direct comparison with the analytical formulae reported here. Nevertheless, the model can be simply extended by assigning codon-dependent rate constants; but, in that case the results cannot be derived analytically (with closed form mathematical expressions) although all the quantities can be evaluated numerically. Since no experimental data for direct comparison is available at present, we have not carried out numerical study of the sequence-dependent model.
As an alternative to sequence inhomogeneous real mRNA, a synthetic mRNA with homogeneous sequence can be used to directly test the validity of our analytical formulae reported here. For example, poly-U, along with the necessary start-, stop codons and untranslated region (UTR) upstream from the start codon (Sharma and Chowdhury 2011a) could be a good candidate for this purpose. For any study of mis-sense error, the tRNA species which contribute the successive amino acids of the growing protein have to be identified. Fluorescence-based optical techniques (Uemura et al. 2010; Chen and Tsai 2012) seem to be ideally suited for this purpose. The model will have to be extended in future also to incorporate the effects of microenvironment, cell cycle phase, etc. on translation. We hope that the relative quantitative contributions of mis-charging of tRNA and mis-reading of mRNA will be measured experimentally in near future and the analytical formulae derived here will find use in analyzing the experimental data. The experimental data will also guide extension of the model to make it more realistic by capturing features that are missing from the simple version reported in this paper.
References
Basu A, Chowdhury D (2007) Traffic of interacting ribosomes: effects of single-machine mechanochemistry on protein synthesis. Phys Rev E 75:021902
Blomberg C (2007) Physics of life: the physicist’s road to biology. Elsevier, Amsterdam
Chen J, Tsai A, OLeary SE, Petrov A, Puglisi JD (2012) Unraveling the dynamics of ribosome translocation. Curr Opin Struct Biol 22:804
Chowdhury D (2013a) Stochastic mechano-chemical kinetics of molecular motors: a multidisciplinary enterprise from a physicist’s perspective. Phys Rep 529:1
Chowdhury D (2013b) Modeling stochastic kinetics of molecular machines at multiple levels: from molecules to modules. Biophys J 104:2331
Chowdhury D (2014) Michaelis–Menten at 100 and allosterism at 50: driving molecular motors in a hailstorm with noisy ATPase engines and allosteric transmission. FEBS J 281:601
Cleland WW (1975) Partition analysis and the concept of net rate constant as tools in enzyme kinetics. Biochemistry 14:3220
Cochella L, Green R (2005) Fidelity in protein synthesis. Curr Biol 136:R536–R540
de Pouplana LR, Santos MAS, Zhu JH, Farabaugh PJ, Javid B (2014) Protein mistranslation: friend or foe? Trends Biochem Sci 39:355–362
Fan Y, Wu J, Ung MH, De Lay N, Cheng C, Ling J (2015) Protein mistranslation protects bacteria against oxidative stress. Nucl Acids Res 43:1740
Fluitt A, Pienaar E, Viljoen H (2007) Ribosome kinetics and aa-tRNA competition determine rate and fidelity of peptide synthesis. Comput Biol Chem 31:335
Frank J (ed) (2011) Molecular machines in biology. Cambridge University Press, Cambridge
Frank J, Gonzalez RL (2010) Structure and dynamics of a processive Brownian motor: the translating ribosome. Annu Rev Biochem 79:381–412
Garai A, Chowdhury D, Chowdhury D, Ramakrishnan TV (2009) Stochastic kinetics of ribosomes: single motor properties and collective behavior. Phys Rev E 80:011908
Grima R, Leier A (2017) Exact product formation rates for stochastic enzyme kinetics. J Phys Chem B 121:13–23
Grima R, Walter NG, Schnell S (2014) Single-molecule enzymology à la Michaelis–Menten. FEBS J 281:518
Johansson M, Lovmar M, Ehrenberg M (2008) Rate and accuracy of bacterial protein synthesis. Curr Opin Microbiol 11:141–147
Johnson KA (2013) A century of enzyme kinetic analysis, 1913 to 2013. FEBS Lett 587:2753
Johnson KA, Goody RS (2011) The original Michaelis constant: translation of the 1913 Michaelis-Menten paper. Biochemistry 50:8264–8269. doi:10.1021/bi201284u
Kim S (2014) Aminoacyl-tRNA synthetases in biology and medicine. Springer, Berlin
Kinz-Thompson C, Sharma AK, Frank J, Gonzalez RL Jr, Chowdhury D (2015) Quantitative connection between ensemble thermodynamics and single-molecule kinetics: a case study using cryogenic electron microscopy and single-molecule Fluorescence Resonance Energy Transfer investigations of the ribosome. J Phys Chem B 119:10888–10901
Kramer EB, Farabaugh PJ (2007) The frequency of translational misreading errors in E. coli is largely determined by tRNA competition. RNA 13:87
Kramer EB, Vallabhaneni H, Mayer LM, Farabaugh PJ (2010) A comprehensive analysis of translational missense errors in the yeast Saccharomyces cerevisiae. RNA 16:1797
Ling J, Reynolds N, Ibba M (2009) Aminoacyl-tRNA synthesis and translational quality control. Annu Rev Micorobiol 63:61–78
Michaelis L, Menten MML (2013) The kinetics of invertin action: translated by T.R.C. Boyde. FEBS Lett 587:2712–2720. doi:10.1016/j.febslet.2013.07.015
Michel D, Ruelle P (2013) Seven competing ways to recover the Michaelis–Menten equation reveal the alternative approaches to steady state modeling. J Math Chem 51:2271
Netzer N et al (2010) Innate immune and chemically triggered oxidative stress modifies translational fidelity. Nature 462:522
Pan T (2013) Adaptive translation as a mechanism of stress response and adaptation. Annu Rev Genet 47:121–137
Parker J (1989) Errors and alternatives in reading the universal genetic code. Microbiol Rev 53:273–298
Reynolds NM, Lazazzera BA, Ibba M (2010) Cellular mechanisms that control mistranslation. Nat Rev Microbiol 8:849–856
Rittie L, Perbal B (2008) Enzymes used in molecular biology: a useful guide. J Cell Commun Signal 2:25
Rodnina MV (2012) Quality control of mRNA decoding on the bacterial ribosome. Adv Protein Chem Struct Biol 86:95–128
Rodnina MV, Wintermeyer W, Green R (eds) (2011) Ribosomes: structure, function, and dynamics. Springer, Berlin
Ruan B, Palioura S, Sabina J, Marvin-Guy L, Kochhar S, LaRossa RA, Söll D (2008) Quality control despite mistranslation caused by an ambiguous genetic code. Proc Natl Acad Sci USA 105:16502–16507
Rudorf S, Lipowsky R (2015) Protein synthesis in E. coli: dependence of codon-specific elongation on tRNA concentration and codon usage. PLoS One 10:e0134994
Schnell S, Maini PK (2003) A century of enzyme kinetics. Should we believe in the Km and vmax estimates? Comments Theor Biol 8:169
Shah P, Gilchrist MA (2010) Effect of correlated tRNA abundances on translation errors and evolution of codon usage bias. PLoS Genet 6(9):e1001128
Sharma AK, Chowdhury D (2010) Quality control by a mobile molecular workshop: quality versus quantity. Phys Rev E 82:031912
Sharma AK, Chowdhury D (2011a) Distribution of dwell times of a ribosome: effects of infidelity, kinetic proofreading and ribosome crowding. Phys Biol 8:026005
Sharma AK, Chowdhury D (2011b) Stochastic theory of protein synthesis and polysome: ribosome profile on a single mRNA transcript. J Theor Biol 289:36–46
Spirin AS (2002) Ribosomes. Springer, Berlin
Uemura S, Aitken CE, Korlach J, Flusberg BA, Turner SW, Puglisi JD (2010) Real-time tRNA transit on single translating ribosomes at codon resolution. Nature 464:1012
Xie P (2013a) Dynamics of tRNA occupancy and dissociation during translation by the ribosome. J Theor Biol 316:49–60
Xie XS (2013b) Enzyme kinetics, past and present. Science 342:1457
Yadavalli SS, Ibba M (2012) Quality control in aminoacyl-tRNA synthesis. Adv Protein Chem Struct Biol 86:1–43
Zaher HS, Green R (2009) Fidelity at the molecular level: lessons from protein synthesis. Cell 136:746–762
Zouridis H, Hatzimanikatis V (2008) Effects of codon distributions and tRNA competition on protein translation. Biophys J 95:1018
Acknowledgements
DC thanks Joachim Frank, Ruben Gonzalez Jr. and Michael Ibba for useful correspondence. We also thank Joachim Frank, Adil Moghal and Alex Mogilner for valuable comments on a draft of this manuscript. We thank the anonymous referees for useful suggestions and for drawing our attention to a very recent paper. This work is supported by “Prof. S. Sampath Chair” professorship (DC) and a J.C. Bose National Fellowship (DC). DC also acknowledges hospitality of the Biological Physics Group of the Max-Planck Institute for the Physics of Complex Systems at Dresden, under the Visitors Program, during the initial stages of this work.
Funding was provided by Science and Engineering Research Board.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A: Master Equations and Steady-State Probabilities
The master equations governing the time evolution of the probabilities can be written as:
with the normalization condition
The steady-state solutions \(P_{\mu }\) of equations (26) are given by
where
Appendix B: Intuitive Derivation of the Expression for Average Rate of Translation
Following Cleland’s approach Cleland (1975) for replacing complex network of biochemical pathways by a simpler equivalent network and deriving the effective rates of the transitions of that network, we derive the expression for the average velocity of the ribosome in our model. To illustrate the method, we consider a simpler reaction
The effective rate constant, \(k_{1}'\), for \(X \xrightarrow {k_{1}'} Y,\) is given by
The same treatment can be applied to our model.
Let us first assume that only correctly charged cognate tRNAs are present (i.e., assuming that mis-charged cognate, near-cognate and non-cognate tRNAs are absent in the surrounding). For the five consecutive steps of the cycle, we denote the transit times by \(T_{1}\), \(T_{2}\), \(T_{3}\), \(T_{4}\) and \(T_{5}\), respectively. It is straightforward to see that
Therefore, for the above cycle, i.e., when only correctly charged cognate tRNA molecules are present in the surrounding, the average velocity of the ribosome would be
Similarly, for the other three cycles we can specify the transit times in a similar manner. For mis-charged cognate tRNA, the corresponding symbols are \(T'_{1},T'_{2},T'_{3},T'_{4},T'_{5}\), respectively, while for near-cognate tRNA, ths symbols are \(T''_{1},T''_{2},T''_{3},T''_{4},T''_{5}\), respectively. For non-cognate tRNA, we have \(T'''_{1},T'''_{2},T'''_{3},T'''_{4},T'''_{5}\), respectively.
Next, let us consider the general case when all the four different substrates are present in the surrounding; the kinetics of the system is described by full model shown in Fig. 3. The transit times are analogous to resistances in electrical circuits, which means that for a series of reaction, the transit times are additive and for parallel reaction pathways, the reciprocals of the transit times are additive. Hence, the average velocity for the complete model is
where
Appendix C: Master Equations and Derivation of Dwell Time Distribution
The master equations governing the time evolution of the probabilities are identical to those given in Appendix A, except for the following:
and the normalization condition becomes
Here, we take the initial conditions to be \(P_{1}(0)=1\), and \(P_{2}(0)=P_{3}(0)=P_{4}(0)=P_{5}(0)=P'_{2}(0) =P'_{3}(0)=P''_{2}(0)=P''_{3}(0)=P'''_{2}(0) =P'''_{3}(0)=P^{*}_{4}(0)=P^{*}_{5}(0)= \widetilde{P_{1}}(0)=0\).
Rights and permissions
About this article
Cite this article
Dutta, A., Chowdhury, D. A Generalized Michaelis–Menten Equation in Protein Synthesis: Effects of Mis-Charged Cognate tRNA and Mis-Reading of Codon. Bull Math Biol 79, 1005–1027 (2017). https://doi.org/10.1007/s11538-017-0266-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11538-017-0266-5