A Stochastic Model of Nonenzymatic Nucleic Acid Replication: “Elongators” Sequester Replicators

Fernando, Chrisantha; Von Kiedrowski, Günter; Szathmáry, Eörs

doi:10.1007/s00239-006-0218-4

A Stochastic Model of Nonenzymatic Nucleic Acid Replication: “Elongators” Sequester Replicators

Published: 13 April 2007

Volume 64, pages 572–585, (2007)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Journal of Molecular Evolution Aims and scope Submit manuscript

A Stochastic Model of Nonenzymatic Nucleic Acid Replication: “Elongators” Sequester Replicators

Download PDF

Chrisantha Fernando^1,2,3,
Günter Von Kiedrowski⁴ &
Eörs Szathmáry²

462 Accesses
43 Citations
1 Altmetric
Explore all metrics

Abstract

The origin of nucleic acid template replication is a major unsolved problem in science. A novel stochastic model of nucleic acid chemistry was developed to allow rapid prototyping of chemical experiments designed to discover sufficient conditions for template replication. Experiments using the model brought to attention a robust property of nucleic acid template populations, the tendency for elongation to outcompete replication. Externally imposed denaturation-renaturation cycles did not reverse this tendency. For example, it has been proposed that fast tidal cycling could establish a TCR (tidal chain reaction) analogous to a PCR (polymerase chain reaction) acting on nucleic acid polymers, allowing their self-replication. However, elongating side-reactions that would have been prevented by the polymerase in the PCR still occurred in the simulation of the TCR. The same finding was found with temperature and monomer cycles. We propose that if cycling reactors are to allow template replication, oligonucleotide phenotypes that are capable of favorably altering the flux ratio between replication and elongation, for example, by facilitating sequence-specific cleavage within templates, are necessary; accordingly the minimal replicase ribozyme may have possessed restriction functionality.

Template Directed Replication Supports the Maintenance of the Metabolically Coupled Replicator System

Article 11 March 2015

Implementing Arbitrary CRNs Using Strand Displacing Polymerase

Models of Replicator Proliferation Involving Differential Replicator Subunit Stability

Article 10 September 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Although in vitro selection of ribozymes (Cech 2002) lends credence to the idea of an RNA world (Gilbert 1986), we still have no replicase ribozyme (Johnston et al. 2001). Nonenzymatic synthesis of templates up to 55 nucleotides has been achieved on mineral surfaces (Ferris et al. 1996), but there is no replication, because templates do not recycle by unzipping (Kovac et al. 2003). Short oligonucleotide analogues can self-replicate (Von Kiedrowski 1986; Sievers and Von Kiedrowski 1994), but longer ones cannot because self-inhibition by strand association becomes prohibitive. Because long template replication would facilitate unlimited heredity (Szathmáry and Maynard Smith 1997), determining the conditions that could allow long nucleic acid replication to arise and be maintained would explain the details behind an early major transition in evolution (Maynard Smith and Szathmáry 1995).

Previous models of the origin of nucleic acid replication have consisted of ordinary differential equation models that incorporate high-level chemical assumptions (Kanavarioti and Bemasconi 1990; Kanavarioti 1994; Fernando and Di Paolo 2004) such as direct chain growth (Wattis and Coveney 1999), subexponential or parabolic template growth rates in a closed system (Von Kiedrowski 1993), ribozyme effects, or postulate idealized replication mechanisms (Wills et al. 1998). Macroscopic physical models of template replication, although ingeniously relaxing these assumptions, do not embody the order of magnitude differences in rates between phosphoamidate-bond (p-bond) and Watson-Crick base-pair (h-bond) events observed in nucleic acids, and have not yet demonstrated long template replication (Breivik 2001; Griffith et al. 2005). In order to integrate these diverse approaches, a low-level stochastic model of the underlying chemical kinetics of nucleotide and polymer dynamics was designed to explore the sufficient functional conditions for nonenzymatic long template replication.

The details of the model and experiments to validate the model are described under Materials and Methods. The Results section presents experiments using the model to test the claim that cyclic reactor conditions, for example, those allowing denaturation-renaturation cycles, can facilitate nonenzymatic template replication. Lathe (2003, 2005) proposes that fast tidal cycling allows nucleic acids to associate and undergo ligation at high template and precursor concentration and high salinity when the tide goes out, and dissociate at low template and precursor concentration and low salinity when the tide comes in. Others have proposed that temperature oscillation in hydrothermal vent systems may serve a similar function (Kuhn 1972; Stein and Anderson 1984; Braun and Libchaber 2004). Alternatively, Gánti (1979, 2003) proposes that oscillation of monomer concentration in a metabolizing protocell can allow nonenzymatic template replication. All the experiments using the stochastic model show that this general class of solution is susceptible to an “elongation catastrophe” as shown in Fig. 1, in which Lathe’s proposed mechanism is compared to the behavior observed in the stochastic model. The principal aim of this paper is to give insight into the reaction mechanisms that take place in cycling reactors containing nucleic acid templates in the absence of enzymes.

Irrespective of the period of tidal cycling (Verga et al. 2006), the tidal chain reaction (TCR) differs from the polymerase chain reaction (PCR) in that there is no mechanism (e.g., a DNA/RNA polymerase complex) for preventing elongation side-reactions. Temperature cycling to obtain denaturation-renaturation cycles is functionally equivalent to tidal cycling with respect to elongation rate but is also more likely to cause strand breakage. Monomer concentration cycling does not facilitate template replication because at high monomer concentrations, there is elongation of strands at staggered ends and production of incompletely replicated dimers and trimers, and at low monomer concentrations, oligomers are ligated together. In all cases, the continued nonenzymatic formation of new nucleic acid polymers requires high monomer and template concentrations (in the millimolar range). These conclusions support the notion that protocells may have preceded template replication (Szathmáry et al. 2005). To conclude, we propose a possible solution to avoid the elongation catastrophe.

Materials and Methods

An informal description of the algorithm is provided, followed by a formal specification. Figure 2 shows the overall organization of the simulation. The simulation used a two-timescale technique to deal with the order of magnitude difference in characteristic times between (Watson-Crick base-pair) h-bond events and (phosphoramidate) p-bond events. An “event” means the formation or breakage of a bond or a bond type. Constrained by high computational costs, a very small volume^{Footnote 1} was modeled as a flow reactor containing initially 20 random sequence complementary dimer or 4-mer double strands, giving an effective oligomer double strand concentration of 2 mM. Monomer concentration was held constant at either 0.8 or 2 mM. Using a variant of the Gillespie algorithm, the hydrogen-bond dynamics of the system were simulated until a quasi-steady state was reached, e.g., for a period of 0.0001–0.05 s.^{Footnote 2} Figure 4 describes the reaction rules used to calculate propensities. Once a steady state in the fast h-bond dynamics was reached, microstates from this steady state were sampled at fixed time intervals of 0.05 s. Each microstate had an associated propensity of p-bond formation, given by the sum of the number of potential p-bond forming configurations contained in that microstate multiplied by the rate of p-bond formation associated with each p-bond forming configuration. Also, each microstate had an associated p-bond breakage propensity, which was assumed to be simply proportional to the number of p-bonds in that microstate, therefore this value was equal across all microstates. A time was generated at which the next p-bond formation event was expected to occur, by entering the p-bond formation propensity into Eq. (1):

$$ t_{i} = {{\rm {ln}} (rand()) \over P_{i}} $$

(1)

Similarly, an expected time was generated for the next p-bond breakage event. The event occurring first was executed. Roulette wheel selection was used to randomly choose a microstate in which to execute the event, with microstate weighting determined by the p-bond event propensity in that microstate. Once a microstate had been chosen, roulette wheel selection was applied over all p-bond events of the appropriate type in that microstate. In general, a time to the next event was always obtained by applying Eq. (1) to the propensity of that event. The event occurring earliest was the one chosen for execution. The particular bond that was chosen to undergo an event of the chosen class was obtained by percolation through a chain of roulette wheel selections, biased by propensities at each level in the hierarchy.

Figure 3 illustrates the representation of nucleic acids used in the simulation. A polymer is defined as a contiguous molecule joined by h-bonds and p-bonds. Each polymer was represented on a separate unbounded grid. Each grid represented individual A, C, G, and T nucleotides on its vertices, hydrogen (h-bonds) on the vertical edges, and covalent bonds (p-bonds) on the horizontal edges. Thus, a large set of polymer secondary structures could be represented on the grid. Hairpins were excluded because p-bonds were confined to the horizontal edges. First, using this representation, a composite propensity of reaction was calculated for each polymer: the intrapolymer reaction propensity (IPRP). The IPRP was defined as the rate of reactions of type i, multiplied by the number of configurations that could undergo reaction type i, within that polymer at time t (excluding p-bond reactions). Figure 4 shows the reaction rules used to calculate the IPRP. Second, a propensity of collision was calculated for all possible reactions between polymers: polymer-polymer propensity (PPP). This was the propensity that the next event would be a collision event rather than an intrapolymer reaction. We assumed a well-stirred reactor, i.e., the probability of collision of all polymer pairs was equal. Third, the propensity that the next event would be a reaction between a monomer and a polymer was determined, so defining the monomer-polymer propensity (MPP). Having determined all composite event propensities, times were generated for each composite event type. The event type that occurred first was chosen for execution. Roulette wheel selection was used to percolate the event type decision to a particular location. If the earliest event was an intrapolymer event, then roulette wheel selection was used to choose which particular polymer would undergo the event. Roulette wheel selection was again applied to all possible events within that polymer, and an event was executed at time t+x, where x is the time generated using IPRP as the propensity. If the earliest event was a collision event, then the collision algorithm was executed, whereby two polymers where randomly chosen to undergo a collision, at two randomly chosen sites. A collision was deemed legitimate if it did not result in overlapping nucleotides or bonds; and if legitimate, an h-bond was formed between these two sites at time t + x, where x is given by applying PPP to Eq. (1), and the two polymers were joined to form one polymer. If the collision was not successful, time was updated by x, as above, but no change was made to the polymers. Similarly if the earliest event was a monomer-polymer association, roulette wheel selection was again used to choose a polymer onto which a monomer would attach, and again, roulette wheel selection was used to determine a site on that polymer onto which a monomer would attach. A monomer was attached by an h-bond at time t + x, where x is given by applying MPP to Eq. (1).

The algorithms described above are defined formally below. Algorithm 1 describes how fast h-bond dynamics were run to a quasi-steady state between each p-bond event, and algorithm 2 describes the process shown in Fig. 2. A full annotation of the algorithms is available in the supplementary material.

Algorithm 1 Running the fast dynamics to quasi steady state

Full size table

Algorithm 2 Two Time Scale Dynamics

Full size table

Figure 4 describes how the IRPR, PPP, and MPP values were calculated using realistic reaction kinetics and free energies obtained from empirical studies (Cantor and Schimmel 1980; Turner 2000; Reynaldo et al. 2000; Rohatgi et al. 1996; Schoneborn et al. 2001; SantaLucia 1998; Xia et al. 1998). The p-bond formation and breakage rates were modeled on phosphoramidate bonds, although due to the timescale separation between h-bond and p-bond dynamics, the same overall behavior is expected irrespective of the absolute values of p-bond formation and breakage rates (see supplementary material). The probabilities of bond breakage and formation are functions of the local neighborhood configuration of bonds, monomer concentration, and temperature. The rules shown in Fig. 4 were applied to each bond or potential bond site on each polymer to calculate the rate of that bond breaking or forming and then used to calculate the composite propensities, IPRP, PPP, and MPP.

Rule 1: h-bond breakage rate, $ d_{m} = A_{m} e^{(E_{n} - Kn)/ RT} $. GC single h-bond breakage activation energy E _a = 31.0 kJ. AT single h-bond breakage activation energy E _a = 29.5kJ. Mean E _a for noncomplementary h-bonds = 18.4 kJ. This resulted in approximately 10% mispaired nucleotides. Modifications applied due to stacking effects are described in Fig. 5. T is the temperature in Kelvin (either 275 or 300 K in these experiments). R = 8.314 × 10⁻³ kJ K⁻¹ mol⁻¹. The central h-bond in configuration B1 contains no h-bonds adjacent to it, and so it has the highest probability of breakage. The central h-bond in the BN configuration has four adjacent h-bonds, and so has the lowest probability of breakage. The ordering of stabilities of the central h-bond from least to most stable is B1 > B12 > B2 > B23 > BN1 > BN.^{Footnote 3} The Arrhenius constants for each neighborhood configuration are approximated by A_B1 = 3.6 × 10¹³, A_B12 = 1.8 × 10¹³, A_B2 = 7.3 × 10¹², A_B23 = 1.8 × 10¹², A_B3 = 3.6 × 10¹¹, A_BN1 = 3.6 × 10¹⁰, and A_BN = 1.8 × 10¹⁰. The entropy term K _n was required to obtain the proper melting temperature curve for long strands (see supplementary material). K was empirically set to 0.01, n is the number of nucleotides not attached by h-bonds to another nucleotide, e.g., those at the dangling ends.

Rule 2: Catalyzed h-bond formation rate = 10⁶ s⁻¹ adjacent to other h-bonds.

Rule 3: Zipper h-bond formation rate = 10⁶ s⁻¹ irrespective of the distance from the potential h-bond to the closest nucleation site (a nucleation site is a pattern of three adjacent h-bonds). Note that the simultaneous application of rules 2 and 3 results in h-bonds being twice as likely to form next to another h-bond than anywhere else along a duplex.

Rule 4: Template directed p-bond formation rate = $ p\_form\_rate = 0.6e^{{-25} / RT} $.

Rule 5: p-bond breakage rate = $ rate\_p\_break = 1.32 \times 10^{12}\,e^{{-110}/ RT} $, n.b p-bond breakage rate exceeds p-bond formation rate, above 350 K (see supplementary material)l. Note that rules 4 and 5 are only applied in the outer loop of algorithm 2, not in algorithm 1.

Rule 6: Monomer attachment. Complementary rate = 10⁶ s⁻¹. Noncomplementary rate = 10³ s⁻¹. Stacked monomer attachment (i.e., attachment of a monomer next to an h-bond) is 100 times these values.

Rule 7: Collision rate between any two polymers = 5 × 10⁹ s⁻¹ M⁻¹. Note that this is a bimolecular rate constant because it is multiplied by two concentrations. The total collision propensity is given by (5 × 10⁹) × (no. of polymers/N _A V), where N _A = Avogadro’s number, and V is the volume. We set N _A V = 10⁴ when the tide was out and N _A V = 10⁵ when the tide was in. When a collision occurred, the two random nucleotides between which a new h-bond (or transient “stacking” s-bond; see Fig. 6) would form were chosen, and the polymers were combined at that site. If the combination resulted in no overlap of nucleotides on the two-dimensional (2D) grid, then the collision was successful and polymers were combined; otherwise, the polymers remained separate. “Rule” X: Spontaneous p-bond formation by association of monomers was forbidden because the rate was too low to be significant at the timescales considered. However, spontaneous p-bond formation between oligomers by single-stranded and double-stranded (nontemplated) end ligation was allowed; see Fig. 6.

Figure 5 describes how some more elaborate calculations of interstrand and intrastrand stacking forces were implemented (SantaLucia 1998; Cruz et al. 1982) because this was known to influence the tendency for elongation and contributes to “phenotypic” diversity (Sinclair et al. 1984; Zielinski and Orgel 1987a, b, 1989), for example, allowing GCGC to replicate but tending to cause CGCG to elongate.

Finally, transient “stacking” bonds were introduced, capable of mediating blunt end ligation of templates; see Fig. 6. This was necessary to match recent results observed experimentally in real chemical systems of pure dimers capable of elongation despite the fact that templated p-bond formation could not have occurred due to their extremely low melting temperatures.

To check that the above parameters produced the correct behavior, control experiments were conducted to calculate the melting temperature curve for A_nT_n and G_nC_n oligomers; see Fig. 7. To check h-bond kinetics, dissociation behavior from double-stranded to single-stranded states was measured as a function of strand length and temperature (Fig. 8). To check p-bond kinetics, an experiment was conducted in a reactor closed to mass and initialized with only GC dimers, at 275 K (Fig. 9). See the supplementary material for a control showing the different behaviors of GCGC and CGCG oligomers.

In order to check whether the model was capable of template replication in the presence of Q-beta-like sequence-independent RNA replicase ribozyme,^{Footnote 4} a model of this molecule was implemented as another “monomer” type present at a low copy number and capable of executing additional local bond formation and breakage rules. Figure 10 shows the intended mechanism of the particle, the local neighborhood rules that carry out this mechanism using the grid representation, and the replication and elongation behaviors observed from direct screenshots of the simulation.

Tidal cycling was modeled as regular cyclic dilution and low salinity “spikes” sufficient to completely denature effectively all double strands. High tide (spike) dilution was assumed to reduce the concentration of strands by 10 times, and the effect of low tide salinity was to reduce the effective h-bond breakage activation energy by a constant value of 3 kJ per h-bond, so making h-bonds more unstable. The tide was applied at various periods as a square threshold function on a sine wave, i.e., high tide occurred when sin(t/period) > T, typically T = 0.9.

Two temperature cycling experiments were conducted. The first used spikes of 330 K over a baseline of 280 K, with very short high-temperature spikes producing complete denaturation. Spike length was defined as that sufficient to break p-bonds. The interspike interval was defined as the time required to make 100 p-bonds; s-bonds were not allowed in this experiment. In the second experiment, temperature spikes were administered in replacement of high tides to a value of 300 K from a baseline of 275 K; s-bonds were allowed. Experiments with fixed monomer concentrations and with monomer cycling were carried out with monomer concentration varying from 0.6mM to 60mM, with high monomer concentration replacing high tide events.

In summary, the major obstacles encountered in producing the model were (i) the vast number of distinct configurations that nucleic acid polymers could adopt transiently, resulting in many possible types of interactions; (ii) the order-of-magnitude difference in characteristic timescales between Watson-Crick base pairs (h-bonds) and phosphoramidate bonds (p-bonds); and (iii) the high interdependency of reaction rates, due to hydrogen bond stacking interactions, and interpolymer reactions. These problems were solved, respectively, by (i) limiting possible secondary structures to those without hairpins, and simulating very small volumes containing a few templates; (ii) using a relaxation method to allow h-bond dynamics to run approximately to equilibrium before each p-bond event; and (iii) using a very efficient stochastic algorithm (Gibson and Bruck 2000; Elf and Ehrenburg 2004).

Results

A reactor was initialized at 280 K with 20 double-stranded complementary 10-mers, each consisting of a random nucleotide sequence. This means that there is an initial concentration of 2 mM double stranded 10-mers or 4 mM single-stranded 10-mers in the reactor. In addition, identical and constant concentrations of A, C, G, and T monomers were provided at a total concentration of 2 mM. Visual inspection revealed the rapid template-mediated production of dimers (by ligation between monomers stacked on 10-mer single-stranded templates or on the exposed single-stranded parts of splint junctions). Simultaneously, strand elongation occurred by (i) ligation at splint junctions between staggered oligomers and (ii) ligation between monomers and staggered ends. Trimers began to form once the dimer concentration approached that of monomers. This is trivially triplet replication, because 3-mer motifs embedded in longer oligomers are serving as templates for ligation between monomers and dimers. However, there was an inexorable process of elongation. As the templates became longer, dissociation became less likely, and embedded motifs were therefore no longer capable of replication. Although there were rare instances of 4-mer replication and 5-mer replication, due to between-oligomer ligation on templates, these newly formed copies typically become embedded into the elongating strands and so were unable to undergo further rounds of replication. Thus, elongation side-reactions destroyed the capacity for motifs to continue to act as templates. Novel motifs were created at a rate higher than the rate of replication of old motifs, thus information in the form of long sequences was not heritable. The distribution of template lengths over the course of the experiment is shown in Fig. 11 (top left).

Can tidal cycling help to remedy this problem? The claim by Lathe is that strand dissociation at high tide can free embedded motifs, allowing them to enter another round of replication; see Fig. 1. Tidal cycling was introduced with period 18,000 s, with dilution and reduced salinity lasting for 10% of each cycle, sufficient to allow complete dissociation of strands at high tide. The length distribution of templates under the influence of tidal cycling is shown in Fig. 11 (bottom). Under both high (left) and low (right) monomer concentrations, elongation was still observed with tidal cycling. Indeed the rate of elongation was approximately an order of magnitude faster with tidal cycling than without. This finding was not dependent on the period of cycling, salt concentration, or monomer concentration (see supplementary material), nor did it depend on the presence of s-bonds.^{Footnote 5} Direct visual inspection reveals that tidal cycling promotes elongation by splint junction ligation that occurs after reannealing of templates as the tide goes out. Experiments at higher temperatures (e.g., 320 K) revealed neither elongation nor replication; see Fig. 12. Temperature oscillation did not increase the capacity for replication, for reasons described in the legend to Fig. 13. Oscillation of monomer concentration did not increase the capacity for replication; see Fig. 14a. This is because at high monomer concentrations, oligomers were produced (Fig. 14b, bottom right) and later used to elongate templates at low monomer concentrations (Fig. 14b, top left).

Further checks for the presence of replication were carried out for the tidal cycling case. A brute force motif finding method was used to examine the distribution of all motifs in the reactor at the end of an experiment. How different is this distribution from a distribution that would be expected by random synthesis of sequences with the same length distribution and nucleotide composition as in the final reactor? A signature of replication is the clustering of sequences in sequence space. The number of sequences separated by a Hamming distance of <2 from another sequence would be expected to be greater in a reactor in which template replication was occurring, compared to a randomly generated set of sequences. One important caveat to this approach is that we cannot assume that elongation is purely random assembly. This is because stacking effects will favor the elongation of some sequences in preference to others and the nonrandom trajectory of self-assembly may also be sensitive to initial conditions. Template elongation can also be expected to generate nonrandom motif distributions. The measurements listed in Table 1 were made as follows. All possible motifs of a given length were enumerated. The frequency of each motif and its nearest neighbors (at Hamming distance 1; in the reactor at the end of each run) was counted and stored. The motif defining the largest of the clusters was then determined, and its frequency listed in Table 1 as a percentage of the total frequency of motifs in all clusters. This was then compared against a random model in which the final sequence state of the reactor was constructed as follows. Nucleotide pairs were chosen at random and exchanged between templates, a total of 20,000 exchanges being made. One hundred such random models were generated, and the figure on the right in each entry in Table 1 shows the mean and standard deviation of the largest motif cluster obtained from the random model. Entries in boldface show where the experimental model contains a largest motif cluster that is >3 standard deviations from the mean expected from a random model.

Table 1 Percentage occupancy of 1 Hamming distance cluster space by the largest cluster, compared to the mean and standard deviations derived from a random model generated by shuffling nucleotides in the test set

Full size table

In the nontidal case at high monomer concentrations, for all oligomers up to 5-mers, the most common motif cluster was always present at a frequency at least 3 standard deviations greater than the mean expected from the random models. This suggests that some nonrandom sequence elongation or replication is likely to have been occurring. However, no significant clustering was observed in the nontidal model at lower monomer concentrations. Tides at high monomer concentrations resulted in a loss of significant clustering, whereas tidal effects at low monomer concentrations increased the extent of clustering compared to the nontidal, low monomer concentration case.

Conclusion

In conclusion, replication of oligomers is unreliable compared with de novo sequence generation; see Fig. 15. This is due to a combinatorial explosion of elongation events that occurs with increasing template length. Denaturation-renaturation cycles, irrespective of their cause, promote elongation at splint junctions at both high and low monomer concentrations. At high monomer concentrations, tidal cycling decreases the extent of clustering in motif space, whereas at low monomer concentrations it has the opposite effect, i.e., to increase clustering in motif space. This may be due to either replication effects or nonrandom self-assembly effects, but other methods are required to explore the reaction mechanisms responsible for such clustering. For example, it may be possible in a software model to directly tag sequence segments that have acted as templates, to measure the number of replication cycles each segment takes part in before becoming sequestered. Such techniques are not possible in real chemistry. The same elongation problem arises with temperature oscillation and with monomer concentration oscillation. Experiments have been conducted with a wide range of parameter settings, and the finding that elongation outcompetes replication is extremely robust.

The stochastic model is limited in some important ways. It considers only simple secondary structures. It cannot be guaranteed that the h-bond dynamics have been run to a complete steady state between p-bond events, however, test runs with longer times allowed for h-bond dynamics to reach steady state revealed no differences in outcome. Rare p-bond forming configurations that have a high p-bond forming propensity will be underrepresented in the microstate sampling, but this bias is expected to vanish in the limit of high sample size.

The model demonstrates that tidal cycling cannot work without acknowledging the need for more complex RNA secondary structures, e.g., hairpins that may be capable of altering the rate of flux between the elongation and the replication pathways. This is because of the ubiquitous presence of elongating side-reactions. One obvious means by which flux could be channeled from the elongation to the replication pathway would be if certain sequences were able to cut themselves out of elongating templates, i.e., by acting as restriction ribozymes.^{Footnote 6} We hypothesize that a self-splicing replicase ribozyme in an elongation-favoring environment can be shorter and therefore easier to evolve than a replicase ribozyme with ligase activity (Jeffries et al. 1989). Whatever the precise mechanism of the minimal ribozyme, and whenever reaction conditions were such that nonenzymatic ligation could take place, there would have been strong selective pressure for a minimal replicase ribozyme to avoid entrapment in elongating sequences. We have demonstrated that even if these conditions involved tidal cycling or another kind of denaturation-renaturation cycle, the mechanisms of replication hitherto proposed (see Fig. 1) are foiled by the elongation problem. One proposed solution to the elongation problem is the early evolution of a minimal replicase ribozyme with restriction activity.

Notes

Avogadro’s number × Volume is defined as 10,000. Dividing the number of molecules by this value gives the concentration as moles per liter.
The rules are applied to the system using a variant of the SSA algorithm (Elf and Ehrenburg 2004), based on the next reaction method of the Gillespie algorithm (Gillespie 1977; Gibson and Bruck 2000).
These are codes for the classes of equivalent h-bond neighborhood states that contribute the same stacking stability to the central h-bond. They label the 16 configurations shown in Fig. 4 rule 1.
We model a replicase ribozyme that behaves similarly to the sequence-nonspecific RNA-dependent RNA polymerase protein enzyme from Q beta.
Elongation is observed even in the absence of spontaneous ligation if the system is initialized with 10-mers.
Sequence-independent degradation reactions are insufficient because they would result in stochastic loss of sequence information.

References

Braun D, Libchaber A (2004) Thermal force approach to molecular evolution. Phys Biol 1:1–8
Article CAS Google Scholar
Breivik J (2001) Self-organization of template-replicating polymers and the spontaneous rise of genetic information. Entropy 3:273–279
Article CAS Google Scholar
Cantor CR, Schimmel PR (1980) Statistical mechanics and kinetics of nucleic acid interactions. In: Biophysical chemistry. W. H Freeman, San Fransisco, pp 1183–1264
Cech TR (2002) Ribozyme, the first 20 years. Biochem Soc Trans 30:1162–1166
Article PubMed CAS Google Scholar
Cruz P, Bubienko E, Borer P (1982) A model for base overlap in RNA. Nature 298:198–200
Article PubMed CAS Google Scholar
Elf J, Ehrenberg M (2004) Spontaneous seperation of bi-stable biochemical systems into spatial domains of opposite phases. IEE Syst Biol 1(2):230–236
CAS Google Scholar
Ferris JP, Hill AR Jr, Liu R, Orgel LE (1996) Synthesis of long prebiotic oligomers on mineral surfaces. Nature 381:59–61
Article PubMed CAS Google Scholar
Fernando CT, Di Paolo E (2004) A model for the origin of long RNA templates. In: Proceedings of the Ninth International Conference of Artificial Life, Boston, MA, pp 1–9
Gánti T (1979) A theory of biochemical supersystems and its application to problems of natural and artifical biogenesis. Akadémiai Kiadó, Budapest/University Park Press, Baltimore
Google Scholar
Gánti T (2003) The principles of life. Oxford University Press, Oxford
Google Scholar
Gibson A, Bruck G (2000) Efficient exact stochastic simulation of chemical systems with many species and many channels. J Phys Chem A 104:1876–1889
Article CAS Google Scholar
Gilbert W (1986) The RNA world. Nature 319:618
Article Google Scholar
Gillespie D (1977) Exact stochastic simulation of coupled chemical reactions. J Phys Chem 8:2340–2381
Article Google Scholar
Griffith S, Goldwater D, Jacobson JM (2005) Robotics: self-replication from random parts. Nature 437:636
Article PubMed CAS Google Scholar
Jeffries AC, Symons RH (1989) A catalytic 13-mer ribozyme. Nucleic Acids Res 17:1371–1377
Article PubMed CAS Google Scholar
Johnston WK, Unrau PJ, Lawrence MS, Glasner ME, Bartel DP (2001) RNA-catalyzed RNA polymerization: accurate and general RNA-templated primer extension. Science 292:1319–1325
Article PubMed CAS Google Scholar
Kanavarioti A (1994) Template-directed chemistry and the origins of the RNA world. Origins Life Evol Biosph 24:479–495
Article CAS Google Scholar
Kanavarioti A, Bemasconi C (1990) Computer simulation in template-directed oligonucleotide synthesis. J Mol Evol 31:470–477
Article PubMed CAS Google Scholar
Kovac L, Nosek J, Tomaska L (2003) An overlooked riddle of life’s origins: energy-dependent nucleic acid unzipping. J Mol Evol 57:S182—S189
Article PubMed CAS Google Scholar
Kuhn H (1972) Selbstorganisation molekularer Systeme und die Evolution des genetischen Apparats. Angew Chem 84:838–862
Google Scholar
Lathe R (2003) Fast tidal cycling and the origin of life. Icarus 168:18–22
Article CAS Google Scholar
Lathe R (2005) Tidal chain reaction and the origin of replicating biopolymers. Int J Astrobiol 4(1):19–31
Article CAS Google Scholar
Maynard Smith J, Szathmáry E (1995) The major transitions in evolution. Oxford University Press, Oxford
Google Scholar
Reynaldo LP, Vologodskii V, Neri BP, Lyamichev VI (2000) The kinetics of oligonucleotide replacements. J Mol Biol 297:511–520
Article PubMed CAS Google Scholar
Rohatgi R, Bartel DP, Szostak JK (1996) Nonenzymatic, template-directed ligation of oligoribonucleotides is highly regioselective for the formation of 3′–5′ phosphodiester bonds. J Am Chem Soc 118:3340–3344
Article PubMed CAS Google Scholar
SantaLucia J (1998) A unified view of polymer, dumbell, and oligonucleotide DNA nearest-neighbor thermodynamics. Proc Natl Acad Sci USA 95:1460–1465
Article PubMed CAS Google Scholar
Schoneborn H, Bulle J, Von Kiedrowski G (2001) Kinetic monitoring of self-replicating systems through measurement of flourescence resonance energy transfer. Chembiochem 12:922–927
Article Google Scholar
Sievers D, Von Kiedrowski G (1994) Self-replication of complementary nucleotide-based oligomers. Nature 369:221–224
Article PubMed CAS Google Scholar
Sinclair A, Alkema D, Bell RA, Coddington JM, Hughes DW, Neilson T, Romaniuk PJ (1984) Relative stability of guanosine-cytidine diribonucleotide cores: a h-NMR assessment. Biochemistry 23:2656–2662
Article PubMed CAS Google Scholar
Stein DL, Anderson PW (1984) A model for the origin of biological catalysis. Proc Natl Acad Sci USA 81(6):1751–1753
Article PubMed CAS Google Scholar
Szathmáry E, Maynard Smith J (1997) From replicators to reproducers: the first major transitions leading to life. J Theor Biol 187:555–571
Article PubMed Google Scholar
Szathmáry E, Santos M, Fernando C (2005) Evolutionary potential and requirements for minimal protocells. Topics Curr Chem 259:167–211
Article CAS Google Scholar
Turner DH (2000) Conformational changes. In: Bloomfield VA, Crothers DM, Tinoco I Jr (eds) Nucleic acids: structures, properties and functions. University Science Press
Verga P, Rybicki C, Davis KR (2006) Comment on the paper “Fast Tidal Cycling and the Origin of Life” by Richard Lathe. Icarus 180(1):274–276
Article CAS Google Scholar
Von Kiedrowski G (1986) A self-replicating hexadeoxynucleotide. Angew Chem Int Ed Engl 25:932–934
Article Google Scholar
Von Kiedrowski G (1993) Minimal replicator theory i: parabolic versus exponential growth. Bioorg Chem Front 3:113–146
Google Scholar
Wattis J, Coveney P (1999) The origin of the RNA world: a kinetic model. J Phys Chem B 103:4231–4250
Article CAS Google Scholar
Wills P, Kauffman S, Stadler B, Stadler P (1998) Selection dynamics in autocatalytic systems: templates replicating through binary ligation. Bull Math Biol 1:1–26
Google Scholar
Xia T, SantaLucia J, Kierzek R, Schroeder R, Cox C, Turner D (1998) Thermodynamic parameters for an expanded nearest-neighbor model for formation of RNA duplexes with Watson-Crick base pairs. Biochemistry 37:14719–14735
Article PubMed CAS Google Scholar
Zielinski W, Orgel L (1987a) Autocatalytic synthesis of a tetranucleotide analogue. Nature 327:346–437
Article CAS Google Scholar
Zielinski W, Orgel L (1987b) Oligoaminonucleoside phosphoramidates. Oligomerization of dimers of 3′-amino-3-deoxynucleotides (GC and CG) in aqueous solution. Nucleic Acids Res 15:1699–1715
Article CAS Google Scholar
Zielinski W, Orgel L (1989) The template properties of triphosphraamidates having CG residues. J Mol Evol 29:281–283
Article PubMed CAS Google Scholar

Download references

Acknowledgments

This work was partly supported by the Hungarian National Research Fund (OTKA T047245), the National Office for Research and Technology (NAP 2005/ KCKHA005) of Hungary, and the ESIGNET European 6th Framework Grant for Cell Signalling Networks. Thanks go to Johan Elf, Mans Ehrenberg, and Simon McGregor for help with the writing of the code for the stochastic algorithm and design of the two-timescale method. Thanks are due to Richard Lathe for helpful comments during the preparation of the manuscript.

Author information

Authors and Affiliations

School of Computer Science, University of Birmingham, Edgbaston, B15 2TT, UK
Chrisantha Fernando
Collegium Budapest (Institute for Advanced Study), Szentháromság u. 2, H-1014, Budapest, Hungary
Chrisantha Fernando & Eörs Szathmáry
Center for Computational Neuroscience and Robotics, University of Sussex, Brighton, BN1 9RH, UK
Chrisantha Fernando
Bioorganische Chemie, Lehrstuhl für Organische Chemie I, Faculty of Chemistry, Ruhr Universität Bochum, Universitätsstr.150, 44780, Bochum, Germany
Günter Von Kiedrowski

Authors

Chrisantha Fernando
View author publications
You can also search for this author in PubMed Google Scholar
Günter Von Kiedrowski
View author publications
You can also search for this author in PubMed Google Scholar
Eörs Szathmáry
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chrisantha Fernando.

Additional information

Reviewing Editor: Dr. Niles Lehman

Electronic Supplementary Material

239_2006_218_Supp.pdf

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fernando, C., Von Kiedrowski, G. & Szathmáry, E. A Stochastic Model of Nonenzymatic Nucleic Acid Replication: “Elongators” Sequester Replicators. J Mol Evol 64, 572–585 (2007). https://doi.org/10.1007/s00239-006-0218-4

Download citation

Received: 02 October 2006
Accepted: 22 January 2007
Published: 13 April 2007
Issue Date: May 2007
DOI: https://doi.org/10.1007/s00239-006-0218-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Stochastic Model of Nonenzymatic Nucleic Acid Replication: “Elongators” Sequester Replicators

Abstract

Similar content being viewed by others

Template Directed Replication Supports the Maintenance of the Metabolically Coupled Replicator System

Implementing Arbitrary CRNs Using Strand Displacing Polymerase

Models of Replicator Proliferation Involving Differential Replicator Subunit Stability

Introduction

Materials and Methods

Results

Conclusion

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic Supplementary Material

239_2006_218_Supp.pdf

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Stochastic Model of Nonenzymatic Nucleic Acid Replication: “Elongators” Sequester Replicators

Abstract

Similar content being viewed by others

Template Directed Replication Supports the Maintenance of the Metabolically Coupled Replicator System

Implementing Arbitrary CRNs Using Strand Displacing Polymerase

Models of Replicator Proliferation Involving Differential Replicator Subunit Stability

Introduction

Materials and Methods

Results

Conclusion

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic Supplementary Material

239_2006_218_Supp.pdf

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation