Keywords

1 Introduction

1.1 Synthetic Biology

A major goal of synthetic biology is to produce synthetic biochemical systems which have functions similar to the biochemical functions of living organisms. Considerable related work has been done in the fields of artificial and synthetic life, see [3, 10, 13] for an overview of these fields. Two predominant approaches by researchers in this area have been to design protein-based or RNA-based biochemical systems. However, both proteins and RNA are very difficult to predictively design for given functions, and behavior of the resulting protein-based or RNA-based biochemical systems can be very complex, making their engineering highly challenging. An alternative approach we propose here is to produce synthetic biochemical systems based on a very well understood, relatively non-reactive molecule, such as DNA (DNA strands and DNA nanostructures). DNA–DNA interactions like hybridization and strand displacementStrand displacement are reasonably well understood and a vast literature exists that studies, models, predicts and even controls such interactions (see [8, 29]). This literature enables us to program simple DNA systems by controlling experimental conditions like pH, salt concentrations, temperature and DNA concentration.

1.2 Overview of Our Work on Meta-DNA

In this work, we engineer synthetic biochemical systems, called meta-DNA, consisting only of DNA that capture the properties and structure of DNA in biological systems. Our work is reductive: we use simple DNA chemistry to emulate more complex enzyme Enzymebased DNA chemistry through ingenious use of DNA hybridization, strand displacementStrand displacement and hairpin systems. From a computer science perspective, our work can be thought of as using a lower level programming language to simulate programs encoded in a higher level programming language. This kind of simulation often leads to a better abstract understanding of the phenomena being simulated. From a synthetic biology perspective, we are building novel biochemical systems to emulate useful, well known natural biological systems and providing alternatives to protein enzymesEnzyme. From an engineering perspective, our work is a minimalist approach to designing biochemical systems from simple, predictable yet powerful modules. Our systems are largely isothermalIsothermal and autonomous which suggest that they may have applications for in vitro biochemical systems like transport devices, molecular motors, detection, signalling and computing systems.

1.3 Prior Work on Synthetic Replicators

Driven by the important role that replication plays in Biology, many self-replicating systems have been proposed, starting with Von Neumann. Early self-replicating systems were designed by von Kiedrowski [24], Tjivikua et al. [22]. For a review of various artificial replicators see [4, 23, 25]. A DNA-based artificial replicator was proposed by Zhang and Yurke [30]. Schulman and Winfree [17] study growth and evolution of simple crystals using DNA. Smith et al. [20] have independently developed abstractions for self-replication systems that can be thought of as tile-based and also rely on the idea of activation.

The major departure from this prior work on self-replicating systems and our meta-DNA approach, is that meta-DNA allows for a wide variety of key operations far beyond merely the operation of replication, enabling a much more extensive and complex set of synthetic biochemical systems.

1.4 Organization of this Chapter

In Sect. 7.2 we review the key properties of DNA, including its structure and enzymatic reactions. In Sect. 7.3 we list the desirable properties of meta-DNA and give a design for a meta-nucleotide, single and doubleDouble strand meta-DNA (dsmDNA) stranded meta-DNA, along with its secondary structure. Section 7.4 states the assumptions we make while designing protocols for meta-DNA manipulation. In Sect. 7.5 we discuss various meta-DNA reactions such as meta-hybridization, meta-denaturation, meta-strand displacement,Strand displacement meta-polymerization, meta-restriction, meta-helicase denaturation and exponential amplification using a meta-DNA replicator. Section 7.6 discusses open problems and future work.

2 Review of DNA Structure and Reactions

DNA is considered to be an ideal material to construct nanoscale structures and devices and has been used as scaffolding material for complex shapes, fuel for molecular motors and aptamers for various organic and inorganic molecules. The key properties of DNA that enable these functionalities are programmability, predictable chemical interaction and secondary structure and simple laboratory protocols for its manipulation. Synthetic DNA is also cheaply and readily available from a variety of commercial sources. At the most abstract level, fabricating structures and devices with DNA is akin to working with smart bricks that fit together in a specific predefined way and then putting them in a bag, shaking it and waiting for the bricks to self-assemble. However, the process is probabilistic and correcting errors is a fundamental challenge for the field.

Most readers will be familiar with the basic structure and reactions of DNA. The following discussion will be useful to even these readers as it frames DNA and its reactions in the context of our work and also specifies the granularity at which we consider DNA, with particular focus on the biochemical properties that we wish to simulate using our synthetic mDNA systems.

2.1 Key Properties of DNA

Here we review the key properties of natural DNA that we wish to mimic in mDNA. Single stranded DNA is a linear polymer made from repeating units called nucleotides. The nucleotide repeats contain both a segment of the backbone of the molecule, which holds the chain together, and a base. Each nucleotide has directionality and hence a DNA strand is asymmetric. The asymmetric ends of DNA strands are called the \(5^\prime \) (five prime) and \(3^\prime \) (three prime) ends. Single stranded DNA is flexible and has a small persistence length and is generally modeled as a freely-jointed chain. In living organisms, DNA does not usually exist as a single molecule, but instead as a pair of molecules entwined like vines, in the shape of a double helix. In a double helix the direction of the nucleotides in one strand is opposite to their direction in the other strand: the strands are antiparallel. Double stranded DNADouble strand meta-DNA (dsmDNA) has much greater persistence length than single stranded DNA and is generally modeled as a worm-like chain. The DNA double helix is stabilized by hydrogen bonds between the bases attached to the two strands. The four bases are classified into two types, purines (A and G) and pyrimidines (T and C). Each type of base on one strand overwhelmingly prefers a bond with just one type of base on the other strand. This is called complementary base pairing. Here, purines form hydrogen bonds to pyrimidines, with A bonding preferentially to T, and C bonding preferentially to G. This arrangement of two nucleotides binding together across the double helix is called a base pair. As hydrogen bonds are not covalent, they can be broken and rejoined relatively easily. The two strands of DNA in a double helix can therefore be pulled apart like a zipper, either by a mechanical force or kinetic energy due to high temperature. Given a pH value and salt (\(\text {Mg}^{++}\), \(\text {Na}^+\)) concentrations, the melting temperature is defined as the temperature at which half the DNA strands are in a double helical conformation while the rest are denatured. Melting temperature depends on the length of the DNA strand as well as its sequence, with longer strands having a higher melting temperature.

2.2 Key Reactions Involving DNA

We view DNA reactions as biochemical programs and attempt to emulate them. Hence, it is important to understand and abstract out these DNA reaction properties. With this goal in mind, we review a few key DNA reactions.

2.2.1 DNA–DNA Reactions

  • Hybridization is the attachment of a pair of single strands of DNA via hydrogen bonds along their complementary regions to form a double helix. Note that the two strands only attach in an anti-parallel manner and hence are reverse complements of each other.

  • The reverse process, separation of a double helix into its constituent single strands by the breaking of hydrogen bonds is DNA denaturation. This can be achieved via mechanical shear forces, high temperature or presence of denaturing agents like urea or formamide.

  • DNA strand displacement Strand displacement is the displacement of a single strand of DNA from a double helix by an incoming strand with a longer complementary region to the template strand. The incoming strand has a toehold, an empty single stranded region on the template strand complementary to a subsequence of the incoming strand, to which it binds initially. It eventually displaces the outgoing strand via a kinetic process modeled as a one dimensional random walk.

2.2.2 Enzymatic Reactions on DNA

  • DNA restriction is the cleaving of the backbone at a sequence specific recognition site by a restriction enzymeEnzyme.

  • DNA polymerases are a class of enzymesEnzyme that catalyze the polymerization of nucleotides into a DNA strand. The polymerase “reads” an intact DNA strand as a template and uses it to synthesize the new strand. The newly polymerized molecule is complementary to the template strand. DNA polymerases lack the ability to do de novo polymerization and can only extend a DNA strand already attached to the template strand, called a primer.

  • Isothermal Isothermaldenaturation can be achieved by helicases which are motor proteins that move directionally along a DNA backbone, denaturing the double helix.

  • DNA ligation is the rejoining of nicked double Double strand meta-DNA (dsmDNA) stranded DNA by repairing the backbone break. EnzymesEnzyme belonging to this class are known as ligases.

3 Desired Properties of Meta-DNA

We wish to abstract the structure and reactions of DNA described in Sect. 7.2.2 and emulate them using only DNA–DNA interactions. In doing so, we would have circumvented DNA-enzyme Enzymechemistry with a synthetic biochemical system that uses only DNA hybridization. First we state the desirable properties of Meta-DNA (mDNA)meta-DNA (mDNA).

3.1 List of Desirable Properties for mDNAMeta-DNA (mDNA)

  • We desire a set of \(2k\) meta-nucleotides (where \(k\) is an integer \({\ge }2\)). Each meta-nucleotide must be directional and have a \(3^\prime \) and \(5^\prime \) end. There must be two types of meta-nucleotides: meta-purines and meta-pyrimidines, each \(k\) in number. There must exist a pairing between meta-purines and meta-pyrimidines where each edge indicates an overwhelming complementary preference of meta-base linking between the meta-purine and meta-pyrimidine it connects.

  • The active \(5^\prime \) end of any meta-nucleotide should be able to bind to the active \(3^\prime \) end of any other meta-nucleotide and vice versa via a meta-backbone link.

  • The meta-backbone link must be a strong bond (implemented as a long sequence of hybridized DNA) while the meta-base link must be a weak bond (implemented as a short sequence of hybridized DNA).

  • ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)should be a linear directional polymer chain of meta-nucleotides bound by meta-backbone linkages and has \(3^\prime \) and \(5^\prime \) ends.

  • Two ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)that are reverse complementary to each other must have the ability to meta-hybridize to form a dsmDNA.Double strand meta-DNA (dsmDNA) Meta-DNA (mDNA) The strands of any dsmDNA Double strand meta-DNA (dsmDNA) Meta-DNA (mDNA) Double strand meta-DNA (dsmDNA)must be anti-parallel.

  • When an ensemble of identical dsmDNA Double strand meta-DNA (dsmDNA) Double strand meta-DNA (dsmDNA) Meta-DNA (mDNA)is heated to a temperature known as its melting temperature, half of the ensemble must denature into its constituent ssmDNA.Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA) The melting temperature must depend on the sequence of meta-nucleotides of the mDNA Meta-DNA (mDNA)and also on its length, with longer strands having a higher melting temperature.

  • To support strand displacement Strand displacementprotocols, the weak meta-base bonds must be continuously broken and remade in a kinetic process called breathing. The rate of breathing should be positively correlated with temperature.

  • ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)must be flexible and have a small persistence length when compared to dsmDNA Double strand meta-DNA (dsmDNA) Double strand meta-DNA (dsmDNA) Meta-DNA (mDNA)of comparable length. ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)must be like a freely-jointed chain while dsmDNA Double strand meta-DNA (dsmDNA) Meta-DNA (mDNA) must be like a worm-like chain.

  • Meta-nucleotides should not spontaneously form meta-backbone bonds in the absence of catalyst.

3.2 Abstract Description of mDNA

We model a meta-nucleotide as an Meta-DNA (mDNA)activatable tile [12] having three activatable pads: a \(5^\prime \) pad, a \(3^\prime \) pad and a base pad and represent it by a square tile as illustrated in Fig. 7.1. The tile has directionality as indicated by an arrow from \(5^\prime \) to \(3^\prime \) which is imposed by the sequence in which the pads are activated, with \(5^\prime \) always activated before \(3^\prime \). Tiles bind to each other via symmetric pad interactions called binding or linking. Each binding has a strength associated with it (\(1, 2\) or \(3\)) that depends only on the type of pads involved in the binding. The strength of a binding models the energy required to break the bond. Base pads can only interact with other base pads through strength \(1\) bindings, and are called meta-base bindings. \(5^\prime \) pads can only interact with \(3^\prime \) pads through strength \(3\) bindings and these are called meta-backbone bindings. The pads exist in one of four states: inactive, active unbound, bound and capped. Inactive pads do not bind with other pads. Any active unbound \(5^\prime \) pad of a meta-nucleotide can bind to any active unbound \(3^\prime \) pad of another meta-nucleotide via a bond of strength \(3\) after which these pads go to the bound state. There are \(2k\) different base pads, each corresponding to a meta-nucleotide type. These are split into two complementary perfectly matched sets. Let \(\mathfrak {P}=\{\beta _1, \beta _2, \ldots , \beta _{k}, \bar{\beta _1}, \bar{\beta _2}, \ldots , \bar{\beta _k} \}\) be the set of pads. For all \(i\), pads \(\beta _i\) and \(\bar{\beta _i}\) bind with each other. The face label on any tile indicates the base type of the meta-nucleotide. Any active unbound base pad of a meta-nucleotide can bind in an antiparallel manner to an active unbound complementary base pad of another meta-nucleotide by a strength \(1\) bond after which these pads go to the bound state.

Fig. 7.1
figure 1

Abstraction of the nucelotide

Fig. 7.2
figure 2

Single and double Double strand meta-DNA (dsmDNA) Meta-DNA (mDNA)stranded mDNA:Meta-DNA (mDNA) a single stranded mDNA, b double stranded mDNA, c double stranded mDNA has a longer persistence length

A linear chain of these tiles held together by \(5^\prime \)\(3^\prime \) pad bindings forms a directional polymer of meta-nucleotides and hence is ssmDNA.Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA) The \(3^\prime \) base pad of the tile at the \(3^\prime \) end of the ssmDNA Single strand meta-DNA (ssmDNA)and the \(5^\prime \) base pad of the tile at the \(5^\prime \) end are always in the capped state. A double stranded mDNA (dsmDNA)Double strand meta-DNA (dsmDNA) Double strand meta-DNA (dsmDNA) is a dimer of two ssmDNA Single strand meta-DNA (ssmDNA)held together by base pad bindings. ssmDNA Single strand meta-DNA (ssmDNA)is flexible with a low persistence length while dsmDNA Double strand meta-DNA (dsmDNA)is stiff and has a higher persistence length. These properties arise out of the geometric constraints imposed by the tile base structure of mDNA Meta-DNA (mDNA)as illustrated in Fig. 7.2.

Fig. 7.3
figure 3

Design of the meta-nucelotide

Fig. 7.4
figure 4

Internals of mDNA:Meta-DNA (mDNA) a internals of a single stranded mDNA,Meta-DNA (mDNA) b internals of a double stranded mDNADouble strand meta-DNA (dsmDNA)

3.3 Strand Design for mDNA

Figure 7.3 shows an Meta-DNA (mDNA)implementation of a meta-nucleotide tile as a DNA nanostructure. In Fig. 7.3a we have the tile with no protection and in Fig. 7.3b we have the tile with protection strands. The purpose of the protecting strands is to prevent spontaneous aggregation of tiles into mDNA. The protection mechanism is designed to impose the requirement that the \(5^\prime \) pad is activated before the \(3^\prime \) pad (this property is used in the polymerization and replicator protocols). Each letter denotes a DNA sequence and a bar atop a letter indicates reverse complement of the sequence that the letter denotes. The red strand (\(fgdcba\)) contains the \(5^\prime \) pad \(fg\) and also the base pad \(ba\) and the light green strand (\(\bar{g}\bar{f}\bar{e}\)) contains the \(3^\prime \) pad \(\bar{g}\bar{f}\). The blue strand (\(\bar{c}\bar{q}\bar{d}\)) and the dark green strand (\(qe\)) are bridging strands that hold the nanostructure together and give it the required geometry. We will have a detailed discussion of the secondary structure of the meta-nucleotide tile and mDNA Meta-DNA (mDNA)in Sect. 7.3.4 (Fig. 7.4). The sequence of reactions that occur when a tile is deprotected are as follows. The pink strand \(\bar{b}x\bar{f}\) protects (renders inactive) the \(5^\prime \) pad. When \(ba\) binds to its complement on another tile, \(\bar{b}x\bar{f}\) is ripped away from the \(5^\prime \) pad by the invasion of the strand \(f\bar{x}b\) thus activating the \(5^\prime \) pad. The strand \(f y \bar{g}\) protects (renders inactive) the \(3^\prime \) pad. When the \(5^\prime \) pad binds to its complement on another tile, \(f y \bar{g}\) is ripped away from the \(3^\prime \) pad by the invasion of the strand \(g \bar{y} \bar{f}\) thus activating the \(3^\prime \) pad. Figure 7.5 illustrates all possible states a meta-nucleotide can exist in. We model weak base bonds and strong backbone bonds by making the sequences corresponding to the \(5^\prime \) and \(3^\prime \) pads much longer than the sequences corresponding to the base pads. The internal structure of both single and double Double strand meta-DNA (dsmDNA) stranded mDNA are shown in Fig. 7.4a and b. Note the black protection strands on the tiles at the ends of the mDNA Meta-DNA (mDNA)to implement capping.

Fig. 7.5
figure 5

States of the nucleotide

3.4 Secondary Structure of mDNAMeta-DNA (mDNA)

Fig. 7.6
figure 6

a Design of rigid T-junction self-assembled from DNA. Figure from [7]. b(i) Secondary structure of dsmDNA Double strand meta-DNA (dsmDNA)forming a double helix (ratio of bases per axial rotation is programmable via strand design). Not to scale. b(ii) Axial view of dsmDNA.Double strand meta-DNA (dsmDNA) Meta-DNA (mDNA) Only the rungs of the twisted ladder structure are shown

Apart from simulating the reaction properties of DNA, we also wish to simulate its secondary structure, in particular:

  • The flexible nature of ssDNA characterized by shorter persistence length

  • The rigid nature of dsDNA characterized by a longer persistence length

  • The double helical Double strand meta-DNA (dsmDNA)structure of dsmDNADouble strand meta-DNA (dsmDNA).

Recently, Hamada and Murata [7] reported a novel self-assembled rigid T-shaped interconnected junction where each arm is a DNA double helix (Fig. 7.6a). They synthesized tile based structures like \(1D\) linear ladders, \(1D\) ringed structures and \(2D\) lattices using the T-junction geometry. This suggests that their T-junction motif is a useful widget for designing stable, rigid, well behaved self-assembled objects. We use the T-junction as the key motif in achieving the required structural properties for mDNA.Meta-DNA (mDNA) Each meta-nucleotide is a T-junction (Fig. 7.3a). ssmDNA Single strand meta-DNA (ssmDNA)(Meta-DNA (mDNA)Fig. 7.4a) is a linear polymer of these T-junctions with consecutive base pad sections (the double helical structure \(c \equiv \bar{c}\)) not in the same plane, but slightly rotated so that they stick out of the plane of the paper. This rotation is controlled by carefully choosing the number of bases that make up the horizontal double helical section between consecutive vertical helical sections (Fig. 7.4a). This secondary structure for ssmDNA Single strand meta-DNA (ssmDNA)induces a helical twist for ds Double strand meta-DNA (dsmDNA) (imagined in Fig. 7.6b). We can think of the secondary structure of dsmDNA Double strand meta-DNA (dsmDNA)as a twisted 1D ladder. The 1D ladder design in [7] can be easily modified by adding or deleting a single base pair from the side rungs to induce a twist to get a double helix structure with approximately \(10.5\) meta-bases per turn of the double helix, mimicking the twist of dsmDNA.Double strand meta-DNA (dsmDNA) In particular, we choose the length of the repeating DNA double helical unit \(efgdq \equiv \bar{q}\bar{d}\bar{g}\bar{f}\bar{e}\) to be either \(41\) (one less than the number of bases in four full turns of a DNA double helix) or \(43\) (one greater than the number of bases in four full turns of a DNA double helix). The following choice of lengths for the subsequences would potentially give us the required geometries and at the same time preserve the thermodynamic and kinetic properties that would allow our subsequent mDNA Meta-DNA (mDNA)protocols to succeed (\(|x|\) is the length of the DNA sequence represented by \(x\)): \(|a_i|=|b_i|=|\bar{a_i}|=|\bar{b_i}|=4, |c|=|\bar{c}|=|d|=|\bar{d}|=|q|=|\bar{q}|=6, |f|=|\bar{f}|=|g|=|\bar{g}|=12\) and \(|e|=|\bar{e}|=5 \mathrm or 7\).

4 Assumptions for mDNA Reactions

Before describing our protocols involving Meta-DNA (mDNA)mDNA we give here our assumptions under which these reactions proceed. We also describe some simple rules of DNA chemistry which are repeatedly used like subroutines in our mDNA Meta-DNA (mDNA)protocols.

  • Our systems are maintained at only three different temperatures characterized by a parameter \(\tau \). At room temperature or \(\tau =1\), both the meta-backbone bonds and the meta-base bonds are stable but breathing still occurs. Recall that breathing is the phenomenon of meta-base bonds spontaneously breaking and forming. At the melting temperature, \(\tau =2\), the meta-backbone bonds are stable but the base bonds are broken. At freezing temperature or \(\tau =0\), breathing does not occur.

  • The pH, salt concentrations and other factors that affect hybridization are set to levels such that spontaneous hybridization between a DNA sequence and its complement can occur.

  • Strand displacement,Strand displacement as defined earlier, always occurs and proceeds to completion. The strand that gets displaced out remains in the solution.

  • We assume that each subsequence, denoted by a letter in the figures, only interacts with its perfect and full reverse complements and no other spurious interactions occur.

5 Reactions in mDNA

We set out protocols for mDNA Meta-DNA (mDNA)that mimic DNA–DNA and DNA-enzyme Enzymeinteractions. We have two kinds of figures in the discussions that follow. The abstraction diagrams illustrate the protocols in the abstract activatable tile model while the internal structure diagrams illustrate the protocols in greater detail.

5.1 Meta-hybridization and Meta-denaturation in mDNAMeta-DNA (mDNA)

The simplest mDNA Meta-DNA (mDNA)reactions are meta-hybridization and meta-denaturation, which are reverse reactions of each other. In meta-hybridization, at temperature \(\tau =1\) two complementary ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)strands bind via complementary base pad bindings to give dsmDNA Double strand meta-DNA (dsmDNA)(Fig. 7.7a). Heating dsmDNA Double strand meta-DNA (dsmDNA)to temperature \(\tau =2\) meta-denatures the structure into its two component ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)(Fig. 7.7b).

Fig. 7.7
figure 7

Meta-hybridization and meta-denaturation by heating: a meta-hybridization reaction, b meta-denaturation at \(\tau =2\)

5.2 Meta-strand Displacement Strand displacementProcess in mDNA

Meta-strand displacement Strand displacementfor mDNA Meta-DNA (mDNA)is defined as displacement of a ssmDNA Single strand meta-DNA (ssmDNA)from a meta-double helix by an incoming ssmDNA Single strand meta-DNA (ssmDNA)with a longer complementary region to the template strand. This reaction occurs at temperature \(\tau =1\). Figure 7.8 gives a high level view of meta-strand displacement Strand displacementusing the activatable tile model. Two meta-strands compete to hybridize with a single ssmDNA.Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA) The shorter of the two is completely meta-hybridized to the template while the longer one comes in by gaining a toehold. Now, breathing of the meta-bases of the short strand gives an opportunity to the meta-bases of the competing incoming strand to meta-hybridize with the template. Note that the intermediate steps are reversible. However, once the incoming meta-strand completely displaces the outgoing meta-strand, the reaction stops as the outgoing meta-strand is extremely unlikely to come back in as it lacks a toehold.

Note that this reaction is made possible because the weak meta-base bonds can breathe at \(\tau =1\). Breathing in mDNA Meta-DNA (mDNA)is expected to occur at a slower rate than in DNA because we require multiple bases to spontaneously denature for a single meta-base to denature. Also, contiguous bases in DNA are more immediately local than contiguous meta-bases in mDNA Meta-DNA (mDNA)and hence the rate at which a meta-base occupies an empty spot on a complementary meta-base is also expected to be slower than for the corresponding process in DNA. Due to these reasons, we would expect meta-strand displacement Strand displacementin mDNA Meta-DNA (mDNA)to proceed slower than strand displacement Strand displacementin DNA.

Fig. 7.8
figure 8

Meta-strand displacement Strand displacementdue to breathing at \(\tau =1\)

5.3 Meta-polymerization in mDNAMeta-DNA (mDNA)

Meta-polymerization in mDNAMeta-DNA (mDNA) occurs by the extension of a ssmDNA,Single strand meta-DNA (ssmDNA) called a primer, by additions of free meta-nucleotides in the solution to the \(3^\prime \) end via meta-backbone bonds. The extension cannot occur de novo, it requires a primer meta-hybridized to a template ssmDNA.Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA) Neither can meta-nucleotides spontaneously aggregate, because of the protection strands (see Fig. 7.1). The meta-nucleotides added have base pads complementary to the corresponding base pads of the template strand.

Figure 7.9 gives an activatable tile model view of a single step in mDNA Meta-DNA (mDNA)meta-polymerization. Each base pad of the template strand that is not bound to its complement is in the active unbound state. The \(3^\prime \) pad of the tile at the \(3^\prime \) end of the primer is in the capped state (step 1). It transitions to the active unbound state when its black capping strand is displaced by an initiator strand. The incoming meta-nucleotide (step 2) has its \(5^\prime \) and \(3^\prime \) pads in the inactive state, while its base pad is in the active unbound state. The complementary base pads bind (step 3), activating the \(5^\prime \) pad of the incoming nucleotide (step 4). The \(5^\prime \) pad then binds to the \(3^\prime \) pad of the previous meta-nucleotide (step 5) causing the \(3^\prime \) pad of the incoming nucleotide to transition to the active unbound state (step 6). This process occurs repeatedly till either no further free meta-nucleotides are available or the end of the template strand is reached. There is an alternate mechanism to stop the polymerization, which involves adding to the solution a black capper strand to transition the \(3^\prime \) pad of growing \(3^\prime \) end to a capped state, thus stopping further additions. Note that in a solution with an ensemble of these nanostructures, in general the stopping point of meta-polymerization cannot be carefully controlled and such an attempt would probably lead to various length subsequences of the fully complementary ssmDNA.Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA) However, we can exclude certain meta-base types from the meta-nucleotide mix in the solution ensuring that the polymerization halts when the complementary meta-base is encountered on the template strand. Since we have access to a larger alphabet of bases in our mDNA Meta-DNA (mDNA)systems we can set aside a set of bases for such purposes, analogous to stop codons in translation.

Fig. 7.9
figure 9

Meta-polymerization reaction

Fig. 7.10
figure 10

Internals of meta-polymerization reaction-I

Fig. 7.11
figure 11

Internals of meta-polymerization reaction-II

We will now examine this protocol in greater detail by looking at the internal strand structure (Figs. 7.10 and 7.11). The \(3^\prime \) pad of the tile at the \(3^\prime \) end of the primer is capped by \(fgt_{3^\prime }\) (step 1). The initiator strand \(\bar{t_{3^\prime }} \bar{g} \bar{f}\) binds to the capper strand and activates the \(3^\prime \) pad \(\bar{g}\bar{f}\) (step 2). Now, the protected meta-nucleotide comes in (step 3) and binds to the complementary base sequence \(\bar{a_2}\bar{b_2}\), displacing the strand \(\bar{b_2}x\bar{f}\) (step 4). The toehold \(\bar{b_2}\) is exposed on \(\bar{b_2}x\bar{f}\) allowing \(f\bar{x}b_2\) to bind (step 5). This strand displaces \(\bar{b_2}x\bar{f}\) exposing \(f\) on the \(5^\prime \) end of the incoming meta-nucleotide (step 6). Now, the \(5^\prime \) end of the incoming meta-nucleotide binds to the \(3^\prime \) end of the previous nucleotide, displacing \(fy\bar{g}\) (step 7). The strand \(fy\bar{g}\) is stripped away by its complement through the toehold \(\bar{g}\), activating the \(3^\prime \) end (\(\bar{g}\bar{f}\)) of the incoming meta-nucleotide (step 8). This brings the \(3^\prime \) end of the growing ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)back to the same state as in step 2 and thus the reaction can repeat till the end of the template strand is reached (step 9). At this point, we introduce the \(3^\prime \) capper sequence \(fgt_{3^\prime }\) which binds to the \(3^\prime \) pad of the last meta-nucleotide added. This terminates the polymerization reaction. Note that in a solution with an ensemble of these nanostructures, we must wait for each copy of the reaction to proceed to step 9 before introducing the capper sequence or we must program a stopper sequence, analogous to stop codons in translation. DNA polymerases like \(\Phi 29\) possess exceptional strand displacing capability which aids in isothermal Isothermalamplification of DNA. However, our meta-polymerization protocol lacks this ability. Thus, if in the course of extending the primer an already bound ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)is encountered then the meta-polymerization process halts. In Sect. 7.5.6, we show how to achieve isothermal Isothermalamplification Meta-DNA (mDNA)of mDNA.

5.4 Meta-restriction Cuts in mDNAMeta-DNA (mDNA)

We can achieve site specific meta-restriction in both ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)and dsmDNA Double strand meta-DNA (dsmDNA) Meta-DNA (mDNA)by slightly modifying the internal structure of a nucleotide. This modification does not significantly affect the secondary structure of the mDNA and is compatible with all the other mDNA protocols described in this chapter. We prefix a sequence \(h_i\) to the \(5^\prime \) pad of the meta-nucleotide with base pad \(\beta _i\). This is the sequence that will be recognized by cleaving strands that break the meta-backbone bonds. We illustrate restriction for dsmDNA.Double strand meta-DNA (dsmDNA) The protocol for ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)is very similar and can be thought of as a special case of the meta-restriction of Double strand meta-DNA (dsmDNA) dsmDNA.

The abstract activatable tile model of meta-restriction is illustrated in Fig. 7.12. The \(5^\prime \) end of the meta-nucleotide \(\beta _1\) is recognized, cut and sent to the capped state. In Fig. 7.13 the site \(h_2\) associated with the meta-base \(b_2 a_2\) is recognized by the cleaver strand \(\bar{g}\bar{f}\bar{h_2}r_2\). There is a single recognition site \(h_2\) on each of the strands of the dsmDNA Double strand meta-DNA (dsmDNA)(Meta-DNA (mDNA)step 1). The cleaver strands bind to the toehold \(h_2\) on each strand (step 2) and break the meta-backbone bond by strand displacing the \(3^\prime \) pad \(\bar{g}\bar{f}\) (step 3). We now introduce the \(3^\prime \) capper sequence \(fgt_{3^\prime }\) to cap the exposed \(3^\prime \) pads (step 4). We prevent interaction between the cleaver strand \(\bar{g}\bar{f}\bar{h_2}r_2\) and the \(3^\prime \) capper sequence \(fgt_{3^\prime }\) by executing step 4 only after we are reasonably certain that step 3 is complete. This means that meta-restriction is not autonomous. The strand \(\bar{r_2}h_2 f g\) is introduced to strip away the cleaver strand \(\bar{g}\bar{f}\bar{h_2}r_2\) from the \(5^\prime \) pad by exploiting the toehold \(r_2\) (step 5). This exposes the \(5^\prime \) pads (step 6) which are then capped by introducing the \(5^\prime \) capper sequence \(\bar{g}\bar{f}t_{5^\prime }\) (step 7), completing the process of meta-restriction. Again, we prevent interaction between the strand \(\bar{r_2}h_2 f g\) and the \(5^\prime \) capper sequence \(\bar{g}\bar{f}t_{5^\prime }\) by executing step 7 only after we are reasonably certain that step 6 is complete. We do not require that these extraneous strands do not interact at all. Rather, it is sufficient that even after interacting among themselves there are a sufficient concentration of them to perform the tasks described in Figs. 7.13 and 7.14.

Fig. 7.12
figure 12

Abstract description of meta-restriction at site \(\beta _1\)

Fig. 7.13
figure 13

Internals of meta-restriction at \(h_2\) associated with meta-base \(b_2 a_2\)-I

Fig. 7.14
figure 14

Internals of meta-restriction at \(h_2\) associated with meta-base \(b_2 a_2\)-II

5.5 Meta-helicase Denaturation Reaction in mDNAMeta-DNA (mDNA)

We have previously described meta-denaturation for mDNA Meta-DNA (mDNA)using temperature. The same result can be achieved isothermally, which we call meta-helicase denaturation. Consider the abstract view of meta-helicase denaturation given in Fig. 7.15. Denaturation is the breaking of meta-base bonds of a dsmDNA.Double strand meta-DNA (dsmDNA) Meta-DNA (mDNA) When all the meta-base bonds are broken, the meta-strands float apart. Initially all the meta-base pads are in the bound state (step 1). Meta-helicase activity breaks the bonds and sends the base pads to the inactive state. This meta-helicase reaction doesn’t necessarily act contiguously. Some meta-base bonds are broken before others (steps 2 and 3). When all the meta-base pads are broken the meta-strands float apart (step 4). We can reactivate the meta-base pads by transitioning them to the active unbound state (step 5) at which point the ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA) Double strand meta-DNA (dsmDNA)can recombine to form a Meta-DNA (mDNA) dsmDNA.

Meta-helicase activity in mDNA Meta-DNA (mDNA)is performed by a host of strands (colored pink in Fig. 7.16), two for each type of meta-base bond that must be broken. In Fig. 7.16 there are two types of meta-base bonds and hence we have four strands to perform helicase activity (step 1). We introduce a slight modification in the internal strand structure by appending a sequence \(p\) to the base pad of each meta-nucleotide. This modification does not significantly affect the secondary structure of mDNA Meta-DNA (mDNA)and neither does it interfere with any other mDNA protocol described in this chapter. This sequence (\(p\)) will act as a toehold for the strands involved in the meta-helicase process. A pair of strands invade the meta-base pad bond and break them (step 2). Note that half of the meta-base bond is broken by one of these strands and half by the other, ensuring symmetry. Once each meta-base bond is broken the meta-strands drift apart (step 3) and meta-helicase activity is complete. We can strip off the protecting strand, for example \(\bar{m}\bar{p}b_1\), by adding in its complement, say \(\bar{b_1}pm\) (step 4). The ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)can now recombine into dsmDNA.Double strand meta-DNA (dsmDNA) Meta-DNA (mDNA) It is possible to repeat this process of meta-denaturation and meta-renaturation, however the protocol would not be autonomous.

Fig. 7.15
figure 15

Abstract description of meta-denaturation by helicase action

Fig. 7.16
figure 16

Internal strand structure of denaturation by helicase action

Fig. 7.17
figure 17

Replicator: new states

Fig. 7.18
figure 18

Replicator abstraction

5.6 Exponential Amplification in mDNA Using a ReplicatorMeta-DNA (mDNA)

Our protocol for meta-polymerization in mDNA Meta-DNA (mDNA)lacked meta-strand displacement Strand displacementcapabilities and hence could not be used for isothermal IsothermalPCR-like amplification. In this section we describe a method to get isothermal Isothermalexponential amplification using a replicator mechanism. The protocol is similar to meta-polymerization, it involves linear contiguous extension of a primer by addition of meta-nucleotides, with the newly polymerized ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)having the complementary sequence to that of the template. The key difference in the two protocols is a mechanism to isothermally dissociate the newly synthesized ssmDNA Single strand meta-DNA (ssmDNA)from the template. We achieve this using a new idea, not present in any previous protocol described in this work, of active DNA sequences sequestered in hairpins that are released by strand displacement Strand displacementreactions. The release of the “hidden” DNA sequence inside the hairpin structure can be thought of as an activation step, setting off another strand displacement Strand displacementreaction. This idea of sequestering sequences within hairpins has been demonstrated previously by Zhang et al. [28] and Dirks and Pierce [6].

For the purposes of this protocol, we define a new state of the base pad in the activatable model of mDNA,Meta-DNA (mDNA) which we call semi active unbound (shaded purple in Fig. 7.17c). If two base pads are in the semi active unbound state, they cannot bind to each other. However, a semi active unbound base pad can bind with an active unbound base pad, provided their sequences are complementary. There are also modifications (Figs. 7.17a, b) to how the other pad states are implemented, however the properties of the state do not change. Note the introduction of the hairpin structure \(g_2\bar{p}\bar{a}l\bar{g_2}\) at the \(3^\prime \) end of a meta-nucleotide (Fig. 7.17a) which will be used to cleave the growing strand from the template (Fig. 7.17c), sending the base pad to the semi active unbound state.

Figure 7.18 illustrates the replicator in the abstract activatable tile model of mDNA.Meta-DNA (mDNA) The top strand is the template and the bottom strand grows from a primer. For initiating and terminating replication, we require the use of a pair of special tiles with complementary base pads, labeled \(\beta _{3^\prime }\) and \(\bar{\beta _{3^\prime }}\) in Fig. 7.18. These are capped at one end and occur at the terminal ends of the template and hence in each replicated mDNA.Meta-DNA (mDNA) \(\bar{\beta _{3^\prime }}\) acts as the primer, initiating the replication. Consider the replication process after a few meta-nucleotides have been added (step 1). The growing strand is attached via a single base pad (at the \(3^\prime \) terminal meta-nucleotide) to the template. The base pads of the rest of the meta-nucleotides are in the semi active unbound state and hence cannot bind with each other. A new meta-nucleotide comes in (step 1) with its base pad in the active unbound state and binds to the template (step 2). This activates its \(5^\prime \) end (step 3) and allows the \(3^\prime \) end of the growing strand to bind (step 4). This activates the \(3^\prime \) end of the meta-nucleotide added and also cleaves the meta-base pad binding between the template and the previously attached meta-nucleotide of the growing strand, sending the cleaved meta-base pads to the semi active unbound state (step 5). The process (step 1–5) repeats till the last meta-nucleotide \(\beta _{3^\prime }\) attaches. At this point, we want the two strands to separate. We achieve this by making the meta-base pad bond between \(\beta _{3^\prime }\) and \(\bar{\beta _{3^\prime }}\) relatively weak, allowing the strands to separate spontaneously due to breathing of the base pad bond. This is implemented by choosing a very short length sequence for the base pads of \(\beta _{3^\prime }\) and \(\bar{\beta _{3^\prime }}\). Both the template and the newly synthesized ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)can now act as templates for further replication and hence we can achieve exponential amplification. Note that we can easily introduce mutations in the replicator mechanism by designing DNA sequences for certain base pads that do not have any exact complementary base pads but rather several partial complements. This allows us to probabilistically evolve a diverse sequence population ofMeta-DNA (mDNA) ssmDNASingle strand meta-DNA (ssmDNA).

Let us examine the replicator protocol in greater detail, paying attention to the strand diagram (Figs. 7.19 and 7.20). We have a four meta-base mDNA Meta-DNA (mDNA)sequence, with the first and last bases being the special terminator bases (step 1). Note that the sequence \(g=g_1g_2\) and \(\bar{g}=\bar{g_2}\bar{g_1}\). A meta-nucleotide with an active unbound base pad (step 1) comes in and binds to the template via the sequence \(b_2a_2\) (step 2), strand displacing out the sequences \(\bar{p}b_2\) (part of the light green strand) and \(\bar{b_2}\) (part of the pink strand). The strand \(f\bar{x}b_2\) now strips away \(\bar{b_2}x\bar{f}\), activating the \(5^\prime \) pad of the incoming meta-nucleotide (step 3). This allows the \(5^\prime \) pad \(fg_1g_2\) to bind with the \(3^\prime \) pad \(\bar{g_2}\bar{g_1}\bar{f}\), displacing \(\bar{g_2}\bar{g_1}\) and opening up the hairpin structure \(\bar{p}\bar{a_1}l\) (step 4). This allows the strand \(fy\bar{g_2}\bar{g_1}\) to be stripped away through the toehold \(\bar{g_2}\bar{g_1}\), activating the \(3^\prime \) end (step 5). The released hairpin from step 4 can now cleave \(a_1\) from \(\bar{a_1}\) using the toehold \(p\), which is half of the base pad binding, while the other half \(\bar{b_1}\) of the base pad binding is cleaved from \(b_1\) via the toehold \(p\) by the sequence \(\bar{p}b_1\) (step 6). Note that cleaving the bond between \(b_1\) and \(\bar{b_1}\) is actually a reversible process, and could have occurred after step 1 itself. Only when it is combined with the cleaving of the bond between \(a_1\) and \(\bar{a_1}\) does it get biased towards the configuration depicted in step 6. This completes the addition of a single meta-nucleotide accompanied by cleaving the previous meta-nucleotide. The process repeats till the last meta-nucleotide is added at which point the short base sequence of \({\bar{a}_{3^\prime }{\bar{b}_{3^\prime }}}\) allows the ssmDNA Single strand meta-DNA (ssmDNA) Meta-DNA (mDNA)to separate due to breathing. We note that both the separate ssmDNA Single strand meta-DNA (ssmDNA)are in a configuration that allows them to act as template strands for further replication, allowing exponential amplification ofMeta-DNA (mDNA) mDNA.

Fig. 7.19
figure 19

Internals of exponential amplification by replication-I

Fig. 7.20
figure 20

Internals of exponential amplification by replication-II

6 Lessons Learnt

In this work we have outlined a synthetic biochemical system made purely from DNA strands that simulates the behavior of various protein enzymesEnzyme acting on DNA. We believe mDNA Meta-DNA (mDNA)systems are a powerful programming paradigm for designing complex biochemical systems. In Sect. 7.5.6 we provided a protocol for exponential self-replication of a meta-DNA sequence and pointed out how to introduce mutations during replication and thus achieving sequence diversity. Incorporating environmental selection pressure in mDNA to evolve functional biosystems is a major open challenge. The next step in our investigations of mDNA systems is development of a strand level kinetic simulation of mDNA protocols. We hope to use data from such simulations to enhance and fine tune our protocols and also perform in vitro experiments to validate them. A major facilitating factor in the field of DNA nanotechnology has been technology to efficiently synthesize synthetic DNA strands of defined sequence de novo from individual nucleotides. Analogously, we require methods to efficiently (high throughput with low error rates) synthesize mDNA Meta-DNA (mDNA)strands of defined sequence from synthetic DNA. The current protection-deprotection DNA synthesis protocols can be adapted for the use of synthesizing mDNA from pre-formed meta-nucleotides. Note that meta-nucleotides can be synthesized using standard hierarchical assembly techniques from DNA self-assembly developed by Park et al. [14]. Other important protocols for mDNA Meta-DNA (mDNA)systems are meta-polymerization with meta-strand displacing capability and meta-ligation of mDNA strands.