Keywords

1 Introduction

One of the key posttranscriptional gene regulatory mechanisms in eukaryotes is mediated by small regulatory RNAs such as microRNAs (miRNAs). miRNAs are ~22 nucleotides (nt) long, small noncoding RNAs that induce translational repression and degradation of mRNAs that are complementary to seed sequences of the miRNA (reviewed in [1, 2]). A summary of miRNA biogenesis process is presented in Fig. 8.1. Briefly, miRNA biogenesis begins with the transcription of the miRNA gene by RNA polymerase II [3,4,5,6]. A cluster of miRNAs is transcribed together as a long polycistronic transcript known as primary miRNA (pri-miRNA), which folds back on itself to form multiple hairpin structures in a single transcript (Fig. 8.1). These hairpins undergo endonucleolytic cleavage by RNase III-type enzyme Drosha in a complex with DiGeorge syndrome critical region gene 8 (DGCR8) [7,8,9,10,11]. The complex, known as the microprocessor, recognizes the junction between the hairpin structure and the single-stranded RNA and cleaves the RNA ~11 bases away from the junction [9, 12, 13]. More recently, structural and biochemical investigations have identified the molar composition of the microprocessor (one molecule of Drosha and two molecules of DGCR8), Drosha-binding motif in the basal segment of the pri-miRNA, as well as DGCR8-binding motif in the hairpin region of the RNA [13,14,15].

Fig. 8.1
figure 1

A schematic depicting biogenesis of miRNA from transcription by RNA polymerase II to Ago loading in the cytosol

Microprocessor cleaves pri-miRNAs and releases ~65–70 nt long stem-loop structured RNAs known as precursor miRNAs (pre-miRNAs). Pre-miRNAs are then exported to the cytoplasm by Exportins including Exportin-5 where they are recognized by another RNase III-type enzyme Dicer [16]. Dicer recognizes both the phosphate group at the 5′ end and the 2 nt overhang structure of the pre-miRNA and cleaves the RNA ~22 nt from the ends [17,18,19]. The resulting miRNA duplex is then loaded onto Argonaute (Ago) family of proteins which discards one of the strands (known as the passenger strand) and retains the other strand (known as the guide strand). Ago–miRNA complex constitutes the core of the RNA-induced silencing complex (RISC) and uses the miRNA seed sequences as the guide to search for target mRNAs to induce posttranscriptional gene silencing [20, 21].

Numerous studies analyzed the Ago–miRNA and RISC–mRNA interactions using biochemical and biophysical single-molecule approaches. Their experimental findings were further complemented by the structural knowledge of Ago and RISC. Together, these studies have significantly advanced our understanding of the mechanism of gene regulation mediated by miRNAs. In this chapter, we present these studies.

2 Functional Domains of Ago

The overall structure of Ago family of proteins is a bilobate architecture that consists of four distinct domains: the N-terminal, PAZ, MID, and Piwi domains (Fig. 8.2) [22, 23]. Biological functions of these domains are summarized in Table 8.1. The N-terminal region forms one lobe with the PAZ domain. The function of the N-terminal region is unclear, but it may assist in the release of the target mRNA by disrupting its base pairing with the miRNA [24]. The PAZ domain can be subdivided into two subdomains separated by threonine 667; one domain consists mostly of aromatic residues, while the other subdomain folds into a structure similar to oligonucleotide/oligosaccharide binding (or OB-fold) structure that is capable of binding to single-stranded nucleic acids [22, 25,26,27]. The possibility of the interaction between the PAZ domain and the single-stranded nucleic acids is confirmed via crystallographic studies and biochemical experiments where the PAZ domain binds to single-stranded RNAs, although with low affinity [28,29,30].

Fig. 8.2
figure 2

The figure is adapted from [22] with American Association for the Advancement of Science, Copyright 2012

A schematic of different functional domains (top) and the ternary structure of human Ago2 (bottom).

Table 8.1 Summary of Ago domains and their functions

The PAZ domain can interact and anchor the 3′ end of the miRNA [31]. The anchoring incurs steric hindrance and prevents the interaction between the last few nucleotides of the miRNA with its target mRNA. This reduces the degree of interaction between the miRNA and the target mRNA, facilitating the target release and allowing RISC to act as a multi-turnover complex [31, 32]. Furthermore, anchoring of the 3′ end of the miRNA is important for the loading of miRNA duplex onto Ago. Dicer cleavage product (miRNA duplex) contains two nucleotide 3′ overhangs which is a common characteristic of RNase type-III enzyme products [33]. This recognition of the 3′ overhang allows Ago to distinguish miRNA duplex from other small RNAs such as degradation by-products or small duplex RNAs that are derived from non-related pathways [34,35,36].

The human genome encodes four paralogs of Ago proteins (hAgo1–4). While all four proteins share the characteristic domains of the Ago family, only hAgo2 shows target cleavage activity, which is mediated by the Piwi domain [21, 37]. This domain has an RNaseH-like fold and is responsible for the endonucleolytic activity of the protein. RNaseH is an endonuclease that recognizes DNA–RNA hybrid and cleaves RNA using DNA as the template. The catalytic activity of RNaseH requires a conserved Asp-Asp-Glu/Asp motif in the catalytic center and two divalent metal ions [38]. The Piwi domain of cleavage competent Agos including hAgo2 has a very similar motif (Asp-Asp-Asp/Glu/His/Lys) [23]. Mutagenesis of this region resulted in the loss of the catalytic activity [23]. In addition, these Agos require divalent metal ions to induce RNA cleavage [21, 39, 40]. Moreover, the products of Agos and RNaseH both show 3′-OH and 5′-phosphate groups, suggesting that the two proteins induce RNA cleavage in a similar manner [21, 40, 41].

Unlike hAgo2, other three paralogs of human Agos (hAgo1, hAgo3, and hAgo4) do not show slicing activity. Examination of their Piwi domains reveals that the RNaseH-like motif in hAgo1 and hAgo4 does not match the consensus sequence and hence accounts for their inability to cleave target mRNAs. Human Ago3 shows Asp-Asp-His consensus sequence which matches the one from hAgo2, yet studies reported that hAgo3 does not show RNA cleavage activity [42]. Therefore, simple RNaseH fold structure may not account for the action mechanism of Agos.

One possible explanation is the difference in the target cleavage efficiency. In Drosophila, two Agos (Ago1 and Ago2) have the identical consensus motif, but Ago1 shows much higher cleavage efficiency than that of Ago2 due to faster target release kinetics [43]. Applying similar logic to the human Agos, hAgo3 may have much slower dissociation kinetics with the target compared to that of hAgo2, which can make hAgo3 effectively a single turnover enzyme and show much lower cleavage efficiency. Through a series of biochemical experiments using recombinant hAgo2 and hAgo3, Park et al. showed that hAgo3 loaded with miR-20a can cleave target mRNAs [44]. However, when incubated with other miRNAs such as let-7a, miR-19b, or miR-16, recombinant hAgo3 failed to induce target cleavage [44]. The authors attributed this phenomenon to the differences in the miRNA-target interaction channel between hAgo2 and hAgo3, indicating that hAgo3 has more strict substrate requirement in addition to simple sequence complementarity in order to induce target cleavage [44]. As Ago protein structure plays a key role in the miRNA-target interaction as well as during target dissociation from RISC (see below for details), the difference in action mechanism of hAgo2 and hAgo3 may arise from the differences in their interaction with the target rather than in the RNaseH-like motif in their Piwi domains. Together, these evidences call for more detailed investigation of miRNA-target interaction in combination with information on Ago protein structures to better understand the action mechanism of miRNAs.

For both cleavage competent and incompetent Agos, the Piwi domain plays an essential role in substrate recognition. While the 3′ end of the miRNA is recognized by the PAZ domain, the 5′-phosphate of the miRNA is anchored at the interface between the Piwi and the MID domains [42, 45]. Biochemical studies further showed that a divalent cation binds to this interface and interacts with the 5′-phosphate of the miRNA [28]. Furthermore, the preferential nucleotide is shown to be uridine although the effect of the nucleotide identity and the efficiency of Ago loading in cell need further investigation [46].

The main silencing effect by RISC is mediated not by target cleavage, but mostly through translational repression and subsequent RNA degradation via deadenylation and decapping [42]. The MID domain holds the key to explain the latter function of Agos as hAgo2 contains an MC motif which shows high homology to the cap-binding motif of the eukaryotic initiation factor 4E (eIF4E) [47]. Biochemical studies further showed that the MID domain can bind to the cap of the mRNA, and this interaction is required for efficient translational repression [48]. As hAgo1, hAgo3, and hAgo4 are not capable of inducing target cleavage, the cap-binding ability provided by the MID domain may be critical for these Agos to induce repressive effects. However, the MID domains of these cleavage incompetent Agos do not show the motif homologous to that of eIF4E and the exact mechanism of the cap-binding ability of these Agos remains to be investigated. Perhaps, this process may be mediated by Ago-interacting proteins that are components of RISC [49,50,51]. In addition, it remains unclear how the MID domain and Ago induce deadenylation and decapping of the target mRNA. Recently, it has been shown that targeting by miRNAs induces uridylation of the mRNA at the end of its poly(A) tail which facilitates RNA turnover [52]. Considering that mRNA turnover is responsible for most of the gene silencing effect by RISC, the role of MID domain and its cap-binding ability to induce deadenylation and decapping needs further investigation in the future.

3 Assembly of Ago–MiRNA Complex

During the posttranscriptional regulation by miRNAs, miRNAs use their seed sequences to guide the RISC to find the target mRNAs while Ago functions as the effector protein of the complex [53,54,55]. The interaction between miRNAs and Agos begins with the assembly of the RISC. Since the mechanism of RISC assembly has been key aspects in understanding miRNA-mediated gene silencing, it was under extensive investigation over the past few decades. RISC assembly begins with the loading of duplex Dicer cleavage product onto Ago protein. This process is most thoroughly studied using Drosophila Ago2, but in this chapter, we will focus on the mammalian system (Fig. 8.3).

Fig. 8.3
figure 3

The figure is modified from [66] with Elsevier, Copyright 2012

RISC assembly in mammals. Pre-miRNAs are first loaded onto miRNA RISC loading complex (miRLC). However, the transfer mechanism of Dicer cleavage products to Ago and the function of the direct interaction between pre-miRNAs and Ago remain to be investigated.

One of the key questions in RISC assembly is how Dicer releases its cleavage product and delivers it to Ago. Numerous studies have suggested that Dicer together with Ago and TAR RNA-binding protein (TRBP) forms miRNA RISC loading complex (miRLC) which plays a key role in loading of the miRNA duplex (Fig. 8.3). In this conventional model, Dicer product is oriented by Dicer–TRBP heterodimer and is handed over to Ago in the miRLC [56,57,58]. This is further supported by the EM data where Ago is bound to miRNA duplex in complex with Dicer and TRBP [59].

However, accumulating in vitro biochemical evidences suggests that Dicer and miRLC may not be required for miRNA loading [60,61,62,63]. Using Dicer knockout embryonic stem cells, it has been shown that the loading of small RNA duplexes to Ago can occur in the absence of Dicer [64]. In addition, Ago can directly bind to pre-miRNAs and form a complex called miRNA deposit complex (miPDC; Fig. 8.3). For specific miRNAs such as miR-451 whose pre-miRNA is too short for Dicer processing, miPDC formation is responsible for mature miRNA biogenesis [65]. In other cases, miPDC can incorporate Dicer and TRBP to form miRLC and deliver pre-miRNA to Dicer for RNA processing [66].

Interestingly, it has been shown that dissociation of Ago from miRLC to form RISC with mature miRNA requires catalytically active Dicer [57]. This supports the conventional model where Ago is present as a complex with Dicer, and Dicer physically hands over its cleavage product, the miRNA duplex, to Ago [57]. Furthermore, the loading of miRNA duplex may trigger conformational change on Ago such that it can now dissociate from miRLC to form mature RISC [60, 67]. However, since small duplex RNAs can be loaded on to Ago without Dicer, it still remains unclear how the miRNA RISC loading occurs. In the end, there exist two models: Dicer product is released to the bulk solution and then loaded onto Ago in the vicinity, and Dicer is physically handing over the cleaved product to the Ago protein [66]. In either of the two cases, the interaction between Ago and Dicer and the formation of miRLC are required in order to minimize the searching process of Ago to find duplex miRNAs [58, 68]. Further investigation is required to elucidate the in-depth mechanism of Ago loading during miRNA biogenesis in mammals.

4 Target Recognition by Minimal RISC

Once the RISC assembly has been completed, Ago uses the guide strand of the miRNA embedded within the complex to search for its target mRNAs to induce posttranscriptional regulation [69, 70]. During the process, Ago searches for mRNAs whose 3′ UTR sequences are complementary to the miRNAs’ seed sequences, 2–7 or 2–8 nt from the 5′ end of the miRNA [42]. The importance of the miRNA seed sequences has been demonstrated by numerous high-throughput sequencing studies. They showed that when expression of a given miRNA is perturbed, levels of mRNAs that contain sequences complementary to seed sequence of the miRNA are significantly affected [71,72,73,74]. Detailed examination of the interaction between miRNA and its target mRNA revealed that the two RNAs hybridize in a stepwise process that is accompanied by structural changes in Ago [75].

miRNA loading and target mRNA recognition by RISC can be subdivided into five steps. The first step is the loading of the miRNA onto Ago and its effect on the accessibility of the individual nucleotides. The 5′-monophosphate and the first base of the miRNA are anchored at the interface between the MID and the Piwi domains, and the 3′ end of the miRNA is recognized by the PAZ domain of Ago [25, 28, 29, 31, 45, 76]. This anchoring of miRNA guide strand makes the first nucleotide inaccessible and opens up the seed region to the media [28, 45]. Therefore, the seed region becomes the first nucleotides that can interact with mRNAs and play an essential role in target selection by the miRNA.

Although the seed region is crucial for target recognition, not all seven bases hybridize with their complementary bases on the target mRNA simultaneously. The latter part of the seed region is inaccessible to the media due to steric hindrance imposed by the Ago protein (Fig. 8.4) [75]. Initially, only the second–fifth positions of the guide RNA are exposed and are able to interact with the target mRNAs [75]. These sequences are known as the sub-seed sequences and are responsible for the weak recognition of the target by RISC. When RISC finds an mRNA with complementary match to the sub-seed sequences, the hybridization between miRNA and mRNA is stabilized by Ago protein which assists the two to form into an A-form helix [75, 77]. This pre-organization of miRNAs to base pair only the sub-seed region greatly accelerates the target finding speed by increasing the on-rate as much as 250-folds [78].

Fig. 8.4
figure 4

The figure is adapted from [77] with Elsevier, Copyright 2017

In RISC, the access of the latter half of the seed region (5th–7th nt) of the miRNA is blocked by the helix-7 motif of Ago.

The initial interaction between the sub-seed sequences and the mRNAs has been supported by a number of single-molecule studies. For example, Salomon et al. designed an experiment where they introduced a series of di-nucleotide changes on mRNAs and measured the dissociation rate of the RISC–mRNA complex [78]. They found that mismatches in the first two sequences of the seed region had the greatest effects on the target binding rates compared to mismatches in other regions [78]. Interestingly, mismatches in the last two sequences of the seed had significantly weaker effects on target binding rate [78]. Similarly, Chandradoss et al. compared the binding rate of mRNA with full seed complementarity and one with partial seed complementarity. They found that the first three nucleotides of the seed region are critical for the target recognition and the latter part of the seed region did not have significant effects on the binding rate between the RISC and the target mRNA [79]. Of note, while the structural study subclassified 2–5 nt as sub-seed sequences, these follow-up single-molecule studies showed that only the 2–4 nt may act as the “mini-seed” region [75, 78, 79]. These studies support the notion that RISC uses the first three or maybe four nucleotides of the seed region for the initial target search.

The hypothesis of the existence and the significance of the sub-seed match are further supported by structural analysis of Ago protein in complex with guide miRNA. Without interaction with the target, only the second–fifth positions rather than the entire seed region are exposed to the media [22, 45, 80, 81]. This is because Ago protein induces structural constraint and makes the guide kink away from the A-form helix, in particular at the position 7. With this conformation, mRNA will not be able to hybridize and form duplex RNA beyond the fifth position of the miRNA (Fig. 8.4) [77].

This structural constraint at the position 7 is relieved by the initial interaction between the mRNA and the sub-seed sequences of the guide miRNA. The hybridization triggers structural change on Ago such that it undergoes 4 Å displacement at the region around the position 7, allowing the seventh nucleotide to adapt A-helix configuration (Fig. 8.5) [77]. This allows the sixth–eighth positions of the guide RNA to base pair with the target mRNA [77].

Fig. 8.5
figure 5

The figure is adapted from [75] with American Association for the Advancement of Science, Copyright 2014

Base pairing in the sub-seed region induces conformational change such that the helix-7 is shifted by 4 Å which allows further base pairing with the target.

The conformational change of Ago and the subsequent extension of hybridization between the miRNA and the target mRNA are supported by a number of single-molecule studies. They showed that there is a sharp increase in the binding affinity when the number of seed matches is increased from six to seven [78, 79]. This result is consistent with the idea that the sub-seed interaction induces changes in Ago structure such that the sixth–eighth position of the guide RNA has become accessible to hybridize with the target mRNA [82]. Without such change, these positions of the seed region will not be able to hybridize with the mRNA and thus complementarity in these bases will not affect the binding affinity with the target.

The combination of structural, biochemical, and biophysical single-molecule experiments provides a powerful approach in understanding RISC–mRNA interaction. Together, these studies converge on the idea that the target recognition by RISC is a multi-step process (Fig. 8.6). First, RISC anchors the two ends of the miRNA to orient and stabilize the miRNA into proper A-helix conformation. Second, the sub-seed bases, in particular positions 2–4, provide the initial searching platform that mediates the interaction with the target mRNA. Lastly, the hybridization between the mRNA and the sub-seed bases of the miRNA induces conformational change on Ago to allow further seed match for extended hybridization between the two RNAs. This increased complementarity significantly lengthens the residence time of Ago bound on the target mRNA, which may be necessary for sufficient gene silencing [78, 79].

Fig. 8.6
figure 6

The figure is adapted from [77] with Elsevier, Copyright 2017

Model summarizing the conformational changes of Ago during miRNA-target interaction.

5 Implications of the Sub-seed Region: 1-D Target Search

One of the unresolved questions in RISC-target interaction is how the miRNA embedded in RISC can effectively find its target mRNAs in a complex media like inside the cell. The sequential target recognition process by RISC suggests one clue: By using the sub-seed sequences, Ago may find an mRNA with the partial seed match first and then slide along the RNA to search for better binding sites, i.e., sites with extended seed match sequences if such sites do exist. By doing so, the target search in the three-dimensional space has effectively become one-dimensional sliding problem [79]. In fact, this kind of one-dimensional search algorithm is employed by transcription factors in search for their optimal binding sites on the DNA. Previous studies on the mechanism of the recognition of the lac operon by LacI repressor in Escherichia coli (E. coli) showed that the protein first finds the DNA in the three-dimensional space and slides along the DNA to find the optimal binding site located within the lac operon [83,84,85]. Therefore, the three-dimensional diffusion has essentially become one-dimensional sliding which significantly facilitates the target search process.

Using the combination of three- and one-dimensional search mechanisms can be advantageous in multiple ways. First, once the RISC finds an mRNA with sub-seed match, it can undergo fast lateral diffusion along the RNA to search for the optimal binding site with extended seed match (Fig. 8.7) [79]. This lateral searching process may require hopping and sliding along the mRNA rather than trying to find the optimal site through searching in three-dimensional space of the cytosol [79]. The facilitated search mechanism may allow RISC to act as a multi-turnover type of regulator as it can quickly move from one target to another. In addition, the ability to search in three-dimensional space is necessary as RISC, unlike transcription factors, targets mRNAs. As the sub-seed region only contains three or four nucleotides, it is likely that mRNAs with the sub-seed match may not have the sequences that are complementary to the rest of the seed region. In this case, the RISC should detach from the mRNA and diffuse through the cytosol, searching for an unprobed new target mRNA. Of note, the partial seed match assists this step as it has lower binding affinities to the mRNA compared to that of the full seed match [78, 79]. The search for the new target strictly depends on the diffusion in the three-dimensional space. Once the RISC finds another mRNA with sub-seed match, it will again undergo one-dimensional sliding and hopping along the RNA to find the optimal binding site (Fig. 8.7). Therefore, the sub-seed match allows the RISC to scan through many different mRNAs to find the true target, and the optimal target search will depend on the proper distribution of three-dimensional search and one-dimensional scan mode of the RISC [79].

Fig. 8.7
figure 7

The figure is adapted from [79] with Elsevier, Copyright 2015

Model summarizing dynamic target search by RISC using sub-seed match and lateral diffusion along the target mRNA.

The structure of Ago and its conformational changes during RISC-target recognition play a role in optimizing the balance between three-dimensional search and one-dimensional scan processes. Previous investigation of the one-dimensional scanning of transcription factor argued that it is not possible for the transcription factor to have both fast searching and stable binding [86,87,88]. This is because fast searching requires weak interaction between the protein and the DNA and consequently transcription factor with fast searching speed is likely to miss many of its true binding sites. Similarly, stable protein–DNA interaction implies slow dissociation which results in slow lateral diffusion as the protein gets trapped at nontarget sites. This results in speed–stability paradox in DNA scanning where fast and specific target search is difficult to achieve simultaneously [87, 89]. However, this problem can be resolved if the protein can adopt multiple configurations each with different DNA binding affinities. During the initial search mode, the protein may show weak interaction with the DNA and only when it recognizes sequences similar to its binding site, then the protein may change its configuration and slowly scan the vicinity to find the optimal binding site [87, 89] (Fig. 8.8).

Fig. 8.8
figure 8

The figure is modified from [89] with Elsevier, Copyright 2016

Speed–stability paradox of DNA–protein interaction. In order to overcome the speed–stability paradox, transcription factors often use two state DNA interactions. The search mode is characterized by weak DNA–protein interaction which allows fast search with relatively smooth energy landscape. The recognition mode shows increased interaction with the DNA which results in decreased speed with increased specificity. The recognition mode also shows a large energy variation.

The structure of Ago provides an ideal example for the configuration changes required for optimal target search process. When miRNA is loaded onto Ago, the protein arranges the miRNA such that only the mini-seed bases can adapt A-form helical structure [22]. In other words, the kink at the position 7 imposed by Ago prevents the target interaction beyond the fifth position of the guide RNA, restricting the interaction and lowering binding affinity between miRNA and mRNA pairs [75]. Therefore, during the scan mode, Ago effectively limits miRNA–mRNA interactions to reduce the binding affinity such that it can quickly scan through the 3′ UTR to find the sequences that match the sub-seed region of the miRNA [79].

In addition, once the sub-seed match region is found, the hybridization between miRNA and mRNA induces the conformational change in Ago such that it now allows full seed interaction [79]. This change in Ago configuration allows base pairing beyond the sub-seed region and can significantly increase the RISC–mRNA-binding affinity. As a result, the scanning process has slowed down sufficiently to find the full or nearly full seed match sites on the mRNA [79]. Therefore, the steric hindrance imposed by Ago and conformational change of Ago by miRNA–mRNA interaction provide the necessary conditions for the optimal target search by RISC as suggested by Slutsky and Mirny: facilitated diffusion and one-dimensional target search along the mRNA and slowed diffusion near the target site in order to converge on the site with full seed match nucleotides [86,87,88,89]. Overall, structural understanding of Ago and the change induced by sub-seed base pairing strongly suggest that the target search mechanism by the RISC is a multi-step process with at least one scan mode and one recognition mode.

6 Toward Target Cleavage

Gene silencing by RISC is mediated through at least three mechanisms: (1) RISC components interact with the cap-binding proteins and suppress translation at the initiation step; (2) RISC induces RNA decay by triggering deadenylation and decapping of the target mRNA; (3) RISC directly cleaves mRNA at the binding site. The direct cleavage requires hAgo2, the only cleavage competent Ago in the human genome [21, 37]. While the miRNA-mediated gene silencing mostly occurs through the first two mechanisms, small interfering RNAs (siRNAs) can also induce target cleavage when loaded onto hAgo2. The difference between the two pathways (miRNA vs. siRNA) lies in the extent of the target complementarity. Target cleavage can occur when the 10th and 11th nucleotides of the guide siRNA/miRNA pair with the mRNA [90,91,92]. However, while siRNAs are designed to have extended base pairing with the target, most of miRNAs do not base pair at these positions, resulting in only the siRNA being able to induce target cleavage.

Consistent with this idea, bioinformatics studies have shown that most of the targets of miRNAs in human do not show complementarity beyond the seed sequences and thus are not cleaved by hAgo2 [71,72,73,74]. Moreover, numerous studies have shown that seed sequences of the miRNA are the key determinant of target selection and the rest of the sequences do not affect RISC-target interactions [32, 72, 82, 93,94,95,96]. Yet, structural and single-molecule studies have shown that seed pairing triggers an additional conformational change in Ago, which provides a key understanding of the mechanism and efficiency of gene silencing by RISC [75, 81, 97, 98].

First, the sub-seed match relieves the kink at the position 7 and allows extended seed match with the mRNA [77]. However, the pairing beyond the eighth position is still restricted and requires the widening of the channel between the PAZ and the N-terminal domains [75]. Recently, Jo et al. observed that many of the targets of the miRNA are not cleaved despite the perfect complementarities [99, 100]. One possibility is that Ago imposes structural hindrance such that the 10th and 11th positions of the miRNA cannot base pair with the corresponding complementary nucleotides on the mRNA. This result suggests that the identity of miRNA may be important in predicting its target cleavage capability. Therefore, the simple identity and complementarity are not enough to predict the target cleavage and further investigation on the conformational change on Ago due to miRNA–mRNA interaction is required [77].

Although the seed sequence match may not induce conformational change in Ago to allow target cleavage, it does rearrange the protein such that the 13th–16th nucleotides of the miRNA (also known as the supplementary region) are now configured into an A-helical form and may base pair with the target RNA (Fig. 8.9) [75]. The extended target complementarity in this region of the miRNA further enhances the binding affinity and increases the residence time of Ago on the target mRNA [75, 78, 79].

Fig. 8.9
figure 9

The figure is adapted from [75] with American Association for the Advancement of Science, Copyright 2014

Seed match with the target mRNA triggers conformational change of Ago such that the supplementary region of the guide miRNA arranges into A-form helical structure and may base pair with the target mRNA.

The sequences beyond the 16th position of the miRNA cannot interact with mRNA due to structural constraint imposed by Ago protein [77]. Similar to the first sequence of the miRNA, the 3′ end of the miRNA is anchored at the PAZ domain [22]. Ago does not release the 3′ end of the miRNA even after the pairing at the supplementary region and prevents the access of the sequences beyond the 16th position [31, 45, 101, 102]. This tight association between the PAZ domain and the 3′ end of the miRNA is necessary to ensure efficient target release (Fig. 8.10). It has been shown that RNA duplex longer than 12 base pairs has half-life of approximately one year, indicating that the two will form an extremely stable complex and are not likely to dissociate [103]. However, miRNA loaded onto Ago dissociates with the target mRNA quickly and allows the protein to act as a multi-turnover enzyme [100]. This reversible interaction between Ago and its target might be possible because the 3′ end of the miRNA is anchored at the PAZ domain, which lowers the binding affinity between the Ago and the target mRNA [31, 45, 67, 101, 102, 104].

Fig. 8.10
figure 10

The figure is adapted from [31] with American Chemical Society, Copyright 2013

During miRNA-target interaction, the 3′ end of the miRNA is anchored at the PAZ domain and is prohibited from base pairing with the target, which may play a role in efficient target release.

Lastly, evidences suggest that Ago may directly interact with mRNAs and contribute to the target recognition process [79]. The first sequence of the miRNA is anchored and cannot interact with the target [21, 90, 105, 106]. However, when the first sequence of the miRNA is uridine, it may interact with adenine nucleotide of the mRNA and anchor the mRNA onto the MID domain of Ago [107]. This provides an additional sequence pairing between miRNA and mRNA and can increase the efficiency of gene silencing effect [108]. Moreover, this interaction may account for the phenomenon where uridine is the preferred sequence at the 5′ end of the guide strand [21, 90, 105, 106]. Interestingly, using a single-molecule approach, Schirle et al. showed that the adenine anchoring in Ago does not influence the initial target recognition process, but does increase the residence time of Ago on the mRNA [108]. Therefore, RISC can still search for its target using the sub-seed sequences, and the base pairing at the first position only affects the gene silencing efficiency.

7 Concluding Remarks

In this chapter, we present structural, biochemical, and biophysical single-molecule studies on the Ago–miRNA-target interactions. As a key posttranscriptional regulatory molecule, miRNA has received increasing attention over the recent decades [109]. In particular, miRNAs are recognized as key potential biomarkers for diagnosis of human disease and prognosis during clinical treatments as well as indicators of cellular status [109]. The dynamic changes in miRNA expressions are associated with a variety of human diseases including heart disease, neurological diseases, immune function disorders, and age-related diseases. Furthermore, dramatic changes in miRNA profiles have been observed during disease progression, drug treatment, and differentiation of stem cells.

Despite its significance, our understanding of miRNA-mediated gene silencing is limited due to lack of knowledge of the in-depth mechanism of RISC assembly and RISC-target interactions. Numerous structural studies have significantly expanded our understanding of how miRNAs are loaded onto Ago. Loading of the miRNA to the MID and PAZ domains of Ago positions the RNA such that it can efficiently search for targets. In addition, miRNA–Ago complex constantly undergoes changes in its configuration as it interacts with the target mRNA to effectively find and associate with the true targets. Lastly, anchoring of the 3′ end of the miRNA to the PAZ domain ensures the efficient target release, which may be important for the effective gene silencing by RISC.

These structural studies inspired the development of new models of target search and regulation that are further confirmed by single-molecule experiments. Particularly, these studies investigated the binding dynamics of miRNA-target interactions. Like transcription factors, RISC also utilizes one-dimensional scanning in order to quickly search for the optimal binding site on an mRNA. This mode of quick scanning is possible as Ago protein induces steric hindrance and prevents the complementary pairing beyond the sub-seed region. More importantly, RISC also can readily dissociate from the mRNA that lacks full seed match sequences and undergo diffusion in the three-dimensional space in the cytosol to find a new potential target. Ago structure also plays a key role in this switch between the two target search modes. First, complementary pairing at the sub-seed region triggers structural change such that the helix-7 motif can no longer block the miRNA–mRNA interaction beyond the sub-seed region. Furthermore, seed pairing also can switch the protein to the recognition mode and allow base pairing even at the supplementary region. Therefore, Ago subdivides miRNA into multiple functional domains and changes its configuration to determine which regions to expose to the media in order to optimize its target search and release processes (Fig. 8.11).

Fig. 8.11
figure 11

The figure is adapted from [77] with Elsevier, Copyright 2017

Schematic of miRNA-target interaction depicting subdivision of the miRNA by Ago.

The one-dimensional scanning mechanism as well as the dynamic conformational change induced by target interaction is also observed in various other systems. As discussed above, LacI repressor scans through the DNA to find the optimal binding site on the lac operon. In addition, the interaction between RecA and DNA is restricted to 7–8 nt due to the steric hindrance imposed by RecA protein [110]. Qi and colleagues termed these sequences as “microhomology motif” which serves as an initial platform for target recognition by RecA [111]. Such restriction may allow RecA to quickly scan the DNA to find the optimal binding site. Lastly, CRISPR/Cas protein also utilizes multi-step target recognition mechanism. It first scans the DNA to find the PAM motif and subsequently scans the vicinity to find the optimal binding site. Furthermore, binding to the extended complementary target induces conformational change to bring the nuclease domain to the target DNA [112,113,114]. These series of target recognition steps may account for the remarkable efficiency and specificity of the CRISPR system.

Given the generality of the target search mechanism, we expect that one-dimensional scanning as well as the conformational change induced by the target interaction is an important regulatory strategy in RNA/DNA binding proteins. Quantitative and single-molecule approaches in combination with the structural crystallographic information will provide valuable insights into a comprehensive understanding of RNA–protein interactions.