Keywords

1 Introduction

The world of RNA and its species has widened the scope of scientific research after the two decades of the discovery of various subtypes of nonprotein-coding RNAs (ncRNAs) [1, 2]; long nonprotein-coding RNAs (lncRNAs) [3, 4], small interfering RNAs (siRNAs) [5, 6], piwiRNA [7], circular RNAs (circRNAs) [8], or microRNAs (miRNAs) [9, 10]. miRNA was discovered in 1993 in a nematode worm, Caenorhabditis elegans [11]. Since then, it has been reported in viruses, single-celled eukaryotes, plants, and animals. miRNAs are 22–25-nucleotide-long endogenous noncoding RNAs. It regulates several biological processes through controlling the expression of target genes [12, 13]. Recently, the exploration of miRNAs has become pivotal for the detailed understanding of the alteration of genes in many human diseases [13]. miRNAs have been studied in context with several diseases including infectious as well as noninfectious [14,15,16].

A lot of research has been done in recent years to infer the miRNA targets and their functions which resulted in a huge amount of mRNA and miRNA data; therefore, computational methods provide experimental validation with statistically significant outcomes.

2 Formation of miRNA

miRNAs are synthesized by RNA polymerase class II enzyme in the cell nucleus. Initially, these are formed as a long primary transcript called pri-miRNA. These are formed either from the intronic regions of coding genes or from their noncoding regions of the genome. After its formation, pri-miRNA folds into a hairpin structure. miRNAs are categorized by precise formation processes demarcated as the canonical and noncanonical biogenic pathway (Fig. 14.1). Mostly pri-miRNA follows canonical biogenic pathway distinguished by double processing through the enzymes, namely, Drosha and Dicer, of the ribonuclease III (RNase III) family.

Fig. 14.1
figure 1

Diagrammatic representation of noncanonical and canonical biogenic pathway of miRNA

Drosha forms a microprocessor complex with DGCR8 in the nucleus [17,18,19]. This complex cuts the pri-miRNA and forms pre-miRNA. Exportin-5 helps in the transportation of pre-miRNA to the cytoplasm. Further, Dicer carries out the second cleavage of the pre-miRNA in the cytoplasm and forms ~21-nucleotide-long miRNA duplex with 2 nucleotide 3′ overhang [19]. One of the mature strands from the duplex is laden onto an Argonaute (AGO) protein-associated RNA-induced silencing complex (RISC). This multiprotein complex leads to the formation of the effector complex in the cytoplasm. RISC uses miRNA as an antisense strand to recognize and regulate the specific targets. It is also reported that transposable elements are also involved in the miRNA formation [20]. Unlike animals, in plants, miRNA becomes fully mature in the nucleus [21, 22]. Recently, other processes of miRNA biogenesis have been categorized as noncanonical biogenic pathways consisting of Dicer-independent and Drosha-independent mechanism as explained in Fig. 14.1 [23,24,25,26,27,28,29,30,31].

3 Targets of miRNAs

miRNA regulates the expression of mRNA through binding with the 3′ untranslated regions (UTRs) of mRNA through any of the two binding patterns as mentioned below [32]:

  1. (a)

    In first class, two to seven nucleotides from the 5′ end of the miRNAs have complete Watson-Crick pairing to the 3′ UTR of the mRNA. These nucleotide sequences are known as the seed region. Depending on the number of nucleotides involved in the base pairing, the seed regions are known as 6mer, 7mer, and 8mer accordingly.

  2. (b)

    While in another class of mRNA expression regulation by miRNA, there is partial base pairing formed in the seed region, but to overcome this insufficient binding, there are some base pairings found in the 3′ side of the miRNA too.

A single mRNA may be regulated by multiple miRNAs and vice versa. Even one miRNA may influence the expression of other miRNAs, and due to this reason, regulatory mechanism becomes more complex.

4 Functions of miRNAs

miRNAs have a significant role in the modulation of gene expression in animals and in plants. In animals, miRNAs play important functions at various physiological and evolutionarily conserved developmental stages. miRNAs mostly show partial complementarity with the specific mRNAs in animals; however, it is adequate to regulate diverse physiological processes at the initiation step accompanied by the degeneration of mRNA [33]. In animals, miRNAs bind partially with the 3′ UTR regulatory elements of the transcript and do not affect the functions of 5′ UTR and target sites [34]. miRNAs affect the stability of the specific mRNAs at the transcriptional stage [35]. According to few studies, miRNAs can enhance translation in some specific cell types under certain conditions [36]. mRNAs seem to be inhibiting miRNAs from regulating the RNAs also. Like animals, plant miRNAs have shown key roles in the facilitation of organ maintenance at several developmental stages. miRNAs in plants have complementary regions to the specific mRNAs. In Arabidopsis thaliana, it has been shown that these regulates negatively also and may be due to their binding to the nonspecific targets in other noncoding RNAs [37].

5 miRNAs in Infectious Diseases

Different miRNAs are involved in a wide array of diseases, and the levels of miRNAs are either upregulated or downregulated based upon the disease. However, our focus centers around the miRNAs involved in the infectious diseases and their expression profiles. miR-93-5p belongs to the miR-106-25 cluster which plays a crucial role in cancer development. HCV-1b core protein enhances the expression of miR-93-5p in Huh7 cells. IFNAR1 is a direct target of miR-93-5p, and upregulation of miR-93-5p inhibits IFN signaling pathway by reducing the phosphorylation status of STAT1, a transcription factor involved in IFN signaling [38]. During HBV, infection level of miR-29 is upregulated in HCC cell line HepG2.2.15. SMARCE1 is a target of miR-29a; an inhibition of SMARCE1 increased the HBV replication and expression [39]. The hepatic miR-122 level is reduced in HCV-infected cultured hepatocytes, and expression of NF-κB-inducing kinase (NIK) is induced [40].

miR-146a plays an important role in immunomodulation of the host response during Vibrio cholerae infection. V. cholerae bacteria releases outer membrane vesicles which upregulate miR-146a. Upregulation of miR-146a allows the pathogen to colonize inside the host due to the reduced strength of an epithelial innate immune defense reaction [41]. miR-155 is another important miRNA that is induced by the V. cholerae to limit the host immune response, thereby reducing the probability of being eliminated. The miR-155 is upregulated in V. cholerae infection [42].

miR-155 acts as a negative regulator of endothelial and blood-brain barrier integrity during severe malaria. Upregulation of miR-155 is associated with increased endothelial activation and blood-brain barrier breakdown [43]. Severe malaria with multiorgan failure includes different miRNAs. Mouse with cerebral malaria shows upregulation of miR-27a, miR-150, and let7i levels in brain tissue compared to a mouse with no cerebral malaria [44]. miR-451 is a negative regulator of the host immune responses to Plasmodium infection. Downregulation of miR-451 induces pathogen clearance of Plasmodium vivax by CD4+ T cells [45]. Levels of plasma miR-451 and miR-16 were significantly downregulated in Plasmodium vivax infection, and these miRNAs can be exploited as biomarkers for malaria infection [46].

Autophagy happens to be one of the key events in the survival mechanism of Leishmania parasite under a stressed condition. The autophagy-related gene BECN1 is a key autophagy-promoting gene. Upon L. donovani infection, the miR-30 family member, miR-30A-3p, modulates the autophagy by targeting BECN1. miR-30A-3p targets BECN1 and decreases its level of expression resulting in decreased autophagic activity and finally parasite elimination [47]. miR-122 is expressed abundantly in the liver and modulates a wide range of liver functions. The L. donovani metalloprotease gp63 targets the DICER1 in human hepatocytes to reduce miR-122 expression [48]. Dendritic cells and macrophages infected with L. donovani show upregulation of let-7a and let-7b miRNAs, while these miRNAs are downregulated in L. major-infected cells. Suppressor of cytokine signaling 4 (SOCS4), a negative regulator of JAK-STAT signaling, plays the role in the regulation of miRNA expression [49]. L. donovani significantly downregulates the expression of miR-494 in THP-1-differentiated human macrophages. L. donovani metalloprotease gp63 degrades c-Jun and specifically upregulates the expression of Rab5a to downregulate miR-494 [50].

6 Fundamental Features Considered for miRNA Target Prediction

miRNAs target the complementary sequences generally present in the 3′ UTR of mRNA. miRNA reduces the translation of mRNA to prevent protein synthesis. There are several computational methods which provide support to predict the possibilities and specificity of a miRNA binding to an mRNA. miRNA target prediction tools are based on a few common features [51], which are the basic features that are often implemented to develop algorithms for the miRNA tools. These features are described as follows.

6.1 Seedmatch

The initial two to eight nucleotides from 5′ end to 3′ end are called as seed sequence. Generally, miRNA target prediction tools are based on the Watson-Crick base pairing between the seed region of miRNA and mRNA. An ideal seed match has no gap in alignment of this region. There are different types of seed sequences that are used in the target prediction tool of miRNA [52]. Commonly used seed matching in miRNA tools are mentioned below:

  1. 1.

    6-mer: It includes the complementary pairing between the second nucleotide of the 5′ side of the miRNA to the sixth nucleotide with the 3′ UTR of the miRNA.

  2. 2.

    7-mer-m8: This is an ideal seed match between the second nucleotide from the 5′ side of the miRNA to the eighth nucleotide with the 3′ UTR of the miRNA

  3. 3.

    7mer-A1: This also is a perfect seed match between the second nucleotide from the 5′ side of the miRNA to the seventh nucleotide with the 3′ UTR of the miRNA in addition to an A across the miRNA first nucleotide.

  4. 4.

    8-mer: This type of seed match is also a perfect seed match between the second nucleotide from the 5′ side of the miRNA to the eighth nucleotide with the 3′ UTR of the miRNA in addition to an A across the miRNA first nucleotide.

6.2 Conservation

Conservation is defined as the measure of the sequence conserved throughout the different species. The functionality of an identified miRNA target is predicted by conservation analysis [51]. This feature is considered for the prediction of miRNA, 3′ UTR, and 5′ UTR and the combination of these three. The seed region in miRNA is considered to be more conserved than any other regions. The 3′-compensatory sites are a small portion of miRNA which has conserved pairing and networks with the target mRNA to compensate the mismatched seed [53].

6.3 Free Energy

The stability of a particular interaction-complex is defined by Gibb’s free energy and is calculated as the variation in free energy change (ΔG). The value of ΔG suggests the stability of a reaction. A larger value of negative ΔG signifies greater stability of the reaction. miRNA binding with a target mRNA leads to the formation of either a stable structure or an unstable one. Formation of most stable interaction-complex is the root base of prediction of the likely target of miRNA.

6.4 Site Accessibility

Site accessibility is the degree of easiness through which a miRNA may work out its target mRNA and interacts with it. Interaction between miRNA and its target mRNA takes place in two steps. First, miRNA interacts with a short accessible part of mRNA, and then unfolding of mRNA takes place. After unfolding of mRNA, binding of miRNA occurs with it [54]. The amount of energy required for making a site in the mRNA accessible for binding of miRNA is required for the determination of site accessibility. Therefore, site accessibility is one of the most common features to find the appropriate target of a miRNA.

6.5 Other Features

There are some other features including target-site abundance, local AU content, GU wobble seed match, 3′-compensatory pairing, seed pairing stability, and position contribution that are often considered in miRNA target prediction algorithms.

7 Common Tools for miRNA Target Prediction

There are many prediction tools reported for the prediction of the miRNA targets. These all are based on the above-discussed features. Some of the tools are discussed in this section (Fig. 14.2) [55].

Fig. 14.2
figure 2

An overview of the prediction tools

7.1 miRanda

It is a target prediction tool of miRNAs in a genome. The algorithm of this tool had been developed in C language. It was developed by the Computational Biology Center, at Memorial Sloan Kettering Cancer Center, USA, and is freely available online. Its algorithm works in the following three steps:

  1. 1.

    A miRNA sequence is entered as an input and is searched for the WC matches against the 3′ UTR provided by the user.

  2. 2.

    The free energy is determined for every miRNA-mRNA target pair, only when the pair surpasses a threshold that matches the score.

  3. 3.

    It applies the conservation analysis and is based on either the score or by having many sites.

7.2 miRanda-mirSVR

It is an online tool that recognizes candidate target sites and then scores them. SVR stands for super vector regression that is much similar to SVM (support vector machine). It comprises the study of AU flanking, site accessibility, and target-site position in the 3′ UTR region. It also offers the miRNA expression analysis and links to the miRBase and miRO [56].

7.3 TargetScan

TargetScan prediction tool permits the operator to search the target by the name of the gene. The targets are predicted by calculating the predicted effectiveness of targeting, i.e., context + score, or by calculating the likelihood of conserved targeting, known as PCT. PCT is the probability of a target being targeted efficiently. No sequence is required to be input for the prediction. For conservation analysis, 3′ UTR conservation is evaluated, and then specific k-mer is analyzed [57].

7.4 DIANA-microT-CDS

It is the latest version of DlANA-microT, first target prediction tool in human [58]. It incorporates data in a way to identify the most related features retrieved from photoactivatable ribonucleoside-enhanced cross-linking and immunoprecipitation (PAR-CLIP) that allows this tool to get an idea about the position of miRNA interaction in the coding sequences and in 3′ UTR region. There are numerous sites in a target that are recognized using microarray expression data. It gives the predicted target location, conservation, score, and external links to miRBase, Ensembl, and PubMed [58].

7.5 MirTarget2

MirTarget2 uses SVM and incorporates a large microarray training dataset in the target search. Its algorithm finds out the seed conservation, base content in the flanking regions, seed match in positions of 2–8 nucleotides, the region of target site in 3′ UTR, and secondary structure [59, 60]. It has a disadvantage that it includes 3′ UTR sequences as a training dataset with only one seed pairing site rather than many sites. It is comprehensible, and miRNA-mRNA interactions can be investigated by either mRNA or miRNA. Since there may be multiple target sites of miRNA in the target mRNA, therefore, there is a target running option that helps to select some specific target sites [59, 60].

7.6 RNA22-GUI

RNA22-GUI is the latest version of RNA22 [61, 62]. It identifies the target islands using pattern discovery and calculation of free energy of these islands and then finally predicts the candidate miRNA [63].

7.7 TargetMiner

TargetMiner identifies the seed sites between the user-provided miRNA and mRNA using an SVM-based classifier. This tool incorporates negative and positive training data to predict more precise seed matches in miRNA and the mRNA [64].

7.8 SVMicrO

SVMicrO is also a machine learning tool that includes a large positive dataset of different species [65]. It identifies the miRNA-miRNA interactions involving features of the miRNA target prediction including seed match, site accessibility, conservation, target-site abundance, and free energy [65].

7.9 Probability of Interaction by Target Accessibility (PITA)

PITA predicts the miRNA target using the target accessibility as a major feature [66]. It is dependent on the key observation that highly accessible regions of the 3′ UTR of mRNAs have privileged and conserved positioning of target sites. This tool first recognizes a probable site through seed match measures. Then it considers site availability by calculating the free energy score depending on the difference between the gain of free energy linked with miRNA-mRNA hybrid target structure and the free energy requirement of dissociating the target to render it reachable. Further, to determine the interaction score for the microRNA and 3′ UTR, the target-site abundance is considered which is computed through accessibility scores of the same miRNA [66].

7.10 RNAhybrid

RNAhybrid has a user-defined seed region and calculates the free energy between miRNA and mRNA. It predicts the target sites by assigning a p-value to the miRNA-mRNA interaction sites [67].

Among all the above-discussed tools miRanda is the most widely accepted tool for miRNA target prediction and also according to the ease of use [51, 68, 69]. But other tools such as RNA22 and GUI offer a graphical representation of the interactions between the miRNA and the mRNA.