Keywords

CAGE:

Cap analysis of gene expression

EST:

Expressed sequence tag

lncRNA:

Long noncoding RNA

MPSS:

Massively parallel signature sequencing

SAGE:

Serial analysis of gene expression

seq:

Sequencing

sncRNA:

Small noncoding RNA

1 Introduction

Viral infections often lead to the modulation of various regulatory RNAs like miRNA, siRNA, tRNA-derived RNA fragments, Piwi RNA, etc. that are called noncoding RNA. These ncRNAs do not code for any protein. Modulation of small ncRNAs (sncRNAs) during viral infection is primarily associated with the regulation of endogenous host gene expression, and expression of some endogenous virus generated noncoding RNA in host cells which may potentially contribute to disease progression. vsRNA and sfRNA are virus-derived small noncoding RNAs which are found to be associated with the evasion of host antiviral response. Table 12.1 depicts various host- and virus-originated noncoding RNAs and their roles during viral infections.

Table 12.1 Types of noncoding RNAs with functions and examples in virus infections

Delayed diagnosis of infectious disease often aggravates the disease condition. Differential expression of sncRNA can prove to be dispensable biomarkers to identify the disease severity during various stages of infection. RNA-based therapeutics represents next-generation sequencing approaches for prospective antiviral and other disease-related therapeutic development. Many RNA therapeutics have already been approved by the FDA such as miravirsen to treat HCV infection and patisiran, givosiran, and MRX34 to treat various tumors. Thus unraveling the transcriptional landscape of various infectious disease is important to build an articulated nexus of regulatory roles of ncRNA during virus-host interaction. Recent advancements in the field of deep sequencing and high-throughput screening have enabled to establish a deeper understanding about RNAome and thus explore the capacities of these sncRNAs. Here we are summarizing various transcriptomic techniques used to study dengue virus and other viruses to study small noncoding RNAs.

2 Tag-Based Approaches

The tag-based approaches have been used for the direct determination of the cDNA sequences. These include expressed sequence tag (EST), serial analysis of gene expression (SAGE), cap analysis of gene expression (CAGE), and massively parallel signature sequencing (MPSS). The expressed sequence tag has proved to be a rapid and efficient means of characterizing the huge sets of gene sequences consisting of 300–1000 bp of DNA, and it is often accumulated in a database as a “single-pass read” that is sufficient in establishing the identity of particular expressed gene. Serial analysis of gene expression (SAGE) is another method used to obtain the qualitative and quantitative level of gene expression profiles in different conditions. The SAGE methodology is considered as an “open architecture” technique, and unlike another array method, it does not need prior information about the genes which needs to be analyzed, and it also reflects the absolute mRNA levels. Further, several SAGE-like methods have also been developed. They can be employed for the genome-wide analysis of DNA copy-number changes, transcription factor targets, and analyzing epigenetic signatures such as methylation patterns and chromatin structure. CAGE is another high-throughput sequencing-based technique that enables us to quantify and identify the expression of 5′ capped RNAs [21]. Likewise, MPSS is an open-ended platform for quantifying in-depth gene expression based on individual mRNA levels in the cell. In MPSS, the identification and characterization of the gene prior to conducting the experiment are not required. It has the routine sensitivity at the level of mRNA molecules, and the datasets are in the digital format that facilitates the management and analysis of data.

3 Chip-Based Approaches

Microarray plays a significant role in transcriptome profiling studies. Microarray is used for DNA mapping, sequencing, and transcript-level analysis [22]. It’s a crucial genomic technology and often used synonymously with DNA microarray and high-throughput gene expression analysis. Gene expression microarray is a nucleic-acid-hybridization-based technique, and a complex mechanism is required to understand the global and parallel analysis of different cellular processes. It’s been more than a decade that microarray has been in use as a gold standard for transcriptome studies in a wide range of settings [23]. Recently, despite the emergence of the next-generation sequencing [24], DNA microarrays still appear an extremely compelling approach due to its quick, precise, and inexpensive detection of the pathogens compared to culture or immunoassay techniques [25, 26]. It is a technique of choice for the detection of altered gene expression upon virus infection and is often used in clinical diagnostic to detect for the presence of existing viruses or new viruses [27, 28].

Tiling arrays are subtypes of microarray which are used to investigate thousands of gene expression in a coordinated fashion, transcriptome mapping, and to identify the transcription factor binding sites [29, 30]. It is a useful tool for the investigation of whole genome or chromosome expression as well as to uncover the various novel RNA expression patterns [31, 32]. Like traditional oligonucleotide microarrays, the probes are designed to the matched parts of the genomic region of interest. The probes get hybridized to the labeled DNA or RNA target molecules which are fixed onto a chip. The experimental technique to identify the site for protein DNA interaction involves the hybridization of immunoprecipitated DNA on a tiling microarray (ChIP-chip experiments) [33]. The genome-tiling microarrays have facilitated the analysis of global expression patterns with or without completely annotated genome in organisms such as prokaryotes, mouse, human, and yeast [34, 35]. In conventional gene-probing microarrays, probes targeting a particular gene give independent measures of the same RNA expression, while when tiling is the strategy applied to the entire genome, the analysis becomes restricted to the annotated genes, and it becomes difficult for the unannotated genes to be analyzed. Therefore, major challenges for tiling array studies are (1) determining transcriptional start and stop sites and (2) predicting whether transcripts are either from long continuous stretches such as genes or from short noncontinuous strands of RNA, i.e., typically ncRNAs [36, 37].

Although microarray-based transcriptional profiling was very much prevalent during the last decade and have made numerous remarkable contributions, with the emergence of new sequencing technologies, its use has become limited. Although there is an advanced microarray application in regard to RNA profiling, however, it has certain limitations in terms of reproducibility, sensitivity, and specificity [38].

4 RNA Sequencing Technologies (Platforms)

4.1 RNA Sequencing

RNA seq is a recently established high-throughput deep sequencing technique which has been used extensively for mapping, quantifying, and identification of novel genes. Small noncoding RNAs has been widely studied these days since these have been found to play a critical role in regulation of gene expression. After microarray, RNA sequencing is slowly procuring its space in the profiling studies and now has become the gold standard for a novel small RNA discovery as well as for small RNA profiling with high throughput, high sensitivity, and reproducibility. Deep RNA sequencing does not need any prerequisite information about the sequence which is a clear advantage over existing approaches like microarrays and qPCR.

4.1.1 Workflow

RNA seq workflow typically comprises of small RNA enrichment, RNA assessment and quantitation, small RNA library construction, deep sequencing, and bioinformatics analysis. Figure 12.1 is showing a typical RNA seq workflow.

Fig. 12.1
figure 1

Workflow of a typical RNA sequencing experiment and analysis

4.1.2 Small RNA Enrichment Methods

Small noncoding RNAs play a significant role in gene expression. In order to analyze them, additional purification steps are required as they are present in very low concentration and thus enrichment methods are employed for their isolation and purification. Two types of enrichment methods can be used, i.e., manual methods and kit-based approaches.

4.1.2.1 Manual Methods for Isolation of sncRNAs
4.1.2.1.1 miRICH Method

miRICH method is variant of TRIzol method used for total RNA isolation. In this method, total RNA is isolated by using TRIzol reagent, and in order to concentrate small RNAs, washing using ethanol is skipped, after the precipitation step and the pellet is overdried. Overdrying the pellet causes aggregation of larger RNA molecules due to high salt concentration; hence larger molecules are barely soluble, while small RNAs can be easily eluted [39].

4.1.2.1.2 Fractionation Approach

Fractionation method makes use of flashPAGE fractionator that works on the principle of gel electrophoresis. sncRNAs are isolated based on their size on a miniature PAGE. A precasted PAGE is inserted into the buffer chamber of the fractionator after addition of lower running buffer; this is followed by addition of upper running buffer. The small RNA fraction will be present in lower running buffer after electrophoresis which further enriched by using flashPAGE cleanup kit [40].

4.1.2.1.3 Multidimensional HPLC for Purification of Small Noncoding RNA

mRNA isolation can be eased by exploiting poly(A) tail, while isolation of sncRNA is difficult because of limited knowledge regarding biochemistry, posttranscriptional processing, ribonucleotide modification, and biological function, hence limiting the ability to obtain RNA in pure form. HPLC can be used to purify all classes of ncRNA from single samples. To achieve complete separation of two ranges, size exclusion chromatography with ion-pair reverse phase chromatography can be used [40].

4.1.2.1.4 p19 Protein-Mediated Isolation of siRNA and Detection of miRNA

Another approach for enrichment of sncRNA is to make use of the property of p19 protein to strongly bind to siRNA derived from plant virus, Carnation Italian ringspot virus (CIRV). The p19 fusion protein designed can be used for isolation of siRNA and detection of miRNA with the help of magnetic beads [41].

4.1.2.1.5 Trans-Kingdom Rapid Affordable Purification of RISC (TRAP) for Isolation of all Classes of Silencing Small Noncoding RNA

Isolation of small noncoding RNAs from total RNA using size separation methods is tedious and may give poor yields. An alternative to total RNA isolation method is the AGO purification technique in which the AGO-associated sncRNA complex is purified in order to isolate small RNA by anion exchange chromatography. The issue of contamination by other RNA species will be overcome by employing this technique, but it requires highly specific antibody against AGO proteins. Anion exchange chromatography makes use of Q Sepharose a positively charged anion exchange matrix to adsorb the RNPs complex, and then the complex is eluted by increasing mild salt concentration. Further analysis is based on the gel electrophoresis technique [42].

4.1.2.2 Kit-Based Methods

Silica-based columns are used for recovery of sncRNAs widely. Different commercial kits are available for isolation of small RNAs like mirVana, miRNeasy mini kit, mirPremier, MasterPure RNA purification kit, and miRCURY as described in Table 12.2. Selection of method depends upon the initial volume of sample, type of sncRNA to be analyzed, ease to use, and price per sample [42].

Table 12.2 Various kits available for small RNA enrichment

4.1.3 RNA Quality and Quantity Assessment

For assessment of RNA quality, two methods are used, i.e., denaturing/non-denaturing agarose gel and capillary electrophoresis, which determine intact RNA by identifying 28S and 18S rRNA bands [43]. Quantity of RNA can also be determined by various methods. It includes UV absorbance at wavelength 260 nm, fluorometry which employs fluorescently labeled probe specific to sncRNAs which have to be quantified, and splinted ligation assay where a small RNA-specific bridge oligonucleotide is used to form base pairs with the 5′-end-radiolabeled oligonucleotide and sncRNA [44]. For real-time quantification of small RNA, stem loop PCR is used in order to increase the length of template, and quantification is done using TaqMan probe [45].

Recent development in microfluidic technology has led to development of different instruments which can check RNA integrity as well as quantified RNAs. For example, microfluidic instruments like Agilent 2100 Bioanalyzer, 2200 TapeStation, and Experion can check RNA integrity by calculating 28S /18S ratio, and quantification is done by fluorescent labeling.

4.1.4 Small RNA Library Construction

Small RNA library preparation typically starts with the ligation of DNA adapters. It includes ligation of pre-adenylated adapter oligo at the 3′ end of the sncRNA using a truncated variant of T4 RNA ligase2. It is followed by ligation of adapter oligo at the 5′ end using T4 RNA ligase 1. The sequence is then reverse transcribed into cDNA following ligation. Although direct RNA sequencing is possible, but many instruments are based upon DNA sequencing due to which cDNA library preparation is done and is the crucial step of sequencing. After cDNA preparation, PCR amplification is carried out. For the longer sequences, fragmentation is done using DNase I treatment or sonication before adaptor ligation. But small RNAs can be sequenced directly after adapter ligation since these already exists as a shorter sequence. The adaptor ligation results in loss of strand specificity which is restored by dUTP incorporation in the second strand of cDNA which is then further degraded by uracil-DNA glycosylase (UDG) before the amplification step. Hence, only a single strand is amplified by PCR up to 8–12 cycles. Finally, single-end sequencing (from one end) or paired-end sequencing (from both ends) can be carried out in a high-throughput manner. The read sequences obtained are then aligned with the reference genome present in the database; else de novo assembly is carried out for the reads.

4.1.5 Bioinformatics Analysis

Once the raw sequencing data is generated, it is then preprocessed and normalized which involve trimming of adaptors, alignment, outlier removal, RNA class filtering, and generation of unique reads. Normalization involves comparison of expression levels across libraries. There are two commonly used small RNA database: (1) miRBase comprises information of all miRNA sequences and annotations; (2) Rfam is an open-access database which includes information about tRNA, rRNA, snoRNA, etc.

4.1.6 Challenges

Library Preparation Challenges

RNA seq contains various manipulation stages during cDNA library preparation, which can complicate the profiling procedure of the transcripts. Sometimes, sequencing data is incomparable with the qPCR, Northern blot, and microarray results which predominantly can happen due to the bias that occurs during different library preparation steps. Many identical short reads are amplified from cDNA libraries which give a reflection of abundant RNA species and thus culminate in false results.

Small RNA ligation bias is another attribute for library preparation which is particularly dependent upon T4 RNA ligase during adaptor ligation. It has been suggested that randomization of the adapter sequences near ligation junction might result in bias reduction. Illumina sequencing identifies antisense as well as overlapping transcripts which are unwanted. To avoid this, dUTP is incorporated during the generation of second cDNA synthesis prior to adaption ligation. The strand containing dUTP then finally gets digested leaving only a single strand for amplification.

Bioinformatics challenges include storage, retrieval, and processing of large amount of data which are very much essential for error reduction during result analysis. Background noise also occurs due to incomplete processing of RNAs.

4.2 Second-Generation Sequencing Platforms.

4.2.1 Pyrosequencing

This technique is based on the principle of sequence by synthesis approach which detects released pyrophosphate (PPi) during DNA synthesis. To detect the nucleic acid synthesis, four enzymatic reactions take place in a sequential manner. The primer is first hybridized with the biotin-labeled single-stranded DNA template. The hybridized primer template is then mixed with the enzymes: DNA polymerase, ATP sulfurylase, luciferase, and apyrase, and the substrates adenosine 5′-phosphosulfate (APS) and luciferin. Each dNTP is added separately to the reaction mixture. With the incorporation of a single nucleotide, one PPi is released which is then quantified in an equimolar concentration to the incorporated nucleotide. ATP sulfurylase forms ATP from PPi in the presence of APS. ATP and luciferase catalyze the conversion of luciferin to oxyluciferin which produces visible light which is accountable for the total amount of ATP generated. The produced light is detected by a photon detection device, with a maximum wavelength of 560 nm. ATP and unincorporated dNTPs are continuously degraded by apyrase [46].

Currently, pyrosequencing has been widely used in single nucleotide polymorphism (SNP) genotyping identification of bacteria, fungal, and viral typing. Moreover, this method can be used for the determination of difficult secondary structures and for the identification of mutations. Other possible applications are DNA methylation analysis and whole genome sequencing [47].

4.2.2 Ion Torrent Sequencing

Ion Torrent also requires sequence amplification, but it is the first technique that uses electrochemical detection, not camera scanning and fluorescence detection. Ion Torrent is done with an Ion Personal Genome Machine (PGM). Firstly, the DNA sample is prepared from fragmented RNA and then linked together. The library is then clonally amplified by emulsion PCR onto beads known as Ion Sphere particles. The beads are placed into proton-sensing wells on a semiconductor sequencing chip so that one bead can roughly fit into hundreds of millions of wells. The chip is then submerged into a nucleotide solution and results in the release of protons and a corresponding change in pH. This change in pH is documented by the PGM to determine whether the right nucleotide was used in the process or not, let alone if a nucleotide was added at all. As sequencing occurs, each of the four bases is introduced sequentially. A clear indication of the correct nucleotide being added is the presence of voltage. There will be no voltage found if the wrong nucleotide is added, and there will be double the voltage if two nucleotides are added. The more nucleotides present, the greater the increase in voltage and pH [48].

This technique can be used for targeted DNA and RNA sequencing, exome sequencing, viral typing, bacterial typing, aneuploidy and CNV analysis, and small RNA and miRNA sequencing [49].

4.2.3 Illumina Sequencing

It is often known as Solexa sequencing based on the name of the founder Solexa. It is based upon sequencing by synthesis approach and reversible dye terminators that are fluorescently labeled. These terminators are incorporated to the new DNA strand and stop DNA synthesis. These are then detected by the camera. Each type of dNTP is differently fluorophore-labeled, and therefore each base addition gives off different light which is detected by laser [50].

It is the next-generation sequencing that is used for the small RNA discovery, metagenomics, methylation profiling, transcriptome analysis, genome-wide profiling, and protein-RNA/DNA interaction analysis [51].

4.2.4 ABI SOLiD Sequencing

This principle is based upon the principle of sequencing by ligation. PCR-amplified target sequence is anchored by agarose beads onto a glass surface. Fluorescently labeled oligonucleotides are then added for sequential ligation mediated by enzyme DNA ligase. Once labeled oligonucleotides are annealed; fluorescent gets removed from the fragment due to the formation of phosphonothioate bond between the bases. The removal of fluorophore from 5′ site makes that site vacant for upcoming nucleotide ligation. This allows the formation of different fluorescent peaks corresponding to different nucleotides [52].

It is used for targeted sequencing, identification of small RNAs, epigenome analysis, and chromatin- immunoprecipitation etc. [53].

4.3 Third-Generation Sequencing Platforms

4.3.1 Oxford Nanopore Sequencing

Oxford sequencing is based upon the principle of small variations in electrical conductance that is generated when a DNA strand or a nucleotide passes through a pore of 1.5 nm size. The nanopore is immersed in a conducting fluid along with tiny wells layered with DNA polymerase which catches the hold of the exposed DNA and makes it pass through the nanopore. Each nucleotide passage through the pore creates varying degrees of obstruction that leads to the generation of varying electric current. This small variation in the electric current defines the characteristic of each nucleotide. This change of current is recorded to detect the sequence [54].

This technique confers precision and sensitivity and is cost-effective, and also, RNA can be directly sequenced with this technique. Very long 10–50-kb-sized fragments can also be sequenced. It can be used for metagenomics analysis, de novo assembly, sequencing of long transcripts, etc. [55].

4.3.2 PacBio Sequencing

PacBio is again based upon the principle of sequencing by synthesis approach which makes use of specialized SMRT chips for sequencing. All the four bases are fluorescently labeled with different fluorophores. Upon binding of the polymerase to the template on the SMRT chip, each incorporating base will be irradiated with different fluorescent colors. The peak and wavelength of the fluorescent emission are then recorded for the sequencing [11, 12].

Unlike first- and second-generation sequencing, this technique requires no PCR amplification and can sequence even longer reads. This technique is used for de novo sequencing of genomes and transcriptomes, detecting alternative splicing isoforms, epigenetic modifications, analysis of different mutations, etc. [56] (Table 12.3).

Table 12.3 Advantages and disadvantages of different transcriptomic approaches

5 Data Processing and Analysis

Next-generation sequencing (NGS) technologies have been emerging with the potential to explore small RNA (sRNA) transcriptomes and their associated roles. It offers a reliable and high-throughput approach for the identification and quantification of various sRNA classes. The steps required for RNA sequence analysis of sncRNAs has been summarized in Fig 12.2. Various tools are available online for the analysis of a particular small RNA from sequencing data, shown in Table 12.4. Various tools are also available for the integrated analysis of more than type of RNA analysis which are depicted in Table 12.5.

Fig. 12.2
figure 2

Various steps and tools used in RNA sequencing data analysis. (The figure is adopted from [57])

Table 12.4 Web-based tools to analyze RNA sequencing data
Table 12.5 Various integrated tools to analyze RNA sequencing data

6 Available Databases for Studying Small Noncoding RNAs

  1. 1.

    miRBase: It consists of the miRNA database with information about sequence, genomic location, and predicted targets of miRNA [78].

  2. 2.

    MirZ: It consists of the miRNA database with information about sequence-based miRNA profiles and predicted targets [79].

  3. 3.

    IsomiR database: It mainly has information about miRNAs and isomiRs with respect to the reads; isomiRs assigned to miRNAs belong to human 293 T cells, with miRNA annotation from miRBase [80].

  4. 4.

    siRNAdb: It is a database about siRNA (small interfering RNA) that contains information about experimentally verified and predicted siRNAs, sequence, and the literature links [81].

  5. 5.

    piRNABank: This database is useful for information on piRNA about their sequence, cluster, and homology searching [82].

  6. 6.

    snoRNA-LBME-db: It is a database in which information about snoRNAs and scaRNAs are available with respect to their sequence, expression information, as well as predicted targets including base pairing information [83].

  7. 7.

    Rfam: It consists of the data about snRNAs, snoRNAs, and miRNAs and about the sequence families of other structural RNAs [84].

  8. 8.

    NONCODE: This database contains data about miRNAs, piRNAs, snoRNAs, and scaRNAs. NONCODE is linked with GenBank [85].

  9. 9.

    RNAdb: There is information about miRNAs, piRNAs, snoRNAs, and other ncRNAs that has their sequence with links to literature and other databases[86].

  10. 10.

    deepBase: It is a miRNA, piRNA, endo-siRNA, nasRNA, pasRNA, easRNA, and rasRNA database with the data information about their sequence from different tissues and for computationally predicted snRNAs [87].

  11. 11.

    fRNAdb: It consists of data about various annotated and predicted ncRNAs of various lengths from different sources [88].

7 Applications of Various Transcriptomic Approaches in Context of Dengue and Other Viruses

Expressed sequence tags (ESTs) emerged as the first high-throughput technique to gene expression and annotation of genome. To investigate host gene expression response upon viral infection, EST approach was used as molecular tool. Barón et al. in 2010 compared the differential gene expression profile in the midgut of refractory and susceptible lines of Aedes aegypti mosquitoes by infecting with dengue-2 virus [89]. The annotation of EST-identified differentially expressed genes belongs to immune response and metabolism. Another EST-based study by Guo et al. in 2009 in response dengue-2 virus infection in Aedes albopictus was analyzed. They identified seven ESTs; among them five were overexpressed in susceptibility, and two were overexpressed in refractory lines [90].

Microarray analysis is regularly used for the analysis of altered gene expression upon virus infections. It is often used in clinical diagnosis to detect the presence of known viruses or to find new viruses [27, 28]. In addition, microarray has been used in the detection and identification of seven pathogenic viruses from the Flaviviridae family; these viruses are the yellow fever (YF), West Nile (WN), Japanese encephalitis (JE), and the dengue strain 1–4 viruses [91]. DNA microarray was used for the detection of dual infection with different DENV serotypes [92]. Likewise, high-density resequencing microarrays (RMAs) are another variant of microarray which has been used for rapid identification and molecular analysis of bacteria and viruses. It also enables to predict for the biomarkers during different clinical outcomes of dengue infection [93, 94]. RMA, microarray technique, has also been proven useful for the rapid and accurate identification of pathogens, specifically for the Rhabdoviridae family [95]. However, this technology often has certain limitations in the context of sensitivity, specificity, and reproducibility [38]. Tiling array which is a variant of microarray has a wide range of applications and had been used widely in the detection of multiple foodborne RNA viruses including multiple coxsackievirus serotype (A and B) strains and multiple hepatitis A virus genotype strains. It has also been used in differentiating the virus serotypes [96]. Further, the customized tiling array technique has been utilized into accurate identification of non-polyadenylated RNA in the case of human cytomegalovirus (HCMV) [97].

Transcriptome approach is also used to investigate transcriptional patterns associated with the dengue progression by analyzing the RNA-Seq of peripheral blood mononuclear cells. Patients with varying severity of dengue infection were compared with the patients with other febrile illnesses (OFIs) and the healthy controls. Researchers have collected the sample from individuals; RNA sequencing was performed by constructing a library using Illumina HiSeq 2500. Further, analysis of their data reveals the direct molecular mechanism of bleeding due to decreased platelet count in dengue patients [98]. RNA seq has been used in various studies to explore the potential of noncoding RNA as biomarkers and antiviral targets by analyzing the expression profiles of various noncoding RNAs during different viral infections [99,100,101,102,103,104,105]. An RNA seq study was carried out by Castillo et al. [2], where miRNA profiling in DENV-infected primary macrophages was done which identifies miR-3614-5p as an antiviral target [2]. piRNA profiling was done in DENV-infected Asian tiger mosquito and midgut [101]. On the other hand, differential expression of lncRNA during DENV infection was also carried out in which lncRNA was proven to be the potential biomarkers for disease progression [106].

Next-generation sequencing has been widely used nowadays for profiling and discovery of small noncoding RNAs in various infectious diseases. It has been used for profiling of small noncoding RNAs in dengue virus-infected Aedes mosquito cells to discover specific viral small noncoding RNAs that facilitate viral replication [107].

Illumina has been widely used to determine an active small interfering RNA-based antiviral response in case of West Nile virus-infected mosquitos [108] and to discover microRNA-like small RNA which autoregulates replication of dengue-2 virus in mosquito cells [109]. A study by Samuel et al. in 2018 carried out multiplex sequencing of DENV serotypes using Illumina and nanopore sequencing simultaneously. They found that multiplex sequencing was robust and generated full genome coverage for all DENV isolates in first attempt unlike single plex approach which failed to produce several amplicons [55]. In a meta-analysis study, microarray, Affymetrix, and Illumina datasets were compared which revealed the modulation of extracellular matrix and cell junctions during DENV infection which might prove to be the gene signatures of dengue infection [110]. Illumina has been also used to sequence tRNA-derived fragments during infection of respiratory syncytial viral infection [111]. In HIV-infected cells, novel miRNAs and piRNAs have been identified by using next-generation sequencing platform illumine [112].

With the advancements in omics technologies, third-generation sequencing platforms like nanopore can be used for direct sequencing of RNA. Analysis of viral transcriptome using conventional RNA sequencing methods is more complicated because of complexity in splicing patterns, overlapping reading frames, and high gene density. To overcome these problems, Depledge et al. in 2019 used nanopore sequencing to directly sequence RNA from host and viral RNA to study host-pathogen interaction during herpes simplex virus infection [113].

8 Conclusion and Future Perspectives

Before 2005, noncoding RNA field was very much scarcely studied. But with the emerging next-generation sequencing technologies, the number of studies increased in the last decade. Various tools and techniques have been introduced to make the ncRNA study to be more comprehensive. This chapter has reviewed the current and emerging approaches for decoding the spectrum of ncRNA functions with the primary focus on virus-related tools and techniques. The field of miRNA is the extensively studied field followed by siRNA and piRNA. While various bioinformatics tools and databases are available for miRNA, siRNA, piRNA, snoRNA, etc., for many ncRNAs, there is a lack of tools and databases for their study which primarily includes lncRNA, circRNA, virus-associated RNAs, etc. This could be one reason for impeding the associated research. Moreover, several tools, techniques, and databases have recently been introduced which help to dig deeper into the possible functions of ncRNAs. Due to extensive expansion of various transcriptome analysis tools, it is expected that various roles of noncoding RNAs are to be discovered in the future which will lead to better understanding of various virus pathophysiologies and development of noncoding RNA-based diagnosis and therapeutics.