Abstract
Sequencing is a process used for determining the array of biopolymers applicable in identifying microbial gene arrangements, phenotypes, evolutionary biology, metagenomics, potential drug targets, gene cloning, etc. Commercial sequencers are emerging all around the globe due to rapid development of recombinant DNA (rDNA) technology. For sequencing DNA, next-generation sequencing (NGS) methods provide faster, inexpensive, accurate sequencing of polymers than traditional approaches. The concept of NGS is not quite new; it started in the mid-late 1990s with the successful introduction of methods thereafter, namely, Roche 454 pyrosequencing, Illumina sequencing, SOLiD sequencing, etc. In this book chapter, we briefly elaborate on the above-stated methods and its advantages and disadvantages. Furthermore, we will be discussing the sequencing methods that are under development in the biological research.
Access provided by CONRICYT-eBooks. Download chapter PDF
Similar content being viewed by others
Keywords
- Next-generation sequencing (NGS)
- Molecular diversity
- Bioinformatics
- Roche 454 pyrosequencing
- SOLiD sequencing
- True single-molecule sequencing (tSMS)
- Single-molecule real-time (SMRT) sequencing
- Ion torrent semiconductor sequencing
- Nanopore sequencing
1 Introduction
Deoxyribonucleic acid (DNA), a fundamental part of the central dogma of molecular biology, defines the complete organism. After the discovery of DNA in 1953 by two eminent scientists, sequencing its genes for studying the genotype and phenotype of cells was necessary. Frederick Sanger and Coulson in 1975 came up with the method of sequencing DNA (Sanger and Coulson 1975), and due to its drawbacks 2 years later, they developed another method based on chain termination of nucleotide strand (Sanger et al. 1977). It is considered to be the first-generation sequencing technology to be commercialized with a wide range of applications in molecular cloning, breeding, and evolution. In spite of its accuracy and applicability, traditional Sanger sequencer did not find scope because of its high cost and low throughput per run (Yadav et al. 2014). Molecular biologists from every nook of the world wanted to develop a faster, high-throughput, cheaper, and accurate method to sequence DNA.
Next-generation sequencing (NGS) technologies fulfilled the needs of emerging molecular biology laboratories. NGS technologies sequence thousands to millions of strands at the same time (Hall 2007). Roche 454 sequencer emerged as the first NGS platform working under the principle of pyrosequencing (Ronaghi et al. 1996). Following Roche, Illumina and ABI SOLiD commercialized their genome analyzer. HeliScope and PacBio RS sequencer developed by Helicos BioSciences and Pacific BioSciences respectively eliminated the PCR amplification step and consumed the total run time (Harris et al. 2007). Although these platforms utilized fluorophores and optical imaging to detect the sequence reads, Ion Personal Genome Machine of Life Technologies use pH detection system to accurately predict the sequences (Niedringhaus et al. 2011). Read length of the NGS technologies increased remarkably as a cause of nanopore detection system with greater accuracy and short run time. Due to high read length, sequencing larger fragments or even the whole genome is now possible with some required modifications. In spite of the remarkable success with biological nanopores, its inherent drawback such as nanopore instability at high temperatures was noted (Kang et al. 2005). To ameliorate the nanopore technology sustainable for upcoming generation, synthetic solid-state nanopores are under progress for commercialization, hoping that these nanopores will devote effective genetic analysis.
In this chapter, we will be discussing in brief on individual sequencing topics commencing with traditional Sanger sequencing, followed by an interconnection with current NGS technologies, and finally concluding with emerging NGS method. Merits and demerits of different DNA sequencing approaches along with their necessary bioinformatics tools are also included.
2 Traditional Method
2.1 Sanger Sequencing
Sanger and Coulson in 1975 described their first method known as “plus and minus” for sequencing DNA (Sanger and Coulson 1975), but due to its ineffectiveness Sanger and his co-workers in 1977 launched another sequencing method known as the “chain termination method” that uses chain-terminating inhibitors (Sanger et al. 1977). Since then, DNA sequencing has been carried out in the Sanger sequencer using the chain termination approach as a key principle. Prior to sequencing, the fragmented DNA strands are cloned into a suitable bacterial vector (having high copy number) for in vivo amplification. Then, sequencing starts with amplified template denaturation, annealing of primer to the resulting ssDNA, and extension of annealed primer. After incorporation of fluorescently labeled dideoxynucleotides (ddNTPs), each round of primer extension is randomly terminated. In any given fragment, each fluorescent label on the terminating ddNTP corresponds to a particular nucleotide (either A/T/G/C). Fragments are then separated based on their molecular weight by high-resolution electrophoresis separation method carried out in a capillary polymer gel. A detector coupled to the capillary chamber excites the fluorescent labels of the separated fragments by laser (Fig. 1). The resulting four-channel emission spectra provide the readout also known as “trace,” and these traces are then translated by base-calling software into DNA sequences (Ewing and Green 1998; Shendure and Ji 2008).
3 Current Methods
3.1 Roche 454 Pyrosequencing
Roche 454 sequencing system (http://www.454.com) was the first NGS technology to be commercialized. In 2005, Jonathan Rothberg (the founder of 454 Life Sciences) launched his first commercial NGS platform in the name of GS 20, and later in 2007 Roche Applied Science acquired 454 Life Sciences and marketed the second version of the 454 instrument as GS FLX (Voelkerding et al. 2009). The working principle of this high-throughput sequencer lies in the collaboration of two approaches, namely, single-molecule emulsion PCR (Tawfik and Griffiths 1998) with pyrosequencing technology (developed by Nyren et al. 1993 and refined by Ronaghi et al. 1996, 1998). Pyrosequencing is a DNA “sequencing-by-synthesis” method where synthesis/release of a pyrophosphate takes place during nucleotide incorporation (Ronaghi et al. 1996). A picotiter plate (PTP) is a microfabricated fiber-optic slide in which the pyrosequencing reaction takes place. One PTP measures about 44 μm in diameter and consists of more than a million reaction wells per plate with each well having 3.4 × 106 picoliter scale holding capacity. The walls of the wells are metal coated to ameliorate the signal-to-noise discrimination, and a charged couple device (CCD) camera is placed opposite to the PTP that records the light emitted from each bead (Margulies et al. 2005; Voelkerding et al. 2009).
A set of reactions occurring in the 454 pyrosequencing system includes fragmentation of the template DNA to prepare a library of several hundred base pairs in length and ligation of the library to 454-specific adaptor oligonucleotides. Next, the library is diluted to single-molecule concentration, followed by their denaturation into single strands and hybridization to Sepharose or Styrofoam beads carrying oligonucleotides complementary to the sequences of the adaptor. The fragment-containing beads (micelles) are emulsified in a “water-in-oil” emulsion mixture with PCR amplification reagents to create individual micelles. Then these isolated micelles are amplified using emulsion PCR (emPCR) to clonally expand individual DNA fragments into 1 million copies on the surface of each bead. Once beads are recovered after amplification, they are individually arrayed by centrifugation into a PTP with sequencing enzymes, and the plate is loaded into GS FLX sequencer. Enzyme ATP sulfurylase, DNA polymerase, luciferase, luciferin, adenosine 5′ phosphosulfate (APS), and apyrase are involved in catalyzing the pyrosequencing reaction (Mardis 2008a, b; Froehlich et al. 2010). During sequencing, the PTP acts as a flow cell that allows the flow of dinucleotide triphosphates (dNTPs) into the wells.
The enzyme DNA polymerase incorporate these nucleotides as a complementary base to the template strand with the release of inorganic pyrophosphate (PPi). The unmatched bases are degraded by the apyrase enzyme. ATP sulfurylase converts the released PPi into ATP, which then drives the luciferase reporter enzyme to signal luciferin to use the ATP to generate light (Fig. 2). This localized luminescence is transmitted through the PTP and gets recorded on a CCD camera. The amount of light generated is directly proportional to the number of nucleotide incorporated. With the repetitive cycles of pyrosequencing, wells of PTP are subsequently imaged, the signal-to-noise ratio is analyzed and filtered, and algorithmically (Newbler software) a translated output in the form of a linear sequence is given (Ronaghi et al. 1996, 1998; Voelkerding et al. 2009).
3.2 Illumina Sequencing
A highly parallelized adapted version of traditional Sanger sequencing is the Illumina sequencing that works on the principle of sequencing-by-synthesis (Illumina 2010). Initially, this platform originated from the work of Turcatti and colleagues from Manteia Predictive Medicine, Switzerland, and later it merged with four companies Solexa (Essex, UK), Lynx Therapeutics (Hayward, CA, USA), Manteia Predictive Medicine, and Illumina, respectively (Shendure and Ji 2008). Library is constructed by fragmentation of DNA template into 200–300 bp in length, and two different adaptors are ligated to both ends of the strands using PCR. The HiSeq 2500 sequencer uses a solid surface (known as single-molecule array or flow cell) which is an optically transparent slide with eight individual lanes and sequences complement to the adaptor attached on it (Bentley et al. 2008). Fragments are denatured into single strands, and with the help of flanked adaptor, they are annealed to the flow cell complementary sites. The free end adaptor of the array-bound DNA bends down and hybridizes to an opposite adaptor thereby creating a bridge. The bridged DNA strand that acts as a template for the synthesis of its complementary strand is cloned by using fluorophore-unlabeled nucleotides, and hence this process is known as bridge amplification. After amplification, strands are denatured and cleaved. The flow cell will now contain more than 40 million clusters wherein each cluster is composed of approximately 1000 clonal copies of individual template strands (Morozova and Marra 2008). During sequencing reaction, all four dNTPs along with DNA polymerase are added simultaneously to the channels of the flow cell. Each dNTP is bound to a unique fluorescent label reversible dye terminators. Upon nucleotide incorporation, the reversible terminator is cleaved, and the process is repeated until the template strand is made (Fig. 3). A colored fluorescent signal is generated once after the incorporation of each nucleotide, which is then recorded by an imaging device to determine the sequence (Sucher et al. 2012).
3.3 SOLiD Sequencing
Supported oligonucleotide ligation and detection (SOLiD) platform was originated from George Church laboratory in 2005 and later commercialized by Applied Biosystems (Life Technologies) (Shendure et al. 2005). The technology lies in the principle of sequencing by ligation of oligonucleotides to the DNA template. A mixture of short DNA fragments from the library are ligated to oligonucleotide adapters, immobilized on the surface of 1 μm paramagnetic beads, and clonally amplified by emulsion PCR (Dressman et al. 2003). Emulsion is then broken, and beads are recovered and are attached covalently to the surface of specially treated glass slide that is placed into a fluidics cassette within the sequencer to generate dense, disordered array. In this platform, two slides are processed per run, one slide receiving sequencing reactants, and second slide is being imaged (Mardis 2008b). The ligation-based sequencing begins with the annealing of universal primer (n) complementary to the array of amplicon-bearing beads that provides a 5′ phosphate substrate for DNA ligase. After annealing, DNA ligase is added along with four semi-degenerate octamer/8-mer fluorescent oligonucleotides in an automated manner within the instrument. The 8-mer fluorescent oligonucleotide is a probe that contains 2 probe-specific bases consisting of one of 16 possible 2-base combinations (TT, GT, and so on), 6 degenerate bases (nnnzzz), a ligation site (on first base), a cleavage site (on fifth base), and 4 different fluorescent dyes (linked to the last base) (Liu et al. 2012). Adjacent to the universal primer (n), DNA ligase seals the phosphate backbone when a matching 8-mer probe hybridizes to the DNA template sequence. A fluorescent readout is followed identifying the ligated 8-mer probe, which corresponds to one of the 16 possible combinations. Subsequently, linkage between the fifth and sixth base of the 8-mer is chemically cleaved removing the fluorescent group allowing further ligation. Seven ligation cycles (rounds) are performed to extend the first primer (n) and then the synthesized strand is denatured from the adapter/template. The second round of sequencing is initiated with hybridization of a new n − 1-positioned universal primer (that is offset by 1 base in the adapter sequence) to the synthesized strand and continues the rounds of ligation-mediated sequencing (Fig. 4). Process is repeated each time with a new primer with a successive offset (n − 2, n − 3, and so on). In this approach, all the templates are sequenced twice (Voelkerding et al. 2009; Shokralla et al. 2012). Finally, 2-base-calling processing software decodes the fluorescence generated from the universal primers into sequence reads.
3.4 True Single-Molecule Sequencing (tSMS)
True single-molecule DNA sequencing method was first developed by Stephen Quake and colleagues in 2003 and commercialized in 2007 by Helicos BioSciences (Cambridge, MA) as HeliScope sequencer (Braslavsky et al. 2003). It is a unique platform utilizing sequencing-by-synthesis approach without any clonal amplification of the DNA template by PCR (Harris et al. 2007; Pushkarev et al. 2009). Library is prepared by fragmenting template DNA into small, 100–200 bp fragments and polyadenylation of generated DNA fragments with poly-(A) tail at 3′ end, with the final adenosine fluorescently labeled with Cy3 (Shendure and Ji 2008). Denatured polyadenylated template strands hybridized to the poly-(T) oligonucleotides are immobilized on the surface of flow cell at a capture density of up to 100 × 106 template strands/cm2. The instrument records the position of each fluorescently labeled template on the array prior to sequencing and expects the sequence read in that position. After the positional coordinates are recorded by highly sensitive CCD camera, label is cleaved and washed before sequencing. Sequencing cycle begins by adding DNA polymerase and one of four Cy5-labeled (Cyanine-5) dNTPs to the flow cell resulting in template-dependent extension of DNA strands (Fig. 5). Here sequencing is asynchronous, i.e., during a particular round of sequencing, all the templates will not incorporate a nucleotide (Voelkerding et al. 2009; Anderson and Schrijver 2010). The emitted fluorescence signal is captured, and the images are recorded using CCD camera to determine incorporated nucleotide. After imaging, the label is chemically cleaved and removed by washing, and the cycle is repeated with the next Cy5-labeled dNTP. Therefore this sequencing platform utilizes single DNA molecules as templates rather than clonally amplified clusters and results in a higher sequencing output per run (Deschamps and Campbell 2010).
3.5 Single-Molecule Real-Time (SMRT) Sequencing
Pacific BioSciences (Menlo Park, CA) commercially introduced PacBio RS sequencer in 2010 for single-molecule real-time (SMRT) sequencing. It is a real-time, fluorescent single-molecule sequencing platform that relies on sequencing-by-synthesis approach (Korlach et al. 2010). A dense array nanostructure called as zero-mode waveguide (ZMW) manufactured on chip surface is the chief component of the sequencer. The chip is fabricated by perforating a thin metal film supported by a transparent substrate and contains more than 10–1000 ZMW well, with each well measuring about 10–50 nm in diameter. ZMW is a light-focusing structure that allows for real-time observation of DNA polymerization. Adapters are ligated onto the ends of the template DNA fragments, and a primer complementary to the adapter sequence is annealed. A calculated ratio of primer-annealed DNA templates and DNA polymerase molecules mixture is supplied to the instrument by a diffusion-mediated process. DNA polymerase is engineered to have a decreased rate of polymerization and a unique ability to incorporate fluorescently modified nucleotides (Mardis 2013). Next, fluorescently labeled nucleotides (linked to the phosphate) are added sequentially to the wells, and DNA polymerase incorporates these nucleotides to create DNA strand complementary to the template. The optical system is finely tuned to measure extremely small detection volume (20 × 10−21 L) of fluorescence emitted during reaction (Anderson and Schrijver 2010). Once phosphate diester bond is formed, a fluorescent signal is produced corresponding to the each incorporated nucleotide which is detected by the optical system (Eid et al. 2009). After each incorporation and detection event, fluorophore is cleaved from the growing strand and diffuses out of the wells (Fig. 6). The sequencer incorporates nucleotides at the rate of 10 bases/s, giving rise to a chain of a thousand nucleotides within minutes (Korlach et al. 2008). The instrument monitors all the wells constantly during the run, performs numerous calculations, condenses the data, and analyzes it to produce sequence reads.
3.6 Ion Torrent Semiconductor Sequencing
Ion Personal Genome Machine (PGM) is a fast, simple, massively scalable, versatile, and less costly sequencing technology commercialized in 2010 by Ion Torrent technology, a company that was later acquired by Life Technologies during the same year (Pareek et al. 2011). The sequencer relies on the principle detection of hydrogen ion concentration that is released as a by-product during nucleotide incorporation into the template by DNA polymerase (Rothberg et al. 2011). Library fragments constructed by DNA fragmentation and adapter ligation are hybridized to the bead containing adapter complementary sequences and then amplified by emulsion PCR (emPCR). After successful amplification, emulsion is broken, and the resulting beads are enriched. Enriched beads are annealed to the sequencing primer and are deposited into the highly dense microwells of an Ion Chip along with DNA polymerase by mild centrifugation. Ion Chip is a specialized silicon chip designed to detect pH changes of individual microwells (Mardis 2013). Upper surface of the chip delivers the required sequencing reactants. Beneath the microwells, an ion-sensitive layer followed by a field-effect transistor (FET)/ion sensor sub-layer is aligned to measure changes in pH of the solution (Niedringhaus et al. 2011). Sequentially nucleotides flow into the wells, wherein each well acts an individual DNA polymerization reaction chamber. Nucleotide complements with the template DNA and gets incorporated by polymerase action releasing hydrogen ions (H+) (Fig. 7). As a result, FET detects the change in pH of the solution (ΔpH), and a potential change (ΔV) is recorded as a direct measure of nucleotide incorporation events (Hui 2014). For instance, if two identical bases are incorporated, the voltage is double, and two identical bases are recorded. Next, if an unmatched nucleotide enters the well, no voltage change is recorded, and no base is called. The sequencer directly converts the received chemical information into digital information by processing the signals and by using base-calling algorithms to produce the DNA sequence associated with individual reads (http://www.lifetechnologies.com/).
4 Emerging Method
4.1 Nanopore Sequencing
Nanopore sequencing technology emerged from the independent work of David Deamer and his colleagues at the University of California (Santa Cruz, CA, USA) in 1996 (Branton et al. 2008). It is a fast-growing real-time sequencing technology, in that sequencing is carried out in a very thin porous membrane with pores ranging in nanoscale size (nanopores). Current is applied across the nanopore, and the negatively charged ssDNA traverses through it toward the positive terminal causing a change in electrical conductivity across the nanopore. The potential difference generated in picoamperes (pA) is measured using a circuit (Gupta 2008). This technology obviates the need to amplify the template fragments by PCR, synchronous washing of the reagent, and synthesis of strand complementary to the template and to reduce time (Ku and Roukos 2013). Based on the kind of nanopore, the technology is categorized into two types, namely, biological nanopore sequencing and solid-state nanopore sequencing.
4.1.1 Biological Nanopore Sequencing
In this type of sequencing, α-hemolysin (αHL), a protein nanopore, is held between a phospholipid bilayer separating two chambers filled with KCl solution. Two electrodes are placed on the opposite sides of the bilayer, and an electrical potential is applied during sequencing. The amount of current passing through the nanopore in a given moment varies depending on the shape, size, and length of the nucleotide that blocks the ion flow through the pore (Hui 2014). An exonuclease enzyme is attached to the nanopore surface to cleave individual nucleotide molecules from the DNA as it enters. Negatively charged ssDNA translocates through the membrane as positive potential is applied to the electrode. Translocation velocity varies with different parameters such as electric potential, type of nanopore, and strands of DNA (whether ss/ds) (Steinbock and Radenovic 2015). Ionic current is partially blocked during translocation producing different magnitudes of current disruption that in turn aids in differentiation among the four nucleotides (Fig. 8a).
Specificity of nucleotide detection across the membrane is tenfold higher in current MspA (Mycobacterium smegmatis porin A protein) nanopore than classical αHL (Manrao et al. 2011) (Fig. 8b). Translocation velocity of DNA can be slowed down for improving detection accuracies by combining MspA with DNA polymerase φ29. Oxford Nanopore Technologies (ONT) is only the leading company that develops, commercializes, and updates these types of bio-nanopore-based sequencing platforms. ONT in collaboration with Professor Daniel Branton, George Church, and Jene Golovchenko of Harvard, David Deamer and Mark Akeson of UCSC, and John Kasianowicz of NIST commercialized their first developer version of the nanopore sequencer “MinION” on 2014 as an early access program (MAP or MinION Access Program) to the researchers and investigators. MinION is a portable device commercialized for the analysis of DNA and RNA in real time. Recently, in May 2015, ONT introduced the second version of the device “MinION” as “MinION MkI” featuring its improvements in performance and easy access. Improvements in library preparation kits and temperature controlling system, an increase in total number of nanopores per flow cell, etc., can be found in MinION MkI. PromethION, a small benchtop platform developed by ONT for high-throughput real-time analysis of the DNA and RNA samples, is currently under the scheme of commercialization (Oxford Nanopore Technologies, UK).
4.1.2 Solid-State Nanopore Sequencing
This type of sequencing uses nanopores fabricated mechanically on a solid-state material such as silicon nitrides, silicon oxide or metal oxides, and graphene (Fig. 8c). These artificial nanopores are considered to be more stable than biological nanopores as it eliminates membrane instability and protein positioning drawbacks. IBM in collaboration with 454 Life Sciences is developing artificial nanopores by drilling a 10 nm titanium nitride membrane, which are separated by insulating layers of silica. When DNA strand translocates through the nanopore, the electric field across the layer is flipped resulting in immobilization of the strand and significant measuring of ionic current. Graphene is another new single-atom membrane that is 1–5 nm thick and has pores with a diameter of 5–10 nm (Niedringhaus et al. 2011). Graphene membrane improves sequencing and detection accuracies by allowing the passage of one base at a time (Pennisi 2012). Therefore, these kinds of nanopores are robust in their unique ability and leads researchers toward the next-generation sequencing.
5 Bioinformatics Tools
Sequence reads generated from various NGS platforms are analyzed using commercially available bioinformatics tools (Table 1). Each tool has a unique function, categorized as (Lee et al. 2013):
-
1.
Read alignment based on the reference genomes.
-
2.
De novo assembly involves grouping of generated short reads into contigs which are further assembled into scaffolds to reconstruct original genomic DNA of a new species.
-
3.
Identification of single nucleotide genetic variant (i.e., mutation) in the assembled sequence reads.
-
4.
Detection of functional variants altering the protein coding regions of individual genomes.
-
5.
Detection of large alterations such as indels, inversions, and translocation in the DNA (~1 kb–3 Mb).
-
6.
Transcriptome assembly.
6 Comparison of Various NGS Platforms
See Table 2.
7 Conclusion and Perspective
Next-generation sequencing (NGS) technologies contribute rapid, high-throughput, cost-effective probing of genomes of individual organisms. Geneticists have seen a prompt inclination in the sequencing technologies over the past decade. Despite the merits and limitations of each platform, these platforms can be used in tandem for decoding the entire genome of a distinct organism. Modern NGS technologies serve prominent roles in scrutinizing genome-level mutation and gene expression to divulge phenotype of diverse strains. Therefore, these technologies have augmented our minds toward the immense knowledge about the organism and their active systems at molecular level. In the successive years of NGS technologies, progress in state-of-the-art bioinformatics software will be of critical importance to elicit sophisticated amount of generated sequenced reads.
References
Anderson MW, Schrijver I (2010) Next generation DNA sequencing and the future of genomic medicine. Genes 1:38–69. doi:10.3390/genes1010038
Bentley DR, Balasubramanian S, Swerdlow HP et al (2008) Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456:53–59. doi:10.1038/nature07517
Branton D, Deamer DW, Marziali A et al (2008) The potential and challenges of nanopore sequencing. Nat Biotechnol 26:1146–1153. doi:10.1038/nbt.1495
Braslavsky I, Hebert B, Kartalov E, Quake SR (2003) Sequence information can be obtained from single DNA molecules. Proc Natl Acad Sci U S A 100:3960–3964. doi:10.1073/pnas.0230489100
Butler J, MacCallum I, Kleber M et al (2008) ALLPATHS: de novo assembly of whole-genome shotgun microreads. Genome Res 18:810–820. doi:10.1101/gr.7337908
Chaisson MJ, Pevzner PA (2008) Short read fragment assembly of bacterial genomes. Genome Res 18:324–330. doi:10.1101/gr.7088808
Chen K, Wallis JW, McLellan MD et al (2009) BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat Methods 6:677–681. doi:10.1038/nmeth
Deschamps S, Campbell MA (2010) Utilization of next-generation sequencing platforms in plant genomics and genetic variant discovery. Mol Breed 25:553–570. doi:10.1007/s11032-009-9357-9
Dohm JC, Lottaz C, Borodina T, Himmelbauer H (2007) SHARCGS, a fast and highly accurate shortread assembly algorithm for de novo genomic sequencing. Genome Res 17:1697–1706. doi:10.1101/gr.6435207
Dressman D, Yan H, Traverso G et al (2003) Transforming single DNA molecules into fluorescent magnetic particles for detection and enumeration of genetic variations. Proc Natl Acad Sci 100:8817–8822. doi:10.1073/pnas.1133470100
Eid J, Fehr A, Gray J et al (2009) Real-time DNA sequencing from single polymerase molecules. Science 323:133–138. doi:10.1126/science.1162986
Ewing B, Green P (1998) Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res 8:186–194. doi:10.1101/gr.8.3.186
Froehlich T, Heindl D, Roesler A (2010) Miniaturized, high-throughput nucleic acid analysis. U.S. Patent 20100248237 A1. doi:10.1002/jctb.1421
Grabherr MG, Haas BJ, Yassour M et al (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29:644–652. doi:10.1038/nbt.1883
Gupta PK (2008) Single-molecule DNA sequencing technologies for future genomics research. Trends Biotechnol 26:602–611. doi:10.1016/j.tibtech.2008.07.003
Hall N (2007) Advanced sequencing technologies and their wider impact in microbiology. J Exp Biol 210:1518–1525. doi:10.1242/jeb.001370
Harris T, Buzby P, Jarosz M et al (2007) Optical train and method for tirf single molecule detection and analysis. U.S. Patent 20070070349
Hernandez D, Francois P, Farinelli L et al (2008) De novo bacterial genome sequencing: millions of very short reads assembled on a desktop computer. Genome Res 18:802–809. doi:10.1101/gr.072033
Hui P (2014) Next generation sequencing: chemistry, technology and applications. Top Curr Chem 336:1–18. doi:10.1007/128_2012_329
Illumina (2010) Technology spotlight: Illumina® sequencing
Jeck WR, Reinhardt JA, Baltrus DA et al (2007) Extending assembly of short DNA sequences to handle error. Bioinformatics 23:2942–2944. doi:10.1093/bioinformatics/btm451
Kang XF, Gu LQ, Cheley S, Bayley H (2005) Single protein pores containing molecular adapters at high temperatures. Angew Chem Int Ed Engl 44(10):1495–1499. doi:10.1002/anie.200461885
Korbel JO, Abyzov A, Mu XJ et al (2009) PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data. Genome Biol 10:R23. doi:10.1186/gb-2009-10-2-r23
Korlach J, Marks PJ, Cicero RL et al (2008) Selective aluminum passivation for targeted immobilization of single DNA polymerase molecules in zero-mode waveguide nanostructures. Proc Natl Acad Sci U S A 105(4):1176–1181. doi:10.1073/pnas.0710982105
Korlach J, Bjornson KP, Chaudhuri BP et al (2010) Real-time DNA sequencing from single polymerase molecules. Methods Enzymol 472:431–455. doi:10.1016/S0076-6879(10)72001-2
Ku CS, Roukos DH (2013) From next-generation sequencing to nanopore sequencing technology: paving the way to personalized genomic medicine. Expert Rev Med Devices 10:1–6. doi:10.1586/erd.12.63
Lee W, Zhang Y, Mukhyala K et al (2009) Bi-directional SIFT predicts a subset of activating mutations. PLoS One 4:e8311. doi:10.1371/journal.pone.0008311
Lee CY, Chiu YC, Wang LB et al (2013) Common applications of next-generation sequencing technologies in genomic research. Transl Cancer Res 2:33–45. doi:10.3978/j.issn.2218-676X.2013.02.09
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler Transform. Bioinformatics 25:1754–1760. doi:10.1093/bioinformatics/btp324
Li H, Ruan J, Durbin R (2008a) Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res 18:1851–1858. doi:10.1101/gr.078212
Li R et al (2008b) SOAP: short oligonucleotide alignment program. Bioinformatics 24(5):713–714. doi:10.1093/bioinformatics/btn025
Liu L, Li Y, Li S et al (2012) Comparison of next-generation sequencing systems. J Biomed Biotechnol 2012:251364. doi:10.1155/2012/251364
Manrao E, Derrington I, Pavlenok M et al (2011) Nucleotide discrimination with DNA immobilized in the MspA nanopore. PLoS One 6:e25723. doi:10.1371/journal.pone.0025723
Mardis ER (2008a) Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet 9:387–402. doi:10.1146/annurev.genom.9.081307.164359
Mardis ER (2008b) The impact of next-generation sequencing technology on genetics. Trends Genet 24:133–141. doi:10.1016/j.tig.2007.12.007
Mardis ER (2013) Next-generation sequencing platforms. Annu Rev Anal Chem 6:287–303. doi:10.1146/annurev-anchem-062012-092628
Margulies M, Egholm M, Altman WE et al (2005) Genome sequencing in open microfabricated high-density picoliter reactors. Nature 437:376–380. doi:10.1038/nature03959
Metzker ML (2010) Sequencing technologies—the next generation. Nat Rev Genet 11:31–46. doi:10.1038/nrg2626
Morozova O, Marra MA (2008) Applications of next-generation sequencing technologies in functional genomics. Genomics 92:255–264. doi:10.1016/j.ygeno.2008.07.001
Mortazavi A, Williams BA, McCue K et al (2008) Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods 5:621–628. doi:10.1038/nmeth.1226
Niedringhaus TP, Milanova D, Kerby MB et al (2011) Landscape of next-generation sequencing technologies. Anal Chem 83:4327–4341. doi:10.1021/ac2010857
Ning Z, Cox AJ, Mullikin JC (2001) SSAHA: a fast search method for large DNA databases. Genome Res 11(10):1725–1729. doi:10.1101/gr.194201
Nyren P, Pettersson B, Uhlen M (1993) Solid phase DNA mini sequencing by an enzymatic luminometric inorganic pyrophosphate detection assay. Anal Biochem 208:171–175. doi:10.1006/abio.1993.1024
Pareek CS, Smoczynski R, Tretyn A (2011) Sequencing technologies and genome sequencing. J Appl Genet 52:413–435. doi:10.1007/s13353-011-0057-x
Pennisi E (2012) Genome sequencing. Search for pore-fection. Science 336(6081):534–537. doi:10.1126/science.336.6081.534
Pushkarev D, Neff NF, Quake SR (2009) Single-molecule sequencing of an individual human genome. Nat Biotechnol 27:847–850. doi:10.1038/nbt.1561
Robertson G, Schein J, Chiu R et al (2010) De novo assembly and analysis of RNA-seq data. Nat Methods 7:909–912. doi:10.1038/nmeth.1517
Ronaghi M, Karamohamed S, Pettersson B et al (1996) Real-time DNA sequencing using detection of pyrophosphate release. Anal Biochem 242:84–89. doi:10.1006/abio.1996.0432
Ronaghi M, Uhlen M, Nyren P (1998) A sequencing method based on real-time pyrophosphate. Science 281(5375):363–365. doi:10.1126/science.281.5375.363
Rothberg JM, Hinz W, Rearick TM et al (2011) An integrated semiconductor device enabling non-optical genome sequencing. Nature 475:348–352. doi:10.1038/nature10242
Rumble SM, Lacroute P, Dalca AV et al (2009) SHRiMP: accurate mapping of short color-space reads. PLoS Comput Biol 5:e1000386. doi:10.1371/journal.pcbi.1000386
Sanger F, Coulson AR (1975) A rapid method for determining sequences in DNA by primed synthesis with DNA polymerase. J Mol Biol 94:441–448. doi:10.1016/0022-2836(75)90213-2
Sanger F, Nicklen S, Coulson AR (1977) DNA sequencing with chain-terminating inhibitors. Proc Nat Acad Sci U S A 74:5463–5467. doi:10.1073/pnas.74.12.5463
Shendure J, Ji H (2008) Next-generation DNA sequencing. Nat Biotechnol 26:1135–1145. doi:10.1038/nbt1486
Shendure J, Porreca GJ, Reppas NB et al (2005) Accurate multiplex polony sequencing of an evolved bacterial genome. Science 309:1728–1732. doi:10.1126/science.1117389
Shokralla S, Spall JL, Gibson JF, Hajibabaei M (2012) Next-generation sequencing technologies for environmental DNA research. Mol Ecol 21:1794–1805. doi:10.1111/j.1365-294X.2012.05538.x
Smith DR, Quinlan AR, Peckham HE et al (2008a) Rapid whole-genome mutational profiling using next-generation sequencing technologies. Genome Res 18:1638–1642. doi:10.1101/gr.077776
Smith AD, Xuan Z, Zhang MQ (2008b) Using quality scores and longer reads improves accuracy of Solexa read mapping. BMC Bioinf 9:128. doi:10.1186/1471-2105-9-128
Steinbock LJ, Radenovic A (2015) The emergence of nanopores in next generation sequencing. Nanotechnology 26:074003. doi:10.1088/0957-4484/26/7/074003
Stone EA, Sidow A (2005) Physico-chemical constraint violation by missense substitutions mediates impairment of protein function and disease severity. Genome Res 15:978–986. doi:10.1101/gr.3804205
Sucher NJ, Hennell JR, Carles MC (2012) DNA fingerprinting, DNA barcoding, and next generation sequencing technology in plants. Methods Mol Biol 862:13–22. doi:10.1007/978-1-61779-609-8_2
Tawfik DS, Griffiths AD (1998) Man-made cell-like compartments for molecular evolution. Nat Biotechnol 16:652–656. doi:10.1038/nbt0798-652
Voelkerding KV, Dames SA, Durtschi JD (2009) Next-generation sequencing: from basic research to diagnostics. Clin Chem 55:641–658. doi:10.1373/clinchem.2008.112789
Warren RL, Sutton GG, Jones SJ, Holt RA (2007) Assembling millions of short DNA sequences using SSAKE. Bioinformatics 23:500–501. doi:10.1093/bioinformatics/btl629
Wu TD, Watanabe CK (2005) GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 21:1859–1875. doi:10.1093/bioinformatics/bti310
Yadav NK, Shukla P, Omer A et al (2014) Next generation sequencing: potential and application in drug discovery. Sci World J 2014:802437. doi:10.1155/2014/802437
Zerbino DR, Birney E (2008) Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18:821–829. doi:10.1101/gr.074492.107
Zerbino R, Tobes JM, Bossio ME et al (2012) On the orientation of fibres in structural members fabricated with self-compacting fibre reinforced concrete. Cement Concrete Comp 34(2): 191–200
Acknowledgements
Authors thank DST-SERB for the Grant No. SB/YS/LS- 79/2013, “Development of endophytic bacterial consortium from selected medicinal plants of Western Ghats of India,” and VELS University for their support in this work.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
Mohinudeen, C., Joe, M.M., Benson, A., Tongmin, S. (2017). An Overview of Next-Generation Sequencing (NGS) Technologies to Study the Molecular Diversity of Genome. In: Kalia, V., Kumar, P. (eds) Microbial Applications Vol.1. Springer, Cham. https://doi.org/10.1007/978-3-319-52666-9_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-52666-9_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-52665-2
Online ISBN: 978-3-319-52666-9
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)