Abstract
In this study, we examine approaches to the problem of assembling large, contiguous sections of genetic code from short reads generated from laboratory techniques. We explore the Eulerian Path approach in detail, utilizing a de Bruijn Graph, and demonstrate current software technologies and algorithms using a sample genome. We investigate the input parameters of Velvet and discuss their implications.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Watson, J.D., Francis, H.C.: A Structure for DNA. Nature 171, 737–738 (1953)
Watson, J.D., Francis, H.C.: Genetical Implications of the structure of Deoxyribonucleic Acid. Nature 171, 964–967 (1953)
Gilbert, W., Maxam, A.: The nucleotide seq. of the lac operator. Proc. Natl. Acad. Sci. U.S.A 12(70), 3581–3584 (1973)
Sanger, F., Nicklen, S., Coulson, R.A.: DNA sequencing with chain-terminating inhibitors. Proc. Natl. Acad. Sci. U.S.A 12(74), 5463–5467 (1997)
Tamarin, R.H.: Principles of Genetics, 4th edn. Wm. C. Brown Publishers (1993)
Ewing, B., Phil, G.: Base-Calling of Automated Sequencer Traces UsingPhred. II. Error Probabilities. Genome Res., 186–194 (1998)
Ewing, B., et al.: Base-Calling of Automated Sequencer Traces UsingPhred. I. Accuracy Assessment. Genome Res. 8, 175–185 (1998)
Ronaghi, M., Uhlén, M., Nyrén, P.: A sequencing method based on real-time pyrophosphate. Science 363, 365 (1998)
Margulies, M., et al.: Genome Sequencing in Open Microfabricated High Density Picoliter Reactors. Nature 437, 376–380 (2003)
Altschul, S.F., et al.: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acid Res. 25, 3389–3402 (1997)
Linz, P.: An Intro. to Formal Lang. & Automata, 4th edn. Jones & Bartlett, Boston (2006)
Pevzner, P.A.: 1-Tuple DNA sequencing: computer analysis. J. Biomol. Struct. Dyn. 7, 63–73 (1989)
Ramana, M.I., Michael, W.S.: A New Algorithm for DNA Sequence Assembly. Journal of Computational Biology 2(2), 291–306 (1995)
Zerbino, D.R., Ewan, B.: Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Research 18, 821–829 (2008)
Gross, J.L., Yellen, J.: Handbook of graph theory. CRC Press, Boca Raton (2004); 69 DRAFT 04/02/2010 AE LLC
Blattner, F.R.: The complete genome sequence of E. coli K-12. Sci., 1453–1462 (1997)
Richter, D.C., et al.: MetaSim—A Sequencing Simulator for Genomics and Metagenomics. PLoS ONE 3(1), e3373 (2008)
Schatz, M.C., et al.: Hawkeye: an interactive visual analytics tool for genome assemblies. Genome Biology 8, R34 (2008)
NCBI. FASTA format description, http://www.ncbi.nlm.nih.gov/blast/fasta.shtml
Kyoto University Bioinformatics Center. GenomeNet (March 22 (2010), http://www.genome.jp
Leipzig: Standardized-velvet-assembly-report - Project Hosting on Google Code (March 22, 2010), http://code.google.com/p/standardized-velvet-assembly-report
Roche Diagnostics Co. Products & Solutions - Syetem Benefits: 454 Life Sciences, a Roche Company (March 22, 2010), http://454.com/products-solutions/system-benefits.asp
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Elliot, A.C., Perkins, A.L., Yenduri, S. (2012). A Parameterization Study of Short Read Assembly Using the Velvet Assembler. In: Satapathy, S.C., Avadhani, P.S., Abraham, A. (eds) Proceedings of the International Conference on Information Systems Design and Intelligent Applications 2012 (INDIA 2012) held in Visakhapatnam, India, January 2012. Advances in Intelligent and Soft Computing, vol 132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27443-5_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-27443-5_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27442-8
Online ISBN: 978-3-642-27443-5
eBook Packages: EngineeringEngineering (R0)