Abstract
Due to the increasing availability of public bacterial genome data and cost efficiency of novel bacterial strain sequencing, phylogenetic analyses based on more than a single or few marker genes have become feasible. In this method protocol, we describe the complete bioinformatic workflow from raw genomic data to final phylogenetic analyses based on 107 conserved single copy genes. This approach can be used to perform phylogenetic reconstructions with high resolution on strain level or across taxa spanning different clades of the bacterial tree of life.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Chun J, Oren A, Ventosa A et al (2018) Proposed minimal standards for the use of genome data for the taxonomy of prokaryotes. Int J Syst Evol Microbiol 68:461–466
Yoon S-H, Ha S-M, Kwon S et al (2017) Introducing EzBioCloud: a taxonomically united database of 16S rRNA gene sequences and whole-genome assemblies. Int J Syst Evol Microbiol 67:1613–1617
Metzker ML (2010) Sequencing technologies—the next generation. Nat Rev Genet 11:31–46
Janda JM, Abbott SL (2007) 16S rRNA gene sequencing for bacterial identification in the diagnostic laboratory: pluses, perils, and pitfalls. J Clin Microbiol 45:2761–2764
Bernard G, Chan CX, Ragan MA (2016) Alignment-free microbial phylogenomics under scenarios of sequence divergence, genome rearrangement and lateral genetic transfer. Sci Reports 6:28970
Ankenbrand MJ, Keller A (2016) bcgTree: automatized phylogenetic tree building from bacterial core genomes. Genome 59:783–791
Na S-I, Kim YO, Yoon S-H et al (2018) UBCG: up-to-date bacterial core gene set and pipeline for phylogenomic tree reconstruction. J Microbiol 56:280–285
Bankevich A, Nurk S, Antipov D et al (2012) SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477
Seemann T (2014) Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069
Eddy SR (2011) Accelerated profile HMM searches. PLOS Comput Biol 7:e1002195
Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797
Castresana J (2000) Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol 17:540–552
Stamatakis A (2014) RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30:1312–1313
Letunic I, Bork P (2019) Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res 47:W256–W259
R Core Team (2013) R: a language and environment for statistical computing.
Revell LJ (2012) phytools: an R package for phylogenetic comparative biology (and other things). Meth Ecol Evol 3:217–223
Keller A, Brandel A, Becker MC et al (2018) Wild bees and their nests host Paenibacillus bacteria with functional potential of avail. Microbiome 6:229
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Keller, A., Ankenbrand, M.J. (2021). Inferring Core Genome Phylogenies for Bacteria. In: Mengoni, A., Bacci, G., Fondi, M. (eds) Bacterial Pangenomics. Methods in Molecular Biology, vol 2242. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1099-2_4
Download citation
DOI: https://doi.org/10.1007/978-1-0716-1099-2_4
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-1098-5
Online ISBN: 978-1-0716-1099-2
eBook Packages: Springer Protocols