Multiple Protein Sequence Alignment with MSAProbs

Liu, Yongchao; Schmidt, Bertil

doi:10.1007/978-1-62703-646-7_14

Yongchao Liu³ &
Bertil Schmidt³

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1079))

5159 Accesses
11 Citations

Abstract

Multiple sequence alignment (MSA) generally constitutes the foundation of many bioinformatics studies involving functional, structural, and evolutionary relationship analysis between sequences. As a result of the exponential computational complexity of the exact approach to producing optimal multiple alignments, the majority of state-of-the-art MSA algorithms are designed based on the progressive alignment heuristic. In this chapter, we outline MSAProbs, a parallelized MSA algorithm for protein sequences based on progressive alignment. To achieve high alignment accuracy, this algorithm employs a hybrid combination of a pair hidden Markov model and a partition function to calculate posterior probabilities. Furthermore, we provide some practical advice on the usage of the algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

QuickProbs 2: Towards rapid construction of high-quality alignments of large protein families

Article Open access 31 January 2017

Multiple Sequence Alignment Algorithms in Bioinformatics

Multiple Alignment of Structures Using Center Of ProTeins

References

Feng DF, Doolittle RF (1987) Progressive sequence alignment as a prerequisite to correct phylogenetic trees. J Mol Evol 25:351–361
Article PubMed CAS Google Scholar
Liu Y, Schmidt B, Maskell DL (2010) MSAProbs: multiple sequence alignment based on pair hidden Markov models and partition function posterior probabilities. Bioinformatics 26:1958–1964
Article PubMed CAS Google Scholar
Durbin R, Eddy SR, Krogh A, Mitchison G (1998) Biological sequence analysis: probabilistic models of proteins and nucleic acids. Cambridge University Press, Cambridge
Book Google Scholar
Miyazawa S (1995) A reliable sequence alignment method based on probabilities of residue correspondences. Protein Eng 8:999–1009
Article PubMed CAS Google Scholar
Thompson JD, Koehl P, Ripp R, Poch O (2005) BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark. Proteins 61:127–136
Article PubMed CAS Google Scholar
Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797
Article PubMed CAS Google Scholar
Sievers F, Wilm A, Dineen D et al (2011) Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol 7:539
Article PubMed Google Scholar
Chang JM, Di Tommaso P, Taly JF et al (2012) Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee. BMC Bioinformatics 13:S1
Article PubMed CAS Google Scholar
Deng X, Cheng J (2011) MSACompro: protein multiple sequence alignment using predicted secondary structure, solvent accessibility, and residue–residue contacts. BMC Bioinformatics 12:472
Article PubMed CAS Google Scholar
Vingron M, Argos P (1989) A fast and sensitive multiple sequence alignment algorithm. Comput Appl Biosci 5:115–121
PubMed CAS Google Scholar
Gotoh O (1990) Consistency of optimal sequence alignments. Bull Math Biol 52:509–525
PubMed CAS Google Scholar
Notredame C, Holm L, Higgins DG (1998) COFFEE: an objective function for multiple sequence alignments. Bioinformatics 14:407–422
Article PubMed CAS Google Scholar
Notredame C, Higgins DG, Heringa J (2000) T-coffee: a novel method for fast and accurate multiple sequence alignment. J Mol Biol 302:205–217
Article PubMed CAS Google Scholar
Do CB, Mahabhashyam MS, Brudno M et al (2005) ProbCons: probabilistic consistency-based multiple sequence alignment. Genome Res 15:330–340
Article PubMed CAS Google Scholar
Liu Y, Schmidt B, Maskell DL (2009) MSA-CUDA: multiple sequence alignment on graphics processing units with CUDA. 20th IEEE international conference on application-specific systems, architectures and processors, pp 121–128
Google Scholar
Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22:4673–4680
Article PubMed CAS Google Scholar

Download references

Author information

Authors and Affiliations

Institut für Informatik, Johannes Gutenberg Universitat Mainz, Mainz, Germany
Yongchao Liu & Bertil Schmidt

Authors

Yongchao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Bertil Schmidt
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Electrical Engineering, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
David J Russell

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Liu, Y., Schmidt, B. (2014). Multiple Protein Sequence Alignment with MSAProbs. In: Russell, D. (eds) Multiple Sequence Alignment Methods. Methods in Molecular Biology, vol 1079. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-62703-646-7_14

Download citation

DOI: https://doi.org/10.1007/978-1-62703-646-7_14
Published: 23 August 2013
Publisher Name: Humana Press, Totowa, NJ
Print ISBN: 978-1-62703-645-0
Online ISBN: 978-1-62703-646-7
eBook Packages: Springer Protocols

Publish with us

Policies and ethics

Multiple Protein Sequence Alignment with MSAProbs

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

QuickProbs 2: Towards rapid construction of high-quality alignments of large protein families

Multiple Sequence Alignment Algorithms in Bioinformatics

Multiple Alignment of Structures Using Center Of ProTeins

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Multiple Protein Sequence Alignment with MSAProbs

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

QuickProbs 2: Towards rapid construction of high-quality alignments of large protein families

Multiple Sequence Alignment Algorithms in Bioinformatics

Multiple Alignment of Structures Using Center Of ProTeins

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Search

Navigation