Abstract
How to sample alignments from their posterior probability distribution given two strings is shown. This is extended to sampling alignments of more than two strings. The result is first applied to the estimation of the edges of a given evolutionary tree over several strings. Second, when used in conjunction with simulated annealing, it gives a stochastic search method for an optimal multiple alignment.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Allison L, Wallace CS (1994) An information measure for the string to string correction problem with applications. 17th Australian Comp. Sci. Conf, Christchurch, New Zealand, pp 659–668
Allison L, Wallace CS, Yee CN (1992a) Finite-state models in the alignment of macro-molecules. J Mol Evol 35:77–89
Allison L, Wallace CS, Yee CN (1992b) Minimum message length encoding, evolutionary trees and multiple alignment. 25th Hawaii Int. Conf. Sys. Sci. 1:663–674
Bishop MJ, Thompson EA (1986) Maximum likelihood alignment of DNA sequences. J Mol Biol 190:159–165
Duan W, Achen MG, Richardson SJ, Lawrence MC, Wettenhall REH, Jaworowski A, Schreiber G (1991) Isolation, characterisation, cDNA cloning and gene expression of an avian transthyretin: implications for the evolution of structure and function of transthyretin in vertebrates Eur J Biochem 200:679–687
Felsenstein J (1981) Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol 17:368–376
Felsenstein J (1983) Inferring evolutionary trees from DNA sequences. In: Weir BS (ed) Statistical analysis of DNA sequence data. Marcel Dekker, pp 133–150
Hastings WK (1970) Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57:97–109
Haussler D, Krogh A, Mian S, Sjolander K (1993) Protein modelling using hidden Markov Models: Analysis of globins. 26th Hawaii Int. Conf. Sys. Sci. 1:792–802
Hirschberg DS (1975) A linear space algorithm for computing maximal common subsequences. Comm ACM 18(6):341–343
Huggins AS, Bannam TL, Rood JI (1992) Comparative sequence analysis of the catB gene from Clostridium butyricum. Antimicro agents Chemother 36(11):2548–2551
Ishikawa M, Toya T, Hoshida M, Nitta K, Ogiwara A, Kanehisa M (1992) Multiple sequence alignment by parallel simulated annealing. Institute for New Generation Computing (ICOT) TR-730
Lawrence CE, Altschul SF, Bogushki MS, Liu JS, Neuwald AF, Wooton JC (1993) Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science 262:208–214
Sankoff D, Cedergren RJ, Lapalme G (1976) Frequency of insertion-deletion, transversion, and transition in evolution of 5S ribosomal RNA. J Mol Evol 7:133–149
Schreiber G, Aldred AR, Duan W (1992) Choroid plexus, brain protein-homeostasis and evoluation. Today's Life Science Sept: 22–28
Thorne JL, Kishino H, Felsenstein J (1991) An evolutionary model for maximum likelihood alignment of DNA sequences. J Mol Evol 33:114–124
Thorne JL, Kishino H, Felsenstein J (1992) Inching towards reality: an improved likelihood model of sequence evolution. J Mol Evol 34:3–16
Wallace CS, Boulton DM (1968) An information measure for classification. Comp J 11(2):185–194
Wallace CS, Freeman PR (1987) Estimation and inference by compact encoding. J R Stat Soc B 49:240–265
Yee CN, Allison L (1993) Reconstruction of strings past. Comp Appl Biosci 9(1):1–7
Author information
Authors and Affiliations
Additional information
Correspondence to: L. Allison
Rights and permissions
About this article
Cite this article
Allison, L., Wallace, C.S. The posterior probability distribution of alignments and its application to parameter estimation of evolutionary trees and to optimization of multiple alignments. J Mol Evol 39, 418–430 (1994). https://doi.org/10.1007/BF00160274
Received:
Revised:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF00160274