Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
J. Adachi and M. Hasegawa. Model of amino acid substitution in proteins encoded by mitochondrial DNA. J Mol Evol, 42(4):459–468, Apr 1996.
J. Adachi, P.J. Waddell, W. Martin, and M. Hasegawa. Plastid genome phylogeny and a model of amino acid substitution for proteins encoded by chloroplast DNA. J Mol Evol, 50(4):348–358, Apr 2000.
D. A. Afonnikov, D. Y. Oshchepkov, and N. A. Kolchanov. Detection of conserved physico-chemical characteristics of proteins by analyzing clusters of positions with co-ordinated substitutions. Bioinformatics, 17(11):1035–1046, Nov 2001.
C. B. Anfinsen. Principles that govern the folding of protein chains.Science, 181(96):223–230, Jul 1973.
M. Anisimova, J. P. Bielawski, and Z. Yang. Accuracy and power of Bayes prediction of amino acid sites under positive selection. Mol Biol Evol, 19(6):950–958, Jun 2002.
W. R. Atchley, K. R. Wollenberg, W. M. Fitch, W. Terhalle, and A. W. Dress. Correlations among amino acid sites in bHLH protein domains: An information theoretic analysis. Mol Biol Evol, 17(1):164–178, Jan 2000.
E. Azarya-Sprinzak, D. Naor, H. J. Wolfson, and R. Nussinov. Interchanges of spatially neighbouring residues in structurally conserved environments. Protein Eng, 10(10):1109–1122, Oct 1997.
A. Bateman, E. Birney, L. Cerruti, R. Durbin, L. Etwiller, S. R. Eddy, S. Griffiths-Jones, K. L. Howe, M. Marshall, and E. L. L. Sonnhammer. The Pfam protein families database. Nucleic Acids Res, 30(1):276–280, Jan 2002.
S. A. Benner, M. A. Cohen, and G. H. Gonnet. Amino acid substitution during functionally constrained divergent evolution of protein sequences. Protein Eng, 7(11):1323–1332, Nov 1994.
C. Branden and J. Tooze. Introduction to Protein Structure. Garland Publishing, New York, 1999.
W. J. Bruno. Modeling residue usage in aligned protein sequences via maximum likelihood. Mol Biol Evol, 13(10):1368–1374, Dec 1996.
T. R. Buckley. Model misspecification and probabilistic tests of topology: Evidence from empirical data sets. Syst Biol, 51(3):509–523, Jun 2002.
T. R. Buckley, C. Simon, and G. K. Chambers. Exploring among-site rate variation models in a maximum likelihood framework using empirical data: Effects of model assumptions on estimates of topology, branch lengths, and bootstrap support. Syst Biol, 50(1):67–86, Feb 2001.
G. Chelvanayagam, A. Eggenschwiler, L. Knecht, G. H. Gonnet, and S. A. Benner. An analysis of simultaneous variation in protein structures. Protein Eng, 10(4):307–316, Apr 1997.
C. Chothia, J. Gough, C. Vogel, and S. A. Teichmann. Evolution of the protein repertoire. Science, 300(5626):1701–1703, Jun 2003.
T. M. Collins, P. H. Wimberger, and G. J. P. Naylor. Compositional bias, character-state bias, and character-state reconstruction using parsimony. Sys Biol, 43:482–496, 1994.
M. O. Dayhoff, R. M. Schwartz, and B. C. Orcutt. A model of evolutionary change in proteins. In M. O. Dayhoff, editor, Atlas of Protein Sequence and Structure, volume 5, chapter 22, pages 345–352. National Biomedical Research Foundation, Washington, DC, 1978.
M. W. Dimmic, D. P. Mindell, and R. A. Goldstein. Modeling evolution at the protein level using an adjustable amino acid fitness model. In Pacific Symposium on Biocomputing, pages 18–29. World Scientific, Singapore, 2000.
M. W. Dimmic, J. S. Rest, D. P. Mindell, and R. A. Goldstein. rtREV: an amino acid substitution matrix for inference of retrovirus and reverse transcriptase phylogeny. J Mol Evol, 55(1):65–73, Jul 2002.
A. K. Dunker, C. J. Brown, J. D. Lawson, L. M. Iakoucheva, and Z. Obradovic. Intrinsic disorder and protein function. Biochemistry,41(21):6573–6582, May 2002.
P. Fariselli, O. Olmea, A. Valencia, and R. Casadio. Progress in predicting inter-residue contacts of proteins with neural networks and correlated mutations. Proteins, Suppl 5:157–162, 2001. Evaluation Studies.
J. Felsenstein. Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol, 17(6):368–376, 1981.
W. M. Fitch and E. Markowitz. An improved method for determining codon variability in a gene and its application to the rate of fixation of mutations in evolution. Biochem Genet, 4(5):579–593, Oct 1970.
K. M. Flaherty, D. B. McKay, W. Kabsch, and K. C. Holmes. Similarity of the three-dimensional structures of actin and the ATPase fragment of a 70-kDa heat shock cognate protein. Proc Natl Acad Sci USA, 88(11):5041–5045, Jun 1991.
M. S. Fornasari, G. Parisi, and J. Echave. Site-specific amino acid replacement matrices from structurally constrained protein evolution simulations. Mol Biol Evol, 19(3):352–356, Mar 2002, letter.
K. Fukami-Kobayashi, D. R. Schreiber, and S. A. Benner. Detecting compensatory covariation signals in protein evolution using reconstructed ancestral sequences. J Mol Biol, 319(3):729–743, Jun 2002.
N. Galtier. Maximum-likelihood phylogenetic analysis under a covarionlike model. Mol Biol Evol, 18(5):866–873, May 2001.
E. A. Gaucher, X. Gu, M. M. Miyamoto, and S. A. Benner. Predicting functional divergence in protein evolution by site-specific rate shifts. Trends Biochem Sci, 27(6):315–321, Jun 2002.
N. Goldman, J. L. Thorne, and D. T. Jones. Assessing the impact of secondary structure and solvent accessibility on protein evolution. Genetics, 149(1):445–458, May 1998.
N. Goldman and S. Whelan. Statistical tests of gamma-distributed rate heterogeneity in models of sequence evolution in phylogenetics. Mol Biol Evol, 17(6):975–978, Jun 2000, letter.
G. H. Gonnet, M. A. Cohen, and S. A. Benner. Analysis of amino acid substitution during divergent evolution: The 400 by 400 dipeptide substitution matrix. Biochem Biophys Res Commun, 199(2):489–496, Mar 1994.
S. Govindarajan, J. E. Ness, S. Kim, E. C. Mundorff, J. Minshull, and C. Gustafsson. Systematic variation of amino acid substitutions for stringent assessment of pairwise covariation. J Mol Biol, 328(5):1061–1069, May 2003.
J. Gu, Y. Wang, and X. Gu. Evolutionary analysis for functional divergence of Jak protein kinase domains and tissue-specific genes. J Mol Evol, 54(6):725–733, Jun 2002.
X. Gu. Statistical methods for testing functional divergence after gene duplication. Mol Biol Evol, 16(12):1664–1674, Dec 1999.
X. Gu. Mathematical modeling for functional divergence after gene duplication. J Comput Biol, 8(3):221–234, 2001.
J. P. Huelsenbeck. Testing a covariotide model of DNA substitution. Mol Biol Evol, 19(5):698–707, May 2002.
D. T. Jones, W. R. Taylor, and J. M. Thornton. The rapid generation of mutation data matrices from protein sequences. Comput Appl Biosci, 8(3):275–282, Jun 1992.
D. T. Jones, W. R. Taylor, and J. M. Thornton. A mutation data matrix for transmembrane proteins. FEBS Lett, 339(3):269–275, Feb 1994.
S. Kawashima and M. Kanehisa. AAindex: Amino acid index database. Nucleic Acids Res, 28(1):374, Jan 2000.
M. Kimura. Population Genetics, Molecular Evolution, and the Neutral Theory: Selected Papers. University of Chicago Press, Chicago, 1994.
H. Kishino, T. Miyata, and M. Hasegawa. Maximum likelihood inference of protein phylogeny and the origin of chloroplasts. J Mol Evol, 30:151–160, 1990.
B. Knudsen and M. M. Miyamoto. A likelihood ratio test for evolutionary rate shifts and functional divergence among proteins. Proc Natl Acad Sci USA, 98(25):14512–14517, Dec 2001.
R. Koradi, M. Billeter, and K. Wuthrich. MOLMOL: A program for display and analysis of macromolecular structures. J Mol Graph, 14(1):51–55, Feb 1996.
J. M. Koshi and R. A. Goldstein. Context-dependent optimal substitution matrices. Protein Eng, 8(7):641–645, Jul 1995.
J. M. Koshi and R. A. Goldstein. Models of natural mutations including site heterogeneity. Proteins, 32(3):289–295, Aug 1998.
S. M. Larson, A. A. Di Nardo, and A. R. Davidson. Analysis of covariation in an SH3 domain sequence alignment: Applications in tertiary contact prediction and the design of compensating hydrophobic core substitutions. J Mol Biol, 303(3):433–446, Oct 2000.
P. Lió and N. Goldman. Using protein structural information in evolutionary inference: Transmembrane proteins. Mol Biol Evol, 16(12):1696–1710, Dec 1999.
P. Lió and N. Goldman. Modeling mitochondrial protein evolution using structural information. J Mol Evol, 54(4):519–529, Apr 2002.
L. Lo Conte, B. Ailey, T. J. Hubbard, S. E. Brenner, A. G. Murzin, and C. Chothia. SCOP: A structural classification of proteins database. Nucleic Acids Res, 28(1):257–259, Jan 2000.
P. Lopez, D. Casane, and H. Philippe. Heterotachy, an important process of protein evolution. Mol Biol Evol, 19(1):1–7, Jan 2002.
Y. Mandel-Gutfreund, S. M. Zaremba, and L. M. Gregoret. Contributions of residue pairing to beta-sheet formation: Conservation and covariation of amino acid residue pairs on antiparallel beta-strands. J Mol Biol, 305(5):1145–1159, Feb 2001.
T. Miyata, S. Miyazawa, and T. Yasunaga. Two types of amino acid substitutions in protein evolution. J Mol Evol, 12(3):219–236, Mar 1979.
K. Mizuguchi and T. Blundell. Analysis of conservation and substitutions of secondary structure elements within protein superfamilies. Bioinformatics, 16(12):1111–1119, Dec 2000.
T. Muller, R. Spang, and M. Vingron. Estimating amino acid substitution models: A comparison of Dayhoff’s estimator, the resolvent approach and a maximum likelihood method. Mol Biol Evol, 19(1):8–13, Jan 2002.
S. V. Muse. Evolutionary analyses of DNA sequences subject to constraints of secondary structure. Genetics, 139(3):1429–1439, Mar 1995.
G. J. Naylor and W. M. Brown. Structural biology and phylogenetic estimation. Nature, 388(6642):527–528, Aug 1997, letter.
E. Neher. How frequent are correlated changes in families of protein sequences? Proc Natl Acad Sci USA, 91(1):98–102, Jan 1994.
R. Nielsen and Z. Yang. Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics, 148(3):929–936, Mar 1998.
J. Overington, D. Donnelly, M. S. Johnson, A. Sali, and T. L. Blundell. Environment-specific amino acid substitution tables: Tertiary templates and prediction of protein folds. Protein Sci, 1(2):216–226, Feb 1992.
L. Patthy. Protein Evolution. Blackwell Science, London, 1999.
D. D. Pollock and W. R. Taylor. Effectiveness of correlation analysis in identifying protein residues undergoing correlated evolution. Protein Eng, 10(6):647–657, Jun 1997.
D. D. Pollock, W. R. Taylor, and N. Goldman. Coevolving protein residues: Maximum likelihood identification and relationship to structure. J Mol Biol, 287(1):187–198, Mar 1999.
T. Pupko, R. E. Bell, I. Mayrose, F. Glaser, and N. Ben-Tal. Rate4Site: An algorithmic tool for the identification of functional regions in proteins by surface mapping of evolutionary determinants within their homologues. Bioinformatics, 18(Suppl 1):71–77, Jul 2002.
D. M. Robinson, D. T. Jones, H. Kishino, N. Goldman, and J. L. Thorne. Protein evolution with dependence among codons due to tertiary structure. Mol Biol Evol, 20(10):1692–1704, Oct 2003.
M. Schoeniger and A. von Haeseler. Toward assigning helical regions in alignments of ribosomal RNA and testing the appropriateness of evolutionary models. J Mol Evol, 49(5):691–698, Nov 1999.
O. Schueler and H. Margalit. Conservation of salt bridges in protein families. J Mol Biol, 248(1):125–135, Apr 1995.
I. N. Shindyalov, N. A. Kolchanov, and C. Sander. Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations. Protein Eng, 7(3):349–358, Mar 1994.
O. Soyer, M. W. Dimmic, R. R. Neubig, and R. A. Goldstein. Using evolutionary methods to study G-protein coupled receptors. In Pacific Symposium on Biocomputing, pages 625–636. World Scientific, Singapore, 2002.
O. Soyer, M. W. Dimmic, R. R. Neubig, and R. A. Goldstein. Dimerization in aminergic G-protein coupled receptors: Application of a hidden site-class model of evolution. Biochemistry, 42(49):14522–14531, Dec 2003.
K. Tomii and M. Kanehisa. Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. Protein Eng, 9(1):27–36, Jan 1996.
P. Tufféry and P. Darlu. Exploring a phylogenetic approach for the detection of correlated substitutions in proteins. Mol Biol Evol, 17(11):1753–1759, Nov 2000.
C. Tuffley and M. Steel. Modeling the covarion hypothesis of nucleotide substitution. Math Biosci, 147(1):63–91, Jan 1998.
H. Wako and T. L. Blundell. Use of amino acid environment-dependent substitution tables and conformational propensities in structure prediction from aligned sequences of homologous proteins. I. Solvent accessibility classes. J Mol Biol, 238(5):682–692, May 1994.
S. Whelan and N. Goldman. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol, 18(5):691–699, May 2001.
K. R. Wollenberg and W. R. Atchley. Separation of phylogenetic and functional associations in biological sequences by using the parametric bootstrap. Proc Natl Acad Sci USA, 97(7):3288–3291, Mar 2000.
Z. Yang. Estimating the pattern of nucleotide substitution. J Mol Evol, 39(1):105–111, Jul 1994.
Z. Yang. Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: Approximate methods. J Mol Evol, 39(3):306–314, Sep 1994.
Z. Yang. Relating physicochemical properties of amino acids to variable nucleotide substitution patterns among sites. In Pacific Symposium on Biocomputing, pages 81–92. World Scientific, Singapore, 2000.
Z. Yang, R. Nielsen, and M. Hasegawa. Models of amino acid substitution and applications to mitochondrial protein evolution. Mol Biol Evol, 15(12):1600–1611, Dec 1998.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science+Business Media, Inc.
About this chapter
Cite this chapter
Dimmic, M.W. (2005). Markov Models of Protein Sequence Evolution. In: Statistical Methods in Molecular Evolution. Statistics for Biology and Health. Springer, New York, NY. https://doi.org/10.1007/0-387-27733-1_9
Download citation
DOI: https://doi.org/10.1007/0-387-27733-1_9
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-22333-9
Online ISBN: 978-0-387-27733-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)