Summary
We outline a method for estimating quantitatively the influence of point mutations and selection on the frequencies of codons and amino acids. We show how the mutation rate, i.e., the rate of amino acid replacement due to point mutation, can be affected by the codon usage as well as by the rates of the involved base exchanges. A comparison of the mutation rates calculated from reliable values of codon usage and base exchange probabilities with those that would be expected on the basis of chance reveals a notable suppression of replacements leading to tryptophan, glutamate, lysine, and methionine, and particularly of those leading to the termination codons.
If selection constraints are neglected and only mutations are taken into account, the best agreement between expected and observed frequencies of both codons and amino acids is obtained for α=1.13–1.15, where
The “selection values” of codons and amino acids derived by our method show a pattern that partially deviates from others in the literature. For example, the selection pressure on methionine and cysteine turns out to be much more pronounced than expected if only the discrepancies between their observed and expected occurrences in proteins are considered. To estimate to what extent randomly occurring amino acid replacements are accepted by selection, we constructed an “acceptability matrix” from the well-established matrix of accepted point mutations. On the basis of this matrix “acceptability values” of the amino acids can be defined that correlate with their selection values.
We also examine the significance of mutations and selection of amino acids with respect to their physicochemical properties and functions in proteins. The conservatism of amino acid replacements with respect to certain properties such as polarity can be brought about by the mutational process alone, whereas the conservatism with respect to other relevant properties-among them all measures of bulkiness-obviously is the result of additional selectional constraints on the evolution of protein structures.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Aboderin AA (1971) An empirical hydrophobic scale for α-amino-acids and some of its applications. Int J Biochem 2:537–544
Alff-Steinberger C (1969) The genetic code and error transmission. Proc Natl Acad Sci USA 64:584–591
Amelunxen R, Murdock AL (1978) Mechanisms of thermophily. CRC Crit Rev Microbiol xx:343–393
Argos P, Rossman MG, Grau UM, Zuber H, Frank G, Tratschin JD (1979) Thermal stability and protein structure. Biochemistry 18:5698–5703
Argyle E (1980) A similarity ring for amino acids based on their evolutionary substitution rate. Orig Life 10:357–360
Batchinsky AG (1976) Structure and noise immunity of the genetic code. J Gen Biol 37:163–174
Berger EM (1978) Pattern and chance in the use of the genetic code. J. Mol Evol 10:319–323
Brown AL (1982) Evolution and molecular biology. Nature 298:793–794
Bull HB, Breese K (1974) Surface tension of amino acid solutions: a hydrophobicity scale of the amino acid residues. Arch Biochem Biophys 161:665–670
Cercignany C (1975) Theory and applications of the Boltzmann equation. Scottish Academic Press, Edinburgh London
Charton M (1981) Protein folding and the genetic code: an alternative quantitative model. J Theor Biol 91:115–123
Chothia C (1975) Structural invariants in protein folding. Nature 254:304–308
Chothia C (1976) The nature of the accessible and buried surfaces in proteins. J Mol Biol 105:1–14
Chou PY, Fasman GD (1978) Prediction of secondary-structure of proteins from their amino-acid sequence. Adv Enzymol 47:45–148
Conrad M, Friedlander C, Goodman M (1983) Evidence that natural selection acts on silent mutation. Biosystems 16:101–111
Cornish-Bowden A, Marson A (1977) Evaluation of the nonrandomness of protein composition. J Mol Evol 10:231–240
Coulondre C, Miller JH (1977) Genetic studies of the lac-repressor. Part IV: Mutagenic specifity in the LacI gene ofEscherichia coli. J Mol Biol 117:577–606
Coutelle R, Hofacker GLC (1982) Influence of selective processes on the amino acid composition of proteins: collagen, cytochrome c, ferredoxin and α-crystallin. J Theor Biol 95: 615–639
Cox EC (1976) Bacterial mutator genes and the control of spontaneous mutation. Annu Rev Genet 10:135–156
Davies J, Jones DS, Khorano HG (1966) A further study of misreading of codons induced by streptomycin and neomycin using ribopolynucleotides containing two nucleotides in alternating sequence as templates. J Mol Biol 18:48–57
Dayhoff MO, Eck RV, Park CM (1972) A model of evolutionary change in proteins. In: Dayhoff MO (ed) Atlas of protein sequence and structure. National Biomedical Research Foundation Washington, DC, pp. 89–99
Doolittle RF (1979) Protein evolution. In: Neurath H, Hill RL (eds) The proteins, 3rd edn, vol IV. Academic Press, New York, pp 1–118
Dunill P (1968) The use of helical net-diagrams to represent protein structures. Biophys J 8:865–875
Eigen M, Schuster P (1979) The hypercycle: a principle of natural self-organization. Springer-Verlag, Berlin Heidelberg New York
Epstein CJ (1967) Non-randomness of amino acid changes in the evolution of homologous proteins. Nature 215:355–359
Fendler JH, Nome F, Nagyvary J (1975) Compartmentalization of amino acids in surfactant aggregates. Partitioning between water and aqueous micellar sodium dodecanoate and between hexane and dodecylammonium propeonate trapped in hexane. J Mol Evol 6:215–232
Fersht AR (1979) Fidelity of replication of phage ΦX174 DNA by DNA-polymerase III holoenzyme: spontaneous mutation by misincorporation. Proc Natl Acad Sci USA 76:4946–4950
Fersht AR, Knill-Jones JW (1981) DNA polymerase accuracy and spontaneous mutation rates: frequencies of purine-purine, purine-pyrimidine and pyrimidine-pyrimidine mismatches during DNA replication. Proc Natl Acad Sci USA 78:4251–4255
Fitch WM (1967) Evidence suggesting a non-random character to nucleotide replacements in naturally occurring mutations. J Mol Biol 26:499–507
Fitch WM (1980) Estimating the total number of nucleotide substitutions since the common ancestor of a pair of homologous genes: Comparison of several methods and three beta hemoglobin messenger RNAs. J Mol Evol 16:153–209
Gamov G (1954) Possible relation between deoxyribonucleic acid and protein structure. Nature 173:318–320
Golding GB, Strobeck C (1982) Expected frequencies of codon use as a function of mutation rates and codon fitness. J Mol Evol 18:379–386
Goldsack DE, Chalifoux RC (1973) Contributions of the free energy of mixing of hydrophobic side chains to the stability of the tertiary structure of proteins. J Theor Biol 39:645–651
Grantham R (1974) Amino acid difference formula to help explain protein evolution. Science 185:862–864
Grantham R, Gautier C, Gouvy M (1980) Codon frequencies in 119 individual genes confirm consistent choices of degenerate bases according the genome type. Nucleic Acids Res 8: 1893–1912
Grantham R, Gautier C, Gouvy M, Jacobsone M, Mercier R (1981) Codon catalog usage is a genome strategy modulated for gene expressivity. Nucleic Acids Res 9:r43-r74
Grosjean H, Sankoff D, Jou WM, Fiers W, Cedergren RJ (1978) Bacteriophage MS2 RNA: a correlation between the stability of the codon anticodon interaction and the choice of code words. J Mol Evol 12:113–119
Hendry LB, Bransome ED, Petersheim M (1981a) Are there structural analogies between amino acids and nucleic acids? Orig Life 11:203–221
Hendry LB, Bransome ED, Hutson MS, Campbell LK (1981b) First approximation of a stereochemical rationale for the genetic code based on the topography and physicochemical properties of cavities constructed from models of DNA. Proc Natl Acad Sci USA 78:7440–7444 (see also refs 3–47 within)
Holland JP, Holland MJ (1980) Structural comparison of two non-tandemly repeated yeast glyceraldehyde-3-phosphate dehydrogenase genes. J Biol Chem 255:2596–2605
Holmquist R (1978) Evaluation of compositional nonrandomness in proteins. J Mol Evol 11:349–360
Holmquist R, Pearl D (1980) Theoretical foundations for quantitative paleogenetics. Part III: The molecular divergence of nucleic acids and proteins for the case of genetic events of unequal probability. J Mol Evol 16:211–267
Ikemura T (1981) Correlation between the abundance ofEscherichia coli transfer RNAs and the occurrence of the respective codons in the protein genes. J Mol Biol 146:1–21
Jones DD (1975) Amino acid properties and side chain orientations in proteins. J Theor Biol 50:167–183
Jukes TH (1975) Mutation in proteins and base changes in codons. Biochem Biophys Res Commun 66:1–18
Jukes TH, Holmquist R, Moise H (1975) Amino acid composition of proteins: selection against the genetic code. Science 189:50–51
Kafatos FC, Efstratiadis A, Forget BE, Weissmann SM (1977) Molecular evolution of human and rabbit β-globin mRNAs. Proc Natl Acad Sci USA 74:5618–5622
Krigbaum WR, Komoriya A (1979) Local interactions as a structure determinant for protein molecules. Biochim Biophys Acta 576:204–228
Laird M, Holmquist R (1975) Tables of critical values for examining compositional non-randomness in proteins and nucleic acids. J Mol Evol 4:261–276
Lehmann AR, Karran P (1981) DNA repair. Int Rev Cytol 72:101–146
Li, W-H (1981) Simple method for constructing phylogenetic trees from distance matrices. Proc Natl Acad Sci USA 78: 1085–1089
Manavalan P, Ponnuswamy PK (1978) Hydrophobic character of amino acid residues in globular proteins. Nature 275:673–674
McLachlan AD (1971) Tests for comparing related amino acid sequences: cytochrome c and cytochrome c551. J Mol Biol 61: 409–424
Modiani G, Battistuzzi G, Motulsky AG (1981) Nonrandom patterns of codon usage and of nucleotide substitutions in human and β-globin genes: An evolutionary strategy reducing the rate of mutations with drastic effects? Proc Natl Acad Sci USA 78:1110–1114
Ohta T, Kimura M (1971) Amino acid composition of proteins as a product of molecular evolution. Science 174:150–153
Olsen KW (1980) Internal residue criteria for predicting three dimensional protein structure. Biochim Biophys Acta 622: 259–267
Oobatake H, Ooi T (1977) An analysis of nonbonded energy of proteins. J Theor Biol 67:567–587
Ozoline ON, Oganesjan HG, Kamzolova SG (1980) On the fidelity of transcription byEscherichia coli RNA polymerase. FEBS Lett 110:123–125
Papentin F (1973) A Darwinian evolutionary system. Part II: Experiments on protein evolution and evolutionary aspects of the genetic code. J Theor Biol 39:417–430
Pieczenik G (1980) Predicting coding function from nucleotide sequence or survival of “fitness” of tRNA. Proc Natl Acad Sci USA 77:3539–3543
Ponnuswamy PK, Muthusany R, Manavalan P (1982) Amino acid composition and thermal stability of proteins. Int J Biol Macromol 4:186–190
Richard AB, Barry NJ, Divulet FE, Garner WH, Lehmann LD, Gurd FRN (1980) Evolution of the amino acid substitution in the mammalian myoglobin gene. J Mol Evol 15:197–218
Robson B, Suzuki E (1976) Conformational properties of amino acid residues in globular proteins. J Mol Biol 107:327–356
Salemme FR, Miller MD, Jordan SR (1977) Structural convergence during protein evolution. Proc Natl Acad Sci USA 74:2820–2824
Sander C, Schulz GE (1979) Degeneracy of the information contained in amino acid sequence: evidence from overlaid genes. J Mol Evol 13:245–252
Singleton R, Middaugh CR, McElroy RD (1977) Comparison of proteins from thermophilic and nonthermophilic sources in terms of structural parameters inferred from amino acid composition. Int J Pept Protein Res 10:39–50
Sneath PHA (1966) Relations between chemical structure and biological activity in peptides. J Theor Biol 12:157–195
Sternberg HJE, Thornten YM (1977) On the conformation of proteins: hydrophobic ordering of strands in β-pleated sheets. J Mol Biol 115:1–17
Topal MD, Fresco JR (1976) Complementary base pairing and the origin of substitution mutations. Nature 263:285–289
Vogel F, Kopun M (1977) Higher frequencies of transitions among point mutations. J Mol Evol 9:159–180
Vogel F, Röhrborn G (1966) Amino-acid substitution in hemoglobins and the mutation process. Nature 210:116–117
Wain-Hobson S, Nussinov R, Brown RJ, Sussman JL (1981) Preferential codon usage in genes. Gene 13:355–364
Weinberg G, Ullman B, Martin DW (1981) Mutator phenotypes in mammalian cell mutants with distinct biochemical defects and abnormal deoxyribonucleoside triphosphate pools. Proc Natl Acad Sci USA 78:2447–2451
Woese CR (1973) Evolution of the genetic code. Naturwissenschaften 60:447–459
Wolfenden RV, Cullis PM, Southgate CCF (1979) Water, protein folding and the genetic code. Science 206:575–577
Wolfenden RV, Andersson L, Cullis PM, Southgate CCF (1981) Affinities of amino acid side chains for solvent water. Biochemistry 20:849–855
Wolkenstein MV (1979) Mutation and the value of information. J Theor Biol 80:155–169
Wong JT (1975) A co-evolution theory of genetic code. Proc Natl Acad Sci USA 72:1909–1912
Yano T, Hasegawa M (1974) Entropy increase of amino acid sequence in proteins. J. Mol Evol 4:179–187
Yarus M (1979) The accuracy of translation. Prog Nucleic Acid Res Mol Biol 23:195–225
Zamyatnin AA (1972) Protein volume in solution. Prog Biophys Mol Biol 24:107–123
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Frömmel, C., Holzhütter, H.G. An estimate on the effect of point mutation and natural selection on the rate of amino acid replacement in proteins. J Mol Evol 21, 233–257 (1985). https://doi.org/10.1007/BF02102357
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02102357