Summary
The common but generally overlooked problem of how best to construct phylogenies from orthologous amino acid sequences, when their alignment requires the placement therein of gaps denoting insertions/deletions in the evolutionary history of their genes since their common ancestor, has been studied. Three diverse methods were examined: 1. each missing residue in a gap is weighted as equivalent to the average number of minimum nucleotide replacements in known conjugate amino acid pairs of those same two sequences, which weight necessarily differs for each pair of sequences; 2. each missing residue in a gap is weighted as equivalent to a fixed number of nucleotide replacements; and 3. each gap, regardless of length, is weighted as equivalent to a fixed number of nucleotide replacements. For the flavodoxins, each method yielded a different best tree and suggests that the choice of method may be crucial. For the plant ferredoxins, all methods give results inconsistent with botanical classification and suggests the sequences may not all be orthologous. For the bacterial ferredoxins, the method was less germane than the actual weight used, five different best trees being obtained depending upon the weight. The best tree for all ferredoxins (prokaryotic plus eukaryotic) combined proved to be greatly dependent upon the gap locations with several reasonable alignments yielding different best trees. They also suggest that functional equivalence may well prove to be a poor guide to which residues have a common ancestral codon. The rubredoxin sequences show that a partial internal gene duplication occurred in thePseudomonas line, probably very soon after its divergence from the other genera. Together, the results clearly indicate that the phylogenetic answer one gets may greatly depend upon how one treats the gaps but they fail to indicate what treatment may be best. This results partly from the fact that the phylogenies of the taxa represented are not known with sufficient confidence to be sure when the procedures are performing best.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Adman, E.T., Sieker, L.C., Jensen, L.H. (1973). J.Biol.Chem. 248, 3987–3996
Aggarwal, S.J., Rao, K.K., Matsubara, H. (1971). J.Biochem. (Tokyo), 69, 601–603
Arnheim, N., Steller, R. (1970). Arch.Biochem.Biophys. 141, 656–661
Azari, P., Tsunoda, J., Glantz, M., Mayhew, S., Yasunoba, K.T. (1970). unpublished
Bachmayer, H., Benson, A.M., Garrard, W.T., Yasunobu, K.T., Whiteley, H.R. (1968a). Biochem. 7, 986–996
Bachmayer, H., Mayhew, S., Peel, J., Yasunobu, K.T. (1968b). J.Biol. Chem. 243, 1022–1030
Benson, A.M., Mower, H.F., Yasunobu, K.T. (1967). Arch.Biochem.Biophys. 121, 563–575
Benson, A.M., Tomoda, K., Chang, J., Matsueda, G., Lode, E.T., Coon, M.J., Yasunobu, K.T. (1971). Biochem.Biophys.Res.Comm. 42, 640–646
Benson, A.M., Yasunobu, K.T. (1969). J.Biol.Chem. 244, 955–963
Dayhoff, M.O. (1972). Atlas of protein sequence and structure, Vol.V, p. D-39. Washington D.C.: National Biomedical Res. Center
Dubourdieu, M., Legall, J., Fox, J.L. (1973). Biochem.Biophys.Res.Comm. 52, 1418–1425
Eck, R.V., Dayhoff, M.O. (1966). Sci. 152, 363–366
Fitch, W.M. (1966a). J.Mol.Biol. 16, 9–16
Fitch, W.M. (1966b). J.Mol.Biol. 16, 17–27
Fitch, W.M. (1969). Biochem.Genet. 3, 99–108
Fitch, W.M. (1970a). Biochem.Genet. 49, 1–14
Fitch, W.M. (1970b). Biochem.Genet. 49, 15–21
Fitch, W.M. (1971). Syst.Zool. 20, 406–416
Fitch, W.M. (1973). Ann.Rev.Genet. 7, 343–381
Fitch, W.M., Margoliash, E. (1967). Sci. 155, 279–284
Fitch, W.M., Margoliash, E. (1970). Evol.Biol. 4, 67–109
Fox, J.L., Smith, S.S., Brown, J.R. (1972). Z.Naturforsch. 27b, 1096–1100
Jukes, T.H. (1966). Molecules and evolution. N.Y.: Columbia Univ. Press 285 pp.
Keresztes-Nagy, S., Perini, F., Margoliash, E. (1969). J.Biol.Chem. 244, 981–995
Matsubara, H., Sasaki, R.M., Chain, R.K. (1967). Proc.Nat.Acad.Sci. 57, 439–445
Matsubara, H., Jukes, T.H., Cantor, C.R. (1968). Brookhaven Symp.Biol. 21, 201–216
Matsubara, H., Sasaki, R.M., Tsuchiya, D.K., Evans, M.C.W. (1970). J. Biol.Chem. 245, 2121–2131
McCarthy, K.T. (1972). Ph.D.Thesis, George Washington University
Moore, G.W., Barnabas, J., Goodman, J. (1973). J.Theoret.Biol. 38, 459–485
Needleman, S., Wunsch, C.D. (1970). J.Mol.Biol. 48, 443–453
Rall, S.C., Bolinger, R.E., Cole, R.D. (1969). Biochem. 8, 2486–2496
Rao, K.K., Matsubara, H. (1970). Biochem.Biophys.Res.Comm. 38, 500–506
Reichert, T.A., Cohen, D.N., Wong, A.K.C. (1974). J.Theoret.Biol. 42, 245–262
Rossman, M.G., Liljas, A., Bränden, C.-I., Banaszak, L.J. (1975). The enzymes, 3rd edition (in press)
Sneath, P.H.A., Sokal, R.R. (1973). Numerical taxonomy. San Francisco: W.H.Freeman and Co. 573 pp.
Sugeno, K., Matsubara, H. (1969). J.Biol.Chem. 244, 2979–2989
Tanaka, M., Nakashima, T., Benson, A.M., Mower, H., Yasunobu, K.T. (1966). Biochem. 5, 1666–1680
Tanaka, M., Haniu, M., Matsueda, G., Yasunobu, K.T., Himes, R.H., Akagi, J.M., Barnes, E.M., Devanathan, T. (1971a). J.Biol.Chem. 246, 3953–3960
Tanaka, M., Haniu, M., Matsueda, G., Yasunobu, K.T., Mayhew, S., Massey, V. (1971b). Biochem. 10, 3041–3046
Tanaka, M., Haniu, M., Yasunobu, K.T., Himes, R., Akagi, J. (1973a). J.Biol.Chem. 248, 5215–5217
Tanaka, M., Haniu, M., Yasunobu, K.T., Mayhew, S.G., Massey, V. (1973b). J.Biol.Chem. 248, 4354–4366
Tanaka, M., Haniu, M., Yasunobu, K.T., Mayhew, S.G. (1974a). J.Biol.Chem. 249, 4393–4396
Tanaka, M., Haniu, M., Yasunobu, K.T., Jones, J.B., Stadtman, T. (1974b). Biochem. 13, 5284–5289
Tanaka, M., Haniu, M., Yasunobu, K.T., Evans, M.C.W., Rao, K.K. (1974c). Biochem. 13, 2953–2959
Travis, J., Newman, D.J., Le Gall, J., Peck, H.D.Jr. (1971). Biochem. Biophys.Res.Comm. 45, 452–458 (1971)
Tsunoda, J.N., Yasunobu, K.T., Whiteley, H.R. (1968). J.Biol.Chem. 243, 6262–6272
Yasunobu, K.T., Tanaka, M. (1973a). In: Iron-sulfur proteins, Vol. II, W.Lovenberg, ed., p. 27–130. N.Y. Acad. Press
Yasunobu, K.T., Tanaka, M. (1973b). Syst.Zool. 22, 570–587
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Fitch, W.M., Yasunobu, K.T. Phylogenies from amino acid sequences aligned with gaps: The problem of gap weighting. J Mol Evol 5, 1–24 (1975). https://doi.org/10.1007/BF01732010
Received:
Issue Date:
DOI: https://doi.org/10.1007/BF01732010