Summary
Chaos game representation (CGR) is a novel holistic approach that provides a visual image of a DNA sequence quite different from the traditional linear arrangement of nucleotides. Although it is known that CGR patterns depict base composition and sequentiality, the biological significance of the specific features of each pattern is not understood. To systematically examine these features, we have examined the coding sequences of 7 human globin genes and 29 relatively conserved alcohol dehydrogenase (Adh) genes from phylogenetically divergent species. The CGRs of human globin cDNAs were similar to one another and to the entire human globin gene complex. Interestingly, human globin CGRs were also strikingly similar to human Adh CGRs. Adh CGRs were similar for genes of the same or closely related species but were different for relatively conserved Adh genes from distantly related species. Dinucleotide frequencies may account for the self-similar pattern that is characteristic of vertebrate CGRs and the genome-specific features of CGR patterns. Mutational frequencies of dinucleotides may vary among genome types. The special features of CG dinucleotides of vertebrates represent such an example. The CGR patterns examined thus far suggest that the evolution of a gene and its coding sequence should not be examined in isolation. Consideration should be given to genome-specific differential mutation rates for different dinucleotides or specific oligonucleotides.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Bilofsky HS, Burks C (1988) The GenBank genetic sequence data bank. Nucleic Acids Res 16:1861–1863
Bird AP (1980) DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res 8:1499–1504
Coulondre C, Miller JH, Farabaugh PJ, Gilbert W (1978) Molecular basis of base substitution hot spots in Escherichia coli. Nature 274:775–780
Devereux J, Haeberli P, Smithies O (1981) A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Res 12:387–395
Ehrlich M, Wang RY-H (1981) 5 Methylcytosine in eukaryotic DNA. Science 212:1350–1357
Gross RH (1986) A DNA sequence analysis program for the Apple Macintosh. Nucleic Acids Res 14:591–596
Jeffrey HJ (1990) Chaos game representation of gene structure. Nucleic Acids Res 18:2163–2170
May R (1976) Simple mathematical models with very complicated dynamics. Nature 261:459–467
Needleman SB, Wunsch CD (1970) A general method applicable to search for similarities in the amino acid sequence of two proteins. J Mol Biol 48:443–453
Nei M (1987) Molecular evolutionary genetics. Columbia University Press, New York
Russell GJ, Walker PMB, Elton RA, Subad-Sharpe JH (1976) Doublet frequency analysis of fractionated vertebrate nuclear DNA. J Mol Biol 108:1–23
Wilkinson L (1991) Systat: the system for statistics. Systat Inc., Evanston, IL
Yokoyama S, Yokoyama R, Kinlaw CS, Harry DE (1990) Molecular evolution of zinc-containing long-chain alcohol dehydrogenase genes. Mol Biol Evol 7:143–154
Author information
Authors and Affiliations
Additional information
Offprint requests to: S. M. Singh
Rights and permissions
About this article
Cite this article
Hill, K.A., Schisler, N.J. & Singh, S.M. Chaos game representation of coding regions of human globin genes and alcohol dehydrogenase genes of phylogenetically divergent species. J Mol Evol 35, 261–269 (1992). https://doi.org/10.1007/BF00178602
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF00178602