Summary
One hundred twelve human DNA sequences were analyzed with respect to dinucleotide frequency and amino acid composition. The variation in guanine and cytosine (G+C) content revealed: (1) at 2–3 and 3-1 doublet positions CG discrimination is attenuated at high G+C, but TA disfavor is enhanced, and (2) several amino acids are subject to G+C change. These findings have been reported in part for collections of sequences from various species. The present study confirms that in a single organism-the human-the G+C effects do exist. Aspects of the argument that connects G+C with protein thermal stability are also discussed.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Aota S-I, Ikemura T (1986) Diversity in G+C content at the third letter position of codons in vertebrate genes and its cause. Nucleic Acids Res 14:6345–6355, 8702 (erratum)
Bernardi G, Bernardi G (1985) Codon usage and genome composition. J Mol Evol 22:363–365
Bernardi G, Bernardi G (1986a) Compositional constraints and genome evolution. J Mol Evol 24:1–11
Bernardi G, Bernardi G (1986b) The human genome and its evolutionary context. Cold Spring Harbor Symp Quant Biol 51:479–487
Bernardi G, Olofsson B, Filipski J, Zerial M, Salinas J, Cury G, Meunier-Rotival M, Rodier F (1985) The mosaic genome of warm-blooded vertebrates. Science 228:953–958
Bird AP (1980) DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res 8:1499–1504
Endo S, Nagayama K, Wada A (1985) Probing stability and dynamics of proteins by protease digestion I: comparison of protease susceptibility and thermal stability of cytochromesc. J Biomol Struct & Dyn 3:409–421
Goldberg AL, Dice JF (1974) Intracellular protein degradation in mammalian and bacterial cells. Annu Rev Biochem 43:835–869
Goldberg AL, St. John AC (1976) Intracellular protein degradation in mammalian and bacterial cells: part 2. Annu Rev Biochem 45:747–803
Goldman MA, Holmquist GP, Gray MC, Caston LA, Nag A (1984) Replication timing of genes and middle repetitive sequences. Science 224:686–692
Grantham R, Gautier C, Gouy M (1980a) Codon frequencies in 119 individual genes confirm consistent choices of degenerate bases according to genome type. Nucleic Acids Res 8:1893–1912
Grantham R, Gautier C, Gouy M, Mercier R, Pavé A (1980b) Codon catalog usage and the genome hypothesis. Nucleic Acids Res 8:r49–62
Grantham R, Gautier C, Gouy M, Jacobzone M, Mercier R (1981) Codon catalog usage is a genome strategy modulated for gene expressivity. Nucleic Acids Res 9:r43–74
Ikemura T (1985) Codon usage and tRNA content in unicellular and multicellular organisms. Mol Biol Evol 2:13–34
Jukes TH (1985) A change in the genetic code inMycoplasma capricolum. J Mol Evol 22:361–362
Jukes TH, Bhushan V (1986) Silent nucleotide substitutions and G+C content of some mitochondrial and bacterial genes J Mol Evol 24:39–44
Levitt M (1976) A simplified representation of protein conformation for rapid simulation of protein folding. J Mol Biol 104:59–107
McLendon G, Radany E (1978) Is protein turnover thermodynamically controlled? J Biol Chem 253:6335–6337
Muto A, Osawa S (1987) The guanine and cytosine content of genomic DNA and bacterial mutation. Proc Natl Acad Sci USA 84:166–169
Nussinov R (1980) Some rules in the ordering of nucleotides in the DNA. Nucleic Acids Res 8:4545–4562
Nussinov R (1981a) Nearest neighbor nucleotide patterns: structural and biological implications. J Biol Chem 256:8458–8462
Nussinov R (1981b) Eukaryotic dinucleotide preference rules and their implications for degenerate codon usage. J Mol Biol 149:125–131
Nussinov R (1981c) The universal dinucleotide asymmetry rules in DNA and the amino acid codon choice. J Mol Evol 17: 237–244
Nussinov R (1984a) Strong doublet preferences in nucleotide sequence and DNA geometry. J Mol Evol 20:111–119
Nussinov R (1984b) Doublet frequencies in evolutionary distinct groups. Nucleic Acids Res 12:1749–1763
Osawa S, Jukes TH, Muto A, Yamao F, Ohama T, Andachi Y (1987) Role of GC/AT-biased mutation pressure in evolution of eubacterial code. Cold Spring Harbor Symp Quant Biol 52:777–789
Russel GJ, McGeoch DJ, Elton RA, Subak-Sharpe JH (1973) Doublet frequency analysis of bacterial DNAs. J Mol Evol 2:277–292
Russel GJ, Walker PMB, Elton RA, Subak-Sharpe JH (1976) Doublet frequency analysis of fractionated vertebrate nuclear DNA. J Mol Biol 108:1–23
Salser W (1977) Globin mRNA sequences: analysis of base pairing and evolutionary implications. Cold Spring Harbor Symp Quant Biol 42:985–1002
Subak-Sharpe H, Bürk RR, Crawford LV, Morrison JM, Hay J, Keir HM (1966) An approach to evolutionary relationships of mammalian DNA viruses through analysis of the pattern of nearest neighbor base sequences. Cold Spring Harbor Symp Quant Biol 31:737–748
Sueoka N (1961) Correlation between base composition of deoxyribonucleic acid and amino acid composition of protein. Proc Natl Acad Sci USA 47:1141–1149
Sueoka N (1962) On the genetic basis of variation and heterogeneity of DNA base composition. Proc Natl Acad Sci USA 48:582–592
Swartz MN, Trautner TA, Kornberg A (1962) Enzymatic synthesis of deoxyribonucleic acid: XI. Further studies on the nearest neighbor base sequences in deoxyribonucleic acids. J Biol Chem 237:1961–1967
Yamao F, Muto A, Kawauchi Y, Iwami M, Iwagami S, Azumi Y, Osawa S (1985) UGA is read as tryptophan inMycoplasma capricolum. Proc Natl Acad Sci USA 82:2306–2309
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Hanai, R., Wada, A. The effects of guanine and cytosine variation on dinucleotide frequency and amino acid composition in the human genome. J Mol Evol 27, 321–325 (1988). https://doi.org/10.1007/BF02101194
Received:
Revised:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF02101194