Summary
On the average in the coding sequences of 30 eucaryotic structural genes the weak hydrogen bonding, W, (A or T) or strong hydrogen bonding, S, (C or G) base in codon site 3 was chosen to be unlike its neighbors on both sides up to two sites away. This preference produced the nonrandom excess of runs W and S of length one and two and the defict of long runs observed earlier (Blaisdell 1982). The neighbors in the different codon, 3′ to codon site 3, were as important in determining the choice as were the neighbors 5′ in the same codon. Every amino acid except methionine and tryptophan, of least frequent occurrence, permits choice of W or S. The persistence of this preference could explain the observation that the rate of substitution of codon site 3 in fuctional genes is considerably less than in synonymous pseudo genes.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Altenburger W, Neumaier PS, Steinmetz M, Zachau HG (1981) DNA sequence of the constant region of the mouse immunoglobulin kappa chain. Nucleic Acids Res 9:971–981
Baralle FE, Shoulders CG, Proudfoot NJ (1980a) The primary structure of the human epsilon-globin gene. Cell 21:621–626
Baralle FE, Shoulders CC, Goodbourn S, Jeffreys A, Proudfoot NJ (1980b) The 5′ flanking region of human epsilon globin gene. Nucleic Acids Res 8:4393–4404
Becker RA, Chambers JM (1981) S, a language and system for data analysis. Bell Laboratories, Murray Hill
Bell GI, Pictet RL, Rutter WJ, Cordell B, Tischer E, Goodman HM (1980a) Sequence of the human insulin gene. Nature 284:26–32
Bell GI, Pictet R, Rutter WJ (1980b) Analysis of the regions flanking the human insulin gene and sequence of an Alu family member. Nucleic Acids Res 8:4091–4109
Blaisdell BE (1982) A prevalent persistent global nonrandomness that distinguishes coding and noncoding eucaryotic nuclear DNA sequences. J Mol Evol (in press)
Breathnach R, Benoist C, O'Hare K, Gannon F, Chambon P (1978) Ovalbumin gene: evidence for a leader sequence in mRNA and DNA sequences at the exon-intron boundaries. Proc Natl Acad Sci USA 75:4853–4857
Bullock E, Elton RA (1972) Dipeptide frequencies in proteins and the CpG deficiency in vertebrate DNA. J Mol Evol 1:315–325
Chang ACY, Cochet M, Cohen SN (1980) Structural organization of human genomic DNA encoding the proopiomelanocortin peptide. Proc Natl Acad Sci USA 77:4890–4894
Duncan DB (1955) Multiple range and multiple F tests. Biometrics 11:1–42
Efstratiadis A, Posakony JW, Maniatis T, Lawn RM, O'Connell C, Spritz RA, De Riel JK, Forget BG, Weissman SM, Slightom JL, Blechl AE, Smithies O, Baralle FE, Shoulders CC, Proudfoot NJ (1980) The structure and evolution of the human beta-globin gene family. Cell 21:653–668
Feller W (1967) An introduction to probability theory and its applications, 3rd edition, John Wiley & Sons, New York
Goeddel DV, Yelverlon E, Ullrich A, Heyneker HL, Miozzari G, Holmes W, Seeburg PH, Dull T, May L, Stebbins N, Crea R, Maeda S, McCandliss R, Sloma A, Tabor JM, Gross M, Familetti PC, Pestka S (1980) Human leukocyte interferon produced by E. coli is biologically active. Nature 287:411–416
Grantham R, Gautier C, Gouy M, Mercier R, Pave A (1980) Codon catalog usage and the genome hypothesis. Nucleic Acids Res 8:r49-r62
Grantham R, Gautier C, Gouy M, Jacobzone M, Mercier R (1981) Codon catalog usage in a genome strategy modulated for gene expressivity. Nucleic Acids Res 9:r43-r74
Gubbins EJ, Maurer RA, Lagrimini M, Erwin CR, Donelson JE (1980) Structure of the rat prolactin gene. J Biol Chem 225:8655–8662
Hardison RC, Butler ET, Lacy E, Maniatis T, Rosenthal N, Efstratiadis A (1979) The structure and transcription of four linked rabbit beta-like globin genes. Cell 18:1285–1297
Heindell HC, Liu A, Paddock GV, Studnicka GM, Salser WA (1978) The primary sequence of rabbit alpha globin in mRNA. Cell 15:43–54
Hieter PA, Max EE, Seidman JG, Meizel JV, Leder P (1980) Cloned human and mouse kappa immunoglobulin constant and J region genes conserve homology in functional segments. Cell 22:197–207
Holland JP, Holland MJ (1979) The primary structure of a glyceraldehyde-3-phosphate dehydrogenase gene from Saccharomyces cerevisiae. J Biol Chem 254:9839–9845
Kafatos FC, Efstratiadis A, Forget BG, Weissman SM (1977) Molecular evolution of human and rabbit beta globin mRNAs. Proc Natl Acad Sci USA 74:5618–5622
Kataoka T, Kawakami T, Takahashi N, Honjo T (1980) Rearrangement of immunoglobulin gamma-1 chain gene and mechanism for heavy-chain class switch. Proc Natl Acad Sci USA 77:919–923
King FL, Jukes TH (1969) Non-Darwinian evolution. Science 164:788–798
Konkel DA, Maizel JV, Leder P (1979) The evolution and sequence comparison of two recently diverged mouse chromosome beta-globin genes. Cell 18:865–873
Lawn RM, Efstratiadis A, O'Connell C, Maniatis T (1980) The nucleotide sequence of the human beta-globin gene. Cell 21:647, 651
Lawn RM, Adelman J, Franke AE, Houck M, Cross M, Najarian R, Coeddel OV (1981) Human fibroblast interferon gene lacks introns. Nucleic Acids Res 9:1045–1052
Lehmann EL (1975) Nonparametrics. Holden-Day, San Fransisco, p 239
Li W, Gojobori T, Nei M (1981) Pseudo genes as a paradigm of neutral evolution. Nature 292:237–239
Lomedico P, Rosenthal N, Efstratiadis A, Gilbert W, Kolodner R, Tizard R (1979) The structure and evolution of the two nonallelic rat preproinsulin genes. Cell 18:545–558
Miller RG (1981) Simultaneous Statistical Inference. 2nd Edition, Springer, New York, p. 157
Newell N, Richards JE, Tucker PW, Blattner FR (1980) J genes for heavy chain immunoglobulins of mouse. Science 209:1128–1132
Ng R, Abelson J (1980) Isolation and sequence of the gene for actin in Saccharomyces cerevisiae. Proc Natl Acad Sci USA 77:3912–3916
Nishioka Y, Leder P (1979) The complete sequence of a chromosomal mouse alpha-globin gene reveals elements conserved throughout vertebrate evolution. Cell 18:875–882
Nishioka Y, Leder PJ (1980) Organization and complete sequence of identical embryonic and plasmacytoma kappa V-region genes. Biol Chem 255:3691–3694
Pan J, Elder JT, Duncan CH, Weissman SM (1981) Structural analysis of interspersed repetitive polymerase III transcription units in human DNA. Nucleic Acids Res 9:1151–1170
Peck LF, Wang JC (1981) Sequence dependence of the helical repeat of DNA in solution. Nature 292:375–378
Perler F, Efstratiadis A, Lomedico P, Gilbert W, Kolodner R, Dodgson J (1980) The evolution of genes: the chicken preproinsulin gene. Cell 20:555–566
Proudfoot NJ, Brownlee CG (1976) Noncodong region sequences in eucaryotic messenger RNA. Nature 263:211–214
Proudfoot NJ, Maniatis T (1980) The structure of a human alpha globin pseudogene and its relationship to alpha globin gene duplication. Cell 21:537–544
Rhodes D, Klug A (1981) Sequence dependent helical periodicity of DNA. Nature 292:378–380
Robertson MA, Staden R, Tanaka Y, Catterall JF, O'Malley Brownlee CG (1979) Sequence of three introns of the chick ovalbumin gene. Nature 278:370–372
Sakano H, Huppi K, Heinrich G, Tonegawa S (1979) Sequences at the somatic recombination sites of immunoglobulin light chain genes. Nature 280:288–294
Sakano H, Maki R, Kurosawa Y, Roeder W, Tonegawa S (1980) Two types of somatic recombination are necessary for the generation of complete immunoglobulin heavy chain genes. Nature 286:676–683
Slightom JL, Blechl AE, Smithies O (1980) Human fetal G-gamma and A-gamma globin genes: complete nucleotide sequences suggest that DNA can be exchanged between these duplicated genes. Cell 21:627–638
Spritz RA, De Riel JK, Forget BG, Weissman SM (1980) Complete nucleotide sequence of the human delta-globin gene. Cell 21:639–646
Sun SM, Slightom JL, Hall TC (1981) Intervening sequences in a plant gene: comparison of the partial sequence of cDNA and genomic DNA of French bean phaseolin. Nature 289:37–41
Sures I, Lowry J, Kedes LH (1978) The DNA sequence of sea urchin (S. purpuratus) H2A, H2B and H3 histone coding and spacer regions. Cell 15:1033–1044
Takahashi N, Kataoka T, Honjo T (1980) Nucleotide sequences of class-switch recombination region of the mouse immunoglobulin gamma2b-chain gene. Gene 11:117–127
Tilghman SM, Tiemeier DC, Seidman JG, Peterlin BM, Sullivan M, Maizel JV, Leder P (1978) Intervening sequence of DNA identified in the structural portion of a mouse beta globin gene. Proc Natl Acad Sci USA 75:725–729
Tschumper G, Carbon J (1980) Sequence of a yeast fragment containing a chromosomal replicator and the TRPI gene. Gene 10:157–166
Tsujimoto Y, Suzuki Y (1979) The DNA sequence of B bombyx mori fibroin gene including the 5′ flanking, mRNA coding, entire intervening and fibroin protein coding regions. Cell 18:591–600
Ullrich A, Dull TJ, Gray A, Brosius J, Sures I (1980) Genetic variation in the human insulin gene. Science 209:612–615
van Ooyen A, van den Berg J, Mantei N, Weissmann C (1979) Comparison of total sequence of a cloned rabbit beta-globin gene and its flanking regions with a homologous mouse sequence. Science 206:337–344
Young RA, Hagenbuchle O, Schibler U (1981) A single mouse alpha-amylase gene specifies two different tissue-specific mRNAs. Cell 23:451–458
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Blaisdell, B.E. Choice of base at silent codon site 3 is not selectively neutral in eucaryotic structural genes: It maintains excess short runs of weak and strong hydrogen bonding bases. J Mol Evol 19, 226–236 (1983). https://doi.org/10.1007/BF02099970
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02099970