Abstract
The frequency of two-base tracts is surveyed in a wide range of eukaryotic genomes using the special program TRACTS. All three two-base families are surveyed: R.Y (A,G.C,T), K.M (A,C.G,T), and S;W (A.T and G.C). Data for the human β-globin complex, for the tobacco chloroplast, and for 247 nt mammalian promoter regions are presented. All two-base tracts longer than three or four bases are overrepresented to an extent surpassing by far their occurrence in a randomized DNA population in the majority of the genomic regions analyzed; 20–30 long tracts are quite frequent, against the statistical odds. R.Y tracts are found at the largest excess, K.M tract to a slightly lesser extent, while S.W tracts are found at a moderate yet significant excess. The majority of the tracts manifest only a limited extent of tandem repeat structures. The idea that the two base tracts serve as unwinding elements is considered.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Beckmann JS, Weber JL (1992) Survey of human and rat microsattelites. Genomics 12:627–631
Behe MJ (1987) The DNA sequence of the human β-globin region is strongly biased in favor of long strings of contiguous purine or pyrimidine residues. Biochemistry 26:7870–7875
Bernardi G, Mouchiroud D, Gautier C, Bernardi G (1988) Compositional patterns in vertebrate genomes: conservation and change in evolution. J Mol Evol 28:7–18
Bernardi G, Olofsson B, Filipski J, Zerial M, Salinas J, Cuny J, Meunier-Rotival M, Rodier F (1985) The mosaic genome of warmblooded vertebrates. Science 228:953–958
Birnboim HC, Sederoff RR, Paterson MC (1979) Distribution of polypyrimidine. Polypurine segments in DNA from diverse organisms. Eur J Biochem 98:301–307
Birnboim HC, Holford RM, Seligy VL (1976) Random phasing of polypyrimidine/polypurine segments and nucleosome monomers in chromatin from mouse L cells. Cold Spring Harbor Symp 39:1161–1165
Bramhill D, Kornberg A (1988) A model for initiation at origins of DNA initiation. Cell 54:915–917
Britten RJ, Davidson EH (1969) Gene regulation for higher celss: a theory. Science 165:349–357
Bucher P (1991) Eukaryotic promoter database. NETSERVE@EMBL-Heidelberg.DE
Bucher P, Yagil G (1991) The occurrence of oligopurine.oligopyrimidine tracts in eukaryotic and prokaryotic genes. DNA Sequence 1:27–43
Bucher P, Trifonov EN (1986) Compilation and analysis of eukaryotic POL 2 promoter sequences. Nucleic Acids Res 14:10009–10026
Campbell A, Botstein D (1983) Evolution of lamboid phages. In: Lambda II. Cold Spring Harbor Lab Press, New York, pp 365–380
Case ST, Baker RF (1975) Detection of long eukaryote-specific pyrimidine runs in repetitive DNA sequences and their relation to single-stranded regions in DNA isolated from sea urchin embryos. J Mol Biol 98:69–92
Chargaff E (1963) Essays in nucleic acids. Elsevier, Amsterdam, 1:126ff
Christophe D, Cabrera B, Bacolla A, Targovnik H, Pohl V, Vassart G (1985) An unusually long poly(purine)-poly(pyrimidine) sequence is located upstream from the human thyroglobulin gene. Nucleic Acid Res 13:5127–5144
Epplen JT (1988) On simple repeated GATA sequence in animal genomes: a critical reappraisal. Heredity 79:409–417
Gall JG, Atherton DD (1974) Satellite DNA sequences invirilis. J Mol Biol 85:633–664
Greaves DR, Patient RK (1985) (AT)n is an interspersed repeat in the Xenopus genome. EMBO J 4:2617–2626
Hayes TE, Dixon JE (1985) Z-DNA in the rat somatostatin gene. J Biol Chem 260:8145–8156
Karlin S, Ghandour G (1985) The use of multiple alphabets in kappa-gene immunoglobulin DNA sequence comparisons. EMBO J 4:1217–1223
Kowalski D, Eddy MJ (1989) The DNA unwinding element: A novel, cis acting component that facilitates the opening of theE. coli replication origin. EMBO J 8:4335–4339
Kozhukhin GC, Pevzner PA (1991) Genome inhomogeneity is determined mainly by WW and SS dinucleotides. CABIOS 7:39–49
Maher LJ, Dervan PB, Wold B (1992) Analysis of promoter-specific repression by triple-helical DNA complex in a eukaryotic cell-free transcription system. Biochemistry 31:70–81
O'Neill DO, Bornschlegel K, Flamm M, Castle M, Bank A (1991) A DNA binding factor in adult hematopoeitic cells interacts with a pyrimidine rich domain upstream from the human delta globin gene. Proc Natl Acad Sci USA 88:8953–8957
Palecek E (1991) Local supercoil-stabilized DNA structures. Crit Rev Biochem Mol Biol 26:151–226
Rippe K, Fritsch V, Westhof E, Jovin TM (1992) Alternating d(G-A) sequences form a parallel stranded DNA homoduplex. EMBO J 11:3777–3786
Ruskin B, Green MR (1985) The role of the 3′ splice sites concensus sequence in mammalian pre-mRNA splicing. Cell 52:207–219
Schlotterer C, Tautz D (1992) Slippage synthesis of simple sequence DNA. Nucleic Acid Res 20:211–215
Shapiro HS, Rudner R, Miura KI, Chargaff E (1965) Inferences from the distribution of pyrimidine isostichs in deoxribonucleic acids. Nature 205:1068–1070
Siegfried E, Thomas GH, Bond UM, Elgin SCR (1986) Characterization of a supercoil dependent S1 sensitive site 5′ to theD. melanogaster hsp26 gene. Nucleic Acids Res 14:9425–9441
Sprizhitski Yu A, Nechipurenko Yu D, Alexandrov AA, Volkenstein MV (1988) Statistical analysis of nucleotide runs in coding and noncoding DNA sequences. J Biomol Struc Dyn 6:345–358
Tamm C, Shapiro HS, Lipshitz R, Chargaff E (1953) Distribution density of nucleotides within a desoxyribonucleic acid chain. J Biol Chem 203:673–698
Tautz D (1989) Hypervariability of simple sequences as a general source for polymorphic DNA markers. Nucleic Acids Res 17:6463–6471
Tautz D, Rentz M (1984) Simple DNA sequences of D Virilis is detected by screening with RNA. J Mol Biol 172:229–235
Tautz D, Trick M, Dover GA (1986) Cryptic simplicity in DNA is a major source of genetic variation. Nature 322:652–656
Umek RM, Kowalski D (1990) The DNA unwinding element in a yeast replication origin functions independently of easily unwound sequences present elsewhere in the plasmid. Nucleic Acids Res 18:6601–6617
Weber JL (1990) Human DNA polymorphisms and methods of analysis. Curr Opin Biotechnol 1:166–171
Wells RD, Collier DA, Hanvey JC, Shimizu M, Wohlrab F (1988) The chemistry and biology adopted by oligopurine.oligopyrimidine sequences. FASEB J 2:2939–2949
Yagil G (1991) Paranemic structures of DNA and their role in DNA unwinding. Crit Rev Biochem Mol Biol 26:475–559
Yu Y-T, Manley JL (1986) Structure and function of the S1 nuclease-sensitive site in the adenovirus late promoter. Cell 45:743–751
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Yagil, G. The frequency of two-base tracts in eukaryotic genomes. J Mol Evol 37, 123–130 (1993). https://doi.org/10.1007/BF02407347
Issue Date:
DOI: https://doi.org/10.1007/BF02407347