Summary
The Escherichia coli K-12 genetic map was divided into intervals of equal length to count the number of genes per interval. Plots of genes per interval at four sets of interval lengths revealed large-scale clustering of genes with the major clusters occurring at regularly spaced distances apart. Major gene cluster properties were analyzed at a scale of 100 intervals wherein each interval corresponded to a genetic map unit length of 1 min. In any major gene cluster, the highest gene concentration was observed at or near the midpoint interval, and the number of genes per interval was found to decline exponentially as a function of the linear distance from the midpoint or interval of peak gene concentration of that cluster. An autocorrelation analysis of gene content in first-neighbor intervals throughout the chromosome revealed an ordered first-neighbor relationship in comparison to 2,000 randomized interval versions of the chromosome. Attempts to simulate gene placement by a Gaussian model did not produce large-scale gene clustering in any way comparable to that observed on the chromosome. We propose that major gene clusters formed from smaller gene clusters, and the contemporary chromosome formed from fusion of homologous or heterologous major gene clusters.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Bachmann BJ (1983) Linkage map of Escherichia coli K-12 edition 7. Microbiol Rev 47:180–230
Bachmann BJ (1990) Linkage map of Escherichia coli K-12 edition 8. Microbiol Rev 54:130–197
Bachmann BJ, Low KB, Taylor AL (1976) Recalibrated linkage map of Escherichia coli K-12. Bacteriol Rev 40:116–167
Baines AHJ (1951) Methods of detecting non-randomness in a given series of observations. In: Churchman CW (ed) Frankford Arsenal statistical manual: Methods of making experimental inferences. 2nd Rev Ed, Frankford Arsenal, Philadelphia
Beyer WH (1987) CRC handbook of tables for probability and statistics edition 2. CRC Press, Boca Raton, FL
Crow EL, Davis FA, Maxfield MW (1960) Statistics manual. Dover, New York
Dayhoff MO, Schwartz RM, Orcutt BC (1978) In: Dayhoff MO (ed) Atlas of protein sequence and structure. Supplement 3, vol 5 National Medical Research Foundation Washington, DC pp 345–352
De Martelaere DA, Van Gool AP (1981) The density distribution of gene loci over the genetic map of Escherichia coli: Its structural, functional and evolutionary implications. J Mol Evol 17:354–360
Herdman M (1985) The evolution of bacterial genomes. In: T Cavalier-Smith (ed) The evolution of genome size. John Wiley & Sons, Inc New York, pp 37–68
Hood L, Campbell JJ, Elgin SCR (1975) The organization, expression, and evolution of antibody genes and other multigene families. Annu Rev Genet 9:305–353
Jurka J, Savageau MA (1985) Gene density over the chromosome of Escherichia coli: Frequency distribution spatial clustering and symmetry. J Bacteriol 163:806–811
Kimura M (1983) The neutral allele theory of molecular evolution. Cambridge University Press, Cambridge
Knott V, Blake DJ, Brownlee GG (1989) Completion of the detailed restriction map of the Escherichia coli genome by the isolation of overlapping cosmid clones. Nucl Acids Res 17: 4901–5912
Kohara Y, Akiyama K, Isono K (1987) The physical map of the whole E. coli chromosome: Application of a new strategy for rapid analysis and sorting of a large genomic library. Cell 50:495–508
Kunisawa T, Otsuka J (1988) Periodic distribution of homologous genes or gene segments on the Escherichia coli K-12 genome. Protein Seq Data Anal 1:263–267
Loomis NF, Gilpin ME (1986) Multigene families and vestigial sequences. Proc Natl Acad Sci USA 83:2143–2147
Metropolis N, Rosenbluth A, Rosenbluth M, Teller A, Teller E (1953) Equations of state calculation by fast computing machines. J Chem Phys 21:1087–1092
Naora H, Miyahara K, Curnow RN (1987) Origin of noncoding DNA sequences: Molecular fossils of genome evolution. Proc Natl Acad Sci USA 84:6195–6199
Neidhardt FC, Ingraham JL, Schaecter M (1990) Physiology of the bacterial cell. Sinauer Associates Inc, Sunderland, MA
Ohno S (1970) Evolution by gene duplication. Springer-Verlag, New York
Riley M, Anilionis A (1978) Evolution of the bacterial genome. Annu Rev Microbiol 32:519–560
Riley M, Solomon L, Zipkas D (1978) Relationship between gene function and gene location in Escherichia coli. J Mol Evol 11:47–56
Rudd KE, Miller W, Ostell J, Benson DA (1990) Alignment of Escherichia coli K-12 sequences to a genomic restriction map. Nucl Acids Res 18:313–321
Savageau M (1986) Proteins of Escherichia coli come in sizes that are multiples of 14 kDa: Domain concepts and evolutionary implications. Proc Natl Acad Sci USA 83:1198–1202
Shimkets LJ (1990) Social and developmental biology of the Myxobacteria. Microbiol Rev 54:473–501
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Williamson, R.M., Hetherington, J. & Jackson, J.H. Detection of fundamental principles and a level of order for large-scale gene clustering on the Escherichia coli chromosome. J Mol Evol 36, 347–360 (1993). https://doi.org/10.1007/BF00182182
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF00182182