Abstract
Extant genomes are the result of repeated duplications and subsequent divergence of primordial genes that assembled the genomes of the first living beings. Increased information on genome maps of different species is revealing conserved syntenies among different vertebrate taxa, which allow to trace back the history of current chromosomes. However, inferring neighboring relationships between genes of more primitive genomes has proven to be very difficult. Most often, the ancestral arrangements of genes have been lost by multiple histories of internal duplications, chromosomal breaks, and large-scale genomic rearrangements. Here we describe a gene arrangement of nonrelated genes that seems to have endured evolution, at least from the separation of the two major clades of bilateria: deuterostomia and protostomia, approximately 1 billion years ago. In its simplest conception, this gene cluster, named EVG, groups the genes for a glucose transporter, an enolase, and a vesicle-associated membrane protein (VAMP). EVG might represent the evolutionary remnants of the gene organization of an ancient bilaterian genome.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
How primitive genomes were organized is an intriguing question, the answer to which can only be inferred from survived features in the current genomes (Birnbaum et al. 2000). Most of the gene associations that have been conserved are of genes either structurally (indicative of a common origin) or functionally (indicative of selective pressures) related. Here we describe a gene association of apparently nonrelated genes that appears to have survived evolution from the Urbilateria (De Robertis and Sasai 1996), the distant ancestor of current bilateria. We found that genes for glucose transporter-3 (SLC2A3), synaptobrevin-1 (VAMP1), and γ-enolase (ENO2) map within a region of approximately 3 Mb of human chromosome 12 (12p13) (Fig. 1A). This region has been shown to be syntenic with the 17p12 region of human chromosome 17 (Lundin 1993). Consistently, the genes for β-enolase (ENO3), the synaptobrevin-2 (VAMP2), and Glut-4 (SLC2A4) map within 2.5 Mb of this region of chromosome 17. A third syntenic group was localized in human chromosome 1, at 1p35-p31. This region included the genes for α-enolase (ENO1), synaptobrevin-3 (VAMP3), and Glut-1 (SLC2A1). These gene groupings of human chromosomes 1, 12, and 17 represent a conserved gene association that we named EVG (from eno-vamp-glut).
EVG seems to be also present in the genomes of Drosophila melanogaster and Caenorhabditis elegans. These species possess homologues for SLC2A, VAMP, and Eno in an arrangement reminiscent of EVG (Fig. 1b). CeEVG spans 512 kb of C. elegans chromosome II (1998), however, DmEVG is split into two regions: that of DmGlut1 (Escher and Rasmuson-Lestander 1999) and Dmn-syb (a VAMP gene (DiAntonio et al. 1993)), which maps in 684 kbp near the telomere of chromosome 3R of the fly, and that of the enolase gene (Eno), located in the 22A band of chromosome 2L (Bishop and Corces 1990).
The statistical significance of these groupings of genes was estimated according to Trachtulec and Forejt (2001), modified for paralogue substitution by multiplying the probability of by-chance association by the number of all possible paralogues of each gene. These estimates gave low probabilities that EVGs have occurred by chance (7.3 × 10−3, 2.1 × 10−5, and 1.4 × 10−5 for human EVG1, EVG12, and EVG17 and 7.5 × 10−4 and 3 × 10−2 for C. elegans and Drosophila, EVGs, respectively). Overall, the cumulative probability that human, Drosophila, and C. elegans EVGs could have occurred by chance and in an unrelated manner is very low (4.8 x 10−17), reinforcing the idea that the EVGs of the three species are evolutionary related and not the result of genetic convergence.
In the mammalian lineage, EVG has experienced at least two rounds of duplications, which gave rise to the current three human EVGs. Since teleost fish possess, at least, two Glut, two Eno, and two VAMP genes, EVG could have already duplicated at least once when teleostei appeared. The existence of three EVGs in humans supports the hypothesis of Susumo Ohno (1999) on vertebrate genome evolution, which portends that two genome-wide duplications (tetraplodization) of an ancestral genome have occurred after the separation of vertebrates and invertebrates. This hypothesis has been recently revived by several reports on the existence of en bloc duplications in vertebrate genomes (Abi-Rached et al. 2002; Gu et al. 2002; McLysaght et al. 2002; reviewed by Spring 2002).
The existence of EVG in D. melanogaster and C. elegans suggests that this gene arrangement appeared in evolution before the separation of these phyla from vertebrates. These three taxa separated between 670 (Ayala and Rzhetsky 1998) and 993–1177 (Wang et al. 1999) Myr (million years) ago, which suggests an estimated age for EVG that could be more than 1 billion years, far before the Cambrian explosion. EVG was not found in Saccharomyces cerevisiae or Schizosaccharomyces pombe genomes. Arabidopsis thaliana has putative genes for a sugar transporter, a synaptobrevin (VAMP), and an enolase, which map within 3.6 Mb of chromosome II. Although the statistical significance of this gene grouping is comparable to that of EVG, we should be cautious in considering it as a plant version of EVG, because of the rather low sequence similarities of A. thaliana VAMP and Glut-like genes to their animal counterparts.
During evolution, gene rearrangements have occurred among EVG’s genes, although they did not change the general topology of the syntenies. For example, the VAMP-like gene within Ce- EVG (Vamp[II]) is not the authentic orthologue of the VAMP gene present in any of the three human EVGs. The most probable orthologue, Ce-Snb-1 (1998), is located on chromosome V. The evolution of EVG has also permitted the inclusion of other genes in the group, besides Glut, VAMP, and Eno, that define the association. However, these extra genes are not conserved among all EVGs, not are they among species, and thus they were probably not integral to the primitive EVG, but later evolutionary additions to the association.
Primitive genomes can be regarded as collections of genes in linkage groups or primordial chromosomes. Different evolutionary histories have led to the current chromosomal situation, with most of the original gene vicinity relationships being lost. The evolutionary persistence of EVG is quite striking, since Ecdysozoa and Vertebrata, the two major clades where EVG has been found so far, proceeded quite differently in evolution. It should be expected in some way that different gene organizations should have evolved, due to the enormous cytogenetic changes that genomes have experienced during this long-duration evolution. Exceptions to these are gene clusters that have been maintained due to selective pressures or that have originated by successive internal duplications of one or several ancestral genes. Long-lasting gene associations usually group genes with some kind of relationship, either structural or functional. Although several examples of conserved syntenies exist among vertebrate genomes, these are much more scarce between invertebrates and vertebrates and even more difficult to find when the search is extended to distantly related phyla like Ecdysozoa and Vertebrata.
Table 1 summarizes the known common syntenies found in current Ecdysozoa (D. melanogaster and C. elegans are representatives) and vertebrates. The best-known examples are the clusters of homeobox genes, which include the Hox, ParaHox, NKL, and EHGbox gene clusters (Pollard and Holland 2000). These clusters group genes structurally and functionally related and a common evolutionary origin is obvious. Their conservation is thought to be due to the selective pressures imposed by their coordinated expression during embryonic development. The Wnt genes provide another example of this type of preserved gene associations. The Wnt-related genes form a cluster in Drosophila melanogaster and two paralogous groups have been found in the human genome, which appear to have originated by genomic duplication (Nusse 2001). Again, the Wnt cluster groups genes structurally related (derived from a common ancestor by gene endoduplication) and are involved in functionally related roles, which could have been instrumental for their evolutionary conservation.
However, syntenies of genes structurally or functionally unrelated that have been maintained from the separation of Ecdysozoa and Vertebrata are much more rare. Table 1 includes the three known examples of syntenies of nonrelated genes, which are found in the genomes of D. melanogaster, C. elegans, and humans: the FGFR/VMAT (Pebusque et al. 1998), the TBP/C5 (Trachtulec and Forejt 2001; Trachtulec et al. 1997), and EVG (this article). These syntenies could represent the vestiges of the gene organization of an ancient urbilaterian genome. Of all three, EVG has the smaller size, in base pairs, in the D. melanogaster (EVG, 684 kbp, vs FGFR/VMAT, 18 Mbp, and TBP/C5, 2.5 Mbp) and C. elegans (EVG, 512 kbp, vs FGFR/VMAT, 6–8 Mbp, and TBP/C5, 4.7 Mbp) genomes.
What did determine the evolutionary conservation of EVG? The final answer to this question may be difficult to find. However, the evolutionary persistence of this synteny may shed light on the reasons why it originated. It is tempting to speculate that the evolutionary appearance of EVG might be related to the original functions of these genes and the selective advantage that their linkage could represent for the organism, in the rapidly changing environment of earth during those primeval times. Genes clustered in each human EVG seem to share a similar pattern of expression. Thus, SLC2A4, VAMP2, and ENO3, which define human EVG-17, are expressed predominantly in muscle. Human EVG-12 genes are expressed in neuronal tissue, whereas genes of human EVG-1 are broadly expressed. This hints at mechanisms of concerted gene expression as a possible explanation for the evolutionary appearance of EVG. Enolase and glucose transporters (SLC2As) are both involved in glucose metabolism, and the vesicular transport of Glut-4 (SLC2A4) is dependent on VAMP2 (Ramm et al. 2000; Randhawa et al. 2000). It seems reasonable that, for the primitive organism where the EVG association appeared, the linkage of these genes could confer a selective advantage, probably based on mechanisms of concerted gene expression of the three genes. Thus, although the concomitant gene expression of the current EVG genes may not necessarily be relevant for their present functions, it could be so at the origin of the EVG association, and what remains today are just the relics of that relevance.
The coordinated expression of the ancestral EVG genes could be attained by way of a peculiar genomic structure of the chromosomal region where these genes localized. This could rely on large-scale genomic structures that define chromatin loops important for the expression of the genes within the loop. Recent reports on large-scale gene expression mapping have shown the clustering of housekeeping and highly expressed genes in chromosomal domains (Boutanaev et al. 2002; Caron et al. 2001; Lercher et al. 2002, 2003; reviewed by Oliver et al. 2002; Spellman and Rubin 2002). These observations demonstrated that genes localized near chromosomal locations often showed similar expression patterns. The underlying molecular mechanisms are still a matter of discussion, although regulatory elements, such as insulators (West et al. 2002), nuclear compartmentalization (Carmo-Fonseca 2002), and genes of the Polycomb and Trithorax groups (Simon and Tamkun 2002), have been involved in the establishment and maintenance of large-scale expression domains. This type of mechanism could be relevant for the evolutionary conservation of large chromosomal regions and help to explain the evolutionary persistence of syntenic groups like EVG, even though the genes that define the current syntenies do not seem to share clear functional and/or structural relationships.
References
L Abi-Rached A Gilles T Shiina P Pontarotti H Inoko (2002) ArticleTitleEvidence of en bloc duplication in vertebrate genomes. Nature Genet 31 100–105 Occurrence Handle10.1038/ng855 Occurrence Handle1:CAS:528:DC%2BD38Xjt1Kgu7k%3D Occurrence Handle11967531
FJ Ayala A Rzhetsky (1998) ArticleTitleOrigin of the metazoan phyla: Molecular clocks confirm paleontological estimates. Proc Natl Acad Sci USA 95 606–611 Occurrence Handle10.1073/pnas.95.2.606 Occurrence Handle1:CAS:528:DyaK1cXmtFajsg%3D%3D Occurrence Handle9435239
D Birnbaum F Coulier MJ Pebusque P Pontarotti (2000) ArticleTitle“Paleogenomics”: Looking in the past to the future. J Exp Zool 288 21–22 Occurrence Handle10.1002/(SICI)1097-010X(20000415)288:1<21::AID-JEZ2>3.0.CO;2-Q Occurrence Handle1:STN:280:DC%2BD3c3hsFSjtg%3D%3D Occurrence Handle10750049
JG Bishop VG Corces (1990) ArticleTitleThe nucleotide sequence of a Drosophila melanogaster enolase gene. Nucleic Acids Res 18 191 Occurrence Handle1:CAS:528:DyaK3cXhslegtbc%3D Occurrence Handle2106662
AM Boutanaev AI Kalmykova YY Shevelyov DI Nurminsky (2002) ArticleTitleLarge clusters of co-expressed genes in the Drosophila genome. Nature 420 666–669 Occurrence Handle10.1038/nature01216 Occurrence Handle1:CAS:528:DC%2BD38XpsVSitbk%3D Occurrence Handle12478293
M Carmo-Fonseca (2002) ArticleTitleThe contribution of nuclear compartmentalization to gene regulation. Cell 108 513–521 Occurrence Handle1:CAS:528:DC%2BD38XhvF2jsrY%3D Occurrence Handle11909522
H Caron B van Schaik M van der Mee F Baas G Riggins P van Sluis MC Hermus R van Asperen K Boon PA Voute S Heisterkamp A van Kampen R Versteeg (2001) ArticleTitleThe human transcriptome map: Clustering of highly expressed genes in chromosomal domains. Science 291 1289–1292 Occurrence Handle1:CAS:528:DC%2BD3MXhtlSgs70%3D Occurrence Handle11181992
InstitutionalAuthorNameThe C. elegans Sequencing Consortium (1998) ArticleTitleGenome sequence of the nematode C. elegans: A platform for investigating biology. Science 282 2012–2018 Occurrence Handle9851916
EM De Robertis Y Sasai (1996) ArticleTitleA common plan for dorsoventral patterning in Bilateria. Nature 380 37–40 Occurrence Handle1:CAS:528:DyaK28XhsFeis7c%3D Occurrence Handle8598900
A DiAntonio RW Burgess AC Chin DL Deitcher RH Scheller TL Schwarz (1993) ArticleTitleIdentification and characterization of Drosophila genes for synaptic vesicle proteins. J Neurosci 13 4924–4935 Occurrence Handle1:CAS:528:DyaK2cXlt1Oh Occurrence Handle8229205
SA Escher A Rasmuson-Lestander (1999) ArticleTitleThe Drosophila glucose transporter gene: cDNA sequence, phylogenetic comparisons, analysis of functional sites and secondary structures. Hereditas 130 95–103 Occurrence Handle1:CAS:528:DyaK1MXmtVSmsr8%3D Occurrence Handle10479996
X Gu Y Wang J Gu (2002) ArticleTitleAge distribution of human gene families shows significant roles of both large- and small-scale duplications in vertebrate evolution. Nature Genet 31 205–209 Occurrence Handle10.1038/ng902 Occurrence Handle1:CAS:528:DC%2BD38XktVKgt7o%3D Occurrence Handle12032571
MJ Lercher AO Urrutia LD Hurst (2002) ArticleTitleClustering of housekeeping genes provides a unified model of gene order in the human genome. Nature Genet 31 180–183 Occurrence Handle10.1038/ng887 Occurrence Handle1:CAS:528:DC%2BD38XktVKgtrw%3D Occurrence Handle11992122
MJ Lercher T Blumenthal LD Hurst (2003) ArticleTitleCoexpression of neighboring genes in Caenorhabditis elegans is mostly due to operons and duplicate genes. Genome Res 13 238–243 Occurrence Handle10.1101/gr.553803 Occurrence Handle1:CAS:528:DC%2BD3sXhtlCrsL0%3D Occurrence Handle12566401
LG Lundin (1993) ArticleTitleEvolution of the vertebrate genome as reflected in paralogous chromosomal regions in man and the house mouse. Genomics 16 1–19 Occurrence Handle10.1006/geno.1993.1133 Occurrence Handle1:CAS:528:DyaK3sXkt1Wnsr4%3D Occurrence Handle8486346
A McLysaght K Hokamp KH Wolfe (2002) ArticleTitleExtensive genomic duplication during early chordate evolution. Nature Genet 31 200–204 Occurrence Handle10.1038/ng884 Occurrence Handle1:CAS:528:DC%2BD38XktVKgsbc%3D Occurrence Handle12032567
EW Myers GG Sutton AL Delcher IM Dew DP Fasulo MJ Flanigan SA Kravitz CM Mobarry KH Reinert KA Remington EL Anson RA Bolanos HH Chou CM Jordan AL Halpern S Lonardi EM Beasley RC Brandon L Chen PJ Dunn Z Lai Y Liang DR Nusskern M Zhan Q Zhang X Zheng GM Rubin MD Adams JC Venter (2000) ArticleTitleA whole-genome assembly of Drosophila. Science 287 2196–2204 Occurrence Handle10.1126/science.287.5461.2196 Occurrence Handle1:CAS:528:DC%2BD3cXitlSnsrg%3D Occurrence Handle10731133
R Nusse (2001) ArticleTitleAn ancient cluster of Wnt paralogues. Trends Genet 17 443 Occurrence Handle10.1016/S0168-9525(01)02349-6 Occurrence Handle1:STN:280:DC%2BD3MvksF2nsQ%3D%3D
S Ohno (1999) ArticleTitleGene duplication and the uniqueness of vertebrate genomes circa 1970–1999. Semin Cell Dev Biol 10 517–522 Occurrence Handle1:CAS:528:DyaK1MXnvFOmsb4%3D Occurrence Handle10597635
B Oliver M Parisi D Clark (2002) ArticleTitleGene expression neighborhoods. J Biol 1 4 Occurrence Handle10.1186/1475-4924-1-4 Occurrence Handle12144705
MJ Pebusque F Coulier D Birnbaum P Pontarotti (1998) ArticleTitleAncient large-scale genome duplications: phylogenetic and linkage analyses shed light on chordate genome evolution. Mol Biol Evol 15 1145–1159 Occurrence Handle1:CAS:528:DyaK1cXls1Kis7g%3D Occurrence Handle9729879
SL Pollard PW Holland (2000) ArticleTitleEvidence for 14 homeobox gene clusters in human genome ancestry. Curr Biol 10 1059–1062
G Ramm JW Slot DE James W Stoorvogel (2000) ArticleTitleInsulin recruits GLUT4 from specialized VAMP2-carrying vesicles as well as from the dynamic endosomal/transGolgi network in rat adipocytes. Mol Biol Cell 11 4079–4091 Occurrence Handle1:CAS:528:DC%2BD3cXovFansL4%3D Occurrence Handle11102509
VK Randhawa PJ Bilan ZA Khayat N Daneman Z Liu T Ramlal A Volchuk XR Peng T Coppola R Regazzi WS Trimble A Klip (2000) ArticleTitle VAMP2, but not VAMP3/cellubrevin, mediates insulin-dependent incorporation of GLUT4 into the plasma membrane of L6 myoblasts. Mol Biol Cell 11 2403–2417 Occurrence Handle1:CAS:528:DC%2BD3cXkvFCiur0%3D Occurrence Handle10888677
JA Simon JW Tamkun (2002) ArticleTitleProgramming off and on states in chromatin: mechanisms of Polycomb and trithorax group complexes. Curr Opin Genet Dev 12 210–218 Occurrence Handle10.1016/S0959-437X(02)00288-5 Occurrence Handle1:CAS:528:DC%2BD38XhvVCnsbc%3D Occurrence Handle11893495
PT Spellman GM Rubin (2002) ArticleTitleEvidence for large domains of similarly expressed genes in the Drosophila genome. J Biol 1 5 Occurrence Handle10.1186/1475-4924-1-5 Occurrence Handle12144710
J Spring (2002) ArticleTitleGenome duplication strikes back. Nature Genet 31 128–129 Occurrence Handle1:CAS:528:DC%2BD38XktVKgsbs%3D Occurrence Handle12040374
Z Trachtulec J Forejt (2001) ArticleTitleSynteny of orthologous genes conserved in mammals, snake, fly, nematode, and fission yeast. Mammal Genome 12 227–231 Occurrence Handle10.1007/s003350010259 Occurrence Handle1:CAS:528:DC%2BD3MXhsFSgtLo%3D
Z Trachtulec RM Hamvas J Forejt HR Lehrach V Vincek J Klein (1997) ArticleTitleLinkage of TATA-binding protein and proteasome subunit C5 genes in mice and humans reveals synteny conserved between mammals and invertebrates. Genomics 44 1–7 Occurrence Handle10.1006/geno.1997.4839 Occurrence Handle1:CAS:528:DyaK2sXlsF2itb4%3D Occurrence Handle9286694
DY Wang S Kumar SB Hedges (1999) ArticleTitleDivergence time estimates for the early history of animal phyla and the origin of plants, animals and fungi. Proc R Soc Lond B Biol Sci 266 163–171 Occurrence Handle10.1098/rspb.1999.0617 Occurrence Handle1:CAS:528:DyaK1MXht1Gns7g%3D Occurrence Handle10097391
AG West M Gaszner G Felsenfeld (2002) ArticleTitleInsulators: Many functions, many mechanisms. Genes Dev 16 271–288 Occurrence Handle10.1101/gad.954702 Occurrence Handle11825869
DL Wheeler DM Church AE Lash DD Leipe TL Madden JU Pontius GD Schuler LM Schriml TA Tatusova L Wagner BA Rapp (2001) ArticleTitleDatabase resources of the National Center for Biotechnology Information. Nucleic Acids Res 29 11–16 Occurrence Handle10.1093/nar/29.1.11 Occurrence Handle1:CAS:528:DC%2BD3MXjtlWns7c%3D Occurrence Handle11125038
Acknowledgements
We thank S. Lamas, C. Zaragoza, and V. Ley for critical reading of the manuscript and helpful suggestions. This work was supported in part by grants to B.G. (Ayudas de Investigación Serono de la Fundación Salud-2000 [Spain]) and J.R.-C. (PM1999-0103; Spanish Ministerio de Ciencia y Tecnología).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Granadino, B., Rey-Campos, J. EVG, the Remnants of a Primordial Bilaterian’s Synteny of Functionally Unrelated Genes . J Mol Evol 57, 515–519 (2003). https://doi.org/10.1007/s00239-003-2503-9
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/s00239-003-2503-9