Abstract
The inherent potential underlying the sequence data produced by the International Human Genome Sequencing Consortium and other systematic sequencing projects is, obviously, tremendous. As such, it becomes increasingly important that all biologists have the ability to navigate through and cull important information from key publicly available databases. The continued rapid rise in available sequence information, particularly as model organism data is generated at breakneck speed, also underscores the necessity for all biologists to learn how to effectively make their way through the expanding “sequence information space.” This review discusses some of the more commonly used tools for sequence discovery; tools have been developed for the effective and efficient mining of sequence information. These include LocusLink, which provides a gene-centric view of sequence-based information, as well as the 3 major genome browsers: the National Center for Biotechnology Information Map Viewer, the University of California Santa Cruz Genome Browser, and the European Bioinformatics Institute’s Ensembl system. An overview of the types of information available through each of these front-ends is given, as well as information on tutorials and other documentation intended to increase the reader’s familiarity with these tools.
Similar content being viewed by others
References
Collins FS, Green ED, Guttmacher AE, Guyer MS. (2003) A vision for the future of genomics research. Nature 422:835–47.
Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL. (2003) GenBank. Nucleic Acids Res. 31:23–7.
Baxevanis AD. Information retrieval from biological databases. In:Bioinformatics: a practical guide to the analysis of genes and proteins. 2nd edition. Baxevanis AD and Ouellette BFF (eds.) John Wiley and Sons, New York, pp. 155–85.
Hamosh A et al. (2002) Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 30:52–5.
Wolfsberg TG, Landsman D. Expressed sequence tags. In: Bioinformatics: a practical guide to the analysis of genes and proteins. 2nd edition. Baxevanis AD and Ouellette BFF (eds.) John Wiley and Sons, New York, pp. 283–302.
Velculescu VE, Vogelstein B, Kinzler KW. (2000) Analyzing uncharted transcriptomes with SAGE. Trends Genet. 16:423–5.
Blake JA et al. (2003) MGD: the Mouse Genome Database. Nucleic Acids Res. 31:193–5.
Sprague J et al. (2003) The Zebrafish Information Network (ZFIN): the zebrafish model organism database. Nucleic Acids Res. 31:241–3.
Yeh RF, Lim LP, Burge CB. (2001) Computational inference of homologous gene structures in the human genome. Genome Res. 11:803–16.
Karolchik D et al. (2003) The UCSC Genome Browser database. Nucleic Acids Res. 31:51–4.
Clamp M et al. (2003) Ensembl 2002: accommodating comparative genomics. Nucleic Acids Res. 31:38–42.
Wolfsberg TG, Wetterstrand KA, Guyer MS, Collins FS, Baxevanis AD. (2002) A user’s guide to the human genome. Nat. Genet., vol. 32 supplement.
Baxevanis AD. (2003) The Molecular Biology Database Collection: 2003 update. Nucleic Acids Res. 31:1–12.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Baxevanis, A.D. Using Genomic Databases for Sequence-Based Biological Discovery. Mol Med 9, 185–192 (2003). https://doi.org/10.1007/BF03402130
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/BF03402130