Abstract
Hypothetical Products from Noncoding Frames (i.e., HyPNoFs) are hypothetical, not-coded proteins, translated from alternate reading frames (i.e., coding+1 and coding+2) of cDNAs. HyPNoFs of CD4, PKC, oncostatin, bcl-2 proto-oncogene, tumor suppressor p53, cystic fibrosis transmembrane regulator (CFTR), and tumor necrosis factors a and β were searched as query sequences vs the SWISS-PROT data bank. Homology searches carried out revealed that hypothetical products (i.e., HyPNoFs) may share high similarity with real protein products actually coded. Sequence similarity of hypothetical products to real proteins is sometimes very high, suggesting common conformational features, according to the Sander and Schneider cutoff value. This finding supports the hypothesis that eukaryotic DNA, currently considered to be monocistronic, might occasionally have polycistronic regions, carrying different protein messages on overlapping frames. As yet, polycistronic genes have been observed in viral genomes only. The presence of polycistronic regions in eukaryotic genes is likely reminiscent of an ancient strategy, rather than a present feature of the genome in eukaryotes.
These data suggest that thorough investigation of HyPNoFs is likely to improve our ability to trace genes' evolution and to investigate structure-function relationships of protein and DNA sequences.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (1992) Methods and algorithms for statistical analysis of protein sequences. Proc Natl Acad Sci USA 89:2002–2006
Dalgleish AG, Beverley PCL, Clapham PR, Crawford DH, Greaves MF, Weiss RA (1984) The CD4 (T4) antigen is an essential component of the receptor for the AIDS retrovirus. Nature 312:763–767
Dayhoff MO (1978) Atlas of protein sequences and structure. National Biomedical Research Foundation, Washington, DC
Dinesh-Kumar SP, Brault V, Miller WA (1992) Precise mapping and in vitro translation of a trifunctional subgenomic RNA of barley yellow dwarf virus. Virology 187(2):711–722
Doolittle RF (1981) Similar amino acid sequences: chance or common ancestry? Science 214:149–159
Facchiano A, Facchiano F, van Renswoude J (1993) Divergent evolution may link human immunodeficiency virus gp411 to human CD4. J Mol Evol 36:448–457
Feng DF, Johnson MS, Doolittle RF (1984) Aligning amino acid sequences: comparison of commonly used methods. J Mol Evol 21: 112–125
Gilbert TL, Haldeman BA, Mulvihill E, O'Hara PJ (1992) A mammalian homologue of a transcript from the Drosophila pecanex locus. J Neurogenet 8(3):181–187
Gonnet GH, Cohen MA, Benner SA (1992) Exaustive matching of the entire protein sequence database. Science 256:1443–1445
Gotoh O (1982) An improved algorithm for matching biological sequences. J Mol Biol 162:70–705
Green P, Lipman D, Hillier L, Waterston R, States D, Claverie JM (1993) Ancient conserved regions in new gene sequences and the protein databases. Science 259:1711–1716
Hockenbery D, Nunez G, Milliman C, Schreiber RD, Korsmeyer SJ (1990) BCL-2 is an inner mitochondrial membrane protein that blocks programmed cell death. Nature 348:334–336
Jacks T, Madhani HD, Masiarz FR, Varmus HE (1988) Signals for ribosomal frameshifting in the Rous Sarcoma Virus gag-pol region. Cell 55:447–458
Lamb RA, Choppin PW (1979) Segment 8 of the influenza virus genome is unique in coding for two polipeptides. Proc Natl Acad Sci USA 76:4908–4912
Lamb RA, Horvath CM (1991) Diversity of coding strategies in influenza viruses. Trends Genet 7(8):261–267
Lipman DJ, Pearson WR (1985) Rapid and sensitive protein similarity searches. Science 227:1435–1441
Karlin S, Brendel V (1992) Chance and statistical significance in protein and DNA sequence analysis. Science257:39–49
Kirchner J, Sandmeyer SB, Forrest DB (1992) Transposition of a TY3 GAG3-POL3 fusion mutant is limited by availability of capsid protein. J Virol 66(10):6081–6092
Merelli F, Stojilkovic SS, Iida T, Krsmanovic LZ, Zheng L, Mellon PL, Catt KJ (1992) Gonadotropin-releasing hormone-induced calcium signaling in clonal pituitary gonadotrophs. Endocrinology 131;2: 925–932
Montell C, Fisher EF, Caruthers MH, Berk AJ (1982) Resolving the function of overlapping viral genes by site-specific mutagenesis at a mRNA splice site. Nature 295:380–384
Myers EW, Miller W (1988) Optimal alignments in linear space. Comput Appl Biosci 4:11–17
Needleman SB, Wunsch CD (1970) A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 48:443–453
Ray R, Jameel S, Manivel V, Ray R (1992) Indian hepatits E virus shows a major deletion in the small open reading frame. Virology 189(1):359–362
Rich DP, Anderson MP, Gregory RJ, Cheng SH, Paul S, Jefferson DM, et al. (1990) Expression of cystic fibrosis transmembrane conductance regulator corrects defective chloride channel regulation in cystic fibrosis airway epithelial cells. Nature 347:358–363
Riordan JR, Rommens JM, Kerem B, Alon N, Rozmahel R, et al. (1989) Identification of the cystic fibrosis gene: cloning and characterization of complementary DNA. Science 245:1066–1073
Rose TM, Bruce AG (1991) Oncostatin M is a member of cytokine family that includes leukemia-inhibitory factor, granulocyte colony stimulating factor, and interleukin 6. Proc Natl Acad Sci USA 88(19):8641–8645
Sali A, Overington JP, Johnson MS, Blundell TL (1990) From comparison of protein sequences and structures to protein modelling and design. In: Bradshow RA, Purton M (eds) Proteins: form and function. Elsevier Trends Journals, Cambridge, pp 163–171
Sander C, Scheneider R (1991) Database of homology derived protein structure and the structural meaning of sequence alignment. Proteins Struct Funct Genet 9:56–68
Suzuki N, Sugawara M, Kusano T (1992) Rice dwarf phytoreovirus segment S12 transcript is tricistronic in vitro. Virology 191(2):992–995
Taga T, Narazaki M, Yasukawa K, Saito T, Miki D, Hamaguchi M, Davis S, Shoyab M, Yancopoulos GD, Kishimoto T (1992) Functional inhibition of hematopoietic and neurotrophic cytokines by blocking the interleukin-6 signal transducer gp130. Proc Natl Acad Sci USA 89(22):10998–11001
Wegenka UM, Buschmann J, Luttichen C, Heinrich PC, Horn F (1993) Acute-phase response factor, a nuclear factor binding to acutephase response elements, is rapidly activated by interleukin-6 at the posttranslational level. Mot Cell Biol 13(1):276–288
Wilbur WJ, Lipman DJ (1983) Rapid similarity searches of nucleic acid and protein data banks. Proc Natl Acad Sci USA 80:726–730
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Facchiano, A. Investigating hypothetical products from noncoding frames (HyPNoFs). J Mol Evol 40, 570–577 (1995). https://doi.org/10.1007/BF00160503
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF00160503