Abstract
The phenomenon of overlapping of various sequence messages in genomes is a puzzle for evolutionary theoreticians, geneticists, and sequence researchers. The overlapping is possible due to degeneracy of the messages, in particular, degeneracy of codons. It is often observed in organisms with a limited size of genome, possessing polymerases of low fidelity. The most accepted view considers the overlapping as a mechanism to increase the amount of information per unit length. Here we present a model that suggests direct evolutionary advantage of the message overlapping. Two opposing drives are considered: (a) reduction in the amount of vulnerable points when the overlapping of two messages involves common critical points and (b) cumulative compromising cost of coexistence of messages at the same site. Over a broad range of conditions the reduction of the target size prevails, thus making the overlapping of messages advantageous.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
DNA sequences are often compared with the texts written in a language, either a natural human language or an artificial one. The fundamental difference is that all usual human texts are organized linearly so that if there is more than one message, the messages do not overlap but follow one another. This does not apply to natural genetic DNA sequences. The same fragment of a sequence typically carries more than one functional message, and these messages overlap. Holliday (1968) predicted this phenomenon considering signals responsible for recombination, which may well reside within the protein-coding sequences. This would be allowed, perhaps, due to degeneracy of the triplet code. The idea was developed further in works by Schaap (1971), Zuckerkandl (1976), Eigen and Schuster (1979), and Trifonov (1981) and confirmed by numerous experimental data, as reviewed by Normark et al. (1983). In a generalized form (Trifonov 1989) any sequence pattern that corresponds to a certain function is considered a code. Due to the degeneracy of the codes, “corresponding messages are not only interspersed, but actually overlapped, so that some nucleotides belong to several messages simultaneously.” The idea of the multiplicity of the codes of nucleotide sequences and their overlapping also appeared in studies by Caporale (1984), Staden (1984), Kypr (1986), and Konings et al. (1987) (reviewed by Trifonov 1996).
One example of such overlapping is when a biologically significant RNA secondary structure overlaps sequencewise with the protein-coding region. Several well-known RNA secondary structures along the HIV-1 genome play various functional roles during the virus life cycles, while residing in protein-coding regions. One of the best known is the RNA structure of the Rev Responsive Element (RRE), which interacts with the Rev Trans-Activator protein (Dayton et al. 1989; Kjems et al. 1991; Malim et al. 1989, 1990). Biologically functional RNA secondary structure motif may stimulate ribosomal frameshifting in the gag–pol overlapping region in retroviruses. The exact nature of this signal is still controversial. Various researchers define it as either a pseudoknot (Le et al. 1991; Morikawa and Bishop 1992), a stem–loop structure (Parkin et al. 1992; Vickers and Ecker 1992), or a two-stem structure (Dulude et al. 2002). The translational frameshifting message is also suggested to reside in (G–non-G–N) n mRNA periodicity (Trifonov 1987, 1992; Lagunez-Otero and Trifonov 1992), which may be responsible for monitoring the correct reading frame during translation by forming transient complementary complexes with the C-periodical structure of the ribosomal RNA. Staple and Butcher (2003) fortified the frameshifting theory by NMR analysis of the RNA of the gag–pol region. This analysis indicated a frameshift-inducing stem–loop element made of an A form helix capped by a structured ACAA tetraloop.
Superimposed messages can be revealed even without knowing the biological meaning of the hidden message. A simple statistical approach based on processing data from a combination of multiple alignments and RNA secondary structure predictions, allowed to deduce distinct RNA folds in the first conserved (C1) protein-coding region of the env gene (Peleg et al. 2002) and in the β3/β4 region of the nef gene of HIV-1 (Peleg et al. 2003). It is important to distinguish between overlapping messages that play a role in the expression of the gene they overlap with and those that are functionally unrelated to the gene. The first group includes ribosomal frameshift signals, attenuators, antiterminators, and nucleosome positioning codes. An example of a message from the second group is the RRE RNA secondary structure encoded by the sequence of the env gene in retroviruses, which does not play any role in expression of the protein gp160 encoded by the same sequence.
Viruses often have very compact genomes, and overlapping of messages in this case occurs frequently. Konings (1992) studied a specific type of multiple-coding problem in lentiviruses, taking the RRE structure as a unique biological example. TAR is another well-known RNA secondary structure of retroviruses, which is not located inside any coding sequence. Comparison of the mutation rate between the TAR and the RRE regions revealed that in the case of RRE overlapping with the protein-coding message could be viewed as a factor constraining the evolutionary divergence of the element by increasing its selective value. Wagner and Stadler (1999) studied the genomes of single-stranded RNA viruses (dengue virus, hepatitis C virus, and HIV-1), focusing on the mutational stability of conserved and nonconserved viral secondary structure elements. Using this comparison, they concluded that the mutation robustness and monomorphism are associated with the RNA secondary structure message that overlaps with the protein coding.
The overlapping messages occur frequently in prokaryotic genomes and prokaryote-derived organelles such as mitochondria (Normark et al. 1983). Fukuda et al. (1999) identified 160 overlapping gene pairs in the genome of M. pneumonia and 155 overlapping gene pairs in the genome of M. genitalium.
A number of reports indicate the existence of overlapping messages in nuclear genomes of eukaryotes. Coelho et al. (2002) found that coding information for protein (TAR1) and structural RNAs (rRNA) in Saccharomyces cerevisiae can overlap, raising issues regarding the coevolution of such complex genes. Shintani et al. (1999) reported the overlapping of genes ACAT2 and TCP1 in their 3′ untranslated regions (UTRs). They appear in a tail-to-tail orientation, while their coding sequences are located on the opposite strands. Zhou and Blumberg (2003) found that the genes VLCAD and DLG4 are arranged in a head-to-head orientation on human chromosome 17p13 and share a 245-bp overlapping region that contains part of DLG4 exon 1 and the entire exon 1 of VLCAD including 62 bp of protein-coding sequence. Edgar (2003) found that the genes ABHD1 and Sec12 overlap. These genes, located on human chromosome 2p23.3, share 42 bp of the 3′-UTR in an antisense manner.
Species with a low quality of replication can maintain only a short genome (Eigen and Schuster 1979), which may be too small to store all the necessary information in a sequential manner. To increase the amount of information that can be stored, “the quantity of information per length unit has to be increased; i.e., part of the genome has to code for multiple functions” (Huynen et al. 1993). Hogeweg and Hesper (1992) studied the evolutionary dynamics leading to multiple coding. They showed that a high mutation rate and crossing-over lead to “multiple coding.” However, they concluded that “multiple coding often does not increase the fitness of the population; nevertheless, it is selected.” Huynen et al. (1993) evaluated the transition from RNA primary sequences to RNA secondary structure in the RRE region in the env gene of lentiviruses and Visna virus. The results of this study indicated a variation in the initial ruggedness of fitness landscapes that plays an essential role in the evolution and optimization of RNA secondary structure encoded in the translated region. On the other hand, simulation of an evolutionary search process for a specific secondary structure shows a reduction of allowable point mutations and a reduction of the possibility for small-scale adaptation. Apparently, this decreases the final fitness of the region of overlapping. High fitness as a prerequisite for multiple coding is discussed by Pavesti et al. (1997). Studying the informational content of overlapping genes in prokaryotic and eukaryotic viruses, the authors revealed an increased frequency of amino acid residues with high levels of degeneracy in proteins encoded by overlapping genes. Krakauer (2000) proposed a mathematical model to estimate the stability and evolution of overlapping genes in various orientations in terms of information cost. Krakauer assumed that the superposition increases coupling between functionally related genes and concluded that overlapping at the 3′ end decays more slowly than that at the 5′ end.
If the cost of the superposition is so high, is it at all advantageous? In this article, we propose two versions of a model in which multiple coding directly enhances the fitness under conditions of a high mutation rate.
Models of Advantageous Overlapping of Biological Messages
A Simple Model
Let us consider an example with two adjacent messages in a given genome. Each message contains several crucial residues (uppercase) along its base sequence.
Seq 1:ActggtGttaTCtttaCcgATAggaTGgccttActC
Seq 2:CaGggaaggAaaCagtTAgCcaGtcaAtcgGtagT
Here the total number of crucial residues in both messages is 23 (N = 23). For simplicity, other residues are here considered neutral, replaceable by any other residue. Consider now the overlapping as below, such that the matching residues (boldface letters) are identical in both sequences:
The superposition (Seqoverlap) contains both messages, but the total number of crucial residues (uppercase) including common ones is now smaller (18). A reduction in the number of crucial residues (target size) by 5, thus, would increase the fitness of the organism. In other words, for fitness λ and number of crucial positions N we have λ↓N. In the case of overlapping messages, Noverlap<Nnon. As a result, λoverlap>λnon.
The Model Based on Probabilities of Lethal Mutations per Nucleotide in A, B, and A/B
The above model can be described in detail by considering two sequence messages sliding toward their superposition forming a common region. Let us have two messages, A and B. Message A has length n1; message B has length n2. m is the length of the overlapping region a/b in the A/B merge.
Let μ1 be the probability of a (lethal) mutation per one nucleotide for message A. (Further on, we consider only lethal mutations.) Let μ2 be the probability of a mutation per one nucleotide of message B. Therefore, the probability that there will be no mutation in the interval n1–m is equal to , and by the same token the probability of no mutation in the interval n2–m is . The probability that there will be no mutation in the overlapping interval m is equal to (1−ν)m. Consequently, the probability P m of the total absence of any mutation in the whole interval is equal to the product of these probabilities,
Obviously, 0 ≤ μ1, μ2, ν ≤ 1. Let us rewrite P m
where the constant C
From (2), it follows that the extreme value of P m is reached at one of the limits of the parameter range. Namely, if (1−ν) < (1−μ1)(1−μ2), then the value P m reaches a maximum when m = 0 (P0 > P i for 0<i≤m). If (1−ν) > (1−μ1)(1−μ2), then the value P m reaches the maximum when m is maximal, that is, m = min (n1, n2) (P m > P i for 0 ≤ i < m). Note that parameter ν, in principle, may take any value between zero and one in a species-dependent or environment-dependent manner. One reasonable possibility is that ν is equal to the larger of the probabilities μ1 and μ2: ν = max (μ1, μ2). In this case, the inequality (1−ν) > (l−μ1)(1−μ2) is always true; consequently, the optimum corresponds to the maximal possible overlap. In the case of ν = μ1μ2, the probability that there would not be a mutation in the interval m is equal to (1−μ1μ2)m, and again, (1−ν) > (l−μ1)(1−μ2).
To illustrate this equation graphically we assumed that the probability of a lethal mutation per one nucleotide μ1 is equal to mutation rate α multiplied by “index of lethality” λ and ν = μ1 = μ2. Then the survival index σ for estimating the difference between the nonoverlapping situation and overlapping of m bases (σ = P0−P m ) is
Figure 1A–C illustrate how the survival index is influenced by the index of lethality, mutation rate, and length of the superposition. All figures correspond to mutation rate values between 0 and 0.0001 and superposition lengths between 0 and 100 bases. Figure 1A corresponds to lethality index λ = 0.3 (30% of critical points). Figure 1B and C correspond to lethality index λ = 0.6 and λ = 0.9, respectively. We would like to emphasize that the actual values of the survival index are not important. For the purposes of this analysis what matters is its behavior as a function of the parameters.
Figure 1A–C demonstrate that the survival index increases with degree of overlapping, in particular, under conditions of a high mutation rate. This is a simple formal illustration of the general idea of the advantage of overlapping. Formally the survival index as defined and parameterized in this model would increase indefinitely and cause the whole genome to overlap. This situation is clearly unrealistic. The penalty should be introduced to take into account a consequence of the compromise, which decreases the ability of the overlapping (degenerate) messages to vary. The compromise of the overlapping may lead, for example, to a nonoptimal choice of an amino acid at a particular location in the protein. It might cause a mismatch of a specific nucleotide in an RNA secondary structure to its counterpart in the opposite strand of the stem. Both would cause a reduction of the phenotypic repertoire. This justifies the assumption that the fitness also depends on the length of the overlap. Let us introduce the penalty for the overlapping as a function f(m) that decreases with an increase in m. Then a general fitness λ m becomes
Examples with different possible types of penalty function f(m) are considered below.
(1) f(m) = (1−β)m. In this case a penalty for each overlapping nucleotide (letter) equals a constant value β. Then the probability that no letter would be penalized is (1−β)m.
Obviously, the maximum of λ m is reached either when m = 0 or when m = min (n1, n2). As in the previous case, if (1−ν)(1−β) < (1−μ1)(1−μ2), then P m is maximal when m = 0. If (1−ν)(1−β) > (1−μ1)(1−μ2), then P m is maximal when m = min (n1, n2).
(2) f(m) = 1/(1+m). In this case
If (1−ν) < (1−μ1)(1−μ2) then λ m decreases with an increase in m. This means that m = 0 (no overlap) has the best fitness. If (1−ν) > (1−μ1)(1−μ2) then λ m is not a monotonous function of m. An extreme value would be when
However, it is easy to show that λm0 is the minimum of the function. Thus, as before, the maximum of λ m is reached either when m = 0 or when m = min (n1, n2).
(3) , where ω is a relatively small value. This penalty function has a weaker length dependence than in case 2. Consequently,
An extreme value would be reached when
0 ≤ m0 ≤ min (n1,n2) and, in principle, may be any value in this range. If ν = max (μ1,μ2), assuming μ1>μ2, then
That is, approximately
Thus, the optimal m, which is the optimal length of the overlapping region, depends on the ratio between the mutation rate μ and the coefficient ω of the cumulative influence of an overlapping length m in the penalty function f(m).
From formulae (10) and (11) it is apparent that the optimal m does not depend upon the lengths of the messages. It should obey the inequality 0≤m≤min (n1,n2), and it may happen that m would be less than required by (10) and equal to min (n1,n2), which is the case of the maximal possible overlap.
Figure 2A–C correspond, respectively, to Figs. 1A–C modified by the addition of the penalty function. The penalty function is . The survival index σ = λ0−λ m is
The survival index indicates an evolutionary advantage for overlapping as long as it is higher than zero. Figure 2 shows that since the penalty depends only on the extent of the overlapping, its increase leads to accumulation of the compromises. The information contained in at least one of the messages decreases, which leads to the loss of the evolutionary advantage of the overlapping. Values of the parameters in the formulae above (overlapping length m, mutation rate μ, lethality index λ, parameter ω) are only of illustrative significance. Moreover, each sequence position has, actually, its own penalty for the message overlapping. However, the plots above illustrate in a semiquantitative way how the accumulation of compromises that accompanies the overlapping restricts the length of the overlapping region. This occurs even under favorable conditions for overlapping such as a high mutation rate and reduction of the number of crucial points.
The Model Based on Probabilities of Lethal Mutations per Interval
In this model we assume that a probability of one (or more) lethal mutation within the interval is proportional to the length of the interval. The definitions follow.
Q m is the probability that a lethal mutation happens inside the overlapping interval,
and the probability P m ,
to compare with (1).
From (14) it follows that the optimal m corresponds to the maximal overlap m = min (n1, n2). Let us introduce a penalty for each overlapping nucleotide (letter) equal to a constant value β. Then the probability that no letter would be penalized is (l−β)m. Accordingly
and the extreme value of a general fitness λ m would be reached when
A very simple and meaningful conclusion follows from this formula: The higher the penalty for the overlapping, the lower the advantage of the overlapping.
Discussion
A genome region with two overlapping messages is considered, such that each message contains several crucial residues, while other residues are assumed to be neutral. The models are proposed to demonstrate how the superposition of critical points and mutation rate may influence the survival index. The interplay between an advantage of the superposition and a penalty for impairment of an advantage of the variability of the (degenerate) message is analyzed. The conclusion is that under reasonable assumptions the degree of the optimal overlapping depends on the ratio between the mutation rate and the cumulative penalty function. A low mutation rate and incompatibility of the messages (too high a cost for the compromise in the overlapping region) make overlapping disadvantageous. Importantly, a high mutation rate and high tolerance of the juxtaposed sequence messages make the superposition beneficial.
Usually, the phenomenon of overlapping is explained by the need to store a large quantity of information in a small genome (Eigen and Schuster 1979). The model that we present here is in general accord with previous models associating high mutation rate with message overlapping. However, unlike the previous models (Hogeweg and Hesper 1992; Huynen et al. 1993), our analysis suggests a direct evolutionary advantage for message overlapping under conditions of a high mutation rate. Moreover, our model proposes a particular mechanism to realize this evolutionary advantage through superposition of critical points, thus reducing their amount in the genome. This simple model involves two opposing forces that balance the degree of overlapping. The first force is the reduction of the number of vulnerable points and the second, opposing factor is a penalty for deterioration of the messages, which gradually reach the point of zero gain. This penalty is quite similar to the information cost described by Krakauer (2000). He described the information cost in terms of the tendency of a population to become monomorphic, which restricts the ability of polypeptides to be fine-tuned. Although our model describes the overlapping messages as sliding toward each other, we do not mean that this is the only possible mechanism. The sliding presentation is only a simplification that helps to illustrate the idea of two opposing forces optimizing the effect of overlapping.
References
LH Caporale (1984) ArticleTitleIs there a higher level genetic code that directs evolution? Mol Cell Biochem 64 5–13 Occurrence Handle10.1007/BF00420923 Occurrence Handle1:CAS:528:DyaL2cXmt1emtLs%3D Occurrence Handle6493222
PS Coelho AC Bryan A Kumar GS Shadel MA Snyder (2002) ArticleTitleNovel mitochondrial protein, Tar1p, is encoded on the antisense strand of the nuclear 25S rDNA Genes Dev 16 2755–2760 Occurrence Handle10.1101/gad.1035002 Occurrence Handle1:CAS:528:DC%2BD38Xos1Sju7g%3D Occurrence Handle12414727
AJ Edgar (2003) ArticleTitleThe gene structure and expression of human ABHD1: Overlapping polyadenylation signal sequence with Sec12 BMC Genomics 4 18 Occurrence Handle10.1186/1471-2164-4-18 Occurrence Handle12735795
E Dayton D Powell A Dayton (1989) ArticleTitleFunctional analysis of CAR, the target sequence for the Rev protein of HIV-1 Science 246 1625–1629 Occurrence Handle1:CAS:528:DyaK3cXlt1Ohsbo%3D Occurrence Handle2688093
D Dulude M Baril L Brakier-Gingras (2002) ArticleTitleCharacterization of the frameshift stimulatory signal controlling a programmed -1 ribosomal frameshift in the human immunodeficiency virus type 1 Nucleic Acids Res 30 5094–5102 Occurrence Handle10.1093/nar/gkf657 Occurrence Handle1:CAS:528:DC%2BD38XptlOiu7s%3D Occurrence Handle12466532
M Eigen P Schuster (1979) The hypercycle: A principle of natural self-organization Springer-Verlag Berlin
Y Fukuda T Washio M Tomita (1999) ArticleTitleComparative study of overlapping genes in the genomes of Mycoplasma genitalium, Mycoplasma pneumoniae Nucleic Acids Res 27 1847–1853 Occurrence Handle10.1093/nar/27.8.1847 Occurrence Handle1:CAS:528:DyaK1MXivVertr0%3D Occurrence Handle10101192
P Hogeweg B Hesper (1992) ArticleTitleEvolutionary dynamics and the coding structure of sequences: Multiple coding as a consequence of crossover and high mutation rates Computers Chem 4 300–314
R Holliday (1968) Genetic recombination in fungi WJ Peacock RD Brock (Eds) Replication and recombination of genetic material Australian Academy of Science Canberra 157–174
MA Huynen DA Konings P Hogeweg (1993) ArticleTitleMultiple coding and the evolutionary properties of RNA secondary structure J Theor Biol 165 251–267 Occurrence Handle10.1006/jtbi.1993.1188 Occurrence Handle1:CAS:528:DyaK2cXntFOrtQ%3D%3D Occurrence Handle7507189
J Kjems M Brown D Chang S Sharp (1991) ArticleTitleStructural analysis of the interaction between the human immunodeficiency virus Rev protein and the Rev response element Proc Natl Acad Sci USA 88 683–687 Occurrence Handle1:CAS:528:DyaK3MXhsl2itr8%3D Occurrence Handle1992459
DA Konings (1992) ArticleTitleCoexistence of multiple codes in messenger RNA molecules Comput Chem 16 153–163 Occurrence Handle10.1016/0097-8485(92)80042-X Occurrence Handle1:CAS:528:DyaK38XlsVGms7g%3D
DA Konings P Hogeweg B Hesper (1987) ArticleTitleEvolution of the primary and secondary structures of the E1a mRNAs of the adenovirus Mol Biol Evol 4 300–314 Occurrence Handle1:CAS:528:DyaL2sXktlGhtbY%3D Occurrence Handle2965782
DC Krakauer (2000) ArticleTitleStability and evolution of overlapping genes Int J Org Evol 54 731–739 Occurrence Handle1:STN:280:DC%2BD3cvhvFyjtA%3D%3D
J Kypr (1986) ArticleTitleA part of codon bias in genes protects protein spatial structures from destabilization by random single point mutations Biochem Biophys Res 139 1094–1097 Occurrence Handle1:CAS:528:DyaL28XmtVCitb4%3D
J Lagunez-Otero EN Trifonov (1992) ArticleTitlemRNA periodical infrastructure complementary to the proof-reading site in the ribosome J Biomol Struct Dyn 10 455–464 Occurrence Handle1:CAS:528:DyaK3sXhs1egtLY%3D Occurrence Handle1492920
S-Y Le BA Shapiro JH Chen R Nussinov JV Maizel (1991) ArticleTitleRNA pseudoknots downstream of the frameshift sites of retroviruses Genet Anal Tech Appl 8 191–205 Occurrence Handle1:CAS:528:DyaK38Xhs1ynurY%3D Occurrence Handle1663382
M Malim J Hauber S-Y Le J Maizel B Cullen (1989) ArticleTitleThe HIV-1 rev trans-activator acts through a structured target sequence to activate nuclear export of unspliced viral mRNA Nature 338 254–257 Occurrence Handle10.1038/338254a0 Occurrence Handle1:CAS:528:DyaL1MXhslSjtLs%3D Occurrence Handle2784194
M Malim L Tiley D McCarn J Rusche J Hauber B Cullen (1990) ArticleTitleHIV-1 structural gene expression requires binding of the Rev trans-activator to its RNA target sequence Cell 60 675–683 Occurrence Handle10.1016/0092-8674(90)90670-A Occurrence Handle1:CAS:528:DyaK3cXhsFaksLY%3D Occurrence Handle2406030
S Morikawa DH Bishop (1992) ArticleTitleIdentification and analysis of the gag-pol ribosomal frameshift site of feline immunodeficiency virus Virology 186 389–397 Occurrence Handle10.1016/0042-6822(92)90004-9 Occurrence Handle1:CAS:528:DyaK38XhsVGhsbw%3D Occurrence Handle1310175
S Normark S Bergstrom T Edlund T Grundstrom B Jurin FP Lindberg O Olsson (1983) ArticleTitleOverlapping genes Annu Rev Genet 17 499–525 Occurrence Handle10.1146/annurev.ge.17.120183.002435 Occurrence Handle1:CAS:528:DyaL2cXmt1Gmtw%3D%3D Occurrence Handle6198955
NT Parkin M Chamorro HE Varmus (1992) ArticleTitleHuman immunodeficiency virus type 1 gag-pol frameshifting is dependent on downstream mRNA secondary structure: Demonstration by expression in vivo J Virol 66 5147–5115 Occurrence Handle1:CAS:528:DyaK38Xlt1yjsrY%3D Occurrence Handle1321294
A Pavesi B De Iaco MI Granero A Porati (1997) ArticleTitleOn the informational content of overlapping genes in prokaryotic and eukaryotic viruses J Mol Evol 44 625–631 Occurrence Handle1:CAS:528:DyaK2sXjslGrtbs%3D Occurrence Handle9169554
O Peleg S Brunak EN Trifonov E Nevo A Bolshoy (2002) ArticleTitleRNA secondary structure and sequence conservation in C1 region of human immunodeficiency virus type 1 env gene AIDS. Res Hum Retro 18 867–878 Occurrence Handle10.1089/08892220260190353 Occurrence Handle1:CAS:528:DC%2BD38XmvVSkur4%3D
O Peleg EN Trifonov A Bolshoy (2003) ArticleTitleHidden messages in the nef gene of human immunodeficiency virus type 1 suggest a novel RNA secondary structure Nucleic Acids Res 31 4192–4200 Occurrence Handle10.1093/nar/gkg454 Occurrence Handle1:CAS:528:DC%2BD3sXltl2ktr4%3D Occurrence Handle12853637
T Schaap (1971) ArticleTitleDual information in DNA and the evolution of the genetic code J Theor Biol 32 293–298 Occurrence Handle1:CAS:528:DyaE3MXltVGis7w%3D Occurrence Handle5566788
S Shintani C O’hUigin S Toyosawa V Michalova J Kelin (1999) ArticleTitleOrigin of gene overlap: The case of TCP1 and ACAT2 Genetics 152 743–754 Occurrence Handle1:CAS:528:DyaK1MXktFGguro%3D Occurrence Handle10353914
R Staden (1984) ArticleTitleComputer methods to locate signals in nucleic acid sequences Nucleic Acids Res 12 IssueID1,Pt 2 505–519 Occurrence Handle1:CAS:528:DyaL2cXpvFCjug%3D%3D Occurrence Handle6364039
DW Staple SE Butcher (2003) ArticleTitleSolution structure of the HIV-1 frameshift inducing stem-loop RNA Nucleic Acids Res 31 4326–4331 Occurrence Handle10.1093/nar/gkg654 Occurrence Handle1:CAS:528:DC%2BD3sXlvF2qsbc%3D Occurrence Handle12888491
EN Trifonov (1981) Structure of DNA in chromatin H Schweiger (Eds) International cell biology 1980–1981 Springer-Verlag Berlin 128–138
EN Trifonov (1987) ArticleTitleTranslation framing code and frame-monitoring mechanism as suggested by the analysis of mRNA and 16 S rRNA nucleotide sequences J Mol Biol 194 643–652 Occurrence Handle1:CAS:528:DyaL2sXktF2mtbc%3D Occurrence Handle2443708
EN Trifonov (1989) ArticleTitleThe multiple codes of nucleotide sequences Bull Math Biol 51 417–432 Occurrence Handle1:CAS:528:DyaL1MXlsVyjsr0%3D Occurrence Handle2673451
EN Trifonov (1992) ArticleTitleRecognition of correct reading frame by the ribosome Biochimie 74 357–362 Occurrence Handle10.1016/0300-9084(92)90113-S Occurrence Handle1:CAS:528:DyaK38XkvVGgtrk%3D Occurrence Handle1637861
EN Trifonov (1996) ArticleTitleInterfering contexts of regulatory sequence elements Comput Appl Biosci 12 423–429 Occurrence Handle1:CAS:528:DyaK2sXoslGlsA%3D%3D Occurrence Handle8996791
TA Vickers DJ Ecker (1992) ArticleTitleEnhancement of ribosomal frameshifting by oligonucleotides targeted to the HIV gag-pol region Nucleic Acids Res 20 3945–3953 Occurrence Handle1:CAS:528:DyaK38XlvVyisL4%3D Occurrence Handle1508680
A Wagner PF Stadler (1999) ArticleTitleViral RNA and evolved mutational robustness J Exp Zool 285 119–127 Occurrence Handle1:CAS:528:DyaK1MXlsFGhtrs%3D Occurrence Handle10440723
C Zhou B Blumberg (2003) ArticleTitleOverlapping gene structure of human VLCAD and DLG4 Gene 305 161–166 Occurrence Handle10.1016/S0378-1119(02)01235-0 Occurrence Handle1:CAS:528:DC%2BD3sXhtlKju7o%3D Occurrence Handle12609736
E Zuckerkandl (1976) ArticleTitleEvolutionary processes and evolutionary noise at the molecular level. II. A selectionist model for random fixations in proteins J Mol Evol 7 269–311 Occurrence Handle1:CAS:528:DyaE28XksFygtLs%3D Occurrence Handle180297
Acknowledgments
We thank all members of the Genome Diversity Center for fruitful discussions and critical comments on the paper. We thank Ms. Naava Rubinstein for the editing. A.B. is grateful to Professors T. Ratiu and J.H. Maddocks, Bernoulli Institute, Swiss Federal Institute of Technology, for the kind invitation to visit Centre Bernoulli for two months.
Author information
Authors and Affiliations
Corresponding author
Additional information
[Reviewing Editor: Dr. Martin Kreitman]
Rights and permissions
About this article
Cite this article
Peleg, O., Kirzhner, V., Trifonov, E. et al. Overlapping Messages and Survivability. J Mol Evol 59, 520–527 (2004). https://doi.org/10.1007/s00239-004-2644-5
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/s00239-004-2644-5