Introduction

The RNA World hypothesis is a leading explanation for the origin of life. It proposes that RNA or RNA-like molecules emerged through Earth’s chemical processes, leading to an RNA sequence with the capacity to catalyse its own synthesis (Rich 1962; Visser 1984; Gilbert 1986; Bernhardt 2012; Goldman and Kacar 2021). RNA holds significant appeal as the precursor to complex life due to its dual characteristics as both a catalyst and a carrier of genetic information (Atkins et al. 2011; Bernhardt 2012).

The RNA World hypothesis has numerous unsolved challenges, including even its viability (Szostak 2012; Bregestovski 2015). Apart from these questions, the transition from the RNA world to a cellular state also requires elucidation. A model of this transition needs to provide an explanation for the acquisition of membranes and the emergence of the ribosome.

The concept of a self-assembling fatty acid vesicles enclosing an RNA replicator is a prevalent paradigm for the origin of the membrane (Bachmann et al. 1992; Segré et al. 2001; Szostak et al. 2001; Monnard and Deamer 2002; Chen et al. 2005; Albertsen et al. 2014; Monnard et al. 2015; Kurihara et al. 2015). The general thesis is that initially both the RNA and lipid components are self-assembling; however, with time, the RNA molecules developed catalytic functions that facilitated the replication of both the vesicle and RNA (Szostak et al. 2001). Contrary to being a simple lipid enclosure, the modern cell membrane functions as an active and dynamic barrier, playing a crucial role in regulating the movement of nutrients and waste within the cell. This regulation is essential for establishing and maintaining the complex network of enzyme-driven chemical processes of life. Therefore, for a lipid enclosure to sustain life, the first vesicles have been proposed to process a rather unusual set of properties, including spontaneous growth, permeability to nucleotide substrates (Szostak et al. 2001; Joyce and Szostak 2018), and a proto metabolism (Nunes Palmeira et al. 2022; Babajanyan et al. 2023).

Translation is a remarkably intricate and interdependent biological subsystem that has been conserved across all cellular life forms (Fox 2010). Although the evolutionary development of the ribosome has been modelled with high confidence (Fox and Naik 2004; Bokov and Steinberg 2009; Hsiao et al. 2009; Petrov et al. 2015), the reason for its emergence remains elusive (Wolf and Koonin 2007). The first proto-ribosome has been predicted to contain the peptidyl transferase centre (PTC) but lacks the ability to decode mRNA (Agmon et al. 2005; Bokov and Steinberg 2009). This proto-ribosome was estimated to be approximately 165 bases in length (Bokov and Steinberg 2009) and capable of synthesising short peptides comprised of either fixed or random amino acids. Whilst such peptides might not appear useful, the emergence of the complete translational apparatus implies that for a substantial period in the RNA world’s existence, the production of non-coded peptides served a significant purpose. It is generally proposed that these primitive peptides enhanced RNA’s functionality by serving as co-factors or structural influencers for RNA complexes (Szathmáry and Smith 1997; Noller 2012).

A lesser-known solution is the “inside-out” or obcell hypothesis. This posits that the proto-ribosome emerged in a pre-cellular RNA world in order to enable RNA life to colonise the outer surface of lipid membranes. This resolve concerns regarding lipid permeability and enables for the emergence of a relatively sophisticated lifeform, prior to its transition to a protocellular structure. In addition, by positing a high demand for peptides that all have a simply attained chemical property, the hypothesis provides a viable explanation for the early development of the ribosome. This hypothesis was originally introduced by Blobel (1980) and further elaborated by Cavalier-Smith (1987, 2001), with a further contribution from Griffiths (2007). Cavalier-Smith (1987) introduced the term obcell using the Latin prefix “ob-” meaning “towards”, “on”, and “against”.

The Emergence of a Predation-Based Ecosystem Offers an Explanation for Further Development of RNA World

With the RNA world scenario, i.e. a population of RNA replicators, one may expect that the population will proliferate until further growth is limited by the availability of a resource (Malthus 2018; Tang and Riley 2021). Reaching this limit could be extremely harmful to the RNA population. For example, consider an RNA-based world where the majority of replication efforts fail due to a lack of resources. In this scenario, the partial replication of the RNA replicators is an unproductive consumption of resources and would therefore predict a significant decrease in the carrying capacity of the environment. Regardless of the specific details, an ecosystem of competition, either direct or indirect, is likely to occur in an RNA world.

The availability of nucleotides is a possible candidate to be the limiting factor in the RNA world. Under such an environment one may imagine a significant evolutionary fitness gain for an RNA replicator that can acquire its nucleotides by the degradation of foreign RNA. Such an environment might lead to the emergence of a predation or decomposition-based ecosystem (Cavalier-Smith 2001).

Predation creates arms race dynamic and as such can have a major influence on the evolutionary landscape. For instance, it is cited as a potential driver of major transitions in evolution, including multicellularity and origin of eukaryotes (Bengtson 2002; de Nooijer et al. 2009; Ispolatov et al. 2023). Its emergence in the RNA world may be just as significant as it would favour the emergence of more sophisticated RNA lifeforms over time. It also offers a potential solution to the emergence of parasites; RNAs that selfishly increase their own replication at a cost of the fitness of the RNA lifeform (Smith 1979; Takeuchi and Hogeweg 2008; Bansho et al. 2012).

Evolutionary Selection for Increased RNA Cooperation May Result in the Emergence of Lipid-Based Compartmentation

Predation can help create conditions favouring the emergence of cooperative groups (Rubenstein 1978). This may occur if the reproductive success of the individuals is greater when acting in a cooperative manner and if the environment has capacity to sustain the groups (Rubenstein 1978). Cooperation also enables for functional specialisation. However, the evolution of cooperative behaviour in RNA necessitates a mechanism for RNA compartmentalization, a physical process that enables the production of inheritable groups of RNA.

Multiple means of compartmentation have been proposed, including encapsulation in fatty acid vesicles (Blain and Szostak 2014), coacervates (Koga et al. 2011), and aqueous two-phase systems (Andes-Koback and Keating 2011). Perhaps the simplest form of RNA compartmentation may be the complementary base pairing of RNA genes to produce multi-subunit RNA entities. This system of compartmentation is compatible as a predator; however, it is clearly limited in the number of RNA genes to which it could contain.

Life clearly eventually adopts a lipid membrane, and this would enable the increased sophistication of life. However, an immediate adoption of a lipid enclosed RNA lifeform appears to be incompatible with a predation-based RNA world. The replicating RNA within vesicles would be essentially incapable of interacting with foreign RNA.

The Obcell is a Lipid-Bound RNA Lifeform

The obcell offers a superior means compartmentation that is compatible as an RNA predator and that provides a means of adaption of life towards lipids. The tethering is proposed to be achieved by the production of peptidyl-RNAs polymers consisting of a hydrophobic peptide region and a hydrophilic RNA gene. Owing to hydrophobic interactions, the latter which would spontaneously enmesh itself into a lipid whilst the former would remain at the aqueous medium.

This RNA lifeform might be capable of colonising other lipids through environmental processes that facilitate the merging and dividing of lipid vesicles or obcells. This method of reproduction relies on the random segregation RNA genes, which may be viable once the parent obcell achieves a sufficiently high copy-number of each of its genes (Goldman and Landweber 2012). An independent modern analogue of this “statistical segregation” replication has been found to occur in Oxytricha, a genus of single-celled ciliated protists (Goldman and Landweber 2012). This segregation occurs during amitotic division of macronucleus of Oxytricha which contain approximately 1000 copies of each of 20,000 DNA nanochromosomes (Goldman and Landweber 2012; Swart et al. 2013). Unlike the obcell model, these nanochromosomes are not bound to a membrane.

An intuitive critique of the obcell model pertains to the potential dissociation of metabolites/nucleotides due to the non-enclosed structure. However, confinement of this life form could have been achieved within certain environments, such as rock cavities. Additionally, it may be that the cellular morphology is only viable for rather sophisticated lifeforms (Hutchison et al. 2016).

The hypothesis may also be received unfavourably because of its failure to provide a plausible path for the transition from the obcell to a cellular morphology. The subsequent section of this manuscript offers a new suggestion regarding this transition. A new model of the means by which obcells are formed is also presented.

On the Transition of Obcells to a Cellular Morphology

Positing a Gradual Transition to the Lipid Enclosed State

The transition from obcells to a cellular morphology has been previously suggested to occur through the inversion of the membrane, where the outer surface of an obcell becomes the inner surface of an enclosed protocell. A few different mechanisms have been proposed, such as the merger of curved obcells (Blobel 1980; Cavalier-Smith 2001) or an invagination of the membrane facilitated by an early cytoskeletal system, ultimately resulting in complete encapsulation (Griffiths 2007). These models however are not particularly satisfactory. The evolutionary advantage granted by the curved obcell or invagination of membrane is not apparent yet to maintain such a conformation would require a significant energetic expenditure. Furthermore, the model demands that the RNA that is specialised to colonise the outer membrane to immediately adapt to lipid-based encapsulation.

I propose an alternative scenario in which obcells transition to cellular morphology through the emergence of a permeable membrane that enabled RNA life to spread into the internal lumen. In this model, the emergence of membrane pores enables free diffusion of RNAs and ions across the membrane. This enables RNA to colonise the vesicle’s lumen and for this to become an organelle of the obcell. The lumen could confer evolutionary advantages by compartmentalising activities, promoting reactions by molecular crowding (Ichihashi and Yomo 2014; Saha et al. 2014) and reducing the loss of metabolites to the environment. Over time, as life becomes more biochemically capable and less reliant on the external environment, the lumen or cytosol becomes the principal compartment of life.

Emergence of Both Template Decoding Proto-Ribosome and a Porous Membrane Owing to Selection for Improved Anchors

It is reasonable to expect that early development of the proto-ribosome including of template decoding occurred as it enabled for improvements of its original function. In other words, the early development of the proto-ribosome occurred because of the benefits conferred by enhanced membrane anchors. I propose that the evolutionary pressure that favoured this development eventually led to the incidental production of pores with the membrane anchors.

Several hypotheses can explain this evolutionary path. For example, it is plausible that the advancements resulted in more hydrophobically stable membrane anchors. This stability could permit a more concentrated placement of these anchors on the obcell membrane without undermining its structural integrity. It also might have supported the obcells’ adaptation to varied environments, like vesicles with distinct lipid compositions or areas with elevated temperatures. Furthermore, the advancements in proto-ribosome functionality, including template decoding, could have allowed for a more effective utilisation of the limited, highly sought-after hydrophobic residues and it could have facilitated the incorporation of residues that were previously unsuitable for anchor formation.

Regardless of the exact selection process, obcell membranes become sufficiently permeable that there is sufficiently free diffusion of RNA across the membranes to support colonisation. This may have required that this surface of obcells to be saturated with pores. The colonisation of the inside lumen would likely incentivize the further development of membrane pores (Pohorille et al. 2003). This model aligns with prior predictions that the membranes of early life were leaky and gradually became less permeable over time due to advantages related to maintaining low sodium levels initially and later evolving membrane bioenergetics (Mulkidjanian et al. 2007, 2009).

Model of Obcell Structure and Synthesis

Production of Obcell by a Proto-Ribosome and a tRNA-Derived Primer

A new model by which the obcell creates a membrane-bound RNA is illustrated in Fig. 1. This model posits that (proto-)tRNAs, which here I assume are essentially the same as contemporary tRNAs, are charged with a hydrophobic residue by an RNA gene. Next, in a fashion similar to modern ribosomes, the proto-ribosome utilises these charged tRNAs to form a hydrophobic peptidyl-tRNA. Once the peptidyl group becomes of sufficient length the peptidyl-tRNA separates from the proto-ribosome and the peptidyl group rapidly integrates into a lipid vesicle, whilst the tRNA group remains in the aqueous medium. A segment of the tRNA then hybridises at the 5′ end to the template/reverse strand of an RNA gene. Following this, the hybridised tRNA acts as an RNA primer, initiating template-based replication of the catalytic/forward stand. Once the synthesis of the complementary strand is complete, the strands separate.

Fig. 1
figure 1

Model of an obcell. Catalytic/forward strand RNAs genes are tethered to lipid, whilst template strand RNA and (proto-)tRNAs are free floating. The steps to add a new gene are (1) tRNA are charged with hydrophobic amino acids. (2) A proto-ribosome uses the aminoacylated tRNAs to make a peptidyl charged tRNAs. (3) These peptidyl-tRNAs dissociate from the proto-ribosome and become bound to the membrane. The tRNA is then cleaved, which enables remaining part (the minihelix) to become accessible to base-pair to RNA genes. (4) This tethered minihelix hybridises to a complementary sequence at the 5′ end of a RNA gene and it promotes the synthesis of RNA strand (either enzymatically or otherwise). A different primer is used to produce the reverse strand non-membrane-bound strands. (6) One of the non-membrane-bound (reverse strand) RNAs includes a primer source gene, which is spliced to excise tRNAs and reverse primers

Whilst a ~ 76-nucleotide long tRNA might seem an unlikely candidate to be an RNA primer, it has previously been proposed the tRNA evolved from a simpler stem-loop structure approximately half the size of a full tRNA (Di Giulio 1992; Tamura 2015; Burton 2020). I suggest that this 3′ half of a tRNA, often referred to as the “minihelix”, served as a primer.

As has previously been suggested by Robertson (1994) this hypothesis proposes template-based RNA replication in a manner opposite to that of modern cellular life; RNA replication in extant life consists of single-nucleotide extension in the 3′ direction, where a phosphate group attached carbon 5 of an activated nucleotide bonds to hydroxyl group in carbon 3 of the growing RNA polymer. This difference should not diminish its plausibility given that the process of RNA replication in the RNA world is predicted to be very different to current mechanisms (Szostak 2012). In addition, template directed replication of oligonucleotides in 5′ direction has been previously demonstrated using a non-enzymatic process (Kaiser et al. 2012). Ribozyme-catalysed template-based extension in the 5′ direction have also been reported although it was capable only of an extension by 3 successive nucleotides (McGinness et al. 2002).

The model proposed here differs from (Cavalier-Smith 2001) model which proposed the transfer of the hydrophobic peptidyl group from the tRNA to RNA genes. I argue that the model presented here is more plausible in part as it avoids this extra processing step. A second, more significant, advantage however is that it proposes a function for tRNAs prior to the emergence of the proto-ribosome thereby simplifying the emergence of the early translational apparatus.

The placement of the tRNA has some similarity with the genomic tag hypothesis which proposed that a minihelix was present at the 3′ end of each RNAs genome (Weiner and Maizels 1987; Maizels and Weiner 1993). Weiner and Maizels however did not propose the minihelixes to be primers.

Obcells Requires a Source of RNA Primers

To ensure a steady supply of primers, both for catalytic and template strands, the obcell requires an RNA gene that contains multiple primer sequences. These sequences could be excised by a ribozyme similar to RNAse P (Guerrier-Takada et al. 1983) in order to generate the primers. As these primer source genes are likely to be quickly degraded after their synthesis it appears unnecessary for them to be membrane bound. In addition, given that a single template strand can be used to create multiple catalytic strands, it stands to reason that using different primers for each strand could optimise the utilisation of available nucleotides.

Typically, one would not expect a sequence that acts as a primer to contain an RNA structure. However, the requirement for a ribozyme to recognise the sequences to excise it may explain why the minihelix (RNA primer) has a stem-loop structure. The large ribosome subunit RNA has been proposed to emerge from a gene containing a multiple tRNAs sequences (Tamura 2011; Farias et al. 2014; de Farias et al. 2016). This has previously been explained by a fusion of tRNAs; however, a remarkable possibility is that the proto-ribosome first emerged from such a primer source gene.

Discussion

The evolutionary transition from the RNA world to the cellular world, as outlined here, can be condensed into six key stages, as illustrated in Fig. 2. These stages encompass the following: (1) the emergence of the replicator, (2) the development of predation, (3) the emergence of a non-template decoding proto-ribosome, (4) its development to a template decoding proto-ribosome, (5) the introduction of membranes pores, and (6) the colonisation of the internal lumen.

Fig. 2
figure 2

Proposed stages of evolutionary transition from the RNA world to the cellular one. RNA genes are shown in orange. Peptidyl groups are represented with square boxes with hydrophobic domains in cyan and hydrophilic domains in red. Nucleotides are shown in light green circles, whilst tRNA-derived primer are shown in dark green. For clarity, the RNA genes are not always shown

The first stage is the emergence of an RNA (or RNA network) that catalyses its own replication. Given the utility of primers for RNA replication, it is intriguing to consider that this RNA replicator was a nuclease that excised RNA motifs, available in a randomly synthesised pool, to serve as primers for its own replication. This mechanism could have been effective in catalysing its own replication if the template-based extension were a non-enzymatic reaction (Szostak 2012). Although an RNA polymerases may also be viable as a RNA replicator (Tjhung et al. 2020), this alternative would require cooperation of a two polymerases to achieve replication.

The emergence of a predation-based ecosystem is proposed to be the major driver of selection in this model and a reason for increase of complexity with time. The emergence of predation would seem necessary if the RNA replication rate is limited by the availability of nucleotides. I argue that its emergence creates an ecosystem that prevents the RNA world from undergoing decay (Boltzmann 1974) and instead drives increasing RNA cooperation and compartmentation. It is likely that the importance of nucleotide availability would only wane after the emergence of nucleotide biosynthesis in protocells. Owing to its importance to the model proposed here, the examination of a ribozymes capability to degrade RNA is a significant means to evaluate it.

The predation of RNA-by-RNA lifeforms was unlikely to be rapid process as such an event would necessitate the existence of extremely effective, indiscriminate RNA nucleases which likely would result in deleterious exchanges between predators. Instead, it may be a decomposition like process, i.e. a slow consumption process that may be principally focused on RNAs transcripts that were already partly hydrolysed.

The latter stages consist of various stages in the development of the ribosome and essentially posits that this occurred to achieved superior compartmentation. The proposal that the template decoding proto-ribosome emerged to better fulfil the role of the non-template decoding proto-ribosome, as opposed to production true proteins is reasonable. Although the evolutionary selection behind the emergence of membrane pores is not immediately obvious, if offers a plausible pathway towards cellularisation.

It is apparent that DNA emerged relatively late in the development of the protocell, for instance, it is a genuine possibility the genome of the Last Universal Common Ancestors (LUCA) was composed of RNA instead of DNA (Mushegian and Koonin 1996; Forterre 2006). Owing to the relative instability of RNA (Koonin 2014) argued that this would require that the genome of protocells to consist of numerous RNA segments and that cell division is achieved by the statistical segregation approach. In order to reduce the stochasticity with the segregation of genes during cell division it is possible that some form tethering of RNA to membranes persisted after the adoption of the cell-like structure perhaps until the emergence of DNA genomes (Koonin 2014). For this to be achieved with peptidyl-tRNAs however would require two distinct modes of translational termination during this protocellular stage: one that maintained the peptidyl-tRNA bond to create membrane anchors and another that broke this bond to produce the other proteins. The potential existence of distinct translation termination mechanisms is corroborated by the appearance of release factors (the proteins that break the peptidyl-tRNA bond) only after LUCA (Maxwell Burroughs and Aravind 2019).

Support for the obcell hypothesis comes from the structure of the oldest (core) ribosome proteins. Based in part on its structure and characteristics, (Smith et al. 2008; Hartman and Smith 2010) have proposed that the original function of the core ribosome proteins was to anchor the primitive ribosomal RNA to the membrane.

The obcell hypothesis also aligns well with the “adaptive” model of genetic code. This proposes that the code is optimised to reduce the impact of translational errors (Knight et al. 1999). Haig and Hurst (1991) argued that the code maximises the correct placement of hydrophobic and hydrophilic residues (Haig and Hurst 1991; Chiusano et al. 2000). The authors even speculated that translated peptides produced during the optimisation of the genetic code had a role as membrane anchors (Haig and Hurst 1991). This optimisation is possible with the obcell hypothesis given that it proposes a stage of development where the only proteins produced were membrane anchors and as such the code would not be frozen (Crick 1968). This does not imply, however, that the obcell had developed the universal genetic code. For example, it suggests that whilst the codon GUU coded for a hydrophobic amino acid, but it was not necessarily valine. The universal genetic code specification may have emerged at a later stage by substitution of residues with those of similar properties. The evidential value of this aspect of the genetic code for the obcell hypothesis is also tempered however by other hypothesis that have been proposed to explain its nature (Trifonov 2000; Crick 1968; Szathmáry 1999; Baranov et al. 2009).

The obcell hypothesis offers a framework for understanding multiple facets of cell origin. Notably, it avoids proposing singular transformational events or requiring major environmental demands. Instead, it emphasises gradual transitions and the emergence of an ecosystem that continually selects for greater cooperation.