Introduction

In present organisms, cellular macromolecular content is highly concentrated and remains between tightly bounded values (Fulton 1982; Zimmerman and Trach 1991). In Escherichia coli, this constancy of cell content holds across many physiological conditions and through experimental evolution (Schaechter et al. 1958; Bremer and Dennis 1996; Lenski et al. 1998). However, to our knowledge, no previous work has been much concerned with the driving forces and evolutionary mechanisms leading to highly crowded cellular states.

The idea that high total macromolecular concentration and the associated little free water available could condition cytoplasm organization and function is by no means new (e.g., Minton 1981). Effects of macromolecular crowding have been reported in phenomena spanning several levels of cellular organization: enzyme kinetics, protein folding, biochemical pathways, signal transduction processes, cell volume regulation, active transport, DNA conformation and function, among others (Minton 1983; Zimmerman and Minton 1993; Garner and Burg 1994; Walter and Brooks 1995; Zimmerman and Murphy 1996; Rohwer et al. 1998; Bray 1998; van den Berg et al. 2000; Kornberg 2000; Al-Habori 2001; Ellis 2001; Goobes et al. 2003; Ellis and Minton 2003). All these effects originate at the physiochemical level of nonspecific macromolecular interactions and have been studied in well-defined experimental systems (Zimmerman and Minton 1993; Hall and Minton 2003).

Regarding the potential advantages of a crowded environment, several considerations and findings have been made previously. Cellular crowding has been invoked to explain stabilizing mechanisms for DNA-DNA and protein-DNA interactions (Goobes et al. 2003; Record et al. 1998). For example, exposing E. coli to a wide range of osmolarities did not affect the kinetics and equilibria of protein-DNA interactions, suggesting that this was the result of compensation mechanisms underpinned by macromolecular crowding (Record et al. 1998). In another context, a study by Westerhoff and colleagues showed how crowding reduced the dissociation constants of enzyme complexes, leading to longer-lived functional structures (Rohwer et al. 1998). Furthermore, macromolecular crowding—along with diffusion—was described as a fundamental constraint in a discussion of the evolution of cell signal transduction pathways (Bray 1998). In the context of enzyme complexes sustaining sequential metabolic pathways in the cell, Srere (1987; Srere and Ovadi 1990) linked high macromolecular concentrations to tendencies toward organizing cytoplasm into discrete microenvironments—“metabolons”—where substrate channeling could take place.

The physiochemical considerations described above provide evidence that some of the changes in the properties of proteins, in the evolution from a diluted to a crowded cytoplasm, could confer the organism an evolutionary advantage. However, the changes in other proteins, for example, the change in the catalytic activity of certain enzymes, could contribute negatively to fitness (see, e.g., Cayley et al. 1991).

To explain the origins of a crowded cytoplasm, in this work we shall move from the molecular level to the cellular level of description, that is, from macromolecular interactions to systemic physiological processes. We shall show that the sole increase in the concentrations of proteins, without having to invoke modification of their properties due to nonspecific interactions, suffices to elaborate a plausible physiological explanation about the evolutionary advantage of having a highly concentrated intracellular milieu. First, we shall test the effect of increasing overall protein concentration in a model of E. coli that describes its cell cycle dynamics and long-term experimental evolution of cell volume and growth rate. Next we show that the results obtained with this particular model can be generalized using model free arguments from control analysis, macromolecular crowding, conservation constraints, and scaling, among others. Finally, we sketch a plausible mechanism for the evolution toward a highly crowded cytoplasm and discuss a way in which the existence of the newly proposed driving force, acting at the physiological level, could be tested experimentally.

Crowding and Growth Rate

A Bacterial Model

In order to test the effect that increasing overall cellular protein concentration has on growth rate, we used a mathematical model of E. coli previously developed to explain the parallel increase in cell volume and fitness in laboratory evolution experiments (Graña and Acerenza 2001).

In Richard Lenski’s (2004, 2005) laboratory, populations of Escherichia coli have evolved, under controlled conditions, during thousands of generations. In these experiments, the bacteria showed a parallel increase in mean cell volume and relative fitness (Lenski and Travisano 1994). From a purely scaling perspective the result was rather unexpected, since it implied that a larger organism—with a lower surface-to-volume ratio and a smaller input of nutrient per unit volume—would grow faster. We found that explaining this parallel increase required considering processes operating at two levels: the organism and the population (Graña and Acerenza 2001). First, the physiology relevant to the evolutionary question was implemented in an organismic model. Our aim was to describe how cell volume, ν, and relative fitness (i.e., growth rate of the evolved populations relative to the ancestor, μ/μ0) change through evolution. Therefore, the relevant physiology to answer the evolutionary question was the relationship between cell volume and growth rate, ν vs μ (Donachie and Robinson 1987), and the response of the growth rate to changes in external nutrient concentration, μ vs X, represented by the so-called Monod (1949) equation. These physiological responses were implemented in a modular model consisting of only three variables: cell volume, ν, number of molecules of nutrient internal to the cell, n, and number of molecules of signal for initiation of DNA replication, s (see Appendix). The temporal behavior of these variables during the cell cycle was described by a system of three differential equations. Finally, we considered the processes operating at the population level. It has been experimentally shown that many catabolic functions, e.g., the capability to degrade substrates not present in the medium, decay during evolution (Cooper and Lenski 2000). This genetic decay was incorporated in the model via a simplified population genetics formalism. The resulting model, combining the physiology and population genetics, succeeds in reproducing, quantitatively, the parallel increase in cell volume and fitness, and several other properties of the physiology and evolution of E. coli, determined under laboratory conditions (Graña and Acerenza 2001).

The essence of the explanation given by our model is as follows. The decay of unused functions during evolution decreases the consumption of nutrient for physiological adaptation purposes, leaving more nutrient available for growth. Since the organism maintains its composition approximately constant during the cell cycle and through evolution, the surplus of nutrient results in an increase in volume. The rate at which the signal is produced also increases with the amount of nutrient available for growth and the doubling time is reached in a shorter time, resulting in an enhanced growth rate.

In the present paper, the organismic model described above is used to test the effects of changing total protein concentration on growth rate. In the model, the total protein concentration remains approximately constant during the time scale of several cell cycles, in which short-term physiological responses take place. Therefore, it is considered a parameter. On an evolutionary time scale, total protein concentration could change, which may be represented by a slow change in time of the parameter. If this parameter change is advantageous, i.e., it increases the relative fitness of the population, the new value of total protein concentration and its phenotypic consequences would be fixed by selection.

It is important to note that, in the model, all the rates of the processes are proportional to the concentrations of the proteins that catalyze them. Therefore, the formulation of this model represents the situation that would occur in a putative diluted ancestral scenario, where the changes in protein concentrations maintain their proportions and specific properties unchanged. This additive effect on their functions results in macromolecular activities proportional to protein concentrations (this is described in more detail in the next section).

We shall now use the organismic model to simulate changes that could have occurred during the course of evolution. In Fig. 1, we represent the growth rate as a function of the factor by which protein concentrations and activities have been changed. The result obtained is an increasing function. Moreover, the growth rate divided by the surface-to-volume ratio is approximately constant (Fig. 2), indicating a close correlation between them. In conclusion, the analysis of the model suggests that an increase in the concentrations of proteins under additive conditions, i.e., maintaining their proportions and specific properties unchanged, produces an increase in the fitness of the organism.

Fig. 1.
figure 1

Growth rate versus relative total protein concentration. The model used to generate this plot is briefly described in the Appendix (see also Graña and Acerenza 2001).

Fig. 2.
figure 2

(Growth rate)/(surface/volume) versus relative total protein concentration. The model used to generate this plot is briefly described in the Appendix (see also Graña and Acerenza 2001).

In the model, fitness augments and cell volume diminishes indefinitely with higher total protein concentrations. In reality, a limit exists for these tendencies, set by physical and physiological constraints of cellular processes. For example, a higher limit exists to the number of macromolecules that can be accommodated in a certain volume, and protein concentration may not be indefinitely increased. Similarly, there must be a lower limit to the volume of the cell capable of containing the minimal set of macromolecular species able to sustain life (Koch 1996).

In the next section, using model independent physiological considerations, we obtain results on the relationship between growth rate and total protein concentration similar to the ones that we have determined in this section, with the particular model of E. coli.

General Physiological Considerations

Let us consider a diluted intracellular milieu, where concentrations of molecules and macromolecules are very low. This is an additive milieu, because in a highly diluted medium protein association-dissociation processes are displaced to the dissociated forms, causing the macromolecules to function independently, i.e., they only affect each other through metabolite effects (Kacser et al. 1990). In this scenario, when the protein concentrations are increased their proportions and specific properties remain unchanged, resulting in a proportionality relationship between concentration and activity. The alterations occurring in the system are therefore only due to the increase in the number of molecules per unit volume. Under these conditions, the increase in the concentration of all proteins by a certain factor produces two simultaneous effects—an increase in protein activity per unit volume and an increase in protein mass per unit volume—both by the same factor. Next we shall analyze the consequences of these two effects on the growth rate of a unicellular organism.

In an in vitro biochemical system, the protein concentrations are externally adjusted, i.e., they are parameters. Owing to the additivity property described above, if the experimenter increased them by a factor k, the processes would be performed, and their products would be produced, k times faster. In addition, in vitro systems operate at constant volume. Therefore, in the time courses of the metabolite concentrations, the same values would be reached in a period k times shorter (Acerenza 1990; Acerenza and Kacser 1990). In contrast to what normally happens in in vitro systems, in living systems proteins and volume are internally adjusted, i.e., they are variables. As mentioned above, the increase in protein activity per unit volume k times comes together with an increase in the protein mass per unit volume by the same factor. Mass increase takes time, contributing to decrease the growth rate. But since all the processes occur k times faster, the production of the signal that starts DNA replication would be faster and the time required to reach its threshold value smaller. As a consequence, the cell would divide in a shorter time, resulting in cells with smaller volumes. Smaller volumes give higher surface-to-volume ratios, and more nutrient per unit time is incorporated to the cell per unit volume. Therefore, these higher surface-to-volume ratios produce faster increases in protein mass per unit volume.

In summary, when total protein concentration is increased, there is a simultaneous increase in protein activity per unit volume and in protein mass per unit volume. The activity effect tends to increase growth rate and the mass effect to decrease it. We have argued that the activity effect is quantitatively more important than the mass effect, the increase in total protein concentration resulting in a net increase in growth rate. Note that, in these arguments, a key property leading to a faster growth rate is the increase in the mean surface-to-volume ratio, which is in agreement with what we found for the specific model of E. coli described under A Bacterial Model, above (see Fig. 2).

The general physiological reasoning that we have developed shows that even in an ideal additive scenario, where macromolecular nonspecific interactions are still not very significant, it would be expected that unicellular organisms showing a higher total protein concentration grow faster, being more fit. It is noticeable that this conclusion arises within a systems perspective, without postulating detailed mechanisms.

Final Remarks

We have argued that the highly concentrated cytoplasm found in contemporary unicellular organisms could be explained by an increase in fitness, produced by the effect that increasing protein concentrations, under additive conditions, has on cellular physiology. Superimposed on this effect are the contributions of crowding originated in nonspecific macromolecular interactions, which may increase or decrease fitness.

Another major component of fitness is the way in which the total protein concentration is distributed among different proteins performing the whole set of cellular functions. For a fixed total protein concentration, increasing the concentration of an enzyme, for example, would cause a fall in the concentration of others, resulting in perturbations of metabolic variables. This type of perturbation, called protein burden, has been measured experimentally (Koch 1983; Snoep et al. 1995). Under constant environmental conditions, one could hypothesize that in cellular organisms there is a distribution (or set of distributions) of enzyme concentrations maximizing fitness, which would be most difficult to determine with our present level of knowledge. However, in simple metabolic systems, the distributions of enzyme concentrations that maximize or minimize a variable, e.g., that maximize the flux of a metabolic pathway, can be calculated using standard optimization procedures (see, e.g., Heinrich et al. 1991; Heinrich and Schuster 1996; Kacser and Beeby 1984; Ortega and Acerenza 1997). In the present work, we did not treat the effects of enzyme distribution on fitness. On the contrary, we studied the effect of changing total protein concentration, while leaving the proportion of protein allocated to each enzyme or process unchanged.

Under our physiological arguments, a plausible mechanism of evolution of total cellular protein concentration could be as follows. At the early stages of prebiotic evolution, it is more likely that rather diluted environmental conditions prevailed (Lazcano and Miller 1999). If the first cells originated at this stage, the protein concentration increase was driven solely by its effect on cellular physiology, in the absence of nonspecific interactions. At some point, protein concentrations sufficiently high for macromolecular nonspecific interactions to enter into operation in the cytoplasm were reached. In contrast, if the first cells appeared only after the prebiotic conditions attained a rather concentrated state, then their protein concentration increase would be driven by the effect on cell physiology and the effect of nonspecific interactions acting simultaneously. Nonspecific interactions could have positive or negative effects on fitness. Advantageous interactions reinforce the positive effect on physiology and are retained. In contrast, disadvantageous interactions may prevent additional increases in protein concentration by neutralizing the advantageous effect on physiology. These would be erased by mutation/selection, and only protein variants not promoting such negative interactions would be kept. As a consequence, further increases in protein concentration driven by the effects at the cellular level, or by newly evolved favorable nonspecific molecular interactions, could be obtained.

This process would continue until a highly crowded cytoplasm was obtained. At this stage, further small increases in total macromolecular concentration are expected to have major simultaneous effects on the thermodynamic and kinetic properties of many intracellular reactions. Therefore, a slowdown in the rate at which evolution to higher concentrations takes place would occur. In addition, concomitant to total macromolecular concentration increase during evolution, physiological mechanisms to maintain and restore this concentration, similar to those exhibited by present organisms, would have to have evolved to prevent harmful effects of external perturbations (Minton 2001).

The experimental test of the effect of increasing total protein concentration on growth rate would require measuring these physiological effects in the absence of nonspecific interactions. This poses a problem because, in the highly crowded cytoplasm of present living cells, the thermodynamic and kinetic effects of nonspecific interactions dominate. Note, for example, that with an increase in the external osmolarity, the cytoplasmic protein concentration of E. coli K-12 could be increased by more than 50%, resulting in a linear decrease in the growth rate (Cayley et al. 1991). This behavior can be explained by the disadvantageous effect of the newly added nonspecific interactions. The physiological effect of increasing the total protein concentration proposed in the present work, could well find a test bed in experimental chemical/biochemical systems similar to those that have been built to mimic essential properties of life, such as self-reproduction and homeostasis (Zepik et al. 2001; Luisi et al. 2004). It appears that more elaborated systems than the ones that have been constructed up to date would be required, but adequate synthetic systems would probably be obtained during the accomplishment of the challenging project of synthesizing life (Szostak et al. 2001).