Keywords

Introduction

In his book “The origin of species by means of natural selection” (Darwin 1859), Charles Darwin manifested a deep frustration justified by the realization that natural selection is too slow to be observed in real time. He admittedly based all his conclusions on observations or indirect measurements of the action of natural selection and reported many evidence supporting his conclusion: “That natural selection will always act with extreme slowness I fully admit.”

Darwin, if lived today, would be enthralled by the fact that the process of natural selection and the mechanisms underlying them could be directly tested in a reasonable short time using microbes. Microbes offer a unique opportunity to observe and test the mechanism of natural selection and the general principles of evolution. This is mainly due to the short generation times, small genome sizes, and deep microbes genetic and physiological characterization. These features and the feasibility of evolving microbes in the laboratory with the current technology under controlled conditions and at high “speeds” make them ideal systems to put the main principles of evolution to test and unearth the dynamics underlying the evolution of biological complexity (Kawecki et al. 2012). In addition to the possibility of conducting laboratory-supervised evolution experiments, the next generation sequencing technology (NGS) has enabled sequencing hundreds of microbial genomes at once, linking particular genome dynamics to microbes’ lifestyles.

In this chapter, I will discuss the many different scenarios under which microbes have been evolved in the laboratory, how did NGS contribute to the understanding of the genomes dynamics behind specific adaptive processes, and the main conceptual breakthroughs derived from these studies.

What Makes Microbes Attractive to Test Evolutionary Processes?

Eighty-five years ago, August Krogh articulated a principle (Krogh principle) after which experimentalists should choose the model organism that can best foster a clear and direct experimental design and a rigorous and unambiguous result and interpretation (Krogh 1929). Krogh principle is particularly useful when testing evolutionary processes, as these are often dominated by very complex patterns that are intermingled and many times shaped by the environment.

The general principles of Evolutionary biology has been historically built based on indirect theoretical and comparative studies (Futuyma 1998), lacking rigorous experimentation proof. There are several reasons for the lack of experimental studies probing principles of evolution. Mainly, it remains difficult identifying the dynamics of natural selection leading to the fixation of advantageous mutations at specific episode of organisms’ evolution. Some of the reasons for this difficulty are the impracticality of replicating the complex mix of environmental conditions under which populations grew at some stage during their evolution and the slow pace at which natural selection acts. In this sense, microorganisms offer a unique opportunity for studying evolution as they present large populations sizes, short generation times, small genome sizes, and enormous physiological plasticity. Noticeably, microorganisms are not equipped with complex homeostasis systems, and thus their phenotype is largely the result of their genetic composition interacting with the environment. In addition to this convenient feature, microbes present a puzzling diversification whether measured in terms of the number of species (Dykhuizen 1998; Gans et al. 2005), habitat range (Pikuta et al. 2007), or the breadth of energy sources and biochemical pathways they can exploit in order to survive (Pace 1997).

The hallmarks of experimentation of any kind are control and replication. In evolutionary biology, controlling environmental conditions, especially when conducting experiments out of the laboratory, is difficult if not impossible. However, the fact that enormous population size of microbes could grow in tiny spaces (for example, a drop of culture medium) makes it feasible growing hundreds of microbial populations in a standard laboratory space. Moreover, microbiologists have successfully harnessed bacterial evolution and domesticated them to grow under laboratory-controlled conditions. Hundreds of microbial populations can be then propagated and analyzed simultaneously. If maintained evolving separately, with no cross-contamination, such populations can be used to test the repeatability of evolutionary processes (Lenski et al. 2000), to understand the physiological plasticity of bacteria growing under different carbon sources, and reproduce ecological scenarios of more complex organisms. In summary, experimental evolution allows determining the selective forces operating, and by virtue of replicating the experiment, researchers can distinguish between deterministic and stochastic effects.

Environmental control is one of the most important advantages of using microbial populations because we can grow homogenously distributed populations in an environment in which single factors can be modified. In this new single-factor modified environments, that reproduces ancestral environments, many hypotheses can be tested, including how novel physiologies emerge to adapt to a new environment, the population dynamics of generalists and specialists, and the role of contingency in the adaptation to novel conditions and the trade-offs that such adaptations involve (Bennett and Lenski 2007; Bronikowski et al. 2001; Lee et al. 2009).

The large population sizes of microbes offers an analytical advantage, which is concerned with the higher likelihood of originating novel adaptations through mutations. The rationale is simple: in a small space of culture liquid billions of microbial cells can be kept and propagated, thereby avoiding the effect of genetic drift and directly testing the role of natural selection. During DNA replication, or even protein translation, there is a low but finite probability of an error in replication. The probability of occurrence of such a mutation is the product of the population size and the mutation rate. Therefore, the larger the population size the greater is the number of mutations originating in the population and the higher is the probability of a mutational novelty emerging. Because selection is strong when population sizes are large, the probability of fixation of beneficial mutations is very high. It follows then that the rate at which evolution occurs is high in microbial populations, making it possible reproducing adaptive evolution in real time. Indeed, in long-term evolutionary studies on microbial populations, every single nucleotide base pair should have experienced at least one mutation, and thus have undergone selection filtering (Lenski et al. 2003).

Finally, unlike multicellular organisms that require at least days or weeks to produce a new generation, microbes require minutes or hours. This allows beneficial mutations to become quickly fixed in the populations. For example, thermo-resistant mutations can become fixed in the microbial population within 15–20 days after initiating an evolution experiment (Bennett and Lenski 2007; Elena and Lenski 2003). Moreover, the enormous linkage disequilibrium of microbes ensures their clonal transmission for thousands of generations preserving the ancestral genetic background. This, in addition to the possibility of freezing evolved cells that can be thawed again, permits building a microbial fossil record and perform genome archeology at any time of the evolution experiment (Lenski et al. 2003; Ostrowski et al. 2008).

Experimental Evolution and Mutation Accumulation Dynamics

Experimental evolution combined with whole-genome re-sequencing is a promising strategy for investigating the dynamics of evolutionary change. One of the questions that have motivated efforts in reproducing an evolutionary scenario is how repeatable is evolution. The fragmentary nature of the fossil record cannot provide a full picture that would allow answering this question, and even if it did we are not certain what kinds of environments or adaptations have not been explored by nature. Instead, reproducing fine-tuned scenarios in a test tube containing billions of bacterial cells can shed light on the complexity of evolutionary patterns.

Evolution experiments start with an initial population of microbes genetically identical and adapted to an ancestral environment (Fig. 7.1). Adaptation is determined by the Malthusian growth parameter of the population and is considered to be proportional to the relative fitness of the population. Fitness in experimentally evolved populations is measured as the capacity of such descendent populations to compete head-to-head with their ancestors. These two populations, the evolved one and its parental ancestral population, can be compared because they can be brought together in the same place at the same time. We can compare the performance of the descendant and ancestral populations by quantifying the number of offspring that each leaves in the next generations in an environment in which the carbon source is common for the two differentiated populations. Populations are propagated between generations by diluting 0.1 ml of the grown culture in 9.9 ml of a new culture. To determine how repeatable is evolution, many different independently evolving lineages are generated from the same ancestor, and thus originally presenting the same genetic background and evolved in parallel (Fig. 7.1). The many different evolutionary paths followed by each of the independent evolving lines can be then compared and their differences quantified.

Fig. 7.1
figure 1

Experimental evolution of microbes in the laboratory. From a single cell (ancestral cell) many replicate populations are generated, all being genetically identical and evolving for many generations independently. To assay the biological fitness of the evolved population at any time point, equal proportions of this population and of its ancestral one are mixed in the same medium. Both cells, the evolved and ancestral ones, should be distinguishable, for example through a metabolic marker that yields distinctly colored cells, to determine the relative frequency of each population at the start and end of the competition experiment. Improved fitness of the evolved population is reflected in a higher proportion of evolved cells than ancestors in the competition experiment

As I explained earlier, microbes are genetically represented by one chromosome. The gamma-proteobacterium Escherichia coli strain K12 MG1655 is the one most used in experimental evolution of microbes. Most bacteria, including E. coli, present highly dense genomes, with the genome size reflecting the number of genes (Giovannoni et al. 2005; Mira et al. 2001). The high gene density of these genomes and large linkage disequilibrium means that the mutational load is expected to increase as generations pass by without disrupting previous genetic backgrounds and that most changes will be affecting coding genes or regulatory regions. This means that we can directly associate particular nucleotide mutations to specific phenotypes and follow the history of interesting mutations since the last common ancestor of all the founded bacterial populations. Likewise, the yeast Saccharomyces cerevisiae has been used in its haploid or diploid genetic structure as a model to test specific evolutionary processes through experimental evolution. Here I provide examples of how NGS performs a powerful tool when combined with experimental evolution to unearth the rules governing fundamental evolutionary processes.

The Evolutionary Trajectories of Adaptive Mutations

NGS has been developed reaching a stage in which single minority mutations can be identified at low frequencies and their origin traced through reviving evolved cells at different time points of an evolution experiment. For example, the final stages of the fixation of an adaptive mutation can be identified by mixing equal proportions of bacterial cells labeled with two different tags (Hegreness et al. 2006). Combining cost-effective Illumina re-sequencing with experimental evolution makes it possible to sequence several hundreds of individuals from an evolved population, generating estimates of allele frequencies at millions of single-nucleotide polymorphisms (SNPs) genome-wide (Burke 2012; Burke et al. 2010; Burke and Long 2012; Futschik and Schlotterer 2010). This is important not only to identify rare variants but also to determine with unprecedented accuracy the evolutionary trajectories of adaptive mutations.

Evolution experiments seeking to identify adaptive evolution derive populations from a single ancestral genotypes, and thus genetically identical, in a constant environment or an environment with constant fluctuations. This is achieved by a continuous culture of populations in which the input of resources and the removal of individuals occur at a constant and controlled way. Alternatively, a fraction of the grown population is passaged to a new culture medium. When an adaptive mutation emerges in such an environment, this drives the evolutionary dynamic of the population, so that the average fitness of the population increases gradually. When several adaptive mutations emerge, synergistic epistasis among them, that is interactions between mutations that increases the effects of single mutants on fitness in a non-linear fashion, leads to diminishing-returns epistasis: each mutation has lower beneficial effect for the individuals in the presence of another beneficial mutations than if it appeared alone in the ancestral genetic background (Chou et al. 2011; Khan et al. 2011; Kvitek and Sherlock 2011). Regardless of whether or not diminishing returns take place, beneficial mutations will lead populations to climb peaks in a fitness landscape (Fig. 7.2) (Orr 2009a, b). In the absence of interfering mutations, beneficial mutations will undergo refinement and selective sweep in the population (Atwood et al. 1951; Barrick and Lenski 2013). However, in asexual populations it is more frequent to observe cases in which the beneficial mutation needs to displace other beneficial mutations emerging during its fixation, thereby slowing down the fixation rate of adaptive mutations. This effect, known as clonal interference (Fogle et al. 2008; Miralles et al. 1999), has been shown to be frequent in asexual populations of influenza (Strelkowa and Lassig 2012), the bacteriophage phiX174 (Pepin and Wichman 2008), bacteria (de Visser and Rozen 2006), and yeast (Kao and Sherlock 2008; Lang et al. 2013) but has only been characterized in yeast by deep sequencing yeast populations at frequent intervals (Lang et al. 2013).

Fig. 7.2
figure 2

Fitness landscape of an evolving population. Peaks represent regions of maximum relative biological fitness while valleys are regions of low fitness. In a smooth landscape (a) populations (spheres) can cross the valleys of low fitness without yielding lethal phenotypes (e.g., these populations go from high mean fitness to intermediate mean fitness). In rugged and complex landscape (b), crossing the valleys of low fitness is lethal and precludes populations from reaching new local fitness maxima through gradual evolution. This figure is taken from (Henderson et al. 2013), with author’s permission

Adaptive mutations need to be distinguished from those that are innovative, leading to new phenotypes adaptable to novel environments. Many research studies in this area have shown that such innovative mutations are often sudden and involve only one-to-few mutations. The identification of these mutations has been possible through the use of NGS, which has also enabled disentangling beneficial mutations from innovative ones. For example, in a recent study, Marchetti and colleagues showed that an experimentally evolved chimeric Ralstonia solanacearum strain, derived from a plant pathogen, could establish a symbiotic mutualistic association once evolved experimentally. This change in lifestyle occurred upon colonizing root nodules and was due to a single non-synonymous (amino acid replacing) mutation in the gene hrpG that encodes a protein regulating the expression of several virulent factors (Marchetti et al. 2010). In another study in which authors conducted a long-term evolution experiment with E. coli (LTEE), E. coli adapted to a glucose-limited medium, which also contained the bacterium-unusable citrate, evolved the ability to metabolize citrate after 30,000 generations in one of the 12 original replicate populations with which the experiment commenced (Blount et al. 2008). The emergence of this innovation required a single genome event in earlier generations (an enabling mutation), consistent on a chromosomal duplication that placed a transcription promoter upstream of a Citrate transporter-encoding gene (Blount et al. 2012).

The concept of genetic background and enabling mutations is very important to understand the term “evolvability”—the capacity of individuals or genotypes to evolve and adapt to a wide set of different conditions. Indeed, the combination of alleles existing in the population may well condition and constrain the evolutionary trajectories of new alleles, through either altering mutation rates or conditioning the nature and strength of epistatic interactions with new mutations (Meyer et al. 2012). The actual dynamics underlying the enabling effect of neutral mutation networks has been investigated in very simple systems, such as RNA folding (Wagner 2008), however, the role of enabling mutations versus compensatory mutations—those compensating the effects of destabilizing innovative mutations—remains the ground of intense investigation and debate.

As discussed earlier, populations with high mutation rates increase the per-capita chance of acquiring a beneficial mutation. In LTEE, the frequency of hyper-mutators is high, rising mutation rates 100-fold compared to that of the ancestral population (Mao et al. 1997). However, in recent studies it has been shown that hyper-mutators in experimental populations are generally followed by phenotypes with slow mutation rates, probably because such phenotypes prevent the loss of adaptive mutations in the populations and lower genetic load (Sniegowski et al. 2000; Wielgoss et al. 2013).

Convergent Evolution in Bacterial Experimental Populations

One of the most important questions yet unanswered is how repeatable is evolution. In particular, what is the role of contingency in the fixation of adaptive mutations? In a recent study (Tenaillon et al. 2012), authors evolved 115 E. coli populations for 2,000 generations of the bacterium to adapt to 42.2 °C, a complex environmental factor to which many pathways of the organism respond. To determine the diversity of adaptation of E. coli to high temperatures, they started the experiment from a single ancestral cell adapted to 37 °C. After 2,000 generations, the genome of one clone from each of the 115 experimentally evolving populations at 42.2 °C was sequenced. In addition to genome sequencing, the relative fitness of the evolved clones was assessed, observing a significant increase of fitness of the evolved strain at 42.2 °C in comparison with their ancestor. Interestingly, in 18 of the 115 lines, authors found a shared mutation in codon 966 of the RNA polymerase β-subunit (rpoB), and 17 lines contained an amino acid replacing mutation in codon 15 of the rho gene. In general, 20.2 % of genes mutated convergently in their experiment and 24.5 % of operons were convergently affected by mutations. This significant convergence was strongly driven by the epistatic interactions between new alleles. These experiments demonstrate that while the range of adaptive pathways may be bewildering, epistasis and genetic background can constrain the set of possible solutions to adapt to an environment, making evolution somewhat predictable.

Experimental Evolution Under Inefficient Natural Selection

To study the spectrum of mutations, researchers have evolved microbes, such as E. coli and S. cerevisiae, under controlled laboratory experiments and re-sequenced their genomes at different time points of the evolution experiment. Because the main objective of these experiments is to identify the breadth of mutations occurring in the genome, and calculate the rates of mutations, such populations have been evolved under very inefficient natural selection: replicates of evolving lines were single-colony transferred to new plates and this was repeated for hundreds or even thousands of generations (Fig. 7.3). These experiments have been useful to determine the spectrum and rate of mutations in E. coli (Lee et al. 2012) and S. cerevisiae (Lynch et al. 2008).

Fig. 7.3
figure 3

Experimental evolution of populations of yeast under inefficient natural selection. Many replicate populations derive from a single yeast cell. To impose population bottlenecks and genetic drift a single colony is transferred to the new environment (plate). In the figure example, five independent lines of evolution started and evolved for many generations. At specific points of the evolution experiment, whole-genome sequencing and growth curves are conducted and mutations mapped in the reference genome

Purifying selection generally precludes the fixation of innovative mutations because they are generally destabilizing owing to the trade-off between current and novel adaptations (DePristo et al. 2005; Wilke et al. 2005; Zeldovich et al. 2007). There are a number of scenarios in which innovative mutations can be fixed under inefficient natural selection, including gene duplication (Ohno 1999), and systems with over-active mechanisms of mutational robustness, such as over-expressed molecular chaperones (Moran 1996).

How does gene duplication enable the fixation of innovative mutations? After the duplication of a gene, the two daughter copies are virtually identical, hence functionally redundant, with some exceptions that include non-duplicated regulatory elements, moving of one gene copy to a differently transcribed genome region or allele ancestral polymorphism (Lynch and Katju 2004). Such exceptions may well determine the spectrum of subsequent mutations of each gene copy, and consequently the functional fates of duplicates. The asymmetry between gene copies is avoided in many biological systems such as yeast through whole-genome duplication (WGD) but not through small-scale duplications (SSD). Accordingly, a number of studies have shown that the mechanism of duplication can determine the persistence of genes in duplicate, with WGDs being more prevalent among central genes in the network (although with some exceptions depending on the organism (Alvarez-Ponce and Fares 2012)), they are refractory to subsequent SSD events and dosage sensitive (Carretero-Paulet and Fares 2012; Conant and Wolfe 2006; Fares et al. 2013; Hakes et al. 2007; Makino and McLysaght 2010). These studies have shown that SSDs are more likely to present redundancy, hence mutational robustness and evolvability (Draghi et al. 2010), than WGDs. In particular, Fares and colleagues conducted a simple mutation accumulation experiment in which five lines of S. cerevisiae haploid strains derived from a single ancestor deficient in a mismatch repair gene (msh2) were evolved independently under strong genetic drift. They passaged these lines periodically by single colony transfers from one generation to the next for 2,200 generations. The whole genome of one colony was sequenced from each line and the distribution of non-synonymous SNPs in duplicates and singletons identified. As predicted by theory, SSDs showed significantly larger number of non-synonymous SNPs than singletons and WGDs, supporting larger redundancy for SSDs than WGDs (Fares et al. 2013).

Experimental evolution has also been used to determine the role of a molecular chaperone in ameliorating the effects of deleterious non-lethal mutations. In an experiment in which several independent E. coli lines were subjected to single-colony passages, authors assessed the fitness of evolved population by competing them head-to-head to their ancestral population. After 3,200 generations of experimental bottlenecked evolution, cells presented half as much fitness as their ancestors owing to the increase in the deleterious mutational load owing to strong genetic drift effects. Over-expression of GroEL, a molecular chaperone essential in E. coli and which folds other proteins in the cell (Fayet et al. 1989; Lin and Rye 2006), allowed the recovery of about 88 % of the fitness of evolved cells (Fares et al. 2002). Interestingly, groESL, the operon encoding the chaperonin GroEL and its cofactor GroES, is abundantly synthesized in endosymbiotic mutualistic bacteria (Ahn et al. 1994; Aksoy 1995) that undergo strong genetic drift during their clonal transmission from mother host to the offspring (Buchner 1965). Experimental evolution of E. coli under inefficient natural selection reproduced therefore the transmission of endosymbiotic bacteria and identified GroEL as a mechanism of robustness against deleterious non-lethal mutations.

Concluding Remarks

Experimental evolution is a powerful tool to reproduce particular evolutionary processes with high repeatability and under tightly controlled environmental conditions. When combined with whole-genome sequencing, experimental evolution can inform on the dynamics underlying adaptations, speed of evolution, role of environment, and evolvability. Current studies have unveiled unprecedented and unexpected outcomes and have revealed complex dynamics to adaptation. While the general principles of evolution by natural selection clearly follow Darwinian laws, the evolutionary trajectories, contingency, constraints, and evolvability of organisms remain largely obscure. Future research in population genomics combined with NGS will be the key for understanding how do adaptations come about, how they interact, and where they lead.