Keywords

Chlamydomonas Is a Powerful Model Organism for Answering Key Questions in Microalgal Lipid Metabolism

Chlamydomonas is a unicellular green alga that can be used to study a broad spectrum of biological processes, including photosynthesis, the algal carbon concentrating mechanism, cell motility, light reception, phototaxis, nutrient stress responses and the cell cycle. Chlamydomonas has a number of useful features that make it convenient for genetic characterization of these processes. It is haploid during vegetative growth, so phenotypes of mutants can be observed without the need to generate homozygous lines (Harris et al. 2009). The nuclear (Merchant et al. 2007), chloroplast (Maul et al. 2002) and mitochondrial (GenBank accession number U03843) genomes have all been sequenced and can be transformed (Boynton et al. 1988; Kindle et al. 1989; Randolph-Anderson et al. 1993). A collection of selection markers, expression cassettes and fluorescent tags has been developed (Harris et al. 2009; Rasala et al. 2014). Large numbers of nuclear mutants can be generated by chemical or random insertional mutagenesis (Tam and Lefebvre 1993). Chlamydomonas has a well-characterized and rapid mating cycle, allowing combination of mutations and the analysis of segregation patterns by tetrad and random spore analyses (Harris et al. 2009).

Chlamydomonas is also a powerful model system for studies of lipid metabolism. Four qualities make it particularly useful for this application: (1) Chlamydomonas can be induced to accumulate TAGs in lipid droplets under nutrient-deprived conditions, for example under nitrogen (N) deprivation (Moellering and Benning 2010; Wang et al. 2009). Thus, Chlamydomonas can be used to understand how TAG is synthesized and assembled into lipid droplets and how algal lipid metabolism is regulated. (2) Its lipid metabolism is primarily regulated by environmental conditions, unlike in land plants, where lipid synthesis and turnover are also tissue-dependent (To et al. 2012). This simplifies the design, analysis and interpretation of experiments. (3) Chlamydomonas is unicellular, which enables higher throughput mutant screens. (4) Chlamydomonas is a member of the green lineage, with thylakoids and a chloroplast similar to those of land plants in structure and composition. Thus, Chlamydomonas can be used to understand processes that are conserved between green algae and land plants, such as the export of fatty acids from the chloroplast (Riekhof et al. 2005).

A significant fraction of current Chlamydomonas lipid studies is focused on the discovery of genes involved in TAG metabolism. Candidate genes with possible functions in TAG metabolism have been found by homology search (Riekhof et al. 2005), transcript profiling under normal growth conditions and N-deprived conditions (Blaby et al. 2013; Boyle et al. 2012; Goodenough et al. 2014; Miller et al. 2010; Schmollinger et al. 2014), and proteomic characterization of lipid droplets (Moellering and Benning 2010; Nguyen et al. 2011). However, the functions of only a handful of these candidate genes have been confirmed experimentally (Boyle et al. 2012; Li et al. 2012c; Yoon et al. 2012). Additionally, mutants with defects in lipids have been identified in screens, but the genes responsible for the lipid phenotypes have only been identified in a small fraction of cases (Cagnon et al. 2013; Xie et al. 2014; Yan et al. 2013; Li et al. 2012c). A comprehensive understanding of Chlamydomonas TAG metabolism will require additional unbiased screens for TAG mutants, combined with efficient methods for identifying the genes disrupted in the hits. Below, we review existing screening and genotyping tools, and present some tools that are in development.

High-Throughput Genetic Analyses of TAG Metabolism Traditionally Include Three Parts

A traditional forward genetic study consists of three major stages: mutagenesis, phenotypic screen and identification of mutated genes. Typically, a screen starts by randomly mutagenizing wild-type cells by radiation, chemicals or DNA insertions (Fig. 10.1). The TAG content in mutants is then analyzed by direct or indirect methods, in single cells or in a clonal population of each mutant. As there is usually a tradeoff between the throughput and the accuracy of phenotypic analyses, phenotyping is often performed in two stages: a high-throughput screen that rapidly identifies candidate mutants (called “primary screen” here) followed by a lower-throughput but higher accuracy analysis of the hits (“secondary screen”). Once mutants with abnormal TAG content are identified, mutated genes can be identified. We will first discuss screen growth conditions and methods of measuring TAG content. We will then discuss available approaches for generating mutants and for identifying mutated genes.

Fig. 10.1
figure 1

A forward genetic study for mutants with abnormal TAG content consists of three steps. Mutants are generated from wild-type cells using random mutagenesis, followed by high-throughput phenotypic analysis (TAG content analysis in this case). Mutants with higher or lower amounts of TAG than wild-type cells, referred as “hits” in the figure and depicted as two cells with large and small lipid droplets, are isolated. The causative mutation in these mutants is then identified using map-based cloning, whole-genome sequencing or PCR-based methods. An example is given for an insertional mutant where a piece of exogenous DNA inserted into an exon of a gene results in perturbed TAG levels

Growth Conditions Can Affect TAG Content and the Classes of Mutants That Can Be Isolated

The most common environmental perturbation used for inducing TAG accumulation is N deprivation. Many other stress conditions such as high light and high salinity also induce TAG accumulation, but to a lesser extent (Fan et al. 2011). When N is replete, TAG is detectable but minimal. Under N deprivation, growth stops and TAG is gradually accumulated (Moellering and Benning 2010; Wang et al. 2009). Upon the resupply of N to N-deprived cells, growth resumes and TAG is gradually degraded or diluted by cell division until it reaches a level comparable with the level before N deprivation (Fig. 10.2a) (Cagnon et al. 2013; Li et al. 2012b, 2014; Tsai et al. 2014).

Fig. 10.2
figure 2

Mutants can be screened for aberrations in the content of TAG or other lipids under several conditions with a variety of approaches. (a) Three examples of mutants with abnormal TAG contents in a certain stage within an N deprivation/resupply cycle are shown. One mutant accumulates a significant amount of TAG without nitrogen deprivation. Another mutant accumulates more TAG than wild-type under nitrogen deprivation. A third mutant has a defect in degrading the TAG accumulated when nitrogen is re-supplied. Note that this is not an exhaustive list of TAG phenotypes, and some mutants exhibit phenotypes under more than one condition. (bf) Several methods are available for identifying mutants with perturbed lipid content at any timeFig. 10.2 (continued) point. (b) In dot-thinFig. 10.2 (continued) layer chromatography (Dot-TLC), lipids are extracted and resolved radially. Each circle corresponds to a lipid species. A scheme is shown for eight mutants with two mutants depicted in black. Note that one of the two mutants over-accumulates one lipid species. (c) In gas chromatography-flame ionization detector (GC-FID) analysis, whole cell transesterification is performed, followed by gas chromatography separation of fatty acid methyl esters (FAMEs). Each FAME is depicted as one peak. FAME profiles are presented for two strains (the solid line depicts one strain; the dashed line depicts the other). Ratios between different FAMEs can be used to estimate the relative abundance of lipid species. The abundance of total FAMEs can also be obtained. (d) After staining with a lipid sensitive dye, cells of each mutant can be analyzed in bulk with the fluorescence quantified in 96-well format. (e) Flow cytometry allows the analysis of the lipid content of each cell of a mutant. Each dot in the graph represents a single cell. For each cell, the fluorescence emitted from a lipid sensitive dye is typically compared to chlorophyll fluorescence to offset the noise caused by extrinsic factors. Median fluorescence from the lipid sensitive dye across the whole population of cells is used as a metric of the TAG content of each mutant. (f) A mixed population of cells of different mutants is subjected to staining and fluorescence-activated cell sorting. Cells with high-TAG or low-TAG content (in the regions indicated by the red triangles) are isolated

Broadly speaking, a N starvation and re-feeding experiment can reveal mutants that differ from wild-type in up to three stages: (1) Under N-replete conditions, some mutants may already accumulate excess or insufficient TAG (e.g. mutant with high steady state TAG, Fig. 10.2a); such mutants could have mis-regulated signal transduction or metabolism during vegetative growth. Considering the low levels of TAG in wild-type Chlamydomonas cells under N-replete conditions, it would be challenging to screen for mutants with lower steady-state TAG levels. However, this may be possible, as one mutant with such a phenotype was isolated with reverse genetic approaches (Yoon et al. 2012). (2) When N is deprived, mutants with defective or enhanced TAG synthesis or signaling pathways will accumulate less or more TAG, respectively, than wild-type cells (e.g. mutant with high TAG induction, Fig. 10.2a). (3) Finally, one could also identify mutants that degrade TAG faster or slower than wild-type cells (e.g. mutants defective in TAG turnover, Fig. 10.2a). In fact, mutants in all three categories have been identified through forward or reverse genetic studies, demonstrating the power of this N deprivation and re-feeding process for revealing processes affecting TAG metabolism (Li et al. 2012c; Terashima et al. 2014; Tsai et al. 2014; Wang et al. 2009; Work et al. 2010; Boyle et al. 2012; Xie et al. 2014; Yan et al. 2013; Cagnon et al. 2013).

Primary Screens Have Assessed TAG Content by Lipid Stains, Lipid Chromatography or a Protein Bio-marker

The major technical goal in a primary screen is maximizing throughput while enriching for mutants of interest. Biochemical methods can quantify absolute TAG amount but are relatively time-consuming. Staining with lipid-sensitive dyes enables high-throughput analysis with a microtiter plate reader or flow cytometer, although one must be cautious, as artifacts can be caused by factors such as inefficient dye penetration.

Screens Based on Lipid Stains Enable Very High Throughput

Lipid-Binding Dyes Differ in Their Spectral Overlap with Chlorophyll, and Propensity for Photobleaching

Several fluorescent dyes have been used to stain lipid droplets in algae and other organisms. These include Nile Red, BODIPY493/503, BODIPY505/515 and LD540 (Moellering and Benning 2010; Spandl et al. 2009; Wang et al. 2009; Govender et al. 2012). For fluorometric estimation of TAG content in Chlamydomonas, the first concern is the spectral overlap between the dye and chlorophylls. Nile Red is the most frequently used dye for Chlamydomonas because Nile Red fluorescence and chlorophyll fluorescence minimally interfere with each other when certain excitation and emission wavelengths are selected (Terashima et al. 2014).

Another concern for a fluorescent dye is its propensity for photobleaching, which compromises the robustness of the measurement. In an in vitro study, photobleaching of LD540 was determined to be threefold slower than Nile Red and 15-fold slower than BODIPY493/503 (Thiele 2011). BODIPY505/515 was reported to photobleach slower than Nile Red (Govender et al. 2012).

To accurately reflect the amount of TAG in cells, the fluorescence from the dye needs to be linearly correlated with the amount of TAG present. In Chlamydomonas, one staining protocol has been shown to yield a decent correlation (R-squared value of 0.7) between the intensity of Nile Red fluorescence and TAG content (Cagnon et al. 2013).

Staining Can Be Improved by Permeabilizing Cells to Facilitate Dye Entry

One major challenge for quantitative staining of lipid droplets is ensuring that the dye can efficiently enter cells. Several treatments, such as heating by microwaving (Chen et al. 2011), a low amount of detergents (Terashima et al. 2014), or presence of organic solvents (Doan and Obbard 2011) have been reported to enhance the staining efficiency in Chlamydomonas or other algae. The concentration of the dye also needs to be optimized, as a very high concentration can cause fluorescence from dye precipitates in addition to lipid droplets (Cirulis et al. 2012).

Lipid Content Can Be Quantified with a Plate Reader or by Flow Cytometry

Three general formats have been used for screening mutants stained with fluorescent lipid-sensitive dyes: a microtiter plate reader, flow cytometry of individual mutants, and flow cytometry of pools of mutants (Fig. 10.2b). Li et al. and Yan et al. stained mutants in 96-well plates and measured fluorescence emitted by lipid-sensitive dyes with a plate reader (Li et al. 2012c; Yan et al. 2013). Cagnon et al. also stained mutants in 96-well plates, but individually analyzed each mutant by flow cytometry (Cagnon et al. 2013). Deviation of the mutant population from the wild-type population in the lipid-sensitive dye fluorescence dimension was considered a phenotype. As flow cytometry measures the fluorescence of single cells, Xie et al. and Terashima et al. isolated high-TAG or low-TAG mutants from a mixed pool of mutants using fluorescence-activated cell sorting (FACS) (Terashima et al. 2014; Xie et al. 2014). Combined with high-throughput sequencing, FACS of pools of mutants has the potential to reveal all the genes involved in TAG metabolism in a single experiment. The throughput and other features of the above methods are summarized in Table 10.1.

Table 10.1 Properties of different TAG primary screen strategies in Chlamydomonas

Lipid Species Can Be Separated Physically in Medium-Throughput Screens

In contrast to fluorophore staining, analysis of extracted lipids can reveal differences in the abundances of individual lipid species. Transesterification of lipids into fatty acid methyl esters (FAMEs) followed by gas chromatography-flame ionization detector (GC-FID ) analysis provides the composition as well as the amount of fatty acids contained in the lipid substrates used (Fig. 10.2b). For the purpose of high-throughput screens, direct transesterification with unprocessed algal cells has been developed. Fatty acid profile analysis has enabled the identification of the sole ω-3 fatty acid desaturase in Chlamydomonas (Nguyen et al. 2013; Pflaster et al. 2014). If the starting number of cells is known, this analysis yields the total amount of fatty acids per cell, which is often affected when TAG content is changed (Terashima et al. 2014). Additionally, certain fatty acid species are enriched in one lipid species compared to the others. The ratio of the abundance of a fatty acid species to the total abundance of all fatty acids serves as an indicator of the amount of the corresponding lipid. Because TAG is generally more saturated than membrane lipids, the ratio of C16:0 to C16:4 fatty acid was shown to correlate with TAG content (Liu et al. 2013).

Chlamydomonas studies could benefit from the adaptation of other high-throughput biochemical lipid screening techniques developed for other organisms. In Arabidopsis, Xu et al. screened for suppressors of a mutant deficient in synthesis of digalactosyldiacylglycerol (DGDG) by performing lipid extraction and TLC robotically (Xu et al. 2003). While lipids are resolved vertically in a traditional TLC, Xu et al. spotted lipid extracts on a TLC plate and applied a solvent mixture to each spot to separate lipid species radially outwards from the spot (Fig. 10.2b). This approach allows parallel analysis of 384 samples, and could be adapted to TAG mutant screens in algae, which are even more amenable to robotic operations.

Screens Based on Lipid Bio-markers Have Also Been Performed

High-throughput screens for lipid aberrations are also facilitated when a biomarker is available for the lipid of interest. The most abundant constituent of the Chlamydomonas lipid droplet proteome, Major Lipid Droplet Protein (MLDP), is highly correlated with TAG in abundance (Moellering and Benning 2010; Tsai et al. 2014). Tsai et al. screened for mutants with defects in TAG degradation by performing immuno-dot blots against MLDP in mutants arrayed in 96-well format. This allowed the identification of a mutant that exhibits a delay in TAG degradation (Tsai et al. 2014).

Raman Spectroscopy Has Potential for Single Cell-Based Screens

Since staining with fluorescent dyes can be biased by the cell penetration efficiency and provides little composition information on lipids, some groups have sought alternative approaches for rapid, single-cell lipid analysis. Single-cell Raman spectroscopy (SCRS ) is a label-free technique for analyzing the in vivo chemical profiles of single cells (Li et al. 2012a). It detects the emitted light from light-excited molecules, with each wave number shift representing a distinct mode of vibration from a specific molecular structure. In contrast to infrared spectroscopy, Raman spectrometry is not compromised by water molecules thus is suitable for analyzing live biological samples. Recently, several groups have developed techniques based on Raman spectroscopy for analysis of lipids in single algal cells and obtained information on lipid amount and unsaturation levels (Wang et al. 2014; Wu et al. 2011). In addition to lipids, SCRS is also capable of quantitatively measuring the concentration of starch in individual microalgal cells such as Chlamydomonas spp. and Chlorella spp. (Ji et al. 2014). Furthermore, new technologies that isolate and sort single cells based on their Raman spectrum have recently been demonstrated (Wang et al. 2013; Zhang et al. 2014a, 2015). The SCRS approach is label-free, non-invasive, culture-independent, rapid, low-cost, and potentially able to simultaneously track multiple metabolites in individual live cells. As throughput, ease of deployment, cost, and multiplexing are all important considerations in design of assays for detecting and measuring intracellular compounds, such advantages can be exploited for development of many novel applications such as screening of microbial cell factories and optimization of bioprocesses.

Quantitative Analysis and Microscopy Are Used to Confirm TAG Phenotypes

GC-FID Can Be Used to Analyze the Fatty Acid Composition of TAG

The profile of the fatty acids contained in TAG is very useful for determining which pathway is responsible for the extra or missing fatty acids. For example, the pgd1 mutant showed a major deficiency in the 18:1Δ9 fatty acid in TAG and other extraplastidic lipids, indicating a defect in exporting de novo synthesized fatty acids from the chloroplast rather than recycling of membrane lipid species (Li et al. 2012c). To analyze the fatty acid content of TAG specifically, instead of that of whole cells, lipids need to be extracted from cells and resolved, e.g. by TLC. The TAG fraction can then be transesterified and analyzed by GC-FID. In this way, the molar amount of TAG as well as the fatty acid profile of TAG can be determined. A major limitation of GC-FID is that it cannot identify the molecular species of TAG, because the origins of FAMEs are not traceable.

Mass Spectroscopy (MS) Can Be Used to Identify and Quantify TAG Species

MS has been increasingly utilized to quantify TAG content in Chlamydomonas (Liu et al. 2013; Terashima et al. 2014; Wang et al. 2009; Yoon et al. 2012; Vieler et al. 2007). Direct infusion MS minimizes oxidation of polyunsaturated fatty acid species (Danielewicz et al. 2011) while liquid chromatography (LC) prior to MS enables separation and identification of positional isomers of TAG (Kind et al. 2012; MacDougall et al. 2011). With ultrahigh performance liquid chromatography–tandem mass spectrometry (UHPLC–MS/MS), 140 molecular species of TAG were identified for Chlamydomonas. Quantification with this method yielded a consistently lower amount for TAG compared to GC-FID performed on the same samples, possibly due to the difference in ionization efficiency between different TAG species (Liu et al. 2013).

Microscopy Can Be Used to Characterize the Size and Quantity of Lipid Droplets

Hyper or hypo-accumulation of TAG is expected to affect the number or size of lipid droplets. Disruption of different pathways may affect one metric over the other, depending on whether the pathway mainly affects the initial generation or the enlargement of lipid droplets (Wilfling et al. 2013). These parameters can be measured by microscopy on cells stained with lipid sensitive dyes. In a lipid droplet morphology mutant screen performed on a Drosophila cell line, five categories of abnormal lipid droplet morphology were observed (Guo et al. 2008). Microscopy has not yet been used for large-scale primary screens in Chlamydomonas, but it has been employed to visualize the phenotypes of specific mutants. For example, the over-accumulation of TAG in the sta6 starchless mutant was initially observed as an increase in lipid droplet area by microscopy after Nile Red staining (Wang et al. 2009). Electron microscopy revealed a previously unknown association between lipid droplets and the chloroplast and allowed the discovery of large lipid droplets within the chloroplast in the sta6 mutant (Fan et al. 2011; Goodson et al. 2011).

Point Mutations and Insertional Mutations Can Be Used to Disrupt Genes

Point Mutations Can Be Generated by Irradiation or Chemical Mutagens

Point mutations can produce gain-of-function and temperature-sensitive mutants. In Chlamydomonas, point mutations are commonly induced using ultraviolet (UV) radiation or chemical mutagens such as ethyl methane sulfonate (EMS). A general concern with these two approaches is the extent of mutation, which is correlated with the percentage of cells killed during mutagenesis (the “kill rate”). A low kill rate generally means that a larger number of cells need to be mutagenized to cover the genome. A very high kill rate indicates a higher density of point mutations, which may complicate the identification of the mutation causing the phenotype, as many mutations will be genetically linked to the phenotype. EMS mutagenesis has been employed by two groups for the purpose of obtaining high-TAG strains (Xie et al. 2014; Lee et al. 2014).

Traditionally, point mutations are identified through map-based cloning, which requires multiple rounds of crossing and a high-density molecular marker system, which is under-developed in Chlamydomonas compared with the plant model A. thaliana (Jander et al. 2002; Rymarquis et al. 2005). In recent years, next-generation sequencing technology has greatly facilitated the mapping of point mutations. Tulin and Cross were able to map 52 mutants by crossing each mutant to wild-type and performing whole-genome sequencing on pools of 10–12 progeny that showed the mutant phenotype. Candidate mutations were identified based on the expectation that only the causative point mutation or tightly linked ones would be uniform in the mutant progeny population (Tulin and Cross 2014).

Insertional Mutagenesis Is Broadly Used in Chlamydomonas

Transformation of an insertion cassette containing a resistance marker into Chlamydomonas cells disrupts the genome at random sites (Tam and Lefebvre 1993; Zhang et al. 2014b). Thus, insertional mutagenesis has been extensively employed to understand many processes including TAG metabolism (Cagnon et al. 2013; Li et al. 2012c; Terashima et al. 2014; Tsai et al. 2014; Yan et al. 2013). General considerations regarding the insertion cassette sequence and transformation method are summarized below.

Selection Is Facilitated with Antibiotic Resistance Markers or Auxotrophy Markers

Commonly used antibiotic resistance markers for Chlamydomonas nuclear genome transformation include ble for resistance to bleomycin, aph7” for resistance to hygromycin B and aphVIII for resistance to paromomycin (Berthold et al. 2002; Sizova et al. 2001; Stevens et al. 1996). aph7” and aphVIII each encode an aminoglycoside phosphotransferase but both can be used in the same strain without interfering with each other (Berthold et al. 2002). Recently, the chloroplast marker AadA, which confers resistance to spectinomycin, has been recoded for the nuclear genome and can be used as a shuttle marker between E. coli and Chlamydomonas (Meslet-Cladiere and Vallon 2012). Other drug-selectable markers include mutant ALS, encoding a mutated form of acetolactate synthase that confers resistance to sulfometuron methyl (Kovar et al. 2002), and CRY1-1, encoding a mutant ribosomal protein that confers resistance to emetine (Nelson et al. 1994).

Besides drug resistance markers, auxotrophy markers are also available for auxotrophic strains. Introduction of the NIT1 gene (encoding the nitrate reductase) into a nit1 background strain enables it to grow on nitrate as the nitrogen source (Kindle et al. 1989); the ARG7 gene (encoding the argininosuccinate lyase) rescues the arginine auxotrophy of arg7 mutants (Debuchy et al. 1989); NIC7 (encoding a quinolinate synthetase) rescues the nicotinamide auxotrophy of nic7 mutants (Ferris 1995); and THI10 (encoding a hydroxyethylthiazole kinase) rescues thiamine auxotrophy of thi10 mutants (Ferris 1995).

Most drug resistance markers are several-fold smaller than NIT1 and ARG7 in size. This makes drug resistance markers easier to manipulate and potentially less vulnerable to endonuclease cleavage during transformation (as discussed later).

Expression of Selection Marker Genes Can Be Enhanced by Strong Promoters or Introns

A strong promoter is desired for abundant production of resistance proteins and optimal transformation efficiency. The Chlamydomonas PSAD promoter (Fischer and Rochaix 2001), RBCS2 promoter (Ribulose Bisphosphate Carboxylase/Oxygenase Small Subunit 2), β-2 tubulin promoter (Brunke et al. 1984; Kozminski et al. 1993) and a hybrid HSP70A-RBCS2 promoter (Schroda et al. 2000) drive high expression of nuclear genes and are extensively used to drive selection markers.

Besides a strong promoter , the presence of the first intron of RBCS2 upstream or downstream of the start codon was demonstrated to enhance the abundance of the marker protein and the transformation efficiency (Berthold et al. 2002; Lumbreras and Purton 1998).

The Extent of Gene Disruption Is Affected by the Gene Feature of the Insertion Site

Insertions into exons typically disrupt gene functions. However, in the absence of a transcriptional terminator, insertions into introns can be spliced out. Commonly, a gene with an insertion in an intron will produce a normal protein but at a lower abundance (Wang 2008).

DNA Can Be Delivered into the Chlamydomonas Genome by Biolistics, Glass Bead Transformation, Electroporation or Agrobacterium-Mediated Gene Transfer

Early transformations of the Chlamydomonas nuclear genome were mostly achieved using biolistic particle delivery . Auxotrophic mutants were successfully complemented with native Chlamydomonas genes using this method (Debuchy et al. 1989; Kindle et al. 1989). However, the transformation efficiency of the biolistic method is typically much lower than the methods developed later. Moreover, it requires a sophisticated particle bombardment system. Soon afterwards, it was observed that exogenous DNA could be delivered into Chlamydomonas genome by agitating the cell/DNA mixture with glass beads . This method does not require specialized equipment but only yielded ~1,000 colonies with 1 μg DNA used in a cell wall-deficient strain (Kindle 1990). A protocol based on electroporation was later developed, enabling transformation efficiencies of up to 100,000 colonies per μg DNA (Shimogawara et al. 1998). More recently, Agrobacterium tumefaciens , which has been primarily used for transferring DNA into dicotyledonous plants, was also shown to introduce DNA into the Chlamydomonas genome (Kumar et al. 2004).

The Number of Insertions Needed to Obtain the Desired Genome Coverage Depends on the Number of Insertions per Mutant

Before mutagenesis, it is important to decide how many mutants are needed to cover a certain fraction of the genome. Results from a simulation suggest that 120,000 insertions are required to cover 80 % of Chlamydomonas genes with one or more allele for each gene (Zhang et al. 2014b). The number of mutants required to cover a certain fraction of the genome can be reduced by increasing the number of insertions per mutant , which can be achieved using a higher amount of transforming DNA per cell (Shimogawara et al. 1998).

Insertion Sites Can Be Identified Using Plasmid Rescue or Various PCR-Based Methods

Several methods have been applied in Chlamydomonas for identification of insertion loci (Table 10.2). If the insertion cassette contains E. coli plasmid sequences, it can be retrieved by restriction digestion of the mutant genome, circularization and transformation back into E. coli (Tam and Lefebvre 1993). Sequencing of the Chlamydomonas genomic regions in the resulting plasmid can reveal the insertion site. This method does not involve PCR and can be used when flanking regions are hard to amplify. However, it requires transformation of an entire plasmid, including bacterial replication origin and bacterial selection marker in addition to the Chlamydomonas selection marker.

Table 10.2 Comparison of different strategies for insertion site identification in Chlamydomonas

Multiple methods developed later use nested PCR to amplify the junction between the insertion and the genome flanking sequence followed by sequencing of the PCR products. Examples include thermal asymmetric interlaced PCR (TAIL-PCR) (Dent et al. 2005), restriction enzyme site-directed amplification PCR (RESDA-PCR) (Gonzalez-Ballester et al. 2005a), and SiteFinding PCR (Li et al. 2012c).

In cases where the insertion cassette lacks a transcriptional terminator, a chimeric mRNA will be produced, containing both the sequence of the selection marker gene and the genomic sequence between the marker gene and the first native transcriptional terminator encountered in the genome. 3′ rapid amplification of cDNA ends (3′ RACE) can be applied to recover the sequence downstream of the marker gene (Meslet-Cladiere and Vallon 2012). Both 3′ RACE and nested PCR methods only require thermal cycling that can be easily performed on multiple microtiter tubes in parallel.

One challenge with 3′ RACE and nested PCR approaches is that the distance between the annealing location of the degenerate primer and the insertion junction varies between mutants. In some mutants, the distance may be too large for robust amplification. Additionally, differences in PCR efficiency between different sizes of DNA makes it difficult to adapt these methods to pools of mutants.

Recently, we developed a method for mapping insertion cassettes in pools of tens of thousands of Chlamydomonas mutants (Zhang et al. 2014b). This method, named Chlamydomonas MmeI-based insertion site sequencing (ChlaMmeSeq), relies on the presence of recognition sites for the type IIS restriction enzyme MmeI at the two ends of the cassette. To extract flanking sequences, the genome is digested with MmeI, which binds in the cassette and cleaves 20–21 base pairs into the flanking DNA. This produces a fragment containing 20–21 bases of flanking sequence attached to the cassette, onto which an adapter can be ligated to allow PCR amplification of the flanking sequence. When coupled with high-throughput sequencing, ChlaMmeSeq enables parallel retrieval of 20–21 bp of flanking sequences from more than 10,000 mutants. Approximately 70 % of short flanking sequences obtained can be aligned to a unique locus in the Chlamydomonas genome.

Insertion Sites Are Frequently Messy

The complex nature of many insertions can make it difficult to identify the insertion site and the gene whose disruption causes the observed phenotype. Insertions can contain large deletions, truncated cassettes, and insertions of genomic sequence between the cassette(s) and flanking genomic DNA (Fig. 10.3). In an ideal insertion site, the insertion cassette is inserted without any deletion in the cassette sequence or the genomic sequence at the insertion site (Fig. 10.3a). In this scenario, flanking sequences can be obtained from both sides of the inserted cassette, and a single gene will be directly disrupted by the insertion. However, insertion sites are rarely this clean.

Fig. 10.3
figure 3

Insertion sites are often complex in Chlamydomonas. Models of cassette insertions in genomic sequence are presented. (a) The cassette is inserted cleanly into the genome. (b) An insertion can be associated with a deletion of genomic DNA. (c) A cassette can be truncated on one or both ends. (d) Genomic DNA from a distant locus (“junk DNA”, shown in yellow) can be inserted next to the cassette. (e) In addition to the mutation caused by cassette insertion, mutations can be present at one or more other loci. Note that a mutant may harbor a combination of these scenarios

Some insertions include a large deletion of genomic DNA that eliminates multiple genes. In such cases, one common strategy to identify the gene whose disruption causes the observed phenotype is to attempt complementation of the mutant with each of the disrupted genes (Fig. 10.3b) (Matsuo et al. 2008; Tsai et al. 2014).

Regardless of the presence of a deletion of genomic sequence, one or both sides of the insertion cassette can be truncated (Fig. 10.3c) (Zhang et al. 2014b). Such truncations can lead to the failure of flanking sequence extraction protocols, due to the loss of the necessary sequences, e.g. primer binding sites or sequences required for plasmid propagation in bacteria.

In addition to possible genomic deletions and cassette truncations, genomic DNA from a distant locus (referred as “junk DNA” here; Fig. 10.3d) can sometimes be found inserted between the cassette and the genomic sequence from the disrupted locus (Zhang et al. 2014b). The presence of junk DNA can make it challenging to identify the insertion site, as the DNA sequence flanking the cassette maps to the genome but does not indicate the true insertion site.

Last but not least, even when the insertion site can be correctly identified, the phenotype of a mutant could be caused by unmarked mutations, such as point mutations that occur during the transformation process (Fig. 10.3e). Thus, the causative relationship between the mutation and the phenotype should always be verified as suggested in the next part.

A Mapped Insertional Mutant Library Will Be a Transformative Tool

Our lab is developing an indexed Chlamydomonas insertional mutant library in which the insertion site is known for each mutant. Mutants are maintained as colony arrays on solid medium, and also preserved cryogenically. This resource will provide mutants in many genes of interest for reverse genetic studies. Additionally, this library will also be usable for various forward genetic screens, eliminating the time-consuming steps of mutant generation and mapping.

One scheme for a genome-saturating TAG mutant screen using this library takes advantage of a feature that enables tracking of the abundance of individual mutants in pools. The mutants in this library are generated with a cassette containing MmeI recognition sites at both ends, allowing the retrieval of a 20–21 bp flanking sequence from each end. If thousands of mutants are pooled together, the flanking sequence of each mutant can serve as a molecular barcode that indicates the abundance of that mutant in the pool (Fig. 10.4). To identify genes with a role in TAG metabolism, mutants can be combined into a single pool, and mutants with unusually high or low TAG content can be enriched by staining with a TAG-binding dye and Fluorescence-Activated Cell Sorting (FACS) (Terashima et al. 2014). Mutants enriched in a high-TAG or low-TAG pool can be identified because their barcodes will be enriched in those pools. Since the physical coordinates of each mutant in the library are already known prior to the screen, mutants in hit genes of interest can then be individually cherry-picked for further characterization.

Fig. 10.4
figure 4

An indexed library of insertional mutants will facilitate genetic screens for TAG phenotypes. Application of ChlaMmeSeq to each mutant produces a unique 20–21 bp genomic flanking sequence. This unique flanking sequence can serve as a barcode for detecting the presence or quantifying the abundance of the mutant in a pool. For a TAG mutant screen, the colonies from the library can be combined, grown and nitrogen deprived as a pooled sample. Fluorescence activated cell sorting (FACS) can be employed to sort out high-TAG mutants and low-TAG mutants as two pools of hits (depicted as cells with large and small lipid droplets, respectively). ChlaMmeSeq can then be applied to the two pooled samples to obtain insertion flanking sequences/barcodes. Individual mutants for genes of interest can be cherry-picked from the library for further studies

The Causative Relationship between the Mutation and the Phenotype Needs to Be Confirmed by Mutant Complementation

In the prior part, we mentioned the possibility that some mutations can be challenging to locate, and the mutation identified may not always be the one causing the phenotype. Thus, experiments should always be performed to confirm the causative relationship between the mutation and the mutant phenotype.

The Mutation Locus in Insertional Mutants Should Be Characterized by PCR

For insertional mutants, as shown in Fig. 10.3d, the flanking sequence obtained may not reveal the actual insertion site. To gain more confidence on the flanking sequence, one can attempt to retrieve longer flanking sequence beyond a possible junk DNA region. The challenge here is that this method can never tell conclusively that one has read into the actual flanking genomic sequence. A more robust approach is to obtain flanking sequences for both sides of the insertion and see whether they align to the same locus. If they do, it is extremely likely that the insertion is truly at that site. Finally, the most conclusive evidence that a region is disrupted is the absence of a wild-type product when the region spanning the insertion site is amplified by PCR.

Multiple Alleles Increase Confidence in a Genotype-Phenotype Link

When multiple independent mutants in the same gene exhibit the same phenotype, the likelihood that mutation of this gene causes the phenotype is greatly increased. Indeed, this principle is used extensively in Arabidopsis research (Krysan et al. 1999).

Genetic Linkage Analysis Can Be Used to Rule out Non-causative Mutations

In genetic linkage analysis, a mutant is crossed to the wild-type and the phenotypes and genotypes of progeny are determined. A non-causative mutation will typically be found in both progeny with and without the phenotype. However, co-segregation of the phenotype with a mutation of interest does not conclusively establish that the mutation of interest causes the phenotype, as the mutation of interest could merely be at a chromosomal location near the causative mutation.

Genetic Complementation Is Critical for Establishing the Causative Relationship between the Mutation and the Phenotype

The strongest evidence that a recessive mutation in a specific gene causes the observed phenotype is the observation that expression of the wild-type gene complements the mutant phenotype. To establish complementation robustly, multiple independent transformants of the wild-type gene into the mutant should be analyzed.

Investigators should be cautious as second site mutations can give the false appearance of complementation. During transformation, suppressor or reversion mutations can arise and revert the phenotype, giving the impression that the mutant has been complemented. To safeguard against this possibility, complementation transformations should include a parallel control transformation with a construct that is not expected to complement. If this latter construct produces colonies that revert the phenotype, reversion mutations are arising, and the investigator should perform multiple independent transformations and test whether the construct of interest yields a significantly greater number of wild-type-like transformants than the control construct.

Concluding Remarks

Algae are a promising platform for producing TAG for biodiesel. Many challenges remain to be overcome before algal biofuels are profitable, including our limited understanding of genes involved in algal TAG metabolism (Chisti 2007). The green alga Chlamydomonas is a powerful model organism for this process. However, due to difficulties in identifying the mutation sites, only a handful of genes have been identified with a role in TAG metabolism. Future efforts should focus on improving the throughput and robustness of screens and the success rate of mutation mapping.

More broadly, TAG serves as a universal storage lipid in eukaryotic organisms (Farese and Walther 2009), while glycerolipids are essential membrane components of all cells. Research on plant and animal lipid metabolism provides important insights into human nutrition and health. Yet, the multicellular nature of land plants and animals limits the throughput of genetic screens. For Chlamydomonas, in contrast, emerging tools including the genome-wide mutant library will enable high-throughput genome-saturating screens. We anticipate that future screens will reveal many conserved aspects of lipid metabolism.