Synthetic biology of polyketide synthases

Yuzawa, Satoshi; Backman, Tyler W. H.; Keasling, Jay D.; Katz, Leonard

doi:10.1007/s10295-018-2021-9

Synthetic biology of polyketide synthases

Metabolic Engineering and Synthetic Biology - Original Paper
Published: 09 February 2018

Volume 45, pages 621–633, (2018)
Cite this article

Download PDF

Access provided by CONRICYT – Journals CONACYT

Journal of Industrial Microbiology & Biotechnology

Synthetic biology of polyketide synthases

Download PDF

Satoshi Yuzawa^1,2^na1,
Tyler W. H. Backman^1,2,3^na1,
Jay D. Keasling^1,2,3,4,5,6 &
…
Leonard Katz^2,3

2075 Accesses
26 Citations
18 Altmetric
2 Mentions
Explore all metrics

Abstract

Complex reduced polyketides represent the largest class of natural products that have applications in medicine, agriculture, and animal health. This structurally diverse class of compounds shares a common methodology of biosynthesis employing modular enzyme systems called polyketide synthases (PKSs). The modules are composed of enzymatic domains that share sequence and functional similarity across all known PKSs. We have used the nomenclature of synthetic biology to classify the enzymatic domains and modules as parts and devices, respectively, and have generated detailed lists of both. In addition, we describe the chassis (hosts) that are used to assemble, express, and engineer the parts and devices to produce polyketides. We describe a recently developed software tool to design PKS system and provide an example of its use. Finally, we provide perspectives of what needs to be accomplished to fully realize the potential that synthetic biology approaches bring to this class of molecules.

The Assembly Line Enzymology of Polyketide Biosynthesis

Recent Advances in Re-engineering Modular PKS and NRPS Assembly Lines

Article 23 December 2020

Insights into polyketide biosynthesis gained from repurposing antibiotic-producing polyketide synthases to produce fuels and chemicals

Article 01 June 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Synthetic biology is an emerging discipline that combines the fields of biology, biotechnology and engineering for the design and construction of new biological parts, devices, and systems, as well as the re-design of existing natural biological systems, all for useful purposes. To make the engineering of biology faster, more reliable, and more reproducible, synthetic biologists have developed a framework for engineering biology that is similar to other engineering disciplines, including standardized, off the shelf parts (e.g., genes) with standardized connections that could be used and reused in the construction of devices and larger systems. For example, a computer can be assembled from standardized parts and devices (sound cards, processors, operating systems, hard drives, etc.) available from many vendors; devices such as monitors, scanners, printers, etc., are connected to the computer through a standard port.

Synthetic biology has numerous applications: it can be used for the construction of proteins that can serve as new vaccines, or new biochemical pathways synthesizing small molecules that can serve as pharmaceuticals, chemicals (solvents, adhesives, food flavorings, cosmetics, etc.), or biofuels, plant and animal growth stimulators or insecticides. Additional applications include the generation of new biosensors for early detection of medical conditions, or to detect environmental pollution. The hosts used for production of new proteins, small molecules, or biosensors were designated as chassis. Synthetic biology also includes the direct engineering of chassis, as well as the technologies such as CRISPR/Cas9 to make chassis easier to engineer.

Reduced polyketides are a sub-class of natural products that have a variety of medical, veterinary, and agricultural applications. Examples, shown in Fig. 1, include the US Food and Drug Administration (FDA)-approved antibacterial agents erythromycin, fidaxomicin, rifamycin and tylosin; the antifungal agent amphotericin; the immunosuppressive agents rapamycin and tacrolimus; the cholesterol lowering agent lovastatin; the anthelmintic agent avermectin; the insecticide spinsosad; and monensin, a growth promotant for ruminants and coccidiostat in poultry. Curacin, discodermolide, epothilone, geldanamycin, and salinosporamide have anti-cancer properties and are in clinical trials. With the exception of lovastatin, which is produced by the fungus Aspergillus terreus, as well as a number of higher plants, all of the compounds shown in Fig. 1 are produced by bacteria, in particular members of the mycelia-forming family called actinomycetes that includes the most common genus Streptomyces, as well as the genera Dactylosporangium, and Saccharopolyspora, among others. Many members of the myxobacteria family also produce polyketides (e.g., epothilone, disorazole). Curacin is made in the cyanobacterium Lyngbya majuscula. Discodermolide was isolated from a sponge, but it is believed that it is produced by a symbiotic bacterium. Although these compounds do not resemble each other, and have different biological targets, they are all made by a class of enzymes known as modular polyketide synthases (PKSs), which produce the polyketide backbones. An example of a modular PKS is 6-deoxyerythronolide B synthase (DEBS), which produces 6-deoxyerythronolide B (6-dEB), the polyketide backbone of erythromycin (Fig. 2), which undergoes post-polyketide processing (glycosidation, hydroxylation, O-methylation) to generate the final product erythromycin. Some of the other compounds shown in Fig. 1 (e.g., chalcomycin, fidaxomicin, methymycin, and tylosin) undergo similar post-polyketide processing, whereas others, such as splenocin and geldanamycin undergo acylation or carbamoylation, respectively. Discodermolide and epothilone undergo little or no further elaboration. Some polyketides are connected to an amino acid or amino acid derivative (e.g., epothilone, rapamycin, tacrolimus, and lipomycin) via a linked non-ribosomal peptide synthetase (NRPS). The polyketide backbone of monensin reforms into a polyether, whereas part of the backbone of the reveromycin assumes a spiroketal configuration.

Although great structural diversity is observed for polyketides, they share, as outlined below, an overarching similarity in their biosynthesis. This has drawn the attention of a variety of scientists interested in understanding the fundamental basis of how these molecules are made and the common and differentiating features of their biosynthesis. In addition, their medical and industrial importance has drawn the interest of pharmaceutical and industrial companies who desire to apply genetic approaches to change the structures of polyketides to improve their properties or generate chemical diversity.

This paper is intended for people interested in synthetic biology who may not be familiar with polyketides and PKSs. It is, therefore, not intended to be comprehensive with respect to the detailed enzymology of PKS or present a complete review of the genetic engineering of PKS genes that has been reported over the past 27 years. For that, the reader is directed to a number of recent reviews [3, 4, 21, 38, 44]. Rather, we attempt to show here that nature has devised a near perfect synthetic biology system of composable parts and devices in PKSs which can be assigned specific nomenclature according to function. We believe that this nomenclature will be interesting to synthetic biologists, as well as people who work with polyketides and PKSs, and useful for those interested in recomposing parts and devices to create new polyketides. We present an overview of work done in this area and discuss what needs to be understood that will make PKSs easier to engineer.

Overview of modular PKS enzymology

As can be seen in Fig. 2, 6-dEB is built by the stepwise condensation of propionyl-CoA and 6 molecules of methylmalonyl-CoA in a manner similar to the biosynthesis of fatty acids, which are produced by fatty acid synthases (FASs). However, unlike FASs [5], which use a single set of enzymes or enzymatic domains for all elongation steps, each polyketide chain elongation step is carried out by a discrete set of catalytic domains, which is defined as an extension module. Other modular PKSs also use acyl-CoAs or carboxyacyl-CoAs as substrates. The number of modules in a given PKS determines the size of the resultant polyketide produced, and the specific carboxyacyl-CoAs employed, which are selected by the modules themselves, determine the overall structure. DEBS has 6 extension modules that use methylmalonyl-CoA, in addition to the N-terminal loading module and the C-terminal off-loading domain, which accounts for the 14-membered macrolactone ring and a methyl group at every other position in the ring. In contrast, the fidaxomicin PKS, which, like erythromycin, also starts its synthesis with propionyl-CoA, has 8 extension modules, and uses malonyl-CoA in the first, sixth, and seventh extensions, methylmalonyl-CoA, in the second, third, fifth, and eighth extensions, and ethylmalonyl-CoA in the fourth extension, yielding an 18-membered macrolactone (Fig. 1). Each module also controls the reduction state of the corresponding elongated polyketide intermediate by having a specific set of reductase domains, whereas FASs exclusively provide only the fully reduced forms of their products. In addition, modular PKSs can control the stereochemistry at each chiral center as described below. Hence, the modular nature of the PKSs not only accounts for the thousands of diverse natural polyketide structures, but also offers a wealth of engineering opportunities for the production of novel compounds.

Polyketide chain elongation is catalyzed by three different catalytic domains: a ketosynthase (KS) domain, an acyltransferase (AT) domain, and an acyl carrier protein (ACP) domain where the AT domain determines the specific carboxyacyl-CoA incorporated into the growing polyketide chain, and then transfers it to the ACP in the same module. The KS domain then catalyzes decarboxylative C–C bond formation between the growing polyketide chain and the extender unit on the ACP. Modular PKSs are categorized into cis- and trans-AT (or AT-less) modular PKS systems. In the trans-AT type, each module lacks an AT domain and instead receives its substrate, by one or more free-standing ATs. For example, disorazole (Fig. 1) is produced by a trans-AT PKS. Although most of the free-standing ATs thus far identified are specific for malonyl-, methylmalonyl-, or ethylmalonyl-CoA [17], over 20 different malonyl-CoA analogs have been found to be incorporated into naturally occurring polyketides by cis-AT PKSs [46]. These analogs include short to medium-chain alkylmalonyl-CoAs, halogenated malonyl-CoAs, and benzylmalonyl-CoA (Fig. 3). Examples of compounds that contain side-chains in the polyketide backbone that are larger than a simple methyl group in Fig. 1 include tylosin and monensin (ethyl), rapamycin (methoxy), tacrolimus (methoxy and allyl), reveromycin (butyl, pentyl, isopentyl, or hexyl), splenomycin (benzyl), and salinosporamide (chloro-, fluoro-, or bromoethyl), all of which are introduced as malonyl-CoA or malonyl-ACP analogs by their corresponding cis-AT domains in their respective PKSs. In addition, there are a number of AT domains that can employ more than a single derivative of malonyl-CoA as substrate for incorporation, such as the AT domain of module 4 (AT4) of the epothilone PKS, which can use either malonyl- or methylmalonyl-CoA, the AT5 domain of the monensin PKS, which incorporates either methylmalonyl- or ethylmalonyl-CoA, and the AT5 domain of the reveromycin PKS which can utilize four different alkylated malonyl-CoAs as substrates. The DEBS AT domains are known to incorporate only the (2S)-enantiomer of methylmalonyl-CoA [27] and the subsequent KS-catalyzed condensation reactions generate (2R)-2-methyl-3-ketoacyl-ACP intermediates, the opposite stereochemical orientation [45]. It is also thought, but not yet proven, that all AT domains that use alkylated malonyl-CoAs as substrates only use the corresponding 2S-enantiomers.

After each condensation step, the newly generated β-keto group is reduced by a ketoreductase (KR) domain if present in the module. DEBS contains a KR domain in modules 1,2,4,5, and 6, and an inactive (reductase incompetent) KR domain in module 3 (Fig. 2). Importantly, the stereochemistries of both β-hydroxyl- and α-alkyl side-chains in the nascent polyketide generated are controlled primarily by the individual KR domains of each PKS module [20]. KR domains are characterized based on their stereochemical outcomes. A-type and B-type KRs generate the l-β-hydroxyl and d-β-hydroxyl groups, respectively. Each KR type can also control stereochemical orientations of α-substituents. This is important in all modules any acyl-CoA other than malonyl-CoA is introduced. A1- or B1-type KRs produce the d-α-substituent, and A2- or B2-type KRs produce l-α-substituent. Finally, C1- or C2-type KRs are reductase-incompetent but retain epimerase activity. The d/l system is not perfect but preferred over the R/S system because the assignment of R or S depends on the substituents present.

The trans or cis-double bonds within polyketides are generated by a dehydratase (DH) domain after KR-catalyzed reductions. DEBS has a DH domain in module 4. The syn-coplanar water elimination reaction from a substrate possessing d-β-hydroxyl group generates a trans-double bond [43]. Thus, a trans-double bond typically results from a module that contains B-type KR (B type KRs but not B2). In a module that contains an A-type KR, a cis-double bond can be generated. If a module further contains an enoyl reductase (ER) domain (e.g., DEBS module 4), stereoselective reduction of a trans-double bond possessing a α-substituent may occur [25]. ER domains are also characterized based on their stereochemical outcomes. L-type ERs generate l-α-substituents, and D-type ERs generate d-α-substituents. Recently, cryptic methyl group epimerase function of DH domains has also been reported [49].

Both cis- and trans-AT PKS modules are known to harbor an integrated, S-adenosylmethionine-dependent C-methyltransferase (MT) domain, which installs a methyl group to the α-carbon of β-keto polyketide intermediates. For example, module 8 of the epothilone PKS (EPOS M8) carries a C-MT domain. Since the AT domain specifies methylmalonyl-CoA, the gem-dimethyl group seen in epothilone is the result of a single methylation [37]. On the other hand, the C-MT domain of the PKS module for yersiniabactin biosynthesis specifies dimethylation to produce the corresponding gem-dimethyl group [37]. Recently, C-MTs from trans-AT PKS systems have been shown to be stereospecific [50].

After the above-mentioned reactions, the elongated and reduced (if reduction domains are present) polyketide intermediate is eventually translocated to the KS domain of the next module and similar reactions are repeated until the chain is fully elongated. Finally, thioesterase (TE) domains release products by cyclization or hydrolysis, thus generating lactones, lactams, and carboxylic acids [9]. TE domains are usually embedded into the last module of the PKSs but polyether-producing PKSs such as the nanchangmycin and monensin PKSs employ free-standing TEs to cleave the final intermediates [34, 42]. A few other less common termination mechanisms are also known. A tandem sulfotransferase (ST)-TE didomain yield terminal olefins, as found in the curacin PKS [7]. A reductive (R) domain produces aldehydes or alcohols, which are found in the coelimycin and myxalamid PKSs [14, 40].

The polyketide chain initiation event is catalyzed by domains, commonly called the loading module, preceding the first extension module. The most common loading module consists of a condensation-incompetent KS domain (e.g., KS^Q, a Cys to Glu variant of a KS domain), an AT domain, and an ACP domain [31]. Here, the AT domain is specific for malonyl-CoA or methylmalonyl-CoA and generates acetyl or propionyl starter units, respectively, after a decarboxylation reaction by the KS^Q domain. On the other hand, loading modules consisting of AT-ACP didomains tends to show broader substrate specificity (the AT lacks the conserved arginine residue, which stabilizes the acid moiety of malonyl- and methylmalonyl-CoA substrates). For example, as can been seen in the avermectin compounds shown in Fig. 1, the avermectin PKS can initiate polyketide biosynthesis from either isobutyryl-CoA or 2-methylbutyryl-CoA [18]. This PKS is also known to incorporate a huge variety of synthetic starters that can be exogenously fed to a variant of the producing organism that lacks the ability to make the natural starter units [10]. Similarly, the lipomycin PKS has also been demonstrated to have broad substrate specificity [52, 53]. On the other hand, the borrelidin PKS loading module was shown to be stereoselective and incorporate a carboxyacyl-CoA [15]. Another type of loading didomain is composed of a CoA-ligase (CoL) and an ACP domain, which usually activates a shikimic acid-derived cyclocarboxylic acid in an ATP-dependent manner and loads it onto the ACP. Some PKSs contain an additional domain between CoL and ACP domains, as found in the rapamycin PKS [39].

Synthetic biology of PKS: parts, devices, chassis

In synthetic biology jargon, the simplest unit of function is a part. For modular PKSs, the catalytic domains ACP, AT, DH, ER, KR, KS, C-MT, and TE, could be considered parts. A representative list of common PKS parts is shown in Table 1. AT parts are distinguishable on the basis of the substrate(s) that they utilize, e.g., malonyl-CoA, methylmalonyl-CoA, propionyl-CoA, etc. KR parts are distinguished on the basis of the stereochemistry generated at both the alkyl side chain (methyl, ethyl, etc.) at the α position, and the OH group at the β position. KR parts with stereochemistry unknown to the authors at the time of writing are represented by the code KRu. Other parts are represented in Table 1 by a single code including ER (the current version of our software cannot distinguish different ER types), but it is possible that different KS domains will be found to have different preferences for various acyl-ACPs. Similarly, subtypes of the other domains may also emerge.

Table 1 List of parts (domains) found in natural type I PKS systems and codes employed

Full size table

A device consists of two or more parts, that has a function beyond that of the part. A simple device would be a gene, its promoter and a terminator, which can produce the corresponding protein. A more complicated device would be an operon consisting of several genes, along with a promoter and a terminator. In the context of PKS synthetic biology, PKS modules are considered devices. The simplest device in modular PKSs would consist of three parts: KS, AT and ACP; a complex device would contain KS, AT, DH, ER, KR, and ACP. A full PKS, which produces a functional polyketide and composed of an assemblage of devices would be considered a system. Without reference to PKS systems, the term module has also been used to describe a reusable device, built from simpler parts. In the discussion of PKSs, we will use the terms module and device interchangeably. Table 2 shows a list of 56 devices that are components of known PKSs, along with their cognate parts. Devices are grouped by presence of reductive domains with DH-ER-KR devices designated δ, DH-KR devices designated γ, KR devices designated β, and other devices designated α. The large number of devices is based on the variability of the AT and KR parts that comprise them. Figure 4 shows the structures of the growing or starting acyl chains attached to their cognate ACPs imparted by 25 devices selected from Table 2. It can be seen that devices α4 and α14 yield the structure, propionyl-ACP. In device α4, the ATp part (domain) directly loads propionyl-CoA to the ACP, whereas in device α14, the ATmm domain loads methylmalonyl-CoA onto the ACP and the KSQ domain subsequently decarboxylates it to leave propionyl-CoA.

Table 2 Representative list of devices (modules) found in natural PKS systems and codes employed

Full size table

Table 3 lists the PKS devices used for the production of 15 of the polyketides shown in Fig. 1, excluding terminal domains. The smallest and largest PKSs shown encode the backbones for methymycin and amphotericin B and contain 6 and 19 devices, respectively. Many of the devices are common to two or more PKSs. Device γ3 is used in nine of the 12 PKSs shown, and is tandemly repeated in several PKSs including four times in lipomycin and six times in amphotericin. It is important to note that the PKS devices do not each have a single, unique sequence either at the protein or DNA level, although there is a high degree of similarity among the various representatives of each device. PKS devices are determined purely by function, namely the structure imparted onto the α and β carbons in the growing acyl chain during polyketide synthesis. The repetitive use of functionally identical modules to generate the large variety of polyketides produced in nature highlights them as natural reusable devices.

Table 3 List of devices employed in order in natural PKS systems that construct the polyketide backbone of compounds indicated

Full size table

As mentioned above, many polyketides are generated in members of the genus Streptomyces. Streptomyces strains are likely the most promising chassis for forward engineering because they generally encode a substrate promiscuous phosphopantetheinyl transferase (PPTase) that is necessary to functionalize heterologous PKS genes expressed in the host. PPTases are absent in the model microbial hosts Escherichia coli and Saccharomyces cerevisiae widely used in synthetic biology. Streptomyces hosts are also known to produce diverse acyl-CoAs and carboxyacyl-CoAs such as isobutyryl-CoA and methylmalonyl-CoA that are not naturally produced in E. coli and S. cerevisiae. Recently, approximately 200 native or synthetic promoters and ~ 200 ribosomal binding sites were characterized in Streptomyces venezuelae, and some were also validated in two other Streptomyces strains; their promoter strengths were highly correlated in the three chassis [1]. CRISPR/Cas9 systems have been developed for genome editing of several different Streptomyces strains [26]. These systems enabled direct selection (via survival) for double homologous recombination, and were shown to shorten the time and effort required for genome engineering over the methodologies formerly used for gene replacement. For large gene insertions (> 5 kb), a number of actinophage integrase-based site-specific gene integration systems have been developed and used for heterologous polyketide production in Streptomyces [2, 36]. Many replicating plasmids, as well as a temperature-sensitive plasmid have been developed into vectors, and methods to move DNA from E. coli to a variety of Streptomyces strains have also been developed [23].

Engineering PKSs

Synthetic biologists use the term refactoring to describe the engineering of genes or operons. The modularity of PKSs and the similarities of the devices that compose them render them as excellent targets for refactoring (at the DNA level) to produce novel compounds with predicted structures. Over the past 27 years, much work in this area has been undertaken, mainly employing cis-AT PKS systems. Most efforts have been directed at changing the parts within modules (AT or reduction domain swapping), deleting modules or combining heterologous PKS modules to form novel polyketides. It should be noted that, beyond the development of the technology, or the seeking of greater understanding of a specific PKS-mediated biochemical step, most of the reported efforts have been directed at producing novel antibiotics or other polyketides of medical interest. In these cases, it generally would not have been possible to predict a priori which novel molecules would have the desired properties, hence a combinatorial approach, where many compounds could be generated simultaneously, would be preferred. Because of simple, as well as sophisticated assays for activities that have been developed for polyketides of interest to various pharmaceutical companies, the emphasis has been on generating large numbers of novel compounds of diverse structure, where low-level production of is usually sufficient for assessment of potential value. Examples include the production of derivatives of erythromycin, rapamycin, and geldanamycin, among others [28, 35, 47]. Unfortunately, the molecular bases of why some engineered PKSs worked better than others in these in vivo experiments were generally not explored.

A less common approach to refactoring modular PKSs, and one in which our laboratory is heavily engaged, is to produce biofuels and other industrially important chemicals as drop-in replacements for currently used petroleum-based compounds. In these cases, only a single, or small number of novel compounds are targeted for production. Examples include adipic acid [16] and short-chain ketones such as MEK (2-butanone), and MIBK (4-methyl-2-pentanone) [51], which are used as solvents. In general, the monetary values of the molecules targeted are low, hence to be competitive with the same compounds resourced from petroleum, considerations of product titers, rates of production, and yields (relative molar amounts of product to carbon used in fermentation) are very important. Here, the emphasis is on production levels. Success heavily depends upon precise PKS engineering, employing a chassis capable of relatively fast growth rates and reaching high densities on inexpensive media, as well as possessing all the biochemical precursor pathways to support functional expression of heterologous PKS systems.

ClusterCAD

We have developed the online database ClusterCAD which facilitates the selection of natural cis-AT PKS parts to design a novel chimeric PKS that produces a desired small molecule [11]. ClusterCAD will identify a natural truncated PKS with a chemical intermediate most similar to a desired target, and can then identify potential donor parts for subsequent domain swaps, which are necessary to produce a desired target compound. Currently, ClusterCAD currently contains 72 PKS clusters. Notably, among those absent are the tacrolimus and salinosporamide PKS clusters.

Here we demonstrate the utility of ClusterCAD to guide the selection of PKS parts to produce the novel polyketide target shown in Fig. 5a. First, we use the ClusterCAD chemical structure search tool to identify a natural PKS as a starting point for engineering. One of the top hits, or most chemically similar natural PKSs in ClusterCAD to this target molecule is narbonolide, the pikromycin precursor produced by the PKS. The ClusterCAD chemical structure search tool produces a visual MCS plot (Maximum Common Substructure) which highlights the shared substructure between the query and each hit, as shown in Fig. 5b. As this plot visually highlights the similar regions between both structures, it is straightforward to identify the following sequence of engineering modifications to the pikromycin PKS, which would result in the desired product:

(1) swap the module 0 (load module) acyltransferase (AT) domain for an AT which naturally incorporates an isobutyryl-CoA substrate; (2) swap the module 1 acyltransferase (AT) domain for an AT which naturally incorporates an ethylmalonyl-CoA substrate; (3) disable the module 5 ketoreductase (KR); (4) insert a type A1 ketoreductase (KR) in module 6; (5) swap the module 6 acyltransferase (AT) domain for an AT which naturally incorporates an allylmalonyl-CoA substrate.

After decomposing the difference between the natural PKS and the target compound into a sequence of engineering modifications as shown above, the ClusterCAD sequence search tool can identify suggested donor parts for each domain swap, based on amino acid sequence similarity. For domain swaps, this is done by searching the sequence of each domain for the most similar domain with a desired annotation, whereas for domain insertions you can search the entire ‘acceptor’ polypeptide for the most sequence similar polypeptide which includes the desired new domain annotations. This produces the following list of top match donor parts for the above modifications, in order: (1) Divergolide module 6 acyltransferase (AT); (2) Angolamycin module 5 acyltransferase (AT); (3) Tirandamycin module 1 ketoreductase (KR); (4) pikromycin module 5 ketoreductase (KR); (5) None, ClusterCAD does not yet contain any PKS clusters which incorporate allylmalonyl-CoA (or ACP). The AT domain of module 4 of the tacrolimus PKS specifies allylmalonyl-CoA or allylmalonyl-ACP [19, 32].

In summary, this example demonstrates the overall process of using ClusterCAD to design a chimeric PKS for a novel chemical target, by first identifying a chemically similar natural product, and then identifying a set of engineering modifications and donor parts which could, in principle yield the desired product. In practice, the rules which govern domain swap compatibility, and chimeric junction loci remain poorly understood, so it is necessary to test a large number of donor parts and junctions to yield an active enzyme which produces the desired product at detectable levels.

Conclusions

Theoretically, it should be possible to design a polyketide of desired structure, write out the list of parts and devices required from Tables 1 and 2, respectively (to resemble the strategies shown in Table 3 for known PKS systems), employ ClusterCAD or similar software to design the devices, synthesize the DNA of the required sequence, introduce the DNA into a desired chassis, and express it under a selected means of regulation. Except for the production of small molecules that are generated from three or fewer modules, or the limited engineering of one or two modules within the framework of a large PKS, this has not yet been reported. Whereas significant progress in our ability to productively alter the substrate specificities and reductive outcomes generated from within a module has taken place over the last few years [4], the ability to link modules in sequence to generate completely new polyketide structures without losing biochemical activity is still extremely challenging.

The first issue is the strategy to be employed to covalently link the modules so that there is efficient chain transfer. Small polyketides can be made by covalently assembling two PKS modules into a single polypeptide. It may be possible to assemble a larger set of modules into a single polypeptide (e.g., nysC has 6 modules in a single polypeptide [6]). This approach would obviate the need for docking domains to enable correct module–module interactions and a few successful examples have been reported in two module systems [13, 48]. Since the polypeptide chains of modular PKS systems assemble into homodimers [41], it is likely that the modules assembled in the long polypeptide will require placement of dimerization elements in appropriate positions if they are not already present [12, 54]. An empirical approach appears to be the only way to develop this strategy and should be undertaken.

Second, if construction of new PKSs requires the assembly of modules into three or more polypeptides, non-covalent intermodular interactions through docking domains may be necessary. However, previous efforts showed that more than 90% of the hybrid PKSs connected by docking domains showed < 10% activities compared to the wild-type counterparts in this approach [24, 29, 30]. It is likely that success here will require greater knowledge of the detailed structure of not only the docking domains but also of the modules themselves, and perhaps entire PKS systems. Controlling non-covalent intermodular interactions gives the greatest opportunity for rapidly generating large numbers of novel polyketides, and remains the only means for accomplishing this goal combinatorially, which would be a major achievement in the application of synthetic biology.

Finally, several other issues also need to be addressed to generate a working strategy for de novo production of novel polyketides. The first is chain passage of the growing polyketide chain from the ACP of one module to the correct KS domain immediately downstream. The accepting KS must make proper contact with the donating upstream ACP, and must be able to hold the incoming substrate in its active site to allow thiotransfer to its active site cysteine. The KS must catalyze chain elongation reactions at rates faster than its overall kinetics (or lose biochemical activity). Many KS domains are known to be somewhat substrate selective for both chain length and stereochemistry. It may be possible to replace substrate-selective KS domains in each module with those that are more substrate flexible (e.g., mycolactone KSs [33]), but that means non-cognate intramodular KS–ACP interactions for the required chain extensions. Because decreased k_cat/K_m values were previously observed in chain elongation with heterologous KS–ACP interfaces [8] and chain extension reaction is most likely the rate-limiting step in DEBS [22], it is very challenging to solve this issue.

References

Bai C, Zhang Y, Zhao X, Hu Y, Xiang S, Miao J, Lou C, Zhang L (2015) Exploiting a precise design of universal synthetic modular regulatory elements to unlock the microbial natural products in Streptomyces. Proc Natl Acad Sci USA 112:12181–12186. https://doi.org/10.1073/pnas.1511027112
Article PubMed CAS Google Scholar
Baltz RH (2010) Streptomyces and Saccharopolyspora hosts for heterologous expression of secondary metabolite gene clusters. J Ind Microbiol Biotechnol 37:759–772. https://doi.org/10.1007/s10295-010-0730-9
Article PubMed CAS Google Scholar
Barajas JF, Blake-Hedges JM, Bailey CB, Curran S, Keasling JD (2017) Engineered polyketides: synergy between protein and host level engineering. Synth Syst Biotechnol 2:147–166. https://doi.org/10.1016/j.synbio.2017.08.005
Article PubMed PubMed Central Google Scholar
Bayly CL, Yadav VG (2017) Towards precision engineering of canonical polyketide synthase domains: recent advances and future prospects. Molecules. https://doi.org/10.3390/molecules22020235
Article PubMed Google Scholar
Beld J, Lee DJ, Burkart MD (2015) Fatty acid biosynthesis revisited: structure elucidation and metabolic engineering. Mol BioSyst 11:38–59. https://doi.org/10.1039/c4mb00443d
Article PubMed CAS Google Scholar
Brautaset T, Sekurova ON, Sletta H, Ellingsen TE, StrLm AR, Valla S, Zotchev SB (2000) Biosynthesis of the polyene antifungal antibiotic nystatin in Streptomyces noursei ATCC 11455: analysis of the gene cluster and deduction of the biosynthetic pathway. Chem Biol 7:395–403
Article PubMed CAS Google Scholar
Chang Z, Sitachitta N, Rossi JV, Roberts MA, Flatt PM, Jia J, Sherman DH, Gerwick WH (2004) Biosynthetic pathway and gene cluster analysis of curacin A, an antitubulin natural product from the tropical marine cyanobacterium Lyngbya majuscula. J Nat Prod 67:1356–1367. https://doi.org/10.1021/np0499261
Article PubMed CAS Google Scholar
Chen AY, Schnarr NA, Kim CY, Cane DE, Khosla C (2006) Extender unit and acyl carrier protein specificity of ketosynthase domains of the 6-deoxyerythronolide B synthase. J Am Chem Soc 128:3067–3074. https://doi.org/10.1021/ja058093d
Article PubMed PubMed Central CAS Google Scholar
Du L, Lou L (2010) PKS and NRPS release mechanisms. Nat Prod Rep 27:255–278. https://doi.org/10.1039/b912037h
Article PubMed CAS Google Scholar
Dutton CJ, Gibson SP, Goudie AC, Holdom KS, Pacey MS, Ruddock JC, Bu’Lock JD, Richards MK (1991) Novel avermectins produced by mutational biosynthesis. J Antibiot (Tokyo) 44:357–365
Article CAS Google Scholar
Eng CH, Backman TWH, Bailey CB, Magnan C, Garcia Martin H, Katz L, Baldi P, Keasling JD (2017) ClusterCAD: a computational platform for type I modular polyketide synthase design. Nucleic Acids Res. https://doi.org/10.1093/nar/gkx893
Article PubMed Central Google Scholar
Eng CH, Yuzawa S, Wang G, Baidoo EE, Katz L, Keasling JD (2016) Alteration of polyketide stereochemistry from anti to syn by a ketoreductase domain exchange in a Type I modular polyketide synthase subunit. Biochemistry 55:1677–1680. https://doi.org/10.1021/acs.biochem.6b00129
Article PubMed CAS Google Scholar
Gokhale RS (1999) Dissecting and exploiting intermodular communication in polyketide synthases. Science 284:482–485. https://doi.org/10.1126/science.284.5413.482
Article PubMed CAS Google Scholar
Gomez-Escribano JP, Song L, Fox DJ, Yeo V, Bibb MJ, Challis GL (2012) Structure and biosynthesis of the unusual polyketide alkaloid coelimycin P1, a metabolic product of the cpk gene cluster of Streptomyces coelicolor M145. Chem Sci 3:2716. https://doi.org/10.1039/c2sc20410j
Article CAS Google Scholar
Hagen A, Poust S, de Rond T, Yuzawa S, Katz L, Adams PD, Petzold CJ, Keasling JD (2014) In vitro analysis of carboxyacyl substrate tolerance in the loading and first extension modules of borrelidin polyketide synthase. Biochemistry 53:5975–5977. https://doi.org/10.1021/bi500951c
Article PubMed CAS Google Scholar
Hagen A, Poust S, Rond T, Fortman JL, Katz L, Petzold CJ, Keasling JD (2016) Engineering a polyketide synthase for in vitro production of adipic acid. ACS Synth Biol 5:21–27. https://doi.org/10.1021/acssynbio.5b00153
Article PubMed CAS Google Scholar
Helfrich EJ, Piel J (2016) Biosynthesis of polyketides by trans-AT polyketide synthases. Nat Prod Rep 33:231–316. https://doi.org/10.1039/c5np00125k
Article PubMed CAS Google Scholar
Ikeda H, Nonomiya T, Usami M, Ohta T, Omura S (1999) Organization of the biosynthetic gene cluster for the polyketide anthelmintic macrolide avermectin in Streptomyces avermitilis. Proc Natl Acad Sci USA 96:9509–9514
Article PubMed CAS Google Scholar
Jiang H, Wang YY, Guo YY, Shen JJ, Zhang XS, Luo HD, Ren NN, Jiang XH, Li YQ (2015) An acyltransferase domain of FK506 polyketide synthase recognizing both an acyl carrier protein and coenzyme A as acyl donors to transfer allylmalonyl and ethylmalonyl units. FEBS J 282:2527–2539. https://doi.org/10.1111/febs.13296
Article PubMed CAS Google Scholar
Keatinge-Clay AT (2016) Stereocontrol within polyketide assembly lines. Nat Prod Rep 33:141–149. https://doi.org/10.1039/c5np00092k
Article PubMed PubMed Central CAS Google Scholar
Keatinge-Clay AT (2017) The uncommon enzymology of cis-acyltransferase assembly lines. Chem Rev 117:5334–5366. https://doi.org/10.1021/acs.chemrev.6b00683
Article PubMed CAS Google Scholar
Khosla C, Tang Y, Chen AY, Schnarr NA, Cane DE (2007) Structure and mechanism of the 6-deoxyerythronolide B synthase. Annu Rev Biochem 76:195–221. https://doi.org/10.1146/annurev.biochem.76.053105.093515
Article PubMed CAS Google Scholar
Kieser T, Bibb MJ, Buttner MJ, Chater KF, Hopwood DA (2000) Practical Streptomyces genetics. The John Innes Foundation, Norwich, UK
Google Scholar
Klaus M, Ostrowski MP, Austerjost J, Robbins T, Lowry B, Cane DE, Khosla C (2016) Protein–protein interactions, not substrate recognition, dominate the turnover of chimeric assembly line polyketide synthases. J Biol Chem 291:16404–16415. https://doi.org/10.1074/jbc.M116.730531
Article PubMed PubMed Central CAS Google Scholar
Kwan DH, Leadlay PF (2010) Mutagenesis of a modular polyketide synthase enoylreductase domain reveals insights into catalysis and stereospecificity. ACS Chem Biol 5:829–838. https://doi.org/10.1021/cb100175a
Article PubMed CAS Google Scholar
Liu Z, Liang Y, Ang EL, Zhao H (2017) A new era of genome integration-simply cut and paste! ACS Synth Biol 6:601–609. https://doi.org/10.1021/acssynbio.6b00331
Article PubMed CAS Google Scholar
Marsden AFA, Caffrey P, Aparicio JF, Loughran MS, Staunton J, Leadlay PF (1994) Stereospecific acyl transfers on the erythromycin-producing polyketide synthase. Science 263:378–380. https://doi.org/10.1126/science.8278811
Article PubMed CAS Google Scholar
McDaniel R, Thamchaipenet A, Gustafsson C, Fu H, Betlach M, Betlach M, Ashley G (1999) Multiple genetic modifications of the erythromycin polyketide synthase to produce a library of novel ‘‘unnatural’’ natural products. Proc Natl Acad Sci USA 96:1846–1851
Article PubMed CAS Google Scholar
Menzella HG, Carney JR, Santi DV (2007) Rational design and assembly of synthetic trimodular polyketide synthases. Chem Biol 14:143–151. https://doi.org/10.1016/j.chembiol.2006.12.002
Article PubMed CAS Google Scholar
Menzella HG, Reid R, Carney JR, Chandran SS, Reisinger SJ, Patel KG, Hopwood DA, Santi DV (2005) Combinatorial polyketide biosynthesis by de novo design and rearrangement of modular polyketide synthase genes. Nat Biotechnol 23:1171–1176. https://doi.org/10.1038/nbt1128
Article PubMed CAS Google Scholar
Moore BS, Hertweck C (2002) Biosynthesis and attachment of novel bacterial polyketide synthase starter units. Nat Prod Rep 19:70–99. https://doi.org/10.1039/b003939j
Article PubMed CAS Google Scholar
Motamedi H, Shafiee A (1998) The biosynthetic gene cluster for the macrolactone ring of the immunosuppressant FK506. Eur J Biochem 256:528–534
Article PubMed CAS Google Scholar
Murphy AC, Hong H, Vance S, Broadhurst RW, Leadlay PF (2016) Broadening substrate specificity of a chain-extending ketosynthase through a single active-site mutation. Chem Commun 52:8373–8376. https://doi.org/10.1039/c6cc03501a
Article CAS Google Scholar
Oliynyk M, Stark CBW, Bhatt A, Jones MA, Hughes-Thomas ZA, Wilkinson C, Oliynyk Z, Demydchuk Y, Staunton J, Leadlay PF (2003) Analysis of the biosynthetic gene cluster for the polyether antibiotic monensin in Streptomyces cinnamonensis and evidence for the role of monB and monC genes in oxidative cyclization. Mol Microbiol 49:1179–1190. https://doi.org/10.1046/j.1365-2958.2003.03571.x
Article PubMed CAS Google Scholar
Patel K, Piagentini M, Rascher A, Tian ZQ, Buchanan GO, Regentin R, Hu Z, Hutchinson CR, McDaniel R (2004) Engineered biosynthesis of geldanamycin analogs for Hsp90 inhibition. Chem Biol 11:1625–1633. https://doi.org/10.1016/j.chembiol.2004.09.012
Article PubMed CAS Google Scholar
Phelan RM, Sachs D, Petkiewicz SJ, Barajas JF, Blake-Hedges JM, Thompson MG, Reider Apel A, Rasor BJ, Katz L, Keasling JD (2017) Development of next generation synthetic biology tools for use in Streptomyces venezuelae. ACS Synth Biol 6:159–166. https://doi.org/10.1021/acssynbio.6b00202
Article PubMed CAS Google Scholar
Poust S, Phelan RM, Deng K, Katz L, Petzold CJ, Keasling JD (2015) Divergent mechanistic routes for the formation of gem-dimethyl groups in the biosynthesis of complex polyketides. Angew Chem Int Ed Engl 54:2370–2373. https://doi.org/10.1002/anie.201410124
Article PubMed CAS Google Scholar
Robbins T, Liu YC, Cane DE, Khosla C (2016) Structure and mechanism of assembly line polyketide synthases. Curr Opin Struct Biol 41:10–18. https://doi.org/10.1016/j.sbi.2016.05.009
Article PubMed PubMed Central CAS Google Scholar
Schwecke T, Aparicio JF, Molnar I, Konig A, Khaw LE, Haydock SF, Oliynyk M, Caffrey P, Cortes J, Lester JB et al (1995) The biosynthetic gene cluster for the polyketide immunosuppressant rapamycin. Proc Natl Acad Sci USA 92:7839–7843
Article PubMed CAS Google Scholar
Silakowski B, Nordsiek G, Kunze B, Blocker H, Muller R (2001) Novel features in a combined polyketide synthase/non-ribosomal peptide synthetase: the myxalamid biosynthetic gene cluster of the myxobacterium Stigmatella aurantiaca Sga15. Chem Biol 8:59–69. https://doi.org/10.1016/S1074-5521(00)00056-9
Article PubMed CAS Google Scholar
Staunton J, Caffrey P, Aparicio JF, Roberts GA, Bethell SS, Leadlay PF (1996) Evidence for a double-helical structure for modular polyketide synthases. Nat Struct Biol 3:188–192
Article PubMed CAS Google Scholar
Sun Y, Zhou X, Dong H, Tu G, Wang M, Wang B, Deng Z (2003) A complete gene cluster from Streptomyces nanchangensis NS3226 encoding biosynthesis of the polyether ionophore nanchangmycin. Chem Biol 10:431–441. https://doi.org/10.1016/s1074-5521(03)00092-9
Article PubMed CAS Google Scholar
Valenzano CR, You YO, Garg A, Keatinge-Clay A, Khosla C, Cane DE (2010) Stereospecificity of the dehydratase domain of the erythromycin polyketide synthase. J Am Chem Soc 132:14697–14699. https://doi.org/10.1021/ja107344h
Article PubMed PubMed Central CAS Google Scholar
Weissman KJ (2016) Genetic engineering of modular PKSs: from combinatorial biosynthesis to synthetic biology. Nat Prod Rep 33:203–230. https://doi.org/10.1039/c5np00109a
Article PubMed CAS Google Scholar
Weissman KJ, Timoney M, Bycroft M, Grice P, Hanefeld U, Staunton J, Leadlay PF (1997) The molecular basis of Celmer’s rules: the stereochemistry of the condensation step in chain extension on the erythromycin polyketide synthase. Biochemistry 36:13849–13855. https://doi.org/10.1021/bi971566b
Article PubMed CAS Google Scholar
Wilson MC, Moore BS (2012) Beyond ethylmalonyl-CoA: the functional role of crotonyl-CoA carboxylase/reductase homologs in expanding polyketide diversity. Nat Prod Rep 29:72–86. https://doi.org/10.1039/c1np00082a
Article PubMed CAS Google Scholar
Wlodek A, Kendrew SG, Coates NJ, Hold A, Pogwizd J, Rudder S, Sheehan LS, Higginbotham SJ, Stanley-Smith AE, Warneck T, Nur EAM, Radzom M, Martin CJ, Overvoorde L, Samborskyy M, Alt S, Heine D, Carter GT, Graziani EI, Koehn FE, McDonald L, Alanine A, Rodriguez Sarmiento RM, Chao SK, Ratni H, Steward L, Norville IH, Sarkar-Tyson M, Moss SJ, Leadlay PF, Wilkinson B, Gregory MA (2017) Diversity oriented biosynthesis via accelerated evolution of modular gene clusters. Nat Commun 8:1206. https://doi.org/10.1038/s41467-017-01344-3
Article PubMed PubMed Central CAS Google Scholar
Wu N, Tsuji SY, Cane DE, Khosla C (2001) Assessing the balance between protein–protein interactions and enzyme-substrate interactions in the channeling of intermediates between polyketide synthase modules. J Am Chem Soc 123:6465–6474
Article PubMed CAS Google Scholar
Xie X, Garg A, Khosla C, Cane DE (2017) Elucidation of the cryptic methyl group epimerase activity of dehydratase domains from modular polyketide synthases using a tandem modules epimerase assay. J Am Chem Soc 139:9507–9510. https://doi.org/10.1021/jacs.7b05502
Article PubMed CAS PubMed Central Google Scholar
Xie X, Khosla C, Cane DE (2017) Elucidation of the stereospecificity of C-methyltransferases from trans-AT polyketide synthases. J Am Chem Soc 139:6102–6105. https://doi.org/10.1021/jacs.7b02911
Article PubMed PubMed Central CAS Google Scholar
Yuzawa S, Deng K, Wang G, Baidoo EE, Northen TR, Adams PD, Katz L, Keasling JD (2017) Comprehensive in vitro analysis of acyltransferase domain exchanges in modular polyketide synthases and its application for short-chain ketone production. ACS Synth Biol 6:139–147. https://doi.org/10.1021/acssynbio.6b00176
Article PubMed CAS Google Scholar
Yuzawa S, Eng CH, Katz L, Keasling JD (2013) Broad substrate specificity of the loading didomain of the lipomycin polyketide synthase. Biochemistry 52:3791–3793. https://doi.org/10.1021/bi400520t
Article PubMed CAS Google Scholar
Yuzawa S, Eng CH, Katz L, Keasling JD (2014) Enzyme analysis of the polyketide synthase leads to the discovery of a novel analog of the antibiotic alpha-lipomycin. J Antibiot (Tokyo) 67:199–201. https://doi.org/10.1038/ja.2013.110
Article CAS Google Scholar
Zheng J, Fage CD, Demeler B, Hoffman DW, Keatinge-Clay AT (2013) The missing linker: a dimerization motif located within polyketide synthase modules. ACS Chem Biol 8:1263–1270. https://doi.org/10.1021/cb400047s
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

This work was supported by the Joint BioEnergy Institute, which is funded by the US Department of Energy (DOE), the Office of Science, Office of Biological and Environmental Research under Contract No. DE-AC02-05CH11231 between DOE and Lawrence Berkeley National Laboratory. The publisher, by accepting the article for publication, acknowledges that the US government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of the manuscript, or allow others to do so, for US government purpose. The authors declare no competing financial interests.

Author information

Satoshi Yuzawa and Tyler W. H. Backman contributed equally.

Authors and Affiliations

Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
Satoshi Yuzawa, Tyler W. H. Backman & Jay D. Keasling
Joint BioEnergy Institute, Emeryville, CA, 94608, USA
Satoshi Yuzawa, Tyler W. H. Backman, Jay D. Keasling & Leonard Katz
QB3 Institute, University of California, Berkeley, CA, 94720, USA
Tyler W. H. Backman, Jay D. Keasling & Leonard Katz
Department of Bioengineering, University of California, Berkeley, CA, 94720, USA
Jay D. Keasling
Department of Chemical and Biomolecular Engineering, University of California, Berkeley, CA, 94720, USA
Jay D. Keasling
Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Hørsholm, Denmark
Jay D. Keasling

Authors

Satoshi Yuzawa
View author publications
You can also search for this author in PubMed Google Scholar
Tyler W. H. Backman
View author publications
You can also search for this author in PubMed Google Scholar
Jay D. Keasling
View author publications
You can also search for this author in PubMed Google Scholar
Leonard Katz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leonard Katz.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yuzawa, S., Backman, T.W.H., Keasling, J.D. et al. Synthetic biology of polyketide synthases. J Ind Microbiol Biotechnol 45, 621–633 (2018). https://doi.org/10.1007/s10295-018-2021-9

Download citation

Received: 08 December 2017
Accepted: 03 February 2018
Published: 09 February 2018
Issue Date: July 2018
DOI: https://doi.org/10.1007/s10295-018-2021-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Synthetic biology of polyketide synthases

Abstract

Similar content being viewed by others

The Assembly Line Enzymology of Polyketide Biosynthesis

Recent Advances in Re-engineering Modular PKS and NRPS Assembly Lines

Insights into polyketide biosynthesis gained from repurposing antibiotic-producing polyketide synthases to produce fuels and chemicals

Introduction

Overview of modular PKS enzymology

Synthetic biology of PKS: parts, devices, chassis

Engineering PKSs

ClusterCAD

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Synthetic biology of polyketide synthases

Abstract

Similar content being viewed by others

The Assembly Line Enzymology of Polyketide Biosynthesis

Recent Advances in Re-engineering Modular PKS and NRPS Assembly Lines

Insights into polyketide biosynthesis gained from repurposing antibiotic-producing polyketide synthases to produce fuels and chemicals

Introduction

Overview of modular PKS enzymology

Synthetic biology of PKS: parts, devices, chassis

Engineering PKSs

ClusterCAD

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation