Introduction

Secondary metabolites of microbial, plant, animal and human origin play essential roles in the maintenance of human health and treatment of human disease (Newman et al. 2000). Many drugs being used in modern medicine, such as vinblastine (Madagascar periwinkle), digitalis (purple foxglove) and codeine (opium poppy), are derived from herbal remedies and used for diverse purposes in the treatment of cancer, heart diseases and pain. Natural products play a pivotal role as sources of drug lead compounds. Their chemical diversity and complexity provide structural scaffolds for small molecule drugs, serving as an inspiration for medicinal design. But on the other hand, chemical complexity of natural products also represents a major obstacle in the production of these pharmaceuticals on an industrial scale. Chemical synthesis of natural products is often difficult and expensive, and isolation from their natural resources is also typically low yielding.

The advent of molecular biology has offered a new platform for the research and development of natural products. Genetic manipulation of host strains such as E. coli or S. cerevisiae (Khosla and Keasling 2003) and heterologous expression allows for economical production of natural products produced in trace quantities in source organisms. Protein engineering and directed evolution can also be utilized to improve natural product yields. In addition, these techniques make it possible to manipulate biosynthetic pathways completely, and genes responsible for biosynthetic steps from a number of source organisms can be combined and modified to produce desired natural products or natural products with modified chemical structures that are rarely or not found in nature. These novel products with unique chemical properties may have improved efficacy or can be used for completely new purposes. In addition, emerging technologies such as metagenomics provide new approaches to accessing genes responsible for desired biosynthetic steps and potentialities for improving production of known natural products or isolation of useful novel compounds.

At present, microorganisms like bacteria, yeasts (Porro et al. 2005), especially Pichia (Gasser et al. 2006) and Kluyveromyces (Rodriguez et al. 2006), as well as cultured cells from higher organisms (such as mammalians, insects, or plants) represent the most frequently used hosts for the production of heterologous and homologous proteins. Microorganisms (prokaryotic as well as eukaryotic) are advantageous hosts because of high growth rates and easy of genetic manipulation.

Good examples of this are: (1) the use of a few selected microorganisms to produce a wide range of different enzymes (the Danish company Novozymes has expressed a large number of different enzymes in the filamentous fungus Aspergillus oryzae); (2) the use of the penicillin-producing fungus Penicillium chrysogenum by the Dutch company, DSM, for the production of adipoyl-7-aminodeacetoxycephalosporanic acid (adipoyl-7-ADCA) (Robin et al. 2001), a precursor for the production of semi-synthetic cephalosporins, and (3) the production of chemical 1,3-propanediol by the American company, Dupont, by a recombinant Escherichia coli, an organism that is already used for the production of many other chemicals, such as phenylalanine (Maury et al. 2005).

Among the microbial and eukaryotic host systems, yeasts combine the advantages of unicellular organisms (i.e., easy for genetic manipulation and growth) with the capability of protein processing typical for eukaryotic organisms (i.e., protein folding, assembly and post-translational modifications), together with the absence of endotoxins as well as oncogenic or viral DNA. The choice of the yeast host is of paramount importance for the success of the whole process. On this topic, some key reviews have been published (Gellissen and Hollenberg 1997; Sudbery 1996; Gellissen 2000; Dominguez et al. 1998; Cereghino and Cregg 2000; Giga-Hama and Kumagai 1999). Usually yeast hosts are divided in two main categories: (1) conventional and nonconventional or (2) Crabtree-positive and Crabtree-negative yeast hosts. In this respect, S. cerevisiae is the only conventional yeast and one of the few Crabtree positive (i.e. producing ethanol under aerobic conditions), together with Zygosaccharomyces rouxii and Z. bailii. As for other yeasts, S. cerevisiae is a reflection of the familiarity of molecular biologists with this yeast, combined with the deep knowledge about its genetics, biochemistry, physiology, and fermentation technologies. Furthermore, S. cerevisiae is recognized by the US Food and Drug Administration (FDA) as an organism generally regarded as safe (GRAS) (Chemler et al. 2006). Another important reason for the applicability of S. cerevisiae within the field of biotechnology is its susceptibility to genetic modifications by recombinant DNA technology, which has been even further facilitated by the availability of the complete genome sequence of S. cerevisiae, published in 1996 (Goffeau et al. 1996). The whole-genomes of S. cerevisiae have been public and annotated in the Saccharomyces Genome Database (http://www.yeastgenome.org/VL-yeast.html). A remarkable aspect of this wealth of information is that the number of genes that are expected to be involved in secondary metabolite production dramatically outnumbers the number of known secondary metabolites. So we can apply mutagenesis and recombinant DNA technology to increase the production of microbial products and generate new compounds. A genome-scale metabolic model serves such a purpose, providing a general description of the current state of knowledge regarding the genetics and biochemistry of metabolism in an organism (Murray 1991). Therefore, this wealth of information can be expected to make this species serve as a useful model for studying secondary metabolism.

In this perspective, we discuss ongoing research aimed toward the production of specific molecules that are used as pharmaceuticals in genetically engineered Saccharomyces cerevisiae.

Flavonoids and other phenylpropanoid-derived natural products

According to Metcalf (1987), between 50,000 and 100,000 plant secondary metabolite products may exist, and approximately 20% of the carbon fixed by photosynthesis is believed to be channeled into the phenylpropanoid pathway, generating the majority of the phenolic compounds found in nature, including flavonoids and stilbenoids (Ralston et al. 2005). Amongst these compounds, flavonoids are of exceptional interest and represent one of the most diverse and largest groups of plant natural products. Much of the increasing interest in flavonoids is owed to the beneficial properties they possess with respect to human health, as reviewed recently by Dixon (2004). The benefits of specific flavonoids and other phenylpropanoid-derived compounds to human health and their potential benefits for long-term health have only been recognized more recently. However, the productivity of flavonoids and stilbenoids is limited by the low concentrations and the low growth rates of plants. An obvious approach toward this end is to exploit the structural genes encoding enzymes of the general phenylpropanoid and particularly flavonoid and stilbenoid metabolism, whose coordinate expression leads to flavonoid or stilbenoid production as is listed in Table 1.

Table 1 Some flavonoids and other phenylpropanoid-derived natural products produced in yeast

Ro and Douglas (2004) were the first to connect the two initial enzymes involved in the phenylpropanoid pathway, namely phenylalanine ammonia lyase (PAL) and cinnamate 4-hydroxylase (C4H) in S. cerevisiae together with a cytochrome P450 reductase (CPR). They evaluated metabolic flux into the phenylpropanoid pathway from phenylalanine to p-coumaric acid (Fig. 1). Later, Jiang et al. (2005) and Yan et al. (2007) reported biosynthesis of flavanones, the common precursors of the vast majority of flavonoids, in S. cerevisiae. Jiang et al. (2005) demonstrated the production of monohydroxylated naringenin and unhydroxylated pinocembrin at levels of 7 and 0.8 mg/l, respectively (Fig. 2). Yan et al. (2007) reported a four-step metabolic circuit that was constructed in S. cerevisiae with plant genes from heterologous origins: 4-coumarate:coenzyme A ligase (4CL) from Petroselinum crispum, chalcone synthases (CHS) from Medicago sativa and Petunia × hybrida and chalcone reductase (CHR) and chalcone isomerase (CHI) from M. sativa. Expression of the recombinant pathway in S. cerevisiae resulted in the accumulation of both 5-hydroxyflavanone and 5-deoxyflavanone. Other phenylpropanoid acid precursors, such as cinnamic acid and caffeic acid, could also be metabolized through the recombinant pathway, yielding corresponding 5-deoxyflavanone compounds.

Fig. 1
figure 1

Phenylalanine metabolism in yeast triple-expressing strains (transformed by genes coding for PAL, C4H, and CPR), showing reconstruction of the entry point into phenylpropanoid metabolism in this host. Endogenous yeast reactions involved in Phe (phenylalanine) catabolism are shaded in gray. The new pathway generated by introduction of PAL, C4H, and CPR genes is boxed in black, and endogenous pathways that metabolize cinnamate and p-coumarate are circled. ADH, alcohol dehydrogenase; DC, decarboxylase; PAD, phenylacrylic acid decarboxylase; TA, transaminase. Width of arrows is proportional to carbon flux, and potentially reversible reactions are indicated by double arrows. Question marks indicate potential but uncharacterized reactions or products

Fig. 2
figure 2

Proposed reactions catalyzed by S. cerevisiae AH22 coexpressing genes coding for PAL, 4CL, and CHS. PAL (TAL), phenylalanine (tyrosine) ammonialyase; 4CL, 4-coumarate: CoA ligase; CHS, chalcone synthase; CHI, chalcone isomerase; Padp1, phenylacrylic acid decarboxylase. Overexpressed proteins are shown in bold. The biosynthesis of naringenin in plants is shown in the box

Recently, Ralston et al. (2005) reported the partial reconstruction of isoflavonoid biosynthesis in S. cerevisiae by using different types of CHI and an isoflavone synthase (IFS) from soybean (Glycine max).

Becker et al. (2003) succeeded in enhancing the production of stilbene resveratrol by making use of specifically transformed yeast strains. In this work, grape must was fermented with a yeast strain harboring two plasmids, one carrying a 4CL gene from poplar and the other carrying resveratrol synthase (RS) gene from grape. Transformed yeast strains were capable of converting p-coumaric acid from the medium into resveratrol glycosides. However, resveratrol glycosides were not synthesized at high concentrations (0.5–1.5 μg/l culture), where glycosides were synthesized instead of aglycons, and resveratrol was not released into the medium. Although this strategy needs to be optimized, it may provide a method for enhancing resveratrol content in fermented beverages (Pretorius and Bauer 2002; Vivier and Pretorius 2002).

Isoprenoids (terpenoid, terpene)

Isoprenoids with more than 40,000 described compounds (Withers and Keasling 2007) are the largest and most structurally diverse group of plant metabolites. Isoprenoids play various biological roles in plants. Depending on the number of isoprene units, isoprenoids can be classified into several groups, such as monoterpenes, sesquiterpenes and diterpenes (respectively two, three and four C5 units). It is among these secondary metabolites that many effective and promising pharmaceuticals, such as Taxol, vinblastine, artemisinin and prostratin, have been discovered and isolated.

Much work has focused on the introduction of exogenous isoprenoid metabolism into yeast listed in Table 2, such as the precursors to artemisinin (Ro et al. 2006) and Taxol (DeJong et al. 2006).

Table 2 Some terpenoid products produced in S. cerevisiae

Geranyl pyrophosphate (GPP) is the universal precursor of monoterpenoids. In yeast it occurs exclusively as an intermediate of farnesyl pyrophosphate (FPP) synthesis. Oswald et al. (2007) overexpressed linalool synthase (LIS) from Clarkia breweri and geraniol synthase (GES) from Ocimum basilicum (sweet basil) in the wild-type (WT) and mutant strains to evaluate monoterpenol production levels in strains with normal or defective FPP biosynthesis for the objective of specific monoterpenol production–geraniol and linalool in S. cerevisiae (Fig. 3).

Fig. 3
figure 3

Biosynthesis pathway of geraniol and linalool from primary precursors dimethylallyl pyrophosphate (DMAPP), isopentenyl pyrophosphate (IPP) and farnesyl pyrophosphate (FPP). The indicated enzymes are: (1) farnesyl pyrophosphate synthase (FPPS); (2) geraniol synthase (GES); (3) linalool synthase (LIS); (4) geranyl pyrophosphate synthase (GPPS)

Takahashi et al. (2007) engineered yeast for sesquiterpene accumulation by introducing genetic modifications that enable the yeast to accumulate high levels of the key intermediate farnesyl pyrophosphate (FPP). Coexpression of terpene synthase genes diverted the enlarged FPP pool to greater than 80 mg of sesquiterpene/l; and finally the functional hydroxylation of a sesquiterpene scaffold by coexpression of a cognate cytochrome P450 hydroxylase and cytochrome P450 reductase, yielding 50 mg/l each of hydrocarbon and hydroxylated products (Fig. 4).

Fig. 4
figure 4

The mevalonate (MVA) pathway and those steps engineered in the current study for novel sesquiterpene production in yeast are boxed. The MVA pathway is localized to the cytoplasm in eukaryotic cells and supports the biosynthesis of numerous terpene macromolecules. In Saccharomyces cerevisiae, ergosterol is the dominant terpene derived from FPP, a highly regulated 15-carbon intermediate in the MVA pathway (Gardner and Hampton 1999). Various combinations of mutations in genes coding for a phosphatase (DPP1) and squalene synthase (ERG9), along with selection for aerobic uptake of exogenous ergosterol (sue) and engineering of a soluble, unregulated form of 3-hydroxy-3-methylglutaryl CoA reductase (HMGR) should yield yeast strains capable of accumulating excess levels of FPP. The potential to divert the excess FPP to novel sesquiterpene hydrocarbon biosynthesis was assessed by introducing key branch point enzymes (terpene synthases), and the potential to further modify these hydrocarbon skeletons was determined by cotransformation with a gene encoding for a cytochrome P450 hydroxylating enzyme (terpene hydroxylase)

Meanwhile, Asadollahi et al. (2007) chose S. cerevisiae as a microbial host for heterologous biosynthesis of three different plant sesquiterpenes: valencene, cubebol and patchoulol.

More interestingly, Ro et al. (2006) cloned a gene from A. annua that codes for a cytochrome P450. When this gene was expressed in yeast, amorpha-4, 11-diene was converted to artemisinic acid, which can be converted to artemisinin in a simple synthetic process (Fig. 5). Later, for high-level production of amorphadiene, a sesquiterpene precursor to the anti-malarial drug artemisinin was synthesized by the cyclization of FPP, Shiba et al. (2007) engineered the pyruvate dehydrogenase by passage in S. cerevisiae. Overproduction of acetaldehyde dehydrogenase and introduction of a Salmonella enterica acetyl-CoA synthetase variant increased the carbon flux into the mevalonate pathway, resulting in increased amorphadiene production.

Fig. 5
figure 5

Schematic representation of the engineered artemisinic acid biosynthetic pathway in S. cerevisiae strain EPY224 expressing CYP71AV1 and CPR in the box. Genes from the mevalonate pathway in S. cerevisiae that are directly upregulated are shown in blue (Chemler et al. 2006); those that are indirectly upregulated by upc2-1 expression are in purple; and the red line denotes repression of ERG9 in strain EPY224. The pathway intermediates IPP, DMAPP and GPP are defined as isopentenyl pyrophosphate, dimethyl allyl pyrophosphate and geranyl pyrophosphate, respectively. Green arrows indicate the biochemical pathway leading from farnesyl pyrophosphate (FPP) to artemisinic acid, which was introduced into S. cerevisiae from A. annua. The three oxidation steps converting amorphadiene to artemisinic acid by CYP71AV1 and CPR are shown

Recently, a partial taxol biosynthetic pathway was constructed in S. cerevisiae by expressing five sequential pathway steps leading from primary isoprenoid metabolism to intermediate taxadien-5α-acetoxy-10β-ol (DeJong et al. 2006). The Taxus cuspidate genes that the yeast host expressed included geranylgeranyl disphosphate synthase (GGPPS), taxadiene synthase (TS), cytochrome P450 taxadiene 5α-hydroxylase (TYH5a), taxadienol 5α-O-acetyl transferase (TAT) and taxoid 10β-hydroxylase (THY10b). The recombinant strain produced taxadiene at 1 mg/l while taxadien-5α-ol was produced in very small amounts (∼25 μg/l). These results suggest that the first two enzymes (GGPPS and TS) cooperated well with each other, and that the metabolic flux was reduced at the 5α-hydroxylation step, which is catalyzed by cytochrome P450 hydroxylase (DeJong et al. 2006). It is anticipated that overexpressing Taxus P450oxygenases with their corresponding P450 reductases in the yeast host would improve the overall production amounts (Jennewein et al. 2005).

Polyketides

Polyketides are a diverse group of natural products with significance in human and veterinary medicine. In their work, Mutka et al. (2006) describe the introduction into S. cerevisiae of pathways for the production of methylmalonyl-coenzyme A (CoA), a precursor for complex polyketides, by both propionyl-CoA-dependent and propionyl-CoA-independent routes. For propionyl-CoA-dependent methylmalonyl-CoA production, they expressed Salmonella typhimurium propionyl-CoA synthetase (PrpE) to catalyze the production of propionyl-CoA from propionate(included in the production medium) and endogenous ATP (Horswill and Escalante-Semerena 1999), and the propionyl-CoA carboxylase (PCC) pathway from Streptomyces coelicolor consisting of the transcarboxylase subunit, PccB, and the biotin carrier protein/biotin carboxylase subunit, AccA (Rodriguez and Gramajo 1999). Furthermore, they demonstrated the possibility of using methylmalonyl-CoA produced in the engineered yeast strains for the in vivo production of a polyketide product, triketide lactone (Fig. 6).

Fig. 6
figure 6

Pathways for the biosynthesis of propionyl-coenzyme A (CoA) and (2S)-methylmalonyl-CoA. Designations: PrpE: propionyl-CoA synthetase; PCC: propionyl-CoA carboxylase; PccB: encoding the transcarboxylase subunit of PCC complex; AccA: encoding the biotin carrier protein/biotin carboxylase subunit of PCC complex (Rodriguez and Gramajo 1999); MatB: malonyl/methylmalonyl-CoA ligase

Vitamin C

Most eukaryotic organisms produce l-ascorbic acid (l-AA or vitamin C), a water-soluble powerful antioxidant asscavenger of ROS (Levine 1986; Padh 1990, 1991) to prevent or at least alleviate deleterious effects caused by ROS. It is common knowledge that production of ROS is common in many types of cancer cells, and that vitamin C has a positive effect in reducing the occurrence of stomach, lung and colorectal cancer (Knekt et al. 1991; Valko et al. 2006). On the other hand, vitamin C can increase the generation of ROS under certain circumstances and could have detrimental effects at least under certain circumstances (Naidu 2003; O’Brien et al. 2003; Trommer et al. 2002). However, yeast cells naturally lack the ability to produce l-AA. Instead, erythro-ascorbic acid, a structurally related compound with chemical properties similar to those of l-AA, occurs at a low concentration in yeast cells (Huh et al. 1998). Its role in stress resistance has been established but to what extend it is important is still a question for scientific debate (Huh et al. 1998, 2001; Spickett et al. 2000). Animals and plants employ two different metabolic pathways to synthesize l-ascorbic acid (Wheeler et al. 1998; Banhegyi et al. 1997). Branduardi et al. (2007) reported for the first time the biosynthesis of vitamin C by recombinant S. cerevisiae cells starting from d-glucose utilizing a plant pathway (shown in Fig. 7). Accumulation of ascorbic acid was proven to be successful in two different strains, which were examined in parallel to elucidate the effect of a different genetic background.

Fig. 7
figure 7

Ascorbic acid biosynthetic pathway. Schematic representation of the pathway of l-AA production from d-glucose in plants. The following enzymes are involved: (A) hexokinase (2.7.1.1), (B) glucose-6-phosphate isomerase (5.3.1.9), (C) mannose-6-phosphate isomerase (5.3.1.8), (D) phosphomannomutase (5.4.2.8), (E) mannose-1-phosphate guanylyltransferase (2.7.7.22), (F) GDP-mannose-3,5-epimerase (5.1.3.18), (G) GDP-l-galactose phosphorylase (E.C.C not assigned), (H) l-galactose 1-phosphate phosphatase (3.1.3.25), (I) l-galactose dehydrogenase, (J) l-galactono-1,4-lactone dehydrogenase (1.3.2.3)

How many more secondary metabolites are produced by S. cerevisiae?

The present review shows that S. cerevisiae can produce a large number of secondary metabolites, including representatives of many of the principal classes of secondary metabolites found in nature. In addition, a variety of structures are often present from within a single class, a feature typical of secondary metabolism.

How many more secondary metabolites can be produced by S. cerevisiae? The progress in recent research suggests that many more new compounds would be produced in the next few years.

First of all, more biosynthetic pathways or secondary metabolism can be elucidated and more effective targets for genetic manipulation can be identified. Secondly, the spectrum of heterologous production is limited to a shortage of genes involved in secondary metabolic biosynthesis. There is a lack of information about enzymes involved in the biosynthesis of secondary metabolites, and mechanisms underlying the immense complex regulatory network of pathways have not been completely clarified. Improvements in gene synthesis technology will facilitate the construction of increasingly large sets of genes that comprise natural and artificial biosynthetic pathways, and less costly DNA sequencing technologies will drive the expansion of sequencing projects to identify biosynthetic genes for uncharacterized small molecules. Thirdly, a large number of transcription factors involved in secondary metabolic biosynthesis are better known, and the most recent biotechnological advances using genes encoding these factors have been reported. Further, considering the network of the biosynthetic pathway of plant secondary metabolites, the same metabolite can be a member of several different pathways and may also have regulatory effects on multiple biological processes. We can use combinatorial techniques to optimize the pathways, change the expression levels of an enzyme, engineer the specificity, alter post-translational regulation, or alter the metabolic flux through a particular biosynthetic pathway.

Conclusions and future directions

We herein introduce some recent developments of natural products in S. cerevisiae, and describe some emerging genetic manipulations to enrich the chemical diversity in nature. S. cerevisiae has enormous potential for the production of low and high molecular weight compounds as a heterologous host by expressing biosynthetic enzymes or even pathways.

The production of natural products in S. cerevisiae would probably provide cost-effective sources of drugs. Many top-selling drugs are natural products, accounting for approximately 40% of the top twenty drugs. It is anticipated that natural products will provide an increasing number of new drugs in the future, while engineering of biosynthetic enzymes will give access to a vast variety of “unnatural natural products” with improved properties or new biological activities. Developments in bioinformatics have speeded up the process of gene cloning and transformation. Besides, a number of powerful analytical techniques have been employed in metabolite analysis and analyses of cellular properties, which include gas chromatography and mass spectrometry(GC-MS), nuclear magnetic resonance (NMR), and high performance liquid chromatography fingerprint (HPLC fingerprint) (Nielsen 2001). Some of these different tools are reviewed below.

A key point in the study of heterologous production is the analysis at the cellular level to understand the metabolic function of biosynthetic enzymes in details, and this involves several different techniques (Nielsen 1998), including DNA chip and analytical techniques like GC-MS or liquid chromatography coupled with tandem mass spectrometry (LC-MS-MS). With enzyme assays, many different metabolites can be measured in complex matrices and with high sensitivity, as is illustrated in analysis of glycolytic intermediates of S. cerevisiae (Theobald et al. 1993, 1997; Vaseghi et al. 1999). The results may suggest the changes in flux distribution through different branches of the metabolic network. The methods and models used to calculate intracellular fluxes can now be directed toward determining how to manipulate cells to achieve the desired phenotype. After measuring the fluxes through the metabolic network, it is necessary to identify the pathways and enzymes that will most drastically improve the phenotype. Metabolic control analysis (MCA) provides a framework to help understand how flux control is distributed in a bioreaction network. Finding enzyme (gene) targets having the greatest influence on a product rate can be difficult because a rate-limiting step is often not found in biological networks. Instead, the limitations are spread over many enzymes in the network. The flux control coefficient (FCC) of an enzyme is defined as the relative effect of modulating the amount of an enzyme on the flux through the desired pathway. Genome-scale metabolic models for S. cerevisiae (Forster et al. 2003) have been developed in recent years and can be used in directed manipulation of the cellular network to predict changes that are required in the genotype of microorganisms in order to obtain efficient microbial strains (Patil et al. 2004).

These advances will improve our ability to control the shape and topology of secondary metabolites, allowing us to create new conformers that interact specifically with biological targets.

Future advances in directed biosynthesis would be driven by a mixture of technology and creativity. For example, the most interesting enzymes of secondary metabolism are expressed only during differentiation, and can be used in mixed chemical-enzymatic (transformation) production processes after overexpression in a microbial host, as is the case with 11β-hydroxylation. While metabolic engineering to increase the availability of FPP and diversion of this pool of intermediate to sesquiterpene hydrocarbon production is a critical and the first step in establishing any terpene production platform. Introduction of downstream biosynthetic modifications to the hydrocarbon scaffolds is of equal importance because of the structural complexity and biological activity afforded by these modifications (Takahashi et al. 2005). Many of these secondary modifications arise from the regio- and stereochemicalin sertion of hydroxyl groups into the hydrocarbon scaffolds, which then serve as sites for tertiary modifications like methylation, acetylation, aryl-group addition, and the formation of epoxides and lactones. Hence, efficient coupling of terpene hydroxylation reactions with hydrocarbon production is the first essential step for morecomplete recapitulation of the entire terpene biosynthetic repertoire.