Introduction

Steroidal saponins are a class of oligosaccharides derived from spirostanes, which are formed by the condensation of steroidal sapogenin and sugars. They are mainly produced in Dioscoreaceae, Liliaceae, Smilaxaceae, Solanaceae, Dracaenaceae, Agaveaceae, etc. (Sidana et al. 2016; Sparg et al. 2004; Tian et al. 2017), and play a defensive role in plants (González-Lamothe et al. 2009; Osbourn 1996). A small amount of steroidal saponins are also found in certain marine animals and microorganisms (Iorrizzi et al. 2001; Tang et al. 2020; Tian et al. 2017).

The main use of steroidal saponins is as precursors of steroid drugs, which are second most in demanded in the global market after antibiotics (Fernández-Cabezón et al. 2018). Steroidal saponins have extensive pharmacological effects. Di’ao Xinxuekang capsule (steroidal saponins of Dioscorea panthaica or Dioscorea nipponica as the main components) has significant effects on cardiovascular and cerebrovascular diseases and was the first botanical drug approved for marketing in Europe (Dutch Medicines Evaluation Board 2012; Yu et al. 2014). Drugs with similar pharmacological effects include dioscornin tablets and Xinnaoshutong capsule, etc.. Steroidal saponins are also widely used to treat acute and chronic inflammation (Wang et al. 2015; Yang et al. 2017a, b; Yuan et al. 2016b; Zhu et al. 2017) and have therapeutic effects on a variety of high-incidence cancers and clinically difficult cancers (de Oliveira et al. 2019; Lu et al. 2018; Zhao et al. 2018; Zhu et al. 2018b), in addition to being used to solve problems stemming from multidrug resistance and poor prognosis that are difficult to overcome in cancer treatment (Song et al. 2019; Yuan et al. 2016). In addition, steroidal saponins can enhance the immune system, affect oxidative stress, and regulate lipid metabolism. Since saponins have foaming and emulsifying properties, they can be used as emulsifiers, foaming agents and preservatives in the food, cosmetic and pharmaceutical industries (Moses et al. 2014). Yucca extract is the main commercial source of steroidal saponins used as surfactants. It has been approved by the FDA as GRAS (generally recognized as safe) and has received widespread attention in various industries (Güçlü-Ustündağ and Mazza 2007; Tenon et al. 2017).

With the severe destruction of the natural environment, the natural materials from which steroidal saponins originate have been irreversibly decreased in number (Li et al. 2015; Shen et al. 2018). For example, Paris polyphylla, a famous natural medicine rich in steroidal saponins, has been listed as a vulnerable species by the IUCN (Chauhan 2020). Currently, the annual harvest of this natural medicine is becoming increasingly insufficient to meet the rising demand in the market (Cunningham et al. 2018). In addition, the extraction of steroidal saponins from plants also faces problems such as difficulty in separation and low abundance. For example, the highly anticipated OSW-1 possesses better anticancer effects than paclitaxel and camptothecin; however, it has the drawback of having high acquisition costs and limited supplies (Mimaki et al. 1997). To alleviate the dependence on plants for production of steroidal saponins and obtain the target products in a fast, effective and inexpensive way, it is necessary to develop in vitro technology to expand the sources of steroidal saponins.

In vitro plant culture is an important way for the production of biological active compounds (Espinosa-Leal et al. 2018). Currently, some biotechnological methods of plant cell and tissue culture, root culture and somatic embryogenesis to generate steroidal saponins have been investigated (Basu and Jha 2013; Chauhan et al. 2018; Nazir et al. 2021; Nikam et al. 2009; Puente-Garza et al. 2021; Rizvi et al. 2010). Using Agrobacterium tumefaciens-mediated transformation technology, a robust and reproducible method for transforming fenugreek hairy roots was established, and a promoter suitable for strong expression of transgenes was identified (Garagounis et al. 2020). Through the CRISPR/Cas system, a fast and effective new method for targeted gene editing of Dioscorea zingiberensis has been created (Feng et al. 2018; Zhu et al. 2009). However, these technologies are still at the laboratory level, and their transfer into industrial production will be difficult. In recent years, the heterogeneous expression of natural product biosynthetic pathways has become an important focus of bioactive compound production. Because the genetic manipulation of plant cells is very difficult, constructing engineered strains seems to be the most effective and economical way to produce natural medicines. A variety of engineered strains have been used to achieve in vitro production of natural and nonnatural active ingredients (Bai et al. 2016; Dai et al. 2014; Galanie et al. 2015; Nakagawa et al. 2016). For instance, the famous natural anti-malarial drug artemisinin, which is produced by semi-synthetic methods through engineered strains, has been put into industrial production as a precursor that meets the World Health Organization's requirements for drug quality and purity (Corsello and Garg 2015). However, to date, there have been no reports of strains engineering for the steroidal saponin.

An adequate understanding of natural product biosynthesis processes and deep mining of key enzymes are the basis for the utilization of various biotechnologies to produce natural medicines. At present, studies of the biosynthetic pathway of steroidal saponins mainly revolve around plants, and attention to the key enzymes required is focused on the upstream process. This article provides a comprehensive review of the biosynthetic pathways of steroidal saponins in plants and microorganisms and the key enzymes involved in the downstream biosynthesis process and summarizes the microorganisms involved in the transformation of steroidal saponins. The information supplied will benefit the development and application of biotechnology to promote the industrial production of steroidal saponins.

Biosynthetic pathway of steroidal saponins

With the advancement of research technology, such as high-throughput sequencing, gene expression analysis, and functional gene analysis of nonmodel plants, the biosynthetic pathways of steroidal saponins have been further improved and supplemented in recent years. Referencing and analysing the current research literature, provides a more comprehensive understanding of the biosynthesis process of steroidal saponins. The pathways related to the biosynthesis of steroidal saponins are shown in Fig. 1.

Fig. 1
figure 1

Biosynthetic pathways of ergosterol, cholesterol, sitosterol, stigmasterol, and campesterol in microorganisms and plants, and possible pathways for further biosynthesis of steroidal saponins from these sterols

The biosynthesis process of steroidal saponins is divided into three stages: the production of sterols, the synthesis of sapogenins, and the connection of different sugars to synthesize steroidal saponins. The cytosolic mevalonate (MVA) pathway and the plastidial methylerythritol 4-phosphate (MEP) pathway initiate of steroidal saponin biosynthesis. These two pathways are very common in nature and are a common process for the biosynthesis of terpenes, steroids and other secondary metabolites, being carried out in the cytoplasm and plastids in plants (Liao et al. 2016; Newman and Chappell 1999). The two pathways can generate isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP) from acetyl-CoA and pyruvate, respectively as starting points. Subsequently, geranyl diphosphate, farnesyl diphosphate, squalene and 2,3-oxidosqualene are synthesized. After further catalysis by various enzymes, different sterols are generated. Phytosterols are a major component of cell membranes and have important physiological functions in living organisms (Moreau et al. 2018). Although both pathways are involved in the biosynthesis of cholesterol and sitosterol, the precursors of steroidal saponins, the MVA pathway is thought to be the more dominant pathway (Sawai and Saito 2011).

Sitosterol is an important metabolite in plants, while cholesterol is the main sterol in animals. Although the cholesterol content in plants is very low, cholesterol is indispensable (Behrman and Gopalan 2005). The biosynthesis precursors of these two sterols are mainly lanosterol and cycloartenol. The production of cholesterol from lanosterol is the main process in animals and microorganisms; however, lanosterol synthase is only found in a few parts of dicotyledonous plants (Mehrafarin et al. 2010; Suzuki et al. 2006). Many studies have shown that in some plants, lanosterol can participate in the biosynthesis of plant sterols and steroid glycoalkaloids (Kumar et al. 2017; Ohyama et al. 2009). However, there is no direct evidence that lanosterol can be converted into steroidal saponins in plants. Conversion from cycloartenol to sitosterol and then to steroidal saponins is the most commonly accepted biosynthetic pathway for steroidal saponins. This process requires a series of methyltransferases, demethylases, reductases, dehydrogenases, and isomerases to produce sitosterol, followed by hydroxylase and glycosyltransferase to finally produce steroidal saponins (Singh et al. 2017; Upadhyay et al. 2014; Zhu et al. 2018a). Cycloartenol can also biosynthesize cholesterol and further form steroidal saponins, and this process is similar to that of phytosterols. One possibility is that the pathway lacks one-step methylation of cycloartenol. The other is the direct biosynthesis of cholesterol after cycloartenol produces isofucosterol along the phytosterol synthesis pathway (Ciura et al. 2017b; Sonawane et al. 2016). However, the second approach needs further verification (Yang et al. 2019). Comparing the changes in the contents of diosgenin, cycloartenol, cholesterol and sitosterol in Dioscorea nipponica before and after treatment with methyl jasmonate showed that both cholesterol and sitosterol in plants could participate in the biosynthesis of steroidal saponins, and the two approaches might exist simultaneously. The content of phytosterols in plants is much higher than that of cholesterol. Based on this, the phytosterol pathway may occupy a dominant position (Ciura et al. 2017a; Sun et al. 2017).

Microorganisms are a resource pool for the production of natural products. In particular, endophytes can produce the same metabolites as the host during their coexistence (Lata et al. 2018). In Paris polyphylla, more than 30 microbial strains, including fungi, bacteria and actinomycetes that can produce steroidal saponins and their analogue, have been isolated. Among them, the content of such components in the most capable strains was measured to reach 208.62 mg/L (Cao et al. 2005; Yang et al. 2018b; Zhang et al. 2007; Zhao et al. 2005; Zhou et al. 2004). Up to now, there is no direct research on the biosynthesis process of steroidal saponins in microorganisms. But the pathways of biosynthesizing a variety of sterols in microorganisms may be closely related to the production of steroidal saponins. For most fungi, ergosterol is the main sterol. A small number of fungi do not synthesize ergosterol, but can produce cholesterol, sitosterol, stigmasterol and other sterols (Fontaine et al. 2002; Grandmougin-Ferjani et al. 1999; Torres et al. 2017; Wang et al. 2012). The biosynthetic pathway of ergosterol has been clear. Compared with the biosynthetic process of sterols in plants, ergosterol is biosynthesized by lanosterol instead of cycloartenol through a series of reactions. The intermediate products of this process also include 4,4-dimethyl-cholesta-8,14,24-trienol, 4,4-dimethyl-zymosterol, zymosterol, feocosterol, episterol, ergosta5,7,24(28)-trienol, ergosta5,7,22,24(28)-tetraenol, etc. (Lees et al. 1995). Studies have shown that the way that microorganisms biosynthesize cholesterol is similar to that of animals, and the biosynthesis process before zymosterol is the same as that of ergosterol. The intermediates involved in the next biosynthesis process depend on the catalytic sequence of the enzyme, including cholest-8-enol, lathosterol, 7-dehydrocholesterol, cholesta-7,24-dienol, cholesta-5,7,24-trienol and desmosterol, etc. (Madoui et al. 2009; Weete et al. 2010). Studies on the biosynthetic pathway of phytosterols in microorganisms have shown that brassinosteroids can be biosynthesized through ergosta5,7,24(28)-trienol or ergosterol (Pereira et al. 2010; Visbal et al. 2003; Weete et al. 2010). However, the biosynthetic pathways of other phytosterols have not been reported so far. Microalgae are promising cell factories for the production of phytosterols and terpenes. Lanosterol, cycloartenol and cholesterol were found in some microalgae. The process of phytosterol synthesis in microalgae is similar to fungi and terrestrial plants, but there are some unique key genes involved (Fabris et al. 2014; Jaramillo-Madrid et al. 2019; Xie et al. 2017). Prokaryotes do not have cell membranes, and it is generally believed that they cannot produce sterols. However, there was evidence that a small number of bacteria could synthesize sterols. This phenomenon may be caused by the horizontal transfer of related genes from eukaryotes to prokaryotes during evolution (Bouvier et al. 1976; Desmond and Gribaldo 2009; Pearson et al. 2003). Studies have shown that the sterol biosynthesis pathways of some bacteria and the catalytic enzymes involved are very different from those of eukaryotes (Lee et al. 2018). Steroidal saponins also exist in marine organisms, but studies have mainly focused on the separation of compounds and the exploration of pharmacological effects. There is no relevant research on the in vivo steroidal saponin biosynthesis pathway, which deserves attention.

Although plants can produce steroidal saponins through different pathways, the enzymes involved in the synthesis are functionally consistent (Sonawane et al. 2016). Comparing of multiple high-throughput analysis results and systematic genomic analysis of eukaryotic steroid and sterol synthesis genes, it can be concluded that the enzymes that form sterols are homologous (Haubrich et al. 2014). During the continuous evolution of plants, some enzymes have continuously acquired new functions and the selection specificity has become stronger but still retains certain pre-evolution functions. The same enzyme may be able to choose multiple substrates to play a role. When the enzymes involved in cholesterol synthesis in plants are consistent with phytosterols, the phytosterol pathway may catalyse similar cholesterol substrates in a mixed manner. Plants and microorganisms synthesize sterols with different starting components. Although the intermediates and products produced are different, and the genes involved in regulating the necessary enzymes are diverse, the functions of these enzymes are very similar. Therefore, some enzymes with low selectivity can catalyse other similar substrates.

By exploring the mechanisms by which endophytes produce natural products, we believe that the endophytes can produce natural medicine either in the same way as in plants or in unique ways (Heinig et al. 2013; Venugopalan et al. 2016). Although some of the key enzymes of the biosynthesis process in endophytes are similar to the functions in plants, the amino acid sequence correlation between them is very low, proving that endophytes can independently evolve channels to produce natural products (Hedden et al. 2002). In addition, the amino acid sequences encoding key genes and transcription factors present in endophytes are highly homologous to those of the host. It has been proved that some genes can be transferred horizontally during the coexistence process. There is no report at the gene level of the mechanism of endophyte production and conversion of steroidal saponins, but it can be speculated that steroidal saponin-producing fungi also possess these mechanistic characteristics (Sah et al. 2017).

Key enzymes in the biosynthetic pathways of steroidal saponins in plants

The MVA and MEP pathways involved in the biosynthesis of steroidal saponins have been fully described and studied, and it has been found that the key genes involved can regulate the accumulation of sterols (Chaudhary et al. 2015; Suza and Chappell 2015). The key enzymes involved in the downstream biosynthetic pathways mainly include geranyl diphosphate synthase (GPPS), farnesyl diphosphate synthase (FPPS), squalene epoxidase (SQE), cycloartenol synthase (CAS), sterol 24-C-methyl transferase (SMT1), sterol 4α-methyl oxidase 1 (SMO1), cyclopropylsterol isomerase (CPI1), sterol C-14 demethylase (CYP51), sterol C-14 reductase (FK), sterol 8,7 isomerase (HYD1), sterol-C-24-methyl transferase 2 (SMT2), sterol 4α-methyl oxidase 2 (SMO2), sterol C-5(6) desaturase 1(DWF7), 7-dehydrocholesterol reductase (DWF5), delta24-sterol reductase(DWF1), CYPs, and UGTs. CPI, CYP51, FK and HYD1 are shared enzymes in the phytosterol and cholesterol pathways (Sonawane et al. 2016) (Fig. 1) At present, a variety of enzymes in plants related to steroidal saponins biosynthesis have been cloned and studied using gene silencing and other methods. The following content mainly describes some downstream genes that control the biosynthesis of steroidal saponins, reviews the latest research progress, and explores the regulatory roles of these genes.

Sterol methyltransferase

Sterol methyltransferase (SMT) is responsible for the methylation of C-24 alkyl sterols in different modes and is an important gene in the process of phytosterol biosynthesis. In plants, SMT1 and SMT2 are the two main types of SMT. SMT family sequences are highly similar and are considered to have a common ancestor. However, the specificity of the two types of SMT substrates is significantly different. At the same time, different SMTs may produce respective sterols when catalysing the same substrate (Guan et al. 2018; Nes 2005). SMT1 participates in the first step of sterol biosynthesis and is used to catalyse the conversion of cycloartenol to 24-methylenecycloenol. The biosynthesis of 24-methylenecycloenol is a common attribute of all photosynthetic plants. This step is also a bifurcation of phytosterol and cholesterol biosynthesis pathways. SMT1 has been characterized in model plants such as tobacco seed and Arabidopsis thaliana (Holmberg et al. 2002; Husselstein et al. 1996), and has been cloned and functionally identified in plants such as Tripterygium wilfordii (Guan et al. 2017), soybean (Neelakandan et al. 2009), upland cotton (Luo et al. 2008), and Paris polyphylla (Guan et al. 2018). This enzyme has different characteristics in the stereoscopic selection of substrates (Venkatramesh et al. 1996b), and differences in its expression are mainly reflected as changes in cholesterol levels. Based on studies of Withania somnifera, tobacco seed and Paris polyphylla, it was found that overexpression of SMT1 can result in undetectable cholesterol levels, but significant increases in other phytosterols (Guan et al. 2018; Holmberg et al. 2002; Pal et al. 2018). For plants lacking the SMT1 gene, the expression of genes related to the MVA and MEP processes and sterol biosynthesis process are affected, and various CYP450 genes are also suppressed to varying degrees, while cycloartenol and various transcription factors are significantly increased (Ciura et al. 2017a; Diener et al. 2000). By inhibiting the accumulation of cholesterol caused by SMT1, plant growth and diseases responses can be regulated (Chen et al. 2018a).

SMT2 is mainly involved in the biosynthesis of 24-ethylsterol and is responsible for converting 24-methylidenelophenol to 24-ethylidenelophenol. This enzyme plays a decisive role in regulating the contents of campesterol and sitosterol to meet plant growth and membrane integrity requirements. The study found that the initial appearance of SMT1 before SMT2 in phytosterol biosynthesis arose recently in plant evolution. A more primitive SMT gene may have been bifunctional and catalytically promiscuous (Neelakandan et al. 2009). Three SMTs were found in Arabidopsis, of which SMT2 and SMT3 both catalyse the biosynthesis of structural sterols and signal brassinosteroid derivatives and are highly regulated. By constructing SMT-deficient plants, it was found that the substrate specificity of SMTs is somewhat mixed. Although the gene sequence similarity of sterol methyltransferases in fungi and higher plants is not high, SMT2 and SMT3 in Arabidopsis can promote the addition of C1 and C2 in yeast Erg6p mutants blocked by SMTs (Diener et al. 2000).

Methyl oxidase and reductase

The phytosterol biosynthesis process is catalysed by SMO1 and SMO2 at the C4 position twice for demethylation. These two methyl oxidases have low sequence homology and functional differences (Darnet and Rahier 2004). SMO1 is responsible for converting 24-methylenecycloartanol to cycloeucalenol, while SMO2 converts 24-methylenelophenol and 24-ethylidenelophenol to episterol and delta7-avenasterol, and finally produces brassinosterol and sitosterol (Sonawane et al. 2016). DWF5 and DWF1 are the two main reductases in plant sterol biosynthesis. In the biosynthesis of stigmasterol, DWF5 first reduces delta-5,7-avenasterol to isofucosterol, and DWF1 further catalyses the production of stigmasterol. Moreover, these two enzymes are also involved in the biosynthesis of campesterol, which is considered to be the main precursor of brassinosteroids (Tsukagoshi et al. 2016; Youn et al. 2018b).These enzymes are key genes controlling auxin biosynthesis in plants and are related to embryonic development. The current research on these two enzymes mainly focuses on their regulation of plant growth hormones. SMO is a key gene controlling auxin synthesis in plants and is related to embryo development (Zhang et al. 2016b). The synthesis of bioactive brassinosteroids in mutants lacking DWF1 was reduced and led to dwarfism (Youn et al. 2018a). Studies on tomatoes have shown that DWF5 can cause dwarfing of tomato plants by affecting the synthesis of brassinosteroids (Rahim et al. 2018). However, there is currently no report on the regulation of steroidal saponin biosynthesis in plants by such reductases.

Cytochrome P450 monooxygenase

Cytochrome P450 monooxygenase is a membrane-bound enzyme that originated in the prokaryotic era and is capable of various oxidation reactions such as C-H hydroxylation, epoxidation, heteroatom oxidation, aromatic epoxidation, and dealkylation. In the process of continuous evolution, enzymes exhibit different substrate specificity to adapt to different needs (Yoshida et al. 2000). In the process of steroid biosynthesis, CYP51 is the only family that appears in the fungal, mammalian and plant kingdoms and can convert obtusifoliol into delta 8,14-sterol. The P450 enzyme found in the downstream stage mainly plays a role in the hydroxylation of C-16, C-22 and C-26 (Ohnishi et al. 2009). CYP90B1, also known as DWF4, is a C-22 steroid hydroxylase that can catalyse C27/C28/C29 sterols. CYP90B1 has the strongest catalytic activity for cholesterol, followed by brassinosteroid, and the weakest activity for sitosterol, and is involved in the first step in cholesterol production of steroidal saponins (Ciura et al. 2017b; Fujita et al. 2006). PpCYP90B27 was the first P450 enzyme cloned from Paris polyphylla that participates in the biosynthesis of steroidal saponins, but its catalytic specificity and substrate specificity need further study (Yin et al. 2018). CYP734A1 and CYP18A1 are both 26-hydroxylase enzymes. CYP18A1 is mainly a key enzyme in the biosynthesis of plant ecdysis hormone, but the gene found in fenugreek has also been found to be involved in the biosynthesis of steroidal saponins through induction experiments (Ciura et al. 2017a). CYP734A1 is involved in the biosynthesis of brassinosteroids and auxin (Youn et al. 2015). Transcriptome analysis of allium plants was used to investigate the upregulation of CYP734A1 and CYP72B1 (Abdelrahman et al. 2017). There are few studies on C-16 hydroxylase. In exploring the biosynthesis of steroid alkaloids in potatoes, Masaru Nakayasu et al. discovered the 16DOX gene, which that can regulate the C-16 hydroxylation of sterols. This gene may be a suitable target for controlling the levels of toxic steroid alkaloids (Nakayasu et al. 2017).

Glycosyltransferase

Uridine diphosphate-dependent glycosyltransferases (UGTs) are the most common type of glycosyltransferases, and are responsible for transferring UDP-sugar donors to small molecule saponins and steroid receptors in the synthesis of steroidal saponins (Wilson and Tian 2019). Glucose, rhamnose, arabinose, xylose, glucuronic acid, fucose, apiose, etc. can be used as glycogen for UGTs, and multiple UGTs gradually catalyse the extension of oligosaccharides of saponins. UGTs are among the key enzymes that increases the diversity of secondary metabolites (Inoue and Ebizuka 1996a; Louveau et al. 2018). This type of enzyme has a catalytic effect on endogenous and exogenous substrates (Messner et al. 2003), and there is a certain evolutionary relationship between enzymes that catalyse monosaccharides and disaccharides (Huang et al. 2018). At the same time, these enzymes have a high degree of regioselectivity and stereoselectivity (Singh et al. 2018) and can identify a series of substrates with common characteristics , and its expression module shows a high tissue specificity (Song et al. 2015). SaGT4A, isolated from belladonna, was the first cloned functional glucosyltransferase involved in the biosynthesis of steroidal saponins (Moreau et al. 2018). Two rhamnosyltransferases, Dz3GT1 and Dz3GT2, identified from Dioscorea zingiberensis can glycosylate diosgenin and cholesterol (Zhu et al. 2018a). The UGT80 gene family mainly includes UGT80A2 and UGT80B1. UGT80B1 has been found to be involved in the biosynthesis of steroidal saponins (Singh et al. 2017), while UGT80A2 is mainly involved in the accumulation of minor sterol glycosides (SGs) and acyl sterol glycosides (ASGs) (Jiang et al. 2017). At present, the number of glycosyltransferases that can participate in the biosynthesis of steroidal saponins is limited. The identification and isolation of glycosyltransferases as well as the study of their characteristics and functions and use of protein modifications to construct enzymes with similar structures but more functions, will have a positive effect on expanding the source of saponins and enriching their types.

Glucosinase

A variety of glycosidic bonds exist in steroidal saponins. Among them, 26-O-β-glucosidase is a key enzyme in the formation of furosesteric saponins into spirosteroidal saponins. Kentaro et al. isolated and purified F26 from Costus speciosus Sm. Compared to common hydrolases, it was found that this enzyme had a high specificity for the recognition of furosteroid and spirosteroid saponins (Inoue and Ebizuka 1996a, b). Masaru Nakayasu et al. successfully cloned a 26-O-β-glucosidase in Dioscorea zingiberensis and conjectured that this physiological characteristic is a kind of defence mechanism of plants. Furosteroid saponins are the precursor forms of spirosteroid saponins. Under normal physiological conditions, these two types of metabolites are distributed in different regions. When encountering biological stress and other factors, the spatial and chemical relationships between the two are disrupted, and the substrate is transformed by the corresponding enzymes to resist unstable conditions (Nakayasu et al. 2015). The β-glycosidase purified from Solanum torvum also has the characteristic of recognizing natural products better than synthetic products (Arthan et al. 2006). The expression of this enzyme in yeast cells has been used for the hydrolysis of furanosyl glycoside (Suthangkornkul et al. 2016). This enzyme is the most widely studied glycosidase in the biosynthesis of steroidal saponins.

Enzymes related to the biosynthesis and transformation of steroidal saponins in microorganisms

The enzymes related to sterol biosynthesis in fungi and plants are quite different, but they possess similarities in function and homology. This is related to their co-evolution (Ohyama et al. 2009). The most important sterol in fungi is ergosterol. At present, metabolic engineering has been utilized to transform the key enzymes of fungal ergosterol synthesis to produce industrially relevant sterols and steroids using yeast-engineered strains (Wriessnegger and Pichler 2013). Using the synthetic pathway of microbial biosynthesis of ergosterol, an engineered yeast strain that can biosynthesize cholesterol with only a simple carbon source has been successfully constructed (Pompon et al. 2012; Souza et al. 2011). During its biosynthesis, a methyltransferase named ERG6, which is mainly responsible for converting cryptosterol to coprosterol was found. At present, SMT1 has been isolated from microorganisms such as Saccharomyces cerevisiae and Pseudomonas spp. (Desta et al. 2019; Venkatramesh et al. 1996a). The process of ergosterol biosynthesis also includes the methyl oxidase ERG25, and two reductases, ERG4 and ERG24. In contrast to the process in plants, the two demethylation processes are completed by ERG25 (Bard et al. 1996). By verifying the functional experiments of the SMO and SMT genes in Arabidopsis thaliana, the genes of the same family in fungi and plants showed low sequence homology. These genes in plants can express their function in mutant strains, but there are differences in their activities and ways of functioning (Darnet et al. 2001; Ganapathy et al. 2008). CYP51 is named ERG11 in fungi and is responsible for catalysing the formation of lanosterol 4-4-dimethylcholesterol-8,14,24-trienol. Although there are no relevant reports on the key enzymes for the direct synthesis of steroidal saponins by microorganisms, certain enzymes involved in the biosynthesis of ergosterol can participate in the synthesis of cholesterol, the precursor of steroidal saponins. UGTs isolated from microorganisms show high substrate flexibility and poor regional specificity, and can be used as effective biocatalysts for the glycation of natural products. The amino acid length of UGT sequences in fungi changes in different plants. The evolutionary conservation of sequence similarity of the two is up to 60–70%. This difference reflects the common evolution of fungi as part of plant systems (Singh et al. 2018). There are also various glycosidases in microorganisms. Ding et al. found that 26-O-β-glucosidase can be produced in an endophyte that can increase the content of dioscin (Ding et al. 2014). Pgase-1, which was isolated and purified from Aspergillus oryza can hydrolyse a variety of sugars at different positions, thereby participating in the biosynthesis of steroidal saponins (Liu et al. 2013b). Zhang et al. obtained β-galactosidase from Lactobacillus bulgaricus L3, which can make steroidal saponin galactose (Zhang et al. 2016a). GiGly obtained from Gibberella intermedia can specifically hydrolyse multiple-3-O-β-glycosides (Li et al. 2016).

The pharmacological effects of steroidal saponins are closely related to the configuration of steroidal sugar groups, and the number and mode of chain sugars (Podolak et al. 2010; Prawat et al. 2016; Yang et al. 2018a; Zhang et al. 2016c). Some microorganisms can use their own enzyme systems to biotransform steroidal saponins, which not only increases the types of steroidal saponins, but also promotes the production of target saponins and glycosides. The microorganisms found to promote the conversion of steroidal saponins are shown in Table 1. Enzymes that biotransform the natural products of microorganisms usually have strong catalytic ability and specificity, and their stereoselectivity can complement chemical synthesis. To date, the research on the transformation of steroidal saponins by these functional strains has mainly focused on the screening of target strains and elucidation of their transformation pathways. The regulatory mechanism of steroidal saponin biotransformation deserves further exploration.

Table 1 Microorganisms involved in the transformation of steroidal saponins

Conclusion and perspectives

This article details the three pathways of steroidal saponin biosynthesis in different plants and the key enzymes required in downstream processes. The biosynthesis of steroidal saponins starts with MVA and MEP pathways, and then generates squalene, 2,3-oxysqualene, and cycloartenol. Through the action of a series of methyltransferases, oxidases, and reductases, sitosterol or cholesterol is biosynthesized. Finally, different steroidal saponins are produced under the catalysis of a variety of cytochrome P450 enzymes, glycosyltransferases and glycosidases. Although a pathway to synthesize cholesterol through lanosterol has been found in some plants, the ability to produce steroidal saponins through this pathway has not yet been discovered. In addition, it is believed that enzymes with the same function in these pathways may catalyse similar substrates in a mixed manner. Although the biosynthetic pathway of steroidal saponins is clearly understood, there is little research on how cholesterol and sitosterol are converted into steroidal saponins, differences in the biosynthetic pathways of saponins with different configurations, and biosynthetic pathways in nonplant organisms. It is necessary to further verify the pathway from lanosterol to cholesterol then to steroidal saponins and from sitosterol to steroidal saponins, to fully discover the biosynthesis of steroidal saponins in microorganisms and marine organisms, and to pay more attention to the effect of configurations on biosynthesis.

To date, some microorganisms have been found to have the ability to produce plant sterols, cholesterol and even steroidal saponins, and approximately 40 strains promote the conversion of steroidal saponins in the laboratory or in industrial production. Through studies on the biosynthetic pathway of microbial sterols and the mechanisms by which endophytes produce natural products, it can be speculated that the biosynthesis of steroidal saponins in microorganisms may utilize ergosterol biosynthetic enzymes or unique pathways. As genes from microorganisms are relatively easy to obtain and have a strong catalytic ability (Cheong et al. 2016; Fan et al. 2017), these functional strains are an important source of key enzymes in the construction of engineered strains that produce steroidal saponins.

At present, by integrating relevant genes in plants, animals and yeast, engineered strains for the production of diosgenin have been successfully constructed (Cheng et al. 2020). Regrettably, there is no report on the construction of engineering strains for steroidal saponin production, but the engineering strains for triterpene saponin production have not only improved the production capacity of ginsenosides but also achieved the acquisition of new saponins (Yang et al. 2015; Yu et al. 2017; Zhang et al. 2018). These successful practices provide ideas and promise for the construction of strains engineered to produce steroidal saponin. There are still many challenges in cloning and expressing biosynthetic gene clusters by heterologous enzymes in the host. Therefore, it is necessary to use genome mining technology to improve the understanding of the biosynthetic pathway of steroidal saponins, increase the discovery and utilization of biosynthetic gene clusters, and apply epigenetic modification and targeted genome modification to enhance gene silencing and certain specific gene activation mechanisms in steroidal saponin biosynthesis. Research achievements on the biosynthetic pathways of steroidal saponins and key enzymes will promote the industrial production of steroidal saponins by biotechnology.

Author contribution statement YYC compiled and wrote the manuscript and created the figures. XWD conceptualized, designed and edited the manuscript. JKW and DY provided scientific feedback and revised the content. All authors read and approved the manuscript.