Introduction

So far, microbial strains have been widely used to produce various valuable products related to agricultural, biofuel, chemical, environmental, food and pharmaceutical industries (Zeng et al. 2020). But most of the originally isolated strains cannot be directly used for industrial production because of low productivity and weak stress tolerance, leading to an increasing interest in strain improvement over the last several decades (Zeng et al. 2020).

The strategies of strain improvement mainly include random mutagenesis, protoplast fusion, genome shuffling (GS) and rational genetic engineering approaches. Random mutagenesis followed by screening, can lead to desired mutants. As random mutagenesis is easy to operate, and not need the genetic background of microbe strains, it has succeeded in breeding many microbial strains, but it is laborious and time-consuming (Gong et al. 2009; Gu et al. 2017). Rational genetic engineering approaches, such as recombinant DNA technology, metabolic engineering, systematic engineering and genome editing, can modify the specific genes of the target strain in a rational manner (Magocha et al. 2018). But these approaches require deep understanding of the genetic background of the target strain and need necessary genetic tools, which have limited the wide application of the rational approaches.

GS, first used for strain improvement in 2002, has been applied for phenotypic improvements of many important strains (Magocha et al. 2018; Zhang et al. 2002). This practical technology has been considered as a novel whole-genome improvement method for the rapid improvement of the complex phenotypes (Gong et al. 2009; Zhang et al. 2002). GS allows for recombination between multiple parents at each generation, and through several rounds of recursive protoplast fusion, the fusants successful fused the genetic traits from multiple parental strains, significantly improving the genetic diversity of “complex progeny” and remarkably increasing the opportunity for obtaining the high-performance fusants (Gong et al. 2009). Compared with random mutagenesis, GS is more effective in obtaining desired phenotypes (Magocha et al. 2018). Compared with rational genetic engineering approaches, GS also offers more advantages. Firstly, GS is more convenient, easy to operate and can be used in most laboratories without expensive equipment (Gong et al. 2009). Furthermore, GS can break through the limits of species even genus to accelerate the directed evolution of important microbes (John et al. 2008; Luna-Flores et al. 2017), and it circumvents the essential requirements of comprehensively genetic background or metabolic network information of the target strain, therefore this technology has a wider range of application than rational approaches (Gong et al. 2009; Zhang et al. 2002). Most importantly, based on natural homologous recombination via protoplast fusion, fusants obtained by GS are not considered ‘genetically modified’, overcoming the big obstacles of the application of genetically modified organisms (Gong et al. 2009; Zhang et al. 2002).

In this review, we introduce the procedure of GS, discuss the aid strategies of GS, summarize the applications of GS for increasing metabolite yield, improving strain tolerance, enhancing substrate utilization, and put forward the outlook to the future development of this technology.

The procedure of genome shuffling

The procedure of GS mainly consists of selection of initial strain, construction of parental library, recursive protoplast fusion, screening of desired fusants and stability evaluation (Gong et al. 2009; Zhang et al. 2002) (Fig. 1).

Fig. 1
figure 1

The general process of genome shuffling

First of all, the initial strains should be selected, and then used to construct the parental library. The initial strains can be a single strain, or strains from different species, or strains from different genus (Jetti et al. 2019; John et al. 2008; Luna-Flores et al. 2017). To construct the parental library, mutagenesis is still the main choice, and in this step, the initial strains would be subjected to one or several rounds of mutagenesis. Then the selected mutants from the parental library were sent to prepare protoplasts, which would be similarly fused and regenerated. Protoplasts fusion is mainly induced by polyethylene glycol (PEG) and/or electrical pulses. The above process would be repeated several rounds, namely recursive protoplast fusion. Recursive protoplast fusion is the distinctive operation of GS, which successfully completes the efficient shuffling of diverse genes from different parental strains, ensuring the construction of desired phenotypes. The next step is to screen the desired phenotypes, which is the crucial step to ensure the success of GS, but how to efficiently screening the desired fusants is difficult and complex task. Generally, the screening methods are varied by the target of strain improvement, traditionally rely on physiological or biochemical characters of desired phenotypes. For instance, the hydrolysis zone, clear zones or inhibition zone is always employed for screening yield improved fusants, selective medium with corresponding tolerant substances is always used for screening tolerance improved fusants. Additionally, auxotroph and inactivated parental protoplast fusion were also applied in GS for screening the fusants (Gong et al. 2009). Simply, the more efficient the screening method is, the more rapidly the desired phenotypes are obtained, thus the development of the high-throughput screening (HTS) method is important for GS. The last step of GS is to evaluate the genetic stability of the selected fusants, which is always determined by evaluating the performance after continuous passage. Only the genetically stable fusants are of practical value.

Aid strategies for genome shuffling

Although GS has been successfully used for strain improvement of many microbial strains, there are still some practical needs, such as low mutant diversity, low fusion rate, low screening efficiency (Wang et al. 2016; Zhang et al. 2002). In recent years, several effective and sophisticated tools and/or methods have been developed, and introduced into GS as aid strategies to make the technology simpler, more efficient and less time consuming. Here, we describe these aid strategies to present the development potential of this practical technology.

Aid strategies for increasing the mutant efficiency

To date, random mutagenesis is still the common method for generating the mutants, however, it has some inherent limitations such as long time and low mutant efficiency. In recent years, for increasing the mutant efficiency, some novel mutation methods have been developed and applied in GS, such as atmospheric and room temperature plasma (ARTP) (Gu et al. 2017; Shi et al. 2018), atmospheric pressure non-equilibrium discharge plasma (APNEDP) (Xu et al. 2012), femtosecond laser (Liu et al. 2013), ribosome engineering (Liu et al. 2020) and low-energy ion implantation (LEII) (Xu et al. 2012) (Table 1). For example, Liu et al. applied a femtosecond laser to treat Micromonospora sagamiensis for increasing micronomicin yield (Liu et al. 2013). Under the optimized irradiation conditions of 75 mW and 180 s, a maximum of positive mutation rate of 17.8% was obtained, and maximum yield from mutant MX3004 of 263 U/ml was achieved, which was increased by 484% compared with the parent strain (Liu et al. 2013). Besides, to construct mutant libraries more rationally, the random assembly-based strategies, such as multiplex automated genome engineering (MAGE), CRISPR, and multiplex automated genome engineering for eukaryotes (EMAGE), can also be introduced into GS for generating large-scale libraries with higher diversity and different functions (Zeng et al. 2020).

Table 1 Genome shuffling increase the production of microbial metabolites

Aid strategies for enhancing fusion rate

Protoplast fusion is the key event in GS, and the higher the fusion rate is, the more rapidly the desired phenotypes are obtained. However, traditional protoplasts fusion mainly relies on the presence of PEG and/or electric pulse, leading to a low fusion rate. Here we present some new technologies to enhance the fusion rate, such as femtosecond laser (Gong et al. 2008; Liu et al. 2013), nanosecond pulsed UV laser (Steubing et al. 1991), microfluidic device (Skelley et al. 2009) and optical tweezers (Steubing et al. 1991; Zhong et al. 2013). For example, femtosecond laser, possessing ultra-high temporal and spatial resolutions, was employed to induce protoplasts fusion of Phaffia rhodozyma (Gong et al. 2008). Under the optimal condition of 1.38 × 104 W and 0.25 s, the maximal fusion rate achieved 80%, providing increased efficiency in generating fusants and suggesting the great potential application of femtosecond laser in protoplasts fusion and GS (Gong et al. 2008). While Skelley et al. developed a microfluidic device that can trap and properly pair thousands of cells, and then more than 50% properly paired and fused cells were achieved (Skelley et al. 2009).

By the way, a simulation method called Monte Carlo simulation, which was used to estimate the significance level of any test statistic, was introduced into GS to simulate the simplified processes of protoplast fusion, indicating that the highest fusion rate would be achieved from 8 to 12 different parental protoplasts (Wang et al. 2016).

Aid strategies for improving screening efficiency

Recently, some developed schemes, HTS devices and/or analytical technologies have been designed for screening of desired fusants, improving the screening efficiency and making the operation of GS simpler, such as sexual and asexual reproduction (Hou 2009), drug resistance markers (Zheng et al. 2011), microplate reader (Gong et al. 2007; Liu et al. 2013), liquid chromatography-mass spectrometry (LC-MS) and fluorescence-activated cell sorting (FACS) (Fields et al. 2019; Zhang et al. 2015). To be specific, by using yeast sexual and asexual reproduction by itself instead of protoplasts fusion, a novel GS method was developed to increase ethanol production of S. cerevisiae (Hou 2009). After three rounds of GS, the obtained fusant distinctly improved ethanol resistance, but also increased ethanol yield by up to 13% compared with the control (Hou 2009). Zheng et al. introduced two drug resistance markers into GS for screening S. cerevisiae fusants with high acetic acid tolerance, with the aid of G418- and Zeocin-resistance markers, high screening efficiency was achieved (Zheng et al. 2011). While Gong et al. combined a microplate reader with 96-well microtiter plates, developing a HTS method for high epothilones-producing fusants (Gong et al. 2007). The fusants were inoculated in two parallel 96-well microtiter plates, the first plate was cultured to produce epothilone, while the second one was cultured to maintain the fusants. By using a microplate spectrophotometer, the yield of each fusant in the first plate was assayed, then the high-yield fusants were recovered from the second plate for further assay (Gong et al. 2007). Zhang et al. developed a FACS method to efficiently screen fusants of nonconventional yeast Pichia anomala for improved sugar alcohol production through GS. Parent strains were first labeled with different fluorescent stains, then fusants were selected based on their dual fluorescence by flow cytometry, achieved the efficient screening of fusants without complementary genetic markers (Zhang et al. 2015).

Application of genome shuffling for strain improvement

In 2002, GS was first applied to the improvement of tylosin production in Streptomyces fradiae (Zhang et al. 2002). Since then, GS has been extensively used to improve a variety of microbial strains for desired phenotypes (Table 1), including bacteria such as Acetobacter (Wei et al. 2012), Bacillus (Chen et al. 2020; Gu et al. 2017; John et al. 2008), Clostridium (Gu et al. 2017; Otte et al. 2009), Lactobacillus (John et al. 2008) and Zymomonas (Wang et al. 2019), actinomyces such as Streptomyces (Liu et al. 2020; Zhang et al. 2002), yeasts such as Saccharomyces (Pinel et al. 2011; Yin et al. 2016), Candida (Wei et al. 2008) and Yarrowia (Zhao et al. 2014a, b), moulds such as Aspergillus (El-Gendy et al. 2016), Aureobasidium (Kang et al. 2011) and Penicillium (Cheng et al. 2009), and microalgae such as Chlamydomonas (Fields et al. 2019) (Tables 1 and 2). Importantly, not only for strain improvement of intraspecific microbes, but GS has been also successfully applied to the improvement of interspecific microbes (Li et al. 2013; Luna-Flores et al. 2017) and intergeneric microbes (Gu et al. 2017; Jetti et al. 2019; John et al. 2008). Here, we summarize the main examples of the application of GS, especially for increasing metabolite yield, enhancing strain tolerance and improving substrate utilization, fully demonstrating the power and potential of this technology (Tables 12).

Table 2 Genome shuffling improve the physiological characteristics of microbes

Increase metabolite yield

The most important application of GS is to increase metabolite yield. For its first application, only after 2 rounds of protoplasts fusion, the generated fusants showed remarkable improvement in tylosin production, equivalent to those obtained achievement previously requiring about 20 years of effort (Zhang et al. 2002). Chen et al. used GS to increase surfactin yield in B. velezensis, after 3 rounds of GS, a high-yield fusant F34 was obtained, exhibiting a dramatic increase in surfactin yield (from 229.60 ± 7.10 mg/L to 908.15 ± 5.65 mg/L) (Chen et al. 2020). Liu et al. employed GS and ribosome engineering to improve tiancimycin-A production, then fusant CB03234-GS26 was obtained, 1.6 times higher than that of the initial Streptomyces sp. CB03234 (Liu et al. 2020). Similarly, for eucaryotic microbes, S. cerevisiae YS86, a glutathione-producing strain, was carried out 2 rounds of GS, then a 32-folds yield improvement of fusant YSF2-19 was obtained (Yin et al. 2016). P. decumbens JU-A10, a cellulase-producing strain, was also carried out 2 rounds of GS, and fusant GS2-21 achieved the maximum yield of 102.63 FPU/L/h, producing more cellulase (1.42-folds) much earlier (44 h) than that of the original strain (90 h) (Cheng et al. 2009). Furthermore, by GS, a high lovastatin-producing fusant F3/7 was obtained from A. luchuensis MERV10, which produced 57.0 mg/gds lovastatin, 6.0-folds higher than that of the starting strain (El-Gendy et al. 2016). Even for microalgae strains, GS also exhibited powerful effect, a Chlamydomonas reinhardtii fusant was obtained by GS in less than 3 months to express a green fluorescent protein (GFP) over 2% total soluble protein (TSP), which was 15-folds higher than that of the initial strain (Fields et al. 2019).

Enhance strain tolerance

Microbial cells always suffer from many environmental stresses during fermentation, such as strong acid, thermal and osmotic stresses, toxic products, and feedback inhibition, severely affecting their metabolic activity and productivity (Zhu et al. 2018). However, cell tolerance of microbial strains is regulated by distributed polygenes in the genome, thus the improvement of strain tolerance is a complex and laborious task (Zhu et al. 2018). Fortunately, GS has been reported to successfully improve stress tolerance in many industrial strains (El-Bondkly 2012), such as acetic acid-tolerant Zymomonas mobilis (Wang et al. 2019), acid-tolerant Lactobacillus (Patnaik et al. 2002; Wang et al. 2007), thermo-tolerant S. cerevisiae (Shi et al. 2009), sulphite liquor-tolerant (Pinel et al. 2011) and ethanol-tolerant S. cerevisiae (Jetti et al. 2019), glucose-tolerant L. rhamnosus (Yu et al. 2008) (Table 2).

Improve substrate utilization

Effective utilization of substrates is one of the most desired traits of industrial strains, while GS was also proved to successfully improve substrate utilization of many microbial strains (Table 2) (Jetti et al. 2019; John et al. 2008; Magocha et al. 2018). For example, Jetti et al. constructed a S. cerevisiae hybrid SP2-18 by GS, which can utilize hexose sugars as well as pentose sugars, whereas the parental strain cannot utilize xylose (Jetti et al. 2019). John et al. chose a lactic acid-producing L. delbrueckii and an amylase-producing B. amyloliquefaciens as the parental strains, after three cycles of GS, fusant F2 was obtained, which directly produced 40 g/L lactic acid from 83 g/L cassava bagasse (starch content 50% w/w) with 96% of starch conversion (John et al. 2008).

In addition, GS was proved to effectively enhance the degradation of toxic compounds for many strains (Table 2) (Dai and Copley 2004; Yi et al. 2019). For example, Pentachlorophenol (PCP) is a highly toxic pesticide, Dai and Copley used GS to obtain some high PCP-degrading fusants, which can grow on broth plates containing 6 to 8 mM PCP and completely degrade 3 mmol/L PCP in 48 h, while the original Sphingobium chlorophenolicum strain cannot grow in the presence of PCP at concentrations higher than 0.6 mmol/L (Dai and Copley 2004). Perfluorooctanoic acid (PFOA) is an emerging persistent organic pollutant, which is hard to be degraded by conventional methods because of its stable physical and chemical properties. Yi et al. successfully employed GS to improve the PFOA-degrading bacterium Pseudomonas Parafulva YAB-1, the PFOA degradation rate of fusant F3-52 was up to 58.6%, 1.8-fold higher than that of strain YAB1 (Yi et al. 2019).

Other applications

Gene clone, mutation or protoplast fusion may activate the silent genes in the genome, then the strain would produce new active metabolites (Hopwood and Chater 1980). In this respect, some researchers have proved that GS is also efficient in activating some metabolisms, thus GS could be used as a new strategy for the mining of new genes and metabolites (Cao et al. 2010; Wang et al. 2010). Wang et al. reported that fusant G-444 produced 8 new compounds, which were different in structure types and substitutions from those of the original Tubercularia sp. TF5, indicating some silent genes were activated after GS (Wang et al. 2010). Cao et al. used GS to improve the salt tolerance of Zygosaccharomyces rouxii, while the obtained fusant S3-2 not only grew well in high-salt medium, but also produced high amino acid nitrogen and ethyl acetate, and even produced a new flavor component, 4-ethylguaiacol (Cao et al. 2010). Additional, GS has also shown the powerful effects in another desired phenotypes in microbial strains, such as adhesive property in L. plantarum (Zhao et al. 2017), antimicrobial activity in Pediococcus acidilactici (Han et al. 2017) and biocontrol activity in S. bikiniensis (Zhao et al. 2014a, b). To sum up, GS is a wide range and powerful technology for strain improvement.

Future outlooks

Since the first application in 2002, GS has experienced almost two decades, and has successfully finished the improvement of many industrial strains for desired phenotypes, but it still has much room for development. Firstly, GS can combine with the rational genetic engineering techniques, for instance, with the help of rational techniques, a single gene, metabolic pathway or regulatory network with the clear genetic background can be directionally modified in advance, and then the modified phenotypes can be used for further shuffling, which will make the improvement more efficient and rational. Similarly, the fusant obtained by GS, can be sent for further directional modification in virtue of rational techniques, to harvest the better producers. Furthermore, with the development of omics and bioinformatics, the molecular mechanism of high-performance fusants can be explored, therefore GS became an important bridge to link other techniques, providing an opportunity to illustrate the metabolic networks and regulatory mechanisms.

In addition, now the availability of HTS methods is still a serious drawback for GS, while the establishment of the HTS methods is complex. In recent years, some novel devices such as biosensors, multilabel plate reader, Raman spectra, Fourier transform IR spectra, near-IR spectra and flow cytometry, have been developed and can be used for implementing the HTS methods (Zeng et al. 2020). For example, based on the spectra devices, the HTS methods for detecting a wide range of chemicals can be established, while with the introduction of flow cytometry, an HTS method can quickly analyze multiple traits of cell and rapidly classify target groups in multiple ways.

Therefore, with the development of automatic devices and rapid assay methods, the HTS method will be updated accordingly.

Conclusions

GS is a practical and effective technology for rapid improvement in microbial strains, especially for these strains with unclear genetic background. On the side, GS also offers many high-performance fusants as novel resources of rational manipulation, building an effective bridge between traditional recombination and rational manipulation. Predictably, with the development and combination of GS, it will play a much more important role in the improvement of industrial microbial strains.