Introduction

3,4-Dihydroxybutyric acid (3,4-DHBA) is an important C4 compound containing hydroxyl and carboxyl groups that can be modified to produce antibiotics (Choi et al. 2015), α- and β-amino acids and peptides (Sang et al. 2010). 3-Hydroxy-γ-butyrolactone (3HBL), the lactone of 3,4-DHBA, is also a multifunctional chiral building block for various chiral drugs, including the antilipemic agent atorvastatin (Brower et al. 1992), neurotransmitter l-carnitine (Tetrahedron 1990), HIV protease inhibitor Amprenavir (Kim E E 1995), dermatological medicine 12-HETE (Corey E J 1978), and anti-cancer drug aplysistatin (Shieh H M 1982).

Currently, 3,4-DHBA is mainly produced using an H2O2-catalyzed reaction between glucose and alkali metal hydroxides conducted at 70 °C for 24 h (Hollingsworth 1994), or from the hydrocyanation and hydrolyzation of R-3-chloro-1,2-propanediol (Inoue et al. 1991). However, these approaches usually generate low yields, in addition to involving harsh reaction conditions, complex product purification processes, high costs, and environmental pollution. Therefore, the biosynthesis of 3,4-DHBA from renewable sources has attracted considerable attention. For instance, efforts have been made to produce 3,4-DHBA by microbial fermentation.

There have been no reports of a natural pathway for 3,4-DHBA production. The first de novo biosynthesis of 3,4-DHBA, developed in E. coli, involved a six-step enzymatic reaction using glucose, and acyl-CoA as a substrate and acyl donor (Liu et al. 2009; Schweiger and Buckel 1984; Taguchi et al. 2008). After optimizations by Dhamankar et al., a yield of 0.7 g/L 3,4-DHBA (0.105 g 3,4-DHBA/g glucose) was achieved (Dhamankar et al. 2014). However, the numerous reaction steps, low catalytic efficiency, and low yield make this pathway unsuitable for industrial production.

d-xylose has a high proportion of lignocellulose, which is as the most abundant renewable biomass in energy production found in nature (Choi et al. 2014; Kawaguchi et al. 2016). A novel five-step biosynthetic pathway for the generation of 3,4-DHBA from d-xylose has been constructed in E. coli (Wang et al. 2017), in which xylose is catalyzed to 3,4-DHBA by double dehydrogenation followed by oxidation, dehydration and decarboxylation. In shake flask experiments, the titer of 3,4-DHBA has achieved 1.27 g/L (0.079 g 3,4-DHBA/g d-xylose), with 0.18 g/L (0.012 g 1,2,4-butanetriol/g d-xylose) of by-product of 1,2,4-butanetriol (BTO). This yield has remained low because of the lack of highly active enzymes to efficiently decarboxylate 2-keto-3-deoxy-d-xylonate and dehydrogenate 3,4-DHB, and because intermediate products are consumed by competing pathways.

In this study, a four-step biosynthetic pathway using d-xylose as a precursor was developed to produce 3,4-DHBA (Fig. 1). First, several aldehyde dehydrogenases (ALDHs) were screened and characterized to identify potential catalysts suitable for maximizing 3,4-DHBA accumulation. Second, competing pathways were disrupted by deleting xylA, ghrA, ghrB and adhP. Finally, a fusion construct containing PpMdlC and YagF was created to improve the titer of 3,4-DHBA. This study provided novel strategies for achieving a high titer and yield of 3,4-DHBA.

Fig. 1
figure 1

The pathway from xylose to produce 3,4-DHBA in E. coli. Enzymes: BsGDH: glucose dehydrogenase; YagF: d-xylonate dehydratase; PpMdlC: decarboxylase; ALDH: aldehyde dehydrogenase. Genes: xylA encoding xylose isomerase; ghrA, ghrB encoding glyoxylate reductases, respectively; adhP encoding ethanol dehydrogenase

Methods and materials

Strains, plasmids and culture conditions

The E. coli BL21(DE3) was used for expressing, purifying target proteins and producing 3,4-DHBA. Strains used in this study were showed in Table 1. The plasmids pET28a and pACYC184 were used for cloning and expressing the target genes. E. coli cells were cultured at 37 °C in lysogeny broth medium added with appropriate concentration antibiotics. And plasmids in this study were showed in Table S1.

Table 1 E. coli strains used in this study

Construction of plasmids

The primers sequences used are listed in Table S2. pE01 harboring yagF from E. coli and PpmdlC from P. putida was constructed. Similarity, pE04, pE05 and pE06 carrying gdh from B. subtilis and different ALDH genes, respectively.

The gene segment of T7 promoter-PpmdlC was amplified from pET28a harboring PpmdlC. The backbone of pET28a harboring yagF with SpeI and SalI of multiple cloning site was obtained by PCR. A 5 nm rigid α-helical ER/K linker(Sivaramakrishnan and Spudich 2011) was selected to connected to these two segments to create a new plasmid which carrying a fusion construct (Fig. S1).

The disruption of competing pathways

The disruption of competing pathway was accomplished by CRISPR-Cas9. The donor DNA was connected by overlapping PCR from 500 bp upstream and downstream homologous arm. The E. coli BL21(DE3) carrying pCas vector was inoculated in 30 °C and then added into 10 mM arabinose for inducing the expression of Cas9. When OD600 reached 0.5–0.6, the E. coli cells were arranged on ice for 30 min. After washed by 10% glycerol sterile three times, it was split into 100 μL per tube. The competent cells carrying pCas were mixed with 100 ng pTargetT vector and 400 ng donor DNA to have an electro-transformation (2.5 kV, 200 Ω, 25 μF) with an electrical shock time within 5 ms. Then they were cultivated into 30 °C and confirmed by colony PCR.

E. coli cells were cultured in 30 °C with addition of 0.5 mM IPTG overnight to eliminate pTargetT series because IPTG could induce the other sgRNA, which targets one PAM site of pTargetT. The pCas vector could be cured by cultivating it in 37 °C to remove because its temperature sensitivity.

Protein expression and purification

E. coli cells were cultured at 37 °C until the OD600 was approximately 0.6–0.8, and then 0.2 mM IPTG was used to induce target proteins expression at 20 °C for 16 h. Then the cells were collected by centrifugated at 8000 rpm for 10 min, washed by 0.9% NaCl twice and resuspended in appropriate volume PB solution at pH 7.5 (20 mM Na2HPO4 and NaH2PO4). The resuspended cells were centrifuged to collect the supernatants for purification after ultrasonication. The supernatants were filtered, loaded in the pre-equilibrated (20 mM imidazole, pH 7.5) Ni-chelating column with and then washed with different concentration of imidazole (20 mM and 50 mM, pH 7.5). The bound proteins including YneI, Gox0499 and Gox1122 would be eluted by 250 mM imidazole (pH 7.5). The protein samples were then put into dialysate (20 M PB solution, 200 mM NaCl and 5% glycerol, pH 7.5) overnight, and then identified by 12% SDS-PAGE.

Enzymes activity assay

The in vitro enzyme assays of the fusion protein and tandem enzyme reaction system containing YagF and PpMdlC were performed in a 50 mM PB solution (pH 7.5) containing 20 mM d-xylonate, 10 mM MgCl2, 1 mM TPP, and 1 mM NAD+. The reaction mixture was incubated at 30 °C for 5 min to allow the accumulation of 3,4-DHB produced by the five-fold excess of YagF, PpMdlC or the YagF and PpMdlC fusion protein. The absorbance of NADH at 340 nm was measured after 1 mM YneI was added to the reaction.

The in vitro enzyme assay of NAD+-dependent YneI, Gox0499, and Gox1122 activity was performed in a 50 mM PB solution (pH 7.5) containing 20 mM D-xylonate, 10 mM MgCl2, 1 mM TPP, 1 mM NAD+, and a five-fold excess of YagF and PpMdlC. The reaction mixture was incubated at 30 °C for 5 min to allow the accumulation of 3,4-DHB, after which 1 mM YneI, 1 mM Gox0499, and 1 mM Gox1122 were added. The NADH absorbance at 340 nm was then measured.

Whole-cell catalysis for 3,4-DHBA production

Whole-cell catalysis was performed in 20 mL scale, containing 20 g/L of D-xylose, 50 mM PB solution (pH 7.5), 50 g/L of E. coli cells, 2 mM TPP, 2 mM NAD+, 10 mM Mg2+ at 30 °C. The products were sampled about 1 mL in every 12 h and analyzed via HPLC.

Analytical methods

Metabolite analysis was performed by HPLC on a Transgenomic 87H3 column, using refractive-index detection (RID). The mobile phase was 0.08 N H2SO4 with a flow rate of 0.38 mL/min. The temperatures of the RID detector and column were 35 °C. HPLC–MS was used to distinguishing 3,4-DHBA from the supernatants of reaction (Fig. S2). 3,4-DHBA (C4H8O4) was corresponded to the peak of 119.03 Da under the negative ion mode.

Results and discussion

Selection of enzymes for 3,4-DHBA dehydrogenation

The biosynthetic pathway for producing 3,4-DHBA from d-xylose comprises four steps. First, BsGDH from B. subtilis catalyzes d-xylose to produce d-xylonate (Li et al. 2017). Second, YagF from E. coli converts d-xylonate to 2-keto-3-deoxy- d-xylonate. Third, PpMdlC from P. putida produces 3,4-dihydroxybutanal (3,4-DHB) from 2-keto-3-deoxy-d-xylonate(Sun et al. 2016).

Finally, YneI from E. coli is used to convert 3.4-DHB to 3,4-DHBA(Wang et al. 2017). However, YneI generates 3,4-DHBA and BTO as byproducts. To obtain highly active oxidases to minimize the production of BTO, we identified two NAD+-dependent ALDHs (Gox0499 and Gox1122 from Gluconobacter oxydans) and compared their catalytic activity with that of YneI, using in vitro enzyme assays performed in cell lysates. Gox0499 had a higher specific activity (4.26 U/mg protein) than YneI (2.67 U/mg protein) and Gox1122 (2 U/mg protein).

To evaluate the catalytic activity of the enzymes in vivo, the plasmids pE01 (carrying yagF and PpmdlC), pA01 (carrying yneI and gdh), pA02 (carrying gox0499 and gdh), and pA03 (carrying gox1122 and gdh) were constructed to assemble the entire 3,4-DHBA biosynthetic pathway in E. coli BL21(DE3). Figure 2A shows the SDS-PAGE results of target proteins expressed by the strains E0-E01-A01, E0-E01-A02, and E0-E01-A03 after cultivation and target induction. The expression levels of BsGDH, YagF, and PpMdlC were the same among the three strains, but those of the ALDHs differed; Gox1122 levels were higher than those of YneI and Gox0499.

Fig. 2
figure 2

Selection of ALDHs for 3,4-DHBA production. A SDS-PAGE analysis of E0-E01-A01, E0-E01-A02 and E0-E01-A03. Lane 1: whole cell of E0-E01-A02; Lane 2: crude extract of E. coli E0-E01-A02; Lane 3: whole cell of E0-E01-A03; Lane 4: crude extract of E0-E01-A03; Lane 5: whole cell of E0-E01-A01; Lane 6: crude extract of E0-E01-A01; M: protein molecular weight marker. B Several strains were constructed to compare the titer of 3,4-DHBA. The highest titer of 3,4-DHBA (3.04 g/L; yield of 0.190 g 3,4-DHBA/g d-xylose) was produced by E0-E01-A02 containing BsGDH, YagF, PpMdlC and Gox0499, while the lowest titer of BTO was obtained, which means the Gox0499 has the highest oxidative activity and lowest reductive activity among these three ALDHs. These three reactions were performed in 20 mL scale, containing 20 g/L of d-xylose, 50 mM PB solution (pH 7.5), 2 mM TPP, 2 mM NAD+, 10 mM Mg2+ at 30 °C

Next, strains E0-E01-A01, E0-E01-A02, and E0-E01-A03 were used for whole-cell catalysis to produce 3,4-DHBA. As seen in Fig. 2B, strain E0-E01-A02 produced approximately 3.04 g/L 3,4-DHBA (0.190 g 3,4-DHBA/g d-xylose) after a 60 h whole-cell catalytic process, with 0.56 g/L of the by-product BTO. Compared with E0-E01-A01 and E0-E01-A03, E0-E01-A02 had the highest and lowest titers of 3,4-DHBA and BTO, respectively. Since Gox0499 was shown to be the most efficient enzyme for the dehydrogenation of 3,4-DHB, functioning at a lower level of protein expression, it was selected for 3,4-DHBA production.

Disruption of competing pathways to improve 3,4-DHBA titer

To improve the titer of 3,4-DHBA synthesized from d-xylose, several genes in competing pathways were deleted using CRISPR/Cas9. Xylose isomerase, encoded by xylA, catalyzes the production of d-xylulose from d-xylose. d-xylulose then enters the pentose phosphate pathway, facilitating bacterial growth and metabolism. 2-keto-3-deoxy-d-xylonate aldolases, expressed by yagE and yjhG of the Dahms pathway, always consumed the intermediate product 2-keto-3-deoxy-d-xylonate in E. coli (Valdehuesa et al. 2014). A search of the NCBI database showed that the genome of E. coli BL21(DE3) did not contain yagE or yjhG. However, ghrA and ghrB of E. coli BL21(DE3) were found to encode glyoxylate reductase, which consumes the intermediate product 2-keto-3-deoxy-d-xylonate. In addition, the ALDH encoded by adhP has a high level of reduction activity, favoring the production of the by-product BTO from 3,4-DHB (Wang et al. 2017).

After knocking out these competing genes, the xylA-deficient strain E1-E01-A02; xylA and ghrA double-deficient strain E2-E01-A02; xylA and ghrB double-deficient strain E3-E01-A02, xylA, ghrA and ghrB triple-deficient strain E4-E01-A02, and xylA, ghrA, ghrB and adhP quadruple-deficient strain E5-E01-A02 were constructed using CRISPR/Cas9. There was no significant difference in the expression of target proteins among the engineered strains (Fig. S3). This indicated that the deletion of xylA, ghrA, ghrB and adhP had no effect on the expression of target proteins in the host cell.

E1-E01-A02 produced a 56% higher 3,4-DHBA titer than E0-E01-A02 (Fig. 3). This showed that the deletion of xylA minimized the by-product formation of d-xylulose and increased the flux of the conversion of d-xylose to d-xylonate. The 3,4-DHBA titers produced by E2-E01-A02, E3-E01-A02, and E4-E01-A02 were 5.05, 5.24, and 5.38 g/L, respectively. This result indicated that the knockout of ghrA and ghrB increased the titer of 3,4-DHBA, but this effect was negligible. The 3,4-DHBA titer produced by the xylA, ghrA, ghrB and adhP quadruple-deficient strain E5-E01-A02 was 5.69 g/L, and the BTO titer decreased from 0.56 g/L to 0.33 g/L, suggesting that adhP disruption was essential for 3,4-DHBA production. E5-E01-A02 produced the highest titer, at 5.69 g/L (0.356 g 3,4-DHBA/g d-xylose), 87% higher than that produced by E0-E01-A02. We concluded that disruption increased the titer and yield of 3,4-DHBA.

Fig. 3
figure 3

Disruptions of genes of xylA, ghrA, ghrB and adhP for improving titer of 3,4-DHBA. Titer of 3,4-DHBA was produced by different engineered strains. Reaction conditions and systems are as shown in Fig. 2B

Construction of a fusion protein containing PpMdlC and YagF to increase carbon flux toward the 3,4-DHBA pathway

PpMdlC was the rate-limiting enzyme in this pathway because of its lower catalytic activity (Wang et al. 2017; Zhong et al. 2019), which causes the accumulation of intermediate products and consumption by competing pathways. Fusion proteins that serve as multifunctional enzymes can be constructed (Chen et al. 2016); previous studies have shown that enhancing the spatial proximity of enzymes by constructing fusion proteins can increase catalytic efficiency and the titer of products in multi-enzyme cascade reaction (Albertsen et al. 2011; Lu et al. 2006; Ryosuke et al. 2018). Here, a fusion construct containing PpMdlC and YagF with a 5 nm rigid α-helical ER/K motif was created, and the resulting recombinant plasmid pE02 was co-transformed into E. coli BL21(DE3) among with pA02 to generate the engineered strain E5-E02-A02.

The protein expression of PpMdlC and YagF in E5-E02-A02 was lower than that in the engineered strain E5-E01-A02, which harbored plasmids pE01 and pA02 (Fig. 4A). Meanwhile, a new protein of over 120 kDa was observed, indicating that the fusion protein expressing PpMdlC and YagF was constructed successfully. The catalytic activity of PpMdlC and YagF was compared to that of a tandem enzymatic reaction system containing PpMdlC and YagF by monitoring the change in NADH absorbance at 340 nm, with D-xylonate used as a substrate and Gox0499 used to catalyze the subsequent reaction in the cascade. The catalytic activity of the fusion protein containing PpMdlC and YagF was only 1.09 U/g wet cell, compared to 2.53 U/g wet cell produced by the tandem enzymatic reaction system containing PpMdlC and YagF.

Fig. 4
figure 4

Protein fusion of PpMdlC and YagF to improve 3,4-DHBA production. A The SDS-PAGE analysis of E5-E01-A02 and E5-E02-A02. Lane 1: whole cell of E5-E01-A02; Lane 2: crude extract of E5-E01-A02; Lane 3: precipitate of E5-E01-A02; Lane 4: whole cell of E5-E02-A02; Lane 5: crude extract of E5-E02-A02; Lane 6: precipitate of E5-E02-A02; M: protein molecular weight marker. B E5-E02-A02 was engineered to have PpMdlC and YagF fusion expression to enhance the reaction efficiency. The highest titer of 3,4-DHBA was 7.71 g/L (yield of 0.482 g 3,4-DHBA/g d-xylose) produced by E5-E02-A02, with same reaction condition and system as Fig. 2B

The 3,4-DHBA titer was further compared between the engineered strains E5-E01-A02 and E5-E02-A02. The rates of consumption of the substrate d-xylose in these two engineered strains were similar (Fig. 4B). Additionally, the 3,4-DHBA titer in E5-E02-A02 (7.71 g/L; 0.482 g 3,4-DHBA/g d-xylose) accumulated was higher and that in E5-E01-A02, and the reaction in E5-E02-A02 ended 12 h earlier. E5-E02-A02 exhibited a higher titer and yield of 3,4-DHBA than E5-E01-A02 at a lower level of PpMdlC and YagF enzymatic activity. The close spatial proximity of PpMdlC and YagF might have prevented the diffusion of the intermediate 2-keto-3-deoxy-d-xylonate, indicating that the constructed variant containing fusion enzymes effectively regulated catalytic efficiency. At the same time, there are no differences between E5-E01-A02 and E5-E02-A02 on cell growth (Fig. S4).

Conclusion

In this study, we developed a four-step enzymatic reaction pathway in E. coli to efficiently produce 3,4-DHBA from d-xylose. In addition to YagF from E. coli and PpMdlC from P. putida, used in the 3,4-DHBA biosynthetic pathway reported previously (Valdehuesa et al. 2014), BsGDH from B. subtilis was selected to catalyze the conversion of d-xylose to d-xylonate. Additionally, the ALDH Gox0499 from G. oxydans which exhibited higher activity than YneI from E. coli (Taguchi et al. 2008) was chosen to catalyze the conversion of 3,4-DHB to 3,4-DHBA, resulting in a higher titer of 3,4-DHBA and a lower titer of the by-product BTO. E. coli strains containing recombinant plasmids were modified by deleting genes in competing pathways and introducing fusion construct expressing PpMdlC and YagF. Whole cells of the engineered strain E5-E02-A02 produced 7.71 g/L 3,4-DHBA (0.482 g 3,4-DHBA/g d-xylose), the highest titer and yield reported to date.