1 Introduction

Mycobacterium species, including the causative agents of tuberculosis (TB) and leprosy, possess enzymes that metabolize myo-inositol derivatives rarely found in bacteria [13]. In these organisms, myo-inositol is utilized as a precursor for the production of phosphatidyl-myo-inositol (PI) and mycothiol (MSH), which carry out important cellular functions [2, 3]. PI is further elaborated to phosphatidyl-myo-inositol mannosides (PIMs), lipomannan (LM), and lipoarabinomannan (LAM), which are unique glycolipids found in the mycobacterial cell envelope [2, 3]. PIMs and LAM play important roles in the interference of phagosome maturation during TB infection, resulting in adaptation to the host and a decreased inflammatory immune response [4]. MSH is the primary reducing agent in mycobacterial species, and is also involved in drug detoxification and TB persistence [5, 6]. Consequently, enzymes involved in the metabolism of inositol derivatives are considered attractive drug targets for the treatment of TB.

Inositol found in mycobacteria can be obtained from the environment using an active inositol-transport system or synthesized de novo through a conserved pathway involving two enzymatic reactions [2, 3, 7, 8]. Myo-inositol-1-phosphate synthase (MIPS) converts glucose-6-phosphate (Glc-6P) to myo-inositol-1-phosphate (Ins-1P), which is then further hydrolyzed by inositol monophosphatase (IMP) to generate myo-inositol (Fig. 1). The Mycobacterium tuberculosis MIPS enzyme (MtMIPS) is encoded for by the Rv0046c (ino1) gene [9, 10]. MIPS enzymes are clustered into two distinct phylogenetic branches, one containing the smaller bacterial and archaeal enzymes (~40 kDa) and another containing the larger (~60 kDa) eukaryotic orthologs [11]. Sequence analyses suggest that the ino1 gene encoding for mycobacterial MIPS was recruited from archaea [12]. Inhibition of mRNA expression of MtMIPS results in enhanced susceptibility to antibiotics demonstrating the importance of MtMIPS function [13]. Additionally, a M. tuberculosis mutant lacking a functional ino1 can only be isolated when the growth medium is supplemented with exogenous inositol (77 mM), indicating that MtMIPS is an essential enzyme for inositol synthesis [8]. Rapid killing of ino1 mutants may result from reduced levels of MSH that render the bacteria susceptible to oxidative stress in macrophages [8]. The virulence of M. tuberculosis ino1 mutants is severely attenuated such that they are incapable of causing disease in a SCID mouse model [14, 15].

Fig. 1
figure 1

Inositol biosynthetic and metabolic pathways

The crystal structure of MIPS from M. tuberculosis containing a bound NAD+ and Zn2+ has been solved (Table 1) [16]. The MtMIPS monomer (Fig. 2a; PDB 1GR0) contains a Rossmann fold domain that is responsible for binding NAD+, as well as a tetramerization domain comprised primarily of β-sheets. The assembled tetramer is shown in Fig. 2b. The presence of a bound a zinc ion in the active site lead to questions regarding whether the mycobacterial enzyme functions through a type III aldolase mechanism requiring monovalent (i.e., ammonium ions) like eukaryotic MIPS enzymes or a type II aldolase mechanism that involves a divalent cation (e.g., zinc or manganese) as observed for archaeal MIPS enzymes [2, 1721]. Both mechanisms require (catalytic) NAD+. Initially, a hydride is transferred from Glc-6-P to NAD+ to generate the keto-Glc-6P (KPG) intermediate and NADH, and in a later step the hydride is transferred back from NADH to an inosose intermediate to generate Ins-1P and NAD+ [22, 23]. In order address these mechanistic questions, we sought to recombinantly express and purify sufficient quantities of MtMIPS that would allow for detailed biochemical studies of this important enzyme.

Table 1 MIPS crystal structures referenced in these studies
Fig. 2
figure 2

Crystal structure of M. tuberculosis MIPS (PDB 1GR0) containing bound NAD+ and Zn2+. a Monomer; b Assembled tetramer

Herein we report on the cloning and recombinant expression of MtMIPS and MsMIPS in E. coli and M. smegmatis. We constructed the mycobacterial expression vector pHALOsmg (Fig. 3) that is derived from the E. coli-Mycobacterium shuttle plasmid pYUB1049 [24] and Flexi® vector pFN18K to allow for recombinant expression in Mycobacterium smegmatis mc24517. Recombinant proteins produced in this study are linked to the N-terminal affinity tags (i.e., His8-tag, Halo-tag, and His6-Halo-tag) via a TEV-protease site, which allows for rapid purification and removal of the tag following purification. Recombinant expression of mycobacterial MIPS in E. coli yields high quantities of soluble purified protein with no detectable MIPS activity using the Glc-6P substrate. In contrast, recombinant expression of mycobacterial MIPS in M. smegmatis yields lower quantities of soluble purified protein with measurable MIPS activity using the Glc-6P substrate. Available structural data suggest that the lack of activity observed for the E. coli expressed protein may be attributed to incorrect folding of the protein as a helix containing two critical (conserved) lysine residues is not observed in the MtMIPS crystal structure. Results from these experiments suggest that detailed mechanistic studies on MtMIPS may be feasible following optimization of the recombinant expression protocol in M. smegmatis.

Fig. 3
figure 3

Arrangement of genetic elements in the pHALOsmg/MIPS expression vector

2 Materials and Methods

2.1 Materials

All solutions were prepared using milliQ water. M. smegmatis and Mycobacterium bovis BCG genomic DNA were purchased from ATCC. Primers were purchased from Integrated DNA Technologies. Restriction enzymes were purchased from Promega and New England Biolabs. DNA sequencing was performed at the Virginia Bioinformatics Institute DNA Sequencing Facility (Virginia Tech). Plasmids and PCR products were purified using the Wizard Plus SV Minipreps DNA Purification System and Wizard SV Gel and PCR Clean-up kits (Promega), respectively. All chemicals were purchased from Gold Biotechnology, Sigma-Aldrich, and ThermoFisher Scientific except where noted. IMP from bovine brain was purchased from Sigma. d-myo-inositol-3-phosphate sodium salt (equivalent to l-myo-inositol-1-phosphate) was purchased from Cayman Chemical. 2-amino-6-mercapto-7-methylpurine riboside (MESG), purine nucleoside phosphorylase (PNP) and phosphate standard for the IMP-coupled assay were purchased from Invitrogen (EnzChek Phosphate Assay Kit). Phosphate standard (HPLC grade, 1000 mg/L) for the periodate assay was purchased from Dionex. Absorbance and fluorescence measurements were made using a SpectraMax 5Me plate reader. All protein assays were carried out in 96-well plates (UV or fluorescence, ½ area, Corning).

2.2 Cloning

The open reading frame (ORF) encoding MIPS from M. bovis BCG (Rv0046c) and M. smegmatis (MSMEG_6904) were cloned from genomic DNA into the E. coli expression vectors pVP55A (N-terminal His8-TEV-tag) [25] and pFN18K (N-terminal Halo-TEV-tag, Promega) [26] using Flexi® technology (Promega) to generate pVP55A/MIPS and pFN18K/MIPS as previously described for MshB (see Supplemental Material) [27]. The purified recombinant ino1 proteins from M. bovis BCG and M. smegmatis are termed MtMIPS and MsMIPS, respectively. To transfer the cloned genes into a mycobacterial expression system, the primer pair of Halo_1 (CATGCCATGGCAGAAATCGGTACTGGCTTTCCAT) and Halo_2 (CGATAAGCTTGGTACCGAGCCCGAATTCGTTTAAAC) was used to amplify the ORF of Halo-TEV-MIPS from the pFN18K/MIPS vector. The resulting PCR fragment contains NcoI and HindIII restriction sites at 5′ and 3′ ends, respectively. The NcoI/HindIII digested PCR product was cloned to the E. coliMycobacterium shuttle vector pYUB1049 (provided by Dr. Jacobs, Albert Einstein College of Medicine) [24] to produce pHALOsmg/MIPS (Fig. 3). The resulting expression vector pHALOsmg contains a replication origin of E. coli (ori E) and M. smegmatis (ori M), a copy of the hygromycin resistance gene, the inducible lac operon, and the T7 promoter (from pYUB1049), and yields a recombinant protein with a N-terminal His6-HaloTag that can be removed following cleavage with TEV protease (from pFN18K). Since the PmeI and SgfI restriction sites were cloned to the pYUB1049 plasmid along with the MsMIPS ORF, the MIPS ORF from M. bovis BCG was cloned to pHALOsmg using Flexi® technology.

2.3 Recombinant Expression of MIPS

Recombinant expression and purification of the target proteins was carried out as previously reported for MshB (see Supplemental Material) [27, 28]. For recombinant expression in M. smegmatis, electrocompetent M. smegmatis mc24517 (provided by Dr. Jacobs, Albert Einstein College of Medicine) cells were transformed with pHALOsmg/MIPS expression vectors by electroporation [29]. Briefly, electrocompetent M. smegmatis mc24517 cells (40 μL) were added to a 2 cm cuvette containing pHALOsmg/MIPS (1 μL) and 10 % glycerol (260 μL). Electroporation using a Gene Pulser (BioRad) electroporation system was performed using the following parameters of: R = 1000 Ω, Q = 25 μF, and V = 2.5 kV. After one pulse, cells were transferred into 1 mL of 7H9/ADC/Tween and allowed to recover at 37 °C for 3 h. Positive transformants were selected from 7H10 agar plates supplemented with kanamycin (50 μg/mL) and hygromycin (50 μg/mL) after 3–4 days of growth at 37 °C. A single colony of the transformed cells was used to inoculate 5–10 mL 7H9/ADC/Tween and the cells were grown with shaking (250 rpm) at 37 °C for 36 h. For small-scale feasibility studies, the starter culture (4 mL) was used to inoculate LBT (100 mL). The cells were grown with shaking (250 rpm) until an OD600 of ~0.6 was reached. Protein expression was induced by the addition of 0.6 mM IPTG, and the cells were continued to be shaken at 37 °C for an additional 3 days. For large-scale expression, the starter culture (1 mL) was added to 1 L ZYP-5052 auto-induction medium and cells were allowed to grow for an additional 3 days at 37 °C with shaking (250 rpm) [24, 29, 30]. Cells were harvested by centrifugation and resuspended in either Buffer A (30 mM HEPES, 150 mM NaCl, 0.5 mM imidazole, pH 7.5) for IMAC purification (large-scale expression) or in Buffer B (30 mM HEPES, 150 mM NaCl, and 1 mM TCEP, pH 7.5) for HaloLink™ purification (small-scale preparation). Resuspended cells were lysed using an Emulsiflex-C3 high-pressure homogenizer (Avestin) and the resulting cell lysates were clarified by centrifugation (18,000 rpm, 4 °C). Clarified cell lysates were loaded onto pre-equilibrated affinity columns and purified as previously described for MshB (see Supplemental Materials) [27].

2.4 Gel Filtration

The solution molecular weight of purified MIPS proteins were determined using size exclusion chromatography with a Superdex 200 10/300 GL column pre-equilibrated with 50 mM sodium phosphate and 150 mM NaCl (pH 7.5) (see Supplemental Materials) [31]. Elution volumes were used to calculate K av values [K av  = (V e  − V 0)/(V t  − V 0), where V 0 is the void volume of the column, V t is the total volume of the column, and V e is the elution volume of the protein]. Standard curves were prepared with the following protein standards (GE Healthcare): aprotinin, 6.5 kDa; ovalbumin, 44 kDa; and conalbumin, 75 kDa.

2.5 MIPS Enzyme Activity

MIPS activity, specifically the conversion of Glc-6P to Ins-1P, was measured using the periodate (stopped-point) and IMP-coupled (continuous) assays as previously described (see Supplemental Materials) [32, 33]. Briefly, the periodate assay is a stopped-point assay that uses NaIO4 to oxidize and chemically release phosphate from the Ins-1P product and not the Glc-6P substrate. Assay mixtures (100 mM Tris–acetate, 2 mM DTT pH 7.5; 20 mM NH4Cl; 1–18 µM MIPS; 300 µM NAD+) were pre-incubated at 30 °C, and the reactions were initiated with the addition of Glc-6-P (0–40 mM). Reactions aliquots were quenched at various time points by the addition of 20 % trichloroacetic acid and the phosphate group on Ins-1P was released with the addition of 0.2 M NaIO4. The released phosphate group was detected following reaction with ammonium molybdate/ascorbic acid by monitoring the increase in absorbance at 820 nm. In the IMP-coupled assay, the conversion of Glc-6P to Ins-1P is continuously measured by sensing phosphate formation following enzymatic cleavage of Ins-1P using IMP (Fig. 1). Inorganic phosphate released by IMP was detected following reaction with 2-amino-6-mercapto-7-methylpurine riboside (MESG) and purine nucleoside phosphorylase (PNP), which leads to an increase in absorbance at 360 nm. Assay mixtures (50 mM Tris, 1 mM MgCl2, pH 7.5; containing 20 mM NH4Cl; 300 µM NAD+; 0.6 mg IMP; 0.2 mM MESG; 4 U/mL PNP; 0.5–18 µM MIPS) were pre-incubated at 30 °C, and the reactions were initiated with the addition of Glc-6P (0–40 mM). Phosphate production was monitored by measuring the absorbance at 360 nm at various time points, and the rate of phosphate production (μM min−1) was calculated from a phosphate standard curve. Relative rates of phosphate production correspond to rate of phosphate production (μM min−1) per 1 μM MIPS.

The affinity of NAD+ for MIPS was measured by monitoring the decrease of intrinsic MIPS fluorescence intensity upon NAD+ binding [17, 19]. In these experiments, NAD+ (0–1 mM) was added to a solution of MIPS (40 μM) in buffer (100 mM HEPES, 400 μM EDTA, pH 7.5; total volume 100 μL). The fluorescence signal of the enzyme mixture (Excitation wavelength = 280 nm, Emission wavelength = 334 nm) was recorded following incubation at room temperature for 5 min in the dark. The decrease in the intrinsic fluorescence of MIPS upon the addition of NAD+ was plotted as ΔI fl  = I 0  − I (ΔI fl , decrease in MIPS intrinsic fluorescence; I 0 , fluorescence of MIPS before adding NAD+; I, fluorescence of MIPS after adding NAD+). The apparent K D of MIPS for NAD+ was obtained by fitting a binding isotherm equation to the resulting data (Eq. 1). Binding of NAD+ to MIPS from Archaeoglobus fulgidus and Arabadopsis thaliana using this method are included in Supplemental Material (Figure S2).

$$\Delta Fl =\Delta Fl_{\hbox{max} } *\frac{{\left[ {NAD^{ + } } \right]_{Total} }}{{\left( {K_{D} + \left[ {NAD^{ + } } \right]_{Total} } \right)}} + \Delta Fl_{initial}$$
(1)

2.6 Computational Studies

Sequence alignment was carried out using the Align feature in ExPASy [34]. Three-dimensional homology models of MIPS from M. tuberculosis and M. smegmatis were generated using the Phyre 2 protein fold recognition server [35]. Structural alignment of the generated models with MIPS from M. tuberculosis (PDB 1GR0), A. fulgidus (PDB 3QVT) and S. cerevisiae (PDB 1P1J) were carried out using the MatchMaker program in the UCSF Chimera package [36, 37]. For clarity, following alignment figures were generated for the MIPS model, MtMIPS, AfMIPS, and ScMIPS for each orientation by displaying one protein ribbon at a time with KPG and NADH ligands. All molecular graphics images were created using UCSF Chimera [36, 37].

3 Results

3.1 Recombinant Expression of MIPS in E. coli Yields an Inactive Protein

Recombinant MsMIPS and MtMIPS proteins were successfully expressed in E. coli and purified using a simple His-tag (yield: ~60 mg/L, Supplementary Material). Purified MIPS proteins were assayed for the ability to convert Glc-6P to Ins-1P using two different methods—a periodate (stopped-point) assay and an IMP-coupled (continuous) assay [32, 33, 38]. Two different assays were used for measuring MIPS activity since each has limitations—the periodate assay is labor intensive, but tolerates additives such as divalent metal ions, while the IMP-coupled assay is much less labor intensive, but is significantly affected by the presence of divalent metal ions [32]. Unfortunately, the purified MsMIPS and MtMIPS enzymes recombinantly expressed in E. coli exhibited no detectable activity with either assay even in the presence of various additives (0–50 mM ZnSO4 (ZnCl2, Zn-acetate), 0–50 mM MnCl2, 0–50 mM CoCl2, 0–1 mM EDTA, 0–40 mM NH4Cl, or 0–20 % glycerol). Additionally, the purified proteins did not have any measurable activity using a NADH production assay [19]. Therefore, we attempted to obtain active MIPS using the following approaches: (1) adding a DEAE column to the purification protocol (Supplemental Material Figure S1) prior to the initial IMAC column to remove potential contaminants or proteases that affect MIPS activity, (2) using phosphate-buffered saline and/or glycerol (10 %) containing buffers during the purification protocol to enhance protein stability, and (3) expressing the protein as a Halo-tagged protein and purifying it on HaloLink™ resin to circumvent possible complications from improperly folded proteins or the presence of metal ions (Ni2+). While (soluble) purified MIPS enzyme was obtained using all of these approaches, none yielded a protein with the ability to convert Glc-6P to Ins-1P.

The observed lack of activity with the recombinant MIPS proteins was unexpected given that the crystal structure of MtMIPS was solved using a His-tagged protein recombinantly expressed in E. coli [16]. Since a bound NAD+ is observed in the MtMIPS structure (Fig. 2), we probed whether the purified MIPS proteins were capable of binding NAD+ using an assay that monitors the decrease in the intrinsic fluorescence of MIPS upon NAD+ binding [17]. Following excitation at 280 nm, the fluorescence spectrum of MtMIPS exhibits a maximum emission peak at 334 nm. The intensity of this intrinsic fluorescence, but not the maximum emission, decreases upon the addition of NAD+ (300 μM) with an apparent K NAD+ D of 36 ± 4 µM (Fig. 4). Subsequent addition of Glc-6P (50 mM) to the MtMIPS/NAD+ mixture results in a slight increase in the fluorescence intensity. While the apparent K D value indicates that NAD+ does bind to the mycobacterial MIPS, it is weaker than the apparent K D values for NAD+ binding to MIPS from A. fulgidus (1 µM) [19] and A. thaliana (~0.2 µM, Supplementary Material).

Fig. 4
figure 4

NAD+ Binding to MsMIPS. NAD+ (0–0.1 mM) binding to MsMIPS (40 μM) was determined at pH 7.5 by monitoring the decrease in intrinsic fluorescence (Ex. 280 nm, Em. 334 nm) as described in Sect. 2. The apparent K NAD+D value was determined to be 36 ± 4 μM using this approach

3.2 Recombinant Expression of MIPS in M. smegmatis Yields a Functional Protein

Recombinant expression of M. tuberculosis proteins in E. coli often results in improperly folded proteins, which can be circumvented by recombinant expression of the proteins using a M. smegmatis host [24, 39]. Recombinant expression in M. smegmatis offers the following advantages over E. coli for the expression of mycobacterial proteins: the presence of mycobacterial chaperones (differ from E. coli chaperones) to ensure proper folding, ability to carry out post-translational modification of mycobacterial proteins, and presence of ligands not present in E. coli [39]. To probe whether the lack of activity we observed with the recombinant MtMIPS and MsMIPS proteins could be attributed to expression in an E. coli host, we recombinantly expressed the MIPS proteins using a M. smegmatis expression system. To this end, we constructed the pHALOsmg/MIPS expression vectors (Fig. 3) that yield recombinant proteins with N-terminal His6-HaloTags that can be removed after purification following cleavage with TEV protease.

Small-scale expression (100 mL) of the proteins in M. smegmatis was used to test feasibility of this approach using purification with the HaloLink™ resin (Supplemental Material Figure S5). In this approach, the Halo-tag on the Halo-MIPS constructs forms a covalent bond to the HaloLink™ resin, and the MIPS is “eluted” by cleavage of Halo-MIPS using TEV protease. This approach yields highly purified protein in a single step, likely due to the fact that the tag forms a covalent bond to the resin. The purified MIPS proteins were assayed for the ability to convert Glc-6P to Ins-1P using the periodate (stopped-point) assay [32, 33, 38]. To control for the possibility of the HaloLink resin purifying a contaminant protein from the M. smegmatis cell lysate that is capable of phosphate production under the assay conditions used, we also measured phosphate production for MshB deacetylase expressed in M. smegmatis and purified using the HaloLink™ resin as a control. The MtMIPS and MsMIPS expressed in M. smegmatis show significant activities (~0.1–0.3 µM/min) with 2 mM Glc-6P compared to MIPS expressed from E. coli (<0.01 µM/min) using the periodate assay (Fig. 5a). These experiments were done in the presence of zinc (50 µM) since a bound zinc ion is bound to MtMIPS in the crystal structure [16]. No significant activity is observed with the MshB control protein (Fig. 5a, Lane 5) indicating that the observed phosphate production can be attributed to the MIPS enzyme. The relative rate of ~0.3 µM/min corresponds to a specific activity of ~1.5 × 10−3 µmol/min/mg for MtMIPS, which is somewhat lower than the specific activities for MIPS from other organisms characterized to date, such as A. fulgidus (11.8 µmol/min/mg [17]), Arabidopsis thaliana (~0.1 µmol/min/mg [33]), Saccharomyces cerevisiae (0.41 µmol/min/mg [40, 41]), and Synechocystis sp. (0.02 µmol/min/mg [40]).

Fig. 5
figure 5

MIPS activity. a Relative activity of mycobacterial MIPS from small-scale expression in M. smegmatis using the periodate assay. Rates reflect the phosphate production catalyzed by 1 μM MIPS (in assay buffer containing: 2 mM Glc-6P, 300 μM NAD+, 20 mM NH4Cl, 50 μM ZnSO4) as described in Sect. 2. (1) MtMIPS (no NH4Cl); (2) MtMIPS; (3) MsMIPS (no NH4Cl), (4) MsMIPS; (5) MshB (control); and (6) MsMIPS (expressed in E. coli). b Relative activity of mycobacterial MIPS from large-scale expression in M. smegmatis using the periodate (gray bars) and IMP-coupled (white bars) assays. Rates reflect the phosphate production catalyzed by 1 μM MIPS with (300 μM NAD+, 20 mM NH4Cl, no ZnSO4) as described in “Supporting Materials”

In light of these promising results, large-scale expression (1 L) of MtMIPS and MsMIPS in M. smegmatis was carried out and the resulting proteins were purified using the His-tag (IMAC). The activity of purified MtMIPS from the large-scale expression with 2 mM Glc-6P using the periodate assay (no zinc) is comparable to the rate observed for the MtMIPS from the small-scale protein expression that was purified using the Halo-tag in the presence of zinc (50 µM, Fig. 5b). The activity is increased at higher Glc-6P concentrations (20 mM) yielding specific activities that are closer to values reported for other MIPS enzymes (S. cerevisiae and Synechocystis sp.). MIPS activity was also measured using the IMP-coupled assay (Fig. 5b). The relative activities using the IMP-coupled assay are ~twofold higher than the activities measured using the periodate assay.

3.3 Solution Molecular Weight

The majority of MIPS proteins from different species are known to be tetramers in solution, including A. fulgidus and S. cerevisiae, [16, 17, 20, 21, 4244] while MIPS from Neurospora crassa is a hexamer [18] and MIPS from various plant sources [45] and rat testis [46] are trimeric proteins. The solution molecular weight of recombinant MIPS expressed in E. coli and M. smegmatis were determined using size exclusion chromatography (Supplemental Materials Figure S3) [31]. Results from these experiments indicate that the MIPS purified from E. coli has a solution molecular weight of 140 kDa, or ~3.5 monomers, suggesting that some of the MIPS is present as a trimer in solution. MIPS purified from M. smegmatis had two peaks in the elution profile. One peak corresponds to a molecular weight of ~41 kDa, while the second corresponds to a relatively broad peak spanning the range of 248–283 kDa. The protein with the observed molecular weight of ~41 kDa is likely the monomeric Halo-tag protein (34 kDa) [26], and not the MtMIPS monomer, since there are two distinct bands visible on the SDS-PAGE following IMAC purification and the lower band is not observed when the Halo-MIPS protein is purified using the Halo-Link resin that covalently binds to the Halo-tag protein (Supplemental Figure S5). The broad peak spanning the range of 248–283 kDa suggests that the MtMIPS purified following recombinant expression in M. smegmatis is a tetramer–hexamer.

3.4 Structural Insights into MIPS Activity

We examined available structures of MIPS enzymes to probe whether they could offer insights into the molecular basis of our findings (select structures summarized in Table 1). Studies examining the catalytic mechanism of archeal/bacterial MIPS (Note: the chemical proposed mechanism for eukaryotic MIPS enzymes is distinct from the archeal/bacterial enzymes [21]) have identified residues (AfMIPS numbering) Asp225, Lys274, Lys278, Lys306, and Lys367 as important for proton transfer reactions in the chemical mechanism [19, 21, 47]. Additional residues that may play roles in catalysis are: Asp332 (metal ligand), Asn 255 (active site clustering/crowding), Leu257 (active site clustering/crowding), and Asp261 (metal ligand) [19, 47]. Sequence alignment of the mycobacterial proteins with AfMIPS and ScMIPS (Supplemental Material Figure S4) indicate that these catalytic residues are conserved in the mycobacterial MIPS proteins with the exception of Leu257 (Gly in MtMIPS and MsMIPS). However, residues 242–267 (Af numbering residues 268–293) are not observed in the MtMIPS structure suggesting that this helix is not formed in the protein fold captured in the crystal structure. In structures of MIPS enzymes from other organisms, this helix (residues 242–267) bridges the Rossmann fold domain and tetramerization domain. The AfMIPS tetramer highlighting the location of this helix is shown in Supplemental Material Figure S5. This finding is important since this helix includes Lys248 and Lys 252 (Af numbering: Lys274 and Lys278), which are critical for substrate binding and catalysis [19, 47]. Therefore, we prepared homology models of MtMIPS and MsMIPS using the Phyre 2 server and the top hits from this analysis are shown in Table 2.

Table 2 Homology model predictions using Phyre 2

To gain insights into ligand binding, the MtMIPS homology model (c1u1iC, template = AfMIPS) was structurally aligned with AfMIPS (PDB 3QVT), MtMIPS (PDB 1GR0), and ScMIPS (PDB 1P1J). The bound NADH and 5-keto-Glc-6P intermediate (KPG) ligands from the AfMIPS structure are displayed in all figures to highlight the locations of the cofactor and substrate binding sites. There is little difference in the binding orientation of the NADH/NAD+ in the overlayed structures (Supplemental Material). Figures 6a, c depict the MtMIPS homology model with residues 242–267 (not observed in MtMIPS crystal structure) highlighted in pink. Comparison to the MtMIPS crystal structure (PDB 1GR0) in Fig. 6c, d reveals the importance of residues 242–267 in forming the binding site for the KPG ligand. Figures of AfMIPS and ScMIPS in similar orientations are shown in Supplemental Material.

Fig. 6
figure 6

MIPS with bound KPG (green) and NADH (cyan) from PDB 3QVT. a MtMIPS model in orientation 1 with helix containing residues 242–267 highlighted in pink. b MtMIPS (PDB 1GR0) in orientation 1. c MtMIPS model in orientation 2 with helix containing residues 242–267 highlighted in pink d MtMIPS (PDB 1GR0) in orientation 2 (Color figure online)

4 Discussion

Recombinant MIPS expressed E. coli is capable of binding NAD+ with a K D of 36 ± 4 µM, but cannot turnover the Glc-6P substrate. Although these findings were initially unexpected, they are consistent with previously reported studies. Specifically, structural analysis reveals that a key helix comprised of residues 242–267 is missing in the MtMIPS crystal structure. This finding is important because this helix contains K248 and K252 (equivalent to AfIMPS K274 and K278), which are proposed to be important mechanistically for the proton transfer reactions in the conversion of Glc-6P to Ins-1P [47]. Additionally, mutagenesis studies on these side chains indicate that K274A has weakened/no NAD+ binding and no ability to produce Ins-1P or NADH, while K278A has weakened NAD+ (K D  = 70 ± 21 µM) and Glc-6P binding and no ability to produce Ins-1P or NADH [19]. Consequently, one possible explanation for the weak NAD+ binding and the lack of observed activity with the Glc-6P substrate is that the MIPS proteins expressed in E. coli, while soluble, are not folded in a catalytically competent conformation. The finding that the solution molecular weights of the recombinant MIPS proteins expressed in E. coli and M. smegmatis are different is also consistent with the explanation that variations in protein folding/assembly account for the observed differences in the in vitro activities for proteins expressed in E. coli and M. smegmatis.

Recombinant MIPS expressed M. smegmatis is a functional enzyme that is capable of catalyzing the conversion of Glc-6P to Ins-1P. As discussed above, the observed activity for the protein expressed in M. smegmatis is attributed to folding of the enzyme in a catalytically competent conformation. The specific activity reported here for the mycobacterial enzyme is lower than that for the MIPS from A. fulgidus, but comparable to MIPS from other organisms (S. cerevisiae and Synechocystis sp.). This result is also consistent with previous studies. The catalytically important Leu257 (Af numbering) is a glycine in Mt and MsMIPS (Supplemental Materials). The L257A mutation results in a loss of Glc-6P binding, weakened NAD+ binding (K D  = 6.5 ± 3.4 µM), and an inability to produce Ins-1P/NADH [19]. Consequently, it is not surprising that the mycobacterial enzymes would have reduced activity compared to AfMIPS. Finally, the observed activity of MtMIPS with the IMP-coupled assay is higher than the activity with the periodate assay. Since the activity of AtMIPS is the same using both assays [32], these results may suggest that MtMIPS is stimulated by the presence of the MgCl2 in the IMP-coupled assay buffer or that there are contaminating metal ions from the IMAC column in the purified MIPS that are stimulating enzyme activity.

5 Conclusions

Herein we demonstrate the feasibility of the recombinant expression of MtMIPS and MsMIPS in M. smegmatis. Recombinant proteins expressed in M. smegmatis are functional, while those expressed in E. coli lack measurable activity with the Glc-6P substrate. Structural analysis suggests that MIPS proteins expressed in E. coli are not folded in a catalytically competent conformation. Therefore, it is likely that expression in the mycobacterial host enables the protein to fold in a catalytic conformation. Optimization of the MIPS expression and purification protocol in M. smegmatis will allow for detailed biochemical studies to probe the catalytic mechanism and molecular recognition properties of this important enzyme.