Introduction

Bifidobacteria including Bifidobacterium bifidum, Bifidobacterium longum subsp. longum, B. longum subsp. infantis, Bifidobacterium breve, and Bifidobacterium animalis subsp. lactis are recognized as probiotics that are defined as “live microorganisms which, when administered in adequate amounts, confer a health benefit on the host.” Bifidobacteria produce both lactic acid and acetic acid from sugars via hetero lactic acid fermentation and thereby lower the intestinal pH to prevent the growth of harmful bacteria (Fukuda et al. 2011). In addition, they stimulate the intestinal immunity to enhance antiviral and antibacterial activities (Picard et al. 2005; Trebichavsky et al. 2009). Since they originally reside in the lower intestines of mammals where the sugars are highly limited, they possess various glycosidases to hydrolyze indigestible glycans, such as O-glycans of mucin produced by epithelial cells of the gastrointestinal tract. B. bifidum is one of the most frequently found bacteria in the intestines of newborn infants (Turroni et al. 2012). This bacterium also resides in intestines of adults, although the population diminishes over time (Turroni et al. 2012). We previously reported that B. bifidum possesses unique enzymes acting on the core structures in mucin O-glycans: glycoside hydrolase (GH) family 101 endo-α-N-acetylgalactosaminidase (EngBF) specific for core 1 structure, also called T-antigen (Galβ1-3GalNAcα1-Ser/Thr) (Fujita et al. 2005), and GH129 α-N-acetylgalactosaminidase (NagBb) specific for Tn-antigen (GalNAcα1-Ser/Thr) (Kiyohara et al. 2012). Furthermore, this bacterium expresses a series of glycosidases to uncover these core structures of mucin O-glycan (Katayama et al. 2004; Wada et al. 2008; Ashida et al. 2009; Miwa et al. 2010; Kiyohara et al. 2011; Sakurama et al. 2012, 2013). Whole genomic analysis also revealed that B. bifidum is highly adapted to acquire nutrients from O-glycans in mucin by producing related glycosidases (Turroni et al. 2010). The nonreducing termini of O-glycans on gastrointestinal mucin are usually covered with various glyco-epitopes, which confer resistance to digestive enzymes of the host and of general commensal bacteria. Terminal α-linked GlcNAc is one of the major and unique glyco-epitopes on gastroduodenal mucin (Fig. 1a) and is implicated as a host defense mechanism against colonization of Helicobacter pylori by inhibiting the synthesis of cholesteryl-α-glucopyranoside, an essential component of the cell membrane (Kawakubo et al. 2004). Gastroduodenal mucin flowing into the intestines becomes nutrients to some kind of commensal bacteria including bifidobacteria. To assimilate this type of mucin glycans, the first elimination of α-linked GlcNAc is essential. Therefore, we assessed the enzyme activity of α-N-acetylglucosaminidase (EC 3.2.1.50) in various bifidobacteria and found that several strains possess this enzyme. In this paper, we describe the cloning and characterization of α-N-acetylglucosaminidase from B. bifidum.

Fig. 1
figure 1

AgnB α-N-acetylglucosaminidase from B. bifidum JCM 1254. a Typical hexasaccharide structure of gastric mucin O-glycan and the action of α-N-acetylglucosaminidase. b Domain structure of AgnB from B. bifidum JCM 1254. c SDS-PAGE of the recombinant AgnB expressed in E. coli. M, protein marker; lane 1, crude extract; lane 2, affinity purified AgnB

Materials and methods

Bacterial strains and culture

The bifidobacterial strains (Table 1) were obtained from the Japan Collection of Microorganisms (JCM, RIKEN Bioresource Center, Japan). The bacteria were cultured in Gifu Anaerobic Medium (GAM) broth (Nissui Pharmaceutical, Japan) for 16 h at 37 °C under anaerobic conditions using AnaeroPack-Anaero (Mitsubishi Gas Chemical, Japan).

Table 1 α-N-Acetylglucosaminidase activity in various bifidobacterial cells

Genome sequence of B. bifidum JCM 1254

Draft sequencing of the genome of B. bifidum JCM 1254 was performed using a Genome Sequencer 20 System (Roche Applied Science, IN, USA). The details will be reported elsewhere.

Cloning and expression of α-N-acetylglucosaminidase in Escherichia coli

To construct the α-N-acetylglucosaminidase (AgnB) expression vector, a DNA fragment encoding amino acids (aa) 51-1926 (without an N-terminal signal peptide and C-terminal transmembrane region) was amplified by high-fidelity DNA polymerase (PrimeSTAR Max DNA Polymerase, Takara Bio, Japan) using genomic DNA from B. bifidum JCM 1254 as a template and the primers (AgnB-F and AgnB-R, Table S1), digested with EcoRI and XhoI, and ligated into pET23b(+). The nucleotide sequence was confirmed by sequencing. E. coli BL21(λDE3)ΔlacZ (Miwa et al. 2010) was transformed with pET23b/agnb and cultured in Luria-Bertani liquid medium containing 100 μg/mL ampicillin at 37 °C until the optical density at 600 nm reached 0.5. Then, isopropyl β-d-1-thiogalactopyranoside (IPTG) was added to the culture at a final concentration of 0.5 mM, and the culture was continued for more 3 h at 37 °C. Cells were harvested and lysed by BugBuster Protein Extraction Reagent (Novagen, Germany). After centrifugation, the supernatant was applied to a HisTrap HP column (1 mL, GE Healthcare, UK). The column was washed with 5 mM imidazole in 50 mM sodium phosphate buffer, pH 7.0, containing 250 mM NaCl, and then the adsorbed proteins were eluted by 250 mM imidazole in the same buffer. The active fraction was collected, concentrated, and desalted using an Amicon Ultra 30K (Merck Millipore, MA, USA). Protein concentration was determined by Protein Assay (Bio-Rad, CA, USA) using bovine serum albumin as a standard.

Preparation of AgnB mutants

To construct AgnBΔC (aa 51-940) expression vector, a DNA fragment was amplified using pET23b/agnb as a template and the primer set of AgnB-F and AgnB-N-R (Table S1). For AgnBΔN (aa 913-1926), the same template and the primer set of AgnB-C-F and AgnB-R were used. In both mutant construction, the DNA fragments were digested with EcoRI and XhoI and ligated into pET23b(+), resulting pET23b/agnbΔC and pET23b/agnbΔN, respectively. For AgnB(E638A) and AgnB(E638A)ΔC, PCR was performed with pET23b/agnb and pET23b/agnbΔC as templates, respectively, and the primer set of E638A-F and E638A-R (Table S1). After DpnI digestion, mutant plasmid was used to transform E. coli DH5α, and the mutation was confirmed by sequencing. The mutant proteins were expressed in E. coli BL21(λDE3)ΔlacZ and purified under the same conditions as the wild-type AgnB.

Sodium dodecyl sulphate-polyacrylamide gel electrophoresis

Sodium dodecyl sulphate-polyacrylamide gel electrophoresis (SDS-PAGE) was performed in 6 % polyacrylamide gel under reducing condition. Proteins were stained with EZ stain AQua (ATTO, Japan). Precision Plus Protein Dual Color Standards (Bio-Rad) were used as markers.

Enzyme assay

p-Methoxyphenyl (pMP)-disaccharides with GlcNAcα1-2/3/4/6Galβ were chemically synthesized as previously described (Fujita et al. 2011). Substrates were incubated with the enzyme at 37 °C for an appropriate time in 50 mM sodium acetate buffer (pH 5.5). The reaction mixture was analyzed by silica-gel TLC (Merck 5553, Germany) with 1-butanol/acetic acid/water (2:1:1, by volume) as a developing solvent and visualized with spraying diphenylamine-aniline-phosphoric acid reagent (0.1 g diphenylamine, 0.1 mL aniline, and 1 mL phosphoric acid dissolved in 10 mL acetone) followed by heating at 140 °C for 15 min. Released GlcNAc was quantified by the Morgan-Elson method after stopping the reaction by heating. In the case of 4-methylumberiferyl-α-GlcNAc (GlcNAcα1-MU, Sigma-Aldrich, MO, USA) as a substrate, the reaction was stopped by addition of 1.5 times volume of 1 M Na2CO3 (pH 10.9) and released 4-methylumberiferon was measured fluorimetrically (excitation 365 nm; emission 445 nm). For p-nitrophenyl (pNP)-glycosides (Sigma-Aldrich) as substrates, the reaction was stopped by addition of 1.5 times volume of 1 M Na2CO3 (pH 10.9) and released pNP was measured by absorbance at 405 nm.

Preparation of GlcNAcα1-4Gal from porcine gastric mucin

Recombinant GlcNAcα1-4Gal-releasing endo-β-galactosidase (GngC) from Clostridium perfringens (Ashida et al. 2001, 2002) was used for preparation of GlcNAcα1-4Gal. Porcine gastric mucin (PGM, 2.0 g, type III, Sigma-Aldrich) previously dialyzed thoroughly against water was incubated with the recombinant GngC (4.75 mg) in 20 mM sodium acetate buffer (pH 5.6). The reaction mixture was dialyzed against water and the outer solution was concentrated to 7.0 mL of 140 mM GlcNAcα1-4Gal. The concentration of the disaccharide was determined with phenol-sulfuric acid method using equal molar mixture solution of GlcNAc and Gal as the standard. The sample (100 μL) was added with equal volume of 5 % phenol aqueous solution and 500 μL concentrated H2SO4, and then absorbance at 495 nm was measured.

Dot-blot overlay assay

Either 2 μg PGM or bovine serum albumin (BSA) was blotted onto a nitrocellulose membrane (Protran BA 85, GE Healthcare, UK), followed by blocking with 10 % skim milk (Wako Pure Chemical Industries, Japan) in Tris-buffered saline (TBS), pH 7.5, containing 0.05 % Tween 20 (TBS-T). The membranes were then treated with 1 μM of each of purified AgnB mutant proteins in 2 % skim milk in TBS-T for 2 h at 25 °C. After washing the membrane with TBS-T, subsequent detection was carried out by a rabbit anti-His-probe primary antibody (1/800, MBL, Japan) and horseradish peroxidase-conjugated anti-rabbit IgG (1/5000, Santa Cruz Biotechnology, CA, USA). Visualization was carried out using 0.5 mM of 3,3′,5,5′-tetramethylbenzidine in 0.2 M citrate buffer (pH 4.0) with 0.06 % H2O2.

Results

Distribution of α-N-acetylglucosaminidase activity in bifidobacteria

To examine whether bifidobacteria possess α-N-acetylglucosaminidase, we incubated GlcNAcα1-MU with various bifidobacterial cells cultured in GAM. Among those tested, B. bifidum JCM 1254, B. bifidum JCM 7004, and Bifidobacterium scardovii JCM 12489 degraded the substrate, while the other bifidobacterial species/strains did not hydrolyze it (Table 1). Unexpectedly, B. bifidum JCM 1255, the type strain of this species, did not show the activity. The diminished ability of the strain as compared with the other B. bifidum strains to utilize host-derived glycans has already been reported (Asakuma et al. 2011). No enzyme activity was detected in the culture supernatant of any strain, suggesting that three positive strains expressed the enzyme on the cell surface or in the cytosol.

Identification of a candidate gene of α-N-acetylglucosaminidase in B. bifidum

We searched the genome of B. bifidum JCM 1254, which was previously sequenced by ourselves, and found a candidate gene encoding a putative GH89 α-N-acetylglucosaminidase. The gene, termed agnb (accession number AB986539), consists of a 5874-bp open-reading-frame and encodes a polypeptide with 1957 amino acids (aa) containing the following putative sequences/domains: an N-terminal signal sequence (aa 1-51), four carbohydrate-binding module (CBM) 32 domain (aa 60-180, aa 978-1065, aa 1105-1213, aa 1237-1361), a GH89 domain (aa 266-913), and a C-terminal transmembrane region (aa 1928-1957) (Fig. 1b). The presence of an N-terminal signal sequence and a C-terminal transmembrane region indicates that AgnB is a membrane-anchored protein with a large extracellular region that includes GH89 and CBM32 domains. This is consistent with the presence of the major α-N-acetylglucosaminidase activity on the cells but not in the culture supernatant of B. bifidum JCM 1254.

Expression of AgnB

A DNA fragment of agnb lacking the sequences encoding the putative N-terminal signal peptide (aa 1-51) and the C-terminal transmembrane region (aa 1928-1957) was amplified by high-fidelity PCR and ligated into a pET-23b(+) expression vector to produce N-terminally T7-tagged and C-terminally 6×His-tagged AgnB. E. coli BL21(λDE3)ΔlacZ transformed with pET-23b/agnb were cultured, and the expression was induced with IPTG. The 6×His-tagged protein was purified from cell lysate using immobilized Ni2+ affinity chromatography. The purified protein migrated as a single protein band of around 200 kDa on reducing SDS-PAGE, which coincides with the calculated molecular mass (212 kDa) (Fig. 1c).

Substrate specificity of AgnB

First, we incubated the purified recombinant enzyme with various pNP-monosaccharides (GlcNAcα1-pNP, GlcNAcβ1-pNP, GalNAcα1-pNP, GalNAcβ1-pNP, Glcα1-pNP, Glcβ1-pNP, Galα1-pNP, Galβ1-pNP, Fucα1-pNP, Fucβ1-pNP, Manα1-pNP, Manβ1-pNP, Xylα1-pNP, and Xylβ1-pNP). Among these, only GlcNAcα1-pNP was slowly hydrolyzed by this enzyme (data not shown). We confirmed the release of GlcNAc from GlcNAcα1-pNP using TLC, but not from GlcNAcβ1-pNP (Fig. 2a). GlcNAcα1-MU was efficiently hydrolyzed under the same condition (Fig. 2b). Then, we incubated the enzyme with pMP-disaccharides containing α-GlcNAc with various linkages (Fig. 2c). GlcNAc was readily released from GlcNAcα1-4Galβ1-pMP and very slowly from GlcNAcα1-6Galβ1-pMP. However, α1,2- and α1,3-linked GlcNAc were resistant to this enzyme. Since GlcNAcα1-4Galβ1-R structure is specifically distributed in O-glycans of gastroduodenal mucin, we incubated AgnB with PGM and detected the release of GlcNAc (Fig. 2d). The disaccharide GlcNAcα1-4Gal prepared from PGM using GlcNAcα1-4Gal-releasing endo-β-galactosidase from C. perfringens (Ashida et al. 2001, 2002) was also readily hydrolyzed into GlcNAc and Gal (Fig. 2e). These results indicate that AgnB has strict glycone and linkage specificity toward terminal α1,4-linked GlcNAc. The specific activities toward hydrolyzed substrates were determined by measuring released GlcNAc (Table 2). The activities for the substrates containing GlcNAcα1-4Gal structure were nearly the same to each other, whereas those for GlcNAcα1-MU and GlcNAcα1-pNP were 20 % and less than 10 %, respectively. The activity for GlcNAcα1-6Galβ1-pMP was very low compared to those for the other substrates. These results suggest that AgnB is essentially specific to the terminal GlcNAcα1-4Gal with rather strict recognition for the aglycone Gal residue.

Fig. 2
figure 2

Substrate specificity of AgnB. Various substrates were incubated with recombinant AgnB and then analyzed by TLC. a GlcNAcα1-pNP and GlcNAcβ1-pNP. b GlcNAcα1-MU. c GlcNAcα1-4Galβ1-pMP (α1-4), GlcNAcα1-3Galβ1-pMP (α1-3), GlcNAcα1-6Galβ1-pMP (α1-6), and GlcNAcα1-2Galβ1-pMP (α1-2). d Porcine gastric mucin (PGM). e GlcNAcα1-4Gal. Plus sign indicates with AgnB; minus sign indicates without AgnB; GN, GlcNAc

Table 2 Specific activities of AgnB and AgnBΔC toward various substrates

General properties of AgnB

The general properties of AgnB were determined using GlcNAcα1-MU as a substrate (Fig. S1). The highest activity was observed at pH 5.5, and more than 80 % activity was detected between pH 4.0 and 6.5. The enzyme was stable between pH 5.0 and 9.5. The optimum temperature was 50 °C and stable up to 40 °C. The divalent cations, Ca2+, Mg2+, Zn2+, Mn2+, Ni2+, and Co2+, did not affect enzyme activity at 5 mM concentration, whereas 5 mM Cu2+ reduced the activity to 10 % (data not shown).

CBM32 domains enhance the affinity of AgnB toward mucin

AgnB possesses four CBM32 domains in the molecule: one near N-terminus and tandem three in C-terminal part (Fig. 1b). To analyze the function of these domains, we first deleted C-terminal three CBM32 domains to make AgnBΔC (aa 52-941) (Fig. 3a). We compared the specific activities of full-length AgnB and AgnBΔC toward various substrates (Table 2). The specific activities of AgnBΔC toward the monomeric substrates were approximately half of those of the full-length AgnB, whereas the activity toward PGM was dramatically reduced to 1/5–1/7. Since this result suggested that C-terminal CBM32 domains enhance the affinity of AgnB toward multivalent substrate, we further made three mutants: AgnBΔN (aa 914-1927), including only C-terminal CBM32-2, CBM32-3, and CBM32-4 domains, and E638A point mutants of full-length AgnB and AgnBΔC (Fig. 3a). E638 is the putative nucleophilic residue, and the enzymatic inactivation in AgnB(E638A) and AgnB(E638A)ΔC mutants was confirmed (data not shown). Then, we performed dot-blot overlay assay to evaluate the binding activity toward PGM (Fig. 3b). Among the constructs, full-length AgnB(E638A) and AgnBΔN bound PGM, showing that C-terminal CBM32 domains contribute the binding. AgnBΔC and AgnB(E638A)ΔC did not bind PGM, suggesting that N-terminal CBM32-1 does not have binding activity. Interestingly, enzymatically active AgnB hardly bound PGM, probably because active enzyme hydrolyzed α-linked GlcNAc, the binding epitopes of CBM32 domains. Taken together, these results suggest that either of C-terminal CBM32 domains specifically binds GlcNAcα1-4Gal-epitope of O-glycans and enhance enzyme activity toward multivalent substrate such as PGM.

Fig 3
figure 3

Dot-blot overlay assay of mutant constructs. a Schematic representation of mutants. b Dot-blot overlay assay. Either PGM or BSA was dot-blotted onto nitrocellulose membrane and then various mutants were overlaid. After washing, bound proteins were detected with anti-His antibody

Discussion

In this study, we identified GH89 α-N-acetylglucosaminidase from bifidobacteria for the first time. The GH89 family that is exclusively composed of α-N-acetylglucosaminidase is distributed from bacteria to higher eukaryotes. Human GH89 enzymes have been well characterized as a lysosomal enzyme that is involved in degradation of heparan sulfate (NAGLU, AAB06188) (Weber et al. 2001). In contrast, those from other organisms have been poorly investigated, except for the enzyme from C. perfringens (AgnC, BAB80572; CpGH89, ABG84150) (Ficko-Blean et al. 2008; Fujita et al. 2011; Ficko-Blean and Boraston 2012). AgnC has multi-domain structure similar to AgnB; the enzyme consists of a CBM32 domain, a GH89 domain, and five CBM32 domains from its N-terminus in this order. The striking difference is that AgnC does not have C-terminal transmembrane region, indicating that AgnC is an extracellular soluble enzyme. GH89 domain of AgnB shares 57 % amino acid identity with that of AgnC. Acid/base and nucleophilic residues are conserved in both enzymes (E519 and E638 in AgnB, E583 and E601 in AgnC, respectively). Their substrate specificities are also similar to each other: both enzymes essentially specific to GlcNAcα1-4Gal structure. However, AgnB slowly hydrolyzed GlcNAcα1-6Gal, which has unnatural structure and is completely resistant to AgnC. GH89 family contains several uncharacterized enzymes of other intestinal bacteria, such as Bacteroides fragilis, Bacteroides thetaiotaomicron, and Akkermansia muciniphila. These GH89 enzymes including AgnB and AgnC may play an important role in obtaining sugars from host’s mucin and in microbe-host interaction.

AgnB has four CBM32 domains, while AgnC has six. Among the six CBM32 domains in AgnC, the third and fourth ones (AgnC-CBM32-3 and AgnC-CBM32-4) were shown to bind GlcNAcα1-4Gal as revealed by glycan array and isothermal titration calorimetry. AgnC-CBM32-5 bound Gal residue, which is the most common feature of CBM32 family. AgnC-CBM32-1 and AgnC-CBM32-6 were concluded to be nonfunctional (Ficko-Blean et al. 2012). Phylogenetic analysis of CBM32 domains revealed that AgnB-CBM32-2, AgnB-CBM32-3, and AgnB-CBM32-4 are relatively close to AgnC-CBM32-3 and AgnC-CBM32-4 (Fig. 4). Thus, each of these three CBM32 domains in AgnB might bind α-linked GlcNAc specifically. Since AgnB-CBM32-1 is very close to AgnC-CBM32-1, this may have no binding activity. CBM32 is one of the most diverse CBM families, whose members were reported to recognize Gal, Galβ1-4GlcNAc, GalNAc, GlcNAc, blood group H trisaccharide, and GlcNAcα1-4Gal (Grondin et al. 2014). Among these, the domains recognizing GlcNAcα1-4Gal have so far been found only in GH89 AgnC and AgnB. It is very interesting that these domains have evolved binding specificity complementary to the catalytic specificity.

Fig. 4
figure 4

Phylogenetic tree of CBM32 domains in AgnB and AgnC. Tree was constructed using the ClustalW program using neighbor-joining method. The sequences of CBM32 domains of AgnC (BAB80572) used are as follows: AgnC-1 (aa 26-154), AgnC-2 (aa 918-1057), AgnC-3 (aa 1063-1201), AgnC-4 (aa 1206-1343), AgnC-5 (aa 1356-1493), and AgnC-6 (aa 1496-1621)

The core-1 disaccharide of O-glycans, Galβ1-3GalNAc, released by bifidobacterial GH101 endo-α-N-acetylgalactosaminidase is highly selective bifidogenic factor, because bifidobacteria possess a specific assimilation pathway (Kiyohara et al. 2009). Galβ1-3GalNAc is resistant to general bacterial β-galactosidases (Yoshida et al. 2012). The disaccharide is, however, transported into the bifidobacterial cells through a specific ABC transporter (Suzuki et al. 2008) and degraded in the cytosol by a specific phosphorylase (Kitaoka et al. 2005). We attempt to produce this disaccharide from PGM, a meat waste, by using a combination of glycosidases and to apply it as a functional food ingredient, so-called prebiotics. For this purpose, bifidobacterial glycosidases, but not clostridial ones, are suitable, because bifidobacteria are safe probiotic bacteria. We have already identified bifidobacterial glycosidases degrading nonreducing terminal glyco-epitopes such as α-sialidase (Kiyohara et al. 2011), 1,2-α-l-fucosidase (Katayama et al. 2004), and blood group B antigen-specific α-galactosidase (Wakinaka et al. 2013). AgnB α-N-acetylglucosaminidase might be indispensable for high-yield production of bifidogenic factor Galβ1-3GalNAc from PGM, because a large proportion of O-glycans are attached by α-linked GlcNAc.