Introduction

Cholera toxin (CT) from Vibrio cholerae exhibits the multi-functional attributes of a single toxin A and five identical B subunits (CTB). The toxin A subunit is composed of two distinct domains, A1 and A2, which are responsible for the enzymatic activity and anchoring of A1 to pentameric CTB, respectively, whereas CTB has no functional enzymatic activity but contributes to the binding of CT to the apical membrane [14]. CT uptake, delivery, and activation are initiated via GM1 ganglioside–CTB binding [1, 4]. GM1 ganglioside is one of the glycosphingolipids and is composed of five sugar residues (Galβ1,3-GalNAcβ1,4-(NeuAcα2,6-)Galβ1,4-Glcβ1,1-ceramide) and distributed to the lipid raft fraction of cell surface. CTB has an affinity against this GM1 ganglioside, CTB binding mediates the transportation of holotoxin to the host cells. CTB itself lacks toxicity but has the affinity for the GM1 ganglioside and is highly immunogenic by an oral route. Therefore, CTB has been widely used as a carrier protein and mucosal adjuvant to increase the uptake of CTB fusion protein and/or antigen to stimulate the mucosal immunity. In addition to this function, CTB has potential as an immunogen for use in oral vaccines.

Recent studies have shown that CTB fusion proteins are expressed in plants [5]. Plants have received attention as a platform for a heterologous protein expression system [68]. Plant molecular farming enables cost-effective and environmentally friendly protein production and is free from human pathogen. Molecular farming in plants is expected to be a challenging task compared with the use of already established protein production systems such as bacteria, yeast, and cultured mammalian cells due to its low expression rate and time requirements [911]. For efficient plant molecular farming, several strategies are now under development: large-scale protein production, a high-expression system, and optimization of post-translational modifications including N-glycosylation [9]. These strategies enable the effective protein production in plants. CTB and its fusion proteins were also produced in tobacco plants [12, 13], rice seeds [14, 15], potatoes [1618], and lettuce plants [19, 20]. These CTB fusion proteins showed in vitro and in vivo activities as expected. Thus, CTB fusion proteins were considered to be effective and useful molecules for human use. However, plant and plant-made CTB fusion proteins have potential as agents for the induction of immune responses mainly mediated by plant-specific β1,2-xylose and/or α1,3-fucose residues on N-glycans.

In most cells, a majority of proteins are synthesized as glycoproteins through a post-translational modification, glycosylation. Glycosylation is a highly sophisticated post-translational modification in the endoplasmic reticulum (ER) and Golgi apparatus and is categorized into two types: N-glycosylation and O-glycosylation. N-Glycosylation is now well characterized and the synthetic pathways of N-glycans in mammals, insects, yeasts, and plants have been proposed [2125]. In mammals, N-glycans contain β1,4-galactose and N-acetylneuraminic acid (NeuAc) residue(s) at the non-reducing terminus of oligosaccharide and bisecting GlcNAc residue, whereas plant complex N-glycans do not have NeuAc residues, but rather carry β1,2-xylose and α1,3-fucose residues. In addition to the β1,2-xylose and/or α1,3-fucose residues, Lewis a structure that is found on the cell-surface glycolipid antigens of red blood cells in humans has also been observed in plants [26, 27]. These plant-specific sugar residues were transferred to N-glycan and the transfer was mediated by β1,2-xylosyltransferase (XYLT), two α1,3-fucosyltransferase(s) (FUCT), α1,4-FUCT, and β1,3-galactosyltransferase. A recent study provided evidence that these plant-specific β1,2-xylose and/or α1,3-fucose residues on plant-derived glycoprotein may induce immune responses and, in some cases, result in induction of allergy because these glycan-epitopes are absent in humans [28, 29]. Furthermore, β1,2-xylose and α1,3-fucose residues are considered as a IgE-binding carbohydrate determinants, the so-called cross-reactive carbohydrate determinants of plant allergens [29, 30]. Hence, an improved understanding of the N-glycans on proteins expressed in plants is critical for human use to avoid induction of immune response.

N-Glycosylation analysis of CTB fusion proteins expressed in plants has been performed in a few studies [13], revealing that the expression of CTB fusion proteins with a C-terminal ER-retention signal, (SE)KDEL, was an effective system for the suppression of β1,2-xylose and/or α1,3-fucose modification on N-glycans. In addition, these expression systems led to an accumulation of N-glycoproteins with a high-mannose-type structure, which are less immunogenic for human. However, these studies analyzed the CTB fusion proteins but not CTB itself. CTB has one potential N-glycosylation site (Asn-Xxx-Ser/Thr) in its amino acid sequence, which is also considered to be N-glycosylated. The N-glycosylation of the site and the N-glycan structures are still unknown. To gain insights into the correlation between the CTB proteins expressed in plants and the N-glycan structures of these proteins for medical use, an understanding of the N-glycan modification of functional CTB molecules is indispensable.

In this study, to clarify the N-glycosylation of CTB, we expressed CTB in rice seeds with a GluB-1 signal peptide and KDEL sequence at the N- and C-terminus, respectively, and the analyzed N-glycosylation and N-glycan structures of CTB. CTB expressed in rice seeds was N-glycosylated at Asn32 with immunoactive glycan structures against anti-HRP antibody, but some of the CTB was not glycosylated. In addition, N-glycosylated CTB carried PNGase F- and Endo H-insensitive N-glycan structures, indicating that the N-glycans of CTB were modified with β1,2-xylose and/or α1,3-fucose residue(s). The detailed N-glycan analysis revealed that the majority of the structures were plant-specific M3FX and that the N-glycans contained small amounts of high-mannose-type N-glycan. These results suggested that the CTBs expressed in rice seeds were N-glycosylated through the ER and Golgi N-glycosylation machineries without the ER retrieval even though the CTBs expressed in rice seeds contain the C-terminal KDEL sequence.

Materials and Methods

Construction of CTB Expression Vector and Rice Transformation

The gene cassette consisted of the endosperm-specific 2.3-kb GluB-1 promoter, a coding sequence for the signal peptide of GluB-1 and the codon-optimized CTB coding gene of V. cholerae, and a 0.65-kb GluB1 terminator [31, 32], and it was cloned into the EcoRI–Sse8387I site of pTL7 [33] to produce pTL-CTB. The Sse8387I fragment of pNPI130Hm [33] was cloned into the Sse8387I site of pTL-CTB to generate pTL-CTB130Hm. A binary vector, pTL-CTB130Hm, was transformed in rice plants, Oryza sativa L.cv. Nipponbare, using an agrobacterium-mediated method. Transformation of rice plants and regeneration of shoots were carried out according to the procedure described previously [34]. The transgenic rice plants were grown under controlled environment conditions and water with a nutrient solution in a closed green house. Plants were grown with a 12-h photoperiod and 50 % relative humidity in 23–28 °C air during the photo-/dark period.

Expression Analysis of CTB and Its Purification

Soluble CTB proteins were extracted as described previously [35]. The crude protein samples were subjected to 15 % SDS-PAGE under reducing conditions, followed by CBB staining and western blotting. In western blotting, the proteins fractionated by SDS-PAGE were electroblotted onto a PVDF membrane. The membrane was incubated in phosphate-buffered saline (PBS) with 5 % skim milk at room temperature and probed with anti-CTB antibody and anti-KDEL antibody (StressMarq, Canada) as primary antibodies and anti-mouse IgG antibody and anti-rabbit IgG conjugated to horseradish peroxidase (GE Healthcare, Buckinghamshire, England) as secondary antibodies, respectively. Specific bands were visualized using the POD Immunostain Set (Wako, Osaka, Japan).

Rice seeds were ground to a fine powder and total proteins were extracted in 50 mM Tris–HCl buffer (pH 7.5), 500 mM NaCl, 5 mM imidazole, and 5 % 3-[(3-cholamidopropyl)dimethylammonio]-1-propanesulphonate (CHAPS) to solubilize CTB by vortex and centrifuged at 17,000×g for 15 min at room temperature. The supernatant was subjected to a TALON® Resin column (TaKaRa, Shiga, Japan) and purified following the manufacturer’s protocols.

N-Glycosylation Analysis by Western Blotting and Endoglycosidase Digestions

The purified CTB was separated by 15 % SDS-PAGE and detected by CBB staining or western blotting using anti-CTB antibody or anti-HRP antibody as described above. For N-glycosylation analysis, anti-HRP antibody (Sigma, St. Louis, MO) and anti-rabbit antibody conjugated to horseradish peroxidase (GE Healthcare) were used as a primary antibody and a secondary antibody, respectively, and visualized as described above.

The purified CTBs were digested with peptide:N-glycosidase F (PNGase F, TaKaRa) or endoglycosidase H (Endo H) (NEB, Tokyo, Japan) following the manufacturer’s protocols, followed by separation by 15 % SDS-PAGE and CBB staining.

NanoLC–MS/MS Analysis of CTB for De Novo Sequencing

The purified CTB was separated by 15 % SDS-PAGE and stained by CBB. The CBB-stained bands were excised from the gel, destained with 50 mM NH4HCO3 in 50 % acetonitrile and dehydrated using acetonitrile, followed by reduction and alkylation. The CTB was digested using chymotrypsin (Sigma) in ProteaseMAX™ Surfactant (Promega, Madison, WI) at 37 °C for 3 h.

The chymotrypsinized samples containing 5 % acetonitrile were analyzed using an Agilent Technologies 1200 series nanoLC system (Agilent Technologies, Santa Clara, CA) equipped with micrOTOF-QII (BRUKER DALTONICS, Bremen, Germany). For the liquid chromatography portion of the analysis, a ZORBAX 300SB-C18 (5 μm, 0.3 mm × 5 mm) and ZORBAX 300SB-C18 (3.5 mm, 75 μm × 150 mm) (Agilent Technologies) were used as a trapping column and an analytical column, respectively. The mobile phase was composed of 0.1 % formic acid (solvent A) and acetonitrile containing 0.1 % formic acid (solvent B) for nanoLC and 0.1 % trifluoroacetic acid (solvent C) for peptide trapping in the column. Following injection, the flow was directed to the trapping column in solvent C at a flow rate 10 μl/min. The CTB peptides were separated by linearly increasing the solvent B concentration from 8 to 30 % over 30 min at a flow rate of 300 nl/min, followed by washing with 95 % solvent B for 5 min, and equilibration for 18 min with initial flow. In the mass spectrometry portion of the analysis, the MS/MS parameters were as follows: scan range m/z 50–4500, nebulizer flow of 1.0 bar, dry gas flow rate of 5.0 l/min, and dry temperature of 180 °C in the positive-ion mode. The MS data were analyzed using DataAnalysis 4.0 software (BRUKER DALTONICS).

Preparation of N-Glycans and Their Structural Analysis by HPLC and LC–MS/MS

The N-glycopeptide from CTB was obtained by in-gel digestion as described above and completely dried out. The N-glycans were released from the in-gel-digested CTB by hydrazinolysis at 100 °C for 10 h. The released N-glycans were 2-aminopyridine (PA)-labeled as described previously [36]. The reaction products of PA-N-glycans were purified by size fractionation (SF)-high-performance liquid chromatography (HPLC) using a Shodex Asahipak NH2P-50 4E column (4.6 mm ID × 250 mm; SHOWA DENKO Co., Ltd., Tokyo, Japan). The N-glycans were detected by reverse phase (RP)-HPLC. The mobile phase was composed of 0.02 % trifluoroacetic acid (solvent A) and acetonitrile/0.02 % trifluoroacetic acid (solvent B) (20/80, v/v). RP-HPLC was performed using a Cosmosil 5C18-AR-II column (4.6 × 250 mm; Nacalai Tesque, Kyoto, Japan) with an HITACHI LaChrom HPLC System by linearly increasing the solvent B concentration from 0 to 20 % over 35 min at a flow rate of 0.7 ml/min. The eluted fractions were monitored by measuring the fluorescence intensity using excitation and emission wavelengths of 310 and 380 nm, respectively.

The molecular masses of the PA-N-glycans and the number of their sugar moieties were estimated by LC–MS/MS using an Agilent Technologies 1200 series instrument (Agilent Technologies, Santa Clara, CA) equipped with HCT plus software (BRUKER DALTONICS). For the LC, the mobile phase was composed of acetonitrile/acetic acid (solvent A: 98/2, v/v) and water/acetic acid/triethylamine (solvent B: 92/5/3, v/v/v). The PA-sugar chain was separated using a Shodex Asahipak NH2P-50 2D column (2.0 mm ID × 150 mm; SHOWA DENKO Co., Ltd.) by linearly increasing the solvent B concentration from 20 to 55 % over 35 min at a flow rate of 0.2 ml/min. The MS/MS parameters were as follows: scan range m/z 350–2750, nebulizer flow of 5.0 psi, dry gas flow rate of 3.0 l/min, dry temperature of 300 °C, target count of 200,000, and MS/MS Frag. Ampl. of 1.0 V in the positive-ion mode. The relative amount of N-glycan detected in each plant was calculated on the basis of the peak area of the LC.

To determine the PA-N-glycan structures, PA-N-glycans were further separated by SF-HPLC under the following conditions; the mobile phase was composed of 80 % acetonitrile (solvent A) and 20 % acetonitrile (solvent B) (20/80, v/v) using a Shodex Asahipak NH2P-50 4E column with the an HITACHI LaChrom HPLC System by linearly increasing the solvent B concentration from 10 to 60 % over 25 min at a flow rate of 0.7 ml/min. Each structure of PA-N-glycans was compared with authentic PA-N-glycans purchased from TaKaRa or prepared from Arabidopsis thaliana Col-0 WT as previously reported [37].

Results

CTB was Expressed in Rice

A binary vector, pTL-CTB130Hm (Fig. 1), was introduced into rice plants (Oryza sativa L.cv. Nipponbare) mediated by agrobacterium to produce transgenic rice plants expressing CTB. The insertion of the expression cassette into genome DNA and CTB expression was analyzed by PCR using genomic DNA as a template and western blotting using anti-CTB antibody, respectively (data not shown). The transgenic rice plants expressing the highest level of CTB were grown in a closed green house for about 4 months, and mature seeds that accumulated CTB 3.37 mg/g rice seeds were harvested.

Fig. 1
figure 1

Schematic representation of the plant expression vector, pTL-CTB130Hm. The original vector used for expression of CTB was the MAT vector (indicated in gray) [32]. CTB was expressed under the control of the glutelin B-1 promoter (GluB-1PRO) and the glutelin B-1 terminator (GluB-1 T) with the signal peptide of GluB-1 at the N-terminus and KDEL at the C-terminus, respectively, for enhancement of the stable expression and the accumulation of CTBs. This construct also contains the hygromycin phosphotransferase gene, hpt, as a selection marker and isopentenyl transferase gene, ipt, and recombinase gene, Rec, for the production of marker-free transgenic rice [32]. RS recognition sequence; 35S PRO cauliflower mosaic virus 35S promoter, NosT nopaline synthase terminator, RB and LB right and left borders, respectively

CTB was Expressed in Two-Forms: An N-Glycosylated Form and Non-N-glycosylated Form

CTB contains a potential N-glycosylation site, Asn32, in its expressed sequence. This indicates that CTB might have been expressed as an N-glycoprotein in rice. Furthermore, western blotting analysis of transgenic rice using an anti-CTB antibody demonstrated that CTB was expressed in two forms in rice, which led us to hypothesize that one of the CTBs, i.e., the one-detected by the upper band (band 1), had been N-glycosylated in rice (Fig. 2a). To ascertain whether these two forms of CTB were derived from N-glycosylated and/or non-N-glycosylated CTB and to further characterize them, rice-derived CTB was first purified using a Co2+ column. A previous study demonstrated that CTB could be purified with Co2+ column, based on the formation of pentameric CTB [13]. In this study, CBB staining and western blotting using anti-CTB antibody and anti-KDEL antibody showed that the CTB expressed in rice carried KDEL tag at their C-terminus and successfully purified (Fig. 2b, c). Furthermore, western blotting analysis using an anti-HRP antibody that recognizes plant-specific N-glycoproteins carrying β1,2-xylose and/or α1,3-fucose residue(s) revealed that CTB band 1 was immune-sensitive against anti-HRP antibody. The lower band (band 2), on the other hand, was not immune-insensitive (Fig. 2d). These results indicated that one of the forms of CTB expressed in rice was N-glycosylated with plant-specific sugar residue(s) and that the other CTB forms were not N-glycosylated. Furthermore, CTB was expressed as pentameric protein with N-glycosylated and/or non-N-glycosylated CTBs in rice seeds (Fig. 2e). This agreed well with the result of western blotting which indicated that CTBs could be purified using Co2+ column by the formation of pentameric complex.

Fig. 2
figure 2

Detection and analysis of the post-translational modification of CTB. Total proteins from rice seeds of WT and transgenic rice were extracted and analyzed by CBB staining (a) or western blotting using anti-CTB antibody (b) and anti-KDEL antibody (c). Black triangles indicate the CTB. d Detection of the N-glycosylated-form of CTB. Purified CTB was used for detection of the immunoreactivity against the anti-CTB antibody and anti-HRP antibody. Black and gray triangles indicate the N-glycosylated form and non-N-glycosylated form of CTB, respectively. e Pentameric forms of CTB. Purified CTB was analyzed by SDS-PAGE and native-PAGE. N-Glycosylated and non-N-glycosylated CTB were detected at approximately 15 and 13 kDa, respectively, in SDS-PAGE, whereas pentameric CTB was detected at approximately 70 kDa in native-PAGE

For further characterization of the N-glycan moieties of the CTBs expressed in rice, purified CTB was digested with two N-glycan-specific endoglycosidases, peptide:N-glycosidase F (PNGase F) and endoglycosidase H (Endo H). PNGase F cleaves N-glycans off at the GlcNAc-Asn linkage of non-α1,3-fucosylated N-glycoprotein, but does not cleave N-glycan off if the N-glycan contains α1,3-fucose residue (Fig. 3a). Endo H recognizes the terminal mannose residue of Manα1,3-Manα1,6-Manβ1,4-GlcNAcβ1,4-GlcNAc of high-mannose-type and/or hybrid-type N-glycans and cleaves the acetylchitobiose core of oligosaccharide (Fig. 3a). Therefore, we can easily evaluate the N-glycan structure(s) of CTB by PNGase F and Endo H digestion. PNGase F or Endo H digestion of CTB resulted in no obvious band shift and no accumulation of the non-glycosylated CTB form, respectively (Fig. 3b). These results indicate that most of the CTB carried α1,3-fucosylated N-glycan(s) but did not N-glycosylate with high-mannose-type N-glycan(s).

Fig. 3
figure 3

N-Glycosylation analysis of CTB. a Schematic representation of the substrate specificity and recognition site of endoglycosidases. b Endoglycosidase digestion of CTB. Reaction products of endoglycosidase digestion were detected by CBB staining. Black and gray triangles indicate the N-glycosylated form and non-N-glycosylated form of CTB, respectively

Asn32 of CTB was N-Glycosylated

As mentioned above, the CTB expression sequence used in this study contained a consensus N-glycosylation sequence, Asn-Xxx-Ser/Thr, at Asn32 to Thr34. This sequence and the results of western blotting using anti-HRP antibody (Fig. 2d) suggested that CTB was expressed as an N-glycoprotein. To determine the N-glycosylation site of CTB, in-gel digestion of purified CTB using chymotrypsin was performed for de novo sequencing using nanoLC–MS/MS analysis. A MASCOT search (http://www.matrixscience.com/search_form_select.html) revealed that both of the two forms of CTB expressed in rice corresponded to the database sequence at a sequence coverage of 51 % (data not shown). Furthermore, deconvoluted MS/MS signals showed two fragmentation patterns derived from free N-glycan and N-glycopeptide from CTB, respectively (Fig. 4). Fragmentation analysis of free N-glycan exhibited that the N-glycan of CTB mainly consisted of five sugar residues—three hexoses, one deoxyhexose, and one pentose in the core structure of GlcNAc2-Asn. This indicates that CTB was considered to be N-glycosylated with xylose and fucose residues. In addition, fragmentation analysis of N-glycopeptide showed that the fragmented m/z 3395.51 agreed well with the calculated molecular mass of weight of amino acids Met27 to Tyr55 with single methylation. In this amino acid sequence, Asn32 was a good candidate for N-glycosylation. Therefore, these results strongly suggested that CTB was N-glycosylated at Asn32 with the plant-specific sugar residues, xylose and fucose.

Fig. 4
figure 4

Determination of the N-glycosylation site and the components of sugar residues in CTB N-glycan. Chymotrypsin-digested N-glycosylated CTB was subjected to nanoLC–MS/MS analysis. The upper panel shows deconvoluted MS/MS spectrums of free-sugar residues from the CTB N-glycan and N-glycosylated peptide of CTB. The lower panels show the enlarged spectrums of free-sugar residues and N-glycosylated peptide, respectively. The black arrow, black dotted arrow, gray arrow, and gray dotted arrow indicate hexose, deoxyhexose, N-acetyl-hexosamine, and pentose, respectively. m/z 204 and 3395 are the calculated masses of the [HexNAc + H]1+ and peptide sequence of Met27 to Tyr55

CTB Expressed in Rice was Mainly Modified with Plant-Specific N-Glycans

The results of both western blotting using anti-HRP antibody and de novo sequencing suggested that one of the forms of CTB (detected as band 1) was N-glycosylated. To determine the CTB profile in greater detail, the N-glycan structures attached at Asn32 were determined by hydrazinolysis of N-glycopeptide obtained by in-gel digestion of both bands detected in CBB staining, followed by PA-labeling, RP-HPLC and SF-HPLC analysis, and LC–MS/MS analysis (Fig. 5a, b; Table 1). RP-HPLC analysis of PA-N-glycan showed that band 1 (the upper band) contained three peaks, but band 2 (the lower band) contained no obvious peaks, indicating that band 2 was not N-glycosylated. This result further supported the results of the western blotting using anti-HRP antibody, which demonstrated that band 2 was the non-N-glycosylated form of CTB. The major N-glycan in the band 1 was M3FX, which is generally the most abundant plant-specific N-glycan (Fig. 5c, d). Furthermore, M3X was also detected as a plant-specific N-glycan. The total quantity of N-glycans carrying plant-specific sugar residues, β1,2-xylose and/or α1,3-fucose, accounted for approximately 70 % of the total N-glycans detected in CTB. In general, these β1,2-xylosylations and α1,3-fucosylations took place in the Golgi. Therefore, most of the CTB was N-glycosylated via Golgi-N-glycosylation machinery. These findings also suggested that the KDEL sequence at the C-terminus was not functional in the CTB retrieval for the ER. Interestingly, in addition to the plant-specific N-glycan structures, CTB also contained terminal-mannosylated-N-glycans, M3, M4, M5, and M6B. These structures were synthesized by the Golgi- or vacuole-localized glycosylhydrolases, α1,2-mannosidase I and/or α-mannosidase. Together, these results strongly supported the idea that CTB was N-glycosylated through the Golgi and/or vacuole N-glycosylation machineries against the ER retrieval mediated by the C-terminal KDEL sequence.

Fig. 5
figure 5

N-Glycan analysis of CTB. N-Glycosylated CTB was excised from a gel and chymotrypsinized. The N-glycan was prepared by hydrazinolysis of the N-glycopeptide, followed by labeling with PA, and analyzed. a RP-HPLC analysis of the PA-sugar chain using a C18 column. Band 1 was prepared from N-glycosylated CTB and band 2 was prepared from non-N-glycosylated CTB. Black triangles indicate non-PA-sugar derivatives. b Secondary chromatographic separation of PA-derivatives from RP-HPLC using an amide column. Peak 1, peak 2, and four significant peaks in peak 3 were collected, followed by further structural analysis by LC–MS/MS and comparison with authentic PA-sugar chains. Arrows indicate the collected peak, followed by LC–MS/MS analysis and comparison with authentic PA-sugar chains using RP-HPLC. c. LC–MS/MS analysis of the predominant PA-sugar chain from peak 1. The labeled signal represents (M + H)+ ions. The value of m/z 503 detected in the MS/MS spectra agrees well with the calculated mass of HexNAc2-PA, and the mass of the precursor ion, m/z 1268.5, was considered to correspond to Hex3DeoxyHex1Pentose1HexNAc2-PA. The black arrow, black dotted arrow, and gray dotted arrow indicate hexose, deoxyhexose, and pentose, respectively. A black diamond indicates the precursor ion of MS/MS fragmentation. d RP-HPLC profiles of Hex3DeoxyHex1Pentose1HexNAc2-PA. The PA-sugar chain from peak 1 corresponded to an authentic sugar chain M3FX prepared from A. thaliana

Table 1 Relative amounts of N-glycan structures detected in CTB

Discussion

CTB is an ideal and a good-candidate molecule as not only carrier protein, adjuvant, but also the elicitor of immune response. Thus, the large-scale and low-cost production of CTB or its fusion proteins for use in medical applications is a challenging task. Previously, the CTB fusion protein of MPR649–684, which is membrane proximal region of gp41 from HIV virus, was successfully expressed in N. benthamiana plants and its N-glycosylation and N-glycans were analyzed [13]. However, the N-glycosylation and N-glycans of plant-derived CTB itself remain to be clarified. Here, focusing on the large-scale protein production in plants and its N-glycosylation, we expressed CTB in rice seeds and analyzed to evaluate whether the N-glycosylation occurred at the potential N-glycosylation site, and whether the plant-derived CTB contains high-mannose-type structures like those detected in CTB–MPR694–684 or immunogenic β1,2-xylose and/or α1,3-fucose residue(s). Recombinant CTB was N-glycosylated at Asn32 with a plant-specific N-glycan carrying β1,2-xylose and/or α1,3-fucose residue(s). The presence of β1,2-xylose and/or α1,3-fucose residue(s) on N-glycans strongly indicated that CTB was N-glycosylated with Golgi N-glycosylation machinery, because XYLT and FUCTs are considered to localize at medial-Golgi [3840]. Furthermore, high-mannose-type N-glycan was not observed in RP-HPLC analysis of PA-sugar chain, providing evidence that the CTB expressed in rice seeds escaped from ER-retrieval, even though this CTB had a KDEL sequence at its C-terminal, and that the CTB was transferred to the Golgi and vacuole N-glycosylation machineries.

Why did the CTB expressed in rice seeds escape from ER-retrieval? In general, the subcellular localization of heterologous proteins expressed in plants is determined by appropriate signal tag or endogenous targeting and/or sorting sequences. Recombinant proteins expressed by the driven with the GluB-1 promoter and the GluB-1 signal peptide were accumulated in protein bodies (PBs), especially in glutelin-containing protein storage vacuoles (PSVs) in rice seeds [32, 41]. Another well-known PBs is the prolamin-containing PBs. Prolamins accumulate in PBs inside the rough ER without the typical ER-retrieval signal, whereas glutelins are transported from the cisternal ER through the Golgi to the PSVs. In PSVs, just as in glutelins, proteins are considered to be accumulated with the processing of macromolecular complexes formation. Escherichia coli heat-labile enterotoxin B (LTB) and immunoglobulin, for instance, required the multimeric protein formations and assembly for localization [4244]. Thus, correct folding and assembly as multimeric proteins are indispensable for the protein subcellular destination and localization. The localization of CTB expressed in this study was not clear, but one possible hypothesis was considered: CTB was accumulated in PSVs through glutelin-like processing. As showed in Fig. 2e, CTB seemed to be pentameric with N-glycosylated and/or non-N-glycosylated CTBs in rice seeds. This suggests that CTBs were assembled before they reached PSVs, resulting in CTB transition through ER-Golgi processing to PSVs. In addition to this pentameric assembly, accumulation of CTB in PSVs was considered to be enhanced by the GluB-1 signal sequence. GluB-1 signal sequence mediates the ER targeting of the mRNA, which activates the interactions of synthesized protein with molecular chaperons in the ER, which results in the accumulation of protein same as glutelin. This hypothesis was further supported by the results of a previous study which showed that some of CTB was accumulated in PSVs [31] and by the N-glycan analysis in this study. The CTB expressed in rice seeds had M3FX as a predominant structure, which is synthesized via contributions of Golgi- and vacuole-localized enzymes. The formation of this M3FX, first XYLT and FUCTs transfer β1,2-xylose and α1,3-fucose residues to N-glycan, which results in the synthesis of GN2M3FX, and further maturation occurred by the subsequent trimming of two GlcNAc residues by vacuole-localized hexosaminidases [45]. Therefore, the pentameric complexes of CTB and the presence of M3FX N-glycan strongly suggested that CTB expressed in rice seeds was transported through a glutelin-like post-translational protein modification, such as multimeric protein assembly and ER–Golgi transition to PSVs. To examine this hypothesis, further investigations, such as inhibition of the assembly of pentameric CTB, CTB expression with a constitutive promoter without GluB-1 signal sequences, and localization analysis are required and in progress.

Many recombinant proteins and CTB fusion proteins have been expressed and their N-glycosylations analyzed. Previously, Zhang et al. [46] showed that the recombinant human transferrin (rhTF) expressed in rice seeds was not N-glycosylated. It was of interest that rhTF was not N-glycosylated, though hTF potentially has two N-glycosylation sites which are, in general, N-glycosylated in the mammalian expression system. However, the absence of N-glycan on rhTF was not consistent with the result which showed that human lactoferrin (rhLF) expressed in rice seeds carried N-glycans [47, 48], though the amino acids sequence between rhTF and rhLF is quite similar: the amino acid identity and similarity were 59 and 85 %, respectively. Unfortunately, the reasons for this discrepancy are still unclear. On the other hand, Yang et al. [35] showed that a major house dust mite allergen, Der p 1, expressed in rice seeds under the same expression construct as that of CTB was N-glycosylated with the predominant N-glycan of high-mannose-type structure, but some of Der p 1 also was not N-glycosylated, just as seen in the CTB in this study. However, N-glycan structures and subcellular localization analysis demonstrated that Der p 1 was specifically localized in prolamin-containing PBs, suggesting that the C-terminal KDEL sequence of Der p 1 successfully worked in rice seeds, though the GluB-1 signal sequence seemed not to be functional for the targeting of Der p 1 to PSVs. Interestingly, though the CTBs expressed in rice seeds seemed to be transported through the ER–Golgi machineries by escaping from the ER retention and finally accumulated in PSVs, some of them were not N-glycosylated (Fig. 2d). The results in this study were thus opposed to the results of Der p 1. What was the reason for this disparity? One key factor was the 3D structure and native forms of CTB. As mentioned above, a native CTB showed pentameric complexes. In this form, the CTBs seem to be an inaccessible to KDEL receptors, whereas the Der p 1 forms to be accessible to these receptors (data not shown). Der p 1 is monomeric in acidic solution, but forms a dimer in alkaline solution and in the crystal [49, 50], which indicates that the Der p 1 expressed in rice seeds is dimeric. Despite this dimeric solution, the KDEL signals are located opposite each other and are exposed to the surface. Therefore, Der p 1 was considered to be successfully binding to the KDEL receptor and retrieved to the ER. Each C-terminus of pentameric CTB, on the other hand, projects to same sides and make it difficult to access toward KDEL receptor. Therefore, CTB was speculated to have escaped from the ER-retrieve and to have N-glycan structures different from those of Der p 1. Furthermore, the pentameric forms of CTB and GluB-1 signal sequence also enhanced rather the ER–Golgi transit to PSVs by glutelin-like machinery than the ER retrieval mediated by KDEL signal, as mentioned above.

In this study, we focused on the expression of CTB in rice seeds and showed the post-translational modifications of CTB, including N-glycosylation, N-glycan analysis, and multimeric assembly. These results suggested that heterologous protein productions in plants, especially in rice seeds, are influenced by not only the endogenous signal sequence, promoter-specific expression, and the ER-retrieval sequence but also the multimeric assembly of proteins. These issues may be solved by the far-reaching choice of a target protein and its expression systems using a signal sequence, the expression of appropriate fusion proteins, and protein localization in plants.