Introduction

Streptococcus pneumoniae causes diseases as pneumonia, meningitis, and sepsis, representing a major cause of mortality worldwide, mainly among young children and elderly people (Walker et al. 2013). Conjugated pneumococcal vaccines based on capsular polysaccharides (PS) effectively prevent invasive diseases, but only of the serotypes included in formulations. As a consequence, the vaccine serotypes are being replaced by non-vaccine serotypes, which have emerged as a new challenge (Pichon et al. 2013), because the need for multiple PS in formulations increases the costs associated with the development of wide-spectrum vaccines and hinders their use in low- and middle-income countries.

The use of protein antigens shared by different S. pneumoniae serotypes should allow for the elaboration of serotype-independent pneumococcal vaccines. Among candidates, pneumococcal surface protein A (PspA) represents one of the most promising. Every pneumococcal strain evaluated to date expresses this protein attached to the cell wall and exposed outside the polysaccharide layer. PspA inhibits complement deposition on the pneumococcal surface (Ren et al. 2004) and binds to apolactoferrin, preventing its bactericide action in mucosal areas (Mirza et al. 2004). Immunization with PspA protected mice in challenge experiments (Briles et al. 2000; Briles et al. 2003; Darrieux et al. 2008) and sera from human volunteers immunized with PspA protected mice in passive immunization assays (Briles et al. 2000). Moreover, pre-existing serum antibodies against PspA, but not PS, protected humans against carriage after intranasal inoculation with S. pneumoniae (McCool et al. 2002).

The PspA structure comprises four distinct domains: a coiled-coil α-helix-rich N-terminal domain that is exposed outside the polysaccharide layer, a proline-rich domain, a choline-binding domain, and a short hydrophobic tail at the C-terminal domain (Jedrzejas et al. 2001). PspA variants are divided into three families and six clades depending on the identity of amino acid sequences of the α-helix region (Hollingshead et al. 2000). Several studies have shown that more than 90% of pneumococcal strains express PspA belonging to families 1 or 2 (Coral et al. 2001; Croney et al. 2012; Qian et al. 2012). Some N-terminal fragments of PspA elicit a broadly protective cross-response against several clades (Darrieux et al. 2008; Oliveira et al. 2010; Moreno et al. 2010; Goulart et al. 2011).

In large-scale recombinant protein production, downstream processes account for up to 80% of total production costs (Banki et al. 2005). These costs, while potentially acceptable for high-end products, pose an obstacle to the large-scale production of vaccines that should benefit entire populations in countries with different investment potentials. Therefore, the development of simple and reliable purification processes for recombinant proteins represents an important goal in vaccine production.

Tags are often added to recombinant proteins to facilitate the purification process, improve yield, increase solubility, and aid refolding. However, tags may also have detrimental effects such as changing protein conformation, inhibiting enzyme activity, altering biological activity, and toxicity (Arnau et al. 2006). Even commonly used small tags, like histidine tags, may interfere with protein properties (Wu and Filutowicz 1999). Because of these negative effects, ideally, tags should be cleaved off and an additional purification step added to eliminate leftover-tagged proteins, cleaving enzymes, and cleaved tags. Tag removal poses some technical difficulties: the cleavage may not be complete or the proteases may cleave the target protein at secondary sites; the separation often has low yield and depends on expensive chromatography resins due to the similarity between hydrolyzed and tagged molecules (Arnau et al. 2006). Furthermore, the affinity step alone does not always suffice, and further purification steps are required for a product with the desired purity (Hou et al. 2004). An alternative and novel strategy was developed using self-cleaving tags, thus eliminating the need for proteases (Lu et al. 2011; Shi et al. 2014). However, cleavage may be incomplete and further purification steps must be included to separate the target protein from the cleavage products.

At industrial scale, on the other hand, a tag is adopted only if it presents clear advantages in production or to the therapeutic, because of extra time and cost associated with tag removal and the subsequent purification steps required when a tag system is used (Bell et al. 2013). Furthermore, few studies describe the production of untagged recombinant proteins (Hou et al. 2004; Wear et al. 2005; Ludwig et al. 2010).

In the present work, we describe a fully scalable process for the production and purification of a recombinant fragment of PspA from clade 4 (PspA4Pro), which contains the N-terminal α-helix domain and the first block of the proline-rich domain and cross-reacts with PspA variants from clades 1 to 5 (Moreno et al. 2010). This fragment, cloned without tags, was produced in the intracellular soluble fraction of Escherichia coli. The purification strategy was designed to reach at least the purity recommended by the World Health Organization (WHO 2009) for carrier proteins used in pneumococcal conjugated vaccines with low endotoxin content.

Materials and methods

Materials

The shovel mixer X-520 was supplied by Ingenieurbüro CAT M. Zipperer GmbH (Staufen im Breisgau, Germany). The cell disruption system was composed of a high-pressure continuous homogenizer from APV GAULIN (Crawley, UK), a tube-and-shell heat exchanger from Exergy (NY, USA), a jacketed reservoir (ArtePeças, São Paulo, Brazil), and a chiller from PolyScience (Nilles, USA). The centrifuge Avanti J-251 was from Beckman Coulter (Brea, USA). All preparative chromatographic media and preparative chromatographic system (Äkta Avant 150) were from GE Lifescience (USA). The densitometer GS-800 was from BioRad (Hercules, USA). A size exclusion high-performance chromatography (HPSEC) column TSK-GEL G2000SWXL (TOSOH BIOSCIENCE, Minato, Japan) connected to a high-performance liquid chromatography (HPLC) system SCL-10AVP (SHIMADZU, Kyoto, Japan) was used for lactoferrin binding and molecular mass assays. Circular dichroism spectra were obtained in a spectropolarimeter J-810 (Japan Spectroscopic, Tokyo, Japan) equipped with a Peltier unit for temperature control. Human milk lactoferrin, phenylmethilsulfonyl fluoride (PMSF), Triton X-100, cationic detergent cetyltrimethylammonium bromide (CTAB), globular protein molecular mass standards, and dextran linear polymer standards were purchased from Sigma-Aldrich (St. Louis, USA). Phytone peptone and yeast extract were provided by BD/Difco (Sparks, USA). All other reagents were of analytical grade, and water for injection was used in the process.

PspA4Pro production

The N-terminal domain plus the first proline block of PspA4Pro gene (GenBank EF649969.1) was inserted in the plasmid pET37b+ (Novagen) and expressed in E. coli BL21(DE3) (Invitrogen, Carlsbad, CA, USA). A 5-L bioreactor described by Horta et al. (2011) was monitored and controlled by the software SuperSys_HCDC (Horta et al. 2014). The pH was controlled at 6.7 by addition of NH4OH (30%). Temperature was set at 31 °C, and dissolved oxygen concentration was kept no lower than 30% by a controller, which automatically changed the stirrer speed between 200 and 900 rpm and added pure oxygen to the air stream supplied to the bioreactor. The total gas flow rate was maintained at 4 L/min. Two strategies were applied for cultivation: a simple batch with an auto-induction medium (see medium composition in Campani et al. 2016) and a fed-batch with chemically defined medium using glycerol as sole carbon source and lactose as inducer, according to Horta et al. (2012).

Cell concentration was measured by culture broth optical density (λ = 600 nm) and dry cell weight (g/L). Acetate and carbon source concentrations were analyzed by HPLC using an Aminex HPX-87H column (Bio-Rad) and 5 mM sulfuric acid as the mobile phase (0.6 mL/min) at 60 °C, with a refraction index detector (Waters 410) for sugars and glycerol, and UV-detector (Waters 486) at 210 nm for organic acids. For plasmid stability measurement, diluted samples of culture broth were plated on LB agar and incubated for 24 h at 37 °C. Then, at least 50 colonies from each sample were replicated in LB agar with and without kanamycin. After incubation at the same conditions, plasmid stability was calculated as the percentage of antibiotic-resistant colonies in relation to the number of colonies formed in plates without antibiotic.

PspA4Pro purification

The protein was purified with the following steps: cell separation, cell rupture, clarification, anion exchange chromatography, cryoprecipitation at pH 4.0, and cation exchange chromatography. Three clarification procedures were evaluated: CTAB or acidic pH precipitation and centrifugation. Mix mode chromatography was evaluated as an alternative to the two ion exchange chromatographies. After the purification process was established, it was repeated once with the biomass produced with auto-induction medium, and a second time with the biomass from fed-batch cultivation with defined medium and induction with lactose.

Cell separation

Biomass was harvested from culture broth by centrifugation, at 4 °C and 6693×g, for 30 min. The cell pellet was then transferred to plastic bags and stored at −80 °C until use.

Cell rupture

For cell lysis, 100 g of frozen wet cell pellet was suspended in 1 L of lysis buffer: 10 mM sodium phosphate buffer pH 6.5 with 2.5 mM EDTA, 0.1% (m/v) Triton X-100, and 1 mM PMSF, using the shovel mixer at 4 °C and 11,000 rpm until no more clumps were observed. The cell suspension was then recirculated through the high-pressure continuous homogenizer system. Chiller temperature was adjusted to 4 °C and cell rupture was performed circulating the suspension into the close loop for 8 min at 500 bar and 1 L/min. Samples were collected from the homogenizer reservoir every 2 min to analyze the evolution of cell rupture by optical density (OD) at 600 nm and protein release in the medium (absorbance at 280 nm) during the process. The recovered fraction was named Homogenate.

Establishing clarification conditions

Three different pH values and the detergent CTAB were evaluated to improve debris removal efficiency: pH 4.0, 5.5, and 6.5, and 0.1% (m/v) CTAB at pH 6.5. Samples (10 mL) from the Homogenate of the first purification process were adjusted to the desired pH or mixed with CTAB for 60 min at room temperature. These samples were centrifuged at 17,696×g and 4 °C for 90 min, and PspA4Pro purity and recovery were analyzed in the supernatant. The best condition for clarification was then applied to the entire Homogenate volume and to the two following purifications. The Homogenate was centrifuged at 17,696×g and 4 °C for 90 min. The supernatant containing PspA4Pro was called Clarified fraction.

Anion exchange chromatography

Anion exchange chromatography was carried out using 200 mL Q-Sepharose FF resin packed in a XK 50/30 column. The volumetric flow was 2.50 mL/(min cm2) (50 mL/min). The column was equilibrated with five column volumes (CV) of 10 mM sodium phosphate buffer, pH 6.5, and conductivity 1.2 mS/cm. The elution was performed with a discontinuous gradient of NaCl (150, 300, and 1000 mM) in 10 mM phosphate buffer pH 6.5, using 5CV for each step.

Cryoprecipitation

The pH of PspA4Pro fraction eluted from Q-Sepharose was decreased to 4.0 using glacial acetic acid. After pH adjustment, the sample was frozen (−20 °C) for 24 h. After this period, it was thawed and centrifuged (17,696×g, 60 min, 4 °C). The supernatant was recovered and called Cryo-pH 4.0.

Cation exchange chromatography

For cation exchange chromatography, 75 mL of SP-Sepharose FF resin was packed in an XK 26/20 column. The volumetric flow was 2.3 mL/(min cm2) (12 mL/min). The column was equilibrated with 5 CV of 25 mM sodium acetate buffer pH 4.0. The elution was performed by a discontinuous gradient of NaCl (500, 800, and 1000 mM) in 25 mM acetate buffer pH 4.0, using 5CV for each step.

Mix mode chromatography

For the mix mode chromatography, 25 mL of CAPTO-MMC resin was packed in a XK 26/20 column. The flow was 0.75 mL/(min cm2), i.e., 4 mL/min. This chromatography was used at two time points in the process: after clarification and after cryoprecipitation. In the first experiment, 150 mL of Clarified fraction was applied to the resin previously equilibrated with 5 CV of 10 mM sodium phosphate buffer, pH 6.5, and conductivity 1.5 mS/cm. A discontinuous gradient of l-arginine (100, 200, 300, 500, and 1000 mM) in 10 mM phosphate buffer pH 6.5 was performed using 5CV for each elution step. In the second experiment, 450 mL of Cryo-pH 4.0 fraction was applied to the resin previously equilibrated with 5 CV of 25 mM acetate buffer with 300 mM NaCl, pH 4.0, and conductivity 40 mS/cm. The elution was performed by a discontinuous gradient of l-arginine (500, 800, and 1000 mM) in 25 mM acetate buffer pH 4.0, using 5CV for each step.

Protein quantification and PspA4Pro purity and recovery determination

Protein concentration was determined by the Lowry methodology (Lowry et al. 1951). PspA4Pro purity was determined by densitometry of the bands in 12% SDS-PAGE with 2-mercaptoethanol (Laemmli 1970) and calculated as the percentage of PspA4Pro band in relation to the sum of all other bands in the lane. Lipopolysaccharide (LPS) was measured as endotoxin unities by the Limulus amebocyte lysate test (Bang 1956).

The purification factor (PF) and protein recovery (Rc) were calculated according to Eqs. (1) to (3) and the percentage of endotoxin reduction (Er) according to Eq. (4).

$$ PF=\frac{\mathrm{PspA}4\mathrm{Pro}{\left(\%\right)}_n}{\mathrm{PspA}4\mathrm{Pro}\ {\left(\%\right)}_h} $$
(1)

where PspA4Pro (%) is the purity determined by densitometry of PspA4Pro band in the Homogenate fraction (h) and in the step (n).

$$ \mathrm{PspA}4\mathrm{Pro}\left(\mathrm{g}\right)=\frac{\mathrm{Prot}\left(\mathrm{g}\right)\times \mathrm{PspA}4\mathrm{Pro}\ \left(\%\right)}{100} $$
(2)

where PspA4Pro(g) is the PspA4Pro total amount and Prot(g) is the total protein amount measured according to Lowry.

$$ Rc\ \left(\%\right)=\frac{\mathrm{PspA}4\mathrm{Pro}\left(\mathrm{g}\right)n\times 100}{\mathrm{PspA}4\mathrm{Pro}\ \left(\mathrm{g}\right)h} $$
(3)

where PspA4Pro(g)h is the total mass of PspA4Pro in the Homogenate fraction and PspA4Pro(g)n is the total mass of PspA4Pro in the step.

$$ Er\ \left(\%\right)=100-\frac{E\left(EU/\mathrm{mL}\right)n\times \mathrm{V}\left(\mathrm{mL}\right)n\times 100}{E\left(EU/\mathrm{mL}\right)h\times V\left(\mathrm{mL}\right)h} $$
(4)

where E(EU/mL) is the endotoxin concentration and V(mL) is the volume in the Homogenate fraction (h) and in the step (n).

Binding of PspA4Pro to human lactoferrin

The binding ability of PspA4Pro to human lactoferrin was analyzed to verify if the protein preserved its biological function after purification. In this assay, lactoferrin was added to the purified PspA4Pro in three different proportions: 2 mg PspA4Pro plus 0.5 mg lactoferrin, 1 mg PspA4Pro plus 1 mg lactoferrin, and 1 mg PspA4Pro plus 2 mg lactoferrin. The mixtures were incubated at 37 °C for 1 h and applied to the HPSEC. The mobile phase was 300 mM sodium phosphate buffer pH 7.0 with 500 mM NaCl, and the flow rate was 0.6 mL/min.

Determination of PspA4Pro molecular mass

To evaluate the molecular mass of PspA4Pro, two standard curves were prepared in HPSEC. The first curve included the globular proteins β-amylase (220 kDa), albumin (66 kDa), carbonic anhydrase (30 kDa), cytochrome c (12 kDa), and aprotinin (6 kDa). The second curve included linear dextran polymers with molecular masses of 6, 10, 70, and 229 kDa. The molecular mass of PspA4Pro was calculated using the linear regression equation of the molecular mass logarithm versus the retention time of both standard curves.

Secondary structure and stability of purified PspA4Pro

Secondary structure and stability of PspA4Pro at different temperatures and pH values and in the presence of CTAB were verified by circular dichroism (CD). Measurements were obtained from 185 to 260 nm, and final CD spectra resulted from the mean of five measurements. PspA4Pro samples (0.1 mg/mL) were prepared in 10 mM sodium phosphate buffer pH 7.0 as follows: with buffer only, with 0.15% CTAB, and with buffer at pH 3.5 to pH 9.0. The sample in buffer only was heated (1 °C/min) from 15 to 95 °C and cooled back to 15 °C (1 °C/min). Deconvolution was calculated based on the Dichroweb online database (Whitmore and Wallace 2004) with Contin and CDSSTR algorithms (Sreerama and Woody 2000).

Results

PspA4Pro production

The auto-induction animal component-free medium employed in the simple batch cultivation has three carbon sources: glucose, consumed first; glycerol, the main carbon source during the induction phase; and lactose, the inducer of recombinant protein synthesis. Glucose exerts a catabolic repression effect on lac promoter and inhibits PspA4Pro synthesis. Glucose was exhausted in the medium within 8 h of cultivation, and the small amount of acetate produced was completely consumed after 6 h (Fig. 1). Glycerol consumption started after 4 h of cultivation, while lactose consumption started after 8 h, i.e., only after glucose exhaustion (Fig. 1). The combination of glycerol and lactose resulted in an almost unnoticeable diauxic growth. Lactose consumption increased after 9 h cultivation (Fig. 1), in parallel with the beginning of protein synthesis (Fig. 2a). In this process, the biomass concentration reached 39 g/L of dry cell weight in 16 h of cultivation (Fig. 2a). The plasmid remained quite stable, as the percentage of plasmid retention was higher than 88% within 14 h of cultivation and dropped to 70% at the end of the process (Fig. 2a). The volumetric production reached 7.6 g/L of PspA4Pro, the productivity 0.45 g/(L h) and the specific production 200 mg PspA4Pro/ g biomass (Fig. 2a).

Fig. 1
figure 1

Production of PspA4Pro in 5-L bioreactor batch cultivation. Time profiles of biomass (stars), acetate (solid triangles), glucose (open squares), glycerol (open circles), and lactose (solid squares) in Phytone auto-induction medium

Fig. 2
figure 2

PspA4Pro volumetric production (circles), PspA4Pro productivity (triangles), specific PspA4Pro production (diamonds), and plasmid retention (squares) in Phytone auto-induction simple batch (a) and in glycerol-limited fed-batch with lactose induction (b)

For the fed-batch strategy, the biomass increased up to 104 g/L within 36 h cultivation, leading to a PspA4Pro production reached 11.9 g/L after 11 h of induction. The plasmid retention was 95% at the end of cultivation (Fig. 2b). As expected, the fed-batch cultivation reached higher biomass concentration, but the cultivation was longer. Therefore, the PspA4Pro productivity was lower than in auto-induction batch, 0.33 g/(L h), as well as the specific production, 100 mg PspA4Pro/g biomass (Fig. 2b).

Cell rupture and clarification

Soluble PspA4Pro was produced in the cytoplasm of recombinant E. coli. Frozen cells were suspended in lysis buffer and ruptured in a high-pressure continuous homogenizer. During the circulation of cells inside the closed loop of the homogenizer, samples were taken to measure cell lysis and protein release, respectively, by OD at 600 nm and absorbance at 280 nm. There was a rapid decrease in OD, which indicates that most of the cells were ruptured in the first 2 min. After 4 min, both OD and absorbance at 280 nm reached an equilibrium plateau, indicating that cell rupture was completed after four residence times.

After cell lysis, four conditions were evaluated for the clarification step: pH 4.0, pH 5.5, pH 6.5, and 0.1% CTAB at pH 6.5. The purity and recovery increased with increasing pH (Table 1), but the pellet produced after centrifugation at pH 6.5 was soft and difficult to separate from the supernatant. The clarification with CTAB showed the greatest increase in purity and recovery of PspA4Pro (Table 1), and a hard and compact pellet was produced and easily removed, suggesting that the centrifugation time could be decreased.

Table 1 PspA4Pro recovery and purification factor under different clarification conditions

CTAB also decreased endotoxin concentration (Table 2), because this cationic detergent interacts with negatively charged contaminants and parts of LPS, which is the main endotoxin molecule.

Table 2 Evaluation of the PspA4Pro purification process a

Comparison between ion exchange and mix mode chromatography

Both anion and cation exchange chromatographic steps allowed higher PspA4Pro recovery with similar purification factor than mix mode chromatography (data not shown). The low recovery in mix mode chromatography was associated with the combined interaction modes of the resin: ionic and hydrophobic, which led to a very strong binding with PspA4Pro that was not eluted from the resin even with 1 M l-arginine. Due to this low recovery, mix mode chromatography was removed from the purification process and the sequence of chromatographic steps was limited to an anion exchange followed by a cation exchange. It is important to emphasize that the opposite sequence is not possible, since pH reduction of the Homogenate led to a huge loss of PspA4Pro in the clarification step (Table 1).

Cryoprecipitation

A precipitate was observed when the pH of the fraction eluted from Q-Sepharose was adjusted to 4.0 in order to apply it to the cation exchange chromatography, but PspA4Pro remained in the supernatant and the purity increased from 65.2 ± 10.7 to 93.7 ± 4.9%. The freezing/thawing of the Q-fraction without pH adjustment also precipitated contaminants, while PspA4Pro was recovered in the supernatant. When the clarification step was performed at pH 4.0, it resulted in the loss of almost all PspA4Pro (Table 1). On the other hand, PspA4Pro did not precipitate at pH 4.0 after Q-Sepharose. Hence, pH adjustment and freezing were combined in a cryoprecipitation step, which resulted in a great increase of purity and high step recovery (Table 2).

Final purification process and endotoxin removal

The complete purification process was established based on the steps selected above: cell rupture in a continuous high-pressure homogenizer, clarification in the presence of the cationic detergent CTAB, anion exchange chromatography, cryoprecipitation at pH 4.0, and cation exchange chromatography. At the end of the three purification processes, 35.3 ± 2.5% of the initial PspA4Pro was recovered, with 97.8 ± 0.36% purity. In addition, the purification process successfully removed more than 99.9% of the endotoxin unities (EU) from the product (Table 2). The clarification step removed 96.5% of the EU, while centrifugation of Homogenate without CTAB removed 32.1% of the EU. Almost all EU that remained after clarification with CTAB was removed during anion exchange chromatography and cryoprecipitation at pH 4.0. The increase in EU/microgram observed in the cation exchange chromatography resulted from the loss of PspA4Pro during this step (Table 2). However, a slight EU reduction from 4.51 to 4.48 log was detected (data not shown). Figure 3 shows a representative SDS-PAGE of all purification fractions and the Western Blot of the purified protein. PspA4Pro has no enzymatic activity; thus, Western Blot and the lactoferrin binding assay were performed to verify, respectively, if PspA4Pro antigenicity and biological activity were preserved after the purification process.

Fig. 3
figure 3

a SDS-PAGE of untagged recombinant PspA4Pro in the purification process: 1 molecular marker, 2 Homogenate, 3 CTAB Clarified fraction, 4 anion exchange chromatography fraction, 5 Cryo-pH 4.0 fraction, 6 cation exchange chromatography fraction, 7 purified PspA4Pro (10 µg). b Western Blot of different amounts of purified PspA4Pro: 8 2 μg, 9 5 μg, and 10 10 μg

PspA4Pro binding to lactoferrin and molecular mass

After mixing PspA4Pro and human lactoferrin in different proportions, distinct peaks were observed in HPSEC: one corresponding to the PspA4Pro-lactoferrin complex (RT = 12.56 min) and the other to excess PspA4Pro (RT = 13.05 min) or excess lactoferrin (RT = 15.28 min). These results show that purified PspA4Pro maintained the capacity to bind to human lactoferrin, suggesting that biological activity was unchanged (Fig. 4). Based on the mass of human lactoferrin (80 kDa) and PspA4Pro (43 kDa), our results suggest that two PspA4Pro molecules can bind to each lactoferrin molecule (Fig. 4b), which is not in accordance with a previous calculated stoichiometry of approximately 1:1 for this binding (Senkovich et al. 2007). It is also important to notice that the PspA4Pro binding capacity was not affected by the high ionic force of the mobile phase, which contains 300 mM NaCl and 500 mM phosphate buffer, indicating high affinity between the two molecules.

Fig. 4
figure 4

Binding of PspA4Pro to human lactoferrin. a Two micrograms per microliter purified PspA4Pro (gray) and 1 mg/mL human lactoferrin (black). b Mixture of PspA4Pro and human lactoferrin. Two micrograms per microliter PspA4Pro and 0.5 mg/mL lactoferrin (black curve). One microgram per microliter PspA4Pro and 1 mg/mL lactoferrin (gray curve). One microgram per microliter PspA4Pro and 2 mg/mL lactoferrin (bold curve)

In order to calculate the PspA4Pro molecular mass, two different standard curves were prepared: one using globular proteins and the other using linear dextran polymers of different sizes (curves not shown). The theoretical mass of PspA4Pro is 43 kDa (PROTEIN CALCULATOR 2013). However, only the molecular mass calculated by the dextran standard curve was close to this value. PspA4Pro displayed a retention time of 12.85 min (Fig. 4a), which corresponds to a molecular mass of 111 kDa when calculated based on the globular protein standard curve, and 49 kDa when calculated based on the linear dextran standard curve. PspA4Pro is rich in α-helix and has a coiled-coil structure, which gives it a lengthened shape, like a rod. Thus, the Stokes radius is larger and retention time is lower for PspA4Pro than for globular proteins, which results in the overestimation of PspA4Pro mass when calculated with the globular protein standard curve. These results are in accordance with previous works that characterized recombinant N-terminal fragments of PspA and found that those fragments had linear coiled-coil structures (Jedrzejas et al. 2000; Jedrzejas et al. 2001).

Secondary structure and stability of purified PspA4Pro

CD spectra were obtained to determine the secondary structure of purified PspA4Pro in the presence or absence of CTAB, during thermal treatment and at different pH values. In all spectra, two valleys were observed, with minimum values at 208 and 222 nm (Fig. 5), which suggests α-helix-rich protein, as expected for the N-terminal fragment cloned, as demonstrated for N-terminal fragments of PspA from different pneumococcal strains (Jedrzejas et al. 2000; Lamani et al. 2000; Jedrzejas et al. 2001; Haughney et al. 2013). In the presence of CTAB, the molar ellipticity at 222 nm increased from −45,881 to −19,739, which indicates a loss of α-helix structure (Fig. 5). This loss, however, seems to be reversible, since the spectrum in the absence of CTAB was obtained with a PspA4Pro that was treated with CTAB at the beginning of the purification process.

Fig. 5
figure 5

Circular dichroism spectra of purified PspA4Pro in the presence (dotted line) or absence (solid line) of CTAB. Molar ellipticity from 185 to 260 nm

The overlap of heating and cooling curves obtained at 222 nm during the thermal treatment, in which PspA4Pro was heated from 15 to 95 °C and cooled back to the initial temperature, indicates that although PspA4Pro loses α-helix structures during heating, it regains its original structure when cooled (Table 3; Fig. 6). In addition, a region where PspA4Pro presented an intermediate structure that was more resistant to temperature was observed between 40 and 50 °C (Fig. 6). In order to calculate the Tm, the temperature at which the protein loses half of its original structure, the curve was divided in two parts: one between 15 and 50 °C, another between 40 and 95 °C. The similarity of Tm values obtained for heating and cooling confirms that PspA4Pro recovered its original structure after thermal treatment (Fig. 6). This property could be advantageous for downstream processing, as thermal treatment could be employed to remove contaminant proteins.

Table 3 Secondary structure of PspA4Pro, before and after thermal treatment, and at different pH values according to Contin and CDSSTR algorithms
Fig. 6
figure 6

Changes in PspA4Pro molar ellipticity at 222 nm during heating (solid line) and cooling (dotted line) from 15 to 95 °C and from 95 to 15 °C, respectively, at 1 °C/min. The arrows indicate Tm during heating (h) and cooling (c) between 15 and 50 °C, Tmh1 = 36.781 °C, Tmc1 = 35.827 °C, and between 40 and 95 °C, Tmh2 = 67.501 °C, Tmc2 = 65.215 °C

PspA4Pro also maintained its secondary structure in a wide range of pH values according to Contin and CDSSTR algorithms (Table 3). The maintenance of secondary structure in face of pH changes suggests that the PspA4Pro loss during clarification at pH 4.0 (Table 1) was due to a dragging effect of other contaminants precipitating at acid pH. Therefore, precipitation steps based on pH variation are suited for PspA4Pro purification, as long as a certain level of purity has already been reached, as found here after anion exchange chromatography.

Discussion

In industrial bioprocesses, glucose-based mineral salt media are normally used and the cell metabolism is controlled by glucose limited fed-batch to reach high cell densities and high-level recombinant protein production (Krause et al. 2016). Due to the reduced acetate formation, our group has been using glycerol as carbon source in fed-batch cultivation, and lactose instead IPTG as inducer for recombinant protein production with important results (Carvalho et al. 2012; Horta et al. 2011, 2012, 2014; Vélez et al. 2014). Another strategy explored to improve protein production was the auto-induction system, which was originally developed for shaken flasks (Studier 2005) but was successfully adapted for bioreactor cultivation using an animal-free origin peptone to avoid the risks of bovine spongiform encephalopathy transmission (Vélez et al. 2014; Campani et al. 2016). Here, we show that the auto-induction system for bioreactor cultivation has the advantage of being simpler and faster than the glycerol limited fed-batch, with higher specific production and productivity (Fig. 2). Moreover, both cultivation strategies gave similar results in downstream processing (Table 2), which means that the purification process developed is robust enough to deal with differences in the raw material.

According to WHO (2009), diphtheria CRM197, a carrier protein commonly used in pneumococcal conjugate vaccines, should present more than 90% purity. Among the purification processes of recombinant protein candidates for vaccines, a purity ranging from 90 to 95% was described (Frace et al. 1999; Zhang and Pan 2005; Cheng et al. 2009). Thus, our process is among those with higher purity.

In the purification process developed, 35% of PspA4Pro was recovered and the purity increased from 33 to 97.8%, which translates into an overall purification factor of 2.96-fold. Hou et al. (2004) devised two protocols to purify His-tagged and untagged HIV-1 reverse transcriptase, obtaining similar activities and yields in both processes. Although we did not purify tagged PspA in this work, our group described purification processes for obtaining His-tagged PspA from families 1 and 2. The first one used anion exchange chromatography and IMAC Sepharose to increase the purity of PspA from 34.9 to 96.6%, while recovering 23% of the initial PspA (Barazzone et al. 2011). The second consisted of an anion exchange chromatography, metal affinity chromatography, and a cation exchange chromatography, increasing PspA purity from 22.1 to 96.5% with 22% recovery (Carvalho et al. 2012). Compared to the previous processes, the present one managed to obtain higher purity and better recovery of an untagged PspA employing two precipitation steps and two chromatographic steps. Moreover, this process is cheaper than the traditional purification using His-tag for metal affinity chromatography, since the elimination of the IMAC-Sepharose would reduce 4.5 times the process cost, without taking into account the costs of the steps to remove the tag.

Another important aspect of the purification process is the LPS removal in the clarification step. The detergent CTAB has been used for cell disruption and precipitation of polysaccharides and nucleic acids (Huang and Kim 2013; Tomanee et al. 2004; Lander et al. 2002). In addition, Panda and Chakraborty (1998) demonstrated that purified LPS can bind to cationic detergents forming mixed micelles. Yoshihisa (1999) employed CTAB for LPS removal in the purification process of RTX toxin family proteins in a process where these toxins were precipitated with detergent, leaving LPS in the supernatant. However, these studies did not evaluate the capacity of CTAB to precipitate LPS and other contaminants during protein purification.

Branston et al. (2015) described LPS removal from pre-purified Phage M13, which was precipitated three times with 2% (w/v) PEG 6000 and 500 mM NaCl with 97% yield and 90% endotoxin removal in the supernatant. After performing three new rounds of precipitation with 2% (w/v) PEG 6000, 500 mM NaCl, and 2% (v/v) Triton X-100, a 5.7 log endotoxin reduction was achieved. In contrast, we showed here that after only one precipitation step with CTAB, which was added to the crude raw material, the Homogenate, 100% of PspA4Pro was recovered in the supernatant and 96.5% of endotoxin was removed in the precipitate (Table 2). The purification process presented in this work reached a 5.3 log endotoxin reduction. In addition, Branston and colleagues do not state if the treatment with PEG and Triton X-100 removed contaminants other than endotoxins, while the clarification step described here increased PspA4Pro purity with high recovery, removed a large quantity of endotoxins, and helped centrifugation by forming a compact pellet. Characteristics of PspA, such as high solubility and a rod shape with well-defined electronegative and electropositive regions, might have contributed to the success of CTAB in the process. Removal of LPS in early steps of the purification process is especially important, as more free binding sites are available in the anion exchange resin to interact with PspA4Pro. It is noteworthy that due to CTAB positive charge, any remaining detergent is eliminated in the non-adsorbed fraction of Q-Sepharose.

Briles et al. (2000) immunized healthy human adults with two doses of 125 μg PspA in a phase 1 clinical trial. Thus, considering 42 weeks of work in 5-L auto-induction batches and 35% final recovery, the whole process described here would produce 4.46 million doses annually, which would be enough to immunize all Brazilian newborns in 2015 (IBGE 2016) with the same dose and immunization schedule. It is worth to note that the purification process was performed with 100 g of wet biomass, so in order to purify the whole biomass produced in 5-L (around 700 g wet biomass), the process have to be scaled seven times up, which means only 7 L of Homogenate. Therefore, this volume can be processed using fundamentally the same equipment as described here and no major problems are envisaged. In addition, scaling up ion exchange chromatography is extensively studied and easily done by increasing diameter or height of the column (Rathore and Velayudhan 2003; Al-Jibbouri 2006).

In conclusion, the auto-induction simple batch provides higher PspA4Pro specific production and productivity than the glycerol limited fed-batch. The purification process developed for untagged PspA4Pro is simple, easy to scale up, and adequate for vaccine production. The purification strategy adopted here allowed us not only to obtain a superior purity and yield in comparison to our previous works but also to eliminate >99.9% of the LPS. The protein displayed a CD spectrum that resembled previously reported spectra. The purified PspA4Pro was stable in a wide range of pH values and temperatures and maintained its capacity to bind human lactoferrin.