Introduction

The diversity of complex microbial populations and community structure are very important to ecosystem, where species richness and community evenness are considered the most important factors (Loreau et al. 2001; Wittebolle et al. 2009). High performance of a biological treatment system is often achieved through bioaugmentation of one or more strains of microorganisms. Bioaugmentation has been demonstrated to enhance the removal of many specific pollutants such as phenols, chloroaniline, chlorobenzoate, resin acid, etc. (Singer et al. 2005; Thompson et al. 2005; Damsa et al. 2007). These researches mainly focused on bioaugmentation of one strain, which has been documented well in a recent review (Fantroussi and Agathos 2005). However, there are few reports regarding the bioaugmentation of multiple strains of microorganisms. Bioaugmentation of a group of microorganisms may improve reliability of bioaugmentation (Jiang et al. 2007). The most important step for the newly introduced strains is to establish the positive mutualistic relationship with each other. Wittebolle et al. (2009) also confirmed that initial community evenness play a key role in preserving the functional stability of an ecosystem. However, to date, most of the related studies have focused mainly on treatment performance of bioaugmentation without considering the relationship of bacterial communities involved (Briones and Raskin 2003; Smith et al. 2003). Insights into the bacterial relationship of microbial communities are essential for improving treatment performance and stability against inhibitory compounds. However, it is very difficult to clarify the roles of each strain and the relationships among strains of a community.

Some researches used rRNA/rDNA-based molecular techniques to determine the diversity of complex microbial populations and community structure during the degradation process, especially in the field of pollutant biodegradation and biological investigations of bacterial communities (Calli et al. 2006; McMahon et al. 2001; Sekiguchi et al. 1999). These methods are powerful and accurate. However, some drawbacks such as long run times, high instrument expenses, and ignorance of the roles of each member and the interactions of the mixed microorganisms make the molecular techniques method unsuitable in some cases.

In contrast, statistical experimental design can analyze the interactions among the members of bacterial communities, and optimize community structure for improving treatment efficiency. Such a technique has been widely used in various fields such as in biochemistry for optimization of culture medium and conditions (Vohra and Satyanarayana 2002), in a microextraction for the determination of benzene, toluene, ethylbenzene and xylenes material (Gaujaca et al. 2008). However, to our knowledge, there are few reports on using the statistical method to investigate the effect of mixed culture inoculums on the performance of degrading synthetic wastewater.

The aim of this work is to apply statistical experimental design to investigate the role of each individual member and the mutualistic relations in a mixed culture consisting six bacteria during biodegradation of synthetic wastewater. We expect our experiment will facilitate better understanding of the mechanisms to construct a defined mixed culture consisting of specific microorganisms with high capability.

Materials and methods

Microorganism, nutrient medium, and solutions

Six strains of bacteria, i.e., Agrobacterium sp., Bacillus sp., Enterobacter cloacae, Gordonia, Pseudomonas Stutzeri, and Pseudomonas putida were selected based on an established mixed culture in a portable wastewater treatment system. The stock cultures were stored at 4°C. The nutrient medium contained 3 g/l of beef extract, 5 g/l of peptone, and mineral salt medium (MSM) at pH 7. The composition of MSM in g/l was KH2PO4 (0.42), K2HPO4 (0.375), (NH4)2SO4 (0.244), NaCl (0.015), CaCl2·2H2O (0.015), MgSO4·7H2O (0.05), and FeCl3·6H2O (0.054). A phosphate buffer (pH 7) was prepared by dissolving NaCl (8 g/l), KCl (0.2 g/l), K2HPO4 (1.15 g/l), and KH2PO4 (0.2 g/l) in deionized water (Millipore, Milli-Q), and was used for diluting concentrated cells. All chemicals were analytical reagent grade.

Bacteria cultivation and experimental procedure

Six bacteria were activated at 30°C in the nutrient medium. The activated cells in the late exponential phase were then harvested as inoculum, respectively. The cells collected after centrifugation (6,000 rpm) for 10 min were resuspended in phosphate buffer and re-centrifuged. After cleaning, six inoculums were separately prepared by inoculating the activated six strains into the synthetic domestic wastewater to give an optical density at 600 nm (OD600), cells concentrations (L), 0.1 or 0.3. In each fractional design experiment (detailed in “Statistical methods” section), each 20-ml inoculum containing L = 0.1 or 0.3 bacteria was added aseptically to Erlenmeyer flasks (250 ml), the final volume was 120 ml. Then the flasks were capped with cotton plugs and placed in a shaker controlled at 100 rpm and 30°C. The pH of the mixture was kept at 7.0 during the course of all experiments by adding NaHCO3. Samples were withdrawn every hour, and the OD600 of cells (L) and TOC were measured as described below.

Analysis of cells and substrates

The concentration of biomass in the sample, X (mg VSS/l), was analyzed by measuring OD at 600 nm, L using an UV/Vis spectrophotometer (Pharmaspec UV-1700, SHIMADZU) and 1-cm path length square quartz cuvettes with the synthetic domestic wastewater as the blank. Samples with 0.1 OD corresponds to about VSS of 0.039 g/l. In the following experiment, X was analyzed by measuring L, then converted it into VSS. The samples exceeding 0.8 OD were suitably diluted with the culture medium so that the Beer–Lambert law is applied.

Total organic carbon (TOC) was analyzed on filtered samples (0.45 μm Magna nylon membrane filters, SELECTRON, USA) with a Model 100 Benchtop TOC Analyzer (Thermo Electron Corporation, USA) using the combustion/non-dispersive infrared gas analytical method. Triplicate analyses were performed for all assays.

Kinetic analysis

The specific growth rate of cell, μ (h−1): with the hypotheses of small initial cell concentration and of large initial substrate concentration (S 0), μ in a batch reactor was determined according to conventional procedures for kinetic analysis from batch culture data under the assumption of first-order growth kinetics with respect to biomass concentration in the exponential phase. In order to avoid the influence of mortality, specific growth rate of cell, μ is determined by measuring the initial growth rate and is calculated as:

$$ \mu = 1/X{\text{ d}}X/{\text{d}}t = {\text{d ln}}X/{\text{d}}t $$
(1)

where X is the biomass concentration (mg VSS/l). The μ value is determined based on the exponential phase of growth curve. Therefore, the specific substrate utilization rate, q (g TOC/(d-gVSS)), can be expressed as:

$$ q = \left( {{\frac{{{\text{d}}S}}{{{\text{d}}t}}}} \right)\left( {\frac{1}{X}} \right) = \left( {\frac{1}{X}} \right)\left( {{\frac{{{\text{d}}X}}{{{\text{d}}t}}}} \right)\left( {{\frac{{{\text{d}}S}}{{{\text{d}}X}}}} \right) = \left( {{\frac{{{\text{d}}\ln X}}{{{\text{d}}t}}}} \right)\left( {\frac{1}{Y}} \right) = {\frac{\mu }{Y}} $$
(2)

where Y is the yield, biomass produced per unit mass of substrate consumed, determined as:

$$ Y = {\frac{{X - X_{0} }}{{S_{0} - S}}} $$
(3)

where X 0 and S 0 represent the initial biomass and substrate concentrations, respectively, and S is the residual wastewater concentration (mg COD/l).

Statistical methods

To obtain the optimized community constitution for biodegradation, a fractional factorial design (FFD) was employed to evaluate the preliminary significance of the factors, as well as the interactions between them. The cell concentrations measured from OD600 of the six microorganisms, L were the study factors and q was as the response. All factors were evaluated at two levels, low (denoted as −1) and high (denoted as +1). The high and low levels of L were selected to be 0.1 and 0.3 OD600, respectively. A three replicate center point (level 0, L = 0.2) was included in the design to estimate the experimental variance and check the loss of linearity between the levels chosen for each variable (curvature check). All analyses were performed using MINITAB Software (Release 14, Minitab Inc., State College, PA).

A full factorial design for the six bacteria (factors) requires 64 (26) experiments to evaluate the whole set of main effects as well as all interaction effects. The main effects and the lower-order interactions, however, are usually the most significant terms. In this work, a 26−2 fractional factorial design consisting of 16 factorial runs along with three experiments at the center of the design for the analysis of curvature in each replicate was performed as a screening process to identify the significant factors and interactions.

Once critical factors were identified via screening and significant gross curvature has been detected in the design space, the central composite design (a response surface methodology (RSD) based on a highly fractionalized three-level factorial design) was proceeded to obtain a quadratic model, consisting of trials plus a star configuration to estimate quadratic effects and central points to estimate the pure process variability and reassess gross curvature, with the overall specific TOC degradation rate as response. The central composite design used was generated by MINITAB. According to this design, 30 experiments were conducted containing six replicates at the center point for estimating the purely experimental uncertainty variance in triplicates. Each variable was designated as ±2, ±1, and 0. The significant variables indicated by the Pareto chart (obtained from multiple linear regression and analysis of variance) were optimized using a central composition design. A quadratic equation that correlates the variables and the response was obtained, and the critical points of this equation were estimated through this mathematical relationship.

Results and discussion

Screening design

The characterization of individual strains has been presented in our previous published work (Chen et al. 2009). The screening design was to identify the major effects and mutual biodegradation effects of six strains of bacteria, i.e., Agrobacterium sp. (R 1), Bacillus sp. (R 2), E. cloacae (R 3), Gordonia (R 4), P. Stutzeri (R 5), P. putida (R 6). Table 1 shows the design, along with the results of three replicates of FFD for the overall specific rate of substrate utilization q (gTOC/(d-gVSS)). The q varied considerably from 1.92 to 5.36 g TOC/(d-gVSS) under different composition of bacteria in the cultivation. It suggests that these variables significantly affected bacterial growth and TOC degradation. A three-replicate center point (level 0, L = 0.2) was included in the design to estimate the experimental variance and check the loss of linearity between the levels chosen for each variable (curvature check). The order in which the experiments were performed was selected at random, making this a completely randomized design.

Table 1 Experimental design and results of the three replicates of the overall specific substrate utilization rate

The experimental values of q in the triplicate runs of the FFD were used for statistical analysis of the design. The mathematical correlation between analytical response and the variables was performed by a linear regression and curvature check. Figure 1 shows the Pareto chart.

Fig. 1
figure 1

Fractional factorial design Pareto graph. Response is the overall specific substrate utilization rate (q)

From Fig. 1, only Gordonia (R 4) produced significant positive effect on q as a single strain, whereas the other strains were not significant or had negative effects on their own within the levels and conditions tested. In contrast, the positive interaction among some strains should be given consideration due to its synergistic effects. The composition of Agrobacterium sp. and E. cloacae (R 1 R 3) produces the largest effect on q. Other interactions among the strains, such as Agrobacterium sp., and Gordonia (R 1 R 4), Agrobacterium sp., Bacillus sp and Gordonia (R 1 R 2 R 4) and Agrobacterium sp., P. putida (R 1 R 6), were also significant based on a 5% level of significance. The curvature P value of 0.965 indicated that the mean are not significantly different from the average of the center points and the average of the factorial points. As a result, the optimum point should be near or within the experimental design space.

Base on the above results, a more detailed study using a response surface method, central composition design, was conducted with strains R 1, R 3, R 4, R 6, considering their significant impact on q. Strains R 2 and R 5, which did not have significant, were excluded.

Optimization with central composite design and response surface analysis

The objective of this set of experiment is to build a second-order (quadratic) model for the response variable (q). The four main effects variables, Agrobacterium sp. (R 1), E. cloacae (R 3), Gordonia (R 4) and P. putida (R 6), were further investigated using a response surface design within the design space introduced in the FFD.

RSD is appropriate when the optimal region for running the process has been identified. A central composite design with eight-star points and five replicates at the center point for each of four factors was carried out to further optimize the microbial community. To make the design rotatable, the axial distance at the higher star points was assigned a value of 2, while the axial distances of these factors at the lower star point were assigned to be −2. Table 2 showed the experimental design and results.

Table 2 Central composite design with four variables and the corresponding response (the overall specific substrate utilization rate)

The coefficients, t test and P value for the linear, quadratic and interaction terms are shown in Table 3. The results of Analysis of Variance (ANOVA) are shown in Table 4.

Table 3 Estimated effects and corresponding coefficients of the overall specific substrate utilization rate, q
Table 4 Analysis of variance for the overall specific substrate utilization rate, q

The four variables tested expressed strong linear effect on the response (P = 0.001) (Tables 3, 4), whereas Agrobacterium (R 1) displayed negative effect while E. cloacae (R 3), Gordonia (R 4) and P. putida (R 6) showed positive effect. The four strains all presented significant negative quadric effect with P value of 0.000 on response.

Significant negative interactions were noted between Agrobacterium and Gordonia (R 1 R 4), E. cloacae and Gordonia (R 3 R 4), Gordonia and P. putida (R 4 R 6) based on a 5% level of significance (Table 3). The significant negative effect implied that the specific substrate utilization rate initially increased to the maximum, and finally decreased at the even higher levels of variables. The results were different from our FFD experiments (“Screening design” section), which showed that significant interactions between Agrobacterium sp. and E. cloacae (R 1 R 3), Agrobacterium sp. and Gordonia (R 1 R 4), Agrobacterium sp., Bacillus sp. and Gordonia (R 1 R 2 R 4) and Agrobacterium sp. and P. putida (R 1 R 6), were significant. This can be explained by the fact that FFD experiments were performed with six strains while RSD was with four strains, the actual interaction between strains may be more complicated than that we have reported here.

Model F value is calculated as ratio of mean square regression and mean square residual. The high F value and the associated low P values indicated that the strong significance of the model predictors based on the experimental results. The four variables tested expressed strong effect on the regression (Table 4). There is high significance in the linear, square and interaction terms existed by the four strains (Table 4), indicating that the degradation of synthetic domestic wastewater was primarily controlled by four selected strains.

By applying the multiple regression analysis on the experimental data, a second-order polynomial model in coded unit explains the role of each variable and their second-order interactions in q.

$$q = 5.47 - 0.63L_{1} + 0.26L_{3} + 0.37L_{4} + 0.22L_{6} - 0.47L_{1}^{2} - 1.07L_{3}^{2} - 0.86L_{4}^{2} - 0.74L_{6}^{2} - 0.19L_{1} L_{3} - 1.06L_{1} L_{4} + 0.16L_{1} L_{6} - 0.37L_{3} L_{4} + 0.21L_{3} L_{6} - 0.44L_{4} L_{6}$$
(4)

where L i is cells concentration, i = 1, 3, 4, 6 for 4 strains respectively.After neglecting the insignificant terms based on 5% level of significance, it can be expressed as following:

$$ q = 5.47 - 0.63L_{1} + 0.26L_{3} + 0.37L_{4} + 0.22L_{6} - 0.47L_{1}^{2} - 1.07L_{3}^{2} - 0.86L_{4}^{2} - 0.74L_{6}^{2}- 1.06L_{1} L_{4} - 0.37L_{3} L_{4} - 0.44L_{4} L_{6}$$
(5)

The results by RSD showed that the coefficient of determination (R2) was 97.4%, which ensured satisfactory adjustment of the quadratic model to the experimental data and indicated that 97.4% of the variability in the response variables could be explained by the second-order polynomial prediction equation given in Eq. 4. The value of adjusted determination coefficient (R 2adj ) was calculated to be 94.7%, which is also very high, indicating a high significance of the model. Lack of fit P value of 0.017 and low pure error demonstrated the experiments conducted were precise and reliable within the range of variables employed.

Response surface plot provides a method to predict the yields of active substances for different values of the test variables and the contours of the plot are helpful in identification of the type of interactions between test variables.

The two-dimensional contour plot and three-dimensional response surface from the calculated response surface in the overall specific substrate utilization rate was plotted in Fig. 2, which further explains the results of statistical analyses. Each subplot presented the effect of two variables on q, while the other two variables were held at zero level.

Fig. 2
figure 2

Two-dimensional (2-D) contour plot and three-dimensional (3-D) response surface curve predicted by the full quadratic model of a Agrobacterium sp. and Gordonia (R 1 R 4), b E. cloacae and Gordonia (R 3 R 4), and c Gordonia and P. putida (R 4 R 6)

The circular contour plots of response surfaces suggest that the interaction is negligible between the corresponding variables. The significant perfect interaction between the independent variables can be shown when elliptical or saddled contours are obtained (Muralidhar et al. 2001). The 2-D contour plots of R 1 R 4, R 3 R 4 and R 4 R 6 had remarkable interaction, and the 3-D response surface curves of R 3 R 4 and R 4 R 6 were convex in nature (Fig. 2), which suggested that there were optimized concentrations of R 3 R 4 R 6 strains. However, there were not convex in R 1 R 4 response surface curve. At low Gordonia (R 4) concentration, q increased as increase in Agrobacterium (R 1) concentration, and decrease at high Gordonia (R 4) concentration.

The optimum concentrations of the variables were calculated from the data obtained using RSM system. The optimal values of each test component in coded units were as follows: R 1 = −2.00, R 3 = −0.054, R 4 = 1.63 and R 6 = −0.63. Their corresponding L values were L 1 = 0, L 3 = 0.146, L 4 = 0.363 and L 6 = 0.137. At this point, the model predicted q max of 6.88 g TOC/(d-gVSS), and the composite desirability is 0.81.

Validation of the experimental design

To understand the composite effects of the six strains of bacteria on the TOC degradation performance and to confirm the model adequacy for predicting the maximum overall specific rate of substrate utilization, additional three batch comparison experiments were performed.

In each experiment, six inoculums were separately prepared by inoculating the activated six strains into the synthetic domestic wastewater to give the specific applied level of cell concentration L, OD600 (Run I: L 1 = 0, L 3 = 0.146, L 4 = 0.363 and L 6 = 0.137; Run II: L 1 = 0, L 3 = L 4 = L 6 = 0.162; Run III: L 1 = L 2 = L 3 = L 4 = L 5 = L 6 = 0.162). Then each 30-ml (four strains) or 20-ml (six strains) inoculums containing specific cell concentration was added aseptically to Erlenmeyer flasks (250 ml), which made all three batch experiments have the same initial cell concentration (L initial = 0.162) and the same volume (120 ml). The first batch experiment was performed with the optimal OD600 values of the selected three strains, the strain concentration are the same in the second (four strains) and third (six strains) control experiments.

Although the model was developed using q as the response, some other parameters such as the initial cell density (L intial), the time to reach stationary phase (i.e., at stable TOC concentration, T s), the maximum total organic carbon (TOC) removal efficiency (R max) and the yield (Y) were observed to evaluate the biodegradation process. The experimental results are showed in Table 5.

Table 5 Degradation of the verification experimentation

It is found that although L initial of the three batch experiments were the same (0.162), the shortest T s (3.5 h) value, the highest R max (92%) and q (7.13 ± 0.12 g TOC/(d-gVSS)) were achieved by the optimum composite of three strains (Run I).

The three replicates with the optimal concentration composition of three strains (Run I) yielded an average q of 7.13 ± 0.12 g TOC/(d-gVSS), which was closer to the predicted response (6.88 g TOC/(d-gVSS)). The value was 1.5 and 1.97 times greater than those microbial communities of four (Run II) and six strains (Run III) without optimization. Therefore, statistical analysis and the application of optimal conditions enhanced the overall specific rate of substrate utilization as well as other biodegradation parameters.

Interestingly, the q and R max of both three and four strains (Run I, II) were higher than those of six strains (Run III). That implied that the biodegradation process inoculated with R 3 R 4 R 6 performed better than that composed of six strains. This suggests that the optimizing the inoculums is important for improving treatment performance.

The good agreement between the predicted and experimental results validated the model and reflected the existence of an optimal point. The results derived from this study demonstrate that the FFD and RSD are powerful tools for elucidating the individual factors and synergistic relationship among multiple strains and optimizing the composition microbial community in bioaugment treatment.

Conclusions

Statistics-based experimental design methods proved to be valuable tools in optimizing the strains composition for synthetic domestic wastewater degradation. FFD was employed in the first step to screen out the significant factors affecting substances degradation by six strains of microorganisms. Central composite designs and response surface analysis revealed that Agrobacterium sp., E. cloacae, Gordonia and P. putida had the most significant linear, square and interaction terms effects on the response, q. The optimum values of each strains concentration were L 1 = 0, L 3 = 0.146, L 4 = 0.363 and L 6 = 0.137. At this point, the model predicted q of 6.88 g TOC/(d-gVSS). The actual q is 7.13 ± 0.12 g TOC/(d-gVSS), which was 1.5 and 1.97 times higher than those without optimization. The results derived from this study demonstrated that the FFD and RSM are powerful tools for optimizing individual factors.