Introduction

Pigments are one of the most important external qualities of foods that determine their acceptance by consumers. Currently pigments of various kinds and forms have been used as additives or supplements in food industries, cosmetics, pharmaceuticals, livestock feed and other applications (Boo et al. 2011). Food industry adds pigments to manufactured products in order to intensify their character. Color is an important factor for appeal, a major criterion of identification, indicator of quality and freshness and is a determinant of consumer acceptability, image, market size and value. In the past, synthetic pigments were used indiscriminately to manipulate food color. However, synthetic pigments are hardly nutrient, and have been blamed for toxic to different extents and some are found to be carcinogenic (Zhang et al. 2006). The use of synthetic food colourants is restricted by legal regulations of both international and national authorities (Masár et al. 1996). Hence, there is a growing demand for eco-friendly/non-toxic colorants, specifically for health sensitive applications such as coloration of foods and dyeing of child textile and leather garments.

Extraction of bioactive compounds from permeable solid plant materials using solvents constitutes an important step in the manufacture of phytochemical-rich products. The application of this low-cost technology to obtain molecules to be used as food additives or nutraceutical products is an appropriate strategy for the exploitation of some under-utilized plants (Prakash Maran and Manikandan 2012). Indian blackberry (Syzygium cumini L.), commonly known as jamun fruit and it is an under-utilized minor fruit of the Indian subcontinent. The plant belongs to Myrtaceae family and grows naturally in clayey loamy soil in tropical as well as sub-tropical zones of Indo-gangetic plains. The fruit concentrate of jamun has a very long history of use for various medicinal purposes and currently it has been used for the treatment of chronic diarrhea and other enteric disorders, including its use as an antimicrobial agent (Achrekar et al. 1991; Migliato 2005). These beneficial effects are most probably due to the presence antioxidant properties of anthocyanin pigments. Anthocyanins are considered as potential replacements for synthetic colouring agent because of their bright attractive colors and water solubility (Kong et al. 2003). The incorporation of anthocyanin in to the food systems not only increases the color intensity but also increases the medicinal and therapeutic values of the food products.

Development of an economic and efficient conventional extraction method for the extraction of biologically active compounds from jamun fruit provides an opportunity to replace the synthetic colour. From the analysis of lierature it is found that, no publication is reported to the extraction of anthocyanin and colors from jamun fruit. Hence in the present study an attempt has been made to investigate and optimize the individual and the interactive effect of process variables such as extraction temperature, time and solid–liquid ratio on the maximum extraction yield of total anthocyanin and colors from jamun fruit pulp using Box-Behnken response surface design coupled with Derringer’s desired function methodology.

Materials and methods

Raw materials and chemicals

Freshly harvested jamun fruits (Syzygium cumini– with similar maturity and weight) were used as raw materials and they were procured from the local venders near Erode, TamilNadu, and were stored at 4 °C prior to the experiments. Cyanidin-3-glucoside (HPLC grade) was purchased from Sigma chemicals, Mumbai. Potassium chloride (AR grade) and sodium acetate (AR grade) were supplied by Merck chemicals, Chennai.

Aqueous extraction of pigments and colors

Fruits were washed thoroughly in running tap water to get rid of the impurities adhered over its surface. Pulp of the fruits were peeled manually and pulp was dried in the hot air oven at 40˚C until it attains constant weight. The dried pulps were ground using a grinder and sieved (150 mesh) to < 1 mm particle size. Because, size of the particle is considered as an important variable in an extraction process. Previous studies have shown that, the finest particles results with the highest extraction yield, because of the increase in exchange surface area and decrease in path length travelled by the solute to reach the surface, which in turn helps to reduce the extraction time (Pinelo et al. 2005; Silvaa et al. 2007). Accordingly a dried milled fruit sample (particle size of < 1 mm) was used in this study, which would not hinder the experimental work at the filtration step. Aqueous extraction process was performed in a temperature controlled water bath and distilled water was used as a solvent under different aqueous extraction conditions (extraction temperature (40–60 ˚C), time (20–100 min) and solid–liquid (SL) ratio (1:5–1:15 g/ml). Different solid–liquid ratios of samples (1:5, 1:10 and 1:15) were taken in 150 ml Erlenmeyer flasks and it was incubated at different temperature (40, 50 and 60 ˚C) in a temperature controlled water bath. Extracts were taken at different time intervals (20, 60 and 100 min). During the extraction process, Erlenmeyer flasks were covered with a plastic wrap in order to prevent the evaporation of solvent. After extraction for a selected time, the mixture was centrifuged at 6500 rpm for 15 min (Remi R-24 Centrifuge, India) and supernatant liquid were collected for the determination of pigments and colors from jamun fruits. Experiments were performed randomized in order to minimize the effects of unexplained variability in the observed responses due to extraneous factors. All experiments were done in triplicates and the average values were recorded and are shown in Table 1.

Table 1 Un-coded and coded box-behnken experimental design matrix with experimental and predicted response values

Determination of total monomeric anthocyanin content (TMAC)

pH differential method (Maran et al. 2013a) was used to estimate the total monomeric anthocyanin content (TMAC) of the extracts. The method was based on the anthocyanin structural transformation that occurs when pH changes. Aliquots of the extracts were diluted with pH 1.0 and 4.5 buffers using potassium chloride (0.025 M) and sodium acetate (0.4 M) respectively, and then placed in the dark conditions for 1 h. The absorbance of each solution was recorded using a spectrophotometer (Shimadzu UV-1800, Kyoto, Japan) calibrated with distilled water as the blank at the wavelength of 530 and 700 nm. Difference in absorbance between pH values and wavelengths was calculated using the formula:

$$ A={\left({A}_{530 nm}-{A}_{700 nm}\right)}_{pH1.0}-{\left({A}_{530 nm}-{A}_{700 nm}\right)}_{pH4.5} $$
(1)

The total monomeric anthocyanin content was obtained by

$$ TMAC=\frac{A\times MW\times DF\times 1000}{\varepsilon \times 1} $$
(2)

where, A is the absorbance; MW is the molecular weight of cyanidin-3-glucoside (449.2 g mol−1); DF is the dilution factor; ɛ is the molar absorptivity of cyanidin-3-glucoside (26,900 L cm−1 mol−1) and 1 is path length.

Analysis of total color in solution (TCS)

After extraction process, the determination of mass of total color in solution (TCS) was analyzed according to the method described by Sinha et al. (2012). The obtained extracts were filtered using filter paper (What man No.1, England) and filtered extracts were dried in a hot air oven (NSW, India) until all the water gets evaoparated. The dishes were then cooled in a desiccator and weighed. The drying, cooling and weighing procedure was repeated to get a constant weight and thus weight of the extract was determined. The weight of the colorant extract obtained per gram of the jamun fruit pulp was then calculated.

Experimental design

In this study, Box-Behnken response surface experimental design (BBD) with three factors at three levels was used to optimize and investigate the influence of process variables such as extraction temperature (40 - 60 ˚C), time (20–100 min) and solid–liquid ratio (1:5–1:15 g/ml) on the TMAC and TCS of the extracts. Process variables and their ranges were determined based on the single factor experimental analysis. After selection of process (independent) variables and their ranges, experiments were established based on a BBD and the complete design consists of 15 experiments with three centre points (used to estimate the experimental error). The total number of experiments was calculated from the following equation (Prakash Maran et al. 2013a).

$$ N=2K\left(K-1\right)+{C}_0 $$
(3)

where, K is number of factors and C0 is the number of central point. For statistical calculations, the process variables were coded at three levels (-1, 0 and +1) and the coding was done by the following equation (Prakash Maran et al. 2013a).

$$ {x}_i=\frac{X_i-{X}_z}{\varDelta {X}_i}i=1,2,3...........k $$
(4)

where x i , is the dimensionless value of an independent variable; X i , the real value of an independent variable; X z , the real value of an independent variable at the centre point; and ∆X i , step change of the real value of the variable i.

For predicting the optimal point after performing experiments, a second-order polynomial equation was fitted to correlate the relationship between independent variables and responses, which accounts for variations caused by linear, quadratic and interactive effect of the process variables. The mathematical form of second-order polynomial equation is given below

$$ Y={\beta}_0+{\displaystyle \sum_{j=1}^k{\beta}_j{X}_j+{\displaystyle \sum_{j=1}^k{\beta}_{jj}{X}_j^2}}+{\displaystyle \sum_i{\displaystyle \sum_{<j=2}^k{\beta}_{ij}{X}_i{X}_j+{e}_i}} $$
(5)

where, Y is the response; Xi and Xj are variables (i and j range from 1 to k); β0 is the model intercept coefficient; βj, βjj and βij are interaction coefficients of linear, quadratic and the second-order terms, respectively; k is the number of independent parameters (k = 3 in this study); and ei is the error (Prakash Maran et al. 2013b).

Statistical analysis

The experimental data was analyzed by multiple regression analysis through the least square method. Sequential sum of squares and model summary statistics was carried out on the experimental data to evaluate the adequacy of various models (linear, interactive (2FI), quadratic and cubic). Regression coefficients of linear, quadratic, and interaction involved in the model and their effects were analyzed by Pareto analysis of variance (ANOVA) and ANOVA tables were generated. All the terms in the model were tested by student’s F-test and significance of the F-values at probability levels (p ≤ 0.05) were analyzed. The experimental data were analysed with various statistical analysis such as sum of squares (SS), determination coefficient (R2), adjusted determination of coefficient (R 2 a ), predicted determination of coefficient (R 2 p ) and coefficient of variation (CV) to reflect the statistical significance of the fitted polynomial equation. All the statistical analyses were done with the help of Stat ease Design Expert 8.0.7.1 statistical software package (Stat-Ease Inc., Minneapolis, USA). After fitting the data to the models, the models were used for the construction of three dimensional (3D) response surface plots to predict the relationships between independent and dependent variables.

Determination of optimal conditions

After analyzing the polynomial equation depicting the effect of independent variables on the responses, optimization process was carried out by Derringer’s desired function methodology (Maran et al. 2013a). This numerical Optimization technique will optimize any combination of one or more goals; these may be either process variables or responses. The possible goals are: maximize, minimize, target, within range, none (for responses only) and set to an exact value (factors only). In this study, goals of the process variables were selected as in a range and the response goals were selected as maximize. A weight factor of 1 was chosen for the response, which can be used to adjust the shape of its particular desirability function. The default value of one creates a linear ramp function between the low value and the goal or the high value and the goal. Default importance of 3 was chosen for the response, which can represent the goals to be equally important.

Validation of optimized conditions

After optimization, adequacy of the model equation for predicting the optimum response values was validated under the optimal conditions. Triplicate verification experiments were performed under the optimal conditions and the average value of the experiments was compared with the predicted values of the developed model equations.

Results and discussions

Natural pigments and colors from jamun fruit pulps were extracted by aqueous extraction method and the extraction process was carried out at different extraction temperatures, extraction times and solid–liquid ratio with an explicit objective of studying the effect of process variables and determining the optimum extraction conditions on the extraction yield of pigments and colors. In this study, three factors with three levels BBD was used to evaluate the effect and optimize the process variables on the responses. A total number of 15 batch experiments including three centre points were carried out in triplicates using statistically deigned experiments and the results (means values) are shown in Table 1.

BBD analysis

The experimental data was analyzed by two different tests namely the sequential model sum of squares and model summary statistics in order to obtain regression models and decide about the adequacy of various models (linear, interactive, quadratic and cubic) to represent the extraction of pigments and color from jamun fruit pulp. The results are listed in Table 2. From the Table 2, it is found that, linear and interactive (2FI) models are exhibited lower R2, adjusted R2, predicted R2 and also having high p-values, when compared with quadratic model. Cubic model is found to be aliased. Therefore the quadratic model is chosen to describe the effects of process variables on the extraction of natural pigments and colors from jamun fruit pulp. Further more, analysis of variance (ANOVA) is also used to check the adequacy of quadratic model.

Table 2 Adequacy of the model tested for the responses

Model fitting

The results obtained from BBD experiments were evaluated by multiple regression analysis method for total extraction of natural pigments and colors from Jamun fruit pulp. An empirical relationship between the response and independent variables has been expressed by a second-order polynomial equation with interaction terms was fitted between the experimental results obtained on the basis of BBD, which will help to predict the extraction efficiency of different sets of combinations of three process variables on the responses. Two empirical models were developed to understand the interactive correlation between the responses and process variables. The final model obtained in terms of coded factors is given below

$$ TMAC=9.59+0.21{X}_1+0.40{X}_2+0.74{X}_3-0.30{X}_1{X}_2-0.87{X}_2{X}_3+0.24{X}_2{X}_3-1.03{X}_1^2-0.31{X}_2^2-0.22{X}_3^2 $$
(6)
$$ TCS=9554.94+259.75{X}_1+465.97{X}_2+686.74{X}_3-368.7{X}_1{X}_2-926.71{X}_2{X}_3+192.58{X}_2{X}_3-909.14{X}_1^2-249.77{X}_2^2-232.04{X}_3^2 $$
(7)

Adequacy of models

The adequacy of models was evaluated by constructing diagnostic plots such as predicted versus actual and normal probability plots for the experimental data obtained from this study and it is shown in Fig. 1. From the Figs. 1 a-b, it is observed that, the data points on this plot lie very close to the diagonal line, because residuals for the prediction of each response is minimum and it indicated a good adequate agreement between experimental data and the data predicted by the developed models (Maran et al. 2013b). Normal probability plot is a suitable graphical method for judging residuals normality. The observed residuals are plotted against the expected values, as they lie reasonably close on a straight line and show no deviation of variance (Fig. 1 c-d). This confirms the normal distribution of the observed data and adequacy of the developed models.

Fig. 1
figure 1

Diagnostic plots for the model adequacy

Statistical analysis

Pareto analysis of variance (ANOVA) and multiple regression analysis were used to analyze the observed data. Significance of the developed model equations was evaluated by their corresponding F and p-values and it is shown in Table 3. The higher model F values (100.05 for TMAC and 284.80 for TCS) and lower p-values (p < 0.0001) demonstrated that, the developed model was highly significant. The goodness of fit of the model was evaluated by the determination co-efficient (R2), adjusted determination co-efficient (R 2 a ), predicted determination co-efficient (R 2 p ) and co-efficient of variance (CV) and signal to noise ratio (S/N). The high R2 values (0.9945 for TMAC and 0.9981 for TCS) revealed that, the models are statistically significant and only small variations (0.0055 for TMAC and 0.0019 for TCS) are not explained by the models (Maran et al. 2013c). The value of R 2 p (0.9125 for TMAC and 0.9689 for TCS) is in reasonable agreement with the value of R 2 a (0.9845 for TMAC and 0.9945 for TCS) and also shows that the form of the model chosen to explain the relationship between the factors and the response is well-correlated (Maran et al. 2013d). Lower CV values (1.41 for TMAC and 0.82 for TCS) clearly stated that, the deviations between experimental and predicted values are low and also showed a high degree of precision (Maran et al. 2013e) and reliability of the conducted experiments (Prakash Maran et al. 2013c). In this present study, the ratio was S/N found to be > 38 (Table 3), which indicates the best fitness of the developed models (Maran et al. 2014).

Table 3 ANOVA analysis and statistical parameters of the responses

Effect of process variables

Three dimensional (3D) response surface plots were plotted from the developed models in order to study the individual and interaction effect among process variables on the responses (Maran et al. 2013f) and also used to determine the optimal condition of each factor for maximum extraction yield of natural pigments and colors of jamun fruits. In this study, the model has more than two factors and hence 3D plots are drawn by maintaining one factor at a constant level (in turn at its central level), whereas the other two factors are varied in their range (Prakash Maran et al. 2013d).

Effect of extraction temperature

Experiments were carried out to study the effect of temperature (40, 50 and 60 ˚C) over the extraction yield of pigments and colors from jamun fruit pulps. From the results, it was observed that, the yield was increased linearly with increasing temperature from 40–50 ˚C (Fig. 2 a-b and 3 a-b). Increase in temperature up to 45 ˚C would cause softening of the plant tissue, accelerate the molecular movements and penetration of solvent in to the plant material and enhances the solubility and diffusivities of the pigments and colors into the solvent and increases the extraction yield (Yong et al. 2006). However, temperature beyond 45˚C resulted in lower recovery of pigment and color, due to heat sensitive anthocyanin pigment.

Fig. 2
figure 2

Effects of process variables on the extraction yield of TMAC

Fig. 3
figure 3

Effects of process variables on the extraction yield of total color in solutions

Effect of extraction time

Extraction time is one of the important factors for extraction of pigments and colors from plant materials. It is associated with the final concentration of pigments and colors and the efficiency of extraction. From the results, it was found that, the extraction yield was increased rapidly at the initial stages of extraction, but ends with a slower extraction rate. This phenomenon could be explained by that, the mass transfer rate is high at intial stage (Fig. 2b, 3b) due to high concentration gradient during the diffusion of solutes from internal material into the solution, which promotes the transmission of anthocyanin and colors and increases the extraction rate..

Effect of solid–liquid ratio

The solvent quantity is also an important factor to influence the extraction yield. From the results, it was observed that, extraction yield was increased with the increasing liquid–solid ratio (Fig. 2 b-c and 3 b-c). Amount of solute required to saturate the liquid increases with increasing quantatity of the liquid and hence extraction of anthocyanin and colour yield are increases with liquid–solid ratio. Similar results were reported during extraction the phenolic compounds from milled berries (Cacace and Mazza 2003), extraction of anthocyanin from red grape pomace (Mantell et al. 2002) and extraction of andrographolide from plants (Wongkittipong et al. 2004).

Determination and validation of optimized conditions

Simultaneous optimization of the multiple responses was carried out using Derringer’s desired function methodology. This numerical optimization technique evaluates a point that maximizes the desirability function (Prakash Maran et al. 2013e). According to BBD results, optimal extraction conditions to obtain the maximum extraction yield of TMAC and TCS from jamun fruit pulp were determined and it is found to be: extraction temperature of 44 ˚C, extraction time of 93 min and solid–liquid ratio of 1:15 g/ml. Under these conditions, the predicted extraction yield of TAMC and TCS was found to be 10.58 mg/100 g and 10618.3 mg/l with a desirability value of 0.926. A desirability ramp was developed from optimal points via numerical optimization technique (Fig. 4). This set of conditions was determined to be optimum and validated experimentally (Prakash Maran et al. 2013f). Triplicate experiments were performed under the optimized conditions and the mean values (10.49 ± 0.23 for TMAC and 10615.72 ± 3.51 for TCS) obtained from real experiments, demonstrated the validation of the optimized conditions.

Fig. 4
figure 4

Desirability ramp for optimization

Conclusion

In this study, BBD was employed to study and optimize the process variables such as extraction temperature, time and solid–liquid ratio on the extraction yield of pigment (anthocyanin) and color from jamun fruit pulp. From the results, it was observed that, the extraction conditions have significant effects on the extraction yield of pigments and color. ANOVA showed that the effects of all variables (Extraction temperature, extraction time and solid–liquid ratio) were significant and quadratic models were developed for predicting the responses. Optimum set of the independent variables was obtained by derringer’s desired function methodology in order to extract the maximum levels of pigment and color. The optimal conditions were found to be: extraction temperature of 44 ˚C, extraction time of 93 min and solid–liquid ratio of 1:15 g/ml respectively. Under these optimal conditions, the experimental yield (10.49 ± 0.23 mg/100 g for TMAC and 10615.72 ± 3.51 mg/l for TCS) was closely agreed with the predicted values (10.58 mg/100 g for TMAC and 10618.3 mg/l for TCS).