1 Introduction

During the past decade, polycaprolactone (PCL) has attracted considerable attention in the biomaterial field. PCL is a semicrystalline hydrophobic polymer, which is biodegradable and has long-term degradation. Some of the advantages of PCL are relatively inexpensive production, ease of manipulation, suitable and tailor-made physico-mechanical properties, flexibility in surface modification, and FDA approval [1]. Such special characteristics make PCL and its blends well suited for applications including tissue engineering, drug delivery, wound dressing, and fixation devices [25]. To date, PCL in the form of nano- and microsphere [3, 6], micelle [7], foam [8], nanofiber, and microfiber [9, 10] has been fabricated, which a nanofiber structure seems to be the most common form of PCL.

Although several available methods, such as phase separation [11] and template synthesis [12], have been developed for the fabrication of nanofibers, electrospinning has remained the most popular technique of nanofiber production. Electrospinning is the process of continuous fiber production by exploiting an electric field that affects a polymer solution or a melt extruding through a needle. Ease of use, low cost, versatility, and fiber production from many materials are some of the major advantages of electrospinning technique [13].

Electrospun nanofibers are advanced materials that introduce some individual characteristics, such as large surface area, good mechanical performance, and high porosity, which are properties largely dependent on the diameter and morphology of fibers [14]. Approximation of fiber diameter before the process of electrospinning is challenging because of the complicated patterns of variables affected process output. Electrospinning variables include polymer solution factors like concentration and conductivity, along with processing parameters such as voltage, tip-to-collector distance, feed rate, and ambient condition [15]. Exploring the quantitative relationship between process variables and their output could be performed through modeling techniques.

Response surface methodology (RSM) is very popular for modeling a process and for optimization studies. RSM is a collection of mathematical and statistical techniques that are extensively applied for analyzing and optimizing processes. It determines the effect of the independent variables, alone or in combination, on the processes. The major advantage of RSM is that it minimizes the number of experiments required for statistically analyzing a process with multiple factors. In this modeling approach, polynomials are applied as local approximation; subsequently, a mathematical equation expressing the relationship between the response and the independent variables is created [16].

Furthermore, artificial neural networks (ANN), a modeling tool for solving linear and nonlinear multivariate regression problems, have also been broadly applied for modeling different kinds of processes, such as electrospinning and fabrication of composites [1720]. This modeling technique is inspired by the biological brain. In ANN’s first step, a neural network composed of two or more layers with a defined number of artificial neurons in each layer is created. A simple artificial neuron has two key components: weight and transfer function. The weight of an input is multiplied by the input units, and the weighted input is introduced to the neuron transfer function to estimate the output. In the next step, by using several input-response data points, the network is trained to approximate the relationship between the two sets of values. In the training approach, the mean square error (MSE) of experimental and predicted outputs is minimized by continuously adjusting the weights between neurons. Validating and testing the data sets, which are necessary steps for exploring the network performance, are the final steps of ANN modeling [21].

In recent years, several studies have focused on the analysis of electrospinning process using RSM and ANN. Table 1 shows the summary of researchers’ studies on modeling of electrospinning aimed at predicting the diameter of nanofiber. Although several polymers have been studied thus far, no attempt has been made to model electrospinning process of PCL nanofiber, which is widely used in biomedical applications. The fiber diameter has a major role in the mechanical properties and porosity of the PCL scaffold as well as in attachment and proliferation of cells on the scaffold [39].

Table 1 Summary of studies in modeling of electrspinning process to predict nanofiber diameter

In the present study, two modeling techniques are employed to describe the nonlinear effect of selected electrospinning factors on the diameter of PCL, which is one of the most utilized polymers in biomaterials research. The assessed parameters of electrospinning are polymer concentration, voltage, and tip-to-collector distance. During this study, different conditions of fiber production are evaluated using the Box-Behnken design (BBD) method. Subsequently, the relationship between the diameter of polymer fibers and the above-mentioned factors is analyzed using RSM and ANN methods. Additionally, the two proposed models are compared and discussed in this paper.

2 Materials and methods

2.1 Materials

PCL (Mw = 80,000) and chloroform were obtained from Sigma-Aldrich, and N,N-dimethylformamide (DMF) was purchased from Merck.

2.2 Preparation of PCL solution

PCL was dissolved in a chloroform/DMF solution (9:1) at concentrations of 8, 10, and 12%. Each solution was stirred via magnetic stirrer for 3 h.

2.3 Electrospinning setup

Electrospun PCL scaffolds were fabricated using an electrospinning device (Nanoazma Co., Tehran, Iran). The prepared solution was extruded from a 5-ml syringe attached to a 21-G blunted needle. The feed rate was 0.5 ml/h, and a speed of 100 RPM was selected for collecting fibers. The electrospinning was performed at 25 °C. After electrospinning, the fibrous mats were dried for 48 h. Other electrospinning parameters to prepare different samples according to the Box-Behnken method are summarized in Table 2.

Table 2 Actual and coded values of the selected parameters

2.4 Characterization

The morphology of electrospun fibers was assessed by scanning electron microscope (SEM; TESCAN, Brno, Czech Republic) at an accelerating voltage of 30 kV under magnification of ×1500 after sputter coating with a gold sputter coater. The average fiber diameter was measured from the SEM micrographs using ImageJ2 software (maintained by C. Rueden on GitHub). Fifty fibers in each sample were measured. A typical SEM photograph of the electrospun nanofiber mat and its corresponding histogram of fiber diameter distribution are shown in Fig. 1a, b.

Fig. 1
figure 1

SEM micrograph of electrospun PCL nanofibers and their diameter distribution

2.5 Design of experiments

In this study, the selected design of experiments (DOE) was RSM coupled with Box-Behnken design. Among different types of DOE methods, Box-Behnken design has maximum efficiency and needs fewer experiments when three factors with three levels are involved [40, 41].

The initial step in the RSM study is the determination of the input parameters and their levels. This step was performed applying previous reports [38, 42]. Based on the literature, three independent variables were detected as more effective parameters on PCL fiber’s diameter, including concentration, voltage, and needle-to-collector distance. In addition, the range of each variable was defined in such a way to produce PCL fiber with no bead.

Since the parameters have different units, the factors were first normalized according to Eq. 1:

$$ x=\frac{x-\left({x}_{\max }+{x}_{\min}\right)/2}{\left({x}_{\max }-{x}_{\min}\right)/2} $$
(1)

where x is the coded variable, x is the natural variable, and xmax and xmin are the maximum and minimum values of the natural variable. Each of the coded variables is forced to range from −1 to 1. The resultant coded and actual values are listed in Table 2.

Then a three-level, three-variable Box-Behnken design was performed and, consequently, 17 experiments were obtained, as shown in Table 3. The number of required experiment is given by:

$$ N=2 k\left( k-1\right)+{C}_0 $$
(2)
Table 3 Experimental conditions designed by Box-Behnken design and corresponding responses

where N, k, and C0 are the number of experiments, variables, and center points, respectively.

The statistical software Design-Expert® (trial version of v. 7.0.0, Stat-Ease Inc., Minneapolis, MN, USA) was utilized for the design and regression analysis of the experimental data and for plotting the subsequent graphs.

2.6 Artificial neural network

In this study, multilayer perceptron ANN with two hidden layers and one output layer according to Kolmogorov’s theorem was applied [43]. To achieve the optimum topology of the network, different numbers of hidden layers and neurons were examined using a trial and error approach. The optimal structure was obtained based on the lowest value of MSE and the highest correlation coefficient (R) of the data set.

Among the algorithms that have been recommended for learning MLP networks, back propagation (BP) in conjugation with the Levenberg-Marquardt method was selected. BP is the fastest algorithm, although it requires more memory than the others [43]. The hyperbolic tangent sigmoid transfer function and the linear transfer function were employed for the hidden layers and output layer, respectively (Eqs. 3 and 4).

$$ f(x)=\frac{e^x-{e}^{- x}}{e^x+{e}^x} $$
(3)
$$ g(x)= x $$
(4)

The experimental data were randomly divided into three groups, including training, validating, and testing sets, containing 70, 15, and 15% of the samples, respectively. All experimental data were normalized before ANN modeling. The calculations were performed in MATLAB mathematical software (v. 7.1) using the ANN toolbox.

In order to compare the adequacy of ANN and RSM models, the R2, root-mean-square error (RMSE), model predictive error (MPE(%)), and mean absolute error (MAE), were calculated according to Eqs. 5 to 8:

$$ {R}^2=\frac{\sum_{i=1}^n\ {\left({Y}_{i o}-{Y}_{i p}\right)}^2-{\sum}_{i=1}^n\left({Y}_{i o}-{Y}_{i p}\right)\ }{\sum_{i=1}^n{\left({Y}_{i o}-{Y}_{i p}\right)}^2} $$
(5)
$$ \mathrm{RMSE}=\sqrt{\frac{\sum_{i=1}^n{\left({Y}_{i o}-{Y}_{i p}\right)}^2}{n}} $$
(6)
$$ \mathrm{MPE}\left(\%\right)=\frac{100}{n}\sum_{i=1}^n\left|\frac{Y_{i o}-{Y}_{i p}}{Y_{i p}}\right| $$
(7)
$$ \mathrm{MAE}=\frac{i}{n}{\sum}_{i=1}^n\left|{Y}_{i o}-{Y}_{i p}\right| $$
(8)

where Yio and Yip are the observed and predicted values of fiber diameter and n represents the number of experiments.

3 Results and discussion

3.1 Statistical results obtained by RSM

To evaluate the experimental results, we examined multiple regression models. The best model, which fits all the design points was a quadratic model. Analysis of variance (ANOVA) was used to estimate the variables’ effects and their interactions, as displayed in Table 4.

Table 4 Analysis of variance (ANOVA) for response surface quadratic model of PCL fiber diameter

The significant terms of the model, as indicated in the ANOVA table, consist of A, B, C, AC, A2, B2, and C2, which had P-values less than 0.05. Other model terms showed no significant effect on the electrospun PCL fiber diameter and thus were eliminated from the model. Afterwards, the following second-order polynomial function was determined to approximate the PCL fiber diameter:

$$ Y=318.4+101\ A+30.38\ B+10.13\ C-53\ A C-12.58\ {A}^2 - 35.83\ {B}^2+43.68\ {C}^2 $$
(9)

where Y, A, B, and C are the average diameter of PCL fibers, coded forms of polymer concentration, voltage, and tip-to-collector distance, respectively.

As shown in the ANOVA table (Table 4), the model P value (<0.0001) and the insignificant lack of fit (0.059) implied that the experimental data have a suitable agreement with the model. Additionally, the high adjusted and predicted R-squared values, which are 0.986 and 0.949, respectively, validated the goodness of the model.

3.2 Effect of electrospinning variables on diameter

According to the ANOVA results, the response surface model clearly indicates that the most effective parameter on the PCL electrospun fiber diameter is the polymer concentration. As corroborated in the literature, increasing polymer concentration will result in greater viscoelastic force, consequently leading to larger diameter polymer fibers [22]. The dependence between concentration and diameter, while other parameters are at their mean point, is shown in Fig. 2a. As can be seen, any change in the value of concentration is expected to lead to a significant change in the value of fiber diameter.

Fig. 2
figure 2

Main effects of the defined variables on PCL fiber diameter based on coded values: a concentration, b voltage, and c distance

There is a less strong dependence between the applied voltage and diameter with a mildly increasing trend, although in the high range of voltage the trend is almost reversed (Fig. 2b).

Furthermore, tip-to-collector distance had only a mild effect on PCL fiber diameter with concave dependence (Fig. 2c). However, the interactive effect of distance with concentration is more important than its main effect. The contour plot and 3D surface plot of PCL fiber diameter at different values of concentration and distance in the middle level of applied voltage are given in Fig. 3a, b. As indicated in the contour plot, the simultaneous effect of concentration and distance on the fiber diameter is similar to conic curves, although in higher range of variables, the dependence would be stronger.

Fig. 3
figure 3

Effect of variables on electrospun fiber diameter in a 2D and b 3D views

3.3 Artificial neural network results

ANNs are computational models inspired by biological nervous system. ANN includs several types such as feedforward network, radial basis function network, fuzzy network, and more [4446]. As corroborated by Hornik et al. [43], multilayered feedforward networks with a few as one hidden layer are capable of approximating any function from n dimentions to m dimentions to any desired degree of accuracy. Thereupon, this type of ANN was applied.

In this study, the parameter of interest is PCL fiber diameter as an output variable of electrospinning process, and three independent variables were selected as input factors, namely polymer concentration, voltage, and tip-to-collector distance. The final structure of the ANN model was obtained by examining a series of different topologies (Fig. 4a). The optimal model, determined based on the minimum value of MSE of the training and testing sets, was a three-layered perceptron feedforward ANN model with 1, 11, and 5 neurons in output, first, and second hidden layers, respectively. Figure 4b schematically illustrates the proposed ANN structure.

Fig. 4
figure 4

Schematic view of the represented MLP network (a) and the effect of number of neurons on the performance of the ANN (b). (Note that graph (b) is obtained for one hidden layer with tansig transfer function)

After creation of the network architecture through the learning process, the weights between the connections were adjusted until the optimum weights for output prediction were obtained or, in other words, when the mean square error is minimized in all training experiments. The learning process was carried out employing the back propagation algorithm in conjugation with the Levenberg-Marquardt training function.

It is necessary to evaluate the performance of the network while training is underway. This is done using the validating data set, which contains 15% of the samples in our study. In the validating step, the goal is reaching the desirable error.

To test the performance of the trained network in training and validating processes, the experimental values were compared with the output of ANN. The correlation between experimental data and training, and validating and testing sets, is depicted in Figs. 5 and 6. The perfect correlation (output exactly equal to target) is indicated by slope and y-intercept of the parity plot, equal to one and zero, respectively. The correlation coefficients (R value) for the training, validating, and testing sets were 0.987, 0.989, and 0.998, respectively, which demonstrated a good fit.

Fig. 5
figure 5

Regression analysis between ANN responses and the experimental results for a training, b validation, c test datasets, and d MSE plot with respect to the number of epochs for training, validation, and test samples for the PCL fiber diameter response

Fig. 6
figure 6

Regression analysis between actual and predicted diameter of PCL fibers using a RSM (R2 = 0.984) and b ANN (R2 = 0.970)

The MSE plot for all 17 samples is shown in Fig. 5d. As can be seen, the MSE of the network has a descending trend that indicates the network is learning. The best validation performance was at MSE of 471.11, obtained at epoch one.

3.4 Comparison between RSM and ANN models

In this section, the aim is to determine which modeling technique shows better predictive accuracy. Hence, the experimental data and predicted values obtained via RSM and ANNs were compared to obtain the goodness of fitting. For this purpose, error analysis was carried out and R2, RMSE, MPE, and MAE were calculated according to the formulas mentioned in the previous section. As shown in Table 5, both models had good agreement with experimental data, which indicated by R2 value, equal to 0.98 and 0.97 for RSM and ANN, respectively. The RSM model had slightly greater R2 value and lower error than the ANN model (Fig. 6). This fact implied more predictive accuracy of RSM model. On the other hand, the ANN model itself provides little information about the input parameters and their contribution to the response if further analysis has not been done, whereas the main advantage of RSM is its ability to exhibit the factor contributions via coefficients in the regression model.

Table 5 Comparison between experimental and predicted values of electrospun fiber diameter by RSM and ANN models

As the last step, to confirm the validity of the models, we carried out a validation study. Accordingly, additional experiments with different levels of input factors (polymer concentration, voltage, and distance) were conducted and the obtained responses were compared with predicted values. As shown in Table 6, there is no remarkable difference between experimental data and corresponding predicted values (Fig. 7a–c).

Table 6 Validation study of RSM and ANN models using different levels of input variables
Fig. 7
figure 7

SEM micrograph of electrospun PCL nanofiber and corresponding fiber diameter distribution. ac Nanofibers from corresponding experiments in Table 4; d nanofibers at optimized condition (minimum diameter)

3.5 Optimization of electrospun PCL fiber

In a response surface optimization study, the objective is to find a desirable location in the design space. The demanded point could be a minimum, a maximum, or a special region where response is stable. In our study, the goal was to minimize the polymer diameter, which may provide maximum fiber surface area. The conditions for obtaining the finest PCL fiber diameter as determined by RSM model were polymer concentration = 8% (w/v), voltage = 15 kV, and distance = 8 cm.

Figure 7d shows the nanofiber morphology acquired in the optimized condition. The average fiber diameter was 129 nm, and the predicted fiber diameter under similar conditions estimated using RSM was 120 nm with a desirability value of 1. Comparison of the experimental value and the response provided by the model shows that the RSM model has an acceptable performance in the optimization study.

4 Conclusion

This paper presents a study on the effect of electrospinning variables including polymer concentration, voltage, and nozzle-to-collector distance on the average diameter of PCL nanofibers. Box-Behnken design has been carried out to create minimum number of experiment needed for analysis of all the given parameters. RSM and ANN are applied in order to model and optimize nanofiber diameter. RSM analysis indicates that polymer concentration is the most effective factor on PCL fiber diameter.

Voltage and distance are in the next level of importance and the distance effect depends on the range of concentration. The optimum structure of ANN, based on the smallest MSE, is determined to be a three-layer network with Levenberg-Marquardt back-propagation learning algorithm, tangent sigmoid transfer function in the hidden layers containing 11 and 5 neurons, and linear transfer function in the output layer. The high regression coefficient between the experimental and predicted values reveals excellent evaluation of the data sets via the ANN and RSM models. Owing to the lower error of the RSM model, the performance of this model is better than that of the ANN model. In the optimization study, the condition for achieving the minimum diameter is defined. The observed diameter (129 nm) and the theoretical value predicted by RSM (120 nm) are very close, and the error percentage is very low. In addition, the validation study is performed as the last step to confirm the performance of both models. Overall, the collected results suggest that modeling techniques such as ANN and RSM can be useful tools for controlling the diameter of electrospun fibers, which is a key factor in determining properties of nanofibrous scaffolds.