1 Introduction

In general practice, modeling of soil–geosynthetic interface behavior is done by introducing interface elements. This can be accomplished by a number of ways like, node compatibility spring element; composite layer model; joint element of zero or non-zero thickness; etc. (Andrawes et al. 1982; Love et al. 1987; Gens et al. 1988; Poran et al. 1989; Burd and Brocklehurst 1990; Wilson-Fahmy and Koerner 1993; Abdel-Baki and Raymond 1994; Yamamoto and Otani 2002). But all these methods involve horizontal and vertical stiffness values which are very difficult to determine experimentally. To overcome this difficulty Poulos and Davis (1974) and Basudhar et al. (2008) modeled interface as a contact problem and assumed that failure would occur due to a slip at the interface between the soil and the geosynthetic and not due to yielding of the reinforced soil. It is very essential to provide an accurate estimate of the properties of the interface for an accurate prediction. The properties of the soil–geosynthetic interface depend on many factors, like, nature of force along the interface (direct shear or pullout mode); physical and mechanical properties of the soil (density, grain shape and size, grain size distribution, moisture content, cohesive strength, frictional angle, etc.), mechanical properties of the geosynthetic (ultimate tensile strength, stress at 5% strain) and shape and geometry of the geosynthetic. However, the most important parameters are the density of the soil, moisture content of the soil, soil–geosynthetic adhesion, soil–geosynthetic interface frictional angle, properties of the geosynthetic and normal stress on the shear plane. It is observed that the interface shear resistance at the high density polyethylene (HDPE)–clay interface is maximum when the clay is in the compacted condition and minimum when the clay is near saturation (Mitchell et al. 1990; Lopes 2002). Previous studies conducted at the Louisiana Transportation Research Center (LTRC) on geogrid reinforcement in cohesive soils also showed that increased moisture content resulted in decreased pullout resistance (Farrag and Morvant 2003a, b). Large direct shear tests showed that increase in molding moisture content and/or decrease in dry density caused an appreciable reduction in the cohesive soil–geosynthetic interface shear resistance (Farrag and Griffin 1993; Farrag 1995; Farsakh et al. 2007).

Even though these methods are widely used, reliable predictions are uncertain due to the variability of soil properties. To overcome the problem, ANN model can be used as a suitable alternative. Since it is a data driven process, it can learn from history. It can also adapt to any anomaly which gets unnoticed by most of the traditional techniques for the prediction of soil–geosynthetic interface behavior. Several researchers have used the ANN model to predict different soil behaviours like skin friction of the piles in clay (Goh 1995); ultimate bearing capacity of piles (Lee and Lee 1996); slope stability prediction (Sakellariou and Ferentinou 2005); undrained lateral load capacity of piles (Das and Basudhar 2006); the residual frictional angle of clay (Das and Basudhar 2008); elastic modulus of jointed rock mass (Maji and Sitharam 2008); prediction models for soil compaction and permeability (Sinha and Wang 2008); modeling sulfuric acid induced swell in carbonate clays (Sivapullaiah et al. 2009); the compression index of soils (Park and Lee 2011); effective stress parameters of unsaturated soils (Ajdari et al. 2012); ground settlement prediction (Kanayama et al. 2014); prediction of unconfined compressive strength of geopolymer stabilized clayey soil (Mozumder and Laskar 2015) etc.

From the above study it can be observed that the ANN has been used successfully in solving various geotechnical engineering problems. The detailed discussion of ANN is refrained in the present study and can be found in the literature (Minsky and Papert 1969; Lippman 1987; DARPA 1988; Caudill 1989; Caudill and Butler 1992). The ANN is considered as a black box system as it is unable to explain the fundamental principles of prediction. Therefore, interpretation of weights may be considered the subject of future research (Goh et al. 2005). Keeping in mind the above limitations the present study uses a feed forward back propagation neural network model with 4, 10 and 15 hidden nodes to predict the peak shear stress at the cohesive soil–geosynthetic interface. Three learning algorithms, namely Bayesian regularization (BR), Levenberg–Marquardt (LM) and scaled conjugate gradient (SCG) have been used. A comparison study has been done to obtain the best learning algorithm. Sensitivity analysis is employed to quantify the important input parameters for prediction of the peak shear stress (PSS).

2 Development of Prediction Model

2.1 Development of ANN Model

The ANN process mainly consists of four steps, namely, collection of data, selection of a network type, selection of a learning algorithm and selection of a criterion for stopping the process. In the present study, the data are taken from the research paper of Farsakh et al. (2007). The data consist of dry density of cohesive soil (γ), moisture content of cohesive soil in percentage (%w), normal stress on the shear plane (σn), soil–geosynthetic adhesion (ca) and soil–geosynthetic interface frictional angle (δ) as the inputs and peak shear stress at cohesive soil–geosynthetic interface (T) as the output. Similar approach of input–output pattern has already been studied by other researchers (Monjezi et al. 2006; Sarkar et al. 2010; Sobhani et al. 2010; Yaprak et al. 2013). In the present study, out of 90 data points, 70% are considered for training the ANN model (shown in Table 1) and rest 30% for testing the model (shown in Table 2) as suggested by Shahin et al. (2004). Statistical parameters of the training and testing data are given in Table 3. Since the input parameters have different units of measure, normalization of their values is necessary, hence all the variables (inputs and output) are normalized in the range [−1, 1]. ANN model has been implemented in MATLAB R2013a environment with neural network toolbox, with three different learning algorithms as mentioned above. A three-layer feed-forward back-propagation neural network with one hidden layer is adopted as suggested by previous researchers (Rumelhart et al. 1986; Lippman 1987; Sonmez et al. 2006) while the number of hidden neurons is determined using a heuristic model as suggested by Sonmez et al. (2006). The heuristic model is shown in Table 4. The model shows that the hidden neurons may vary from 1 to 15. Figure 1 shows the relationship between the numbers of neurons against MSE during the training. It is observed from the figure that 4, 10 and 15 numbers of hidden neurons give the optimum architecture of ANN model based on the minimum value of mean square error (MSE) of the training data set. Hence, in the present study the optimum numbers of hidden neurons are taken as 4, 10 and 15. Once network weights and biases are initialized randomly, training of the network starts. Hyperbolic sigma transfer function is used as the transfer function for calculating the output from a certain neuron from its inputs. During training, the progress is constantly updated in the training window. The major challenge in the successful application of ANN is when to stop the training. If training is inadequate, then the network will not be fully trained, whereas if training is extreme then it will memorize the training pattern or learn noise. Normally the network stops when gradient of performance reaches a threshold value or when the number of failed validation reaches a prefixed number or when the number of epoch reaches a certain specified number. The gradient will become very small as the training reaches a minimum of the performance and validation checks represent the number of successive iterations that the validation performance fails to decrease. An epoch is a cycle of forward and backward process through which the network weights and biases are modified. For the present study, the stopping criterion for the number of epoch is 1000, the magnitude of gradient of performance is 10−7 and the number of failed validation checks is 6.

Table 1 Summary of training data for peak shear stress of soil geosynthetics interface (from Farsakh et al. 2007)
Table 2 Summary of testing data for peak shear stress of soil geosynthetics interface (from Farsakh et al. 2007)
Table 3 Statistical parameters of training and testing data of ANN model
Table 4 Different heuristics used for calculate the number of nodes in hidden layer (from Sonmez et al. 2006)
Fig. 1
figure 1

Relationship between the number of neurons and MSE

The performance of ANN model is reported in terms of five statistical parameters, namely the coefficient of determination (R 2), mean square error (MSE), mean absolute percentage error (MAPE), variance absolute relative error (VARE) and median absolute error (MEDAE).

2.1.1 Bayesian Regularization (BR) Learning Algorithm

To advance the generalization capabilities of the conventional back-propagation algorithm MacKay (1991) and Neal (1992) introduce Bayesian back propagation neural networks. It is based on the Bayesian statistical approach (Box and Tiao 1973) and originated in the field of highest entropy (Gull 1988). To create a well generalized network, the Bayesian regularization learning algorithm involves constraining the number of the network parameters through a regularized that penalizes the more complex weight functions in favor of simpler functions. This procedure is carried out by adding a penalty term to the sum squared error. Instead of just giving a single “optimum” prediction, the Bayesian approach provides a probability distribution over the predicted value. This is frequently very important as it provides information on the characteristic error of the prediction that arises from the uncertainty associated with interpolating noisy data. In order to create a prediction model with the most accurate response, three artificial neural networks with 4, 10 and 15 hidden neurons are developed. Comparison of predicted peak shear stress, PSS (Y), with experimental peak shear stress, PSS (T), for training and testing sets, using ANN with 4, 10 and 15 hidden neurons, is shown in Fig. 2. From the figures it is observed that the ANN model with four hidden neurons gives the highest correlation coefficient R being equal 0.995 for the training set and 0.994 for the testing set. The standard error is also observed to be the smallest, 1.11 and 1.14 respectively as shown in Fig. 2a, b. In order to validate the predicted PSS values obtained from proposed ANN model with experimental values, a 99% prediction interval is chosen. 99% prediction interval is defined as an interval within which 99% of Y values for a certain X value will lie around the linear regression line. An established equation (Verschuuren 2007) to obtain the upper and lower bound prediction interval values is used in the present study. Figure 2b shows that all the testing data lie well within the 99% prediction interval band, which confirms the validity of the proposed ANN model. It is clear that ANN model with BR learning algorithm with four hidden nodes gives the most accurate predictions that fall within the 99% prediction interval.

Fig. 2
figure 2

Comparison of predicted and experimental values of peak shear stress (kPa) for training and testing sets using Bayesian regularization learning algorithm. a Comparison with 4 hidden nodes for training data, b comparison with 4 hidden nodes for testing data, c comparison with 10 hidden nodes for training data, d comparison with 10 hidden nodes for testing data, e comparison with 15 hidden nodes for training data, f comparison with 15 hidden nodes for testing data

2.1.2 Levenberg–Marquardt (LM) Learning Algorithm

The LM learning algorithm is usually considered as the fastest method for training moderate-sized feed-forward neural networks (Singh et al. 2005; Khandelwal and Singh 2010) and it is the best choice for solving the problems of supervised learning, which is the case in the present analysis. Comparison of predicted peak shear stress, PSS (Y), with experimental peak shear stress, PSS (T), for training and testing sets, using ANN with 4, 10 and 15 hidden neurons, is given in Fig. 3. From Fig. 3a–f it is clear that the ANN model with four hidden neurons has the highest correlation coefficient R, and it is obtained as 0.99 for the training set and 0.982 for testing set with a small value of the standard error, SE = 2.33 for testing. It means that the average distance of the data points from the fitted line is 2.33 kPa. Figure 3b also shows that the upper bound and the lower bound correspond to 99% prediction interval i.e. all the data points lie well within the 99% prediction interval band, which confirms the precision of the proposed model.

Fig. 3
figure 3

Comparison of predicted and experimental values of peak shear stress (kPa) for training and testing using Levenberg–Marquardt learning algorithm. a comparison with 4 hidden nodes for training data, b comparison with 4 hidden nodes for testing data, c comparison with 10 hidden nodes for training data, d comparison with 10 hidden nodes for testing data, e comparison with 15 hidden nodes for training data, f comparison with 15 hidden nodes for testing data

2.1.3 Scaled Conjugate Gradient (SCG) Learning Algorithm

SCG learning algorithm belongs to the group of conjugate gradient optimization methods which are well suited to handle the large-scale problems in a successful way (Fletcher 2000). This method represents one of the four most often used algorithms of this group, besides Polak–Ribiére Update, Fletcher–Reeves Update and Powell–Beale Restarts algorithm. Each one of these conjugate gradient algorithms requires a line search at every iteration step, which is computationally expensive, since the network reply to all training inputs has to be computed several times for each search. The SCG algorithm was designed to avoid this time-consuming process, by combining the model-trust region approach (used in LM algorithm) with the conjugate gradient algorithm (Moller 1993). Comparison of predicted peak shear stress, PSS (Y), with experimental peak shear stress, PSS (T), for training and testing sets, using ANN with 4, 10 and 15 hidden neurons, is given in Fig. 4. From Fig. 4a–f it is clear that the ANN model with four hidden neurons has the highest correlation coefficient R, obtained as 0.994 for the training set and 0.988 for testing set with a small value of the standard error, SE = 1.96 for testing. It means that the average distance of the data points from the fitted line is 1.96 kPa. Figure 4b shows that the upper bound and the lower bound correspond to 99% prediction interval i.e. all the data points lie well within the 99% prediction interval band, which confirms the precision of the proposed model.

Fig. 4
figure 4

Comparison of predicted and experimental values of peak shear stress (kPa) for training and testing sets using Scaled conjugate gradient learning algorithm. a Comparison with 4 hidden nodes for training data, b comparison with 4 hidden nodes for testing data, c comparison with 10 hidden nodes for training data, d comparison with 10 hidden nodes for testing data, e comparison with 15 hidden nodes for training data, f comparison with 15 hidden nodes for testing data

2.2 MVR Model Development

In the present study, a multi-variable regression analysis (MVR) is also conducted to predict the peak shear stress of the soil geosynthetic interface. Similar to ANN model, 70% of the total data are used for developing the MVR model. Rest 30% data are used to evaluate the prediction efficacy of the model. The generalized linear relationship between the dependent variable and the independent variables takes the form as shown in Eq. (1).

$$Y = c_{0} + c_{1} X_{1} + c_{2} X_{2} + c_{3} X_{3 } + \cdots + c_{n} X_{n } \pm e$$
(1)

where Y is dependent variable, c 0 is the Y intercept. c 1 , c 2 , c 3 and c n are the slopes associated with the independent variables X 1 , X 2 , X 3 and X n and e is the error. MVR model is developed with T as dependent variable and γ, %w, σ n , ca and δ as independent variables.

3 Performance Evaluation of the Proposed ANN Model

Performances of the developed prediction models can be further evaluated using different standard statistical criteria given in Table 5 (Monjezi et al. 2013). Calculated statistical errors are shown in Table 6. The ANN model with BR learning algorithm with four hidden nodes has the highest value of coefficient of determination R2 and lowest values of mean square error (MSE), mean absolute percentage error (MAPE), median absolute error (MEDAE) and variance absolute relative error (VARE), in comparison with other proposed ANN models. The optimum architecture of ANN model was characterized by four neurons in hidden layer with hyperbolic tangent sigmoid function as the transfer function and a pure linear transfer function at the output layer. Out of three learning algorithms Bayesian regularization back propagation learning algorithm is used for its better generalization for the training data and lowest statistical error.

Table 5 Statistical error formulation used for models’ evaluation (from Monjezi et al. 2013)
Table 6 Statistical errors of different models for predicting PSS

4 Results and Discussions of ANN and MVR Models

From Table 6 it is observed that out of three types of ANN model, The Bayesian regularization (BR) model with four hidden neurons is the best one showing higher value of R2 and lesser values of statistical errors. These results are compared with MVR model in Table 7. R2, MSE, MAPE, VARE and MEDAE value of the BR model for training and testing data are found to be 0.991, 1.21, 2.43, 1.17 and 0.53 & 0.988, 1.48, 3.31, 1.27 and 0.60 respectively, while for MVR model these values are found to be 0.956, 5.52, 8.04, 6.54 and 1.45 & 0.965, 4.73, 7.08, 5.58 and 0.93 respectively. The statistical values in Table 7 suggest that the ANN model is superior to MVR model in learning and predicting the experimental data. The analysis of variance (ANOVA) and statistical information of predictor variables of MVR model are shown in Tables 8 and 9. Regression analysis data in Tables 8 and 9 are made with the help of t test and F-test at 95% confidence level. From Table 8, it is observed that the P value (2.01E−37) is very small which suggests that the confidence level i.e. (1 − P) is almost 100% and at least one of the coefficients of MVR model is important. However, to identify the significant coefficients in the MVR model in addition to F-test, t tests are conducted. The t-stat and corresponding P values of individual coefficients are given in Table 9. It is observed that P values of coefficients of γ and %w are very high with corresponding low confidence levels [(1 − P) < 0.95] and hence these coefficients are insignificant for the MVR model. On the other hand P values of σn, ca and δ are very low with high confidence level [(1 − P) > 0.95] and hence it is proposed that these coefficients are significant in MVR model. Lower and upper limits of 95% confidence interval are also shown in Table 9. Confidence intervals of γ and %w include zero falls and hence are not consistent with the significance of t test. Confidence intervals of σn, ca and δ do not include zero and hence are consistent with the significance of t tests. It is evident that MVR model fails to generalize the cohesive soil–geosynthetic interface mechanism as it rejects the significance of γ, and %w in PSS prediction.

Table 7 Performance of ANN and MVR model
Table 8 Analysis of variance (ANOVA) of MVR model
Table 9 Statistical information of predictor variable of MVR model

5 Sensitivity Analysis

Sensitivity analysis is a major concern for selecting the important input variables. Different methods have been subjected to select the significant input variables. However, methods such as Garson’s algorithm and Connection weight approach have been successfully used by some researchers for assessing the variable contribution in geotechnical engineering problems (Goh 1994; Das and Basudhar 2006, 2008; Das et al. 2011). In the present study, aforementioned two methods have been used to identify significant input variables in PSS prediction. Both the approaches use optimized weight vector to identify the important input variables, details of which are available in the literature (Garson 1991; Olden and Jackson 2002). Optimized weight vectors of the ANN model are presented in Table 10. Based on the weights listed in Table 10 the importance and relative ranking of different input variables of ANN using Garson’s algorithm and Connection weight approach are shown in Table 11. It is observed from Table 11 that both the approaches namely Garson’s algorithm and Connection weight approach rank normal stress (σn) as the most important parameter. However, the Connection weight approach ranks percentage water content (%w) as the least important parameter whereas; Garson’s algorithm ranks dry density (γ) as the least important parameter. However it is to be mentioned here that researchers have shown an indirect effect of soil moisture content and direct effect of dry density on cohesive soil–geosynthetics interaction (Farsakh et al. 2007). The interface efficiency decreases with the increase in moisture content and decreasing dry density. The results of the present study showed that out of five input parameters chosen, the dry density and moisture content have the least effect on peak shear stress at cohesive soil–geosynthetics interface.

Table 10 Connection weights and biases for predicting PSS with Bayesian learning algorithm with four hidden nodes
Table 11 Sensitivity analysis results

Ranking given by connection weight approach seems to be more realistic and acceptable due to following reasons:

  • With the increase in normal stress the confining pressure at the cohesive soil–geosynthetic interface is increased which causes an increase in the interface efficiency resulting in higher value of PSS. Farsakh et al. (2007) also showed that with increase in normal stress and keeping the dry density and moisture content constant, the PSS increases. Therefore, ranking of the normal stress as the most important parameter for PSS as per connecting weight approach is confirmed.

  • Using Mohr–Coulomb failure criteria following equation is deduced (Farsakh et al. 2007):

    $$\tau_{s - g} = c_{a} + \sigma_{n} \tan \delta$$
    (2)

    where τ sg is the shear stress along the soil–geosynthetic interface.

    Equation (2) clearly shows that shear stress is directly related to soil–geosynthetic interface friction angle and adhesion. Hence, interface friction angle and adhesion are the important parameters for PSS next to normal stress.

  • It is known that the normal stress and dry density are correlated and thus the dry density parameter should have come second in the ranking, but the connecting weight approach shows the dry density to be the fourth important parameter. In a direct shear box the weight of soil sample becomes insignificant in comparison to the applied normal stress which is acting as a surcharge. For an example Farsakh et al. (2007) used a large size shear box consisting of an upper box and lower box both having dimensions 300 mm long, 300 mm wide and 65.4 mm deep. The normal stress due to dry density acting on soil–geosynthetic interface comes around 1 kPa where as the normal stresses applied to the clay sample were 25, 50 and 75 kPa. Hence, the ranking of dry density as fourth important parameter is justified.

  • The interface shear failure envelopes generally decreases with the increase in molding moisture contents as relevant in the sensitivity analysis with negative sign as shown in Table 11. The increase in water content develops excess pore water pressure resulting in decrease of PSS. Thus ranking water content to be the least important parameter is justified.

Mozumder and Laskar (2015) also reported that ranking given by Connection weight approach is more realistic and acceptable than the Garson’s algorithm.

6 Neural Interpretation Diagram (NID)

The neural interpretation diagram (NID) was proposed by Ozesmi and Ozesmi (1999) for visual interpretation of the connection weights among the neurons. The lines joining the input-hidden and hidden-output neurons in the NID represent the magnitude of weights and their directions. The positive and negative weights are represented by black and gray lines respectively, and the thickness of the lines is proportional to their relative magnitude. The input and output relationship is determined in two steps. The positive input hidden and positive hidden-output weights or a negative input-hidden and negative hidden-output weight depict the positive effect of the input variables. The negative effect of the input variables is depicted by positive input-hidden and negative hidden-output or negative input-hidden and positive hidden-output weights. The input having a direct effect on the output is represented by black circles and that having an inverse effect with gray circles. For the present example the weights are presented in Table 10 and NID is presented in Fig. 5. It is seen from Fig. 5 that the inputs γ, σn, ca and δ have a positive contribution to the Y values and %w has negative effects on the Y value which justifies the physical relationship of the soil–geosynthetic interaction mechanism. It is also seen that with an increase in the moisture content there is an appreciable reduction in interface shear resistance as reported by other researchers (Farrag and Griffin 1993; Farrag 1995; Farsakh et al. 2007). Therefore, it is inferred that γ, σn, ca and δ are directly and %w is inversely proportional to Y value. So it can be concluded that NID is an effective method of indicating the physical relationship between inputs with the output.

Fig. 5
figure 5

Neural interpretation diagram of ANN model

7 ANN Model Equation for Predicting PSS Value Based on Trained Neural Network

The mathematical equation as per the ANN relating the input and the output variables suggested by Goh et al. (2005) can be written as

$$Y_{n} = f_{sig} \left\{ {b_{0} + \mathop \sum \limits_{k = 1}^{h} \left[ {w_{k} \times f_{sig} \left( {b_{hk} + \mathop \sum \limits_{i = 1}^{m} w_{ik} X_{i} } \right)} \right]} \right\}$$
(3)

where Y n is the normalized (in the range −1 to 1 in this case) Y value; b 0 is the bias at the output layer; wk is the connection weight between kth neuron of hidden layer and the single output neuron; b hk is the bias at the kth neuron of hidden layer; h is the number of neurons in the hidden layer; w ik is the connection weight between ith input variable and kth neuron of hidden layer; X i is the normalized input variable i in the range [−1, 1] and f sig is the sigmoid transfer function.

In this study, an ANN model equation for peak shear stress along soil geosynthetics was established using the values of the weights and biases presented in Table 10 the following expression can be written to finally arrive at a correlation of Y with the input parameters.

$$A_{1} = - 0.611 + 0.2241\rho - 0.1281w + 0.5307\sigma_{n} - 0.1079c_{a} + 0.4599\delta$$
(4)
$$A_{2} = 0.0486 + 0.0354\rho + 0.2943w + 1.0981\sigma_{n} - 0.4282c_{a} - 0.0225\delta$$
(5)
$$A_{3} = - 0.3116 + 0.0175\rho + 0.3211w + 0.3505\sigma_{n} - 0.5339c_{a} - 0.3239\delta$$
(6)
$$A_{4} = - 0.2448 - 0.1136\rho - 0.2624w - 0.0649\sigma_{n} - 0.5137c_{a} - 0.0691\delta$$
(7)
$$B_{1} = 0.7687 \times \frac{{e^{{A_{1} }} - e^{{ - A_{1} }} }}{{e^{{A_{1} }} + e^{{ - A_{1} }} }}$$
(8)
$$B_{2} = 0.5707 \times \frac{{e^{{A_{2} }} - e^{{ - A_{2} }} }}{{e^{{A_{2} }} + e^{{ - A_{2} }} }}$$
(9)
$$B_{3} = - 0.9582 \times \frac{{e^{{A_{3} }} - e^{{ - A_{3} }} }}{{e^{{A_{3} }} + e^{{ - A_{3} }} }}$$
(10)
$$B_{4} = - 0.7049 \times \frac{{e^{{A_{4} }} - e^{{ - A_{4} }} }}{{e^{{A_{4} }} + e^{{ - A_{4} }} }}$$
(11)
$$C_{1} = - 0.1726 + B_{1} + B_{2} + B_{3} + B_{4}$$
(12)
$$Y_{n} = \frac{{e^{{C_{1} }} - e^{{ - C_{1} }} }}{{e^{{C_{1} }} + e^{{ - C_{1} }} }}$$
(13)

The Y n value as obtained from Eq. (13) is in the range [−1, 1] and this needs to be denormalized as

$$Y = 0.5\left( {Y_{n} + 1} \right)\left( {Y_{max} - Y_{ min } } \right) + Y_{min}$$
(14)

where Y max and Y min are the maximum and minimum values of Y respectively in the data set.

8 Conclusions

The following are the conclusions from the above studies:

  • Out of four learning algorithms Bayesian regularization back propagation learning algorithm is used for its better generalization to the training and testing data and lowest statistical error.

  • ANN model with Bayesian regularization back propagation training algorithm outperforms MVR model in predicting the PSS.

  • Based on sensitivity analysis both the algorithms, namely Garson’s and connection weight approaches rank σn as the most important parameter influencing PSS prediction followed by γ, %w, c a and δ.

  • MVR model shows that σn, ca and δ are the significant parameters for PSS prediction.

  • According to Connection weight approach, out of five input parameters moisture content (%w) is the least important parameter influencing PSS prediction.

  • NID demonstrates the negative or inverse effect of moisture content (%w) on the PSS prediction i.e. with increase in moisture content PSS decreases.

  • The superiority of ANN model over MVR model in PSS prediction can be attributed to its flexibility and adaptability in generalizing the data.

  • Compared to Garson’s algorithm, Connection weight approach is capable of identifying the true importance of input variables in PSS prediction.

  • A model equation is presented based on the trained weights of the ANN.