Keywords

1 Introduction

Every foundation setup requires satisfying two imperative criteria: ultimate bearing capacity of confinement and limiting settlement of foundations. Between these two criteria, a complete bearing capacity of confinement is managed by shearing nature of the soil and can be evaluated by the theories of Terzaghi (1943), Meyerhof (1963), Vesic (1973), and others. The explanation behind the part of possible theories is the most remote point which adjusted the procedure. In this manner lab results obtained by different researchers were positive. In any case, the trials are generally done on tinier measured models, which are significantly cutback models stood out from real field footings. In this way, various authorities (e.g., (De Beer 1965; Meyerhof 1950)) have prompted that one should be amazingly vigilant while extrapolating disclosures of examinations driven on little footings that have a width of several inches, to the broad evaluated footings. The clarification behind this is credited to the extension in shearing strain along the slip line with the development in width of the foundation and the extent of mean grain size of the soil and the adjust width (Steenfelt 1977). For extensive scale establishments on dense sand, shearing strains demonstrate significant variety along the slip line. The normal assembled point of shearing resistance along the slip line is littler than the greatest estimation of the angle of shearing resistance (φmax) which was obtained via plane strain shear tests.

Henceforth, an ultimate technique is required that gives better gauges of bearing capacity. In the most recent two decades, a few analysts have created successful demonstrating instruments.

One of them is artificial neural networks (ANNs) methodology. In the current past, ANNs have been connected to numerous geotechnical designing issues, including the determination of the bearing capacity of footings, settlement forecasts, liquefaction, and stability of slopes (Shahin et al. 2001). The real preferred standpoint of ANNs is that they reanalyze the old and new information, use the old outline helps or conditions, and additionally propose new conditions (Pande and Shin 2004). This paper shows the relevance of ANN method in building up a compelling nonlinear model for anticipating a definitive bearing capacity of shallow foundations on cohesionless soils. The database, which comprises of load test values of extensive scale footings and smaller estimated demonstrate footings, is utilized to create and check the ANN model. This paper additionally goes for a broad assessment of the ANN model with regression-based models for ultimate bearing capacity using in a total of 100 sets of field data and evaluates their benefits and restrictions. The execution of these models is then occurred to demonstrate the most regularly utilized bearing capacity.

2 Development of Ultimate Bearing Capacity Model

The data used for experimenting and the development of the ANN model were collected from the literatures, which are load test data on real-sized foundations regarding the footing and soil. Among 100 collected field data, 50 are from load tests on large-scale footings and 50 are from small shallow footings. Again from 47 large-scale footing data, 24 were reported by Muhs and Weiß (1971), 11 data by Weiß (1970), 5 of them by Muhs et al. (1969), 2 of them by Muhs and Weiß (1973), 5 of them by Briaud and Gibbens (1999).

One of the challenging initiatives in the model improvement for the prediction of bearing capacity of a shallow foundation is to differentiate parameters those have impact on the bearing capacity, for which a number of the conventional bearing capacity estimation techniques (Terzaghi 1943; Meyerhof 1963; Vesic 1973) are used. In spite of the way that the bearing capacity gotten through these techniques varying abruptly, the fundamental type of condition is the same for every one of the techniques, which is as per the following for establishments in cohesionless soil:

$$ {\text{q}}_{\text{u}} = \upgamma {{\text{DN}}_{\text{q}}} {{\text{S}}_{\text{q}}} {{\text{d}}_{\text{q}}} + 0.5\upgamma {{\text{BN}}_{\upgamma}} {{\text{S}}_{\upgamma}} {{\text{d}}_{\upgamma}} $$

where B—width of footings, D—profundity of footings, c—unit weight of sand (underneath or more the establishment level), Nq, Nɣ—bearing capacity factors, Sq, Sɣ—shape factors, and dq, dɣ—depth factors.

Among the parameters identified with the footings, the principle factors influencing the bearing capacity are its width (minimum horizontal measurement, B), footing length (L), depth of the foundation (D), and shape (square, rectangular and round). The depth of the foundation has the most significant impact on the bearing capacity among all the physical properties of the foundation (Meyerhof GG. The bearing capacity of sand. PhD thesis, University of London 1950).

The fundamental parameters with respect to the soil (sand) are its angle of shearing friction and the unit weights of soil layers adjusted to the water table (if available). There are some different factors, for example, compressibility and thickness of the soil layer underneath the foundation which are responsible for a smaller grade. Among the considerable number of properties of soil, the angle of shearing resistance has most noteworthy impact on the bearing capacity, which decreases with the relative thickness of the soil. The unit weight of the soil is depended on the position of water table and proportional to the bearing capacity.

In light of above discussion, the five input factors used for the model encroachment are width of footing (B), depth of footing (D), unit weight of sand (ɣ), angle of shearing resistance (φ), and ratio of length and width (L/B). By using these five factors, the ultimate bearing capacity (qu) is the final result.

3 ANN Model Development

In this study, feed-forward multilayer perceptron (MLP) is utilized; the portrayal of which can be found in numerous publications (Zurada 2003; Fausett 1994). The data set was arbitrarily separated into two subsets: training and testing. 87% (or 87 cases) of the information was set for training the model, while the remaining 13% (or 13 cases) was utilized for testing or validation the execution of the proposed ANN-based model. A trial-and-error strategy was used to determine the optimum number of hidden layers in the model which was concluded by using ten hidden layer. Because of five input and one output variable, five nodes were used in the input layer and one in the output layer which is a typical way to deal with problem of overfitting in early stopping (Fig. 1).

Fig. 1
figure 1

Architecture of ANN model

This approach includes monitoring the generalization error and stopping training when the minimum validation error is observed. Be that as it may, some care is required when to stop, since the approval mistake surface may have local minima or long flat regions preceding a steep drop-off. To minimize these constraints, Mackay (1991) proposed the use of Bayesian back-propagation neural networks which minimize a combination of squared errors and weights and after that decides the correct combination, in order to create a network which sums up well. This procedure is called Bayesian regularization. In the Bayesian structure, the weights and the biases of the network are assumed to be random variables with indicated appropriation of distribution. The regularization parameters are identified with the unknown variances related to the distribution. These parameters are then evaluated using statistical techniques. In the present investigation, the Levenberg–Marquardt algorithm is utilized under the Bayesian network.

3.1 Multiple-Regression Analysis

To develop a relationship between the single dependent variable, qu, and one or more independent variables ci, hi, and B, multiple-regression is used, and the prediction of qu is accomplished by the following equation:

$$ {\text{q}}_{\text{u}} = {\text{w}}_{ 1} {\text{c}}_{1} + {\text{w}}_{2} {\text{c}}_{2} + {\text{w}}_{3} {\text{c}}_{3} + {\text{w}}_{4} {\text{c}}_{4} + {\text{w}}_{5} {\text{h}}_{1} + {\text{w}}_{6} {\text{h}}_{2} + {\text{w}}_{7} {\text{h}}_{3} + {\text{w}}_{8} {\text{B}} + {\text{b}} $$

where wi are the regression weights and are processed in a way that limits the aggregate of the squared deviations. Here, b is the y-axis intercept. Multiple-regression is associated with multiple correlation coefficient, which is the variance of the dependent variable explained collectively by all of the independent variables. A multiple-regression model was obtained using Stata SE 12 (Fig. 2).

Fig. 2
figure 2

Output result from StataSE 12 for MLR model

The following formulas, using the MLR technique, were found to offer the best fitting:

$$ {\text{q}}_{\text{u}} = 1976.073{\text{D}} + 86.464{\text{h}} - 2833.27 $$

where qu (kPa) is the bearing capacity, s (m), and h(deg) is the angle of friction.

4 Performance Evaluations and Results

The performance of the model is being evaluated by assessing the degree to which the values obtained from the simulations match actual output. The assessment was performed in this study with various goodness-of-fit or correlation statistics. The values of the performance indices or measures for the three models are summarized in Table 2. The correlation statistic (R), which evaluates the linear correlation between the actual and predicted ultimate bearing capacity, is good for all the models, for calibration as well as validation data. The model efficiency that evaluates the performance of the model in predicting the ultimate bearing capacity values away from the mean is found to be more than 98% for calibration, and validation data for the ANNs model was close to where it was first referenced (Figs. 3 and 4) (Table 1).

Fig. 3
figure 3

Distribution of prediction error across different error thresholds

Fig. 4
figure 4

Measured versus predicted ultimate bearing capacity values

Table 1 Performance evaluation and comparison
figure a

5 Conclusion

In this study, artificial neural networks (ANNs) and equation developed by the multilinear regression model (MLR) were used to predict the bearing capacity of circular shallow footings on cohesionless soil. The artificial neural network model serves as a simple and reliable tool for the bearing capacity of shallow foundations in sandy soil. The results produced high coefficients of correlation for the trained and testing data sets, and the values are 0.99,308 and 0.98019, respectively, which have much more higher perfection then MLR model. Nevertheless, this study demonstrates that having some known parameters, we can easily be able to have an idea about the ultimate bearing capacity through the ANN model.