Keywords

1 Introduction

Types of settlement that are generally encountered by shallow foundations include immediate settlement, consolidation settlement and compression settlement, which solely depends on the soil type and conditions in which they are laid. Considering cohesion less soil, the major concern is the former said—immediate settlement. The latter two are mostly related to cohesive soil. Higher degree of permeability exhibited by cohesion less soils lay path for instant occurrence of settlement with applied load [1], which could affect the stability of structure in short notice leaving no time to execute remedial measures. Two major criteria that control the design of shallow foundations on cohesionless soils are bearing capacity and settlement of foundation. In terms of serviceability, excessive settlement is inacceptable. Consequently settlement is considered as the controlling criterion for the foundation design rather than bearing capacity, condition being critical when least width of footing is greater than 1 m [2]. Hence settlement prediction is a major concern and is an essential criterion in the shallow foundation design. Thus accurate estimate of shallow foundation settlement is indispensable.

For the prediction of shallow foundation settlement on cohesionless soils, researchers have introduced a fair number of theoretical, empirical, semi empirical and numerical methods. Most of these available methods incorporate several assumptions related to the factors that influence settlement of shallow foundations and hence simplifies the problem. Consequently, fails to achieve consistent success during settlement prediction. Recently soft computing techniques have emerged as tool for solving such highly complex, multivariate problems. One such technique is Artificial Neural Network (ANN). Although ANN rectifies many of the shortcomings of traditional methods it still poses some drawbacks such as overfitting, getting trapped in local minima etc. If a hybrid model of ANN—PSO is used these drawbacks could be sorted out and a better predictive model could be developed [3].

In this study PSO is used as the training technique for ANN instead of the usual backpropagation method. A total of 6 input parameters and one output parameter is used, the database for which is acquired from the literature. The prediction model is developed and its performance is analyzed using various fitness parameters.

2 Model Development

The methodology considered for developing the model includes model input and output determination, division of data followed by pre-processing, determination of appropriate network architecture, model training, and performance analysis.

2.1 Database

The initial step includes determination of parameters that influences settlement estimation. For this study SPT N value, footing dimensions, depth of water table below the footing and net applied pressure are considered as the input values whereas settlement is the single output value [4]. These data were collected from literature [5]. A total of 236 data were collected, reference to which is listed in Table 1.

Table 1 Reference for database

2.2 Data Division and Preprocessing

These dataset values are normalized using the equation given below [6].

$$~X_{n} = \frac{{X - X_{{{\text{min}}}} }}{{X_{{{\text{max}}}} - X_{{{\text{min}}}} }}$$
(1)

where

Xn:

Normalized value.

X:

Actual value.

Xmax:

Maximum value.

Xmin:

Minimum value.

After normalization the whole dataset is divided to obtain two subsets: training set and testing set. Random division is carried out for two subsets of 80 and 20% data. Data division is done in python using train_test_split function from scikit-learn library. Subset with 80% data comprises the training set whereas the second one with 20% data constitutes test set. It is essential that both of these subsets represent the same population and all the patterns present in the training set are available in testing set also [7]. Thus randomly different combinations are selected and then checked for its statistical consistency. This is continued until both the subsets are statistically consistent. The statistical parameters chosen include mean, standard deviation, maximum and minimum.

2.3 Development of Network

Network architecture can be determined by following a series of steps, starting with selection of number of hidden layers followed by fixing the number of nodes in each of these hidden layers. Studies have shown that one hidden layer is enough in a neural network to approximate any continuous function [8], thus for the present study, one hidden layer is used. Therefore a three layer architecture for neural network is finalised: input layer, one hidden layer and output layer. The number of input nodes is chosen as six which is equal to the number of input parameters and number of output neuron as one which is equal to the number of output parameters i.e. settlement. Caudell (1988) suggested that the maximum number of nodes possible in a single hidden layer model is given by (2I + 1), where I is the number of inputs [9]. Thus the upper limit for number of nodes in hidden layer is taken as 13. For the study 10 different values for number of nodes in hidden layer are considered.

In this study, optimum training of the network is performed using PSO. Here, the objective is to minimize the objective function, i.e., mean square error (MSE) of the network. The parameters associated with PSO are number of particles in the swarm (Swarm size), acceleration factor (c1 and c2) and inertia weight (w). The inertia weight (w) is generally assigned random value between 0 and 1. From various studies carried out, researchers have reported that usually when c1 takes a value within the range 1–2.5 and c2 value between 2 and 3, the network exhibits optimum performance [10]. In this study varying values of these parameters are selected, shown in Table 2, and a considerable number of runs are carried out to determine the optimum combination of network parameters with a higher R value and a lower MSE value. MATLAB R2020b is used for the network development. All the possible combinations of acceleration factors for each swarm size and number of neurons chosen are considered. Rather than adopting sensitivity analysis for determining optimum values of parameters, trial and error method is adopted. The former method has been reported ineffective.

Table 2 Values of different parameters considered for network development

2.4 Stopping Criteria

Stopping criteria considered [11]:

  • Maximum iteration: 20,000

  • If improvement in objective function for 100 successive iterations is below is 10–8.

3 Model Validation

After successfully training the model using PSO it is required to validate the performance of trained model. The major intention behind validating the model is to ensure that the model is not simply memorizing the relationship between input and output available in the training data but has the ability to generalize within the limits of training data. Model validation is carried out using test data.

4 Fitting Examination

The result of fitting examination of PSO-ANN model is given in Table 3. The best performing acceleration factor (c1 and c2) corresponding to a given number of neurons in the hidden layer and swarm size is depicted in Table 3. Results for n = 6, 8, 10, 11, 12 & 13 and swarm size = 10, 20, 30, 50 & 100 are reported. Corresponding prediction evaluating indices MSE and R value for training and the testing data-set has been also presented. More the R value closer to 1 and MSE value closer to 0, better will be the performance of model. Figure 1a, b shows actual and predicted settlement values of training and testing set respectively (Fig.1).

Table 3 Best R and MSE for varying number of hidden neurons
Fig. 1
figure 1

Actual and predicted settlement of a training and b testing set (actual value on x-axis and predicted value on y-axis)

Fig. 2
figure 2

MSE Variation with number of iterations

The variation of MSE with number of iterations is shown in Fig. 2. Upto 100 iterations MSE value showed significant change. Later on the value remained constant towards 10,000. Thus maximum iteration is chosen as 20,000. All the runs performed got completed before 20,000 iterations. Iteration number for the final model and the related details are given in Table 4.The highest and lowest value of R and MSE respectively are obtained for 13 number of hidden neurons, c1 = 1.3 and c2 = 2.25. Thus the predictive model is developed as a 6–13-1 network. Architecture of the neural network is shown in Fig. 3.

Fig. 3
figure 3

Architecture of neural network developed (N = SPT N value, B = Breadth of footing, L = Length of footing, D = Depth of footing, WT = Depth of water table below footing, q = Net applied pressure, S = Settlement)

Table 4 Description of the hybrid ANN PSO model developed

5 Performance Analysis

Root mean square error, RMSE and coefficient of determination, R2 are the commonly used parameters to evaluate the performance of model. In order to ensure the efficiency of prediction model, a number of fitness parameters has been considered for this study. Fitness parameters used to assess the model and their corresponding values obtained are given in Table 5.

Table 5 Performance parameters

Nash–Sutcliffe efficiency (NS) indicates the predictive power of the models. The predictive capacity increases as the NS value gets closer to 1. Here 0.909 is obtained for training set indicating good prediction. Root Mean Square Error (RMSE) closer or equal to 0 indicates marginal error in prediction. The obtained value is 0.345 which indicates marginal error. Variance Account Factor (VAF) equal to 100% indicates that model performance gives a reasonable result. Obtained value is 99.98% for training set which is much closer to 100%. R2 (Coefficient of determination) and Adj.R2 (adjusted determination coefficient) values should be closer to 1. Both closer to each other show that the model reflected most of the variability in soil parameters. Here values of R2 and Adj.R2 are closer to 1 and also closer to each other, 0.908 and 0.906 respectively are obtained.

6 Conclusion

From about more than 300 runs the optimum predictive model was developed as a 6–13-1 network ie, with 6 input parameters and 13 hidden neurons. c1 and c2 values for the model developed are 1.3 and 2.25 respectively. An R value of 0.953 for training set and 0.932 for testing set was obtained. MSE value of 0.119 m for training and 0.217 m for testing was obtained. All the other performance parameters also gave values proving the model to be good in prediction. Thus PSO –ANN hybrid model can be used to accurately predict the settlement of shallow foundations on cohesionless soil.