Abstract
Several bridges failed because of scouring and erosion around the bridge elements. Hence, precise prediction of abutment scour is necessary for the safe design of bridges. In this research, experimental and computational investigations have been devoted based on 45 flume experiments carried out at the NIT Warangal, India. Three innovative ensemble-based data intelligence paradigms, namely categorical boosting (CatBoost) in conjunction with extra tree regression (ETR) and K-nearest neighbor (KNN), are used to accurately predict the scour depth around the bridge abutment. A total of 308 series of laboratory data (a wide range of existing abutment scour depth datasets (263 datasets) and 45 flume data) in various sediment and hydraulic conditions were used to develop the models. Four dimensionless variables were used to calculate scour depth: approach densimetric Froude number (Fd50), the upstream depth (y) to abutment transverse length ratio (y/L), the abutment transverse length to the sediment mean diameter (L/d50), and the mean velocity to the critical velocity ratio (V/Vcr). The Gradient boosting decision tree (GBDT) method selected features with higher importance. Based on the feature selection results, two combinations of input variables (comb1 (all variables as model input) and comb2 (all variables except Fd50)) were used. The CatBoost model with Comb1 data input (RMSE = 0.1784, R = 0.9685, MAPE = 10.4724) provided better accuracy when compared to other machine learning models.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Scour is a natural phenomenon that occurs in alluvial streams as a result of the erosive action of flowing water (Oliveto and Hager 2005; Eghlidi et al. 2020). Abutments are constructed near the stream banks for constructing and supporting a bridge. It is generally recognized that abutments are undermined by river-bed erosion and scouring, which are the leading causes of abutment failure (Pandey et al. 2020; Afzal et al. 2021). Natural processes or man-made interactions can cause local scour in streams (Barbhuiya and Dey 2004; Kothyari et al. 2007; Goyal and Ojha 2011). Abutments and other hydraulic structures are often failed by scour (Pandey et al. 2020). Earlier studies on scour around abutments tended to focus on the prediction of maximum, and time-dependent scour depth (Barbhuiya and Dey 2004; Kothyari et al. 2007). The Federal Highway Administration 1973 report illustrates that more than 400 bridges failed due to pier and abutment scour (Pandey et al. 2018). These collapsed bridges show the importance of realistic prediction of scour around the bridge elements (Kumar et al. 2022). Scour around the abutments in natural streams is still a matter of alarm, while numerous researchers and engineers have proposed numerous mathematical and numerical models using laboratory and field studies (Barbhuiya and Dey 2004; Dey and Barbhuiya 2005). Therefore, improving the abutment scour phenomenon and processes is vital to calculate the maximum abutment scour depth at equilibrium scour conditions. (Oliveto and Hager 2005) carried out an experimental study on abutment scour and stated that the minimum required laboratory dimensions to apply Froude number similarity. They also checked the influence of sloping abutments on scour. As long as the limitations of the computational approach are respected, the outcomes of their study could be applied in practice. (Dey and Barbhuiya 2005) completed an experimental flume study on abutment scour and derived a semi-empirical method to calculate the abutment scour around a short vertical wall, semicircular, and 45° wing wall abutments (length/flow depth \(\le\) 1). By considering the horse-shoe vortex system as the prime agent of abutment scour, they followed the conservation of sediment mass for analyzing the experimental data. The maximum scour depth at equilibrium conditions around abutments is key to the river and bridge engineers (Barbhuiya and Dey 2004).
The scour processes and flow patterns around the abutments are so complicated; thus, it is hard to derive a general empirical relationship (Azamathulla et al. 2010, 2013; Singh et al. 2020). The scour depth around the abutment can be calculated using numerous empirical relationships (Dey and Barbhuiya 2005; Mohammadpour et al. 2017). Each relationship yields good agreements with experimental values just for the specific datasets. Previous studies show the importance of the prediction of abutment scour precisely. Further, insufficient field data would lead to uncertainty of abutment scour equations (Mohammadpour et al. 2016).
With this in view, precise prediction of the abutment is cumbersome and needs to implement robust approaches like gene-expression programming (GEP), artificial neural network (ANN), generalized reduced gradient (GRG), genetic algorithm (GA), evolutionary polynomial regression, modern multi-level ensemble approach and adaptive neuro-fuzzy inference system (ANFIS) to predict abutment scour (Azamathulla et al. 2010; Mohammadpour et al. 2016; Mohammadpour 2017; Aamir and Ahmad 2019; Singh et al. 2022). Azamathulla et al. (Najafzadeh and Azamathulla 2013) applied soft computing approaches to calculate the pipeline scour and illustrated the best results with experimental datasets. (Mohammadpour et al. 2016) applied ANN and ANFIS to calculate the abutment scour. They stated that the ANN and ANFIS could be successfully used to predict the scour depth at different bridge elements.
In this study, the authors aim to examine the existing maximum scour depth data at equilibrium scour conditions for rectangular wall abutments. Existing abutment scour depth datasets (263 datasets) are collected from (Coleman et al. 2003) and (Dey and Barbhuiya 2005). Further, 45 flume experiments are carried out at the National Institute of Technology in Warangal, India. Previously proposed empirical relationships are also examined to analyze the performance based on available datasets (a total of 308 datasets). Although the applications of robust approaches in maximum scour depth prediction have been of interest to several investigators because of their accuracy and simplicity, no research work has been undertaken to develop a machine learning approach to predict the maximum scour depth at the bridge abutment. However, previous studies on other hydraulic structures scour have shown good agreements with observed values. Considering the significance of the concern of bridge abutment scour, which is mainly responsible for the scour hazards. An effort has been made to develop machine learning based models with a wide range of experimental data. The main objectives of this study are: 1- Application of the CatBoost model as new a machine learning approach to model abutment scour depth using a vast experimental database, 2- Compare developed model with common machine learning approaches such as K nearest neighbor and extra tree models. 3- Compare existing empirical equations with the machine learning models.
2 Methodology
2.1 Maximum Scour Depth Relationships
Numerous studies are available on abutment scour. Most studies have examined the maximum scour at equilibrium scour conditions. At equilibrium conditions, maximum scour depth is influenced by flow properties, sediment characteristics, and abutment geometry (Barbhuiya and Dey 2004; Bressan et al. 2011). The variables that influence the maximum abutment scour depth (dse) at equilibrium conditions in uniform sediment beds are as follows:
where dse is the maximum abutment scour depth, V is time-average flow velocity, y is the approach flow depth, \(\rho\) is the mass density of the fluid, \(\upsilon\) is the kinematic viscosity of fluid, d50 is the median diameter of sediment, Vc is the threshold velocity of sediment, \(\rho_{s}\) is the mass density of sediment, L is the transverse length of the abutment and g is the gravitational acceleration.
For sediment-fluid interaction, Eq. (1) should not contain independent parameters \(\rho ,\rho_{s} {\text{ and }}g\) (Dey and Barbhuiya 2005). (Dey and Barbhuiya 2005) gave a better representation of above mentioned parameters, and hence, Eq. (1) becomes
where, densimetric Froude number \(F_{{d_{50} }} = \frac{V}{{\sqrt {(S - 1)gd_{50} } }}\), S is the relative density of sediment.
Many researchers derived mathematical relationships that calculate the maximum abutment scour depth using various parameters (Kandasamy and Melville 1998; Melville and Coleman 2000). (Melville and Coleman 2000) proposed an abutment scour relationship on the basis of different empirical factors or K factors which show the influence of flow, sediment, and abutment characteristics. These K-factors can be calculated by the curve fitting method, and the maximum abutment scour depth (dse) expressed in terms of the product of K-factors is given as
where dse is the maximum abutment scour depth at equilibrium condition, Kd50 is the sediment gradation factor, KI is the flow intensity factor, Ks is the abutment shape factor, Kt is the time factor, Ky is the flow depth–abutment size factor, and \(K_{\alpha }\) is the abutment alignment factor. For vertical wall abutments, Ks and \(K_{\alpha }\) = 1.
K factors can be calculated using different empirical equations, given as
where Va is armor peak velocity.
Other abutment scour depth relationships for vertical-wall abutments under different regimes are given in Table 1.
2.2 Description of Collected Data from Literature and Present Experimental Work
A wide range of existing abutment scour depth datasets (263 datasets) have been collected from (Coleman et al. 2003) and (Dey and Barbhuiya 2005). In addition, 45 flume experiments have been carried out at the NIT Warangal, India. Additional tests were completed in a fixed bed masonry rectangular flume. Flume dimensions were 16.0 m long, 1.0 m wide, and 0.40 m deep. The flume's test section started 8.0 m from its entrance and had dimensions of 3.0 m long, 0.80 m wide, and 0.25 m deep. Uniform sand with a median diameter of 0.27 mm and a geometric standard deviation of 1.17 was used as the sediment bed. We used vertical wall abutment models with transverse lengths viz. 5.0, 6.8, 7.5, 8.4, 9.8, 10.6, 12.5, 15.0, 17.5 and 19.0 cm. All tests were conducted under non-submerged conditions. A valve was fixed into the flume inlet pipe to control the flowrate. At the downstream end of the flume, a tail gate and a pre-calibrated rectangular notch were fixed to maintain the flow depth and measure the flowrate, respectively. The experiments were performed until they reached an equilibrium scour, i.e., no change in scour geometry over time. All tests in this study were carried out for 20 h. Experimentally, it was observed that the maximum scour depth (dse) at equilibrium condition is located at the upstream nose of the abutment. The Vernier point gauge was used to determine the maximum abutment scour depth under equilibrium conditions. Due to the fact that all experiments were conducted in clear-water scour, the threshold velocity ratio was always less than one.
The vertical-wall abutment model was fixed in the test section at the right side of the flume prior to the start of the experiment (as can be seen in Fig. 1), i.e., located 9.5 m from the flume entrance. The sand bed was leveled perfectly with the bed slope and then covered with a 3 mm acrylic sheet to avoid unwanted scouring around the abutment. We achieved desired flow conditions using the inlet valve and flume tail gate. The acrylic sheet was sensibly removed after getting the desired flow conditions. Table 2 summarizes several parameters that influence the maximum abutment scour depth.
Table 3 summarizes the statistical analysis of all data utilized in this investigation. According to the values of kurtosis and skewness presented in Table 3, the densimetric Froude number (\({F}_{d50}\)) has an approximately normal distribution. However, other dimensionless input variables do not follow the normal distribution. The dimensionless output parameter (\({d}_{se}/L\)) also approximately follows the normal distribution.
Correlation and regression analyses should be utilized to determine which variables influence the target variable. Using Pearson correlation coefficients, Fig. 2 depicts the relationship between the dimensionless abutment scour depth (\({d}_{se}/L\)) and the input components. Correlation coefficients with a positive value suggest a direct association between two variables, while those with a negative value imply an inverse relationship. According to Fig. 2, the ratio of depth to transverse length of the abutment (\(y/L\)) with a correlation coefficient of + 0.75 is the most effective input variable in predicting scour. The densimetric Froude number (\({F}_{d50}\)) with a correlation coefficient of + 0.51 is the second effective variable. The lowest correlation coefficient + 0.4 belongs to the mean velocity to critical velocity ratio (\(V/{V}_{c}\)). The ratio of transverse length of the abutment to the diameter of sediment particles with a correlation coefficient of -0.45 has the opposite effect on scour depth, and with increasing it, the amount of scour decreases.
2.3 Gradient Boosting Decision Tree (GBDT) for Feature Selection
Many machine learning algorithms have been proposed and utilized to solve classification and regression problems over the year. But, the gradient boosting decision tree (GBDT) is one of the most popular algorithms for handling the classification and regression issues based on weak decision trees integration (Friedman 2002). In other words, the GBDT model is an ensemble of decision trees that assimilate a series of weak base learners with many leaf nodes and avoid the overfitting problem (Tao et al. 2022). The amount of error in each node is measured using the weak learners, and a test function is utilized for splitting the node (Fan et al. 2018). The comprehensive background of GBDT can be obtained from Friedman (2002). In this study, the GBDT algorithm is exploited for feature selection and to predict the abutment scour depth (ASD).
2.4 Machine Learning Approaches
2.4.1 Categorical Boosting (CatBoost)
CatBoost is a new machine learning algorithm that was exposed by Prokhorenkova et al. (2018) for dealing the categorical features. It is a subset of the gradient boosting decision tree (GBDT) family but is different in working style. CatBoost is more powerful than other machine learning algorithms, i.e., XGBoost (extreme gradient boosting) and LightGBM (gradient boosting machine)(Ke et al. 2017), in handling complex and noisy data. Recently, the CatBoost algorithm has been widely used in hydrological modeling like reference evapotranspiration estimation (Bian et al. 2020), pan-evaporation estimation (Dong et al. 2021), and prediction of flash flood susceptibility (Saber et al. 2021). The enhancement of CatBoost comprises the following aspects:
-
1.
Manage categorical features during the training period instead of pre-processing period. In training, the complete dataset is permitted by CatBoost. The Greedy target-based statistics (Greedy TBS) method is used for handling categorical features with the least information loss. Precisely, for each sample, a random permutation of the dataset was performed by CatBoost to calculate an average label value for the sample with the same category value positioned before the given one in the permutation. Assume a dataset of observations \(D=\left\{{X}_{i}, {Y}_{i}\right\} i=1,\dots ,n\) and if a permutation is \(\theta ={\left({\sigma }_{1},{\sigma }_{2},\dots ,{\sigma }_{n}\right)}_{n}^{T}\), it is changed with (Prokhorenkova et al. 2018):
$${x}_{{\sigma }_{p,k}}=\frac{\sum_{j=1}^{p-1} \left[{x}_{\sigma j,k}={x}_{{\sigma }_{p,k}}\right]\times {Y}_{{\sigma }_{j}}+\beta \times P}{\sum_{j=1}^{p-1} \left[{x}_{{\sigma }_{j,k}}={x}_{{\sigma }_{p,k}}\right]+\beta }$$(7)
Here, \(\beta\) = weight of prior, \(P\) = prior value. In the dataset, the prior is the average label value, which helps in reducing the low-frequency category noise.
-
2.
Feature combinations. CatBoost integrates all categorical features and their combinations in the current tree with all categorical features in the dataset using a greedy method.
-
3.
Unbiased boosting with categorical features. CatBoost used an ordered boosting method to overcome the gradient bias (Prokhorenkova et al. 2018) and improve the generalization ability of the model.
-
4.
Fast scorer. CatBoost uses oblivious trees as base predictions since they use the same splitting criterion across the tree's levels. These trees are less prone to overfitting and have a more balanced growth pattern. Each leaf index in oblivious trees is represented as a binary vector with a length equal to the tree's depth. This rule is employed to compute model predictions in CatBoost model evaluators because all binaries comprise float, statistics, and one-hot encoded features. Figure 3 shows the architecture of the CatBoost algorithm.
2.4.2 K-Nearest Neighbor (KNN)
K-nearest neighbor (KNN) is a non-parametric regression technique that was first proposed by Fix and Hodges (1951) for optimizing classification and prediction problems (Karlsson and Yakowitz 1987). The background history of the KNN exposes its effective application in hydrology (Sikorska-Senoner and Quilty 2021). In KNN, the independent variables (or predictors) are the input for the prediction objective.
2.4.3 Extra Tree Regression (ETR)
Geurts et al. (2006) proposed the idea of extra tree regression (ETR), which is a new ensemble machine learning model to perform regression or classification tasks based on many united decision trees (DT). A classical top-down procedure is used to construct the ETR model (Geurts et al. 2006). Many applications of the ETR model have been found in different fields (Heddam et al. 2020; Seyyedattar et al. 2020; Asadollah et al. 2021). The ETR is a highly randomized version of random forest (RF) with two main differences. The \(K\) (the number of features randomly nominated at each node), and \({n}_{min}\) (the minimum sample size for splitting a node) are the two main parameters of the ETR, which avoids the overfitting and enhance the prediction accuracy of the model (Asadollah et al. 2021). In this research, ETR model was developed in scikit-library of Python programming for ASD prediction by tuning its parameters.
2.5 Performance Indicators
he efficacy of the applied machine learning paradigms i.e., KNN, Extra Tree, and CatBoost, for predicting the abutment scour depth (ASD) was evaluated by employing the five different performance indicators, including MAPE (mean absolute percent error), RMSE (root mean square error), R (coefficient of correlation), IA (Willmott agreement index) (Willmott 1982), and U95% (uncertainty coefficient with 95% confidence level) (Patino and Ferreira 2015). The mathematical formulas of R, RMSE, MAPE, IA, and U95% indicators are listed as follows:
Here, \({ASD}_{meas,i}\) and \({ASD}_{pred,i}\) are measured and predicted abutment scour depth (ASD) values for ith data points, values; \(\overline{{ASD }_{meas}}\) and \(\overline{{ASD }_{pred}}\) are means of measured and predicted ASD values, \(SD\) is the standard deviation, and \(N\) is the total number of observations.
2.6 Model Development and Configuration
The data were normalized using the following equation to equalize the data scale:
where \({x}_{max}\) and \({x}_{min}\) denote the maximum and minimum values of the dataset used to generate the prediction models, respectively. Then 70% of the data for training and 30% of the data for evaluating the models' performance were considered test data sets.
The CatBoost model was used to predict scour depth around the bridge abutment in the present study. Two powerful models, including Extra Tree Regression (ETR) and KNN models, were used to compare the performance of the CatBoost model. The Gradient Boosting Decision Tree (GBDT) method was used to select the effective features in scouring prediction. Figure 4 shows the importance of each of the input variables in the scour estimation. According to Fig. 7, the \(y / L\) ratio is the most effective factor in predicting scouring, and Fd50 is the least important compared to other input variables. As a result, according to the feature selection results, two combinations, namely: comb1 (all variables (\(V/{V}_{c}\), \({F}_{d50}\), \(y/L\), \(L/{D}_{50}\))) and comb2 (all variables except Fd50 (\(V/{V}_{c}\), \(y/L\), \(L/{D}_{50}\))), were considered.
The proper adjustment of model parameters is one of the most important aspects of using machine learning models. Fine-tuning the parameters leads to higher accuracy. The grid search method was used to find the optimal value of machine learning model parameters. All models are implemented in a PC with an Intel-core i7-10750H 2.6 GHz processor and 32 GB of RAM. Extra Tree Regression (ETR) and KNN and the feature selection algorithm were developed in the Scikit-Learn package (Pedregosa et al. 2011), and the CatBoost package (Dorogush et al. 2018) in Python was used to implement the CatBoost model. In the CatBoost model, the four main learning rate (learning_rate), tree depth (depth), number of iterations (iterations), and L2 regularization (l2_leaf_reg) parameters must be set. In the present study, the range of learning rates [0.001–0.5], depth [2–20], iterations between [200–2000], and L2 regularization in the range [0.5–1.5] were considered. Table 4 presents the optimal parameters of the CatBoost model for the two input combinations, comb1, and comb2. In the Extra Tree model, there are two important parameters, the number of trees in the forest (n_estimators) and the maximum depth of the tree (max_depth), that need to be adjusted. The maximum depth of the tree was considered between [2–20], and the number of trees in the forest was set between [10–150]. Table 4 presents the optimal parameters of the ETR model for both input combinations. The only parameter of the KNN model is the number of neighbors, which is considered between [1–10], and its optimal values are presented in Table 4 for both input combinations. Figure 5 indicates the schematic flowchart of the present study.
3 Results and Discussion
In this research, a comprehensive ML-based investigation was performed on the normalized scour depth (\({d}_{se}/L\)) of the abutments in uniform bed based on four dimensionless input features, namely \(V/{V}_{c}\), \({F}_{d50}\), \(y/L\), and \(L/{D}_{50}\). The feature selection process was addressed to determine the most significant candidate input combinations using a tree-based, namely the Tree decision FS method. The outcomes of pre-processing indicate that two scenarios comprised all features (Comb1) and all features except the \({F}_{d50}\)(Comb2) were considered to feed the ML model to model the normalized scour depth around abutments. Table 5 summarizes the goodness-of-fit statistics of the simulation of the normalized scour depth at the abutment. According to Table 5, the CatBoost model in Comb 1 (comprised of all features), regarding the best accuracy in training (R = 0.9993, RMSE = 0.0290, MAPE = 0.5069%, and U95% = 0.0569) and testing (R = 0.9685, RMSE = 0.1784, MAPE = 10.4724%, and U95% = 0.0612) outperformed the Extra Tree (R = 0.9491, RMSE = 0.2236, MAPE = 11.3301%, and U95% = 0.0979 for the testing phase) and KNN models (R = 0.9251, RMSE = 0.2778, MAPE = 18.4478%, and U95% = 0.1503 for the testing phase). Moreover, in the second variant of input combination (i.e., Comb 2), the CatBoost model owing to the highest values of R = 0.9608 and IA = 0.9790 and least diagnostic metrics (RMSE = 0.1995, MAPE = 12.2251%, and U95% = 0.0759) in the testing stage had the best performance among three considered models followed by Extra Tree (R = 0.9378, RMSE = 0.2482, MAPE = 12.4341%, and U95% = 0.1193), and KNN (R = 0.9122, RMSE = 0.2936, MAPE = 21.3313%, and U95% = 0.9545) models, respectively. Overall, regarding the goodness-of-fit statistics reported in Table 5, it can be concluded that the predictive performance of Comb 1, including all input features, is superior to Comb 2.
For validation of the provided model to the estimation of the normalized scour depth at the abutment, several infographic tools, and diagnostic analyses were addressed in the forms of scatter plots, Rug-Histograms density distribution function, trend variation graphs, Taylor diagrams, and the violin plots of residual and relative deviation error.
Figure 6 demonstrates the scatter plots of Comb 1 to compare the predicted and measured values of normalized scour depth at the abutments. According to Fig. 6a, it can be seen that all three models in the training phase of Comb 1 have perfect performance regarding the best fitness with actual values of \({d}_{se}/L\). Although in the testing phase, the Catboost model representation with green color (R = 0.9685) led to the best agreement with actual values of \({d}_{se}/L\) compared with the Extra Tree (R = 0.9491) and KNN (R = 0.9251) approaches. Besides, in Comb 2, (Fig. 6b) the best compatibility between the predicted and measured \({d}_{se}/L\) is related to both ensemble-based ML models (CatBoost and Etra Tree) and KNN stands on the last rank of accurateness of simulation. A closer comparison of scatter plots indicates that the performance of all three methods in Comb 1 is better than Comb 2 due to the better alignment around the 45° line. In this stage of evaluating the capability of the models, the Taylor diagrams in Fig. 7 are employed to qualitatively assess the accuracy of the representatives of each model in comparison with the actual values, which criterion are the correlation coefficient and standard deviation (Xu et al. 2016). Based on Fig. 7, it can be seen that the representative of the CatBoost method for both combinations is located in the range of 0.95 to 0.99 from a smaller physical distance than other methods with a reference point. Although, the precision of Comb 1 for all three considered models is superior to Comb 2 for estimating the \({d}_{se}/L\) values. Figure 8 displaces the box plot of residual error in the testing stage, which reveals that the CatBoost models regarding the lowest residual error range in Comb 1 (1.08) and Comb 2 (1.37) yielded the most reliable outcomes than KNN (Comb 1|1.486 and Comb 2|1.575) and Extra Tree (Comb 1|1.634 and Comb 2|1.616). Overall, the diagnostic analyses evident that Comb 1 results in more accurate outcomes than Comb 2, and the Extra Tree as the second-best predictive model can be considered a reliable model for precise modeling of the \({d}_{se}/L\) values. Furthermore, the results presented in this research showed that the novel ensemble-based machine learning methods (CatBoost and Extra Tree) have a good performance in solving significant non-linear scour depth estimation problems at the hydraulic structures, which is also confirmed by previous research.
3.1 Extra Discussion and Comparison
Here, the performance of provided ML models in the superior candidate input combination is examined for the prediction of the normalized scour depth at abutments. Figure 9 displaces the trend variation of predicted \({d}_{se}/L\) values using CatBoost, Extra Tree, and KNN versus the measured \({d}_{se}/L\). Best performance in capturing non-linear behavior of the scour depth data points (\({d}_{se}/L\)) in the testing phase is related to the CatBoost method, and the Extra Tree and KNN methods are in the next ranks, respectively. Also, the residual distribution indicates that the KNN model has the highest oscillation among the three ML approaches. Figure 10 depicted the Rug-Histogram of the predicted normalized scour depth (\({d}_{se}/L\)) obtained by the CatBoost, Extra Tree, and KNN models vs. the measured \({d}_{se}/L\) for the superior input combination for whole data points. Here, the density distribution function of the CatBoost model appeared to be more matched with the measured \({d}_{se}/L\) in comparison with the Extra Tree and KNN models. Also, the greater conformity of the compression band of the predicted and actual data in the CatBoost model ascertains a better performance than the other methods in estimating scour depth at abutments.
3.2 Comparison of Previously Proposed Relationships and Present Approaches
A comparison has also been done using experimental and computed scour depths for previously proposed empirical relationships, viz. Melville and Coleman (Melville and Coleman 2000) and Dey and Barbhuiya (Dey and Barbhuiya 2005) and soft computing approaches viz. CatBoost, ETR, and KNN. Figure 11a illustrates scatter plots between observed and computed values of normalized maximum scour depth around the abutment with the error line bands of ± 20%. One can easily identify that novel ensemble-based data-intelligence paradigms, viz. CatBoost, ETR, and KNN predict abutment scour depth more precisely than empirical relationships. Melville and Coleman (Melville and Coleman 2000) equation overestimates the abutment scour depth and shows maximum error for used datasets, as can be seen in Fig. 11a. Figure 11b illustrates the variation between the error and total datasets. For CatBoost, ETR, and KNN, approximately 95% of the datasets were found to have less than ± 20% error than whereas more than 60% of datasets for Dey and Barbhuiya (Dey and Barbhuiya 2005) were found to have less than ± 30% error, while only 40% datasets for Melville and Coleman (Melville and Coleman 2000) were found to have less than ± 40% error, as can be seen in Fig. 11b.
4 External Validation
Tropsha et al. (Tropsha et al. 2003) established new criteria for model external validation. These criteria are derived from the known prediction performance of the model. The validation criteria and supporting data for the suggested models are summarized in Table 6. K and K' must be between 0.85 and 1.15, and m and n must be smaller than 0.1 to meet the requirements. For the test dataset, CatBoost's correlation coefficient (R) is 0.968, while for the training dataset, it is 0.999. CatBoost's m and n coefficients (n = -0.064 and m = -0.061 for the test dataset) are greater than those of the other models. As seen in Table 6, all three models in the training and testing datasets met all important criteria, demonstrating that the models' predictive quality and significant connection between goal and output are not coincidental.
5 Conclusion
The purpose of this work was to undertake a complete machine learning analysis to predict the scour depth around the bridge abutment. Three machine learning models were created for this purpose: CatBoost, Extra Tree Regression (ETR), and K-nearest neighbor (KNN). The models were developed using 308 samples series of laboratory data (a wide range of existing abutment scour depth datasets (263 datasets) and 45 flume experiments data at the NIT Warangal, India). Four dimensionless parameters including upstream densimetric Froude number (Fd50), the upstream depth (y) to abutment transverse length ratio (y/L), the abutment transverse length to the sediment mean diameter (\(L/d50\)), and the mean velocity to the critical velocity ratio (\(V/Vc\)) were considered as the model inputs, and the normalized scour depths (\({d}_{se}/L\)). Based on the GBDT feature selection method, two combinations: comb1 (\(V/{V}_{c}\),\({F}_{d50}\),\(y/L\), \(L/{D}_{50}\)) and comb2 (\(V/{V}_{c}\),,\(y/L\), \(L/{D}_{50}\)), were selected as input for models. The results of this study showed that the use of input combination 1 (comb1), which includes all input variables, provided more accurate results. Comb1 has provided better results in the test phase in all three models used. The CatBoost model performed best in predicting scour depth in both input combinations 1 and 2 (RMSE = 0.1784 and R = 0.9685 for comb1 and RMSE = 0.1995 and R = 0.9608 for comb2). In the ET model, input combination 1 (RMSE = 0.2236 and R = 0.9491) also performed better than input combination 2 (RMSE = 0.2482 and R = 0.9378). The ET model performed worse than the CatBoost. The KNN model has the weakest results among the models used (RMSE = 0.2778 and R = 0.9251 for comb1 and RMSE = 0.2936 and R = 0.9122 for comb2). Additionally, a comparison of purposed intelligent models to prior empirically-based research demonstrates the superiority of all established machine learning models. Finally, external validation established that all prediction techniques were constrained to values of\(0.85<K, {K}^{^{\prime}}<1.15\), and\((m, n)< 0.1\). The performance of the created model on test data reveals its ability to generalize effectively.
Data Availability
Data are available from the corresponding author upon reasonable request.
References
Aamir M, Ahmad Z (2019) Estimation of maximum scour depth downstream of an apron under submerged wall jets. J Hydroinformatics 21:523–540
Afzal MS, Holmedal LE, Myrhaug D (2021) Sediment transport in combined wave–current seabed boundary layers due to streaming. J Hydraul Eng 147:4021007
Asadollah SBHS, Sharafati A, Motta D, Yaseen ZM (2021) River water quality index prediction and uncertainty analysis: a comparative study of machine learning models. J Environ Chem Eng 9:104599. https://doi.org/10.1016/j.jece.2020.104599
Azamathulla HM, Cuan YC, Ghani AA, Chang CK (2013) Suspended sediment load prediction of river systems: GEP approach. Arab J Geosci 6:3469–3480. https://doi.org/10.1007/s12517-012-0608-4
Azamathulla HM, Ghani AA, Zakaria NA, Guven A (2010) Genetic programming to predict bridge pier scour. J Hydraul Eng 136:165–169. https://doi.org/10.1061/(ASCE)HY.1943-7900.0000133
Barbhuiya AK, Dey S (2004) Local scour at abutments: a review. Sadhana - Acad Proc Eng Sci 29:449–476. https://doi.org/10.1007/BF02703255
Bian C, He H, Yang S, Huang T (2020) State-of-charge sequence estimation of lithium-ion battery based on bidirectional long short-term memory encoder-decoder architecture. J Power Sources 449:227558. https://doi.org/10.1016/j.jpowsour.2019.227558
Bressan F, Ballio F, Armenio V (2011) Turbulence around a scoured bridge abutment. J Turbul 12:1–24. https://doi.org/10.1080/14685248.2010.534797
Coleman SE, Lauchlan CS, Melville BW (2003) Développement de l’affouillement en eau claire aux butées de pont. J Hydraul Res 41:521–531. https://doi.org/10.1080/00221680309499997
Dey S, Barbhuiya AK (2005) Time variation of scour at abutments. J Hydraul Eng 131:11–23. https://doi.org/10.1061/(ASCE)0733-9429(2005)131:1(11)
Dong L, Zeng W, Wu L et al (2021) Estimating the pan evaporation in Northwest China by coupling catboost with bat algorithm. Water 13:256. https://doi.org/10.3390/w13030256
Dorogush AV, Ershov V, Gulin A (2018) CatBoost: gradient boosting with categorical features support. arXiv Prepr arXiv181011363
Eghlidi E, Barani G-A, Qaderi K (2020) Laboratory investigation of stilling basin slope effect on bed scour at downstream of stepped spillway: physical modeling of javeh RCC dam. Water Resour Manag 34:87–100
Fan J, Yue W, Wu L et al (2018) Evaluation of SVM, ELM and four tree-based ensemble models for predicting daily reference evapotranspiration using limited meteorological data in different climates of China. Agric for Meteorol 263:225–241. https://doi.org/10.1016/j.agrformet.2018.08.019
Fix E, Hodges JL (1951) Discriminatory Analysis. Nonparametric Discrimination: Consistency Properties. USAF Sch Aviat Med Randolph Field, Texas. https://doi.org/10.2307/1403797
Friedman JH (2002) Stochastic gradient boosting. Comput Stat Data Anal 38:367–378
Geurts P, Ernst D, Wehenkel L (2006) Extremely randomized trees. Mach Learn 63:3–42
Gill MA (1972) Erosion of sand beds around spur dikes. J Hydraul Div 98(9):1587–1602
Goyal MK, Ojha CSP (2011) Estimation of scour downstream of a ski-jump bucket using support vector and M5 model tree. Water Resour Manag 25:2177–2195
Heddam S, Ptak M, Zhu S (2020) Modelling of daily lake surface water temperature from air temperature: Extremely randomized trees (ERT) versus Air2Water, MARS, M5Tree, RF and MLPNN. J Hydrol 588:125130. https://doi.org/10.1016/j.jhydrol.2020.125130
Kandasamy JK, Melville BW (1998) Maximum local scour depth at bridge piers and abutments. J Hydraul Res 36:183–198. https://doi.org/10.1080/00221689809498632
Karlsson M, Yakowitz S (1987) Nearest-neighbor methods for nonparametric rainfall-runoff forecasting. Water Resour Res 23:1300–1308. https://doi.org/10.1029/WR023i007p01300
Ke G, Meng Q, Finley T et al (2017) LightGBM: A highly efficient gradient boosting decision tree. In: Advances in Neural Information Processing Systems. pp 3149–3157
Kothyari UC, Hager WH, Oliveto G (2007) Generalized approach for clear-water scour at bridge foundation elements. J Hydraul Eng 133:1229–1240. https://doi.org/10.1061/(ASCE)0733-9429(2007)133:11(1229)
Kumar L, Afzal MS, Ahmad A (2022) Prediction of water turbidity in a marine environment using machine learning: A case study of Hong Kong. Reg Stud Mar Sci 52:102260
Laursen EM (1963) An Analysis of Relief Bridge Scour. J Hydraul Div 89:93–118. https://doi.org/10.1061/JYCEAJ.0000896
Liu H-K, Chang FM, Skinner MM (1961) Effect of bridge constriction on scour and backwater. CER; 60–22
Melville BW, Coleman SE (2000) Bridge scour. Water Resources Publication
Mohammadpour R (2017) Prediction of local scour around complex piers using GEP and M5-Tree. Arab J Geosci 10. https://doi.org/10.1007/s12517-017-3203-x
Mohammadpour R, Ab. Ghani A, Zakaria NA, Mohammed Ali TA (2017) Predicting scour at river bridge abutments over time. Proc Inst Civ Eng Water Manag 170:15–30. https://doi.org/10.1680/jwama.14.00136
Mohammadpour R, Ghani AA, Vakili M, Sabzevari T (2016) Prediction of temporal scour hazard at bridge abutment. Nat Hazards 80:1891–1911. https://doi.org/10.1007/s11069-015-2044-8
Najafzadeh M, Azamathulla HM (2013) Group method of data handling to predict scour depth around bridge piers. Neural Comput Appl 23:2107–2112. https://doi.org/10.1007/s00521-012-1160-6
Oliveto G, Hager WH (2005) Further results to time-dependent local scour at bridge elements. J Hydraul Eng 131:97–105. https://doi.org/10.1061/(ASCE)0733-9429(2005)131:2(97)
Pandey M, Sharma PK, Ahmad Z, Karna N (2018) Maximum scour depth around bridge pier in gravel bed streams. Nat Hazards 91:819–836. https://doi.org/10.1007/s11069-017-3157-z
Pandey M, Valyrakis M, Qi M et al (2020) Experimental assessment and prediction of temporal scour depth around a spur dike. Int J Sediment Res:1–13. https://doi.org/10.1016/j.ijsrc.2020.03.015
Patino CM, Ferreira JC (2015) Confidence intervals: a useful statistical tool to estimate effect sizes in the real world. J Bras Pneumol. https://doi.org/10.1590/s1806-37562015000000314
Pedregosa F, Varoquaux G, Gramfort A et al (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830
Prokhorenkova L, Gusev G, Vorobev A et al (2018) Catboost: Unbiased boosting with categorical features. In: Advances in Neural Information Processing Systems. pp 6637–6647
Saber M, Boulmaiz T, Guermoui M et al (2021) Examining LightGBM and CatBoost models for wadi flash flood susceptibility prediction. Geocarto Int:1–26. https://doi.org/10.1080/10106049.2021.1974959
Seyyedattar M, Ghiasi MM, Zendehboudi S, Butt S (2020) Determination of bubble point pressure and oil formation volume factor: extra trees compared with LSSVM-CSA hybrid and ANFIS models. Fuel 269:116834. https://doi.org/10.1016/j.fuel.2019.116834
Sikorska-Senoner AE, Quilty JM (2021) A novel ensemble-based conceptual-data-driven approach for improved streamflow simulations. Environ Model Softw:105094. https://doi.org/10.1016/j.envsoft.2021.105094
Singh RK, Pandey M, Pu JH et al (2020) Experimental study of clear-water contraction scour. Water Supply 20:943–952. https://doi.org/10.2166/ws.2020.014
Singh UK, Jamei M, Karbasi M et al (2022) Application of a modern multi-level ensemble approach for the estimation of critical shear stress in cohesive sediment mixture. J Hydrol 607:127549. https://doi.org/10.1016/j.jhydrol.2022.127549
Sturm TW, Janjua NS (1994) Clear-water scour around abutments in floodplains. J Hydraul Eng 120:956–972
Tao H, Salih S, Oudah AY et al (2022) Development of new computational machine learning models for longitudinal dispersion coefficient determination: case study of natural streams, United States. Environ Sci Pollut Res. https://doi.org/10.1007/s11356-022-18554-y
Tropsha A, Gramatica P, Gombar VK (2003) The importance of being earnest: validation is the absolute essential for successful application and interpretation of QSPR models. QSAR Comb Sci 22:69–77
Willmott CJ (1982) Some comments on the evaluation of model performance. Bull Am Meteorol Soc 63:1309–1313
Xu Z, Hou Z, Han Y, Guo W (2016) A diagram for evaluating multiple aspects of model performance in simulating vector fields. 4365–4380. https://doi.org/10.5194/gmd-9-4365-2016
Author information
Authors and Affiliations
Contributions
Manish Pandey: Conceptualization, formal analysis, writing up the manuscript; Masoud Karbasi: Validation, investigation, software, writing up the manuscript; Mehdi Jamei: Conceptualization, formal analysis, visualization, writing up the manuscript, Supervision; Anurag Malik: Formal analysis, writing up the manuscript and manuscript editing; Jaan H. Pu: Investigation, and writing up the manuscript, supervision.
Corresponding authors
Ethics declarations
Ethical Approval
The manuscript is conducted within the ethical manner advised by the water resource management journal.
Consent to Participate
Not applicable.
Consent to Publish
The research is scientifically consented to be published.
Competing Interests
The authors declare that he has no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Pandey, M., Karbasi, M., Jamei, M. et al. A Comprehensive Experimental and Computational Investigation on Estimation of Scour Depth at Bridge Abutment: Emerging Ensemble Intelligent Systems. Water Resour Manage 37, 3745–3767 (2023). https://doi.org/10.1007/s11269-023-03525-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11269-023-03525-w