1 Introduction

Rock-socketed piles use a combination of end bearing, shaft resistance, and a hybrid of these two systems to protect against imposed pressures. The use of piles in rock formations has been proposed as a potentially optimal arrangement in scenarios where an unsteady soil foundation overlays bedrock at a shallow depth (Sarkhani Benemaran et al. 2022a). Even if there is minimal pile displacement in these circumstances, the structural frame’s resistance may impact the extraordinary load-carrying capacity (Carrubba 1997). Requirements for socketed piles are rising since there is a rising requirement to improve piles’ structure and design efficiency. Both practical and mathematical methods impact the design of rock-socketed piles. A thorough modeling examination is, thus, planned. The findings show that both methods are suitable for building rock-based socketed piles (Ng et al. 2001).

The application of machine learning-based algorithms to estimate various properties in civil engineering was reported successfully recently (Sarkhani Benemaran et al. 2022b, 2024; Sarkhani Benemaran and Esmaeili-Falak 2023; Esmaeili-Falak and Sarkhani Benemaran 2024). Nevertheless, due to the intricate nature of the pile’s motion, no more dependable alternatives provide inaccurate estimates. Given the significance of pile settlement (\(SP\)) in pile architecture, a novel approach using soft computing techniques is presented to anticipate SP (Carrubba 1997).The precision of the input data plays a vital role in determining the effectiveness of estimating heaps. The following statements aim to reassess the relevant literature to determine the suitable input factors for predicting the progress of \(SP\). Based on an academic publication, several parameters have been identified as influential in the settling of the pile’s foundations, such as the magnitude of the applied force, the length of the pile, the modulus of shear of the soil, the width of the pile, and the angle at which the shear force becomes insignificant (Randolph and Wroth 1978).Within the framework of this specific geological creation, prior research has found that the unconfined compressive strength (\(UCS\)) of rocks acts a pivotal role in ascertaining the capacity of the pile, therefore influencing the \(SP\) (Tirant 1992). The information analysis approach was expanded to include the forecasting of foundation settling by including traditional penetration test findings. The present model was enhanced with around 1000 estimations from other articles, like ground data (Nejad et al. 2009). A neural network-based approach was developed to forecast the settlement of raft foundations on non-cohesive soils. The assessment or ground assessment included soil properties, footing measurements, and strengthening criteria (Soleimanbeigi and Hataf 2006). The evidence indicates that several important variables are critical in determining the settling of rock-socketed piles. These variables are the rock’s \(UCS\), the length-to-diameter ratio, the pile power, the proportion of the pile’s length in the soil to its length in the rock, and amount of \({N}_{SPT}\) (Shahin et al. 2002). Data mining techniques are expanded algorithms to address geotechnical issues that are discussed in many works (Momeni et al. 2015; Armaghani et al. 2016). To accurately forecast the depth of scouring around bridge piers (Najafzadeh and Barani 2011), as well as the formation of scour pile groups resulting from wave action (Najafzadeh and Azamathulla 2015), it is important to possess a comprehensive understanding of the cohesive properties of the soil (Najafzadeh et al. 2013). The genetic algorithm and genetic programming, also referred to as gene expression programming (\(GEP\)), are widely acknowledged as predictive approaches in the field such as liquefaction (Raja et al. 2023a). Through the use of approaches such as \(GEP\), it becomes feasible to create an innovative solution by building a linkage between the input variables and their related output. Several analytic methodologies, including multivariate adaptable regression spline (\(MARS\)), Gaussian method regression, and minimax probability machine regression, have been extensively and correctly used in prior research endeavors (Samui 2019; Le and Le 2021). The systematic application of \(GEP\) has facilitated the development of effective resolutions for several challenges encountered in geotechnical engineering (Dindarloo 2015). \(GEP\) was taken into consideration while trying to determine the axial ability of the piles (Alkroosh and Nikraz 2011). The authors proposed a unique description using the \(GEP\) technique to approximate the deformation moduli of soil (Mollahasani et al. 2011). Additional research was undertaken utilizing three different algorithms, namely the multilayer perceptron, supporting vector machine, and \(GEP\), to forecast the \(UCS\) of rock (Dindarloo 2015). The \(GEP\) approach is used to ascertain the deformation modulus of a stratified sedimentary rock creation (Alemdag et al. 2016).

A novel methodology has been devised to forecast \(SP\) using \(GEP\) in conjunction with multiple linear regression. To receive this data, a comprehensive study was performed on a substantial number of rock samples that were gathered from the project situated in Malaysia. Results suggest a strong probability of achieving success in the \(GEP\) analysis, as demonstrated by \({R}^{2}\) coefficients of 0.872 and 0.861 for the training and testing records, respectively. The root mean square error (\(RMSE\)) values were around 1.29 and 1.65 for the training and testing, respectively, further demonstrating the effectiveness of this method for forecasting \(SP\) (Armaghani et al. 2018). A model derived from a soft computing technique linked to swarming optimization was the goal of several investigations to anticipate \(SP\). The probability stage of the suggested hybrid algorithm’s performance in forecasting pile settlement was shown to have coefficients of determination that had been in the train 0.851 and test 0.079 (Armaghani et al. 2020).

Research uses an optimized Radial Basis Function Neural Network (RBFNN) to achieve its goals. This method is used to evaluate how well the chosen methods perform in the setting of this study. \(RBFNN\) have been proven to be useful in a variety of scientific and technological contexts in several types of research. An effective approach for rainfall predicting involves the use of a hybrid \(RBFNN\) that has been tuned using both Particle Swarm Optimization (PSO) and genetic algorithm techniques (Wu et al. 2015). The allocation of resources has been achieved using quantum PSO and \(RBF\) neural network techniques (Xu et al. 2016). The use of \(RBF\) neural networks in conjunction with the cuckoo search algorithm has been employed to accomplish the modeling of overhead crane systems (Zhu and Wang 2017). The computational analysis of a competitive swarm optimized \(RBF\) neural network has been important in enhancing the accuracy of short-term solar energy generation forecasts. Furthermore, optimizing the \(RBFNN\) using a sine–cosine method has successfully achieved sonar goal sorting.

Researchers in the study hoped to get an understanding of how to use \(SVR\) to predict pile settlement. They may improve the accuracy of the prediction with the addition of meta-heuristic approaches like the Arithmetic Optimization Algorithm (\(AOA\)) or the Grasshopper Optimization Algorithm (\(GOA\)). When compared to the \(SVR-GOA\mathrm{^{\prime}}s\) \(R\) value of 0.99, the \(SVR-AOA\mathrm{^{\prime}}s\) ratio of 0.994 is within a reasonable margin of error (Ge et al. 2023). To precisely calculate the vertical motions of piles under both dynamic and static loading conditions, researchers have turned to using Artificial Neural Networks (\(ANN\)). Using a \(RBFNN\), the Equilibrium Optimizer Algorithm (\(EOA\)), and the \(GOA\), the ideal number of neurons in the hidden layer was calculated. After running simulations, the \(RMSE\) error ratios for \(RBF-GOA\) (0.6312) and \(HRBF-EOA\) (0.5947) were found (Jiang 2022). To find the right number of neurons in hidden layers, this research suggests combining the Hybrid Radial Basis Function (\(HRBF\)) neural network method with the \(AOA\) and Biogeography-Based Optimization (\(BBO\)). The Klang Valley transportation network, including the Mass Rapid Transit system in Malaysia, was the primary subject of the research, emphasizing assessing the settling of piles and ground structures. For this study, we used the \(HRBF-AOA\) and \(HRBF-BBO\) scenarios. Throughout making predictions, the \(R\) values were 0.9825 for \(HRBF-AOA\) and 0.9724 \(HRBF-BBO\) (Zhang et al. 2022). Multivariate adaptive regression splines (\(MARS\)) were proposed as a method of forecasting pile settlement at varying degrees of interface. Results show that the order 3 \(MARS\) framework performs better than the others, with performance index (\(PI\)) levels of training (0.0541) and testing (0.0951) [49], utilizing the \(PI\) as a holistic evaluation of network efficiency (Zuo 2022).

1.1 The objective of this study

The novelty of your study lies in several aspects:

  • Integration of \(ANFIS\) with optimization algorithms: Combining the Adaptive Neuro-Fuzzy Inference System (\(ANFIS\)) with optimization algorithms like the Equilibrium Optimizer (\(EO\)), the Black Widow Optimization Algorithm (\(BWOA\)), and \(PSO\) is innovative. This integration can potentially improve the accuracy and efficiency of predicting pile settlement in rock formations compared to using \(ANFIS\) alone or other traditional methods.

  • Specific application to rock settlement prediction: This study focuses on predicting the settlement of piles in rocks, which is a critical issue in geotechnical engineering. While there are various methods for predicting pile settlement, applying \(ANFIS\) and optimization algorithms specifically to rock formations is relatively novel.

  • Use of local data: This study utilizes data from the Klang Valley Mass Rapid Transit (\(KVMRT\)) project in Kuala Lumpur, Malaysia. Using local data is important because geotechnical conditions can vary significantly from one location to another. Applying these methods to a specific regional project adds a practical and site-specific dimension to the research.

  • Selection of key input parameters: Your study identifies and uses five key input parameters, including the ratio of pile length in different layers, total length-to-diameter ratio, uniaxial compressive strength, standard penetration test results, and final bearing capacity. The selection of these parameters and their combination within the \(ANFIS\) framework represents a unique approach to solving the problem of pile settlement prediction in rocks.

  • Practical implications: The study’s findings can have practical implications for engineering and construction projects in regions with similar geological conditions to Kuala Lumpur. Accurate prediction of pile settlement is crucial for ensuring the stability and safety of structures, and this research could potentially contribute to more reliable foundation design practices.

2 Methodology

2.1 Dataset and pre-processing

Creating a suitable dataset with extremely efficient dependent variables is the first step in constructing a model for forecasting. The most influential factors on a model’s results should be described. The mentioned tests were conducted with a pile evaluation instrument developed by Pile Dynamic, Inc. The rate at which the pile undergoes distortion is governed by its dimension and length, as discussed earlier. This article utilized the dataset related to the Klang Valley Mass Rapid Transit (\(KVMRT\)) project in Malaysia, where the location and circumstances of the project are provided in Fig. 1 (Armaghani et al. 2020). So, the present research looks at the ratio of the length of the pile in the soil layer to the pile length in the stone layer (\({L}_{s}/{L}_{r}\)) and the ratio of total pile length to pile width (\({L}_{p}/D\)) to figure out how pile shape affects things. Moreover, the \(UCS\) was added as the system’s input for the purpose of forecasting pile collapse, owing to its considerable importance. The metric \({N}_{SPT}\) has been generally acknowledged as a significant factor in assessing the condition of the layer of soil. The input for the entire pile-bearing ability (\({Q}_{u}\)) was also supplied since the force placed on the pile directly affects sinking. To assess the settling of the pile (\(SP\)), five input factors were applied. According to the literature (Aghayari Hir et al. 2022; Bardhan et al. 2022; Raja et al. 2023b; Sarkhani Benemaran 2023), several ratios could be considered for dataset division into training and testing portions, such as 70/30, 80/20, 75/25, and 90/10. In the present study, the models were developed using all of these ratios, and the best ratio of 75% / 25% was selected for phases. Table 1 provides an overview of the inputs and outputs used in each model employed for this study, along with their corresponding intensities. Furthermore, Fig. 2 presents a visual representation of the bar chart and lognormal distribution of inputs and output.

Fig. 1
figure 1

Location of the \(KVMRT\) project

Table 1 The statistical values of the input and output variables
Fig. 2
figure 2

The lognormal distribution of parameters in the train and test phases

The Pearson correlation coefficient (Eq. 1) is used to assess the magnitude and degree of the linear connection between two discrete parameters. It measures how effectively a straight line can capture the connection between the two parameters. This statistic has a value that ranges from − 1 to 1. A complete positive linear connection is shown by a coefficient of + 1, which means that if one variable rises, the other rises correspondingly as well. A complete negative linear connection, as shown by a coefficient of − 1, means that when one variable rises, the other falls proportionately. There is no linear link between the two variables, as shown by a correlation coefficient of 0. The findings show that (Fig. 3), with minor exceptions, the majority of the variables show strong linear relationships. In addition to this, a 0.78 positive connection between \({L}_{s} / {L}_{r}\) and \({L}_{p} / D\) was found. In terms of negative values, \(UCS\) and \(SP\) were separated by -0.753.

$${\upgamma }_{z,t} = \frac{{{\text{covariance}}\;\left( {{\text{z}},{\text{t}}} \right)}}{{{\text{standard}}\;{\text{deviation}}\;\left( {\text{z}} \right) \times {\text{standard}}\;{\text{deviation}}\;\left( {\text{t}} \right)}}$$
(1)
Fig. 3
figure 3

The Pearson correlation analysis

2.2 Equilibrium optimizer (\({\varvec{E}}{\varvec{O}}\))

The unique meta-heuristic algorithm known as \(EO\) was introduced by Faramarzi et al. (Faramarzi et al. 2020) as a means to address optimization challenges. This method draws inspiration from the mass balancing mechanism used in control-level systems. \(EO\) has commendable efficacy in addressing evaluate optimization problems with dimensions of up to 200. \(EO\) includes three important facets: the process of initialization involves setting up the balance pool, which consists of four members and their average position. In addition, the updating method incorporates an exponential part and creation ratio. The fundamental principles of \(EO\) are presented in the following manner. During the initialization step, the locations of search agents, often referred to as particles, are randomly created inside the search space. This is typically represented as:

$${X}_{i}^{initial} = {l}_{b} + {r}_{ini}({u}_{b} - {l}_{b})$$
(2)

The initialization location of the \(i-th\) particle, denoted as \({X}_{i}^{initial}\), is determined by a random vector, \({r}_{ini}\), which is created from the interval (0,1). \({l}_{b}\) and \({u}_{b}\) respectively, signify the minimum and maximum values for the design parameter.

The balanced pool of evolutionary optimization (\(EO\)) comprises many agents of search that are in close proximity to the near-optimal answer. The balanced pool consists of five candidates, namely four elements with the highest fitness values and a fifth element determined by averaging the positions of the four best-performing elements. Potential choices inside the balance pool are first produced during the startup step and then adjusted after every repetition during the optimization stage.

$${X }_{eq,pool}= \{{X}_{eq(1)}, {X}_{eq(2)},{ X}_{eq(3)}, {X}_{eq(4)},{ X }_{eq(ave)}\}$$
(3)

The symbol \({X }_{eq,pool}\) denotes a balance pool, whereas \({X}_{eq(i)} (i = 1, 2, 3, 4)\) indicates the four potential candidates with the highest fitness values amid the elements. In addition, \({X }_{eq(ave)}\) refers to the average position of the four best-performing particles seen so far. In each cycle, the optimal particle to update the locations is selected randomly from the balanced pool with the same chance.

The optimization method involves updating the balance pool as well. During the update section, the \(EO\) framework adheres to two primary rules to follow: the focus updating rule, which is regulated by the exponential element (\(F\)), and the equilibrium state rule, which is governed by the creation ratio (\(G\)). The exponential element (\(F\)) has been officially defined as:

$$F = {a}_{1}sign({r}_{1} - 0.5)[exp( - {r}_{2}t) - 1 ]$$
(4)

The variable \({a}_{1}\), as stated by Faramarzi et al. (Faramarzi et al. 2020), is set to a value of 2 to regulate the investigation capability. The variables \({r}_{1}\) and \({r}_{2}\) represent random vectors inside the interval (0,1), whereas t denotes the ratio of \(EO\) and can be mathematically written as:

$$t = {(1 - T/{M}_{iter})}^{(T/{M}_{iter})}$$
(5)

where \(T\) is the current iteration and \({M}_{iter}\) represents the maximum number of iterations. The exploitation potential of \(EO\) may be regulated by adjusting the generation ratio (\(G\)), which is defined as:

$$G = - P( {X}_{eq} - {r}_{2}X(T) )F$$
(6)
$$P =\left\{\begin{array}{l}0.5{r}_{dl}.u {r}_{d2}\ge GP\\ 0 \ {r}_{d2}<GP\end{array}\right.$$
(7)

Let \({X}_{eq}\) represent the candidate chosen from the balance pool as defined in Eq. (3).\(X(T)\) denotes the present location. The equation for \(GP\) may be expressed as \(GP = 0.5{r}_{d1}\), where \({r}_{d1}\) and \({r}_{d2}\) are randomly generated values between 0 and 1. In addition, \(u\) represents the unit vector. The updating method of search tools in \(EO\) is stated as follows:

$$X(T + 1) = {X}_{eq} + ( X(T) - {X}_{eq} )F + (1 - F)G/{r}_{3}V$$
(8)

The new location, denoted as \(X(T + 1)\), is determined by a random vector, \({r}_{3}\), which is uniformly distributed across the range \((0, 1)\). In addition, \(V\) denotes a unit. In Formula (8), the determination of the equilibrium concentration is mostly governed by the first term. At the same time, the second term predominantly influences searchability, while the third term primarily affects the exploitation ability. The procedure of \(EO\) algorithm is provided step by step in Algorithm. 1.

Algorithm 1
figure a

EO's steps Faramarzi et al. (2020)

2.3 Black widow optimization algorithm (\({\varvec{B}}{\varvec{W}}{\varvec{O}}{\varvec{A}}\))

Widows often inhabit forested areas as well as several other regions around the world. One special kind of widow spider is often referred to as the black widow, characterized by its distinctive behavioral patterns. The black widow spider often maintains a concealed presence during daylight hours, reserving its activity for the nighttime when it retreats to its web. The female black widow spider uses pheromones to attract the male counterpart by marking certain areas of her web during the mating process (El-Fergany 2018). By reducing the size of the web, the first male widower to enter the arena is able to steal attractive females from their male rivals. In this particular reproductive behavior, the female feeds the male partner in its entirety, afterward assuming the responsibility of carrying the eggs inside a specialized egg sac. Following the process of egg marking, the offspring partake in a period of sibling living together, during which they temporarily reside inside their mothers' web. This arrangement persists to the point when the offspring may even feed their moms. Consequently, the widows who had strength and physical fitness were more likely to succeed in this regular pattern. The duration of the life of the widow is considered to be a very effective approach for developing an optimal algorithm.

2.3.1 A novel black widow optimization algorithm

In a manner similar to previous evolutionary algorithms, the first phase of the innovative optimization process involves the establishment of an initial population of spiders, and each spider represents a potential solution. The primary objective of these female spiders, in conjunction with their male counterparts, is to ensure the successful reproduction and continuation of their lineage. During the mating period or subsequent to it, the female widow spider engages in cannibalistic behavior by consuming her male partner. Eventually, she transfers preserved sperm into her sperm storage organs and subsequently introduces them into the ovisacs. The gestation period for spiders is around 11 days. During a certain period ranging from a few days to a week, the offspring cohabit inside the maternal web, during which instances of sibling cannibalism have been seen. Finally, the little spiders traverse the air currents.

2.3.2 Initial population

In the context of an optimization problem involving \(N\) dimensions, a window is conceptualized as an array of dimensions (\(1\times {N}_{var}\)) that represents the solution to the optimization problem. The array is represented by the following notation:

$$Widow= [{x}_{1}, {x}_{2} \dots {x}_{Nvar}]$$
(9)

Each of the parameter amounts (\({x}_{1}, {x}_{2} \dots {x}_{Nvar}\)) is represented as a floating-point number. The fitness an amount of a widow is determined by evaluating the fitness function, denoted as \(f\), at a widow consisting of parameters (\({x}_{1}, {x}_{2} \dots {x}_{Nvar}\)).

$$Fitness= f \left(widow\right)= f ({x}_{1}, {x}_{2} \dots {x}_{Nvar})$$
(10)

2.3.3 Generation

It is important to acknowledge that each couple is mutually separate, beginning the process of pairing to produce offspring simultaneously, akin to natural occurrences. Every couple has unique web compatibility. In nature, each successful couple produces more than 1,000 eggs, but only a few healthy spider lings survive. In the suggested process, it is necessary to renew an array called alpha if a widow array holding random numbers is present since this would result in the creation of children. In the equation that follows, the variables \({x}_{1}\) and \({x}_{2}\) are defined as parental entities, whereas the parameters \({y}_{1}\) and \({y}_{2}\) are chosen to represent children.

$$\left\{\begin{array}{c}{y}_{1}=\alpha \times {x}_{1}+(1-\alpha )\times {x}_{2}\\ {y}_{2}=\alpha \times {x}_{2}+(1-\alpha )\times {x}_{1}\end{array}\right.$$
(11)

This process is repeated \({N}_{var}/2\) times, or until it is clear that the random integers being used are unique.

2.3.4 Cannibalism

Within this particular stage, there are two distinct forms of cannibalism. The first phenomenon under discussion is often referred to as sexual cannibalism when the female black widow spider consumes her male partner either during or during the mating process. The suggested method distinguishes between females and men based on their respective fitness scores. Sibling cannibalism is the second form, in which the stronger spider lings use their weaker siblings. Identify if spiderlings are strong or weak using the fitness amount.

2.3.5 Mutation

During this particular stage, the selection of \(Muttpop\) is conducted in a random manner, as seen in Fig. 4.

Fig. 4
figure 4

Mutation

2.3.6 Convergence

Like other methods of evolution, it is possible to consider three-stage scenarios: (\(A\)) a preset amount of rounds, (\(B\)) the requirement that the fitness and amount of the best solution remain unchanged for many iterations, and (\(C\)) the achievement of a certain degree of accuracy.

2.3.7 Parameter setting

One crucial aspect of the suggested algorithm is the identification of several factors to get improved outcomes. The factors in the study are the cannibalism rate (\(CR\)), procreation rate (\(PP\)), and mutation rate (\(PM\)). The procedure of \(BWO\) algorithm is provided step by step in Algorithm 2.

Algorithm 2
figure b

BWOA Hayyolalam and Kazem (2020)

2.4 Adaptive Neuro-Fuzzy Inference System (\({\varvec{A}}{\varvec{N}}{\varvec{F}}{\varvec{I}}{\varvec{S}}\))

As a well-known machine learning technique, the \(ANFIS\) algorithm has seen extensive use in the study and solution of challenging nonlinear problems (Jiang et al. 2019; Masoumi et al. 2020). The present approach effectively combines the neural network with the fuzzy conclusion system. The \(ANFIS\) utilizes the least squares and gradient descent algorithms to facilitate the learning process of the method. It is being discovered that \(ANFIS\) is a useful technique for use in solving forecast issues. This is an explanation of a five-tiered \(ANFIS\) (Shahnazar et al. 2017).

In the first layer, known as the fuzzification layer, every node is considered an adaptable input.

$${O}_{i}^{1}=\mu {A}_{i}\left({x}_{1}\right) \mathrm{for }i=\mathrm{1,2},\dots ,n$$
(12)
$${O}_{i}^{1}=\mu {B}_{i-2}\left({x}_{2}\right) \mathrm{for }i=\mathrm{3,4},\dots ,n$$
(13)

The symbols \(\mu {A}_{i}({x}_{1})\) and \(\mu {B}_{i-2}({x}_{2})\) represent Gaussian membership activities, whereas the quantity n represents the number of fuzzy sets for other input variables (Shahnazar et al. 2017). Using Eq. 14, each node in the second layer (the product layer) determines the efficacy of a given rule by means of its ability to trigger an event.

$${O}_{i}^{2}={\omega }_{i}=\mu {A}_{i}\left({x}_{1}\right)\mu {B}_{i}\left({x}_{2}\right) \mathrm{with }i=\mathrm{1,2}$$
(14)

The normalizing procedure takes place inside the third layer, or the normalized layer, using Eq. 15 and the summing of the firing strength ratio of the \(i\) th condition to the firing strength of all the other rules (Shahnazar et al. 2017).

$${O}_{i}^{3}={\overline{w} }_{i}=\frac{{w}_{i}}{{w}_{1}+{w}_{2}} i=\mathrm{1,2}$$
(15)

The defuzzification procedure is executed inside the fourth layer. Within this particular layer, every individual node has the ability to change in accordance with Eq. 16.

$$O_{i}^{4} = \overline{w}_{i} f_{i} = \overline{w}_{i} (k_{1}^{i} x + k_{2}^{i} y + k_{0}^{i} )$$
(16)

The symbol \({\overline{w} }_{i}\) shows the result of the third layer, whereas \(\left\{{k}_{1}^{i},{k}_{2}^{i},{k}_{0}^{i}\right\}\) indicates the variable ranges associated with the \({\overline{w} }_{i}\) node. The result of the fifth layer, namely the output layer, is generated by summing the result of the preceding layer according to Eq. (17).

$$O_{i}^{5} = {\text{overalloutput}} = \sum\limits_{i} {\overline{w}_{i} } f_{i} = = \frac{{\sum\nolimits_{i} {\overline{w}_{i} f_{i} } }}{{\sum\nolimits_{i} {\overline{w}_{i} } }};\;i = 1,2$$
(17)

\(ANFIS\) involves the use of three distinct simulations, namely fuzzy \(C-\) means clustering (\(FCM\)), subtractive clustering (\(SCM\)), and grid partitioning (\(GP\)). Since \(FCM\) has been shown to be the strongest simulation in the literature, it was selected for use in the \(ANFIS\) algorithm’s predictive capabilities. Nikafshan Rad et al. provide a thorough explanation of \(FCM\) (Rad et al. 2015). In relation to \(ANFIS\) concepts, a primary \(ANFIS\) design was generated employing the initial variables. The \(EO\), \(BWOA\), and \(PSO\) methodologies were then used to improve the \(ANFIS\) framework that had been built. The membership function parameters for the suggested network were improved by applying the \(EO\), \(BWOA,\) and \(PSO\) methods in this study. The \(RMSE\) metric was used as a fitness indicator to evaluate the accuracy of the optimization framework. In conclusion, a hybridized and optimized network consisting of \(ANFE\), \(ANFB\), and \(ANFP\) components were established, with the network parameters, including the quantity of fuzzy words and the maximum iteration count, being specified.

Here is a detailed explanation of the coupling process:

  • The initializing of the \(ANFIS\) model begins with random or predefined initial parameters. These parameters include membership function parameters, rule weights, and input/output scaling factors.

  • \(EO\), \(BWOA,\) and \(PSO\) are integrated into the parameter learning process of the \(ANFIS\) model.

  • During optimization, \(EO\), \(BWOA,\) and \(PSO\) interact with the \(ANFIS\) architecture to update the parameters, such as membership function parameters and rule weights, based on the fitness evaluations.

  • The optimization algorithms modify the parameters of the \(ANFIS\) model iteratively, gradually improving its performance on the training data.

  • The optimization process continues until a termination criterion is met, such as convergence of the objective function or reaching a maximum number of iterations.

  • Once optimization is complete, the performance of the coupled \(ANFIS\) model with \(EO\), \(BWOA,\) and \(PSO\) is evaluated using testing data.

2.5 Metrics

Seven efficiency indicators were used to compare and evaluate the performance of the hybrid ANFE, ANFB, and ANFP models. The indices are:

  • Coefficient of determination (\({R}^{2}\))

  • Root mean square error (\(RMSE\))

  • Normalized root mean square error (\(NRMSE\)

  • Root relative squared error (\(RRSE\))

  • Relative absolute error (\(RAE\))

  • Mean absolute error (\(MAE\))

Lesser values of RMSE, NRMSE, RRSE, MAE, and RAE show more accuracy. Furthermore, a higher \({R}^{2}\) number is a sign of better performance.

  • In addition, the \(OBJ\) metric was taken into account as a complete indication that RMSE, MAE, and R2 values were concurrently calculated for the training and testing phases.

    $${{R}^{2}=\left(\frac{{\sum }_{d=1}^{D}\left({m}_{d}-\overline{m }\right)\left({z}_{d}-\overline{z }\right)}{\sqrt{\left[{\sum }_{d=1}^{D}{\left({m}_{P}-m\right)}^{2}\right]\left[{\sum }_{d=1}^{D}{\left({z}_{d}-\overline{z }\right)}^{2}\right]}}\right)}^{2}$$
    (18)
    $$RMSE=\sqrt{\frac{1}{D}{\sum }_{d=1}^{D}{\left({z}_{d}-{m}_{d}\right)}^{2}}$$
    (19)
    $$NRMSE=RMSE/\overline{z }$$
    (20)
    $$RRSE=\sqrt{\frac{\sum_{d=1}^{D}{({m}_{d}-{z}_{d})}^{2}}{\sum_{d=1}^{D}{({m}_{d}-\overline{m })}^{2}}}$$
    (21)
    $$RAE=\frac{\sum_{d=1}^{D}\left|{m}_{d}-{z}_{d}\right|}{\sum_{d=1}^{D}\left|{m}_{d}-\overline{m }\right|}$$
    (22)
    $$MAE=\frac{1}{D}\sum_{d=1}^{D}\left|{z}_{d}-{m}_{d}\right|$$
    (23)
    $$OBJ = \left[ {\frac{d}{D} \times \frac{{RMSE + MAE}}{{R^{2} + 1}}} \right]_{{train}} + \left[ {\frac{d}{D} \times \frac{{RMSE + MAE}}{{R^{2} + 1}}} \right]_{{test}}$$
    (24)

The variables \({m}_{d}\), \(\overline{m }\), \({z}_{d}\), and \(\overline{z }\) represent the observed values, the mean of the observed values, the simulated values, and the mean of the simulated values, respectively, in the given equations. Moreover, the variable \(D\) denotes the total number of datasets. Uncertainty analysis at 95% confidence level (\({U}_{95}\)) provided further information on scenario efficacy [63,64]. This inquiry compares the real and simulated outcomes, as shown by Eq. 25.

$${U}_{95}=1.96\sqrt{({SD}^{2}+{RMSE}^{2})}$$
(25)

3 Findings and justifications

This research paper shows the findings of the \(ANFE\), \(ANFB\), and \(ANFP\) investigations. These analyses were performed to ascertain the \(SP\) of deep foundations when they are buried in rock. Figure 5 displays the measured and estimated quantities of the \(SP\) throughout the training and evaluation stages of the established \(ANFE\) and \(ANFB\) methodologies. Moreover, the chart presented exhibits an error percentage distribution along with a time series. Furthermore, the present study compared the results of the generated models with the most relevant literature sources to assess the reliability and effectiveness of these models considering order three \(MARS\) (\(MO3\)) (Zuo 2022), ant lion optimizer with multi-layered perceptron (\(ALO-MLP\)) (Hu 2022), ant lion optimizer with \(ANFIS\) (\(ALO-ANFIS\)) (Yu 2022), ant lion optimizer with \(RBF\) (\(ALO-RBFNN\)) (\(neuro-swarm system\)Armaghani et al. 2020; Gao and Jun-Wei 2022), and (\(GEP\)Armaghani et al. 2018).

Fig. 5
figure 5

The performance of systems (correlation, time series, and error histogram)

The findings suggest that all \(ANFE\), \(ANFB,\) and \(ANFP\) have significant promise in properly forecasting the \(SP\). The \(ANFE\) system demonstrated high levels of accuracy, with \({R}^{2}\) values of 0.9977 and 0.996 attained during the phases of training and testing. Similarly, the \(ANFB\) system exhibited strong performance, with \({R}^{2}\) values of 0.9946 and 0.993 in its training and evaluating stages, respectively. Moreover, the third rank belonged to \(ANFP\), getting the \({R}^{2}\) values of 0.9805 and 0.989 attained during the phases of training and testing. To achieve this purpose, it is crucial to do a thorough assessment of the behavior shown by different error-based metrics. By analyzing the divergent values of \(ANFE\) and \(ANFB\) in relation to these parameters, it becomes apparent that \(ANFE\) demonstrates an approximate 50% reduction when compared to \(ANFE\). The aforementioned decrease highlights the dependability of \(ANFE\) in predicting the stock price. Uncertainty analysis values by calculating \({U}_{95}\) as a precious index (the lower values, the more accuracy) depict a better performance of the \(ANFE\) compared to \(ANFB\) by gaining 0.4916 lower than 0.7485 in the train part and at 0.7099 smaller than 0.94 in the test. In addition, a comprehensive index was computed, referred to as the \(OBJ\), which encompasses the values of \(RMSE\), \(MAE\), and \({R}^{2}\) for both the training and test phases. In this context, lower values of these metrics indicate greater precision of the systems. The results show that the \(ANFE\) model received the lowest \(OBJ\) at 0.1649 compared to \(ANFB\) at 0.2444, and compared to \(ANFP\) at 0.2968, approving the applicability and reliability of the models.

A comprehensive comparison is performed to validate the reliability of the models with the literature \(MO3\) (\(ALO-MLPALO-ANFISALO-RBFNNneuro-swarm system\)Armaghani et al. 2020; Zuo 2022; Hu 2022; Yu 2022; Gao and Jun-Wei 2022), and (\(GEP\)Armaghani et al. 2018). After conducting a thorough analysis of Table 2, it becomes apparent that the \(ANFE\) proposed in this study exhibited a superior level of efficacy in comparison to previous investigations reported in the existing scholarly literature. This assessment considered comparable evaluation measures, including \({R}^{2}\), \(MAE\), and \(RMSE\). The findings of this research exhibit more robustness and precision compared to existing literature, as shown by the attainment of higher \({R}^{2}\) values and lower \(RMSE\) and \(MAE\) values. Regarding the \({R}^{2}\), a significant improvement can be observed when attaining values beyond 0.99, starting with beginning values below 0.844 (Armaghani et al. 2020). The ALO–RBFNN [53] has the lowest \(RMSE\) when compared to other developed models (Gao and Jun-Wei 2022). However, it is worth noting that the RMSE values for the ALO–RBFNN model are very high when compared to the \(ANFE\) model, with values of 0.1767 \(mm\) and 0.2535 \(mm\) seen in the train and test stages, respectively.

Table 2 The \(ANFIS\)’s results and comparison with the literature

3.1 Sensitivity analysis

How inputs affect the simulation’s performance may be understood via the use of sensitivity analyses in mathematical models. To find the impact of removing each parameter, several models were created in the present investigation, each employing unique input elements. The generated models were compared with the results of the outperformed models, i.e., \(ANFE\). \(ANFE\) was used to calculate and compare the \({R}^{2}\), \(RRSE\), \(MAE\), and \({U}_{95}\) metrics evaluate the impact of various inputs (Table 3). The impact of variables that have been removed from the conclusion becomes increasingly obvious as measurement discrepancies widen. Based on the findings, it can be concluded that almost every input has a marginally detrimental effect on output as compared to \(ANFE\). It is important to note that deleting the \(UCS\) variable from the input category leads \({R}^{2}\) to decrease and \(RRSE\), \(MAE\), and \({U}_{95}\) to increase. In the training phase, the value of the \({R}^{2}\) showed a reduction from 0.9977 to 0.971. In this phase, \(RRSE\), \(MAE\), and \({U}_{95}\) depict the remarkable increment from 0.0491 to 0.173, 0.1269 to 0.4231, and from 0.4916 to 1.7326, respectively. This trend is valid for the testing phase. Removing \({L}_{s}/{L}_{r}\) has the lowest impact on the output by resulting in lower differences of the metrics with respect to the model with no removal. It is important to point out that the models were made using a set of data that came from tests. So, leaving out any number could make the programs less complete and reliable.

Table 3 Sensitivity analysis

3.2 Limitations, suggestions, and practical usages

The accuracy of the predictive model heavily relies on the quality and quantity of data available. Future research could explore ways to obtain more comprehensive and high-quality data. The generalizability of the proposed approach to other geological conditions and foundation types may be limited. Future research could investigate the adaptability of this model to different scenarios. The choice of optimization algorithms and modeling techniques (\(ANFIS\), \(EO\), \(BWOA\), and \(PSO\)) can significantly affect the results. It is possible that the performance of these algorithms may vary for different datasets or applications. Future work could explore a wider range of optimization techniques to assess their robustness.

By providing more reliable predictive models, the study’s findings can directly contribute to enhancing the design process. Engineers can use these models to optimize pile configurations, sizes, and placement strategies, leading to more cost-effective and structurally sound foundations.

Unforeseen settlement can lead to delays, structural damage, and costly remediation efforts. By leveraging accurate predictive models, project stakeholders can anticipate potential settlement issues early in the design phase and implement preventive measures to minimize risks.

Understanding the anticipated settlement behavior of piles in rock formations allows construction planners to optimize project schedules and sequencing. By incorporating predictive models into construction planning software, project managers can better allocate resources, schedule activities, and mitigate potential conflicts between construction activities and ground settlement constraints.

4 Conclusion

To operationalize hybrid \(ANFIS\) designs, the study employed the results of \(PDA\) tests, together with the properties of both piles and soil. In relation to this matter, five elements, namely \({L}_{s}/{L}_{r}\), \({L}_{p}/D\), \(UCS\), \(N\), and \({Q}_{u}\), were selected as independent variables for the purpose of forecasting the pile settlement (\(SP\)) in rock formations. To improve the precision of the models, the primary essential parameters of the \(ANFIS\) were calculated by integrating the \(EO\), the \(BWOA\), and \(PSO\).

  • The findings suggest that all \(ANFE\), \(ANFB,\) and \(ANFP\) have significant promise in properly forecasting the \(SP\). The \(ANFE\) system demonstrated high levels of accuracy, with \({R}^{2}\) values of 0.9977 and 0.996 attained during the phases of training and testing. Similarly, the \(ANFB\) system exhibited strong performance, with \({R}^{2}\) values of 0.9946 and 0.993 in its training and evaluating stages, respectively. Moreover, the third rank belonged to \(ANFP\) getting the \({R}^{2}\) values of 0.9805 and 0.989 during the phases of training and testing, respectively.

  • By analyzing the divergent values of \(ANFE\) and \(ANFB\) in relation to error-based metrics, it becomes apparent that \(ANFE\) demonstrates an approximate 50% reduction when compared to \(ANFE\).

  • Uncertainty analysis depicts a better performance of the \(ANFE\) compared to \(ANFB\) by gaining 0.4916 lower than 0.7485 in the train part and at 0.7099 smaller than 0.94 in the test part.

  • The results show that the \(ANFE\) model received the lowest \(OBJ\) at 0.1649 compared to \(ANFB\) at 0.2444, and compared to \(ANFP\) at 0.2968, approving the applicability and reliability of the models.

  • The findings of this research exhibit more robustness and precision compared to existing literature, as shown by the attainment of higher \({R}^{2}\) values and lower \(RMSE\) and \(MAE\) values.

  • It Is important to note that deleting the \(UCS\) variable from the input category leads \({R}^{2}\) to decrease and \(RRSE\), \(MAE\), and \({U}_{95}\) to increase. In the training phase, the value of the \({R}^{2}\) showed a reduction from 0.9977 to 0.971. In this phase, \(RRSE\), \(MAE\), and \({U}_{95}\) depict the remarkable increment from 0.0491 to 0.173, 0.1269 to 0.4231, and from 0.4916 to 1.7326, respectively. This trend is valid for the testing phase.

  • The generalizability of the proposed approach to other geological conditions and foundation types may be limited. Future research could investigate the adaptability of this model to different scenarios. The choice of optimization algorithms and modeling techniques (\(ANFIS\), \(EO\), \(BWOA\), and \(PSO\)) can significantly affect the results. It is possible that the performance of these algorithms may vary for different datasets or applications. Future work could explore a wider range of optimization techniques to assess their robustness.

  • By providing more reliable predictive models, the study’s findings can directly contribute to enhancing the design process. Engineers can use these models to optimize pile configurations, sizes, and placement strategies, leading to more cost-effective and structurally sound foundations. Understanding the anticipated settlement behavior of piles in rock formations allows construction planners to optimize project schedules and sequencing. By incorporating predictive models into construction planning software, project managers can better allocate resources, schedule activities, and mitigate potential conflicts between construction activities and ground settlement constraints.