1 Introduction

Fiber-reinforced soil (FRS) has been applied in civil engineering ecosystems for many years. The FRS is a promising alternative material for localized repair of slopes and reinforcement of thin soil veneers where planar reinforcement is difficult to implement due to geometric constraints [1]. However, estimating the FRS behaviors has confirmed a difficult task because shear strength of FRS is influenced complexly by many factors including fiber characteristics, soil properties, and stress conditions. Accurately estimating the shear strength of FRS is vital for civil engineers in designing geotechnical structures and management.

Some researchers have proposed methods to estimate the peak shear strength of FRS. Particularly, a statistical technique [2], a discrete formulation [3], and energy dissipation formulations [4] have been developed for predicting FRS shear strength. Although these conventional methods are convenient, their accuracy is limited because simplified models do not adequately consider complex mechanisms of soil–fiber interaction [5]. Relationship among shear strength of FRS and fiber characteristics, soil properties, and stress conditions is inherently nonlinear. This inherent complexity limits the ability of designers to assess shear strength parameters and has made it difficult to establish a mathematical model for accurately predicting the FRS shear strength.

Because of the importance of improving the prediction performance, machine learning (ML)-based methods have been developed to enhance prediction accuracy and efficiency [6, 7]. ML-based approaches have confirmed the potential solutions for solving real-world engineering problems [811]. Literature shows that no study has used ML-based method to estimate shear strength of FRS in geotechnical systems. Therefore, the ML-based approaches are potential techniques for shear strength prediction of FRS.

Among the ML techniques, the least squares support vector regression (LSSVR) method proposed by Suykens et al. [12] is one of the prevailing AI methods. The LSSVR greatly reduces computation complexity and increases efficiency compared to standard SVR. It has been widely used in civil engineering field [1315]. Most studies agree that the predictive accuracy of a SVR-based model is superior to that of other ML-based models [1618]. The performance of the LSSVR model depends highly on its hyperparameters, namely the regularization parameter (C) and radial basis function kernel parameter (σ). For improving the prediction ability, appropriately selecting the LSSVR hyperparameters is a challenging optimization problem.

Swarm intelligence (SI) has attracted great interest and attention [19, 20]. Examples of SI-based algorithms include artificial bee colony, particle swarm optimization, cuckoo search, and firefly algorithm (FA). The FA is a particularly efficient SI-based algorithm because it has two major advantages over the others: automatic subdivision and the ability to handle multimodality [21]. SI-based algorithms have been successful in solving a wide range of problems in various domains [22]. As a part of this work, an enhanced nature-inspired metaheuristic algorithm was used to dynamically optimize the LSSVR hyperparameters.

In this regard, this study proposes a new computational method, namely a swarm intelligence optimized regression (SIOR) system to estimate the peak shear strength of randomly distributed fiber-reinforced soil. The system integrates an LSSVR and an enhanced SI optimization algorithm, namely smart firefly algorithm (SFA), in which the SFA is used to fine-tune the LSSVR hyperparameters.

As the original scientific contribution, the proposed SIOR system provides a new method of estimating the FRS shear strength in the geotechnical system which is efficient and reliable. In addition, the new SFA algorithm is integrated into the SIOR system, which can efficiently optimize the LSSVR hyperparameters; consequently, the prediction accuracy of the SIOR system is significantly improved. In particular, a user-friendly interface of the SIOR system provides convenience to the users. Finally, a comparison analysis of the prediction performance between the proposed SIOR system, the baseline LSSVR, and empirical methods has been done via cross-fold validation and hypothesis test.

The proposed SIOR system was developed in MATLAB®. To demonstrate its efficacy, FRS dataset collected from 20 studies published during 1983–2015 was used to validate the system. Prediction performance was evaluated in terms of linear correlation coefficient (R), root-mean-square error (RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE).

The remainder of this paper is organized as follows. Section 2 reviews the relevant literature and empirical methods for calculating shear strength of FRS. The proposed system and evaluation theory are presented in Sect. 3. Section 4 discusses the data collection process, the system evaluation, and analytical results. The last section presents the concluding remarks.

2 Literature review

The FRS was used to strengthen the shear-resistant capacity of the geotechnical systems [23], reduce potential slope failure [24], improve soil rigidity and brittleness [25], repair slope veneer, and patch uneven surfaces, and serve as a more durable alternative to geogrid and geotextile [5]. Many prior researches confirmed the effectiveness of fiber reinforcement [26, 27]. The FRS has shown the effectiveness for repairing failed slopes where the irregular shape of soil patches limits the use of textile reinforcements and stabilizing subgrade in road construction [28].

Accurately predicting the peak friction angle of randomly distributed FRS is essential for efficient engineering design. In the past three decades, studies of FRS behavior [1, 3, 4, 26, 2931] have included Zornberg, who introduced a discrete model for designing FRS slopes. Fibers are treated as discrete elements that contribute to stability by mobilizing tensile stresses along shear planes. Zornberg [3] applied the bilinear shear strength envelope, in which the fiber-induced distributed tension acting along a failure surface is added to the soil shear strength.

Michalowski and Čermák [4] further proposed an energy dissipation model for estimating the ultimate shear strength of randomly distributed FRS in the “fiber slip” confinement ranges. Similarly, Sadek et al. [32] proposed a mathematical model for predicting shear strength of fiber-reinforced sands. Sivakumar Babu et al. [27] examined soil–fiber interaction in FRS by using numerical analysis to capture the overall stress–strain response. Recent studies indicate that peak friction angle of FRS is influenced by several factors, including soil properties, fiber characteristics, and stress conditions.

Instead of numerical modeling, Maher and Gray [30] performed triaxial compression tests to study the effects of soil properties, fiber characteristics, and other variables on sand reinforced with randomly distributed fibers. Ranjan et al. [2] performed triaxial compression tests on soil reinforced with randomly distributed fibers to identify the relationship between the shear strength of reinforced soil and fiber parameters, soil properties, and confining stress. In particular, Ahmad et al. [26] performed a series of triaxial tests to study the effect of oil palm empty fruit bunch (OPEFB) fiber on the shear strength of soil under drained and undrained triaxial test conditions. According to their experimental results, addition of OPEFB fiber significantly increases the peak shear strength of silty sand soil.

Li and Zornberg [1] performed triaxial compression tests and fiber pullout tests to evaluate how fiber tension is mobilized at varying shear strain levels. The experimental results confirmed that (1) full mobilization of fiber-induced tension requires comparatively high strain levels; (2) the initial density of the soil does not significantly affect the shear strength of FRS prepared with comparatively high fiber contents; (3) for FRS prepared with comparatively low fiber content and for conditions where the unreinforced soil stress–strain response shows a post-peak shear strength loss, the discrete framework should be applied when using the peak shear strength of the unreinforced soil to predict the equivalent shear strength of FRS.

Commonly used synthetic fibers include polypropylene (PP), polyethylene terephthalate, polyvinyl alcohol, glass, nylon, and steel. The advantages of these fibers include their high strength, high durability, easily controllable quality and yield, chemical resistance, and climate resistance. However, their disadvantage is their high environmental impact. Since synthetic fibers are resistant to bio-decomposition and chemical corrosion, they prevent soil shrinkage and are resistant to weather and chemical changes. The most common of the above fibers is PP fiber [24]. Al-Refeai [33] indicated that PP fiber was superior to glass fiber and that the optimal length for fiber reinforcement was approximately 76 mm.

Several researchers have proposed empirical formulas for calculating the engineering strength of FRS. For example, Michalowski and Zhao [34] presented a mathematical formula as shown in Eq. (1) to estimate the shear-resistant strength of FRS.

$$\phi_{f} = 2\tan^{ - 1} \sqrt {\frac{{X_{f} \eta_{\text{f}} M\tan \phi_{w} + 6K_{p} }}{{6 - X_{f} \eta_{\text{f}} M\tan \phi_{w} }}} - \frac{\pi }{2}$$
(1)

where \(M = K_{p} \sin \theta_{0}\), \(K_{p} = \tan^{2} (45 + \theta /2)\), \(\theta_{0} = \tan^{ - 1} \sqrt {K_{p} /2}\), η f is fiber aspect ratio, ϕ is friction angle of soil, ϕ w is interfacing friction angle between the sand and fiber, and X f is fiber content (volume percentage).

Alternatively, Zornberg [3] developed a discrete model for predicting shear strength of FRS. Equation (2) is the mathematical formula of the discrete model.

$$\phi_{f} = \tan^{ - 1} ((1 + \alpha \cdot \eta_{\text{f}} \cdot X_{f} \cdot c_{i,\phi } ) \cdot \tan \phi )$$
(2)

where α is empirical coefficient, α ≤ 1, ϕ is friction angle of soil, η f is fiber aspect ratio, X f is fiber content (volume percentage), c i,ϕ is interface friction coefficient of soil (\({{\tan \phi_{w} } \mathord{\left/ {\vphantom {{\tan \phi_{w} } {\tan \phi }}} \right. \kern-0pt} {\tan \phi }}\)), ϕ w is interfacing friction angle between sand and fiber.

Although these empirical methods are convenient for designers, the accuracy of the estimated values is low because these empirical methods explore a limited number of factors and because the relationship between FRS strength parameter and predictive factors is highly nonlinear. Therefore, this study developed the novel computational method for improving accuracy in predicting the shear strength property of FRS. Particularly, this study enhances the optimization algorithm and LSSVR to provide high accuracy in facilitating FRS shear strength design.

3 Methodology

3.1 Least squares support vector regression

The LSSVR proposed by Suykens et al. [12] is a well-developed ML technique with many advanced features that enable good generalization capacity and fast computation. The LSSVR training process uses a least squares cost function to obtain a linear set of equations in the dual space so as to minimize computational cost. Accordingly, iterative methods, such as conjugate gradient efficiently solving a set of linear equations, are used to derive a solution. To reduce the computational burden of the LSSVR for function estimation, the regression model in this study used a quadratic loss function [35]. In a function estimation of the LSSVR, given a training dataset \(\left\{ {x_{k} ,y_{k} } \right\}_{k = 1}^{N}\), the classic linear LSSVR model related inputs X’s with the output Y by minimizing the objective function which is formulated as Eq. (3).

$$\mathop {\hbox{min} }\limits_{\omega ,b,e} J(\omega ,e) = \frac{1}{2}\left\| \omega \right\|^{2} + \frac{1}{2}C\sum\limits_{k = 1}^{N} {e_{k}^{2} } ;\quad {\text{subject}}\;{\text{to}}\;y_{k} = \left\langle {\omega ,\varphi (x_{k} )} \right\rangle + b + e_{k} ,\quad k = 1, \ldots ,N$$
(3)

where J(ω, e) is the optimization function; ω is the parameter of the linear approximator; \(e_{k} \in R\) is error variables; \(C \ge 0\) is a regularization constant that specifies the constant representing the trade-off between the empirical error and the flatness of the function; x k is input patterns; y k is prediction labels; and N is the sample size.

Since this is a typical optimization problem of a differentiable function with constraints, it can be solved by using Lagrange multipliers (α k ). The resulting LSSVR model for function estimation can be expressed as Eq. (4).

$$f(x) = \sum\limits_{k = 1}^{N} {\alpha_{k} K(x,x_{k} )} + b$$
(4)

where \(\alpha_{k} ,b\) are Lagrange multipliers and the bias term, respectively, and K(x, x k ) is kernel function. In the feature space, the kernel function can be described as Eq. (5):

$$K(x,x_{k} ) = \sum\limits_{k = 1}^{m} {g_{k} (x)g_{k} (x_{k} )}$$
(5)

In this study, the radial basis function (RBF) kernel was used not only because it has lower mathematical complexity compared to the polynomial kernel, but also because it effectively solves highly nonlinear problems. Equation (6) is the mathematical expression of the RBF kernel.

$$K(x,x_{k} ) = \exp ( - \left\| {x - x_{k} } \right\|^{2} /2\sigma^{2} )$$
(6)

where σ is the RBF width.

However, the prediction accuracy of the LSSVR is highly dependent on the determination of its hyperparameters. Therefore, as a part of this study, the enhanced SI algorithm was developed to optimize LSSVR hyperparameters, i.e., the regularization parameter \((C)\) and the sigma of the RBF kernel \((\sigma )\).

3.2 Enhanced swarm intelligence optimization algorithm

An enhanced SI optimization algorithm, i.e., SFA, is developed by combining the conventional FA and metaheuristic components, which are used to fine-tuning the conventional FA parameters.

3.2.1 Swarm intelligence-based algorithm

The computational intelligence has received great interest and attention in the literature. In the communities of optimization, computational intelligence, and computer science, bio-inspired algorithms, especially those SI-based algorithms are now become very popular [19, 36, 37]. The FA developed by Yang [38] is a stochastic, SI algorithm that efficiently finds the global optima of an optimization problem. The FA applies three idealized rules: (1) Each firefly is attracted to other fireflies regardless of gender because all fireflies are unisex; (2) the attractiveness of a firefly is proportional to its brightness and decreases as distance increases; additionally, a firefly moves randomly if no other firefly is brighter; and (3) the brightness of a firefly is affected or determined by the search space of the objective function.

For a maximization problem, the brightness value is simply set to be proportional to the value of the objective function. As the attractiveness of a firefly is proportional to the intensity of light that is visible to adjacent fireflies, the attractiveness β of a firefly satisfies Eq. (7).

$$\beta = \beta_{0} e^{{ - \gamma r^{2} }}$$
(7)

where β is the attractiveness of the firefly; β 0 is the attractiveness of the firefly at r = 0; r is the distance between the firefly of interest and any other, e is a constant coefficient, and γ is the absorption coefficient.

The distance between any two fireflies i and j at x i and x j , respectively, is the Cartesian distance, given by Eq. (8).

$$r_{ij} = \left\| {x_{i} - x_{j} } \right\| = \sqrt {\sum\nolimits_{k = 1}^{d} {(x_{i,k} - x_{j,k} )^{2} } } \,$$
(8)

where r ij is the distance between any two fireflies i and j at x i and x j , respectively; x i,k is the kth component of spatial coordinate x i of the ith firefly; x j,k is the kth component of spatial coordinate x j of the jth firefly, and d is the number of dimensions of the search space.

Equation (9) specifies the movement of the ith firefly when attracted to the more attractive (brighter) jth firefly.

$$x_{i}^{t + 1} = x_{i}^{t} + \beta_{0} e^{{ - \gamma r_{ij}^{2} }} \left( {x_{j}^{t} - x_{i}^{t} } \right) + \alpha^{t} ({\text{rand}} - 0.5)$$
(9)

where \(x_{i}^{t + 1}\) is the coordinate of the ith firefly in the (t + 1)th iteration; \(x_{i}^{t}\) is the coordinate of the ith firefly in the tth iteration; \(x_{j}^{t}\) is the coordinate of the jth firefly in the tth iteration; γ is the absorption coefficient and was set to explore global optima. The best result obtained in the sensitivity analysis of γ is γ = 1; β 0 = β min is the attractiveness at r ij  = 0; α t is a randomization parameter; rand is a random-number generator uniformly distributed within [0, 1].

The FA procedure can be summarized as follows. First, the initial firefly positions are randomly generated. Second, a given fitness function is used to evaluate each firefly. Third, fireflies move by using Eq. (9) in which fireflies with worse fitness values move toward fireflies with better fitness values. For the firefly with the best fitness value, the second part of Eq. (9) becomes zero. Therefore, its movement is randomly proportional to the coefficient α t. The above procedure continues until a given number of iterations are reached.

Although FA is highly efficient in many applications, it often becomes trapped in a local optimum [39]. Moreover, setting tuning parameters that improve the convergence of the FA is challenging. The FA control parameters should be optimized for effectively balancing exploitation and exploration. Specifically, the balance between the exploration and exploitation capability of the conventional FA can be improved by enhancing the diversity of initial population of the FA, tuning attractiveness, adjusting randomization, and controlling movement of the FA.

3.2.2 Metaheuristic components in fine-tuning of swarm intelligence algorithm

The conventional FA uses three standard elements to perform an automatic search that proceeds toward global optimality: initial population, the attractiveness parameter, and the movement. These three terms are adjusted by the following metaheuristic components: chaotic maps, adaptive inertia weight (AIW), and Lévy flight. First, the logistic chaotic map and the Gauss/mouse chaotic map provide a highly diverse initial population and tune an attractiveness parameter in the FA implementation, respectively. Second, AIW is dynamically adjusted in the optimization process to control the local exploitation and the global exploration capabilities of the FA. Third, Lévy flight speeds up the local search by generating new optimal neighborhoods around the obtained best solution.

3.2.2.1 Logistic chaotic map for enhancing initial solution

The FA uses a typical random approach to generate an initial solution. The two major disadvantages of this approach are its slow convergence and its tendency to become trapped in local optima because of reduced population diversity. For improving initial solution diversity and the quality of the initial population, logistic chaotic map is used to generate a highly diverse of fireflies in the initial stage. Equation (10) is the logistic map formula.

$$X_{n + 1} = \eta X_{n} (1 - X_{n} )$$
(10)

where n denotes the number label of the individual firefly and X n is the chaotic value for the nth firefly.

Since Eq. (10) is clearly deterministic, it exhibits chaotic dynamics when η = 4.0 and X 0 ∉ {0.0, 0.25, 0.5, 0.75, 1.0}. It exhibits strong dependence on initial conditions and describes travel with ergodicity, irregularity and pseudorandomness. A minute difference in the initial values of populations can cause a considerable difference in its long-term behavior [40]. Thus, in all experiments in this work, initial firefly positions are generated using the logistic map equation, and parameter η was set to 4.0.

3.2.2.2 Gauss/mouse chaotic map for tuning attractiveness

The Gauss/mouse map is the best way to tune the attractiveness parameter (β) of the FA [41]. Equation (11) describes the Gauss/mouse map used in this study instead of random parameters used in the conventional FA.

$${\text{The}}\;{\text{Gauss}}/{\text{mouse}}\;{\text{map:}}\;\beta_{chaos}^{t + 1} = \left\{ {\begin{array}{*{20}l} 0 \hfill & {\quad \beta_{\text{chaos}}^{t} = 0} \hfill \\ {1/\beta_{chaos}^{t} \bmod (1)} \hfill & {\quad {\text{otherwise}}} \hfill \\ \end{array} } \right.$$
(11)

Eq. (7) is then updated to

$$\beta = (\beta_{chaos}^{t} - \beta_{0} )e^{{ - \gamma r_{ij}^{2} }} + \beta_{0}$$
(12)
3.2.2.3 Adaptive inertia weight for adjusting randomization

The swarm-based algorithm can be improved by reducing the randomness of its parameters as the iteration proceeds. In the early stages of the search process, a high inertia weight can boost global exploration (in searches of new areas). However, in the late stages, the adaptive inertia weight, which is reduced in each stage, can improve the local exploitation of the optimal solution (finely tuning the current search area). Inertia weight accelerates convergence to the optimal solution and reduces the execution time of the simulation. The adaptive inertial weight controls the local exploitation and global exploration capabilities of the swarm algorithm.

The above issue is addressed by using a monotonically decreasing function of the inertia weight to change the randomization parameter α in the conventional FA. Since the adaptive inertia weight was used to adjust the randomization parameter α, the distances between fireflies are reduced to maintain the α within a reasonable range [Eq. (13)].

$$\alpha^{t} = \alpha_{0} \theta^{t}$$
(13)

where α 0 is the initial randomization parameter; α t is the randomization parameter at tth iteration; \(\theta\) is the randomness reduction constant (0 < \(\theta\) < 1); and t is the number of iterations. The selected value of \(\theta\) is 0.9 in this implementation based on several trials and experiments, and t ∈ [0, t max], where t max is the maximum number of iterations.

3.2.2.4 Lévy flights for controlling movement

Random walk theory plays a critical role in modern swarm intelligence and evolutionary optimization algorithms. Lévy flights are a random walk in which the step length is a Lévy distribution. The step lengths have no characteristic scale, i.e., the second moment or even the first moment may diverge, and the distribution exhibits self-affine properties. The steps form a random walk process with a power-law step-length distribution with a heavy tail.

Lévy flights are defined such that each jump, regardless of size, takes one unit of time [42]. Lévy flights are used to generate random numbers in two steps: random selection of a direction and generation of steps that obey the selected Lévy distribution. In this work, directions were generated with uniform distributions. The Mantegna algorithm, which is the most efficient, is used to generate steps from a symmetric Lévy stable distribution. The step length s in the Mantegna algorithm is calculated using Eq. (14).

$${\text{L}}{\text{\'e}}{\text{vy}}\,\sim\,s = \frac{u}{{\left| v \right|^{1/\tau } }}$$
(14)

where u and v are drawn from normal distributions, as follows:

$$u\,\sim\,N\left( {0,\sigma_{u}^{2} } \right),\quad v\,\sim\,N\left( {0,\sigma_{v}^{2} } \right)$$
(15)
$$\sigma_{u} = \left\{ {\frac{\varGamma (1 + \tau )\sin (\pi \tau /2)}{{\varGamma [(1 + \tau )/2]\tau 2^{(\tau - 1)/2} }}} \right\}^{1/\tau } ,\quad \sigma_{v} = 1$$
(16)

where \(\varGamma (z)\) is the Gamma function

$$\varGamma (z) = \int\limits_{0}^{\infty } {t^{z - 1} e^{ - t} {\text{d}}t}$$
(17)

A Lévy walk speeds up the local search by generating new solutions around the best solution obtained so far. This behavior is applicable for optimizing the search in optimization problems, and preliminary results verify the promise of so doing [43]. Therefore, Eq. (9) is revised to Eq. (18).

$$x_{i}^{t + 1} = x_{i}^{t} + \beta (x_{j}^{t} - x_{i}^{t} ) + \alpha^{t} {\text{sign[rand}} - 0.5] \otimes {\text{L}}{\text{\'e}}{\text{vy}}$$
(18)

where the second term is the attraction; the attractiveness parameter (β) is updated by Eq. (12); the third term denotes the randomization that is associated with the Lévy flights [Eqs. (14)–(17)]; and α t is the randomization parameter [Eq. (13)]. The product operator \(\otimes\) indicates entry-wise multiplication. The term sign[rand − 0.5] with rand ∈ [0, 1] denotes a random sign or direction when the random step length is drawn from a Lévy distribution. Figure 1 presents the pseudocode of the enhanced swarm intelligence optimization algorithm.

Fig. 1
figure 1

Pseudocode for the enhanced swarm intelligence optimization algorithm

3.3 Swarm intelligence optimized regression system

This study developed the SIOR system that integrates LSSVR with the enhanced swarm intelligence optimization algorithm (i.e., SFA). The SIOR system is used to predict the peak shear strength (peak friction angle) of composite FRS, in which the SFA is used to fine-tune the LSSVR hyperparameters (i.e., C and \(\sigma\)). For ensuring the generalization capacity of the proposed regression system, 10-fold cross-validation algorithm as recommended in the literature [44] was used in this study.

Specifically, the dataset was randomly divided into mutually exclusive 10 folds. Of these, 9 were used for model learning, and the tenth fold was used for model testing. This procedure was repeated 10 times with a rotary fold as test data. Each time, P% (0 < P < 100) of learning data (training data) was used to train the SIOR system, (100 − P)% of learning data (validation data) was used to validate the optimal LSSVR hyperparameters. Notably, the P value can be customized by trial experiments for a particular dataset and is usually set as 60–90 for most cases. Test data were then used to evaluate the learning performance of the LSSVR model after optimization. Figure 2 is a flowchart of the process used to build and evaluate the proposed SIOR system.

Fig. 2
figure 2

Flowchart of system building and evaluation process

The objective function of the SFA is derived from the RMSE value of validation data. Equation (19) describes the fitness function of the SFA.

$${\text{The}}\;{\text{objective}}\;{\text{function:}}\;f(C,\sigma ) = {\text{RMSE}}_{Validation-data}^{\text{Optimization}} = {\sqrt {\frac{1}{n_{v}}\sum\limits_{i = 1}^{n_{v}}(y^{\prime } - y)^{2} } }$$
(19)
$$\begin{aligned} & {\text{Search}}\;{\text{space:}}\;C_{\hbox{min} } \le C \le C_{\hbox{max} } \;{\text{and}}\;\sigma_{\hbox{min} } \le \sigma \le \sigma_{\hbox{max} } \\ & {\text{Maximum}}\;{\text{iteration:}}\;t_{\hbox{max} } \\ \end{aligned}$$

where f(C, σ) is the fitness functions of the SFA optimization algorithm; \({\text {RMSE}}_{Validation-data}^{\text{Optimization}}\) is the root-mean-square error calculated according to the predicted (y′) and actual (y) values based on the validation data; n v is the sample size of validation data; C min, C max, σ min , σ max, and t max are user-defined constants based on a series of trial experiments.

Figure 3 presents the pseudocodes for the SIOR model. The firefly search parameters are initialized via a logistic chaotic map operator [Eq. (10)]. The inside loop of the firefly search process is performed as follows: (1) to dedicate the “AlphaNew” function, modify the initial value of parameter α by multiplying it by 0.9t (adaptive inertia weight) [Eq. (13)]; (2) use the Gauss/mouse chaotic map to compute the “BetaNew” function [Eq. (12)]; (3) use the “EvaluateSFA” algorithm to evaluate LSSVR solution quality via the fitness function f(C, σ) [Eq. (19)]; (4) use the “SortSFA” function to select the individual with the best fitness in the population; finally, (5) use “MoveSFA” to move the firefly toward more attractive individuals via Lévy flight in the search space [Eq. (18)].

Fig. 3
figure 3

Pseudocode for the swarm intelligence optimized regression model

The proposed SIOR system was implemented in MATLAB environment, and MATLAB GUIDE was used to design a user-friendly interface. Figure 4 illustrates the operating procedure with the system interface built in the MATLAB GUIDE environment. The SIOR system interface has three main parts: (1) initial parameter settings for the swarm intelligence optimized regression model; (2) data preparation, selection of objective function, and option of evaluation method; and (3) performance or prediction results. The initial parameter settings can be set to default values, or they can be customized by users. In the SIOR system, each firefly represents a possible solution, and its light intensity is inversely proportional to its objective function value. After the stopping criteria are reached, the best solution is defined as the firefly with the highest light intensity.

Fig. 4
figure 4

SIOR system interface

3.4 Evaluation theory

3.4.1 Performance measures

The prediction accuracy of the proposed system was evaluated in terms of R, RMSE, MAE, and MAPE. Table 1 summarizes the calculation formulas for these indexes. The R is a common measure of the fit of the curve to the actual data. The RMSE is computed to find the square error of the prediction compared to actual values and to find the square root of the summation value. That is, the RMSE is the average distance of a data point from the fitted line measured along a vertical line. This tool efficiently identifies undesirably large differences. The MAE is a measure of the difference between a forecasted value and the actual value. It computes the average magnitude of errors between predicted and actual values, disregarding direction of errors. The MAPE is a statistical measure of the performance of predictive models. Because it gives relative values, it is unaffected by the size or unit of actual and predicted values for fairly comparing models.

Table 1 Mathematical formulas for performance measures

3.4.2 Cross-fold validation method

Predictive performance is often validated by using a k-fold cross-validation algorithm to minimize bias associated with randomly sampled training and test data. Since cross validation randomly assigns individual cases into distinct folds, the folds themselves are often stratified. In stratified k-fold cross validation, the proportions of predictor labels (responses) in the folds approximate those in the original dataset. Since 10 folds are considered optimal [44], 10-fold cross validation was used to justify the predictive consistency of the model. The performance of the proposed model is measured in terms of average R, RMSE, MAE, and MAPE.

4 System evaluation

4.1 Data collection and descriptive statistics

The experimental data for the 300 valid samples were collected from 20 FRS-related studies published during 1983–2015 [1, 4, 2527, 30, 3234, 4555]. The shear strength of FRS is affected by many factors, including fiber characteristics, soil properties, and stress conditions. These factors have been conducted by various researchers in the above 20 literatures. Since these factors have been validated in several studies, they were used to predict FRS shear strength.

The complete dataset was divided into two parts, using the cross-validation algorithm: a learning dataset containing 270 samples and a test dataset containing the remaining data (30 samples). The 70% of learning dataset (189 samples) serving as training data was used to train the SIOR system. The remainder of learning dataset (81 samples) serving as validation data was used to determine the optimal LSSVR hyperparameters C and σ. After optimal hyperparameters were identified by the SFA, test data were then used to evaluate the learning performance of the LSSVR.

Table 2 describes the statistical attributes of the dataset. The predictor variables are fiber type (X 1), fiber length (X 2), fiber diameter (X 3), fiber content—volume percentage (X 4), fiber content—weight percentage (X 5), fiber gravity (X 6), soil type (X 7), soil grade (X 8), soil D50 (X 9), soil friction angle (X 10), cohesion (X 11), unit weight of dry soil (X 12), soil interface coefficient (X 13), fiber–soil interface friction angle (X 14), confining pressure/normal stress (X 15). Fiber types (X 1) in the dataset included nine types: polypropylene or polyprop (PP), polyamide, steel, glass, reed, coir, palm, polyester (PET), and unreinforced. Soil types (X 7) include fine sand, medium sand, medium dune, medium mortar, and coarse sand. Soil grade (X 8) consists of SP (poorly graded sand) and SW (well-graded sand) according the Unified Soil Classification System (USCS). The response is FRS friction angle (Y).

Table 2 Statistical parameters for fiber-reinforced soil dataset

4.2 Analytical results and discussion

As discussed above, obtaining accurate solution for the shear strength of FRS requires optimization of the LSSVR hyperparameters, including the regularization parameter (C) and sigma (σ) of the RBF kernel. Consequently, this is a two-dimensional optimization problem. In this study, the SFA was served as a search engine to optimize the C and σ.

Table 3 shows the setting for the SFA parameters during the optimization process. Particularly, the values of C and σ are set to large ranges of [0.001, 1012] and [0.001, 103], respectively. Population size of the SFA (i.e., the number of LSSVR hyperparameter pairs, C and σ) is set 50. The optimal hyperparameters of the LSSVR are obtained when the stopping criteria are reached, which is set as maximum iteration of 25 or consecutive rates of change in the objective function values less than 0.000001.

Table 3 Parameter settings for the enhanced swarm intelligence optimization algorithm

Table 4 presents the optimal values of LSSVR hyperparameters, including C and σ for 10 folds of the dataset. The results show that for predicting the peak shear strength of FRS, SFA optimizes the ranges of C and σ as [445.8886, 10e+6] and [4.1334, 10e+3], respectively, in the cross-fold validation procedure.

Table 4 Optimal values for least squares support vector regression hyperparameters

Table 5 shows the performance measures obtained by the proposed SIOR system during the learning and test phases. The SIOR system had average R, RMSE, MAE, and MAPE values of 0.958, 2.125°, 1.222°, and 2.86%, respectively, for the FRS test data, which demonstrated the effectiveness of the system. Notably, in the cross-fold validation procedure, the best MAPE value obtained by the proposed system was 2.01% at fold 8, and the best R value was 0.986 at fold 6.

Table 5 Performance measures obtained by the SIOR system

Figure 5 displays the observed (actual) and predicted peak friction angles of FRS with their performance measures in test data for 10 folds. The linear correlation patterns demonstrated the good prediction performance for the peak shear strength property of randomly distributed FRS.

Fig. 5
figure 5figure 5

Observed and predicted peak friction angles of FRS in test data for 10 folds

The performance of the proposed system is also compared with that of the baseline LSSVR and widely used empirical methods proposed by Zornberg [3] and Michalowski and Zhao [34]. Moreover, Najjar et al. [5] suggested that the empirical methods are unsuitable when confining pressure is higher than 400 kPa or aspect ratio of fibers is larger than 200 which is a very thin fiber. Thus, FRS samples with the above properties were excluded from the dataset when using empirical methods to calculate FRS shear strength.

All inputs shown in Table 2 are used in comparison with the proposed SIOR system and the baseline LSSVR (namely group 1). To fairly compare performance between the proposed system and the empirical method proposed by Michalowski and Zhao [34], the considered inputs are fiber aspect ratio η f (\(L_{\text{f}} /D_{\text{f}}\)), soil friction angle ϕ, fiber–soil interface friction angle ϕ w , and fiber content–volume percentage X f (namely group 2). Likewise, to compare performance between the SIOR system and the empirical method proposed by Zornberg [3], the inputs are soil friction angle ϕ, fiber aspect ratio η f, fiber content–volume percentage X f , soil interface coefficient c i,ϕ , and fiber–soil interface friction angle ϕ w (namely group 3).

For the three above groups, Table 6 shows the performance evaluation results obtained by the proposed system and those obtained by other methods. Figure 6 depicts the prediction accuracy of the methods. The comparison results for group 1 confirmed that performance of the SIOR system is better than that of the baseline LSSVR in terms of all performance indexes. The accuracy of predictions of peak shear strength of FRS obtained by the SIOR system is significantly improved and reaches 33.8%, 63.1%, 70.4%, and 71.9% in R, RMSE, MAE, and MAPE, respectively.

Table 6 Performance measures for the proposed system, the baseline model, and methods reported in the literature
Fig. 6
figure 6

Prediction performance comparisons between the proposed system and the others

Similarly, compared with the empirical methods used in group 2 and group 3, the numerical results indicate that the proposed SIOR system outperformed the methods proposed by previous studies [3, 34]. Particularly, the proposed system improved R, RMSE, MAE, and MAPE by 34.5/7.8%, 52.5/19.0%, 68.8/64.5%, and 99.7/44.7%, respectively, with the significant level higher than 1 or 5% (Table 6). Therefore, the hypothesis tests confirmed that the accurate and reliable prediction of shear strength of FRS by the SIOR system was superior to those of other predictive methods.

5 Conclusions

Fiber-reinforced soil is a promising alternative material for civil engineering systems. Shear strength of FRS is influenced complexly by many factors including fiber characteristics, soil properties, and stress conditions. Accurately estimating the shear strength of FRS is vital for civil engineers in designing geotechnical structures and management. Thus, this study developed a SIOR system in MATLAB environment to predict the peak shear strength friction angle of randomly distributed FRS.

This study makes three main contributions: Metaheuristic components are used to improve the efficacy of the conventional swarm algorithm; the enhanced algorithm effectively optimizes LSSVR hyperparameters. Thus, accuracy in predicting FRS shear strength can be improved. The proposed system was established by integrating the LSSVR and the advanced swarm intelligence optimization algorithm to assist civil engineers in designing optimally FRS.

A cross-fold validation method was then used to justify its generalization capability. In experiments performed using FRS data collected from 20 studies published during the 1983–2015, the results for the learning and test stages suggested that the system is efficient for forecasting the shear strength of randomly distributed FRS. The SIOR system also simplifies analyses of how the shear strength of FRS is affected by each mechanical and geometric characteristic of soils and fibers.

Compared with the baseline LSSVR and previous empirical methods, the proposed system yielded more convincing results. Particularly, R, RMSE, MAE, and MAPE improvements were 33.8%, 63.1%, 70.4%, and 71.9%, respectively, in comparison with the baseline LSSVR. The system achieves MAPEs of 5.42% and 3.62%, and improves prediction accuracy by 44.7% and 99.7% compared to the conventional empirical methods [3, 34]. The SIOR system is a potentially superior alternative for predicting FRS shear strength property. Therefore, civil engineers can use the novel system to achieve accurate and reliable prediction of shear strength in randomly distributed FRS.