Introduction

Soil with high contents of clay minerals like Montmorillonite and Verimculite is usually difficult to deal with, due to undesirable properties like absorbing high amounts of water, having low shear strength, and performing high compressibility. These types of clay are called expansive soils due to the swelling and shrinkage properties they display when water is present or absent, respectively (Gokul et al. 2020; Mohanty 2018). Thus, it is necessary to stabilize these soils to enhance the engineering properties of the soil such as mechanical strength, permeability, compressibility, durability, and plasticity. In the literature, chemical stabilization of soil has been practiced to improve the engineering characteristics and enhance the load-bearing capacity of weak soil (Coudert et al. 2019; Jayawardane et al. 2020).

In general, stabilization is referred to chemical improvements in the properties of the soil by adding soil additives. Several geotechnical research has been carried out on the effect of adding soil additives on the engineering properties of the soil. For example, Rai et al. (2020) investigated the effect of waste marble powder and magnesium phosphate cement on the properties of the soil. They found that marble powder and magnesium phosphate cement improved the stability of the soil. Gokul et al. (2020) experimentally examined the effect of Ground Granulated Blast Furnace Slag (GGBFS) on the stability of the clayey soil. Their results showed that the UCS of the stabilized soil was increased by increasing the percentage of GGBFS. Tanegonbadi and Noorzad (2017) improved the stability of the clay soil using Lignosulfonate. They found that the Lignosulfonate-treated soil increased the Unconfined Compressive Strength (UCS) and stiffness of the soil without leading to a considerable brittle behavior based on the stress–strain curves of UCS tests. Ghadir and Ranjbar (2018) determined the compressive strength and compared the mechanical performance of clay soil-geopolymer and clay soil-ordinary Portland cement. They found that geopolymer treatment and Portland cement were more efficient in dry conditions and wet environments, respectively. Furthermore, adding a chemical additive such as sodium hydroxide (NaOH) into clay soil will improve clayey soil characteristics such as undrained shear strength, Unconfined Compressive Strength, USC, and liquid limit (Neeladharan et al. 2007; Hu et al. 2009; Modmoltin and Voottipruex 2009; Rashad and Zeedan 2011; Davoudi and Kabir 2011; Cristelo et al. 2012; Phummiphan et al. 2016; Parhi et al. 2017; Miao et al. 2017; Fasihnikoutalab et al. 2019; Ghavami et al. 2020; Murmu et al. 2020). Generally, NaOH is one of the important inorganic chemical raw materials that are commonly utilized in industrial manufacturing, wood and paper products, the food industry, and many other fields (Hassan et al. 2019; Madhav et al. 2018). Recently, NaOH is applied to reduce the stabilization cost of the material used for road and building construction by mixing the soil with the required amount of NaOH (Phummiphan et al. 2017). The reviewed studies have shown that the UCS value was significantly influenced by sodium hydroxide concentration in the mixture.

Given the fact that stabilized soil behavior can be a challenge in a geotechnical context, the investigation of the stress–strain behavior of soils is important for evaluating safety in geotechnical engineering such as hazard prevention during tunneling and groundwater-soil interaction in excavations (Zhang et al. 2017; Lyu et al. 2019, 2020). The behavior of stress–strain graphs of these samples show high nonlinearity (Zheng et al. 2020). Several empirical models including artificial neural networks, supervised Learning Methods and mathematical have been used to predict engineering properties of soil, concrete, and so on (Nassr et al. 2018; Zheng et al. 2020; Molaabasi et al. 2019; Debnath and Dey 2016; Yao et al. 2018; Ahmad et al. 2020; Dutta et al. 2020, Ahmad et al. 2021). For example, Zheng et al. (2020) predicted the stress–strain behavior using the Long Short-Term Memory deep learning method, feed-forward neural network, and feedback neural network. The results indicated that the LSTM deep learning method was superior in predicting the stress–strain behavior with reported values of 8.25, 6.1, and 0.955 for the parameters of the mean absolute error (MAE), the mean absolute percentage error (MAPE), and R-squared. Debnath and Dey (2016) utilized a feed-forward back-propagation neural network with various learning algorithms to estimate the peak shear stress along with the cohesive soil–geosynthetic interface. It was observed that the Bayesian regularization back-propagation learning algorithm has the lowest statistical error compared to other learning algorithms. Dutta et al. (2020) evaluated the performance of three techniques including techniques ANN, Multiple regression analysis, and M5P model tree for the prediction of clay friction angle. The results indicated that the developed ANN model was superior to the one obtained using MRA and to M5P model tree technique, which is based on statistical parameters. Moreover, Table S1 as supplementary material captures some of the notable work that has been conducted in this field (Johari et al. 2011; Javadi et al. 2012; Kohestani and Hassanlourad 2016; Asr et al. 2018; MolaAbasi et al. 2019; Pham et al. 2022; Zhang et al. 2021; Kim et al. 2022). As the parameters of the current research show, some have had a broad scope, modeling the stress behavior of a general group of soils, and some focused on less widely used soil treatment methods aiming to understand the effect of specific stabilizers on soil behavior.

To the best of our knowledge, there are no studies in the literature about the effect of NaOH on the engineering properties of clay soil in the Northern part of Cyprus. In general, due to swelling minerals such as smectite or illite, clay soils may cause severe damage to the foundations and cracks on the surface of a structure, particularly when the soil is exposed to wetting and drying conditions (Shalabi et al. 2017). According to Abdeh (2018), swelling clay formations are mostly found in the western and northern parts of Nicosia, Cyprus. The reduction in strength and high expansion takes place in these areas due to the increase in water content in clay soil after wet winters. Thus, utilizing additives and chemical stabilizers can improve the engineering properties of the soil.

The purpose of this paper is to investigate the effect of NaOH on the engineering properties of clay soil. Clay soil-NaOH blends were mixed in various percentages (0.5, 10, 15, and 20%). The compaction properties (Atterberg’s limits, Maximum Dry Density, Optimum Moisture Content, and Unconfined Compressive Strength) of untreated and treated soil were measured. In this study, each test was repeated three times and then the average was determined to reduce the experimental error. Moreover, according to the authors’ review, no studies had evaluated the accuracy of the Quadratic model (QM) to predict the stress–strain behavior of the NaOH-soil mixture. The present work, therefore, consists of establishing a stress–strain behavior estimation model that aims to estimate stress–strain behavior of treated soil from more easily measured variables obtained at the same instant as the desired forecast. Based on the analysis of the experimental results, the accuracy of the Quadratic model (QM) for modeling the stress–strain behavior of NaOH treated clay soil. Moreover, the proposed model (QM) was compared with the most common machine learning models such as MFFNN, CFNN, RBFNN, and ENN, and MLR to show the predictive accuracy of the proposed model.

Materials and methods

In this section, the preparation of the clay soil-NaOH mixtures and measurement of their properties according to standards are discussed. Figure 1 illustrates the analysis procedure of the present study.

Fig. 1
figure 1

Flowchart of analysis procedure in the present study

Soil characterization

The soil used in the present study was collected from Haspolat village in the eastern suburb of North Nicosia, Cyprus. The color is gray and it is originated from mudstone. The soil samples were first dried at 50 °C in an oven, and then manually crushed and sieved through a sieve with openings of 2.35 mm to remove the unwanted organic particles and gravel fraction. The particle size distribution of soil was then investigated using the Hydrometer test following the ASTM D422 standard.The tests for determining the physical properties of the selected soil in terms of Atterberg limits (liquid limit, LL, plastic limit, PL), specific gravity, grain size distribution, and soil classification were done according to the ASTM D 4318, the ASTM D 422 and the Unified Soil Classification System, respectively, as shown in Table 1. Moreover, the standard Proctor compaction test following ASTM D 698 was conducted to estimate the maximum dry density (\({\rho }_{\mathrm{max}}\)) and the optimum moisture content (OMC) of the soil. In this work, each test was repeated three times and the average was determined to reduce the experimental error. It was found that the values of \({\rho }_{\mathrm{max}}\) and OMC are 1.72 g/cm3 and 20%, respectively.

Table 1 Engineering properties of clay soil

Sodium hydroxide (NaOH) pellets

Sodium hydroxide pellets are inorganic compounds. This material is also known as caustic soda with a chemical formula of NaOH. This inorganic compound consists of sodium cations Na+ and hydroxide anions OH. Sodium hydroxide is a highly caustic base and is used in industries such as the manufacture of pulp and paper, textiles, drinking water, soaps and detergents, and as a drain cleaner.

Preparation of mixtures

In this study, a series of mixtures were prepared by mixing the clay soil with various percentages of NaOH. The percentage of NaOH added to the soil were varying from 0 to 20% by weight.

In the present study, five different mixtures of clayey soil (CS) with a different percentage of NaOH (CS0, CS5, CS10, CS15, and CS20) were prepared. These mixtures were prepared as follows:

  • CS0 contains 100% of soil and 0% of NaOH

  • CS5 contains 95% of soil and 5% of NaOH

  • CS10 contains 90% of soil and 10% of NaOH

  • CS15 contains 85% of soil and 15% of NaOH

  • CS20 contains 50% of soil and 50% of NaOH

In this experimental work, the maximum dry density, and the optimum moisture content were measured for all samples. Then, the samples were compacted at their maximum dry density and optimum moisture content to be tested for their unconfined compressive strength. The unconfined compression test apparatus was used with a load ring capacity of 6.6 kN to analyze the stress–strain behavior of the samples. The monitoring of percentage strain during the test was done using a digital transducer connected to a computer for measuring the deformation of the sample by the accuracy of 0.01 mm. The heights of samples were measured before each test using a digital caliper. The load was transferred to the sample with a ratio of 1 mm/min.

Empirical models

Artificial neural networks (ANN)

Machine learning models are used as alternative tools to describe a complex system (Livingstone 2009; Keat et al. 2015; Kassem et al. 2019, 2018; Gökçekuş et al. 2020). They are utilized in a wide variety of applications in engineering and science. In this study, four empirical models [Multilayer feed-Forward Neural Network (FFNN), Cascade Forward Neural Network (CFNN), Radial Basis Function Neural Network (RBFNN), and Elman neural network (ENN)] are developed to predict the stress–strain behavior of the soil. In this work, the value of clay soil content, NaOH content, axial strain, area, and force are used as explanatory input variables. The data are divided into training and testing groups and the results by the models are compared with each. In this study, the training was done using data for 0, 5, 10, and 20% NaOH treated soil, and the developed model was used to predict the stress–strain plot for 15% NaOH treated soil, and then compared with the stress–strain plot obtained by performing laboratory test on actually prepared 15% NaOH treated soil.

Multilayer feed-forward neural network (MFFNN)

The MFFNN consists of three layers (input layer, hidden layer, and output layer). The number of neurons and the number of hidden layers should be carefully selected because they affect the accuracy of training. TRAINLM is utilized as a training function. Also, Mean squared error (MSE) is estimated to find the best performance of the training algorithm. The declining gradient of the back-propagation algorithm is utilized to reduce the value of MSE between the actual and estimated output. The description of the developed model was given in Kassem and Gokcekus (2021). Figure 2 illustrates the explanation process of the proposed MFFNN method.

Fig. 2
figure 2

The proposed algorithm of predicting stress–strain behavior of the soil using MFFNN

Cascade feed-forward neural network (CFNN)

CFNN represents a static neural network where the signals move in the forward direction only (Alkhasawneh and Tay 2018). It is similar to a feed-forward neural network but it contains a connection from the input and every previous layer to the layers of the following layer (Hedayat et al. 2009; Zheng et al. 2020). The advantage of this model is that it can show the nonlinear relationship without eliminating the linear relationship between input and output. The optimum number of neurons is based on the minimum value of RMSE. The description of the developed model was given in Kassem and Gokcekus (2021). Figure 3 describes the steps of the proposed model (CFNN).

Fig. 3
figure 3

The proposed algorithm of predicting stress–strain behavior of the soil using CFNN

Radial basis neural networks (RBFNN)

RBFNN is a feed-forward network, which consists of one input layer, one hidden layer, and one output layer. It is used radial basis functions as activation functions (Barati-Harooni and Najafi-Marghmaleki 2016). Speed and efficiency are the most important advantages of RBFNN models compared to other multi-layer perceptron models due to their simple structure. The description of the developed model was given in Kassem and Gokcekus (2021). Figure 4 illustrates the steps of the proposed model (RBFNN).

Fig. 4
figure 4

The proposed algorithm of predicting stress–strain behavior of the soil using RBFNN

Elman neural network (ENN)

The ENN is a simple type of recurrent neural network. It includes four main layers namely, the input layer, context layer, hidden layer, and the output layer (Yu et al. 2019). The main ENN structure is similar to the multi-layer neural network. As mentioned previously, there is a context layer in ENN; the inputs of this layer come from outputs of the hidden layer, which were utilized to store the hidden layer’s output values of the previous time (Yu et al. 2017; Ren et al. 2018).

Quadratic model (QM)

The QM is a mathematical model that can be simultaneously model the input variables and output variables affected by input variables. The aim of using this model is to estimate the degree of effect of the input variables (percentage of clay soil (CSc), percentage of NaOH (NaOHc), axial strain (AS), area (A), and force (F)) on the output parameter (deviator stress (q)) of the treated soil. Equation (1) presents the relationship form between the input parameters and output parameters.

$$q=f\left(PCS, PNaOH, LS, \mathrm{A}, \mathrm{F}\right).$$
(1)

Based on the actual data, regression analysis was carried out by the following quadratic polynomial model:

$$Y={\beta }_{0}+\sum_{i=1}^{n}{\beta }_{i}{x}_{i}+\sum_{i=1}^{n}{\beta }_{ii}{x}_{i}^{2}+\sum_{i}^{n-1}\sum_{i=i+1}^{n}{\beta }_{ij}{x}_{i}{x}_{j},$$
(2)

where \(Y\) is the predicted response, \({\beta }_{0}\) a constant, \({\beta }_{i}\) the linear coefficient, \({\beta }_{ii}\) the squared coefficient, and \({\beta }_{ij}\) the cross-product coefficient, \(n\) is the number of factors, xi and xj are the independent variables.

Multiple linear regressions (MLR)

MLR is a classical method, which attempts to model the correlation between independent variables (x) and dependent (y). It explores how the dependent and independent variables are correlated. The MLR model is

$$Y={\beta }_{0}+{\beta }_{1}{x}_{1}+\dots +{\beta }_{i}{x}_{i} i=\mathrm{1,2}\dots n,$$
(3)

where \(Y\) is the predicted response, \({\beta }_{0}\) a constant, \({\beta }_{i}\) the intercept and \({x}_{i}\) where i = 1,2,..,n, denotes the explanatory or independent variables.

Model performance criteria

Coefficient of determination, root mean squared error and mean absolute error were used to measure the estimation success of the models. The following equations were used for evaluation.

Coefficient of determination (R 2)

$${R}^{2}=1-\frac{\sum_{i=1}^{n}{\left({a}_{a,i}-{a}_{p,i}\right)}^{2}}{\sum_{i=1}^{n}{\left({a}_{p,i}-{a}_{a,\mathrm{ave}}\right)}^{2} }.$$
(4)

Mean squared error (MSE)

$$\mathrm{MSE}=\frac{1}{n}\sum_{i=1}^{n}{\left({a}_{a,i}-{a}_{p,i}\right)}^{2}.$$
(5)

Root mean squared error (RMSE)

$$\mathrm{RMSE}=\sqrt{\frac{1}{n}\sum_{i=1}^{n}{\left({a}_{a,i}-{a}_{p,i}\right)}^{2}}.$$
(6)

Mean absolute error (MAE)

$$\mathrm{MAE}=\frac{1}{n}\sum_{i=1}^{n}\left|{a}_{a,i}-{a}_{p,i}\right|.$$
(7)

Nash–Sutcliffe efficiency (NSE)

$$\mathrm{NSE}=1-\frac{\sum_{i-1}^{n}{\left({a}_{a,i}-{a}_{p,i}\right)}^{2}}{\sum_{i-1}^{n}{\left({a}_{a,i}-{a}_{a,\mathrm{ave}}\right)}^{2}}.$$
(8)

Willmott’s index of agreement (d)

$$d=1-\frac{\sum_{i-1}^{n}{\left({a}_{a,i}-{a}_{p,i}\right)}^{2}}{\sum_{i-1}^{n}{\left(\left|{a}_{p,i}-{a}_{a,\mathrm{ave}}\right|+\left|{a}_{a,i}-{a}_{a,\mathrm{ave}}\right|\right)}^{2}},$$
(9)

where \(n\) is the number of data, \({a}_{p,i}\) is the predicted values, \({a}_{a,i}\) is the actual values, \({a}_{a,\mathrm{ave}}\) is the average actual values and i is the number of input variables.

Results and discussion

Soil stabilization and characterization tests

Atterberg limits

Figure 5 shows the results of Atterberg Limits in terms of liquid limit (LL), plastic limit (PL), and plasticity index (PI) tests of the five samples. Additionally, the results of the Atterberg limits are plotted on the Holtz and Kovacs diagram and workability chart as shown in Fig. 6. The Liquid Limit (LL), Plastic Limit (PL), and Plasticity Index (PI) following ASTM D4318 are within the range of 68–30%, 32–17% and 36–5%, respectively. It was found that all Atterberg parameters, LL, PL, and PI, decreased almost linearly with the increase in NaOH content.

Fig. 5
figure 5

Atterberg limits of Clay soil-NaOH mixture

Fig. 6
figure 6

Position of soil-NaOH samples; a on the Holtz and Kovacs diagram (Holtz and Kovacs 1981) and b Casagrande diagram

Except for the CS20, which was in the moderate plastic region, all clay soil-NaOH samples were found in the highly plastic region, as the reference clay sample as shown in Fig. 3a. CS0, CS5, CS10, and CS15 samples were found in the Illite clay region. It shows that these samples have higher LL and PI values than CS20. CS10 has acceptable extrusion properties similar to the CS5 sample, whereas the CS15 sample has optimal properties. CS20 demonstrates lower cohesion and is an outlier as shown in Fig. 3b. In general, according to Murray (2007), the plasticity value of the soil is an essential parameter utilized in estimating the application of the clay body. In clay soil-NaOH, PL value is within the range of 32–17%. According to Monterio and Vieira (2004), the high value of PL is associated with greater mechanical strength. Furthermore, Siddiqui and Osman (2012) concluded that the differences in PL of the soil samples are usually understandable from the particle size distribution data.

Unconfined compressive strength and optimum compaction

It is well known that the estimation of shear strength of the soil is essential in civil engineering for various applications like highway and airfield design, the stability of slopes and cuts, and the design of coastal structures. In this study, Unconfined Compressive strength (UCS) tests were carried out on samples with 0%, 5%, 10%, 15%, and 20% NaOH contents, compacted at their at \({\rho }_{\mathrm{max} }\) and OMC, which were obtained from the standard Proctor compaction curves. Figure 7 shows the compaction curves of the clay soil-NaOH at different NaOH contents. It was observed that the sample with 15% NaOH has the maximum value of \({\rho }_{\mathrm{max}}\) and minimum value of OMC comparing to other samples (see Table 2).

Fig. 7
figure 7

Compaction curves of the clay soil at different NaOH content

Table 2 Results of \({\rho }_{\mathrm{max} }\) and OMC for clay soil-NaOH samples

The unconfined compression test apparatus was used with a load ring capacity of 6.6 kN. The monitoring of percentage strain during the test was done using a digital transducer connected to a computer for measuring the deformation of the sample by the accuracy of 0.01 mm. The height of samples was measured before each test using a digital caliper. The load was transferred to the sample with a ratio of 1 mm/min.

Figure 8 illustrates the effect of various percentages of NaOH on the results of the UCS of the samples. Comparing with the sample without NaOH, the UCS of the sample with 15% NaOH increased from 97.5 to 201.7 kPa, i.e. 15% NaOH increased the UCS to peak value as shown in Fig. 8. Additionally, it was found that by increasing the NaOH content to 20% the value of UCS was reduced from 201.7 to 142.0 kPa. It was observed that the unconfined strength value increases 107% with the addition of 15% stabilizer, and the further addition of NaOH has an adverse effect on UCS and hence the shear strength of the soil. The witnessed strength gain can be described by the reaction of sodium hydroxide with the clay mineral. The alkaline attack changes the clay mineral lattice and results in producing sodium silicate and sodium aluminates. Further addition of NaOH is thought to disturb the clay mineral structure and therefore decrease the UCS. According to Das and Sobhan (2014) consistency/UCS correlation chart, the compacted untreated clay samples consistency can be classified as “firm” based on the obtained value of UCS, while the UCS of samples treated with 5%, 10%, and 20% NaOH classify them as “stiff” and with 15% NaOH as “very stiff”.

Fig. 8
figure 8

Unconfined compressive strength (UCS) and undrained shear strength (USS) of the clay soil with NaOH content

Given that the undrained shear strength is independent of the confining pressure in saturated undrained clays, the shear strength at failure,\({\tau }_{f},\) of the samples would be equal to the cohesion of them and could be calculated by dividing the failure stress by two (Das and Sobhan 2014).

It was found that the sample with 15% NaOH has the maximum average \({\tau }_{f}\) with a value of 99 kPa as shown in Fig. 8.

Artificial models

As mentioned previously, four neural network models were employed to predict the deviator stress (q) for clay soil-NaOH samples. Thus, the value of clay soil content, NaoH content, axial strain, area, and force are used as explanatory input variables. The data were divided into training and testing groups and the results by the models were compared with each. In this study, the training was done using data for 0, 5, 10, and 20% NaOH-treated soil, and the developed model was used to predict the stress–strain plot for 15% NaOH-treated soil, and then compared with the stress–strain plot obtained by performing laboratory test on actually prepared 15% NaOH treated soil. The summary statistics of the independent variables, which are considered as input and dependent variables (output), are given in Table 3.

Table 3 Statistical parameters of used variables

A series of models were examined to estimate the optimum number of hidden layers (HL), the number of neurons (NN), and transfer function (TF) for the MFFNN, CFNN, and ENN models. It should be noted that the number of HLs and NNs in the MFFNN, CFNN and ENN models was determined by utilizing trial and error approaches. Based on the value of MSE, it was found that two hidden layers and ten neurons are selected as the best for the MFFNN model (5:1:1) with an MSE value of 6.20 \(\times \) 10–6. While it found that two hidden layers and five neurons were chosen as an optimum number for the CFNN model (5:2:1) with an MSE value of 274 \(\times \) 10–6. Also, it was observed that the ENN model (5:3:1) with ten neurons has the minimum MSE with a value of 2.26 \(\times \) 10–6.

Moreover, the 10-th order root of the input data was used instead of actual input data to provide better performance for the RBFNN model. This helps to smooth the variation of the input data points within a narrower range and this leads to better accuracy of the implemented model. Then, the data points were randomly divided into training and testing subsets. The random division was carried out several times to prevent the aggregation of data points in the desired domain of the problem and to provide a smooth distribution of data points within the training and testing sets. In general, the spread and the maximum number of neurons (MNN) are important parameters in the structure of RBFNN as the performance and accuracy of the implemented model are significantly affected by the values of these parameters. Similarly, the optimum values of these parameters were estimated by a trial and error approach. It was observed that the optimum values that provide the most accurate performance for the RBFNN model are 0.001 and 200 for the spread and MNN, respectively.

Figure 9 illustrates the R-squared for training data of deviator stress for clay soil-NaOH mixtures. R-squared was used to evaluate the performance of artificial models. R-squared means the degree of the linear relationship between the observed and modeled values. The line is almost straight with a 45° angle and this proves the accuracy of the provided model. For the training phase, the R2 value was found to be approximately 1 as shown in Fig. 6. The results obtained from the ANN models show that the use of ANN is enough to predict deviator stress for clay soil-NaOH mixtures.

Fig. 9
figure 9

Comparison between predicted data with actual data of deviator stress using various ANN models

Mathematical models

The developed mathematical models (MM) including QM and MLR were implemented to predict the deviator stress for the clay soil-NaOH mixtures. The data of clay soil content, NaOH content, LS, A, and F were used to generate a mathematical equation based on QM and MLR for q as given in Eqs. (8) and (9), respectively.

$$q=-0.003054-0.000039\bullet CSc-0.000277\bullet LS-0.000147\bullet A+1.07364\bullet F-0.000770\bullet {CSc}^{2}-0.005534\bullet {LS}^{2}+0.003696\bullet {A}^{2}-0.003347{F}^{2}+0.000809\bullet CSc\bullet LS+0.004298\bullet CSc\bullet A-0.001933\bullet CSc\bullet F-0.001844\bullet LS\bullet A+0.006123\bullet LS\bullet F-0.116304\bullet A\bullet F,$$
(8)
$$q=0.023284-0.005336\bullet CSc+0.006188\bullet LS-0.051475\bullet A+1.01040\bullet F.$$
(9)

Moreover, the results of the analysis of variance are presented in Table 4. The results of the actual data and the corresponding values predicted by Eqs. (7) and (9) are displayed in Fig. 10. To test the fit of the model, R-squared was determined. For higher modeling accuracy, the R-squared value should be closer to 1. In this case, the values of R-squared for training data are 0.99999 for QM and 0.9995 for MLR.

Table 4 Analysis of variance of QM for deviator stress responses
Fig. 10
figure 10

Comparison between predicted data with actual data of deviator stress using QM and MLR

Moreover, Fig. 11 presents two-dimensional (2D) contour plots and three-dimensional (3D) surfaces for predicting the deviator stress of the treated soil. It shows the influence of input variables (CSc, AS, F, A) on the deviator stress (q) of the treated soil. The number written on each contour area indicates the deviator stress in the specified conditions.

Fig. 11
figure 11

Contour plots of deviator stress data

Performance evaluation of artificial models and mathematical models for testing data

The data were divided into training and testing groups and the results by the models were compared with each. In this study, the training was done using data for 0, 5, 10, and 20% NaOH treated soil, and the developed model was used to predict the stress–strain plot for 15% NaOH treated soil, and then compared with the stress–strain plot obtained by performing laboratory test on actually prepared 15% NaOH treated soil. Furthermore, the R-squared, RMSE, MAE, and ADD were determined to select the best model for predicting the behavior of stress–strain of clay soil-NaOH mixtures. R-squared is a measure of how well the regression line represents the data, while RMSE and MAE are direct methods for describing deviations. For high accuracy, R-squared must be close to 1.0, and the RMSE and MAE between the observed and predicted values must be as small as possible. Table 5 shows the results of the R-squared, RMSE, and MAE values for all models. It was observed that all models gave good predictions according to the R-squared values for the testing data. Also, it was found that the QM and MFFNN have the highest value of R2 and the lowest value of RMSE and MAE for the testing data. Moreover, Fig. 12 shows the comparison of the estimated and observed values of the deviator stress for all models. Additionally, the values of deviator stress for CS15 are reported in Table S2 as supplementary material. The deviation between the estimated and experimental results of deviator stress is shown in Fig. 13. It can be seen that most of the deviations were positive and only a few were negative for QM and MFFNN models.

Table 5 Performance evaluation of the models
Fig. 12
figure 12

Comparison of the predicted and observed values of the deviator stress of CS15 for all models

Fig. 13
figure 13

The relative error for deviator stress

Conclusion

In this study, the effect of NaOH concentrations on the stabilization of high plasticity clay was investigated. Therefore, to address the main objective, the authors proposed a Quadratic model to predict the behavior of stress–strain of treated and untreated soil as a function of the percentage of clay soil, percentage of NaOH, axial strain (AS), area (A), and force (F). Besides, the behavior of stress–strain of soils was evaluated through four artificial models, namely Multilayer Feed-Forward Neural Network, Cascade Feed-forward Neural Network, Radial Basis Neural Network and Elman neural network based on the experimental data. Furthermore, the proposed models were then compared in terms of predictive accuracy to select the best model. The most important results can be summarized in the following cases:

  • The treatment of studied soil with NaOH increases maximum dry unit weight and decreases the optimum moisture content.

  • From the UCS test, it was observed that with increasing the percentage of NaOH from 0 to 15%, the peak axial stress increased from 97.54 to 201.71 kPa (i.e., the related maximum increment was up to 107%). Nevertheless, this value was reduced when the percentage of NaOH exceeded the optimum content of 15% NaOH. Similar trends were followed in the case of undrained shear strength values from stress–strain curves of UCS tests.

  • Based on the stress–strain curves of UCS tests, it was observed that this treatment increases the stiffness and UCS of the soil without leading to considerable brittle behavior up to 15% addition of NaOH.

  • The results indicate that the all developed models are valid in predicting the stress–strain behavior of treated soil. Furthermore, the findings demonstrated that the QM model performed well and presented high accuracy in modeling the stress–strain behavior.

Based on our findings, the QM model developed for this study to predict the stress–strain behavior of NaOH stabilized clay soil could be utilized further to test soil with other stabilizers or characteristics. Moreover, future research should focus on evaluating the effects of curing time on the engineering properties of the soil.