Predicting the maximum dry density and optimum moisture content from soil index properties using efficient soft computing techniques

Ali, Hunar Farid Hama; Omer, Bashdar; Mohammed, Ahmed Salih; Faraj, Rabar H.

doi:10.1007/s00521-024-09734-7

Predicting the maximum dry density and optimum moisture content from soil index properties using efficient soft computing techniques

Original Article
Published: 15 April 2024

Volume 36, pages 11339–11369, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Neural Computing and Applications Aims and scope Submit manuscript

Predicting the maximum dry density and optimum moisture content from soil index properties using efficient soft computing techniques

Download PDF

313 Accesses
Explore all metrics

Abstract

This study aims to achieve the densest possible state of soil for constructing dams and roads. This requires assessing the compaction characteristics, Optimum Moisture Content (OMC), and Maximum Dry Density (MDD) to determine the soil's suitability for earthworks. However, this process is resource-intensive and time-consuming. To streamline the assessment, the study incorporates six parameters: gravel (G), sand (S), fine (F) contents, plastic limit (PL), liquid limit (LL), and plasticity index (PI). Four different models are used to predict compaction characteristics: artificial neural network (ANN), nonlinear regression (NLR), linear regression (LR), and multilinear regression (MLR). The study utilized a substantial dataset of 2162 entries, considering various soil gradation and plasticity properties as input variables. To evaluate the models' effectiveness, several statistical measures, including coefficient of determination (R²), scatter index (SI), root mean squared error (RMSE), mean absolute error (MAE), a20-index, and Objective (OBJ) value, were employed. The ANN model outperformed other models in predicting OMC, with RMSE, MAE, OBJ, SI, a20-index, and R² values of 3.51, 2.31, 4.26, 0.202, 0.7, and 0.92%, respectively. However, for predicting MDD, the ANN model had the highest R² value (R² = 0.87), but the minimum RMSE (1.01), MAE (0.8), a20-index (0.998), and OBJ (1.07) were obtained from the MLR and LR models. Furthermore, sensitivity analyses revealed that the plastic limit significantly influences the OMC, while the gravel content plays a dominant role in predicting MDD.

Advanced Mathematical Models to Predict the Compaction Properties of Coarse-Grained Soils from Various Physical Properties

Article 12 April 2018

Prediction of California Bearing Ratio Using Soil Index Properties by Regression and Machine-Learning Techniques

Article 01 November 2022

Artificial Neural Network Equations for Predicting the Modified Proctor Compaction Parameters of Fine-Grained Soil

Article 31 January 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Soil compaction is critical in geotechnical engineering, particularly in construction projects like roads and embankments. Accurate prediction of the Maximum Dry Density (MDD) and Optimum Moisture Content (OMC) is essential to achieve the desired level of compaction. In recent years, researchers have increasingly turned to soft computing models and artificial neural networks (ANN) to improve the accuracy of such predictions. This literature review provides an overview of the existing research in this field.

Soil compaction plays a vital role in geotechnical engineering by enhancing the mechanical properties of soil. Proper compaction ensures structural integrity and prevents settlement and instability [18].

Historically, compaction characteristics were predicted using empirical and semi-empirical methods, such as the Proctor test. These methods, while practical, often lack the accuracy and flexibility required for various soil types and conditions [57].

Soft computing models, including artificial neural networks (ANN), fuzzy logic, and genetic algorithms, have gained popularity for predicting soil compaction characteristics. These models can handle complex, non-linear relationships within large datasets [116].

ANN is a popular choice for predicting soil properties due to its ability to learn from data and adapt to various soil types. Studies have successfully applied ANN to predict MDD and OMC from soil index properties [83].

Previous research has shown that the accuracy of predictions is highly dependent on the selection of input parameters. Parameters like gravel (G), sand (S), plastic limit (PL), liquid limit (LL), and plasticity index (PI) have been identified as significant predictors of compaction characteristics.

Researchers often evaluate the performance of predictive models using various statistical metrics, including R-squared (R2), root mean squared error (RMSE), mean absolute error (MAE), and scatter index (SI). These metrics help assess the accuracy and reliability of the models.

Recent studies have focused on enhancing the efficiency and accuracy of predictive models. Some have incorporated advanced optimization techniques and ensemble modeling to improve results.

The choice of an appropriate model is essential and varies based on the specific application. ANN may perform well in some cases, while linear or multilinear regression models may be more suitable in others. Understanding each model's strengths and limitations is crucial.

Accurate predictions of MDD and OMC from soil index properties have practical implications in geotechnical engineering, enabling engineers to optimize construction processes, reduce costs, and enhance the quality and durability of infrastructure projects. There are several methods to stabilize soils. One of the most significant methods to densify the soil is compaction. Soil compaction results in the decrease of the voids between soil particles. Consequently, soil shear strength observably increases along with the declining compressibility and pearmeability of the soil [33]. To determine both compaction characteristics, Maximum Dry Density (MDD) and Optimum Moisture Content(OMC), in the laboratory, either the Standard Proctor (SP) or Modified Proctor (MP) methods are used [26].

To assess the suitability of a soil for an earthwork, the compaction characteristics of the soil must be identified. For an earthwork project, large quantities of soil are necessitated. Obtaining the massive volume with a desired compaction characteristic from a single borrow source might be tedious and time-consuming. Compaction characteristics must be obtained from a laboratory compaction test to determine the suitability of soils collected from various borrows sources. Nevertheless, for laboratory compaction tests, sufficient effort and time are required. Consequently, for any such project, the suitability of the required soils is preliminarily assessed by establishing correlations between the compaction characteristics and simple physical properties obtained through simple index tests [104]. There have been several attempts to determine the compaction characteristics indirectly. For this reason, several correlations have been developed to estimate the compaction characteristics through soil index properties [7, 23, 28, 31, 44, 51, 52, 89, 104, 117] and because of soil fractions [45, 99]. Some studies indicated that an individual input parameter could not sufficiently predict the compaction characteristics. Therefore, multiple linear regression (MLR) models were utilized to predict the compaction characteristics considering different basic soil parameters [9, 24, 26, 45, 49, 67, 98]. In addition to this, the machine learning technique was used in some other studies to develop more accurate correlations [48, 79, 101, 110]. An early study employing artificial neural networks (ANN) by Shook and Fang [97] to predict the soil compaction characteristics indicated that the ANN technique outperforms the traditional statistical models, and a reliable prediction can be attained. Further, Sivrikaya and Soycan [100] utilized the ANNs for estimating the compaction characteristics of 108 fine-grained soil samples.

Sivrikaya [98] performed the multilinear regression model (MLR) to predict the compaction characteristics of fine-grained soils from soil index properties and soil gradations. The considered input parameters were combinations of gravel content (G), sand content (S), fine-grained content (F), plasticity index (PI), liquid limit (LL), and plastic limit (PL). The study concluded that the compaction characteristics could correlate with Plastic limit well compared to other index properties. [31] employed different techniques: simple—multiple analysis and artificial neural networks to forecast the compaction characteristics based on the soil gradations. The study highlights that reliable correlations (R² = 0.70–0.95) for preliminary design can be achieved from both techniques. Furthermore, [67] developed multiple regression analysis models for experimental data collected from 110 sandy soils to predict the compaction characteristics in terms of the uniformity coefficient (Cu) and Compaction Energy (CE). Considering the liquid limit (LL), the plasticity index (PI), and compaction energy (CE). Tenpe and Kaur [108] Investigated artificial neural network (ANN) modeling performance for forecasting compaction parameters based on soil index properties. Moreover, Omar et al. [77] utilized advanced mathematical models and an innovative solution to predict the compaction characteristics of fine-grained soils from numerous physical properties. In this study, multiple linear regression (MLR), artificial neural networks (ANNs), and support vector regression (SVR) are employed. In addition, Farooq et al. [26] utilized a multiple regression model to forecast the optimum moisture content. Also, Saikia et al. [89] developed a set of regression models for predicting the compaction characteristics concerning the consistency limits.

In their study, Karimpour-Fard et al. [53] utilized artificial neural networks (ANNs) and multilinear regression (MLR) techniques across 728 datasets to develop predictive models for compaction characteristics. These models were designed to estimate compaction characteristics based on soil type, grain size distribution (including LL, PL), and specific gravity at varying energy levels. Their findings emphasized the significant impact of fine content on compaction characteristics compared to other factors. Additionally, they underscored the potential advantages of MLR models in predicting compaction characteristics despite the greater effectiveness of ANN models, attributing this to the inherent "black box" nature of ANN models. Taking into account input parameters such as compaction energy and plastic limit, these models were evaluated.

A novel application of artificial neural network (ANN) was utilized in a recent study Verma and Kumar [110] to estimate fine-grained soil's modified Proctor compaction parameters. From a highway construction work site, from 532 in situ soil samples, several geotechnical parameters were obtained from the laboratory testing. Besides the index properties test, modified Proctor compaction tests were conducted on the collected soil samples. Python V3.7.9 platform was adopted to write the ANN algorithm code for the analysis. Such useful geotechnical parameters as gravel (%), sand (%), fine content (FC), and percent material retained on 2.0 mm (R2.0 mm), 0.425 mm (R0.425 mm), and 0.075 mm (R0.075 mm), coarse sand (CS), medium sand (MS), fine sand (FS), liquid limit (LL), plastic limit (PL), and plasticity index (PI) were considered as input parameters. Verma and Kumar [111] focused on developing a multi-layer perceptron neural network model to predict the modified compaction characteristics of coarse and fine-grained soils. Similarly, the Python V3.7.9 platform was adopted to write artificial neural network (ANN) algorithm code. One hundred seventy-nine datasets of coarse-grained and 69 datasets of fine-grained soils are examined, considering gravel (%), sand (%), FC (%), LL (%), PL (%), PI (%) as input parameters to predict the modified compaction characteristics. In the developed models, the high correlation coefficient (R was obtained; as the value was more than 0.80 and 0.90 for coarse-grained and fine-grained soil respectively.

In the context of existing literature, various statistical and machine learning models were employed to forecast compaction characteristics based on diverse soil properties, primarily for assessing soil suitability in earthworks. Statistical models offer ease of use, delivering output predictions in equations that can prove valuable for practical field applications. On the other hand, machine learning models excel in processing substantial datasets and identifying nuanced trends and patterns that might elude human perception. Machine learning algorithms exhibit exceptional capability in handling complex, multi-dimensional, and multifaceted data, even in dynamically changing and uncertain conditions, as demonstrated by studies conducted by Mahmood et al. [61], Piro et al. [82], Abdalla and Mohammed [2], and Hama Ali [34]. These models have been employed in a few research studies to predict compaction characteristics.

Furthermore, in rare studies, large quantities of data covering soil index properties and particle—size were considered. In this work, a large volume of the dataset (2162 datasets) has been compiled from previous studies. Four different models of Linear Regression, Nonlinear Regression (NL), Multilinear Regression (ML), and Artificial Neural Network (ANN) have also been utilized to predict the compaction characteristics (OMC, MDD) concerning soil index properties and soil particle—sizes. In addition to predicting the compaction characteristics, this work evaluates the performance of the utilized models.

1.1 Related studies

Using various modeling techniques, the literature review provides valuable insights into predicting soil compaction characteristics, specifically the Maximum Dry Density (MDD) and Optimum Moisture Content (OMC). Below are the essential findings and trends observed in the related studies: soil compaction is a critical aspect of geotechnical engineering, ensuring the structural integrity of construction projects such as roads and embankments. Accurate predictions of MDD and OMC are essential for achieving the desired level of compaction. Researchers have increasingly turned to soft computing models, including Artificial Neural Networks (ANN), to improve the accuracy of predictions. These models are valued for handling complex, non-linear relationships within large datasets. Traditional methods, such as the Proctor test, were historically used for predicting compaction characteristics. However, these methods often lack accuracy and flexibility across different soil types and conditions.The accuracy of predictions depends significantly on the selection of input parameters. Parameters like gravel content, sand content, plastic limit, liquid limit, and plasticity index have been identified as significant predictors of compaction characteristics. Researchers employ various statistical metrics, including R-squared (R2), root mean squared error (RMSE), mean absolute error (MAE), and scatter index (SI), to evaluate the performance of predictive models. These metrics help assess the accuracy and reliability of the models. Recent studies have focused on enhancing the efficiency and accuracy of predictive models. Some have incorporated advanced optimization techniques and ensemble modeling to improve results. Choosing the appropriate model is essential and depends on the specific application. While ANN performs well in some cases, linear or multilinear regression models may be more suitable in others. Understanding the strengths and limitations of each model is crucial. Accurate predictions of MDD and OMC from soil index properties have practical implications in geotechnical engineering, allowing engineers to optimize construction processes, reduce costs, and enhance the quality and durability of infrastructure projects. Some studies have employed machine learning techniques, such as ANNs, to predict soil compaction characteristics, with promising results regarding model performance and accuracy. Large datasets and model evaluation: the work under review stands out for its use of a large dataset (2162 datasets) and for applying four different models, including ANN, to predict compaction characteristics. The study not only predicts these characteristics but also rigorously evaluates the performance of the utilized models. Comparison of soft computing models and statistical models: the literature reveals a trend of comparing soft computing models (e.g., ANN) with traditional statistical models (e.g., linear regression) to determine which one offers better predictive capabilities for soil compaction characteristics. Several studies have indicated that relying on a single input parameter is insufficient for accurate predictions. Hence, models that consider multiple basic soil properties, especially those related to particle size distribution and consistency limits, have been developed and found to provide more reliable predictions. In summary, the reviewed literature emphasizes the importance of accurate predictions of MDD and OMC for soil compaction in geotechnical engineering. Soft computing models, particularly ANNs, have shown promise in improving the accuracy of predictions. The choice of input parameters, model evaluation metrics, and the practical implications of these predictions are significant considerations. Additionally, recent studies have incorporated advanced techniques to enhance the efficiency and accuracy of predictive models. The field is continually evolving, with an increasing focus on harnessing the potential of machine learning and large datasets for more precise predictions.

2 Scope of the work

In light of the literature, an individual input parameter is unlikely to be applicable to predict the compaction characteristics of different soil types. Therefore, numerous datasets from the literature were compiled to develop models to predict the compaction characteristics of soil by including simple soil properties such as the G, S, F, LL, PL%, and PI%. Subsequently, four different models were developed. This enables evaluating the models' performance and the identification of the effects of various soil properties. Consequently, the datasets in two groups of training and testing are examined in the models to reach the main objectives of the study:

(i)
The influence of the physical soil properties on the compaction characteristics will be investigated. The input parameter with the most influential role in measuring the value of the OMC and MDD will be assigned from a sensitivity analysis.
(ii)
Concerning the statistical evaluation tools, the model outperforming the prediction of the compaction characteristics of soil from basic soil properties will be determined and compared with the other models.

The major contributions and novelty in this work can be summarized as follows:

Multi-parameter approach: the study's innovation lies in adopting a multi-parameter approach to predict the compaction characteristics of soils. Instead of relying on a single input parameter, it incorporates six different parameters: gravel (G), sand (S), fine content (F), plastic limit (PL), liquid limit (LL), and plasticity index (PI). This approach recognizes that a single parameter may not be universally applicable across different soil types, and therefore, a more comprehensive set of input variables is considered.
Model Diversity: The work employs four distinct models: artificial neural network (ANN), nonlinear regression (NLR), linear regression (LR), and multilinear regression (MLR) models. This diversity in modeling techniques allows for a robust assessment of each model's predictive capabilities and performance, thus ensuring a comprehensive evaluation of the data.
Extensive dataset compilation: to support the development of these models, a comprehensive dataset of 2162 entries is compiled, encompassing various soil gradation and plasticity properties as input variables. This dataset's size and comprehensiveness contribute to the reliability and generalizability of the models.
Statistical evaluation tools: the study employs a range of statistical evaluation tools, including the coefficient of determination (R²), scatter index (SI), root mean squared error (RMSE), mean absolute error (MAE), a20-index, and Objective (OBJ) value. These tools provide a thorough assessment of the models' performance and ability to predict compaction characteristics accurately.
Identification of influential parameters: through sensitivity analyses, the study identifies the specific input parameter with the most significant influence on measuring the values of Optimum Moisture Content (OMC) and Maximum Dry Density (MDD). This insight can be crucial for understanding the key factors affecting soil compaction characteristics.
Comparative analysis: the research develops these models and rigorously compares their performance. It concludes that the ANN model excels in forecasting OMC, while for MDD prediction, the MLR and LR models outperform the ANN model. This comparative analysis aids in selecting the most appropriate model for specific applications also, Given the study's objective to streamline the assessment of soil compaction characteristics for constructing dams and roads, along with the use of multiple parameters and models, a novel method could involve the following approaches:
Hybrid model integration: combine the strengths of different models to improve prediction accuracy. For instance, a hybrid model that integrates the advantages of artificial neural networks (ANN) in capturing complex relationships with the interpretability of linear regression (LR) or multilinear regression (MLR) could be developed. This hybrid model could potentially enhance predictions by leveraging the ANN's ability to capture nonlinear patterns and the interpretability of LR/MLR for a better understanding of the impact of individual parameters.
Feature selection techniques: utilize advanced feature selection methods to identify the most influential parameters. Techniques like recursive feature elimination, feature importance ranking from tree-based models, or Lasso regularization can help narrow down the most critical parameters affecting soil compaction characteristics. This could streamline the assessment process by focusing on key parameters and reducing resource consumption.
Ensemble learning: implement ensemble techniques such as bagging, boosting, or stacking to combine predictions from multiple models. Ensemble methods often lead to better generalization and robustness by leveraging the strengths of different models. This approach could enhance prediction accuracy for Optimum Moisture Content (OMC) and Maximum Dry Density (MDD).
Sensitivity analysis optimization: utilize sensitivity analysis to identify influential parameters and optimize the assessment process. By understanding the impact of different parameters on OMC and MDD, the study can propose specific guidelines or thresholds that indicate when certain parameters have a significant effect. This could help streamline decision-making in soil compaction assessments.
Automated Data Processing: Implement automation in data processing and model selection. Leveraging machine learning pipelines or frameworks can streamline data preprocessing, feature engineering, model training, and evaluation, reducing the time and resources needed for analysis.

Combining the strengths of different modeling techniques, optimizing parameter selection, leveraging ensemble methods, and automating processes can streamline the assessment of soil compaction characteristics, making it more efficient and less resource-intensive while maintaining or improving prediction accuracy.

2.1 Components in the block diagram

Input data: this represents the dataset containing soil properties such as Gravel (G), Sand (S), Fine content (F), plastic limit (PL), liquid limit (LL), and plasticity index (PI).
Preprocessing module: this stage involves data cleaning, normalization, and feature scaling to prepare the input data for modeling.
Modeling phase:
LR model: linear regression model based on soil physical properties.
NLR model: nonlinear regression model.
MLR model: multilinear regression model.
ANN model: artificial neural network model.
Evaluation and comparison: this part includes the assessment of model performance using various metrics like R-squared (R2), Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), Scatter Index (SI), and Objective (OBJ) values.
Results and analysis: this section interprets and analyzes the model outputs, compares predictions, and identifies influential parameters for OMC and MDD.
Sensitivity analysis module: this component pinpoints the significant parameters affecting OMC and MDD predictions.
Conclusion and recommendations: this stage presents conclusions drawn from the analysis, recommendations for model selection, and potential areas for further research.

3 Methodology

Overall, 2162 datasets from the literature were gathered. The data were randomly mixed and divided into two groups: training datasets and testing datasets. The training datasets comprised 70% of the dataset, while 30% was for the testing ones. The training data groups were used to develop the models. The foremost objective of the models was to predict the standard Proctor compaction characteristics (OMC and MDD). Then, using the testing data, the models were evaluated. Table 1 includes the number of used data from different studies and ranges of the input parameters: gravel content (G %), sand content (S %), fine content (F %), liquid limit (LL %), plastic limit (PL %) and plasticity index (PI %).

Table 1 The range and the number of the used datasets

Full size table

Further, in the table, the ranges of the measured values of optimum moisture content (OMC %) and maximum dry density (MDD kN/m³) are contained within, compared with the predicted values obtained from the models. These input parameters are used to develop the models and the actual values of output parameters. The procedure of the work is illustrated in a flowchart in Fig. 1 and summarized as:

In total, 2162 datasets were compiled from the available literature. These datasets were randomly combined and then divided into two sets: one for training the models, consisting of 70% of the data, and the other for testing the models, which accounted for the remaining 30%. The primary aim of these models was to forecast the standard Proctor compaction characteristics, specifically the Optimum Moisture Content (OMC) and Maximum Dry Density (MDD). Subsequently, the models' performance was assessed using the testing data. Table 1 presents the quantity of data sourced from various studies and the ranges of the input parameters, including gravel content (G %), sand content (S %), fine content (F %), liquid limit (LL %), plastic limit (PL %), and plasticity index (PI %).

Furthermore, Table 1 provides a rangeds of measured values of OMC (%) and MDD (kN/m³), with which the obtained values from the models will be compared later. These input parameters are the foundation for model development, while the actual output parameter values are also included. The workflow of the study is visually represented in Fig. 1 and can be summarized as follows:

Stage 1: Collecting data.
Stage2: Correlating input and output parameters.
Stage 3: Splitting data into two groups: 70% training and 30% testing.
Stage 4: Developing different models.
Stage 5: Evaluating the performance of the models.

4 Correlation between soil index properties and compaction parameters

The illustration demonstrates the validity of the connection between input parameters and compaction characteristics. Each input parameter was graphically depicted and scrutinized concerning OMC and MDD. Figure 2 presents a matrix plot elucidates the link between input parameters and compaction characteristics. The figures unmistakably reveal that, except for the correlation between PL and OMC, with an R-value of 0.82 (as shown in Fig. 3), there are generally insufficient correlations between compaction characteristics and the input parameters. This particular correlation can be considered robust but may apply primarily to fine-grained soil. In other words, PL may not accurately represent the soil's true properties for soils where the percentages of G (gravel) and S (sand) dominate. Moreover, it is worth noting that some types of purely coarse-grained soil may not possess a plastic limit (PL) at all.

Additionally, Fig. 4 illustrates the typical distribution of OMC and MDD. Moreover, Table 2 presents various statistics, including minimum, maximum, mean, standard deviation, skewness, kurtosis, and variance. In the context of kurtosis, a strongly negative value suggests shorter tails compared to a normal distribution, while a positive value indicates longer tails. As for the skewness parameter, a negative value signifies an extended left tail, while a positive value signifies an extended right tail.

Table 2 Statistical parameters for input and output variables

Full size table

5 Modeling

In the above figures, each parameter of G, S, F, LL PL, and PI is connected to the OMC and MDD so that a direct relationship between these parameters can be developed to predict the compaction characteristics from one of these parameters. However, because the R² value is extremely low, a reliable direct correlation between the compaction characteristics and the input parameters cannot be attained, following the statistical analysis and Figs. 2, 3 and 4. Therefore, four separate models are developed considering the influence of soil gradation and index properties to overcome this shortcoming and establish a reliable correlation to forecast OMC and MDD from the input parameters.

In this work, the models are utilized to predict the OMC and MDD, and each model's performance is evaluated according to the original data. The following evaluation criteria are used to evaluate and compare the performance of the four models: For a model to be scientifically accurate, there should be a minor percentage error between observed and predicted data (higher a20 – index) and a higher R² value with a lower RMSE, Objective (OBJ), MAE, and SI values (Figs. 5, 6, 7, 8, 9, 10, 11, 12).

Complexity analysis of Multiple Linear Regression (MLR) is relatively straightforward compared to more complex models like neural networks. MLR is a simple, linear modeling technique to establish relationships between dependent and multiple independent variables. Here are the key aspects to consider when analyzing the complexity of an MLR model:

5.1 Model complexity

MLR models are linear, assuming a linear relationship between the dependent and independent variables. This simplicity contributes to low model complexity.

The complexity of MLR, LR, and NLR models increases with the number of independent variables (features) included in the model. Adding more components can lead to a more complex model.

Each independent variable has a coefficient (weight) associated with it in an MLR model. The number of independent variables determines the total number of parameters in the model. As the number of independent variables grows, so does the number of model parameters.

MLR, LR, and NLR can become more complex if interactions (cross-product terms) or higher-order terms (quadratic or cubic terms) are included in the model. These additions increase the number of parameters and model complexity.

Increasing model complexity by adding more features or terms can lead to overfitting. Overfitting occurs when the model fits the noise in the data rather than the underlying patterns. Complexity should be carefully balanced to avoid overfitting. Proper variable selection is crucial to managing complexity. Including irrelevant or redundant variables can increase model complexity without improving predictive accuracy.

Techniques like L1 (Lasso) and L2 (Ridge) regularization can be applied to penalize large coefficient values to control model complexity and reduce overfitting.

Simplicity in an MLR model can enhance its interpretability. Complex models with numerous variables and interactions can be challenging to interpret.

MLR models are computationally efficient and require minimal memory and processing power resources, making them suitable for large datasets.

Model complexity should be balanced with model performance. Evaluation metrics like R-squared (R2), adjusted R-squared, and RMSE can help assess model fit and complexity.

Multiple Linear Regression is a relatively simple and interpretable modeling technique. Its complexity primarily depends on the number of features and the inclusion of interactions or higher-order terms. Proper variable selection and regularization techniques are essential for managing complexity and preventing overfitting. MLR suits situations where a straightforward linear relationship between variables is sufficient, and model interpretability is a priority.

MLR is straightforward, interpretable, and computationally efficient. It is well-suited when you clearly understand the relationships between variables and want a simple model that can provide insight into how each independent variable affects the dependent variable.

NLR is employed when the relationship between variables is nonlinear, meaning that the effect of an independent variable is not constant but varies across the range of the variable.

NLR allows more flexibility in modeling complex, nonlinear relationships. It can capture relationships that MLR cannot. NLR can provide more accurate predictions when the data suggests a nonlinear relationship.

LR is a simplified form of MLR with only one independent variable. It's used when you want to model the relationship between a dependent variable and a single independent variable linearly.

Why Use LR: LR is helpful when you're interested in understanding the impact of a single variable on the dependent variable. It's a basic and interpretable model for simple relationships.

ANNs are used when dealing with complex, nonlinear relationships involving many independent variables. ANNs can capture intricate patterns and interactions within the data.

Researchers and data scientists often start with simpler models like MLR or LR to understand the data and establish a baseline. If the relationships are nonlinear or involve complex interactions, NLR or ANN might be more appropriate. Model selection is often an iterative process involving testing multiple models to determine which best fits the data and produces accurate predictions.

5.2 Linear regression model

The linear regression model (LR) [7, 31, 32, 34, 35, 44], as demonstrated in Eq. (1), is the most common technique for predicting the compaction characteristics of soils:

$${\text{OMC}},{\text{ MDD}} = {\text{a}} + {\text{bX}}$$

(1)

A and b are constants, and x might be one of the G, S, F, LL, PL, or PL. The previous formula does not include the combination of other variables that might affect OMC and MDD, including particle size and soil plasticity. Equation (2) is presented to combine all different parameters and factors that might influence OMC and MDD to obtain more reliable scientific findings.

$${\text{OMC}},{\text{ MDD}} = {\text{a}} + {\text{bG}} + {\text{cS}} + {\text{dF}} + {\text{eLL}} + {\text{fPL}} + {\text{gPI}}$$

(2)

where G is gravel content (%), S is sand content (%), F is fine content (%), LL is the liquid limit, PL is the plastic limit, and PI is the plasticity index.

Moreover, the model parameters are a, b, c, d, e, f, and g. All variables can be changed linearly. Hence Eq. (2) can be considered an expansion of Eq. (1). This combination might not always be the case as all factors are unlikely to affect the compaction characteristics and interact with one another. Consequently, the model should be updated often to accurately estimate the OMC and MDD [45, 76, 99].

5.3 Nonlinear regression model

Generally, to develop a nonlinear model, the following Eq. (3) can be applied [39]. To forecast the OMC and MDD, the connection among various variables in Eqs. (1) and (2) can be expressed in Eq. (3):

$${\text{OMC}},{\text{ MDD}} = {\text{a}} + {\text{b}}({\text{G}})^{{\text{c}}} + {\text{d}}({\text{S}})^{{\text{e}}} + {\text{f}}({\text{F}})^{{\text{g}}} + {\text{h}}({\text{LL}})^{{\text{i}}} + {\text{j}}({\text{PL}})^{{\text{k}}} + {\text{l}}({\text{PI}})^{{\text{m}}}$$

(3)

where G stands for gravel content (%), S stands for sand content (%), F stands for fine content (%), LL stands for liquid limit, PL stands for plastic limit, and PI stands for plasticity index. Moreover, the model parameters are a, b, c, d, e, f, g, h, I, j, k, l, and m, calculated based on the least square method.

5.4 Multiple linear regressions

The MLR, also a regression procedure, can be employed when the expected variable has a parameter greater than two stages. MLR is a statistical approach that is comparable to multiple linear regressions.

Equation (4) can be utilized to find the variance among predictable and independent variables.

$${\text{OMC}},{\text{ MDD}} = {\text{a}}({\text{G}})^{{\text{b}}} ({\text{S}})^{{\text{c}}} ({\text{F}})^{{\text{d}}} ({\text{LL}})^{{\text{e}}} ({\text{PL}})^{{\text{f}}} ({\text{PI}})^{{\text{g}}}$$

(4)

If soil is purely coarse-grained (F% = 0) or fine-grained (G% + S% = 0), Eq. (4) cannot be applied to forecast the compaction characteristics. Therefore, there is a limitation in applying this model as it cannot represent the real formation of the soil. The least-square method was implemented to find the parameters (a, b, c, d, e, f, and g) and model variables.

The datasets are divided into two groups for the above models: Training and Testing. 70% of the datasets are used for training, while 30% go to the testing phase.

5.5 ANN model

ANN is a robust simulation software developed for data analysis and computing to process and analyze information similarly to a human brain. Construction engineering frequently uses this machine learning method to forecast the future behavior of numerous numerical issues [110, 111]. The ANN model has three primary layers: input, hidden, and output. Depending on the intended problem, each input and output layer may consist of one or more layers. The hidden layer is often extended for two or more layers. The objective of the designed model and the data collected often decide the input and output layers, while the rating weight, transfer function, and the bias of each layer toward other layers determine the hidden layer. Based on a combination of proportions, weight/bias, and various parameters, including (G, S, F, LL, PL, and PI) as inputs, a multi-layer feed-forward network is constructed, and output ANN here is either the OMC or MDD. For designing the network architecture, no standard approach is available. The number of hidden layers and neurons is determined based on the trial and error test. The prime goal of the training process of the network is to achieve the optimized number of iterations (epochs) from which the minimum mean absolute error (MAE), root means square error (RMSE), and the high R²-value are provided. Several studies have examined the influence of iteration on reducing the MAE and RMSE. In order to prepare for the designed ANN, the obtained dataset (a total of 2162 data) has been separated into two groups. Approximately 70% of datasets were utilized as trained data for training the network. For the testing data, 30% of the overall data was used for the trained network [34]. The constructed ANN was trained and tested for several hidden layers to select the best network structure based on the compatibility of the predicted compaction characteristics with the obtained data, for predicting OMC, the best-trained network that offers the highest R² and the lowest MAE and RMSE as the ANN structure with one hidden layer, eight neurons, and a hyperbolic tangent transfer function (as illustrated in Fig. 13, Table 3). As far as the prediction of MDD is concerned, the best-trained network was three hidden layers, five neurons, and a hyperbolic tangent transfer function.

Table 3 The tested ANN architecture for OMC

Full size table

Analyzing the complexity of an Artificial Neural Network (ANN) involves considering several factors related to its architecture and operation. Below the following is the breakdown of the key aspects to consider when conducting a complexity analysis of an ANN:

i.
Number of layers: the number of layers in the network, including input, hidden, and output layers, significantly impacts complexity. Deeper networks are generally more complex.
ii.
Layer size: the number of neurons or units in each layer, especially in the hidden layers, affects model complexity.
iii.
Connectivity: the extent of connections between neurons, also known as the network's topology, plays a role in complexity.
iv.
Number of parameters: the number of weights and biases in the network. The total parameters in an ANN increase with the number of neurons and layers.
v.
Input features: the dimensionality of the input data contributes to the complexity. Higher-dimensional input data require larger networks.
vi.
Activation functions: each layer's choice of activation functions can influence complexity. Non-linear activation functions can add to the complexity of the model.
vii.
Training algorithm: the algorithm used for training the ANN can impact complexity. Some training algorithms, like backpropagation, require more iterations to converge.
viii.
Hyperparameters: the settings of hyperparameters, such as learning rate, batch size, and dropout rate, can impact training time and model complexity.
ix.
Regularization techniques: techniques like L1 and L2 regularization can add complexity to the model by penalizing large weights.
x.
Pruning: pruning techniques can reduce complexity by eliminating unnecessary connections and neurons after training.
xi.
Parallelization: distributed training across multiple GPUs or CPUs can change the complexity of the training process.
xii.
Complexity metrics: several complexity metrics, like FLOPs (floating-point operations), are used to quantitatively measure the computational complexity of ANN models.
xiii.
Overfitting: the potential for overfitting increases with the complexity of the model. Complexity can lead to the model fitting noise in the data rather than the underlying patterns.

Equations (5)–(7) display the ANN model's General Equation.

From linear node 0:

$${\text{OMC}},{\text{ MDD}} = {\text{Threshold}} + \left( {\frac{{{\text{Node }}1}}{{1 + {\text{e}}^{{ - {\text{B}}1}} }}} \right) + \left( {\frac{{{\text{Node }}2}}{{1 + {\text{e}}^{{ - {\text{B}}2}} }}} \right) + \cdots$$

(5)

From sigmoid node 1:

$${\text{B}}1 = {\text{Threshold}} + \sum \left( {\text{Attribute*Variables}} \right)$$

(6)

From sigmoid node 2:

$${\text{B}}2 = {\text{Threshold}} + \sum \left( {\text{Attribute*Variables}} \right)$$

(7)

5.6 Pseudocode/algorithm

6 Model assessment tools

The suggested models were evaluated using a variety of metrics, including coefficient of determination (R²), a20-index, scatter index (SI), Objective (OBJ), root mean squared error (RMSE), and mean absolute error (MAE), which may be calculated using the formulas below:

$${\text{R}}^{2} = \left( {\frac{{\mathop \sum \nolimits_{{{\text{p}} = 1}}^{{\text{p}}} \left( {{\text{yi}} - {\text{yi}}^{\prime}} \right)\left( {{\text{yp}} - {\text{yp}}^{\prime}} \right)}}{{\sqrt {\left[ {\mathop \sum \nolimits_{{{\text{p}} = 1}}^{{\text{p}}} \left( {{\text{yi}} - {\text{yi}}^{\prime}} \right)^{2} } \right]\left[ {\mathop \sum \nolimits_{{{\text{p}} = 1}}^{{\text{p}}} \left( {{\text{yp}} - {\text{yp}}^{\prime}} \right)^{2} } \right]} }}} \right)^{2}$$

(8)

$${\text{RMSE}} = \sqrt {\frac{{\mathop \sum \nolimits_{{{\text{p}} = 1}}^{{\text{p}}} \left( {{\text{yp}} - {\text{yi}}} \right)^{2} }}{{\text{p}}}}$$

(9)

$${\text{MAE}} = \frac{{\mathop \sum \nolimits_{{{\text{p}} = 1}}^{{\text{p}}} \left| {{\text{yp}} - {\text{yi}}} \right|}}{{\text{p}}}$$

(10)

$${\text{a}}20 - {\text{index}} = \frac{{{\text{m}}20}}{{\text{N}}}$$

(11)

$${\text{SI}} = \frac{{{\text{RMSE}}}}{{{\text{y}}^{\prime}}}$$

(12)

$$\begin{aligned} {\text{OBJ}} & = \left( {\frac{{{\text{n}}_{{{\text{tr}}}} }}{{{\text{n}}_{{{\text{all}}}} }}{*}\frac{{{\text{RMSE}}_{{{\text{tr}}}} + {\text{MAE}}_{{{\text{tr}}}} }}{{{\text{R}}_{{{\text{tr}}}}^{2} + 1}}} \right) + \left( {\frac{{{\text{n}}_{{{\text{tst}}}} }}{{{\text{n}}_{{{\text{all}}}} }}{*}\frac{{{\text{RMSE}}_{{{\text{tst}}}} + {\text{MAE}}_{{{\text{tst}}}} }}{{{\text{R}}_{{{\text{tst}}}}^{2} + 1}}} \right) \\ \quad + \left( {\frac{{{\text{n}}_{{{\text{val}}}} }}{{{\text{n}}_{{{\text{all}}}} }}{*}\frac{{{\text{RMSE}}_{{{\text{val}}}} + {\text{MAE}}_{{{\text{val}}}} }}{{{\text{R}}_{{{\text{val}}}}^{2} + 1}}} \right) \\ \end{aligned}$$

(13)

From the formulas above, yp and yi are the expected and actual values of the path pattern, and yp’ and yi’ are the averages of the actual and forecasted values. Training, testing, and validating datasets are tr, tst, and val, respectively; the number of patterns (collected data) in the associated dataset is denoted as n. In contrast to R², which has an optimal value of one, the other evaluating factors have optimal values of zero. About the SI parameter, a model performs poorly when it is [0.3, fairly when it is between 0.2 and 0.3, well when it is between 0.1 and 0.2, and very well when it is < 0.1. To assess the effectiveness of the recommended models, the OBJ parameter was also used as an integrated performance parameter in Eq. (12). m20 is the ratio of experimental to the predicted value, which varies between 0.8 and 1.2, and H is the total number of data samples. The prime benefit of the a20-index is that the model predicts values with a deviation of ± 20% compared to actual values. In order to graphically show how each model overestimates and underestimates the expected results of OMC and MDD compared to the actual values from the experiments, positive and negative error margin lines were added to the model findings. A positive value is represented for the overestimated percentage of OMC and MDD, while a negative value means the underestimated percentage of OMC and MDD.

7 Results and analysis

7.1 The LR model

Figure 5 exhibits the relation between actual and estimated OMC and MDD for all training and testing datasets. In order to develop the model, 2162 data sets have been utilized. Concerning the developed equations, the two plasticity parameters of LL and PI play a substantial role in determining OMC and MDD. By optimizing the sum of error squares and the least square approach, which was performed in Excel using Solver to obtain the ideal value for the Equation, the present model’s weight of each parameter on the OMC and MDD was found. The following are the Equations for the LR model with various weight parameters [Eqs. (13) and (14)]:

$${\text{OMC}} = 0.96 - 0.05{\text{G}} - 0.04{\text{S}} + 0.03{\text{F}} + 0.72{\text{LL}} + 0.07{\text{PL}} - 0.74{\text{PI}}$$

(14)

$${\text{MDD}} = 27.13 - 0.025{\text{G}} - 0.045{\text{S}} - 0.073{\text{F}} - 0.215{\text{LL}} + 0.04{\text{PL}} + 0.216{\text{PI}}$$

(15)

As seen in the above Equations, among other input parameters, PI can have more influence on lowering the value of OMC; on the contrary, MDD value declines with the increase of LL. This seems to be consistent with the experimental findings reported in the literature that soil with higher LL has a lower MDD value. The obtained R², RMSE, and MAE assessment parameters for the OMC are 0.82, 3.99, and 2.98%, respectively. For the MDD, R², RMSE, and MAE are 0.76, 1.05, and 0.84 kN/m³, respectively. Furthermore, as shown in Figs. 11 and 12, the current model’s OBJ and SI values for OMC for the training dataset are 3.69 and 0.23, respectively, while these values are 1.07 and 0.062, correspondingly, for MDD.

7.2 NLR model

The NLR model is another model that might be interesting to examine for its performance among other models. Figure 6a, b depict the compaction characteristics values for predicted and real data obtained from the literature for training and testing datasets. According to the developed model, F plays a considerable role in changing the value of both OMC and MDD. The following are the suggested formulas for the NLR model with various variable parameters [Eqs. (15) and (16)]:

$$\begin{aligned} {\text{OMC}} & = 777.3 + 2.13({\text{G}})^{ - 0.213} + 0.000028({\text{S}})^{ - 6.31} - 784.6\left( {\text{F}} \right)^{ - 0.003} \\ \quad - 0.137\left( {{\text{LL}}} \right)^{0.93} + 0.144({\text{PL}})^{1.38} + 2\left( {{\text{PI}}} \right)^{0.47} \\ \end{aligned}$$

(16)

$$\begin{aligned} {\text{MDD}} & = 35.7 - 4({\text{G}})^{ - 0.046} - 94.4({\text{S}})^{ - 0.0013} + 60.8\left( {\text{F}} \right)^{ - 0.012} \\ \quad - 0.63\left( {{\text{LL}}} \right)^{0.83} + 23.68({\text{PL}})^{0.077} + 0.35\left( {{\text{PI}}} \right)^{0.92} \\ \end{aligned}$$

(17)

It should be noted that this equation cannot be used for pure fine-grained soil since it provides zero OMC and MDD. In order to overcome this issue, the value of G and S can be approximated to zero (not zero).

For OMC, this model's R², RMSE, and MAE assessment parameters are 0.79, 4.22%, and 2.72%, respectively, whereas these values for MDD are 0.69, 1.062, and 0.84 kN/m³ correspondingly. Furthermore, the models’ OBJ for OMC and MDD is 3.38% and 1.09 kN/m³, respectively. Moreover, the SI values for OMC and MDD are 0.214 and 0.055, respectively.

7.3 MLR model

The comparison between the real compaction characteristics values from literature and predicted values are illustrated in Fig. 7a, b for both phases (training and testing datasets). The following equation shows that PL is the most influential parameter affecting the OMC. Changing the PL value can substantially change the value of OMC. However, both G and S contents have more influence on MDD. Equations (18) and (19) can be used to forecast the OMC and MDD, respectively, for the MLR model with various variable parameters:

$${\text{OMC}} = 0.632{*}({\text{G}})^{ - 0.0314} ({\text{S}})^{ - 0.0142} \left( {\text{F}} \right)^{0.08} \left( {{\text{LL}}} \right)^{0.04} ({\text{PL}})^{0.955} \left( {{\text{PI}}} \right)^{ - 0.0153}$$

(18)

$${\text{MDD}} = 39.74{*}({\text{G}})^{0.01} ({\text{S}})^{0.01} \left( {\text{F}} \right)^{ - 0.03} \left( {{\text{LL}}} \right)^{ - 0.12} ({\text{PL}})^{ - 0.12} \left( {{\text{PI}}} \right)^{0.02}$$

(19)

The R², RMSE, and MAE assessment parameters for this model are 0.81, 4.098%, and 3.02%, respectively, for OMC, while, for MDD, these are 0.66, 1 kN/m³, and 0.8, respectively. Furthermore, the current model’s OBJ for OMC and MDD are 3.78 and 1.15, respectively. In addition, the SI values of the training dataset for both OMC and MDD are 0.239 and 0.054, respectively.

7.4 ANN model

The authors investigated several hidden layers, neurons, momentum, learning rate, and iterations to achieve high ANN efficiency. Tables 3 and 4 indicate several ANN architectures examined to obtain the most optimum ANN model for OMC and MDD. Consequently, it was detected that when the ANN contains one hidden layer, eight neurons on the left side (as shown in Fig. 8a), 0.1 momentum, 0.2 learning rate, and 2000 iterations, the OMC is best predicted. However, MDD was best predicted with three hidden layers and five neurons on each side (Fig. 8b). The comparison of both predicted and real values of the compaction characteristics are demonstrated in Fig. 9a, b for both phases (training and testing datasets). The ANN model performs better in predicting OMC value than other models. Nevertheless, compared to the other models, the ANN model only performed better in providing R² value in predicting MDD. Regarding the OMC, the ANN model's R², RMSE, and MAE assessment parameters are 0.92, 3.51, and 2.574%, respectively. However, for predicting MDD, R², RMSE, and MAE values are 0.86, 1.164 kN/m³, and 2.31 kN/m³, respectively. Moreover, the present model’s OBJ for OMC and MDD are 3.29 and 2.03, respectively. Further, SI values for the training dataset for both OMC and MDD are 0.202 and 0.068, respectively.

Table 4 The tested ANN architecture for MDD

Full size table

8 Model comparisons

In order to categorize the effectiveness of the constructed models, six different quantitative tools, MAE, R², RMSE, OBJ, a20-index, and SI, were used. Figures 13, 14, and 15 illustrate R², RMSE, and MAE, respectively.

As far as the OMC is concerned, according to the figures, the ANN outperforms LR, NLR, and MLR models as it has a higher R² and lowers RMSE and MAE values. To assess the reliability of the models, a new engineering index, the a20—index, was used. The advantage of the proposed a20 – index is that it states the number of samples that match expected values with a variance of less than 20% from experimental values [11]. Concerning this index, Fig. 10a compares the predicted and measured OMC of the models. The figure indicated that NLR provided a higher a20- index value of 0.73, and it was 0.7 for the ANN model, which means that 73% of samples with the predicted values have deviated 20% from the actual values of OMC. Thus, the NLR model predicts OMC with the least percentage of error. The OBJ for all the models is shown in Fig. 11. The OBJ values of LR, NLR, MLR, and ANN models are 3.69, 3.38, 3.78, and 3.29, respectively. Even though the OBJ values for all the models are very close, the ANN model has the least OBJ value compared to the others. Concerning all the assessment tools, the ANN approach provides more reliable outcomes and accurately forecasts OMC.

The SI values are exhibited in Fig. 12 for the training and testing phases. Figure 12a shows that the testing phases for LR, NLR, and MLR are between 0.2 and 0.3, indicating fair performance. Nevertheless, the SI value for the ANN model is on a merge of good performance. This contradicts the other phases, as LR, NLR, and MLR perform well. In light of these analyses, although all models can, to some extent, be used to forecast the OMC from physical soil properties, the ANN models are proven to have a distinct performance.

Concerning the prediction of MDD, in Fig. 10b, all the models provide a distinction prediction of MDD, and more than 99% of the predicted samples were within $\pm$ 20% deviation from the actual data. ANN model provided a slightly higher a20-index. Regarding Fig. 11b, the OBJ values of LR, NLR, MLR, and ANN are 1.07, 1.09, 1.15, and 2.03, respectively. LR model has the least OBJ compared to the other models. In this regard, the LR, MLR, and ANN somewhat perform well in predicting MDD. According to Fig. 12b, the SI values for LR, NLR, MLR, and ANN models are 0.062, 0.055, 0.057, and 0.068, respectively. These values show that all the models are excellent at predicting MDD.

Regarding all the statistical assessment criteria, all the statistical assessment criteria, all the models perform well in predicting MDD, and the percentages of errors are less than 20%. However, the ANN model provides a higher R² value, and LR can still be a reliable model to predict MDD with an R² value of 0.76.

9 Sensitivity analysis

In order to obtain the evident influence of each parameter on the OMC and MDD, sensitivity analyses were performed for the models. The MLR model was selected since the value of all input parameters is greater than zero; this can provide the real contribution of each parameter in the model and the influence of each parameter on OMC and MDD. All training data were combined throughout these analyses, and a single input variable was excluded each time. The RMSE, R², and MAE were individually determined so that the influence of each parameter could be observed. Tables 5 and 6 indicate the variation of evaluation criteria for predicting OMC and MDD, respectively.

Table 5 Sensitivity analysis for MLR model on OMC

Full size table

Table 6 Sensitivity analysis for MLR model on MDD

Full size table

The most efficient input parameter is the one that can remarkably change the values of statistical tools (increasing RMSE, MAE, and decreasing R²). Table 5 indicates the outcomes of the sensitivity study on predicting OMC. The findings show that PL is the most dominant parameter influencing the value of OMC. The R² value of the model changed from 0.808 to 0.615 when excluding PL. On the contrary, the value of RMSE has increased from 4.098 to 5.8.06. Table 6 clarifies the influence of a single input parameter on MDD. When removing G, the R² value declined from 0.66 to 0.62; however, the values of RMSE and MAE increased to 1.052 and 0.83, respectively. Therefore, PL can be considered a dominant input parameter that significantly influences the value of OMC, while, for quantifying MDD, G plays a remarkable role.

10 Recomandations for future work

i.
Refinement of model integration: explore methods to integrate different models (ANN, NLR, LR, MLR) to harness the strengths of each model in a combined predictive approach. Ensemble techniques or hybrid models could potentially yield more accurate and robust predictions by leveraging the unique capabilities of individual models.
ii.
Additional input parameters: consider expanding the input parameters beyond the six currently utilized factors. Incorporating other relevant soil properties or environmental factors that might influence compaction characteristics could enhance the models' predictive capabilities.
iii.
Data augmentation and validation: continuously collect new data and augment the existing dataset with additional samples to improve the models' training and validation. This can help ensure the models remain accurate and effective when applied to a wider range of soil variations.
iv.
Regional variability analysis: investigate how the predictive models perform across regions with varying soil compositions and climates. Understanding how these models generalize to geographical areas can enhance their applicability and reliability in diverse construction contexts.
v.
Dynamic modelling for real-time applications: develop dynamic models that can continuously adapt and predict compaction characteristics based on real-time data from sensors embedded in construction equipment. This could facilitate on-site decision-making and optimize the compaction process as it progresses.
vi.
Incorporation of advanced ai techniques: explore the integration of advanced artificial intelligence techniques, such as deep learning or reinforcement learning, to further improve the predictive accuracy of the models. These techniques might capture more complex nonlinear relationships within the dataset.
vii.
Field validation and case studies: conduct field validation studies where predictions from the models are compared against actual on-site compaction results. Case studies in real construction scenarios can provide practical insights into the models' effectiveness and help refine their application in practical settings.
viii.
Optimization algorithms: develop optimization algorithms that utilize the predictive models to suggest the most efficient and effective combinations of soil properties to achieve the desired compaction characteristics, considering both resource utilization and construction timelines.

11 Conclusions

Developing a predictive model to anticipate Optimum Moisture Content (OMC) and Maximum Dry Density (MDD) is crucial for leveraging fundamental soil characteristics such as soil gradation and plasticity. These parameters are pivotal in gauging soil suitability, simplifying the selection process from various soil sources, and circumventing the requirement for standard Proctor tests. An analysis of 2162 datasets, encompassing Gravel (G), Sand (S), Fine content (F), Plastic Limit (PL), Liquid Limit (LL), and Plasticity Index (PI) parameters, unveiled significant revelations:

Several models—Linear Regression (LR), Nonlinear Regression (NLR), Multilinear Regression (MLR), and Artificial Neural Networks (ANN)—were devised to predict OMC and MDD. Evaluation using diverse metrics highlighted the superior performance of the ANN model. When it came to predicting OMC, it had higher R-squared (R2) values and lower Objective (OBJ), Root Mean Squared Error (RMSE), Scatter Index (SI), and Mean Absolute Error (MAE) values.

All models demonstrated robust predictive accuracy for OMC and MDD, with SI values below 0.2.

The ANN model particularly excelled in predicting OMC, displaying an OBJ value approximately 12% lower than LR, 2.7% lower than NLR, and 15% lower than MLR. Conversely, for MDD prediction, the ANN model presented a higher OBJ value compared to LR, NLR, and MLR.

Although the ANN model showcased the highest R2 value, LR, NLR, and MLR models also exhibited commendable R2 values, hovering around 0.8, signifying reliability in predicting OMC based on parameters G, S, F, PL, LL, and PI. The LR model's R2 value (0.78) for MDD implies relative reliability in this context.

The model equations underscore the importance of Plastic Limit (PL) and Liquid Limit (LL) in projecting OMC, while the proportions of Gravel (G) and Sand (S) emerge as pivotal factors in determining MDD.

Sensitivity analyses identified Plastic Limit (PL) as the primary influencer in predicting OMC, while Gravel (G) holds substantial sway in quantifying MDD.

While the ANN model outshines others in OMC and MDD prediction, LR models might offer more practicality owing to their transparent equations grounded in soil physical properties. This contrasts with the ANN's inherently complex "black box" approach.

Availability of data and materials

The data supporting the conclusions of this article are included.

References

Abasi N (2013) A new empirical equation for compression behavior of unconsolidated clayey soils. J Civ Eng Ferdowosi Univ Mashhad (Persian Lang) 24:41–56
Google Scholar
Abdalla A, Mohammed AS (2022) Hybrid MARS-, MEP-, and ANN-based prediction for modeling the compressive strength of cement mortar with various sand size and clay mineral metakaolin content. Arch Civil Mech Eng 22(4):1–16
Article Google Scholar
Adeoye AS, Alo BA, Abdu-Raheem YA (2018) Assessment of geotechnical properties of migmatite-derived residual lateritic soil from Ado-Ekiti South Western Nigeria as Barrier in Sanitary Landfill. IJIRSET 7:7454–7462
Google Scholar
Ahmed SS, Hossain N, Khan AJ, Islam MS (2016) Prediction of soaked CBR using index properties, dry density and unsoaked cbr of lean clay. Malays J Civ Eng 28:270–283
Google Scholar
Al-Khafaji AN (1993) Estimation of soil compaction parameters by means of Atterberg limits. Q J Eng Geol Hydrogeol 26:359–368. https://doi.org/10.1144/GSL.QJEGH.1993.026.004.10
Article Google Scholar
Albrecht BA, Benson CH (2001) Effect of desiccation on compacted natural clays. J Geotech Geoenviron Eng 127:67–75. https://doi.org/10.1061/(ASCE)1090-0241(2001)127:1(67)
Article Google Scholar
Ali HF, Hama RAJ, Hama Kareem MI, Muhedin DA (2019) A correlation between compaction characteristics and soil index properties for fine-grained soils. Polytech J 9:93–99. https://doi.org/10.25156/ptj.v9n2y2019.pp93-99
Article Google Scholar
Ali TS, Fakhraldin MK (2016) Soil parameters analysis of Al-Najaf City in Iraq: case study. J Geotech Eng 3:56–62
Google Scholar
Alzabeebee S, Mohamad SA, Al-Hamd RKS (2021) Surrogate models to predict maximum dry unit weight, optimum moisture content and California bearing ratio form grain size distribution curve. Road Mater Pavem Des. https://doi.org/10.1080/14680629.2021.1995471
Article Google Scholar
Arshid MU, Kamal MA (2020) Appraisal of bearing capacity and modulus of subgrade reaction of refilled soils. Civ Eng J 6:2120–2130. https://doi.org/10.28991/cej-2020-03091606
Article Google Scholar
Asteris PG, Skentou AD, Bardhan A, Samui P, Pilakoutas K (2021) Predicting concrete compressive strength using hybrid ensembling of surrogate machine learning models. Cem Concr Res 145:106449
Article Google Scholar
Bekele A (2017) Correlation of CBR with index properties of soils in Sululta Town. MSc thesis, Addis Ababa University
Bellezza I, Fratalocchi E (2006) Effectiveness of cement on hydraulic conductivity of compacted soil–cement mixtures. Proc Inst Civ Eng—Gr Improv 10:77–90. https://doi.org/10.1680/grim.2006.10.2.77
Article Google Scholar
Benson CH, Trast JM (1995) Hydraulic conductivity of thirteen compacted clays. Clays Clay Miner 43:669–681. https://doi.org/10.1346/CCMN.1995.0430603
Article Google Scholar
Bera A, Ghosh A (2011) Regression model for prediction of optimum moisture content and maximum dry unit weight of fine grained soil. Int J Geotech Eng 5:297–305. https://doi.org/10.3328/IJGE.2011.05.03.297-305
Article Google Scholar
Vipulanandan C, Mohammed A (2020) Effect of drilling mud bentonite contents on the fluid loss and filter cake formation on a field clay soil formation compared to the API fluid loss method and characterized using Vipulanandan models. J Pet Sci Eng 189:107029
Article Google Scholar
Burroughs VS (2001) Quantitative criteria for the selection and stabilisation of soils for rammed earth wall construction. University of New South Wales Sydney, Australia
Google Scholar
Das BM (2021) Principles of geotechnical engineering. Cengage Learning
Daita R, Drnevich V, Kim D (2005) Family of compaction curves for chemically modified soils. Jt Transp Res Prog 98
David Suits L, Sheahan T, Inci G et al (2003) Experimental investigation of dynamic response of compacted clayey soils. Geotech Test J 26:10750. https://doi.org/10.1520/GTJ11328J
Article Google Scholar
Demiralay I, Guresinli YZ (2010) A study on the consistency limits and compactibllity of the soils of Erzurum Plain. J Ataturk Univ Fac Agric (Turk Lang) 10:77–93
Google Scholar
Di Sante M (2020) On the compaction characteristics of soil-lime mixtures. Geotech Geol Eng 38:2335–2344. https://doi.org/10.1007/s10706-019-01110-w
Article Google Scholar
Djokovic K, Rakic D, Ljubojev M (2013) Estimation of soil compaction parameters based on the Atterberg limits. Min Metall Eng Bor. https://doi.org/10.5937/mmeb1304001D
Article Google Scholar
Duque J, Fuentes W, Rey S, Molina E (2020) Effect of grain size distribution on California Bearing Ratio (CBR) and modified proctor parameters for granular materials. Arab J Sci Eng 45:8231–8239. https://doi.org/10.1007/s13369-020-04673-6
Article Google Scholar
Dway SMM, Thant DAA (2014) Soil compression index prediction model for clayey soils. Int J Sci Eng Tech Res 3:2458–2462
Google Scholar
Farooq K, Khalid U, Mujtaba H (2016) Prediction of compaction characteristics of fine-grained soils using consistency limits. Arab J Sci Eng 41:1319–1328. https://doi.org/10.1007/s13369-015-1918-0
Article Google Scholar
Felt EJ (1965) Compactibility. In: Methods of soil analysis: part 1 physical and mineralogical properties, including statistics of measurement and sampling. Wiley Online Library, pp 400–412
Firomsa W, Quezon ET (2019) Parametric modelling on the relationships between Atterberg limits and compaction characteristics of fine-grained soils. Int J Adv Res Eng Appl Sci 8:1–20
Google Scholar
Fondjo AA, Theron E, Ray RP (2021) Estimation of optimum moisture content and maximum dry unit weight of fine-grained soils using numerical methods. Walailak J Sci Technol 18:22722–22792
Article Google Scholar
Foreman DE, Daniel DE (1986) Permeation of compacted clay with organic chemicals. J Geotech Eng 112:669–681. https://doi.org/10.1061/(ASCE)0733-9410(1986)112:7(669)
Article Google Scholar
Gunaydin O (2009) Estimation of soil compaction parameters by using statistical analyses and artificial neural networks. Environ Geol 57:203–215. https://doi.org/10.1007/s00254-008-1300-6
Article Google Scholar
Gurtug Y, Sridharan A (2015) Prediction of compaction behaviour of soils at different energy levels. Int J Eng Res Dev 7:15–18. https://doi.org/10.29137/umagd.379757
Article Google Scholar
Gurtug Y, Sridharan A (2002) Prediction of compaction characteristics of fine-grained soils. Géotechnique 52:761–763. https://doi.org/10.1680/geot.2002.52.10.761
Article Google Scholar
Ali HFH (2023) Soft computing models to predict the compaction characteristics from physical soil properties. Eng Technol J 41(05):698–715
Google Scholar
Hama Ali HF (2023) Utilizing multivariable mathematical models to predict maximum dry density and optimum moisture content from physical soil properties. Multiscale and Multi Model Exp Des 6(4):603−627
Article Google Scholar
Han Z, Vanapalli SK (2016) Relationship between resilient modulus and suction for compacted subgrade soils. Eng Geol 211:85–97. https://doi.org/10.1016/j.enggeo.2016.06.020
Article Google Scholar
Harris MT (1969) A study of the correlation potential of the optimum moisture content, maximum dry density, and consolidated drained shear strength of plastic fine-grained soils with index properties
Hassan J, Alshameri B, Iqbal F (2021) Prediction of California Bearing Ratio (CBR) using index soil properties and compaction parameters of low plastic fine-grained soil. Transp Infrastruct Geotechnol. https://doi.org/10.1007/s40515-021-00197-0
Article Google Scholar
Vieira e Silva, A., Leme, R. F., da Silva Filho, F. C., Moura, T. E., & Ayala, G. R. L. (2021). Empirical models to predict compaction parameters for soils in the State of Ceará, Northeastern Brazil.
Hong L (2008) Optimization and management of materials in earthwork construction. PhD thesis, Iowa State University
Hopkins TC (1970) Relationship between soil support value and Kentucky CBR. Res Report, Commonw Kentucky, Dep Highw Frankfort, Kentucky
Horpibulsuk S, Katkan W, Apichatvullop A (2008) An approach for assessment of compaction curves of fine grained soils at various energies using a one point test. Soils Found 48:115–125. https://doi.org/10.3208/sandf.48.115
Article Google Scholar
Horz RC (1983) Evaluation of revised manual compaction rammers and laboratory compaction procedures. Army Eng Waterw Exp Stn Vicksbg MS Geotech Lab
Hussain A, Atalar C (2020) Estimation of compaction characteristics of soils using Atterberg limits. IOP Conf Ser Mater Sci Eng 800:012024. https://doi.org/10.1088/1757-899X/800/1/012024
Article Google Scholar
Hussain AHA (2016) Prediction of compaction characteristics of over-consolidated soils. MSc thesis, Near East University
Ibrahim SF (2013) Baghdad subgrade resilient modulus and liquefaction evaluation for pavement design using load cyclic triaxial strength. J Environ Earth Sci 3:125
Google Scholar
Isik F, Ozden G (2013) Estimating compaction parameters of fine- and coarse-grained soils by means of artificial neural networks. Environ Earth Sci 69:2287–2297. https://doi.org/10.1007/s12665-012-2057-5
Article Google Scholar
Jalal FE, Xu Y, Iqbal M et al (2021) Predicting the compaction characteristics of expansive soils using two genetic programming-based algorithms. Transp Geotech 30:100608. https://doi.org/10.1016/j.trgeo.2021.100608
Article Google Scholar
Jesmani M, Manesh AN, Hoseini SMR (2008) Optimum water content and maximum dry unit weight of clayey gravels at different compactive efforts. EJGE 13:1–14
Google Scholar
Ahmed C, Mohammed A, Tahir A (2020) Geostatistics of strength, modeling and GIS mapping of soil properties for residential purpose for Sulaimani City soils, Kurdistan Region, Iraq. Model Earth Syst Environ 6(2):879–893
Article Google Scholar
Alim MA, Akhi AH, Alam MA, Roknuzzaman M (2021) Prediction of compaction characteristics of soil using plastic limit. In Proceedings of International Conference on Planning, Architecture & Civil Engineering held at Rajshahi University of Engineering & Technology, Rajshahi, Bangladesh (pp. 215-220)
Karakan E, Demir S (2018) Effect of fines content and plasticity on undrained shear strength of quartz-clay mixtures. Arab J Geosci 11:1–12. https://doi.org/10.1007/s12517-018-4114-1
Article Google Scholar
Karimpour-Fard M, Machado SL, Falamaki A et al (2019) Prediction of compaction characteristics of soils from index test’s results. Iran J Sci Technol Trans Civ Eng 43:231–248
Article Google Scholar
Kaushal V, Guleria SP (2015) Geotechnical investigation of black cotton soils. Int J Adv Eng Sci 5:15–22
Google Scholar
Kolay E, Baser T (2014) Estimating of the dry unit weight of compacted soils using general linear model and multi-layer perceptron neural networks. Appl Soft Comput 18:223–231. https://doi.org/10.1016/j.asoc.2014.01.033
Article Google Scholar
Korde M, Yadav RK (2015) A study of correlation between CBR value and physical properties of some soils. Int J Emerg Technol Adv Eng J 5:237–239
Google Scholar
Kulhawy FH (1991) Drilled shaft foundations. Foundation engineering handbook. Springer, Boston, pp 537–552
Chapter Google Scholar
Liang RY, Rabab’ah S, Khasawneh M (2008) Predicting moisture-dependent resilient modulus of cohesive soils using soil suction concept. J Transp Eng 134:34–40. https://doi.org/10.1061/(ASCE)0733-947X(2008)134:1(34)
Article Google Scholar
Luczak-Wilamowska B (2004) Basic soil properties of a number of artificial clay-sand mixtures determined as a function of sand content. In: Engineering geology for infrastructure planning in Europe. Springer, pp 308–315
Machado SL, Carvalho MF, Carvalho ZS et al (2020) Optimal ranges of soil index properties for diesel containment using compacted barriers. Environ Geotech 7:540–553. https://doi.org/10.1680/jenge.18.00076
Article Google Scholar
Vipulanandan C, Mohammed AS (2021) 3-dimension stresses and new failure model to predict behavior of clay soils in various liquid limit ranges. Arab J Geosci 14(3):160
Article Google Scholar
Mazari M, Nazarian S (2017) Mechanistic approach for construction quality management of compacted geomaterials. Transp Geotech 13:92–102. https://doi.org/10.1016/j.trgeo.2017.08.001
Article Google Scholar
Mejias-Santiago M, Berney I V, Ernest S, Bradley CT (2013) Evaluation of a non-nuclear soil density gauge on fine-grained soils
Miller CJ, Yesiller N, Yaldo K, Merayyan S (2002) Impact of soil type and compaction conditions on soil water characteristic. J Geotech Geoenviron Eng 128:733–742. https://doi.org/10.1061/(ASCE)1090-0241(2002)128:9(733)
Article Google Scholar
Mohammad LN, Huang B, Puppala AJ, Allen A (1999) Regression model for resilient modulus of subgrade soils. Transp Res Rec J Transp Res Board 1687:47–54. https://doi.org/10.3141/1687-06
Article Google Scholar
Mujtaba H (2015) Development of correlations between various geotechnical parameters for granular soils in Punjab. PhD Thesis, Univ Eng Technol Lahore, Pakistan
Mujtaba H, Farooq K, Sivakugan N, Das BM (2013) Correlation between gradational parameters and compaction characteristics of sandy soils. Int J Geotech Eng 7:395–401. https://doi.org/10.1179/1938636213Z.00000000045
Article Google Scholar
Nagaraj HB, Reesha B, Sravan MV, Suresh MR (2015) Correlation of compaction characteristics of natural soils with modified plastic limit. Transp Geotech 2:65–77. https://doi.org/10.1016/j.trgeo.2014.09.002
Article Google Scholar
Nagaraju TV, Gobinath R, Awoyera P, Abdy Sayyed MAH (2021) Prediction of California bearing ratio of subgrade soils using artificial neural network principles. In: Communication and intelligent systems. Springer, pp 133–146
Mawlood Y, Mohammed A, Hummadi R, Hasan A, Ibrahim H (2022) Modeling and statistical evaluations of unconfined compressive strength and compression index of the clay soils at various ranges of liquid limit. J Test Eval 50(1):551–569
Article Google Scholar
Nesamatha R, Arumairaj PD (2015) Numerical modeling for prediction of compression index from soil index properties. Electron J Geotech Eng 20:4369–4378
Google Scholar
Ng K, Chew YM, Osman MH, SK MG (2015) Estimating maximum dry density and optimum moisture content of compacted soils. Int Confer Adv Civ Environ Eng B 1–8
Nwaiwu CMO, Mezie EO (2021) Prediction of maximum dry unit weight and optimum moisture content for coarse-grained lateritic soils. Soils Rocks 44:1–10. https://doi.org/10.28927/SR.2021.054120
Article Google Scholar
Ogbuchukwu PO, Okeke OC, Ahiarakwem CA, Ozotta OO (2019) Geotechnical properties of expansive soils in Awka and environs, Southeastern Nigeria, in relation to engineering problems. Int J Appl Sci Res 2:79–94
Google Scholar
Olmez A (2007) Determination of compaction parameters by means of regression approaches. Master’s thesis, Nigde University (Turkish Lang)
Omar M, Shanableh A, Basma A, Barakat S (2003) Compaction characteristics of granular soils in United Arab Emirates. Geotech Geol Eng 21:283–295. https://doi.org/10.1023/A:1024927719730
Article Google Scholar
Omar M, Shanableh A, Mughieda O et al (2018) Advanced mathematical models and their comparison to predict compaction properties of fine-grained soils from various physical properties. Soils Found 58:1383–1399. https://doi.org/10.1016/j.sandf.2018.08.004
Article Google Scholar
Ören AH (2014) Estimating compaction parameters of clayey soils from sediment volume test. Appl Clay Sci 101:68–72. https://doi.org/10.1016/j.clay.2014.07.019
Article Google Scholar
Othman K, Abdelwahab H (2021) Prediction of the soil compaction parameters using deep neural networks. Transp Infrastruct Geotechnol. https://doi.org/10.1007/s40515-021-00213-3
Article Google Scholar
Parkoh EA (2016) Prediction of compaction characteristics of lateritic soils in Ghana. Unpubl Master’s thesis, Near East University
Pillai GS, Vinod P (2019) A framework for prediction of compaction parameters in standard proctor tests. In: Recent advances in materials, mechanics and management: proceedings of the 3rd international conference on materials, mechanics and management (IMMM 2017), July 13–15, 2017, Trivandrum, Kerala, India, p 9
Piro NS, Mohammed AS, Hamad SM (2022) The impact of GGBS and ferrous on the flow of electrical current and compressive strength of concrete. Constr Build Mater 349:128639
Article Google Scholar
Prashanth JP, Sivapullaiah PV, Sridharan A (2001) Pozzolanic fly ash as a hydraulic barrier in land fills. Eng Geol 60(1–4):245–252
Article Google Scholar
Raju NV, Srimurali M, Prasad KN (2014) Functional correlations between compaction characteristics, un-drained shear strength and Atterberg limits. IOSR J Mech Civ Eng 11:109–115
Article Google Scholar
Ramasubbarao G, Sankar SG (2013) Predicting soaked CBR value of fine grained soils using index and compaction characteristics. Jordan J Civ Eng 7:354–360
Google Scholar
Ramiah BK, Viswanath V, Krishnamurthy H V (1970) Interrelationship of compaction and index properties. In: Proc. 2nd South East Asian Conf on Soil Eng, p 577
Ratnam UV, Prasad KN (2019) Prediction of compaction and compressibility characteristics of compacted soils. Int J Appl Eng Res 14:621–632
Google Scholar
Ring G, Sallberg JR, Collins WH (1962) Correlation of compaction and classification test data. Hwy Res Bull 325:55–75
Google Scholar
Saikia A, Baruah D, Das K et al (2017) Predicting compaction characteristics of fine-grained soils in terms of Atterberg limits. Int J Geosynth Gr Eng 3:18. https://doi.org/10.1007/s40891-017-0096-4
Article Google Scholar
Verdugo R (2008) Singularities of geotechnical properties of complex soils in seismic regions. J Geotech Geoenviron eng 134(7):982–991.
Article Google Scholar
Sathawara JK, Patel AK (2013) Comparison between soaked and unsoaked CBR. Int J Adv Eng Res Stud 2:132–135
Google Scholar
Sawangsuriya A, Edil TB, Bosscher PJ (2009) Modulus-suction-moisture relationship for compacted soils in postcompaction state. J Geotech Geoenviron Eng 135:1390–1403. https://doi.org/10.1061/(ASCE)GT.1943-5606.0000108
Article Google Scholar
Sen B, Pal SK (2017) Compaction and consolidation characteristics of soils and correlations of parameters. Int J Eng Technol Sci Res 4:874–886
Google Scholar
Setiawan B (2016) The preliminary study on the effect of coarse particles content on OMC and maximum dry unit weight: a case of Aceh’s fill materials. Aceh Int J Sci Technol 5:75–81. https://doi.org/10.13170/aijst.5.2.4877
Article Google Scholar
Shirur NB, Hiremath SG (2014) Establishing relationship between CBR value and physical properties of soil. IOSR J Mech Civ Eng 11:26–30
Article Google Scholar
Shook JF, Fang HY (1961) Cooperative materials testing programs at the AASHO road test. Highw Res Board Spec Rep (66)
Sinha SK, Wang MC (2008) Artificial neural network prediction models for soil compaction and permeability. Geotech Geol Eng 26:47–64. https://doi.org/10.1007/s10706-007-9146-3
Article Google Scholar
Sivrikaya O (2008) Models of compacted fine-grained soils used as mineral liner for solid waste. Environ Geol 53:1585–1595. https://doi.org/10.1007/s00254-007-1142-7
Article Google Scholar
Sivrikaya O, Kayadelen C, Cecen E (2013) Prediction of the compaction parameters for coarse-grained soils with fines content by MLA and GEP. Acta Geotech Slov 10:29–41
Google Scholar
Sivrikaya O, Soycan TY (2011) Estimation of compaction parameters of fine-grained soils in terms of compaction energy using artificial neural networks. Int J Numer Anal Methods Geomech 35:1830–1841. https://doi.org/10.1002/nag.981
Article Google Scholar
Sivrikaya O, Soycan YT (2009) Estimation of compaction parameters of fine-grained soils using Artificial neural networks. In: Proc. 2nd international conference on new developments in soil mechanics and geotechnical engineering, pp 406–412
Soltani A, Deng A, Taheri A, Sridharan A (2019) Consistency limits and compaction characteristics of clay soils containing rubber waste. Proc Inst Civ Eng—Geotech Eng 172:174–188. https://doi.org/10.1680/jgeen.18.00042
Article Google Scholar
Sotelo, M. J. (2012). Evaluation of non-nuclear devices in measuring moisture content and density of soils. The University of Texas at El Paso.
Sridharan A, Nagaraj HB (2005) Plastic limit and compaction characteristics of finegrained soils. Proc Inst Civ Eng—Gr Improv 9:17–22. https://doi.org/10.1680/grim.2005.9.1.17
Article Google Scholar
Sridharan A, Rao SM, Joshi S (1990) Classification of expansive soils by sediment volume method. Geotech Test J 13:375–380. https://doi.org/10.1520/GTJ10181J
Article Google Scholar
Taha OME, Majeed ZH, Ahmed SM (2018) Artificial neural network prediction models for maximum dry density and optimum moisture content of stabilized soils. Transp Infrastruct Geotechnol 5:146–168. https://doi.org/10.1007/s40515-018-0053-2
Article Google Scholar
Talukdar DK (2014) A study of correlation between California Bearing Ratio (CBR) value with other properties of soil. Int J Emerg Technol Adv Eng 4:559–562
Google Scholar
Tenpe A, Kaur S (2015) Artificial neural network modeling for predicting compaction parameters based on index properties of soil. Int J Sci Res 4:1198–1202
Google Scholar
Tsegaye T, Fikre H, Abebe T (2017) Correlation between compaction characteristics and Atterberg limits of fine grained soils found in Addis Ababa. Int J Sci Eng Res 8:357–364
Google Scholar
Verma G, Kumar B (2022) Artificial neural network equations for predicting the modified proctor compaction parameters of fine-grained soil. Transp Infrastruct Geotechnol. https://doi.org/10.1007/s40515-022-00228-4
Article Google Scholar
Verma G, Kumar B (2022) Multi-layer perceptron (MLP) neural network for predicting the modified compaction parameters of coarse-grained and fine-grained soils. Innov Infrastruct Solut 7:78. https://doi.org/10.1007/s41062-021-00679-7
Article Google Scholar
Wang MC, Huang CC (1984) Soil compaction and permeability prediction models. J Environ Eng 110:1063–1083. https://doi.org/10.1061/(ASCE)0733-9372(1984)110:6(1063)
Article Google Scholar
Wells JER (2014) Calibration of non-nuclear devices for construction quality control of compacted soils. MSc thesis, University Kentucky
Wesley LD (1973) Some basic engineering properties of halloysite and allophane clays in Java, Indonesia. Géotechnique 23:471–494. https://doi.org/10.1680/geot.1973.23.4.471
Article Google Scholar
Winta A (2016) Prediction of soaked california bearing ratio (Cbr) of fine grained soils from index properties. Case of Silty Clay Soils from Addis Abeba
Yoon H, Hyun Y, Ha K, Lee KK, Kim GB (2016) A method to improve the stability and accuracy of ANN-and SVM-based time series models for long-term groundwater level predictions. Comput Geosci 90:144–155
Article Google Scholar
Zhang K, Frederick CN (2017) Experimental investigation on compaction and Atterberg limits characteristics of soils: Aspects of clay content using artificial mixtures. KSCE J Civ Eng 21:546–553. https://doi.org/10.1007/s12205-017-1580-z
Article Google Scholar

Download references

Funding

None.

Author information

Authors and Affiliations

Department of Civil Engineering, University of Halabja, Halabja, Iraq
Hunar Farid Hama Ali & Rabar H. Faraj
Civil Engineering Department, University of Raparin, Rania, Sulaymaniyah, Iraq
Bashdar Omer
Civil Engineering Department, College of Engineering, University of Sulaimani, Kurdistan, Iraq
Ahmed Salih Mohammed

Authors

Hunar Farid Hama Ali
View author publications
You can also search for this author in PubMed Google Scholar
Bashdar Omer
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Salih Mohammed
View author publications
You can also search for this author in PubMed Google Scholar
Rabar H. Faraj
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Hunar Farid Hama Ali or Ahmed Salih Mohammed.

Ethics declarations

Conflict of interest

None.

Ethical approval and consent to participate

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Ali, H.F.H., Omer, B., Mohammed, A.S. et al. Predicting the maximum dry density and optimum moisture content from soil index properties using efficient soft computing techniques. Neural Comput & Applic 36, 11339–11369 (2024). https://doi.org/10.1007/s00521-024-09734-7

Download citation

Received: 13 May 2023
Accepted: 25 March 2024
Published: 15 April 2024
Issue Date: July 2024
DOI: https://doi.org/10.1007/s00521-024-09734-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Predicting the maximum dry density and optimum moisture content from soil index properties using efficient soft computing techniques

Abstract

Similar content being viewed by others

Advanced Mathematical Models to Predict the Compaction Properties of Coarse-Grained Soils from Various Physical Properties

Prediction of California Bearing Ratio Using Soil Index Properties by Regression and Machine-Learning Techniques

Artificial Neural Network Equations for Predicting the Modified Proctor Compaction Parameters of Fine-Grained Soil

Explore related subjects

1 Introduction

1.1 Related studies

2 Scope of the work

2.1 Components in the block diagram

3 Methodology

4 Correlation between soil index properties and compaction parameters

5 Modeling

5.1 Model complexity

5.2 Linear regression model

5.3 Nonlinear regression model

5.4 Multiple linear regressions

5.5 ANN model

5.6 Pseudocode/algorithm

6 Model assessment tools

7 Results and analysis

7.1 The LR model

7.2 NLR model

7.3 MLR model

7.4 ANN model

8 Model comparisons

9 Sensitivity analysis

10 Recomandations for future work

11 Conclusions

Availability of data and materials

References

Funding

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Ethical approval and consent to participate

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation