Introduction

Over the last three decades, polymeric foams have been used in the construction industry for various applications such as road embankment filing [1,2,3,4], compressible inclusion [5,6,7,8,9], vibration and noise isolation [10,11,12,13] and thermal insulation [14,15,16]. The simplicity, ease and faster pace of construction make it the preferred material for construction, including embankments. These foams are 20–100 times lighter than conventional embankment fill material [17]. Geofoam is a generic term for polymeric foams used in geotechnical engineering applications. Due to its ease of manufacture, Expanded Polystyrene (EPS) foam is the geofoam most commonly used for these applications. As of 2022, EPS geofoam has a global market of USD 900 million in the construction industry and expected an annual growth of 7% [18].

The term “compressive strength” of geofoam is generally attributed to the compressive resistance at 10% strain or yield stress, whichever is lower [19]. However, ASTM D7180 [20] limits the compressive strain of EPS geofoam to 1% for geotechnical applications. Thus, the compressive resistance at 1% strain, which may be termed permissible compressive stress, is the most significant factor governing the design of EPS geofoam in various geotechnical applications. Yield stress and compressive resistance at 10% strain are essential parameters for characterising the plastic response of geofoam. Thus, an initial assessment of these compressive parameters under site-specific conditions will enable the designer to select the geofoam of suitable grade.

Several investigations [21,22,23,24,25,26,27,28,29] examined the compressive characteristics of EPS geofoam with varying nominal densities, which is referred to the density of blocks obtained during production and post-processing in the industrial stage. However, the compressive responses of the geofoams having the same nominal density, reported by different authors, exhibited a wide variation. However, the actual density, also known as apparent density, may differ from the nominal density, leading to discrepancies in the results. This variation could also be due to other influencing factors such as geometry, ambient temperature, or rate of straining. Several researchers assessed the selective, if not comprehensive, impact of some of these influencing factors on the compressive behaviour of EPS geofoam [30,31,32,33,34,35]. Nevertheless, there is a scarcity of thorough research that investigates the various factors that influence the compression behaviour of geofoam, a crucial aspect in the design process.

Amongst these parameters, temperature has a significant influence on compression behavior of EPS geofoam. As per Žiliūtė et al. [36], the temperature of the subgrade is typically similar to the ambient temperature. Since the ambient temperature in the Indian subcontinent can often exceed 46 °C [37], a thorough evaluation of the compression behaviour of EPS geofoam under elevated temperatures is necessary to develop suitable design parameters for its application as embankment fill. Zou and Leo [38] investigated the confined compression behaviour at varying temperatures from 23 to 45 °C for EPS geofoam having a nominal density of 20 kg/m3. They observed a minor reduction in the initial elastic modulus and compressive response at 10% strain of the specimen. Krundaeva et al. [39] reported a decrease in dynamic compressive strength for elevated temperatures and an increase for subzero temperatures for EPS geofoam with a 10 kg/m3 nominal density. Based on the literature review, it is observed by the authors that there are limited studies that have been conducted to study the role of ambient temperature on compressive behavior of EPS geofoam.

Of late, researchers have explored the application of machine learning methods to characterise several geomaterials. While some researchers [40,41,42] employed Artificial Neural Networks (ANN) models to predict the vibration and energy absorption characteristics of geofoam, Akis et al. [43] developed ANN models for compressive stresses of geofoam at 1%, 5%, and 10% strains. However, none of the studies included the model dependent evaluation of influencing factors.

In the present study, the effect of density, strain rate, geometry, and ambient temperature on the Modulus of elasticity (Ei), compressive stress at 1% Strain (\(\sigma_{1\% }\)), yield stress (\(\sigma_y\)), and compressive stress at 10% strain (\(\sigma_{10\% }\)) of geofoam has been experimentally evaluated by conducting a series of uniaxial unconfined compression tests. In addition, the experimentally evaluated data of this study is also combined with the results from the literature to develop generalised prediction models using ANN and XGBoost algorithms. XGBoost is a latest data-driven advanced ensemble learning-based predictive Machine learning (ML) model [44, 45] that consists of sequential models. The error reported by a preceding model is successively reduced by the following sequential model, resulting in a robust predictive ML model. Statistical checks such as R2, mean absolute percent error (MAPE) and root mean square error (RMSE) were incorporated in the study to evaluate the model performances. The study concluded by performing Model Dependent Feature Analysis to assess the factors that influence the compression behaviour of geofoam.

Materials and methods

Materials

EPS geofoam blocks manufactured from three distinct manufacturing units were used in the present study. Geofoam blocks of four different nominal densities, namely 15 kg/m3, 20 kg/m3, 25 kg/m3, and 30 kg/m3 were collected. These densities are denoted as 15D, 20D, 25D, and 30D, respectively. The specimens of requisite sizes and shapes were cut from the blocks using a hot wire. The apparent density of all cut specimens was determined as per ASTM D1622 [46] to investigate its relevance in compressive behaviour.

Method

Uniaxial compression tests were performed as per ASTM D1621 [47] on 50 mm specimens in a universal testing machine (UTM) (Model: Shimadzu AGSJ) with a capacity of 5 kN and an accuracy of 0.01 N. Another UTM (Model: Shimadzu UH-2000 kN) with a maximum loading scale range of 20 kN and 0.01 kN accuracy is used for testing larger specimens. The test setup is shown in Fig. 1. The built-in data acquisition system records the load-deformation behaviour seamlessly at 3 ms. Before testing, a seating load of 10–30 N, which corresponds to 0.5% strain, was applied to adjust the alignment of the geofoam. The compressive responses were measured and reported up to 90% strain in the author’s earlier study [48]. However, the present study does not include the compressive responses over 10% strain because it exceeds the serviceability criteria for most geotechnical applications [49].

Fig. 1
figure 1

Compression test set up for elevated temperatures

To assess the influence of strain rate on compression behaviour, tests were also conducted for two different strain rates i.e., 6% and 1%, in addition to the ASTM-prescribed [39] testing strain rate of 10%. The role of temperature on compression behaviour was evaluated by conducting tests at 23 °C, 37 °C and 50 °C. The specimens were kept inside a hot air chamber within the UTM at the testing temperature during and two hours before testing for uniform distribution of temperature throughout the test material. To study the effect of geometry, specimens of cubical and cylindrical specimens of varying sizes (diameter or width), namely 50 mm, 100 mm and 150 mm, were used. The aspect ratio of the cylindrical specimen was maintained constant at 2. Thus, a total of 122 specimens of varying densities, shapes and sizes subjected to varying loading strain rates and ambient temperatures were assessed. Table 1 summarises the testing strategy adopted for the study.

Table 1 Specimen details and testing conditions

Test results and analysis

The measured compression responses of various EPS geofoam grades at different testing conditions are shown in Fig. 2. It can be observed that the compressive response of geofoam is linear until 2% strain, followed by yielding, resulting in a plateau region with minimal stress increment on increasing strain. Thus, the overall compression behaviour can be classified into three categories: elastic response, yielding and post-yield. For the analysis, stress at three strain levels was considered: Compressive resistance at 1% Strain (\(\sigma_{1\% }\)), yield stress (\(\sigma_y\)) and compressive resistance at 10% strain (\(\sigma_{10\% }\)). The yield point was calculated by employing the double tangent method. The yield strain ranges from 2.3 to 4.7%.

Fig. 2
figure 2

Measured compressive behaviour of geofoam under varying a strain rate, b temperature, c cube sizes, d cylinder sizes

The variation of the stresses and initial modulus with apparent density under varying test conditions is shown in a scatter matrix plot as shown in Fig. 3. The figure shows the importance of different variables considered in the study on the compression parameters. Each plot shows the correlation between the input (γf) and output parameters (\(\sigma_{1\% }\), \(\sigma_y ,\) \(\sigma_{10\% }\) or Ei) as well as with each variable (test conditions). The data scatter is quantified using the coefficient of determination (adjusted R2) value of the linear interpolation between the input and output parameters. Adjusted R2 is used for the evaluation, as it gives a better indication of the accuracy of the linear correlation for multivariable regression problems compared to other coefficient of determination indicators. 95% confidence ellipses, which assume a bivariate normal distribution, are used as visual indicators of correlations between the parameters considered. The confidence ellipse collapses diagonally as the correlation between the input and output parameters approaches 1, whereas they are more circular when input and output parameters are uncorrelated. A lower scatter with a high R2 value and slender ellipses indicates a strong correlation between the input and output parameters and has a negligible influence of other variable conditions considered, such as strain rate, temperature, size, etc. However, a significant influence from the variable parameter on the response is indicated by a higher scatter, low R2 value, and wide ellipses with systematic shifts in the data points with the variables.

Fig. 3
figure 3

Scatter matrices of the test results for varying parameters a Nominal density, b Strain rate, c Temperature, d Size, e Shape

Effect of density

As shown in Fig. 2a, the compressive stresses and yield strain increase with an increase in nominal density. The variation of Ei, \(\sigma_{1\% }\), \(\sigma_y\), and \(\sigma_{10\% }\) is plotted with respect to apparent and nominal densities in the scatter matrices (Fig. 3a). These stresses and initial modulus significantly depended on and increased with the apparent density. The observed linear correlations between Ei and γf as well as \(\sigma_{1\% }\) and γf indicate only a reasonable convergence (adjusted R2 < 90%). In contrast, the linear correlations between \(\sigma_y\) and γf as well as \(\sigma_{10\% }\) and γf indicate excellent convergence (adjusted R2 > 90%). The confidence ellipse is extremely slender for \(\sigma_y\) and \(\sigma_{10\% }\). Thus, it can be inferred that the dependency of \(\sigma_y\) and \(\sigma_{10\% }\) on apparent density is more pronounced than that of \(\sigma_{1\% }\) and Ei. The following empirical correlations relating to compressive resistance and apparent density are derived for standard test conditions prescribed as per ASTM [39] specifications.

$$ \sigma_{1\% } = 2.2\gamma_f - 10.7 $$
(1)
$$ \sigma_y = 6.5\gamma_f - 32 $$
(2)
$$ \sigma_{10\% } = 7.3\gamma_f - 33.7 $$
(3)

A theoretical expression correlating the initial modulus (E) and density of cellular foams (γ), derived by Gibson and Ashby [50], is indicated in Eq. 4. On similar lines, an attempt was made here to ascertain the mathematical constant associated with the expression pertaining to EPS geofoam and derive an expression for predicting the initial modulus. Thus, an empirical power expression was developed for the variation of the modulus of elasticity of geofoam (Ei) with the apparent density (γf) as shown in Fig. 4. Considering Young's modulus (Ep) and density of solid polystyrene polymer (γs) as 2600 MPa and 1050 kg/m3, respectively [51], the correlation can be expressed as shown in Eq. 5. Thus, for EPS geofoam having densities ranging from 15 to 30 kg/m3, the correlation constant C can be arrived at as 3.5 for standard testing conditions.

$$ \frac{{\text{E}}}{E_p } = {\text{ C}}\left( {\frac{{\upgamma }}{\gamma_s }} \right)^2 $$
(4)
$$ \frac{E_i }{{E_p }} = { }3.5\left( {\frac{\gamma_f }{{\gamma_s }}} \right)^2 $$
(5)
Fig. 4
figure 4

Variation of modulus of elasticity with apparent density

Effect of strain rate

It can be observed that the initial response and yield strain are independent of strain rates; however, yield and post-yield compressive stresses increase with an increase in strain rates, as shown in Fig. 2a. This increase in \(\sigma_y\) and \(\sigma_{10\% }\) with strain rate is similar to all geofoam grades and can be attributed to the creep response of the cellular foam. This variation is also indicated by the wide confidence ellipse and low R2 value in the scatter plot (Fig. 3b).

The variation of compressive response parameters with strain rate (Fig. 5) indicates an average increase of \(\sigma_{10\% }\) at a rate of 1%, 1.6%, 2% and 2.2% per 1%/min strain rate increment, respectively, for nominal densities 15D, 20D, 25D and 30D. Other researchers have also observed an increase in \(\sigma_{10\% }\) with strain rates in the range of 0.5%–2.2% per 1%/min strain rate increment [31, 43, 52]. Therefore, it is advisable to perform the tests at a strain rate of 1% per minute in order to avoid overestimating the compression parameters. This is in variation with the recommendation of the strain rate in the ASTM specification [39], which specifies a testing strain rate of 10%. It is important to note that the aforementioned specification was primarily formulated for the general use of rigid cellular plastics. However, the present investigation specifically concentrated on load-bearing applications in geotechnical contexts.

Fig. 5
figure 5

Variation of compressive stresses with strain rate

Effect of temperature

The compressive behaviour in the elastic response range is significantly influenced by temperature, as depicted in Fig. 2b. It has been observed that the yield strain exhibits a twofold increase as the temperature rises from 23 to 37 °C. The variation of the initial compression parameters is also indicated by the very low R2 value and wide confidence ellipse in the scatter matrix (Fig. 3c). \(\sigma_{1\% }\) and Ei decrease with the increase in temperature. \(\sigma_y\) and \(\sigma_{10\% }\) also decrease marginally with an increased temperature; however, more pronounced for denser samples.

The variation plots (Fig. 6) demonstrate a decrease in compressive response, as evidenced by a reduction of \(\sigma_{1\% }\) ranging from 40 to 60% with increasing temperature. This corresponds to an average decrease of 3.7% per degree Celsius increment and can be attributed to the thermal-induced softening of polystyrene strands. The effect of temperature is more pronounced for denser specimens. Thus, for the 30D specimen, the reduction of \(\sigma_{1\% }\) and Ei is higher than in the 15D specimen. The reduction in the \(\sigma_{10\% }\) value for the 15D, 20D, 25D, and 30D specimens were determined to be 0.34 kPa, 0.53 kPa, 0.84 kPa, and 2.0 kPa per degree Celsius, respectively. Zou and Leo [38] reported the corresponding value as 0.46 kPa per degree Celsius for the 20D specimen.

Fig. 6
figure 6

Variation of compressive stresses with temperature

Effect of size

Figure 2c and d show the stress–strain curves of geofoam under varying sizes for cubical and cylindrical specimens, respectively. It can be inferred that the size significantly impacts the initial elastic response of geofoam. This is also indicated by the low R2 value of the scatter plot (Fig. 3d). In the case of cube-shaped specimens, negligible effect of size is observed for \(\sigma_y\) and \(\sigma_{10\% }\). However, for cylindrical specimens, due to an increase in bead population, a marginal increase in \(\sigma_y\) and \(\sigma_{10\% }\) with size is observed, as shown in Fig. 7. Ei increases by 60% when size increases from 50 to 100 mm, after which negligible increment is observed. This is in consistent with the observations reported by Atmatzidis et al. [26]. Elragi et al. [30] further observed an Ei value increment of 100% when the size increases from 50 to 600 mm.

Fig. 7
figure 7

The variation of compressive stresses with block size and shape

Effect of geometry

From Fig. 7, it can be inferred that shape does not have significant influence on the initial elastic response of geofoam. The \(\sigma_{1\% }\), \(\sigma_y\) and \(\sigma_{10\% }\) of cubical specimens are slightly higher than cylindrical specimens of the same size for all the geofoam grades. This can be attributed to the increased bead quantity for cubical specimens relative to the cylindrical specimens, having diameter same as the cube size. However, the high value of R2 in the scatter plot (Fig. 3e) indicates a relatively uniform distribution. Consequently, it is reasonable to infer that both shapes exhibit comparable responses. Testing on cubical samples is recommended because of the ease of moulding and cutting.

Data analysis and identification of influencing factors

The data analysis of the laboratory test results on geofoam is conducted to analyse the input and output data quality. The statistical description of the experiments conducted in the laboratory is elucidated in Table 2. The count refers to the number of specimens available, and the mean is the representative central tendency of the data, Std. Dev. expresses the deviation from the mean; the variance is the spread between numbers in a feature in a data set which assesses the dispersion of data points around the mean; min is the minimum value of the feature in the dataset; max refers to its maximum value followed by varying percentile values of each feature. Since the shape is a categorical parameter, this study indicates its influence through aspect ratio. It is worth noting that, with the exception of apparent density, all other independent variables are interval data points. The skewed distribution of these independent variables is evident from the quartiles. The high variance in the dependent variables, specifically the compressive stresses, emphasises the strong necessity for the development of prediction models.

Table 2 Statistical analysis of input data from the experimental study

In addition to statistical analysis of the experimental data, correlation coefficients were developed to observe the interdependency of the input and output parameters considered in the study. Given that the majority of the independent variables are of the interval data type, it can be argued that rank correlation provides a more accurate measure of the interdependence between these variables as compared to Pearson's correlation. Therefore, Spearman's rank correlation coefficient (rs) is employed for assessing the correlation between two parameters and is calculated using the following formula [53].

$$ r_s (X_i - Y_i ) = 1 - \frac{{6{ }\sum_{i = 1}^n \left( {X{^{\prime}}_i - Y{^{\prime}}_i } \right)^2 }}{n(n^2 - 1)} $$
(6)

where X’i and Y’i are the ranks of Xi and Yi, respectively, and n is the number of datasets. The Spearman’s rank correlation coefficient for all the input and output parameters is shown as a heat map in Fig. 8. A positive coefficient indicates direct proportionality, whereas a negative coefficient indicates inverse proportionality. The perfect correlation is indicated by a correlation coefficient of 1.0. The significance of the correlation coefficient, explained by Chan [54], is shown in Table 3.

Fig. 8
figure 8

Heat map of correlation coefficients

Table 3 Interpretation of Spearman’s rank correlation coefficient

It can be said that the apparent density of geofoam is the most positive influential parameter for the compressive resistance of geofoam. The effect of density on \(\sigma_{y }\) and \(\sigma_{10\% }\) is the strongest, whereas it is moderate on \(\sigma_{1\% }\) and Ei. The second most important parameter, which plays a significant role in the compressive behaviour, is size. The initial compressive behaviour is found to be fairly affected by the specimen size. Temperature negatively influences the initial compressive behaviour, whereas the strain rate positively influences the compressive behaviour after yielding. Based on the results of the correlation analysis, it can be inferred that there is no significant impact of the shape and aspect ratio on the compressive behaviour.

ML predictive models

Based on the test results, ML models were developed for the prediction of \(\sigma_{1\% }\), \(\sigma_{10\% }\), and \(\sigma_y\) with independent input parameters of uniaxial compression tests. Data-driven models based on ANN and XGBoost were developed and checked for accuracy. ANN is one of the basic prediction model widely used for the compressive strength of different materials in the past two decades [43, 55,56,57]. Neural networks are highly flexible and powerful in modeling non-linear relationships and complex patterns in the data. Thus, in the present problem statement, where the relationships between different input features are not pre-determined and well-defined, ANN has been used by the authors. Furthermore, the study chose the XGBOOST algorithm because it stood out as one of the most refined among boosting machine learning (BML) algorithms, driven by its superior performance documented in literature for characterizing the compressive strength of diverse geomaterials [58,59,60,61]. XGBoost algorithm is based on boosting technique and is an implementation of gradient-boosted trees that is most effective in handling a variety of data types, distributions, and relationships through an ensemble of decision trees and continuous reduction of errors. Since the features in the problem statement have non-linear relationships, the authors have also used XGBoost for the development of the model in the present study. Eight key processes are involved in the development of the optimised model, and each stage is described in detail as follows:

  1. i.

    Database preparation: Initial database was prepared from the experimental study.

  2. ii.

    Preliminary data evaluation: Preliminary data analysis was conducted using statistical evaluation. Correlation analysis for the interdependency of various parameters was also assessed.

  3. iii.

    Data matrix expansion: More data was collected from relevant literature and compiled appropriately.

  4. iv.

    Data pre-processing: Data was analysed using statistical evaluation, and missing values were identified to arrange the acquired data. It was followed by normalising the dataset in preparation for model building.

  5. v.

    Model selection: ANN and XGBoost were utilised for the prediction and evaluation of the models.

  6. vi.

    Hyperparameter optimisation: Hyperparameters that lead to the development of the most accurate and generalised ML models were selected.

  7. vii.

    Model evaluation: All models were compared, and the best-performing algorithms were selected based on evaluation metrics, i.e., R2, RMSE, MAE, MSE, and MAPE.

  8. viii.

    Analysis and reporting: The findings were reported by comparing various models considered, optimisation parameters, and evaluation metrics.

Data compilation

To facilitate the development of a comprehensive model, it is imperative to acquire a large dataset wherein ML models can be trained and tested. A commonly used rule of thumb for dataset sample size requirements for neural networks is that the sample size should be at least 10 to 20 times the number of trainable parameters in the model [62, 63]. Some researchers suggested that, for real-world problems, the minimum data size requirement should be increased to 30 times the number of trainable parameters as a precautionary measure [64]. This guideline helps ensure that the model has sufficient data to learn meaningful patterns without overfitting to the training data. Consequently, the findings documented in existing literature pertaining to the uniaxial compressive strength test conducted on geofoam were collected and integrated with the test results presented by the authors of this study. This assertion is supported by the evidence that the manufacturing process has a minimal impact on the mechanical characteristics of EPS blocks [65]. The stress–strain characteristics of geofoam have been extensively investigated with respect to apparent density, size, aspect ratio, and strain rate. However, the influence of temperature on these properties is yet to be fully investigated. Until now, no prior research has investigated such a large number of test variables as the present study, with a specific focus on incorporating temperature as a variable. Therefore, to achieve a larger dataset for developing models, the influence of temperature observed in the experimental datasets was excluded. A database of 555 test results was compiled, which comprised the test results conducted by the authors and the results reported in the literature [24,25,26, 43, 66,67,68]. A subset of these studies did not include data on yield stress. These can be considered as missing value. In such cases, \(\sigma_y\) is estimated by dividing \(\sigma_{10\% }\) using a reduction factor of 1.3 [40]. Therefore, the imputation technique employed in the present study involves replacing missing values with information derived from previous domain knowledge. In general, Ei is estimated to be 100 times \(\sigma_{1\% }\); thus, it is not considered in the ML models. The statistics of the expanded dataset are presented in Table 4. It may also be noted that the prediction models are valid only for range of input parameter data considered in the table.

Table 4 Statistical description of the expanded database

Development of ML models

In this study, ML models, based on ANN and XGBoost algorithms, were developed to predict the values of \(\sigma_{1\% }\), \(\sigma_{10\% }\), and \(\sigma_y\) on Python 3.10. The dataset comprising 555 test results was randomly divided into training and test datasets in a 70:30 ratio. The models were developed on 70% of the data, the accuracy of which was tested on the remaining 30% of the dataset.

Statistical metrics were used to test the accuracy of the models in terms of coefficient of determination (R2), mean square error (MSE), root mean square error (RMSE), mean absolute error (MAE) and mean absolute percent error (MAPE). The mathematical formulations of these parameters are defined in Eqs. 37.

$$ R^2 = 1 - \frac{{\sum_{i = 1}^{{ }n} \left( {y_i - y{\prime}_i } \right)^2 }}{{\sum_{i = 1}^{{ }n} \left( {y_i - \overline{y}} \right)^2 }} $$
(7)
$$ MSE = \frac{1}{n}\mathop \sum \limits_{i = 1}^n \left| {y_i - y_i {\prime} } \right|^2 $$
(8)
$$ RMSE = \sqrt {{\frac{1}{n}\mathop \sum \limits_{i = 1}^n \left| {y_i - y_i {\prime} } \right|^2 }} $$
(9)
$$ MAE = \frac{1}{n}\mathop \sum \limits_{i = 1}^{{ }n} \left| {y_i - y_i {\prime} } \right| $$
(10)
$$ MAPE = \frac{1}{n}\mathop \sum \limits_{i = 1}^{{ }n} \left| {\frac{{y_i - y_i {\prime} }}{y_i }} \right| $$
(11)

where y′ and y are the predicted and measured values, respectively, while n is the total number of test datasets and \(\overline{y} \) is the mean of measured values. R2 assesses how well the model reproduces the observed outputs. The values of R2 range from 0 to 1, with higher fitting optimisation reported if the value is closer to 1. The values MSE, MAE, MAPE, and RMSE are used to evaluate modelling error, wherein the smaller the value, the lesser the difference between the predicted and measured values.

ANN

ANN is a highly parallel distributed processor that has a natural propensity for storing experiential knowledge and making it available for use [69]. It is widely used to predict output from a dataset comprising independent variables. The neural network of the ANN model consists of several neurons stacked in layers, which act as the primary processing element of the neural network. Data is received, weighed, processed, and subsequently transferred from neurons of one layer to the next.

In this study, three independent ANN models were developed to predict the values of \(\sigma_{1\% }\), \(\sigma_{10\% }\), and \(\sigma_y\) based on the independent input parameters of uniaxial compression tests. Each model was trained on the same input parameters, namely the size and aspect ratio of the geofoam specimen, the density of the material, and the strain rate at which the uniaxial compression test was performed.

A hidden layer often results in giving reasonable accuracy [70]. In the current study, as the dataset is not very large but can be considered a medium-sized dataset, a simple network architecture comprising only a single hidden layer with 6 neurons has been considered to avoid overfitting. According to Erzin et al. [71], the maximum number of neurons in the hidden layer cannot be greater than 2i + 1, where i is the number of input parameters. Since the number of input parameters for each model is 4, the limiting number of neurons in the hidden layer for the models has been restricted to 9. To achieve an optimum number of neurons in the hidden layer, one neuron was fitted in the hidden layer for each model and was gradually increased to the upper limit. Based on the comparison of the measured and predicted values during the trials, six neurons in the hidden layer of each ANN model were used in this study, the structure of which is illustrated in Fig. 9. Thus, the total number of trainable parameters for this feed-forward neural network is 30. In this study, the ratio of data size to the number of trainable parameters or weights stands at 18, which is deemed adequate considering similar models for material strength predictions [43, 72,73,74]. The combination of exponential linear unit (ELU) and rectified linear unit (ReLU) has been used as the activation function for the neural layers. Activation functions convert the weighted input received at a neuron of a layer to an output, which is then used as input by the subsequent layer.

Fig. 9
figure 9

Architecture of the ANN model

XGBOOST

XGBoost is an ensemble learning algorithm developed by Chen and Guestrin [75] and belongs to the boosting family. A model based on the boosting principle produces sequential models to solve the same problem such that each sequential model focuses on the training data values that the previous model inaccurately predicted. The models developed in each sequence are known as weak learners, which, when combined, develop a strong learner. The weak learner is usually defined as a decision tree, which is a supervised ML technique. The tree structure consists of internal nodes, branches, and leaves [76]. Internal nodes correspond to the dataset’s features, branches represent the decision rules, and leaves signify the output variables. The algorithm of XGBoost improves the traditional boosting algorithms to minimise overfitting and provide better predictions. The gradient boosting algorithm consists of a loss function to be optimised, a weak learner to make predictions and an additive model to add weak learners to minimise the loss function. XGBoost model, through an iterative procedure, optimises the objective function by updating parameters in a sequential step using residuals from the previous step. The most significant benefit of XGBoost is its scalability across any condition [77]. In general, the XGBoost algorithms are the evolution of decision tree algorithms that were improved over time.

Three XGBoost models were developed for the prediction of \(\sigma_{1\% }\), \(\sigma_{10\% }\), and \(\sigma_y\) values. The hyperparameter tuning of the models was performed using fivefold cross-validation, and the values were optimised using the “Random-SearchCV” function in Scikit-learn 0.24.2 [78].

Results and discussion

A comparison of the predicted values of \(\sigma_{1\% }\), \(\sigma_{10\% }\), and \(\sigma_y\) obtained from ANN and XGBoost models used in this study is made against the values obtained through laboratory experiments for both training and testing datasets and is illustrated in Fig. 10. Higher the alignment of the values against the diagonal, the better the ML model’s performance. It can be observed that the performance of both ANN and XGBoost models is reasonable as well as comparable. On comparing the models, it can also be observed that for the training dataset of the XGBoost model, the points are near the ideal, i.e., y = x line, indicating that the predicted values are close to the experimental results and thereby result in a high R2 value. For the ANN model, a relatively higher data scatter can be observed away from the diagonal, indicating a significant difference between true and predicted output values by the corresponding model. For the testing data, the XGBoost model presents higher accuracy than ANN, as it can be seen that the scattering of points is more widely spread than XGBoost, which results in lower RMSE and MAPE values for the XGBoost model. The predicted results of the testing dataset that disperse away from the diagonal indicate less accuracy in the prediction of a testing dataset. It is thus evident that the XGBoost model predicts the values of \(\sigma_{1\% }\), \(\sigma_{10\% }\), and \(\sigma_y\) with high accuracy, robustness, and generalisation. While the models perform accurately for both training and testing datasets to predict \(\sigma_y\) values, the performance is relatively inaccurate for the prediction of \(\sigma_{1\% }\) as observed from the statistical metrics of the models summarised in Table 5.

Fig. 10
figure 10

Plot between the observed and the predicted values for training and testing datasets for \({\sigma }_{1\%}\), \({\sigma }_{10\%}\), and \({\sigma }_{y}\)

Table 5 Summary of comparison between various prediction models

The R2 value for each model is greater than 0.8 for both the training and testing dataset. This indicates that the predictions obtained from the models are significantly correct. As expected, the accuracy of all the testing dataset is equal to or lower than that of the training dataset, except for the ANN and XGBOOST models of \(\sigma_{1\% }\). However, the R2 value, which assesses a regression problem better, is lower for the testing dataset compared to the training dataset for all the models. Relative to the average values of \(\sigma_{1\% }\), \(\sigma_{10\% }\), and \(\sigma_y\), the RMSE value lies within 5–20% of the actual values, with a higher error being observed in the prediction of \(\sigma_{1\% }\). The low R2 value achieved for testing the \(\sigma_{1\% }\) using ANN and XGBOOST indicates that the issue is primarily due to the variability in the data rather than a modelling error. This suggests that to enhance the reliability of the developed model in predicting values of \(\sigma_{1\% }\) improved data incorporating additional input parameters is required, which stands as a limitation of the current study. The omission of the input parameter temperature in the development of ML models is another clear limitation of the current study. Additionally, the developed models are incapable to predict the long-term compressive strength of geofoam. Further experimental data is necessary to anticipate the compressive behaviour of geofoam under sustained loading conditions and to predict its creep effects.

Visualization of errors

To visualise the error in the predictions of \(\sigma_{1\% }\), \(\sigma_{10\% }\), and \(\sigma_y\) with respect to true values, residual error plots were plotted. Figure 11 shows the residual error plots of the models. The residual error is the measure of the dispersion of a point vertically from the regression line. It measures the error between the predicted and true values of the target. The higher the residual dispersion from the origin (y = 0 lines), the lower the accuracy of the model. The residual plot of the XGBoost model has the highest density of points close to the origin with a minimum dispersion of points away from the origin. On the other hand, the residual plots of the ANN models demonstrate high density away from the origin and low density close to the origin. No model can exhibit null residual errors; however, a good model, such as XGBoost, exhibits only minor random errors.

Fig. 11
figure 11

Residual Plots for ANN and XGBOOST

Relative importance of influencing factors

The ‘feature importance’ of different parameters is evaluated using the XGBoost model, which determines the degree of usefulness of a specific input parameter for the model and prediction. XGBoost model was selected for this analysis, as it outperformed the ANN model. This gives an insight into the relative importance of the influence factors on the compressive response at different strains considered in the study. In this analysis, the variations of each input parameter are analysed by eliminating a single parameter from the dataset exclusively while keeping the remaining input parameters unchanged. The XGBoost model is then run on these datasets after randomly shuffling the values of a selected input while keeping the remaining input unchanged to predict the new output. The relative importance of each input parameter is determined by comparing the RMSE associated with it to that of other parameters. Other researchers used a similar approach for predicting the relative importance of variables on the performance of various geomaterials [44, 79, 80]. It may be noted that this analysis is a model-dependent feature importance study, whereas Spearman’s correlation developed earlier is a model-agnostic feature importance study. This analysis considers the contribution of all the input variables in a combined form to predict the output. In contrast, the heat map developed using Spearman’s correlation only considers the correlation between two parameters. This also considers a generalised model of a wider data range in contrast to Spearman’s correlation developed for the results of the experimental study conducted by the authors.

The relative importance of the parameters considered in this study for the compressive response prediction is indicated in Fig. 12. Apparent density is the most significant influencing factor for all the stress levels. However, its relative importance is more for \(\sigma_{10\% }\), and \(\sigma_y\). The XGBoost model also indicates significant importance in aspect ratio followed by size on \(\sigma_{1\% }\) output. However, its effect diminishes during yield and post-yielding. It may also be noted that the specimen's height has a greater impact on the value of \(\sigma_{1\% }\) than the specimen's width (expressed as size). Strain rate has only a minor influence on the compressive responses of geofoam.

Fig. 12
figure 12

Relative importance of different input variables

Conclusions

In the present study, the effect of density, temperature, strain rate, and geometry on the compressive behaviour of EPS geofoam was characterised using a series of uniaxial compression tests. Machine learning models based on ANN and XGBoost were then developed to predict the values of \(\sigma_{1\% }\), \(\sigma_y\), and \(\sigma_{10\% }\) using four input parameters, namely size of the geofoam, apparent density of the material, aspect ratio of the specimen, and the strain rate of testing. As the size of database is critical for developing a robust predictive model, 433 tests results from the literature were compiled, making a database of 555 number of tests that included the experiments conducted by the authors in this study. Following are the salient observations made from the study:

  • The apparent density was the most important parameter influencing the compressive behaviour of geofoam. The influence of apparent density was more pronounced post-yield than the initial elastic response. However, the influence of apparent density on yield strain was found to be insignificant.

  • An increase in testing strain rate marginally increases the post-yield stresses. A strain rate of 1%/min is recommended for the quality testing of geofoam for geotechnical applications.

  • The elastic response of geofoam was found to be significantly influenced by the ambient temperature. It was observed that the modulus of elasticity of geofoam decreased by 3.7% for every one-degree Celsius increase in temperature. The yield strain doubled as the temperature increased from 23 to 37 °C. Therefore, it is advisable to conduct testing at the highest ambient temperature of the proposed site in addition to the conventional temperature conditions.

  • The size of the specimen demonstrated a positive influence on the initial elastic response of geofoam. The modulus of elasticity increased by 1.6 times when the size was increased from 50 to 100 mm, after which a negligible increment was seen. The geometry and aspect ratio were found to have an insignificant influence on compressive behaviour.

  • Developed prediction models using ANN and XGBoost, provide reasonable and reliable results for the geofoam compression parameters. XGBoost outperforms ANN in predicting the compressive behaviour parameters with MAPE values of 12.2, 4.15 and 4.66 respectively for \({\sigma }_{1\%}\), \({\sigma }_{y}\), and\({\sigma }_{10\%}\). The R2 values for each model in both the training and testing dataset were greater than 0.8, with an accuracy higher than 80% in each case. The success of the developed models is underscored by the respective overall accuracies achieved for the XGBoost models, which were 88%, 96% and 95% for \({\sigma }_{1\%}\), \({ \sigma }_{y}\), and\({ \sigma }_{10\%}\).

  • The feature importance analysis using the XGBoost model revealed that density is a significant parameter influencing stresses at all strain levels with relative importances of 36%, 90% and 86% respectively for \({\sigma }_{1\%}\), \({\sigma }_{y}\), and\({\sigma }_{10\%}\). Additionally, the size and aspect ratio of the geofoam also have an impact on the initial response of its compression behaviour with relative importances of 22% and 35% respectively. Test strain rate also has an influence on the overall compressive response. The result of this study depicts that systematically trained ML models can be easily utilised to comprehend the compressive behaviour of geofoams by employing the experimental data parameters.

  • The constraints of the present study encompass inadequate supplementary experimental data required for developing models to predict the behaviour across broader temperature ranges, along with the absence of creep effects consideration in the developed models’ analysis of the compressive response of geofoam.