Introduction

The most frequently used construction material is concrete (Abdullah et al. 2021; Faraj et al. 2022). Yet, Portland cement is the primary cementing material used to bind the ingredients of the concrete composites (Sharif 2021). However, producing Portland cement necessitates a significant quantity of energy and raw materials, which creates a large amount of total carbon dioxide (about 7%) into the atmosphere (Mahasenan et al. 2003; Hamah Sor et al. 2021). However, cement-based concrete remains the most widely used material in the global building industry (Shaikh 2016). Therefore, it has become mandatory for all nations to consider CO2 emission regulations and reductions (Yildirim et al. 2015). As a result, extensive research has been conducted to develop a new material that can be used as an alternative to Portland cement (Provis et al., 2015); among them, geopolymer technology was developed in France by Professor Davidovits (Abdel-Gawwad and Abo-El-Enein 2016). Due to the high consumption of waste materials in mixed proportions, GPC emits approximately 70% less green gas than conventional concrete (Weil et al. 2009).

Geopolymers are an inorganic alumino-silicate polymer family produced through alkaline activation of various aluminosilicate virgin or waste materials rich in silicon and aluminum (Davidovits, 2015; Qaidi et al. 2022). The mixed proportions of the GPC consist of aluminosilicate source binder materials, fine and coarse aggregates, alkaline solutions, and water. The polymerization between the alkaline solutions and source binder materials produces solid concrete, almost like traditional concrete composites (Ahmed et al. 2021a). Many factors influenced the properties and performances of GPC, including the molarity of NaOH, the ratio of Na2SiO3/NaOH, curing regime and ages, water to solids ratio, alkaline solution to binder ratio, elemental composition and type of source binder materials, ratio of Si to Al in the geopolymer system, mixing time and rest period, superplasticizer dosage and extra water contents, and coarse and fine aggregate contents (Mohammed et al. 2021).

Nanotechnology is the ability to monitor and restructure matter at the atomic and molecular levels in the range of 1 to 100 nm, as well as the contribution to the distinct properties and phenomena at that size that are equivalent to those associated with individual atoms and molecules or bulk behavior (Roco et al. 2000). Nanotechnology is a burgeoning field of research, with novel science and practical applications gradually gaining prominence over the last two decades. Recently, efforts have been made to incorporate nanoparticles (NPs) into construction materials to improve the properties and performance of concrete (Lazaro et al. 2016). NPs were introduced with geopolymer matrices to enhance durability issues, physical structure, and mechanical properties of the geopolymer mixture (Assaedi et al. 2016). Because NPs have a higher surface area to volume ratio, they are highly reactive and have an effect on reaction rates (Wiesner and Bottero 2017). As a result, NPs modify the microstructure of GPC at the atomic level, resulting in significant improvements in both the fresh and hardened states, as well as microstructural behavior of geopolymer composites (Adak et al. 2017). In the literature, a wide range of NPs like nano-silica (nS) (Mustakim et al., 2020), nano-clay (nC) (Ravitheja and Kumar, 2019), nano-alumina (nA) (Shahrajabian and Behfarnia, 2018), carbon nanotubes (CNT) (Kotop et al. 2021), nano-metakaolin (Nm) (Rabiaa et al., 2020), and nano-titanium (nT) (Sastry et al. 2021) were consumed to improve various properties of the geopolymer composites, with nS being the most frequent as shown in Table 1. Since nano-silica was the most used nanomaterial to prepare geopolymer concrete composites (Ahmed et al. 2022) among all other NP types, therefore, this study was devoted to proposing different models to estimate the CS of GPC composites incorporated with nS.

Table 1 Properties and dosages of NPs reported in the literature to modify geopolymer concrete

Compression strength is a critical characteristic of all concrete composites, including GPC. The CS provides a broad assessment of the quality of the concrete (Ahmed et al. 2021b). However, the concrete’s CS at 28 days is critical in structural design and construction. As a result, creating a credible model for estimating the CS of concrete is crucial in terms of modifying or validating the concrete mix proportions (Golafshani et al., 2020). Several factors influence the CS of GPC, resulting in a wide range of compression strength results; consequently, estimating CS is a problematic issue for scholars and engineers. As a result, new numerical and mathematical models are required to clarify this issue (Shahmansouri et al., 2020a). Machine learning methods have been utilized in the literature to model various features of concretes, such as the CS of green concrete (Velay-Lizancos et al., 2017), essential mechanical properties of recycled concrete aggregate (Gholampour et al., 2020), recycled concrete aggregate modulus of elasticity (Golafshani and Behnood, 2018), the CS of environmentally friendly GPC using natural zeolite and silica-fume (Shahmansouri et al., 2020b), the CS of nS-modified self-compacting concrete (Faraj et al., 2021), and the CS of fly ash-based GPC composites (Ahmed et al. 2021c).

In the literature, there is a shortage of studies examining the impact of different mixture proportion parameters on the CS of GPC incorporated with nS at various curing temperatures and ages. Also, according to a complete and systematic assessment of GPC, the construction industry rarely uses an authoritative and developed model that uses numerous parameters to estimate the CS of GPC incorporated with nS. Most efforts have focused on a single-scale model that does not account for a wide range of experimental data or factors. Moreover, the CS of GPC is influenced by several factors. As a result, in a single equation and model structure, the effects of eleven variables such as the alkaline solution to binder ratio (l/b), binder content (b), fine (FA) and coarse (CA) aggregate content, sodium hydroxide (SH) and sodium silicate (SS) content, the ratio of SS/SH, the molarity of SH (M), nS content, curing temperatures (T), and ages of concrete specimens (A) were considered and quantified on the CS of GPC incorporated with nS by using different model techniques, namely artificial neural network (ANN), M5P-tree (M5P), linear regression (LR), nonlinear regression (NLR), and multi-logistic (MLR) models. Finally, different statistical tools, such as the root mean squared error (RMSE), mean absolute error (MAE), scatter index (SI), OBJ value, and the coefficient of determination (R2), were used to evaluate the created models’ accuracy.

Research significance

The primary goal of this paper is to create multiscale models for estimating the CS of GPC incorporated with nS. Thus, a diverse range of laboratory work data, approximately 207 tested specimens with a variety of l/b, b, FA, CA, SH, SS, M, SS/SH, nS, T, and ana A, were collected and reviewed using a variety of analytical approaches with the goal of (i) providing most effortless equations to be used by practicing engineers and scholars in their GPC mix design works; (ii) clarifying the effects of each mix proportion parameter and curing temperature and age on the CS of GPC incorporated with nS; (iii) quantifying and offering systematic multiscale models for forecasting the CS of GPC utilizing eleven variable input parameters; (iv) using statistical assessment methods such as MAE, RMSE, R2, OBJ, and SI to find the most authoritative model to forecast the CS of GPC composites incorporated with nS from various model strategies (LR, NLR, MLR, ANN, and M5P).

Methodology

The authors conducted an extensive search of several databases, including Research Gate, Science Direct, Google Scholar, Scopus, and the Web of Science. A wealth of papers was discovered discussing the effect of various NP types on the properties of geopolymer paste composites. However, a limited number of documents were found regarding the impact of NPs on the properties of GPC composites. In total, 207 datasets of the CS values were obtained. In the literature, a wide range of NPs like nS, nC, nA, CNT, nM, and nT were consumed to improve various properties of the GPC composites, with nS being the most frequent, as can be seen in Table 1. Therefore, in this study, the authors take those articles that used nS to improve various properties of the GPC composites.

In the modeling process, eleven input parameters were used, limiting the authors’ ability to utilize a greater number of data in the created models. The gathered datasets were statistically analyzed and classified into three groups. The models were built using the larger group, which included 135 datasets. The second group is made up of 36 datasets that were used to test the created models, and the final group is made up of 36 datasets that were consumed to validate the suggested models (Golafshani et al. 2020; Faraj et al. 2021). Table 2 shows the dataset ranges, including all significant parameters and the observed CS of the GPC incorporated with nS. The input dataset contains the following values: l/b ranges from 0.4 to 0.4, b ranges from 300 to 500 kg/m3, FA ranges from 490 to 990 kg/m3, CA ranges from 810 to 1470 kg/m3, SH ranges from 18.17 to 159.75 kg/m3, SS ranges from 40.8 to 187.5 kg/m3, M ranges from 4 to 16 M, SS/SH ranges from 0.33 to 3, nS ranges from 0 to 60 kg/m3, T ranges from 23 to 70 °C, A ranges from 0.5 to 180 days, and CS ranges from 3.2 to 81.3 MPa. The previous datasets were used to propose various models such as LR, NLR, MLR, ANN, and M5P to estimate the CS of GPC incorporated with nS; then, the developed models were evaluated using statistical criteria such as R2, RMSE, MAE, SI, and OBJ to determine the most reliable and accurate model. Additional details about this work’s methodology are shortened in a flowchart, as illustrated in Fig. 1.

Table 2 Summary of the constitutes of geopolymer concrete mixes reported in the literature
Fig. 1
figure 1

The flowchart diagram process followed in this study

Statistical assessment

Sufficient information about each variable input model parameter is provided in the following sections through “Alkaline solution to binder ratio (l/b)” to “Compressive strength (CS).”

Alkaline solution to binder ratio (l/b)

Based on the collected datasets, the ratio of l/b of the GPC mixtures modified with nS was in the range of 0.4 to 0.6, with the average and standard deviations of 0.49 and 0.05, respectively. Also, regarding other statistical analyses, it was found that the variance was 0.002, skewness was 0.66, and the kurtosis was − 0.25. Figure 2 depicts the relationship between CS and l/b with histograms of GPC mixtures incorporated with nS.

Fig. 2
figure 2

Correlations between CS and l/b ratio with histogram of GPC mixtures incorporated with nS

Binder content (b)

According to Table 1, F, GGBFS, MK, SF, RHA, and NP are those ashes that scholars used as source binder materials to produce GPC composites. The ranges of these binders were between 300 and 500 kg/m3, with the average and standard deviations of 417 kg/m3 and 51.8 kg/m3, correspondingly. At the same time, other statistical assessment tools like variance, skewness, and kurtosis were 2689, 0.11, and − 0.81, respectively, for the collected datasets. Figure 3 illustrates the CS and b content variation and frequencies of the gathered data of GPC mixtures incorporated with nS.

Fig. 3
figure 3

Correlations between CS and binder content with histogram of GPC mixtures incorporated with nS

Fine aggregate content (FA)

Like traditional concrete mixtures, natural and crushed sands were used as the FA in GPC mixtures. The FA should be satisfied with the requirements of ASTM standards. According to gathered datasets from the literature article, it was found that the range of FA was between 490 and 990 kg/m3, with an average of 681 kg/m3 and standard deviations of 135.2 kg/m3. More information regarding other statistical assessment tools can be found in Fig. 4.

Fig. 4
figure 4

Correlations between CS and FA content with histogram of GPC mixtures incorporated with nS

Coarse aggregate content (CA)

Natural, crushed, and recycled aggregates are those forms of aggregates that were used as the CA in geopolymer concrete mixtures, just like conventional concrete mixtures. Same as FA, the CA should have all the properties which are required by ASTM standards. Regarding the ranges of CA, it was concluded that the contents of CA in past research varied between 810 and 1470 kg/m3 with an average of 1113.8 kg/m3 and standard deviations of 183.2 kg/m3. On the other hand, the variance, skewness, and kurtosis were 33,580, − 0.19, and − 0.71, respectively. Also, the correlations between the CS of tested datasets and the CA contents can be found in Fig. 5.

Fig. 5
figure 5

Correlations between CS and CA content with histogram of GPC mixtures incorporated with nS

NaOH content (SH)

Pellets and flakes are two forms of SH in a solid state with a purity above 97%. This material is mixed with the required amount of water to prepare a solution of SH with the required molarity. In this study, according to the collected datasets, the amount of SH in 1 m3 of GPC mixtures incorporated with nS was in the range between 18.1 and 159.7 kg/m3, with an average of 71.3 kg/m3 and a standard deviation of 33.9 kg/m3. Extra information about other statistical assessment criteria and correlations between the CS and SH content can be found in Fig. 6.

Fig. 6
figure 6

Correlations between CS and SH content with histogram of GPC mixtures incorporated with nS

Na2SiO3 content (SS)

Water glass or sodium silicate is present in a liquid form which mainly consists of Na2O, SiO2, and H2O. Based on the previous research conducted on the GPC mixtures incorporated with nS, the range of SS was found between 40.8 and 187.5 kg/m3, with an average of 134.4 kg/m3 and the standard deviations of 35.6 kg/m3. In comparison, other stats information like variance, skewness, and kurtosis were 1268, − 1.42, and 1.55, correspondingly. Furthermore, the correlations between the CS and the SS contents of GPC can be found in Fig. 7.

Fig. 7
figure 7

Correlations between CS and SS content with histogram of GPC mixtures incorporated with nS

Molarity (M)

In the field of GPC science, the concentrations of sodium hydroxide inside water were called molarity. The authors of this study found that the molarity of SH in the collected papers was in the range between 4 and 16 M, with an average of 11.9 M and standard deviations of 3.3 M. Also, it was found that the variance of the reviewed datasets was 11.1, the skewness was − 1.4, and kurtosis was 1.3. The variations between the CS and M with the frequency of their datasets of GPC incorporated with nS are presented in Fig. 8.

Fig. 8
figure 8

Correlations between CS and molarity of SH with histogram of GPC mixtures incorporated with nS

Na2SiO3/NaOH (SS/SH)

This parameter consists of a mixture of SS and SH with the required molarity. Usually, it is prepared about 24 h before mixing the GPC ingredients. According to the gathered datasets, this parameter was used in the range between 0.33 and 3, with an average of 2.05 and standard deviations of 0.76. Also, the other statistical criteria were found to be 0.59, − 1.2, and 0.22 for the variance, skewness, and kurtosis, respectively. Moreover, correlations between the CS and the SS/SH are illustrated in Fig. 9, with the frequencies of their datasets.

Fig. 9
figure 9

Correlations between CS and SS/SH ratio with histogram of GPC mixtures incorporated with nS

Nano-silica (nS) content

As mentioned earlier, nS was the most frequently NPs that scholars used to improve various properties of GPC composites. It was used as a binder replacement or just by the addition. Table 1 shows the different properties of nS and other NP types that were utilized in GPC composites. Regarding the values of this input model parameter, it was found that the range of nS was used to improve GPC composites in the range between 0 and 60 kg/m3, with an average of 11.6 kg/m3, and the standard deviation of 14.5 kg/m3. Similarly, other statistical criteria with the correlations between the CS and the nS content can be found in Fig. 10.

Fig. 10
figure 10

Correlations between CS and nS content with histogram of GPC mixtures incorporated with nS

Curing temperatures (T)

Ambient, steam, and oven curing regimes were commonly used to cure GPC composites. One of the reasons behind using NPs in GPC composites is to take away from the oven and steam curing methods and go toward ambient curing methods. Based on the collected datasets, GPC specimens modified with nS were cured in the temperature ranges between 23 and 70 °C, with an average of 42.05 °C and the standard deviations of 17.4 °C. Also, other statistical assessment tolls like variance, skewness, and kurtosis were 303.9, 0.11, and − 1.92, respectively. The variations of the CS with the nS content and the frequencies of nS datasets are presented in Fig. 11.

Fig. 11
figure 11

Correlations between CS and T with histogram of GPC mixtures incorporated with nS

Age of specimens (A)

To gain sufficient early and late CS, the curing ages should be extended to promote the polymerization process, which strengthens geopolymers. Thus, based on the collected datasets, the cure time for GPC incorporated with nS ranged from 0.5 to 180 days, with an average of 28 days and standard deviations of 31.8 days. Similarly, the published datasets’ variance, skewness, and kurtosis were 1012.8, 2.36, and 6.96, respectively. The relationships between the CS and the specimen ages with the frequencies of collected data are shown in Fig. 12.

Fig. 12
figure 12

Correlations between CS and A with histogram of GPC mixtures incorporated with nS

Compressive strength (CS)

An applied vertical load per unit area of the GPC specimens was known as normal stress or compressive strength. This property is one of the critical mechanical properties of GPC composites. As shown in Table 2, the range of the CS for the gathered datasets was in the range between 3.2 and 81.3 MPa, with an average of 36.2 MPa and standard deviations of 17.52 MPa. At the same time, other statistical criteria like variance, skewness, and kurtosis were 307, 0.15, and − 0.75, respectively.

Modeling

Based on the coefficients of the determinations (R2) of the collected input model parameters, as shown in Figs. 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12, there is no direct relationship between the CS and any individual input model parameters. Therefore, multiscale model techniques, including M5P, MLR, ANN, LR, and NLR, are employed to develop empirical models to forecast the CS of GPC composites incorporated with nS in different mix proportion parameters, curing regimes, and specimen ages.

For creating the models, the collected datasets are split into three categories. The models were built using the larger group, which included 135 datasets. The second group is made up of 36 datasets that were utilized to test the created models, and the final group is made up of 36 datasets that were consumed to validate the suggested models (Golafshani et al. 2020; Faraj et al. 2021). The forecasts of various models were compared employing these criteria: (1) The model’s validity should be established scientifically; (2) between estimated and tested data, it should have a lower percentage of error; (3) the RMSE, OBJ, and SI values of the suggested equations should be low, while the R2 value should be high.

  1. (a)

    Linear regression model (LR)

LR is one of the standard methods that scholars used to estimate and forecast the CS of concrete composites (Faraj et al., 2021). This model has a general form, as depicted in Eq. (1) (Ahmed et al. 2021c).

$$CS=a+b\left(x1\right)$$
(1)

where, CS, \(x1\), \(a,\;\mathrm{and}\;b\) represent the compressive strength, one of the variable input parameters, and model parameters, respectively. This equation contains just one variable of input data, so to have more practical and reliable investigations, Eq. (2) is suggested, which contains a wide range of input variable data parameters that can cover all of the geopolymer concrete mixture proportions and curing conditions, as well as curing ages.

$$CS=a+b\left(\frac{l}{b}\right)+c\left(b\right)+d\left(FA\right)+e\left(CA\right)+f\left(SH\right)+g\left(SS\right)+h\left(M\right)+i\left(\frac{SS}{SH}\right)+j\left(nS\right)+k\left(T\right)+l\left(A\right)$$
(2)

As mentioned earlier, all these main variables in Eq. (2) were described except that the a, b, c, d, e, f, g, h, i, j, k, and l are the model parameters. Equation (2) is a one-of-a-kind equation because it incorporates a large number of independent variables to generate GPC incorporated with nS that may be extremely useful in the construction industry. On the other hand, because all variables can be adjusted linearly, the proposed Eq. (2) can be considered an extension of Eq. (1).

  1. (b)

    Nonlinear regression model (NLR)

In terms of the NLR, Eq. (3) may be regarded as a general form for proposing an NLR model (Ahmed et al. 2021c). The interrelationships between the variables in Eqs. (1) and (2) can be used to calculate the CS of normal geopolymer concrete mixtures and geopolymer concrete mixtures modified with nS using Eq. (3).

$$CS=a*{\left(\frac{l}{b}\right)}^{b}*({b)}^{c}*{\left(FA\right)}^{d}* {\left(CA\right)}^{e}*{\left(SH\right)}^{f}*{\left(SS\right)}^{g}*{\left(M\right)}^{h}*{\left(\frac{SS}{SH}\right)}^{i}*{\left(T\right)}^{j}*{\left(A\right)}^{k}+l*{\left(\frac{l}{b}\right)}^{m}*{\left(b\right)}^{n}*{\left(FA\right)}^{o}*{\left(CA\right)}^{p}*{\left(SH\right)}^{q}*{\left(SS\right)}^{r}*{\left(M\right)}^{s}*{\left(\frac{SS}{SH}\right)}^{t}*{\left(T\right)}^{u}*{\left(A\right)}^{v}*{\left(nS\right)}^{w}$$
(3)

where: all of the variables in this equation were provided earlier, except that the a, b, c, d, e, f, g, h, i, j, k, l, m, n, o, p, q, r, s, t, u, and v are described as a model parameter.

  1. (iii)

    Multi-logistic regression model (MLR)

As with the previous models, the collected datasets were subjected to multi-logistic regression analysis, and the general form of the MLR is shown in Eq. (4) based on the research conducted by Ahmed et al. (2021c) and Faraj et al. (2021). MLR is used to distinguish a nominal predictor variable from one or more independent variables.

$$CS=a\ast\left(\frac lb\right)^b\ast({b)}^c\ast\left(FA\right)^d\ast({CA)}^e\ast\left(SH\right)^f\ast\left(SS\right)^g\ast\left(M\right)^h\ast\left(\frac{SS}{SH}\right)^i\ast\left(nS\right)^j\ast\left(T\right)^k\ast\left(A\right)^l$$
(4)

where: all of the variables in this equation were provided earlier. Moreover, in this equation, the value of nS should be greater than 0.

  1. (iv)

    Artificial neural network (ANN)

ANN is a powerful simulation software designed for data analysis and computation that processes and analyzes data similarly to a human brain. This machine learning tool is widely used in construction engineering to forecast the future behavior of a variety of numerical problems (Mohammed 2018; Sihag et al. 2018).

An ANN model is generally divided into three main layers: input, hidden, and output. Depending on the proposed problem, each input and output layer can be one or more layers. On the other hand, the hidden layer is usually ranged for two or more layers. Although the input and output layers are generally determined by the collected data and the purpose of the designed model, the hidden layer is determined by the rated weight, transfer function, and bias of each layer to other layers. A multi-layer feed-forward network is constructed using a combination of proportions, weight/bias, and several parameters as inputs, including (l/b, b, FA, CA, etc.), and the output ANN is compressive strength.

There is no standardized method for designing network architecture. As a result, the number of hidden layers and neurons is determined through a trial and error procedure. One of the primary goals of the network’s training process is to determine the optimal number of iterations (epochs) that provide the lowest MAE, RMSE, and best R2-value close to 1. The effect of several epochs on lowering the MAE and RMSE has been studied. For the purpose of training the designed ANN, the collected dataset (a total of 207 data) was divided into three parts. Approximately 70% of the collected data was used as training data to train the network. The dataset was tested with 15% of the total data, and the remaining data were used to validate the trained network (Demircan et al., 2011). The designed ANN was trained and tested for various hidden layers to determine the optimal network structure based on the fitness of the predicted CS of GPC incorporated with nS with the CS of the actual collected data. It was observed that the ANN structure with two hidden layers, 24 neurons, and a hyperbolic tangent transfer function was a best-trained network that provides a maximum R2 and minimum both MAE and RMSE (shown in Table 3). As a part of this work, an ANN model has been used to estimate the future value of the CS of GPC incorporated with nS. The general equation of the ANN model is shown in Eqs. (5), (6), and (7).

Table 3 The tested ANN architectures

From linear node 0:

$$CS=Threshold+\left(\frac{Node 1}{1+{e}^{-B1}}\right)+\left(\frac{Node 2}{1+{e}^{-B2}}\right)+\dots$$
(5)

From sigmoid node 1:

$$B1=Threshold+\sum \left(Attribute*Variable\right)$$
(6)

From sigmoid node 2:

$$B2=Threshold+\sum \left(Attribute*Variable\right)$$
(7)
  1. (e)

    M5P-tree model (M5P)

The M5P model tree reconstructs Quinlan’s M5P-tree algorithm (Quinlan 1992), a decision tree with a linear regression function added to the leaf nodes. The decision tree encapsulates the algorithms in a tree structure formed by nodes formed during training on data. The nodes of the decision tree are classified as root nodes, internal nodes, and leaf nodes. Nodes are interconnected through branches until the leaves are reached (Malerba et al., 2004). Mohammed (2018) also introduced the M5P-tree as a robust decision tree learner model for regression analysis. The linear regression functions are placed at the terminal nodes by this learner algorithm. Classifying all datasets into multiple sub-spaces assigns a multivariate linear regression model to each sub-space. The M5P-tree algorithm operates on continuous class problems rather than discrete segments and is capable of handling tasks with a high number of dimensions. It reveals the developed information of each linear model component constructed to estimate the nonlinear correlation of the datasets. The information about division criteria for the M5-tree model is obtained through the error calculation at each node. The standard deviation of the class entering that node at each node is used to analyze errors. At each node, the attribute that maximizes the reduction of estimated error is used to evaluate any task performed by that node. As a result of this division in the M5P tree, a large tree-like structure will be generated, which will result in overfitting. The enormous tree is trimmed in the followed step, and linear regression functions restore the pruned subtrees. The general equation form of the M5P-tree model is the same as the linear regression equation, as shown in Eq. (8).

$$CS=a+b\left(\frac{l}{b}\right)+c\left(b\right)+d\left(FA\right)+e\left(CA\right)+f\left(SH\right)+g\left(SS\right)+h\left(M\right)+i\left(\frac{SS}{SH}\right)+j\left(nS\right)+k\left(T\right)+l\left(A\right)$$
(8)

where: the descriptions of all of the variables in this Eq. (8) were provided earlier.

Model efficiencies

To rate and assess the proposed models’ accuracy, various performance stats tools such as R2, RMSE, MAE, SI, and OBJ were used, which they have the following equations:

$${R}^{2}={\left(\frac{ \sum_{p=1}^{p}\left({y}_{p}-y\right)\left({x}_{p}-{x}^{^{\prime}}\right)}{\sqrt{\left[\sum_{p=1}^{p}{\left({y}_{p}-{y}^{^{\prime}}\right)}^{2}\right]\left[\sum_{p=1}^{p}{\left({x}_{p}-{x}^{^{\prime}}\right)}^{2}\right]} }\right)}^{2}$$
(9)
$$RMSE=\sqrt{\frac{\sum_{p=1}^{p}{({x}_{p}-{y}_{p})}^{2}}{n}}$$
(10)
$$MAE=\frac{\sum_{p=1}^{p}|({x}_{p}-{y}_{p})|}{n}$$
(11)
$$SI=\frac{RMSE}{{y}^{^{\prime}}}$$
(12)
$$OBJ=\left(\frac{{n}_{tr}}{{n}_{all}}*\frac{{{RMSE}_{tr}+ MAE}_{tr}}{{R}_{tr}^{2}+1}\right)+\left(\frac{{n}_{tst}}{{n}_{all}}*\frac{{{RMSE}_{tst}+ MAE}_{tst}}{{R}_{tst}^{2}+1}\right)+\left(\frac{{n}_{val}}{{n}_{all}}*\frac{{{RMSE}_{val}+ MAE}_{val}}{{R}_{val}^{2}+1}\right)$$
(13)

where:

xp and yp are estimated and tested CS values; y′ and x′ are averages of experimentally tested and the estimated values from the models, respectively. tr, tst, and val referred to the training, testing, and validating datasets, respectively, and n is the number of datasets. Except for the R2 value, 0 is the optimal value for all other evaluation parameters. However, 1 is the highest benefit for R2. When it comes to the SI parameter, a model has bad performance when it is > 0.3, acceptable performance when it is 0.2 SI 0.3, excellent performance when it is 0.1 SI 0.2, and great performance when it is 0.1 SI 0.1 (Faraj et al., 2021). Furthermore, the OBJ parameter was employed as a performance measurement parameter in Eq. (13) to measure the efficiency of the suggested models.

Results and analysis

  1. (a)

    LR model

The output of this model revealed that the l/b, SS/SH, and M are those parameters that have a greater impact on the CS of GPC incorporated with nS than other parameters. This result was confirmed by a wide range of published experimental works in the literature (Hardjito et al. 2004; Deb et al. 2014; Oyebisi et al. 2020). Equation (14) with the weight of each model parameter is the output of this model. Optimizing the sum of error squares and the least square method, which were implemented in an Excel program using Solver to calculate the ideal value for the equation in one cell designated the objective cell, were used to determine the weighting of each parameter on the CS of GPC mixtures incorporating nS. The values of other equation cells constrained this object cell in the worksheet (Faraj et al., 2021).

$$CS=110.06-66.6\left(\frac{l}{b}\right)+0.06\left(b\right)-0.01\left(FA\right)+0.005\left(CA\right)-0.34\left(SH\right)+0.12\left(SS\right)-3.12\left(M\right)-13.9\left(\frac{SS}{SH}\right)-0.18\left(nS\right)+0.11\left(T\right)+0.23\left(A\right)$$
(14)

Figure 13a, b, and c depict the relationship between estimated and real CS of GPC mixtures incorporated with nS for training, testing, and validating datasets, respectively. Moreover, this model was evaluated by some statistical assessment tools, and it was observed that the R2 and RMSE for the training datasets were equal to 0.7989 and 7.65 MPa, respectively, and as illustrated in Figs. 25 and 26, the other statistical criteria like OBJ and SI were 8.05 MPa and 0.209. Finally, utilizing training, testing, and validating datasets, the residual CS for the LR model for the forecasted and observed CS is displayed in Fig. 14.

  1. (b)

    NLR model

Fig. 13
figure 13

Comparison between tested and predicted CS of GPC mixtures incorporated with nS using LR model: a training data, b testing data, c validating data

Fig. 14
figure 14

Residual error diagram of CS of GPC mixtures incorporated with nS using training, testing, and validating datasets for LR model

The correlations between the actual and forecasted CS of GPC mixtures incorporated with nS are presented in Fig. 15a, b, and c for the training, testing, and validating datasets, correspondingly. As shown in Eq. (15), the weight of the model parameters demonstrated that the l/b, SH, and M are those input variable parameters that significantly affect the CS of geopolymer concrete mixtures modified with nS. This result was also well-validated in the previous experimental laboratory research works (Hardjito et al. 2004; Aliabdo et al. 2016; Ghafoor et al. 2021).

Fig. 15
figure 15

Comparison between tested and predicted CS of GPC mixtures incorporated with nS using NLR model: a training data, b testing data, c validating data

$$CS=-4643005*{\left(\frac{l}{b}\right)}^{3.78}*({b)}^{1.34}*{\left(FA\right)}^{-1.87}* {\left(CA\right)}^{0.08}*{\left(SH\right)}^{5.5}*{\left(SS\right)}^{-7.85}*{\left(M\right)}^{2.72}*{\left(\frac{SS}{SH}\right)}^{-6.06}*{\left(T\right)}^{-0.53}*{\left(A\right)}^{0.02}+73.89*{\left(\frac{l}{b}\right)}^{0.37}*{\left(b\right)}^{-0.03}*{\left(FA\right)}^{-0.45}*{\left(CA\right)}^{0.44}*{\left(SH\right)}^{-0.15}*{\left(SS\right)}^{-0.001}*{\left(M\right)}^{-0.02}*{\left(\frac{SS}{SH}\right)}^{-0.15}*{\left(T\right)}^{0.0002}*{\left(A\right)}^{0.15}*{\left(nS\right)}^{0.002}$$
(15)

Similar to the LR model, this model was also assessed by some statistical criteria, and it was found that the R2, RMSE, OBJ, and SI of the training datasets were equal to 0.8792, 5.92 MPa, 5.80 MPa, and 0.162, respectively. Furthermore, the difference between actual and estimated CS of geopolymer concrete mixtures modified with nS can be found in Fig. 16 for all the validating, testing, and training datasets.

  1. (iii)

    MLR model

Fig. 16
figure 16

Residual error diagram of CS of GPC mixtures incorporated with nS using training, testing, and validating datasets for NLR model

Equation (16) shows the generated models for the MLR model with various variable parameters. The most significant independent factors that impact the CS of the geopolymer concrete mixtures modified with nS in the MLR model were SS content, age of the specimens, and curing temperatures, which are matched with some experimental studies published in the past articles (Jindal et al. 2017; Hassan et al. 2019; Ghafoor et al. 2021).

$$CS=29.0347208*{\left(\frac{l}{b}\right)}^{-1.47}*({b)}^{0.04}*{\left(FA\right)}^{-0.21}* {\left(CA\right)}^{-0.26}*{\left(SH\right)}^{-0.09}*{\left(SS\right)}^{0.56}*{\left(M\right)}^{-0.49}{* \left(\frac{SS}{SH}\right)}^{-0.22}*{\left(nS\right)}^{-0.005}*{\left(T\right)}^{0.13}*{\left(A\right)}^{0.2}$$
(16)

Figure 17a was created by utilizing training datasets to depict the anticipated and measured CS correlations for the GPC mixtures incorporated with nS. Furthermore, similar to the earlier models, this model was tested using two parts of data (validating and testing data) to demonstrate its efficacy for variables not included in the model data (training data). The findings indicate that by substituting the independent variables into the established equation, this model can predict the CS of GPC, as illustrated in Fig. 17b and c. The values of R2 and RMSE for this developed model are 0.7787 and 8.02 MPa, respectively, for the training datasets. Also, as depicted in Figs. 25 and 26, the value of other statistical assessment tools like OBJ and SI values was observed at 8.8 MPa and 0.22, respectively. Lastly, utilizing validating, training, and testing data, the residual CS for the MLR model for predicted and observed CS of GPC incorporated with nS is displayed in Fig. 18.

  1. (iv)

    ANN model

Fig. 17
figure 17

Comparison between tested and predicted CS of GPC mixtures incorporated with nS using MLR model: a training data, b testing data, c validating data

Fig. 18
figure 18

Residual error diagram of CS of GPC mixtures incorporated with nS using training, testing, and validating datasets for MLR model

In this study, the authors tried a lot to get the high efficiency of the ANN by applying different numbers of the hidden layer, neurons, momentum, learning rate, and iteration, as can be seen in Table 3. Lastly, it was observed that when the ANN has two hidden layers, 24 neurons (12 for left side and 12 for the right side as shown in Fig. 19), 0.2 momenta, 0.1 learning rate, and 2000 iterations give the best-predicted values of the CS of the GPC mixtures incorporated with nS. The ANN model was equipped with the training datasets, accompanied by testing and validating datasets to predict the compression strength values for the correct input parameters. The comparisons between estimated and experimentally tested CS of GPC mixtures incorporated with nS for training, testing, and validating datasets are presented in Fig. 20a, b, and c. The consumed data have + 10% and − 20% error lines for the training and testing datasets, and ± 10% for the validating datasets, which is better than the other developed models. Furthermore, this model has a better performance than other models to predict the CS of the GPC incorporated with nS based on the value of OBJ and SI illustrated in Figs. 25 and 26. Also, the value of R2 = 0.9771, MAE = 2.83 MPa, and RMSE = 3.89 MPa. Finally, the differences in the value of the CS for estimated and tested GPC mixtures incorporated with nS can be found in Fig. 21 by consuming all the datasets.

  1. (e)

    M5P model

Fig. 19
figure 19

Optimal network structures of the ANN model

Fig. 20
figure 20

Comparison between tested and predicted CS of GPC mixtures incorporated with nS using ANN model: a training data, b testing data, c validating data

Fig. 21
figure 21

Residual error diagram of CS of GPC mixtures incorporated with nS using training, testing, and validating datasets for ANN model

The predicted and observed CS of the GPC mixtures incorporated with nS for whole the datasets are shown in Fig. 22a, b, and c. Similar to the other models, it was discovered that the l/b and M of the GPC mixtures incorporated with nS have the greatest impact on the CS of the GPC mixtures incorporated with nS, which agrees with experimental findings in the past studies (Hardjito et al. 2004; Aliabdo et al. 2016; Ghafoor et al. 2021). Figure 23 shows the tree-shaped branch correlations. Also, the model (in Eq. (17)) parameters are summarized in Table 4, and the model variables will be selected based on the linear tree registration function.

Fig. 22
figure 22

Comparison between tested and predicted CS of GPC mixtures incorporated with nS using M5P-tree model: a training data, b testing data, c validating data

Fig. 23
figure 23

M5P-tree pruned model tree

Table 4 M5P-tree model parameters (Eq. (17))
$$CS=a+b\left(\frac{l}{b}\right)+c\left(b\right)+d\left(FA\right)+e\left(CA\right)+f\left(SH\right)+g\left(SS\right)+h\left(M\right)+i\left(\frac{SS}{SH}\right)+j\left(nS\right)+k\left(T\right)+l\left(A\right)$$
(17)

For all of the training, testing, and validation datasets, there is a 20% error line. Finally, for all datasets, the residual CS for the M5P model was displayed in Fig. 24 for both predicted and observed CS. Furthermore, this model’s R2, RMSE, MAE, OBJ, and SI evaluation criteria are 0.9454, 5.59 MPa, 4.45 MPa, 6.0 MPa, and 0.153, respectively, for the training datasets.

Fig. 24
figure 24

Residual error diagram of CS of GPC mixtures incorporated with nS using training, testing, and validating datasets for M5P model

Proposed models’ performance

As early mentioned, the efficiency of the developed models was evaluated by employing these five stats tools: RMSE, MAE, SI, OBJ, and R2. When compared to the LR, NLR, MLR, and M5P models, the ANN model has a higher R2 with lower RMSE and MAE values, as well as lower OBJ and SI values.

In addition, Fig. 27 shows a comparison of model predictions of the CS of GPC mixtures incorporated with nS based on the testing datasets. Furthermore, Figs. 14, 16, 18, 21, and 24 display the residual errors for the CS by consuming all the datasets. The whole figures show that the estimated and tested CS values for the ANN model are close, indicating that the ANN model is more accurate than other models.

Figure 25 shows the OBJ values for all of the proposed models. The OBJ is 8.05, 5.8, 8.8, 3.59, and 6.0 for LR, NLR, MLR, ANN, and M5P, respectively. The ANN model has a lower OBJ value, about 124% less than the LR model, 61.5% less than the NLR model, 145% less than the MLR model, and 67% less than the M5P model. This also emphasized that the ANN model better forecasts the CS of GPC incorporated with nS.

Fig. 25
figure 25

The OBJ values of all developed models

In addition, Fig. 26 shows the SI values for the created models during the training, validating, and testing phases. The SI values for NLR and ANN models for the entire training, testing, and validating datasets were between 0.1 and 0.2, signalizing good accuracy for these models. While, for the other remaining models, the values of SI were between 0.2 and 0.3, this result revealed that the performance of the LR, MLR, and M5P models is in fair condition. Similar to other statistical assessment criteria, the ANN model has smaller SI values among the entire models. The ANN model has lower SI values (for training datasets) than the LR, NLR, MLR, and M5P models by 97.2%, 52.8%, 107.5%, and 44.3%, respectively. This also demonstrated that when forecasting the CS of GPC mixtures incorporated with nS, the ANN model is more efficient and performs better than the other models (Fig. 27).

Fig. 26
figure 26

Comparing the SI performance parameters of different developed models

Fig. 27
figure 27

Compression between model predictions of CS of GPC mixtures incorporated with nS using testing datasets

Conclusions

Using new scientifical technics like empirical correlations and neural networks to estimate the CS of GPC mixtures incorporated with nS can save time and money. LR, NLR, MLR, ANN, and M5P models were used in this article to develop predictive models for estimating the CS of GPC mixtures incorporated with nS. Based on the extensive revision and data gathering, the following conclusion can be drawn:

  1. (i)

    The average amount of nS used in GPC mixtures was 11.6 kg/m3, or approximately 3% of the binder content. Additionally, the percentage of nS substituted with binder varied between 0 and 60 kg/m3.

  2. (ii)

    The LR, NLR, MLR, ANN, and M5P models were all successfully utilized to create predictive models for the CS of the GPC mixtures incorporated with nS. The estimated CS closely matched experimentally measured CS of GPC mixtures that contained nS.

  3. (iii)

    The whole created models satisfied all the statistical assessment criteria such as R2, RMSE, MAE, OBJ, and SI.

  4. (iv)

    The ANN model outperforms the other three models based on statistical evaluation and sensitivity analysis. For the training, testing, and validating datasets, the R2 values are 0.9771, 0.9777, and 0.9923, respectively. Furthermore, the RMSE, MAE, OBJ, and SI stats criteria for the training dataset for the ANN model are 3.892 MPa, 2.832 MPa, 3.59 MPa, and 0.106, respectively. Consequently, the ANN model has greater generality and suitability in the initiatory design of GPC mixtures incorporated with nS.

  5. (v)

    The two-layer ANN model with twelve neurons in each layer is the best model combination for estimating the CS of GPC mixtures incorporated with nS.

  6. (vi)

    The sequence for suitability and having greater performances of the proposed models is as follows: ANN, NLR, M5P, LR, and MLR.

  7. (vii)

    The obtained results indicate that the most significant variable parameters for estimating the CS of GPC mixtures which contained nS are the l/b, SS/SH, M, T, and A.