Introduction

Concrete is the main component in the construction industry. However, ordinary concrete mixtures cannot fulfill the industrial demand for all structural projects, wherein, special combinations and performance is needed for some structural elements. Therefore, in some projects, the use of high-performance concrete (HPC) is enviable (Aïtcin, 2011). HPC excelled in many physical characteristics in comparison with the ordinary concrete throughout permeability, durability, bleeding, workability, and extreme compression strength.

Compressive strength is an extremely vital property of concrete. The adopted method of compressive strength test included soaking the sample in water for 28 days and a carefully controlled environment (Khademi, 2017). Numerous factors affect the compressive strength of concrete including compaction, age, additives, aggregate size, water-cement ratio, etc. The compressive strength assay and assessment is time-consuming especially for ongoing buildings and infrastructural projects (Khademi, 2016). The importance of conducting research targeting the development of useful toll to predict concrete essential properties comes from the continuous need to save energy and time which caused by conventional laboratory work. (Yu, 2018),

Recently, many fields of science are adopting artificial neural networks (ANN) (Cao, 2017; Fiyadh, 2017, 2018; Fiyadh, et al. 2017, 2018; Shahin et al. 2001; Shi, 2016), and concrete research is not an exception to this field. (Akbari & Jafari Deligani, 2020; Atici & U.J.E.S.w.a., 2011; Hola & Schabowicz, 2005; Lee, 2003; Li, 2019; Öztaş, 2006a; Reddy, 2018; Zhang, 2020). Several studies, conducted over the last two decades, have shown that the use of ANN approaches to estimate the physical and chemical properties of concrete has enhanced the quality control of this field (Ni & Wang, 2000). Machine learning approaches such as ANN have gain popularity in solving complex engineering problems and capture the nonlinear relations where the empirical and casual approach may not address these problems well (AlOmar et al. 2020; Kaveh et al. 2001, 2008; Rofooei et al. 2011). For the last two decades, the backpropagation algorithm is one of the most efficient algorithms used to train ANN and correct the weight and bias values (Iranmanesh & Kaveh, 1999; Kaveh & Dehkordi, 2002; Kaveh & Elmieh, 2001; Kaveh & Iranmanesh, 1998; Kaveh & Servati, 2001).

Many researchers used traditional ANN techniques for such purposes including, feed-forward neural network (Ni & Wang, 2000; Öztaş, 2006b; Słoński, 2010), the back-propagation (Naderpour et al. 2018), multiple linear regression, and fuzzy logic (Khademi, 2016, 2017). It is worth mentioning, that the prediction of compressive strength of concrete by conventional ANN is difficult and inaccurate due to the many parameters involved in building the model (Nikoo, et al. 2015). Unlike ordinary concrete, the compressive strength test of HPC and high-strength concrete (HSC) is more complicated due to many additive materials(Hameed, 2021). Thus, reliable forecasting of the HPC compressive strength is considered a major challenging mission facing researchers in the recent decade. The traditional method such as pure ANN models are no longer efficient in addressing concrete properties prediction. To overcome this issue, researchers start to use combined models and techniques (Nikoo, et al. 2015). Eventually, in this study, the principal component analysis (PCA) method is adopted using experimental data containing concrete samples of different variables. The data set consists of more than one thousand HPC samples with a variety of ages; 1, 3, 7, 14, 28, 90, 270 and 365 days.

This research aims to constitute a robust predictive model that can forecast the compressive strength of concrete with fewer forecasted errors. This study suggests employing ANN coupled with principal components analyses (PCA) technique and cross-validation (CV) method to constitute a more reliable effective model abbreviated as (ANN-CVPCA). For further assessment, the proposed model performance was compared with several and novel models developed in previous studied to ensure the reliability of the proposed model. Besides, a sensitivity analysis procedure was applied to study the effect of each input parameter and to identify the one which has the greatest effect on the compressive strength test of HPC.

Methodology

The dataset

Modeling HPC is a difficult task due to the complexity of its components. In this study, dozens of HPC data collected from many sources by (Yeh (1998)), were used to inspect the reliability and accuracy of the model. The 1030 samples obtained contained several characteristics including blast furnace slag (kg/m3), superplasticizer (kg/m3), cement (kg/m3), age (days), coarse aggregate (kg/m3), water (kg/m3), fine aggregate (kg/m3), fly ash (kg/m3), and concrete compressive strength (MPa). The details of all variables used in this study are presented in Table 1.\({S}_{X},{{X}_{min,}X}_{max},,{C}_{sx},{C}_{V}{,C}_{c}\) denote the stander deviation, minimum, maximum, skewness, coefficient of variation, and coefficient of correlation with compressive strength of HPC.

Table 1 Descriptive analysis of compressive strength of concrete variables

Back propagation artificial neural network

In machine learning, the backpropagation (BP) algorithm is broadly used as a learning algorithm to train a single layer of ANN. During the training phase, the BP algorithm iteratively adjusts and updates the bias and connection weight values (Afan, 2015; Yaseen, 2016). Figure 1 shows the classical structure of ANN. The ANN training method can be carried out as follows, with the ANN initiating input parameters and the BP algorithm starting to compute the sum square error by calculating the gap between the actual and predicted ANN values. The BP algorithm minimizes the sum square error in the second stage by setting weights and bias values by propagating the expected error from the output layer back to the input layer. (Fiyadh, 2019a, b; Hameed, 2017; Heidari et al. 2017; Kewalramani & Gupta, 2006). Since the BP algorithm is known to be a gradient descent-based algorithm, the variance in weight values is directed to the gradient in the negative path, refer to Eq. (1).

$$ \Delta W_{k} = \alpha \Delta W_{k - 1} - \eta \frac{\partial E}{{\partial W}} $$
(1)
Fig. 1
figure 1

Classical structure of ANN


where W is the weight and bias values, \(\alpha \) is the momentum parameter, \(\Delta {W}_{k}\) and \(\Delta {W}_{k-1}\) refer to the changes in the weight W at k, and k-1 stands for iteration, η stands for the learning rate factor, and \(E\) is the forecasted error.

Model development

Principal Component Analysis (PCA) can be defined as a dimension reduction procedure usually used for converting the large input set variables to smaller ones that contain the most beneficial information from the whole large data set. PCA is a mathematical technique that converts high dimensional and correlated variables to smaller dimensional and uncorrelated (orthogonal) variables known as principal components (PCs) which are assigned as a linear combination’s procedure of the original parameters. The eigenvalues of the PCs measured the associated variance while the sum of eigenvalues corresponded with the total number of parameters in the data set. The first PCs considered the most important ones since it contains most of the variability in the original data set and the following PCs considered for another possible residual variability. PCA decreases the characteristic space from a higher number of parameters to a smaller number of variables since it is considered a non-dependent method which means. Besides, PCA does not assume a dependent variable as defined. PCA is considered as data compression or dimensionality reducing method where the main goal for using this technique is to ensure the presented data can be effectively interpretable. In this study, the input variables are transformed into uncorrelated variables and compressed before introducing them to the ANN model.

One of the main steps in developing an advanced and accurate model is data partition. This reality arises from the complex phenomena of compressive strength of HPC, and it is therefore mostly inaccurate to construct a single predictive model that is capable to capture and mimic the entire system characteristic using only one group. Accordingly, in this study, the K-fold cross-validation approach was used for reducing the uncertainty in modeling outcomes. In this current study, the dataset was randomly divided into five equaled-size subsets (K) which were all used for training the model, except one, which is used for testing the model accuracy. The process repeated 5 times, and the final estimations computed by averaged the obtained results, refer to Fig. 2. The most accurate model is subsequently selected to simulate the compressive strength of HPC using the training and testing dataset.

Fig. 2
figure 2

K-fold cross validation approach

The scheme in Fig. 3 illustrates the use of PCA coupled with ANN and CV technique to constitute the suggested model (ANN-CVPCA) in predicting HPC compressive strength. Thus, in this study, eight models were developed to predict the compressive of high-strength concrete. The input parameters which already discussed were introduced to the PCA approach to get the principal component variables. Accordingly, the first model was developed using one principle component variable while the second model was also established using two principal components only. This process continued until establishing the eight models developed based on principal component variables obtained from the PCA algorithm. These essential variables were introduced to CV and ANN to build the predictive models.

Fig. 3
figure 3

ANN-PCA joins with CV methodology

It is important to mention that the input data were standardized before introducing it to the predictive model based on Eq. (2).

$$ X_{{{\text{new}}}} = \frac{{X - \overline{X}}}{\sigma } $$
(2)

where,\({X}_{\mathrm{new}},\) standardized value, X is the original data value and \(\stackrel{-}{X}\) and \(\sigma \) are mean and standard deviation, respectively.

On top of that, a single-layer feed-forward has been used equipped with a BP algorithm (Levenberg–Marquardt) to train the model and updates the weights and biases. In the hidden layer and output layer, the tangent sigmoid function and linear function have been used, respectively. The codes were written using MATLAB 2018b languages.

Performance evaluation

The data set was utilized to construct a prediction model of the HPC compressive strength. ANN incorporated with the PCA and CV method to constitute (ANN-CVPCA) model for forecasting the HPC compressive strength. Later, the results compared with four different models were carried out in previous studies. The compared models were: the robust modeled called classification and regression tree coupled with support vector machine, linear regression and cross-validation (CART + SVM + LR) (Chou, 2014) and other classical and novel forecasting models such as extreme learning machine (ELM), regularized extreme learning machine (RELM) combined with cross-validation method, and other powerful modeling approached (Al-Shamiri & Yuan, 2020; Chou & Pham, 2013).

To assess the validity of the adopted model (ANN-CVPCA) against the referenced models, different statistical criteria have been applied namely; correlation coefficient (cc), Root Mean Square Error (RMSE), Mean Absolute Error (MAE) and Normalized Root Mean Square Error (NRMSE)(AlOmar, 2020). Refer to Eqs, (3), (4), (5), and (6).

$$ {\text{cc}} = { }\frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {m - m^{ - } } \right)\left( {{ }p - p^{ - } } \right)}}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{n} ({ }m - m^{ - } )^{2} { }\mathop \sum \nolimits_{i = 1}^{n} ({ }m - p^{ - } )^{2} { }} { }}} $$
(3)
$$ {\text{RMSE}} = { }\sqrt {\frac{1}{n}{ }\mathop \sum \limits_{i = 1}^{n} (m - p)^{2} } $$
(4)
$$ {\text{MAE}} = \frac{{\left| {\mathop \sum \nolimits_{I = 1}^{N} \left( {m - p} \right)} \right|}}{n} $$
(5)
$$ {\text{NRMSE}} = \frac{{\sqrt {\frac{1}{n}{ }\mathop \sum \nolimits_{i = 1}^{n} (m - p)^{2} } }}{{m^{ - } }} $$
(6)

whereas the number of samples is represented by n, the measured and predicted data is represented by m and p, while \({{\varvec{m}}}^{-}\) and \({{\varvec{p}}}^{-}\) refer to the mean value of actual and predicted data set respectively. For more detailed analyses, a graphical figure (boxplot) was used to provide more description of the prediction accuracy of each model and helps to get fast and summarized information which is very significant in the evaluation step(Tao, 2021).

Additionally, sensitivity data analyses have been carried out to evaluate the performance of the predictive model (ANN). Sensitivity analysis is an approach used to distinguish between the causes and consequences of the input and output phases. The basic principle of this approach is that the input parameters applied to the ANN system are slightly omitted and the resultant output adjustment is recorded as a percentage.

Results and discussion

In this study, ANN combined with PCA and CV Technique constituted the ANN-CVPCA model that was employed to predict the compressive strength of HPC. PCA method is used to reduce and reconstruct the input variables and then introduce them to ANN. K-fold CV procedure used to separate data into two sets: a training set and testing set where training set make up 75% of the data and 30% of data were employed for testing the functioning of the model. It is important to mention that the trial and error method is used to determine the number of hidden nodes in the single layer of ANN since there is no specific approach can be applied to establish the best possible number of hidden neurons. The capability of artificial intelligence (AI) modeling techniques to predict the compressive strength of HPC is investigated for the set of data used in the current research. For verification purposes, the attained results from the adopted model (ANN-CVPCA) are compared with the previous studies that used CART classification and regression tree + SVM(support vector machine) + LR(linear regression) (Chou, 2014), extreme learning machine(ELM), regularized extreme learning machine( RELM) and other novel predictive models (Al-Shamiri & Yuan, 2020; Chou & Pham, 2013; Hameed & AlOmar, 2020).

K-fold CV method was used to split the data into two phases namely, training set for calibration and testing set were utilized to test the accuracy of the ANN-CVPCA model. Hameed & AlOmar investigate the prediction of compressive strength of HPC using ANN coupled with CV method which is used to select the best quality data set for the training and testing phase (Hameed & AlOmar, 2020). The mentioned study showed that the ANN-CV can forecast efficiently the compressive strength test compared with the Multilinear Regression model. Although this study presented high-accuracy forecasting results, it lacks some important things that may benefit researchers in the field of developing concrete mixtures. One of the most important issues that concrete mixture developers may encounter is to determine which factors with the most significant impact on the compressive strength of high-performance concrete. In this study, the sensitivity analysis method was applied to the ANN model in order to uncover more concrete properties and determine any of the eight variables, mentioned in the methodology part, that have an effective impact on the HPC compression strength. Figure 4 showed the result of sensitivity analyses for the best predictive ANN model. Based on Fig. 4, the ANN showed all variables are significant and highly associated with the target. Moreover, the most two efficient parameters are superplasticizer and cement, respectively. These two parameters made up about 28%. The observation according to this percentage confirms that the sensitivity analysis of each parameter is consistent with the complexity HPC. The relationship between these variables is characterized as highly nonlinear since most input factors have an important influence on the compressive strength of HPC. Therefore, it is very essential to use sophisticated and advanced techniques for getting a comprehensive understanding of the compacted behavior of HPC mixture. Additionally, superplasticizers and cement parameters can play an important role in affecting HPC compressive strength. Therefore, taking into consideration these factors is vital in designing a more efficient HPC mixture.

Fig. 4
figure 4

Sensitivity analysis percentage for each parameter contribute to HPC compressive strength: ANN approach with Cross-Validation (4 k-fold)

Table 2 illustrates the predicted results of the proposed modeling approach. In the first seven models listed in Table 2, the PCA approach was utilized to minimize and compress the original input variables whereas, the PCA approach was used for the reconstruction of the input variables in the last model. It is noticeable, the accuracy of ANN-CVPCA models improved by increasing the number of PCs. For example, when only one PC variable is used as input to ANN, the model led to inadequate predicted values according to statistical criteria (RMSE = 15.33(Mpa), MAE = 12.39(Mpa), NRMSE = 0.43 and CC = 0.18). On the other hand, the performance of ANN-CVPCA is improved when 7 PCs are used and recorded less error (RMSE = 5.33(Mpa), MAE = 4.07(Mpa), NRMSE = 0.16 and CC = 0.95). According to Table 2, it can be observed that M8 with using 8 PCs provides the best accuracy with less error reported in the testing set. For more comparative information, Fig. 5 exhibits a graphical presentation using a boxplot diagram. The mentioned figure summarizes the forecasting accuracy for each predictive model and can be concluded that the M8 has the lowest calculated error among the other eight models. Moreover, Using the PCA technique for compressing the variables and reconstructing them enhance the accuracy of the predictive model and eventually made the current model superior to other forecasting models reported in publication (Chou (2014); Chou & Pham, 2013; Al-Shamiri & Yuan, 2020; Hameed & AlOmar, 2020). The main reason behind the superiority of the pressed model (ANN-CVPCA) is the existence of CV and PCA approaches. Based on the given results, the ANN-CVPCA model was stable and capable to capture nonlinear relations between target and input parameters. Moreover, the two approaches enhance the generalization efficiency and reduce the uncertainty in modeling results. The PCA method improved the prediction capacity of the proposed model by introducing a non-redundant variable to the model hence, providing high accurate prediction estimations. Additionally, comparing the obtained results to other models, reported in other publications, showed an improvement in the accuracy of the current ANN-CVPCA model based on many statistical matrices. The current ANN-CVPCA model outperformed different models such as ANN + CHAID (artificial neural network with chi-squared automatic interaction detector), ANN + SVM (ANN with support vector machine), GENLIN (generalized linear model), and other models have been reported in the literature as listed in Table 2. PCA method played an important role in reducing the sum square errors compared to several powerful models like ANN + SVM, and GENLIN. The evidence of the better performance obtained by ANN-CVPCA is shown based on several statistical measures such as and fewer RMSE (4.64 Mpa) is obtained compared to ANN + SVM (6.174 Mpa) and GENLIN (11.375 Mpa).

Table 2 evaluating ANN-PCA models with cross validation: Testing set
Fig. 5
figure 5

Boxplot diagram describing the comparative analyses for eight different models

Moreover, the improvement in prediction results compared also with previous studies, in terms of reducing sum square error, could also be noticed when the CV method is combined with ANN-CVPCA. Consequently, the proposed model (ANN-CVPCA) also outperforms complicated sophisticated models (ANN and CART (classification and regression tree) + SVM(support vector machine) + LR(linear regression)) (Chou, 2014). The ANN-CVPCA model recorded the uppermost value of the CC and the lowermost value of RMSE (see Table 2). In summary, the used ANN-CVPCA model yielded lower value RMSE than the advanced forecasting model (CART + SVM + LR). Wherein, there is a good and promising improvement in reduction RMSE which reached 9.48%.

Additionally, the ANN-CVPCA model outperformed and achieved an excellent improvement in generating more accurate results in comparison with recent and novel predictive models such as extreme learning machine (ELM), regularized extreme machine (RELM) approaches combined with CV strategy (Al-Shamiri & Yuan, 2020), and other production models which can be summarized in Table 2. It can be seen there was a satisfactory enhancement in minimizing RMSE index, thereby, improving the accuracy of ANN-CVPCA performance by 23.15% and 15.75% over CV + ELM (cross-validation with extreme learning machine) and CV + RELM (cross-validation with regularized extreme learning machine), respectively. The comparison with robust models gave the impression that the suggested model in this study gave satisfactory estimates. Besides, the proposed model was found to produce more accurate prediction than 19 models with had done in previous studies. In general these models gave correlation of coefficient (CC) values ranged from 0.779 to 0.9403. However, the capacity of the proposed model for this study generate excellent prediction with higher value of CC (0.96). CC criteria is considered the most significant parameters than other statistical criteria when it comes to compare a certain model with others were developed in previous studies. The reason behind this fact that the most statistical parameters except CC are highly affected with the range of dataset and normalization approach. Therefore, these criteria such as RMSE, MSE, and so on may give misleading comparisons.

It is believed that removing the correlation between input variables, by using the PCA method, could be an important reason for improving the result. Once the correlation between raw input variables is removed, these variables are transformed as PCs components. Later, the redundant data are generally removed. The performance of the ANN technique seemed to be improved because the forecasting model considered all input variables since they became independent variables that have significant impacts on the targets. Additionally, the CV method could select the proper data set to train ANN, which led to minimizing the sum square error of the predicted values of the HPC compressive strength. Selecting good quality data for both training and testing phases by applying the CV approach is very important for improving the accuracy of ANN.

Figure 6a and b shows the performance of all the forecasted results for the testing phase obtained by the predictive models. It is noticeable that the estimation of the ANN-CVPCA model using eight PCs is less scattered and closer to the trend line compared to other models. For M8, the first PC accounts for about 28.337% of the total variability, and the first 7 PCs can summarize the majority of the input data set and explains about 99.610% of the total variability as shown in Table 3.

Fig. 6
figure 6figure 6

a Comparative and scatter plots between observed and predictive compressive strength test of concrete using different input (PCs) variables: Testing set of 1, 2, 3 and 4 PCs. b Comparative and scatter plots between observed and predictive compressive strength test of concrete using different input (PCs) variables: Testing set of 5, 6, 7, and 8 PCs

Table 3 Descriptive statistics of eight principal components

For a more detailed analysis of the ANN-CVPCA model based on 8 PCs, quantitative analysis has been carried out to identify the performance of ANN in predicting extreme values of HPC compressive strength. Figure 7 illustrates the peak ten predicted values of HPC compressive strength obtained by using ANN coupled with the CV method. The peak predicted readings are very close to actual ones. For instance, the three highest actual (predicted) values of HPC compressive strength are 77.3, 77.3, and 76.80 (73.04, 73.04, and 73.6) MPa. Generally, there is a good match between predicted and actual results. Also, the relative error percentage indicator (RE), which can be expressed by Eq. 7, is generally small (see Fig. 7).

$$ {\text{RE\% }} = 100 \times { }\frac{{\text{actual value - predective value}}}{{\text{actual value}}} $$
(7)
Fig. 7
figure 7

Peak values estimated using ANN-PCA model

Conclusion

This study investigated the ability of the artificial intelligence technique to predict the compressive strength test of HPC based on several input parameters such as coarse aggregate, fine aggregate, water, cement, age as well as some chemical additives like blast furnace slag, fly ash, and superplasticizer. ANN was incorporated with the PCA method and CV constituted (ANN-CVPCA) model, which was utilized for predicting the compressive strength of HPC. The obtained results illustrated that there was a fair match between forecasted and real values of the HPC compressive strength test. The target values of the employed model were compared with four models from the literature that used the same data set. For evaluating and assessing the adopted ANN-CVPCA model, the accuracy of the adopted model was validated against many different models including, (GENLIN, ANN + SVM, CART + SVM + LR), and other recent and novel models like (CV + ELM and CV + RELM) according to serval statistical measures. The result showed that the accuracy of the proposed model (ANN-CVPCA) outperformed all the other models. The adopted model showed superiority in comparison with the conventional and novel models in published literature due to the combination of ANN with PCA as well as the contribution of CV in data organization which eliminated the correlation between input data hence, providing significant and non-redundant information to the model. Moreover, the two approaches, CV and PCA play an efficient role in increasing the generalization ability of the adopted model and reducing the uncertainty in the modeling results compare to other AI models used in this current study. The given results of the suggested model are more stable and accurate. Additionally, the sensitivity analysis has been carried out to provide an important overview of the most important variables that may be significant to develop a concrete mixture thus, attaining a high compression strength. This analysis showed that the superplasticizer and cement are the most effective parameters on the compressive strength of HPC, with 28% effectiveness. This study recommends intensifying the PCA approach with ANN which is trained by bio-inspiration algorithm (Bui, 2018, 2020; Zhang, 2021) to enhance the performance capacity of the hybrid models thereby, obtaining more accurate estimates.