1 Introduction

Design of non-strengthened reinforced concrete (RC) members is an interesting problem for researchers since there are different approaches to their analysis and design. To design non-strengthened RC members, designers should consider two important parameters: shear and flexural failures. When the internal shear forces in RC beams become large, it could lead to shear destruction, and therefore an abrupt brittle fracture might occur. This type of failure is not intended by designers. In fact, when the internal shear force is larger than the flexural ones, the shear strengthening of RC structures has an essential role to stop the shear destruction. Therefore, the measurement of the shear capacity of RC is an important factor to select a suitable method for the strengthening of RC structures (Naderpour and Alavi 2017).

Fiber-reinforced polymer (FRP) is a combination of high-strength fibers and matrix. The matrix binds these fibers to fabricate structural shapes. There are four common fiber types: aramid (AFRP), carbon (CFRP), glass (GFRP) and high-strength steel. Also, two common matrices exist named epoxy and ester (Karbhari 2015; Wu and Eamon 2017). The use of FRP in repairing and rehabilitation of the existing structures or in the construction of new structures has introduced a new area in civil engineering (Cai and Aref 2015; Xu et al. 2019). In addition to increasing the speed of the construction and creation of continuous confinement for the concrete member, FRP increases the member’s resistance to corrosion without changing the shape or size of the cross section (De Lorenzis and Tepfers 2003). Therefore, FRP sheets are widely used in the world for the aim of repairing or retrofitting the structural members (Colomb et al. 2008; Niroomandi et al. 2010; Ozcan et al. 2008; Promis et al. 2009).

There are enormous researches on the behavior of the RC members retrofitted by FRP sheets. Berset (1992) has carried out preliminary research on the shear capacity of beams strengthened with FRP sheets. The study presented an analytical model based on some experimental data to predict the shear capacity of FRP in the strengthened reinforced concrete beams. Also, Uji (1992) has proposed a model to estimate the shear contribution of CFRP in the strengthened reinforced concrete members. In a similar way, Dolan et al. (1993) have been studied different types of composites (i.e., AFRP, CFRP, and GFRP) in the improvement in the shear capacity of concrete members. The effect of different types of FRP on the improvement in the shear capacity of concrete members has been widely investigated by various researchers. Three different bounded and unbounded FRP wraps have been also studied by Teng et al. (2009) in order to improve the shear capacity of RC beams. In addition, several CFRP sheets have been used by Gamino et al. (2010) to improve the load capacity and ductility of concrete members. Moreover, the effect of strip-width-to-strip-spacing ratio on the behavior of the RC beams retrofitted by FRP composites has been investigated (Mofidi and Chaallal 2011). Furthermore, Li and Leung (2016) have evaluated the effect of shear span–effective depth ratio on the shear capacity of the retrofitted RC beams by full-wrapping FRP strip.

Nowadays, intelligence approximate systems such as the artificial neural network (ANN), adaptive neuro-fuzzy inference system (ANFIS), genetic programming (GP) and so on have provided a special tool to solve the problems in engineering (e.g., FRP-strengthened concrete structure, water resources management, bridge engineering, structural optimization, vibration control, etc.).Various researchers work on these fields (Abualigah and Khader 2017; Abualigah et al. 2018a, b, c, 2019; Abualigah 2019; Abualigah and Hanandeh 2015; Al-Janabi 2018; Al-Janabi et al. 2019a, b; Al-Janabi and Alkaim 2019; Al-Janabi and Alwan 2017; Al-Janabi and Mahdi 2019; Ali 2012a, b, 2013; Alkaim and Al-Janabi 2019; Ebrahimpour Komleh and Maghsoudi 2015; Hüsken et al. 2005; Kaghed et al. 2006; Kamgar et al. 2018; Mahdi and Al-Janabi 2019; Salman and Ali 2007). These algorithms are applicable to the complex problems with time-consuming mathematical solutions and the experimental problems with no clear logical relation between their parameters. A review of the literature shows the ability of ANN, ANFIS and GP to predict the behavior of the studied systems because of their positive features. These features are: (I) controlling the existing uncertainties between a large number of complex data, (II) finding the sensitivity between the inputs data and finally (III) devoting a relation between input and output data to predict the behavior of the studied system. Proposing an optimized formula is the main advantage of GP in comparison with ANN and ANFIS. In ANN and ANFIS, no formula or algorithm is presented. They are a trained black box to predict the targets. The proposed formula by GP could be used with a calculator or Excel file. But, it is worth to note that a computer with professional software such as MATLAB should be used for applying the ANFIS and ANN.

The determination of the shear capacity of RC beams is still a big challenge. This complexity is even enhanced when FRP sheets are applied to augment the shear capacity of RC beams (Sas et al. 2009). The total shear capacity of RC beam can be computed by adding the contributions of the concrete (Vc), the steel stirrups (Vs) and the FRP sheets (Vf). So far, numerous numerical studies have been provided to produce a mathematical equation for the estimation of the shear contribution of the FRP sheets. These equations are used in the guidelines such as CIDAR, Fib.TG9.3, ACI and CSA. A comprehensive comparison was presented by Folani (Sas et al. 2009) to evaluate the reliability of the proposed analytical equations. They illustrated that the proposed equations have considerable errors when they are compared with the experimental results. This proves that the use of intelligent approximation systems in this field could be very effective. Also, Tanarslan (2011) proposed a model to predict the shear contribution of FRP using ANN based on the experimental data. Also, Naderpour and Alavi (2017) predicted the shear capacity of the FRP sheets using ANFIS.

In this paper, multigene genetic programming (MGP) is used to propose a formula for the prediction of the shear capacity of FRP in strengthened RC beams. For this purpose, four steps including data gathering, inputs definition, MGP training and comparing are done. These steps are briefly shown in Fig. 1. Also, the sections of the paper are organized as follows. In the second section, the conducted studies about the application of ANN, ANFIS and GP to predict the capacity of elements combined with the composite material such as FRP are presented. In the third section, the MGP theory, construction, limitation, hypothesis and definitions are presented. Then, MGP is trained using the experimental data, and a formula is produced. In the fourth section, the error of the proposed formula is investigated in comparison with the actual values. Also, the MGP model is compared with the ANFIS model proposed by Naderpour and Alavi (2017). In the fifth section, the equations proposed by various researchers and guidelines, and their parameters are defined. In the sixth section, the results of MGP and the previous equations are compared with each other, and their errors are calculated. The results prove the excellent performance of the MGP model.

Fig. 1
figure 1

Main conducted steps

2 Related work

As mentioned, predicting the capacity of composite elements is complex. The conducted studies have indicated that the ability of intelligence approximation systems to predict the strength capacity of composite elements is more than the ability of formulations obtained by experiences. A summary of these studies is presented in Table 1. Mousavi et al. (2010) have been presented a formulation to predict the compressive strength of carbon fiber-reinforced polymer-confined concrete cylinders using GP, simulated annealing (SA) and multi-expression programming (MEP) methods. The linear genetic programming (LGP) has been used by Gandomi et al. (2011) to evaluate the shear resistance of steel fiber-RC beams (SFRCB). The model has been trained using the experimental results including normal and high-strength concrete beams. Kara (2011) presented a simple improved model to predict the shear strength of concrete slender beams without stirrups and reinforced with FRP bars using GP. Lee and Lee (2014) used ANN to predict the shear strength of RC flexural members reinforced with FRP bars. A new formulation has been also presented to predict the curvature ductility factor of FRP-strengthened reinforced high-strength concrete beams by Ebrahimpour Komleh and Maghsoudi (2015) using ANFIS and multiple regression methods. Also, GP has been utilized by Kalfat et al. (2016) to present a formulation for the anchorage strength model. This model can be used to predict the bond performance of FRP which leads to a better performance in retrofitting the structures. In the same way, Al-Mosawe et al. (2017) have used GP to present a formulation to predict the bond strength of CFRP-steel double-strap joints subjected to the impact loads.

Table 1 Overview of the related work

The prediction of the shear contribution of FRP in retrofitted RC beams was studied by Tanarslan (2011) and Naderpour and Alavi (2017). Tanarslan (2011) predicted the shear contribution of FRP by using ANN. The model was developed using a database containing 103 test results of FRP-RC beams including rectangular and T-shaped beams. The model was trained using 83 data sets, and then it was tested with 20 sets of test data. Guideline equations, Fib14, ACI 440.2R, CIDAR, CNR-DT 200 and CHBDC were selected for evaluation the ANN model. Naderpour and Alavi (2017) approximated the shear contribution of FRP by using ANFIS. They used a data set that includes 89 samples of rectangular RC beams strengthened with FRP sheets for training and testing the ANFIS model. The model was trained using 76 data sets (85%) and then tested using 13 test data sets (15%). The proposed model was compared with the existing guideline equations such as Fib-TG9.3, CIDAR, ACI 440.2R-08 and CSA-S806(12).

As shown in Table 1, two recent works are completely related to this study. In these studies, it proved that the use of the artificial intelligence methods (i.e., ANN and ANFIS) could be effective in the prediction of shear capacity of FRP sheets. The accuracy of the artificial intelligence in comparison with the formulas provided by guidelines is very suitable. In the study conducted by Tanarslan et al. (2012), some of the samples are belonged to the T-shape beams and contain the mechanical anchors, and they cause errors in their approximation and prediction. The ANFIS model presented by Naderpour and Alavi (2017) was trained based on the rectangular beams with no additional mechanical equipment such as anchors. It caused the errors of ANFIS reduced in comparison with the ANN model. However, in both models, no formula was presented and the trained systems (ANN and ANFIS) were a black box while the result of a trained GP model is a formula for predicting the output based on the inputs.

3 Multigene genetic programming

3.1 Theory and construction

In 1954, the genetic program (GP) was introduced by Nils Aall Barricelli (Koza 1992). GP is an evolutionary algorithm and a learning machine to generate a function between the inputs and output. This generation is done using the Darwinian principles in survival and reproductive theory and with genetic operators for the mating formulas (Koza 1994). If it is possible to have different formulas and relationships to solve a problem, GP is a system for creating a function to solve the problem. The diagram of GP for generating a formula (a function of a computer program) is indicated in Fig. 2.

Fig. 2
figure 2

Diagram of GP

GP creates a function based on tree form. Each tree (gene) has a value of depth that defines the length of the function. In multigene genetic programming, each individual includes a number of weighted genes (d1, d2, …, dn) and a bias term (d0) (Brameier and Banzhaf 2007). Each tree (gene) involves the input variables (x, y, z, etc.) and predefined mathematical operators (see Fig. 3).

Fig. 3
figure 3

A sample of multigene genetic programming

In the first step of MGP, the initial population is created based on the predefined values for depth, the number of genes and mathematical operators. This generation in the first iteration is random. The number of individuals in each generation is based on the population size. There are three generation methods named full method, growth method and half-and-half method (see Fig. 4). The half-and-half method applies the full method for 50% of the population, and for the other 50% the growth method is employed (see Fig. 4) (Nedjah et al. 2009).

Fig. 4
figure 4

Full and growth method (max depth = 3)

The root-mean-squared-error (RMSE) between the output of MPG and real values are calculated as a fitness function. Based on the fitness function, the individuals are collected for the next generation. The tournament selection method is used to choose individuals for the next generation. This method randomly selects individuals from the population. Individuals participate in a tournament based on the best fitness function. The tournament size defines the number of individuals that have participated in the tournament (Brameier and Banzhaf 2007; Searson et al. 2010).

Three genetic operators, including crossover, mutation and direct copy are applied to create the next generation. The crossover operator consists of low- and high-level forms. In the low-level form, two individuals are selected as parents (Searson et al. 2010). A tree from each parent is selected and a branch of them is swapped, and then the offsprings are created (see Fig. 5). In high-level form, one or more genes are swapped between the parents for the creation of the offsprings (see Fig. 6). In mutation operator, some parts of genes in each individual are randomly changed (see Fig. 7). These changes include: (a) subtree replacing, (b) switch terminals and (c) Gaussian perturbation constant. In the direct copy operator, the individual is transferred to the next generation without any change. These operators have a probability of occurrence for the individuals. The crossover is the main and stable operator in GP as in a reproductive of animals and humans. The increase in the probability of the mutation and direct copy leads to errors enhancement in approximation. So, the probability of the crossover should be considerably more than the mutation and direct copy operators (Nedjah et al. 2009; Searson et al. 2010).

Fig. 5
figure 5

Low-level crossover operator

Fig. 6
figure 6

High-level crossover operator

Fig. 7
figure 7

Mutation operator

Using the selected individuals and the genetic operators, the next generation is created. The number of individuals in each generation should be equal to the population size. This process is iteratively performed based on the number of generations. Simultaneously, the weights of trees (genes) in each individual are modified by the least squares procedure (Searson et al. 2010). The algorithm of GP is presented in Fig. 8.

Fig. 8
figure 8

Multi-genetic programming algorithm

3.2 Limitations and hypothesis

The aim of the proposed MGP is to create a formula that is easy to use and has acceptable accuracy. Also, the formula should be compared with the ANFIS model presented by Naderpour and Alavi (2017). In addition, the training of MGP is based on the experimental results presented in past studies for rectangular beams. Based on these reasons, the following hypothesis is considered here:

  1. 1.

    Since there are extensive experimental data for FRP sheets in 45° and 90° (two common used degree in practice), the obtained formula is valid for these degrees.

  2. 2.

    The proposed formulation can be used to predict the shear capacity of FRP in strengthened reinforced concrete beams for all full and U-wrap models with a single equation.

  3. 3.

    In deriving the proposed formulation, only some of the mathematical operators (not all of them) have been used in MGP models which results in a formula with high accuracy and ability to calculate manually. So, the use of bi-sectional commands or If-then (conditional) commands is prevented.

  4. 4.

    Given that the proposed model should be compared with the ANFIS model, the samples selected for training and testing are in accordance with the study conducted by Naderpour and Alavi (2017).

3.3 Variables definition

Herein, the parameters of MGP are set. The defined values are based on the trial and errors to obtain the best results. The input variables are presented in Table 2. The training and testing data are defined according to Table 9 (see “Appendix”). For comparing the MGP model (in this study) and ANFIS model (Naderpour and Alavi 2017), the input variables and test data are selected according to the model proposed by Naderpour and Alavi (2017). The defined values of the MGP parameters and the used mathematical operators are presented in Tables 3 and 4, respectively.

Table 2 Introducing data and input variables
Table 3 Defined parameters in MGP
Table 4 Defined mathematical operators in MGP

3.4 Results

The values of RMSE for the best formula in each generation are shown in Fig. 9. The results illustrate that RMSE in the first iteration is equal to 40.43, while it reduces to 3.66 in the 300th iteration. The genes of the best formula at the end of the training procedure are presented in Table 5. As shown, the number of genes has reached 67. It should be noted that the presented genes in Table 5 are the weighted genes (dn× treen). The tree form of Gene17 is indicated in Fig. 10. It can be seen that the depth of the tree has reached 10. The weight of Gene17 is equal to − 30.37. The formula obtained by MGP has been presented in Fig. 11.

Fig. 9
figure 9

Values of RMSE during MGP process

Table 5 Genes of the proposed formula by MGP
Fig. 10
figure 10

Tree form of Gene 17

Fig. 11
figure 11

The proposed formula by MGP

4 Comparing the MGP and ANFIS models

In this section, the proposed formula using MGP is compared with the ANFIS model proposed by Naderpour and Alavi (2017). For this purpose, the values predicted by MGP and ANFIS are plotted in Figs. 12 and 13 for the aims of training and testing data, and they have been compared with the actual values. The results illustrate that the MGP model has a good agreement with the actual values. In addition, the errors of the MGP and ANFIS models are calculated and compared in terms of mean absolute percentage error (MAPE), absolute fraction of variance (R2) and root-mean-squared-error (RMSE) according to Eqs. (1)–(3).

Fig. 12
figure 12

Comparison between the MGP, ANFIS and actual outputs for training data

Fig. 13
figure 13

Comparison between the MGP, ANFIS and actual outputs for testing data

$$ {\text{MAPE}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {\frac{{V_{{{\text{f}}({\text{predict}})}} - V_{{{\text{f}}({\text{experiment}})}} }}{{V_{{{\text{f}}({\text{experiment}})}} }}} \right)} \times 100 $$
(1)
$$ R^{2} = 1 - \frac{{\sum\nolimits_{i = 1}^{n} {\left( {V_{{{\text{f}}({\text{predict}})}} - V_{{{\text{f}}({\text{experiment}})}} } \right)^{2} } }}{{\sum\nolimits_{i = 1}^{n} {\left( {V_{{{\text{f}}({\text{predict}})}} } \right)^{2} } }} $$
(2)
$$ {\text{RMSE}} = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {V_{{{\text{f}}({\text{predict}})}} - V_{{{\text{f}}({\text{experiment}})}} } \right)^{2} } } $$
(3)

As shown in Table 6, the values of MAPE and RMSE for the training data of the MGP model are reduced approximately by 60% in comparison with those of the ANFIS model. Also, these reductions are about 48% (MAPE) and 28% (RMSE) for the whole data.

Table 6 Values of RMSE, R2 and MAPE errors for MGP and ANFIS

Therefore, it can be concluded that MGP can predict the actual shear capacity of FRP by the proposed formula with reasonable accuracy. Proposing a formula is the main characteristic of MGP in comparison with ANFIS and ANN. It helps MGP to become very usable while ANFIS is a black box that cannot present any equation or algorithm. In fact, ANFIS should be used in computing for new samples. Also, ANFIS should be applied by software such as MATLAB or other programs. But, the proposed formula by MGP could be utilized easily using a calculator and simple software like Microsoft Excel.

5 Overview of some existing models

There are some formulas which can be used to approximate the shear capacity of FRP in strengthened reinforced concrete beams. In these formulas, some parameters are used as the input parameters [i.e., elasticity modulus of FRP (Ef), effective strain of FRP (εfe), ultimate strain of FRP (εfu), FRP reinforcement ratio (ρf), concrete compressive strength (fc), FRP thickness (tf), effective depth of RC beam (d), minimum width of RC beam over the effective depth (bw), FRP strips width (wf), FRP strips spacing (sf), effective stress of FRP (ffe), ultimate stress of FRP (ffd), angle of diagonal crack with respect to the member axis (θ) and the angle between fiber orientation and longitudinal axis member (α)]. The existing formula to compute the shear capacity of FRP in strengthened reinforced concrete beams is listed in Table 7. Also, Fig. 14 shows a beam strengthened by FRP sheets and the parameters that have been used to compute the shear capacity of FRP.

Table 7 Some of the existing and developed formulas to predict the shear capacity of FRP in strengthened RC beams
Fig. 14
figure 14

A strengthened beam using FRP sheets

6 Comparison of the proposed formula with several existing formulas

To show the ability of the proposed formula, four common guidelines (ACI Committee 440 2008; CIDAR 2006; CSA-S806-12 2012; Fib-TG9.3 2001) are considered, and the results are compared. It should be noted that the existing guidelines (ACI Committee 440 2008; CIDAR 2006; CSA-S806-12 2012; Fib-TG9.3 2001) have two different formulas based on FRP-configuration in RC members (full-wrap and U-wrap), while the proposed formula by MPG covers these two configurations by a single formula. It can be considered as a positive point of the proposed formula. The values of shear capacity of FRP obtained by the proposed and the existing formulas are shown in Fig. 15. Also, Table 8 provides the values of MAPE, RMSE and R2 for the studied formulas.

Fig. 15
figure 15

Comparison between MGP and experimental results for the shear capacity of FRP

Table 8 Compression between the values of MAPE, RMSE and R2 for the different studied models

Based on Table 8, the values MAPE, RMSE and R2 for the proposed formula are 8.92%, 7.16% and 0.989%, respectively. These values for the CSA S806, ACI 440.2R, CIDAR and Fib-TG9.3 methods are more than 35.91%, 25.06% and 0.302%, respectively. This indicates that for the proposed formula, the simulated results are exactly being consistent with the experimental results more than 91%. This shows the accuracy of the proposed formula while the accuracy of other existing formulas is lower than 64%. It needs to be noted that the proposed formula is able to predict the shear capacity of FRP with a different configuration for FRP (full-wrap and U-wrap). In fact, it can be concluded that the proposed formula has a higher precision compared with the existing formulas in the prediction of FRP shear capacity.

7 Conclusion

In this paper, the MGP method has been used to predict the shear capacity of FRP in strengthened RC beams. For this purpose, a set of previously published and available experimental data (89 instances) have been collected. Then, 85% of the data (76 instances) has been used for training the MGP and 15% (13 instances) for testing. Finally, a new formulation has been proposed utilizing MGP. The correlation coefficients of the proposed formula are equal to 0.997, 0.956 and 0.989 for training, testing and whole data, respectively. The results make it obvious that the MGP model has a good agreement with the actual values. To study the accuracy of the proposed formula, the results obtained by MGP have been compared with those of the ANFIS model. The errors of the MGP and ANFIS models have been calculated and compared in terms of MAPE, R2 and RMSE. The results indicated that the errors were decreased in the MGP model for training and whole data in comparison with the ANFIS model. This decrease in training step was about 62% for MAPE error. For the whole data, MAPE and RMSE errors were reduced by about 48% and 29%, respectively. But the errors for testing data in the ANFIS model were less than the MGP model. The MAPE and RMSE errors in the ANFIS model were, respectively, reduced by about 36% and 33% in comparison with the MGP model. Furthermore, the proposed formula has been compared to the formulas from the available guidelines including CSA-S806, ACI 440.2R, CIDAR and Fib-TG9.3. The results reveal that the formula proposed by MGP is able to predict the shear capacity of FRP with higher precision compared to the existing formulas. The equations CIDAR and Fib.TG9.3 had more accuracy in the prediction of the shear capacity of FRP in comparison with the ACI and CSA equations. The MAPE error for the MGP formula was reduced by about 74% in comparison with the CIDAR equations. Also, the RMSE error of MGP formula was decreased near 71% in comparison with the Fib.TG9.3 equations.

Besides, it is shown that the MGP is a formula which could be used by a calculator, while the ANFIS model should be applied by a computer and professional programs. Also, the MGP formulation could be used for both full and U-wrap FPR models, while CSA S806, ACI 440.2R, CIDAR and Fib-TG9.3 have two different formulas for full and U-wrap models. There is a low experimental test on the retrofitting of the beam sections with FRP sheets by angle 30°, 60° and 75° that could be tested experimentally to propose a formula that covers all over the region of FRP angles. It can be worked in the future.