1 Introduction

Retrofit of the existing concrete buildings and bridges is an urgent need worldwide. The retrofit is conducted to improve the seismic resistance to meet newly developed design codes or to upgrade the load carrying capacity against the increased gravity load. Fiber-reinforced polymer (FRP) composites have been extensively applied in retrofitting of structures and are known as a superior technique comparing other traditional strengthening methods due to their lightweight, high strength, easy installation, and excellent durability. In order to enhance flexural or shear strength of reinforced concrete beams and slabs by means of FRP, there are two methods of application as follows. First, the FRP sheets/plates are bonded to the external surface of the concrete, that is, called the externally bonded reinforcement (EBR) [1]. Second, FRP rods/strips are mounted into the groove cut near the surface of the concrete and filled with the resin. The latter is briefly called as near-surface-mounted (NSM) FRP reinforcement [2], and its geometric properties are illustrated in Fig. 1. The NSM technique is newer than EBR method and is less investigated. The advantages of NSM over EBR are considered as follows: the protection of FRP bars against vandalism or environmental effects as a result of its embedment in the concrete cover [3] and the removal of need for preparation of concrete surface prior to FRP installation [4]. The bond capacity and failure mode of NSM FRP are affected by variety of factors including mechanical properties of FRP, epoxy, and concrete; surface treatment and dimensions of FRP rod/strip; and groove depth and width [5].

Fig. 1
figure 1

Details and geometrical characteristics of NSM FRP technique

It is of importance to engineers to accurately estimate the bond capacity of NSM FRP so as to take full advantage of materials and to avoid debonding failure. The bond capacity here refers to the pullout strength of NSM FRP bonded to a concrete block. Unlike the EBR method, very few models have been reported in the literature to predict the bond strength of NSM FRP technique. The existing models on the pullout capacity of NSM FRP have been developed by regression of experimental results combined with a fracture mechanics-based approach or outcomes of finite element models [6,7,8]. The available models on NSM differ from each other in the way they incorporate a number of parameters such as concrete strength or groove dimensions. Besides, the mechanical properties of bonding agent have not been accounted for in some models. Moreover, some of the models were developed only for FRP strips and thus are not applicable to FRP rods.

With the aim of developing an alternative and reliable method to predict the pullout strength of NSM FRP rods and strips, the fuzzy logic approach is employed in this paper. The approach of fuzzy logic has not yet been applied to the present topic. The main advantage of fuzzy logic system stems from its interpretability [9, 10], which may not be pronounced in other neural computing techniques. Neuro-fuzzy methods have been applied to the field of concrete [11, 12] and FRP-wrapped concrete [13]. The Takagi–Sugeno fuzzy inference system has been already employed by the author [14, 15] in other applications of FRP in concrete structures. The Mamdani type of fuzzy logic model has been used in the field of concrete [16] and FRP applications [17, 18]. Also, fuzzy logic expert systems have been utilized recently to evaluate other design issues related to NSM FRP such as deflections and cracks of concrete beams strengthened by NSM FRP [19, 20]. In the present work, the two available types of fuzzy logic systems, namely Mamdani and Takagi–Sugeno (or Sugeno for short), are investigated. Classification decision tree is used to develop Mamdani model, whereas subtractive clustering is employed to derive Takagi–Sugeno model. Both fuzzy models are inferred from 384 experimental data on the direct pullout tests of NSM FRP, which have been collected from the open literature. Detailed procedures to develop each of the fuzzy models are explained, and the outcomes of the proposed fuzzy models are compared with the predictive equations published in the literature.

2 Experimental database

The experimental data, which are employed in this paper to develop fuzzy models, contain 384 direct pullout tests on NSM FRP rods or strips bonded to concrete blocks. The test data were assembled from several experimental campaigns reported in the open literature and were conducted by different researchers [3,4,5,6,7, 21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39]. Since all experimental data used here are extracted from papers published in the well-known journals, it is believed that the data are reliable and the instruments are accurate enough.

As shown in Fig. 1, the pullout test contains a prismatic concrete block in which a groove is cut. The FRP strip/rod is inserted in the groove which is later filled with the resin. After the resin hardens, the FRP rod/strip is subjected to the tensile force exerted by a jack, while the concrete block is restrained by appropriate supports at a face perpendicular to the direction of the applied force. The concrete block may be placed horizontally, or alternatively, vertically in the test setup during the loading test. The main characteristics of the pullout test (as illustrated in Fig. 1) are common among the experimental campaigns employed here. Thus, the testing conditions for determining the pullout capacity are similar among the studies considered herein.

The detailed information on the experimental data including the geometrical/mechanical properties of FRP bars/strips, groove, epoxy, and concrete as well as other testing conditions such as the experimental pullout capacity and bonded length is presented in “Appendix 3.” The FRP reinforcements used in the specimens were different in cross-sectional shape (round bar, and square or narrow strip), surface treatment (smooth, sand coated, or spirally wound), and type of the fiber (carbon, glass, or basalt). The majority of NSM FRP in the test samples was of CFRP type. In the case that some samples were duplicated, only one test sample with the minimum pullout capacity among the replica samples was taken into account. By putting aside the duplicate samples, the whole database decreased from the original 384 experiments to 222 test specimens. Besides, 78 specimens out of 222 samples were FRP strips, and the rest were rounded bars (i.e., rods). The statistics of properties for the experimental database are listed in Table 1.

Table 1 Statistics of the experimental data

3 Selection of input variables for fuzzy logic models

Selection of input parameters is a major stage in developing a fuzzy logic model. According to the NSM FRP bond models currently available in the literature [7, 8], the parameters assumed to have an influence on the bond capacity are as follows: the bonded length (L), concrete compressive strength (fc′), the axial rigidity of FRP rod or strip (AfEf), and the groove depth-to-width ratio (Dg/Wg). In the case of NSM strips, another parameter representing the length of debonding failure plane, which is defined as summation of three side lengths of the groove [8] or sum of twofold of strip width and its thickness plus 4 mm [7], is also considered in some models. Besides, the FRP tensile strength, though not affecting the bond capacity, is used in certain models to separate the FRP rupture failure from the bond failure mode. Furthermore, the rod/strip surface treatment has been considered as a test variable in experimental papers [24, 33]. However, the surface treatment, due to its qualitative characteristics, is not included in the mathematical expressions of the existing bond models. Another important factor is related to the mechanical properties of the adhesive used between NSM FRP and concrete. Among different properties of adhesive, its tensile strength (fe) has been reported in majority of the experimental campaigns.

It is also noted that some of the existing NSM bond models have been developed only for FRP strips and not for rods, whereas the present paper is aimed at proposing a model which is applicable to both cases of strip and rod. In order to distinguish between strip and rod, a quantitative term is introduced herein as the ratio of the normalized perimeter to the normalized cross-sectional area of the NSM reinforcement, denoted as Pnorm/Anorm, in which the perimeter and cross-sectional area are divided by their own maximum values. In the experimental database used in the present paper, the maximum value for the rod/strip perimeter and area is 45.5 mm and 132.7 mm2, respectively. It is obvious that the value of Pnorm/Anorm for the rod shape is smaller than that for the strip shape.

Regarding the above-mentioned explanations, six independent variables are selected as inputs for the proposed fuzzy models, as follows: the bond length (L), concrete compressive strength (fc′), the axial rigidity of FRP rod/strip (AfEf), the groove depth-to-width ratio (Dg/Wg), tensile strength of epoxy (fe), and the ratio of Pnorm/Anorm. The statistical descriptors of these variables for the experimental database employed in the current paper are listed in Table 1.

4 Overview of fuzzy logic-based models

In order to derive a relationship between the input variables (namely, L, fc′, AfEf, Dg/Wg, fe, and Pnorm/Anorm) and the output (i.e., the pullout capacity, Pf), fuzzy logic approach is utilized in this paper. The objective is to develop fuzzy logic models inferred from the available experimental data to predict the pullout capacity of NSM FRP rod/strip bonded to concrete. A fuzzy logic model consists of several rules; each has an antecedent together with a consequent part. The antecedent is composed of some membership functions; each assigned to an input variable and are interconnected by use of logical operators such as AND or OR [40]. The role of membership function is to map the value of an input variable to a number in the range of 0–1, expressing the fuzzy nature of the model. Depending on the format of the consequent part, there are two types of fuzzy models, namely Takagi–Sugeno fuzzy models [41] and Mamdani fuzzy models [42].

In the present paper, both Mamdani and Takagi–Sugeno (or Sugeno for short) fuzzy models are applied to the problem in hand. The merit of Sugeno model, as developed herein, lies in its closed-form presentation of the pullout capacity. On the other hand, Mamdani model is proposed here with the aim of providing highly interpretable rules. In order to develop models and run the related codes, MATLAB is used in the present paper. The different steps in building both fuzzy models are explained in the followings.

4.1 Development of Mamdani fuzzy model based on decision tree

In the current paper, the main purpose from developing a Mamdani-type model is to enhance the interpretability of the fuzzy logic system. Interpretability is known as the advantage of fuzzy modeling as compared to other soft computing techniques such as neural network [9, 10]. Interpretability of fuzzy model means that an expert can easily understand the rule base of the model and modify or extend it if needed. It is particularly important when the model is inferred from data to ensure that the reasoning is understandable to the user, as opposed to the case that the automatic rule induction through supervised learning may result in the models containing meaningless coefficients obtained only on the basis of minimizing error between experimental output and model prediction. The interpretability can be achieved at two levels: membership function and rule induction technique, which will be discussed as follows.

4.1.1 Selection of membership functions by uniform partitioning the input/output space

A simple approach to reach an interpretable membership function is to divide the practical range of the variable into several equal partitions. The number of partitions is the same as the number of membership functions whose coordinates are determined based on the length of each partition, as will be discussed later. The practical range of each variable is presented in Table 1 according to the available experimental database. In here, the range of all input variables is divided into five equal intervals except for the variable Dg/Wg, which is partitioned into three equal intervals. The number of partitions can be decided by looking at the histograms for distribution of experimental data, as demonstrated in Fig. 2 for Dg/Wg and fc′ as examples of input parameters. For instance, according to Fig. 2, there are gaps in the experimental data points in the range of Dg/Wg, suggesting that three divisions would be sufficient to cover the whole range of this input variable, whereas for fc′, five partitions are considered due to a relatively uniform distribution of data points within the range of fc′. It goes without saying that increasing the number of partitions may result in a more complex model. In the next step, membership functions, which correspond to different linguistic concepts, are constructed. In the case of three partitions for Dg/Wg, the linguistic classes of low (L), medium (M), and high (H) are considered as illustrated in Fig. 3. Also shown in Fig. 3, for fc′ with five partitions, the linguistic classes of very low (VL), low (L), medium (M), high (H), and very high (VH) are defined. As for the shape of membership functions, trapezoids are assigned to the first and last classes, while triangles are allocated to the intermediate classes (see Fig. 3). Triangular and trapezoidal shapes are the most common membership functions reported in the literature [9, 10, 17]. It is also noted that the right corner and top and left corner of a triangular membership function lie in the midpoint of three consecutive partitions, respectively, resulting in overlapping parts between two successive membership functions. As for the trapezoidal functions, the top base spans between the midpoint of the first (or last) partition and the minimum (or maximum) value in the range of the considered input variable. In order to build the membership functions for the model output (i.e., Pf), a similar procedure is employed except that six intervals are considered in order to achieve a higher accuracy in prediction of the bond capacity. The linguistic terms for Pf are designated as very low (VL), low (L), almost low (AL), almost high (AH), high (H), and very high (VH), as demonstrated in Fig. 3. The coordinates of each membership function are presented in Tables 2 and 3 for the inputs and output, respectively. The membership functions, as described above, are constructed only by use of the given range of each variable without any need for optimization or other a priori assumptions, leading to enhance the interpretability of the model.

Fig. 2
figure 2

The statistical distributions of Dg/Wg (top) and fc′ (bottom) in the experimental database

Fig. 3
figure 3

The membership functions and linguistic labels for Dg/Wg, fc′, and Pf

Table 2 The coordinates of membership functions for the input variables
Table 3 The coordinates of membership functions for the output variable

4.1.2 Rule induction by classification decision tree and its application to the present problem

Another aspect of interpretability of fuzzy model is related to the rule induction method. As a technique to induce fuzzy rules, the classification decision tree has been suggested by several researchers [10, 43]. In this paper, the rule base of the proposed fuzzy model is extracted from the binary decision tree, which is employed for classification of the experimental observations in terms of the linguistic labels considered for the variables. First, each variable should be converted to a linguistic label, i.e., the so-called fuzzification process. Given a value for an input or output variable, the linguistic label corresponding to that is determined based on its value falling in which partition of the variable range. It is recalled that the range of each variable is partitioned into a number of equal intervals. For instance, if a given value of fc′ is within the third partition out of five partitions of the experimental range of fc′, then the linguistic label of medium (M) is assigned. In other words, among the considered membership functions for fc′, the one denoted by medium (M) yields the largest degree of membership for the given fc′.

The experimental database used in this study contains 222 numerical data points, as mentioned earlier. However, if these data are expressed in terms of the relevant linguistic labels, 165 unique data remain, while the rest are duplicate. Next, the algorithm of decision tree [44] is applied to the 165 labeled experimental data. It is also noticed that the membership labels (and not the numeric values) of the experimental data are employed in developing the decision tree and that no parameter is tuned during the model development. The idea is to select an input variable and choose one of its linguistic labels as a splitting level, which divides the database into two parts: One is larger or equal than the splitting level, and the other is less than that. The criterion for selection of the best variable and its splitting level is to maximize the information gain (IG) in a node, defined as follows:

$${\text{IG}}\left( {\text{node}} \right) \, = G\left( {\text{node}} \right) \, {-}G_{\text{L}} \times w_{\text{L}} {-}G_{\text{R}} \times w_{\text{R}}$$
(1)

where a node refers schematically to a point in the decision tree which shows the selected input variable with its splitting level and G stands for Gini index and is calculated for either a node [i.e., G (node)] or the left/right branch of the considered node (i.e., GL or GR) with the following formula:

$$G\left( {\text{node}} \right) \, = 1 - \mathop \sum \limits_{i = 1}^{6} \left( {\frac{{n_{i} }}{n}} \right)^{2}$$
(2)

where ni is the number of experimental observations whose outputs’ level is the ith linguistic label. In the current paper, i = 1, 2, …, 6, which refer to the linguistic labels of VL, L, AL, AH, H, and VH, respectively. Also, n is the total number of the observations in the node, that is, \(= \,\sum_{i = 1}^{6} n_{i}\). The Gini index [45] is a measure of impurity of classification in a node, meaning that a node, in which all observations belong to only one class of output, will have a Gini index of equal to zero, that is, a pure node. In Eq. (1), wL (or wR) refers to the ratio of the number of observations in the left (or right) branch to that in the original node (wL + wR = 1).

The algorithm of the classification decision tree starts by sorting the experimental observations based on the ascending order of the linguistic labels of each input variable. For example, the linguistic labels of fc′ in the ascending order are as follows: VL, L, M, H, and VH. Then, any linguistic label is examined as a potential splitting point. Thereby, IG from Eq. (1) is calculated for each sorted input variable and each linguistic label as a potential splitting point. The final decision on selection of the best input as well as the best linguistic label for the splitting level is made on the basis of the maximum value of IG which is obtained. Once an optimum input with the corresponding splitting label (such as, fc′ as an input and [M] as the splitting level) is selected, the database is divided into two parts designated by the left and right branches. As an example, a branch designated by fc′< [M] refers to the values of fc′ belonging to linguistic labels of either [VL] or [L], whereas the branch on opposite side is allocated to fc′ ≥ [M], referring to the linguistic labels of either [M], [H], or [VH]. Then, the above algorithm is repeated for the experimental observations belonging to the left (or right) branch. Again, in each branch, an optimum input variable with the best splitting label is determined by maximizing the value of IG. By proceeding the algorithm, the number of branches and eventually depth of the tree increase. The procedure continues till one of the following thresholds is met: (a) The number of observations related to a node shall be greater than 5 and that related to the left or right branch shall be at least 1; (b) the algorithm can no longer improve the value of IG.

The decision tree derived for classification of the experimental database of the current paper is illustrated in Fig. 4. The uppermost node in the decision tree of Fig. 4 is called as the root node, whereas the lowest nodes are known as leaves which correspond to the linguistic labels of the output variable (i.e., Pf). For instance, a leaf showing the class of [H] means that the majority of experimental observations falling in this particular branch possess the linguistic label of HIGH for Pf. By following consecutive branches starting from the root node down to a leaf, a fuzzy rule is established. The rule coming from a decision tree is not necessarily complete, meaning that its antecedent may not include all the input variables. Finally, the classification decision tree shown in Fig. 4 is equivalent to a Mamdani-type fuzzy logic model, in which the number of fuzzy rules is equal to the number of leaves in the decision tree, that is, 35 rules. The rules derived from a decision tree are clear and easy to understand, leading to enhance the interpretability of the fuzzy model.

Fig. 4
figure 4

The original classification decision tree (35 rules)

In order to assess the misclassification error of a node, denoted as E (node), the following relation is used:

$$E\left( {\text{node}} \right) = 1 - \frac{{n_{j} }}{n}$$
(3)

where the index j refers to the output class label whose number of experimental observations (i.e., nj) is the largest one among the observations of different classes existing in the considered node. The total number of observations in a node is denoted by n. As an example, if the majority of observations in a node or a leaf have an output belonging to the label of almost high (i.e., AH), the value of j becomes 4 since [AH] is the fourth linguistic label of Pf. Furthermore, if the number of observations related to [AH] is 20 out of the total observations of 50, then nj = 20 and n = 50, and the error of the node is E = 0.6 according to Eq. (3). It is recalled that in a leaf, the final class of the output is decided based on the value of nj, meaning that the class of a leaf is the linguistic label with the largest occurrences in the experimental observations. If the error in a leaf node, evaluated by Eq. (3), exceeds a user-specified threshold, then that leaf can be eliminated by merging the leaves in that level with a node located just in one level higher than it. Repeating the process of merging nodes of lower level with the node of higher level (i.e., the so-called pruning procedure) can help decrease the depth of a decision tree and eventually reduce the number of the corresponding fuzzy rules.

By pruning the decision tree of Fig. 4 at several consecutive levels, the simpler tree shown in Fig. 5 is obtained, which is finally chosen as the basis to generate the Mamdani model with 17 fuzzy rules. The formats of all rules derived from the decision tree of Fig. 5 are listed in “Appendix 1.” An example of one rule is as follows: IFL is Low or Medium, and AfEf is not Very Low, and fc′ is Medium or High or Very High, and fe is Medium or High or Very High, and Pnorm/Anorm is Very Low, THENPf is Almost Low. It is observed that a rule may contain a set of membership functions (i.e., linguistic variables) interconnected by Boolean operators of OR and NOT, which are defined as the minimum and complement, respectively. For example, the membership function for not Very Low can be constructed as complement of that for Very Low. Similarly, the membership function for the combined variable of Medium or High is built by taking into account the minimum of both membership functions for Medium as well as High. The schematic graphs for the combined membership functions are illustrated in Fig. 6.

Fig. 5
figure 5

The pruned decision tree (17 rules)

Fig. 6
figure 6

Membership functions combined by logical operators OR and NOT (VL, L, M, H, VH stand for linguistic variables of very low, low, medium, high, and very high, respectively)

4.2 Development of Sugeno fuzzy model based on subtractive clustering

The only difference between the rule base of a Sugeno fuzzy model and that of a Mamdani-type fuzzy model lies in the consequent part. In Sugeno model, the consequent part is either a constant value (i.e., the so-called zeroth order) or a linear combination of all input variables plus a constant term (i.e., the so-called first order). The unknown coefficients of the consequent part in Sugeno fuzzy model are commonly determined following an optimization procedure through training against the known experimental data points. However, this is considered as a drawback in the sense that optimization reduces the interpretability of the fuzzy model. Another issue is the definition of membership functions for the input variables and the induction of rules in Sugeno model. In the current paper, a well-known procedure called as subtractive clustering [46] is employed to generate the rule base of the fuzzy model. The application of subtractive clustering-based Sugeno model to the field of FRP composites has been already reported by the author [14, 15]. Another widely used technique for clustering of numerical data is grid partitioning. The disadvantage of grid partitioning is that the number of generated rules may increase a lot when the number of input variables increases. For instance, in the case of the current problem, where there are six input variables, if only two membership functions are assigned to each input, then the number of generated rules becomes 64 (i.e., 2 to the power of 6). Increased numbers of rules yield more complex models with larger number of parameters to be tuned, which in turn reduce the model generalization ability. On the other hand, if subtractive clustering is utilized, then the number of rules, the number of clusters, and the number of membership functions would be all the same. For instance, by considering three membership functions for each input variable, the number of rules still remains as low as three. This is the main reason that subtractive clustering method is employed in this paper so that the simpler models with less number of rules can be created. In the following, a brief overview of the method of subtractive clustering is explained.

4.2.1 Overview of subtractive clustering algorithm

The subtractive clustering algorithm is a technique to extract cluster centers from an assembly of numerical data points (i.e., the experimental database) such that wherever more numbers of the data are concentrated, a cluster center is found. To do so, a potential of being a cluster center, denoted by Pi, is assigned to each of the data (i.e., the ith experimental datum), as follows:

$$P_{i} = \mathop \sum \limits_{j = 1}^{n} \exp \left( {\frac{ - 4}{{r_{a}^{2} }} \cdot \left\| {{\hat{\mathbf{x}}}_{i} - {\hat{\mathbf{x}}}_{j}} \right\|^{2} } \right)$$
(4)

where ‖ ‖ stands for the Euclidean distance between two of data points, which are expressed as vectors whose coordinates contain input and output values, n is the total number of data points, and ra is a user-specified value (in the range of 0–1) for the radius of a cluster. In the problem in hand, xi = [L, AfEf, fc′, fe, Pnorm/Anorm, Dg/Wg, Pf]i, and \({\hat{\mathbf{x}}}_{i}\) is normalized with respect to the range of each coordinate. The datum with the maximum potential value is selected as the first cluster center. In order to find the next cluster centers, an amount of \(P_{k}^{*} \cdot \exp \left( {\frac{ - 4}{{r_{b}^{2} }} \cdot \left\| {{\hat{\mathbf{x}}}_{i} - {\hat{\mathbf{x}}}_{k}^{*} } \right\|^{2} } \right)\) is subtracted from the potentials of all data points, where Pk* and \({\hat{\mathbf{x}}}_{\varvec{k}}^{\varvec{*}}\) are the potential and the vector of normalized coordinates for the kth cluster center and rb is a multiplier of ra. Thereafter, the point with the highest remaining potential is the next cluster center. The procedure ends if \(P_{k}^{*} < \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\varepsilon } P_{1}^{*}\), where \(\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\varepsilon }\) is a user-defined value, and the current cluster center (i.e., the kth center) is not accepted. Besides, if \(\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\varepsilon } P_{1}^{*} < P_{k}^{*} < \bar{\varepsilon }P_{1}^{*}\), where \(\bar{\varepsilon }\) is another user-defined value, then the following condition is evaluated: When \(\frac{{d_{\hbox{min} } }}{{r_{a} }} + \frac{{P_{k}^{*} }}{{P_{1}^{*} }} \ge 1\), where dmin is the shortest of the distances between the kth cluster center and all of the other cluster centers, then the current data point is accepted as the kth cluster center and the procedure continues; otherwise, the current cluster center is rejected and Pk* is set to 0, and the data point with the largest remaining potential is chosen as the new kth cluster center, and the above process is repeated. In the above-mentioned algorithm, there are several user-defined parameters including ra, rb, \(\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\varepsilon }\), and \(\bar{\varepsilon },\) which affect the number and coordinates of the generated cluster centers. In this paper, varying effects of each of them are examined, and the best values are selected based on the minimum error between the model output and experimental results.

4.2.2 Membership function selection and rule induction by subtractive clustering and its application to the present problem

Each of the identified cluster centers forms a fuzzy rule as follows: “IF an input variable is close to the cluster center THEN its output is close to the cluster center’s output.” The level of truth for this rule depends on Euclidean distance between the given variable and the cluster center, meaning that the degree of truth increases as the distance decreases. The following relation is used to determine the level of truth (or weight), wi, of the ith fuzzy rule:

$$\begin{aligned} w_{i} & = \exp \left\{ {\frac{ - 4}{{r_{a}^{2} }}\left[ {\left( {\frac{{L - L_{i}^{*} }}{{L_{ \hbox{max} } - L_{ \hbox{min} } }}} \right)^{2} + \left( {\frac{{A_{f} E_{f} - \left( {A_{f} E_{f} } \right)_{i}^{*} }}{{\left( {A_{f} E_{f} } \right)_{ \hbox{max} } - \left( {A_{f} E_{f} } \right)_{ \hbox{min} } }}} \right)^{2} } \right.} \right. \\ & \quad + \,\left( {\frac{{f_{\text{c}}^{\prime } - f_{{{\text{c}}i}}^{\prime *} }}{{f_{{{\text{c}},\hbox{max} }}^{\prime } - f_{{{\text{c}},\hbox{min} }}^{\prime } }}} \right)^{2} + \left( {\frac{{f_{\text{e}} - f_{{{\text{e}}i}}^{*} }}{{f_{{{\text{e}},\hbox{max} }} - f_{{{\text{e}},\hbox{min} }} }}} \right)^{2} \\ & \left. {\left. {\quad + \,\left( {\frac{{\frac{{P_{\text{norm}} }}{{A_{\text{norm}} }} - \left( {\frac{{P_{\text{norm}} }}{{A_{\text{norm}} }}} \right)_{i}^{*} }}{{\left( {\frac{{P_{\text{norm}} }}{{A_{\text{norm}} }}} \right)_{\hbox{max} } - \left( {\frac{{P_{\text{norm}} }}{{A_{\text{norm}} }}} \right)_{\hbox{min} } }}} \right)^{2} + \left( {\frac{{\frac{{D_{\text{g}} }}{{W_{\text{g}} }} - \left( {\frac{{D_{\text{g}} }}{{W_{\text{g}} }}} \right)_{i}^{*} }}{{\left( {\frac{{D_{\text{g}} }}{{W_{\text{g}} }}} \right)_{\hbox{max} } - \left( {\frac{{D_{\text{g}} }}{{W_{\text{g}} }}} \right)_{\hbox{min} } }}} \right)^{2} } \right]} \right\} \\ \end{aligned}$$
(5)

where the denominators are the range of input variables in the experimental database, and in the numerators, the variables without star refer to the input coordinates of a given variable, and those with a star refer to the input coordinates of the ith cluster center. The index i varies between 1 and c, where c is the number of rules that is equal to the number of cluster centers. Equation (5) implies an antecedent with Gaussian membership functions interconnected by AND operators defined as multiplication. The consequent part of the ith rule in Sugeno fuzzy model (first order) is as follows:

$$P_{f,i} = a_{i} L + b_{i} (A_{f} E_{f} ) + c_{i} f_{\text{c}}^{{\prime }} + d_{i} f_{\text{e}} + e_{i} \left( {\frac{{P_{\text{norm}} }}{{A_{\text{norm}} }}} \right) + f_{i} \left( {\frac{{D_{\text{g}} }}{{W_{\text{g}} }}} \right) + g_{i}$$
(6)

The final output (i.e., the pullout capacity Pf) of the fuzzy model is calculated as:

$$P_{f} = \mathop \sum \limits_{i = 1}^{c} \frac{{w_{i} }}{{\mathop \sum \nolimits_{j = 1}^{c} w_{j} }}P_{f,i}$$
(7)

where ai, bi, ci, di, ei, fi, and gi (i = 1,…, c) are unknown coefficients to be determined by minimizing the error between the model output and experimental value for Pf.

As mentioned earlier, in the developed Sugeno model, which is based on subtractive clustering algorithm, the membership function is of Gaussian type (i.e., exponential) as proposed by Chiu [46]. The reason for selecting the Gaussian membership function stems from the method of subtractive clustering, in which the membership value of a data point in a cluster is determined based on an exponential function of the Euclidean distance between the given point and the center of the cluster under study, as presented in Eq. (5). In fact, each input coordinate of each cluster center forms a Gaussian membership function, where the mean is the cluster center coordinate and the standard deviation is proportional to the range of the values of the given input variable. The membership functions considered for Sugeno models are illustrated in Fig. 7 (bottom part). The controlling parameters of the membership function are the same as the parameters affect the subtractive clustering algorithm, namely ra, rb, \(\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\varepsilon }\), and \(\bar{\varepsilon }\). The effects of varying values of these parameters are discussed in the following parts of the paper.

Fig. 7
figure 7

The diagram of fuzzy rules for the proposed Mamdani model (top) with 17 rules and Sugeno model (bottom) with 3 rules (the experimental value = 30.46 kN)

The first step to derive a fuzzy model is to decide the values for the clustering parameters (i.e., ra, rb, \(\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\varepsilon }\), and \(\bar{\varepsilon }\)) which result in identifying the cluster centers within the domain of the experimental database. Then, the IF-part of the rule base is materialized by use of the cluster centers (see Eq. 5), while the THEN-part is formed after optimizing the coefficients of Eq. (6) in terms of the model error evaluated on experimental datasets.

For developing soft computing models which are inferred from experimental data, it is required to use only a portion of data for training process, whereas the rest should be employed to check or validate the generalization ability of the developed model. In this regard, the total database of experiments is divided randomly into two equal portions called as training and checking datasets. The training dataset is used for tuning the parameters, whereas the checking dataset is employed to assess the model generalization on an unfamiliar data to avoid overfitting. Thus, the checking dataset is not used in the training process. For division of data into training and checking datasets, the data points which are assigned to a certain cluster are divided randomly into two equal portions. It is noted that the cluster center which is the closest to a data point is considered as the cluster assigned to that data point. Table 4 presents several error measures obtained on training, checking, and total experimental datasets for several Takagi–Sugeno fuzzy models derived using some selected values of ra, rb, \(\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\varepsilon }\), and \(\bar{\varepsilon }.\) The error measures considered here include: coefficient of variation (CoV) and mean of experiment-to-prediction ratio, as well as the mean absolute percentage error (MAPE) between the experiment and model prediction. Employment of checking dataset, which is here generated randomly, is essential to prove that the developed models are insensitive to the used training dataset considering that one of the criteria to select the proposed models in this study is to assure that the error on training dataset is similar to that on checking dataset (see Table 4) so as to avoid overfitting. As a result of this, the proposed models are indeed the best models which can be extracted from the current database.

Table 4 The effects of varying values of clustering parameters on the generated Sugeno fuzzy models

Each set of values for the clustering parameters (i.e., ra, rb, \(\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\varepsilon }\), and \(\bar{\varepsilon }\)) results in a particular Sugeno fuzzy model. In order to examine the effect of varied values of these user-defined parameters and to select the best values, each of the clustering parameters is varied with a small increment within its possible range so that it is assured that all possible values for each clustering parameter are generated. Then, all possible combinations of the values for the clustering parameters are considered, and as a result, numerous sets of values for ra, rb, \(\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle-}$}}{\varepsilon }\), and \(\bar{\varepsilon }\) are obtained. In the next step, each set of the parameters is used to build a Sugeno fuzzy model by subtractive clustering algorithm. Finally, the best fuzzy model is considered the one whose errors on both training and checking datasets are similar to each other (to ensure the generalization ability of the model) and also are minimum among the other prospective models. Besides, under the same performance in terms of errors, a simpler model with less number of rules is preferred. The above-mentioned procedure is implemented in a MATLAB code. For instance, as seen in Table 4, five different Sugeno models with the same number of rules (i.e., four rules) are built by varying the values of clustering parameters. The models presented in Table 4 serve as just demonstration and are chosen from many models built in this study by varying the clustering parameters. The finally selected set of parameter values for the case of fuzzy model containing four rules is listed in Table 5, which is decided based on the minimum error between the model output and experimental results while developing a high ability of generalization, as explained above. By following this procedure, the most accurate fuzzy models with different numbers of rules are chosen and listed in Table 5, which also reports the associated clustering parameters, the coordinates of the cluster centers, and the constants of consequent part.

Table 5 The coordinates of cluster centers and coefficients in consequent part of several Sugeno models with different clustering parameters

5 Results and discussion

5.1 Comparison of predictions by fuzzy logic models and experiments

The performances of different fuzzy logic models developed in this paper are evaluated in Table 6 in terms of various error measures in comparison with the experimental data. There are three Mamdani-type fuzzy logic models developed here including the one with 17 rules in addition to two others containing 13 and 21 rules. All these Mamdani models are derived from the same classification decision tree having 35 leaves, as explained earlier (see Fig. 4), but by pruning at different levels. As for Sugeno-type fuzzy logic models, five models with 1 up to 5 rules are built herein, as appeared in Table 5 with their characteristics. According to Table 6, the Mamdani model with 17 rules develops definitely less errors in terms of CoV, MAPE, RMSE, and MAE, which are evaluated on the total experimental database, as compared to Mamdani models with 13 or 21 rules. Thus, the Mamdani model with 17 rules is finally chosen. As for Sugeno models reported in Table 6, the difference in one error type among the models with 3, 4, and 5 rules is hardly noticeable. Thereby, all these models may be considered as a candidate for the final selection. However, selection of the best model is a trade-off between the reduced error measures and the simplicity of the model. In other words, if a model with less numbers of rules demonstrates error measures being more or less similar to another model which contains more numbers of rules, the model with less numbers of rules is of more interest since it is simpler. As a result, the Mamdani model with 17 rules and the Sugeno model with 3 rules are finally selected.

Table 6 Error measures of several fuzzy models with different numbers of rules and comparison with the existing models

The schematic diagrams of the rule base and membership functions for Mamdani model (17 rules) and Sugeno model (3 rules) are illustrated in Fig. 7. As a numerical example, also shown in Fig. 7, for an input variable with L = 200 mm, AfEf = 2053 kN, fc′ = 52.8 MPa, fe = 16 MPa, Pnorm/Anorm = 5.28, and Dg/Wg = 3.77, the value of Pf is equal to 30.2 kN in Mamdani model and 30.5 kN in Sugeno model as compared to its experimental value of 30.46 kN. In this specific work, the accuracies offered by both Mamdani and Sugeno models are almost similar. However, a general comparison of these two types of fuzzy logic models is beyond the scope of the current paper. It should be mentioned that although the number of rules in the proposed Sugeno model is much less than that in Mamdani model, the number of parameters need to be tuned in Sugeno model with 3 rules is equal to 21 unknowns in the consequent part (i.e., 7 constants per each rule). As stated earlier, tuning these parameters against experimental data needs an optimization procedure, which in turn reduces the interpretability of the fuzzy logic model. On contrary, in development of Mamdani model based on the classification decision tree, there is no need to optimization, resulting in a highly interpretable model. Furthermore, with reference to Fig. 7, it is observed that for the proposed Sugeno model, the membership functions, which are obtained by subtractive clustering, have no linguistic or qualitative meaning as opposed to the case of the proposed Mamdani model, where the membership functions are essentially attributed to the linguistic concepts. This aspect also contributes to the interpretability of the proposed Mamdani model. As mentioned previously, interpretability is an important feature of fuzzy logic-based modeling when compared to the other soft computing paradigms. The rule base of the proposed Mamdani model is presented in “Appendix 1.” On the other hand, the Sugeno model, in the format developed herein by use of subtractive clustering approach, can be presented as closed-form relations, making it of particular interest to practicing engineers. The closed-form equations of the proposed Sugeno model appear in “Appendix 2.”

5.2 Comparison between the proposed fuzzy models and existing equations for pullout strength of NSM FRP bonded to concrete

In the literature, there are two models developed by Seracino et al. [7] and Zhang et al. [8] to predict the pullout capacity of NSM FRP bonded to concrete blocks. The equations for both models are presented in Table 7. It is noted that Seracino’s model is applicable only for NSM strips (i.e., not including NSM rods). The error measures of both models are evaluated on the experimental database, and the results are reported in Table 6. The data on NSM strips consist of 78 specimens out of 222 experiments of the total database and are employed in Table 6 so as to compare Seracino’s model with the other considered models. According to Table 6, both of the proposed fuzzy logic models (i.e., Mamdani type and Sugeno type) demonstrate superior performance compared to Seracino’s and Zhang’s models. For instance, the CoVs for the ratios of the experimental value to model prediction for the proposed Mamdani model (17 rules) and Sugeno model (3 rules) are equal to 22.9 and 20.3% on the strip database, respectively, as compared to CoVs for Zhang’s and Seracino’s models, which are equal to 30.2 and 39.3%, respectively. Also, other error measures such as MAPE and RMSE decrease for the proposed fuzzy models comparing with the existing equations. Besides, the bond capacity predictions by selected fuzzy models and Zhang’s model versus 222 experimental results are depicted in Fig. 8, which shows that dispersion of the results by Zhang’s model is larger than that by the fuzzy logic models.

Table 7 The expressions of the existing models for bond capacity of NSM FRP
Fig. 8
figure 8

Predictions by different fuzzy models and Zhang’s model [8] for bond (i.e., pullout) capacity of NSM FRP versus experimental data

As a result, the performance of the proposed fuzzy models in terms of errors reported in Table 6 and Fig. 8 is better than the other existing models in the literature. Meanwhile, it is worth emphasizing that this paper seeks not only reducing the errors but also enhancing the interpretability of the proposed fuzzy models. In many cases, very low errors may be achieved but at the cost of reducing the interpretability issues. For instance, the adaptive network-based fuzzy inference system (ANFIS) is expected to provide higher accuracy since the parameters of the membership functions are iteratively adjusted by the training algorithm. On contrary, in both of the fuzzy models used here (namely subtractive clustering-based Sugeno model and classification decision tree-based Mamdani model), the membership functions of input variables are set fixed in the beginning and not altered during training. This feature, itself, enhances the interpretability of the model but may not yield the results as accurate as ANFIS does. Another point which should be noticed is that increasing the number of parameters to be tuned requires a larger database so as to assure that overfitting can be avoided.

Although the fuzzy models proposed here (see “Appendices 1, 2”) are more accurate than the existing regression-based models reported in the literature, they appear to be more complex than Zhang’s or Seracino’s models (see Table 7). However, this is a common issue for most of soft computing models as compared to regression-based models in the sense that a regression-based formula is so simple to be applied. On the other hand, a fuzzy model, though looks more complex, is not a black box and its advantage lies on its rule base which makes an expert easily understand the reasoning behind the model, and extend or modify it when needed. This is apparently opposed to a regression-based formula containing some coefficients suffering from lack of sound meaning.

5.3 Parametric study on the bond length of NSM FRP

In order to assess whether the proposed fuzzy models are able to capture the underlying mechanics of the problem in hand, the varying effects of the bond length L on the bond capacity Pf are examined in Fig. 9, where the other input variables are kept unchanged and equal to their mean values. According to the literature [8], there exists an effective bond length, beyond which the bond capacity no longer increases. As seen in Fig. 9, the proposed Mamdani model (with 17 rules) and Sugeno model (with 3 rules) are able to develop the concept of the effective bond length, meaning that after a certain amount of the bond length of NSM FRP, the pullout capacity remains constant. It is also observed from Fig. 9 that the amount of the effective bond length identified by the proposed Mamdani model is almost the same as that by the proposed Sugeno model. The other fuzzy models illustrated in Fig. 9 can predict, to some extent, the existence of an effective bond length except for the one-rule Sugeno model due to the fact that just one rule is most likely to be insufficient to describe the full behavior of the problem.

Fig. 9
figure 9

Parametric study on the effective bond length using different fuzzy models (common details: AfEf = 7115.5 kN, fc′ = 36.4 MPa, fe = 39.7 MPa, Pnorm/Anorm = 2.14, Dg/Wg = 2.01)

The simultaneous variations of both bond length, L, and concrete compressive strength, fc′, across their own ranges, and their effects on the pullout capacity, Pf, are shown in three-dimensional diagrams of Fig. 10 for the proposed Mamdani and Sugeno models. The other input variables are kept unchanged. According to Fig. 10, the general trend of variation in the proposed Mamdani model is similar to that in the proposed Sugeno model. This conclusion is important in the sense that the rule bases of the two proposed fuzzy models are initiated from two different sources: The former comes from classification decision tree, whereas the latter stems from subtractive clustering algorithm. More specifically, it is revealed from Fig. 10 that the proposed Sugeno model endorses the Mamdani model’s rules, which are extracted from the decision tree of Fig. 5, such as: (1) Given very low values of L, the value of Pf remains very low or low, and (2) if L and fc′ are not very low, while AfEf is not very low as it is the case in Fig. 10, then Pf is not in very low and low ranges.

Fig. 10
figure 10

Prediction of Pf with respect to simultaneous variations of L and fc′ by the proposed Mamdani fuzzy model (top figure) and Sugeno model (bottom figure); (common details: AfEf = 9533 kN, fe = 35 MPa, Pnorm/Anorm = 1.55, Dg/Wg = 1)

6 Conclusions

As an alternative method of determining the pullout capacity of NSM FRP rods or strips bonded to the concrete block, two fuzzy logic models are proposed in the current paper. The first fuzzy model is of Mamdani type, in which the rule base is deduced from a classification decision tree. The second one is of Sugeno type (first order), which is developed by the subtractive clustering algorithm. Several fuzzy logic models of each type with different numbers of rules are built. Among them, a Mamdani model with 17 rules and a Sugeno model with 3 rules are finally selected as they demonstrate the least errors on model predictions. The conclusions are as follows:

  1. 1.

    The decision tree-based Mamdani fuzzy model of the pullout strength, which contains the membership functions corresponding to linguistic labels, provides a highly interpretable rule base as listed in “Appendix 1.” The proposed Sugeno model based on subtractive clustering serves as closed-form formulations for the pullout capacity of NSM FRP, as presented in “Appendix 2.” With reference to the mentioned Appendices, the researchers and engineers can employ the fuzzy logic models proposed in this study.

  2. 2.

    Comparison with a large experimental database including 386 direct pullout tests on NSM FRP, which are collated from the open literature, indicates that the predictions by the proposed fuzzy models are in good agreement with the experimental results. The prediction accuracies of both of the proposed fuzzy models are similar in this particular study. However, a general comparison of Mamdani versus Sugeno fuzzy models is beyond the scope of this paper.

  3. 3.

    The proposed fuzzy models outperform the existing models reported in the literature. Unlike some of the published models which have been developed only for FRP strips, the proposed fuzzy models are applicable to both rods and strips. The coefficient of variation (CoV) for the ratio of the experiment to prediction in the proposed fuzzy models is almost 10% lower than that in Zhang’s model on the total data and about 16% lower than that in Seracino’s model on the strip data.

  4. 4.

    By conducting a parametric study on the bonded length of NSM FRP, it is revealed that the proposed fuzzy models are able to capture the underlying mechanics of the problem in hand since they can develop the concept of the effective bond length, that is, a bond length beyond which the bond capacity no longer increases.

The results obtained in the present work endorse the efficiency of fuzzy logic approach in predicting the pullout strength of NSM FRP bonded to concrete. It is, however, noted that the fuzzy model predictions are valid within the considered range of input variables. Future work may deal with applications of the fuzzy logic approach to other design aspects of NSM FRP reinforcement such as flexural or shear strengthening of the concrete beams.