Introduction

Swelling potential of soils is a parameter required for the design of many geotechnical projects such as foundations, embankments, compacted earth dams and compacted soil liners for containing contaminated solid and liquid wastes. For such structures, swelling properties is desired not to exceed a certain level that is appropriate for the design objectives.

Expansive soils can produce significant heave pressure to the engineering structures such as foundations, roads and sealing system for solid waste, when they are subjected to seasonal moisture changes. These types of soils can lead unexpected damage and large economic loss. In the United States, the economic cost of damage due to expansive soils is approximately 2.3 billion dollars per year (Dhowian et al., 1988). Therefore, the knowledge of swelling characteristics of soils is absolutely needed for the safe and economic construction of engineering structures. The main factor affecting swelling potential is the type of clay minerals in a soil. The soils which have high activity exhibit higher swelling potential. On the other hand, initial water content, dry density and grain size distribution are the other parameters affecting the swelling potential (El-Sohby and El- Sayed, 1981). Many correlation equations have been suggested for estimation of the swelling characteristics in literature (Mowafy and Bauer 1985, Mallikarjuna Rao, 1988, Erzin and Erol 2004). However these correlation equations has been mostly developed using limited number of data which consequently does not provide general identification for a wide range of soils. Therefore, it seems that the alternative methods are needed to obtain better predictions for the swelling potential. The spatial approaches such as artificial neural network (ANN) and fuzzy logic (FL) using black-box type computational process appear to be reasonable for the quick prediction of soil behavior since they enable to establish complex relationship between soil behavior and soil properties with nonlinear functions.

Recently, fuzzy methods has been successfully employed for modeling the complex systems in many engineering applications such as modeling the mechanical behavior of engineering materials, design of structures, automatic control, pattern recognition (Adam 2003). Fuzzy theory was first introduced by Lotfi Zadeh in 1965 developing the classical set theory. According to the classical set theory, a variable belongs to a set or not. In other words it crisply answers to a question. Whereas, fuzzy theory does not give a sharp answer to questions and enables partial belongings of a variable to different sets. The continuous membership functions that vary between 0 and 1 are employed for partial belonging. Mamdani and Takagi-Sugeno models are two types of fuzzy approaches used widely. In Mamdani model, the relationship between input variables and output variables is defined using the linguistic knowledge’s by means of if-then rules. This provides flexible and intuitive feature. There are many similarities between Tagagi-Sugeno model and Mamdani model. The main difference is that the output membership functions are either linear or constant in TS model (Mamdani and Assilian 1975, Tagagi and Sugeno 1985). TS model which is preferred for mathematical analysis and computational efficient seems to be more advantageous. It uses optimization and adaptive techniques to establish the system modeling and also uses less number of rules than Mamdani. (Tütmez and Tercan 2007). ANFIS, a type of the TS model, is a new method in the field of practical engineering. In order to establish a relation between input variables and output variables, it allows using the learning techniques of ANN and adaptively determining the model parameters.

In the content of this work, ANFIS model was considered to predict the swelling potential of compacted soils. The data used for training and validating ANFIS models were obtained from the laboratory tests performed on compacted soils for different geotechnical applications around Nigde in Turkey. Four basic soil parameters, coarse-grained fraction ratio (CG), fine-grained fraction ratio (FG), plasticity index (PI) and maximum dry density (MDD) which are considered as major factors affecting the swelling potential were presented to ANFIS model as inputs.

Overview of adaptive neuro-fuzzy inference system (ANFIS)

Jang (1993) proposed a new fuzzy logic model called ANFIS which employs some properties of artificial neural network (ANN) such as learning and parallelism. ANFIS also makes use of adaptive properties of ANN to determine the fuzzy system parameters, fuzzy rules, and type of the membership function. For that purpose it uses two methods, grid partitioning and clustering (Şen 2001). The structure of the ANFIS model is depicted in Fig 1. In ANFIS model, either linear or constant output membership functions are used to find the consequence parameter. Linear function is named as first-order sugeno type fuzzy inference system and constant function is named as zero-order Sugeno type fuzzy inference system. A typical two if-then rules in first-order Sugeno type is given at following form:

Fig. 1
figure 1

a) First order TS model reasoning b) ANFIS architecture (Padmini et al. 2008)

$$ {\text{If x}} = {\text{A}}_{ 1} {\text{ and y}} = {\text{B}}_{ 1} {\text{ then f}}_{{ 1({\text{x}},{\text{y}})}} = {\text{p}}_{ 1} {\text{x}} + {\text{q}}_{ 1} {\text{y}} + {\text{k}}_{ 1} $$
(1)
$$ {\text{If x}} = {\text{A}}_{ 2} {\text{ and y}} = {\text{B}}_{ 2} {\text{ then f}}_{{ 2({\text{x}},{\text{y}})}} = {\text{p}}_{ 2} {\text{x}} + {\text{q}}_{ 2} {\text{y}} + {\text{k}}_{ 2} $$
(2)

where x (or y) is the input node i, p,q and k are the consequence parameters obtained from the training, A and B are labels of fuzzy set defined suitable membership function.

The membership function is updated by backpropagation learning algorithm or a hybrid learning algorithm which are described in detail in Demuth and Beale (2001). The computations in ANFIS model is performed in five layers. The membership values of an input parameter for the premise parameter are determined in first layer. The output of the ith node of the layer l is defined as;

$$ {\text{U}}_{{ 1,{\text{i}}}} = \eta {\text{A}}_{\text{i}} \left( {\text{x}} \right){\text{ for i}} = 1, 2 {\text{ or}} $$
(3)
$$ {\text{U}}_{{ 1,{\text{i}}}} = \eta {\text{B}}_{{{\text{i}} - 2}} \left( {\text{x}} \right){\text{ for i}} = 3, 4 $$
(4)

Where, η shows the membership function. Typical membership function is as follow;

$$ \eta_{A}^{{}} (X) = \frac{1}{{1 + \left| {\frac{{x - c_{i} }}{{a_{i} }}} \right|^{2b} }} $$
(5)

where a i, b i and c i is the parameter set known as premise parameters

In layer 2 in which each node represents the fire strength of the rule, the output U2,i which is the product of the incoming signals is calculated as shown below;

$$ {\text{U}}_{{ 2,{\text{i}}}} = {\text{w}}_{\text{i}} = \eta {\text{A}}_{\text{i}} \left( {\text{x}} \right)\eta {\text{B}}_{\text{i}} \left( {\text{y}} \right){\text{ i}} = 1, 2 $$
(6)

The ratio of the ith rule’s firing strength versus all rules’ firing strength, called normalized firing strengths is computed in Layer 3, as

$$ {\text{U}}_{{ 3 , {\text{i}}}} = U_{3,i} = \overline{{w_{i} }} = \frac{{w_{i} }}{{w_{1} + w_{2} }},\quad {\text{i}} = 1,2 $$
(7)

In layer 4 the contribution of the ith rule to output is calculated with an adaptive node function:

$$ U_{4,i} = \overline{{w_{i} }} f_{i} = \overline{w}_{i} (p_{i} x + q_{i} y + k_{i} ) $$
(8)

where, \( \overline{w} \) is the normalized firing strength found from layer 3, p i , q i and k i are the consequent parameters.

The single node in layer 5 calculates the overall output of the ANFIS using the equation shown below.

$$ U_{5,i} = \sum {_{i} } \overline{{w_{i} }} f_{i} = \frac{{\sum {_{i} } w_{i} f_{i} }}{{\sum {_{i} } w_{i} }} $$
(9)

Database

Data used for training and validating the neuro fuzzy models compiled for this study were taken from various projects from Nigde region in Turkey. The data base comprises a total of 98 cases. As seen in Table 1 the soils utilized in analyses are classified as CH, CI, CL, GC, GM, MH, MI, ML, according to The Unified Soil Classification System (or USCS). Many study given in previous section revealed that CG, FG, PI and MDD have considerably influence on the swelling potential of soils. Therefore, they were presented to ANFIS model developed in this study as input parameters. Dataset in Table 1 were divided into two groups. While first group data was used for training, the second group which was not included to the training group was used for testing.

Table 1 Data used for training and testing

Development of ANFIS model

In this section the ANFIS model developed for estimation of the swell potential is described. Two types of methods, grid partitioning and scatter partitioning, are generally utilized to generate the membership functions for developing the ANFIS model (Dubois and Prade 1980, Shahin et al. 2003, Abdulsamet et al. 2004, Yılmaz and Mahmut 2006, Topçu and Sarıdemir 2007). In grid partitioning method, the numbers of fuzzy rules are generated exponentially as depending on number of inputs and number of membership functions. This is marked as the inconvenience feature and leads that the ANFIS architecture has more complex structure. It is not mostly preferred in case the number of input parameters is more than 6 (Nayak et al. 2004, Padmini et al. 2008). On the other hand, in scatter partitioning, satisfactory results may not be obtained when any data does not belong to fuzzy cluster and it is away from the centre of cluster. This issue is not in question for the grid partitioning methods. It evaluates the entire range of the data in determination of the ANFIS structure (Padmini et al. 2008).

In this work, the ANFIS model was constructed using grid partitioning method for the estimation of the swell potential. On the other hand, the Gaussian membership function was selected and updated by hybrid learning algorithm in training stage. Fig. 2 shows the shape of the MFs. Surface graphs obtained from ANFIS model is given in Fig. 3. This figure shows the relationships between the inputs parameters and the output parameter. In other words, trend of the ANFIS model is viewed from this figure in terms of variation of the input parameters. The Fuzzy toolbox of MATLAB computer aided Software (Demuth and Beale 2001) was utilized for the necessary computation and for developing the ANFIS models in this work.

Fig. 2
figure 2

Membership functions for the swelling potential

Fig. 3
figure 3

Fuzzy surface showing the relationship between the input parameters and swelling potential

Results and Discussion

In this section, it is focused to examine and discuss the performance of ANFIS models. Of the 98 data sets, 78 were used for training the ANFIS models and 20 were presented to ANFIS models for testing. As can be seen from most of previous studies, it is a common agreement to use percentages varying between %15–25 and many artificial intelligence software uses similar percentages for the test stage (Shahin et al. 2003, Topçu and Sarıdemir 2007, Padmini et al. 2008). In order to find out how accurate the results of developed ANFIS models are, some statistical verification tools were utilized such as coefficient of correlation (R), standard errors (SE) and standard deviation (σ). Furthermore, t-test and F-test to indicate the significance of models were implemented to the results.

Estimated swelling potential values were graphically compared with the experimental counterparts in Fig. 4. As can be seen, the ANFIS models were found to be able to learn the complex relationship between the basic soil properties and the swelling potential. The satisfactory agreement between the ANFIS predictions and experimental data was obtained in terms of the R values. R values of the ANFIS model is 0.93. In statistics, the overall error performances of the relationship between two groups can be interpreted from the R values. According to Smith (1986), if a proposed model gives R > 0.8, there is a strong correlation between measured and estimated values over all the data available in the database.

Fig. 4
figure 4

Comparison beetween the predicted and experimental swelling potential

The reliability of ANFIS model was also evaluated by SE value that is a statistical criterion showing a measurement of deviation around the regression line. This is important for statistic as well as R value, because sometimes a relationship with high R value may exhibit high SE value. It is desired that a model gives high R value and low SE value. The ANFIS model developed herein for the swelling potential yields SE value of 1.06. This is satisfactory value of the SE for reliable model of the swelling potential. On the other the σ value of the model was obtained as 2.82.

In order to evaluate the significance of R values, the t-test was performed. For that purpose both groups, namely experimental values and ANFIS predictions were adopted to be normally distributed. T-test includes the comparison of calculated t-value with the tabulated t-value using the null hypothesis which is set up to test differences in groups. The null hypothesis either is nullified or refuted as a viable possibility with respect to data. According to this, in case the calculated t-value is greater than the tabulated t-value, the null hypothesis is rejected. This indicates R value of a relation is significant. On the contrary, if the calculated t-value is less than the tabulated t-value, the null hypothesis is not rejected and it is accepted that R is not significant (Drapper and Schmidt 1981, Tukey 1997, Rawlings 1998, Myers 1990). In this work, confidence level was taken as 95% and tabulated critical t-values corresponding to 95% confidence level were obtained as 2.093. T-value for the swelling potential was calculated as 4.42. According to this, it is understood that the results obtained from ANFIS model are significant. Analysis of variance was also performed to see the significance of relationship. For that that purpose ANNOVA analysis was applied. Assuming a 95% level of confidence F value for the swelling potential was obtained as 29.03. Since this value greater than tabulated F-value, the null hypothesis is rejected. ANOVA analysis and t-test show that there is acceptable relationship between ANFIS predictions and experimental data.

Summary and conclusion

In the content of this study, the applicability of adaptive neuro fuzzy inference system (ANFIS) for the prediction of the swelling potential of soils was investigated. The experimental data sets of compacted soils taken from different location of Nigde in Turkey were utilized to develop the ANFIS models. CG, FG, PI and MDD values of soils were presented as input parameters to ANFIS models. While the 78 data sets were employed for training, 20 data sets were used for testing. End of the analyses, It was observed that predicted values from ANFIS models are in the satisfactory agreement with the experimental counterparts. Some statistical tools were used to evaluate the performances of models. R value of the relationship between the predictions and experimental values was found to be at sufficient level in terms of the statistical. On the other hand, t-test and F-test was performed assuming a 95% level of confidence and it was observed that the ANFIS model is significant and valid. When the difficulties and limitations in soil testing procedures are considered, it can be concluded that the results obtained from ANFIS are very encouraging for the cases tested and the use of ANFIS is more reasonable method for the prediction of swelling potential of soils.