Statistical and neural network assessment of the compression index of clay-bearing soils

Ozer, Mustafa; Isik, Nihat S.; Orhan, Mehmet

doi:10.1007/s10064-008-0168-8

Statistical and neural network assessment of the compression index of clay-bearing soils

Original Paper
Published: 08 October 2008

Volume 67, pages 537–545, (2008)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Bulletin of Engineering Geology and the Environment Aims and scope Submit manuscript

Statistical and neural network assessment of the compression index of clay-bearing soils

Download PDF

Mustafa Ozer¹,
Nihat S. Isik¹ &
Mehmet Orhan¹

616 Accesses
49 Citations
Explore all metrics

Abstract

The compression index is used to estimate the consolidation settlement of clay-bearing soils. As the determination of compression index from oedometer tests is relatively time-consuming, empirical equations based on index properties can be useful. In this study the performance of widely used single and multi-variable empirical equations was evaluated using a database consisting of 135 test data. New empirical equations were developed utilizing least square regression analysis. In addition, an artificial neural network (ANN) with eight input variables was also developed to estimate the compression index. It was concluded that ANN provides the best results.

Résumé

L’indice de compression est utilisé pour estimer le tassement de consolidation des sols argileux. Comme la détermination de cet indice à partir des essais oedométriques prend quelque temps, des équations empiriques basées sur des indices géotechniques peuvent être utiles. Dans cette étude, l’intérêt d’équations empiriques à une ou plusieurs variables a été évalué à partir d’une base de données comportant 135 résultats d’essais. De nouvelles équations empiriques ont été développées à partir d’une analyse de régression par la méthode des moindres carrés. De plus, un réseau de neurones artificiel (ANN) avec huit variables d’entrée a été développé pour estimer l’indice de compression. La conclusion est que l’ANN donne les meilleurs résultats.

Predicting Compression Index Using Artificial Neural Networks: A Case Study from Dalian Artificial Island

Development of Artificial Neural Networks for Prediction of Compression Coefficient of Soft Soil

Prediction of Coefficient of Compression of Soil Using Artificial Neural Network

Discover the latest articles, news and stories from top researchers in related subjects.

Geoengineering

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The compression index represents the slope of the curve of void ratio versus logarithm of effective pressure and is conventionally determined by oedometer tests. Used for the calculation of consolidation settlement of clayey soils, this parameter directly affects the type and dimensions of the foundation system hence the cost of the construction.

The amount of consolidation settlement of fine grained soils depends on the fabric of the soil, the water absorption capacity of the clay sized particles, the existing stress state, the pre-consolidation pressure of the soil sample, and to some extent the compressibility of the soil grains. Therefore, it would be assumed that any direct or indirect parameters which define these conditions should be related to the compression index. Atterberg limits reflect the relative amount of clay sized particles and their mineralogy; the initial void ratio of the soil is an indication of the existing stress state and the pre-consolidation pressure; the natural water content is a measure of the water attracted to the clay particles and free water present within the voids; and dry unit weight may to some degree be an indication of the compressibility of soil grains. Other weight–volume relationship parameters, such as specific gravity and natural unit weight, are physically related to dry unit weight, natural water content and the void ratio of the soil. For this reason, it seems logical to estimate the compression index from the above mentioned parameters.

As the oedometer test is relatively time-consuming compared with standard index tests, various attempts have been made to estimate the compression index from tests more easily carried out. Empirical equations relating various parameters to the compression index have been presented by many researchers, e.g. for local clays (Helenelund 1951; Cozzolino 1961; Bowles 1989; Yin 1999; Yoon et al. 2004); organic soils (Bowles 1989); low plasticity clays (Sowers 1970; Nakase et al. 1988); and remolded soils (Skempton 1944; Wroth and Wood 1978; Carrier 1985; Nagaraj and Murthy 1986; Sridharan and Nagaraj 2000; Giasi et al. 2003). In addition to these particular conditions, a large number of empirical equations have been published which are applicable to all clays.

Many researchers have used single parameter models for the estimation of compression index, such as liquid limit, natural water content or in-situ void ratio, although others recommend multiple soil parameter models for the estimation of the compression index.

Some widely used equations for the estimation of compression index which are valid for all clays are presented in Table 1. It should be noted that the majority of these equations are linear in form.

Table 1 Some widely used compression index equations applicable to all clays

Full size table

This study compares the performance of some widely used empirical compression index equations applicable to all clays. New empirical equations relating the compression index with other parameters of fine grained soil are proposed using the least squares regression technique. In addition to empirical equations, a neural network model is also developed for the estimation of compression index.

Database compilation

The data includes tests undertaken specifically for this research (77) and 58 results from Herrero (1980). In addition to the oedometer tests, natural water content (w _n), initial void ratio (e _o), liquid limit (w _L), plastic limit (w _p), plasticity index (I _p), specific gravity of soil particles (G _s), dry unit weight (γ_d) and natural unit weight (γ_n) were used. Whilst the new results are from different parts of Turkey, those from Herrero (1980) were obtained on undisturbed soil samples from different parts of America. Although Herrero (1980) presented results from 90 samples, as some physical-index properties were not reported, only 58 were used in this study. The conventional oedometer tests in both studies were undertaken following ASTM D 2435 (1996) and the other tests according to the relevant ASTM standards.

The oedometer test requires undisturbed soil samples. Clearly, some degree of disturbance is unavoidable, which causes deviation on the e–log p plot. Schmertmann (1953) described a procedure to obtain the equivalent of a field (virgin) consolidation curve from the laboratory consolidation curve. This was followed in both the present study and by Herrero (1980).

In order to obtain an empirical equation which is valid for all clays, the database must include a sufficiently wide range of data. In order to assess the adequacy of the database, descriptive statistics and the frequency distribution of each data set were determined. Table 2 displays the descriptive statistics and Fig. 1 represents the histogram of each variable.

Table 2 Descriptive statistics of variables used in the database

Full size table

Comparison of existing equations

In order to evaluate the performance of empirical equations, root mean square error (RMSE) indices (Eq. 1) was used, following for example Alvarez Grima and Babuska 1999; Finol et al. 2001; Gokceoglu 2002; Yılmaz 2006).

$$ {\text{RMSE}} = \sqrt {\frac{1}{n}\sum\limits_{1}^{n} {\left( {C_{{{\text{c}}_{\text{est}} }} - C_{{{\text{c}}_{\text{lab}} }} } \right)^{2} } } $$

(1)

Where, n is the number of data points, Cc_est is the compression index estimated from empirical equations, Cc_lab is the compression index determined in laboratory. The lower the RMSE value, the better the model performance.

Figures 2, 3, 4 illustrate the performance of the single variable equations listed in Table 1; the RMSE are presented in Table 3. It can be seen that the performance of the three equations utilizing w _n as the predictor is very similar (Fig. 2). The equation with the lowest RMSE value (0.077) was that proposed by Azzouz et al. (1976) (Eq. 1 in Table 3). Nevertheless, as can be seen from Fig. 2, it still results in a considerable under-estimate of the compression index for some of the soil samples.

Table 3 RMSE indices of the assessed equations

Full size table

Among the single variable empirical equations utilizing e _o as the predictor, Azzouz et al. (1976)’s equation (Eq. 4 in Table 3) again shows the best performance with the lowest RMSE index. It is clear from Fig. 3 that the Hough (1957) equation generally under-estimates the compression index, while Nishida’s (1956) equation generally gives an over-estimate.

As can be seen from Fig. 4, the general performance of single variable equations based on w _L are not as good as those based on w _n and e _o. All the w _L based single variable equations over-estimate the compression index.

As noted above, some researchers recommended multiple soil parameter models for the estimation of the compression index. This is very logical as a number of different factors control the amount of consolidation settlement of fine grained soils. The performance of the multi-variable equations proposed by Azzouz et al. (1976) are presented in Fig. 5, those proposed by Herrero (1983a, b) in Fig. 6 and those by Al-Khafaji and Andersland (1992), Nagaraj and Murty (1985), and Koppula (1981) in Fig. 7.

All the multi-variable equations proposed by Azzouz et al. (1976) gave good results (Fig. 5); the best using e _o and w _n (Eq. 10 in Table 3). This is consistent with the results obtained using their single variable equation, when that using w _L was found to be less accurate than those using e _o and w _n.

Figure 6 shows that the Herrero 2 (1983b) and Herrero 3 (1983b) equations generally under-estimate the compression index, with the lowest RMSE recorded for Herrero 1 (1983a).

Figure 7 clearly demonstrates that the Koppula (1981) and Nagaraj and Murty (1985) multi-variable equations significantly over-estimate the compression index, with relatively high RMSE indices (Table 3). The equation from Al-Khafaji and Andersland (1992) gives a relatively lower RMSE.

Development of new empirical equations

Using the same database, a correlation matrix was generated (Table 4) to identify the relationship between the parameters. This indicates a strong positive relationship between w _n, e _o, w _L and C _c, and a strong negative relationship between γ_d, γ_n and C _c. Both multiple linear and multiple non-linear regression analyses were performed utilizing the variables with good correlation with the compression index. After trying numerous combinations with all parameters, one linear and one non-linear equation with the highest regression coefficient and lowest RMSE index were determined. These equations and their RMSE indices are given in Table 5 and their performance compared in Fig. 8.

Table 4 Correlation matrix of parameters

Full size table

Table 5 Suggested empirical equations for the estimation of compression index

Full size table

It can be seen that the proposed non-linear equation (Eq. 1) is slightly better than the linear one. However, the RMSE indices of both are lower than those achieved using the single and multi-variable equations previously proposed. The non-linear equation developed in this study is compared with the equations proposed by Herrero 1 (1983a) and Azzouz et al. 1 (1976) in Fig. 9, where it can be seen that the proposed non-linear equation results in points more closely located around the 1:1 line.

Artificial neural network-based compression index estimation

In recent times, artificial neural networks (ANNs) have been applied to many geotechnical engineering tasks and have demonstrated some degree of success (Shahin et al. 2002). They are a form of artificial intelligence which tries to simulate the human brain in a very crude way. The purpose of ANNs is to set a relationship between model inputs and outputs by continuously updating connection weights according to inputs–outputs. The main advantage of ANNs is that they are very flexible, and complex relationships between inputs and outputs can be discovered by changing the model structure and connection weights. In addition, a previously developed (trained) network can easily be updated when new datasets become available. However, ANNs have an important disadvantage in not being transparent as a closed form equation.

Neural network model development involves six main stages: determination of input and output variables; grouping of database as training and validating datasets; determination of network structure; optimization of connection weights, stopping criteria; and validation of the neural network. In order to obtain good predictions from an ANN, a large set of data is needed which must be diverse as ANNs are unable to extrapolate beyond the range of the training data. As described previously the database used in this study covered a wide range. All the available parameters (w _n, e _o, w _L, w _p, I _p, G _s, γ_d, γ_n) were used as input variables, scaled between 0 and 1 as recommended by Masters (1993).

It is common practice to divide the available data into two subsets; a training set to construct the neural network model and an independent validation set to estimate model performance (Twomey and Smith 1997). Approximately 80% of the data were used for training and 20% for validation. The validation data were selected to cover a wide range of compression index values.

Determination of a network structure involves the selection of the number of hidden layer nodes. Hornik et al. (1989) showed that a network with one hidden layer can approximate any continuous function provided that sufficient connection weights are used; therefore, in this study a network with one hidden layer is used and the number of hidden layer nodes was increased until a good model was achieved. During the training stage (i.e. optimization of connection weights) the aim is to find a global solution to what is typically a highly non-linear optimization problem (Shahin et al. 2002). Feed-forward neural networks with back propagation algorithms are the most widely used method (Rumelhart et al. 1986). Therefore, in this study a back propagation algorithm was used during training with a 0.6 momentum and 0.8 learning rate. Stopping criteria are used to decide whether to stop the training process or not; in this study the training process was stopped when the average error was below 0.02.

Figure 10 displays the architecture of the neural network for prediction of the compression index and the relative connection weights. Figure 11 shows the laboratory-determined scaled compression index values versus the ANN estimated scaled compression index values of training data. The RMSE of the validation data was calculated as 0.051, which is considerably better than the RMSE of 0.0661 (Eq. 1 in Table 5) achieved by the best empirical equation. In addition to the lower RMSE of the ANN estimations, laboratory determined versus ANN estimated compression index values are more closely spread around the 1:1 line (Fig. 11).

Summary and conclusions

In this study, the performances of widely used single and multi-variable empirical equations for the estimation of the compression index were evaluated using a database consisting of 135 wide-ranging test data. The results indicate that the single variable model of Azzouz et al. (1976), utilizing initial void ratio as the predictor, has the lowest RMSE index while for the multi-variable models, Herrero’s (1983a) equation gave the best performance (RMSE 0.0740) using initial void ratio and specific gravity of soil particles as predictor variables.

Using the same database, new multi-variable empirical equations (both linear and non-linear) were developed using least square regression analysis. In these equations natural water content, initial void ratio, liquid limit and dry unit weight were used as the predictor variables as a correlation matrix indicated these have a strong relationship with compression index. The new equations gave a slightly lower RMSE index than the existing single or multi-variable empirical equations.

In addition, an ANN was developed using all the parameters in the compiled database as predictor variables. The network had one hidden layer and eight hidden layer nodes. Back propagation algorithm was used to train the network. Approximately 80% of the data were used for training and 20% for validation. The RMSE of the validation data was calculated as 0.051, which is significantly lower than that for of the empirical equations obtained from regression analysis.

It should always be kept in mind that some environmental factors such as fabric and cementation could not be considered in any of the empirical equations. In view of this, it must be pointed out that these equations may be useful only for the preliminary estimation of the compression index. In order to improve the usefulness of the equations, the procedure suggested by Bowles (1996) can be applied, i.e. at least one oedometer test should be performed on a sample taken from the area of interest and an empirical equation which gives the closest estimate selected.

References

Al-Khafaji AWN, Andersland OB (1992) Equations for compression index approximation. J Geotech Eng ASCE 118(1):148–153
Article Google Scholar
Alvarez Grima M, Babuska R (1999) Fuzzy model for the prediction of unconfined compressive strength of rock samples. Int J Rock Mech Min Sci 36(3):339–349
Article Google Scholar
ASTM (American Society for Testing and Materials) D 2435 (1996) Standard test method for one–dimensional consolidation properties of soils. Annual book of ASTM standards, West Conshohocken, United States
Azzouz AS, Krizek RJ, Corotis RB (1976) Regression analysis of soil compressibility. Soils and Foundations. Jpn Soc Soil Mech Found Eng 16(2):19–29
Google Scholar
Bowles JE (1989) Physical and geotechnical properties of soils. McGraw-Hill Book Company, New York
Google Scholar
Bowles JE (1996) Foundation analysis and design, 5th edn. McGraw-Hill Company, New York, p 88
Google Scholar
Carrier WDIII (1985) Consolidation parameters derived from index tests. Géotechnique 35(2):211–213
Google Scholar
Cozzolino VM (1961) Statistical forecasting of compression index. In: Proceedings of the 5th International Conference on Soil Mechanics and Foundation Engineering Paris 1: 51–53
Finol J, Guo YK, Jing XD (2001) A rule based fuzzy model for the prediction of petrophysical rock parameters. J Petrol Sci Eng 29(2):97–113
Article Google Scholar
Giasi CI, Cherubini C, Paccapelo F (2003) Evaluation of compression index of remoulded clays by means of Atterberg limits. Bull Eng Geol Environ 62(4):333–340
Article Google Scholar
Gokceoglu C (2002) A fuzzy triangular chart to predict the uniaxial compressive strength of Ankara agglomerates from their petrographic composition. Eng Geologist 66:39–51
Article Google Scholar
Helenelund KV (1951) On consolidation and settlement of loaded soil layers. Dissertation, Finland Technical Institute, Helsinki, Finland
Herrero OR (1980) Universal compression index equation. J Geotech Eng Div ASCE 106(11):1179–1200
Google Scholar
Herrero OR (1983a) Universal compression index equation; Discussion. J Geotech Eng Div ASCE 109(10):1349
Google Scholar
Herrero OR (1983b) Universal compression index equation; Closure. J Geotech Eng Div ASCE 109(5):755–761
Article Google Scholar
Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2:359–366
Article Google Scholar
Hough BK (1957) Basic Soils Engineering. The Ronald Press Company, New York, pp 114–115
Google Scholar
Koppula SD (1981) Statistical estimation of compression index. Geotech Test J 4(2):68–73
Google Scholar
Masters T (1993) Practical Neural Network Recipes in C++. Academic Press, San Diego, p 493
Google Scholar
Mayne PW (1980) Cam-clay predictions of undrained strength. J Geotech Eng Div ASCE 106(11):1219–1242
Google Scholar
Nagaraj TS, Murty BRS (1985) Prediction of the preconsolidation pressure and recompression index of soils. Geotech Test J 8(4):199–202
Google Scholar
Nagaraj TS, Murthy BRS (1986) A critical reappraisal of compression index equations. Géotechnique 36(1):27–32
Google Scholar
Nakase A, Kamei T, Kusakabe O (1988) Constitute parameters estimated by plasticity index. J Geotech Eng ASCE 114:844–858
Google Scholar
Nishida Y (1956) A brief note on compression index of soil. J Soil Mech Found Eng Div ASCE 82:1027–1–1027–14
Google Scholar
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning internal representation by error propagation. In: Rumelhart DE, McClelland JL (eds) Parallel distributed processing, vol. 1, Chap. 8. MIT Press, Cambridge, p 1208
Google Scholar
Schmertmann JH (1953) Estimating the true consolidation behavior of clay from laboratory test results. Proc. ASCE 79, Separate 311, pp 26
Shahin MA, Maier HR, Jaksa MB (2002) Predicting settlement of shallow foundations using neural networks. J Geotech Geoenviron Eng 128(9):785–793
Article Google Scholar
Skempton AW (1944) Notes on the compressibility of clays. Q J Geol Soc Lond 100:119–135
Article Google Scholar
Sowers GB (1970) Introductory soil mechanics and foundations, 3rd edn. The Macmillan Company, Collier-Macmillan Limited, London, p 102
Google Scholar
Sridharan A, Nagaraj HB (2000) Compressibility behaviour of remoulded, fine-grained soils and correlation with index properties. Can Geotech J 37(3):712–722
Article Google Scholar
Terzaghi K, Peck RB (1967) Soil mechanics in engineering practice, 2nd edn. Wiley, New York, p 73
Google Scholar
Twomey JM, Smith AE (1997) Validation and verification. In: Kartam N, Flood I, Garrett JH (eds) Artifical neural networks for civil engineers: fundamentals and applications. ASCE, New York, pp 44–64
Google Scholar
Wroth CP, Wood DM (1978) The correlation of index properties with some basic engineering properties of soils. Can Geotech J 15:137–145
Google Scholar
Yılmaz I (2006) Indirect estimation of the swelling percent and a new classification of soils depending on liquid limit and cation exchange capacity. Eng Geol 85:295–301
Article Google Scholar
Yin J–H (1999) Properties and behavior of Hong Kong marine deposits with different clay content. Can Geotech J 36(6):1085–1095
Article Google Scholar
Yoon GL, Kim BT, Jeon SS (2004) Empirical correlations of compression index for marine clay from regression analysis. Can Geotech J 41(6):1213–1221
Article Google Scholar

Download references

Author information

Authors and Affiliations

Teknik Egitim Fakültesi, Gazi Üniversitesi, Yapi Egitimi Bölümü, Besevler, 06500, Ankara, Turkey
Mustafa Ozer, Nihat S. Isik & Mehmet Orhan

Authors

Mustafa Ozer
View author publications
You can also search for this author in PubMed Google Scholar
Nihat S. Isik
View author publications
You can also search for this author in PubMed Google Scholar
Mehmet Orhan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mustafa Ozer.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ozer, M., Isik, N.S. & Orhan, M. Statistical and neural network assessment of the compression index of clay-bearing soils. Bull Eng Geol Environ 67, 537–545 (2008). https://doi.org/10.1007/s10064-008-0168-8

Download citation

Received: 27 November 2007
Accepted: 01 September 2008
Published: 08 October 2008
Issue Date: November 2008
DOI: https://doi.org/10.1007/s10064-008-0168-8

Keywords

Mots clés

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Statistical and neural network assessment of the compression index of clay-bearing soils

Abstract

Résumé

Similar content being viewed by others

Predicting Compression Index Using Artificial Neural Networks: A Case Study from Dalian Artificial Island

Development of Artificial Neural Networks for Prediction of Compression Coefficient of Soft Soil

Prediction of Coefficient of Compression of Soil Using Artificial Neural Network

Introduction

Database compilation

Comparison of existing equations

Development of new empirical equations

Artificial neural network-based compression index estimation

Summary and conclusions

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Mots clés

Navigation

Statistical and neural network assessment of the compression index of clay-bearing soils

Abstract

Résumé

Similar content being viewed by others

Predicting Compression Index Using Artificial Neural Networks: A Case Study from Dalian Artificial Island

Development of Artificial Neural Networks for Prediction of Compression Coefficient of Soft Soil

Prediction of Coefficient of Compression of Soil Using Artificial Neural Network

Explore related subjects

Introduction

Database compilation

Comparison of existing equations

Development of new empirical equations

Artificial neural network-based compression index estimation

Summary and conclusions

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mots clés

Search

Navigation