Abstract
Under the action of seepage or surface erosion, earth dams, canal foundations, subgrades and embankments filled with dispersive soils can easily cause piping, caves, gullies and other damages and even cause accidents in civil engineering and hydraulic engineering. Thus, how to correctly distinguish the dispersibility of soils is concerned by soils engineers all the time. In this study, the artificial neural network model was developed to recognize dispersive soils by using TensorFlow which is an open-source library of Python. The data samples were from 11 water conservancy projects located in west of China. The results show that the artificial neural network model has a great effect in predicting the dispersibility of soil. A combination of artificial neural network and Python is applicable to solve complex, fuzzy and highly nonlinear problems in civil engineering and hydraulic engineering, which is also one of the important trends in the future study.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Dispersive soils are widely distributed in all parts of the world, especially in arid and semiarid area (Abbaslou et al. 2016). It is a kind of special soil with water sensitivity, and its water erosion resistance is very low. The existence of dispersive soils is easy to cause piping, caves, gullies and other damages, which poses a serious threat to the safety of buildings and structures.
Dispersion occurs when the repulsive forces between individual clay particles exceed the attractive forces so that when the clay mass is in contact with water, individual clay particles are progressively detached from the surface and go into suspension. If the water is flowing, the dispersive clay particles are carried away (Sherard et al. 1972). Many factors governing the susceptibility to dispersion are physical, chemical and mineral properties indexes of soils and environmental water. The dispersion of soil is positively correlated with the content of dissolved sodium cations, exchangeable sodium cations, montmorillonites and pH and is negatively correlated with the content of organic matters and clay particles. The lower the content of dissolved salts in the water, the greater the susceptibility of the soils to dispersion. Such conclusions have been widely accepted (Richards et al. 1954; Sherard et al. 1972; Holmgren et al. 1977; Chorom et al. 1994; Marchuk et al. 2012, 2013). Zhao et al. (2015) applied the gray system theory to analyze the relationship between the physical, chemical and mineral properties and dispersibility of 123 groups of soil samples. It was concluded that the dispersive soil is routinely observed to have rich exchangeable sodium percent and high pH value and cannot be predicted solely by sodium or montmorillonite contents. Thus, the effect factors of soil dispersibility have been well studied.
Conventional soil mechanics parameters, such as particle relative density, particle gradation, boundary moisture content, cohesion and friction angle, cannot directly reflect the degree of soil dispersion. Different dispersibility discrimination tests (pinhole test, crumb test, double-hydrometer test, pore water test and exchangeable sodium percent test) were designed from different perspectives (simulation model, colloidal properties, particle size distribution and soil chemistry). However, due to the complexity of dispersive soils, dispersive discriminant results of same samples under different dispersive tests are often inconsistent with each other (Maharaj et al. 2015). At present, comprehensive identification methods are mostly used. Bell and Walker (2000) developed a rating system which requires results from pinhole test, exchangeable sodium percentage test, crumb test and pore water test. Fan et al. (2013) and Ju et al. (2016) designed weights analysis method which requires results from pinhole test, crumb test, double-hydrometer test, pore water test and exchangeable sodium percent test. Fan and Kong (2013) established an empirical equation with four soil physical and chemical indicators which are liquid limit, clay content, sodium percentage in the pore water and pH. Maharaj (2013) proposed a complete process which requires results from in situ crumb test, laboratory crumb test, double-hydrometer test and chemical analysis tests.
Artificial neural network (ANN) is a highly self-learning and self-adaptive nonlinear system. Under the condition of unknown physical and mathematical models of objective natural phenomena, the actual output achieved by the self-learning of ANN is completely consistent with the expected output in a certain error range. The aim of ANN is to explore the mystery of human intelligence by studying the mechanism of the thinking of human brain and then to simulate the structure of work of human brain so as to make machines have human-like intelligence. ANN has been applied in pattern recognition, machine learning, expert system and other fields and has become an active field in artificial intelligence (AI) research. In the field of hydraulic engineering and civil engineering, ANN had worked well for deep mixing technique (Hosseini et al. 2020), for identifying and categorization of water and soil (Olawoyin et al. 2013; Wu et al. 2018) and for estimation and prediction of the them indexes of air permeability (Cai et al. 2020), hydraulic properties (Schaap et al. 1998; Tamari et al. 1996), thermal characteristics (Erzin et al. 2008; Ozturk et al. 2011; Jalal et al. 2021), chemical composition (Aitkenhead et al. 2012; Sarani et al. 2016) and mechanical behavior (Attoh-Okine 2004; Mohanty et al. 2019; Jalal et al. 2021).
According to a study published by IEEE Spectrum, Python has become the most popular computing language in the world, with Java and C ranking second and third, respectively. Python is an object-oriented programming language and can easily connect various modules written in other languages (especially C/C++). It has a large number of third-party libraries, so developers only need to be good at using these libraries to complete most of the work.
The aim of this study is to propose a new reliable identification of dispersibility of soils. With the vigorous research of ANN, it has been widely used in various disciplines. The relationship between the comprehensive discrimination result of dispersibility of soils and their physical, chemical, mineral and dispersibility discrimination tests indices forms a complex nonlinear system, essentially. The study used 80 samples information obtained from different provinces in west of China to build ANN model by using TensorFlow which is an open-source library of Python.
2 Materials and Methods
2.1 Data Sources and Selection of Discriminant Parameters
There are 80 sets of soil data information. They were came from Dashixia Hydropower Station, Xijiao and Sanping reservoirs in Xinjiang Province, main channels of irrigation area in Heilongjiang Province, Ningmute and Banduo hydropower stations in Qinghai Province, Wenjiagou, Majiashu and Nanping hydropower stations in Ningxia Hui Autonomous Region, Heihe Water Control Project in Shaanxi Province and Shangma Reservoir in Shanxi Province. In all, there are 11 water conservancy projects. Soil’s detailed information is shown in Table 1.
Based on the double-layer theory of soil–water–electrolyte system, Fan et al. (2012) systematically studied the relationship between basic physical, chemical, mineral indexes and dispersibility of soils. It was considered that the dispersive soils consist of three aspects: low clay content, high sodium content and high pH. Referring to the relevant research results, the input parameters were determined as liquid limit (Ww), plastic limit (Wp), clay content (Pc), dispersive degree (D), pH, organic matter (Wu), montmorillonite content (M), illite content (I), exchangeable sodium percentage (ESP), pore water sodium ion content (CNa+) and sodium percentage (PS). Statistical results of data are shown in Table 2. The output parameter is one of discriminant results: dispersive soil (DS), transitional soil (TS) or non-dispersive soil (NS).
2.2 Artificial Neural Network
The concept of ANN originates from biological neural network. Each neuron in human brain has thousands of channels which are widely interconnected with other neurons to form complex biological neural network. Signals can be transmitted through new connections generating by stimulation to form feedback in biological neural network (BNN). However, ANN cannot generate new connections randomly, so it updates neurons to form a good nervous system.
Diagram of the ANN model used in this paper is shown in Fig. 1. Each circle stands for one neural, and the all circles form an ANN. The information transmission between neurons is expressed by function y = f (weights * x + bias). Different weights (w) and bias(b) make different output results. The computing process consists of two parts, the forward propagation of information and the back propagation of error. In the process of forward propagation, node data can be calculated in accordance with formulas (1), (2) and (3). Sn in formula (3) is the probability of three discriminant results, and the final output result is one of them. The commonly used activation functions are sigmoid, tanh and relu. In the process of back propagation, the gradient descent algorithm is often used to update w and b. Common gradient descent algorithms include gradient descent optimizer, momentum optimizer and Adam optimizer. Simply, the self-learning process of ANN is an iterative process.
Thus, ANN has intellectualized and computerized characteristics. There is no need to establish an empirical statistical relationship between discriminant parameters and predictive objectives based on some theory.
2.3 Modeling Tool
The Python language was invented by Guido van Rossum in 1991. As a powerful cross-platform object-oriented programming language, it is one of the most widely used computer languages. The advantages of this language are as follows: (1) simple grammar, clear code, free and open source; (2) easy to connect with modules made in other languages (C, Java, C++,VB, etc.); (3) easy porting to Linux, Windows; (4) supporting interactive mode, which can input and get results from terminals without compiling environment in the development process, also called dynamic language; and (5) containing more rich and powerful software libraries. The libraries used in this article are TensorFlow, NumPy, Pandas and Scikit learn. The function of min–max scaler and train–test split in Scikit learn library was used to standard and bath data, respectively.
The machine learning flowchart is shown in Fig. 2. Basic flow of ANN discriminant model by using TensorFlow module is as follows: (1) Define function of add layers: Calculation formula is node data = w * upper layer node data + b; discriminant formula of excitation function is that if node layer has excitation function. (2) Define placeholder of node: Node data placeholder of input layer is 11 and of output layer is 3. (3) Define function of layers: The ANN in this study consists of three layers: Input layer only has inputs data without activation function; the activation function of layer1 (hidden layer) and output layer2 (output layer) is tanh and softmax, respectively. (4) Define loss function: Cross-entropy was used; cross-entropy and softmax function are used together to solve the problem of binary classification or multi-classification with mutual exclusion of categories. (5) Choose an appropriate optimizer to minimize the loss: Gradient descent algorithm was used and the learning rate is 0.5. (6) Initialize all variables and start operations. The maximum number of iterations is set to 3001. In order to verify the correctness of the model, 70% of the 80 data samples were randomly selected as training samples and the remaining 30% of the data were used as test samples. The training samples and test samples did not coincide.
3 Results and Discussion
3.1 Data Preprocessing
What is clear from the data is that dimensions of factors affecting dispersion of soils are different. The model is easy not to converge if used different dimension data for training, so the data should be normalized. The function of min–max scaler was used to normalize input layer data to 0.01 ~ 0.99. The scaling principles are shown in formulas (4) and(5). xstandard is the result of standard normalization, as well as normalizing data to [0,1]; xmax and xmin are maximum and minimum value in data, respectively; max and min are maximum and minimum value in range of normalization; and xscaler is the result of normalizing data to [0.01,0.99]. Results of data preprocessing are shown in Table 3. Due to limited space, only some data are listed.
3.2 Model Building and Discussion
The influence of numbers of neurons in hidden layer and iterative steps on dispersivity discriminant ANN model under different activation functions and learning rates is discussed, as shown in Fig. 3. It could be concluded that the speed of gradient descent and finding minimum value of function when learning rate is small. Even if iterative calculation is up to 10,000 steps, the accuracy of ANN model with small learning rate is still low. Therefore, it is suggested that learning rate should be 0.1. As shown in Fig. 3c, e and i, higher and stable accuracy could be obtained, ranging from 0.80 to 0.84 when activation function is relu as well as using less iterative steps. It is further suggested that numbers of neurons in hidden layer should be 29 and iterative steps should be 2500. In general, for ANN discriminant model of dispersive soil, it is suggested that the activation function is relu, the learning rate is 0.1, the number of neurons in hidden layer is 29, and iterative steps are 2500.
3.3 Distribution of Weights and Bias Values
The difficulty of ANN model is to change weights and bias iteratively, so that prediction result is infinitely close to actual result. The histograms of weights and bias in different layers (hidden layer and output layer) which reflect statistical characteristics of weights and bias are shown in Fig. 4. The values of weights in hidden layer are between -3.06 and 3.06 and the most common values are −0.106 and 0.528 and the values of weights in output layer are between -3.05 and 3.71 and the most common value is −0.717, as given in Fig. 4a. The values of bias in hidden layer are between −0.655 and 1.31 and the most common value is 0.0898 and the values of bias in output layer are between −0.209 and 0.417 and the most common value is −0.209, as given in Fig. 4b.
3.4 Model Validation
We listed the results of discrimination and classification of new ten data which did not participate in the neural network model system construction with the knowledge information of dispersive soil prediction, so as to test the reliability of the established machine identification system, as given in Table 4. Test dates were chosen from Nanyin project, Taoshan Reservoir in Heilongjiang Province and a dam in Northeast China. The prediction results are in good agreement with the actual results, which verifies ANN model has good application effect in predicting the dispersibility of soil as shown in Table 5.
4 Conclusion
ANN has the characteristics of intelligence and computerization. It is not necessary to establish empirical statistical relationship between discriminant parameters and prediction targets based on some theory. It is very effective in soil dispersion prediction.
Python is a high-level language, which has characteristics of fast computing speed, high computing ecology and easy grammar. TensorFlow library has the advantages of good visualization effect and easy to understand the model calculation process.
The combination of Python and ANN is suitable for the identification and simulation of highly complex and nonlinear problems, which is one of the trends of hydraulic engineering and civil engineering in the future.
References
Abbaslou H, Hadifard H, Poorgohardi A (2016) Characterization of dispersive problematic soils and engineering improvements: a review. Comput Mat Civ Eng 1:65–83. https://doi.org/10.4236/msa.2011.26085
Aitkenhead MJ, Coull MC, Towers W, Hudson G, Black HIJ (2012) Predicting soil chemical composition and other soil parameters from field observations using a neural network. Comput Electron Agr 82:108–116. https://doi.org/10.1016/j.compag.2011.12.013
Attoh-Okine NO (2004) Application of genetic-based neural network to lateritic soil strength modeling. Constr Build Mater 18:619–623. https://doi.org/10.1016/j.conbuildmat.2004.04.006
Bell FG, Walker DJH (2000) A further examination of the nature of dispersive soils in Natal. South Africa Q J Eng Geol Hydroge 33:187–199. https://doi.org/10.1144/qjegh.33.3.187
Cai W, Kumar H, Huang S, Bordoloi S, Garg A, Lin P, Gopal P (2020) ANN Model Development for Air Permeability in Biochar Amended Unsaturated Soil. Geotech Geol Eng 38:1295–1309. https://doi.org/10.1007/s10706-019-01091-w
Chorom M, Regasamy P, Murray R (1994) Clay dispersion as influenced by pH and net particle charge of sodic soils. Aust J Soil Res 32:1243–1252. https://doi.org/10.1071/SR9941243
Erzin Y, Rao BH, Singh DN (2008) Artificial neural network models for predicting soil thermal resistivity. Int J Therm Sci 47:1347–1358. https://doi.org/10.1016/j.ijthermalsci.2007.11.001
Fan HH, Gao MX, Li P, Wu PT (2007) Experimental study on clay dispersibility. J chang’an Uni (nat Sci) 27:76–80. https://doi.org/10.1360/yc-007-0745
Fan HH, Kong LW (2013) Empirical equation for evaluating the dispersivity of cohesive soil. Can Geotech J 50:989–994. https://doi.org/10.1139/cgj-2012-0332
Fan HH, Kong LW, Guo MX, Li P, Liu RM (2009) Dispersivity and impermeability of dam soil in Wenjiagou Reservoir. Chin J Geotech Eng 31:458–463
Fan HH, Kong LW, Li HL, Lu XQ, Yin PJ (2010) Study of dispersive identification and treatment with lime of dam soil in Majiushu Reservoir. Rock Soil Mech 31:193–198
Fan HH, Li HL, Zhao GW (2012) Relation among dispersive mechanism, physical-chemical and mineral properties of clayey soil. Chin J Geotech Eng 34:1740–1745
Fan HH, Wu PT, Li P, Jia L, Zhang S (2005) Study on identification of dispersive clay soils. Chin J Geotech Eng 27:1310–1316
Fan HH, Zhao GW, Lu LN, Li Z (2013) Comprehensive criterion of dispersive soil and improvement of pinhole test. J Hydrelc Eng 32:248–253
Gao MX, Li P, Wang GD, Fan HH (2009) Mechanism and reason of dispersivity of dam soil in Nanping Reservoir. Chin J Geotech Eng 31:1303–1308
Holmgren GGS, Flanagan CP (1977) Factors affecting spontaneous dispersion of soil materials as evidenced by the crumb test. ASTM Spec Tech Publ 623:218–239. https://doi.org/10.1520/STP26990S
Hosseini SAA, Mojtahedi SFF, Sadeghi H (2020) Optimisation of deep mixing technique by artificial neural network based on laboratory and field experiments. Georisk Asses Manage Risk Eng Syst Geohazards 14:142–157
IEEE Spectrum (2020) Top Programming Languages 2020. https://spectrum.ieee.org/static/interactive-the-top-programming-languages-2020?utm_source=spectrum-hero. Accessed 13 May 2021
Jalal FE, Xu YF, Iqbal M, Javed MF, Jamhiri, (2021) Predictive modeling of swell-strength of expansive soils using artificial intelligence approaches: ANN, ANFIS and GEP. J Environ Manage 289:1–17. https://doi.org/10.1016/j.jenvman.2021.112420
Ju JL, Fan HH, Liu JM (2015) Establishment of clay dispersion degree evaluation model based on principal component analysis. Yangtze River 46:59–62
Ju JL, Fan HH, Liu JM (2016) Study on dispersivity identification and modification of dam soil in Dashixia Hydropower Station. J Hydrelc Eng 42:114–119. https://doi.org/10.3969/j.issn.0559-9342.2016.11.029
Ju JL, Liu JM, Yan BW (2008) Study on dispersive test for impervious soil of Ningmute hydropower station dam. Subg Eng 2:33–35. https://doi.org/10.3969/j.issn.1003-8825.2008.02.016
Li Z, Xing YC, Fan HH (2006) Study on dispersibility by physical and chemical properties analysis and dispersive test for soil core wall. J Liaoning Tech Uni 25:123–125. https://doi.org/10.3969/j.issn.1008-0562.2006.z1.042
Liu J, Qiu MX, Miu LJ (1984) Mechanism, identification and protection of dispersive soils. Scientific research papers of water resources and Hydropower Research Institute, Beijing. 27:149–157
Maharaj A (2013) The Evaulation of test protocols for dispersive soil identification in Southern Africa. Dissertation, University of Pretoria
Maharaj A, Rooy V, Paige-Green P (2015) Revised test protocols for the identification of dispersive soils. J S Afr Inst Civ Eng 57:31–37
Marchuk A, Rengasamy P (2012) Threshold electrolyte concentration and dispersive potential in relation to CROSS in dispersive soils. Soil Res 50:473–481. https://doi.org/10.1071/SR12135
Marchuk A, Rengasamy P, Mcneill A (2013) Influence of organic matter, clay mineralogy, and pH on the effects of CROSS on soil structure is related to the zeta potential of the dispersed clay. Soil Res 51:34–40. https://doi.org/10.1071/SR13012
Mohanty S, Roy N, Singh SP (2019) Estimating the strength of stabilized dispersive soil with cement clinker and fly ash. Geotech Geol Eng 37:2915–2926. https://doi.org/10.1007/s10706-019-00808-1
Olawoyin R, Nieto A, Grayson RL, Hardisty F, Oyewole S (2013) Application of artificial neural network (ANN)-self-organizing map (SOM) for the categorization of water, soil and sediment quality in petrochemical regions. Expert Syst Appl 40:3634–3648. https://doi.org/10.1016/j.eswa.2012.12.069
Ozturk M, Salman O, Koc M (2011) Artificial neural network model for estimating the soil temperature. Can J Soil Sci 91:551–562
Qiu MX (1984) Influence of sodium montmorillonite on stability of soil. Scientific research papers of water resources and Hydropower Research Institute, Beijing, 20:120–126
Richards L (1954) Diagnosis and Improvement of Saline and Alkali Soils. Soil and Water Conservation Research Branch Agricultural Research Service, Washington
Sarani F, Ahangar AG, Shabani A (2016) Predicting ESP and SAR by artificial neural network and regression models using soil pH and EC data (Miankangi Region, Sistan and Baluchestan Province, Iran). Arch Agron Soil Sci 62:127–138. https://doi.org/10.1080/03650340.2015.1040398
Schaap MG, Leij FJ, Van Genuchten MT (1998) Neural Network analysis for hierarchical prediction of soil hydraulic properties. Soil Sci Soc Am J 62:847–855. https://doi.org/10.2136/sssaj1998.03615995006200040001x
Sherard JL, Decker RS, Ryken NL (1972) Piping in earth of dispersive clay. In: Proceedings of the ASCE speciality conference on the performance of earth-supported structures. 1:589-626
Tamari S, WöSten JHM, Ruiz-Suárez JC (1996) Testing an artificial neural network for predicting soil hydraulic conductivity. Soil Sci Soc Am J 60:1732–1741. https://doi.org/10.2136/sssaj1996.03615995006000060018x
Wu W, Li AD, He XH, Ma R, Liu HB, Lv JK (2018) A comparison of support vector machines, artificial neural network and classification tree for identifying soil texture classes in southwest China. Comput Electron Agr 144:86–93. https://doi.org/10.1016/j.compag.2017.11.037
Yue BR, Jin YH (1998) Crack cause and control of earth dam of Shangma Reservoir in Shanxi Province. Tech Seep Prev 4:1–14
Yue C, Tang H, Yao HL, Liu J, Yang YH (2020) Laboratory test of classification and identification for dispersive clay. Yellow River:11–16. https://kns.cnki.net/kcms/detail/41.1128.TV.20200621.1709.006.html
Zhang L, Fan HH, Che WF, Zhang Y, Meng L, Zhao HW (2018) Experimental analysis of engineering properties of foundation soil of canal works in Heilongjiang area. Hydro-Sci Eng 168:85–92
Zhao GW, Fan HH, Shi M, Ma XL, Li YG, Wei F (2015) Dispersion mechanism of soils revealed by grey system theory and verification tests. Chin J Geotech Eng 37:186–190
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, L., Du, YH., Yang, XJ. et al. Application of Artificial Neural Network in Predicting the Dispersibility of Soil. Iran J Sci Technol Trans Civ Eng 46, 2315–2324 (2022). https://doi.org/10.1007/s40996-021-00754-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40996-021-00754-0