Abstract
A realistic analysis of rock deformation in response to any change in stresses is heavily dependent on the reliable determination of the rock properties as analysis inputs. Young’s modulus (E) provides great insight into the magnitude and characteristics of the rock mass/material deformation, but direct determination of Young’s modulus in the laboratory is time-consuming and costly. Therefore, basic rock properties such as point load strength index, P-wave velocity and Schmidt hammer rebound number have been used to estimate Young’s modulus. These rock properties can be easily measured in the laboratory. The main aim of this study was to develop two intelligent models based upon fuzzy logic and biological nervous systems in order to estimate Young’s modulus of sandstone for a set of known index properties drawn from laboratory tests. The database required to construct these models comprised a series of drill cores (96 samples of sandstone) from site investigation operations for a hydroelectric roller-compacted concrete (RCC) dam located in the Malaysian state of Sarawak. In the final stage of the present study, using the same data sets, multiple regression (MR) analysis was also proposed for comparison with the prediction results of both the fuzzy inference system (FIS) and artificial neural network (ANN) models. The ANN model was found to be far superior to FIS and MR in terms of several performance indices including root-mean-square error and ranking. Thus, from the results of this study, it was concluded that the models proposed herein could be utilised to estimate the E of similar rock types in practice.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
In general, providing a reliable estimate of rock mass characteristics (i.e. strength and deformation) is of primary importance in analysing and designing rock engineering applications such as slope, foundation and underground excavation. In particular, the elasticity constants of a rock mass (E, \(\upsilon\)) are considered as two main inputs for analysing rock deformation behaviour. Typically, rock mass deformability parameters such as Young’s modulus are measured directly by field tests as in situ modulus (denoted by E rm) and indirectly by laboratory tests as intact modulus (denoted by E). Moreover, since true deformation analysis of rock must be carried out according to site conditions, it is important to calculate the rock deformation parameters by taking both lab and field circumstances into consideration. For this purpose, the intact material modulus derived from lab experiments should be correlated with the rock mass modulus using an appropriate classification scheme.
The most widely used methods for determining Young’s modulus include plate loading (equipped with a multi-point extensometer in the rock mass) and flat jack testing for measurements in the field (Hoek and Diederichs 2006) and uniaxial compressive strength (UCS) testing in the laboratory. However, the results of these tests may be susceptible to uncertainty due to the discontinuity and anisotropic behaviour of rock mass subjected to diverse field stresses (Yazdani Bejarbaneh et al. 2015; Armaghani et al. 2015, 2016a). For example, rock samples extracted from the failure zone around a tunnel free face might be exposed to grain-scale damage (micro-cracking) as a result of either stress relaxation or blasting (Martin and Stimpson 1994). On the other hand, both testing methods are time-consuming and require costly equipment, particularly for field tests (Mishra and Basu 2013; Armaghani et al. 2016b).
Several polynomial regression models using field and lab test data have been proposed to overcome these limitations. Table 1 lists a series of regressions that utilise correlations between field data and rock mass classification systems, including rock mass rating (RMR; Bieniawski 1973), tunnelling quality index (Q-system; Barton et al. 1974) and geological strength index (GSI; Hoek and Brown 1997), in order to predict deformation modulus values, E rm, for an isotropic rock mass. Findings from most of these correlations show a relatively good fit to the field data, despite the fact that the exponential relationships and modulus-based equations, as suggested by Mitri et al. (1994) and Sonmez et al. (2004), deliver poor performance in predicting rock mass deformation moduli.
Additionally, Table 2 provides a number of typical simple regression equations based on the data from lab measurements. In fact, these regressions were developed by relating a range of data from simple index tests, including Schmidt hammer (Yilmaz and Sendir 2002; Dincer et al. 2004), ultrasonic velocity (Yasar and Erdogan 2004; Armaghani et al. 2014), point load strength (Yilmaz and Yuksek 2008, 2009) and porosity (Lashkaripour 2002; Beiki et al. 2013), to Young’s modulus E. However, these statistical models suffer from low generalisability, such that they are not able to be generalised to a wide range of data from different engineering applications (e.g. Beiki et al. 2013; Rezaei et al. 2012).
Over the last 20 years, there has been a marked increase in the successful application of intelligent methodologies such as artificial neural networks (ANNs), fuzzy inference system (FIS) and evolutionary computation for the preliminary stage of rock engineering design and rock mechanics modelling (Feng and Hudson 2004, 2010; Hudson and Feng 2007; Mishra and Basu 2013; Mishra et al. 2015). Gokceoglu and Zorlu (2004) validated the results of a FIS model in predicting the E and UCS of greywacke samples. The proposed fuzzy model benefited from 54 fuzzy rules to map four input variables (rock index properties) to two output variables (E and UCS). The authors analysed the performance of the fuzzy model using multiple regression, and concluded that the predicted results were in good agreement with the lab test results compared with a statistical model. Kahraman et al. (2009) trained an ANN model to predict the levels of UCS and E for Misis fault breccia, and their prediction results revealed the highest accuracy when compared with the regression models. Yagiz et al. (2012) examined the effect of slake durability cycles on the UCS and elasticity constant of carbonate rocks by developing ANN and multivariate regression models based upon 54 carbonate rock cores. They reported more reliable prediction of ANN than that with multivariate regression. An ANN model together with a multiple regression using several intact rock properties of gypsum were developed by Yilmaz and Yuksek (2008) to predict Young’ modulus of gypsum, with results demonstrating that the proposed ANN model was able to predict the proposed rock modulus with reasonable accuracy. Yilmaz and Yuksek (2009) undertook another study of gypsum rock samples obtained from the Sivas basin in Turkey in order to assess the predictability of two engineering properties of gypsum; UCS and E. For this purpose, they constructed ANN and hybrid neuro-fuzzy systems from the basic and index properties of the samples (input data), including water content, porosity, sonic velocity, Schmidt hammer rebound number and point load index, to predict both E and UCS (output data). They reported successful prediction for both models compared with multiple regression. Tonnizam Mohamad et al. (2015) and Momeni et al. (2015) showed the successful application of hybrid particle swarm optimization (PSO) and ANN in predicting the UCS of rocks. Several recently developed models for predicting the rock modulus, E, using soft computation techniques are shown in Table 3.
This study attempts to estimate Young’s modulus for a series of known index properties for sandstone samples. To this end, two soft computing techniques, FIS and ANN, were designed such that the three index properties act as inputs to the systems, with Young’s modulus as the target. More specifically, this fitting problem involves matching a set of numeric lab measurement inputs, including Schmidt hammer, rebound number (R n), P-wave velocity (V p), and point load index (I s(50)) to an associated set of numeric targets E. In the final stage of this study, a multiple regression (MR) model using the same data set is constructed for the sake of comparison. This statistical model is employed to provide a measure of how well both systems (FIS and ANN) fit the data in terms of their performance indices.
Methods
Fuzzy inference system
Zadeh (1965) established the fundamental mechanism of fuzzy logic theory on which mapping the input space onto the output space is based. The concept of fuzzy logic is principally founded on a fuzzy set. In such a set, there are a number of components with partial membership in that set, as opposed to crisp or well-defined boundaries for a classical or ordinary set. In the case of fuzzy logic, the truth of a conditional expression is measured by degrees (between 0 and 1) resulting from an appropriate membership function (MF). In other words, characterisation of a fuzzy set is carried out by the specific MF which conveys a sense of ambiguity (Zadeh 1973). In fact, the MF implies how each of the crisp values from the input dimension is connected to a membership grade (values within the interval from 0 to 1). The mathematical relationships are capable of constructing any form of MF in a succinct and straightforward way. In addition, each of the various classes of MF is denoted by a specific designation which is directly dependent on the shape and formula of that function, with triangular, trapezoidal, Gaussian, generalised bell and sigmoidal MFs as the most common types (Jang et al. 1997).
Fuzzy conditional statements
Fundamentally, in a fuzzy system, the input–output mapping can make some inferences based upon a set of rule statements using parallel evaluation. Overall, an if–then rule structure is divided into two differentiable parts, the "if" part and the "then" part, which are referred to as the antecedent or premise and the consequent or conclusion, respectively (Sivanandam et al. 2007). With regard to the configuration of the fuzzy rule, there are various primary elements in defining the rules which involve input and output variables in conjunction with descriptive adjectives pertaining to those variables. For instance, a statement for the if–then rule is represented as follows:
where two variables x and y are the universes of discourse for fuzzy sets describing linguistic labels or values A and B, respectively.
Fuzzy reasoning (or approximate reasoning)
In general, fuzzy reasoning attempts to specify conclusions by applying various inference operations to a certain number of fuzzy statements (Bai et al. 2007). More specifically, the fuzzy reasoning (or approximate reasoning) mechanism comprises five steps:
-
1.
The first step (fuzzification of inputs) consists in interpreting the input values through the specific input membership functions.
-
2.
The second step (application of fuzzy operators) involves the application of a fuzzy logic operator to the membership values from the “if–then” parts of a rule in order to yield a single firing strength per rule.
-
3.
The third step (implication method), in which a fuzzy set is assigned to the output variable, refers to an implication method that involves truncating that fuzzy set to a degree derived from the preceding stage.
-
4.
The fourth step (aggregation) involves aggregating all the truncated fuzzy sets for each rule in order to obtain an overall consequent membership function.
-
5.
The fifth step (defuzzification) consists in converting the resultant fuzzy set (i.e. overall consequent membership function) into a single crisp output using a competent method selected from a list of defuzzification techniques, e.g. centroid, bisector, mean of maximum.
Fuzzy rule-based models
The combination of fuzzy sets, fuzzy logic operators (fuzzy reasoning) and fuzzy statement constitutes the backbone of the FIS, also referred to as a fuzzy model or rule-based model. FIS applications cover a broad scope of research areas including pattern recognition, data and image classification, management, economics, automatic control, robotics, signal processing, computer vision, decision-making, expert systems and prediction of chaotic time series (Zadeh 1965; Rutkowski 2004).
Overall, the most frequently used FIS models for various applications can be categorised under two headings: the Mamdani fuzzy model (Mamdani and Assilian 1975) and the Sugeno fuzzy model (Takagi and Sugeno 1985). These systems differ based on the type of function being used in the consequent part of their fuzzy statements, as demonstrated in Fig. 1. As a result, the operations employed in the consequent part of fuzzy rules (i.e. aggregation and defuzzification) vary in accordance with this distinction (Jang et al. 1997).
Sugeno-type fuzzy model
Takagi and Sugeno (1985) developed a systematic approach for generating fuzzy statements, commonly referred to as the Sugeno fuzzy inference method (other designations include the Takagi–Sugeno method or TSK method, hereinafter called the Sugeno method). In the Sugeno fuzzy model, the consequent function is expressed mainly in polynomial form (defined by the input variables) rather than any of the MFs previously mentioned. For this reason, the fuzzy operations are applied only to the antecedent part of the if–then rule. In other words, unlike the Mamdani type, the fuzzy reasoning process in the Sugeno model cannot be rigorously executed in both parts of the fuzzy rule statement. Since the resulting outputs inferred from each rule are crisp values, the total single-value output for each target variable is equal to a weighted average of outcomes from all the rules. These two versions of the fuzzy model are graphically illustrated in Fig. 1.
The Sugeno model substitutes the weighted average method for the defuzzification techniques employed in the Mamdani model. In fact, the defuzzification of the resulting output MF into a single crisp value suffers from two major drawbacks, the failure to reach an exact result mathematically and the time-consuming nature of the computations (Shams et al. 2015). Therefore, the Sugeno-type fuzzy inference is far and away the preferred choice for modelling a fuzzy system from any given input–output data set.
Data clustering
Data clustering is a quick, one-pass approach for developing a FIS from any given number of data sets. In most cases, it is advisable to apply a clustering algorithm to the data-based fuzzy model, particularly when there is little to no prior information as to the underlying behaviour of the data being analysed (Jain and Dubes 1988; Jain et al. 1999). Most fuzzy systems generated without clustering suffer from an excessive number of rules, especially those systems with relatively large input variables. In contrast, the rules produced by data clustering are extremely well suited to the input data points such that the number of rules is optimised according to the number of identified clusters.
The extraction of fuzzy rules from a set of data is usually undertaken by either subtractive or mountain clustering analysis. However, the mountain clustering approach involves high computational expense in comparison to the subtractive method. Therefore, in this research, all fuzzy rules were propagated based on subtractive clustering in order to avoid the extra computational cost. Further information on these clustering paradigms can be found in the authors' publications (Chiu 1994; Yager and Filev 1994a, b).
Artificial neural network
In general, an artificial computational system, or ANN, is designed by simulating various organisational principles upon which a nervous system functions are based. Unlike traditional expert systems, ANN is inherently capable of learning from any given training pattern to find the underlying relationship between input and output data for a mapping problem (Zurada 1992). Artificial neurons are regarded as the constitutive units of an ANN computing system and enable the parallel processing of information in the same way as a biological brain.
Pioneering work in neural network modelling by McCulloch and Pitts (1943) led to the development of a binary threshold logic unit (binary decision unit) for modelling artificial neuron behaviour. Every artificial node of the network captures a weighted sum of incoming signals, and then passes the signals through a specific activation function to produce a more useful output. Structurally, ANNs can be viewed as highly parallel systems in which a network of interconnected computational units, called neurons or nodes, are organised into successive layers. Each pattern of connection between neurons affects network behaviour and also defines the network class (Kanellopoulas and Wilkinson 1997).
As mentioned previously, it is possible to train the network so that network performance can be effectively improved. More precisely, in the course of network training, both the architecture and connection weights are iteratively modified to minimise the error from the output layer node. In fact, the produced output error is computed by a squared error function, given as
where the parameters t and y represent the target value and actual produced value, respectively. The parameter P denotes the number of training patterns.
Network learning tasks are commonly undertaken through a gradient-based learning procedure, referred to as a back-propagation (BP) learning algorithm, especially for multilayer feedforward networks. Basically, each training period in BP learning is a twofold procedure comprising a forward stage and a backward stage. During the forward stage, input signals move forward through the network, sending out error signals for each output-layer node. Subsequently, in the next stage, the resulting error rates will be passed backward along the network to modify the network weights and biases.
Depending on the network architecture, ANNs are classified into two functional groups: feedforward and feedback. One of the most commonly used variants of multilayer feedforward networks is the multilayer perceptron (MLP), in which successive layers of processing units (neurons) exchange and process information (signals) through weighted links and activation functions, respectively (Haykin 1999). In general, hidden and output neurons can perform certain specific activation functions of net input in order to produce neuron outputs. Note that each neuron output is regarded as input to the next layer of neurons. Generally speaking, the type of activation function should be selected according to the complexity of the problem to be solved. In the case of nonlinear problems, therefore, it is advisable to employ the sigmoid transfer functions, e.g. log-sigmoid and tangent sigmoid. Each of the hidden neurons is fed with the total net input in which each incoming signal (x i ) from the previous layer is multiplied by an associated adaptive weight coefficient (w ij ) to yield weighted input signals. A summation function is then applied to these weighted signals, and finally a small amount of bias is added to the aggregate signal. This process is repeated for each layer until the system’s overall output is produced. Mathematically, the total net input to every hidden or output neuron is expressed as:
For each neuron output, the resulting total net input is squashed into the activation function (e.g. sigmoid). Thus the output for every hidden or output neuron is derived as:
Figure 2 briefly demonstrates data processing operations for a typical artificial neuron.
Case study and experimental work
The data set employed in the present study relates to a hydroelectric power project which includes the construction of a roller-compacted concrete (RCC) dam, located in the Malaysian state of Sarawak (see Fig. 3). The state of Sarawak benefits from an abundant supply of water, thanks to average annual precipitation of up to 4000 mm. In addition to the high annual rainfall levels, the presence of appropriate geographical and geological conditions in the state provides the foundation for the development of hydroelectric power (HEP) dams. Given these two attributes, Sarawak is ideally positioned as a sustainable source of renewable energy.
The Lawas RCC dam is designed with output capacity of about 100 MW to meet a portion of the current state demand for electrical power. As part of exploration and subsurface investigations at the proposed site, a total of 20 boreholes were drilled to a depth of 150 m by means of a wash boring machine. This subsurface survey revealed that the substrata profiles situated below the RCC dam foundation consisted primarily of a range of sedimentary rocks with degrees of weathering ranging from fresh to moderately weathered zones. These sedimentary rocks comprise sandstone, shale, and mudstone at the foundation level, which indicates an RMR number of 40.
The data set analysed herein was developed based on the core samples of a sandstone layer at depths varying from approximately 13.50 to 81.50 m. A sufficient number of core specimens of sandstone were collected from the boreholes of ZKB1, ZKB2, ZKB3, ZKB4 and ZKB7 using the NX core barrel (54-mm core diameter). Next, the extracted core samples were packed out and then transported to the laboratory, where the geotechnical properties of the rock would be characterised numerically using several laboratory tests, including Schmidt hammer, point load, P-wave velocity and UCS.
Specimen preparation
Each core sample employed in this study was trimmed by a diamond disc cutter in order to obtain a standard cylindrical shape with a 54-mm diameter, which allows for height/diameter (H/D) ratios within an acceptable range of 2.5–3 such that troublesome size effects are eliminated. After the core samples were cut, a grinding machine was used to grind the end planes of the specimen to provide parallelism and flatness, facilitating the axial loading condition. Some typical core samples prepared for laboratory tests are shown in Fig. 4. In the present study, the drill core preparation and all testing procedures fully conformed to the guidelines of the International Society for Rock Mechanics (ISRM 2007). It is also worth pointing out that all laboratory tests were performed on air-dried core samples.
Point load index test (PLT)
A group of diametral tests were conducted to classify the strength of the core samples investigated in this study. The testing machine is equipped with a loading system of 100 kN capacity, two measuring systems (load and displacement records) and a controller unit. A loading frame, pump, ram and a pair of conical platens constitute a loading system in which the load is applied incrementally to the core specimen such that a sudden rupture occurs within 10–60 s. During the load application, the corresponding records for both failure load and distance (between core sample and platen contacts points) were monitored through a hydraulic pressure gauge and a displacement transducer, respectively, and these records were also simultaneously transferred to the controller unit, a data logger, for producing the data as an indication of the strength of the sample being tested.
Schmidt/rebound hammer test
An L-type Schmidt hammer with an impact energy of 0.74 Nm was employed to assess the surface hardness of NX core samples. In order to avoid movement and vibration during the test, the core specimens were securely clamped to a semi-cylindrical slot embedded in a steel base. The test was performed by employing a spring-driven steel hammer with vertical downward axis orientation. When a steel plunger rod is pushed against the prepared core surface, an internal spring-controlled mass with a predetermined amount of energy impacts the plunger and rebounds a certain distance. The rebound distance travelled by the mass is measured on a graduated scale as the rebound number. Based on ISRM guidelines (2007), 20 representative points with even spacing equal to at least the diameter of the plunger were determined on the surface of the core sample. Accordingly, an average of 20 valid readings for each specimen was calculated and used.
Ultrasonic velocity test
In this study, a high-frequency ultrasonic pulse technique using transducers with a frequency range of 100 kHz to 2 MHz was adopted to measure the compressional wave velocity (denoted by V p). In order to meet the full coupling condition, the end planes of specimens were uniformly covered with a thin film of a specific gel. In accordance with ISRM (2007), the transducers are first pressed against the core samples with a small stress up to 10 N/cm2 (seating force), and a pulse generator subsequently sends out an input signal of compression waves along the core axis. A direct pulse transmission method was utilised to calculate P-wave velocities by recording the time during which the waves travel from the transmitter to the receiver.
Uniaxial compressive strength test
In the present study, the uniaxial compressive strength (UCS) and deformation attributes of the rock materials were verified under uniaxial compression by means of a servo-controlled 3000 kN compression machine. All tests were conducted under a stress-controlled state in which the compressive load was applied at a constant rate of stress around 0.5–1 MPa/s. In accordance with ISRM guidelines (2007), the rock samples under compression were ruptured within 5–10 min. During the test, a set of measurements on load cell and axial strains (linear variable differential transformers [LVDTs]) were recorded for the core samples at regular intervals until failure in order to determine various rock material properties, including UCS, strain at failure \(\varepsilon\) and elastic modulus E. Similar to igneous and metamorphic rocks, these medium-grained sedimentary core samples typically display a brittle behaviour under uniaxial compression, resulting in sudden failure in the form of distinct fracture planes (see Fig. 5).
Calculating the modulus of elasticity
In engineering practice, the deformation behaviour of any rock material/mass is commonly described either through tangent elastic modulus (E tan) or secant elastic modulus (E sec). These values are determined using an analysis of the stress–strain relationship for any given rock material/mass which is subject to unconfined compression. In the case of rock material deformation measurement, it is customary to use the tangent modulus (E tan; also called the modulus of elasticity), which represents the slope of a stress–strain curve at one-half the ultimate strength (50% UCS), whereas for specifying rock mass deformation, most engineers prefer to employ the secant modulus (E sec; also known as modulus of deformation), which represents the slope of a straight line from origin (0, 0) to a certain stress–strain point corresponding to either ultimate strength or one-half the ultimate strength. The procedures representing both the E tan and E sec are illustrated in Fig. 6. In the present study, the former procedure was utilised for analysing the stress–strain curve of each sandstone material to produce the relevant tangent elastic modulus (hereinafter referred to as the modulus of elasticity and denoted by E).
In this study, a total database of 96 data samples, including R n (in a range of 20–40.5), V p (in a range of 1.67–3.16 km/s), and I s(50) (in a range of 1.43–4.29 MPa) as predictors and E (in a range of 10.5–32.22 GPa) as output, were prepared to construct the predictive models. Figure 7 demonstrates three input and one output variable with their respective data used in the modelling process for all systems under consideration. In addition, basic descriptive statistics of the database are presented in Table 4.
Simple regression analysis
In order to examine the effect of input parameters, simple regression analysis was carried out between the E and other input parameters including R n, V p and I s(50). To obtain equations with higher performance capacity, various types including linear, exponential, power and logarithmic equations were performed. In this study, the coefficient of determination (R 2), variance accounted for (VAF) and root mean square error (RMSE) were calculated to control the capacity performance of all developed models:
where y and y′ are the measured and predicted values, respectively, \(\tilde{y}\) is the mean of the y values, and N is the total number of data. The model will be excellent if R 2 = 1, VAF = 100 and RMSE = 0. The selected equations for predicting E using the above-mentioned predictors together with their performance indices are presented in Table 5, which shows that the power, linear and logarithmic equation types give the best results for predicting E using R n, V p and I s(50), respectively. The R 2 values obtained for the equations are 0.503, 0.545 and 0.445, respectively. The purposed relationships between the E and relevant parameters of the rock are given in Fig. 8. The results revealed that these relationships were statistically meaningful, but in order to obtain higher-performance models for predicting E in practice, multi-input parameters may be needed. Therefore, three types of modelling techniques—FIS, ANN and MR—were also constructed and developed.
Multi-input predictive models
Designing the Sugeno fuzzy system
This section presents the fuzzy predictive technique for predicting the E of sandstone using results of R n, V p and I s(50). For this purpose, as a first stage of modelling, the proposed data set was normalised into a unit interval [0, 1] using the following equation:
where X and X norm represent the measured and normalised values, respectively, and X min and X max are the minimum and maximum values of the measured parameters, respectively. Modelling and validating the fuzzy systems are accomplished by dividing the complete data set into training and test sets, each of which is determined as a percentage of the original data: 80% is designated to design the systems, and 20% is designated to measure the accuracy of the systems. This was implemented according to the work of several scholars, such as Swingler (1996) and Looney (1996). Therefore, in the present study, 77 data sets were randomly chosen from which to develop the models, and the remaining 19 data sets were assigned to test these models.
In this research, several FIS models of first-order Sugeno class were created based upon the subtractive clustering algorithm as an effective preprocessor to those data-based models. As a result of this preprocessing, a number of clusters were identified to produce the MFs and fuzzy conditional statements for each fuzzy system. Actually, the clustering-induced fuzzy system attempts to form a pattern for physical behaviour of the proposed empirical data set by relating the observations of three-dimensional input space (R n, V p and I s(50)) to their corresponding targets (E). Figure 9 presents a schematic of these components associated with each of the proposed fuzzy systems.
In contrast to the linear MFs, with sudden changes and breaks at the intersection points of straight lines, the MFs representing nonlinear relationships allow for a gradual, smooth movement among fuzzy sets (Jang et al. 1997). Consequently, a bell-shaped function with normal distribution, referred to as Gaussian MF, was defined to characterise each fuzzy set on the premise part of an if–then rule. The following formula expresses a Gaussian MF by its two geometric parameters c and σ:
where the parameters c and σ represent the center and spread coefficient for the Gaussian curve, respectively. As mentioned above, the identified clusters within the proposed data set provide both geometric parameters for each input MF. In addition, each of the output MFs is a first-order polynomial composed of the input variables and expressed by the following equation. The coefficients of this linear relationship are drawn from training data samples based on least squares estimation.
The fuzzy operation for each rule in the antecedent part was accomplished by AND operator: prod (product). The AND method was then used to perform the implication function whereby the antecedent part outcome (firing strength) for each rule defined a level of corresponding linear output MF, E i . Finally, a weighted average method was employed to combine all rules outcomes into a single value, representing the final output of the proposed system, E, as follows:
where the number of each rule is designated by n. In total, seven Sugeno-type fuzzy models were constructed from the training patterns given diverse parametric values of design. The resulting models were then validated using only the test set. Generally, the first stage in developing any data-based FIS is to adjust the data clustering parameters. The two characteristics associated with center coordinate and number of identified clusters can be controlled by altering the extent of the cluster radius parameter (indicated by R a ). In most cases, assigning a minimum value to the cluster radius parameter increases the size and quantity of identified clusters, and vice versa (Chiu 1994). The optimal range of the cluster radius is usually set between 0.2 and 0.5 (MATLAB user guide 2007).
In addition to cluster radius, another parameter of subtractive clustering, known as cluster neighbourhood (indicated by R b ), can be tuned to govern the range of influence of each cluster as well as the cluster numbers in the input space under consideration. More precisely, increasing the neighbourhood for each cluster allows subtractive clustering to find the centers for those clusters with larger intermediate distances. Accordingly, a number of various cluster radii (R a ) ranging from 0.2 to 0.5, along with a fixed cluster neighbourhood value (usually greater than cluster radius) for all the data dimensions, were used to create the seven fuzzy models. Table 6 presents the design parameters along with the number of fuzzy rules for each of the seven fuzzy models.
The training set and then testing set performance capacities of the proposed models are evaluated by means of R 2, VAF and RMSE, as shown in Table 7. However, selecting the best model based upon only these performance indices can be difficult, due to small differences among the pertinent statistics. For this reason, a simple ranking approach was used (Zorlu et al. 2008), in which each of the fuzzy models was graded separately according to its performance on training and test sets. As shown in the assigned total rank scores listed in Table 7, the FIS1 model was the most successful of the seven fuzzy models in predicting E values.
For the FIS1 model, the training data points were grouped into six clusters to construct the fuzzy system. Figure 10 demonstrates this natural grouping for just two selected dimensions of input space, I s(50) against E. As a result of the clustering process, the geometric parameters were produced to form Gaussian-type input MFs. These Gaussian parameters used in the model FIS1 configuration are arranged into a matrix (denoted by c) in relation to center coordinates and into a row vector (denoted by σ) in relation to spread coefficients, as indicated below:
Since the input MFs produced are equal in number to the identified clusters, six fuzzy sets with their respective linguistic labels (denoted by corresponding cluster number) are assigned to each of the input variables. Figures 11, 12 and 13 graphically depict these six antecedent fuzzy sets for the model FIS1. Similar to the input MFs, the number of propagated fuzzy rules and identified clusters would be equal. Therefore, six fuzzy conditional statements are propagated for the model FIS1, as summarised in Table 8. Functionally, each row of the rules tries to form a direct relationship between a cluster from the premise part and a cluster from the conclusion part.
Additionally, Fig. 14 shows how the antecedent and consequent MFs interact with each other in the fuzzy system FIS1. In other words, this graphical diagram simulates the system’s behaviour in mapping three-element input to a one-element target. The prediction performance of the FIS1 will be further discussed in a later section.
Designing the ANN model
Like the FIS systems, each of the ANN models utilises the same distribution of normalised original data for estimating the elastic modulus of sandstone, that is, 80% of the database is set in the training part and the remaining data sets are devoted to testing of the models. A challenging task in designing any ANN models entails adopting an optimal ANN architecture based upon the number of hidden layer(s) and the number of nodes per hidden layer. In practice, an ANN model benefiting from a single hidden layer can solve any complex fitting problem given sufficient nodes for that layer (Cybenko 1989; Hornik et al. 1989). Furthermore, theoretically, the number of hidden neurons per layer should be proportional to the problem complexity. This means that a higher degree of complexity requires additional nodes in hidden layer(s), so that they can capture the true underlying relationships of the modelled data, although an excessive increase in neuron numbers may cause certain difficulties, including overfitting and longer computation time. Thus, specifying an optimal number of nodes for each hidden layer is crucial (Sonmez et al. 2006). Table 9 lists chronologically several empirical expressions with respect to hidden node numbers. Based on the above discussion and the information in Table 9, the initial network connection structure used in this research will be composed of three input nodes, one output node and a single hidden layer with neurons ranging in number from one to seven.
Hence, the suggested network for this research will be a two-layer feedforward network consisting of sigmoid hidden nodes and a single linear output node. Based on the resulting neuron range, seven networks with different hidden node numbers are established. The neural network models are thus configured to be trained and then tested to find the optimal number of nodes in the proposed hidden layer. The RMSE is considered as convergence criterion for the training process (Simpson 1990). In addition to RMSE, other statistics are designated to assess the predictive performance of these trained networks, including R 2 and variance accounted for (VAF).
After training the proposed models, testing samples are used to put the trained models to the test and also to validate the model generalisation according to the performance results on the test data set. The task of selecting the best network performance on both training and test sets is accomplished by means of a simple ranking approach (Zorlu et al. 2008), as described earlier. Most scholars involved in the field of ANN put considerable emphasis on the importance of the learning algorithm utilised for training purposes. With regard to the efficiency of these algorithms, several studies have shown that the Levenberg–Marquardt (LM) BP algorithm has a number of distinct advantages over conventional gradient descent approaches (Hagan and Menhaj 1994). Accordingly, all the ANN models developed in this research are trained using the LM BP algorithm. Table 10 summarises the trained and tested ANN models along with their respective performance measures (RMSE, R 2, VAF) and lists rank values for both the training and testing parts of randomly chosen data.
As shown in Table 10, a set of performance measures with a maximum rank value of 36 demonstrates the superiority of model ANN4 over the others. Consequently, the optimal number of hidden nodes will be equal to four, according to the selected ANN model. Figure 15 illustrates a schematic of the optimal structure for the ANN model under consideration. Evaluation of the ANN model will be provided later.
Designing the multiple regression
MR analysis is used to determine the values of parameters for a function such that the function will best fit a provided set of data observations. With this technique, the function is a linear (straight-line) equation. MR solves engineering problems by performing a least squares fit, which constructs simultaneous equations through the creation of a regression matrix. By employing this technique, coefficients are suggested by means of a backslash operator.
Using the established normalised data set, an MR equation was developed to predict E, as shown in Eq. 11. Values of R 2 of 0.588 and 0.715 were obtained for training and testing of the proposed MR, respectively. In these models, R n, V p and I s(50) were considered as inputs, and the E was then estimated as a function of these inputs. The statistical package SPSS 11.5 (SPSS 2007) was used to construct the MR models. The predictive performance of the MR models will be examined in greater detail in the following section.
Comparing predictive performance
This section presents an evaluation of the capacity of the developed models for predicting the E. Simple regression analysis revealed the need to develop E predictive models with higher accuracy using multi-input parameters. Hence, FIS, ANN and MR models were also proposed for estimating the E of the sandstone samples. In the FIS, ANN and MR modelling procedures, all 96 data sets were randomly divided into two sets for model development and evaluation. As mentioned previously, in this study, R 2, VAF and RMSE were considered and calculated to evaluate the performance of the predictive models.
Table 11 presents the results of the models in predicting E. Based on these results, the performance of the ANN model is superior to that of both the FIS and MR for all items, and the performance of the FIS predictive model is superior to that of the MR model in most of the items. Based on the predictive performance results for both superior FIS and ANN models, the proposed ANN model is able to estimate the sandstone elastic modulus, E, for both training and testing samples with better accuracy compared to the fuzzy model. As an example, R 2 values of 0.715, 0.670 and 0.818 for the testing data sets of MR, FIS and ANN, respectively, indicate that the ANN is the best predictive model for estimating the E of the sandstone samples.
The high predictive ability of the proposed ANN model is essentially attributable to its use of iterative optimisation in predicting the response data E. In contrast, the poor predictive performance of the fuzzy model is principally due to its rule-based, one-pass mechanism that does not implement any iterative optimisation for capturing the underlying behaviour of the training data. However, to improve the predictive capability of the FIS, two common overall techniques have been proposed, both of which emphasise the fine-tuning of MF parameters of the fuzzy system (i.e. premise and consequent MFs) over a training period. In the first of these, which is referred to as adaptive network-based fuzzy inference system (ANFIS), a class of adaptive networks is converted to an equivalent FIS being used as a whole. In the second technique, a hybrid system can be developed to combine the FIS with any optimisation algorithm, such as particle swarm optimisation.
Conclusions
Several laboratory tests, including uniaxial compressive strength, Schmidt hammer, point load strength and P-wave velocity, were conducted on 96 samples of sandstone. These core samples were acquired from sites in the state of Sarawak, Malaysia, and sample preparation and testing were carried out in accordance with ISRM guidelines. As a target of this study, elastic modulus values were obtained after conducting UCS tests.
Based on simple regression analysis, the relationships between the E and other predictors were found to be acceptable. Nevertheless, in order to obtain models with higher accuracy, MR, FIS and ANN models were also developed. Based on model performance indices and using a simple ranking method, the best FIS and ANN models were chosen from among the group of models constructed, and then using the same data sets, an MR model was developed to predict the E of the rock. The indices R 2, VAF and RMSE were utilised to check the predictive performance of the models, with results revealing the ANN to be the best predictive model. Based on RMSE, results of 0.167, 0.151 and 0.127 were obtained for testing data sets of the MR, FIS and ANN models, respectively, demonstrating the higher capacity of the ANN model in estimating modulus of elasticity of the rock. It should be noted, however, that the predictive models proposed in this study were designed based on the properties of sandstone rock samples; hence, direct implementation of the models must be undertaken with caution and for similar conditions.
References
Armaghani DJ, Amin MFM, Yagiz S, Faradonbeh RS, Abdullah RA (2016) Prediction of the uniaxial compressive strength of sandstone using various modeling techniques. Int J Rock Mech Min Sci 85:174–186
Bai Y, Zhuang H, Wang D (2007) Advanced fuzzy logic technologies in industrial applications. Springer Science & Business Media, London
Barton N (2002) Some new Q value correlations to assist in site characterisation and tunnel design. Int J Rock Mech Min Sci 39:185–216
Barton N, Lien R, Lunde J (1974) Engineering classification of rock masses for the design of tunnel support. Rock Mech 6(4):189–236
Beiki M, Majdi A, Givshad AD (2013) Application of genetic programming to predict the uniaxial compressive strength and elastic modulus of carbonate rocks. Int J Rock Mech Min Sci 63:159–169
Bieniawski ZT (1973) Engineering classification of rock masses. Trans S African Inst Civ Engrs 15(12):335–344
Bieniawski ZT (1978) Determining rock mass deformability—experience from case histories. Int J Rock Mech Min Sci Geomech Abstr 15:237–247
Chiu SL (1994) Fuzzy model identification based on cluster estimation. J Intell Fuzzy Syst 2:267–278
Cybenko J (1989) Approximations by superpositions of a sigmoidal function. Math Control Signal Syst 2:303–314
Dehghan S, Sattari GH, Chehreh CS, Aliabadi MA (2010) Prediction of unconfined compressive strength and modulus of elasticity for Travertine samples using regression and artificial neural. New Min Sci Technol 20:0041–0046
Diederichs MS, Kaiser PK (1999) Stability of large excavations in laminated hard rockmasses: the Voussoir analogue revisited. Int J Rock Mech Min Sci 36:97–117
Dincer I, Acar A, Cobanoglu I, Uras Y (2004) Correlation between Schmidt hardness, uniaxial compressive strength and Young’s modulus for andesites, basalts and tuffs. Bull Eng Geol Environ 63:141–148
Feng XT, Hudson JA (2004) The ways ahead for rock engineering design methodologies. Int J Rock Mech Min Sci 41(2):255–273
Feng XT, Hudson JA (2010) Specifying the information required for rock mechanics modelling and rock engineering design. Int J Rock Mech Min Sci 47(2):179–194
Gokceoglu C, Zorlu K (2004) A fuzzy model to predict the unconfined compressive strength and modulus of elasticity of a problematic rock. Eng Appl Artif Intell 17:61–72
Hagan MT, Menhaj MB (1994) Training feed forward networks with the Marquardt algorithm. IEEE Trans Neural Netw 5:861–867
Haykin S (1999) Neural networks, 2nd edn. Prentice-Hall, Englewood Cliffs
Hecht-Nielsen R (1987) Kolmogorov’s mapping neural network existence theorem. In: Proceedings of the first IEEE international conference on neural networks, San Diego, CA, USA, pp 11–14
Hoek E, Brown ET (1997) Practical estimates of rock mass strength. Int J Rock Mech Min Sci 34(8):1165–1186
Hoek E, Diederichs MS (2006) Empirical estimation of rock mass modulus. Int J Rock Mech Min Sci 43:203–215
Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2:359–366
Hudson JA, Feng XT (2007) Updated flowcharts for rock mechanics modelling and rock engineering design. Int J Rock Mech Min Sci 44(2):174–195
ISRM (2007) The complete ISRM suggested methods for rock characterization, testing and monitoring: 1974–2006. In: Ulusay R, Hudson JA (eds) Suggested methods prepared by the commission on testing methods, international society for rock mechanics, compilation arranged by the ISRM Turkish National Group, Ankara, Turkey
Jahed Armaghani D, Tonnizam Mohamad E, Momeni E, Narayanasamy MS, Mohd Amin MF (2014) An adaptive neuro-fuzzy inference system for predicting unconfined compressive strength and Young’s modulus: a study on Main Range granite. Bull Eng Geol Environ. doi:10.1007/s10064-014-0687-4
Jahed Armaghani D, Mohamad ET, Hajihassani M, Yagiz S, Motaghedi H (2015) Application of several non-linear prediction tools for estimating uniaxial compressive strength of granitic rocks and comparison of their performances. Eng Comput. doi:10.1007/s00366-015-0410-5
Jahed Armaghani D, Mohamad ET, Momeni E, Monjezi M, Narayanasamy MS (2016) Prediction of the strength and elasticity modulus of granite through an expert artificial neural network. Arab J Geosci 9(1):1–16. doi:10.1007/s12517-015-2057-3
Jain AK, Dubes RC (1988) Algorithms for clustering data. Prentice-Hall, Englewood Cliffs
Jain AK, Murty MN, Flynn PJ (1999) ACM Computing Surveys, Vol. 31, No. 3, September
Jang JSR, Sun CT, Mizutani E (1997) Neuro-fuzzy and soft computing: a computational approach to learning and machine intelligence. Prentice Hall, Inc, Englwood Cliffs, NJ
Kaastra I, Boyd M (1996) Designing a neural network for forecasting financial and economic time series. Neurocomputing 10:215–236
Kahraman S, Gunaydin O, Alber M, Fener M (2009) Evaluating the strength and deformability properties of Misis Fault Breccia using Artificial Neural Networks. Expert Syst Appl 36(3):6874–6878
Kanellopoulas I, Wilkinson GG (1997) Strategies and best practice for neural network image classification. Int J Remote Sens 18:711–725
Lashkaripour GR (2002) Predicting mechanical properties of mudrock from index parameters. Bull Eng Geol Environ 61:73–77
Leite MH, Ferland F (2001) Determination of unconfined compressive strength and Young’s modulus of porous materials by indentation tests. Eng Geol 59:267–280
Looney CG (1996) Advances in feed-forward neural networks: demystifying knowledge acquiring black boxes. IEEE Trans Knowl Data Eng 8(2):211–226
Majdi A, Beiki M (2010) Evolving neural network using a genetic algorithm for predicting the deformation modulus of rock masses. Int J Rock Mech Min Sci 47(2):246–253
Mamdani EH, Assilian S (1975) An experimental in linguistic synthesis with a fuzzy logic control. Int J Man-Mach Stud 7:1–13
Martin DM, Stimpson B (1994) The effect of sample disturbance on laboratory properties of Lac du Bonnet granite. Can Geotechn J 31(5):692–702
Masters T (1994) Practical neural network recipes in C++. Academic, Boston
MATLAB user’s guide (2007) Fuzzy logic Toolbox user’s guide (MATLAB CD-ROM). Mathworks Inc, Natick, p 299
McCulloch WS, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. Bull Math Biophys 5:115–133
Mishra DA, Basu A (2013) Estimation of uniaxial compressive strength of rock materials by index tests using regression analysis and fuzzy inference system. Eng Geol 160:54–68
Mishra DA, Srigyan M, Basu A, Rokade PJ (2015) Soft computing methods for estimating the uniaxial compressive strength of intact rock from index tests. Int J Rock Mech Min Sci 80:418–424
Mitri HS, Edrissi R, Henning J (1994) Finite element modeling of cable bolted stopes in hard rock ground mines. In: Presented at the SME annual meeting, New Mexico, Albuquerque, pp 94–116
Mohamad ET, Armaghani DJ, Momeni E, Abad SVANK (2015) Prediction of the unconfined compressive strength of soft rocks: a PSO-based ANN approach. Bull Eng Geol Environ 4:745–757
Momeni E, Jahed Armaghani D, Hajihassani M, Amin MFM (2015) Prediction of uniaxial compressive strength of rock samples using hybrid particle swarm optimization-based artificial neural networks. Measurement 60:50–63
Moradian ZA, Behnia M (2009) Predicting the uniaxial compressive strength and static Young’s modulus of intact sedimentary rocks using the ultrasonic test. Int J Geomech 9(1):14–19
Nefeslioglu HA (2013) Evaluation of geo-mechanical properties of very weak and weak rock materials by using non-destructive techniques: ultrasonic pulse velocity measurements and reflectance spectroscopy. Eng Geol 160:8–20
Nicholson GA, Bieniawski ZT (1990) A nonlinear deformation modulus based on rock mass classification. Int J Min Geol Eng 8:181–202
Paola JD (1994) Neural network classification of multispectral imagery. MSc thesis, The University of Arizona, USA
Read SAL, Richards LR, Perrin ND (1999) Applicability of the Hoek–Brown failure criterion to New Zealand greywacke rocks, vol 2. In: Proceedings 9th Int. Society for Rock Mechanics Congress, Paris, pp 655–660
Rezaei M, Majdi A, Monjezi M (2012) An intelligent approach to predict unconfined compressive strength of rock surrounding access tunnels in longwall coal mining. Neural Comput Appl 24(1):233–241
Ripley BD (1993) Statistical aspects of neural networks. In: Barndoff- Neilsen OE, Jensen JL, Kendall WS (eds) Networks and chaos-statistical and probabilistic aspects. Chapman and Hall, London, pp 40–123
Rutkowski L (2004) Flexible neuro-fuzzy systems: structures, learning and performance evaluation. Kluwer Academic, New York
Sachpazis CI (1990) Correlating Schmidt hardness with compressive strength and Young’s modulus of carbonate rocks. Bull Int Assoc Eng Geol 42:75–84
Serafim JL, Pereira JP (1983) Considerations of the geomechanics classification of Bieniawski. In: Proceedings of the International Symposium on Engineering Geology and Underground Construction, Laboratorio National De Engenharia Civil, Lisbon, Spain. A.A. Balkema, Rotterdam, the Netherlands, vol 1, pp 1133–1142
Shams S, Monjezi M, Johari Majd V, Jahed Armaghani D (2015) Application of fuzzy inference system for prediction of rock fragmentation induced by blasting. Arab J Geosci 8:10819–10832
Simpson P (1990) Artificial neural system: foundation, paradigms, applications and implementations. Pergamon, New York
Singh R, Kainthola A, Singh TN (2012) Estimation of elastic constant of rocks using an ANFIS approach. Appl Soft Comput 12(1):40–45
Sivanandam SN, Sumathi S, Deepa SN (2007) Introduction to fuzzy logic using MATLAB. Springer, Berlin, Heidelberg
Sonmez H, Gokceoglu C, Ulusay R (2004) Indirect determination of the modulus of deformation of rock masses based on the GSI system. Int J Rock Mech Min Sci 1:849–857
Sonmez H, Gokceoglu C, Nefeslioglu HA, Kayabasi A (2006) Estimation of rock modulus: for intact rocks with an artificial neural network and for rock masses with a new empirical equation. Int J Rock Mech Min Sci 43:224–235
SPSS Inc (2007) SPSS for Windows (Version 16.0). SPSS Inc, Chicago
Swingler K (1996) Applying neural networks: a practical guide. Academic, New York
Takagi T, Sugeno M (1985) Identification of systems and its application to modeling and control. IEEE Trans Syst Man Cybern 15:116–132
Wang C (1994) A theory of generalization in learning machines with neural application. PhD thesis, The University of Pennsylvania, USA
Yager RR, Filev DP (1994a) Generation of fuzzy rules by mountain clustering. J Intell Fuzzy Syst 2:209–219
Yager RR, Filev DP (1994b) Essentials of fuzzy modeling and control. Wiley, New York
Yagiz S, Sezer EA, Gokceoglu C (2012) Artificial neural networks and nonlinear regression techniques to assess the influence of slake durability cycles on the prediction of uniaxial compressive strength and modulus of elasticity for carbonate rocks. Int J Numer Anal Method 36(14):1636–1650
Yasar E, Erdogan Y (2004) Correlating sound velocity with the density, compressive strength and Young’s modulus of carbonate rocks. Int J Rock Mech Min Sci 41(5):871–875
Yazdani Bejarbaneh B, Armaghani DJ, Amin MFM (2015) Strength characterisation of shale using Mohr-Coulomb and Hoek-Brown criteria. Measurement 63:269–281
Yilmaz I, Sendir H (2002) Correlation of Schmidt hardness with unconfined compressive strength and Young’s modulus in gypsum from Sivas (Turkey). Eng Geol 66(3):211–219
Yilmaz I, Yuksek AG (2008) An example of artificial neural network (ANN) application for indirect estimation of rock parameters. Rock Mech Rock Eng 41(5):781–795
Yilmaz I, Yuksek G (2009) Prediction of the strength and elasticity modulus of gypsum using multiple regression, ANN, and ANFIS models. Int J Rock Mech Min Sci 46(4):803–810
Zadeh LA (1965) Fuzzy sets. Inform. Control 8:338–353
Zadeh LA (1973) Outline of a new approach to the analysis of complex systems and decision processes. IEEE Trans Syst Man Cybern 3:28–44
Zorlu K, Gokceoglu C, Ocakoglu F, Nefeslioglu HA, Acikalin S (2008) Prediction of uniaxial compressive strength of sandstones using petrography-based models. Eng Geol 96(3):141–158
Zurada JM (1992) Introduction to artificial neural systems. West, St Paul
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bejarbaneh, B.Y., Bejarbaneh, E.Y., Amin, M.F.M. et al. Intelligent modelling of sandstone deformation behaviour using fuzzy logic and neural network systems. Bull Eng Geol Environ 77, 345–361 (2018). https://doi.org/10.1007/s10064-016-0983-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10064-016-0983-2