1 Introduction

Determining the geotechnical properties of rocks is considered to be the most important component in different projects such as tunnels, dams, mining, rock mass classification (Khajevand and Fereidooni 2018; Diamantis et al. 2021; Hassan et al. 2021), rock slopes stability (Johari et al. 2013, 2015), oil and gas reservoirs (Wang et al. 2020), and damage to historical buildings (Hemeda and Pitilakis 2010). Ultrasonic measurement is one of the non-destructive geophysical methods commonly used in engineering projects that are standardized by the American Society for Testing and Materials (ASTM) and the International Society for Rock Mechanics (ISRM). In rock engineering, ultrasonic testing has become very popular due to its low cost, simplicity, portable nature, and high precision (Chawre 2018). This method can be applied in the laboratory and field (Ajalloeian et al. 2020). Until now, ultrasonic wave velocity has been widely used as a reliable geotechnical test by various researchers for predicting the mechanical and physical properties of different rocks (Tuğrul and Zarif 1999; Kahraman 2001; Yasar and Erdogan 2004; Kilic and Teymen 2008; Diamantis 2019; Kianpour et al. 2019; Wang and Han 2020; Arman and Paramban 2021). Evaluating the strength of rocks by performing direct tests is the most expensive and requires high-quality core specimens and considerable time. Therefore, indirect laboratory tests such as ultrasonic wave velocity are often used for predicting the mechanical properties of rocks (Khajevand and Fereidooni 2019).

In engineering geology and rock mechanics, various simple and multivariate regression models have been developed for correlation between P-wave velocity and some physical and mechanical properties (e.g., Cobanoğlu and Çelik 2008; Diamantis et al. 2011, 2012; Yagiz 2011; Fener 2011; Tandon and Gupta 2015; Kurtulus et al. 2016; Shirole et al. 2020; Yang et al. 2020; Benavente et al. 2020; Khajevand and Fereidooni 2022). Several researchers attempted to determine the uniaxial compressive strength (UCS) of different rocks by wave velocity (Khandelwal and Singh 2009; Altindag 2012; Azimian and Ajalloeian 2015; Ferentinou and Fakir 2017; Heidari et al. 2018; Aboutaleb et al. 2017; Armaghani et al. 2018; Aldeeky and Al Hattamleh 2018; Mohammed et al. 2019; Aliyu et al. 2019; Salehin et al. 2020; Vasanelli et al. 2020; Teymen and Mengüç 2020). Some researchers have utilized soft computing approaches, including artificial neural network (ANN) (Sarkar et al. 2010; Ceryan et al. 2013; Momeni et al. 2015; Mozumder and Laskar 2015; Armaghani et al. 2016; Sharma et al. 2017; Paul et al. 2018; Mahdiabadi and Khanlari 2019; Moussas and Diamantis 2021) and adaptive neuro-fuzzy inference system (ANFIS) (e.g., Singh et al. 2013; Basarir and Dincer 2019; Jing et al. 2021) for estimating geotechnical characteristics of different rocks based on ultrasonic wave velocity.

In this regard, Singh et al. (2017) attempted to predict the compressive and shear strength of rocks from point load index, tensile strength, unit weight, and ultrasonic velocity using statistical analysis and neural networks. This study showed ANN can predict the same strength parameters with better reliability than regression analysis. Shen et al. (2017), using P-wave modulus, estimated the mechanical properties of the rock mass. Jamshidi et al. (2018) evaluated the effect of porosity and density on the correlation between UCS and P-wave velocity. Ghafoori et al. (2018) established correlations between the physical, mechanical, and dynamic properties of limestone through statistical analysis. A strong correlation between UCS, E, and wave velocity was found. Asheghi et al. (2019) predicted UCS of different quarry rocks by a generalized feed-forward neural network (GFFN) and some engineering properties, including density, porosity, P-wave velocity, point load index, and water absorption. Saedi et al. (2019) used multiple regression and fuzzy inference system to estimate UCS and E of migmatites samples. Some index tests including porosity, cylindrical punch index, block punch index, Brazilian tensile strength, point load index, and P-wave velocity were determined and employed to develop predictive models. Ersoy et al. (2019) studied the effect of sample dimension on ultrasonic wave velocity. The results of this study indicated that wave velocity is more controlled by material homogeneity rather than sample length or diameter. Abdelhedi et al. (2020) used a non-destructive ultrasonic technique for studying the mechanical strength of carbonate rock aggregates. Heidari et al. (2020) investigated the dynamic characteristics of pyroclastic rocks by ultrasonic tests during loading in triaxial tests. Ding et al. (2020) investigated the P-wave velocity of anisotropic slate with various foliation orientations and indicated VP increases with increasing foliation angle. Amirkiyaei et al. (2020) determined the VP of some carbonate building stones including, limestone, travertine, and marble, during freeze–thaw (F–T) cycles and the performance of the obtained experimental equations was evaluated using statistical indices. Ajalloeian et al. (2020) studied the effects of petrographic properties on ultrasonic wave velocity including, P-wave and S-wave velocity and dynamic elastic constants of granitic rocks. The results showed that the grain size of minerals and the ratio of quartz to feldspar are reliable parameters for estimating the VP and VS.

In this research, geotechnical tests such as dry density, porosity, and ultrasonic P-wave velocity were used to determine the mechanical properties including, point load strength, Brazilian tensile strength, block punch strength, and uniaxial compressive strength of some sedimentary rocks. Studied samples, including sandstone, limestone, travertine, and conglomerate, were collected from north and northwest parts of Damghan in northeast Iran. Considering the advantages of the ultrasonic wave velocity test mentioned in the literature, the main aim of this research is to determine the strength of sedimentary rocks most simply and cheaply possible in the laboratory based on ultrasonic P-wave velocity. During field investigations, the lithological units were described, and the studied samples were classified according to mineralogical studies and laboratory investigations. Finally, the results of this research were analyzed using multiple regression analysis, artificial neural network, and adaptive neuro-fuzzy inference system to propose some predictive models for estimating the strength of rocks.

From a practical point of view, the present research is generally significant because construction materials and building stones in this region are dominantly composed of sedimentary rocks. These rocks are extracted from quarries and mines and have been used in various engineering projects, such as pavement, foundations, kerb, railway ballast, the rip-rap of dams, and facade stone. The novelty of this research is in the comprehensiveness of geotechnical tests and strength parameters so that all parameters have been reviewed and compared with each other by statistical methods and soft computing approaches, which is not seen in previous research.

2 Material and Methods

2.1 Field Investigations

Northern and northwestern parts of Damghan in northeast Iran were selected as study areas. From a geological point of view, the area is located in the Alborz-Azarbayejan structural zone, and a complete succession of Paleozoic to Neogene rocks exists. Suitable sampling locations were selected from the 1:100,000 geological map prepared by the Geological Society of Iran (GSI 1977). Four different sedimentary rocks, including limestone, sandstone, travertine, and conglomerate, were selected for the study. Sampling has been carried out from the quarries, trenches, and road cuttings. Firstly, the specimens were inspected macroscopically, and only the homogeneous, isotropic, un-weathered, and free of visible joints, cracks, fissures, or other discontinuities were considered.

2.2 Mineralogical and Petrographic Analysis

In this research, thin sections were prepared for studying mineralogical and petrographic properties by optical microscopic observations using a modern polarizing microscope based on ISRM (2007). The mineral content of the rock samples was quantified based on the point-counting method. Based on the results of the mineralogical investigations, the studied samples were commonly composed of quartz and calcite. The investigated rock types and the average modal abundance of their minerals are presented in Table 1. X-ray diffraction analysis was conducted according to ISRM (2007) and compositions of the samples were determined in the 2θ ranges from 4° to 70°. Results show that calcite and quartz are the two most important minerals in all samples. Microscopic images of the samples in polarized light are shown in Fig. 1a–d. Also, the diffractograms of XRD analysis for the studied samples are shown in Fig. 2a–d.

Table 1 Mineral composition of the studied rocks
Fig. 1
figure 1

Microscopic images of a sandstone, b limestone, c travertine, and d conglomerate

Fig. 2
figure 2

The XRD diffractograms of a sandstone, b limestone, c travertine, and d conglomerate

2.3 Sample Preparation

Rock blocks were cored in the laboratory using 54 mm diameter (NX core) diamond coring bits. The cut end faces of cores were smoothed and made perpendicular to the core axes with a polishing and lapping machine. Point load strength, Brazilian tensile strength, block punch strength, and uniaxial compressive strength were performed as mechanical tests on high-quality prepared cylindrical core specimens. Before destructive tests, physical properties and ultrasonic P-wave velocity (UPV) tests were carried out as non-destructive tests. To avoid any influence on mechanical properties, the length-to-diameter ratio of the tested specimens was prepared carefully following the suggested method of ISRM (2007). All tests were conducted at room temperature and under the same conditions.

2.4 physical Properties

Physical Properties including dry density (γdry) and effective porosity (ne) were determined using prepared NX core specimens following the standard procedures recommended in ISRM (2007). The dry density of the studied rocks was obtained from the ratio of sample weight to bulk volume. The values of effective porosity for each specimen were obtained from the ratio of pore volume to the sample volume. The dry mass of the studied samples was determined after drying in an oven at a temperature of 110 °C for a period of 24 h. The average values of physical properties are presented in Table 2. According to the results, the TS6-1 and LS1-1 samples have minimum and maximum values of dry density equal to 2.11 and 2.63, respectively. Also, the samples of TS3-2 and SS3-2 have minimum and maximum values of effective porosity, equal to 2.24 and 10.95, respectively.

Table 2 Values of physical and mechanical properties of the studied rocks

2.5 Point Load Strength

Point load test is widely used to evaluate the strength of rocks, as it is easy to prepare and test the specimens. In the present research, the axial point load test was performed to measure the point load index (IS(50)) of the studied samples. Obtained NX core samples with a length-to-diameter ratio equal to 1 were prepared, and all tests were performed using automatic point load apparatus. The formula of IS is (ISRM 1985; ASTM 2008):

$$I_{S} = \frac{P}{{D_{e}^{2} }}$$
(1)

where P is the applied force and De is the distance between the platens at failure. The point load index for a core diameter equal to 50 mm (Is(50)) was calculated from the following formula:

$$I_{s(50)} = F \times I_{s} = \left( {\frac{{D_{e} }}{50}} \right)^{0.45}$$
(2)

where F is the size correction factor, F = (De/50)0.45; Is is the uncorrelated point load strength, MPa; and De is the equivalent core diameter in mm.

The point load test was performed based on ISRM (2007) and values of the point load strength are presented in Table 2. According to the results, the CS2-2 and LS3-2 samples had the minimum and maximum values of the point load index, equal to 4.87 and 12.06 MPa, respectively. According to the histogram, the average and standard deviation (Std. Dev.) of IS(50) are equal to 7.91 MPa and 1.98, respectively (Fig. 3). Based on Broch and Franklin (1972) classification of the strength of intact rocks, the studied samples were considered as high to extremely high strength rocks. The failure mode of the study samples is presented in Fig. 4 and shows tested specimens fail in typical tensile splitting failure modes under point load tests. As regards axial point load tests were performed, in most samples, a unique failure mode was observed and rock specimens were split into two pieces. Also, a few specimens fail to split into three blocks.

Fig. 3
figure 3

Histogram of point load strength and normal curve distribution

Fig. 4
figure 4

Failure mode of the studied samples under the point load test

2.6 Brazilian Tensile Strength

The Brazilian tensile strength test (also known as indirect tensile strength) was performed on NX-prepared disk-shaped core specimens with lengths-to-diameter ratios from 0.5 to 0.75, according to ISRM (2007). The Brazilian tensile strength was calculated by the following formula:

$${\text{BTS}} = \frac{2P}{{\pi Dt}}$$
(3)

where P is the maximum load recorded during the test and D is the diameter and t is the thickness of the specimen.

The obtained values of BTS for the studied rock samples are presented in Table 2. According to the results, the CS2-2 and LS3-2 samples had the minimum and maximum BTS values, equal to 2.18 and 9.70 MPa, respectively. The histogram of BTS which is presented in Fig. 5, indicated the average and standard deviation of BTS are equal to 5.91 MPa and 2.02, respectively. Two types of failure modes under the BTS test were observed, which are presented in Fig. 6. In sandstone and travertine samples, central and central multiple failure modes were found. While in limestone and conglomerate specimens, central and layer activation failure patterns were observed.

Fig. 5
figure 5

Histogram of Brazilian tensile strength and normal curve distribution

Fig. 6
figure 6

Failure mode of the studied samples under the Brazilian tensile strength test

2.7 Block Punch Strength

To carry out the block punch strength test, the thin disk-shaped NX core specimens are cut into discs of various raw thicknesses ranging between 5 and 15 mm using a diamond saw perpendicular to the core axis. This test was performed based on the ISRM (2007) suggested method using the block punch apparatus that was made in this research. The corrected block punch index (BPIC) can be calculated as follows:

$$BPI_{C} = 3499D^{ - 1.3926} \times t^{ - 1.1265} \times P$$
(4)

where D and t are the diameter and thickness of the disc specimen in mm and P is the failure load in kN.

Results of the block punch test are presented in Table 2 and indicated the CS2-2 and LS1-3 samples had the minimum and maximum values of BPS, equal to 2.18 and 10.54 MPa, respectively. Based on the histogram of BPS, the average and standard deviation are equal to 6.37 MPa and 1.65, respectively (Fig. 7). According to Ulusay et al. (2001), rock classification, TS3, and CS2 are weak, and other tested samples were classified as moderate rocks.

Fig. 7
figure 7

Histogram of block punch strength and normal curve distribution

The failure modes of the study rocks under the block punch test are presented in Fig. 8 and show that after failure, the specimens are broken into three parts. Two marginal parts of the specimens were fixed in the punch apparatus and the middle part was punched out. As can be seen, regular failure occurred and fractured planes were parallel together. In a few travertine samples, cross joints developed during the block punch test, and unexampled fracture planes as irregular failure was observed. For this reason, the test was rejected as invalid results and the block punch test was repeated.

Fig. 8
figure 8

Failure mode of the studied samples under the block punch strength test

2.8 Uniaxial Compressive Strength

The uniaxial compressive test was conducted with a length-to-diameter ratio of 2 to 2.5 based on ASTM (1995) and ISRM (2007) suggested methods. By using the following equation, the UCS of the rock samples can be computed:

$${\text{UCS}} = \frac{P}{A}$$
(5)

where P is the failure load and A cross-sectional area of the cylindrical specimen.

The 50 kg/Sec stress rate was applied to the UCS test and the automatic machine having 200 tons of capacity was used. The UCS test was repeated at least five times for each rock type, and the average value was calculated as the UCS that is presented in Table 2. According to the results, the CS2-2 and LS1-1 samples had the minimum and maximum values of UCS, equal to 14.07 and 53.49 MPa, respectively. The mean values and standard deviation of UCS are equal to 29.07 MPa and 11.61, respectively. Normal distribution curves of the UCS test are also shown in the histogram in Fig. 9. Higher standard deviation values of UCS indicated that studied samples had various ranges of strength. Based on ISRM (1981) rock classification, the tested samples were classified as low to moderate strong rocks.

Fig. 9
figure 9

Histogram of uniaxial compressive strength and normal curve distribution

Various failure patterns of the studied rocks were observed under the UCS test that are presented in Fig. 10. Results of the UCS test show that most of the sandstone specimens failed in a Y-shaped. The failure mode of limestone samples is axial splitting. In the case of travertine samples, most specimens failed along layering. Most conglomerate specimens have axial splitting, whereas in many samples, shearing along a single plane was observed.

Fig. 10
figure 10

Failure mode of the studied samples under uniaxial compressive strength test

2.9 Ultrasonic P-wave Velocity

The ultrasonic P-wave velocity (VP) of the studied rocks was determined using a Portable Ultrasonic Non-destructive Digital Indicating Tester (PUNDIT) in the laboratory, based on ASTM (1996) and ISRM (2007). The PUNDIT has a pulse generator, transducers, and an electronic counter for time interval measurements. An accuracy of 0.1 mm to the length of the measuring base was used. Before the measurements, the cut ends of the samples were polished to provide a flat and smooth surface. A thin layer of petroleum jelly (Vaseline) was fulfilled to the surface of the transducers (receiver and transmitter) to provide complete contact and to remove the air gap between the transducers and the specimen surface. In the tests, the ultrasonic pulse time was measured with an accuracy of 0.1 ms and at ultrasonic frequencies of 500 kHz. The UPV was obtained by dividing the length of the NX-sized core and the travel pulse time. P-wave velocity apparatus and prepared core specimens are presented in Fig. 11a and b, respectively.

Fig. 11
figure 11

a P-wave velocity measurement and b core specimens

Results of the P-wave velocity test show that the CS2-2 and TS1-1 samples had the minimum and maximum values of wave velocities, equal to 2826 and 6053 m/s respectively, as presented in Table 3. The histogram of VP (Fig. 12) indicated that the average and standard deviation are equal to 4303.38 m/s and 907.65, respectively. Based on IAEG (1979) rock classification, the P-wave velocity values of the studied rocks ranged from low to very high. Also, according to Bell (2000), the studied samples were classified as weathered to fresh rocks based on P-wave velocity values.

Table 3 Values of P-wave velocity, pulse time, and P-wave modulus and description of VP
Fig. 12
figure 12

Histogram of P-wave velocity test and normal curve distribution

Travel pulse time and P-wave modulus for studied samples are also calculated and presented in Table 3. Pulse times are in the range of 19.9–42.9 μs. P-wave modulus was calculated by the following formula, which is in the range of 20.92 to 69.87 GPa.

$$M = \rho V_{p}^{2}$$
(6)

where M is P-wave modulus in GPa, and ρ is density in g/cm3.

Obtained echo signal of P-wave velocity, pulse time, and P-wave modulus is presented in Fig. 13a–c. these figures show the first echo arrives at approximately 19.9 μs and corresponds to the TS1-1 with the highest wave velocity equal to 6053 m/s. A lesser echo wave appears in 42.9 μs which corresponds to CS2-2. The correlation between P-wave velocity values, pulse time, and P-wave modulus was performed by simple regression analysis and is presented in Fig. 14. According to Fig. 14a, the relation between VP and time is a reverse power relation, with R2 = 0.99. Therefore, when pulse time travel increased, the VP decreased. Whereas the relation between VP and P-wave modulus is, a direct linear relation with R2 = 0.98 that shows these parameters increased together (Fig. 14b).

Fig. 13
figure 13

a Echo signal of P-wave velocity, b pulse time, and c P-wave modulus

Fig. 14
figure 14

Correlation between P-wave velocity with a pulse time, and b P-wave modulus

3 Results

In the present research, mechanical properties including point load strength, Brazilian tensile strength, block punch strength, and uniaxial compressive strength of some sedimentary rocks have been predicted by dry density, porosity, and ultrasonic P-wave velocity by using multivariate regression analysis and soft computing methods including artificial neural network and adaptive neuro-fuzzy inference system. The results of this research are described and discussed in the next sections.

3.1 Multivariate Regression Analysis

In recent years, simple and multivariate regression analysis (MRA) are widely used as a statistical technique in various fields of geotechnical engineering and engineering geology to develop mathematical expression between significant geotechnical parameters of soils and rocks (Sharma and Singh 2018). Regression analysis is a statistical tool for estimating the relationship between variables and predicting unknown from known parameters (Yagiz 2011). Multivariate regression is a classic linear statistical forecasting technique for understanding the relationship between a dependent variable and two or more independent variables. The multivariate regression technique formulates a model to obtain an estimation of the values of the dependent variable by fitting a linear equation to observed variables. Generally in the form of the regression model is expressed as follows:

$$y = \alpha_{0} + \alpha_{1} x_{1} + \alpha_{2} x_{2} + \cdots + \alpha_{n} x_{n} \pm \beta$$
(7)

where y is the value of the variable that is to be determined, α0 is the y-intercept by the regression line, x1, x2, …, xn are the values of dependent variables, β represents the error, α1, α2, …, αn are the regression coefficient, defined as the amount of the response variable changes due to change in the corresponding dependent variable by 1 unit (Khandelwal et al. 2017; Sharma et al. 2017).

In this research, multivariate regression analyses were modeled by IBM SPSS Statistics (v. 24). In this method, γdry, ne, and VP are considered as independent variables or predictors, and mechanical properties including PLS, BTS, BPS, and UCS are considered as the dependent variables or outputs and were modeled in models 1 to 4, respectively. Table 4 presents the summary of coefficients of multivariate regression results including the Beta coefficient, standard error of Beta, significant level (Sig.), lower and upper confidence interval, tolerance, and VIF of established models. Table 5 presents the summary of the multivariate correlation coefficient including R, R square, adjusted R, standard error, and Durbin-Watson of established models. The coefficient of determination (R2) is equated in Eq. 8, as follows:

$$R^{2} = \left( {\frac{{\sum_{i = 1}^{n} \left( {y_{i} - \overline{y} } \right)\left( {y^{\prime}_{i} - \overline{{y^{\prime}}} } \right)}}{{\left( {\sqrt {\sum_{i = 1}^{n} \left( {y_{i} - \overline{y} } \right)^{2} } } \right)\left( {\sqrt {\sum_{i = 1}^{n} \left( {y^{\prime}_{i} - \overline{{y^{\prime}}} } \right)^{2} } } \right)}}} \right)$$
(8)

where y and y′ are the experimental and predicted values, respectively. \(\overline{y}\) is the average of the set y and n is the number of data. The coefficient of determination measures the proportion of variance in the dependent variable that is predictable from the independent variables. It is the square of the correlation coefficient (R) and ranges from 0 to 1.

Table 4 Coefficients of multiple regression analysis
Table 5 Summary of the results of multiple regression coefficients

By obtaining coefficients from multivariate regression analysis, four experimental models were developed as follows:

$${\text{PLS}} = - 8.15 + 2.70\gamma_{d} + 0.29n_{e} + 0.002V_{p}$$
(9)
$${\text{BTS}} = 2.52 - 2.08\gamma_{d} + 0.06n_{e} + 0.002V_{p}$$
(10)
$${\text{BPS}} = - 11.21 + 3.60\gamma_{d} + 0.43n_{e} + 0.001V_{p}$$
(11)
$${\text{UCS}} = - 71.18 + 18.23\gamma_{d} + 2.52n_{e} + 0.01V_{p}$$
(12)

3.2 Artificial Neural Network

An artificial neural network (ANN) is an intelligent soft computing method that has been used to predict various complex parameters in recent years. It consists of numerous simple processing elements (neurons) capable of performing complex data processing and knowledge representation (Sarkar et al. 2010). The neural network is normally trained by processing a large number of input and output patterns to achieve matching and prediction. The structure and data processing technique of ANN is inspired by the human brain. The configuration of ANN is composed of a highly layered structure that constitutes many simple processing elements capable of performing highly complex computations for data processing and knowledge representation (Singh et al. 2017).

In this research, due to no time-dependent parameter used in the ANN, the feed-forward (FF) neural network by MATLAB Version 9.9.0 (R2020b) (MATLAB 2020) was used to establish predictive models. For ANN modeling, similar to multiple regression analysis, γdry, n, and VP are constant inputs in all models, and mechanical properties including PLS, BTS, BPS, and UCS are considered outputs that are predicted in models 1 to 4, respectively. Architectures of ANN that were used in this study are presented in Fig. 15 which is composed of an input layer, hidden layer, and output layer. All data were then divided into three data sets such as training, validation, and testing, which are 70% for the training, 15% for the testing, and 15% for the validation, respectively. The Levenberg–Marquardt algorithm was applied for prediction and 10 hidden neurons were utilized in models.

Fig. 15
figure 15

Schematic representation of the constructed neural network

Figure 16a–c shows the results of ANN modeling. Figure 16a shows the training state of the ANN model 1. In model 1 after epoch number 9 the errors are happening 6 times and the test is stopped at epoch number 15. In model 2 for predicting BTS, the test is stopped at epoch number 13 and the errors happen 6 times. In model 3, which predicted BPS, after epoch number 4 the errors are happening 6 times and the test stopped at epoch number 10. Finally, in models of UCS, after epoch number 3 the test is stopped at epoch number 9. Therefore, results indicated in all model errors happen 6 times. Figure 16b shows the validation curves in ANN models for predicting the output PLS. In model 1 the best validation performance is 0.039 which happened at epoch number 9. In models 2, 3, and 4; the best validation performances are equal to 0.68, 0.043, and 40.44 which happened in epoch number 7, 4, and 3 which predicted BTS, BPS, and UCS, respectively. The values of mean squared error (MSE) that can be seen in the Y-axis of the graph of Fig. 16b, are different in ANN models, as in model 3 for predicting BPS was less than other models. Figure 16c is a cross-correlation between the predicted (output) and measured (target) data that shows a regression graph for the ANN model 1. The correlation coefficient (R) for models 1, 2, 3, and 4 are equal to R = 0.975, 0.983, 0.986, and 0.955, respectively, which shows the accuracy of the model for predicting PLS, BTS, BPS, and UCS.

Fig. 16
figure 16

a The training state for the ANN model 1 b Best validation performance in ANN model 1 and c regression graph of the ANN model 1

3.3 Adaptive Neuro-Fuzzy Inference System

Adaptive neuro-fuzzy inference system (ANFIS) is an admixture of two soft computing methods, including ANN and fuzzy logic (FL). Jang (1993) developed ANFIS based on the Takagi–Sugeno fuzzy inference system (FIS). Fuzzy logic, which was proposed by Zadeh (1965), can change the qualitative aspects of human knowledge and insights into the process of precise quantitative analysis. Unlike ANN, it has a higher capability in the learning process to adapt to its environment. Adaptive network is one example of a feed-forward neural network with multiple layers. This type of FIS was first introduced by Takagi and Sugeno (1985).

In the ANFIS modeling, the same data sets that were used in MRA and ANN were also employed. Therefore, γdry, n and VP are constant inputs, and PLS, BTS, BPS, and UCS as outputs predicted in models 1 to 4, respectively. The ANFIS models are developed by MATLAB (R2020b) (MATLAB 2020). The structure of the ANFIS models is shown in Fig. 17. The training of the ANFIS models was carried out using 50 epochs. In the training process, the dataset was randomly used. In this study, a hybrid learning algorithm that was proposed by Jang (1993), was applied for predicting UCS, and grid partition was applied to make the propositions of the fuzzy rules. Also, a linear type of Gaussian membership function (gaussmf) was used to predict the dependent mechanical properties. Finally, using the trained model, the training and testing datasets were simulated and assessed. The plot showing the relationship between the models’ inputs with the 27 rules used in predicting the UCS is shown in Fig. 18.

Fig. 17
figure 17

ANFIS model structure for predicting mechanical properties with three inputs

Fig. 18
figure 18

Fuzzy rules

4 Discussion

The predicted values of PLS, BTS, BPS, and UCS that were obtained by models of multiple regression, artificial neural network, and adaptive neuro-fuzzy inference system are presented in Table 6. Based on the comparison between predicted and experimental data that are presented in Table 2, results show obtained values of ANN and ANFIS models with very low errors approximately similar to experimental values. Whereas, obtained values from MRA models have differences from experimental data. Also, some statistical parameters including mean, standard deviation (Std. D.), median, mode, and variance are calculated for the output of MRA, ANN, and ANFIS models that are presented in Table 7. Based on the results, standard deviation, and variance values were obtained from 1.14 to 8.63, 1.59 to 10.84, 1.65 to 11.49, and 1.30 to 74.52, 2.52 to 117.60, and 2.72 to 132.00 for MRA, ANN, and ANFIS models, respectively.

Table 6 Predicted values of PLS, BTS, BPS, and UCS by MRA, ANN, and ANFIS models
Table 7 Statistical description for output of MRA, ANN, and ANFIS models

For assessing the performance of models, statistical parameters including correlation coefficient (R), root mean square error (RMSE), variance account for (VAF), mean absolute percentage error (MAPE), and performance index (PI) were calculated by the following equations:

$$R = \frac{n(\sum xy) - (\sum x)(\sum y)}{{\sqrt {\left[ {n(\sum x^{2} ) - (\sum x)^{2} } \right]\left[ {n(\sum y^{2} ) - (\sum y)^{2} } \right]} }}$$
(13)
$${\text{RMSE}} = \sqrt {\frac{{\sum\limits_{i = 1}^{N} {(y - y^{\prime}} )^{2} }}{N}}$$
(14)
$${\text{VAF}} = \left[ {1 - \frac{{{\text{var}} (y - y^{\prime})}}{{{\text{var}} (y)}}} \right] \times 100$$
(15)
$${\text{MAPE}} = \left[ {\frac{1}{N}\sum\nolimits_{i = 1}^{N} {\left| {\frac{{y - y^{\prime}}}{y}} \right|} } \right] \times 100$$
(16)
$${\text{PI}} = \left[ {R^{2} + \left( {\frac{VAF}{{100}}} \right) - {\text{RMSE}}} \right]$$
(17)

where y and y′ are the experimental and calculated values, respectively; and N is the total number of data. The model will be excellent if R = 1, RMSE = 0, VAF = 100, MAPE = 0, and the PI has the biggest value. When an RMSE value approaches zero, the predicted values from the regression equation are closer to the experimental values.

Statistical parameters for models of multiple regression, artificial neural network, and adaptive neuro-fuzzy inference system were calculated and presented in Tables 8, 9, and 10, respectively. Also, comparative diagrams of R, RMSE, VAF, MAPE, and PI for the results of MRA, ANN, and ANFIS models are presented in Fig. 19a–e. Results show R values were obtained between 0.73 and 0.85, 0.96 and 0.99, and equal to 0.99 for MRA, ANN, and ANFIS models, respectively. RMSE values were obtained between “8.04 and 0.11”, the best value dependent on model 3 of ANFIS. Based on the diagrams, the ANFIS model obtained the best results in R, RMSE, VAF, MAPE, and PI values. Also, it was seen, ANN and ANFIS models have better results than MRA with highly different.

Table 8 Performance of the models of multivariate regression analysis
Table 9 Performance of the models of artificial neural network
Table 10 Performance of the models of adaptive neuro-fuzzy inference system
Fig. 19
figure 19

Comparative diagram of a R, B) RMSE, c VAF, d MAPE, and e PI for the results of the MRA, ANN, and ANFIS models

For comparing the results of MRA, ANN, and ANFIS for predicting mechanical properties of studied rocks, experimental and predicted values of PLS, BTS, BPS, and UCS; also a 45° line (y = x) has been plotted in diagrams that are shown in Fig. 20a–d, as models 1 to 4, respectively. In this method, the adoption rate of MRA, ANN, and ANFIS trend lines with a 45° line (y = x) shows the validity of experimental equations. Based on these diagrams, the trend lines of ANN and ANFIS data completely fit the 45° line. Whereas, trend lines of MRA in models of PLS and UCS (Fig. 20a and d) slightly intercept the 45° line. Also, the trend lines of BTS and BPS models (Fig. 20b and c) are partly parallel to 45° lines. Therefore, the results of the correlation between experimental and predicted values indicated that ANN and ANFIS models presented more accurate results than MRA.

Fig. 20
figure 20

Correlations between experimental and calculated values of a PLS, b BTS, c BPS, and d UCS obtained by MRA, ANN, and ANFIS models

Another method for assessing and comparing the results of MRA, ANN, and ANFIS models, is calculating the residual error of calculated data. Residual error was calculated by subtraction of the experimental values from the calculated values for all models. For comparing the obtained value of residual error, graphs of residual error are presented in Fig. 21a–d, for PLS, BTS, BPS, and UCS models, respectively. Based on diagrams of Fig. 21, obviously can be seen trend lines of residual error for ANFIS and ANN more fit with a horizontal line (X-axis) than MRA trend lines. Also, some statistical parameters including mean, standard deviation, median, mode, and variance are calculated for obtaining values of residual error that are presented in Table 11. Based on the results, standard deviation, and variance values were obtained from 1.03 to 7.99, 0.28 to 3.45, 0.11 to 1.08, and 1.06 to 63.94, 0.08 to 11.88, and 0.01 to 1.17 for MRA, ANN, and ANFIS models, respectively. Results show standard deviation and variance were decreased from MRA to ANN and ANFIS models. Therefore, from a statistical point of view, ANFIS models presented very good results for predicting the mechanical properties of the studied rocks. Also, it noted that ANN shows better results than MRA models.

Fig. 21
figure 21

Residual error diagrams for MRA, ANN, and ANFIS models for predicting a PLS, b BTS, c BPS, and d) UCS

Table 11 Statistical description of residual error for models of MRA, ANN, and ANFIS

5 Conclusion

In this research, the north and northwest parts of Damghan, northeast of Iran are selected as the study area. Some sedimentary rock samples with different lithologies including sandstone, limestone, travertine, and conglomerate were collected from these areas. For determining geotechnical properties laboratory testing including dry density, effective porosity, point load strength, Brazilian tensile strength, block punch strength, uniaxial compressive strength, and ultrasonic P-wave velocity were performed based on recommended standard methods by the ISRM and ASTM. Results of petrography and mineralogy studies by thin sections and XRD analysis show that studied samples are dominantly composed of quartz and calcite. Results of the mechanical testing show, the studied samples were classified as weak to extremely strong rocks. Ultrasonic P-wave velocity values ranged from low to very high and were classified as weathered to fresh rocks based on the VP. The reverse power relation with R2 = 0.99 and direct linear relation with R2 = 0.98 were found between VP, pulse time travel, and P-wave modulus, respectively. To establish predictive models, γdry, ne and VP are considered as inputs, and mechanical properties including PLS, BTS, BPS, and UCS are considered as outputs. Four predictive models were established using multivariate regression analysis, artificial neural network, and adaptive neuro-fuzzy inference system. Results of MRA show the average values of the variance inflation factor and tolerance obtained, 1.18 and 0.86, respectively. The ANN models were constructed using a feed-forward neural network and trained using the Levenberg–Marquardt algorithm. The rules of ANFIS models were obtained by fuzzy if–then rules in a Takagi–Sugeno type inference system, and the Gaussian membership function is also used. Statistical parameters including R, RMSE, VAF, MAPE, and PI were calculated and compared to assess the performance of MRA, ANN, and ANFIS models. Results show R values were obtained from 0.73 to 0.85, 0.96 to 0.99, and equal to 0.99 for MRA, ANN, and ANFIS models, respectively. RMSE values were obtained between “8.04 and 0.11”. Based on the results of comparative diagrams, ANFIS models obtained the best results in R, RMSE, VAF, MAPE, and PI values. Also, it was found, ANN models have better results than MRA. The correlations between experimental and predicted values of PLS, BTS, BPS, and UCS that were obtained by MRA, ANN, and ANFIS models and compared with a 45° line (y = x) indicated that the trend lines of ANN and ANFIS models completely fit the 45° line. Whereas, trend lines of MRA slightly intercept or are partly parallel to the 45° line. Calculating residual error shows trend lines of ANFIS and ANN are fitter to the horizontal line (X-axis) than MRA trend lines. Results indicated standard deviation and variance of residual error were decreased from MRA to ANN and ANFIS models. Therefore, statistical analysis and soft computing methods presented acceptable results, but it was noted that ANN and ANFIS are more reliable methods for establishing predictive models for predicting the mechanical properties of the studied rocks. Considering that the mechanical behavior of rocks depends on the type of rock, this research has limitations. Therefore, the results of the current research are valid and feasible for sandstone, limestone, travertine, and conglomerate. In this regard, it is recommended that similar modeling for other rock types be applied.