Introduction

The assessment of rock geo-mechanical properties plays a pivotal role in understanding the behavior of subsurface formations and optimizing various engineering applications (Abdelhedi et al. 2023; Liu et al. 2023; Ameen et al. 2009). The elastic modulus represents the rigidity and resistance of the rocks to failure. This parameter is one of the important parameters for analyzing the stability of engineering structures, especially dam sites. Recent advancements in the field have witnessed a diverse array of studies employing innovative approaches to characterize rock dynamic properties (Li and Dias 2023; Bouchaala et al. 2024; Motahari et al. 2022; Pappalardo and Mineo 2022). This introduction explores key findings from prominent research articles that delve into the intricacies of rock mechanics and elastic modulus estimation. The focus is on factors affecting dynamic properties and various methodologies, including traditional experimental approaches and innovative machine learning (ML) techniques.

Zhang et al. (2023) summarized factors affecting static and dynamic properties and proposed a method for extracting static Young’s moduli (SYM) by analyzing elastic wave velocities of the various rocks. Rahman and Sarkar (2023a) explored correlations between uniaxial compressive strength and dynamic elastic properties, emphasizing diverse rock types. Shahani et al. (2022) applied machine learning (ML) models for intelligent predictions at Thar Coalfield, while Manda et al. (2023) used a machine learning approach to predict geo-mechanical properties from well logs. Additionally, Li and Dias (2023) assessed rock elasticity modulus using hybrid random forest regression (RFR) models, combining data-driven and soft techniques. Daraei and Zare (2019) predicted the static elastic modulus of limestone using downhole seismic tests, demonstrating the applicability of ML to diverse geological formations. Mahmoud et al. (2020) applied backpropagation multilayer perceptron (BPMLP), support vector regression (SVR), and Mamdani fuzzy inference system (MFIS) for SYM predictive models of sandstone formations. Khan et al. (2022) utilized multivariate statistics alongside ML models to forecast uniaxial compressive strength (UCS) and SYM (static Young's modulus). They stated that the random forest regression (RFR) has the best performance for predicting. Researchers have increasingly turned to artificial neural networks (ANNs), multivariate linear regression (MLR), and support vector regression (SVR) for their efficacy in estimating the dynamic and mechanical characteristics of rocks (Motahari et al. 2022; Mahmoud et al. 2020; Rastegarnia et al. 2021). Pappalardo and Mineo (2022) predicted static elastic modulus through regression models and BPMLP, showcasing the versatility of predictive modeling in rock mechanics. Rastegarnia et al. (2021) focused on the engineering characteristics and static properties of clay-bearing rocks, contributing valuable insights into this specific rock type. They stated that the BPMLP method has a conservative trend for forecasting SYM. Fang et al. (2023) assessed the applicability of BPMLP, SVR, ANFIS, and multiple linear regression (MLR) methods to estimate the SYM and UCS of the rock samples. Khosravi et al. (2022) contributed to the evaluation and prediction of both static and dynamic parameters in rocks, offering a holistic perspective on rock behavior. Additionally, Onaloa et al. (2018) provided a SYM model for formation evaluation, enriching the understanding of rock properties for practical applications in the petroleum industry. Guo et al. (2023) conducted a comprehensive assessment of rock geo-mechanical properties and wave velocities, providing valuable insights into the complex interplay between various rock parameters. They stated that the SVR method has a higher accuracy than ANFIS, BPMLP, and MLR models. Soustelle et al. (2023) contributed to the understanding of carbonates by investigating the relationship between static and dynamic elastic moduli, shedding light on the nuanced behavior of these formations. In the realm of carbonate rocks, Hadi and Nygaard (2023) focused on estimating UCS and SYM using petrophysical properties, offering practical applications for the petroleum industry. Abdi et al. (2023) employed the RFR for forecasting the SYM of weak rock samples, addressing challenges associated with rocks of lower strength. Ebrahimi et al. (2023) verified the use of daily drilling reports in conjunction with ML methods for the estimation of Young’s modulus, showcasing the potential for real-time applications in the oil, gas, and petrochemical industries. Kheirollahi et al. (2023) integrated traditional well data with computational techniques to enhance velocity estimations. Ghafoori et al. (2018) used statistical methods to forecast SYM in the Asmari formation, promising improved accuracy. Their results showed that the dynamic Young's modulus was 5 times larger than the static Young's modulus. Motahari et al. (2022) presented a new relationship between SYM and DYM of the feldspathic litharenite samples. They stated that by increasing the amount of quartz in the samples, the compressional and shear wave speeds increased. Also, BPMLP, SVR, and MLR methods are conservative in estimating Vs. Comparing the performance of the methods in estimating Vs showed that SVR has higher accuracy than other models. Xie et al. (2024) investigated the factors affecting shear wave velocity (Vs) of the limestone samples using Gaussian process regression (GPR), BPMLP, and MLR models. They stated that the GPR showed higher accuracy compared to the BPMLP and MLR. Table 1 summarizes some of the previous relationships to estimate the SYM.

Table 1 Some of the previous relationships to estimate the SYM

Considering the challenges in measuring the mechanical properties of rocks in civil and mining projects, this research prioritizes SYM estimation using simple and cost-effective methods. After assessing petrography, physical and mechanical properties of the sandstone samples types, simple regression (SR), MLR, and artificial intelligence methods including SVR, ANFIS, and backpropagation multilayer perceptron (BPMLP) were employed to build SYM prediction models. The models utilized mineralogy, physical properties, and dynamic Young’s modulus (DYM) from obtained sandstone samples at two large dam sites in China to enhance precision and overcome limitations in predicting SYM.

Case studies

This study investigates the engineering geological characteristics of the two large dam sites in China. Situated on the Yellow River in China, the Xiaolangdi Dam (XLD) was completed in 1995, boasting dimensions of 154 m in height and 1650 m in length. Nestled within Henan province, approximately 20 km northwest of Luoyang, the XLD serves a diverse array of functions, including electricity generation and flood control. The geological makeup of the dam site comprises sedimentary rocks, primarily sandstone, and mudstone, forming its foundational base. These rock formations significantly influence both the stability and design of the dam. Beneath the surface lie Permian and Triassic bedrock, overlaid by Quaternary deposits. The Permian formation features reddish and brown silty claystone and argillaceous siltstone, interspersed with layers of fine calcareous sandstone and medium-coarse siliceous sandstone. Similarly, the Triassic formation consists of red-brown limestone sandstone, alongside fine siliceous or calcareous sandstones and argillaceous siltstone, occurring in stratified layers. Understanding these geological characteristics is paramount for ensuring the durability and functionality of the Xiaolangdi Dam. Engineers and geologists meticulously consider these factors during the design, construction, and ongoing maintenance processes to safeguard its integrity and effectiveness for its intended purposes.

Constructed in 2006 on the Yangtze River, the Three Gorges dam (TGD) serves a multifaceted role, encompassing electricity generation, augmentation of the river's shipping capacity, and mitigation of downstream flooding risks. The dam site presents a rich geological tapestry, characterized by sandstone, shale, and limestone formations. These sedimentary rocks exhibit variances in strength, permeability, and stability. The geological profile of the TGD site encompasses a blend of sedimentary rock formations, along with considerations for seismic activity risks, challenges associated with slope stability, and strategies for sedimentation management. This study focused on the analysis of sandstone types sourced from the dam site. Figure 1 depicts the specific locations of the studied dam sites, with a predominant emphasis on sampling from the TGD. Further details regarding the sampling locations, gathered from both the dam axis and reservoir, are illustrated in Fig. 1 and Table 2. The sampling locations along the reservoir and TGD axis cover depths ranging from 0.50 to 38 m. These areas primarily comprise sandstone formations, including arenite, litharenite, and feldspathic litharenite types, predominantly sourced from the Badong formation dating back to the Jurassic period (Table 2).

Fig. 1
figure 1

Studied dams: Xiaolangdi dam cross-sectional view (a part; abbreviations: SIS: sandstone interbedded with siltstone, FS: fine sandstone, SSCM: siltstone interbedded with sandstone, conglomerate and silty mudstone, FSCS: fine sandstone interbedded with siltstone and conglomerate, S: siltstone, SC: silty claystone, SSC: fine sandstone interbedded with siltstone and silty claystone, al Q3: alluvium deposit of sand and gravel of the Pleistocene, F: fault ), and Three Gorges dam site and reservoir geological map (b part)

Table 2 Geological properties of the collected samples

To ensure precise mechanical property measurements, samples exhibiting cracks and joints were intentionally excluded. This precaution was taken due to the anisotropic behavior of these features, which can lead to stress concentration and potential measurement errors (Yu et al. 2021). Consideration of anisotropy is particularly critical in the design of rock structures, especially when dealing with metamorphic rocks like schist, which are characterized by their sheet-like structure and mechanical variability (Fang et al. 2023).

Materials and methods

Laboratory tests

In the laboratory studies, we examined 122 rock samples with a length to diameter ratio ranging from 2.5 to 3 under natural conditions. To mitigate stress concentration concerns, specimens exhibiting joints and cracks were omitted. Prior to testing, the ends of the specimens were parallelized and polished. For petrography analysis, 122 thin sections were meticulously prepared and scrutinized in the microscopic laboratory using transmitted light. Thin section identifications of the samples were conducted based on Folk (1980) instructions. X-ray diffraction (XRD) analysis employed a D8-Advance, Bruker AXS equipment via copper Kα beam (λ = 1.5406Å) with a 2θ angle range of 2 to 60 degrees, using powder of samples with 50-micron size.

The determination of the quartz ratio (Qzr) was derived from XRD results using Eq. 30, incorporating the percentages of quartz (Qz), fragment (Ft), and feldspar (Fr).

$$\text{Qzr}=\frac{\text{Qz\%}}{(\text{Qz\%}+\text{Ft\%}+\text{Fr\%})}$$
(30)

Density (D), porosity (n), Schmidt hardness number (SHN), and water uptake by weight (U) were determined using methods specified in ISRM standards (ISRM 1981). The porosity test was determined based on the saturation and immersion method. This method is used for rock samples with regular, irregular geometric shapes or rock pieces. Also, the sample should not be crisp and fragile and should not be swollen and should not disintegrate or disintegrate due to exposure to water or a greenhouse. In this test, the sample is washed with water to remove the dust on the surface of the water. The sample is placed in water and in a vacuum of less than 800 pascals for at least one hour to be saturated. During this period, the sample should be moved alternately to remove the air bubbles. Then the sample is placed in the basket and enters the water tub. In this case, the basket is hung on the scale with a wire. The saturated mass of the basket and the sample is measured with an accuracy of 0.1 g. Finally, the result of dividing the volume of pores by the total volume of the sample, which is expressed as a percentage, is called porosity.

$$n\left(\%\right)=\frac{{V}_{v}}{{V}_{t}}*100$$
(31)

In this Eq. n is porosity, Vv is volume of pores in the sample, and Vt is total volume of rock sample.

In the present investigation, we employed the N-type Schmidt hammer, featuring a tensioned spring exerting force near the specimens. This device was consistently used in a vertical orientation for all samples in this study. The obtained hardness is affected by the hammer's orientation, using upright, horizontal, or upright positions within a ± 5 degrees deviation (ISRM 1981).

Compressional (Vp) and shear (Vs) wave tests followed the ASTM D2845 standard at a frequency of 0.50MHz (ASTM 2008). The device used to measure the Vp and Vs is the Pundit Plus device made by Proceq, Switzerland. This device has a transducer with pressure and shear wave speed, frequency 1 MHz, diameter and length 50 mm, maximum force tolerance 220 kilonewtons and temperature range 0 to 70 degrees. Prior to mechanical tests, sample ends were parallelized. Uniaxial compressive strength (UCS) tests were conducted per ASTM's recommended method with a constant loading rate of 0.70MPa/S (ASTM 2002). In the UCS test, when a static load was gradually applied to the sample, the axial strain was measured using a strain gauge. Then the stress–strain curve was drawn. Various definitions of Young’s modulus are provided in Fig. 2. The secant static Young’s modulus (SYM) was determined using the stress–strain curve. The DYM was determined using Eq. 32 (Pereira et al. 2021).

$${\text{DYM = }}\frac{{{\text{D*Vs}}^{{2}} {\text{(3*Vp}}^{{2}} {\text{ - 4*Vs}}^{{2}} {)}}}{{{\text{Vp}}^{{2}} {\text{ - Vs}}^{{2}} }}$$
(32)

where, Vp, Vs, and D are in km/s and g/cm3, respectively.

Fig. 2
figure 2

Various definitions of Young’s modulus

The SVR approach

The SVR algorithm aims to identify the optimal regression function, minimizing prediction error while adhering to specified ε-tube bounds (Hussan et al. 2023; Fang et al. 2023). This method looks for a border that, in addition to separating the data of two groups, has the greatest distance to the closest data of the two groups (Kookalani and Cheng 2021; Maleki and Emami 2019). The closest data of each group to the decision boundary are called support vectors. In some problems, the model input data are not linearly separable for classification. In such cases, the support vector machine (SVM) uses a non-linear imager to transfer the data to a new and larger space (Fang et al. 2023; Dutta et al. 2024; Guo et al. 2023). Upon reviewing published articles on estimating using SVR, it was evident that among various kernel functions, the radial basis function (RBF) outperforms others in terms of efficiency (Fang et al. 2023; Khosravi et al. 2022). Consequently, we adopted this specific kernel function to estimate SYM in our research.

The ANFIS

The ANFIS, a hybrid machine learning (ML) approach merging ANNs and fuzzy logic, facilitates data-driven inference and predictions (Khajevand 2022). In this study, ANFIS based on the Gaussian membership function (GMF) and Sugeno fuzzy system was employed to estimate SYM. ANFIS-GMF incorporates fuzzy logic, managing uncertain information through linguistic variables and membership functions (Khajevand 2023a). Utilizing feedforward neural networks (FFNN), often with a hidden layer, this method maps input variables to output values, harnessing the strengths of fuzzy logic and NNs (Tashayo et al. 2020). Fuzzy logic handles linguistic rules and input-to-fuzzy set mapping, while the neural network learns fuzzy set parameters from data (Guo et al. 2023; Hasheminezhad and Sadeghi 2023; Khajevand 2023b).

The BPMLP method

The BPMLP refers to a type of ANN that uses the backpropagation algorithm for training. In a multilayer perceptron, information moves through an input layer, one or more hidden layers, and an output layer (Shamsashtiany and Ameri 2018). The BPMLP method was completely described in previous studies (Guo et al. 2023; Khajevand 2023b, 2023a). In the current research, a feed-forward back-propagation algorithm with Levenberg Marquart (LM) training algorithm was used for the BPMLP training. We utilized collected equations from previous studies to predict the number of neurons using the BPMLP method (Fang et al. 2024). Accordingly, neurons 1 to 4 were investigated. MATLAB software was employed for BPMLP implementation to estimate the SYM.

Normalizing data and evaluation metrics

In this research, for intelligent modeling, normalization of data between -1 and 1 was done using Eq. 33.

$${\text{A}}_{{\text{n}}} { = 2(}\frac{{{\text{A}}_{a} {\text{ - A}}_{{{\text{min}}{.}}} }}{{{\text{A}}_{{{\text{max}}{.}}} {\text{ - A}}_{{{\text{min}}{.}}} }}{) - 1}$$
(33)

Here, An, Aa, Amin, and Amax​ represent the normalized, actual, minimum, and maximum values for the A parameter, respectively. To evaluate the effectiveness of the methods, calculated performance metric (CPM), mean absolute percentage error (MAPE), variance accounted for (VAF), root mean squared error (RMSE), and determination coefficient were employed (Eqs. 3437). These metrics were extensively used for evaluating models (Shakir 2023; Xiao et al. 2023; Song et al. 2022).

$$CPM=R{.}^{2}-RMSE+VAF$$
(34)
$$\text{RMS}\left(\sqrt{\frac{1}{{\text{N}}_{\text{SYM}}}\sum_{\text{i}=1}^{{\text{N}}_{\text{SYM}}}{\left({\text{A}}_{\text{SYM}}-{\text{P}}_{\text{SYM}}\right)}^{2}}\right)\text{E}=$$
(35)
$${\text{VAF = }}\left[ {{1 - }\frac{{{\text{var(A}}_{{{\text{SYM}}}} {\text{ - P}}_{{{\text{SYM}}}} {)}}}{{{\text{var(A}}_{{{\text{SYM}}}} {)}}}} \right]$$
(36)
$${\text{MAPE = }}\frac{{\sum\limits_{{\text{i = 1}}}^{{{\text{N}}_{{{\text{SYM}}}} }} {\left| {{(}\frac{{{\text{A}}_{{{\text{SYM}}}} {\text{ - P}}_{{{\text{SYM}}}} }}{{{\text{A}}_{{{\text{SYM}}}} }}{)}} \right|} }}{{{\text{N}}_{{{\text{SYM}}}} }}$$
(37)

where, NSYM, ASYM, and PSYM represent the total experiments of SYM, actual and predicted SYM values, respectively.

Results and discussion

In this section, we present and compare laboratory test results of intact rock at the investigation sites, followed by the introduction of various experimental relationships and models for estimating the SYM.

One of the indirect methods of estimating rock properties at the construction site of construction projects is the use of indirect methods such as empirical relationships and intelligent models (Teshnizi et al. 2021). For this reason, in recent years, the presentation of models and relationships to estimate mechanical and dynamic properties, and destructive tests have been of interest to researchers. These methods are more important in the stages of identifying constructions and especially when sampling weak rocks (Pappalardo et al. 2022).

Analysis of laboratory results

The petrographic and geological features of the samples under examination are displayed in Table 2. Figures 3 and 4 present examples of laboratory activities, including an XRD pattern of a Feldspathic litharenite sample and thin sections from the studied samples.

Fig. 3
figure 3

An example of litharenite thin section from the TGD (a), feldspathic litharenite thin section from the XLD (b), litharenite thin section from the TGD (c), and litharenite thin section from the TGD (d): (Q, FM, FP, Ch, and Fm-m are quartz, rock fragments, feldspar, chert, and metamorphic fragments, respectively)

Fig. 4
figure 4

An example of a feldspathic litharenite XRD pattern

Table 3 presents the geo-mechanical properties of 122 samples. Through XRD analysis (evidenced by a sample in Fig. 4) and thin section examination (some images in Fig. 3), the 122 sandstone samples were classified into arenite, litharenite, and feldspathic litharenite per Folk's (1980) categorization, encompassing components like feldspar, murky minerals, quartz, albite, carbonate and metamorphic fragments, and chert. Gypsum and calcite, as types of cement, were characterized by semi-rounded to angular grains and poor to medium sorting.

Table 3 Statistical statistics of the measured properties

The examined samples (refer to Table 3) exhibit a broad spectrum of properties. SYM values range from 8.62 GPa to 35.02 GPa, while DYM values span from 29.81 GPa to 85.26 GPa. Porosity levels vary between 0.001% and 13.08%, and Schmidt hardness number (SHN) varies from 29 to 55. Density values fall in the ranges of 2.35 g/cm3 to 2.77 g/cm3. Based on the results, arenite and litharenite samples, incorporating Gypsum and clay elements, exhibit the lowest P-wave velocity, density, SHN, DYM, UCS, and SYM values. Conversely, specimens with high Qtr demonstrate the highest dynamic and static values.

The mineralogy of sandstone significantly influences its mechanical properties. For instance, quartz-rich sandstones tend to be harder and more durable, while clay-rich varieties may exhibit lower strength. The arrangement and interlocking of minerals impact factors like compressive strength, porosity, and permeability in sandstone rocks, thereby affecting their overall mechanical behavior (Diaz-Acosta et al. 2023; Liang et al. 2024). Sandstones with a higher quartz content generally have higher hardness and abrasion resistance (Motahari et al. 2022; Rastegarnia et al. 2022). Quartz is a hard and durable mineral, contributing to the overall strength of the rock. The minerals binding the sand grains together (cement) influence the rock's strength. Silica, calcite, and iron oxides are common cementing materials (Etemadi et al. 2020). Silica cements often result in harder and more competent sandstones. The presence of clay minerals can affect the cohesion and porosity of sandstone (Tofighkhah et al. 2023). Higher clay content may reduce the overall strength of the rock and increase its susceptibility to weathering (Bagherzadeh Khalkhali et al. 2019).

The arrangement of minerals influences the porosity of sandstone. Well-sorted, tightly packed grains generally lead to lower porosity and higher strength, while poorly sorted or loosely packed grains can result in higher porosity and lower strength. The size and sorting of minerals within the rock impact its permeability and strength (He et al. 2021; Li et al. 2023). Well-sorted and fine-grained sandstones often exhibit higher shear strength. The mechanical properties of sandstone are intricately linked to its mineral composition, with minerals like gypsum and clay playing pivotal roles in shaping the rock's behavior. The inclusion of gypsum may render the rock less robust, and its solubility in water can contribute to increased porosity over time, affecting both strength and durability. In contrast, the presence of clay minerals introduces a different set of influences on the mechanical properties of sandstone. Clay minerals, such as kaolinite, illite, or smectite, bring cohesion to the rock, impacting its strength (Kafash Bazari 2023). The plasticity inherent in clay minerals also plays a role in shaping the deformation behavior of the sandstone. Moreover, these minerals contribute to porosity and permeability, affecting fluid flow and storage properties within the rock (Sharifi et al. 2023; Motahari et al. 2022; Khajevand 2023c).

The interaction between gypsum, clay minerals, and the cementing materials in sandstone is a dynamic process that further modulates the mechanical characteristics of the rock. Gypsum's susceptibility to dissolution in water and the potential for certain clay minerals to expand or contract with changes in moisture can lead to long-term changes in the sandstone's strength and stability (Ghavami and Rajabi 2021; Chen et al. 2023; Taheri and Ziad 2021; Rahman and Sarkar 2023b). Figure 5 presents box plots and normal curves illustrating the variables. The data exhibit normal distributions, supporting statistical analysis. Moreover, with a dataset exceeding 25 data points (122 samples), statistical analysis and Pearson correlation can be applied across the entire dataset, assuming a normal distribution.

Fig. 5
figure 5

Box plots and normal curves of the variables

Effects of failure modes and sandstone types on mechanical properties

Efforts have been undertaken to categorize various failure patterns observed in cylindrical samples. Figure 6 illustrates the different types of failure modes observed during UCS tests (Basu et al. 2013). Analyzing these failure patterns offers insights into the predominant stress states within the rock samples under examination. Tensile fractures typically form perpendicular to the direction of the minimum principal stress, while aligning with the direction of the average and maximum stresses (Basu et al. 2013; Bouchaala et al. 2023). However, the presence of microscopic discontinuities introduces a scale effect, wherein the strength decreases as sample dimensions increase. In larger samples, discontinuities have a more pronounced impact on strength, leading to predominantly shear-type fracture patterns (Basu et al. 2013).

Fig. 6
figure 6

Fracture pattern under UCS test (Basu et al. 2013)

In feldspathic litharenite samples, the prevalent fracture pattern observed in most cases is axial fracturing (AF) type, as depicted in Fig. 7. Conversely, the Y-shaped fracture (YSF) pattern is the least common among the samples examined. Similarly, the majority of fracturing patterns observed in both arenites and litharenites are of the AF type. The wide variability in compressive strength from axial torque contributes to the diverse failure patterns observed under axial pressure (Basu et al. 2013). Consequently, analyzing these failure patterns can elucidate the underlying reasons for such broad variations. As the resistance of the samples increases, transitioning from arenites to feldspathic litharenites, there is an escalation in multiple fracturing (MF) behaviors while the occurrence of axial fracture (AF) decreases. In essence, samples with lower strength exhibit a heightened tendency towards AF as the predominant failure mode.

Fig. 7
figure 7

Fracture pattern observed for each sandstone types (values are in percent): Arenite (a), Litharenite (b), and Feldspathic litharenite (c)

The comparison of the characteristics of arenite and feldspathic litharenite and litharenite samples shows that the highest values of the average dynamic properties are related to feldspathic litharenite samples. Also, the highest value of density and compressive strength is related to litharenite feldspathic. The high specific weight of the feldspathic litharenite type is related to the high amounts of opaque heavy minerals in this type. Feldspathic litharenite type sandstones showed the highest average values of static and dynamic Young's modulus. The low resistance of arenite and litharenite types is mainly due to the presence of sulfate cement, clay minerals, high porosity, and carbonate fragments in this type. For example, the results related to the physical and mechanical parameters of samples 1 to 27 of each sandstone type are presented in Fig. 8.

Fig. 8
figure 8

Comparing physical and mechanical properties of the litharenite (L) and feldspathic litharenite (FL) for samples 1–27

The quartz content in sandstone is a key determinant of the rock's durability and its resistance to weathering processes (Wang et al. 2024). Moreover, the type and amount of cementing minerals, such as silica, calcite, or iron oxides, in sandstone play a crucial role in determining the rock's strength. A comprehensive understanding of the mineral composition of sandstone is vital for assessing their engineering geological characteristics. These insights into the role of calcite, clay minerals, and other components contribute to informed decision-making in construction and various engineering applications where these rocks are utilized (Rastegarnia et al. 2020).

Correlation matrix between inputs and SYM

To explore the influence of geo-mechanical parameters on the static Young’s modulus (SYM), we analyzed the correlation matrix and the two-variable regression models (as shown in Tables 4, and Figs. 9 and 10). The results illustrated the potential for estimating SYM based on various factors including DYM, water uptake (U), porosity(n), quartz ratio (Qzr), and SHN as indicated in Fig. 9, due to their correlation coefficients exceeding 0.84. Notably, water uptake exhibited the least impact on SYM. Following this, both simple and multivariate regression and machine learning (ML) techniques were employed to scrutinize parameter effects and determine the most suitable equation for SYM estimation.

Table 4 Regression equations between static Young’s modulus and inputs
Fig. 9
figure 9

Correlation matrix of the variables

Fig. 10
figure 10

Schematic view of the relationships between dynamic and static Young’s modulus: All fits (a) and the best fit (b)

Figure 10 shows a high correlation between the dynamic and static Young's modulus so that it is possible to estimate the static modulus based on the dynamic modulus. Based on the results of the current study, SYM is 2.5 times greater than DYM. The static and dynamic elastic modulus, which describe a material's stiffness, are influenced by various factors (Ghafoori et al. 2018; Li and Fan 2024; Zhang et al. 2023):

  • Material composition: different rock types have different stiffness due to their atomic and molecular arrangements.

  • Crystal structure: crystalline samples’ elastic modulus depends on their crystal lattice arrangement.

  • Temperature: elastic modulus decreases as temperature rises due to increased thermal vibrations.

  • Strain rate: higher strain rates often lead to stiffer rock responses.

  • Microstructure: features like grain size affect modulus; finer grains typically result in higher stiffness.

  • Presence of defects: cavities and defects like vacancies and dislocations can alter modulus by hindering or facilitating atomic movement.

  • Moisture content: moisture absorption can change internal structure and stiffness.

  • Applied stress/strain: material moduli can vary with the type and direction of applied stress or strain.

  • Loading history: past mechanical loading affects modulus, with cyclic loading or creep causing microstructural changes.

  • Frequency: In dynamic testing, frequency affects moduli due to viscoelastic effects.

Understanding these factors is essential for predicting and controlling rock behavior in various applications, using experimental and computational methods for characterization.

Evaluating previous empirical Eqs.

Table 1 outlines various relationships proposed by prior researchers for predicting SYM. Utilizing these relationships, SYM values were calculated for each sample, and the laboratory test outcomes were then compared with the estimated values derived from these empirical relationships. Evaluation results for the previous relationships are depicted in Figs. 11 and 12. Multiple metrics were employed to evaluate the predictive accuracy of the earlier relationships. In terms of the correlation coefficient, most relationships demonstrated satisfactory accuracy in SYM estimation. Notably, according to the CPM, among the examined background equations, the one proposed by Pereira et al (2021) based on Vp showcased superior precision in predicting SYM compared to other equations from the research background (Fig. 12). Despite having a high correlation coefficient, the CPM values of all previous relationships are negative, indicating low accuracy for SYM estimation (Fig. 12). Additionally, the values derived from most previous relations deviated significantly from the diametric line, resulting in their mean values differing substantially from the mean results of this study. Consequently, the development of local models for each area is deemed necessary (Li and Dias 2023). Thus, we explored the suitability of various modeling techniques for SYM estimation.

Fig. 11
figure 11

Correlation between measured and predicted SYM

Fig. 12
figure 12

Performance of previous proposed Eqs. to estimate SYM

MLR Analysis

The MLR analysis was conducted utilizing Minitab software (Version 20) to construct predictive equations for forecasting SYM, as shown in Table 5. Model accuracy was assessed through analysis of variance (ANOVA), with findings detailed in Tables 5 and 6, demonstrating significance below the 5% threshold and confirming the equation's efficacy in estimating SYM. The t-test was also employed to evaluate constant coefficients, with specific instances presented in Tables 5 and 6. The results show that, statistically, the variables of water uptake (U) and porosity (n) do not have much effect on the static modulus. Because the level of confidence in the t-test is more than 5%. These two variables were removed to check their effect on SYM. Removing these two variables has a very slight effect on the relationship determination coefficient (Table 5). Therefore, SYM estimation with three inputs of Schmidt number (SHN), dynamic modulus (DYM), and quartz ratio (Qzr) is more economical and preferred. In this way, these three variables were also considered as inputs in ML methods to estimate SYM. An example of coefficients and results of analysis of variance and t-test is presented in Table 6. These tests were extensively used in the literature to validate models and empirical relationships (Zhao et al. 2024; AlHamad et al. 2021; Shirnezhad et al. 2021).

Table 5 Results of MLR to estimate SYM
Table 6 Features of the ideal MLR model (i.e., Eq. 46 presented in Table 5)

The ANFIS model results

Utilizing MATLAB software, we developed an ANFIS-GMF model incorporating a Sugeno system. The ANFIS-GMF analysis utilized Gaussian membership functions distributed across rules, with linear membership functions employed for the SYM output layer. Through trial-and-error testing, appropriate membership degrees for input combinations were determined. To train the fuzzy inference system (FIS), we employed a hybrid learning algorithm that combines backpropagation with the least squares method. The developed ANFIS-GMF summarizes that the FIS type is GENFIS2 with input membership function type Gaussian and output membership function type linear. There are 4 membership functions (MFs) for the input, distributed across 4 rules of the Sugeno type (Fig. 13). The error goal is set at 0.00, with training continuing for 180 epochs. The influence radius is specified as 0.53. Figure 14 shows the developed ANFIS performance in SYM prediction.

Fig. 13
figure 13

Rule diagram using ANFIS method in SYM prediction

Fig. 14
figure 14

The ANFIS performance in SYM prediction

The BPMLP results

In this study, a BPMLP was utilized with data normalized to the range of [-1,1] like other ML models. The input layer consisted of 4 neurons, corresponding to the independent variables, while the output layer featured a single SYM output. To address the complexity of the problem, we opted for a single hidden layer with minimal neurons, as suggested by previous research (Shahani et al. 2022; Alsalami et al. 2023). The dataset was split into training (93 cases) and testing (31 cases), with training determining weights and testing evaluating model performance. Relationships based on the number of dependent and independent variables have been presented by previous researchers to determine the number of neurons to be examined. Following equations proposed by various researchers (Fang et al. 2024), we examined neurons 1 to 4 to identify the best neuron for predicting SYM using BPMLP (Table 7). We applied a Sigmoid transfer function between the input and hidden layers and a linear transfer function between the hidden and output layers. The BPMLP training utilized the LM algorithm. Results revealed the highest correctness in approximating SYM for the third neuron with 3 inputs and LM training algorithm, as detailed in Table 7. The LM algorithm's adaptive learning rate adjustment capability proved advantageous in technical and engineering problems (Rastegarnia et al. 2021; AlHamad et al. 2022). Figure 15 shows the results of the ideal BPMLP model in SYM prediction.

Table 7 Results of neuron evaluation using the BPMLP model
Fig. 15
figure 15

The BPMLP performance in SYM prediction

The SVR results

SVR modeling results are summarized according to Table 8:

Table 8 Results of the SVR method to estimate SYM
Fig. 16
figure 16

The SVR performance in SYM prediction

Factors affecting SVR are summarized below (Joseph and Swalih 2023; Fang et al. 2023).

  • Curse of dimensionality: As the number of input variables increases, the feature space expands, leading to sparser data points in high-dimensional spaces. This makes it difficult for SVR to find an optimal hyperplane that effectively separates classes.

  • Overfitting: introducing more input variables can result in overfitting, where the model becomes overly complex and captures noise from the training data instead of underlying patterns. Consequently, the model may struggle to generalize well to unseen data.

  • Increased computational complexity: A higher number of input variables can escalate the computational requirements for SVR training and prediction. This can lead to longer processing times and resource-intensive operations, impeding practical application.

  • Difficulty in kernel selection: SVR often relies on kernel functions to map data into higher-dimensional spaces. With an increasing number of input variables, selecting an appropriate kernel and fine-tuning its parameters becomes more challenging, impacting the model's overall performance.

  • To tackle these challenges, it is crucial to employ feature selection or dimensionality reduction techniques. These methods help identify and retain only the most relevant input variables, mitigating overfitting, alleviating the curse of dimensionality, and enhancing the accuracy of the SVR model.

Method comparisons and discussions

In our evaluation of various models, we examined different statistics, as illustrated in Table 9. Notably, SVR demonstrated the highest accuracy, characterized by CPM and %MAPE values in forecasting SYM. All methods displayed high R2 values, typically around 0.95 or 0.98, indicating a robust ability to explain variance in SYM prediction. Based on the results, the performance of the SVR is better than the BPMLP and other models (Table 9). BPMLP uses the risk minimization method (ERM) but SVR uses the structural risk minimization method. Also, the precision of these approaches is influenced by the number of inputs, the number of samples, and the choice of training algorithm (Fang et al. 2024; Gautam et al. 2023). Khajevand (2023b) evaluated MLR, ANFIS, and BPMLP in predicting UCS of various rock samples. The outcomes showed that the ANFIS produced the most precise results. Khan et al. (2022) stated that the RFR outperforms KNN (K-nearest neighbors), MLR, and BPMLP in guessing rock UCS. Fang et al. (2024) demonstrated that the SVR provides the most correct outcomes for forecasting rock tensile strength among BPMLP, MLR, RFR, and CART.

Table 9 Results of statistical criteria to estimate SYM based on all data sets

Each of the methods successfully displayed the correlations between input parameters and their impacts on SYM, showcasing strong generalization abilities for new data. The average values of the static modulus using intelligent methods were found to be equal to 22.46GPa, which is about 0.04% different from the experimental value (22.47GPa). This discrepancy is less than 1% and indicates the efficiency of the methods used in estimating Young's modulus.

Conclusions

In this research, after evaluating petrography, physical and mechanical characteristics of the sandstone samples, intelligent and statistical analyses including SVR, ANFIS, MLR, and BPMLP were employed to estimate the SYM (static Young’s modulus) of different types of sandstones found at the Three Gorges and Xiaolangdi dam sites. Petrography studies classified samples into arenite, feldspathic litharenite, and litharenite categories. The distinct sandstone types between the two dam sites resulted in variations in their geo-mechanical properties. Samples of feldspathic litharenite at the dam sites exhibited the highest strength, attributed to the influence of feldspar, while litharenite and arenite types showed variability in strength based on their mineral compositions. Analysis of fracture mode patterns revealed that samples with low strength values (arenite and litharenite) predominantly exhibited axial fractures, transitioning to multiple fracture types in feldspathic litharenite samples as UCS and SYM increased. Linear models demonstrated the highest correlation coefficients of inputs with SYM estimation. The ratio of quartz and water uptake showed the highest and lowest impacts on SYM, respectively.

The SVR model outperformed other methods in terms of accuracy (R2 = 0.98, RMSE = 0.11GPa, CPM =  + 1.84) to estimate SYM. Notably, intelligent models demonstrated no overfitting, with test data correctness equal to or greater than training data. The average difference between the values of static modulus using ML methods and its experimental value is about -0.04%, which is less than 1% and indicates the efficiency of the methods used in estimating static Young's modulus.

In summary, the study highlights effective approaches for selecting the most suitable machine learning methods for accurately predicting rock SYM, particularly beneficial in the site investigation stages of civil engineering projects. Testing on larger datasets and data from diverse sites and lithologies is recommended for enhanced prediction accuracy in future models.