Introduction

Mining may create substantial surface subsiding, leading to landslides, collapse, and other disasters (Fernandez et al. 2020; Kwinta and Gradka 2020; Salmi et al. 2017; Tajdus et al. 2018). Therefore, mining subsiding is a serious threat to the ecosystems of mining regions. Many scholars believe that accurate prediction of mining subsidence is beneficial for providing safe and efficient guidance in underground coal mining and scientifically evaluating the mining-induced environmental damage (Liu et al. 2022; Qin et al. 2023; Wang et al. 2022; Wei et al. 2023; Xu et al. 2023).

Currently, there are many kinds of prediction methods for mining subsidence. (1) Scholars regard bedrock and alluvium as two different media. A predictive model for mining subsidence is established by combining two separate models: an overlying strata movement model developed based on mechanics principles, and an alluvium movement model developed based on stochastic media theory. (Guo et al. 2020; Ma et al. 2021; Qin et al. 2023; Wang et al. 2022; Yang and Luo 2021). (2) Simulation method. Academics constructed finite element or discrete element numerical models according to stratigraphic information. Then, a surface subsidence prediction can be implemented by simulating coal mining (Alam et al. 2022; Gong et al. 2022; Jirankova et al. 2020; Liu et al. 2022). (3) The third method uses neural networks, fuzzy mathematics, and other theories to forecast subsiding (Alaee et al. 2019; Kumar et al. 2022). (4) The fourth method is the profile function approach. This method obtains a specific mathematical function by fitting the surface subsidence curve of a mining area and then using this function to predict surface subsidence within the mining area (Haciosmanoglu 2004; Saeidi et al. 2022). One drawback of this method is its limitation in predicting two-dimensional subsidence profiles rather than the entire three-dimensional subsidence basin (Saeidi et al. 2022). (5) The fifth method is the influence function method (IFM). This approach relies on theoretical study or other ways to calculate the subsidence caused by mining basic mining units (represented by influence functions). Then, the subsiding of a particular place is considered to be the summation of the subsidence caused by the mining of basic mining units, according to the superposition principle. (Kratzsch and Fleming 1983; Malinowska et al. 2020; Saeidi et al. 2022). Compared to other methods, IFM has numerous advantages. This approach offers high computational efficiency (Malinowska et al. 2020). The IFM allows for rapid prediction of surface subsidence at any point within the gob using computer programs (Deng et al. 2014). Furthermore, compared with the mechanics method, this method requires fewer model parameters, and acquiring these parameters is relatively straightforward. The predictive parameters of the IFM can be obtained by inverting surface movement observation data (Chi et al. 2021). Many mining areas in China have abundant surface movement observation data, which provides convenient conditions for obtaining predictive parameters of the IFM. Therefore, this study will employ the theory of influence functions to establish the prediction model.

In China, the probability integration method (PIM) is the most commonly used method for predicting mining subsidence (Wei et al. 2023). This method also is a specific application of the IFM. According to the definition of PIM (Deng et al. 2014), mining with infinitesimal mining thickness and width is defined as element extraction. The subsidence basin at the surface induced by the element extraction is called the element basin. The mathematical expression for the element basin is called the element subsidence function. The element subsidence function of PIM follows a normal distribution. Then, using the element subsidence function as the influence function, according to the superposition principle, the PIM can be built using mathematical derivation. In other words, PIM is a prediction model developed based on the normal distribution function. Although PIM is widely used, it does not achieve satisfactory prediction results in some mining areas, especially those with thick alluvium (Jiang et al. 2020, 2022; Wei et al. 2023).

Compared to thin alluvium mining areas, thick alluvium mining areas exhibit larger surface subsidence areas and smaller limit angles. At the edges of subsidence basins, predicted surface subsidence values from PIM are significantly lower than actual measured subsidence values (Jiang et al. 2020, 2022). This phenomenon is referred to as the long-tailed distribution characteristic of surface subsidence. However, PIM is a prediction method developed based on the assumption of a normal distribution, which does not exhibit a long-tailed distribution. As a result, the predictive accuracy of PIM is lower at the periphery of the subsidence basin (Wei et al. 2023). Therefore, some scholars have also developed subsidence prediction models based on other function models to improve the accuracy of subsidence prediction (Chi et al. 2021; Wang et al. 2013; Zhang et al. 2022).

The aforementioned research indicates that prediction models developed based on the assumption of a normal distribution may not be the optimal choice for certain mining areas. To enhance the accuracy of subsidence prediction, this study attempted to extend the PIM to a prediction model for mining subsidence based on an unskewed continuous probability distribution over an infinite interval. This subsidence prediction model is named CPM. CPM is an extension of PIM, sharing the same fundamental principles and application scope. CPM relates the probability distribution function to the subsidence prediction. Unlike other prediction methods, it does not have a specific function expression. Instead, engineers are allowed to screen some appropriate probability distribution functions to incorporate into this model and establish subsidence prediction equations. The objective of this study is to enhance the accuracy of subsidence prediction. By not solely considering one prediction method developed based on a specific function, it avoids limiting the improvement of prediction accuracy.

Methods

CPM

The element subsidence function of the PIM can be expressed by Eq. (1).

$$ \left\{ {\begin{array}{*{20}c} {w_{pime} \left( x \right) = 1/r\;\exp \left( { - \pi x^{2} /r^{2} } \right)} \\ {r = H_{0} /\tan \beta } \\ \end{array} } \right. $$
(1)

where x indicates the position of a certain point on the ground (m), r indicates the main influence radius (m), and H0 indicates the average mining depth (m); tanβ indicates the tangent of main influence angle, which is a parameter of PIM.

The relationship between this element subsidence function and the probability density function (PDF) fN of normal distribution is shown in Eq. (2).

$$ \left\{ {\begin{array}{*{20}c} {f_{N} \left( {x,\mu_{n} ,\sigma_{n} } \right) = 1/\sqrt {2\pi } \sigma_{n} \exp \left( { - \left( {x - \mu_{n} } \right)^{2} /2\sigma_{n}^{2} } \right) \sim N\left( {\mu_{n} ,\sigma_{n}^{2} } \right)} \\ {w_{pime} \left( x \right)\sim N\left( {0,\left( {r/\sqrt {2\pi } } \right)^{2} } \right)} \\ \end{array} } \right. $$
(2)

where μn indicates the location parameter of the normal distribution, and σn indicates the scale parameter.

When \(\mu_{n} = 0,{ }\sigma_{n} = {\raise0.7ex\hbox{$r$} \!\mathord{\left/ {\vphantom {r {\sqrt {2\pi } }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sqrt {2\pi } }$}}\), the PDF of the normal distribution is equivalent to the element subsidence function of PIM. The so-called "main influence radius r" is also essentially the scale parameter of a probability distribution. Therefore, the PIM has a close connection to the normal distribution. The normal distribution has the following two properties: (1) The independent variable is a single continuous random variable; (2) The interval of the independent variable is infinite. (3) The PDF presents an unskewed symmetrical distribution. We define this class of probability distributions as unskewed continuous probability distributions over an infinite interval. This section will extend the PIM to a prediction model of mining subsidence based on an unskewed continuous probability distribution over an infinite interval. This subsidence prediction model is named CPM.

PIM has two application conditions (Deng et al. 2014).

(1) It is appropriate for forecasting ground subsiding with continuous distribution characteristics. (2) It can be used to predict the surface subsidence law with non-skewness characteristics. In mining areas with flat terrain, surface subsidence patterns of horizontal or gently inclined coal seam extraction generally present non-skewness characteristics. Thus, it is appropriate for forecasting ground subsiding induced by extracting horizontal coal or gently inclined seams (the dip angle is less than 15°).

The CPM proposed in this study is an extension of the PIM. Therefore, CPM has the same scope of application as PIM. The CPM is not a generalized model, and its applicability is limited. In addition, the fundamental principles of PIM and CPM are influence function theory. The basic principles of this theory are as follows (Deng et al. 2014; Guo and Chai 2013).

The IFM divides the panel into multiple infinitesimally small mining units and calculates the impact of each unit’s mining on surface subsidence. This impact is typically expressed using a function called the influence function. The influence function used in the CPM is an unskewed continuous probability distribution over an infinite interval. The surface subsidence caused by large-scale underground mining can be considered as the linear superposition of subsidence caused by all the small mining units. This superposition process can be seen as an integration process in probability theory. Therefore, the theoretical foundation of the IFM is based on the principle of linear superposition.

An element subsidence function

According to the definition of the unskewed continuous probability distribution over an infinite interval, its PDF f(x, μ, σ) and its cumulative distribution function (CDF) F(x, μ, σ) have the following properties.

$$ \left\{ {\begin{array}{*{20}c} {f \ge 0} \\ {\mathop \smallint \limits_{ - \infty }^{ + \infty } f\left( {x,\mu ,\sigma } \right)dx = 1} \\ {\left\{ {\begin{array}{*{20}c} {F\left( {x, \mu , \sigma } \right) = \mathop \smallint \limits_{ - \infty }^{x} f\left( {t, \mu , \sigma } \right)dt} \\ {F\left( {\mu , \mu , \sigma } \right) = 0.5} \\ \end{array} } \right.} \\ \end{array} } \right. $$
(3)

When μ = 0, f(x, 0, σ) is the element subsidence function of CPM. In Eq. (3), σ is defined as the main influence radius. The connection between the PIM’s main influence radius r and the CPM’s main influence radius σ is:\(r/\sigma = \sqrt {2\pi }\). σ is calculated according to the following formula.

$$ \sigma = H_{0} /\tan \gamma $$
(4)

where tanγ indicates the tangent of main influence angle, which is a parameter of CPM.

Although there are various element subsidence functions, the curve shape of element subsidence functions generally presents the distribution characteristics of low at both ends and high at the middle (see Fig. 1), called a bell curve. Therefore, when screening the element subsidence function, engineers should screen the PDF whose morphological characteristic is a bell curve. Furthermore, this paper only discusses the forecasting model for mining subsidence developed based on an unskewed continuous probability distribution over an infinite interval. Thus, the PDF chosen by the researchers must present an unskewed symmetric distribution (see Fig. 1).

Fig. 1
figure 1

Morphology of element subsidence function (take the PDF fN(x,0,10) of the normal distribution as an example)

A prediction equation of subsidence on a major section for semi-infinite extraction of horizontal coal seams

In a surface subsidence basin, the vertical section along the strike or dip of the coal seam through the maximum subsidence point on the surface is called the major section. The horizontal coal seam’s extraction depth is H0 (see Fig. 2). When the horizontal coal seam on one side of the open-off cut has not been extracted, and the coal seam in other directions has been fully extracted, this mining condition is named as semi-infinite mining of horizontal coal seam. This section will derive a prediction equation of subsidence on the major section for the semi-infinite extraction of horizontal coal seams. The derivation process is similar to that of PIM. First, the coordinate system for the surface needed to be established (see Fig. 2). Point O above the boundary of the working face is used as the origin of the x-axis. The x-axis points toward the gob. W(x) is the vertical axis, expressed as a given point’s subsidence. In addition, a coordinate system for the coal seam needed to be established (see Fig. 2). The O1 of the coal seam roof is the origin of the s-axis. The s-axis points to the gob. Both axes have the same scale. The maximum sinking w0 is not equal to the mining height m due to the swellability of broken rock—w0 = mq, where q indicates the subsidence factor, which is a CPM parameter. Thus, the mining height can be considered as mq. Referring to the derivation process of the PIM (Deng et al. 2014), the derivation process of the prediction equation of subsidence on a major section for semi-infinite extraction of horizontal coal seams was as follows.

Fig. 2
figure 2

Subsidence on the major section for semi-infinite extraction of horizontal coal seams

As shown in Fig. 2, the coordinate of any point A on the surface is x, and the abscissa of the extraction element is s. The sinking value on point A induced by the element extraction is given by:

$$ w_{e} \left( {x - s} \right) = f\left( {x - s,0,\,\,H_{0} /\tan \gamma } \right) $$
(5)

The sum of point A subsiding caused by the mining of each element in the range of s = 0 ~  + ∞ was given by:

$$ W\left( x \right) = \int\limits_{0}^{{ + \infty }} {mqf\left( {x - s,0,{\mkern 1mu} {\mkern 1mu} H_{0} /\tan \gamma } \right)ds} $$
(6)

Set xs = t. When s = 0, t = x; when s =  + ∞, t = -∞; d(x-s) = -dt, therefore:

$$W\left( x \right) = - mq\int\limits_{x}^{{ - \infty }} {\int_{{}}^{{}} {f\left( {t,0,{\mkern 1mu} {\mkern 1mu} H_{0} /\tan \gamma } \right)dt} = mq\int\limits_{{ - \infty }}^{x} {f\left( {t,0,{\mkern 1mu} {\mkern 1mu} H_{0} /\tan \gamma } \right)dt} } $$
(7)

From Eq. (3), we can get the following.

$$ W\left( x \right) = mq\int\limits_{{ - \infty }}^{x} {f\left( {t,0,{\mkern 1mu} {\mkern 1mu} H_{0} /\tan \gamma } \right)dt = mqF\left( {x,0,{\mkern 1mu} H_{0} /\tan \gamma } \right)} $$
(8)

Equation (8) is the prediction equation of subsidence on a major section for semi-infinite extraction of horizontal coal seams. This equation is expressed as the product of the maximum subsidence value mq and the CDF F(•).

A prediction equation of subsidence on a major section for finite extraction of gently inclined seams

As shown in Fig. 3 (a) and (b), when all coal along the strike (or the dip of the coal seam) has been extracted, but the mining degree along the dip (or along the strike) is still subcritical mining, this mining condition is defined as finite extraction of the coal seam.

Fig. 3
figure 3

Definition of finite extraction: a Finite extraction along the strike of the coal seam, b Finite extraction along the dip of the coal seam, c Distribution positions of four kinds of deviations of inflection point

Before deriving the relevant equations, the term “the inflection point” was introduced. It is the cut-off point of the concave-convex change on the sinking curve, i.e., the point where the curvature is 0 (Deng et al. 2014). According to Eqs. (3) and (8), if the element subsidence function of a mining area follows an unskewed symmetrical distribution, the inflection point of the subsidence curve on a major section for semi-infinite extraction of horizontal coal seams will be located directly above the gob boundary. In fact, the inflection point will be offset to the left or right along the direction of the coal seam, influenced by the cantilever roof or the adjacent gob. Define this offset distance as the deviation of inflection point, which is the parameter of the CPM. When calculating ground subsidence, the offsetting of the gob boundary can be used to achieve the offsetting of the inflection point, referring to the derivation process for PIM. Thus, the deviation of inflection point can be regarded as the distance that the gob’s computational boundary deviates from the real boundary (see Fig. 3 (a) and (b)). If this parameter is positive, it means that the computational boundary is on one side of the gob. At this point, the computational length of the gob will be shortened. If this parameter takes a negative value, the computational boundary is on the coal pillar’s side. At this point, the gob’s computational length will be enlarged. If this parameter is 0, the computational boundary of the gob coincides with the actual boundary of the gob. The deviation of inflection point is generically positive or zero if no other gobs are around the gob. Otherwise, it is negative. As shown in Fig. 3 (c), according to the different positions, the deviation of inflection point is divided into the deviation of inflection point along the dip side of the coal seam (s1), the deviation of inflection point along the rising side of the coal seam (s2), the deviation of inflection point along the left side (sl), and the deviation of inflection point along the right side (sr). If the coal seam is horizontal, the inflection point after offset will be directly above the gob boundary after offset (computational boundary of gob).

This section will first derive a prediction equation for subsidence on a major section along the dip of a coal seam in the case of a finite extraction of a gently inclined seam.

As shown in Fig. 4 (a), A and B are the actual boundaries of the gob, and C1 and D1 are the computational boundaries of the gob. s1 and s2 are the deviations of inflection point along the dip and rising sides of the coal seam, respectively. When the coal seam and overlying strata are inclined, mining influence spreads at an angle to the surface rather than vertically upward. Due to the inclination of the coal seam and overlying strata, the inflection point will not be located directly above the computational boundary of the gob, but will be offset toward the dip direction of the coal seam (Deng et al. 2014). The inflection points after the offset are located at O1 and O2, respectively. In addition, the maximum subsidence point is not located at G1 directly above the center O of the gob, and will be offset toward the dip of the coal seam (see Fig. 4 (a)). G2 is the maximum sinking point after the offset. As shown in Fig. 4 (a), the angle on one side of the dip direction of the coal seam between the line connecting the inflection point after offset and the calculation boundary on the same side and the horizontal line is defined as the influence transference angle θ. θ is a parameter of the CPM. The length of G1G2 is H0cotθ.

Fig. 4
figure 4

Computational principles of subsidence on a major section along the dip of the coal seam for finite extraction of gently inclined seams: a Create a coordinate system, b Relationship between length and height along gently inclined seams and length and height along horizontal coal seams

According to the calculation principle of the PIM (Deng et al. 2014), the calculation of surface subsidence induced by finite extraction of gently inclined seams is essentially the difference between the surface subsidence induced by semi-infinite extraction of two horizontal coal seams. As seen in Fig. 4 (a), the surface subsidence caused by finite extraction with the computational boundary C1D1 (equivalent to the actual boundary AB) is equal to the difference between the ground subsiding induced by semi-infinite extraction of horizontal coal seams DE and CF.

As shown in Fig. 4 (b), after a gently inclined seam is naturalized into a horizontal coal seam, the mining height of the coal seam is mqcosα. As seen in Fig. 4 (b), according to the law of sine, the relationship between the length AA along the inclination direction of the coal seam and the length BB along the horizontal direction is:

$$ AA/\sin \theta = BB/\sin \left( {180 - \alpha - \theta } \right) = BB/\sin \left( {\alpha + \theta } \right) $$
(9)

Thus:

$$ BB = AA\sin \left( {\alpha + \theta } \right)/\sin \theta $$
(10)

According to Eq. (10), the gob length L, s1, and s2 along the horizontal direction are Lsin(α + θ)/sinθ, s1sin(α + θ)/sinθ and s2sin(α + θ)/sinθ.

As shown in Fig. 4 (a), a surface coordinate system was established. Taking the G1 point just above the center of the gob as the origin, the y-axis points to the left (the y-axis generally points toward the dip direction). The coordinate system’s origins for two horizontal coal seams are u1 and v1, respectively. They lie on the same vertical line as the surface coordinate system’s origin. In addition, the coordinate systems u and v of the coal seams have the same orientation and scale as the surface coordinate system. The extraction range of horizontal coal seam DE is u ∈ ((-L/2 + s2)sin(α + θ)/sinθ + H0cotθ, + ∞). The extraction range of horizontal coal seam CF is v ∈ ((L/2−s1)sin(α + θ)/sinθ + H0cotθ, + ∞).

The ground sinking W0(y) induced by finite extraction with the computational boundary C1D1 is equal to the difference between the surface subsidence induced by semi-infinite extraction for horizontal coal seams DE and CF; the relevant equations were derived as follows:

$$ \begin{gathered} W_{\left( y \right)}^{0} = \int_{{H_{0} \cot \theta + \,\left( {{\raise0.7ex\hbox{${ - L}$} \!\mathord{\left/ {\vphantom {{ - L} 2}}\right.\kern-0pt} \!\lower0.7ex\hbox{$2$}} - s_{1} } \right)\sin \left( {\alpha + \theta } \right)/\sin \theta }}^{ + \infty } {mq\cos \alpha f\left( {\left( {y - u} \right),0,\,\,H_{0} /\tan \gamma } \right)du} - \\ \int_{{H_{0} \cot \theta + \left( {{\raise0.7ex\hbox{$L$} \!\mathord{\left/ {\vphantom {L 2}}\right.\kern-0pt} \!\lower0.7ex\hbox{$2$}} - s_{1} } \right)\sin \left( {\alpha + \theta } \right)/\sin \theta }}^{ + \infty } {mq\cos \alpha f\left( {\left( {y - v} \right),0,\,\,H_{0} /\tan \gamma } \right)dv} \\ \end{gathered} $$
(11)

Set yu = t. When u =  + ∞, t = − ∞; when u = H0cotθ + (− L/2 + s2)sin(α + θ)/sinθ, t = yH0cotθ + (L/2-s2)sin(α + θ)/sinθ; d(y-u) = − dt. Set y v = t. When v =  + ∞, t = − ∞; when v = H0cotθ + (L/2−s1)sin(α + θ)/sinθ, t = y-H0cotθ-(L/2-s1)sin(α + θ)/sinθ; d(y-v) = − dt. Substitute them into the upper and lower limits of the integral to obtain the following:

$$ \begin{gathered} W_{\left( y \right)}^{0} { = }mq\cos \alpha \left( { - \int_{{y - H_{0} \cot \theta + {\raise0.7ex\hbox{${\left( {L/2 - s_{2} } \right)\sin \left( {\alpha + \theta } \right)}$} \!\mathord{\left/ {\vphantom {{\left( {L/2 - s_{2} } \right)\sin \left( {\alpha + \theta } \right)} {\sin \theta }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sin \theta }$}}}}^{ - \infty } {f\left( {t,0,\,\,H_{0} /\tan \gamma } \right)dt} } \right) + \\ mq\cos \alpha \left( {\int_{{y - H_{0} \cot \theta - {\raise0.7ex\hbox{${\left( {L/2 - s_{1} } \right)\sin \left( {\alpha + \theta } \right)}$} \!\mathord{\left/ {\vphantom {{\left( {L/2 - s_{1} } \right)\sin \left( {\alpha + \theta } \right)} {\sin \theta }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sin \theta }$}}}}^{ - \infty } {f\left( {t,0,\,\,H_{0} /\tan \gamma } \right)dt} } \right) \\ \end{gathered} $$
(12)

After transforming the upper and lower bounds of the integral, we can obtain the following.

$$ \begin{gathered} W_{(y)}^{0} = mq\cos \alpha \left( {\int_{ - \infty }^{{y - H_{0} \cot \theta + {\raise0.7ex\hbox{${\left( {L/2 - s_{2} } \right)\sin \left( {\alpha + \theta } \right)}$} \!\mathord{\left/ {\vphantom {{\left( {L/2 - s_{2} } \right)\sin \left( {\alpha + \theta } \right)} {\sin \theta }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sin \theta }$}}}} {f\left( {t,0,\,\,H_{0} /\tan \gamma } \right)dt} } \right) - \\ mq\cos \alpha \left( {\int_{ - \infty }^{{y - H_{0} \cot \theta - {\raise0.7ex\hbox{${\left( {L/2 - s_{1} } \right)\sin \left( {\alpha + \theta } \right)}$} \!\mathord{\left/ {\vphantom {{\left( {L/2 - s_{1} } \right)\sin \left( {\alpha + \theta } \right)} {\sin \theta }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sin \theta }$}}}} {f\left( {t,0,\,\,H_{0} /\tan \gamma } \right)dt} } \right) \\ \end{gathered} $$
(13)

Having solved the definite integral, we can obtain the following result.

$$ \begin{gathered} W_{\left( y \right)}^{0} = mq\cos \alpha F\left( {y - H_{0} \cot \theta + {\raise0.7ex\hbox{${\left( {L/2 - s_{2} } \right)\sin \left( {\alpha + \theta } \right)}$} \!\mathord{\left/ {\vphantom {{\left( {L/2 - s_{2} } \right)\sin \left( {\alpha + \theta } \right)} {\sin \theta }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sin \theta }$}},0,\,H_{0} /\tan \gamma } \right) - \\ mq\cos \alpha F\left( {y - H_{0} \cot \theta - {\raise0.7ex\hbox{${\left( {L/2 - s_{1} } \right)\sin \left( {\alpha + \theta } \right)}$} \!\mathord{\left/ {\vphantom {{\left( {L/2 - s_{1} } \right)\sin \left( {\alpha + \theta } \right)} {\sin \theta }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sin \theta }$}},0,\,H_{0} /\tan \gamma } \right) \\ \end{gathered} $$
(14)

Equation (14) is the prediction equation for subsidence on a major section along the dip of the coal seam in the case of finite extraction of gently inclined seams.

This section will then derive the prediction equations for subsidence on a major section along the strike of a coal seam in the case of a finite extraction of a gently inclined seam.

First, a coordinate system was established for the surface. The origin O1 of the x-axis coordinate is located just above the center of the gob, and the x-axis points to the right (see Fig. 5). The mining length along the strike of the coal seam is l. sl and sr are the deviations of inflection points along the left and right sides of the coal seam, respectively. The origin of the coal seam coordinate system’s s-axis, O, is set in the middle of the gob and points to the right. In addition, the coordinate system for the coal seam has the same orientation and scale as the coordinate system for the surface. Therefore, when the excavation range is s ∈ (− 0.5 l + sr,0.5 l-sl), the sinking at a point x on the ground is given by:

$$ W_{\left( x \right)}^{0} = mq\cos \alpha \int_{{ - 0.5l + s_{r} }}^{{0.5l - s_{l} }} {f\left( {x - ,0,\,H_{0} /\tan \gamma } \right)ds} $$
(15)
Fig. 5
figure 5

Computational principles of subsidence on a major section along a strike of a coal seam in the case of a finite extraction of a gently inclined seam

Set x - s = t. When s = − 0.5 l + sr, t = x + 0.5 l-sr; when s = 0.5 l-sl, t = x-(0.5 l-sl); d(x-s) = − dt. Thus:

$$ \begin{gathered} W_{\left( x \right)}^{0} = mq\cos \alpha \int_{{x + 0.5l - s_{r} }}^{{x - \left( {0.5l - s_{l} } \right)}} { - f\left( {{\text{t}},0,\,H_{0} /\tan \gamma } \right)} dt = \\ mq\cos \alpha \int_{{x - \left( {0.5l - s_{l} } \right)}}^{{x + 0.5l - s_{r} }} {f\left( {{\text{t}},0,\,\,H_{0} /\tan \gamma } \right)} dt \\ \end{gathered} $$
(16)

Having solved the definite integral, we can obtain the following result.

$$ W_{\left( x \right)}^{0} = mq\cos \alpha \left( {F\left( {\left( {x + 0.5l - s_{r} } \right),0,\,H_{0} /\tan \gamma } \right) - F\left( {\left( {x - 0.5l + s_{l} } \right),0,\,H_{0} /\tan \gamma } \right)} \right) $$
(17)

Equation (17) is the prediction equation for subsidence on a major section along the strike of the coal seam in the case of finite extraction of gently inclined seams. It is also essentially the difference between the surface subsidence induced by the semi-infinite extraction of two coal seams.

A prediction equation of subsidence at any point on the surface induced by the extraction of a rectangular panel

First, a coordinate system was established for the surface and coal seams. As illustrated in Fig. 6, the origin of the coal seam’s coordinate system, sO1t lies in the gob’s center, with the t-axis pointing towards the coal seam’s dip. The horizontal projection of the coordinate system sO1t for the coal seam coincides with that of the coordinate system xOy for the surface. Under two-dimensional conditions, the forecasting formula for ground subsidence induced by element extraction is shown in Eq. (5). Under three-dimensional conditions, the ground sinking is influenced by the combined influence of extraction along the coal seam’s strike and dip (Deng et al. 2014). According to the influence function theory, the subsiding for point A (coordinates (x, y)) on the ground induced by the extraction of element B (coordinates (s, t)) is the following.

$$ w_{e} \left( {x,y} \right) = f\left( {x - s,0,H_{0} /\tan \gamma } \right)f\left( {y - t,0,H_{0} /\tan \gamma } \right) $$
(18)
Fig. 6
figure 6

Coordinate system for the surface and coal seam

The mining length along the strike is l. The mining length along the dip is L, as illustrated in Fig. 6. The extraction range along the strike is s ∈ (− 0.5 l + sr,0.5 l-sl) (see Fig. 6). Due to the inclination of the coal seam, the mining length and height along the dip of the coal seam needs to be converted to the mining length and height along the horizontal coal seam. Therefore, the mining height of the coal seam is mqcosα. The extraction range along the coal seam’s dip is t ∈ ((-L/2 + s2) sin(α + θ)/sinθ + H0cotθ, (L/2-s1)sin(α + θ)/sinθ + H0cotθ). The sinking at point A on the ground induced by the extraction of a rectangular panel is as follows.

$$ \begin{gathered} w\left( {x,y} \right) = \int_{{ - 0.5l + s_{r} }}^{{0.5l - s_{l} }} {\int_{{H_{0} \cot \theta + \left( { - L/2 + s_{2} } \right)\sin \left( {\alpha + \theta } \right)/\sin \theta }}^{{H_{0} \cot \theta + \left( {L/2 - s_{1} } \right)\sin \left( {\alpha + \theta } \right)/\sin \theta }} {mq\cos \alpha f\left( {x - s,0,H_{0} /\tan \gamma } \right)} } \times \\ f\left( {y - t,0,H_{0} /\tan \gamma } \right)dtds \\ \end{gathered} $$
(19)
$$ \begin{gathered} w\left( {x,y} \right) = mq\cos \alpha \int_{{ - 0.5l + s_{r} }}^{{0.5l - s_{l} }} {f\left( {x - s,0,H_{0} /\tan \gamma } \right)ds \times } \\ \int_{{H_{0} \cot \theta + \left( { - L/2 + s_{2} } \right)\sin \left( {\alpha + \theta } \right)/\sin \theta }}^{{H_{0} \cot \theta + \left( { - L/2 - s_{1} } \right)\sin \left( {\alpha + \theta } \right)/\sin \theta }} {f\left( {y - t,0,H_{0} /\tan \gamma } \right)dt} \\ \end{gathered} $$
(20)

Equation (21) can be obtained based on the derivation process of Eq. (14) and (17).

$$ \begin{gathered} w\left( {x,y} \right) = mq\cos \alpha \left( \begin{gathered} F\left( {x + 0.5l - s_{r},0,H_{0} /\tan \gamma } \right) \hfill \\ - F\left( {x - 0.5l + s_{l} ,0,H_{0} /\tan \gamma } \right) \hfill \\ \end{gathered} \right) \times \\ \left( \begin{gathered} F\left( {y - H_{0} \cot \theta + \left( {L/2 - s_{1} } \right)\;\sin \left( {\alpha + \theta } \right)/\sin \theta ,0,H_{0} /\tan \gamma } \right) \hfill \\ - F\left( {y - H_{0} \cot \theta - \left( {L/2 - s_{1} } \right)\sin \left( {\alpha + \theta } \right)/\sin \theta \;\;,0,H_{0} /\tan \gamma } \right) \hfill \\ \end{gathered} \right) = \\ W_{\left( x \right)}^{0} W_{\left( y \right)}^{0} /mq\cos \alpha \\ \end{gathered} $$
(21)

Equation (21) is the prediction equation of subsidence at any point on the surface induced by the extraction of a rectangular panel. In other words, Eq. (21) is the prediction model for mining subsidence at any point on the surface based on an unskewed continuous probability distribution over an infinite interval. This equation is a forecasting formula for the ground subsidence induced by the extraction of the rectangular panel. This prediction model has at least seven parameters. They are, respectively, the tangent of main influence angle (tanγ), the subsidence factor (q), the influence transference angle (θ), the deviation of inflection point along the dip side of the coal seam (s1), the deviation of inflection point along the rising side of the coal seam (s2), the deviation of inflection point along the left side (sl), and the deviation of inflection point along the right side (sr).

Application method of CPM

The scope of application for the CPM is identical to that of PIM. It is suitable for predicting the continuous and non-skewed subsidence patterns caused by exploiting horizontal or gently inclined coal seams. Therefore, engineers need to determine whether the subsidence pattern in a mining area aligns with the applicability of the CPM. The proposed CPM in this paper is a prediction model that does not have specific expressions, making it unsuitable for directly predicting surface subsidence. First, engineers should select specific probability distribution functions. Second, these distribution functions can be incorporated into the CPM to transform the CPM into a subsidence prediction model based on the specific distribution function. Finally, the prediction of surface subsidence in mining areas can be achieved using the subsidence prediction model based on the specific distribution function.

The chosen probability distribution function needs to satisfy two criteria. First, the selected probability distribution function must be based on an unskewed continuous probability distribution over an infinite interval. Second, the accuracy with which the selected probability distribution function is fitted to the surface subsidence curve needs to meet the corresponding requirements. In practical engineering, the root-mean-square error RMSE (mm) and the relative error RE (%) are generally used to evaluate the fitting accuracy. Their calculation methods are shown in Eqs. (22) and (23), respectively.

$$ RMSE = \sqrt {\sum\limits_{i = 1}^{N} {\left( {w_{im} - w_{ip} } \right)^{2} } /N} $$
(22)
$$ RE = RMSE/W_{\max } $$
(23)

where N represents the total number of observation points. wim and wip indicate the measured and predicted values of an observation point i, respectively (mm). Wmax is the maximum subsidence value (mm).

Relationship with the PIM

The following is the normal distribution’s CDF:

$$ F_{N} = {\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 2}}\right.\kern-0pt} \!\lower0.7ex\hbox{$2$}}\left( {1 + erf\left( {x - u_{n} /\sigma_{n} \sqrt 2 } \right)} \right) $$
(24)

Set \(\sigma = \sigma_{n} = {\raise0.7ex\hbox{$r$} \!\mathord{\left/ {\vphantom {r {\sqrt {2\pi } }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sqrt {2\pi } }$}}\), r indicates the main influence radius for the PIM. r = H0/tanβ. tanβ indicates the tangent of main influence angle for the PIM. The relationship between tanβ and tanγ is as follows.

$$ \tan \gamma = H_{0} /\sigma = H_{0} /\left( {r/\sqrt {2\pi } } \right) = \sqrt {2\pi } \left( {H_{0} /r} \right) = \sqrt {2\pi } \tan \beta $$
(25)

Replace CDF F(•) in Eq. (21) with the CDF FN of normal distribution. Set μn = 0, \(\tan \gamma = \sqrt {2\pi } \tan \beta\), can be obtained:

$$ \begin{gathered} W_{pim} \left( {x,y} \right) = mq\cos \alpha \left[ {W_{PIM} \left( {x + 0.5l - s_{r} } \right) - W_{PIM} \left( {x - 0.5l + s_{l} } \right)} \right] \times \\ \left[ \begin{gathered} W_{PIM} \left( {y - H_{0} \cot \theta +^{{\left( {{\raise0.7ex\hbox{$L$} \!\mathord{\left/ {\vphantom {L 2}}\right.\kern-0pt} \!\lower0.7ex\hbox{$2$}} - s_{1} } \right)\sin \left( {\alpha + \theta } \right)}} /\sin \theta } \right) \hfill \\ - W_{PIM} \left( {\sin \theta -^{{\left( {{\raise0.7ex\hbox{$L$} \!\mathord{\left/ {\vphantom {L 2}}\right.\kern-0pt} \!\lower0.7ex\hbox{$2$}} - s_{1} } \right)\sin \left( {\alpha + \theta } \right)}} /\sin \theta } \right) \hfill \\ \end{gathered} \right] \\ \end{gathered} $$
(26)
$$ W_{PIM} \left( X \right) = \frac{1}{2}\left( {1 + erf\left( {\frac{\tan \beta X\sqrt \pi }{{H_{0} }}} \right)} \right) $$
(27)

Equation (27) is the prediction equation for sinking at any point on the ground based on the PIM. It is a special case of the CPM.

Method validation

As shown in Fig. 7, taking two coal mines in Jining and Heze cities, Shandong province, China, as examples, these mines have previously extracted panels numbered 6304 and 2301, respectively. The roof management method employed is caving. The geological and mining conditions of the two panels are shown in Table 1. Two observation lines were set up above each panel to study the subsidence mining law (see Fig. 7). These lines include one along the strike direction of the panel and another along the dip direction. The terrain above the gob is relatively flat. Engineers utilized leveling measurements to monitor surface subsidence when it ceased. Both mining areas exhibit a continuous distribution pattern of surface subsidence. Table 1 shows that both mining areas extracted gently inclined seams, fulfilling the conditions for using the CPM model to predict surface subsidence in these mining areas. Decision-makers have requested that the engineer select appropriate distribution functions to incorporate into the CPM model to establish a suitable surface subsidence prediction model for these two mining areas.

Fig. 7
figure 7

Geographic location of the study area and relative positions of the longwall panel and observation line

Table 1 The geological and mining conditions

According to Table 1, it can be observed that both mining areas have thick alluvium. Based on experience, subsidence in mining areas with thick alluvium exhibits a long-tailed distribution characteristic. Therefore, probability distributions with long-tailed characteristics should be selected. In this study, the Logistic distribution, T Location-Scale distribution, Laplace distribution, and Cauchy distribution were chosen to construct subsidence prediction models and fit the subsidence data (Fang and Xu 2016; Li 2016; Yin et al. 2022).

The main influence radius r of the PIM and the main influence radius σf of the CPM are both scale parameters of probability distributions. Their relationship is given by \(\sigma_{f} = {\raise0.7ex\hbox{$r$} \!\mathord{\left/ {\vphantom {r {\sqrt {2\pi } }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sqrt {2\pi } }$}}\). To ensure numerical proximity between the main influence radius σf of various subsidence prediction models and the main influence radius r of the PIM, let the location parameter and scale parameter for these distribution functions be set as \(\mu_{f} = 0,{ }\sigma_{f} = {\raise0.7ex\hbox{$r$} \!\mathord{\left/ {\vphantom {r {\sqrt {2\pi } }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\sqrt {2\pi } }$}}\), respectively. All subsidence prediction models used r as the main influence radius. The transformed PDFs and CDFs for each distribution function are shown in Table 2.

Table 2 Transformed PDF and CDF

In Table 2, Γ(•) denotes the gamma function, and 2F1(•) represents the Gaussian hypergeometric function. In the T Location-Scale distribution, the parameter ν represents the degree of freedom. A smaller value of ν indicates a longer tail in the distribution (Yin et al. 2022).

Then, substitute the CDFs from Table 2 into Eq. (21) to establish the subsidence prediction models based on the Logistic distribution, Cauchy distribution, T Location-Scale distribution, and Laplace distribution. These subsidence prediction models are named LOM, CM, TM, and LAM, respectively. The LOM is equivalent to the subsidence prediction model based on the Boltzmann function (Wang et al. 2013). The observed subsidence data of mining areas in Jining and Heze were fitted using these prediction models. The particle swarm optimization algorithm was used to match these prediction models with observed surface subsidence data. A comparison was made between the predictive accuracy of these forecasting models and that of the commonly used PIM among Chinese engineers. The fitting results for the two mining areas are shown in Fig. 8 and Fig. 9, while the fitting accuracy is presented in Table 3 and Table 4.

Fig. 8
figure 8

Fitting performance of subsidence in the Jining mining area: a Line A, b Line B

Fig. 9
figure 9

Fitting performance of subsidence in the Heze mining area: a Line E, b Line N

Table 3 Parameters and accuracies of the model obtained from the fit (Jining mining area)
Table 4 Parameters and accuracies of the model obtained from the fit (Heze mining area)

Table 3 and Table 4: In both tables, “mt” refers to the overall fitting accuracy of the subsidence basin, while “me” represents the fitting accuracy of the subsidence data outside the gob (i.e., at the outer edge of the basin). This accuracy is used to describe the fitting performance of the model at the peripheral locations of the subsidence basin.

According to Table 3, all models have a relative error of less than 4% in fitting in the Jining mining area. The ranking of overall fitting accuracy for the entire subsidence basin, from highest to lowest, is as follows: TM > LAM > LOM > PIM > CM. Among the other models, the TM exhibits the highest accuracy in fitting the subsidence in the mining area. Conversely, the CM has the lowest fitting accuracy. Compared to the conventional PIM, the TM shows a 13.8% improvement in fitting accuracy. As shown in Fig. 8, the subsidence impact range in the Jining mining area is significant. At the peripheral locations of the subsidence basin, the predicted values of the PIM model are lower than the measured values. On the other hand, at these locations, the predicted values of the other models are higher than those of the PIM. The ranking of fitting accuracy at the peripheral locations of the subsidence basin, from highest to lowest, is as follows: TM > LAM > LOM > PIM > CM. The TM demonstrates a 28.3% improvement in fitting accuracy compared to the traditional PIM. Therefore, using the TM to predict the boundary of surface subsidence in the mining area provides more precise results. Despite having the lowest fitting accuracy, the CM predicts higher values at the peripheral locations of the subsidence basin compared to other models. Hence, using the CM to predict the boundary of surface subsidence in the mining area ensures a safer approach.

According to Table 4, in the Heze mining area, all models have a relative error of less than 3% in fitting. Among the models, the TM exhibits the highest accuracy in fitting the subsidence in this mining area. Compared to the PIM, the TM shows a 22.2% improvement in the overall fitting accuracy of the entire subsidence basin. As shown in Fig. 9, the surface subsidence in the Heze mining area still follows a long-tail distribution pattern. At the peripheral locations of the subsidence basin, the TM demonstrates the highest fitting accuracy. Compared to the PIM, the TM shows a 63% improvement in fitting accuracy at these peripheral locations. Similarly, the CM predicts higher values at the peripheral locations of the subsidence basin compared to other models.

The Jining and Heze mining areas have thick alluvium, and subsidence in these areas typically exhibits a long-tail distribution pattern. The PIM, developed based on a normal distribution, does not accurately capture this long-tail distribution. Consequently, the accuracy of predicting surface subsidence using the PIM is relatively poor. It is crucial to accurately predict surface subsidence at the peripheral locations of the subsidence basin to determine the boundaries of mining-induced damage more precisely. After screening, the TM has been selected as the subsidence prediction model for both the Heze and Jining mining areas.

The experimental results indicate that, compared to the PIM, CPM can improve the accuracy of mining subsidence prediction.

Discussion

(1) Comparison of CPM with mechanics and numerical simulation methods.

Numerical simulation methods refer to using mechanics software developed based on finite element and discrete element methods to make predictions. It is evident that numerical simulation methods are essentially a form of mechanics methods. Relevant studies have shown that predictive models based on mechanics methods can effectively predict the movement patterns of overlying strata and surfaces (Qin et al. 2023). Additionally, mechanical methods can reveal the mechanisms behind surface subsidence. On the other hand, the CPM, which is based on influence function theory, is a mathematical method that does not consider the mechanical properties of rock mass. However, In China, it should be noted that the application of mechanics methods in predicting mining subsidence is not even as extensive as the use of PIM. There are several reasons for this:

Mechanics methods heavily rely on the existing data on geological conditions, subsurface formations, and soil properties in the study area. Taking a Jining mining area in Shandong Province, China as an example, Fig. 10 shows the stratigraphic information exposed by a specific borehole in this mining area. The number of rock layers in the stratigraphy of this mining area is more than 60. Engineers need to accurately test the mechanical strength and stiffness parameters of rock in each layer. Therefore, mechanics methods involve a significantly larger number of model parameters compared to PIM and CPM. This work is extremely challenging. Moreover, for some rocks with poor cementation properties, obtaining the core for some rocks is impossible, making it impossible to test their mechanical parameters. Even if the mechanical parameters of rocks are accurately measured, they may not be consistent with the rock mass parameters. In-situ testing experiments are required to measure the mechanical parameters of rock masses. There are still difficulties in accurately obtaining the mechanical parameters of rock mass in each layer. Additionally, the occurrence conditions of the overlying strata are extremely complex (Peng et al. 2015). Even two adjacent boreholes may have significant differences in the stratigraphic information they expose (Peng et al. 2015). Even rock masses with the same burial depth can have different lithology and thickness, so constructing a mechanical model based solely on the stratigraphic information from one borehole is inadequate. Additionally, the mechanical strength and stiffness parameters of different rock cores from the same rock layer, as measured in the laboratory, are inconsistent (Peng et al. 2015). As a result, conducting three-dimensional subsidence prediction using mechanics or numerical simulation methods requires extensive computational work. These limitations restrict the widespread adoption of this approach.

Fig. 10
figure 10

Stratum information revealed by a borehole

However, CPM has fewer parameters compared to mechanics methods. The predictive parameters of the CPM can be obtained by inverting surface movement observation data. Many mining areas in China have abundant surface movement observation data, which provides convenient conditions for obtaining predictive parameters of the CPM. Furthermore, as CPM is an extension of PIM, the parameters in both models have the same meaning, and their values are dependent on local geological and mining conditions. For instance, the subsidence factor q in the PIM and CPM is related to the mechanical strength and stiffness of rocks. The lower the strength of the overlying strata and the smaller the elastic modulus, the larger the subsidence factor. By adjusting the parameters in the model, it is possible to alter the shape of the predicted curve to achieve a closer fit with the measured subsidence curve.

Compared to mechanics methods, CPM is closer to an empirical approach as it abstracts the surface subsidence patterns into mathematical functions. Although this model does not consider the mechanical properties of rock masses, the fitting effect of this model on subsidence law is better than that of PIM commonly used in Chinese engineering. This viewpoint has been supported by experiments conducted in this study.

(2) Compared to other methods, such as PIM.

Many subsidence prediction equations, such as PIM or the subsidence prediction equation based on the Boltzmann function, are predictive models developed based on specific functions. However, these methods are not necessarily the optimal prediction methods in some mining areas. On the other hand, CPM does not have a specific expression and encourages engineers to screen appropriate probability distribution functions and incorporate them into CPM to establish prediction equations. This is a significant difference between CPM and other methods. This approach can further improve the accuracy of mining subsidence prediction and avoid limiting the improvement of prediction accuracy due to considering only one prediction method developed based on a specific function.

In the “Method Validation” section, based on the long-tailed characteristics of surface subsidence in the Heze and Jining mining areas, the TM model was constructed by selecting the T Location-Scale distribution and incorporating it into CPM. The TM is constructed based on the T Location-Scale distribution and has a specific mathematical expression. Compared to the traditional PIM, the TM shows an improvement of 13.8% and 22.2% in fitting accuracy for subsidence in Jining and Heze, respectively. Compared to the PIM, the TM even shows a 63% improvement in fitting accuracy at peripheral locations of the subsidence basin. Additionally, TM exhibits higher prediction accuracy compared to LAM, LOM, and CM. The improved accuracy of subsidence prediction in these mining areas can be attributed to the CPM approach, which allows engineers to screen probability distribution functions instead of relying solely on PIM as the predictive method. In conclusion, the experimental results demonstrate that CPM contributes to improving the accuracy of subsidence prediction.

This study links prediction models for mining subsidence to probability theory. The research findings of this study indicate that the PDF can serve as the influence function (element subsidence function). CPM, fundamentally, is a mathematical expression composed of the CDF. Engineers only need to substitute the CDF of a given probability distribution into the CPM to develop a corresponding prediction model for mining subsidence. They do not have to ab-initio derive prediction equations for mining subsidence based on a specific function, reducing their workload. Therefore, this work also helps researchers understand the mathematical mechanisms underlying prediction models for mining subsidence.

Conclusions

  1. (1)

    This study developed the CPM according to the influence function theory. CPM is an extension of PIM, sharing the same fundamental principles and application scope. CPM relates the probability distribution function to the subsidence prediction. CPM does not have a specific function expression. Instead, engineers are allowed to screen some unskewed continuous probability distribution functions to incorporate into this model and establish subsidence prediction equations. This approach can further improve the accuracy of mining subsidence prediction. This method provides engineers with more opportunities for selection. By not solely considering one prediction method developed based on a specific function, it avoids limiting the improvement of prediction accuracy.

  2. (2)

    In the mining areas of Jining and Heze in China, surface subsidence exhibits a long-tail distribution characteristic. A T Location-Scale distribution was selected and incorporated into the CPM, and the TM model was constructed. Compared to the traditional PIM, the TM shows an improvement of 13.8% and 22.2% in fitting accuracy for subsidence in Jining and Heze, respectively. Additionally, the fitting precision of TM is higher than that of LAM, LOM, and CM when fitting mining subsidence with long-tailed distribution characteristics. This experiment confirms that CPM contributes to improving the accuracy of subsidence prediction.