Introduction

China possesses substantial coal resources, while the hydrogeological conditions of mining are among the most complicated in the world (Li et al. 2018; Wu et al. 2011b; Yin et al. 2019; Zhao et al. 2018). In North China, Permo-Carboniferous coal seams are mined underground, and the thick Ordovician limestone strata containing an abundant supply of water are often formed below the bottom of the Permo-Carboniferous coal-bearing formations (Wang and Shi 2019; Xu et al. 2019). The Ordovician limestone aquifer acts as the main source of water recharge. Karst water hazards from the Ordovician limestone aquifer pose a severe threat to the exploitation of deep coal seams in North China-type coalfields, causing equipment damage, project delays, and even casualties (Bu et al. 2019; Sun et al. 2015; Wang and Meng 2018; Zhang et al. 2017). Water inrush is controlled by multiple factors, such as water pressure, water yield property, geological structures, floor aquifuge, and mining conditions (Li et al. 2015; Wu et al. 2017b; Zhao et al. 2014), among which the water yield property is an essential factor that refers to the productivity (quantity and duration) of the aquifer (Qiu et al. 2017; Yin et al. 2018). The severity of water hazards depends on the water yield property (Wu et al. 2011a, b). It will not cause substantial harm, even if water passage can reach an aquifer in a poor water yield property region during mining, however, it is more likely to cause serious water inrush in a rich water yield property zone (Qiu et al. 2017; Wu et al. 2017a, c). Hence, an objective and comprehensive assessment of the water yield property of an aquifer is essential for safely mining in deep coal seams and managing the mining (Wu 2014; Shi et al. 2019c).

Specific field (represented by q) has been widely used to evaluate the water yield property of aquifers in China based on regulations for mine water prevention and control (State Administration of Work Safety and State Administration of Coal Mine Safety 2009), which is defined as water production of boreholes per meter of drawdown during pumping tests operated in surface boreholes. However, the operability of this specific field method is minimal in practice because of the absence of borehole data from a pumping test in the Ordovician limestone aquifer which costs a lot. As a result, a scientific and accurate partition map of the water yield property of the Ordovician limestone aquifer drawn using q is difficult to achieve. The assessment of the water yield property will be greatly limited if we only consider the specific yield. Another traditional approach to evaluating the water yield property of aquifers is the geophysical method (Gao et al. 2019; Shi et al. 2019b). These traditional approaches are usually time-consuming and expensive. Hence, it is of great management importance to develop an effective assessment method of the water yield property with the available geological and hydrogeologic data.

Recently, numerous scholars and coal workers have devoted themselves to the research and approach emerged one after another. A water-richness index method that has been widely promoted was proposed based on the analytic hierarchy process by a geographic information system to evaluate the water yield property of aquifers (Wu et al. 2011a; Yan 2015; Yin et al. 2018). A fuzzy analytic hierarchy process has been used to analyze the water yield property based on GIS (Han et al. 2012; Yang et al. 2017). These methods provide a powerful tool for predicting the water yield property of aquifers, but at the same time, the methods are not appropriate because the weights of the controlling factors are obtained by the analytic hierarchy process, which is relatively subjective (Wu et al. 2017c). To address this disadvantage, weighted grey relational degree theory and the fuzzy analytic hierarchy process were used to evaluate the water yield capacity of a fracture aquifer (Xu et al. 2016; Wu et al. 2017c). These methods provided good results, but the models were linearly weighted. In addition, numerical modeling was used to predict the permeability of the fracture networks (Mourzenko and Thovert 2011), and remote sensing technology and geographic information system modeling were used to map the groundwater potential zones (Dar et al. 2010). However, with the buried depth increasing of the fracture aquifer, these approaches are not always applicable because they are restricted by topography and geomorphology, and the degree of hydrogeological exploration of new mining areas is usually low. Thus, the use of accessible geological and hydrogeologic data to assess aquifer water yield property is a necessary and challenging task.

Accordingly, this paper proposed an innovative method predicting the water yield property of aquifers using accessible geological and hydrological data based on the grey relative analysis (GRA), fuzzy analytic hierarchy process (FAHP), and the technique for order performance by similarity to ideal solution (TOPSIS), applied to predict the water yield property of the upper Majiagou group limestone in the Ordovician top (Ordovician top aquifer) in the Xinwen coalfield, Shandong, China. The GRA and FAHP methods were combined to determine the weights of the assessment factors, a process that helps decrease the subjectivity of traditional expert analysis based on the data of the factors and q. Thus, the TOPSIS method, an effective nonlinear multicriteria decision-making method for ranking performance of the technique based on similarity to an ideal solution, was then applied to establish a water yield property index (WYPI) model. Finally, the relatively poorer or richer water yield property zones were predicted by the WYPI model, which provided a scientific and reasonable partition map of the water yield property of the Ordovician top aquifer.

Study area

Geological conditions

The Xinwen coalfield lies in the midwestern part of Shandong Province, China, where there are seven main coalmines from west to east: Shengquan, Xiezhuang, Zhaizhen, Liangzhuang, Suncun, Huayuan, and Huaheng coalmines. This paper focuses on the Xiezhuang, Zhaizhen, and Liangzhuang coalmines, and the study area is shown in Fig. 1. The Xinwen coalfield is a typical North China coalfield of the Permo-Carboniferous period. The coal-bearing strata consist of the Benxi Formation (F), Taiyuan F, Shanxi F, and Shihezi F, from bottom to top. The mined coal seams belong to Taiyuan F and Shanxi F, of which nos. 2, 4, 6, 11, 13 and 15 are minable seams. The coal-bearing strata are covered by the Jurassic System (S) (developed locally in the south of the F10 fault), Paleogene S, and Quaternary S. The sedimentary basement of the coal-bearing strata is Ordovician S.

Fig. 1
figure 1

Geological and structural map of the study area

The Xinwen coalfield is a faulted and folded monocline dipping NNE, with gentle dips less than 30°. Generally, the faults are well developed in the area. The major faults are oriented NW, NEE, and NE, and others are oriented NNW. In particular, the Lianhuashan fault, oriented NW, forms the north border of the coalfield, and the no. 10 fault (F10 in Fig. 1) and its branch faults, in the NW direction, cut the Xinwen coalfield into a south zone and a north zone.

Hydrogeological conditions

The Xinwen coalfield is located in the sunken area between the Lianhuashan and Mengshan Mountains. The main surface water system is the Chaiwen River and its branches, which flow through the coalfield east to west. The aquifers of the area are mainly the water-bearing gravel layer of the Quaternary S, the gravel-stone layer aquifer of Paleogene S, the sandstone fractured aquifers of the Permo-Carboniferous S, the thin limestone aquifers of the Carboniferous Taiyuan F, and the Ordovician limestone karst-fractured aquifer, forming a multilayer structural groundwater system.

After 60 years of coal mining in the Xinwen coalfield, deep coal seams have formed, mainly the nos. 11, 13 and 15 coal seams. The Ordovician limestone aquifer seriously threatens the mining production of the nos. 11, 13, 15 coal seams. Vertically, the thickness of the Ordovician limestone of the coalfield in northern China varies greatly. For the Xinwen coalfield, the thickness of the Ordovician limestone strata is greater than 800 m. Vertically, many paleokarst systems formed and evolved in the Ordovician limestone strata in the process of geological evolution, but karst-fractured groundwater developed in the Ordovician top aquifer has had an impact on production safety. Therefore, the proposed study focuses on the Ordovician top aquifer.

South of the coalfield are mountains of Ordovician limestone and Cambrian limestone, where the Ordovician limestone, exposed to the surface, is mainly recharged by rainfall. The north border of the coalfield is the Lianhuashan fault (the throw of the fault is more than 3000 m). The footwall of the Lianhuashan fault is gneiss of the Pre-Sinian S of the Archaean group, cutting off the recharge of the aquifers in the Xinwen coalfield by rainfall. Only in the local area south of the Lianhuashan fault is Ordovician limestone exposed on the surface. However, in the Xinwen coalfield, the Ordovician limestone is covered by coal-bearing strata and Paleogene strata, which cut off the hydraulic relationship between the Ordovician limestone aquifer and the water on the surface, the water-bearing gravel layer of the Quaternary S, and the gravel-stone layer aquifer of the Paleogene S. With the increase in the burial depth of the Ordovician limestone from south to north in the Xinwen coalfield, the recharge conditions of the groundwater of the Ordovician limestone aquifer are gradually becoming limited (Fig. 1).

Because Ordovician limestone is buried deep underground, detecting the location and water yield property of the Ordovician top aquifer is restricted. In general, the specific field, obtained by a pumping test of the surface borehole, is used to evaluate the water yield property of aquifers in China. In the study area, seven pumping tests of the Ordovician top aquifer were carried out, mostly in the central and southern study areas. According to the regulations for mine water prevention and control (State Administration of Work Safety and State Administration of Coal Mine Safety 2009), the water yield property of aquifers can be defined by four categories: poor water yield property when q ≤ 0.1 L/s m; medium water yield property when 0.1 L/s m < q ≤ 1.0 L/s m; rich water yield property when 1.0 L/s·m < q ≤ 5.0 L/s m; and very rich water yield property when q > 5.0 L/s m. Accordingly, the water yield property of the Ordovician top aquifer in the study area can be assessed based on q as shown in Fig. 2. The water yield property of the Ordovician top aquifer in the southwestern part of the study area was classified as rich, in the north-central part as medium, and the eastern part as poor. However, three water inrushes from the underlying Ordovician top aquifer occurred during mining the nos. 11 and 13 coal seams in the poor water yield property zone based on q, and their water inflow rates were as high as 772.2 m3/h, 1920 m3/h, and 78 m3/h. The occurrence of water inrushes clearly was the opposite of having partitions of the water yield property based on q because large-scale water inrush is impossible in poor water yield property zones during mining. Therefore, it is necessary to objectively and comprehensively assess the water yield property of the Ordovician top aquifer for safely mining in deep coal seams.

Fig. 2
figure 2

Water yield property zone map of the Ordovician top aquifer based on q

Data

Selected relevant factors of the water yield property of Ordovician top aquifer

Usually, the water yield property of the Ordovician limestone aquifer is mainly controlled by the storage space of water, the quantity of water in the storage space and the water supply. The main potential water yield property conditioning factors in this study are the water inflow of borehole (WIB), fault impact factor (FIF), drilling fluid consumption (DFC) and burial depth (BD) of the Ordovician top aquifer. These hydrologic and geologic data were gathered from boreholes and seismic exploration before and during mining.

Relevant factors

Water inflow of borehole (WIB)

The water inflow of boreholes is the most direct indicator of an aquifer water yield property, which is the maximum water gushing when leaving the Ordovician limestone strata. The larger the value of WIB is, the richer the water is. During mining the nos. 11, 13, and 15 coal seams, supplementary hydrogeological explorations of the Ordovician top aquifer were carried out. Ninety underground hydrogeologic boreholes were drilled into the Ordovician limestone aquifer to explore the conditions of the Ordovician top aquifer in the study area. The WIB was obtained by observing the value of each underground hydrogeologic borehole, and then, the contour map of WIB in the Ordovician top aquifer was drawn as shown in Fig. 3a. The change in WIB was large in the study area. In the south-central and north-central regions, the WIB was larger than that in the other regions.

Fig. 3
figure 3

Thematic maps of every relevant parameters

Fault impact factor (FIF)

Water in the Ordovician limestone strata mainly exists in fractures. Faults have a great influence on groundwater storage and migration, which is closely related to the groundwater potential (Mourzenko and Thovert 2011; Yin et al. 2018). The analysis of water inrush indicates that rock is more crushed and fractured at the intersections and endpoints of faults than at other areas of faults because of the concentration of ground stress (Shi et al. 2019a; Yin et al. 2018). The fractured zone further develops and forms the main space for accumulating groundwater in the fractured aquifer (Wu et al. 2011a). The greater number of and larger the faults are, the more space for storage and conduction of groundwater (Yin et al. 2018). Therefore, the intersections and endpoints of faults, extension length, and fault throw were taken into account by the fault impact factor to evaluate the development degree of a fracture. The fault impact factor is calculated as follows:

$${\text{FIF}} = \frac{{\sum\nolimits_{i}^{n} {l_{i} h_{i} } }}{S} + M,$$
(1)

where FIF is the fault impact factor; h is the fault throw; l is the corresponding strike length; S is the area of the grid cell; n is the number of faults encountered in the grid cell; M is the normalization value of fault intersections and endpoints. According to fault distribution characteristics, the study area was organized into 500 m × 500 m grid cells, and then, the fault intersections and endpoints, and fault throw and its corresponding strike length were counted in each grid cell to calculate FIF. A contour map of FIF is shown in Fig. 3b.

Drilling fluid consumption (DFC)

When boreholes are drilled into an aquifer, drilling fluid consumption will clearly increase. Differences in consumption indicate the distinction of cracks in terms of development degree and connectivity. The DFC is an index that reflects the connectivity of the crack and permeability of an aquifer (Wu et al. 2008). The DFC was obtained by observing the value in each exploratory borehole, and the contour map of DFC was drawn by Golden Software Surfer 13.0 as shown in Fig. 3c.

Burial depth (BD)

The burial depth of the Ordovician top aquifer is related to the water supply. With the increase in the burial depth, the recharge conditions become gradually limited in the study area. In addition, the lithostatic pressure increases with increasing depth, as does the degree of crack compaction, which decreases the connectivity slightly by possibly closing the fractures present. The burial depth of the Ordovician top was acquired from geoexploration, 3D-seismic exploration, and workface construction, and its contour map is shown in Fig. 3d.

Methodology

Procedures

Predicting the water yield property potential zone of the Ordovician top aquifer included four main steps: (a) selecting the relevant factors of the water yield property and data collection, (b) determining the evaluation factor weights, (c) establishing the water yield property index model, and (d) validating and applying the results.

Determination of the factor weights

The fuzzy analytic hierarchy process is a fuzzy group decision-making method that comprehensively uses fuzzy assessment and an analytic hierarchy process. This interactive determination process enables decision makers to fully participate in the determination and analysis of weights. However, there are similarities between the traditional analytic hierarchy process and fuzzy analytic hierarchy process; the relative importance of the evaluation factors was determined by experts based on the Saaty rating scale, and the data were subjective and did have a scientific basis. To avoid these disadvantages, the GRA method was applied to improve the traditional FAHP method.

Calculation of the comparison matrix based on the GRA method

GRA is a significant part of the grey system theory that professor Deng created, which is a simple and scientific process used to determine relevancy between evaluation factors and a target (Deng 1982, 1989). This method is used to judge the degree of the relationship based on the geometrical distance between the reference sequence (specific field q reflecting the water yield property) and the compared sequences (the evaluation factors including WIB, FIF, DFC and BD). The data of q and the evaluation factors in seven hydrologic boreholes were collected where pumping tests were carried out in the study area (Table 1).

Table 1 Data of q and the evaluation factors in seven hydrologic boreholes

To eliminate the impact of the different dimensions of the original data, the compared sequences were normalized using the following formula:

$$x_{t}^{{{(1)}}} {(}i{)} = {{x_{t}^{{{(0)}}} {(}i{)}} \mathord{\left/ {\vphantom {{x_{t}^{{{(0)}}} {(}i{)}} {\left[ {\frac{{1}}{n}\sum\limits_{t = 1}^{n} {x_{t}^{{{(0)}}} {(}i{)}} } \right]}}} \right. \kern-\nulldelimiterspace} {\left[ {\frac{{1}}{n}\sum\limits_{t = 1}^{n} {x_{t}^{{{(0)}}} {(}i{)}} } \right]}},$$
(2)

where i = 1, 2, …, m, t = 1, 2, …, n; \(x_{t}^{{{(0)}}} {(}i{)}\) is the original data of the ith evaluation factor in the tth hydrologic borehole. The reference sequence \(x_{t}^{{{(0)}}} {(0)}\) was normalized as follows:

$$x_{t}^{{{(1)}}} {(0)} = {{x_{t}^{{{(0)}}} {(0)}} \mathord{\left/ {\vphantom {{x_{t}^{{{(0)}}} {(0)}} {\left[ {\frac{{1}}{n}\sum\limits_{t = 1}^{n} {x_{t}^{{{(0)}}} {(0)}} } \right]}}} \right. \kern-\nulldelimiterspace} {\left[ {\frac{{1}}{n}\sum\limits_{t = 1}^{n} {x_{t}^{{{(0)}}} {(0)}} } \right]}}.$$
(3)

After normalization, the grey relational coefficients between the evaluation factors and q were solved to determine the influencing degree of the evaluation factors for the water yield property using the following formula (Deng 1989):

$$r_{{i{,0}}} = \frac{{\Delta _{{{\rm min}}} + \xi\Delta _{{{\rm max}}} }}{{\Delta _{{i{,0}}} {(}i{)} + \xi\Delta _{{}} }},$$
(4)

where \(\Delta _{{{\rm max}}} = {\rm max}_{\forall i} {\rm max}_{\forall t} \left| {x_{t}^{{{(1)}}} {(}i{) - }x_{t}^{{{(1)}}} {(0)}} \right|\) is the absolute maximum difference and \(\Delta _{{{\rm max}}} = {\rm min}_{\forall i} {\rm min}_{\forall t} \left| {x_{t}^{{{(1)}}} {(}i{) - }x_{t}^{{{(1)}}} {(0)}} \right|\) is the minimum difference; \(\Delta _{{i{,0}}} {(}i{)} = \left| {x_{t}^{{{(1)}}} {(}i{) - }x_{i}^{{{(1)}}} {(0)}} \right|\) is the absolute difference of the tth hydrologic borehole between the ith evaluation factor and the specific field q. ξ ∈ [0, 1] is the resolution coefficient, and the value is 0.5 in this study.

According to the GRA method, each evaluation factor’s grey relational coefficient of the seven pumping tests was calculated indicating the influencing degree on the aquifer water yield property. The relative importance of the evaluation factors based on the GRA method is shown in Table 2.

Table 2 Relative importance of the evaluation factors based on GRA method

Based on the relative importance of the evaluation factors above as shown in Table 2, we established seven 4 × 4 pair-wise comparison matrixes based on the following equation:

$$A = \left( {\begin{array}{*{20}c} {1} & {a_{{{12}}} } & {a_{{{13}}} } & {a_{{{14}}} } \\ {{1/}a_{{{12}}} } & {1} & {a_{{{23}}} } & {a_{{{24}}} } \\ {{1/}a_{{{13}}} } & {{1/}a_{{{23}}} } & {1} & {a_{{{34}}} } \\ {{1/}a_{{{14}}} } & {{1/}a_{{{24}}} } & {{1/}a_{{{34}}} } & {1} \\ \end{array} } \right),$$
(5)

where aij = Fi/Fj represents a judgment of the relative importance of factors Fi and Fj.

Determination of the total weight based on the FAHP method

First, group fuzzy comparison matrix B was generated (Table 3) based on pair-wise comparison matrixes using a fuzzy triangular number (Ataei et al. 2012):

$${\varvec{B}} = \left( {\begin{array}{*{20}c} {{(1, 1, 1)}} & {{(}\alpha_{{{12}}} {, }\beta_{{{12}}} {, }\gamma_{{{12}}} {)}} & {{(}\alpha_{{{13}}} {, }\beta_{{{13}}} {, }\gamma_{{{13}}} {)}} & {{(}\alpha_{{{14}}} {, }\beta_{{{14}}} {, }\gamma_{{{14}}} {)}} \\ {{1/}\gamma_{{{12}}} {, 1/}\beta_{{{12}}} {, 1/}\alpha_{{{12}}} } & {{(1, 1, 1)}} & {{(}\alpha_{{{23}}} {, }\beta_{{{23}}} {, }\gamma_{{{23}}} {)}} & {{(}\alpha_{{{24}}} {, }\beta_{{{24}}} {, }\gamma_{{{24}}} {)}} \\ {{1/}\gamma_{{{13}}} {, 1/}\beta_{{{13}}} {,1/}\alpha_{{{13}}} { }} & {{1/}\gamma_{{{23}}} {, }\beta_{{{23}}} {,1/}\alpha_{{{23}}} { }} & {{(1, 1, 1)}} & {{(}\alpha_{{{34}}} {, }\beta_{{{34}}} {, }\gamma_{{{34}}} {)}} \\ {{1/}\gamma_{{{14}}} {, 1/}\beta_{{{14}}} {, 1/}\alpha_{{{14}}} {)}} & {{1/}\gamma_{{{24}}} {, 1/}\beta_{{{24}}} {, 1/}\alpha_{{{24}}} } & {{1/}\gamma_{{{34}}} {, 1/}\beta_{{{34}}} {, 1/}\alpha_{{{34}}} } & {{(1, 1, 1)}} \\ \end{array} } \right),$$
(6)

where (αij, βij, γij) is the fuzzy triangular number, and αij ≤ βij ≤ γij; αij, βij, and γij indicate the lower bound, the geometric mean, and the upper bound, respectively.

Table 3 Group fuzzy comparison matrix B

Then, using the fuzzy algorithm, the group fuzzy weight of any evaluation factor (Table 4) was calculated based on the data in Table 3 using the following equation:

$$w_{i} = r_{i} \otimes {(}r_{{1}} \oplus r_{{2}} \oplus r_{{3}} \oplus r_{{4}} )^{ - 1} ,$$
(7)
$$r_{i} = {(}b_{{i{1}}} \otimes b_{{i{2}}} \otimes b_{{i{3}}} \otimes b_{{i{4}}} ),$$
(8)

where operators ⊕ and ⊗ denote the addition and multiplication of fuzzy numbers, respectively; \(w_{i} { = }(w_{i}^{L} ,w_{i}^{M} ,w_{i}^{U} )\) represents the fuzzy weight vector of the ith factor; i = 1, 2, …, 4.

Table 4 Group fuzzy weights and total weights of each evaluation factor

Finally, we obtained the total weights of each evaluation factor by defuzzification and normalization, as shown in Table 4.

Establishment of the water yield property index model

In this paper, we use a nonlinear method, the TOPSIS method, to assess the water yield property of the aquifer. TOPSIS is an evaluation method used to solve multicriteria decision-making problems and was originally developed by Hwang and Yoon (1981). The WYPI model was established using the TOPSIS method based on the values of factors and the weights calculated by GRA and FAHP methods. According to the TOPSIS method, we can first construct one imaginary sample with the richest water yield property, which is defined as a positive ideal solution (PIS), and one imaginary sample with the poorest water yield property, which is defined as a negative ideal solution (NIS). The richest sample of all the assessment samples is the one that has the nearest distance to the positive ideal solution and is farthest from the negative ideal solution, and the poorest sample of all the assessment samples is the one that has the nearest distance to PIS and is farthest from NIS. Then, the distances of each sample to the negative and positive ideal solutions were calculated to establish the water yield property index model, with which we can rank the relative water yield property values of the assessment samples in the study area. Based on the TOPSIS theory, the water yield property index model was conducted as follows:

First, the study area was organized into 500 m × 500 m grid cells using latitudinal and longitudinal lines. Four evaluation factor values in each of 209 grid cells and 7 pumping test hydrologic boreholes (which constituted 216 assessment samples) were collected using the basic functions of data acquisition in the Golden Software Surfer 13.0 to establish a 216 × 4 decision matrix, D216×4. Thus, a weighted normalized decision matrix was established through normalizing and weighting of the decision matrix V as follows:

$${\varvec{V}}={({v}_{pi})}_{216\times 4}={({W}_{i}\times {c}_{pi})}_{216\times 4},$$
(9)
$${c}_{pi}={d}_{pi}/\sqrt{\sum_{p=1}^{216}{d}_{pi}^{2}},$$
(10)

where Wi is the total weight of the ith evaluation factor as shown in the third column in Table 4; dpi, indicating the collection value of the pth assessment sample’s ith evaluation factor, is the element of the decision matrix D216×4; and cpi indicates the normalized value of the pth assessment sample’s ith evaluation factor. i = 1, 2, 3, 4; p = 1, 2, …, 216.

Then, based on the weighted normalized decision matrix, NIS and PIS for the water yield property were determined by Hayaty et al. (2014):

$${\varvec{V}}^{ - } = \left\{ {{(}\mathop {{\rm max}}\limits_{{{1} \le p \le l}} v_{pi} \left| {\;i \in {\varvec{J}}_{{1}} } \right.{),}\;{(}\mathop {{\rm min}}\limits_{{{1} \le p \le l}} v_{pi} \left| {\;i \in {\varvec{J}}_{{2}} } \right.{)}} \right\},$$
(13)
$${\varvec{V}}^{ + } = \left\{ {{(}\mathop {{\rm min}}\limits_{{{1} \le p \le l}} v_{pi} \left| {\;i \in {\varvec{J}}_{{1}} } \right.{),}\;{(}\mathop {{\rm max}}\limits_{{{1} \le p \le l}} v_{pi} \left| {\;i \in {\varvec{J}}_{{2}} } \right.{)}} \right\},$$
(14)

where V and V+ are the negative ideal solution and positive ideal solution, respectively; J1 indicates the set of negative factors, including BD; and J2 is the set of positive factors, including WIB, FIF and DFC.

The distance of each assessment sample to the negative and positive ideal solutions was calculated by the following Eqs. (15)–(16):

$$D_{p}^{ + } = \sqrt {\sum\limits_{{i = {1}}}^{{4}} {{(}v_{pi} - v_{i}^{ + } {)}^{{2}} } }$$
(15)
$$D_{p}^{ - } = \sqrt {\sum\limits_{{i = {1}}}^{{4}} {{(}v_{pi} - v_{i}^{ - } {)}^{{2}} } } .$$
(16)

Finally, the water yield property index of each assessment sample was calculated as:

$${\mathrm{W}\mathrm{Y}\mathrm{P}\mathrm{I}}_{p}=\frac{{D}_{p}^{-}}{{D}_{p}^{+}+{D}_{p}^{-}}$$
(17)

where WYPIp is the water yield property index of the pth assessment sample. The larger the value of WYPI is, the richer the water yield property of the Ordovician top aquifer.

Results and discussion

Results

The WYPI was calculated for each grid cell and each pumping test hydrologic borehole. The data were then processed using the Golden Software Surfer 13.0. Based on the coordinates and the WYPI values of the grid cell, the WYPI contour map was established using the kriging function interpolation technique, as shown in Fig. 4. The water yield property of the Ordovician top aquifer as expressed by the WYPI value ranged in the study area from 0.014 to 0.766. Based on the geometric mean method of the seven pumping tests’ WYPI values, the area was divided into three water yield property zones (Fig. 4):

  1. (a)

    Poor water yield property zone: WYPI < 0.232.

  2. (b)

    Medium water yield property zone: 0.232 ≤ WYPI < 0.332.

  3. (c)

    Rich water yield property zone: WYPI ≥ 0.332.

Fig. 4
figure 4

Water yield property zone map of the Ordovician top aquifer based on the WYPI

As shown in Fig. 4, in the Ordovician top aquifer, poor water yield property areas are mainly distributed in the southwestern and northeastern areas of the study area; the medium water yield property areas are mainly distributed in the central and southwestern areas of the study area; and the rich water yield property areas are mainly distributed in the north-central, south-central and southwestern parts of the study area.

Validating the prediction of the WYPI model

After obtaining the prediction result, the next step was validation, which was key to verifying the results. Therefore, validation tests were carried out to verify the accuracy of the water yield property zone prediction based on WYPI for the study area by processing practical engineering data, including pumping tests, water inrushes, water source wells, and leakage characteristics of the Ordovician top aquifer.

Analysis of the pumping tests

There were seven pumping tests for the Ordovician top aquifer, listed in Fig. 4. Based on the value of q, the water yield properties of no. l-12 and no. l-618 hydrologic boreholes were classified as poor; nos. z-9, z-350, and z-55 hydrologic boreholes were classified as medium; and nos. x-301 and l-45 were classified as rich (Table 1). As shown in Fig. 4, the prediction results based on WYPI are consistent with the data for these seven pumping tests.

Figure 5 involves the scatter plot of WYPI and q. This figure shows that the R2 was 0.9633, indicating a high degree of fitting between WYPI and q.

Fig. 5
figure 5

Scatterplot of q versus WYPI

Distribution of water inrushes and water source wells

As shown in Fig. 4, no. 2 water inrush, whose water inflow was as high as 1920 m3/h, lies in the rich water yield property zone based on the WYPI. The nos. 1 and 3 water inrushes, whose water inflows were as high as 772.2 m3/h and 78 m3/h, respectively, lie in the medium water yield property zone based on the WYPI.

Located in the southwestern part of the Liangzhuang coalmine, two water source wells (Chengjialou-1 and Chengjialou-2), whose supply is from the Ordovician top aquifer, are the main source of this coalmine. These two water source wells with a 2400 m3/day water supply are located in the rich water yield property zone based on the WYPI (Fig. 4). The Taocichang water source well is also the Ordovician top aquifer water source well, located south of the Xiezhuang coalmine. This water source well with a 1250 m3/day water supply is located in the medium water yield property zone based on the WYPI (Fig. 4).

Therefore, the distributions of the three water inrushes and three water source wells are in agreement with the WYPI prediction.

Leakage characteristic of the Ordovician top aquifer

There were 67 surface boreholes that revealed the Ordovician limestone aquifer, of which 49 boreholes were not taken into consideration because their thicknesses were small. Six of the remaining 18 boreholes, located in the medium or rich water yield property zone based on the WYPI, had leakage issues (Fig. 4). The remaining 12 boreholes had no leakage, and 11 of these were located in the poor water yield property zone based on the WYPI. However, one borehole with no leakage was located in the rich water yield property zone based on the WYPI. Data from 17 boreholes were consistent with the WYPI prediction, and data from one borehole was not consistent with the WYPI prediction (Table 5). Therefore, the accuracy of the WYPI model was 94.4% based on these 18 boreholes.

Table 5 Water leakage of Ordovician top aquifer in different boreholes

Based on the analysis of the distribution of three water inrushes and three water source wells and the leakage characteristics of the 18 surface boreholes, only one prediction result from the WYPI was not consistent with practical engineering data, so the prediction accuracy based on the WYPI was 95.8%. A WYPI prediction is considered to be accurate and reliable and can be used to guide the process of dewatering or other water control measures.

Comparison of the predictions by the WYPI model, q, and the linear regression model

Comparison of the predictions by the WYPI and q

The specific field q, obtained by a pumping test, is traditionally and widely used to evaluate the water yield property of an aquifer in China based on regulations for mine water prevention and control (State Administration of Work Safety and State Administration of Coal Mine Safety 2009). Figure 2 shows the water yield property zone of the Ordovician top aquifer based on q. As shown in Fig. 2, nos. 1, 2, and 3 water inrushes are located in the poor water yield property zone, which is not consistent with practical engineering; three water source wells are located in the rich water yield property zone, which is consistent with practical engineering; nine surface boreholes without leakage occur in the rich or medium water yield property zones; and three surface boreholes with leakage are located in the poor water yield property zone, which is not consistent with practical engineering. Based on these data, the prediction accuracy based on q is only 37.5%. The prediction accuracy based on the WYPI is 95.8%, indicating that the WYPI performed better than q.

Comparison of the predictions by the WYPI and the linear regression model

Another simple method is a linear regression model of the water yield property, which was solved by the least square method based on the normalized data in Table 6, as shown in the following:

$${\text{LR}}\, = \,a_{{1}} \, \times \,{\text{WIB}}\, + \,a_{{2}} \, \times \,{\text{FIF}}\, + \,a_{{3}} \, \times \,{\text{DFC}}\, + \,a_{{4}} \, \times \,{\text{BD}}\, + \,e = \,\,0.{3418}\, \times \,{\text{WIB}}\, + \,{3}.{3114}\, \times \,{\text{FIF}}\, + \,{9}.{9369}\, \times \,{\text{DFC}} - {6}.{7187}\, \times \,BD - 0.{2537,}$$
(18)

where LR is the water yield property calculated by the linear regression model; a1, a2, a3, and a4 are the weights of WIB, FIF, DFC, and BD, respectively; and e is the error.

Table 6 Normalized data of q and the evaluation factors in seven hydrologic boreholes and the prediction results by least squares

Table 6 shows the prediction results of LR. Figure 6 is the scatter plot of LR and q. Figure 7 is the water yield property zone map of the Ordovician top aquifer based on LR.

Fig. 6
figure 6

Scatterplot of q versus LR

Fig. 7
figure 7

Water yield property zone map of the Ordovician top aquifer based on LR

From the prediction results (Table 6, Figs. 5, 6), the following can be concluded:

a. The correlation coefficient R2 is 0.9943 (Fig. 6), indicating that the linear fit of the model is good. However, we can see that the prediction value of the no. 2 sample in Table 6 is negative, and this situation is widespread in the prediction results (Fig. 7). A total of 89 of 216 samples were predicted to be negative, ranging from − 0.571 to − 0.005. The predictions do not match actual values.

b. Apart from the above-noted problem, validation tests were carried out to verify the accuracy of the linear regression model by three water inrushes, three water source wells, and the leakage characteristics of 18 surface boreholes.

As shown in Fig. 7, there are three water inrushes and three water source wells, of which the no. 3 water inrush lies in the poor water yield property zone. The prediction accuracy of the linear regression model is 83.3% based on the distributions of the water inrushes and water source wells. Four surface boreholes without leakage are in the rich or medium water yield property zones, and two surface boreholes with leakage are located in the poor water yield property zone. The prediction accuracy of the linear regression model is 66.7% based on these 18 surface boreholes. Therefore, the prediction accuracy based on LR is 70.8% given all the test data. The prediction accuracy based on the WYPI is 95.8%, indicating that the WYPI is more accurate and realistic than LR.

Discussion

Predicting the water yield property of the Ordovician top aquifer is a challenging and difficult task because of deep burial depths and complex geo-circumstances. The water yield property of the Ordovician top aquifer mainly depends on three aspects: the storage space of water, the quantity of water in the storage space, and the water supply. Four factors, WIB, FIF, DFC, and BD, were identified, spatially interpolated, plotted with Golden Software Surfer 13.0, and analyzed. The weights of the four factors were 0.262, 0.227, 0.257, and 0.254, respectively.

WIB intuitively reflects the quantity of water in a storage space. Its weight was 0.262, indicating that WIB has a significant influence on the water yield property. In the south-central and north-central regions, the water inflow of the borehole is larger than that in other regions, and the WYPI is also larger in these regions.

The storage spaces of the Ordovician karst water are mainly cracks and dissolved fissures. FIF indicates the development degree of the cracks and fissures partly, but some of the fissures are filled at a late stage. DFC was adopted to reflect the connectivity of the fissures. Therefore, these two factors comprehensively reflect the development and connectivity of the fissure. Although the weight of the FIF or DFC is not large, the sum of the two factors’ weights is 0.484, which indicates that the development and connectivity of the storage space of water have the greatest influence on the water yield property. The WYPI map shows that faults have a great influence on the water yield property of the Ordovician top aquifer. In the southern part of the study, faults mainly strike to the NE, extending to the south mountain, where the Ordovician limestone strata are exposed to the surface. The distribution direction of the rich water yield property zones based on the WYPI is also the NE direction. The strike of the faults in the deep areas is mainly in the NW direction, which is similar to the distribution direction of the water yield property zone of the Ordovician top aquifer as well. As shown in Fig. 4, the distribution of the Ordovician top aquifer water yield property is very inhomogeneous. Rich water yield property zones are mainly located in the northern and central-eastern part of the study area where faults are developed or intersected densely. Poor water yield property zones are mainly located in regions where faults are not developed.

The weight of BD is 0.254 indicating that water supply has relatively little effect on the water yield property of the Ordovician top aquifer. Storage space has the greatest influence on the water yield property. Controlled by faults, the underground water flows from south of the mountain to the deep part of the study area along the faults.

Based on the WYPI method, a water yield property evaluation of the Ordovician top aquifer was achieved. Practical engineering data, including from seven pumping tests, three water inrushes, three water source wells, and leakage characteristics of 18 surface boreholes, successfully validated the WYPI model. The proposed model was established based on multi-influencing factors of the water yield property that performed better in predicting the water yield property zone of the Ordovician top aquifer than the q and LR methods. The prediction results of the WYPI depend on the abundance of geological and hydrogeological exploration data and will change with the quantity and distribution of all the influential factors. With additional data, the quality of the WYPI prediction will be further improved.

Conclusions

It is essential to assess the water yield property of the Ordovician top aquifer to manage and ensure safe mining in deep coal seams. Due to a minimal number of pumping tests for the Ordovician top aquifer, accurately predicting the water yield property zone was a challenging task. A WYPI model was applied to predict the water yield property zone of the Ordovician top aquifer in the Xinwen coalfield, in which four factors, including water inflow of borehole, fault impact factor, drilling fluid consumption and burial depth of the Ordovician top aquifer, were used. The GRA and FAHP methods were combined to determine the weights of each factor. The weights of the four factors, WIB, FIF, DFC, and BD, were 0.262, 0.227, 0.257, and 0.254, respectively. The development and connectivity of the storage space of water (FIF and DFC) had the greatest influence on the water yield property. WIB also had a significant influence on the water yield property, followed by BD.

A water yield property zone map was built based on the WYPI using the kriging function interpolation technique. The water yield property zones were divided into three zones: poor (WYPI < 0.232), medium (0.232 ≤ WYPI < 0.332), and rich (WYPI ≥ 0.332). Practical engineering data, including the water inrushes, water source wells, and leakage characteristics of the Ordovician top aquifer, were used to verify the accuracy of the water yield property zone prediction of the WYPI. The prediction accuracy of the WYPI was 95.8%, while the prediction accuracy of the q and LR methods were 37.5% and 70.8%, respectively. Compared to the prediction results of the q and LR methods, the prediction result of the WYPI model is more consistent with the practical engineering data, which provided guidance on dewatering or other water control measures.

The case study of the Xinwen coalfield shows that the water yield property of the Ordovician top aquifer was predicted with good precision based on the new comprehensive method. The new method can be applied to predict the water yield property of the Ordovician top aquifer for other underground mines in North China where pumping tests are lacking. Notably, the specific relevant factors should be derived from datasets collected in a particular coal mine, because the geological and hydrological conditions are very different in different regions, and then, the method used in this study can be used to establish a similar model for that specific coal mine.