Introduction

The rapid growth in economic benefits along with the increased capacity to extract mineral resource around the globe has contributed to damaging the geo-environment (Monjezi et al. 2009; Mayes et al. 2009; Jordan and Project 2009; Schellenbach and Krekeler 2012; Huang et al. 2012; Howladar 2013). For example, mining and related industries have long operated in the Changjitu economic zone (CEZ) of Jilin Province, Northeast China. However, while the long-term exploitation of mineral resource has created enormous economic benefits, it has also caused serious environmental problems such as geological disasters, resource damage, and environmental pollution that cannot be ignored (Das 1999; Jiang et al. 2014). Geological disasters caused by underground mining include ground subsidence, ground fissures, and the inrush of mine water, while open pit mining has led to collapses, landslides, and debris flows (Xu et al. 2006; Liu et al. 2008). Further, mining activities have damaged land/water resources and the local landscape (Brown 2005), while the widespread discharging of mining waste residues, wastewater, and waste gas represents a major source of pollution that threatens the health of residents (Fessler et al. 2003; Ghose 2003; Mars and Crowley 2003; Pierce et al. 2004; Ryser et al. 2005; Mendez et al. 2006; Krekeler et al. 2008, 2010; Geise et al. 2011; Schellenbach and Krekeler 2012).

The evaluation of mining geo-environmental quality (MGEQ) has important theoretical and practical value based on the comparative analysis of the major geological problems caused by different types of mineral development, different regions, and different mining approaches in the CEZ. It can provide not only a scientific basis for restoring the geological environment and rationally exploiting mineral resources, but also geo-environmental conditions for the suitable distribution of economic and social benefits in the CEZ, which in turn promotes the coordinated development of the economic, social, and geological environments (Li 2013).

In the past few years, numerous authors have researched the selection of evaluation factors, establishment of evaluation indicators, and choice of evaluation methods (Mejía-Navarro et al. 1994; Sarkar et al. 2007; Turer et al. 2008; Huang et al. 2012). For example, in China, Cai et al. (1998) used the analytic hierarchy process (AHP) to determine the weights of evaluation factors (Cai et al. 1998), while Liao and Wu (2004) applied a fuzzy mathematics method to evaluate the mining environment (Liao and Wu 2004). Xu et al. (2003, 2006) classified mining geo-environmental problems in Northwest China by applying the multi-level fuzzy synthetic evaluation, gray situation assessment, and comprehensive evaluation methods to assess MGEQ (Xu et al. 2003, 2006). Further, Ma (2013) determined an MGEQ evaluation system based on five aspects, namely natural geography, basic geology, mining development area, mining-related geological disasters, and mining environment, and selected AHP combined with a fuzzy mathematics method to evaluate MGEQ in two mining areas (Ma 2013).

In the UK, Legg (1990) used remote sensing technology to qualitatively evaluate the environmental and land reclamation problems caused by surface mining (Legg 1990). In the mid-1970s, the famous operational research expert Saaty (1980) proposed the AHP (Saaty 1980), which combines quantitative and qualitative analysis, to calculate the weights of the elements in a pairwise manner and ultimately evaluate MGEQ. Similarly, Zadeh (1965) created a fuzzy evaluation method based on the maximum membership and fuzzy transform principles to evaluate MGEQ (Zadeh 1965). In addition, various researchers have examined a number of other methods for evaluating MGEQ such as graphics overlay, scoring superposition, comprehensive index method (CIM), and Delphi methods (Ma 2013).

While significant achievements and progress in mining geo-environmental assessment have been made by previous studies, some questions remain to be resolved. First, the evaluation results presented thus far are not objective enough, because the selection of the evaluation factors has not been sufficiently comprehensive (Liang 2012). Second, previous works have focused on the qualitative research of mining geo-environments and there has been a lack of quantitative studies (Wang and Chen 2011). Third, unreasonable comprehensive evaluation methods with subjective factors and complex weight calculations have led to the evaluation results lacking comparability. Finally, the complex mapping between the evaluation factors and MGEQ has not been processed well. Therefore, it is of upmost importance to formulate a scientific, comprehensive, reliable, and practical quantitative evaluation system as well as a multi-index, multi-level evaluation model that can handle the complex mapping between the evaluation factors and MGEQ.

The present study establishes such a quantitative evaluation system for the CEZ based on the approaches of previous researchers and the characteristics of the mining geo-environment. Based on the CIM, which was developed to evaluate MGEQ, we propose a new model termed the support vector machine (SVM) model. Objective weighting methods (e.g., the variation coefficient, entropy, and Kantiray assignment methods) and subjective weighting methods (e.g., the AHP) are combined to determine the integrated weights of the evaluation indicators. The proposed SVM model is capable of mapping out the relationship between input and output variables by using a kernel function in a high-dimensional space (Liang 2009) as well as treating pattern recognition. SVM has been widely used in the fields of human face recognition, signal processing, the prediction of rainfall runoff (Dibike et al. 2001), water quality assessment, text classification, remote sensing image analysis (Vapnik 1995; Yoon et al. 2011), stream flow or stage (Liong and Sivapragasam 2002; Asefa et al. 2006; Yu et al. 2006), and lake water level assessment (Khalil et al. 2006; Khan and Coulibaly 2006). However, it has not thus far been applied in the field of MGEQ evaluation. Consequently, the present study applies the SVM model to assess MGEQ and explores the modeling process to evaluate its effects.

Specifically, the aims of the present study are fourfold: (1) to rationally construct a more holistic and scientific evaluation framework; (2) to compare the advantages and disadvantages of the CIM and SVM models for the MGEQ evaluation in the study area, where the geo-environment has been damaged over the past decade because of mining activities; (3) to provide some guidance and be of practical importance for the popularization and application of SVM technology in the field of MGEQ assessment; and (4) to provide reliable data and a basis for the reasonable exploitation and sustainable utilization of mineral resource in the CEZ.

This research is novel in the following three ways. First, the evaluation framework constructed in this study is original. Second, we apply the SVM model to assess MGEQ and obtain good evaluation results. Third, the variation coefficient, entropy, and Kantiray weighting methods as well as the AHP are combined to determine the optimal weights, which not only reflect expert knowledge and opinions, but also use objective information.

Study area

The study area is located in the east of Jilin Province, Northeast China, which includes the cities of Changchun, Dehui, Jiutai, Jilin, and Jiaohe, the counties of Nongan and Yongji, and the Yanbian Korean Autonomous Prefecture. It lies between the geographic coordinates of 42°00′20″N and 44°56′24″N latitude, and 124°32′24″E and 131°18′36″E longitude (Fig. 1), with an area of 73,000 km2. The study area is characterized by a temperate continental monsoon climate. Average annual temperature is 2–6 °C and average annual rainfall is 600–1,400 mm, with 80 % or more of the rainfall occurring between June and September. Mining has become a pillar industry of economic development in the Changjitu region, but it also causes complex and serious mining-related geological problems, which can be divided into three categories: mining-related geological disasters, resource damage, and environmental pollution.

Fig. 1
figure 1

Location of the study area

Materials and methods

MGEQ evaluation system

The MGEQ evaluation system is a complex system involving many factors. Mining, dressing, smelting, and other types of engineering activities can induce and aggravate various types of geological problems at different levels, while geological background and natural conditions are additional factors that may lead to mining-related geological problems. Simultaneously, the difficulty of mining geo-environmental recovery and management is a key factor for MGEQ evaluation (Liu et al. 2008).

The MGEQ evaluation system proposed herein was established based on the comparative analysis of the major mining-related geological problems in the CEZ. The evaluation framework considers five criteria, namely basic mining information, the geo-environmental background conditions of the mining area, mining-related geological problems, the importance of the evaluation area, and the difficulty of mining geo-environmental recovery, along with 15 indicators such as production scale, mining methods, topography, geological conditions, and mining-related geological disasters (Fig. 2).

Fig. 2
figure 2

The MGEQ evaluation system

The determination of geological environment elements and indicator weights

Objective weighting methods, such as the variation coefficient, entropy, and Kantiray weighting methods, and subjective weighting methods such as the AHP were used to determine the comprehensive weights of the elements and indicators. The comprehensive weights not only include expert knowledge and opinions, but also use objective information. Therefore, this method reflects the diversity of the practical data, overcoming the shortcomings of using either objective or subjective weights, making subjective and objective weights consistent, and leading to evaluation results that are sound and scientific (Zhu 2012).

Variation coefficient method

The variation coefficient evaluation method can be used when each factor is relatively independent. Suppose there are n participating samples, with each sample described by p indicators (\( X_{1} \;,X_{2} \;, \cdots ,X_{P} \)). The mean (\( \overline{{X_{i} }} \)) and variance (\( S_{i}^{2} \)) are calculated by using Eqs. (1) and (2):

$$ \overline{{X_{i} }} = \frac{1}{n}\sum\limits_{j = 1}^{p} {X_{ji} } $$
(1)
$$ S_{i}^{2} = \frac{1}{n - 1}\sum\limits_{j = 1}^{p} {(X_{ji} - \overline{{X_{i} }} )}^{2} . $$
(2)

The variation coefficient of each indicator is

$$ V_{i} = {{S_{i} } \mathord{\left/ {\vphantom {{S_{i} } {\overline{{X_{i} }} }}} \right. \kern-0pt} {\overline{{X_{i} }} }}\;\;\;\;i = 1\;,2\;, \ldots ,\;p. $$
(3)

The weight of each indicator \( W_{i} \) is obtained after normalizing \( V_{i} \):

$$ W_{i} = {{V_{i} } \mathord{\left/ {\vphantom {{V_{i} } {\sum\limits_{j = 1}^{p} {V_{j} } }}} \right. \kern-0pt} {\sum\limits_{j = 1}^{p} {V_{j} } }}\;\;\;\;j = 1\;,2\;, \ldots ,p. $$
(4)

Entropy method

The entropy method determines its weights based on the amount of information that the various indicators transmit to policymakers. The greater the difference in the evaluation indicators, the smaller is the entropy, while the weights scale is proportional to the information content of the indicators. The detailed steps are as follows:

Step 1. Normalize the value of the indicators:

$$ a_{ij} = {{X_{ij} } \mathord{\left/ {\vphantom {{X_{ij} } {\sum\limits_{i = 1}^{n} {X_{ij} \;\;\;\;\;\;i = 1,\;2\;, \ldots ,\;n\;\;\;\;j = } }}} \right. \kern-0pt} {\sum\limits_{i = 1}^{n} {X_{ij} \;\;\;\;\;\;i = 1,\;2\;, \ldots ,\;n\;\;\;\;j = } }}1,\;2\;, \ldots ,\;m. $$
(5)

Step 2. Calculate the entropy of the evaluation indicators:

$$ H = - K\sum\limits_{i = 1}^{n} {a_{ij} \ln a_{ij} \;\;\;\;(K = {1 \mathord{\left/ {\vphantom {1 {\ln n}}} \right. \kern-0pt} {\ln n}})} . $$
(6)

Step 3. Transfer the entropy into the weight:

$$ W_{i} = {{(1 - H_{i} )} \mathord{\left/ {\vphantom {{(1 - H_{i} )} {(m - \sum\limits_{j = 1}^{m} {H_{j} } )}}} \right. \kern-0pt} {(m - \sum\limits_{j = 1}^{m} {H_{j} } )}}. $$
(7)

Although the entropy method requires a certain number of sample units, it has a strong mathematical theory and provides a close relationship between the entropy and indicator value itself (Ma 2009).

Kantiray weighting method

This approach was proposed by Kantiray (1989). The weight of each indicator can be derived using the following equation:

$$ (RS - \lambda I)W = 0, $$
(8)

where \( R \) is the correlation matrix of the original variables, \( S \) is the diagonal matrix of the standard deviation, and \( \lambda \) is the largest eigenvalue. \( W \) is the eigenvector that corresponds to the maximum eigenvalue, which is affected by the standard deviation and correlation coefficient. Therefore, the matrix \( RS \) contains both the interaction information between the indicators in the original data and the degree of indicator variation (Kantiray 1989; Ma 2009).

AHP

The AHP has been proven to be an effective decision analysis method in multiple-criteria assessment. The present study uses the AHP to determine the subjective weights of the elements and indicators. First, a judgment matrix is used and then the weights of the indicators are obtained after calculating the eigenvalue and eigenvectors. Next, the normalized eigenvector that corresponds to the maximum eigenvalue-passed consistency check is taken as the weight of each index factor (Hao 2011). The equations describing the above-mentioned procedures are as follows:

$$ AW = \lambda_{ \hbox{max} } W, $$
(9)
$$ {\text{CI}} = \frac{{\lambda_{\hbox{max} } - N}}{N - 1}, $$
(10)
$$ {\text{CR = }}\frac{\text{CI}}{\text{RI}}, $$
(11)

where \( W \) is the eigenvector as well as the weight vector, \( \lambda_{\hbox{max} } \) is the maximum eigenvalue of matrix A, \( {\text{CI}} \) is the consistency index, \( N \) is the order of the judgment matrix, \( {\text{RI}} \) is the average random index of the order of the matrix, and \( {\text{CR}} \) is the consistency ratio. \( {\text{CR}} \) should be lower than 0.1 to consider the eigenvector \( W \) to be an acceptable weight. Otherwise, the comparison and calculation should be redone.

Comprehensive weight method

First, an appropriate objective and subjective weighting method is chosen based on the samples and data and then comprehensive weights are calculated according to a certain ratio of the column (Wang 2013). The equation is written as follows:

$$ w^{ * } = \alpha w + (1 - \alpha )\phi , $$
(12)

where \( w^{ * } \) is the comprehensive weight, \( w \) is the subjective weight, \( \phi \) is the objective weight, and \( \alpha \) is the subjective preference factor. In this study, the value of \( \alpha \) is set as an empirical value of 0.5, while the comprehensive weight is the average of the subjective and objective weights.

Proposed evaluation model

The developmental characteristics, distribution, and extent of damage of mining-related geological problems are analyzed comprehensively based on the investigation of typical mine fields and data collection in the CEZ. Specifically, the CIM and SVM are used to evaluate MGEQ.

CIM

According to this method, we first determine the level of the mining geo-environment by using a weighted sum. This sum is calculated based on the status of the geological environment after assigning basic mining information, the geo-environmental background of the mining area, mining-related geological problems, the importance of the evaluation area, and the difficulty of mining geo-environmental recovery as well as the other 15 indicators. The grading standards for the MGEQ evaluation criteria are given in Table 1.The mathematical formulation is as follows (Liu et al. 2008):

$$ F = \sum\limits_{i = 1}^{n} {F_{\text{i}} W_{\text{i}} } , $$
(13)

where \( F_{\text{i}} \) is the individual score for each indicator and \( W_{\text{i}} \) is the comprehensive weight of each indicator.

Table 1 Indicator scores and ratings of the MGEQ evaluation criteria

Mines that have a comprehensive quality index greater than or equal to 6.0 are classified into level III, between 3.32 and 6.0 level II, and less than 3.32 level I. The influence degree corresponding to each class is shown in Table 2.

Table 2 The influence degree of each class

SVM model

(1) Theory of SVM

SVM (Vapnik 1995) is an emerging machine learning technology that has been extensively used as a classification tool. The theory is based on the structural risk minimization principle (Liang et al. 2011; Yoon et al. 2011). SVM attempts to find the optimal separating hyperplane between classes by maximizing the class margin (Harris 2013). SVM can be divided into linear and nonlinear models, with the former a special case of the latter. Therefore, this article only describes the nonlinear SVM.

Nonlinear problems can be transformed into linear problems by using a kernel function and finding the optimal separating hyperplane in the transform space. Because the kernel function that corresponds to an inner product function is \( K(x_{i} ,x_{j} ){ = }\psi (x_{i} )\; \times \;\psi (x_{j} ) \), \( K(x_{i} ,x_{j} ) \) can avoid this so-called “dimension disaster,” which primarily relates to the large number of possible nonlinear mappings and to the computational complexity associated with any high-dimensional space (Baly and Hajj 2012).

The method can be described as follows. Input vector \( x \) is first mapped to a high-dimensional feature space by using the nonlinear mapping pre-selected \( \phi \) and then finding the optimal separating hyperplane in the high-dimensional space (Keerthi et al. 2000; Cristianini and Shawe-Taylor 2000; Sun et al. 2009; Wu and Wang 2009). The sample set is \( (x_{i} ,y_{i} ),\;i = 1,2, \ldots ,n \); \( y = \{ 1,\; - 1\} \) are class labels. The hyperplane can thus be expressed by the following equation:

$$ \omega \; \times \;x_{i} + b = 0. $$
(14)

Next, we add a slack variable to the constraints \( \xi_{i} \ge 0 \), so that the maximum interval hyperplane is now called the generalized optimal separating hyperplane (Leng et al. 2007; Baly and Hajj 2012; Harris 2013). The constraint becomes

$$ {\text{s}} . {\text{t}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} y_{i} [(\omega \; \times \;x_{i} ) + b) \ge 1 - \xi_{i} . $$
(15)

The optimization problem is written as

$$\begin{aligned} & \hbox{min} \frac{1}{2}(\omega \; \times \;\omega ) + C\sum\limits_{i = 1}^{n} {\xi_{i} } \hfill \\ &{\text{s}} . {\text{t}} y_{i} [(\omega \; \times \;x_{i} ) + b) \ge 1 - \xi \quad i = 1,2, \ldots ,n, \hfill \\ \end{aligned} $$
(16)

where \( \omega \) is the weight vector, \( b \) is the bias, and \( \xi_{i} \) is the slack variable. In addition, \( C > 0 \) is the penalty factor, where the larger the \( C \), the greater is the penalty. To develop the dual form of the problem, we introduce the Lagrange multipliers \( \alpha ,\beta \)

$$ \begin{gathered} L(\omega ,\xi ,b,\alpha ,\beta ) = \frac{1}{2}(\omega \; \times \;\omega ) + C\sum\limits_{i = 1}^{n} {\xi_{i} } \; - \hfill \\ \;\sum\limits_{i = 1}^{n} {\alpha_{i} } [y_{i} (\omega \; \times \;x_{i} + b) - 1 + \xi_{i} ]\; - \sum\limits_{i = 1}^{n} {\beta_{i} \xi_{i} } . \hfill \\ \end{gathered} $$
(17)

Equation (17) is then minimized with respect to \( \omega \), \( \xi \), and \( b \) by taking the partial derivatives with respect to \( \omega \), \( \xi \), and \( b \) and setting them to zero. Hence, the dual problem is as follows:

$$ \begin{gathered} \mathop {\hbox{max} }\limits_{\alpha ,\beta } Q(\alpha ) = L(\omega ,\xi ,b,\alpha ,\beta ) = \hfill \\ \sum\limits_{i = 1}^{n} {\alpha_{i} - \frac{1}{2}\sum\limits_{i = 1}^{n} {\sum\limits_{j = 1}^{n} {\alpha_{i} \alpha_{j} y_{i} y_{j} (\psi (x_{i} )\; \times \;\psi (x_{j} ))} } } \hfill \\ {\kern 1pt} {\kern 1pt} = \sum\limits_{i = 1}^{n} {\alpha_{i} - \frac{1}{2}\sum\limits_{i = 1}^{n} {\sum\limits_{j = 1}^{n} {\alpha_{i} \alpha_{j} y_{i} y_{j} K(x_{i} ,x_{j} )} } } \hfill \\ {\kern 1pt} {\text{s}} . {\text{t}}{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} \sum\limits_{i = 1}^{n} {\alpha_{i} y_{i} } = 0,\alpha_{i} \ge 0. \hfill \\ \end{gathered} $$
(18)

The optimal judgment function is

$$ f(x) = \text{sgn} (\sum\limits_{i = 1}^{n} {\alpha_{i}^{*} y_{i} K(x,x_{i} )} + b). $$
(19)

Many types of kernels have been used to establish the SVM model, such as the linear, polynomial, Gaussian radial basis function (RBF), and sigmoid kernels (Chen et al. 2013). The respective equations are listed as follows:

Linear kernel,

$$ K(x,\;x_{i} ) = (x\; \times \;x_{i} ). $$
(20)

Polynomial kernel,

$$ K(x,x_{i} ) = (g(x\; \times \;x_{i} ) + r)^{d} . $$
(21)

RBF kernel,

$$ K(x,x_{i} ) = \exp \left( { - g\left\| {x - x_{i} } \right\|^{2} } \right). $$
(22)

Sigmoid kernel,

$$ K(x,x_{i} ) = \tanh (g(x\; \times \;x_{i} ) + r). $$
(23)

(2) Construction, application, and validation of the SVM model

According to the classification standard of MGEQ in this study, random training samples were generated by using the rand function in MATLAB. There are three levels of MGEQ, and each of the elements can be evaluated by the following standard levels: less than or equal to 2.0 is level I, between 2.0 and 6.0 is level II, and between 6.0 and 10.0 is level III. We randomly generated 200 pairs of training samples under these three levels (i.e., 600 pairs of training samples), while the test samples were 116 groups of data on the important mines in the CEZ.

The SVM model was built as follows:

Step 1. Normalize the data (Eq. 24)

$$ \tilde{X}_{i} = \frac{{X_{i} - \hbox{min} (X_{i} )}}{{\hbox{max} (X_{i} ) - \hbox{min} (X_{i} )}}. $$
(24)

Step 2. Determine the structure of the SVM model. In this study, basic information (X 1), geological background (X 2), mining-related geological problems (X 3), the importance of the evaluation area (X 4), and the difficulty of mining geo-environmental recovery (X 5) are the input variables, with the level of MGEQ as the output variable.

Step 3. Determine the parameters. The present study uses the C-support vector classification model, while the penalty parameter C and kernel parameter g need to be set. To optimize the two SVM parameters (C and g) simultaneously, a cross-validation parameter optimization program was performed. First, training data were separated into several folds. Sequentially, a fold is considered to be the validation set and the rest are used for training. The average accuracy for predicting the validation sets represents the cross-validation accuracy. We provided a possible interval of log2 C (or log2 g) with the grid space (−10, 10). Then, all the grid points of (C; g) were examined to assess which one provided the highest cross-validation accuracy. After training, the optimal penalty parameter C and optimal kernel parameter g were obtained with values of 1,024, and 64, respectively. Then, we used the best parameter to train the whole training set and generated the final model.

Step 4. Determine the kernel function and establish the evaluation model. By using the MATLAB program to train the random learning samples, the optimal penalty and optimal kernel parameters as well as the linear, polynomial, Gaussian RBF, and sigmoid kernels were developed to establish the SVM model to evaluate MGEQ. The 116 groups of test samples were then placed into the evaluation model to obtain the MGEQ evaluation results, and the evaluation results were compared by using these different kernel functions.

Results and discussions

Results of the MGEQ evaluation based on the CIM

Because each evaluation indicator is relatively independent, the Kantiray weighting method is unsuitable in this case; therefore, the variation coefficient and entropy methods were applied as the objective weighting methods and the AHP as the subjective weighting method. The five evaluation elements of the MGEQ were correlated in some areas, but also showed certain differences. The Kantiray weighting method was more appropriate than the variation coefficient method. Therefore, we selected the Kantiray and entropy weighting methods as the objective weighting methods and the AHP as the subjective weighting method to determine the comprehensive weights of the five elements. The weights of the elements and indicators are listed in Table 3. We finally obtained the evaluation results of CIM using Eq. 13.

Table 3 The comprehensive weights of the elements and indicators

The vector represented the relative importance of the assessment criteria for the geological environment: A3 > A5 > A1 > A4 > A2. The assessment for the indicators under level A1 (i.e., basic information) was B2 > B1, that for the indices under level A2 (i.e., geological background) was of the order of B3 > B4 > B5 > B6, and that under level A3 (i.e., mining-related geological problems) was of the order of B7 > B9 > B8. In the present study, we thus determined the optimal weights that not only reflect expert knowledge and opinions, but also use objective information.

Results of the MGEQ evaluation based on the SVM model

The accuracy of the four SVM models with their different kernel functions is illustrated in Fig. 3. The selection of kernel function was significant for pattern recognition. Figure 3 also shows that the accuracy of the linear, polynomial, RBF, and sigmoid SVM models was 93.10, 92.24, 52.59, and 0 %, respectively, illustrating that the linear SVM model is slightly more accurate than the polynomial SVM model, with both well ahead of the RBF and sigmoid SVM models. Therefore, this finding proves that the most appropriate method for evaluating the mining geo-environment in the study area is the linear SVM.

Fig. 3
figure 3

The accuracy of linear SVM, polynomial SVM, RBF SVM, and sigmoid SVM

Discussion

Conventional evaluation methods such as the CIM, fuzzy mathematics method, and gray clustering method cannot address the complex relationships between the evaluation elements and MGEQ, and the evaluation results are greatly affected by subjective factors (Liao and Wu 2004; Huang et al. 2012). Compared with conventional evaluation methods, SVM is a small-sample machine learning method based on statistical learning theory that uses the structural risk minimization principle. This approach allows generalization and can map the evaluation elements and MGEQ using a kernel function in a high-dimensional space. Hence, it can overcome the shortcoming of conventional methods when evaluating MGEQ as well as the defects of slow training speed, poor network generalization, and low learning accuracy in artificial neural networks (Liao et al. 2012). In addition, the SVM model calculates weights automatically and the evaluation results are comparable. Therefore, as an important pattern recognition method, SVM is appropriate for MGEQ evaluation, which is a typical pattern recognition issue.

According to the CIM and SVM evaluation results presented herein, Figs. 4 and 5 show that the majority of the MGEQ in the study area ranks as level II, which corresponds to moderately affected, accounting for 75.86 % of total eligible mines and 52.6 % of those located in coal mining areas. By contrast, level III (seriously affected) accounts for 21.55 % (87.50 % in coal mining areas) and level I (lightly affected) 2.59 %, located in mineral water, zeolite, iron, and geothermal mines.

Fig. 4
figure 4

The levels of the mining geo-environment in the CEZ

Fig. 5
figure 5

The statistical results of the MGEQ evaluation

In summary, these results indicate that most mining geo-environments in the region are moderately or seriously affected by mining activities, especially coal mining areas (Fig. 5). The long-term exploitation of coal in the study area has created enormous mined-out areas, causing surface collapses and ground fissures. Indeed, there are more than 35 collapsed areas of different sizes, the largest up to 2.346 km2. These surface collapses and ground fissures have damaged the mining geo-environment, threatening the safety of residents and destroying the local landscape. Therefore, related recovery and management measures should be implemented.

Conclusions

Based on the findings of domestic and foreign research and the characteristics of the mining geo-environment of the CEZ, a new MGEQ evaluation system was proposed in this study. The evaluation framework considered five criteria, namely basic mining information, the geo-environmental background conditions of the mining area, mining-related geological problems, the importance of the evaluation area, and the difficulty of mine geo-environmental recovery. The evaluation framework established in this study is thus a relatively comprehensive system for evaluating MGEQ in the CEZ.

CIM and SVM models based on different kernel functions were developed to evaluate MGEQ. The results showed that the accuracy of the linear, polynomial, RBF, and sigmoid SVM models was 93.10, 92.24, 52.59, and 0 %, respectively, confirming that the linear SVM is the most appropriate method for evaluating the mining geo-environment in the CEZ.

The study also demonstrated that the SVM model was appropriate for MGEQ assessment. Compared with existing MGEQ assessment methods, SVM adopts the structural risk minimization principle, which classifies quality into multiple groups. This research thus provides some guidance and is of practical importance for the popularization and application of SVM technology in the field of MGEQ assessment.

The evaluation results also showed that most of the MGEQ is classified into level II in the study area (75.86 % of total eligible mines), with levels I and III accounting for 2.59 and 21.55 %. This finding suggests that the mining geo-environment in the region is moderately affected by mining activities. The results also provide reliable data and a basis for the reasonable exploitation and sustainable utilization of mineral resource in the CEZ.