Introduction

Generally, the soil layer of karst areas in southwest China is thin with a surface-ground bilayer structure, so that it is easy for pollutants to enter into aquifers through weak overlying strata and sinkholes. Compared with other non-karst aquifers, the karst groundwater pollution is increasingly significant and, therefore, how to reasonably and effectively protect karst groundwater has been an urgent problem to solve (Kaçaroǧlu 1999; Shi et al. 1999; Tiwari et al. 2012; Zhang et al. 2017). The risk prediction of groundwater pollution is an important mean to prevent the groundwater pollution. Carrying out the risk identification on groundwater pollution not only can comprehensively explore the relationship between groundwater pollution and human social practices but also can timely determine the key areas of groundwater pollution, which provides a scientific basis for the management and protection of regional groundwater resources (Li et al. 2015, 2016; Masciopinto et al. 2017; Kourakos et al. 2012).

As an open system, the groundwater system shows a close and complex relation with the external factors and, therefore, the risk evaluation on groundwater pollution is a complex problem (Secunda and Collin 1998; Tiwari et al. 2015; Sangam et al. 2015). It is found through literature review that hydrogeologists predicted groundwater pollution risk using different methods from different perspectives according to actual conditions of various areas due to complexities of evaluation factors and limitations of research levels. For example, the United States Environmental Protection Agency put forward the DRASTIC evaluation model in 1987, which is mainly designed for investigating the vulnerability evaluation of pores and fractured aquifers (Aller et al. 1987); Martin and Abraham of Israel comprehensively considered the influences of crucial environmental factors and land use intensity in risk evaluation on groundwater pollution. The four basic environmental factors involve hydrology, landform, soil and vegetation (Martin and Abraham 2001); Sappa and Vitale of Italy has established a pollution impact map for different agrochemicals through water quality simulation mode base on geographic information system (GIS), and chose different key parameters to evaluate groundwater pollution risk (Sappa and Vitale 2001); The UK uses the GOD method designed by Foster and Skinner in the risk assessment of groundwater source pollution (G is the groundwater outlet, O is the overall lithology, D is the depth of the groundwater) (Foster and Skinner 1995); Liu et al. constructed a risk assessment index system for groundwater pollution including influencing factors such as the exposure of the evaluation subject, anti-pollution of unsaturated zone, aquifer vulnerability and so on. In addition, the weight of each evaluation index was determined by analytic hierarchy process (AHP) method (Liu et al. 2012). Current researches have realized significant achievements about risk prediction on non-karst groundwater pollution while it is difficult to establish a uniform evaluation index system owing to the motion laws of karst groundwater in different karst areas show great disparities. Therefore, the subjective (such as Delphi, AHP) and objective (such as entropy, grey relational analysis) weighting methods are generally employed to determine weights of factors in current risk prediction models of groundwater pollution. However, these methods show shortcomings: subjectivity is strong and weights cannot reflect the actual significance of factors. To solve the aforementioned problems, the study established a risk evaluation system for karst groundwater pollution applicable to karst areas based on systematically investigating the typical study area. The study also introduced artificial neural network (ANN) method with learning mechanism containing weights. On this basis, the risk of karst groundwater pollution in Huaxi district of Guiyang city, Guizhou province, China, was evaluated using the data processing and display functions of the GIS, attaining a favorable effect. The research can provide bases and references for similar researches.

General situation of the study area

Geographical location

Huaxi district is located in southwest of Guiyang city in Guzihou province within the geographic coordinate of 106°26′40″–106°41′56″ (east longitude) and 26°15′28″–26°31′03″ (north latitude). The overall research area is 359.79 km2. The geographic location is displayed in Fig. 1.

Fig. 1
figure 1

Geographic location

Meteorological and hydrological conditions

The study area is located in the central Guizhou plateau, which has the subtropical plateau monsoon moist climate in the northern hemisphere with various characteristics of plateau, humid and monsoon climates. According to statistical observation data of Huaxi meteorological station since 1956, the average precipitation for years, the annual maximum and minimum precipitations in the study area were 1099.4, 1450.9 (in 2000) and 765.6 mm (in 1986), respectively. The precipitation mainly takes place in the summer half year, accounting for about 77.6% of the total.

The study area is located in the upstream zone of Qingshui River, a major tributary in Wujiang River of Yangtze River basin, which mainly includes Chetian, Lengfan, Gan and Kailun rivers, with the total runoff in the length of 60.1 km and the overall area of 225.91 km2.

Topography

Located in the middle section of Miaoling Mountain, the study area gradually declines from southwest to northeast in terrain, which mainly appears as karst landform. The combination forms mainly include slopes, depressions, and clusters and valleys as well as eroded middle–low mountains and denudated hills with relative smaller areas.

Aquifer rock formations

The primary exposed strata in the study area from old to new are separately displayed as Permian, Triassic and Quaternary systems. According to the lithology of exposed strata and characteristics of aquifer media, the groundwater in the area is divided into carbonate karst water and pore water in loose Quaternary rocks, where the former can be further divided into fracture–cavern water, karst cave–fracture water, and dissolved pore–fissure water. The aquifer rock formations and the water-abundance characteristics are shown in Table 1.

Table 1 Water abundance of aquifer rock formations

Chemical properties of groundwater

The pH value of groundwater in the study area generally ranges from 7.2 to 8.04, appeared as neutral–weakly alkaline water. The total hardness (as CaCO3) is in the range of 286– 611 mg/L generally, shown as slightly–extremely hard water. As to the water quality, the groundwater in the study area mainly consists of various cations (such as Ca2+, Mg2+, K+ and Na+) and anions (including HCO3−, SO42− and Cl) in chemical compositions. The chemical types of groundwater are mainly shown as HCO3Ca, HCO3Ca·Mg and HCO3·SO4Ca.

The risk evaluation indexes of groundwater pollution

Compared with other groundwater systems, the karst groundwater system in Huaxi district shows significant characteristics: non-uniform aquifer media, diverse landforms, large topographic slope and poor contamination resistance of unsaturated zones. Consequently, the optimal evaluation effect cannot be acquired by simply applying current index systems. Thus, by fully investigating the characteristics of groundwater environment, the study established a risk evaluation index system for karst groundwater pollution in the study area from inherent and special factors of groundwater. The cause of the groundwater pollution is the conjunction of two kinds of factors, the first is the inherent factor which refers to the sensitivity to pollution of groundwater in a natural state and it is static, immutable and uncontrollable; the second is the special factor which refers to the sensitivity to specific pollutants or human activities, and it is related to human activities and is dynamic and can be controlled. Then each evaluation factor was marked with a certain score (1–10) in different ranges. The larger the score is, the higher the risk degree (Tables 2, 3). The details are displayed as follows:

Table 2 Division of karst development degrees
Table 3 Scoring criteria of inherent evaluation indexes of groundwater

Inherent evaluation factors of groundwater

(1) Types of aquifers

The types of aquifers control the runoff path and length of groundwater, determine the contact condition of pollutant-bearing groundwater and aquifers and influence the migration, adsorption, dilution and dispersion of pollutants in the groundwater. Generally, the larger the scales and numbers of fractures and karst caves contained in an aquifer are, the stronger the permeability, the weaker the filtering capacity for pollutants and the poorer the degradation and dilution degrees. According to the drilling and field investigation, the types of aquifers in the study area can be divided into carbonate, clastics and carbonate–clastic aquifers.

(2) Topographic slope

The topographic slope influences the rainfall infiltration. Under a low slope, water gently flows and plenty of pollutants are infiltrated owing to the rainfall has sufficient contact areas with the ground surface so as to pollute the groundwater. On the contrary, the rainfall flows out as surface runoff so that there is a small quantity of infiltrations of rainfall with pollutants under a large topographic slope. Majorities of the study area show gentle terrain with the slope of 1°–15° while local areas are steep with the slope of 36°–62°.

(3) Thickness of unsaturated zones

The thickness of unsaturated zones determines the path and time of pollutants from the ground surface to the aquifer as well as the degradation and adsorption degrees. In general, the thicker an unsaturated zone is, the longer and the more complex the infiltration path of pollutants. Additionally, the probability that pollutants are degraded and adsorbed increases with the growing contact time between infiltration pollutants and unsaturated zones. Therefore, the thickness of the unsaturated zone exerts a protection effect on the groundwater system and the thicker an unsaturated zone is, the more favorable for preventing the groundwater from pollution.

(4) Precipitation

Precipitation is the primary carrier for liquid and solid pollutants to infiltrate and recharge to the groundwater system. Pollutants diffuse into aquifer through horizontal runoff after vertically migrating to aquifers through precipitation. The more the precipitations are, the more the infiltrating pollutants carried by precipitation and the larger the possibility of groundwater to be polluted. The average precipitation of the study area for years is found to be lowest in the southwest (990 mm) while most in the northeast (1140 mm), that is, the precipitations gradually rise from the southwest to the northeast.

(5) Karst development degree

The karst development degree is an important index for evaluating the risk of karst groundwater pollution. The higher the karst development degree is, the larger the development scales of karst caves and fractures in aquifers. Under the condition, it is easier for pollutants to flow in aquifers and more difficult to degrade and dilute the pollutants, which thereby leads to a higher possibility of groundwater pollution. The karst development degree in the study area is divided into five grades (Table 2): strong, moderate–strong, moderate, weak–moderate and weak by taking lithology and dissolved rates of boreholes as inference indexes. Different grades correspond to different scores.

(6) Hydraulic conductivity

The hydraulic conductivity depends on the property of rocks. The fractures inside rocks and size and connectivity of karst conduits influence the flow rate of groundwater and migration rate of pollutants to some extent and further affect the contact time between groundwater and aquifer media. The larger the hydraulic conductivity is, the stronger the water permeability of an aquifer, the faster the pollutants flow into the aquifer, the smaller the probability of the pollutants to be diluted and degraded during migration and the wider the pollution scope. According to the data obtained through pumping test, the hydraulic conductivity in the study area gradually declines from the southeast to the northwest.

(7) Types of karst landforms

Karst landforms are diverse and groundwater infiltrates in different approaches and modes under different landforms. For example, in karst depressions without soil layer or with thin soil layer, precipitations directly flow into underground rivers through depressions and sinkholes. In this way, groundwater is likely polluted if precipitations contain pollutants. However, in karst valleys, precipitations slowly flow out along the surface and mainly filtrate through small holes and karst caves. The filtration mode takes a long time. The combination types of karst landforms in the study area are divided into five types: karst landform types, karst depression, karst valley, slope, middle–low mountains, and eroded hills.

(8) Types of media of unsaturated zones

The media of unsaturated zones determine the filtration paths and approaches. Generally, the larger the particles of media of unsaturated zones are, the lower the clay contents and, therefore, the poorer the adsorptive capacity for pollutants while the larger the possibility of pollutants flowing into groundwater.

Special evaluation factors of groundwater

(1) Land use types

Human activities greatly influence groundwater system with the social progress. The most significant human behavior is shown as land use and land use types affect the capacity of pollutants to flow into groundwater. By analyzing satellite images, the study divides the land use types into land for construction, arable land, forest land and waters. And also, the scoring criteria of land use types in the study area are established according to the producing condition and capacity to flow into groundwater of pollutants under different land use types.

(2) Pollution sources of groundwater

The pollution sources of groundwater refer to the sources of various substances causing groundwater pollution. The pollution sources in the study area are divided into agricultural, domestic and industrial and mineral pollution sources according to data. The first two pollution sources separately refer to agricultural livestock farms and domestic sewage while the industrial and mineral pollution source denotes waste coal fields, sewage treatment plants and polluting enterprises. According to the migration rate and mode of groundwater, the types and influence scopes of pollution sources are scored. The corresponding scores are added up when random two or three scopes of pollution sources in the research study are superposed.

Weight determination

Back propagation neural network

In the comprehensive evaluation of groundwater pollution risk, the rationality of weights of factors directly influences the reliability of the comprehensive evaluation results. Currently, the weights of evaluation factors are generally determined using subjective and objective weightings. Subjective weight method reflects the empirical judgment of decision-makers and the weights determined using the method generally conform to the practices. However, it is easy to exaggerate or reduce the effects of certain factors owing to different decision-makers having diverse empirical judgments (Wang and Li 2006; Fan et al. 2002). In contrast, the weights obtained by applying the objective weighting method are not the true importance of factors but measurements of useful information provided by data of various factors although the method effectively utilizes the data information of evaluation factors. Moreover, the more the information is, the larger the corresponding weights of factors and viceversa (Lu et al. 2008). Additionally, as a comprehensive evaluation system is a complex nonlinear system, the influence degrees of various factors on the evaluated problems vary with time and space. It means that weights of majorities of evaluation factors change with the development of evaluated objectives and progress of human awareness. Therefore, it is necessary to establish a learning mechanism for weights to adapt to the ever-changing evaluation requirements. Artificial neural network (ANN) has special advantages in solving the problem (Dawson and Wilby 1999; Snell et al. 2000; Du and Zhao 2006). Back propagation (BP) neural network is a multilayer feedforward neural network established based on BP algorithm whose topology structure is composed of input, hidden and output layers. The completeness theorem of mapping capacity of BP neural network indicates that a three-layer BP neural network can approximate to any continuous function at random precision (Yang and Ma 2016; Ni et al. 2017). The study attempts to systematically identify the unknown relations between various evaluation factors using BP neural network to establish the neural network model and determine weights of various evaluation factors. The method for determining weights of factors using BP neural network is displayed as follows (Wang 2013; Deng et al. 2016):

(1) Establishment of BP neural network

Data are chosen as training samples. The training samples contain m factors, each of which has n samples. Thereby, an input data matrix X = (x ij )n×m, (i = 1, 2, …, n; j = 1, 2, …, m) can be established in which x ij refers to ith factor of ith sample.

The network model contains three layers: input, hidden and output layers. There are m nodes in the input layer, namely m evaluation factors, while the input vector is expressed as X = (x1, x2, …, x i , …, x m )T. There is one layer in the hidden layer with k nodes (the value of k can be calculated according to Formula (1) or (2)). The output vector is shown as Y = (y1, y2, …, y j , …, y k )T and the output layer contains one node. The output value is O while the desirable output is expressed as D(d1, d2, …, d j , …, d k ). The connection weight matrix between the input and the hidden layers is displayed as V = (v ij ) m×k in which v ij denotes the connection weight between ith node in the input layer and jth node in the hidden layer. The connection weight vector between the hidden and the output layers is expressed as Z = (z1, z2, …, z j , …, z k ) in which z j refers to the connection weight between the output layer and jth node in the hidden layer.

$$k=\left( {{\text{nodes in input layer}}+{\text{nodes in output layer}}} \right)/{\text{2}}$$
(1)
$$k={\left( {{\text{nodes in input layer}} \times {\text{nodes in output layer}}} \right)^{{\text{1}}/{\text{2}}}}$$
(2)

There is the following relation between the input and the hidden layers:

$${y_j}=f\left( {\mathop \sum \limits_{{i=1}}^{m} {v_{ij}}{x_i}} \right)~~~~j=1,2, \ldots ,k~$$
(3)

The following relation exists between the hidden and the output layers:

$$O=f\left( {\mathop \sum \limits_{{j=1}}^{k} {z_j}{y_j}} \right)$$
(4)

\(f\left( x \right)\) in Formulae (3) and (4) is an activation function and the nonlinear transfer function sigmoid is generally applied, namely

$$f\left( x \right)=\frac{1}{{1+{e^{ - x}}}}.$$
(5)

Samples are chosen to train the neural network, namely the factor of the first sample is input after randomly taking a group of small data. The output error E i (the output error of ith sample) occurs when the output result does not conform to the desirable value. The formula is expressed as follows:

$$~{E_i}=\frac{1}{{2\left( {{d_i} - {o_i}} \right)}}~\quad ~i=1,2, \ldots ,n$$
(6)

The total error is expressed as follows:

$$E=\frac{1}{2}\mathop \sum \limits_{{i=1}}^{n} {E_i}.$$
(7)

To improve the generalization ability of the network, it is necessary to train the network by employing regularized adjustment method. The mean square error (MSE) of the network is employed as the performance function of the BP network. The formula is shown as follows:

$${\text{mse}}=\frac{1}{N}\mathop \sum \limits_{{i=1}}^{N} {\left( {{e_i}} \right)^2}$$
(8)

where e i refers to the training error of ith training sample (\({e_i}={t_i} - {a_i}\)) while t i and a i denote the target output and network output of ith training sample, respectively. The network performance function undergoing regularized adjustment is expressed as follows:

$${\text{msereg}}=\gamma {\text{mse}}+\left( {1 - \gamma } \right){\text{msw}}$$
(9)

where γ refers to the performance function while \({\text{msw}}=\frac{1}{n}\mathop \sum \nolimits_{{j=1}}^{n} {\left( {{w_j}} \right)^2}\).

The error signal \({\delta ^o}\) is obtained by calculating output desirable and actual values to adjust the connection weight between hidden and output layers. The specific method refers to Formulae (10)–(14). All samples are trained and the training ends when E < Emin (the training precision is generally a positive decimal fraction). Otherwise, the connection weight is modified and the calculation is repeated until satisfying the requirement.

$${\delta ^o}=\left( {d - o} \right)o\left( {1 - 0} \right),$$
(10)
$$\delta _{j}^{y}={\delta ^o}{z_j}{y_j}\left( {1 - {y_j}} \right),$$
(11)
$${z_j}\left( t \right)={z_j}\left( {t - 1} \right)+\eta {\delta ^o}{y_j}+\mu \Delta {z_i}\left( {t - 1} \right),$$
(12)
$${v_{ij}}\left( t \right)={v_{ij}}\left( {t - 1} \right)+\eta \delta _{j}^{y}{x_i}+\mu \Delta {v_{ij}}\left( {t - 1} \right),$$
(13)

where the learning rate η ∈ (0,1) and the momentum µ ∈ (0,1).

After the training ends and reaches required precision, the sum of absolute values of all connection weights (V) is calculated according to Formula (14), followed by normalization, to further obtain the weights of all factors.

$${w_j}=\frac{{\mathop \sum \nolimits_{{l=1}}^{k} \left| {{v_{ji}}} \right|}}{{\mathop \sum \nolimits_{{i=1}}^{m} \mathop \sum \nolimits_{{l=1}}^{k} \left| {{v_{il}}} \right|}}\quad j=1,2, \ldots ,m$$
(14)

Weight calculation

If too many input samples are chosen, the calculation result may not meet the requirement for high precision. However, if only few are chosen, it would be hard to satisfy the accuracy requirement for the evaluation. After many attempts, 10 sets of data have been chosen as input samples. The locations of these data are distributed evenly in the study area, and then the quantization of input data is realized according to Tables 3 and 4. Meanwhile, to achieve contrast between input and output samples, we have further normalized the quantified data and converted into the numbers between 0 and 1, and Table 5 is the input sample used for network training. The three-layer structure composed of an input layer, a hidden layer and an output layer is employed. The hidden layer contains 9 nodes and the network training precision is 0.001. The MATLAB software is used in calculation.

Table 4 Scoring criteria of special evaluation indexes of groundwater
Table 5 Input samples for training the neural network

The desirable output sample is the risk level of groundwater pollution of each input sample. Values between 1 and 5 were selected to quantify the output samples, in which 1 represents very low risk, while 5 represents very high risk. Similarly, we have normalized the quantified data and Table 6 is the desirable output samples according to artificial neural network training.

Table 6 Desirable output samples for the training of the neural network

The blue line and the dotted line in Fig. 2 denote the precision condition and target of training, respectively. After training for 2378 times, the training ends owing to the network satisfies the requirement, namely, MSE < 0.001. At this time, the predicted values calculated by artificial neural network are consistent with the desirable output samples (Fig. 3). By the connection weigh matrix V between the input layer and the hidden one, we can get the weight of each evaluation factor shown in Table 7 according to formula 14.

Fig. 2
figure 2

Network training precision

Fig. 3
figure 3

Prediction of BP neural network

Table 7 Weights of various evaluation factors

Maps of various evaluation factors based on GIS

The thickness of unsaturated zone, the lithologic combination of unsaturated zone and aquifer, karst corrosion ratio, hydraulic conductivity and so on are calculated using the collected data of geological drilling and pumping test about maps and records in the study area. To compare different factor data, it is necessary to remove the influences of different dimensions and units of various factors (Wu et al. 2016). Therefore, the dimensionless treatment is conducted on raw data according to the scoring criteria in Tables 3 and 4 to convert the data in the range of 1–10. Finally, thematic maps can be drawn by the interpolation method combined with GIS software which include the types of aquifers, the thickness of unsaturated zones, the media type of unsaturated zones, the hydraulic conductivity and the degree of karst development. In the same way, the distribution map of atmospheric precipitation has been made according to the years of monitoring data of the rain station in the study area. According to the existing topographic map of the study area, the topographic slope map has been drawn. According to the remote sensing image of the study area, the land use situation was divided into four types including construction land, arable land, woodland and water area, and the land use type map was made on this basis. Thematic map of Karst landform and influence scopes of pollution sources map have been drawn according to field investigation data, the influence scopes of pollution sources are harder to unify, the influence scopes of the representative pollution sources in Huaxi district was used as a basis in this study. According to the observation, the further the pollution distance is, the lighter the degree of pollution is. The evaluation factor thematic maps are as follows, based on which the spatial distribution law of the quantized values of various factors can be institutively mastered (Figs. 4, 5, 6, 7, 8, 9, 10, 11, 12, 13).

Fig. 4
figure 4

Precipitation

Fig. 5
figure 5

Topographic slope

Fig. 6
figure 6

Types of landforms

Fig. 7
figure 7

Land use types

Fig. 8
figure 8

Karst development degree

Fig. 9
figure 9

Hydraulic conductivity

Fig. 10
figure 10

Thickness of unsaturated zones

Fig. 11
figure 11

Media types of unsaturated zones

Fig. 12
figure 12

Types of aquifers

Fig. 13
figure 13

Influence scopes of pollution sources

Risk prediction of karst groundwater pollution in Huaxi district

The risk evaluation model of karst groundwater pollution

Establishing a risk evaluation model for karst groundwater pollution is to establish a mathematical model which shows the influences of various evaluation factors on groundwater pollution. Thereby, the risk index VI is introduced to evaluate the risk of karst groundwater pollution. The risk evaluation is summarized to the sum of superposed influences of difference evaluation factors on groundwater pollution in an element of a section in an area, which can be expressed using the following model formula:

$$VI=\mathop \sum \limits_{{k=1}}^{n} {W_k}~ \cdot ~{f_k}\left( {x,y} \right)$$
(15)

where VI, W k , f k (x,y), x, y and n refer to risk index, weights of diverse evaluation factors, the score of kth evaluation factor, geographic coordinates, and the number of evaluation factors, respectively.

Risk evaluation of karst groundwater pollution in Huaxi district

On the basis of drawing maps of multiple evaluation factors, the maps of different evaluation factors are superposed by employing the information overlap technology of GIS to form the superposed cells combining multi-source information. Afterwards, the comprehensive risk evaluation indexes in different superposed cells are calculated using the established risk evaluation model, and then graded using Natural Breaks of GIS software to get the optimal five-level grading result. Eventually, the zoning map of comprehensive evaluation about the risk of karst groundwater pollution in Huaxi district combining multiple factors is prepared (Fig. 14).

Fig. 14
figure 14

The vulnerability zoning of karst groundwater

The high-risk and relatively high-risk zones are mainly distributed in the northeastern and central regions of the study area with an area about 150 km2, which takes up 43.9% of the total study area. The zone with low slope is abundant with precipitation and mainly shown as karst hills and depressions in landform. Moreover, the zone mainly has carbonate aquifers in terms of types of aquifers, showing high karst development degree, large-scale karst caves and sinkholes, and thin unsaturated zones in local areas. Thus, it is easy for the surface pollutants to infiltrate into groundwater with precipitation. Especially, owing to intensive urban constructions and distributions of waste coal mines in this zone, karst groundwater is likely to be seriously polluted once the pollution is improperly prevented.

Low-risk and relatively low-risk zones are mainly distributed in the northwest, north, and southwest of the study area with an area about 100 km2, taking up 30.56% of the total area. The precipitation is at a low–moderate level while the zones have large–moderate topographic slopes. These zones are mainly shown as monadnocks, slopes, peak clusters and valleys while carbonate rock with clastics and clastic aquifers are types of aquifers most widely seen. The karst poorly develops mainly appearing as karst fissures and holes and the thickness of unsaturated zones is at a moderate–thick level. Moreover, the lands are mainly used for forest and waters.

The moderate–risk zone is mainly distributed in the area between relatively low-risk and relatively high-risk zones covering an area about 92.06 km2, which accounts for 25.54% of the total study area. In this zone, the karst favorably develops and there is a gentle topographic slope. The precipitation retains for a long time while the media of the unsaturated zones in the zone are not in favor of infiltration of groundwater.

Conclusions

Currently, the determination method of factor weights in the risk evaluation model of groundwater pollution is usually determined by subjective or objective weight; however, all of these methods are either too subjective or the weight value can not reflect the actual meaning of the factor. The artificial neural network method has the characteristics of obtaining the approximate relationship of system from the limited data set, which can better reflect the complicated relationship among groundwater pollution evaluation factors and, therefore, this study constructed the risk prediction model of groundwater pollution based on artificial neural network. Factor values of ten different locations in the study area were chosen as the training samples, the training sample’s groundwater pollution risk as the output samples and the BP neural network was used to systematically identify unknown relationship among various factors, and then the factor weights were calculated. On this basis, combined with GIS technology, the risk evaluation of groundwater pollution in Huaxi District of Guiyang City was completed, and the method proved to be feasible through the analysis of the evaluation results. The groundwater pollution risk evaluation based on GIS-ANN technology can comprehensively and objectively reflect the characteristics of the groundwater pollution which is controlled by many factors and has a complicated nonlinear formation mechanism. At the same time, the evaluation results are intuitive, visual, and can provide a certain basis and reference for related research.

In southwest China, karst groundwater pollution is caused by the natural factors and human activities. Compared with other regions, the thickness of unsaturated zone is extremely uneven and corrosion effect is stronger, the unique karst development structure can connect groundwater with surface water. Furthermore, the unsaturated zone water retention capacity is very weak, which makes pollutants easier to wind its way to the groundwater, so in the process of prevention and control of groundwater pollution in this area, we must fully consider the particularity of the pollution approach and takes special prediction and control measures.