Introduction

With the rapid development of industrialization and urbanization, heavy metals pollution in soil has attracted extensive attention and research (Zhang et al. 2018a; Dong et al. 2019; Baitas et al. 2020; Liu et al. 2020; Wen et al. 2020). Heavy metals can migrate into groundwater and are absorbed by plants into the food chain, which causes a threat to ecosystems and human health (Zhu et al. 2012; Dai et al. 2014; Burges et al. 2015; Yuswir et al. 2015; Shang et al. 2016). Lead, for example, may cause anemia, kidney disease, gastrointestinal colic, and central nervous system diseases (Pareja-Carrera et al. 2014). Arsenic may cause skin lesions, skin cancer and vascular disease (Tchounwou et al. 2012; Tan et al. 2016; Xiao et al. 2017). Although arsenic is a metalloid not a metal, it is often grouped with heavy metals in ecological-health risks studies (Zhang et al. 2018a; Wang et al. 2019b; Liu et al. 2020). In addition, even the metal elements required by the human body, if overexposed, will damage the health. For example, excessive copper will cause metabolism disorders, and even lead to liver cirrhosis (Ameh and Sayes 2019). Therefore, it is important to figure out the harmful effects of heavy metals pollution on ecosystems and human health (Dai et al. 2012; Huang et al. 2018a). Previous studies have found that heavy metals pollution is serious in population-intensive areas (Gülten 2011; Cai et al. 2019; Li and Jia 2018; Shi et al. 2019; Ye et al. 2019), especially in the vicinity of industrial areas (Harvey et al. 2017; Li et al. 2018; Lü et al. 2018; Zhang et al. 2018b; Antoniadis et al. 2019; Zhou and Wang 2019), which may have serious negative effects on ecosystems and human health (Machender et al. 2011; Xue et al. 2014; Qing et al. 2015; Wu et al. 2018a).

Soil parent material itself contains a certain amount of heavy metals, so it is necessary to identify whether the sources of heavy metals are natural or anthropogenic. Receptor models can be used in the quantitative analysis of pollution sources. Commonly used receptor models are: Chemical Mass Balance (CMB) (Zheng et al. 2005), Positive Matrix Factorization (PMF) (Hu et al. 2018), UNMIX (Lang et al. 2015) and Factor Analysis-Multiple Regression Model (FA-MLR) (Pekey et al. 2004). Among them, PMF model showed an advantage in dealing with the missing data, error estimate, and contribution rate of each source at each sampling site, which makes it widely used in source apportionment (Jang et al. 2013; Kim et al. 2018; Men et al. 2018a; Li et al. 2019; Su et al. 2019; Zanotti et al. 2019; Cheng et al. 2020).

Soil heavy metals data are often complex, in order to accurately explain the relationship between the variables, it is usually necessary to reduce the dimensions and classify the data. Traditional dimensionality reduction and classification methods such as PCA and HA (Pathak et al. 2013; Li et al. 2014; Lu et al. 2017) sometimes fail to accurately identify the relationships between variables, which leads to misinterpretations, and they are not useful for the visual assessment of the results (Astel et al. 2007; Lischeid 2009; Wongravee et al. 2010). To overcome the shortcomings of traditional methods, Self-Organizing Map (SOM) has been increasingly used to classify data (Han et al. 2016; Tao et al. 2017; Nakagawa et al. 2020; Zhu et al. 2020). SOM is a powerful non-linear projection tool that visualizes complex high-dimensional data structures onto two-dimensional surfaces by organizing input data into a smaller number of neurons (Kohonen 1982). SOM can reveal local relationships between variables and can be used to classify widely dispersed data (Hong and Rosen 2001). In addition, SOM supports techniques that use the reference vectors to give an informative picture of the data, which can clearly demonstrate the interdependence of variables (Skwarzec et al. 2009; Pearce et al. 2011; Zelazny et al. 2011). This study tried to combine SOM with potential ecological-health risks assessment to help to identify the high-risk areas to protect the ecosystem and human health reasonably and efficiently.

A previous reconnaissance survey showed that the concentrations of some heavy metals, such as Pb, As, Zn, Cu, were relatively high in topsoil around a lead plant, central China, so the potential soil pollution in the area has become a great concern. The purpose of this study is to evaluate the soil pollution level, identify sources of heavy metals in the topsoil and calculate potential ecological-health risks in the area. The sources of heavy metals were analyzed by PMF model. After using SOM to cluster the data, the ecological-health risks caused by heavy metals were assessed combining the SOM results, and the high risks areas around the factory were identified.

Materials and Methods

Study Area

The study area is around a lead smelting factory, central China, covering an area of approximately 25 km2 (Fig. 1). It is located in a plain area and the terrain is flat with an altitude of 140.83–141.50 m. The study area is temperate continental monsoon climate, with an annual average temperature of 14.5 ℃ and an annual average rainfall of 568 mm. The study area is divided into three zones (residential land, commercial land and farmland). Approximately 800 people live in the east of the factory, and most of them are farmers, so we regarded this area as the residential zone. There are some little shops and supermarkets near the factory for the daily needs of workers and farmers, so we regarded them as commercial zone. The other area was covered by agricultural land, so we regarded it as farmland zone. The soils in the study area are mainly loam and sandy loam. The bulk density of topsoil is 1.48 g cm−3. The soil development based on the parent material of Malan loess, and the leaching and deposition of calcareous substances are relatively weak, so the hierarchical differentiation of soil is not significant, and the soil color, texture, organic matter content and acidity and alkalinity of the core soil layer are relatively uniform (Wang et al. 2019a).

Fig. 1
figure 1

Locations of the soil sampling sites and the factory

The factory was built in 1997, and mainly deals in the smelting of lead. Lead ore mixed with limestone and quartz sand was fed into the oxygen bottom blowing furnace to produce crude lead, accompanying liquid lead slag and dust fume. The crude lead was sent to the refiner, and the lead slag was mixed with stone and coke particles and then put into the reduction furnace for reduction, and dusty flue gas was sent to the acid-making system. The raw materials used in this plant was lead–zinc mineral, so zinc, copper and arsenic are common impurities. Therefore, these four elements become the focus of topsoil heavy metals pollution in the study area.

Soil Sampling and Analyses

Sampling sites were distributed around the factory, and a total of 106 soil samples were collected, and each sample was a mixture of five random subsamples taken from the upper soil horizon (0 to 20 cm) of the 1 × 1 km grid, and the sampling sites were dense near the factory and distributed radially with the factory as the center (CEPA 2004). The original weight of each soil sample was greater than 1000 g. All samples were stored in a polyethylene package to avoid contamination. After removing stones and other debris, all soil samples were air dried at 20 °C and then passed through a 2.0 mm nylon sieve (Wang et al. 2019b) and sent to the laboratory for analyses.

The samples of Pb, Zn, Cu, Cr, Ni, Fe, Mn were digested in crucible by HNO3-HF-HClO4 digestion and analyzed by atomic absorption spectrometry (AAS). The samples of As were digested by HCl-HNO3, and were analyzed by hydride generation atomic fluorescence spectrometry (HG-AFS). Using geochemical standard soil samples provided by the National Certified reference Materials Research Center to ensure quality assurance and quality control (QA/QC), and the recoveries of each element were between 88 and 106%. In addition, duplicate samples were determined for 25% of the collected samples, and the relative standard deviations were less than 5%.

Geo-accumulation Index (\(I_{{{\text{geo}}}}\))

Geo-accumulation Index (\(I_{{{\text{geo}}}}\)) can be used to evaluate the pollution level of heavy metals in soil (Al-Haidarey et al. 2010). The calculation principle of \(I_{{{\text{geo}}}}\) were described in the Text S1.

Positive Matrix Factorization (PMF) Model

The PMF model is a data analysis method based on the principle of factor analysis, which was proposed by Paatero and Tapper (1993). The calculation principle and expression of PMF were described in the Text S2.

Self-Organizing Map (SOM)

SOM is an unsupervised competitive learning neural network method. It builds its own network structure by simulating the response of human brain neurons to external stimuli, which can make a cluster area form near each best matching neuron, so that the input vectors of similar features are classified into a cluster. Heuristic rules can be used to select the appropriate number of neurons, such as \(5\sqrt{n}\) suggested by Vesanto et al. (2000), where n is the number of samples. The Davies-Bouldin index can be used to determine the optimal number of clusters (Wang et al. 2017). Kohonen (1982) proposed the SOM algorithm and introduced the specific implementation process of the algorithm. The specific steps contained in the algorithm were described in the Text S3.

Potential Ecological Risks Assessment

Potential ecological risks assessment results show the toxicological effects of heavy metals and environmental responses (Eq. (1)) (Hakanson 1980).

$$\begin{array}{*{20}c} {R_{{\text{l}}} = \sum E_{{\text{r}}}^{i} = \sum \frac{{C_{n}^{i} }}{{B_{{\text{r}}}^{i} }} \times T_{{\text{r}}}^{i} } \\ \end{array}$$
(1)

\(R_{{\text{l}}}\) is the total potential ecological risks of all metals, \(E_{{\text{r}}}\) is the potential ecological risk of a single metal, \(C_{{\text{n}}} { }\) is the concentration of the metal, \({ }B_{{\text{r}}}\) is the environmental background value of a certain metal, and \(T_{{\text{r}}}\) is the toxicity coefficient of a certain metal. The toxicity parameters of Pb, As, Zn, Cu, Cr, Fe, Mn, and Ni were 5, 10, 1, 5, 2, 1, 1, 5, respectively (Hakanson 1980; Johnbosco 2020).

The grading values of the potential ecological risk, proposed originally by Hakanson, were based on eight pollutants (PCB, Hg, Cd, As, Pb, Cu, Cr, and Zn) which were different from the pollutants in this study. In order to make the grading more accurate, the grading of Hakanson’s was adjusted on the basis of the pollutants in this study according to the previous research results (Fernández and Carballeira 2001; Ma et al. 2011; Chen et al. 2018). The grading method of Hakanson’s was based on the maximum toxicity coefficient of pollutants and the total amount of toxicity coefficient of pollutants (Li et al. 2016). The adjustment of \({E}_{r}\) is as follows: the first grade boundary value = non-pollution index \(\left( {\frac{{C_{{\text{n}}}^{i} }}{{B_{{\text{r}}}^{i} }} = 1} \right)\) × the maximum toxicity coefficient (\(T_{{\text{r}}}\)) among pollutants (10 for As in this study) = 10, and the grade value for each grade is 2 times higher than the last grade. For the adjustment of \(R_{{\text{l}}}\), the unit grading value of toxicity coefficient (UGV) is determined first: UGV = 150 (first grade boundary value of Hakanson method)/133(total toxic coefficient of Hakanson method) ≈ 1.13. The total toxic coefficient of 8 pollutants in this study is 30, so the first grade boundary value = 30 × 1.13 ≈ 35, and the grading value for each grade is 2 times higher than the last grade. The above adjustment method has been proved to be a good way to describe potential ecological risks level (Chen et al. 2018). The comparison of the grading value between the Hakanson standard and this study was shown in Table 1.

Table 1 Comparison of Hakanson classification standard in this study

Human Health Risks Assessment

Heavy metals can enter the human body through three ways: oral ingestion, oral and nasal inhalation, and skin contact. The average daily exposures (ADDs) for these three ways can be calculated by Eq. (2)–(4) (USEPA 1996; Praveena et al. 2015).

$$\begin{array}{*{20}c} {{\text{ADD}}_{{{\text{ing}}}} = \frac{{C \, \times \, IngR \, \times \, {\text{EF}} \, \times \, {\text{ED}}}}{{{\text{BW}} \, \times \, {\text{AT}}}} \, \times \, 10^{ - 6} } \\ \end{array}$$
(2)
$$\begin{array}{*{20}c} {{\text{ADD}}_{{{\text{inh}}}} = \frac{{C \, \times \, InhR \, \times \, {\text{EF}} \, \times \, {\text{ED}}}}{{{\text{PEF}} \, \times \, {\text{BW}} \, \times \, {\text{AT}}}}} \\ \end{array}$$
(3)
$$\begin{array}{*{20}c} {{\text{ADD}}_{{{\text{dermal}}}} = \frac{{C \, \times \, {\text{SA}} \, \times \, {\text{AF}} \, \times \, {\text{ABS}} \, \times \, {\text{EF}} \, \times \, {\text{ED}}}}{{{\text{BW}} \, \times \, {\text{AT}}}} \, \times \, 10^{ - 6} } \\ \end{array}$$
(4)

The formula for calculating non-carcinogenic risk are (USEPA 1989):

$$\begin{array}{*{20}c} {{\text{HQ}} = \frac{{{\text{ADI}}}}{RfD}} \\ \end{array}$$
(5)
$$\begin{array}{*{20}c} {{\text{HI}} = \sum {\text{HQ}} = \sum \frac{{{\text{ADI}}}}{RfD}} \\ \end{array}$$
(6)

When HI < 1, it means that there is no non-carcinogenic risk to the human body. When HI > 1, it indicates that there is a non-carcinogenic risk to the human body, and the non-carcinogenic risk increases with the increase of HI (Qing et al. 2015). The formula for calculating carcinogenic risk is (Diami et al. 2016):

$$\begin{array}{*{20}c} {{\text{RI}} = \sum {\text{ADI}} \, \times \, {\text{SF}}} \\ \end{array}$$
(7)

When the carcinogenic risk RI < 10–6, the carcinogenic risk can be ignored; when 10–6 < RI < 10–4, the carcinogenic risk is acceptable; when RI > 10–4, the carcinogenic risk is unacceptable (Wu et al. 2015). The corresponding parameter values (IngR, EF, ED, BW, AT, InhR, PEF, SA, AF, ABS, RfD and SF) were summarized in the Table S1 and Table S2.

Results and Discussion

Description Statistics of Heavy Metals in Topsoil

The statistical results were shown in Table 2. The average concentrations of the eight elements all exceeded the corresponding background values (CNEMC 1990), and the excess level in descending order was Pb > As > Cu > Zn > Ni > Cr > Mn > Fe. The excess levels of Mn, Ni, Cr, and Fe were lower, implying that they might be less affected by anthropogenic activities. The coefficient of variation from large to small was Pb > As > Cu > Zn > Cr > Mn > Ni > Fe. That is, the coefficient of variation of Pb, As, Zn, Cu concentrations were still ranked in the top four places, indicating that the spatial distributions of these four heavy metals had great variability. The mean contents of all heavy metals in the study area except Pb and As were lower than the corresponding criterion of the Chinese Environmental Quality Standard for Soils (CEPA 2018), which is used to measure the safety in farm produce and human health. However, the contents of Zn and Cu in a few of samples were over the criterion, and Pb, As, Zn, and Cu exceeded the criterion with excess rates of 92.5%, 67%, 1.9% and 5.7%, respectively, indicating that these elements may cause potential harm to safety of agricultural products and human health, therefore, a great attention should be paid to them.

Table 2 Summary of heavy metals concentrations in the topsoil around the factory

Skewness values were applied to identify the heavy metals with normal or abnormal distribution. The heavy metals with skewness values between − 1 and 1 showed normal distribution and slightly positive skewness values of heavy metals were identified as abnormal (Chandrasekaran et al. 2015), and the skewness values of Pb, As, Zn, Cu in this study exceeded 1 (Table 2), which might indicating abnormal distribution of these four metals. And the histograms of the heavy metals could be seen in the Fig. S1.

In order to figure out the pollution level, the heavy metals were evaluated by the Geoaccumulation Index method (\(I_{{{\text{geo}}}}\)) (Table 2). 8.49% of Pb samples (the same below), 57.6% of As, 53.8% of Cu and 65.1% of Zn had \(I_{{{\text{geo}}}}\) values between 0 and 2, respectively; 65.1% of Pb, 19.8% of As and 1.89% of Cu had \(I_{{{\text{geo}}}}\) values between 2 and 4, respectively; and 25.5% of Pb had \(I_{{{\text{geo}}}}\) values greater than 4, and 4.72% of Pb had \(I_{{{\text{geo}}}}\) values greater than 5. The results showed that Pb, As, Zn, Cu caused pollution in different level, and Cr, Fe, Mn and Ni caused no pollution.

Source Identification of Heavy Metals in Topsoil

The PMF model identified three factors (Fig. 2), and residuals were between − 3 and 3. Coefficient of determination (R2) between the measured and modeled concentrations of the models were all greater than 0.75, so it was appropriate to use this model to interpret the information contained in the initial data, and the results were reliable (Yang et al. 2013; Salim et al. 2019). Distribution maps of heavy metal concentrations in the study area which was used to explain the results obtained by PMF were shown in the Fig. S2.

Fig. 2
figure 2

Factors profiles and source contributions of heavy metals from PMF model

The first factor, accounting for 41.3% of the total sources, was predominated by Pb, Zn, As, and Cu with the contributions of 68.8%, 49.2%, 43.3% and 54.5%, respectively. In the Fig. S2, Pb, Zn, As, and Cu all had large areas of high concentrations in the southeast and slowly decreased to the surroundings. As the elements of Zn, As, and Cu were major impurities in the lead ore (Li et al. 2010), so during the lead smelting process, industrial fume containing high concentrations of Pb, Zn, As, and Cu was generated. Industrial fume entered the atmosphere, and then reached the ground through dry deposition to increase the concentrations of heavy metals in the topsoil. The main wind direction of the study area was northwest wind, so the concentrations of Pb, Zn, As, Cu in the southeast of the factory were high, and the range of pollution plumes were extended in the southeast direction. Therefore, factor 1 represented the effect of industrial fume produced by the industrial activities of lead smelters on the heavy metals in topsoil.

The factor 2 (52.1%) was mainly associated with Fe (74.7%), Mn (73.6%), Cr (66.1%), and Ni (61.9%). Fe, Mn, Cr, and Ni in soil were often identified as natural sources of soil parent material, which has been confirmed in many studies (Huang et al. 2018b; Jiang et al. 2020). In the Fig. S2, the concentration distributions of Fe, Mn, Cr, and Ni had no obvious high-concentration region. And the concentrations of Fe, Mn, Cr, and Ni were close to the soil background values in the study area. \({I}_{\text{geo}}\) of soil heavy metals also showed that these four elements did not cause pollution (Table 2). Therefore, Factor 2 was considered as a natural source of heavy metals of topsoil.

The load of As (29.8%) was the highest in factor 3 (6.6%). As shown in Fig. S2, in addition to a large area of high concentrations in the southeast, As had a small area of high concentrations in the northeast and the concentrations were even higher here. According to field investigation, the sewage treatment station of the lead plant was located on the northeast of the plant. In 2017 and 2018, respectively, leakage occurred in the sewage treatment equipment and the sewage pipes, and only As in the sewage discharged by the lead plant exceeded relevant standards, and other elements were not, so that As-containing sewage entered the topsoil by this way. Therefore factor 3 represented the impact of wastewater leakage on the contents of heavy metals in topsoil.

Clustering of Sample Sites

Self-Organizing Map (SOM) was used to classify the sampling sites based on the concentrations of heavy metals. 50 (5 × 10) neurons and 4 clusters were selected according to Vesanto et al. (2000) and Davies–Bouldin index (Wang et al. 2017), and the results were shown in Fig. 3. The color gradients of Pb, Zn, and Cu neurons were consistent, indicating that the distributions of these three elements had a high similarity. Except for the difference of neurons color in the lower right corner, the color gradient of As in the other neurons was roughly the same as those of Pb, Zn, Cu. In addition, the neurons of Pb, Zn, Cu, and As in the first cluster were shown in dark blue, indicating low concentrations of Pb, Zn, Cu, and As; the neurons of Pb, Zn, Cu, and As in the second cluster were all shown in light blue, suggesting that the concentrations of Pb, Zn, Cu, and As in this cluster were higher than those of the first cluster. The neurons color of Pb, Zn, and Cu in the third cluster were still light blue, but the neurons color of As were yellow, indicating that the concentrations of Pb, Zn, and Cu in this cluster were similar to those in the second cluster, but the concentrations of As were higher than those in the previous two clusters; The neurons color of Pb, Zn, and Cu in the fourth cluster were yellow, indicating that the concentrations of Pb, Zn, and Cu were the highest, and the neurons color of As were blue and yellow, which indicated that the concentrations of As in this group were also high but lower than those in the third cluster. The color gradients of Cr, Mn, Ni, and Fe had no obvious correlation with the clustering results, which were not the main reason for controlling clustering. The statistics of the heavy metal concentrations in these four clustering results were shown in Table S3. The statistical results were consistent with the above analyses.

Fig. 3
figure 3

The SOM results: a the clustering pattern in the SOM, the number in a hexagon denotes the sample number; b the SOM visualization of heavy metals, the numbers indicate values of the corresponding variables

In combination with the results of PMF and SOM, it could be found that the first cluster of sampling sites were mostly located at a distance from the lead plant where the Pb, Zn, Cu and As concentrations were low, which could be regarded as the second source of PMF model results (natural source); the concentrations of As in the third cluster were the highest and the sampling sites were located at the northeast of the lead plant, which indicated that this cluster could be regarded as the third source (sewage leakage). The fourth cluster was distributed in the southeast, and the second cluster was distributed around factory other than the southeast. Under the influence of the prevailing northwest wind, the concentrations of the fourth cluster were higher than those of the second cluster. And under the influence of other direction winds, the concentrations of the second cluster were higher than those of the first cluster. As shown in the Fig. S2, southeast was not the only region where the heavy metals concentrations increased. Thus, compared with the results of the PMF model, the SOM model identified the effect of wind direction on the distribution of heavy metals in topsoil in more details. The results of the SOM model could provide a basis for studying the potential ecological-health hazards caused by heavy metals in topsoil around the factory.

Ecological-Health Risks Assessment of Heavy Metals in Topsoil

Ecological Risks Assessment

As shown in Fig. 4, the ecological risks values (\({E}_{\text{r}}\)) of all elements were below 10 (belong to the slight ecological risk) except Pb, As and Cu. As for Pb, \({E}_{\text{r}}\) of 1 (1%) sampling site was lower than 10, which belonged to the slight ecological risk level; 10 (9%) sampling sites had the \({E}_{\text{r}}\) of 10 to 20, which belonged to the moderate ecological risk level; 17 (16%) sampling sites had the \({E}_{\text{r}}\) of 20 to 40, which belonged to the strong ecological risk level; 36(34%) sampling sites had the \({E}_{\text{r}}\) of 40 to 80, which belonged to the quite strong ecological risk level; and 42 (40%) sampling sites had the \({E}_{\text{r}}\) above 80, which belonged to the extremely strong ecological risk level. The \({E}_{\text{r}}\) of As in 5 (5%) sampling sites were lower than 10, belonging to the slight ecological risk level; the \({E}_{\text{r}}\) of As in 39 (37%) sampling sites were between 10 and 20, belonging to the moderate ecological risk level; the \({E}_{\text{r}}\) of As in 30 (28%) sampling sites were between 20 and 40, belonging to the strong ecological risk level; the \({E}_{\text{r}}\) of As in 25 (24%) sampling sites were between 40 and 80, belonging to the quite strong ecological risk level; the \({E}_{\text{r}}\) of As in 7 (7%) sampling sites were higher than 80, belonging to the extremely strong ecological risk level. The \({E}_{\text{r}}\) of Cu in 81 (76%) sampling sites were lower than 10, belonging to the slight ecological risk level; the \({E}_{\text{r}}\) of Cu in 9 (8%) sampling sites were between 10 and 20, belonging to the moderate ecological risk level; the \({E}_{\text{r}}\) of Cu in 7 (7%) sampling sites were between 20 and 40, belonging to the strong ecological risk level; the \({E}_{\text{r}}\) of Cu in 9 (8%) sampling sites were between 40 and 80, belonging to the quite strong ecological risk level; No sample site of Cu was with the extremely strong level.

Fig. 4
figure 4

Box drawing of potential ecological risks

The \({E}_{\text{r}}\) from large to small was: Pb > As > Cu > Ni > Cr > Zn > Mn > Fe, which was different from the order of heavy metals pollution degree (\({I}_{\text{geo}}\)). This due to different ecotoxicity of different heavy metals. Heavy metals with very low pollution level, if the metal itself is highly toxic, it will still have a serious impact on the ecology. For example, the \({E}_{\text{r}}\) of non-polluted Ni was higher than the \({E}_{\text{r}}\) of contaminated Zn, which due to the high toxicity of Ni. In addition, as for the four heavy metals (Pb, As, Zn, Cu) with higher pollution level in 3.1, the risk values of As in the four clusters were ranked as: cluster III > cluster IV > cluster II > cluster I, and the risk values of Pb, Zn and Cu in the four clusters were all ranked as: cluster IV > cluster III > cluster II > cluster I, which were associated with pollution resulting from the As sewage leakage and the main wind direction.

For the total ecological risks (\({R}_{\text{l}}\)), 16 (15%) sampling sites were with \({R}_{\text{l}}\) below 35 (slight ecological risk level); 8 (8%) sampling sites with \({R}_{\text{l}}\) between 35 and 70 (moderate ecological risk level); 37 (35%) sampling sites with \({R}_{\text{l}}\) between 70 and 140 (strong ecological risk level); 35(33%) sampling sites with \({R}_{\text{l}}\) between 140 and 280 (quite strong ecological risk level); 10 (9%) sampling sites with \({R}_{\text{l}}\) higher than 280 (extremely strong ecological risk level). From the total ecological risk distribution map (Fig. 5), the ecological risk value was high around the factory. And the high concentrations of Pb, Zn, Cu and As in the downwind direction of main wind formed highest ecological risks area. The ecological risks of Pb occupied the highest proportion in the area except the northeast part where As occupied the highest proportion because of sewage leakage. It was worth noting that the areas with the extremely strong ecological risk values (\({R}_{\text{l}}\)> 280) were mostly commercial land, only 1 sampling site with such risk value was located in the farmland. However, the areas with the quite strong ecological risks (140 < \({R}_{\text{l}}\)<280) contained 23 sampling sites in the farmland, and the areas with strong ecological risks (70 < \({R}_{\text{l}}\)<140) contained 18 sampling sites in the farmland. So, it was best to keep the farmland away from the factory and to monitor the heavy metals contents of the soil frequently to ensure the safety of the farm produce.

Fig. 5
figure 5

Spatial distribution of potential ecological risks

Human Health Risks Assessment

Heavy metals in the environment could enter human body through a variety of routes, specifically three ways: ingestion, dermal contact, and inhalation. As shown in Table 3 for the average risks, the level of risks to the human body caused by these three ways were in the following order: ingestion > dermal contact > inhalation. This result was consistent with those in the previous studies (Chabukdhara and Nema 2013; Men et al. 2018b; Wu et al. 2018b), and generally, these three ways caused negligible health risks or no health risks to human body, except non-carcinogenic risks to children through ingestion (highlighted in bold in Table 3). The health risks to children and adults caused by different heavy metals in each cluster were shown in Table 4. For non-carcinogenic risks, heavy metals individually showed no risks to adults. However, Pb in the second, third, and fourth clusters, and As in the third and fourth clusters caused potential risks to children (highlighted in bold in Table 4). Cluster IV accounted for the highest proportion of all sampling sites, indicating that children in the downwind direction of the factory might have higher non-carcinogenic risks compared with the other areas. For carcinogenic risks, all heavy metals posed negligible health risks or no health risks for both adults and children.

Table 3 Health risks of heavy metals in the topsoil of three exposure ways
Table 4 Health risks of heavy metals in the soils of each cluster

The health risks distribution maps (Fig. 6) showed the range of total health risks from heavy metals exceeding the standard (non-carcinogenic risk > 1; carcinogenic risk > 10–4). Areas with high health risks were concentrated near the factory, and from the pie chart, Pb and As accounted for important proportion of the non-carcinogenic risks, and As accounted for an important proportion of the carcinogenic risks. The range of non-carcinogenic risks for adults was minimal, mainly in the southeast of the factory. The range of non-carcinogenic risks for children was large, almost covering the entire study area. The carcinogenic risks in the northeast were most serious for both adults and children, which were mainly affected by As sewage. Compared with adults, children were at higher health risks, and the uniqueness of their physiology, developmental stages and behavior leads to higher levels of exposure (NRC 2014). To effectively reduce children health risks, children should avoid the unhealthy habits of pica and allowing their fingers (Tan et al. 2016). Children could wear masks to prevent eating soil particles by mistake. In addition, fully understanding of the surrounding environment and meteorological conditions is necessary when outdoor public places such as parks and playgrounds are constructed. They should not be built in the vicinity of the plant sewage treatment station and the downwind of the factory.

Fig. 6
figure 6

Spatial distribution of human health risks (NCR: non-carcinogenic risk; CR: carcinogenic risk): a NCR for adults; b NCR for children; c CR for adults; d CR for children. Data used for interpolation are the sum of risk values from heavy metals for each sampling site

Conclusions

This study evaluated the heavy metal pollution in the topsoil near a lead plant, central China. It was found that the topsoil was polluted by Pb, As, Cu, and Zn with varying degrees with the order of pollution level of Pb > As > Cu > Zn. And the heavy metals came from three sources: industrial fume emitted from the plant entering the topsoil through dry deposition (41.3%), natural sources of soil parent material (52.1%) and sewage leakage (6.6%). Under the influence of prevailing northwest wind, the ecological risks and health risks were high in the southeast of the factory, and the leakage of As-containing sewage caused high risks of carcinogenesis in the northeast of the factory. Therefore, priority should be given to ecological and health protection in the southeast and northeast of the factory. The data evaluation methods and results of the study may be useful to the soil pollution investigation and assessment in other areas.