Introduction

As one of the most toxic heavy metals in the natural world, mercury (Hg) with the high toxicity and migration has brought wide attention (Campos et al., 2018; Shahid et al., 2020; Zhou et al., 2018). Harm from Hg even occurs in the long distance from its initial source because of the long-distance transport, which expands its distributing range in air, water or soil (Lawrence et al., 2013; Tang et al., 2020). The World Health Organization pointed out that Hg was harmful for the skin, eye, lung, kidney and other organs as well as the digestive, immune and nervous systems. It is recognized as one of the ten harmful substances causing the serious public health problem (WHO, 2019). Except for the direct contact, the consumption of plants containing mercury also threats human health (Shahid et al., 2020). Therefore, it is important to analyze the spatial distribution of Hg and trace its potential risk sources.

Mercury can be normally decomposed and released from rock or soil under the functions of water, wind or volcanic activities in natural state. In addition, it also comes from human activities (Shahid et al., 2020). Previous studies reported that some mining activities lead to Hg contaminant in water or soil, which polluted the adjacent ecosystem (Beckers et al., 2019; Osterwalder et al., 2019). Moreover, the dust generated by vehicle exhaust due to the Hg addicted fuel made the Hg disperse near roads or other traffic facilities with the increase in traffic, which has become an important source of soil Hg pollution and exerts negative impacts on nearby ecological environment (Chary et al. 2008; Liu et al., 2012). Besides, the impact on the Hg in grain from mercury-containing pesticides had also become the research focus, the rice consumption brought high organic Hg exposure risks indirectly because Hg entered human body via food chain (Jiang et al., 2021; Wang et al., 2021; Yeganeh et al., 2013). It can be concluded that the agricultural additions of pesticides and fertilizers, as the important external inputs of Hg, should be given enough attention.

Self-organizing map (SOM) belongs to the unsupervised machine learning method and has superior performance in processing nonlinear problem (Mari et al., 2010). As a classic competitive neural network, SOM produced the best matching units and acquired good classifying and visualizing results for input samples. It has already applied for pattern recognition of heavy metals and pollutants in water or soil (Wang et al., 2020). Geoaccumulation index (Igeo) for evaluating heavy metal contamination can explicit Hg contamination and its distribution heterogeneity effectively and vividly (Bing et al., 2011). The risk assessment for Hg by potential ecological risk index (Er) is helpful for the prediction of subsequent ecological risk and reflects the ecological impact of Hg content under certain environment through combining the toxicological, environmental and ecological effects (Antoniadis et al., 2019; Tabassum et al., 2019; Wang et al., 2020). Besides, positive matrix factorization (PMF) is a classic model for source identification, which is used for the source apportion of pollutants in soil, air and sediment popularly (Lv et al., 2019; Magesh et al., 2021). The environmental parameters for the transfer or dispersion of pollutant as well as the pollutant emission inventory are not necessary for PMF, which avoids the deviations resulted from the meteorological or topographical factors effectively (Taghvaee et al., 2018).

Ganzhou City in Jiangxi Province is not only rich in rare earth resources, but also an important transportation hub connecting four provinces in Southeast China (Cheng, 2009; Yang et al., 2013). The research area located in the south of Ganzhou is adjacent to the mining area as well as traffic facilities, and it has developed agriculture, which probably increased the potential pollution of Hg (Xue et al., 2013). Most previous studies paid attention to the potential pollution of toxic elements in mining areas, while this study about Hg pollution considered multiple pollution sources including traffic, mining and smelting as well as farming activities in a typical area innovatively. Combining with various influencing factors, this study aims to: (i) analyze the spatial distribution of Hg and classify the Hg concentrations; (ii) assess the Hg pollution risk; and (iii) identify the pollution sources of Hg in a typical area in Jiangxi Province. This study will give the better comprehension of Hg pollution and its subsequent health risk in a typical area with several possible pollution sources.

Material and methods

Study area

The typical area for this research (114° 34′ 27″ E–115° 30′ 55″ E, 24° 51′ 56″ N–25° 32′ 30″ N) is in Ganzhou City in the south of Jiangxi Province, China, which is mainly in two counties—Xinfeng and Anyuan as shown in Fig. 1. This typical area is adjacent to mining area and traffic facilities. Part of it in Xinfeng was mainly planted with rice, and part of it in Anyuan was mainly planted with fruit trees. The altitude of the study area ranges from 109 to 1152 m, and there is mainly red soil. This typical region belongs to subtropical monsoon climate, with hot summer and warm winter as well as four distinct seasons. There is also simultaneous rain and heat, and developed monsoon in summer (Yu et al., 2020). Its subtropical climate is suitable for the plantation of subtropical crops, especially the rice and orange. Besides, Ganzhou with superior geographical location has become an important transportation hub, which connects many essential economic regions of China. The south region of Ganzhou with rich rare earth reserves has formed developed industries and significantly contributed to local economic development (Zhao et al., 2019). Of course, the relative resource mining has put heavy burden on local ecological environment in many areas such as Anyuan County as previous study reported (Li et al., 2020).

Fig. 1
figure 1

Study area

Soil sampling and analysis

This study collected 78 surface soil samples (0–20 cm) in Xinfeng (46 soil samples) and Anyuan (32 soil samples) in the south part of Ganzhou, and their geographical positions are in Fig. 1. The systematic random distribution method was adopted when the sample positions were determined. The actual geographical sites for sampling were adjusted according to the local environment in the south of Ganzhou. The soil samples were collected and saved in plastic bags in the cool space with no direct sunlight before transport to laboratory for further analysis. After small stones as well as organic debris removed and air-dried, the soil samples were broken up and sifted through the fine sieve of 1 mm. Then, the Hg concentration was measured by the analyzer named Lumex (RA915M) with the detection limit of 0.3 ng·g−1. The GBW07447 (GSS18) was chosen as the reference sample for quality control.

Self-organizing map

SOM belongs to unsupervised neural network, with which high-dimensional data is able to be projected to two-dimensional self-organizing network called “feature map” (Farsadnia et al., 2014; Li et al., 2018). Thanks to the good perform on classification and visualization, it has become an flexible and robust approach to detect the characteristics of the observing data (Melo et al., 2019; Tsuchihara et al., 2020). SOM consists of both input and output layers, and the n-dimensional input variables are stored in the input layer. Output layer neurons mapped from input variables have the nodes arranged in hexagonal lattice, of which the topology as well as numbers can be defined subjectively. Besides, the input as well as output neurons are connected by weight vectors, whose dimension is corresponding to the number of input variables (Rahman et al., 2022). The SOM training begins with weight initialization using random numbers, and the Euclidian distance between the input variable and weight for each output neuron is calculated. The distant neurons restrain each other while the neighboring ones stimulate each other during the SOM adjustment. The final winning neuron is seen as the best matching unit (BMU) of the input (Wang et al., 2020). The topographic error and quantization error are for the accuracy measurement of SOM. In this study, the 9 × 6 SOM was constructed for Hg concentration classification and visualization combined with k-means clusering, which was conducted in MATLAB 2012 using SOM Toolbox (http://www.cis.hut.fi/projects/somtoolbox/).

Geoaccumulation index

Geoaccumulation index performs well in evaluating pollution degree for the heavy metals in soil (Bing et al., 2011), which is calculated as:

$$ I_{{{\text{geo}}}} = \log_{2} \frac{{C_{{{\text{Hg}}}} }}{{1.5{\text{BG}}_{{{\text{Hg}}}} }} $$
(1)

where CHg denotes the Hg concentration, and BGHg is the background value of Hg in Jiangxi Province from Chinese Environmental Protection Administration (1990).

Potential ecological risk

The potential ecological risk index is adopted to evaluate the heavy metal pollutions as well as their toxicological effects (Wang et al., 2020). In this study, the Er of Hg can be calculated as:

$$ {\text{Er}} = \frac{{{\text{TR}}{}_{c} \cdot C_{{{\text{Hg}}}} }}{{{\text{BG}}_{{{\text{Hg}}}} }} $$
(2)

where TRc denotes the toxic response coefficient (Hg = 40), and CHg as well as BGHg are the Hg concentration and its background value is as same as formula 1.

Positive matrix factorization

PMF, initially adopted for the source identification of particulate matter, has already become a classic approach for quantitative pollutant source apportionment (He et al., 2022). It is expressed as:

$$ x_{i} = \sum\limits_{k = 1}^{p} {u_{{{\text{ik}}}} \cdot v_{k} } + \varepsilon_{i} $$
(3)

where xi denotes the measured value for the ith sample, and p, u, v, ε are the number of factors, the profile for each source, the contribution to each sample from each factor and the residual of each sample, respectively. By minimizing the objective function Q, PMF obtains the factor contributions:

$$ Q = \sum\limits_{i = 1}^{n} {\left[ {\frac{{x_{i} - \sum\limits_{k = 1}^{p} {u_{ik} \cdot v_{k} } }}{{\varepsilon_{i} }}} \right]}^{2} $$
(4)

Besides, the input uncertainty for each sample i is:

$$ {\text{uu}} = \sqrt {(E \times c)^{2} + (0.5 \times M)^{2} } \, (c > M) $$
(5)
$$ {\text{uu}} = \frac{5}{6}M \, (c \le M) $$
(6)

where c, E and M are the Hg concentration, error fraction of instrument and detection limit, respectively.

PMF 5.0 from US Environmental Protection Agency was adopted to quantify the contribution of pollution sources for Hg in this research.

Results and discussion

Summary Statistics for Hg concentration

The description for Hg concentration in the study area is shown in Table 1. It can be seen that the Hg concentration in Anyuan ranged in 1.4–345.6 μg kg−1 (n = 32) with the average of 41.1 μg kg−1, which indicated that the distribution of mercury content was uneven particularly. This is supported by the high concentration variance (CV = 138.4%). Of note, one site showed Hg concentrations as high as 345.6 µg kg−1 Hg, over eight times the average of Anyuan and higher than Hg concentrations reported near coal-fired power plants (Song et al., 2021). By contrast, the Hg concentration ranged in 8.5–239.0 μg kg−1 (n = 46) in Xinfeng (Table 1), showing a lower variance than that of Anyuan, which could also be learned from Figure S1. The standard deviation (SD) and the coefficient of variation (CV (%)) of Xinfeng were also lower. Besides, the average Hg concentration of Xinfeng was 53.9 μg kg−1 (Table 1, Figure S1), which exceeded the mean mercury concentration of Anyuan. This indicated that the overall Hg content of the farmland planted with rice was higher than that of the farmland planted with fruit trees. The Hg average of Xinfeng also increased the average of the whole study region, which was 48.7 μg kg−1. The averages of Hg concentration for Anyuan, Xinfeng and the whole study area were 0.49, 0.64 and 0.58 times of the background value in Jiangxi Province (84 μg kg−1 Hg), respectively (Table 1), which verified that the farming area near mining activities had no heavy mercury pollution on the whole (Chinese Environmental Protection Administration, 1990).

Table 1 Statistical description for Hg concentrations (μg kg−1)

Spatial distribution of Hg concentration

The result of inverse distance weighting (IDW) interpolation for Hg concentration is shown in Fig. 2. It could be seen that there was no obvious Hg pollution in the most region of Anyuan except for the south region, where pollution level was close to the average (344 μg kg−1) of certain reported typical pollution areas like the coal-fired power plants area in Heilongjiang (Li et al., 2017). According to on-the-spot investigation, there was farmland around polluted sites. Thus, the high Hg concentration probably caused by the solid wastes as well as the wastewater produced by farmland activities. Moreover, the excessive application of pesticides, organic manures and fertilizers could also contribute to soil mercury content (Cai et al., 2015; Feng et al., 2022; Zhong et al., 2016). The north region of Xinfeng also showed obvious Hg pollution, while most other areas of Xinfeng had relatively lower Hg concentration (Fig. 2). The sampling site with the highest Hg concentration in this region was near some railway stations, bus stations and petrol stations according to the online satellite map (https://map.baidu.com/@12959765,4842486,13z). As the important transportation hubs of Xinfeng, the railway stations and bus stations gathered passengers from various regions, who carried different motor vehicles. A large number of traffic vehicles, together with the running of trains, caused the mercury enrichment in nearby soil (Liu et al., 2012). The high soil mercury concentration was also discovered near the petrol stations and highway in the most north of Xinfeng on the basis of online map (Figs. 1 and 2), where large amount of vehicles gathered every day. The traffic discharged toxic tail gas into atmosphere continuously, and pollutants entered soil with falling dust (Lenka et al., 2017). The spatial distribution analysis of Hg concentration showed that traffic pollution was the important pollution source for Hg in Xinfeng, while the inappropriate agricultural activities caused heavy mercury pollution in some areas of Anyuan. Besides, the Hg pollution in Anyuan was concentrated in the south, while that of Xinfeng was in the north.

Fig. 2
figure 2

Inverse distance weighted interpolation for Hg concentrations

Classification for Hg concentration using self-organizing map

A two-dimensional self-organizing map of Hg concentration (left) and a U-matrix of SOM (right) were constructed (Fig. 3a). The research also classified the samples into four clusters combined with k-means cluster algorithm according to their SOM result (Fig. 3b), and the sample positions in each cluster were identified in Fig. 3c. The topographic error and the quantization error for SOM were 0.590 and 3.374, respectively.

Fig. 3
figure 3

Self-organic map and k-means cluster for Hg concentrations

It could be learnt that most samples with low Hg concentration belonged to Cluster I (Fig. 3ab), which also evidenced that there was no Hg pollution in most study area (Fig. 3c). Many other samples were divided into Cluster II, which scattered throughout the whole rice field in Xinfeng (Fig. 3c), while Anyuan had barely soil samples belonged to this cluster. This spatially reflected that the mercury concentration in the rice fields of Xinfeng was slightly higher than that of the fruit fields in Anyuan as well as the background value of the whole Jiangxi Province (Chinese Environmental Protection Administration, 1990). It was probably because the artificial additions like Hg-containing pesticides facilitate mercury enrichment to a small extent (Yuan et al., 2021). The conversion to sustainable organic farming may avoid the deeper enrichment of mercury. The samples near petrol stations and highway in the most north part of Xinfeng was classified into Cluster III (Fig. 3c). The sample close to the railway stations in Xinfeng as well as the sample in the southwest of Anyuan belonged to Cluster IV (Fig. 3c), which reflected the significant contributions from traffic source and agricultural source (Hu et al., 2021; Shang et al., 2018; Wang et al., 2022). Hence, the traffic facility or farmland with Hg pollution was supposed to be given enough attention or even the relative treating measures. It could also be excavated that the agricultural source promoted minor increase of Hg content in the large scale of Xinfeng, while led to serious Hg enrichment in a small scale in the south of Anyuan.

A1-A32 and X1-X46 denote the samples in Anyuan and Xinfeng, respectively.

Assessment for Hg pollution risk

The geoaccumulation index for each sample and the interpolation of it is shown in Fig. 4, and the correspondence between Igeo and pollution degree is in Table S1. Most samples and their nearby areas had negative Igeo due to the low Hg concentration. There was no high Igeo near rare earth mining and smelting activities, which indicated that mining and smelting rarely caused mercury pollution to surrounding areas. The rice planting had no local pollution as well. Besides, the individual sampling points with the Igeo between 0 and 1 (blue circle) were at the pollution degree of “No pollution ~ Moderate pollution” (Table S1). The two blue circles of this category at the north part of Xinfeng (Fig. 4) were at the sides of petrol stations and railway stations, thus the traffic source significantly contributed to their pollution degree. However, the mercury-containing compounds in vehicle fuels such as oil, coal and minerals were volatilized into air in the combustion process and made the air contain mercury vapor as well as dust containing mercury compounds. Moreover, mercury further settled in the surrounding soil with the release of tail gas (Debnárová & Weissmannová, 2010; Yaylalı-Abanuz, 2011). Besides, the highest Igeo marked with red circle appeared in farmland in the southwest of Anyan, which belonged to the moderate pollution. As reported that the use of mercury-containing pesticides and fertilizers would increase the soil mercury concentration and cause soil pollution (Dong et al., 2017), thus the locally artificial addition was very likely the primary reason leading to this risk. By contrast, the same level pollution did not exist in the vast area of Xinfeng. The absence of Hg pollution in the most farmlands of the research region may be due to the restrictions on the production and usage of the mercury-containing pesticides in recent decades.

Fig. 4
figure 4

Igeo and IDW interpolation for Igeo

The potential ecological risk index of Hg for Xinfeng and Anyuan and the corresponding interpolation result is in Fig. 5, and the relative criteria for ecological risk degrees are in Table S2. Most samples in both Anyuan and Xinfeng had the Er lower than 40, which indicated the low potential ecological risk in the areas near these sampling positions (Fig. 5). The yellow circles had the Er between 40 and 80, which reflected the moderate ecological risks in the central and north areas of the study region. These areas of moderate ecological risk were near the petrol stations, railway stations and highway in Xinfeng, which meant that the traffic pollution not only threatened human health through air pollution, but also promoted soil Hg enrichment and caused ecological risk as well as human health risk indirectly (Li & Tse, 2015). Moreover, the sampling site marked with orange circle had the considerable ecological risk, which was also near the railway stations. In contrast, neither the low ecological risk nor the moderate ecological risk appeared in Anyuan. Therefore, it must be noted that the sampling position in the southwest farmland of Anyuan marked with red circle had the high ecological risk (160 ≤ Er < 320). This local serious pollution may be caused by excessive pesticide additions, and it would bring about high human health risk (Raj et al., 2017; Tutton et al., 2022). The mercury enriched in the plants when its content was too high, which had a toxic effect on plants. It affected the absorption of nutrients by the plant roots, and then impacted the growth of their aboveground parts. It would also damage the human function when it entered human body through food chains (Jiang et al., 2021; Yeganeh et al., 2013). Thus, the local mercury pollution should be managed targetedly in Anyuan. On the contray, Xinfeng had no so serious pollution risk in the rice field.

Fig. 5
figure 5

Er and IDW interpolation for Er

Pollution source identification for Hg

The uncertainty result (Figure S2) for the input of PMF model was calculated (Fig. 6) according to formula 5 since the concentrations of all mercury samples were above the detection limit. The predetermined number for the explanatory factors was 3 in PMF. The Qrobust was approach to Qtrue after ten runs of the PMF calculation using the Hg concentration data and uncertainty file, which acquired an R2 > 0.9. These results suggested that PMF had good performance for quantifying the explanatory factors.

Fig. 6
figure 6

Factor contribution according to positive matrix factorization

In term of PMF result (Fig. 6), factor 1 had the largest contribution (49.8%) to Hg pollution. It could be learnt that most samples with lower Hg concentrations had negative Igeo and lower potential ecological risk indexes. The main sources of the soil Hg contents in most area of the research region were originally from the soil parent material, rock weathering and other natural sources (Carvalho et al., 2019; Nezhad et al., 2014; Rodríguez Martín et al., 2013). Therefore, the factor 1 with the contribution of 49.8% was most likely the natural sources.

Combined with the analysis of spatial distribution of Hg concentration as well as the ecological risk assessment for Hg pollution, most samples with higher geoaccumulation indexes and higher potential ecological risk indexes were close to highway or transportation facilities such as railway station and petrol station. Ganzhou City has been the important transportation hub in the southeast of China, which connects four provinces including Fujian, Jiangxi, Guangdong and Hunan (Cheng, 2009). Roads and other traffic facilities were under great pressure from traffic, which made large amount of harmful gas in the automobile exhaust discharged into the air continuously, and polluted surrounding soil at a certain degree. Therefore, it was concluded that factor 2 with the contribution of 33% was the traffic source, which was the large pollution source. In the face of serious dust pollution near roads, railway stations and petrol stations, it is suggested to build more green belts with strong adsorption capacity to gradually reduce air and soil pollution, decrease ecological risk and improve the quality of ecological environment (Gupta et al., 2016; Islam et al., 2012; Jiang et al., 2020).

Factor 3 contributed 17.2% calculated by PMF, which was the factor that contributed the least to mercury pollution. Except for traffic pollution, there was also local Hg pollution risk in farmland due to the inappropriate agricultural additions. Excessive use of pesticide and fertilizer had caused mercury enrichment in the soil. The cumulative Hg entered the human body and other organisms through the food chains, which caused serious consequences to human health and the whole ecosystem because of the highly toxic of mercury (Liu et al., 2021; Ma et al., 2017; Mahbub et al., 2017). The agricultural pollution had the smaller scale compared with the traffic pollution, which may be attribute to the excessive use of pesticides by individuals. Besides, Hg pollution was not detected in many other farmlands, especially the rice field in Xinfeng. It could be concluded that there was no excessive addition of mercury-containing substances that caused heavy Hg pollution at the large scale. Thus, factor 3 was possibly the agricultural pollution source.

Conclusions

This research investigated the spatial heterogeneity, clustering characteristics, ecological risks as well as pollution source contributions for Hg in a typical area. Results showed that there was no large-scale and heavy mercury pollution near mining or smelting areas of rare earth. However, some areas near railway stations, petrol stations and highway in Xinfeng had high Hg concentrations, and high Hg enrichment also appeared in farmland in the southwest of Anyuan according to spatial analysis and SOM. It reflected the serious impact on local soil environment from heavy traffic and inappropriate agricultural additions. The SOM classification also indicated that most samples in the rice field of Xinfeng had a slight increase of mercury compared with the most field of Anyuan, but this had not caused obvious pollution.

Besides, the geoaccumulation index distribution displayed that mercury from automobile exhaust emission entered the soil, causing certain pollution accumulation in the soil near highways, railway stations and gas stations in Xinfeng. The nearby potential ecological risk had increased as well. In contrast, the sampling position in the farmland in Anyuan had moderate pollution measured by Igeo and high ecological risk measured by Er. The enriched Hg harmed both human and ecological health, which was supposed to be given targeted pollution treatment. In addition, it could be concluded that the natural source like parent material and rock weathering was the factor with the largest contribution of the soil Hg contents. The traffic source causing different pollution degrees and health risks was factor 2 because of the larger-scale pollution than that of agricultural source. The agricultural pollution source appeared in Anyuan at a small scale was factor 3 in this typical area. This research gives good comprehension of Hg pollution in the rare earth mining area with developed traffic and agriculture and will provide scientific evidence for targeted Hg pollution treatment.