Introduction

Soil heavy metal pollution severely threatens agro-ecosystem stability and food security. Different from organic matter, most heavy metals are prone to transform to more toxic methyl compounds and do not degrade naturally. Heavy metals in agricultural soils can not only affect the soil security, grain output, and quality, but also endanger the health of human body through the food chain (Hou et al., 2020). Some research indicated that excessive exposure to the heavy metals could cause body damages and increase the cancer risks (Setia et al., 2021; Wang et al., 2020a, b, cd). Long term accumulation of soil heavy metals can bring about the soil nutrient loss and degradation of soil quality and soil function (Semenkov & Koroleva, 2020). Furthermore, soil heavy metals can cause secondary pollution by transferring to water (Karaouzas et al., 2021) and atmosphere (Wan et al., 2016). As a result, it is important to identify and map the source, ecological risks, and human health risks of soil heavy metals.

To analyze the effects of heavy metals on ecological environment and human health, multiple indices were proposed for heavy metal pollution assessment. Some indices including index of geoaccumulation (Muller, 1969), the enrichment factor (Buat-Menard & Chesselet, 1979), the contamination factor (Hakanson, 1980), and individual contamination factor (Ikem et al., 2003) were utilized to evaluate the individual soil heavy metal pollution. Some indices including Pollution Load Index (Wang et al., 2020a, b, cd) were utilized to assess the multi-element heavy metal pollution status. Other indices including Ecological Risk Index (Hakanson, 1980; Zeng et al., 2020) and total cancer risk (Wu et al., 2020a, b) have been applied to evaluate the effects of heavy metals on environment and human health.

The sources of heavy metals in agricultural soils are either related to the anthropogenic activities or natural process. Intensive input from human activities including fertilizers, agrochemicals, wastewater irrigation, and automobile exhaust emissions is the major driving forces of heavy metal accumulation in agricultural soils (Fei et al., 2019). Apportioning the corresponding contributions of soil heavy metals is of great importance to implement effective source reduction and site-specific sustainable soil management measurements. The techniques which can identify and quantify the sources of heavy metals in soils can be classified into source identification and source quantification methods. Source identification methods, which aim to define the types of pollution sources, include principal component analysis (Yang et al., 2020) and cluster analysis (Pandey et al., 2014). Source quantification methods, which aim to perform the quantitative analysis of contributions from different sources, include chemical mass balance (CMB) (Shi et al., 2019) and positive matrix factorization (PMF) (Wu et al., 2020a, b). Among multiple receptor models, PMF method is an effective and widely used method for soil heavy metal apportionment. It does not require the prior knowledge of source profiles, and it is appropriate to be applied in areas with similar parent materials and agricultural management practices (Lv, 2019). By integrating PMF and self-organizing map (SOM), Bhuiyan et al. (2021) identified four sources of soil heavy metals in Dhaka district in Bangladesh. Christensen et al. (2018) analyzed the anthropogenic and geogenic sources of heavy metals in moss and natural surface soils in Norway by PMF and Principal Component Analysis (PCA) methods. The source contributions of heavy metals are affected by various factors including climate, hydrology, and topography. Most research just analyzed the concentrations of source contribution, while ignoring the associated health and ecological risks. As a result, this research combined source apportionment methods and ecological and human health risk assessment to quantify and map the sources and ecological and health risks of soil heavy metals.

Although receptor models can help identify the potential sources of heavy metal and explain the relationships between different soil heavy metals, they cannot explain the spatial correlations of soil heavy metals. Characterizing the spatial pattern of soil heavy metals can help identify and demarcate the areas with relatively high heavy metal concentration. Generally, geostatistical and data mining methods are two major methods that are used to characterize the spatial pattern of soil heavy metals. Random forest and land use regression model had been utilized to predict the spatial pattern of soil heavy metals in Dongli District, Tianjin (Wang et al. 2020a, b, c, d). Geostatistical methods such as ordinary kriging and Inverse Distance Weight (IDW) methods had been used to analyze the spatial correlations of soil heavy metals (Gujre et al., 2021; Sergeev et al., 2019). Geostatistical methods can also reveal the spatial structure and spatial variability of factors derived from receptor models. However, most of the research just analyzed the spatial pattern of soil heavy metals and did not explore the spatial pattern of source apportionment factors, health risks, and ecological risks (Kharazi et al., 2021; Liu et al., 2021a, b; Sergeev et al., 2019). Most research could not provide site-specific recommendations to the reginal ecological security and food security management.

Songnen Plain, an important part of the northern Songliao Plain, is an important commodity grain base in China. To feed the growing population, multiple natural lands were converted to cropland in the Songnen Plain in the past decades (Mao et al., 2019). In addition, the increasing use of chemical fertilizer and excessive reclamation have also led to soil degradation and soil nutrient loss (Song et al., 2018). However, few research has performed the source apportionment, ecological risk assessment, and health risk assessment of soil heavy metals in this area before. The primary goals of this study were to (1) determine the concentration of soil heavy metals; (2) assess the source apportionment, ecological risk, and health risk of soil heavy metals; (3) map the spatial pattern of soil heavy metals and related source apportionment factors, ecological risks, and healthy risks; and (4) make recommendations to the soil and agricultural management in Hulan River Watershed in Harbin, Heilongjiang province, China.

Materials and methods

Study area description

Hulan River is a tributary of Songhua River located in the central part of Heilongjiang province, China. It origins from Xiao Hinggan Mountains, and then flows into the Songhua River in Hulan District of Harbin City, capital of Heilongjiang province. Hulan River Watershed in Harbin City experienced the northern temperate continental monsoon climate with a warm summer and a cold winter. The frozen period is from late October to late April in this area. The average temperature in July and January is 20–23 °C and − 21– − 26 °C, respectively. The average annual precipitation is 574.7 mm. The soil type in Hulan River Watershed is Moollisols according to the USDA Taxonomy. Maize (Zea mays. L.) was the main cropping system in the region. In recent years, more and more corn fields are transformed to soybean field due to the high economic value and market demand of the soybean in China.

Soil sampling and analysis

The top soils (20 cm depth from the surface) of 144 soil samples were collected across the study area (Fig. 1). Each soil sample was a composite of 3 subsamples acquired from a 10 m × 10 m grid of land. All samples were dried at 25 °C for 2 weeks, and then grounded under 100 mesh, and passed through a 2-mm sieve. Approximately 0.25 g of soil sample was digested with concentrated HNO3-HF-HClO4 on an electric hot plate (Dang et al., 2021; Wu et al., 2019). The contents of soil Cd, Cr, Cu, Pb, Ni, and Zn were determined by inductively coupled plasma-mass spectrometry (ICP-MS) in Geological Bureau of Hunan province. The contents of heavy metals including As and Hg were determined by atomic fluorescence spectrometry (AFC) in Geological Bureau of Hunan province.

Fig. 1
figure 1

Soil sampling sites in Hulan River Watershed, Harbin, China

Assessment of soil heavy metal pollution

Pollution assessment of soil heavy metal

Single factor pollution index (PI) and Pollution Load Index (PLI) were applied to calculate the pollution level from each heavy metal and all heavy metals, respectively (Zang et al., 2017). PI and PLI were estimated as Eqs. (1) and (2), respectively,

$$\mathrm{PI}=\frac{{C}_{i}}{{B}_{i}}$$
(1)
$$\mathrm{PLI}=\sqrt[i]{{PI}_{1}\times {PI}_{2}\times {PI}_{3}\times \dots \, \dots \times {PI}_{i}}$$
(2)

where Ci the concentration of the ith heavy metal in soil, and \({B}_{i}\) is its geochemical baseline concentration in the study area (Sun et al., 2013). PIi is the single factor PI of ith heavy metal in soil (Jiang et al., 2021). The categories of PI and PLI are shown in Table S1.

Potential ecological risk assessment of heavy metal pollution

Ecological risk index (EI) and Comprehensive ecological risk assessment (\(\mathrm{RI}\)) can reflect the ecological impacts of a single heavy metal and all the soil heavy metals, respectively (Hakanson, 1980). The formula of EI and RI was calculated as follows:

$${EI}_{i}={T}_{i}\times {PI}_{i}$$
(3)
$$\mathrm{RI}={\sum}_{i=1}^{n}{EI}_{i}$$
(4)

where \({T}_{i}\) is the toxic-response factor (Zeng et al., 2020). The categories of RI and EI are also shown in Table S2.

Potential health risk evaluation of soil heavy metal

The human exposure risks from soil heavy metals were estimated by three different pathways including ingestion (ing), inhalation (inh), and dermal contact (der) (Zhao et al., 2019). In this study, the average daily intake (ADI) (mg/kg/day) of soil heavy metals for anthropic health risks through the following three pathways was estimated with Eqs. (5)–(7):

$${ADI}_{ing}=\frac{{C\times R}_{ing}\times EF\times ED\times CF}{BW\times AT}$$
(5)
$${ADI}_{inh}=\frac{{C\times R}_{ink}\times EF\times ED}{PEF\times BW\times AT}$$
(6)
$${ADI}_{der}=\frac{C\times SA\times AF\times ABS\times EF\times ED\times CF}{BW\times AT}$$
(7)

The definition and values of exposure parameters are summarized in Table S3.

The total non-carcinogenic risk (THI) and total cancer risk (TCR) (Wu et al., 2020a, b) were calculated with Eqs. (8) and (9).

$$\mathrm{THI}={\sum }_{i=0}^{n}\frac{{ADI}_{i}}{{RfD}_{i}}$$
(8)
$$\mathrm{TCR}={\sum }_{i=0}^{n}AD{I}_{i}\times {SF}_{i}$$
(9)

where n is the number of soil heavy metals, ADIi is the average daily intake for the ith soil heavy metal, RfDi is the reference dose for the ith soil heavy metal, and SFi is the slope factor for the ith soil heavy metal. If the THI value is higher than 1, there are potential non-carcinogenic effects. If the THI value is lower than 1, there is no risk of adverse health effect. If the TCR value is higher than 1E − 04, the cancer risk is unacceptable. If the TCR value is lower than 1E − 04, an acceptable risk for cancer can be expected (Wu et al., 2020a, b).

Statistical analysis

The descriptive analysis of the data was carried out in R software. The spatial distribution of heavy metals was characterized by Inverse Distance Weighted (IDW) method in ArcGIS 10.4.1. The factor scores from PMF analysis, PI, PLI, EI, RI, THI, and TCR were interpolated by IDW to show the pollution status and ecological and health risks in the study area.

PMF analysis

PMF (version 5.0) is a multivariate receptor model developed by Paatero and Tapper (1994). It was widely utilized to identify and quantify the sources of heavy metals in soil, sediments, and air (Brown et al., 2015; Christensen et al., 2018; Wu et al., 2020a, b). The calculation principle and expression of PMF are described as follows:

$${X}_{ij}={\sum}_{k=1}^{p}{g}_{ik}{f}_{kj}+{e}_{ij}$$
(10)

where i is the soil sample number, j is the heavy metal category, p is the number of potential sources, \({X}_{ij}\) is the sample data matrix, \({g}_{ik}\) is the source contribution matrix, \({f}_{kj}\) is the source profile matrix, and \({e}_{ij}\) is the residual for each soil sample.

The purpose of the PMF is to achieve the minimum value of the objective function (Q(E)). The Q(E) is defined as:

$$\mathrm{Q}(\mathrm{E})={\sum}_{i=1}^{n}{\sum}_{j=1}^{m}{(\frac{{e}_{ij}}{{u}_{ij}})}^{2}$$
(11)

where \({u}_{ij}\) is the uncertainty. \({u}_{ij}\) is calculated as the following formula:

$${u}_{ij}=\left\{\begin{array}{c}\frac{5}{6}\times MDL,(c\le MDL)\\ \sqrt{{\left(\sigma \times c\right)}^{2}+{\left(0.5\times MDL\right)}^{2}},\;\;(c>MDL)\end{array}\right.$$
(12)

where c is the soil heavy metal concentration, MDL is the method detection limit, and σ is a percentage of the measurement uncertainty.

Self-organizing map (SOM) analysis

A self-organizing map (SOM) is a data-driven method which requires no preliminary knowledge developed by (Kohonen, 1998). It is an unsupervised artificial neural network model that was widely applied in complex data analysis without preliminary knowledge. The mechanism of SOM resembles the process of human brains that it can self-organize neurons with the same weight vector as the input vectors, and cluster the spatial distribution of similar-function neurons (Wang et al., 2020a, b, c, d). The ordered neurons were connected through a neighborhood relation on a SOM map. Similar color gradients represent positive correlations and antiparallel color gradients show negative correlations. Two error parameters, including quantization error (QE) and topological error (TE), were used to control the SOM modeling quality. Firstly, the number of input samples (N) was 130, which showed 130 soil samples collected in the study area. Secondary, the number of neurons (M) was calculated by the formula as follows: M = 5 \(\sqrt{\mathrm{N}}\)≈55 according to the empirical equation. Finally, the final number of neurons (M) in the neural matrix was composited of 7*8 neurons by considering QE and TE. SOM Toolbox in MATLAB was utilized to apportion the sources of soil heavy metals.

Results and discussion

Descriptive analysis of heavy metal concentrations in Hulan River Watershed

The descriptive analysis of the soil heavy metal concentrations is shown in Table 1. The concentration ranges of heavy metals of As, Hg, Cr, Ni, Cu, Zn, Cd, and Pb were 3.52–27.97, 0.01–0.21, 16.12–78.12, 6.99–249.98, 5.64–940.28, 21.6–314.28, 0.04–1.21, and 15.77–38.5 mg/kg, with corresponding mean values of 10.00, 0.03, 51.91, 26.76, 27.71, 68.39, 0.15, and 24.64 mg/kg, and median values of 9.66, 0.03, 54.3, 25.99, 20.61, 64.65, 0.13, and 24.68 mg/kg, respectively. Approximately all the heavy metals of the soil samples exceeded the corresponding background value of the soil in Heilongjiang province (Sun et al., 2013). In addition, there were 0.77% and 3.08% of sampling locations showing higher concentration values than the risk screening values for Ni/Cu/Zn and Cd, respectively. According to the Costa’s classification (Costa et al., 2019) (high variation: CV > 35%, medium variation: 16 < CV ≤ 35%, low variation: CV ≤ 16%), Cd, Ni, Hg, Cu, and Zn in the study area showed high variations, implying the anthropogenic activities such as industrial emission, traffic emission, and fertilizers might be the important sources for these heavy metals. As and Cr showed median variations, suggesting they might be affected by both anthropogenic activities and natural factors.

Table 1 Descriptive statistics of soil heavy metals in the study area (mg/kg)

Evaluation of soil heavy metal pollution

Pollution evaluation of soil heavy metals

The summary of PLI and PI is shown in Table 2. The mean PI values of As, Hg, Cr, Ni, Cu, Zn, Cd, and Pb were 1.16, 1.56, 0.97, 1.15, 1.49, 1.22, 1.64, and 1.14, respectively, suggesting all the heavy metals except Cr in this area were slightly polluted. The percentages of soil samples showing PI values larger than 1 for As, Hg, Cr, Ni, Cu, Zn, Cd, and Pb were 74.01%, 80.77%, 53.85%, 73.85%, 73.85%, 81.54%, 88.46%, 87.69%, and 81.54%, respectively, suggesting most of the soil sampling sites were heavy metal polluted. According to the classification of PLI, 83.08% of the soil samples were slightly or mildly polluted, and 1.54% of the soil samples were severely polluted, and only 16.92% of the soil samples were unpolluted.

Table 2 Summary statistics of the Single factor Pollution Index (PI) and Pollution Load Index (PLI) in the agricultural soils of Hulan River Watershed

Ecological evaluation of soil heavy metals

The order of mean EI values of the heavy metals in soil samples was Hg (62.30) > Cd (49.22) > As (11.63) > Cu (7.45) > Pb (5.68) > Ni (5.74) > Cr (1.94) > Zn (1.22) (Table 3). All soils experienced low ecological risk with all the heavy metals except Hg and Cd. As the toxic-response factor of Hg and Cd is higher than other soil heavy metals, EI mean values of Hg and Cd were larger than 40, showing both heavy metals reached moderate risk level. A total of 80.77% and 60.77% of the soil samples exceeded the low risk level for Hg and Cd, respectively. RI combined ecological risk from all heavy metals and they ranged from 45.63 to 553.15 with the mean value of 145.19, indicating low combined ecological risk of soil heavy metals across the Hulan River Watershed. Only 24.62% of the soil samples exceeded the low risk level of RI (Table 3).

Table 3 Summary statistics of Ecological risk index (EI) and Comprehensive ecological risk assessment (RI) in the agricultural soils of Hulan River Basin
Table 4 Mean value of non-carcinogenic risk index and carcinogenic risk index of soil heavy metals based on three exposure pathways

Health risk evaluation of soil heavy metals

The average HI values were all below 1 for adult and children, and the average THI and TCR values were also higher than 1 and 1E − 04 for children, respectively (Table 4). The results suggest the non-carcinogenic and carcinogenic risks of the soil heavy metals in the study area were not in an acceptable range for the children. The values of three exposure pathways for average THI and TCR were in the order of THIder > THIing > THIinh, and TCRing > TCRder > TCRinh, respectively, suggesting ingestion and derma contact were two major pathways for heavy metal health risks. For As, Cr, and Ni, the average HI accounted for THI was in the order of As (62.21%) > Cr (27.47%) > Ni (1.37%), and the average CR accounted for TCR was in the order of Ni (42.60%) > As (32.92%) > Cr (24.49%). The non-carcinogenic and carcinogenic risk indices for children were higher than those for adult males and females, which is confirmed by other research (Amin et al., 2013; Li et al., 2021).

Source analysis of soil heavy metals based on PMF and SOM

Figure 2 shows the contributions of sources of soil heavy metals. Factor 1 was dominated by Hg with the loadings of 99.8%. Factor 1 was also moderately loaded with Cd (34.7%), As (21.0%), and Pb (21.0%) (Fig. 2). The high concentration of Hg was shown in the central part of the study area where coal steam-electric plant is built (Fig. 4). Multiple research also indicated that coal burning and atmospheric deposition were the major sources of Hg (Cooke et al., 2020; Liu et al., 2021a, b). Some research proved Cd, As, and Pb were important elements imported into the soil during atmospheric deposition (Luo et al., 2009; Zhao et al., 2019). As a result, factor 1 is considered as industrial emission and atmospheric deposition.

Fig. 2
figure 2

Profiles and contributions of sources of soil heavy metals from PMF model

Factor 2 was moderately loaded by Pb (33.6%) and Cd (24.6%). Previous studies showed that Pb in soil was regarded as one of the contaminants of traffic emission. Pb can not only be released from vehicle tires and brake pad (Xiao et al., 2020), but also from road dust deposit (Yesilkanat & Kobya, 2021). The area close to the major highways and roads in the study area had relative higher Pb concentration compared with other areas (Fig. 3). Some research proved that traffic was an important source of Cd (Jayarathne et al., 2018). Therefore, factor 2 is considered as traffic emission.

Fig. 3
figure 3

SOM map of concentration of soil heavy metals

Factor 3 was highly dominated by Cu and Zn with the loadings of 80.6% and 80.0% (Fig. 2). Previous studies have shown that farming practices, such as the application of fertilizer, pesticide, and livestock manure, were perceived to be the predominant sources of Cu and Zn (Manoj & Kawsar, 2020; Shen et al., 2020). Relatively high concentration of Cu was shown in the northern part of the study area where the major land use type is upland planted with maize. In addition, high concentration of Zn was overlapped with the spatial distribution of livestock farms, suggesting the livestock manure and effluent discharge from the livestock farms may lead to the Zn pollution in those areas. As a result, factor 3 is considered as agricultural source.

Factor 4 was described by Cr, As, Ni, Pb, and Cd concentration with the loadings of 65.3%, 56.7%, 54.7%, 42.3%, and 29.2%. Therefore, Cr, As, Ni, and Pb were the principal elements to demonstrate the source of pollution in factor 4. It is well known that those heavy metals could stem from mixed sources including natural source, sewage irrigation, industrial activities, and other anthropogenic origins (Chai et al., 2021; Heidari et al., 2021). Relatively high Cr concentration was shown in the central part of the study area where urban area of Hulan District is located. Multiple factories, residential areas, roads, and commercial buildings could be the important anthropogenic sources of Cr in the central part of the study area. The farmland in the northeastern part of the study area and Hulan River estuary area also had relatively high As concentration. Various research already had proved that fertilizer, pesticide, and manure were the major sources of Cr, As, and Ni in agricultural area (Kharazi et al., 2021; Setia et al., 2021). As a result, factor 4 is poised to be a combination of agricultural source, natural source, industrial emission, traffic emission, and atmospheric deposition.

Figure 3 is the SOM diagram of soil heavy metal concentration in the study area. The image shows the weight of each soil heavy metal of the input vector. The lighter the color, the greater the weight. If two soil heavy metals are similar in color on a component plane, it indicates that the two soil heavy metals are positively correlated and have a common source. It can be seen from Fig. 3 that the color gradients of Cr and Pb are similar, indicating that Cr and Pb are positively correlated and may have a common source of pollution, which is confirmed by PMF analysis as a mixed source. The color gradients of Zn and Cu are comparable and highly distributed at the upper left corner, which are very different from those of Cr and Pb. The left neurons of the maps show overall high concentration of Hg and Cd compared with right neurons of the maps.

Spatial analysis of soil heavy metals

Spatial pattern of soil heavy metal concentrations

Spatial pattern of all the heavy metals is shown in Fig. 4. Multiple research already concluded that anthropogenic activities such as fertilizers, pesticides, herbicides, and sewage irrigation can contribute to the enrichment of Cr and Pb in agricultural soil (Kharazi et al., 2021; Liu et al., 2021a, b). Relatively high Cr concentration was shown in the eastern part of the study area where the coal steam-electric plants, plastic steel factories, pharmaceutical factories, building materials factories, residential areas, commercial buildings, and roads could be the major anthropogenic sources of Cr. Relatively higher Pb concentration was displayed in the areas close to the major highways and roads. The spatial distributions of Ni and Cu were similar. Some farmlands in the northern part of the study area showed remarkable higher Ni and Cu concentration compared with other areas. It is highly possible that farmlands in this area experienced accumulation due to the agricultural activities such as fertilizer, pesticide, and sewage irrigation. High Zn concentration displayed spotty pattern across the study area. Hot spots of As emerged in the northeastern part of the Hulan River Watershed where there are multiple food processing plants located in Baikui town and Huangjia town in Hulan District (Fig. 4f). High Hg concentration was mainly distributed in the central part of the study area, which is overlapped with the spatial distribution of the coal steam-electric plants (Fig. 4g). Relatively high Cd concentration was shown in the Hulan Estuary. The increasing discharge of industrial and domestic effluents from the Hulan River may contribute to the accumulation of Cd in the study area. Multiple research also found the accumulation of Cd appears in other river estuaries such as Yangtze River Estuary of China (Wang et al., 2021), Scheldt estuary of Europe (Gaulier et al., 2021), and Mlalazi estuary of South Africa (Adeleke et al., 2020).

Fig. 4
figure 4

Spatial distribution  maps of soil (a) Cr, (b) Pb, (c) Ni, (d) Cu, (e) Zn, (f) As, (g) Hg, and (h) Cd 

Spatial pattern of the source contributions to the soil heavy metals

The factor scores were plotted using IDW method (Fig. 5). Factor 1 (atmospheric deposition and industrial emission) was highly concentrated in the central part of the study area where coal steam-electric plants discharged heavy metals such as Hg. Factor 1 was also highly loaded in the southeastern part of the study area where urban areas had high contribution of the atmospheric deposition and industrial emission. Factor 2 (traffic source) was widely distributed in the western and southern part of the study area where multiple expressways and highways were built. Factor 3 (agricultural source) was highly loaded in a small farmland area located in the northern part of the study area where farmlands in this area had significantly higher Ni, Cu, and Zn concentration compared with other areas. Factor 4 was widely distributed in the study area as it was a combination of agricultural source, industrial emission, atmospheric deposition, and natural source.

Fig. 5 (a-d)
figure 5

source contribution to soil heavy metals in the study area

The spatial distribution of

Spatial pattern of the pollution, ecological risk and health risk indices of soil heavy metals

Figure 6 shows the spatial pattern of PI and PLI of heavy metals in the study area. According to Fig. 6, most of study area was unpolluted and slightly polluted for As, Cr, Cu, Ni, Pb, and Zn. Hulan Estuary area was highly polluted by Cd, and some farmlands in the study area were also mildly or moderately polluted by Cd. Most of the study area was moderately contaminated by soil heavy metals according to the spatial pattern of the PLI (Fig. 6i).

Fig. 6
figure 6

Spatial distributions of pollution index (PI) (a-h) and pollution load index (PLI) (i) for heavy metals in Hulan River Basin

Figure 7 shows the spatial pattern of ecological risk indices of soil heavy metals in the study area. As, Cu, Cr, Ni, Pb, and Zn posed low ecological risk in most of the study area. Considerable and high ecological risk from Hg existed in the central part of the study area, and considerable and high ecological risk from Cd existed in the Hulan River estuary. RI values in the eastern part of the study area were higher compared with the western part of the study area (Fig. 7). In general, the spatial pattern of RI revealed that the eastern part of the study area experienced moderate ecological risk of soil heavy metals.

Fig. 7
figure 7

Spatial distributions of pollution ecological risk indices (EI) (a-h) and integrated ecological risk index (RI) (i) for heavy metals in Hulan River Basin

The average THI ranged between 0.08 and 0.47 for adult males, 0.09–0.55 for adult females, and 0.44–2.38 for children (Fig. 8a–c). From Fig. 8c, the average THI for children in most of the study area exceeded 1, suggesting that children experienced non-carcinogenic risk in most of the study area. From Fig. 8d–f, the average TCR ranged between 1.81E − 05 and 2.34E − 04 for adult male, and 2.11E − 05–2.72E − 04 for adult male, and 3.22E − 05–5.06E − 04 for children. The average TCR for adult males and females in a small area located in the northern part of the study area exceeded 1.0E − 04, suggesting adults in this area experienced carcinogenic risks. The spatial pattern of TCR for children also revealed that most of the study area showed relatively high TCR (> 1.0E − 04), suggesting children in these areas also experienced carcinogenic risks. In general, the non-carcinogenic and carcinogenic risks for adult and children along the Hulan River were lower than other areas, and children in most of the study area experienced carcinogenic and non-carcinogenic risks.

Fig. 8 (a-f)
figure 8

Spatial distributions of health risks for heavy metals in Hulan River Basin

In general, considerable and high ecological risk from Hg existed in the area close to the coal steam-electric plant, and considerable and high ecological risk from Cd existed in the Hulan River estuary area. Children in most of the Hulan River Watershed experienced carcinogenic and non-carcinogenic risks. Zonal pollution control and soil restoration measures should be implemented in the areas with high ecological and health risks.

Comparison with other soil heavy metal studies

Compared with other Northeastern China cities (Table S4), the mean concentration of Cu in this study area was the highest. In this area, the mean As concentration is higher than Changchun and Baicheng-songyuan Area, and the mean concentration of Ni showed lower concentration compared with Shenyang. Mean Cr concentration is higher than Changchun, Dehui, and Baichang-Songyuan area. Compared with the research conducted in Eastern China, the average concentration of Cd, Hg, Pb, and Zn was lower than those research in Eastern China. In North China, As and Cd of Tianjin and Shanxi were higher than Hulan River Watershed. The concentrations of all the soil heavy metals in the study areas of Central China were higher than those in Hulan River Watershed. Compared with the study areas in South China, the concentrations of all the soil heavy metals except Hg and Pb in Hulan River Watershed were higher than those in Guandong, while the concentrations of all the soil heavy metals except As in Hulan River Watershed were higher than those in Sanya. The concentrations of all the soil heavy metals in Hulan River Watershed were lower compared with other study areas in other developing countries such as India and Nigeria.

Conclusion

The concentrations of all the heavy metals in the soil samples exceeded the background values in the study area. The mean PI value of As, Hg, Cr, Ni, Cu, Zn, Cd, and Pb was 1.16, 1.56, 0.97, 1.15, 1.49, 1.22, 1.64, and 1.14, respectively, suggesting all the heavy metals except Cr in this area were slightly polluted. According to PLI, 83.08% of the soil samples were slightly or mildly polluted, and 1.54% of the soil samples were severely polluted, and only 16.92% of the soil samples were unpolluted. All soils belonged to the low ecological risk categories except Cd and Hg based on EI values. In addition, non-carcinogenic and carcinogenic risks for the children owed to the soil heavy metals were not in an acceptable range. Based on the PMF and SOM analysis, four potential sources were revealed including (1) atmospheric deposition and industrial emission; (2) traffic source; (3) agricultural source; and (4) a combination of agricultural source, industrial emission, traffic emission, and natural source. Considerable and high ecological risk from Hg existed in the area close to the coal steam-electric plant, and considerable and high ecological risk from Cd existed in the Hulan River estuary area. The eastern part of the study area experienced higher non-carcinogenic and carcinogenic risks for adults and children than the western part of the study area. Zonal pollution control and soil restoration measures should be taken in the areas with high ecological and health risks in Hulan River Watershed.