Introduction

Soil is a pivotal component of human life and serves as a vital nutrient source for plants, a stabilizing agent for environmental changes, and a resource for water preservation. However, it is also a site where numerous pollutants accumulate (Naveedullah et al., 2013). Many environmental contaminants accumulate in the soil and pose significant risks to the soil environment. Among these contaminants, trace metal pollution is particularly detrimental to the soil health (Chen et al., 2005; Govil et al., 2008).

Numerous domestic and international studies have yielded positive results in the investigation of soil trace metal pollution. In Moscow, where potential contamination sources exist, certain forested regions and areas near human-made structures maintain acceptable levels of trace metal contamination in the soils. Nevertheless, human activities significantly influence soil pollution (Guseva and Petrov, 2023). Taghipour et al. (2013) conducted a quantitative analysis of 46 soil samples from the outskirts of a northwestern Iranian city, revealing the impact of the city on adjacent agricultural zones through controlled experiments. Muhammad (2022) employed the ecological risk index (ERI), pollution load index (PLI), and Medalus model to characterize land degradation in Southeastern Serbia, offering a robust methodological foundation for land management. Delavar and Safari (2016) investigated the spatial distribution of Pb, Cu, and Zn near a zinc plant in Zanjan, Iran, identifying intense trace metal contamination near the factory and spreading outward in all directions from its center. A study of soil trace metal contamination in a Malaysian industrial city indicated a gradual decrease from the industrial and residential sectors to rural areas (Hossain et al., 2015). Jadoon et al. (2020), and Din et al. (2022) assessed the distribution and potential risks of trace metals in Pakistani soils by utilizing the load contamination index and human health risk assessment methodology, highlighting a higher susceptibility to carcinogenic risks in children than in adults. Ullah and Muhammad (2020) explored trace metal contamination and phytoremediation potential in soils and plants in Baluchistan, Pakistan, attributing geological sources as the primary cause. Khan et al. (2020) investigated trace metal risks in particulate matter from residential and industrial sites in Islamabad, Pakistan, identifying oral ingestion as the primary exposure route, with children at higher risk than adults. In the Hassimessaoud area of Algeria, industrial sectors exhibited higher trace metal concentrations than rural areas (Mohammed et al., 2014). Lead primarily originates from human activities, such as mining, while nickel and copper are primarily derived from the soil matrix, mining, and agricultural practices (Yang et al., 2022; Sun et al., 2010). Lee et al. (2020) quantified anthropogenic impacts on trace metal contamination in Korean soils using multiple contamination indices and cluster analysis based on a nationwide survey.

The significant presence of soil trace metal pollution in the MRGB is a matter of grave concern owing to its extensive industrial, agricultural, and human habitation significance. As the local economy has grown, there has been a notable increase in pressure due to environmental soil contamination, particularly in recent years. However, previous studies in the MRGB have predominantly focused on crops and agro-economics, neglecting comprehensive investigations into soil pollution characterization, health implications, and contamination sources. Therefore, this study rigorously examined the soil trace metals in the MRGB using conventional geochemical evaluation techniques. We investigated the geographical distribution of trace metals, assessed health-related concerns, and identified the pollution sources of soil trace metals in the region. We aimed to determine the current levels of soil trace metal pollution in the MRGB, its sources, and the potential risks to human health. This study can significantly affect efforts related to soil trace metal pollution prevention and control, ecological environmental protection, and population health within the MRGB.

Materials and methods

Study area

The MRGB, situated at the center of the Shandong Peninsula, exhibits varying topographies with high and low areas in the north, west, and east. It has a mild temperate, semi-humid climate, characterized by distinct seasons. The mean annual temperature is 12.3℃, while annual precipitation and evaporation average 792.5 mm and 1912 mm, respectively. Geologically, the stratigraphy of the Changle County aligns with the Luxi stratigraphic division within the North China stratigraphic zone. Various boundaries such as the Neo-Paleocene, Neo-Paleozoic, and Neo-Cenozoic are distributed throughout the region. Predominantly, Changle County features acidic rocks, with concentrated development of intrusive rocks in the southern and southeastern regions, whereas basal and neutral rocks occur in packages or veins. The Changle is located within the North China Plate of the Lusi Rise and Jiao Liao Rise and is characterized by an active tectonic fault system. According to the hydrogeological zoning of Shandong Province, the northern part of Changle County falls within the loose salt hydrogeological zone of the Northwest Lu Plain, and the southern part belongs to the carbonate-based hydrogeological zone in the middle and low hills to the south of Lu. Soil types in the region predominantly include tidal, coarse bone, brown, and brown soils. Figure 1 shows a visual representation of the location of the MRGB.

Fig. 1
figure 1

Geographic overview of the middle reaches of the Gui River Basin

Samples collection and processing

A total of 98 soil surface samples were collected in the MRGB using a grid arrangement based on a 1:50,000 land-use status map. The average sampling density was 5 pieces/km2, with samples collected at a depth of 0–20 cm. In four equal portions, the root system, straw, stones, insect remains, and other detritus were removed to maintain a weight between 1 and 1.5 kg in a clean, sturdy cotton bag. Subsequently, a mixed-sample field collection approach was employed for sample deployment. For rectangular-shaped sampling plots, sub-sample points were arranged in an “S” shape, while square-shaped plots utilized an “X” or checkerboard layout. In the case of a rectangular sample plot, the sub-sampling points were arranged in an “S” shape. For approximately square sampling plots, sub-sampling points were arranged in either an “X” shape or a checkerboard shape. The roughly square sampling plots were maintained at the exact same sampling location, depth, and weight. The soil samples were collected from the wild, naturally air-dried, crushed using a wooden stick, and then passed through a 10-mesh nylon sieve, resulting in 500 g sieved samples. Throughout the soil collection, air drying, grinding, transportation, and storage processes, precautions were taken to prevent any contact with metal utensils that could lead to contamination. Soil pH values were determined using a magnetic PXSJ-226 electrode (Shanghai Yidian Scientific Instrument Co., Ltd.), while the As and Hg contents were analyzed using an atomic fluorescence spectrometer (AF-630A). The Cu, Pb, Zn, Ni, Cr, Cd, V, Mn, P, and K contents in the soil samples were quantified using an inductively coupled plasma mass spectrometer (Nexlon 2000) (Shanghai PerkinElmer Instrument Co., Ltd.).

To maintain stringent quality control during testing and analysis, a standard substance was included for every 50 soil samples, in line with the quality criteria outlined in the Specification for Geochemical Evaluation of Land Quality (DZ/T0295-2016). This process yielded an accuracy and precision of 100%. In addition, to assess the relative deviation (RD), one duplicate sample was introduced for every 50 samples, resulting in a statistical pass rate exceeding 97%.

Assessment of trace metal pollution

The primary method for assessing soil trace metal contamination involves comparing contamination levels with established reference values. Therefore, selecting appropriate parameters is imperative for accurately evaluating pollution. In this study, both the geoaccumulation index and PERI methods utilized the WFC background values as reference benchmarks. The following values were employed for Cd (0.081 mg·kg−1), Hg (0.015 mg·kg−1), As (8.1 mg·kg−1), Pb (20.3 mg·kg−1), Cr (65 mg·kg−1), Cu (20.1 mg·kg−1), Ni (27.7 mg·kg−1), and Zn (52.8 mg·kg−1) to assess the level of trace metals contamination in the soil and its associated risks.

Geoaccumulative index method and enrichment factor

The geoaccumulation index, also referred to as the Muller index, provides a comprehensive assessment of environmental impacts such as natural geological processes and human activities (Tian et al., 2017). The correction coefficient, denoted as “K,” is commonly used to account for variations in background values resulting from differences between soil and rock properties and is typically set at 1.5 (Lin et al., 2001). The formula is as follows:

$${I}_{{\text{geo}}}={log}_{2}({C}_{i}/K\times {C}_{Bi})$$
(1)

where Igeo is the ground accumulation index of trace metal i; Ci is the measured value of trace metal i in soil (mg·kg−1); CBi is the background value of trace metal i (mg·kg−1), with the background value of soil in Weifang City as the reference value. The evaluation process is displayed in Fig. S1.

Reimann and Caritat (2005) employed an enrichment factor approach to evaluate soil trace metal enrichment and the presence of anthropogenic sources. This method relies on the ratio between the average trace metal concentration and background soil value. As established by previous studies (Flora et al., 2009), an enrichment factor exceeding 1.5 indicates the presence of anthropogenic sources for soil trace metals.

PERI methodology

The PERI, developed by Swedish scientist Hakanson, integrates the environmental interactions of trace metals with its toxicological effects. This index effectively portrays the potential impact of trace metals on the ecological environment (Yang et al., 2016; Hakanson, 1980). The calculation formulas are as follows:

$${P}_{i}=\frac{{C}_{i}}{{C}_{Bi}}$$
(2)

The potential ecological hazard indices for individual trace metals is as follows:

$${E}_{i}={T}_{i}\times {P}_{i}$$
(3)

The combined potential ecological hazard indices for multiple trace metals is shown below:

$${R}_{i}=\sum\nolimits_{1}^{n}{E}_{i}$$
(4)

where Pi is the single-factor pollution index of trace metals; CBi is the background value of trace metal i (mg·kg−1); Ti is the toxicity coefficient of the given trace metal i. The toxicity coefficients are as follows: Cd (30), Hg (40), As (10), Pb (5), Cr (2), Cu (5), Ni (5), and Zn (1) (Hakanson, 1980). The evaluation process is depicted in Fig. S2.

Human health risk assessment

In human health risk assessments, the potential for adverse effects on the population is directly linked to environmental pollutant levels. Following the model outlined by the US Environmental Protection Agency (USEPA, 1996), which evaluates the health risks associated with soil trace metal exposure in adults and children through three pathways, i.e., oral ingestion, oral nasal inhalation, and dermal contact, the health risks for both adult and child populations were assessed.

Daily exposure to trace metals (mg/kg/day) through three routes, oral ingestion, oral nasal inhalation, and dermal contact, was determined using daily average exposure values (Hossain et al., 2015; USEPA, 1996). SFi represents the slope factor for metal i, RfDi represents the reference dose for metal i, and CS represents the trace metals concentration in the soil (mg·kg−1). The relevant equations for calculating the daily exposure through oral ingestion, oral-nasal inhalation, and dermal exposure are as follows:

$${{\text{ADD}}}_{{\text{ing}}}=\frac{{C}_{s}\times IngR\times EF\times ED}{BW\times AT}\times {10}^{-6}$$
(5)
$${{\text{ADD}}}_{{\text{inh}}}=\frac{{C}_{s}\times InhR\times EF\times ED}{PEF\times BW\times AT}$$
(6)
$${{\text{ADD}}}_{{\text{dermal}}}=\frac{{C}_{s}\times SA\times AF\times ABS\times EF\times ED}{BW\times AT}\times {10}^{-6}$$
(7)
$${\text{HI}}=\sum {HQ}_{i}=\sum \frac{{ADD}_{i}}{{RfD}_{i}}$$
(8)
$${\text{THI}}=\sum HI$$
(9)
$${{\text{CR}}}_{i}=\sum {ADD}_{i}\times {SF}_{i}$$
(10)
$${\text{TCRI}}=\sum CRI=\sum (\sum {CR}_{i})$$
(11)

where ADDing, ADDinh, and ADDdermal are the average daily exposure (mg/kg/day) to trace metals by three routes: oral ingestion, oral-nasal inhalation, and dermal contact, respectively (Hossain et al., 2015; USEPA, 1996); CS is the content of trace metals in soil (mg·kg−1); SFi is the slope factor for trace metal i; and RfDi is the reference dose for metal i. Specific calculation parameters and values are shown in Tables 1 and 2. The evaluation process is depicted in Fig. S3.

Table 1 Calculation parameters and values of trace metal exposure for adults and children (Agency, 2001)
Table 2 RfD and SF parameters of trace metals under different exposure pathways (Hall et al., 2000)

Cu, Zn, and Hg do not pose a carcinogenic risk to humans; therefore, no carcinogenic risk calculations are performed for Cu, Zn, and Hg in this paper (Xu et al., 2020; Agency, 2001).

Results and discussion

General characteristics of soil trace metals

Table 3 shows the mean concentrations of soil trace metals in the MRGB, ranked from highest to lowest as follows: Cr (70.88) > Zn (56.8) > Ni (28.63) > Pb (19.40) > Cu (17.64) > As (9.31) > Cd (0.19) > Hg (0.03). Notably, Cr, Zn, Ni, As, Cd, and Hg exhibited higher mean concentrations than background levels in the WFC, indicating varying degrees of accumulation in the MRGB. The coefficient of variation (CV) was significant when the mean value was 0. In our analysis, we observed weak variability for CVs at 0.1, and moderate variability for CVs at 0.11, with Cd (1.14) > Hg (0.8) > Ni (0.67) > Cu (0.39) > Cr (0.36) > Zn (0.33) > As (0.26) > Pb (0.23). Notably, Cd exhibited substantial variability, indicating a considerable anthropogenic interference. The degree of trace metal contamination across the MRGB displayed significant spatial heterogeneity, particularly influenced by external factors and potentially altered by human activities, whereas other factors exhibited medium variability. Moreover, the skewness and kurtosis of soil Cd in the MRGB exceeded the typical distribution characteristics of soil trace metals, signifying a more pronounced skewed distribution with elevated concentration extremes. In addition, the kurtosis values of Zn, Ni, As, and Hg are also large, which indicates a flat data distribution, which is quite different from the normal distribution. This deviation may be attributed to the accumulation of trace metals resulting from human activities, which disrupts the normality of the natural distribution.

Table 3 General characteristics of soil trace metals in the MRGB

According to the “Risk Control Standards for Soil Pollution on Agricultural Land of Soil Environmental Quality (Trial)” (GB 15618-2018), the soil in the MRGB exceeds the risk screening values for Ni, Cd, and As by two points each and one point, respectively.

The soil pH significantly affects its physical and chemical characteristics and influences the behavior of trace metals ions, including their enrichment, mobility, and fugacity. The trace metal content in soil varies with pH, with most trace metals indicating a negative correlation with pH, although certain metals such as Cu and Zn exhibit a positive correlation with pH. In the MRGB, soil pH ranged from 6.79 to 8.69, with an average of 7.85, indicating a weakly alkaline soil. As depicted in Fig. S4, alkaline soils dominated the central region, constituting 97% of the MRGB, whereas acidic soils were concentrated in the northern and southern areas.

Spatial analysis and patterns of soil trace metals content

Geoaccumulation index method

The MRGB generally maintains a non-polluting status, according to the geoaccumulation index approach. However, moderate contamination was observed for Hg (15.31%), Cd (5.10%), Ni (4.08%), Zn (1.02%), and As (1.02%). Hg (2.04%) and Cd (1.02%) exhibited moderate to strong contamination and strong contamination, respectively, indicating that Hg and Cd are the primary pollutants requiring attention and remediation measures. Fig. 2 a illustrates the contamination levels assessed using the geoaccumulation index. Notably, Hg exhibited significant anisotropy in its maximum and minimum values, with Cd and Hg showing notably higher anomalies than the other six trace metals.

Fig. 2
figure 2

Box plot of geoaccumulation index method (a) and PERI class distribution (b)

We generated a cumulative distribution map for Cd ad Hg in the MRGB through interpolation based on inverse distance weights calculated from 98 sampling points. As illustrated in Fig. 3 d and e, Cd pollution was predominantly concentrated in Qianjiazhuang and the vicinity of the Xinfeng Iron and Steel Plant. In contrast, Hg pollution covered a wider area, with notable contamination in Qianjiazhuang and Shouyang Chemical Plant, which served as the most severely affected regions.

Fig. 3
figure 3

Distribution of Cd and Hg in relation to PERI and geoaccumulation index

The enrichment coefficients for the mean contents of the remaining six trace metals relative to the background values of Weifang soil ranged from 0.89 to 1.15. Notably, with the exception of Cd and Hg, the levels of Cu, Ni, and Cr in the MRGB closely mirrored WFC’s background values of WFC, suggesting primarily natural origins according to the guidelines outlined by Dai et al. (2011) for interpreting enrichment coefficients and source attribution. Conversely, Pb, Zn, and As exhibited enrichment coefficients ranging from 0.96 to 1.15, signifying relative enrichment, implying a potential interplay between natural and anthropogenic sources or purely natural origins. Cd and Hg, on the other hand, demonstrated pronounced enrichment coefficients of 2.30 and 2.11, respectively, indicating conspicuous enrichment and a stronger likelihood of anthropogenic sources.

PERI methodology

The PERI values for trace metals in the MRGB were computed using the formula and Excel. A hierarchical evaluation following the grading criteria in Fig. S2 was conducted for the trace metal PERI values. The ecological risk values for trace metals in the soil samples are presented in Fig. 2 b and ranked as follows: Hg (Ei=84.47) > Cd (Ei=68.90) > As (Ei=11.49) > Ni (Ei=5.17) > Pb (Ei=4.78) > Cu (Ei=4.39) > Cr (Ei=2.18) > Zn (Ei=1.08). Within the MRGB, Cu, Ni, Cr, As, Pb, and Zn posed low ecological risks. However, Hg and Cd exhibited varying levels of risk, with 8.38% of Hg samples at moderate risk, 38% at very strong risk, 70% of Cd and Hg samples at moderate risk, 9% at strong risk, 2% at very strong risk, and 1% at extreme risk. In the case of Hg, 38% of the samples exhibited a medium ecological risk, whereas 33% displayed a strong risk, with 5% categorized as very strong and 2% as extremely strong. This distribution was predominantly influenced by the toxicity coefficients of the pollutants, with Cd and Hg demonstrating higher contamination levels and larger toxicity coefficients, resulting in an elevated potential ecological risk. Across all sample sites in the MRGB, the ecological RI ranged from 62.20 to 962.64, with an average value of 182.44, indicating a moderate ecological risk overall. Specifically, 44.9%, 51.02%, and 2.04% of the samples were classified as having low, moderate, and high ecological risks, respectively. Moreover, applying the load pollution index method revealed that all MRGB samples fell within the category of moderate pollution, aligning with assessments of potential ecological risks (Izah et al., 2017; Zhang et al., 2019).

Cd and Hg were the primary contributors to the potential ecological risk of trace metals in the entire MRGB. Therefore, we conducted interpolation and analysis of Cd and Hg to create pollution distribution maps, as shown in Fig. 3 a and b. Cd pollution was predominantly concentrated around the Xinfeng Iron and Steel Factory, Qianjiazhuang, and Jiangyuan Textile Factory, while Hg pollution was most pronounced in the eastern area near Qianjiazhuang and Shouyang Chemical Factory, exhibiting more severe contamination. Fig. 3 c shows these areas, including Daming Biotechnology Co. Ltd., Xinfeng Iron and Steel Plant, Qianjiazhuang, Jiangyuan Textile Plant, and Shouyang Chemical Plant, exhibit high values for the comprehensive ecological hazard index. In addition, the southern region of the MRGB shows a relatively lower ecological risk than the more heavily impacted eastern portion.

Evaluation of the potential ecological hazards of soil trace metals throughout the entire research region with regard to human health

Non-carcinogenic risks

In the non-carcinogenic risk assessment of eight trace metals (Cu, Pb, Zn, Ni, Cr, As, Cd, and Hg), the non-carcinogenic risk values were found to be THI <1 for both adults and children. Specifically, the overall THI non-carcinogenic risk values for adults and children were 9.33 × 10−2 and 8.41 × 10−1, respectively. These results indicate that the non-carcinogenic risk levels for both adults and children in the MRGB remained within acceptable limits, as outlined in Table 4.

Table 4 Evaluation of non-carcinogenic and carcinogenic health risks associated with soil trace metals

Among all three exposure routes, oral intake was identified as the predominant route of non-carcinogenic risk for both adults and children, ranking higher than dermal contact and oral nasal inhalation. This assessment was based on the non-carcinogenic HQ, where each metal exhibited an HQ value of 1 for both adults and children across all three exposure routes. Oral consumption accounted for a substantial portion of the non-cancer risk quotient (HI), contributing to 99.21% and 98.68% in children and adults, respectively. Notably, children exhibited a higher HI for trace metals Cr and As (>0.1), warranting consideration due to their elevated HQ compared to adults. The relative contributions of the HI values of each trace metals to the overall non-carcinogenic risk index THI in adults and children are presented in Fig. S5. It is evident that Cr and As have a significantly greater impact than other trace metals. Overall, this study indicates that neither adults nor children in the MRGB face significant noncarcinogenic risks. However, it also highlights Cr and As as the most influential non-carcinogenic risk factors for both adults and children, necessitating preventive measures.

Risk of carcinogens

Table 4 reveals that the TCRIs for the five carcinogenic trace metals in the MRGB, considering three exposure pathways, were 2.84E-05 for adults and 5.83E-05 for children. These values fall within the acceptable range for carcinogenic risk to human health (1×10−6–1×10−4) to human health. This study focused on evaluating the carcinogenic risks of five trace metals: Ni, Pb, As, Cr, and Cd. Notably, the TCRI values were higher for children than for adults, signifying that the carcinogenic risk associated with soil trace metals in the MRGB was within acceptable limits for both local adults and children.

For each exposure route, the CR values of the five trace metals fell within the range of no carcinogenicity risk (< 1×10−6) and tolerable carcinogenicity risk for humans. Comparative analysis revealed that children had lower CR values than adults for each trace metals in both oral and dermal exposure routes, as well as lower CR values for oral-nasal exposure. Additionally, for As, Cr, and Cd, the risk hierarchy was oral consumption > skin contact > oral/nasal inhalation, highlighting oral intake as the predominant route of carcinogenic health risk.

The oral route significantly contributed to carcinogenicity risk, accounting for 85.55% of adults and 91.14% of children, which was notably higher in children. In each exposure pathway, the CR values for the five trace metals fell within the range of no carcinogenic risk (< 1×10−6) and a tolerable level for human health. The analysis indicated that children exhibited lower CR values than adults for both oral and dermal exposure routes, as well as lower CR values for oral-nasal exposure. Specifically, for As, Cr, and Cd, the risk hierarchy was oral consumption > skin contact > oral/nasal inhalation, underscoring oral intake as the primary route of carcinogenic health risks.

Table 4 presents the CRIs for the adults and children. Notably, the CRIs for all trace metals, except Ni, were lower in adults, as only the CRI for Ni was calculated for the oral and nasal inhalation routes. The CRI values for both adults and children followed the order Cr > As > Pb > Cd > Ni. Specifically, CRI values for As and Cr in adults and children were within acceptable cancer risk levels, whereas Cd, Pb, and Ni CRI values posed no health risks. Fig. S6 illustrates that Cr made a substantial contribution to the overall cancer risk, accounting for 74.33% and 73.09% in adults and children, respectively. Based on these findings, Cr, a trace metals present in the soil, has emerged as the primary contributor to cancer risk in MRGB. It is evident that neither adults nor children in the MRGB face significant carcinogenic risk.

A comparison of trace metals levels between the MRGB and Pakistani regions reveals that the MRGB experienced lower contamination levels. It primarily exhibited mild contamination in terms of geoaccumulation index and potential ecological risk, resulting in a lower risk to human health. In contrast, the Pakistani region demonstrated higher geoaccumulation index values, ecological risks, and greater threats to human health. However, oral intake remained the primary route of health risks in both regions, with children being more vulnerable to trace metal exposure. Notably, As emerged as a significant non-carcinogenic risk factor in both areas.

Tracing the sources of trace metals in soil

Analysis of correlation

The correlation analysis of soil trace metals in the MRGB, conducted using Spearman’s correlation test, is detailed in Table 5. The analysis indicated significant relationships between various trace metals. Notably, strong correlations existed among Cu, Zn, Ni, Cr, As, and Cd, as well as between Hg and Pb, As, and Zn. These findings are visually represented in Fig. 4, which depicts the regional distribution of pollution for each trace metal component.

Table 5 Correlations among trace metals in soil
Fig. 4
figure 4

Spatial distribution of soil trace metals

Principal component analysis

Principal component analysis (PCA) effectively integrates influencing factors, identifies the most significant contributing factors, simplifies the data structure, and enhances source identification. Notably, the KMO value was 0.643, exceeding the threshold of 0.5, and Bartlett’s sphericity test result of 0.001 confirmed the suitability of PCA for analyzing the dataset.

The results are presented in Table 6. Our analysis extracted three principal components that collectively explained 81.1% of the total data variance. These components form the basis for understanding the sources of trace metals in the soil and their distribution patterns. The first principal component, explaining 36.75% of the variance, featured key loadings for Cu (0.881), Zn (0.626), Ni (0.930), and Cr (0.925), all of which exhibited significant positive correlations at P≤0.01 level. Notably, the high loadings and correlations of Cu, Ni, and Cr suggested the same source. Additionally, Zn demonstrated substantial loading (0.703) in the second principal component. The second principal component, responsible for 28.41% of the total variance, exhibited significant loadings in Pb (0.816), Zn (0.703), Cd (0.739), and Hg (0.658). The third principal component, which explained 12.94% of the variance, was primarily loaded with As (0.904).

Table 6 PCA of surface soil trace metals

Cluster analysis

Cluster analysis leverages the strong similarity among trace metals within the same category and their minimal similarity across different categories to classify the data. The results of the cluster analysis for the eight trace metals in the soil, utilizing the intergroup linkage method, are depicted in Fig. 5. The data were categorized into three groups, each corresponding to one of three principal components: PC1, PC3, and PC2. The separation between these groups was maintained within the range of 15–20 units.

Fig. 5
figure 5

R-type clustering of trace metals in surface soils

Source identification

Soil trace metal sources can be categorized as anthropogenic or natural, and their identification relies on geostatistical analysis and spatial distribution patterns of trace metals. The PC1 factor loadings were predominantly associated with Cu, Zn, Ni, and Cr. Examining the spatial arrangement in Fig. 4, the regions with high values for these three trace metals, Cu, Ni, and Cr, were roughly distributed in specific areas. These areas included the east-west belt near Daming Biotechnology Co. Ltd., the central region near the Jiangyuan Textile Factory and Shouyang Chemical Factory, and the southern region between Yite Technology Co. and Xiquan Village. Additionally, variations in the contents of these four trace metals across different regions suggested that they primarily originated from soil parent materials. Other studies have demonstrated the influence of soil parent materials and rocks on trace metal distribution (Bhupander et al., 2012., Gasiorek et al., 2017).

The factor loadings of PC1 were mainly associated with trace metals Pb, Zn, Cd, and Hg concentrations. The spatial distribution of these four trace metals exhibited a relatively consistent pattern, with most of the concentration centers overlapping. These concentrations were predominantly located in the northern region near Daming Biotechnology Co. and Xinfeng Iron and Steel Factory, as well as in the central-eastern part of Qianjiazhuang, Jiangyuan Textile, and Shouyang. This suggests that the primary contributing factors were industrial pollution, transportation, and the application of agricultural manure and fertilizers. Therefore, in addition to natural sources of Zn, anthropogenic influences, such as industrial emissions and transportation, play a significant role in the distribution of these trace metals.

The spatial distribution of arsenic is concentrated primarily in the vicinity of the Huayi Electric Wire Factory, Jiangyuan Textile Factory, and Shouyang Chemical Factory in the central region, as well as in the southern part near Yite Technology and West Circle Village. Unlike Cu, Ni, and Cr, which are identified as anthropogenic source factors with a clear distribution in the northern region, As’ distribution is not well-defined in the north. This suggests that the primary contributing factors are industrial pollution, transportation, and the application of manure and fertilizers in agriculture (Yang et al., 2022). Transportation, industrial emissions, and coal combustion were the main anthropogenic sources of As. Given the prevalence of mining, industrial activities, and transportation operations in the area, coupled with the fact that communities and industries constitute the majority of the anomalous area for As, these three factors are considered the primary anthropogenic sources of As in the MRGB.

Conclusions

Given the significance of agricultural activities in the MRGB, soil quality is of paramount importance. This study comprehensively assessed soil trace metal pollution, including its characteristics, underlying causes, and associated health risks, in the MRGB. The key findings are summarized as follows:

  1. (1)

    Except for a few isolated points with Ni, Cr, and As, the average soil trace metal levels in WFC exceeded the background values, whereas Cu and Pb contents were lower. For all other trace metals at every point, the levels were below the national risk screening values, signifying localized high trace metal concentrations in certain areas of the MRGB.

  2. (2)

    The geoaccumulation index indicated that Cd and Hg were the most frequently occurring pollutants. More than 90% of the points exhibited non-pollution levels for Cu, Pb, Zn, Ni, Cr, and As. In terms of potential ecological risk, Hg had the highest single potential ecological risk index (84.47), reaching a strong ecological hazard level, followed by Cd (68.90) at a moderate ecological hazard level. The other six trace metals had low ecological hazard levels. Overall, the composite ecological hazard index for the MRGB fell into the medium ecological risk category.

  3. (3)

    The non-carcinogenic risk assessment indicated that soil trace metals in the MRGB posed acceptable health risks to both adults and children. As and Cr were the main non-carcinogenic risk factors that significantly contributed to the risk in both groups. In terms of carcinogenic risk, the overall risk for both adults and children was within tolerable limits, with Cr being the primary contributor. Oral ingestion was the main exposure route for both carcinogenic and non-carcinogenic hazards.

  4. (4)

    In the MRGB’s soil, Cu, Zn, Ni, and Cr were predominantly sourced from natural parent rocks, except for Zn, which also exhibited influence from industrial emissions and traffic-related anthropogenic sources. Pb, Cd, and Hg were primarily affected by traffic and the application of manure fertilizers and agricultural chemicals. As, on the other hand, was primarily attributed to coal combustion and industrial emissions.