Introduction

Polycyclic aromatic hydrocarbons (PAHs) are a class of organic compounds that are fused from two or more benzene rings, and they are persistent organic pollutants that are carcinogenic, teratogenic, and mutagenic (Menzie et al. 1992; Davis et al. 1993). The sources of PAHs in the environment include natural and anthropogenic sources and are mainly anthropogenic, such as human activity-related fossil fuels, industrial plant burning, and coking industries (Edwards 1983). Because PAHs are lipophilic, semi-volatile, and migratory, PAHs are released into various environmental media and are accumulated within the food chain, causing harm to the regional ecological environment and human health (Kelly et al. 2007). Therefore, it is necessary to study the concentration, distribution, and origin of PAHs in different regions and environmental conditions.

In recent years, a wealth of information has been accumulated from the investigation and source analysis of PAHs in multi-environment media such as rivers (Yunker et al. 2002; Shen et al. 2017; An et al. 2016), coastlines (Foster et al. 2015; Li et al. 2016), cities (Banger et al. 2010; Vane et al. 2014; Wang et al. 2013; Jiang et al. 2016), and farmland (Ping et al. 2007; Zhang et al. 2016; Jiang et al. 2011). The Issyk-Kul Lake Basin is located in the Kyrgyz Republic and is an important agricultural area. It is also one of the most important scenic spots in Central Asia (ADB 2009). With regard to the socio-economic development of the region, the emission of pollution caused by industry, tourism, and residential daily activities is bound to affect the level and distribution of PAHs in the region. In addition, relative to other environmental media, soils bear most of the PAH environmental load, and the majority of PAHs are absorbed by humans through soil pathways (Edwards 1983; Cachada et al. 2016). Therefore, it is of great practical significance to carry out research into regional soil PAH levels. However, the topsoil PAH pollution in this area has not been previously reported.

In this paper, the concentration and composition of 16 PAH compounds under the priority control of the US Department of Environmental Protection (US EPA) in the 30-point topsoil of the Issyk-Kul Lake area were first investigated, after which the PAH pollution sources were determined through diagnostic ratios and principal component analysis (PCA). The potential ecological risks of the regional PAHs were evaluated, and the toxicity of PAHs in the soil was evaluated using the benzo[a]pyrene (BaP) toxicity equivalent method. Through this study, basic data can be provided for regional environmental protection purposes, and the research results will be useful for the control and regional ecological environmental planning of PAHs.

Materials and methods

Sampling

The soil within the Issyk-Kul Lake Basin is primarily classified as arid soil in the west and semi-arid soil in the middle and the east, and it is mainly composed of loam, clayey loam, and sandy clay loam (Kulenbekov and Merkel 2012). The land type of the upstream region is largely wooded tundra while grassland, irrigated land, shrub land, dry land, and settlements typify the downstream surface cover (Kulenbekov and Merkel 2012). There are two main cities around the Issyk-Kul Lake Basin, including Karakol (the capital of the Issyk-Kul oblast) and Balykchy, and the total population amounts to 450,000 within the Issyk-Kul oblast (ADB 2009). A random sampling strategy was adopted to examine the contamination of PAHs in the basin (42.11506° N, 75.81626° E ~ 42.7839° N, 78.39108° E), and a total of 30 surface soil samples were collected in October 2014 (Fig 1). A clean, stainless steel iron spoon was used for sampling, and GPS was used for positioning. The samples were placed into clean polyethylene bags and taken back for laboratory analysis.

Fig. 1
figure 1

Geographical location of Issyk-Kul Lake and the sampling locations in the basin

Experimental analysis

An SPE column was selected as a silica gel column (6 cm3, 1 g, 30 μm, Waters Company, Milford). Silica gel and aluminum oxide were extracted, respectively, for 48 h with methanol and methylene chloride. The silica gel (100–200 mesh) was activated under 180 °C for approximately 12 h. The aluminum oxide was activated under 250 °C for approximately 12 h in a muffle. The working standard solution contained the following 16 US EPA priority PAHs: naphthalene (Nap), acnaphthylene (Acy), acenaphthene (Ace), fluorene (Flu), phenanthrene (Phe), anthracene (Ant), fluoranthene (Flu), pyrene (Pyr), benzo(a)anthracene (BaA), chrysene (Chr), benzo(b)fluoranthene (BbF), benzo(k)fluoranthene (BkF), benzo(a)pyrene (BaP), dibenzo(a,h)anthracene (DahA), benzo(g,h,i)pyrene (BghiP), and indeno(1,2,3-c,d)pyrene (IcdP). The deuterated recovery surrogates consisted of naphthalene-D8, acenaphthene-D10, phenanthrene-D10, chrysene-D12, and perylene-D12; and the internal standard was hexamethylbenzene. All of the glasswares were soaked with a chromic acid lotion, washed with tap water and deionized water, then dried. The glassware and anhydrous sodium sulfate were solvent-rinsed and heated for 4 h at 450 °C prior to use. All of the organic solvents were HPLC grade and were obtained from Supelco. The anhydrous sodium sulfate was of analytical grade.

The collected samples were freeze-dried, after which any large stones or noticeable organic matter in the soil samples, including leaves, roots, and plant matter, were picked out, and the soil samples were ground into 100 mesh. Accurate 5-g samples were weighed, 6 g of quartz sand was added with 3 g of activated copper powder to remove any sulfur, and the samples were spiked with chemical surrogates (four deuterated PAHs, i.e., naphthalene-d8, acenaphthene-d10, phenanthrene-d10, and chrysene-d12) to calculate the recovery rates. The mixtures were thoroughly incorporated and placed into an accelerated solvent extraction system (ASE, Dionex 100) for extraction. The extracting agent was methylene chloride, with which the mixtures were statically extracted twice under conditions of 150 °C and 1500 psi with an extraction time of 6 min each. After extraction, the whole eluting system was washed with methylene chloride, the extract was all collected, dried with anhydrous sodium sulfate to remove moisture, and then evaporated with a rotary evaporation apparatus (Buchi R-200, Flawil, Switzerland) until nearly dry, after which 5 mL of the solvent n-hexane was added for replacement.

The silica column (with a silica gel/aluminum oxide ratio of 2:1) was activated before sample extraction, successively leached with 10 mL of methylene chloride and 10 mL of n-hexane, and the eluent was rejected. The extract was subsequently applied to the silica gel glass column for cleanup and fractionation. First, the column was leached with 15 mL of hexane to remove compounds such as n-alkanes, after which the column was eluted with a 70 mL mixture of n-hexane (20 mL) and methylene chloride (50 mL), and the components were collected. A rotary was used to evaporate the components into a small volume, which were then redissolved in 5 mL of acetonitrile, blown into 0.5 mL under a gentle N2 stream, and finally stored at a temperature of −20 °C for instrumental analysis. Known quantities of the internal standard were added to the sample prior to instrumental analysis.

The PAH contents were analyzed with an Agilent 1200 high-performance liquid chromatograph (HPLC) that was equipped with a diode array ultraviolet detector (DAD) and coupled with a serial fluorescence detector (FLD). The separation column was a WATERS PAH C18 (4. 6 mm × 250 mm, size: 5 μm, Supelco, USA), the column temperature was 25 °C, the pump flow rate was 1 mL min−1, and the injected volume was 20 μL. The gradient elution conditions were as follows: 0 ~ 15 min, 60% acetonitrile and 40% pure water; 15 ~ 30 min, 60% ~ 100% acetonitrile; 30 ~ 40 min, 100% acetonitrile; and 40 ~ 40.1 min, 100% ~ 60% acetonitrile.

Based on the peak sequences of the standard substance and the retention time, the PAH components were qualitatively identified. Six different linear levels of standard PAHs were used to establish the standard curves, and the correlation coefficients between the concentration and the peak area reached higher than 0.999 for each compound. Individual PAH concentrations in the sample were obtained through a comparison of the peak area with the standard curves. Blanks, duplicate samples, matrix-spiked blanks, and internal standard additions were examined for quality assurance during the analysis. The recoveries of the surrogate standard addition were 75–100%, and the target compounds were not detected in the blank experiment. The procedure was also checked for recovery efficiencies by analyzing uncontaminated samples spiked with PAH standards with an average recovery of 73–116%. The limits of detection (LD) of PAHs were based on a signal-to-noise ratio (S/N) of 3:1 and ranged from 0.4–1.2 ng g−1, while levels observed at less than the LD were expressed as not detected (nd). The data obtained from the experiment were corrected by the recovery rate in this study.

The soil total organic carbon (TOC) content is an important factor in controlling the soil adsorption of PAHs (Wilcke and Amelung 2000), and thus, the TOC was also assessed in the soil samples. The freeze-dried samples were ground into 100 mesh, after which 0.3 ~ 0.5 g of the subsamples was weighed and the potassium dichromate volumetric method-external heating method was used to determine the organic matter content (Walkley and Black 1934).

Data treatment and statistical analysis

An IBM SPSS 20.0 (SPSS Inc., USA) was employed for the data treatment and multivariate statistical analysis. Kolmogorov–Smirnov (K–S) tests were used to determine the normality of the data. Spearman correlations were used to evaluate the relationships between the TOC and individual and total PAH concentrations, which were considered statistically significant if the p value was less than 0.05. PCA was performed with a varimax rotation to identify the potential sources of the PAHs. Kaiser–Meyer–Olkin (KMO) and Bartlett tests were used to evaluate the validity of the PCA. The source diagnostic indices were calculated from the PAH concentration ratios.

The health risk assessment of PAHs is based on the concentration of BaP, which poses the highest risk of cancer (Tsai et al. 2004; Halek et al. 2008). The toxicity of the 16 PAHs (TEQ16) in the study area was evaluated using BaP as a reference substance, and the toxicity equivalent factor (TEF) was introduced (Nadal et al. 2004). The calculation formula is given as follows:

$$ \mathrm{TEQ}16={\Sigma \mathrm{C}}_i\times {\mathrm{TEF}}_i $$

where C i is the concentration of each of the 16 PAHs (i = 1–16) (ng g−1), and TEF i represents the relative concentration of a single PAH relative to that of BaP (Nisbet and LaGoy 1992).

Results and discussion

Compositional characteristics

The concentrations of the 16 collective PAHs (Σ16PAHs), of the individual PAH, and the total concentration of the seven carcinogenic components (Σ7PAHs, including BaA, Chr, BbF, BkF, BaP, IP, and DBA) are presented in Table 1. The Σ16PAHs concentration was 68.58–475.95 ng g−1 with an average of 134.45 ng g−1. The Σ7PAHs concentration was 7.66–76.04 ng g−1 with an average of 30.97 ng g−1. Overall, the collective concentration of the four-ring PAHs was high and accounted for 43.2% of the total, followed by the combined concentration of the two- and three-ring PAHs at 23%. The five- and six-ring PAHs had the minimum proportion with a total of 15.8%. The average concentrations of the individual PAH were sorted in descending order, which was Pyr > Phe > Nap > IcdP > BaA > Ant > Ac > Flu > BbF > BghiP > DahA > Ace > Chr > BkF > BaP > Fl. The main individual PAH pollutants were Pyr and Phe, which accounted for 34.96 and 18.52% of the Σ16PAHs concentration, respectively. The coefficient of variation indicated that the Σ16PAHs concentration and the individual PAH concentrations were highly variable. The soil TOC was between 39.78 and 276.08 g kg−1 with an average of 77.99 g kg−1. A single-sample K–S normal distribution test showed that the TOC, Σ16PAHs, and individual PAH concentrations did not conform to a normal distribution.

Table 1 Concentrations (g ng−1) of 16 individual PAH in the typical topsoil of the Issyk-Kul Lake Basin

A PAH is a type of hydrophobic organic compound. Because of the strong adsorption capacity of soil organic matter onto PAHs, it is generally considered that there is a positive correlation between the soil organic carbon content and the concentrations of PAHs. The results of Spearman correlation analysis showed that there was no correlation among the TOC, Σ16PAHs, and individual PAH concentrations, which was probably because the local topsoil organic matter and PAHs did not achieve equilibrium adsorption, and because the concentrations of PAHs are affected more by the source intensity or other environmental factors (e.g., soil pH, soil particle size, black carbon content, soil disturbance, and leaching effects) (Wang et al. 2013, Jiang et al. 2011). The specific factors in this location need to be further studied and confirmed.

According to a classification based on the value of ∑16PAHs proposed by Maliszewska-Kordybach (1996), 75% of the samples in this study were classified as not contaminated and 25% were classified as weakly contaminated. The topsoil pollution level of PAHs in this study area was higher than that of the Bosten Lake Basin in Xinjiang, China (where ∑16PAHs ranged from 6.6 to 128.2 ng g−1 with an average of 57.8 ng g−1) (Shen et al. 2016), which is lower than that of the Yangtze River Delta farming soil in China (where Σ15PAHs ranged from 8.6 to 3881 μg kg−1 with an average of 397 μg kg−1) (Ping et al. 2007) and is also lower than that of densely populated urban areas, such as Shanghai (where Σ16PAHs ranged from 83.3 to 7220 μg kg−1 with a mean value of 1970 μg kg−1) (Wang et al. 2013), Shenzhen (with an average Σ16PAHs of 479.4 ng g−1) (Zhang et al. 2016), and Lanzhou (where Σ16PAHs ranged from 82.2 to 10,900 μg kg−1 with a mean value of 2360 μg kg−1) (Jiang et al. 2016) in China, Miami in the USA (where Σ16PAHs ranged from 1508 to 2364 μg kg−1 with a mean value of 1869 μg kg−1) (Banger et al. 2010), and London in the UK (where Σ16PAHs ranged from 4 to 67 mg kg−1) (Vane et al. 2014). This comparative analysis showed that PAHs in the study area were affected by regional human activities, but the overall pollution is low.

Diagnostic ratios for source analysis

The diagnostic ratio method is widely used in the determination of PAH pollution sources, of which LMW/HMW, Ant/(Ant+Phe), Flu/(Flu+Pyr), BaA/(BaA+Chr), and IcdP/(IcdP+BghiP) are the most commonly used (Yunker et al. 2002; Tobiszewski and Namieśnik 2012). At present, the double ratio method is commonly used for the source analysis and determination of PAHs (Tobiszewski and Namieśnik 2012; Jiang et al. 2016; An et al. 2016).

Through the diagnostic ratios of PAHs in the soil of the Issyk-Kul Lake area, including those of BaA/(BaA+Chr) and Ant/(Ant+Phe), IcdP/(IcdP+BghiP) and Ant/(Ant+Phe), and BaP/BghiP and COMB/Σ16PAHs, the sources of the PAHs were analyzed (Fig. 2). According to the summary analyses of previous studies (Yunker et al. 2002; Tobiszewski and Namieśnik 2012), a BaA/(BaA+Chr) ratio of <0.2 indicates a petrogenic source, >0.35 indicates a combustion source, and a ratio between the two represents a mixed source. An Ant/(Ant+Phe) ratio of <0.1 usually indicates a petrogenic source, whereas >0.1 indicates a source of combustion. An IcdP/(IcdP+BghiP) of <0.2 indicates a petrogenic source, whereas >0.5 indicates a source of grass, firewood, or coal combustion, and a ratio between the two indicates a source of fossil fuel combustion. The ratios of Ant/(Ant+Phe) in the study samples were in the range of 0.1 to 0.3, the ratios of BaA/(BaA+Chr) were from 0.69 to 0.96, and the ratios of IcdP/(IcdP+BghiP) were from 0.46 to 0.95. As shown in Fig. 2a and b, the diagnostic ratio plots of BaA/(BaA+Chr) and Ant/(Ant+Phe) and of IcdP/(IcdP+BghiP) and Ant/(Ant+Phe) indicated that the main sources of the PAHs were coal, wood, and grass combustion. A COMB/Σ16PAHs ratio of <0.3 indicates a petrogenic source, whereas a ratio between 0.3 and 0.7 indicates a mixed source, and ratio of >0.7 indicates a high-temperature combustion source (Ravindra et al. 2008). Conversely, BaP/BghiP >0.6 indicates fuel combustion, whereas <0.6 indicates non-traffic sources (Katsoyiannis et al. 2007). In the BaP/BghiP and COMB/Σ16PAHs diagnostic ratio plots (Fig. 2c), >80% of the points fell within the range of non-traffic sources to mixed combustion sources, whereas <20% were from traffic sources. This indicates that the sources of PAHs in the regional topsoil mainly originated from the mixed combustion of grass, wood, and coal, while traffic sources also contributed a small amount.

Fig. 2
figure 2

(a) Diagnostic ratio plots for BaA/(BaA+Chr) and Ant/(Ant+Phe) in the topsoil samples; (b) diagnostic ratio plots for IcdP/(IcdP+BghiP) and Ant/(Ant+Phe) in the topsoil samples; (c) diagnostic ratio plots for BaP/BghiP and COMB/PAHs in the topsoil samples

Principal component analysis

To further investigate the possible sources of PAH pollution in the study area, PCA was also performed on the data. A KMO value of >0.5 and a significant Bartlett test (<0.001) indicates that the results of the PCA analysis are effective. The KMO of the samples in the study area was 0.656 and the Bartlett significance test was 0, and thus, the PCA analysis was valid. The PCA results provided four principal components, which can explain 77% of the variances. The component matrix is shown in Table 2.

Table 2 Component matrix of a principal component analysis of PAHs in the topsoil samples of the Issyk-Kul Lake Basin

The first principal component variance percentage was 45.9%. The component matrix showed that BbF, BkF, BaP, and BghiP of five- and six-ring PAHs and Phe, Fl, Pyr, and Chr of three- and four-ring PAHs had higher contribution rates. Moreover, the high molecular weight four- to six-ring PAHs were mainly derived from high-temperature pyrolysis (Harrison et al. 1996), and the higher load components were mostly characteristic compounds of coal combustion at high temperatures (Simcik et al. 1999; Harrison et al. 1996). State agriculture is the backbone of the national economy, accounting for 30–40% of the GDP (ADB 2009). Other than the mining industry, however, industry has barely developed in the study region, so industrial input is not the main source of PAHs in the regional topsoil. Coal is currently the main good that is transported via the Issyk-Kul Lake (ADB 2009). Therefore, coal that is used for heating is the main source of PAHs in the topsoil of the area. The first principal component of PAHs in the PCA was interpreted as the high-temperature combustion of coal.

The second principal component variance percentage was 15.9%, and the component matrix showed that the loads of Nap, Ac, Ace, Flu, Phe, and Ant of two- and three-ring PAHs were high. The low molecular weight two- and three-ring PAHs were derived mainly from low-temperature combustion or petrogenic sources (Harrison et al. 1996; Jenkins et al. 1996). Nap is a common characteristic compound of the incomplete combustion of biomass (Khalili et al. 1995; Simcik et al. 1999), Ac is the characteristic compound of wood burning (Khalili et al. 1995), and Ace, Phe, Ant, and Flu are the characteristic compounds of coal that is not fully burned (Larsen and Baker 2003). Approximately 70% of the state’s population lives in rural areas, and they usually use coal, wood, and dry manure as fuel for heating in the winter (ADB 2009). Therefore, the combustion of coal, wood, and dry manure for heating in rural areas is another important source of PAHs in the topsoil. The second principal component in the PCA was interpreted as the medium- and low-temperature combustion of coal, grass, and wood.

The third principal component variance percentage was 8.7%; the loads of BaA and DahA in the component matrix were high. BaA is one of the markers of gasoline fuel combustion (Khalili et al. 1995; Harrison et al. 1996), and DahA is one of the markers of diesel fuel combustion (Simcik et al. 1999; Harrison et al. 1996). The beautiful natural environment attracts millions of tourists each year for sightseeing and drives the development of regional transportation (ADB 2009). Therefore, the combustion of liquid fossil fuels also contributes to the PAHs in the topsoil of this area. Thus, the third principal component of the PCA in this area was interpreted as traffic sources.

The fourth principal component variance percentage was 6.5%. The Ace load in the component matrix was high, and Ace is a major component of coking (Khalili et al. 1995; Simcik et al. 1999). Coking production is an important part in the development of the modern steel industry. The topsoil PAHs also showed a small amount of input from industrial sources, and thus, the fourth principal component was interpreted as regional industrial development.

The results of the above-mentioned PCA analysis were essentially the same as those of the diagnostic ratio method. Based on the integration of the two analysis results, it can be concluded that the PAHs in the study area were mainly derived from the mixed combustion of coal, wood, and grass. Meanwhile, traffic and industrial development also contributed, albeit in a smaller proportion.

Ecological risk assessment

There are two types of commonly used ecological risk assessment criteria: the ecological effects range low (ERL) plus the biological effects range median (ERM) (Long et al. 1995) and the threshold effects level (TEL) with the predicted effects level (PEL) (Macdonald et al. 1996). A value less than the ERL (or TEL) indicates that the pollutants rarely produce ecological risks. A value between the ERL and ERM (or between the TEL and PEL) indicates that the pollutants occasionally produce ecological risks. When the value is greater than the ERM (or PEL), it indicates that the pollutants often produce ecological risks. The concentrations of the 16 individual PAHs at the sampling points were all lower than the ERL value (Table 3). The Nap and BaP values at the sampling points were lower than the TEL (Table 3). The Ac values were lower than the TEL value and accounted for 73.3% of the total, and those between the TEL and PEL accounted for 26.7%. The Pyr values that were lower than the TEL value accounted for 96.7%, and those between the TEL and PEL accounted for 3.3%. The DahA values that were lower than the TEL value accounted for 96.7%, and those between the TEL and PEL accounted for 3.3%, as shown in Table 3. It can be seen that 26.7% of the Ac points and 3.3% of the Pyr and DahA points occasionally produce ecological risks.

Table 3 Ecological risk assessment of PAHs in the topsoil of the Issyk-Kul Lake basin

Toxicity evaluation

The results of calculations reveal that the TEQ16 value in this region was between 2.48 and 13.78 ng g−1 with an average value of 6.23 ng g−1, which is lower than that of the Dutch agricultural PAH pollution threshold of 33 ng g−1, and thus, it currently represents no health risk. The contributions of the 16 individual PAHs to the total toxic equivalent were as follows: DahA (35.02%) > IcdP (20.15%) > BaP (18.25%) > BaA (13.87%) > BbF (7.16%) > BkF (2.27%) > Ant (0.86%) > Pyr (0.78%) > BghiP (0.48%) > Phe (0.47%) > Nap (0.24%) > Chr (0.22%) > Flu (0.09%) = Ac > Ace (0.04%) > Fl (0.01%). PCA showed that DahA and BaA originated mainly from traffic discharge, and IcdP and BaP were sourced mainly from coal and wood combustion. This suggests that the toxicity of PAHs in this area originated mainly from traffic fuel combustion, followed by coal, wood, and other biomass.

Conclusions

PAHs in soil samples from the Issyk-Kul Lake Basin were studied in conjunction with the total organic carbon (TOC) for the first time. The Σ16PAHs concentration in the topsoil of the Issyk-Kul Lake area was 68.58–475.95 ng g−1, for which the average value was 134.45 ng g−1, and they showed no correlation with the TOC. Pyr and Phe were the two main individual PAHs. The topsoil PAHs were mainly derived from the mixed low- to high-temperature combustion of coal, wood, grass, and other biomass. Traffic and industrial input only had small contributions to the regional PAHs. Compared to the pollution level proposed by Maliszewska-Kordybach in 1996, uncontaminated soil samples accounted for 75% and slightly contaminated soil samples accounted for 25% of the studied soils. An ecological risk assessment showed that 26.7% of Ac and 3.3% of Pyr and DahA in the sampling points had potential ecological risks, and a toxicity assessment based on BaP indicated that the topsoil PAHs in the region currently pose no health risks.