Introduction

Polycyclic aromatic hydrocarbons (PAHs), a family of compounds which possess two or more fused benzene rings, are a common cause of environmental contamination (Pietzsch et al. 2010). Their specific hazard derives from their known carcinogenic, teratogenic, and mutagenic potential (Y. Liu et al. 2009; Ruiz et al. 2011). Owing to their high hydrophobicity and slow rate of degradation, PAHs are readily adsorbed onto particulate matter, which may end up being incorporated into river and lake sediments. The bulk of anthropogenic pollution by PAHs derives from incomplete combustion of biomass and fossil fuel, to which overlap occasional oil spills (Tobiszewski and Namieśnik 2012). As each of the various sources of pollution is associated with a characteristic profile of PAHs, it is often possible to infer the origin of the pollution (Mostafa et al. 2009; Yunker et al. 2012). Establishing the relative presence of low-molecular-weight PAHs carrying two or three fused benzene rings compared with higher-molecular-weight PAHs carrying four to six rings allows the differentiation between pyrogenic and petrogenic sources of PAH pollution (Brown and Peake 2006). Another widely used technique to identify sources is the diagnostic ratios (Bouloubassi and Saliot 1993; Dickhut et al. 2000). A more sophisticated statistical approach used for source identification is represented by principal component analysis (PCA) in conjunction with multivariate linear regression (MLR); this allows both the identification and apportioning of the various PAHs present in sediments, suspended matter, and aerosols.

The Pearl River Delta (PRD) is an area with a high concentration of organic contaminants, including PAHs, affecting air and water quality and the whole food chain (Fu et al. 2003). The Beijiang River, the second largest tributary of the Pearl River, provides a major source of drinking water (Chen et al. 2009), as well as for irrigation and industry in the PRD and Guangdong province. The rapid industrial growth in the region has increased the discharge of waste materials into the Beijiang River (Song et al. 2011). Environmental protection requires knowledge of both the impact and the source of pollution. Here, we provide basic information regarding the level and origins of the PAHs accumulated in the Beijiang River sediment, using a variety of identification techniques, namely diagnostic ratios and PCA and MLR analysis.

Materials and methods

Study area and sampling strategy

The sampling sites were defined using Global Positioning System (Table 1), and a set of 12 sediment samples was collected over the period 10–15 January 2009 (Fig. 1). The top 5 cm of the sediment surface was collected and transferred into a 100 ml screw cap glass jar, then homogenized and stored at 4 °C in the dark (Shukla and Upreti 2009). The material was then air-dried by exposing it to room temperature (25 ± 1 °C) for 2 weeks and finally sieved through a 2 mm mesh. A further 30 samples were analyzed by Jing et al. (2009).

Table 1 Details of the sampling stations
Fig. 1
figure 1

Map of study area and sampling stations

Chemical analysis and quality control

The sample extraction and PAH clean-up methods followed the relevant protocols given in US Environmental Protection Agency (EPA) Standard Methods 3540c and 3620b (USEPA 1996a, 1996b). Each 10-g sample was mixed with 10 g anhydrous sodium sulfate, then extracted by the addition of 80 ml of dichloromethane/acetone (v/v 1:1) for 16–24 h at 65 °C to which activated copper was added for desulfurization. The extract was concentrated to 2 ml by a rotary evaporator and the solvent exchanged to hexane. The hexane extract was fractionated and cleaned up by a silica gel/alumina (v/v 1:2) column. The column was eluted with 75 ml of n-hexane/dichloromethane (v/v 3:7) to obtain PAH. The PAH fraction was finally concentrated to 1.0 ml under a gentle stream of nitrogen (USEPA 1996c).

The 16 PAHs identified by US EPA as priority pollutants (naphthalene (NAP), acenaphthylene (ANY), acenaphthene (ANA), fluorene (FLU), phenanthrene (PHE), anthracene (ANT), fluoranthene (FLT), pyrene (PYR), benzo[a]anthracene (BaA), chrysene (CHR), benzo[b]fluoranthene (BbF), benzo[k]fluoranthene (BkF), benzo[a]pyrene (BaP), indeno[1,2,3-cd]pyrene (IPY), dibenz[a,h]anthracene (DBA), and benzo[g,h,i]perylene (BPE)) were detected in sequence, using analytical methods based on US EPA Method 8270c (USEPA 1996d). Quantification of individual PAHs was achieved by gas chromatography (Hewlett Packard 5890) coupled to a GC/MS device (Hewlett Packard 5973). A silica fused capillary column (HP-5, 50 m × 0.32 mm (i.d.) × 0.17 μm (film thickness)) was used to separate the target analytes. The oven temperature was raised from 45 °C to 200 °C at a rate of 3 °C per min, then 200 °C to 285 °C at 5 °C per min, and held for 30 min. The injector temperature was maintained at 280 °C. Each 2-μl sample was delivered by manual splitless injection. Additional details of the chromatographic and spectrometric conditions are described elsewhere (Mai et al. 2002).

The identification of compounds was based on retention times recorded for calibration standards. The concentration of each analyte was determined in the form of nanograms per gram dry weight of sediment. Quality control was ensured by including blanks (solvent only), duplicate samples, and spiked blanks. The instruments were re-calibrated daily. The PAH recovery rates were quantified by using standard reference materials provided by NRCCRM (China National Research Center for Certified Reference Materials) and lay in the range 83–133 %. The total organic matter content (TOM) of each sample was determined by the method of loss in weight by ignition, as described by Byers et al. (1978).

Diagnostic ratio analysis

Since PAH concentration ratios can provide an important insight into the source of PAH contaminants, the ratios ANT/(ANT + PHE), FLT/(FLT + PYR), BaA/(BaA + CHR), and IPY/(IPY + BPE) were used to infer the likely origin of the sedimentary PAHs (Sienra et al. Sienra and d. R., Rosazza, N. G., and Préndez, M. 2005; Jiang et al. 2009; Yunker et al. 2012). An ANT/(ANT + PHE) greater than 0.1 implies a petrogenic (associated with petroleum) source, while a ratio below 0.1 is indicative of combustion. Similarly, an FLT/(FLT + PYR) of less than 0.4 implies a petrogenic origin, of 0.4–0.5 indicates petroleum combustion, and >0.5 grass, wood, or coal combustion. A BaA/(BaA + CHR) ratio below 0.2 indicates a petrogenic source, while one of 0.2–0.35 implies petroleum combustion. An IPY/(IPY + BPE) ratio of less than 0.2 indicates a petrogenic source, of 0.2–0.5 implies petroleum combustion, and above 0.5 implies grass, wood, and coal combustion (Jiang et al. 2009).

Principal component analysis and multiple linear regression

PCA and MLR analysis were employed to quantify the relative contribution of each potential PAH source (Li et al. 2012; Yunker et al. 2011). The following seven steps are involved: (1) Bartlett’s test of sphericity and the Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy to test the appropriateness of the data for PCA analysis (Bagozzi and Yi 1988); (2) the calculation of the correlation coefficients and the resulting covariance matrix; (3) the calculation of eigenvalues and their corresponding eigenvectors; (4) the discarding of components which only accounted for a small proportion of the variation; (5) the development of the PCA loading matrix and the performance of a Varimax rotation on the factor loading matrix to infer the principal components (PCs); (6) the calculation of the absolute PC scores (APCS) (Guo et al. 2004; Kowalkowski et al. 2006); and (7) the implementation of MLR using the de-normalized APCS values produced by the PCA as the independent variables and the measured concentrations of the particular pollutant (Simeonov et al. 2003). A more detailed explanation of the procedure can be found in Thurston and Spengler (1985) and Ouyang et al. (2006).

The PCA was performed using routines within SPSS v17.0 software (SPSS Inc, USA), selecting only Varimax rotation and PCs associated with an eigenvalue >1. Following Liu (2003), factor loadings of >0.75, 0.50–0.75, and 0.3–0.5 were interpreted as representing, respectively, “strong,” “moderate,” and “weak”. The factor scores were used to determine the relative contribution of each source by MLR. In all, 42 sediment samples were analyzed in this way.

Results and discussion

Pollution characteristics

Concentration values and basic statistics of the 16 key PAHs are synthetically presented in the box–whiskers plots of Fig. 2. Among the metals, PYR has the highest values (2.56–1,262 ng g−1), followed by BkF (0.66–1,090 ng g−1), BbF (2–1,060 ng g−1), BaA(1.1–1,038 ng g−1), and FLT (2–1,049 ng g−1). The aggregate PAH concentrations ranged from 44 to 8,921 ng g−1 (mean 1,132.3). According to Baumard (2009), values <100 ng g−1 can be considered to reflect a low level of pollution, those in the range 100–1,000 ng g−1 a moderate level, those in the range 1,000–5,000 ng g−1 a high level, and those >5,000 ng g−1 a very high level. On this basis, only five of the samples were classified as being polluted at a low level and 23 as moderately polluted, and 12 samples are at high pollution level. The pollution levels of as highly polluted and two (S5 and BS3 are considered to be) as very high, and it clearly associated with point source pollution, because it was highly polluted. The S5 site is located in a major industrial zone of Qingyuan City and the BS3 site downstream of the outfall of the Shaoguan smelting plant. Overall, the PAH contamination level in the Beijiang River sediment was moderate to high. The Beijiang River sediments are more polluted than those from the Dongjiang (100–3,400 ng g−1, mean 880 ng g−1 (K. Zhang et al. 2011)), from the Yellow River (111–205 ng g−1, mean 116 ng g−1 (Ning-jing et al. 2010)), and from the Daliao River (62–840 ng g−1, mean 287 ng g−1, (Li et al. 2012)), but lower than those from the Fenhe River (539–6,282 ng g−1, mean 2,215 ng g−1). Since the predominance of low-molecular-weight PAHs indicated a petrogenic source and the higher-molecular-weight PAHs, a pyrogenic source, the relative amounts of these fractions were then determined (W. Zhang et al. 2008; Lichtfouse et al. 1997; Soclo et al. 2000). The composition of the PAHs is illustrated in Fig. 3. Higher-molecular-weight species predominated, contributing >50 % of the PAH content in 34 of the 42 sites. This suggests that the origin of the PAHs in the study area has derived mainly from the incomplete combustion of fossil fuel and wood (W. Zhang et al. 2008).

Fig. 2
figure 2

Concentrations of PAHs in sediments samples

Fig. 3
figure 3

Percentage level of two- to three-ring and four- to six-ring PAHs in sediment of Beijiang River

Diagnostic ratio analysis

The results have been presented in the form of a cross plot contrasting ANT/(ANT + PHE) with FLT/(FLT + PYR), and BaA/(BaA + CHR) with IPY/(IPY + BPE). The FLT/(FLT + PYR) ratio was >0.4 except at sites S1 and S7 (Fig. 4a), while the ANT/(ANT + PHE) ratio varied rather widely (although it was generally >0.1). Site S1 and S7 stood out as a result of their low level of PAH pollution. Neither of these sites lies in industrial zones. The ratios indicated a mixed origin of the PAHs, although the predominant one appears to be pyrogenic, with some contribution from petrogenic sources, probably accidental oil spills. Thus, the BaA/(BaA + CHR) ratio was >0.2 for most of the samples while the IPY/(IPY + BPE) ratio varied from 0.35 to 0.55, diagnostic of a pyrolytic origin (Fig. 4b). The predominance of pyrolysis as the major source of PAHs has been previously noted by Pies et al. (2008) and Men et al. (2009). Other diagnostic ratios, used to distinguish petrogenic from pyrolytic origin include FLT/(FLT + PYR), BaA/(BaA + PYR), IPY/(IPY + BPE), which are described elsewhere (Ravindra et al. 2008; Tobiszewski and Namieśnik 2012).

Fig. 4
figure 4

Diagnostic ratio plot of ANT/(ANT + PHE) versus FLT/(FLT + PYR) (a) and BaA/(BaA + CHR) versus IPY/(IPY + BPE) (b)

PCA/MLR

The KMO value was 0.76, indicating that the PCA can be expected to be informative in reducing the dimensionality of the data. The component loadings of the three PCs retained are given in Fig. 5a, b, c and the correlations between the measured concentrations of the 16 PAHs in Table 2. The correlation efficient matrix demonstrated that nearly all of the variables were significantly and positively correlated with one another; the exception was ANA, a low-molecular-weight (two- to three-ring) group of compounds. Its poor correlation with the other PAHs may reflect a rather different origin for this pollutant, perhaps from quite remote locations via atmospheric transportation (Chung et al. 2007). As semi-volatile organic compounds, the property of PAHs vapor pressure determined the compounds vapor–particle partitioning, representing the available fraction for long-range atmospheric transport and phase transfer into sediment organic pollutions (Agarwal et al. 2009; W. Wang et al. 2011b). The correlations between the various PAHs may aid in identifying the sources of the pollution.

Fig. 5
figure 5

Component loadings for three components (PC1 (a), PC2 (b), PC3 (c))

Table 2 Correlations matrix of the 16 priority PAHs

Table 3 summarizes the PCA. Three PCs were identified, individually accounting for 77.8 %, 11.3 %, and 8.2 % of the total variance, and taken together for 97 %, and individually. Each of the PCs showing a loading value >0.75 is highlighted. PC1 had a high contribution from ANT, FLT, PYR, BaA, CHR, BaP, and IPY, all of which (except for ANT) are high-molecular-weight PAHs. FLT, PYR, and BaP are markers of coal combustion (Yang et al. 2013), IPY of automobile exhaust, BaA of diesel exhaust, and CHR of diesel and natural gas combustion. Thus, PC1 was largely reflective of fossil fuel combustion. PC2 was heavily influenced by NAP and BkF; the latter is associated with diesel exhaust and the former with oil spillage (C. Wang et al. 2011a; Y. Liu et al. 2009). As suggested by the correlation co-efficient matrix, NAP was significantly correlated with BkF; a possible explanation for this correlation is that both pollutants emanated from shipping along the river, in the form of exhaust emission and oil leakage. As a result, PC2 may well reflect diesel emissions plus oil spillage. PC3 was dominated by ANY and ANA, both of which are low-molecular-weight PAHs, diagnostic of coke combustion (Bixiong et al. 2006). The PCA indicated that the Beijiang River sediment PAHs have arisen mostly from combustion, in good agreement with the analysis based on PAH ratios.

Table 3 Rotated component matrix of 16 PAHs

The relative contributions of the three major origins of PAH pollution were estimated using MLR, based on the model:

$$ y={\displaystyle \sum {a}_i{X}_i}+b $$
(1)

This model is valid only if there is no collinearity in X i. Non-collinearity, by definition, is ensured by selecting the PCA factor scores as the independent variables, X i. The dependent variable y is the overall PAH concentration. The influence of each dependent variable on the independent variable can be directly compared via the regression coefficients, provided that the independent and dependent variables are normally standardized, at which point, the regression coefficients can be represented as B i (the intercept b is at Y = 0). The standardized form of the model is represented by:

$$ Y={\displaystyle \sum {B}_i}{X}_i $$
(2)

where Y is the standardized normal deviate of the overall PAH concentration. The normal standardization equation is:

$$ Y=\left({\displaystyle \sum PAH}-\overline{{\displaystyle \sum PAH}}\right)/{\sigma}_{{\displaystyle \sum PAH}} $$
(3)

Where ∑ PAH is the total PAH concentration, \( \overline{{\displaystyle \sum PAH}} \) is the ∑ PAH mean of ∑ PAH, and σ ∑ PAH is the standard deviation of ∑ PAH.

The regression of the three leading PCs (X 1X 3), which represent fossil fuel combustion, diesel emission plus oil spillage, and coke combustion, were regressed against Y to give the relationship:

$$ Y=0.786{X}_1+0.536{X}_2+0.305{X}_3 $$
(4)

The associated R 2 value was 0.999, and the P values associated with the regression coefficients were all <0.05, indicating that the MLR expression gave a good representation of the relative contributions. The percentage contributions to the mean for the three factors were calculated from 100*Bi/ƩBi, producing estimates for the mean relative contributions of fossil fuel combustion, diesel emission plus oil spillage, and coke combustion of, respectively, 48 %, 33 %, and 19 %.

Conclusions

The present study has provided important data on the distribution characteristics of PAHs in the sediments of the Beijiang River. To minimize the selection uncertainties of source profiles and obtain the higher model performance, diagnostic ratio analysis and PCA/MLR were applied to estimate source contributions for sediment PAHs from the Beijiang River system. The derived individual PAH ratios suggested that most of this pollution has arisen from pyrogenic activity, while the PCA/MLR indicated that fossil fuel combustion, oil spillage, and coke burning have provided most of the locally produced PAH, with the former activity being the single most important contributor. Identifying the major emission sources is an essential step in seeking to control, manage, and ameliorate environmental pollution.