Introduction

A geochemical baseline of trace elements provides the means of defining the natural spatial variations in the geochemistry of the earth’s surface materials like soil and sediment. Meanwhile, it can distinguish between the pedogenic origin and the anthropogenic origin of trace elements in the environmental compartments (Darnley 1997; Salminen and Tarvainen 1997). However, the term ‘geochemical baseline’ has not yet been defined clearly. Since the first definition proposed by Tidball et al. (1974), many ways of defining the geochemical baseline have been suggested (Beus and Grigorian 1977; Rose et al. 1979; Ebens and Connor 1980). Among them, the most commonly used was the natural abundance of an element in a particular material such as soil, sediment, and rock with respect to a particular area or dataset. However, this definition was used mainly in exploration rather than environment because it generally refers to a single value (Salminena and Gregorauskienè 2000). Prior to a definition of geochemical baseline in environment, it is crucial to acknowledge the natural variability of any analyte in a given matrix under the aspect of the migration of chemical compounds and elements within and between individual environmental compartments. Therefore, a geochemical baseline or geochemical background should be characterized by regional variability and it is a function of time (Matschullat et al. 1999).

The normalization procedure is a widely applied method to obtain the regional geochemical baseline particularly in sedimentology recently (Covelli and Fontolan 1997; Donoghue et al. 1997; Tam and Yao 1998; Hamon et al. 2004). In this approach, the measured element concentrations have to be fitted with the reference element that is not influenced by anthropogenic activities. This is done by linear regression so that the baseline can be estimated for every single point that confirms to the regression conditions. The samples that lie beyond the confidence interval (95%) will be posted as anthropogenically influenced. However, the selection of the reference element is not universal but mostly depends on local geological and mineralogical features (Tam and Yao 1998; Hamon et al. 2004). Aluminum, a major constituent of the aluminosilicate mineral fraction, is often used as the reference element (Covelli and Fontolan 1997; Donoghue et al. 1997; Miko et al. 1999). Iron, a clay mineral indicator element, has been used as an alternative to Al by Tam and Yao (1998) and Tsai et al. (2003). In addition, Ti and Sc have also been used as reference elements due to their characteristic of geochemical stability in the environmental compartments (Teng et al. 2003). Another approach of calculating the baseline is relative cumulative frequency (RCF), which is widely used as well (Bauer et al. 1992; Bauer and Bor 1995). This technique does not require any assumption concerning the distribution function and simply applies the curves of the individual elements to display the RCF linearly. The first turning point of the curve can therefore be defined as the upper limit of any background data collective (Matschullat et al. 1999).

Since the geochemical baseline for the soils of Hong Kong was not investigated in the past, the target value of the Dutch List that was used to distinguish between clean and polluted soil in The Netherlands was usually referred in screening the polluted soil (Wong et al. 1996; Chen et al. 1997; Tam and Yao 1998). The calculation of the mean value for the heavy metals that have been collected from the countryside is an alternative way to determine the background of the soils (Li 2001; Li et al. 2001). However, such methods seem to be inappropriate in determining the soil pollution precisely. The main objective of this study is to establish a geochemical baseline for the soils of Hong Kong by application of the normalization procedure. Furthermore, it was assumed to be useful in defining the geochemical baseline for the soils under a similar environment.

Materials and methods

Study area descriptions

Hong Kong is located between 22°08′–22°35′ north latitude and 113°49′–114°31′ east longitude, with a total land area of 1,076 km2 in the southeastern tip of the Chinese mainland. It covers Hong Kong Island, Kowloon, New Territories and a number of outlying islands and Lantau Island is the biggest one. Three–quarters of the land areas is covered by countryside where most of the areas are managed as country parks and natural reserve areas. The limited agricultural land is scattered irregularly in the marginal land of New Territories (AFCD 2002). Geologically, Hong Kong is located in the extension to the South China Caledonian folding-orogenic belt and belongs to the part of west Pacific magmatic mobile belt of Mesozoic. The pre-Devonian basement is unexposed in the district and only a few strata of middle Devonian, lower Carboniferous, Permian, Jurassic, and Cretaceous are exposed (Lee et al. 1999). The middle and later Jurassic volcanic rocks in particular of the tuffs are the most widely distributed in Hong Kong, and Granite, Granodiorite, adamellite, feldspathic, porphyry of middle-late Yanshanian phase are well developed (Peng 1986). There is an anticlinorium with the axial direction of northeast–southwest in the regional tectonics. The mineral resources are composed of metallic and non-metallic deposits, with the former found in Hong Kong that includes pyrometasomic, pegmatite, hydrothermal replacement, and fissure filling, while the latter includes graphite, kaolin, clay, feldspar quartz, building stone, and sand (Peng 1978).

Major soil types of Hong Kong consist of Ferralisols, Ferrisols, Luvisols, Cambisols, Gleysols, Entisols, and Anthrosols according to the classification of Chinese soil taxonomy (CASNNFC 2001). The climate of Hong Kong is generally described as subtropical monsoon. The mean monthly temperature ranges from 15.9°C in January to 28.9°C in July, with the extreme minimum and the maximum of 0 and 36.2°C, respectively, based on the records of the Hong Kong Observatory (2001). Rainfall is also highly seasonal and 77% falls between May and September, with the mean annual rainfall of 2,214 mm.

Soil sampling and chemical analyses

Soil profiles were collected based on the parent material, land using, vegetation types, and elevations from the countryside of Hong Kong including the New Territories, Hong Kong Island and Lantau Island. Soil layers were sampled according to the soil genesis sequence in each profile and a total of 271 horizon samples representative of 51 soil profiles were collected (Fig. 1). The collected samples were air-dried at room temperature for a few days, screened through a 2-mm sieve and then stored in glass bottles prior to analysis. Samples were digested with 60% perchloric acid, 40% hydrofluoric acid, concentrated nitric acid and concentrated hydrochloric acid (Page et al. 1986). Concentrations of Al, Co, Cr, Cu, Fe, Mn, Ni, Pb, Ti, Sc, and Zn were analyzed by inductively coupled plasma atomic emission spectroscopy (ICP-AES; Perkin-Elmer 3300 DV), while concentrations of As, Cd, Hg, and Se were analyzed either by a graphite furnace atomic absorption spectroscopy (GFAAS; Varian 220Z) or cold vapor atomic absorption spectroscopy (CVAAS; CETAC M-6000A). For quality control, reagent blanks, replicates, and standard reference material (GSS-4 soil purchased from the National Research Center for Geoanalysis of China) were incorporated to detect contamination and assess precision and bias. The results showed no sign of contamination and revealed that the precision and bias of the analysis were generally below 10%. The recovery rates for the elements in the standard reference material (GSS-4) ranged from 75 to 115%. Mineralogical analyses of the selected soil samples were performed by X-ray diffraction (XRD) (D/max-IIIC).

Fig. 1
figure 1

Map of soil samples locations

Statistical analysis

Statistical analysis such as cluster analysis and cumulative frequency distribution was performed by the SPSS 11.0 for Windows. The Kolmogorov–Smirnov (K–S) test was applied to test the normal distribution for both raw and log-transformed data. The non-normal distributed data were log-transformed prior to further data analyses. Since the concentration of elements varied to a great extent, the raw data were standardized to the Z score (with a mean of 0 and a standard deviation of 1) before execution of the cluster analysis. Cluster was performed with the method of weighted average linkage between the groups. Pearson correlation for the cluster intervals and the elements that showed a close correlation were identified and grouped for further analysis.

Results and discussion

Mineral composition of Hong Kong soils and the reference element selection

Particle size analysis revealed that the texture of Hong Kong soil varied to a great extent from clay to sand most of which was loam. Kaolinite dominated the clay mineralogy in the soils, other clay minerals such as vermiculite, micas, gibbsite, quartz, goethite, montmorillonite, and mixed-layer clay minerals were also identified by XRD. It indicated that a high degree of weathering had occurred in the soils of Hong Kong (Duzgoren-Aydin et al. 2002; Duzgoren-Aydin 2003).

In general, as the intensity of weathering increases, Ca, Na, K, and Si decrease, whereas Fe, Ti, and Al together with some trace elements increase. The relative enrichment of these elements can be mostly attributed to the formation of secondary phases, particularly sesquioxides and clay minerals, upon which most major elements and some trace elements can be fixed during weathering (Malpas et al. 2001). In addition, the geochemical behavior of trace elements was also controlled by the type and abundance of clay minerals. Duzgoren-Aydin and Aydin (2003) observed the enrichment of Cu, Ni, and Pb in the clay minerals with a higher degree of weathering in Hong Kong as a result of the relative abundance of Mn-oxides, which hold a high-sorption affinity. Since the reference element for the determination of the geochemical baseline must be an important constituent of one or more of the major trace element carries, Fe, Ti, Al, and Mn were therefore selected to be the potential reference elements based on the result of clay mineralogy analysis. Meanwhile, Sc was also selected as the potential reference element due to its relative immobility. Correlation analysis between the potential reference elements and elements of concern was performed as an aid to making the correct choice (Table 1). On the whole, Fe seems to be the best reference element as it has a significant positive relationship with all the elements of concern and would be used in the following normalization procedure.

Table 1 Correlation matrix for potential reference elements with concerned elements

Geochemical baseline of Hong Kong soil

The linear relationship between the contents of each element of concern and Fe contents was determined in the form of y = ax + b, where y is the value of the element, x is the Fe content in the samples. As proposed by Covelli and Fontolan (1997), the data lying out of the 95% confidence band were eliminated, and then build a new linear equation with the updated dataset until all the data were within the 95% confidence band. Table 2 shows the final equations for predicting the geochemical baseline of Hong Kong soil. Based on the equations and the measured Fe contents, a predicated value for each element at each sample location was computed. It was therefore defined that the predicated mean value as the critical value and mean ± 2δ as the range of the geochemical baseline for each element (Covelli and Fontolan 1997; Reimann and Filzmoser 1999), where δ is the standard deviation of predicated dataset. Table 3 is the predicted geochemical baseline of Hong Kong soil. It could be observed that the predicted baseline was close to the measured data according to the mean values for most of the elements, as all the samples were collected from the countryside areas, where the effect of industrial activities was relatively low to the soils on the whole. However, an exception was found with As, the measured mean value was over two times higher than the predicted data. This may probably result from the agricultural activities in the past as the inorganic forms of As were used extensively in agriculture such as insecticides, herbicides, fungicides, desiccants, defoliants and as additives to animal feeds prior to 1970s in Hong Kong (AFCD 2002). Chen et al. (1997) also found a relative higher concentration of As in the soils of agricultural lands and forestlands in Hong Kong. However, the upper limit derived from the normalization procedure for each element was much lower than the counterpart of the measured data. It implied that the impact of anthropogenic activities on the trace element contents of the soil could be eliminated by the normalization procedure, whereas the natural variability caused by the pedogenic process was not covered up. It could be represented by the range of the baseline with such a method.

Table 2 Linear relationship between trace metals and Fe
Table 3 Geochemical baselines computed by the normalization procedure and measured results

Geochemical baseline obtained from relative cumulative frequency method as a comparison

The technique of RCF curve was conducted in order to make a validation for the results of the normalization procedure. Figure 2 shows the RCF curves for each element. As shown in the curves, at least one turning point was indicated in each curve. The mean value of the first band of the slope was regarded as the critical value of the baseline as proposed by Matschullat et al. (1999). The result is shown in the Table 4. It could be observed that the baseline obtained from the RCF method was much identical to that of the normalization procedure based on the mean value. This implied that it is feasible to use the element of iron as the reference element in estimating the geochemical baseline of Hong Kong soil.

Fig. 2
figure 2

Relative cumulative frequency curves indicates the upper limits of the geochemical baseline for the trace elements of Hong Kong

Table 4 Geochemical baseline estimated by relative cumulative frequency and the reference background applied for the Hong Kong soil

In addition, the reference background value applied usually to the soils of Hong Kong in the past was provided as a comparison in Table 4. It is obvious that the predicted geochemical baseline is much lower than the A-value of the Dutch List (VROM 2000), which means that heavy metal pollution might be underestimated for the Hong Kong soil when the A-value was referred to. The predicted geochemical baseline was also different from the background of Guangdong soils for most of the elements, although the two areas are related geographically and geologically. As for the background value of Guangdong soils, a mean value was computed from the dataset covering the whole territory in which the land uses and parent rock were much more complex than those of Hong Kong (Xia et al. 1987). Therefore, it could vary greatly between the two areas in geochemistry of the trace elements as shown in Table 4. However, the background (or baseline) would not be estimated appropriately without a proper sampling scheme or data processing even if the dataset were derived from the local soils. Li et al. (2001) applied the mean value of the heavy metal from the countryside as a background. In this case, the soil pollution for Cd, Cu, and Pb might be overestimated since the samples collected were not distributed as extensively as the whole area of Hong Kong. On the other hand, log-normal distribution occurred for most of the elements in the soils, hence, the mean value would be affected by the extreme of the dataset. This might also result in the over- or under-estimation of the background values (Reimann and Filzmoser 1999).

Geochemical association of the trace elements in Hong Kong soil

Geochemical association of trace element in the soil is mainly determined by the regional geological features, pedogenesis and the characteristics of trace element. It would influence the value of regional geochemical baseline (Baize and Sterckeman 2001). Figure 3 is the result of the cluster analysis for the 15 elements of the Hong Kong soil and three major geochemical associations were distinguished in the dendrogram. The first group is composed of Cd, Fe, Ti, Sc, Co, Ni, Zn, Pb, Mn, and Cr. This indicated that trace elements such as Cd, Ni, and Zn were mainly associated with Fe–Mn oxides and were relatively immobile in geochemistry. Similar geochemical association was also found in the stream sediment of Hong Kong in which Cd and Co were related to Fe, while Pb was correlated to Mn significantly (Parry 2000). This implied that the first association was mainly influenced by local geological features. The element of chromium was also incorporated into the first group although it showed a weak correlation to the other elements. Chromium is frequently found to be associated with Fe in natural lateritic Fe-oxide crust. The trivalent form chromium (Cr3+) probably accumulated during weathering through the way of incorporation into the octahedral sites in the structures of kaolinite, gibbsite, hematite, and goethite, the minerals that make up most of the clay fraction (Marques et al. 2004). The second group contained Hg, Se, Al, and Cu, indicating this association was connected with the Al oxides or with the Al-bearing clay minerals mostly. In addition, Hg and Cu were always presented in the form of sulfides such as HgS and CuFeS2 in geochemistry, while Se occurred among these sulfides as an associated mineral (Peng 1986). The last group was the element of arsenic, which occurs as a sulfide generally in geochemistry. However, it was not associated with Hg, Se, Cu, and Al of the second group in the dendrogram. This might be attributed to the anthropogenic sources of arsenic in the soil as arsenic-bearing pesticides were applied in the past (Chen et al. 1997). Figure 4 shows the relationship between the content of As and that of S in the soil with the raw dataset and the dataset within the baseline range, respectively. For the former, no relationship was found between them, which was reflected by the low value of r 2 (0.0009), whereas, a significant (r 2 = 0.4066, p < 0.01) positive correlation was observed when only the data within the baseline range were focused on.

Fig. 3
figure 3

Dendrogram using average linkage (between groups) to display the elements association in the soil of Hong Kong

Fig. 4
figure 4

Correlations between As contents and S contents in the soils. a and b represents raw dataset and dataset within the geochemical baseline, respectively

Conclusions

This study indicates that the element of Fe is a good reference element in the determination of the geochemical baseline for the soil of Hong Kong as when compared to the other four elements Al, Ti, Sc, and Mn. The baselines of ten trace elements that were of concern in the survey of soil pollution were predicted through the linear regression between iron and each element. The predicted values were much closer to those obtained from the RCF curve. All the estimated baselines were much lower than the A-values of the Dutch List, which was used as an alternative background in screening the contaminated soil of Hong Kong. This implied that the trace elements in pollution might have been underestimated for the Hong Kong soils. A geochemical association of the trace element would play an important role in the geochemical baseline. Approximately three types of geochemical associations could be identified in Hong Kong soils including associations with Fe–Mn oxides, Al oxides or Al-bearing clay minerals and sulfides.