Introduction

Mumbai (Latitude 18°53′ to 19°04; Longitude 72°48′ to 72°53′E) is a heavily populated industrial island city on the west coast, which is known as the commercial capital of India. During the past 30 years, it has grown from a small city to an international city. In 1971, Mumbai had a population of 5,970,575, which increased to 8,227,382 in 1981, 9,990,000 in 1991 and 11,430,000 in 2001 (Dept of Public Health, Govt. of Maharashtra 2000). Congruent to this population expansion, the number of registered motor vehicles in Mumbai city has also increased. Fundamental to this rapid urbanization are the numerous industries at Thane–Belapur industrial area, which is one of the most rapidly developing industrial areas around Mumbai city, developed by Maharashtra State Government. This is also one of the contaminated sites identified by Central Pollution Control Board (CPCB), New Delhi. About 400 industries are in a 20 km area with all types of process industries, including chemical, pharmaceutical, textile, steel, paper, plastic, fertilizers, etc. Some of the well-known industries are E-Merck (I) Ltd., Hindustan Lever Ltd. and Pidilite industries Ltd., which have flourished in the area during recent years. Many of these industries discharge their treated or untreated effluents either on the open land or into unlined water streams. Average annual rainfall in the study area is about 2,500 to 3,500 mm, which falls during the period from of June to October. The rock formation in the area is derived from Deccan basalt and also from granites, gneisses and laterite. The soils of this region are highly saline. They are calcareous, neutral to alkaline (pH 7.5–8.5) and clays with a high amount of bases and have water-holding capacity (Shankar et al. 1994).

Pollution is a common concern in many modern, industrialized urban communities around the globe (Zhang et al. 1999; Moon et al. 2000). The connection between pollution and various health problems is now well documented and reasonably well known by the general public (Barzilay et al. 1999). Soils are usually the interface between human activities and other parts of the environment that should be preserved and protected (Krzyztof et al. 2004). Soil samples represent an excellent media to monitor heavy metal pollution because anthropogenic heavy metals are usually deposited on topsoils. Furthermore, soils not only serve as sources for certain metals but also function as sinks for metal contaminates (Alloway 1995; Govil et al. 2001). Therefore, analyses of heavy metals in soils offer an ideal means to monitor not only the pollution of soils itself, but also the quality of the overall environment as reflected in soil. Trace metal distribution is well documented for many countries (USA, Germany, Japan; Thuy et al. 2000); furthermore, a number of recent works (Manjunatha et al. 2001; Rawlins et al. 2002) evaluated the accumulation of heavy metals in various developed and developing countries. The situation is more serious in developing countries because of a sudden increase in industrialization and urbanization during the last decade.

The main objectives of the present investigation were: (1) to determine the concentration of heavy metals (Cu, Cr, Co, Ni and Zn) and to find the extent of their distribution in soils within the area, and (2) to provide a scientific basis for pollution control and further monitoring.

Methodology

Sample collection

A total of 28 soil samples were collected, mostly from the upper surface (0–50 cm) to differentiate between natural and anthropogenic sources of pollutants. Normally industrial pollutants contaminate the upper layer of the soil (0–40 cm). In case of natural pollutant, the entire soil shows high metal enrichment at all depths. The samples were collected in self-locking polythene bags and sealed in double bags. Use of metal tools was avoided and a plastic spatula was used for sample collection. Figure 1 shows the location of the soil samples collected from the area.

Fig. 1
figure 1

Sample location map of the study area

Sample preparation

Soil samples were dried for 2 days at 60°C. The dry soil sample was disaggregated with mortar pestle. The sample was finely powdered to −250 mesh size (US standard) using a swing grinding mill. Sample pellets were prepared for analysis by X-ray fluorescence (XRF) spectrometry, using a backing of boric acid and pressing at 25 tons of pressure. A hydraulic press was used to prepare the pellets for XRF analysis to determine trace elements.

Instrumentation

Along with a 4 kW X-ray generator, a Philips PW-2440 fully automatic, microprocessor- controlled, 168-position automatic PW 2540 vrc sample changer X-ray Spectrometer was used for the determination of major and trace metals (Cu, Cr, Co, Ni and Zn) in soil samples. Suitable software ‘Super Q’ was used to define dead-time correction and inter element matrix-effects. Reference samples from the US Geological Survey, Canadian Geological Survey, International Working Group France (Govindaraju 1994), and NGRI India (Govil and Narayana 1999; Govil 1993) were used to prepare calibration curves for major oxides and trace metals (for details of XRF procedure, see Govil 1985; Rao and Govil 1995). The trace- element data is reported in Table 1 and major-element data in Table 2.

Table 1 Heavy metal XRF data in soils (Values given are in mg/kg)
Table 2 Major element XRF data in soils (Values given are in wt%)

Processing of analytical data

Summary statistics of the heavy metal data (Table 3) set were first calculated to evaluate the distribution of elements. Frequency distribution of each of the elements was examined based on comparison with the threshold values, cluster analysis and calculation of skewness. Skewness measures the asymmetry of the observations. Geochemical variables, especially trace elements, have low concentrations in the environment, while the anomalies resulting from natural processes and anthropogenic activities cause the occurrence of high concentration in data sets. This makes the probability distribution positively skewed (Webster and Oliver 2001). The correlation between the elements was estimated by the scatter plots (Huisman et al. 1997).

Table 3 Statistical summary of metal concentration in soil (mg/kg)

Results and discussions

Total heavy metals concentrations

To determine the variability of the content of heavy metals in soils, comparative analysis of the elements was first performed. Table 3 summarizes the statistics of the data. Maximum permitted threshold soil concentrations of potentially toxic metals prescribed by WHO guidelines (WHO 1996) are also given. Distributions of Cu, Cr, Co, Ni and Zn are skewed (the skewness coefficient exceeded 1.0). Regardless of skewed distributions, the mean value and median are similar for most metals, considering that the median value was lower (Table 3). The highest skewness is found in Zinc, with the g1 value of 2.97. However, after the logarithmic transformation, the mean value and the median equalize, g1=0.47 indicating that these values can be further tested by applying more complex statistical methods. The environmental geochemical phenomena, including heavy metal concentrations in soils, commonly correlated, are caused by the physico-chemical features of elements and geochemical processes (Zhang and Selinus 1998). Also, some of the major oxide data, as shown in Table 2 revealed high concentrations of Fe and Mn in the area. The study also showed concentrations of the metals that are to a large extent linearly correlated. Table 4 is the correlation matrix. It shows strong correlation between Cr and Co (r=0.75361), Cu and Zn (r=0.71477), Co and Ni (r=0.79984).

Table 4 Correlation matrix table, lower triangle for Thane-Belapur soils

Copper

The average concentration of copper in the investigated area is 104.6 mg/kg and lies between 3.10 mg/kg and 271.2 mg/kg (Table 1). The normal threshold value prescribed in soils is 30 mg/kg. Figure 2 shows a marked copper accumulation in soils around the industrial area, which may be from application of agrochemicals in the agro-based industry in the area. Copper is, however, characterized by the so-called point sources of contamination, namely uncontrolled, active or untended waste dumps. Such places imply great contamination risks for not only soil but also groundwater (Romic et al. 2001; Govil et al. 1999). From the aspect of groundwater hazards in this area, only the quantities of metals leached to the sorum/gravel substratum contact are dispersed in the aquifer, thus potentially contaminating it. The transfer of contamination and its dispersion in the aquifer is a very slow process, and changes in metal concentration in the upper aquifer layer are not manifested in a short time. However, the smaller the sorum depth, the higher the risk.

Fig. 2
figure 2

Distribution of copper in soil

Chromium

Chromium levels around the study area ranged from 177.9 mg/kg to 1,039 mg/kg, with an average of 521.3 mg/kg (Table 2), which is considerably high as shown in the distribution map of chromium (Fig. 3). The normal range of Cr in soils is 100 mg/kg (Ferguson 1990). Chromium and its compounds are primarily used in the production of steel and other alloys, chrome plating and pigment production, which is the major industrial activity in the Thane–Belapur study area. However, inadequate disposal of waste containing chromium at industrial sites in the past decade has contaminated the land and groundwater. Although small amounts of naturally occurring trivalent chromium (Cr+ 3) are essential for nutrition, the presence of industry produced (Cr+ 6), in water or the food chain is considered to be highly hazardous. The source of Cr appears to be anthropogenic—from some industries producing steel, textiles in the area. Ansari et al. (1999) found very high Cr levels in soil, i.e. up to 1,220 mg/kg in some industrial areas of India.

Fig. 3
figure 3

Distribution of chromium in soil

Cobalt

The concentrations of cobalt in the analyzsed samples of soil in the study area are generally high. In almost all the samples the concentrations were above the normal threshold value of 17 mg/kg with average concentration of 68.7 mg/kg ranging from 44.8 mg/kg to 101.6 mg/kg. Cobalt usually occurs in association with other metals, such as copper, nickel, manganese and arsenic. One of the most important soil properties is its acidity. The more acidic the soil, the greater the potential for cobalt toxicity (Ferguson 1990; Govil et al. 1998). The source of cobalt to the environment could be the burning of coal and the industrial processes that use the metal or its compounds. The highest concentration, 101.6 mg/kg, in the study area was determined at a location in the vicinity of the metal industries (Fig. 4), indicating the source as anthropogenic.

Fig. 4
figure 4

Distribution of cobalt in soil

Nickel

The geochemical map in Fig. 5 shows the spatial distribution of Ni in the study region. The high average concentration of nickel (183.6 mg/kg) is related to some anthropogenic input. However, the highest concentrations up to 537.8 mg/kg are found in the area which is higher than the normal threshold content of Ni in soils from 1 mg/kg to 80 mg/kg (Aswathanarayana 1995). Nickel accumulation in the soil is obviously influenced by the Thane–belapur ring road and some of the industries. Anthropogenic sources, such as combustion of fuel or waste released from industries where nickel is extracted from ore, may have some effect on increase in Ni concentrations, as confirmed in this study.

Fig. 5
figure 5

Distribution of nickel in soil

Nickel is known for its considerable spatial variation in soil, even on a small scale (Proctor and Baker 1994), but mention should also be made of its temporal variability, characteristic of this metal which are not typical of heavy metals, in general (Xiangdong Li et al. 2004). In such cases, the fluctuation of metals in soil can be monitored and their bioavailability and mobility estimated solely by stationery trials. Nickel concentrations available to plants are much lower than the total, but still there is no universally accepted method for evaluation of its bioavailabilty, the main reason being a great variability in Ni concentrations in soils.

Zinc

Zinc belongs to a group of trace metals, which is an essential element for the growth of humans and plants. The main sources of pollution are industry and the use of liquid manure, composed materials and agrochemicals such as fertilizers and pesticides in agriculture. The average zinc concentration in the soil of Thane–belapur is 191.3 mg/kg, ranging from 96.6 mg/kg to 763.2 mg/kg. Zinc showed a distribution pattern similar to copper in soils. There are clearly marked locations having higher concentrations of zinc in soils in the vicinity of the industrial area (Fig 6). However, only two samples showed zinc concentrations higher than the normal threshold value (Fig 7).

Fig. 6
figure 6

Distribution of zinc in soil

Fig. 7
figure 7

Heavy metal content and its variations with respect to background value shown with it dark thick line for a Copper (30 mg/kg) b Chromium (100 mg/kg) c Cobalt (17 mg/kg) d) Nickel (80 mg/kg) e Zinc (200 mg/kg)

Cluster analysis

R type cluster analysis is used to elucidate the complex relationships between the different heavy metals studied. Numerous factors control their relative abundance, including the original concentrations in rocks, during the process of soil formation, as well as anthropogenic factors such as pollution. The correlation coefficient “r” was used as the measurement parameter of relationships, and the single linkage clustering method was used to divide the elements into clusters (Davis 1986). All the samples were used for the calculation of the correlation coefficient of the elements using the following equation

$$r_{jk} = {{\sum\limits_{i = 1}^N {\left( {X_{ij} - \bar X_j } \right)\left( {X_{ik} - \bar X_k } \right)} } \over {\surd \sum\limits_{i = 1}^N {\left( {X_{ij} - \bar X_j } \right)^2 \sum\limits_{i = 1}^N {\left( {X_{ik} - \bar X_k } \right)^2 } } }}$$

where j, k = 1–5 are the number of elements, i is the number of soil samples; \(0 \leq\vert r \vert\leq1. \)

The results of the calculations are listed in the Table 4 and plotted in Fig. 8.

Fig. 8
figure 8

Scatter plots showing correlation between Cr vs. Co, Ni; Co vs. Ni, Zn; Cu vs. Zn, Ni

Table 4 and Fig. 8 indicate that Cr–Co and Cr–Ni are closely related with a correlation coefficient of 0.75361 and 0.72041; Co–Ni also have a close relationship as shown by a correlation coefficient of 0.79948, respectively. Cu–Zn also showed correlation with the correlation coefficient of 0.71477. Overall all elements, Cr, Co, Cu, Ni and Zn, are grouped together, indicating that the anthropogenic sources of these heavy metals are closely related in soil of the study area.

In a recent study, Romic and Romic (2003) prepared cluster analysis on numerous heavy metals present in the agriculture topsoil in an urban area of the Croatian capital of Zagreb. They identified the group of clustered elements, including Cd, Pb, Cu, Zn and Ni as heavily influenced by anthropogenic means. However, in this study area the cluster group of heavy metals shows the influence of the anthropogenic input from industries such as steel, agrochemicals, fertilizers, waste released from industry.

Conclusion

Soils in the Thane–Belapur industrial development area were found to be significantly contaminated by Cu, Cr, Co, Ni, Zn and some of the major elements such as Fe and Mn. The detected levels of total metal contamination in many of the samples were found to exceed international threshold values. By the cluster analysis applied to determine associations of heavy metals in soils of Thane–Belapur, the groups of Cu, Cr, Co, Ni and Zn showed a strong anthropogenic influence. High nickel concentrations are also connected to morphogetic characteristics of the greater region, primarily basic rocks of the surrounding mountain range. However, it has been assumed that the anomalous concentrations of nickel in the vicinity of the highway and industry are of anthropogenic input. Copper is characterized by strongly scattered anthropogenic influence. This particularly relates to the uncontrolled waste dumps and liquid waste from industries. With the analytical results of Cr, Co and Zn obtained in the soils of the study area it is also indicated that the pollution of these elements in the area originated mainly from industries in the area, either from dumping of industrial waste or from release of untreated effluents on to the ground which contaminate the underground water table. High values of Fe and Mn in the area also indicate anthropogenic origin from some petrochemical-based industry. According to the environmental quality criteria for soils, the site would require remediation. It is obvious that systematic and continuous monitoring for heavy metal pollution should be instituted, and certain steps (like phyto-remediation by growing some plants in the area) should be carried out as soon as possible to minimize the rate, and extent of future pollution problems.