Rivers are crucial freshwater resources due to their wide applications in domestic, industrial, agriculture, and recreational activities (Razamkhah et al. 2010; Mustapha et al. 2012). The water quality of a river is influenced by multiple factors ranging from catchment’s lithology, topography, and climate, atmospheric deposition to several anthropogenic factors (Shrestha and Kazama 2007; Iqbal and Shah 2014). The domestic sewage and industrial effluents represent a regular anthropogenic source of pollution, which contributes largely through the point sources. On the other hand, surface run-off, which is mainly a season-dependent phenomenon, represents a diffuse source of pollution that carries various pollutants not only from urban areas but a major portion also is contributed from agriculture fields (Singh et al. 2004). The diffuse or non-point sources of pollution are difficult to manage due to their complicated mode of mobility in river catchment and influx in rivers through wide surface area (Kotti et al. 2005). Rivers flanked by urban areas are more susceptible to pollution, especially in developing countries where the natural waterways are the ultimate sinks for urban wastewater (Singh et al. 2004; Jadoon et al. 2012). Downstream sections of rivers are particularly at greater risk of degradation due to collective impact of pollution influx from upstream sections (Chang 2008).

It is estimated that 40 × 109 m3/year of wastewater is generated by Rawalpindi, which is either used for irrigation of crops or discharge in River Soan. Approximately 1000 tons of garbage and construction material is disposed daily in Nullah Lei. The bed of Soan River also is used as a garbage station for disposal of solid waste on a temporary basis. Islamabad is the capital and the most organized city of Pakistan, but industrial effluents are discharged from its two industrial zones directly or indirectly into the Soan River, because it lacks an appropriate system for treatment of industrial effluents (Azizullah et al. 2011). Only a few industrial units have acquired antipollution equipment, but those are not functioning properly due to lack of maintenance. These units release slags, gases, and untreated effluents that fall directly or indirectly in River Soan. One ton of particulate matter is emitted daily from the steel furnaces. A survey conducted by the Environmental Protection Agency has reported that DO as low as 0.9 mg/L, BOD as 63.5 mg/L, COD as 162.5 mg/L, whereas Pb and Cd concentrations were found exceeding NEQ’s (Pb: 0.5 mg/L and Cd: 0.1 mg/L) in streams passing through an industrial area. It has been reported earlier that effluents released from the Industrial area of Humak model town has contaminated River Soan with Ni, Cr, Cd, and Pb (Mian et al. 1998). Jalil and Khan (2012) analyzed the physicochemical and bacteriological conditions of Soan River and reported that the Soan River and specifically the Kaurang River has become unfit for human consumption and existence of aquatic life.

To sustain imperative role of rivers in both aquatic and terrestrial ecosystems, it is inevitable to restore and conserve these resources through their effective management, which primarily rely on assortment of consistent data on their water quality (Mustapha and Abdu 2012; Wang et al. 2013; Nazeer et al. 2014). Assessment of water quality on the basis of water-quality index (WQI) was recently reported for River Soan, which drained one of the most urbanized catchment of Pakistan (Nazeer et al. 2014). Although variations in water quality could be predicted considering WQI, the pollution sources could not be well described on the basis of WQI (Wunderlin et al. 2001). Hence, a multivariate approach was needed to evaluate the water quality of Soan River. The present study was designed to evaluate the influence of anthropogenic and natural processes on water quality and the apportionment of different sources for their contribution in affecting water quality. Furthermore, it is attempted to determine the similarities in sampling sites on the basis of water quality and to sort out those variables that best described spatiotemporal trends in data. The findings will be useful for the water-quality manager to design an effective monitoring program for pollution control in River Soan.

Methodology

Study Area

The River Soan is located in northern Pakistan between 71°45′ and 73°35′ east longitude and 32°45′ to 33°55′ north latitude (Fig. 1). The river has its origin in lower Himalayan foothills and drained catchment area of approximately 10,999 km2. Study area exhibits cold Himalayan climate in northern part while it has severe hot climate in southern region. The climatic condition of the study area is largely categorized as “Subtropical Triple Season Moderate Climate Zone.” The region receives average rainfall of 1789 mm in north to 783 mm per year in southwestern part, whereas the major rainfall (>50 %) occurs during monsoon season, which extends from July to September (Hagler 2007). The annual temperature also is highly variable across the study area, ranging from 9 to 17 °C in north to 15 to 35 °C in southwest. Study area exhibits diverse lithological features based mainly on limestone, shale, continental sand stones, and clay belong to eocene, miocene epoch, Pliocene, and lower pleistocene ages (Sheikh et al. 2007). The river receives numerous tributaries on its way to the River Indus, among which the River Kaurang, River Ling, and Nallah Lei are noticeable. The major domestic need for water of more than 3 million populations of Rawalpindi and Islamabad is accomplished through Simly and Rawal Lake reservoirs built on River Soan and its tributary the River Kaurang respectively.

Fig. 1
figure 1

Map showing landuse patterns and population density in study area and location of sampling sites on River Soan and its tributaries (Color figure online)

Upstream of River Soan is characterized by distinct topography of lower Himalayan foothills having high elevated (>550 m) hilly terrain with numerous valleys and steep gradient. Forest is the predominant land use at upstream reaches of River Soan. Upstream catchment slopes towards southward to the piedmont fold, which constitute the plain area of the Rawalpindi and Islamabad. Upper and lower zones of eastern section are characterized by rainfed and irrigated agriculture landuses respectively. Western section of midstream exhibit intense urbanization, because two major cities (Rawalpindi and Islamabad) are located here. More than 1200 registered industrial units are working in Rawalpindi and Islamabad, which include pharmaceutical industry, re-rolling mills, flour mills, marble cutting and polishing units, and oil and ghee industries. Downstream zone lies in the southwestern part of the catchment area and Pothar plateau. This region is composed of low elevated hills, plateaus, and undulating plains of Attock and Chakwal districts. Rainfed agriculture and wasted land due to deep gullies and gorges are the main landuse at river downstream.

Sampling and Physicochemical Analysis

Water samples were collected from 26 sampling located on the River and its tributaries during pre-monsoon (April–May) and post-monsoon season (September–October) in 2008 (Fig. 1). The survey conducted during the base flow periods and daylight. Triplicate water samples were collected from 25-cm depth where applicable using the grab method. Different parameters, such as temperature, pH, DO, TDS, specific conductivity, and salinity, were recorded in the field using portable Hydrolab Surveyor (MS 5 Surveyor Hach Environmental) and turbidity was measured with 2100P turbiditimeter (HACH). Water samples were transported in 2-L polythene plastic bottles first cleaned with metal-free soap and then rinsed with distilled water and finally kept in 10 % HNO3 overnight. All samples were transported in ice boxes conserved at 4 °C and delivered to the laboratory on the same day. All samples were kept at 4 °C until further lab analysis.

Water samples were analyzed for different parameters in laboratory. Analyses conducted in lab included Alkalinity (Acid–Base titrometry), chlorides (Silver-nitrate titrometry), COD kits (HACH kits based spectrometric Methodology), orthophosphates (ammonium molybdate method), nitrate-nitrogen (phenol disulphonic method), ammonia–nitrogen (phenate method), and sulphates (barium chloride method) as described previously (APHA 1998). Water samples were filtered through Whatman filter paper (pore size = 0.45 μm) for estimation of dissolved metals and acidified with nitric acid to adjust pH <2 to preserve samples (Sadeghi et al. 2012). Water samples were run on fast sequential Atomic absorption spectrometer for analysis of dissolved metals.

Concentration of metals was determined through calibration curve of standard solutions using Atomic Absorption Spectrometry (Variance FSS-240). The detection limits determined for Cd, Cr, Cu, Fe, Pb, Ni, and Zn were 0.009, 0.004, 0.002, 0.005, 0.003, 0.002, 0.003, 0.0007, and 0.009 mg/L, respectively. The blanks for both procedure and reagents were treated and measured in parallel to the samples using the same procedure. Standard reference material SRM-1673e was used to assess the precision (Relative Standard Deviation-RSD) and recovery of running procedure followed for quantification of metals in water. Precision for different metals was detected in the range of 5 % to 10 %, whereas recoveries for all the metals were within range of 92–111 %. Results of quality control measures validate the reliability and repeatability of analytical procedure followed in present study.

Multivariate Analysis

The multivariate statistical techniques have been widely used in various studies for the explanation of spatiotemporal variations and interpretation of chemical/physical characteristics of water-quality parameters (Simeonov et al. 2000; Wang et al. 2013) compared with univariant techniques, which usually fail to give adequate information on multivariate dataset (Santos-Roman et al. 2003). In the present study, four multivariate techniques, such as HACA, PCA/FA, DFA, and APCS–MLR, were used for the water-quality assessment and interpretation of the results.

In the present study, CA was performed on the normalized dataset using Euclidean similarity matrix as a distance measure and Wards method as a linkage technique. The linkage method was reported as Dlink/Dmax where Dlink and Dmax correspond to the linkage distances for a specific object and maximum distance respectively (Singh et al. 2005a, 2005b). To standardize the linkage distance along y-axis, the Dlink/Dmax measure was multiplied by 100 (Wunderlin et al. 2001). Cluster significance was based on the criterion of 0.66 Dmax.

Principal component analysis (PCA) is a widely used ordination technique for unfolding the ecological gradients and to explore spatiotemporal trends in limnological studies (Perona et al. 1999). Data transformation is performed to reduce the set of intercorrelated variables to the set of uncorrelated (orthogonal) variables (Yusuf et al. 2013; Retnam et al. 2013). Factor analysis is designed to reduce the contribution of less significant variables to decrease the complication in data that have passed through PCA (Shrestha and Kazama 2007; Chen et al. 2015). Varimax (variance maximizing) rotation is the most common rotation approach, which is generally applied to reallocate the variance and present the better interpretations of the factors that after varimax rotation are termed as varifactors (VF) (Murillo et al. 2013).

The discriminant function analysis (DFA)—a supervised pattern recognition technique—is used to determine whether prior defined groups vary in terms of a mean of a variable and afterwards DFA find out the group that particular variable is associated (Qadir et al. 2008; Chen et al. 2015). The DFA yielded discrimination functions (DFs) for each group using standard, forward, and backward stepwise modes. In DFA-based spatial analysis of water-quality data, sites were used as grouping variable while temporal analysis used seasons as grouping variables. These grouping variables were taken as dependent variables, while different water-quality attributes were taken as independent variables.

The number and characteristics of possible sources were determined through PCA/FA after which the source contribution to the characteristic pollutant was determined through absolute principal component scores using multiple regression method (APCS–MLR) (Guo et al. 2004; Singh et al. 2005a, 2005b). The quantitative contributions from each source for each variable/pollutant were then compared with measured concentrations.

Results

Temporal Trends in Water Quality

The physicochemical parameters of water quality recorded during pre- and post-monsoon at different sampling sites of River Soan and its associated tributaries are illustrated in Table 1. Seasonal variations determined through Spearman correlation matrix for season-variables showed that among different parameters, COD, ammonia, temperature, pH, Cd, Pb, Ni, Cu, and Cr were significantly correlated with season. Strong correlations were found for Cu (r = 0.72) followed by Ni (r = 0.65), ammonia (r = 0.57), Pb (r = − 0.5), pH (r = − 0.48), COD (r = − 0.42), temperature (r = − 0.4), Cd (r = 0.37), and Cu (r = 0.34). Temporal trends were further analyzed through DFA. Raw data divided into pre- and post-monsoon seasons was used for temporal DFA. Discriminant functions of forward stepwise and backward stepwise modes are presented in Table 2. Forward stepwise mode involves the stepwise addition of significant variables until no significant variation is found, whereas backward stepwise mode involves the removal of insignificant variables until no significant variation is found. The standard, forward stepwise, and backward stepwise assigned 96, 92, and 90 % respectively of the cases correctly using 23, 9, and 6 variables respectively. Temporal trends as identified through DFA revealed that COD, DO, pH, Cu, Cd, and Cr could be attributed for major seasonal variations in water quality (Table 2). Box and whisker plots revealed that COD and pH were high during pre-monsoon, whereas DO and concentrations of heavy metals, such as Cu, Cd, and Cr, were high during post-monsoon (Fig. 2).

Table 1 Mean and standard deviation of physicochemical parameters of water quality of River Soan
Table 2 Classification functions of spatial and temporal trends in water quality of River Soan (all physicochemical variables were selected by standard stepwise mode hence classification function for standard stepwise mode is not presented here)
Fig. 2
figure 2

Box and whisker plots illustrating the spatial (orthophosphates, nitrates, chlorides, alkalinity, and Ca) and temporal (COD, Cu, dissolved oxygen, Cd, Cr, and pH) trends of water quality of River Soan

Spatial Trends in Water Quality

In order to analyze the similarity among different sampling sites along the spatial scale, cluster analysis was conducted which grouped 26 sampling sites into three spatial groups ((Dlink/Dmax) × 100 < 60) on the basis of water quality (Fig. 3). Cluster 1 included sites SM, SN-0, SN-2, SN-3, SN-4, SN-5, LG-1, LG-2, LG-3, J, LH-4, and KD, which were located in river upstream and suburban midstream. Cluster 2 included downstream sites, such as SN-7, SN-8, SN-10, SN-11, and SN-12, and sites located in close proximity to urban areas, such as GM, KG-3, and SN-6. Cluster 3 was comprised of sites LH-1, LH-2, LH-3, KG-1, KG-2, and SN-9, which are located either in urban or close proximity to urban area except SN-9, which is located in river downstream. Spatial variations in the data were interpreted through DFA. Analysis was conducted on raw data set comprising of three clusters formed by cluster analysis. Discriminant functions of forward stepwise and backward stepwise modes are given in Table 2. Standard and forward stepwise mode DFs based on 23 and 12 variables assigned 100 % cases correctly while backward stepwise mode based on six variables assigned 96 % of cases correctly. Backward stepwise mode sorted total alkalinity, orthophosphates, nitrates, Cl, and Ca as those variables which significantly discriminate among three groups (Table 2). Box and Whisker plots revealed that orthophosphates, Cl, alkalinity, and Ca were higher in cluster 3, which included highly polluted sites of urban midstream, followed by moderately polluted sites of cluster-2 and least polluted sites of cluster-1 (Fig. 2). Nitrates were higher at moderately degraded sites of cluster-2 followed by cluster-3 and cluster-1 (Fig. 2).

Fig. 3
figure 3

Dendrogram showing the grouping of sampling sites into three clusters on the basis of water quality of River Soan

Source Identification of Different Pollutants

To identify the potential source of pollution in river Soan, PCA/FA was applied to normalized data (Table 3). Total of six factors having eigen value >1 were sorted which explained 81 % of the variations in the dataset. Factor 1 explains 40 % of total variations in the data set and showed high loadings of total alkalinity, orthophosphates, specific conductivity, salinity, Ca, Mg, and K. Factor 2 accounted for 16 % of total variations and showed high loadings of temperature, pH, and DO. Factor 3 explained 8 % of total variations in data and showed high loadings of Fe and Cu and moderate loadings of COD, Cd, ammonia, orthophosphates, and Ni. Factor 4 explained 6 % of total variations in data and had high loadings of nitrates. Factors 5 and 6 explained 5.6 % and 5.3 % of variation in data, respectively, and showed high loadings of turbidity and Zn respectively. To verify the PCA interpretations, factor scores of different groups of sampling sites (identified through cluster analysis on the basis of water quality) were examined so that source contributions could be confirmed through factor scores (Fig. 4). Source profile of factor 1 also showed high scores at highly polluted sites which are located in urban areas (Cluster-3) (Fig. 4). Factor 2 showed high scores of moderately polluted sites (Cluster-2) located in the river downstream, which suffer from hyper eutrophic conditions (Nazeer et al. 2014) (Fig. 4). Source profile of factors 3 showed high scores of urban midstream (Cluster-3) (Fig. 4). Factor 4 represents the impact of agriculture activities on water quality and showed the high scores of sub urban part of midstream and downstream sites, which have agriculture as main catchment land use (Fig. 4). High scores for factor 5 were noted mainly at suburban midstream sites included in cluster-2 (Fig. 4). Factor scores for factor 6 were measured less than 0 at each cluster (Fig. 4).

Table 3 Factor loadings of different water physico-chemical variables on significant varimax factors (strong loadings are italicized and bold)
Fig. 4
figure 4

Factor scores of different groups of sampling sites (identified through cluster analysis on the basis of water quality)

Source Apportionment

The APCS–MLR technique was utilized to determine the contribution of different sources in affecting water quality and the results are presented in Table 4. Predicted concentrations of different parameters hold good accuracy with measured concentrations, which also is confirmed by regression coefficients and the ratio between predicted and measured values. The values of the correlation coefficient (R 2) are the measure of the variance of the observed concentration which could be attributed to the variance of predicted concentration. Organic pollution was mainly added by factor 1 (mineralization and sewage waste), which contributes 27 % followed by factor 3 (urban runoff), 4 (agriculture activities), and 5 (livestock interruptions and bank erosion), which contributes 17, 16, and 14 %, respectively (Fig. 5a). Factor 1 (mineralization and sewage waste) and 6 (industrial activities) accounts for 28 % contribution for heavy metal pollution in water, followed by factor 3 (urban runoff) and 4 (agriculture activities), which contribute 15 and 13 %, respectively (Fig. 5b). Nutrients were mainly added by factor 4 (agriculture activities), which contributes 35 %, followed by factor 1 (mineralization and sewage waste), 5 (livestock interruptions and bank erosion), and 3 (urban runoff), which contribute 28, 17, and 10 %, respectively (Fig. 5c). Factor 3 (urban runoff) contributes 39 % to macro elements, followed by factor 1 (mineralization and sewage waste) and 2 (ecological response factor and natural processes), which contribute 30 and 17 % respectively (Fig. 5d).

Table 4 Source contribution to the different water quality parameters
Fig. 5
figure 5

Source apportionment for different water quality parameters a organic pollution (COD + Ammonia), b heavy metals (Pb, Cu, Cr, Ni, Zn, Fe, and Cd), c nutrients (orthophosphates and nitrates) and macroelements (Mg, Ca, and K). (Factor 1 sewage waste and mineralization. Factor 2 ecological response factor and natural variability. Factor 3 urban runoff. Factor 4 agriculture activities. Factor 5 livestock interruptions, bank erosion, and soil erosion. Factor 6 industrial activities)

Different physicochemical factors, such as DO, turbidity, and specific conductivity, showed characteristic pattern according to their association with catchment land uses (Table 4). DO was mainly affected by elevated growth of phytoplankton (factor 2: 63.41 %), whereas some unidentified sources (36.58 %) also were controlling DO in river. These sources might be related to hydrological factors like water flow which affects DO in river. Turbidity was mainly affected by livestock interruptions and bank erosion (factor 5: 75.92 %), whereas phytoplankton (factor 2: 24.07 %) also played their role in making water turbid. Specific conductivity was mainly attributed to sewage waste along with natural processes (factor 1: 59.5 %); however, surface runoff from agriculture fields (factor 4: 19.6 %) and urban areas (factor 3: 20.89 %) also raised the specific conductivity of water.

Discussion

Temporal analysis of water quality revealed that COD, DO, pH, Cu, Cd, and Cr could be attributed to major seasonal changes in water quality, which suggests that anthropogenic activities are major sources of seasonal variations in water quality. The impact of anthropogenic activities might be due to the monsoon precipitation which facilitates the soil erosion and accelerates the influx of organic waste and agrochemical runoff from catchment through flooding. Chen et al. (2013) has reported that pollution was considerably high in Jinjiang River, China during the period of high flow while pollution was low to moderate during low-flow period. In Soan Basin, increase in chemical oxygen demand during pre-monsoon was due to sewage waste, which on degradation enhanced the chemical oxygen demand and thus lowered the concentration of dissolved oxygen in water. However, increase in water discharge from reservoirs during monsoon improves flow in the river and enhanced the dissolution of oxygen in water in monsoon and post-monsoon seasons (Chang 2008). Chen et al. (2015) have reported similar findings that increase in flow during the wet season, lowering the COD in Danjiangkou Reservoir Basin in Central China. During post-monsoon, pH was relatively low and concentrations of heavy metals, such as Cu, Cd, and Cr, were high. Monsoon precipitation mobilizes various contaminants from terrestrial ecosystem to aquatic ecosystem by accelerating surface runoff, which brings waste from dumping sites for domestic solid waste, industrial waste, and poultry farms (Nazeer et al. 2014). Diffuse sources of pollution actively play their role during high flows through surface runoff (Kotti et al. 2005). Chang (2008) has reported that the influx of solutes in river increases through surface run off during intense monsoon precipitation. Low pH noted during post-monsoon might be due to the influx of surface runoff, which adds organic waste from solid waste dumping sites as well as agrochemical wastes in river and increases the cationic concentrations in water. Moreover atmospheric depositions also play an additive role in enhancing metal concentration particularly during post-monsoon.

Major spatial variations in water quality were attributed to total alkalinity, orthophosphates, nitrates, Cl, and Ca, which were mainly high at sites grouped in cluster-3 with the exception of nitrates, which was high in cluster-2. Cluster-1 was comprised of the sites located in urban part of river midstream which were characterized by dense urbanization in their catchment and presence of several point sources of pollution in the form of drainage pipes. The sewage and industrial wastes of twin cities of Rawalpindi and Islamabad are directly drained through these pipes into the freshwater streams and rivers mainly in untreated form. In urbanized catchments, municipal and industrial wastes are the major pollutants of aquatic ecosystem (Li et al. 2011). Deliberate release of sewage and industrial wastes through point and diffuse sources at sites of cluster-1 has resulted in increase in organic load in river. Biochemical degradation of organic compounds increases concentration of carbonates and bicarbonates which elevates water alkalinity (Khan et al. 2014). Sewage waste and carbonate geology of the study area could be account for relatively high alkalinity at sites of cluster-3 (Jalil and Khan 2012). High Ca at sites of cluster-3 is attributed to the local geology as the sites clustered in group are located on Lei Nallah, which drains the Margalla hills rich in limestone (Munir et al. 2011). High alkalinity at these sites could be partly attributed to the weathering of the parent rock material of Margalla hills, which adds bicarbonate ions to the surface water. It is to be noted that some of the sites located at Lei Nallah exhibit DO as low as 0.5 mg/L, but pH did not decrease accordingly as normally occurs. The stability in pH also could be attributed to the weathering of calcareous rocks present in this region. Domestic use of detergents and soaps makes sewage waste an important source of orthophosphates in streams and rivers (Perona et al. 1999). In Soan drainage, sewage pollution is the major source of chlorides and orthophosphates in surface water. Chloride is regarded as a conservative chemical species and serves as an indicator for sewage pollution (Varol et al. 2012; Kumar et al. 2015). Beside sewage pollution, natural sources, urban runoff, and industrial effluents also add chloride to the surface water (Oketola et al. 2013).

Downstream sites were grouped in cluster-2. Nitrates were high in cluster-2, which might be attributed to the agricultural activities which is the major land use at these sites. The organic waste entered in river from midstream region causes major environmental hazard for the aquatic environment at downstream. The downstream sites showed extreme growth of phytoplankton due to influx of nutrients in river mainly from urban midstream. In previous study, hyper eutrophic conditions were reported in downstream region of River Soan in response to the enrichment of river with high nutrient loadings (Nazeer et al. 2014). Some of the sites located in moderately polluted suburban part of river midstream also were grouped in cluster-2. The clustering of downstream sites with moderately impacted sites of suburban midstream showed that due to self-purification and assimilative capacity of the river, water quality was improved from highly degraded condition (river urban midstream) to a moderate one (Kannel et al. 2007). Upstream and some of the suburban midstream sites were grouped in cluster-3. It was earlier reported that water quality of River Soan at these sites could be classified into good-fair quality classes, whereas none of these site had poor or very poor quality of water (Nazeer et al. 2014). Least deteriorated quality of water at these sites is attributed to relatively less urbanization in the catchment of these sites. Since the pollution increases downstream the gradient as the river passes through different land uses, the upstream reaches often are less polluted than downstream reaches (Kotti et al. 2005; Wang et al. 2015). Poultry farming, tourism, grazing, and deforestation are major environmental threats at river upstream.

The grouping pattern revealed through CA was consistent with classification of river done earlier on the basis of WQI (Nazeer et al. 2014). The results indicate that the CA technique is useful in offering reliable classification of surface waters in the whole region and will make it possible to design a future spatial sampling strategy in an optimal manner, which can reduce the number of sampling stations and associated costs (Shrestha and Kazama 2007).

The sources of pollution in riverine ecosystems can be determined through factor scores estimated through factor analysis (Bu et al. 2010). The high scores correspond to high influence of the factor on the sampling sites (Kannel et al. 2007). Source profile of different factors helps in identification of polluted areas in river catchment which could be useful in assessment of water quality at watershed level (Yan et al. 2015). High loadings of alkalinity, Mg and Ca on first factor point out towards the natural phenomenon of mineralization. Munir et al. (2011) has reported that Ca and Mg in surface water of this region are mainly contributed by natural sources. Present study revealed the concentrations of Mg and Ca in 1:2, which is consistent with the findings reported for other regions round the globe (Ndungu et al. 2014). Conversely, high loadings of orthophosphates and moderate loadings of COD and ammonia indicate the role of anthropogenic activities, which magnifies the natural processes in altering the water quality. These findings suggest that first factor could be termed as mineralization and sewage pollution factor. Source profile of factors-1 also showed the high scores at highly polluted sites located in urban areas (cluster-3) and drain the domestic sewage and industrial effluents from urban areas of Rawalpindi and Islamabad (Fig. 5). Addition of sewage waste from the twin cities results in increase in orthophosphates and organic contents. Degradation of organic compounds enhances the ammonia concentration as well as biochemical oxygen demand (Shrestha and Kazama 2007). COD is considered as an indicator of organic pollution which occurs primarily due to degradation of organic waste of sewage and industrial discharges (Ullah et al. 2013; Chen et al. 2015). The dissolved oxygen thus decreases in response to increase in biochemical oxygen demand, which is evident from negative loading of dissolved oxygen on the first factor. Various ions, such as carbonates, bicarbonates, sulfates, phosphates, Na, K, and Cl, are added to aquatic ecosystem mainly through sewage waste. These ions and decomposed organic matter increase the dissolved solids and conductivity of water (Yousafzai et al. 2013). The presence of shales, industrial effluents, and oxidation of sulfite ores are the common sources of sulfate in surface water (Chen et al. 2013). In River Soan, leaching of gypsum, domestic use of detergents, and geology of the area has been identified as major sources of sulfates (Jalil and Khan 2012; Mehmood et al. 2013). Calcareous rocks and marble processing units in the urban midstream are greatly attributed for high mineral contents of River Soan. Effluent discharge from manufacturing and processing units could be an important source of K in river water (Mehmood et al. 2013).

The second factor represents the role of ecological response of aquatic ecosystem as a source of variability in water physico-chemistry. Temperature, DO, and pH showed high loadings on this factor. DO and pH are factors that are quite sensitive to water pollution, hence greatly vary under the influence of other environmental factors (Wang et al. 2007). Factor 2 showed high scores of moderately polluted sites (cluster-2) located in the river downstream, which suffer from hyper eutrophic conditions (Nazeer et al. 2014) (Fig. 5). Midstream catchment of river enriches the river with nutrients through sewage pollution and urban and agriculture runoff, which creates hyper eutrophic conditions at river downstream. Primary productivity of the river is increased by eutrophication and subsequently increased the photosynthetic process which releases oxygen. The role of eutrophication as a source of variability also has been discussed earlier for other rivers (Zhao and Cui 2009). Due to climatic variability in the study area, downstream catchment of the river has relatively high temperature compared with upstream and midstream zones. The association of DO and temperature along second axis corresponds to increase in organic pollution and subsequent algal growth from upstream to downstream, which coincides with the longitudinal gradient in temperature. Factor analysis also revealed close association with pH. High photosynthetic activity of algal communities consumes carbon dioxide and raises the water pH (Chandra et al. 2014). Nutrients showed weak negative loadings on this factor, which is due to the conversion of nutrients into phytoplankton biomass (Table 3).

Factor 3 has high loadings of Cu and Fe along with moderate loadings of COD, Cd, ammonia, orthophosphates, and Ni. Source profile of factors 3 showed high scores in urban midstream (cluster-3), which showed the role of urbanization in contribution of those physicochemical parameters associated with factor 3 (Table 3; Fig. 5). In freshwater, copper form non-labile soluble complexes with Fe, which has mainly origin of parent rock material, whereas ammonia and chlorides also form stable ligands with copper. Concentrations of Fe and Cu were particularly high during post-monsoon as determined through temporal DFA analysis. It was discussed earlier that intense monsoon precipitation transports various contaminants from terrestrial ecosystem to aquatic ecosystem through surface runoff. Hence, Fe and Cu also could be contributed to Soan basin through surface runoff. On the basis of aforementioned reasons, the factor 3 could be termed as urban runoff factor. Livestock farming, leachate from dumps of both industrial and domestic solid waste, and road runoff constitute the urban runoff. In urban runoff, Cu mainly comes from domestic wastewater, combustion process, corrosion, leaching of plumbing, fungicides, wood preservatives, and antifouling agents. Moderate loadings of COD and NH3 indicate the presence of organic matter in association with other contaminants (Ndungu et al. 2014).

High loadings of nitrates on the fourth factor indicate the role of agriculture runoff. Agrochemical runoff, industrial effluents, municipal landfills, livestock waste, and atmospheric deposition constitute the major sources of nitrates in surface water (Kotti et al. 2005; Chen et al. 2013). Factors 4 represents the impact of agriculture activities on water quality and showed the high scores of suburban part of midstream and downstream sites, which have agriculture as main catchment land use (Table 3; Fig. 5). In River Soan, nitrates are mainly added by agrochemicals and regional geology through diffuse sources, whereas sewage and industrial waste partly contribute to nitrate loadings of river (Jalil and Khan 2012). Pb and Cr showed moderate to weak loadings on this factor, which showed the influence of vehicular emission. These findings are consistent with the previous studies in the region which have reported Pb deposition in water at sites located in close vicinity of roads (Ahmed 1998; Mehmood et al. 2013). Abbasi et al. (2013) has reported the common sources of Pb in this region are heavy traffic, industries, and refineries. Atmospheric fallout from vehicular emissions is usually the most important source of Pb in the freshwaters (Qadir et al. 2008, 2013; N’ Diaya et al. 2013). Car washing activities in streams and rivers are prominent anthropogenic activities in a suburban part of midstream of River Soan, which also could contribute Pb to freshwater ecosystem.

Turbidity has high loadings on fifth factor and showed the impact of sediment loadings in altering the water quality. High scores for this factor were noted mainly at suburban midstream sites included in cluster-2 (Table 3; Fig. 5). Human settlements and livestock farming are major anthropogenic activities that were performed either in riparian zone or in close proximity of the river at these sites. Grazing in riparian zone is an important factor to increase turbidity either through the bank erosion or in-stream disturbances by grazing animals. Weak loadings of COD and nitrates on factor 5 could be attributed to animal waste associated with grazing phenomenon (Wan et al. 2014). Moreover, the poultry waste at suburban part of midstream and domestic sewage from rural areas constitute an additional source of pollution (Yan et al. 2015). Construction activities in new housing societies and dumping of construction waste along the river bank also might be responsible for fluvial transport of sediments (Babar et al. 2014). Moderate association of COD with turbidity on factor 5 might be attributed to the sewage waste discharge. Sewage waste of these housing societies is drained directly into river without any prior treatment. Because the area is not properly developed and hence the population density is low, consequently the impact of urbanization is not as strong as experienced in urban part. Sixth factor has high loadings of Zn (Table 3; Fig. 5). Galvanizing units operating in the region most likely attributed for release of Zn in the air, which entered freshwaters through atmospheric deposition (Pak EPA 2006).

Sewage discharge along with natural processes, such as mineralization of parent rock material, was attributed for enrichment of river with nutrients, macroelements, heavy metal, and organic pollution. Surface runoff from urban areas was attributed for adding macroelements, organic pollution, and heavy metals. Agriculture runoff has appeared to be main role in adding nutrients to the river; however, heavy metals and organic pollution also might contribute to agriculture activities. Livestock interruptions and bank erosion have a main role in adding nutrients and increasing organic load of the river. Industrial activities seemed to be affecting water quality mainly by adding heavy metals.

Conclusion

Spatiotemporal variation in water of River Soan was analyzed using multivariate analysis. Results revealed that spatial heterogeneity in water quality is primarily related to anthropogenic activities concomitant to urbanization. Temporal variations in water quality showed the influence of monsoon precipitation, which mobilizes different contaminant from terrestrial environment to regional water bodies. Multivariate techniques identified six main sources for causing variations in water quality, which includes mineralization of parent rock material, sewage waste, overgrowth of phytoplankton, urban and agrochemical runoff, physical disturbances at river channels, riparian zone, and at its catchments and industrial activities.

To address the issues linked with water-quality deterioration, the installation of wastewater treatment plants is inevitable. A wastewater treatment plant for treatment of wastewater from Islamabad was made functional in 2009, a year subsequent to duration that the study was conducted. This treatment plant is provided with primary treatment facility for sewage wastewater from different sectors of Islamabad. Wastewater is passed through settling and aeration tanks before its discharge into mainstream of Nallah Lei. The performance of the treatment plant has been severely affected by discontinuous electric power supply. Iram et al. (2013) have reported water quality data of Nallah Lei after the installation of the treatment plant. The concentrations of Ca (75 mg/L) and sulphates (20 mg/L) measured at Nallah Lei by Iram et al. (2013) was lower than present study (Ca: 113 mg/L and sulphates: 61 mg/L), whereas no considerable change was noted for other parameters, such as Pb, Cr, Zn, specific conductivity, and nitrates. To restore water quality, installations of more treatments plants are needed particularly at those sites where sewage drains fall in river mainstream. Furthermore, restoration of riparian buffer zones along the river and effective solid management practices also are needed to cope with diffuse sources of pollution. Likewise livestock farming also needs effective measures like proper handling of silage and slurry and effective utilization of livestock waste as manure.