Introduction

Rivers play an important role in controlling the hydrological cycle (Garrels et al. 1975). Rivers carry dissolved or suspended elements and deposit them at different locations (Gaur et al. 2005; Yao and Xu 2013). Several workers (Riedel et al. 2000; Koklu et al. 2010; Akbal et al. 2011; Wan et al. 2013) have assessed the water quality in different rivers. Multivariate statistical techniques such as factor analysis (FA), principal component analysis (PCA), cluster analysis (CA), multiple linear regression analysis (MLR) and artificial neural networks (ANN) are effective analytical tools for water quality assessment (Kuppusamy and Giridhar 2006; Wu and Wang 2007; Chau and Muttil 2007; Chen and Mynett 2006; Alkarkhi et al. 2009). PCA and CA are helpful in the identification of factors that affect the quality of water (Shrestha and Kazama 2007). MLR is used to find out the relationship between one dependent and two or more independent variables. Remote sensing (RS) has been used for monitoring and assessment of water quality by various workers (Chopra et al. 2001; Choubey 1992; Kloiber et al. 2002; Nelson et al. 2003). Studies by Alparslan et al. (2007) and Dewidar and Khedr (2001) have shown that DO, pH, total nitrogen, total phosphorus, Na, K and salinity show a good relationship with radiance data from the satellites. Mabwoga and Thukral (2014) reported land use changes in Harike wetland over a span of 21 years. The present study is aimed at physiochemical characterization of water from river Beas between the towns of Beas and Harike in Punjab at four chosen sites and three seasons. Water quality parameters were correlated with reflectance values of bands 2 (green, G), 3 (red, R) and 4 (near infrared, NIR). The study finds its importance in the fact that Platanista gangetica minor R., a freshwater river endangered dolphin, is found only in this river (Khan 2013). Recently, about 12 dolphins were sighted in this river (Rana 2013).

Study area

River Beas originates in the Himalayas in central Himachal Pradesh, India, at 31.51′ N lat., 77° 05′ E long. and altitude 2050 m above sea level, and flows for about 470 km to merge with river Sutlej. The study area comprised a 63-km stretch of the river between the towns of Beas and Harike. The river on its either side has wild growth followed by agricultural fields. The villages around the river dump their sewage and solid wastes into the river. On the basis of approachability to the river, samples were collected from the river adjoining the following towns (Fig. 1):

Fig. 1
figure 1

Location of study area

  1. 1.

    Beas (31.510′ N, 75.305′ E and 211 m asl): Located in district Amritsar, the town is the headquarters of a religious sect, Radha Soami. Every year, millions of pilgrims come to Beas.

  2. 2.

    Kishanpura (31.409′ N, 75.189′ E and 219 m asl): Located in the Kapurthala district, it is a major agriculture area in this region.

  3. 3.

    Goindwal Sahib (31.376′ N, 75.162′ E and 217 m asl): Located in the Tarn Taran district, a thermal power plant is located on the banks of river Beas.

  4. 4.

    Harike (31.150′ N, 74.951′ E and 210 m asl): Harike is located at the confluence of the rivers Beas and Sutlej. It has its boundaries with Amritsar, Ferozepur and Kapurthala districts. It is one of the international Ramsar wetlands and has a wildlife sanctuary.

Materials and methods

Satellite imagery

Landsat (TM) data dated 7 June 2013, 14 October 2013 and 26 February 2014 were obtained from the U.S. Geological Survey (USGS) (http://glovis.usgs.gov/). Erdas Imagine 11 and Arc GIS 9.3 software were used for the processing of the images. An eTrex Garmin Global Positioning System (GPS) receiver was used to determine the geo-coordinates of the study sites. The images were geometrically rectified with reference to the ground control points collected from the field with the help of a GPS receiver. Geo-coordinates along the river were recorded and converted into shape files using Erdas Imagine software from which a base map was prepared. The image was subset to extract the area of study which was used for further analyses. Reflectance values from three bands (G, R and NIR) of the satellite imagery were extracted from the image. The wavelengths of the bands used for the study were G (0.53–0.61 μm), R (0.63–0.69 μm) and NIR (0.78–0.90 μm). The other specifications of the images were spatial resolution (30 m), temporal resolution (16 days), radiometric resolution (8 bits) and swath width of 185 km.

Water sampling and analysis

Water samples were collected on May 30, 2013, October 10, 2013 and February 20, 2014 between 8 am and 11 am from the sites in triplicate. The collection, preservation and analysis of the water samples were carried out as prescribed in the standard methods (APHA 2005). pH was tested using a SDFCL micro pH Analytica pH meter. The temperature of the river water was measured in situ at a depth of 5 cm. Turbidity was measured with a portable turbidity meter (PC compact turbidity meter, Aqualytic). Electrical conductivity was determined by conductivity meter (Systronics, Model 304). TDS was determined by HM digital portable TDS meter. BOD was determined by 5-day BOD test at 20 °C. COD was estimated by using the closed reflux titrimetric method. DO was determined by azide modification technique. NO3 −1 was determined using spectrophotometric method at 220 nm (Thermo, Genesys 10 UV). Hardness and alkalinity were determined by EDTA titrimetric method and titration method respectively. Cl−1 determination was done by argentometric method. Na and K were estimated using (Systronics flame photometer-128). Ca and Mg were determined by using EDTA titration method. Water samples were fixed with HNO3 and filtered with Whatman #1 filter paper for heavy metal analysis using atomic absorption spectrophotometery (Agilent Technologies 200 -Series AA).

Multivariate statistical methods

One-way and two-way analysis of variance (ANOVA) and Tukey’s multiple comparison test were applied to find honestly significant difference (HSD at p ≤ 0.05), for 25 water quality parameters (Beaver et al. 2012; Sokal and Rohlf 1995). The null hypothesis tested was that at least two means are significantly different from each other. The heavy metals which were not detected or had very low concentrations were not used for multivariate analysis. Cluster analysis for 21 water quality parameters was performed using Euclidian distance as a measure of distance. Principal component analysis for 19–23 water quality parameters for pre-monsoon, post-monsoon and winter seasons was performed to determine the principal components which explain maximum variance of the data. In factor analysis, new factors which cause variations in the physiochemical parameters (11) and metal concentration (4) were extracted by varimax rotation of the PCA.

Multiple linear regression analysis was used to determine the effect of two or more independent variables on a dependent variable. β-regression coefficients are unitless partial regression coefficients which give the relative effects of different independent variables on a dependent variable. The artificial neural network consists of an input layer, one or more hidden layers of neurons and an output layer. The ANN model was developed between water quality parameters and reflectance values in bands 2 (G), 3 (R) and 4 (NIR) using the Statistica-12 software. The number of neurons was selected by the software when the maximum variance was explained. It is generally 70 to 90 % of the size of the input layer (Boger and Guterman 1997). The options selected were multilayer perceptron, 1000 iterations, ten neurons and tanh transfer function, both at the input layer and the hidden layer. ANN models were also developed between BOD, turbidity and water quality parameters (Fig. 2). Software used for calculations was Microsoft Office Excel-2007, Statistica-12, Minitab-14, PAST and self-coded software.

Fig. 2
figure 2

Artificial neural network

Results and discussion

Tables 1, 2 and 3 give water quality parameters in the river water samples collected from different sites. Differences among the means of BOD and pH were found to be significant for all the three seasons. The pH of the water was neutral to slightly alkaline. Variations in Cl−1 were found to be significant during the post-monsoon and pre-monsoon seasons. Mg levels were significantly different between the winter and pre-monsoon seasons. Na and Mn differences were found to be significant during the post-monsoon and pre-monsoon seasons. Turbidity, NO3 −1, hardness, Ca and Cr showed significant variations during the post-monsoon season. Concentration of K varied significantly during the winter season, whereas conductivity and COD showed significant differences in the means of samples during the pre-monsoon season. Two-way ANOVA between different sites and seasons revealed significant interaction between the sites and the seasons for pH, temperature, turbidity, BOD, NO3 −1, hardness, Cl−1, Na, K, Ca, Mg, Mn, Cr and Co, implying that the trends of variation of parameters change with site and season (Table 4).

Table 1 One-way ANOVA and multiple comparison test for different water quality parameters from different sites of river Beas for the pre-monsoon season
Table 2 One-way ANOVA and multiple comparison test for different water quality parameters from different sites of river Beas for the post-monsoon season
Table 3 One-way ANOVA and multiple comparison test for different water quality parameters from different sites of river Beas for the winter season
Table 4 Two-way ANOVA for water quality parameters of river Beas

Our results are in accordance with Moza and Mishra (2007) who analysed pH, DO, Cl−1 and PO4 −3 in water from river Beas at Beas bridge and the Harike lake. The results show slight variations from those of Central Pollution Control Board, New Delhi (2011) for pH, conductivity, DO and BOD in water from river Beas. Increase in dissolved and suspended solids in water increase the biological and chemical oxygen demands (Jonnalagadda and Mhere 2001). Wastewater possesses high concentration of Cl−1 than raw water because NaCl, the common component of human diet, passes unchanged through the digestive system (WHO 2006). In rivers, the lower content of DO may be due to the microbial activity in water (Palma et al. 2010). BOD is the amount of oxygen required for aerobic bacteria for the oxidation of organic matter present in the waste and is a part of COD (Cieszynska et al. 2012). Large amounts of biodegradable organic compounds affect the quality of surface waters because the organisms use the available DO for breakdown of wastes in the water, and as a result, the supply of oxygen in the water is reduced, thereby affecting the overall quality of water (Carpenter et al. 1998).

The multivariate analysis of water quality parameters of river water has been used for the first time in this region. In cluster analysis, the data from the post-monsoon season formed one group and data from winter and pre-monsoon seasons formed another group. Our studies on cluster analysis find support from Gnanamoorthy et al. (2013) who worked on Parangipettai, southeast coast of India, and observed that each season forms a separate group. PCA generates the number of principal components equal to the number of variables. In the present study, the first three principal components explained 100 % of the total variance for all the seasons (Table 5). COD and alkalinity had maximum loadings during the pre-monsoon season, whereas turbidity, COD and alkalinity had maximum loadings during the post-monsoon season. In the winter season, maximum loadings were on turbidity, COD and Na. Results of factor analysis are presented in Table 6. Factor 1 accounted for 46 % of the total variance and had strong positive loadings on conductivity, DO, PO4 −3 and hardness, and strong negative loadings on BOD, COD and Cl−1. Since all these parameters are pollution related from sewage and domestic wastewater, the first factor is largely a pollution loading factor leading to eutrophication. Factor 2 accounted for 24 % of the total variance and had strong loadings on TDS, alkalinity and turbidity due to the natural quality of water and industrial pollutants. Factor 3 accounted for 16 % of the total variance and had strong positive loadings on NO3 −1. This factor indicates the influence of NO3 −1 fertilizers used in agriculture dominated the catchment area of the river. In case of metal analysis, factor 1 accounted for 53 % of the total variance and had strong loadings on Na, K and Mg. This factor indicates the influence of domestic pollutants. The contribution of Na and K to this factor may be attributed to cation exchange processes in the soil-water interface (Guo and Wang 2004). Factor 2 accounted for 41 % of the total variance and had strong loadings on Ca.

Table 5 Variance explained in principal component analysis of different water quality parameters of river Beas
Table 6 Factor analysis of different water quality parameters of river Beas

Multivariate analysis of spectral radiance in relation to water quality parameters revealed that the reflectance values in three bands, 2 (G), 3 (R) and 4 (NIR), are governed by two factors (Table 6). Factor 1 had maximum loadings on bands 2 (G) and 3 (R), and this factor is mainly due to abiotic components. Factor 2 had maximum loadings on band 4 (NIR), and this factor may be attributed to green algae and other plants present in the water. Mabwoga et al. (2010) worked on water quality parameters in Harike wetland, and reported that turbidity shows correlation with the reflectance data. Our findings are in conformation with Coskun et al. (2008) on drinking water of Omerli reservoir. PO4 −3 is an indicator of pollution which enters into the river through domestic wastewater and causes eutrophication. Turbidity may be caused by two factors, i.e., sand and silt carried by water from the mountains and sewage and industrial effluents discharged into the river. TDS mainly consists of inorganic salts such as Na, K, PO4 −3, NO3 −1, Ca, Mg, SO4 −2, Cl−1, etc. and lesser amount of organic matter. MLR was done between the reflectance values in bands 2 (G), 3 (R) and 4 (NIR) and water quality parameters PO4 −3, turbidity and TDS (Table 7). These parameters were selected on the basis of PO4 −3 in sewage discharged into the river, turbidity due to physical and biological phenomena, and TDS due to industrial and agricultural water discharges. It was found from the multiple and β-regression analyses that reflectance in band 2 (G) is decreased due to the TDS content of water, and enhanced due to PO4 −3 content of water. Turbidity does not have much effect on reflectance in band 2 (G). Band 3 (R) also gives results similar to band 2 (G). In band 4 (NIR), turbidity decreases the spectral radiance. PO4 −3 increases the spectral radiance of all the bands. PO4 −3, turbidity and TDS were also regressed on BOD. It was found from multiple and β-regression analysis that the value of BOD is enhanced due to PO4 −3 content of water, whereas turbidity and TDS decreased the value of BOD. The ANN model fitted well with the simulated and the observed data. The correlations between target and output values from ANN for BOD (validation: r = 0.926), turbidity (validation: r = 0.995) and reflectance values in bands 2 (G) (validation: r = 0.999), 3 (R) (validation: r = 0.998) and 4 (NIR) (validation: r = 0.999) were highly significant (p < 0.001).

Table 7 Multiple linear regression between reflectance values or BOD and independent variables

Conclusions

The results suggested that domestic, industrial and agricultural activities have significant effect on the water quality parameters in river Beas. The water quality of the river deteriorates as it flows down from Himachal Pradesh to Punjab. Parameters having significant influence are BOD, DO, COD, TDS, turbidity, conductivity, hardness, PO4 −3, NO3 −1, Cl−1, Ca, Na and K. In the present study, reflectance values in FA revealed two underlying factors. Factor 1 comprised the abiotic components, and factor 2 comprised green algae and other plants present in the water. Cluster analysis showed that water quality parameters of each season form a separate group. PO4 −3 contributed positively to the reflectance values of all the bands, whereas TDS contributed negatively. PO4 −3 contributed positively to the BOD, whereas turbidity and TDS contributed negatively. ANN can be used to stimulate water quality parameters using reflectance values. The study warrants that appropriate water pollution prevention measures should be taken to treat the domestic and industrial wastewater, before it flows down to the river. This will help in protecting the endangered biodiversity of the river and the Harike wetland.