Abstract
The lower course of the Damodar River in West Bengal is one of the most polluted stretches in the Ganga River basin. There is a lack of research along the whole course of the Damodar, and parameter level analysis receives little attention. Eleven monitoring sites were chosen based on the potential sources of pollution for 6 years (2014–2019). Multivariate statistical techniques (factor analysis (FA), cluster analysis (CA), and discriminate analysis (DA)) evaluate the spatial and temporal variation of Damodar River water quality by considering 24 parameters. Factor analysis extracts the most influential seasonal parameters, and stepwise DA extracts ammonia, DO, potassium, temperature, total coliform, TFS, and turbidity, which are the most responsible parameters for seasonal variation of the water quality. CA classify sampling stations into three groups helping to identify the spatial variation of water quality. Ammonia, BOD, calcium, chloride, conductivity, DO, sodium, sulfate, temperature, Alkalinity, TDS, hardness, TSS, and turbidity are the most influential variables for spatial variation extracted through stepwise DA. Monsoon season shows a higher pollution level due to the contribution from both point and non-point sources. Due to high-density urban areas and large-scale industries, the middle course is more polluted. The Canadian Council of Ministers of the Environment (CCME) water quality index (WQI) accesses the water quality in temporal and spatial scales. The resultant water quality pattern is matched with the derived result from multivariate analysis. Poor water quality is regular at all sample sites in all seasons.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Water is a lifeline for the development of civilization. Untreated water from point and non-point sources contaminant the water. Some natural and anthropogenic processes like weathering, agriculture runoff, minerals, effluents from municipalities, and industries are responsible for this pollution (Grzywna & Bronowicka-Mielniczuk, 2020; Hajigholizadeh & Melesse, 2017; Zhang et al., 2020; Zhou et al., 2007). Thus, a river is one of the most vulnerable components of the environment, which can easily be destroyed by the unconscious activities of human beings that further create a threat to human lives. A general overview of the West Bengal Pollution Control Board’s published annual report (WBPCB) shows a clear picture of the seasonal and sample to sample variation of water quality. This variation of pollution potential sources of pollution gives a detailed account of rivers that help to maintain the water quality and proper management of the polluted stretches (Bhat et al., 2018; Grzywna & Bronowicka-Mielniczuk, 2020; Li et al., 2009; Platikanov et al., 2019; Salim et al., 2019; Singh et al., 2004; Varol, 2020; Varol & Şen, 2009; Zhong et al., 2018).
Multivariate statistical analysis is widely used for spatial and temporal analysis (Simeonov et al., 2003; Singh et al., 2004; Varol & Şen, 2009; Vega et al., 1998; Xiaolong et al., 2010) of water quality. Water quality index (WQI) is a mathematical method that determines water quality by combining multiple variables and transforming them into a single value (Akkoyunlu & Akiner, 2012; Lkr & Neizo, 2020; Sharma & Kansal, 2011; Zeinalzadeh & Rezaei, 2017). Multivariate statistical techniques and WQI complete each other by identifying pollutants, specific changing patterns of pollution levels, and overall water quality.
Rapid deterioration of river water quality has been a significant environmental concern in recent years. The Damodar River in West Bengal, like many other Indian rivers, passes through industrial and agriculturally developed areas. Damodar River is the water source of this area, and not only this, it is one of the most important tributaries of the lower Ganga (Hoogly). This river stretch is one of the polluted river (category I) in West Bengal, India (CPCB, 2017). The earlier researches were site-specific, analyzing the results of anthropogenic sources and pollutants on habitat. This study demonstrates the alteration of water quality on a temporal and spatial scale. Accessing a large quantity of data with multiple variables presents particular challenges. Multivariate techniques like factor analysis, cluster analysis, and discriminate analysis are required to represent data understandably (Bengraı̈ne & Marhaba, 2003; Chatterjee et al., 2010; Helmreich, 2015; Kotti et al., 2005; Reghunath et al., 2002). So, a comprehensive study has been done on the Damodar River using multivariate statistics to identify the responsible variables for water pollution, seasonal and spatial variation of water quality, source identification, and water quality estimation by WQI.
Materials and methods
Study area
Damodar River is a sub-basin of the Ganga basin, extended in Jharkhand and West Bengal (WB). It is one of the major rivers of the south Gangetic plain in West Bengal (Fig. 1). Damodar flows a distance of 260.48 km through the Purba Bardhaman and Paschim Bardhaman districts of West Bengal. Paschim Bardhaman is predominantly an industrial district and also an urbanized area. It is located in the west of West Bengal and between 22°27′ and 23°49′ North and between 86°48′ and 87°55′ East. The average slope of the basin is 2.34°. This river basin is constituted by sandstone, shales (Gondwana formation), laterite (tertiary period), and alluvial geological formations (Mondal et al., 2018).
Agricultural, residential, industrial, and mining areas are the dominant land-use categories in the Damodar River basin. Durgapur, Asansol municipal corporation, and municipalities like Kulti, Burdwan, Jamuria, Raniganj, and many small census towns are situated along this river. Asansol-Kulti township extended to the upper reach of the Damodar River (DCO, 2011) at a length of 36 km. The four sample stations located along this stretch of the river are S1 (Barakar), S2 (Dishergarh), S3 (Asansol), and S4 (IISCO). Durgapur Municipal Corporation encompassed a 16.5-km stretch of the river, with S9 (Durgapur) and S10 (Mujhermana) sampling stations. There are four sample stations, S5 (Narainpur), S6 (Raniganj), S7 (Andal US), and S8 (Andal DS), in the middle. S11 (Burdwan) station is located in the lower reach of the Damodar.
Site S1 receives outlets from Kulti industrial and residential areas. Site S2 receives water from a tributary near the state Jharkhand. Sites S3 and S4 stations are located near congested urbanized and large-scale industrial clusters. Site S5 receives a drainage outlet from the Anansol residential area. Sites S6, S7, and S8 stations are located in residential areas and receive sewages. Sites S9 and S10 stations are located in a highly congested area of industries and residential clusters. Station at site S10 receives treated, untreated effluents from industries and municipalities in the Durgapur region and drains into Tamla Nala (drain), finally joining the Damodar River (Mukhopadhyay & Mukherjee, 2013). S11 site is located in the lower reach of the Damodar River in the Burdwan municipality region. Industrial clusters, urbanized residential areas, and the outlets from these centers are shown in Fig. 2. Maximum industries have fallen in the red category list (the most polluted). These districts are also referred to as the rice bowl of West Bengal. It is, therefore, agriculturally one of the most productive regions. Maps are generated in the ArcGIS environment.
Monitoring sites
This study is based on the data collected by WBPCB (West Bengal Pollution Control Board) under the Pollution Control Board of India. A total of 11 sampling sites (Fig. 1) are selected for the sampling purpose by WBPCB. The selection of sites is based on the potential sources of pollution (Guidelines for water quality monitoring). All sites are concentrated around industrial sites or municipal areas. Central Pollution Control Board describes the methods and sampling procedure (Guide Manual: Water and Wastewater analysis). The data have been taken monthly from the year 2014 to 2019 for all 11 stations. Out of the analyzed 27 parameters, 24 are used to determine the changes in water quality. The other three parameters (Boron, Phenolphthalein alkalinity, and total Kjeldahl nitrogen) are below detection level or NIL for maximum times. The measured parameters are Ammonia-N (Ammonia), biological oxygen demand (BOD), Calcium, Chloride, chemical oxygen demand (COD), conductivity (Cond), dissolved oxygen (DO), fecal coliform (F.Coliform), Fluoride, Magnesium, Nitrate–N (Nitrate), pH, Phosphate, Potassium, Sodium, Sulfate, temperature °C (Temp.), total alkalinity (Alkalinity), total coliform (T.Coliform), total dissolved solids (TDS), total fixed solid (TFS), total hardness (Hardness), total suspended solids (TSS), and turbidity.
Water quality parameters from eleven sampling locations are analyzed, totaling twenty-four (24) parameters and categorized into three seasons to find out the temporal variation of the pollution load from 2014 to 2019. Factor analysis was used to identify the most influential water quality parameters out of the 24 parameters in the three seasons. Spatial variations of pollution load are analyzed through cluster analysis. Among the 24 parameters, few are crucial for seasonal and spatial variation and discriminant analysis has conducted to distinguish those variables. WQI has accessed overall water quality based on seasonal and spatial variation. The detailed research design are shown in Fig. 3.
Rainfall pattern
Rainfall data of the Damodar River basin are extracted from the interpolated raster map (Pai et al., 2014) collected from the Indian Meteorological Department (IMD). Monthly rainfall data have been extracted and summarized seasonally from 2014 to 2019 (Table S1). It is one of the controlling factors that determine the pollution load by carrying elements through surface runoff. Data in Table S1 shows mean rainfall and SD in mm. A year is divided into three seasons, pre-monsoon (March to May), monsoon (June to September), and post-monsoon (October to February) for the analysis purpose. Discharge declines from the monsoon to the pre-monsoon and lowest prevail during the month between October to February (Bhattacharyya, 2011).
Statistical techniques
Factor analysis
Factor analysis is one of the most common and useful methods for multidimensional data used in many water quality analyses (González et al., 2014; Kükrer & Mutlu, 2019; Mutlu, 2019; Ouyang et al., 2006; Singh et al., 2004). It transforms the original variables into few latent variables without compromising the original characters of the data. Each factor is a set of latent factors that carry as much variance and bear some unique characters. The same number of factors is generated as the number of input variables. Eigenvalue more than 1 is considered as the method to choose the number of components of factor analysis. It also produces uniqueness for each variable that tells us that other variables cannot explain that variable. A “varimax” axis rotation makes the output factor loadings easier to read in factor analysis. The factor loadings can be classified into strong (> 0.75), moderate (0.5–0.75), and low (< 0.50) (Liu et al., 2003).
Cluster analysis
Cluster analysis is a multivariate technique that performs the grouping of sampling stations depending on the similarity of the pollution load. The clusters show high homogeneity within-cluster and high heterogeneity between clusters (Hair et al., 2010). It has widely been used in many studies (Alberto et al., 2001; Chang, 2005; Hajigholizadeh & Melesse, 2017; Simeonov et al., 2003; Singh et al., 2004; Vega et al., 1998).
We have used an agglomerative hierarchical cluster where each observation is considered a cluster until a large cluster is formed through the set of observations (Maechler et al., 2005). Data are standardized before clustering. “Euclidean” distance is used to calculate the distance among stations, and the stations are clustered using ward’s minimum variance clustering method. Hopkins statistics (Lawson & Jurs, 1990) determine the data suitability for cluster analysis.
Discriminant analysis
Factor analysis is performed to extract the low dimensional factors representing the high variance of the multivariate dataset. In the discriminate analysis (DA), the dataset is divided into the best possible groups. It is also called the supervised pattern recognition model, which is based on multiple explanatory variables to predict categorical response variables. DA assumes that all classes are linearly separable by hyperplanes depending on the various explanatory variables’ criteria. The number of hyperplanes relies on the number of groups. This study is based on the three seasons, so two hyperplanes will generate to classify the data. This hyperplane passes through the midpoint of the cluster mean. It is also calculated from the individual sample covariance matrix. The discriminant function has the form presented in Eq. (1) (Alberto et al., 2001).
where i is the number of groups (it is three in temporal analysis), ki is the constant inherent to each group, DA assigned weight coefficient (wj) for selected parameters (pj), and n is the number of analytical parameters.
This DA performs in standard and stepwise mode to select variables that significantly contribute to maximizing distance between the mean of each group. DA analysis has been performed for temporal and spatial analysis where seasons and clusters are the response variables and observed water quality parameters are explanatory variables. The model performance is shown through the confusion matrix. SPSS software is used for the discriminant statistical analysis and R software is used for other statistical analysises and representation.
Water quality index
Canadian Council of Ministers of the Environment (CCME) develop a water quality index (WQI) to determine the water quality depend on the different variables. The essential features of CCME WQI are flexibility in choosing water quality parameters according to the requirements and availability. This index is based on the three elements (CCME, 2017), which are scope (F1), frequency (F2), and amplitude (F3). The water quality index is expressed as
CCME WQI = 100 − \(\left(\frac{\sqrt{{{F}_{1}}^{2}+{{F}_{2}}^{2}+ {{F}_{3}}^{2} }}{1.732}\right)\); here, the 1.732 value normalizes the result at a range of 0–100.
Where F1 is the percentage of the failed parameters concerning the total number of parameters that fail to meet the water quality standard, F2 is the percentage of failed test for the total number of tests, and F3 is an asymptotic function used to normalized the sum of excursions to yield a range between 0 and 100, but before calculating F3, excursion and sum of excursion (nse) need to be calculated.
Excursion is calculated by dividing the failed values by the objective when concentration is greater than the permissible limit and vice versa when concentration is less than the required minimum permissible limit.
A minimum of four parameters and four sampling frequencies are required for this WQI. Due to the flexibility of choosing the variables and the permissible limits, this WQI is used for CPCB assigned water category A (drinking water source without conventional treatment but after disinfection). The permissible limit of IS 2296:1992 Indian standard has been used for this analysis. BOD, chloride, DO, fluoride, nitrate, pH, sulfate, total coliform, TDS, and hardness determine the water quality. This WQI is classified as poor (0–44), marginal (45–64), fair (65–79), good (80–94), and excellent (95–100).
Result and discussions
Correlation analysis
Pearson correlation analysis has been performed (Table 1) to understand the significant correlation among 24 parameters. COD has a significant positive correlation with ammonia, calcium, cond., phosphate, potassium, sodium, sulfate, TDS, TFS, and TSS. So, the sources of these elements are similar. DO and pH negatively correlated with COD. COD is highly associated with ammonia and TFS. COD is the source of effluent discharge from the residential, industrial, and agricultural fields (Bellos & Sawidis, 2005). So, an increase in nutrients leads to a decrease in the level of DO. DO has a highly significant positive correlation with the pH of the water and negatively correlated with temperature, turbidity, TFS, and TSS. An increase in temperature increases the biological process in water that consumes oxygen from water and decreases the DO level in the water (Brandt et al., 2017). TDS and TFS are highly correlated with minerals like ammonia, chloride, sodium, potassium, sulfate, and Alkalinity. The alkalinity of water comprises the amount of calcium, magnesium, sodium, and potassium that further controls the level of TDS (contains Ca2+, Mg2+, K+, Na+, SO42−, Cl−, etc. (Çadraku, 2021)) and TFS in the water (Brandt et al., 2017). It is highly affected by land washing (Bengraı̈ne & Marhaba, 2003) in the wet season (monsoon) and drainage from urban areas (Alberto et al., 2001) as well as from irrigation discharges (Liu et al., 2019). Ions like sodium, potassium, and magnesium are a highly positive relation with hardness. Water mineralization is controlled by these ions (Varol, 2020). Both natural and anthropogenic sources are responsible for the variation of these ions.
Spatial variation
Ammonia, BOD, and COD concentrations are higher in site S5 (Narainpur) than upper four stations (Table 2). Site 5 is located near the tributary that receives sewage from a vast residential area. Domestic effluents increase ammonia concentration (Brandt et al., 2017; Kotti et al., 2005) in water. Oxidation of ammonia contributes to the increase in COD levels (Gradilla-Hernández et al., 2020). The concentration gradually decreases from site S6 due to the self-purification process (Varol, 2020). Site S10 receives maximum sewages from large clusters of industries and congested residential areas. So, the concentration of pollutants is relatively high on this site. Minerals like calcium, magnesium, sodium, potassium, and TDS, alkalinity increase from site S1 to site S6, and then it decreases and further increases in site S10. Hardness is controlled by the amount of calcium and sodium concentration in water. Thus, it follows the same pattern as calcium and magnesium. Nitrate, phosphate, and potassium; alkalinity; chloride; conductivity; TDS; TFS; and hardness are significantly high from sites S6 to S8 due to the concentration of agriculture field and residential area in this zone. TSS, turbidity, and coliform (both fecal and total) are significantly higher in sites S1 and S2 than site S3 due to the vast agriculture field and residential area.
Cluster analysis has been used to detect the variation of pollution content along the river bed from source to mouth. Hopkins statistics (Lawson & Jurs, 1990) H value is 0.236; thus, the null hypothesis is rejected, and the dataset is suitable for cluster analysis. The result is represented in Dendrogram (Fig. 4). The best cluster number is chosen based on the 30 indices (Charrad et al., 2014). Here, the three clusters are the best number of clusters. Site S1, S2, S3, and S4 in the upper reach form the first cluster; sites S5 to S9 and S11 constitute the second cluster; and site S10 form a separate cluster. Site S10 (Mujher Mana) receives the effluents from the Tamla drain. Sixty percent of Durgapur town’s habitat slopes toward the Tamla drain (Mukhopadhyay & Mukherjee, 2013).
Seasonal variation
COD, fluoride, potassium, sulfate, TSS, and turbidity show a significant increase in the monsoon season than the other seasons. F.coliform and T.coliform amounts get almost double in the monsoon seasons. These parameters have increased as a result of surface runoff from non-point sources. On the other hand, the conductivity level decreases in the monsoon season. The overall mineral composition of the river water improves during the monsoon season. A significant decrease is found in the hardness level in monsoon than in the other seasons. But still, the water remains in the same hardness level (slightly hard) in all three seasons (Brandt et al., 2017). Chloride, magnesium, phosphate, alkalinity, and TDS do not experience any significant seasonal variation.
Seasonal factor analysis has been performed to identify the most critical seasonal parameters (Mohanty & Nayak, 2017; Ouyang et al., 2006; Pejman et al., 2009). Bartlett’s test and Kaise-Meyer-Olkin (KMO) statistics were conducted to test the data suitability for performing FA. The p value of Bartlett’s test is significant (p < 0.000) and KMO criterion (> 0.73) for all the seasons. Those components are chosen which have eigenvalue more than 1. Seven components are selected for pre-monsoon and post-monsoon season, explaining 75.99% and 72.02% variance, respectively, and six are chosen for the monsoon season, explaining 63.55%. These components explain above 60% of the variance sufficient for the environmental dataset (Hair et al., 2010). Factor loading of more than 0.75 is considered a significant parameter for seasonal variation. A factor loading less than 0.75 shows a very high uniqueness value (failed to explain the variables by factor analysis).
In pre-monsoon season, component 1 (Table 3(a)) indicates the high loading on chloride, conductivity, sodium, TDS, and TFS suggests pollution related to ionic and salt concentration. Natural and anthropogenic sources are responsible for these elements. Ammonia, phosphate, and TSS show high loading in component 2 connected to the runoff from agricultural fields and sewage effluents (Aliyu et al., 2020; Brandt et al., 2017; Pejman et al., 2009). Magnesium and hardness have a strong correlation, with component 3, which indicates the mineral composition of the water. Component 4 reveals the bacteriological characteristics of the water. Domestic, agricultural fields, and animal farms are responsible sources for the coliform bacteria in water. Alkalinity has a strong correlation with component 5, which represents the salt concentration of water. Component 6 is correlated with pH. BOD with very high loading associated with component 7 denotes the organic pollution in the water caused by effluents from residential areas and industries.
Component 1 of monsoon seasons (Table 3(b)) dominates conductivity, sodium, TDS, and TFS controlled by erosion and high surface runoff in monsoon seasons. Component 2 is characterized by the mineral composition of water (Singh et al., 2004; Vega et al., 1998). The presence of dolomite and anhydrite in the study area (Bengraı̈ne & Marhaba, 2003; Salifu et al., 2012) are the responsible factors for the high contribution of minerals in component 2. Calcium has natural sources from rocks that control the hardness of the water. Along with the natural sources, anthropogenic activities also increases the level of hardness in water. Pathological character is presented by component 3. Component 6 is highly correlated with alkalinity. The alkalinity of water comprises the sum of all salts (Brandt et al., 2017).
Post-monsoon season (Table 3(c)) dominates by conductivity, sodium, and TDS in component 1, directly related to the salt characteristics of water. Ammonia, nitrate, phosphate, and TSS overlook in component 2. The source of ammonia is sewage coming from industrial and agricultural sites (Brandt et al., 2017). The source of ammonia is also from the decomposition of plant and animal matters. An increasing amount of nitrate in water is for sewage pollution (Kotti et al., 2005), agriculture runoff (Bu et al., 2010; Kotti et al., 2005), and oxidation of ammonia (Brandt et al., 2017). So, an increase in ammonia may increase the amount of nitrate in water. Phosphate comes from multiple sources, including industries, cropland where phosphate-based inorganic fertilizers are used, and phosphate-based detergents used in households (González et al., 2014). Phosphate-based fertilizers are most common, and it has massive use in all seasons except monsoon season. In monsoon season, least amount of fertilizers and pesticides are used. Paddy cultivation uses maximum fertilizer (about 31.8%) among all agricultural product of which irrigated cultivation use 22.2% fertilizer (FAO, 2005) in India. So phosphate is not an essential parameter in the monsoon season. Components 3 and 4 dominate the mineral composition of the watershed and is controlled by natural and organic compounds of wastewater (Potasznik & Szymczyk, 2015). Component 5 indicates the pathological pollution in the Damodar River. Components 6 and 7 have a high correlation with temperature and BOD, respectively.
Pathological pollution dominates in all seasons. Very high (> 0.9) impact of BOD is found in the pre-monsoon season, related to organic pollution. Municipal waste discharge (Saksena et al., 2008; Vega et al., 1998) is the potential source of organic pollution. BOD is not a vital parameter in the monsoon season. So, an increase in the volume of water reduces the oxidation process of organic pollutants.
Discriminant analysis
Discriminant analysis (DA) is used to evaluate the temporal variation of water quality by dividing the dataset into three seasons pre-monsoon, monsoon, and post-monsoon. Standard and stepwise methods are used in the discriminate analysis. The standard discriminate method is used to discriminate the seasons. The stepwise discriminate method is used to extract the variables responsible for the temporal discrimination depends on Wilk’s lambda criteria (at a significance of p < 0.05). Overall significance tests of Wilk’s lambda are presented in Table 4. P value represents the significant temporal classification in standard and stepwise mode. The first function of DA explains 80.1% of the variance and the second function explains 19.9% of the variance. The stepwise method suggests ammonia, DO, potassium, temperature, total coliform, TFS, and turbidity as responsible parameters for seasonal variation of Damodar River water quality (Table 5(a)). The first DA function in the stepwise method explains 83% variability, and the second function explains 17% of the variability. It separates groups more accurately than the standard DA. The accuracy of the model is presented in the confusion matrix. Standard DA and stepwise DA predict temporal classes with 74.4% and 71.5% accuracy, respectively (Table 6(a)).
The extracted parameters from stepwise DA are plotted in box and whisker plot (Fig. 5). Monsoon season has the lowest presence of ammonia, and in the post-monsoon season, it is in the highest amount. Ammonia is associated with agriculture runoff and sewage effluents. The lowest amount of rainfall prevails in the post-monsoon (11.46 ± 5.09 mm). In the monsoon season, the dilution effect comes into play to reduce the amount of ammonia in the water (Varol, 2020). The average rainfall in monsoon seasons is 117.44 mm, with a standard deviation of 18.23 mm. The highest DO level is found in the post-monsoon season and the lowest in the monsoon season. DO level is associated with the temperature in water. In the monsoon season, a more consistent temperature is found. An increase in temperature enhances the biological activities in water that consumes more DO in water. A decrease in temperature improves the DO condition in the post-monsoon season (Hajigholizadeh & Melesse, 2017). Municipal and industrial sewage discharges and agricultural runoff are the familiar sources of potassium in river water (Skowron et al., 2018). Thus, in monsoon seasons, potassium increases due to the non-point source (agricultural area). Still, potassium remains the same in pre- and post-monsoon periods due to the constant supply of pollutants from point sources. TFS denotes the fixed amount of non-volatile solids in water that does not increase vastly like turbidity in monsoon season. In the monsoon season, carrying a large number of solids through surface runoff increases turbidity. Coliform bacteria population increases with the increase in surface runoff. So, in the monsoon season, the coliform bacteria population is highest, and in the post-monsoon season, it is the lowest. Bacterial population increases with runoff from non-point sources along with point sources.
DA analysis is also performed on the spatial variation in the cluster dataset. A significant p value of standard and stepwise DA indicates the good classification of cluster datasets (Table 4). The first DA function explains 87.80% variance, and the second function explains 12.2% of the variance. It represents an efficient classification of clusters. In stepwise DA, first and the second functions explain 89.1% and 10.9%, respectively. The confusion matrix shows the accuracy above 77.8% and 76.9% for standard DA and stepwise DA, respectively (Table 6(b)). Stepwise DA selects ammonia, BOD, calcium, chloride, cond., DO, sodium, sulfate, tem., alkalinity, TDS, hardness, TSS, and turbidity as responsible for cluster variation (Table 5(b)). The variation of these parameters is represented through the box and whisker plot (Fig. 6). Ammonia, BOD, calcium, chloride, conductivity, sodium, sulfate, alkalinity, TDS, hardness, and TSS show the same pattern that increases the pollution level from cluster 1 to cluster 3. DO level is almost the same in-between cluster 1 and cluster 2. In cluster 3, the variation of DO is found from the other two clusters. The lowest DO level is found in cluster 3, highly affected by organic pollution. Temperature level also increases from cluster 1 to cluster 3. An increase in the pollution load increases the temperature of the water as it is controlled by conductivity and other pollutants (Brandt et al., 2017). TSS and turbidity are very high in cluster 3 than in cluster 1 and cluster 2. In the upper course, the water quality is much better than in the lower course of the Damodar River.
Water quality index
WQI is based on the ten water quality variables. Water quality is determined both seasonally and spatially (Fig. 7). Poor water quality is found in all sites and seasons. The first four sites are almost the same value WQI. Site S10 shows the most polluted station in the Damodar River. The lowest water quality is also found in the monsoon season. However, all stations fall into the poor water category indicates the water quality is almost threatened.
Conclusions
Multivariate statistical techniques find that there is a high spatial variation of pollution levels for all sites. Point and non-point sources are primarily responsible for seasonal variation in pollution. Ionic concentration does not have significant seasonal variation, but it varies spatially. This river is highly encroached by pathological pollution. This pathological population gets almost double in monsoon season. FA successfully extracted the seasonally important parameters. Seasonal variations are found mainly to the parameters related with the anthropogenic sources. Stepwise DA efficiently extract the parameters which are sensitive to the seasonal and spatial change. The WQI for each station says that the river water is not suitable for use. The water quality is the worst in those areas dominated by congested urbanization and associated with clusters of large-scale industries. Thus, site 10 is the most polluted among all the sites. In the monsoon season, the water quality deteriorates more.
The present study can provide several suggestions to maintain the Damodar River water quality: (1) Water quality can be improved by controlling the direct discharges into the river. (2) Fertilizer use should be controlled because the non-point sources (like agriculture fields etc.) have high control on the water quality in this river. (3) Along with regular monitoring direct actions are also required to revive the water quality.
Data availability
Rainfall data have been collected from India Meteorological Department (IMD) of Pune under the Ministry of Science, Govt. of India. Water quality data have been collected from West Bengal Pollution Control Board (WBPCB) under the Central Pollution Control Board (CPCB) of Govt. of India. Industrial information has been taken from the district industrial profile of Paschim Bardhaman under MSME (Ministry of micro, small and medium enterprise) under Govt. Of India, West Bengal Industrial Development Corporation, and Asansol Durgapur Development Authority under Govt. of West Bengal. SRTM DEM data have been downloaded from USGS (US Geological Survey) Earth Explorer to generate river basin and channels.
References
Akkoyunlu, A., & Akiner, M. E. (2012). Pollution evaluation in streams using water quality indices: A case study from Turkey’s Sapanca Lake Basin. Ecological Indicators, 18, 501–511. https://doi.org/10.1016/j.ecolind.2011.12.018
Alberto, W. D., del Pilar, D. M., Valeria, A. M., Fabiana, P. S., Cecilia, H. A., & de Los Ángeles, B. M. (2001). Pattern recognition techniques for the evaluation of spatial and temporal variations in water quality. A case study: Suqui’a river basin (Co’ Rdoba–Argentina). Water Research, 35(12), 2881–2894.
Aliyu, A. G., Jamil, N. R. B., Adam, M. B. B., & Zulkeflee, Z. (2020). Spatial and seasonal changes in monitoring water quality of Savanna River system. Arabian Journal of Geosciences, 13(2), 55. https://doi.org/10.1007/s12517-019-5026-4
Bellos, D., & Sawidis, T. (2005). Chemical pollution monitoring of the River Pinios ( Thessalia — Greece ). Journal of Environmental Management, 76, 282–292. https://doi.org/10.1016/j.jenvman.2005.01.027
Bengraı̈ne, K., & Marhaba, T. F. (2003). Using principal component analysis to monitor spatial and temporal changes in water quality. Journal of Hazardous Materials, 100(1–3), 179–195. https://doi.org/10.1016/S0304-3894(03)00104-3
Bhat, B. N., Parveen, S., & Hassan, T. (2018). Advances in environmental technology seasonal assessment of physicochemical parameters and evaluation of water quality of river Yamuna , India. Advances in Environmental Technology, 1, 41–49. https://doi.org/10.22104/aet.2018.2415.1121
Bhattacharyya, K. (2011). The Lower Damodar River. Understanding the human role in changing fluvial environment. springer Dordrecht Heidelberg. https://doi.org/10.1007/978-94-007-0467-1
Brandt, M. J., Johnson, K. M., & J., E. A., & Ratnayaka, D. D. (2017). Twort’s water supply. Elsevier. https://doi.org/10.1016/c2012-0-06331-4
Bu, H., Tan, X., Li, S., & Zhang, Q. (2010). Temporal and spatial variations of water quality in the Jinshui River of the South Qinling Mts., China. Ecotoxicology and Environmental Safety, 73(5), 907–913. https://doi.org/10.1016/j.ecoenv.2009.11.007
Çadraku, H. S. (2021). Groundwater quality assessment for irrigation: Case study in the blinaja river basin, Kosovo. Civil Engineering Journal (Iran), 7(9), 1515–1528. https://doi.org/10.28991/cej-2021-03091740
CCME. (2017). CCME Water Quality Index user's manual 2017 Update. In Canadian Water Quality Guidelines for the Protection of Aquatic Life. https://ccme.ca/en/res/wqimanualen.pdf. Accessed 23 May 2021.
Chang, H. (2005). Spatial and temporal variations of water quality in the han river and its tributaries, Seoul, Korea, 1993–2002. Water, Air, & Soil Pollution, 161(1–4), 267–284. https://doi.org/10.1007/s11270-005-4286-7
Charrad, M., Ghazzali, N., Boiteau, V., & Niknafs, A. (2014). NbClust: An R package for determining the relevant number of clusters in a data set. Journal of Statistical Software, 61(6), 1−36. https://www.jstatsoft.org/v061/i06. Accessed 5 June 2021.
Chatterjee, S. K., Bhattacharjee, I., & Chandra, G. (2010). Water quality assessment near an industrial site of Damodar River, India. Environmental Monitoring and Assessment, 161(1–4), 177–189. https://doi.org/10.1007/s10661-008-0736-1
CPCB. (2017). Restoration of Polluted River Stretches: Concept and Plan. 56.
DCO. (2011). District Census Handbook Barddhaman.
FAO. (2005). Fertilizer use by crop in India.
González, S. O., Almeida, C. A., Calderón, M., Mallea, M. A., & González, P. (2014). Assessment of the water self-purification capacity on a river affected by organic pollution: Application of chemometrics in spatial and temporal variations. Environmental Science and Pollution Research, 21(18), 10583–10593. https://doi.org/10.1007/s11356-014-3098-y
Gradilla-Hernández, M. S., de Anda, J., Garcia-Gonzalez, A., Meza-Rodríguez, D., Yebra Montes, C., & Perfecto-Avalos, Y. (2020). Multivariate water quality analysis of Lake Cajititlán. Mexico. Environmental Monitoring and Assessment, 192(1), 5. https://doi.org/10.1007/s10661-019-7972-4
Grzywna, A., & Bronowicka-Mielniczuk, U. (2020). Spatial and temporal variability of water quality in the Bystrzyca River Basin, Poland. Water, 12(1), 190. https://doi.org/10.3390/w12010190
Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2010). Multivariate Data Analysis. In Pearson. Pearson Education.
Hajigholizadeh, M., & Melesse, A. M. (2017). Assortment and spatiotemporal analysis of surface water quality using cluster and discriminant analyses. Catena, 151, 247–258. https://doi.org/10.1016/j.catena.2016.12.018
Helmreich, J. E. (2015). Statistics: An introduction using R (2nd Edition). Journal of Statistical Software, 67(Book Review 5), 1–353. https://doi.org/10.18637/jss.v067.b05
Kotti, M. E., Vlessidis, A. G., Thanasoulias, N. C., & Evmiridis, N. P. (2005). Assessment of river water quality in Northwestern Greece. Water Resources Management, 19(1), 77–94. https://doi.org/10.1007/s11269-005-0294-z
Kükrer, S., & Mutlu, E. (2019). Assessment of surface water quality using water quality index and multivariate statistical analyses in Saraydüzü Dam Lake, Turkey. Environmental Monitoring and Assessment, 191(2). https://doi.org/10.1007/s10661-019-7197-6
Lawson, R. G., & Jurs, P. C. (1990). New index for clustering tendency and its application to chemical problems. Journal of Chemical Information and Computer Sciences, 30(1), 36–41. https://doi.org/10.1021/ci00065a010
Li, S., Gu, S., Tan, X., & Zhang, Q. (2009). Water quality in the upper Han River basin, China: The impacts of land use/land cover in riparian buffer zone. Journal of Hazardous Materials, 165(1–3), 317–324. https://doi.org/10.1016/j.jhazmat.2008.09.123
Liu, C., Lin, K., & Kuo, Y. (2003). Application of factor analysis in the assessment of groundwater quality in a blackfoot disease area in Taiwan. Science of the Total Environment, 313(1–3), 77–89. https://doi.org/10.1016/S0048-9697(02)00683-6
Liu, X., Zhang, G., Sun, G., Wu, Y., & Chen, Y. (2019). Assessment of Lake Water quality and eutrophication risk in an agricultural irrigation area: A case study of the Chagan Lake in Northeast China. Water. https://doi.org/10.3390/w11112380
Lkr, A., & Neizo, M. R. S. (2020). Assessment of water quality status of Doyang River, Nagaland, India, using Water Quality Index. Applied Water Science, 10(1), 1–13. https://doi.org/10.1007/s13201-019-1133-3
Maechler, M., Rousseeuw, P., Struyf, A., & Hubert, M. (2005). Cluster analysis basics and extensions. In Unpublished.
Mohanty, C. R., & Nayak, S. K. (2017). Assessment of seasonal variations in water quality of Brahmani river using PCA. Advances in Environmental Research, 6(1), 53–65. https://doi.org/10.12989/aer.2017.6.1.053
Mondal, G. C., Singh, A. K., & Singh, T. B. (2018). Damodar River Basin : Storehouse of Indian Coal. 259–272.
Mukhopadhyay, S., & Mukherjee, R. (2013). Physico–chemical and microbiological quality assessment of groundwater in adjoining area of Tamla Nala, Durgapur, District : Burdwan (W. B.). International Journal of Environmental Sciences, 4(3), 360–366. https://doi.org/10.6088/ijes.2013040300012
Mutlu, E. (2019). Evaluation of spatio-temporal variations in water quality of Zerveli stream (northern Turkey) based on water quality index and multivariate statistical analyses. Environmental Monitoring and Assessment, 191(6), 335. https://doi.org/10.1007/s10661-019-7473-5
Ouyang, Y., Nkedi-Kizza, P., Wu, Q. T., Shinde, D., & Huang, C. H. (2006). Assessment of seasonal variations in surface water quality. Water Research, 40(20), 3800–3810. https://doi.org/10.1016/j.watres.2006.08.030
Pai, D. S., Sridhar, L., Rajeevan, M., Sreejith, O. P., Satbhai, N. S., & Mukhopadyay, B. (2014). Development of a new high spatial resolution (0.25° × 0.25°) Long Period (1901–2010) daily gridded rainfall data set over India and its comparison with existing data sets over the region. MAUSAM, 1(January), 1–18.
Pejman, A. H., Bidhendi, G. R. N., Karbassi, A. R., Mehrdadi, N., & Bidhendi, M. E. (2009). Evaluation of spatial and seasonal variations in surface water quality using multivariate statistical techniques. International Journal of Environmental Science & Technology, 6(3), 467–476. https://doi.org/10.1007/BF03326086
Platikanov, S., Baquero, D., González, S., Martín-Alonso, J., Paraira, M., Cortina, J. L., & Tauler, R. (2019). Chemometric analysis for river water quality assessment at the intake of drinking water treatment plants. Science of the Total Environment, 667, 552–562. https://doi.org/10.1016/j.scitotenv.2019.02.423
Potasznik, A., & Szymczyk, S. (2015). Magnesium and calcium concentrations in the surface water and bottom deposits of a river-lake. Journal of Elementology, 20(3), 677–692. https://doi.org/10.5601/jelem.2015.20.1.788
Reghunath, R., Sreedhara Murthy, T. R., & Raghavan, B. R. (2002). The utility of multivariate statistical techniques in hydrogeochemical studies: An example from Karnataka, India. Water Research, 36(10), 2437–2442. https://doi.org/10.1016/s0043-1354(01)00490-0
Saksena, D. N., Garg, R. K., & Rao, R. J. (2008). Water quality and pollution status of Chambal river in National Chambal Sanctuary, Madhya Pradesh. Journal of Environmental Biology, 29(5), 701–710. https://doi.org/10.21172/ijiet.112.07
Salifu, A., Petrusevski, B., Ghebremichael, K., Buamah, R., & Amy, G. (2012). Multivariate statistical analysis for fluoride occurrence in groundwater in the Northern region of Ghana. Journal of Contaminant Hydrology, 140–141, 34–44. https://doi.org/10.1016/j.jconhyd.2012.08.002
Salim, I., Sajjad, R. U., Paule-Mercado, M. C., Memon, S. A., Lee, B.-Y., Sukhbaatar, C., & Lee, C. (2019). Comparison of two receptor models PCA-MLR and PMF for source identification and apportionment of pollution carried by runoff from catchment and sub-watershed areas with mixed land cover in South Korea. Science of the Total Environment, 663, 764–775. https://doi.org/10.1016/j.scitotenv.2019.01.377
Sharma, D., & Kansal, A. (2011). Water quality analysis of River Yamuna using water quality index in the national capital territory, India (2000–2009). Applied Water Science. https://doi.org/10.1007/s13201-011-0011-4
Simeonov, V., Stratis, J. A., Samara, C., Zachariadis, G., Voutsa, D., Anthemidis, A., Sofoniou, M., & Kouimtzis, T. (2003). Assessment of the surface water quality in Northern Greece. Water Research, 37(17), 4119–4124. https://doi.org/10.1016/S0043-1354(03)00398-1
Singh, K. P., Malik, A., Mohan, D., & Sinha, S. (2004). Multivariate statistical techniques for the evaluation of spatial and temporal variations in water quality of Gomti River (India)—a case study. Water Research, 38(18), 3980–3992. https://doi.org/10.1016/j.watres.2004.06.011
Skowron, P., Skowrońska, M., Bronowicka-Mielniczuk, U., Filipek, T., Igras, J., Kowalczyk-Juśko, A., & Krzepiłko, A. (2018). Anthropogenic sources of potassium in surface water: The case study of the Bystrzyca river catchment, Poland. Agriculture, Ecosystems and Environment, 265(July), 454–460. https://doi.org/10.1016/j.agee.2018.07.006
Varol, M. (2020). Use of water quality index and multivariate statistical methods for the evaluation of water quality of a stream affected by multiple stressors: A case study. Environmental Pollution, 266, 115417. https://doi.org/10.1016/j.envpol.2020.115417
Varol, M., & Şen, B. (2009). Assessment of surface water quality using multivariate statistical techniques: A case study of Behrimaz Stream, Turkey. Environmental Monitoring and Assessment, 159(1–4), 543–553. https://doi.org/10.1007/s10661-008-0650-6
Vega, M., Pardo, R., Barrado, E., & Debán, L. (1998). Assessment of seasonal and polluting effects on the quality of river water by exploratory data analysis. Water Research, 32(12), 3581–3592. https://doi.org/10.1016/S0043-1354(98)00138-9
Xiaolong, W., Jingyi, H., Ligang, X., & Qi, Z. (2010). Spatial and seasonal variations of the contamination within water body of the Grand Canal, China. Environmental Pollution, 158(5), 1513–1520. https://doi.org/10.1016/j.envpol.2009.12.018
Zeinalzadeh, K., & Rezaei, E. (2017). Determining spatial and temporal changes of surface water quality using principal component analysis. Journal of Hydrology: Regional Studies, 13, 1–10. https://doi.org/10.1016/j.ejrh.2017.07.002
Zhang, H., Li, H., Yu, H., & Cheng, S. (2020). Water quality assessment and pollution source apportionment using multi-statistic and APCS-MLR modeling techniques in Min River Basin, China. Environmental Science and Pollution Research, 27(33), 41987–42000. https://doi.org/10.1007/s11356-020-10219-y
Zhong, M., Zhang, H., Sun, X., Wang, Z., Tian, W., & Huang, H. (2018). Analyzing the significant environmental factors on the spatial and temporal distribution of water quality utilizing multivariate statistical techniques: A case study in the Balihe Lake, China. Environmental Science and Pollution Research, 25(29), 29418–29432. https://doi.org/10.1007/s11356-018-2943-9
Zhou, F., Huang, G. H., Guo, H., Zhang, W., & Hao, Z. (2007). Spatio-temporal patterns and source apportionment of coastal water pollution in eastern Hong Kong. Water Research, 41(15), 3429–3439. https://doi.org/10.1016/j.watres.2007.04.022
Acknowledgements
The authors are sincerely grateful to the department of Geography of the Vidyasagar University, West Bengal Pollution Control Board for giving such a dataset and 'Fund for Improvement of S&T Infrastructure of the Department of Science and Technology (DST-FIST)' for providing the necessary supports and opportunity to prepare this research work.
Funding
The authors declares no funding was received from any agency for conducting this research.
Author information
Authors and Affiliations
Contributions
Souvanik Maity: conceptualization, resources, methodology, data structuring, statistical analysis, software analysis, writing original draft, and visualization. Ramkrishna Maiti: conceptualization, resources, review draft, supervision, and validation. Tarakeshwar Senapati: conceptualization, review draft, supervision, and validation.
Corresponding author
Ethics declarations
Ethics approval
This piece of work is an original work, and it has not been published or submitted elsewhere for publication.
Consent to participate
This paper is guided by Dr. Ramkrishna Maiti, and Dr. Tarakeshwar Senapati. This paper is written with the consent of two other authors.
Consent for publication
The paper is sending to publish with the consent of all authors.
Conflict of interest
The authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Maity, S., Maiti, R. & Senapati, T. Evaluation of spatio-temporal variation of water quality and source identification of conducive parameters in Damodar River, India. Environ Monit Assess 194, 308 (2022). https://doi.org/10.1007/s10661-022-09955-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10661-022-09955-0