Introduction

Rivers are undoubtedly the potential surface water source for human survival and serve as a socio-economic indicator for the advancement of a country. Additionally, river water essentially serves as a resource for maintaining environmental integrity and is entitled to preserve the fundamental requirements for public health safety and aquatic life that have been recognized by numerous earlier studies globally (Bhuiyan et al., 2015; Islam et al., 2020). Last few decades, several works have been done on evaluation the riverine water quality for the use on drinking, agricultural, industrial, recreational purposes and allow accelerating the monitoring programme for conservation and protection of surface water quality (Şener et al., 2017). The degradation of river water quality has been triggered by numerous anthropogenic activities, including the discharge of domestic wastewater, industrial operations, agriculture runoff, mining operations, vehicular pollution, the dumping of solid waste, etc. (Han et al., 2016; Jabbo et al., 2022; Zhai et al., 2022), as well as numerous natural processes like soil and bedrock erosion, precipitation, volcanic emission, soil–water interaction, and seasonal effects (Kazi et al., 2009; Subramani et al., 2005). Rapid urbanization, industrialization, and several human activities have been identified as the main sources of river water pollution (Kumar et al., 2019a; b). According to the UNEP (2021) report, 3 billion people worldwide are at a high risk of contracting a water-borne illness (Chorol and Gupta, 2023). As a result, assessing and monitoring river water quality based on its hydrogeochemical characteristics is crucial to determining whether it should be used for home, agricultural, or industrial purposes. It also enables planners and decision-makers to take corrective action and restore the river’s ecosystem.

Several approaches have been periodically used to assess the water physicochemical properties, including potentially toxic elements (PTEs) and the status of water quality. The water quality index (WQI) is a widely used method for assessing water quality that aggregates a number of measured water parameters into a single numerical value (Singh et al., 2018). However, solely relying on WQI may not be sufficient to understand the pollution characteristics, especially the risk to human health due to heavy metals and PTEs (Şimşek and Mutlu, 2023). PTEs concentration in riverine ecosystem is now one of the most serious threats due to their higher accumulation, recalcitrance, biomagnification, and toxicity (Ali and Khan, 2019; Sahoo and Sahu, 2022). Among them, a few trace elements (like as Fe, Cu, Co, and Mn) are essential in smaller quantities for several physiological and metabolic activities in living organisms, while larger concentrations of these elements can cause serious health issues (Setia et al., 2020). Excessive use of river water for agricultural purposes is the most prominent route of transfer/bioaccumulation of PTEs in vegetables/crops. Subsequent transportation of such PTEs to human through food chain accelerates their accumulation in different parts of human/organism causing severe health hazards (Panda et al., 2021). As a result, many researchers also evaluate the qualitatively and quantitatively of PTE content in river water and their assimilative impacts on human health (Bhuiyan et al., 2015; Islam et al., 2020; Shan et al., 2021). Several indexing approaches such as degree of contamination (Cd), heavy metal evaluation index (HEI), and heavy metal pollution index (HPI) on PTEs concentration of surface waters explore the level of toxicity and indicate the quality of water (Sahoo and Sahu, 2022). Various tools such as multivariate analysis, viz., Pearson’s correlation analysis, principal component analysis (PCA), hierarchical cluster analysis (HCA), and geospatial interpolation might provide comprehensive information to discriminate the potential sources of PTEs in river water from surrounding point and non-point sources.

Chronic exposure to PTEs through ingestion and dermal absorption can cause cancerous and non-cancerous health effects in humans. Among other PTEs, Cr, Pb, and Cd are known as possible carcinogens and could potentially represent a concern because of their toxic, genotoxic, and cancer-causing actions (Jiang et al., 2021; Sadeghi et al., 2022). According to reports, drinking unsuitable water contributes to roughly 20% of all cancer cases worldwide (Chorol and Gupta, 2023; Balali-Mood et al., 2021). Therefore, estimating the potential health risks associated with total PTE concentrations in river water is becoming increasingly relevant (Ozoko et al., 2022).

Subarnarekha is a rain-fed river in eastern India that provides water to millions of people in Jharkhand, West Bengal, and Odisha. It originates at Nagri village in Ranchi district, Jharkhand, and flows past important industrial centres like Jamshedpur, Chaibasa, and Bhadrak before joining the Bay of Bengal at Kirtania port in Odisha. The river’s total length from its sources to its outlet in the Bay of Bengal is 446.12 kms, of which 79 kms are in Odisha. It is the smallest of India’s 14 major river basins. The rain-fed river covers a drainage area of 18,951 km2, lies between 21° 33 ′N to 23 °32 ′N and 85 °09 ′E to 87 °27 ′E, and comprises 0.4% of total surface water resources in the country (Giri and Singh, 2015). This river is the lifeline for many tribal communities, residing on the riverbanks, following agricultural practises, and fishing for their livelihood. Contamination of the river water with respect to PTEs can be a major environmental concern, as their entry to human body can pose a serious health hazard.

Thus, the primary goals of current investigation were to assess the following:

  • Evaluation of water quality and distribution of PTEs in Subarnarekha River on a spatio-temporal scale.

  • To classify the contamination level and identify the sources and factors that influence the PTEs in Subarnarekha River water.

  • Carcinogenic and non-carcinogenic risk evaluation of PTEs through diverse exposure paths using health risk assessment tool.

  • Monte Carlo simulation approach to validate and reduce uncertainty in health risk analysis

Materials and methods

Sampling, analysis and quality assurance

For this research study, field samples were collected at different sampling stations along the lower stretches of the Subarnarekha River in the State of Odisha, covering about 79 km, starting from Balasore district to its termination to Bay of Bengal. To evaluate the river water quality, 17 sampling sites were strategically selected considering the accessibility of the river in view of land use patterns and landform variability, such as village areas, agricultural areas, fish depots, jetty and port areas (Fig. 1). The river water samples were collected during the period of 2021–2022, covering two distinct seasons, i.e. pre-monsoon and post-monsoon, respectively. The representative samples were collected in sterilized and washed 1 L PVC bottles up to its mouth to prevent entrapment of any air in the sample bottles. The representative water samples were collected at least 0.5 m below the surface level of the water to avoid atmospheric influences. Thereafter, the samples were kept in an ice box under 4 °C and shifted to the laboratory for further physicochemical analysis.

Fig. 1
figure 1

Study area with sampling sites of Subarnarekha River

In this study, a total of 14 physicochemical parameters were analysed, and all analyses were performed in triplicate. Among the parameters, pH, temperature, electrical conductivity (EC), total dissolved solids (TDS), and dissolved oxygen (DO) were immediately analysed in situ by using portable hand analyser (Systronics 371). All the sample bottles were kept in an ice box under 4 °C and shifted to the laboratory immediately. Remaining physicochemical parameters such as total hardness (TH), total suspended solids (TSS), total alkalinity (TA), biological oxygen demand (BOD), chemical oxygen demand (COD), sodium (Na+), calcium (Ca2+), potassium (K+), chloride (Cl), sulphate (SO42−), nitrate (NO3), and phosphate (PO43−) were analysed as per APHA (2012) guidelines. Atomic absorption spectroscopy (AAS) was used to measure six metals: iron (Fe), manganese (Mn), chromium (Cr), copper (Cu), lead (Pb), and zinc (Zn). Throughout the analysis, the analytical protocols of the APHA (2012) Standard Methods for the Examinations of Water and Wastewaters were followed.

The geological composition of the studied basin area primarily consists of recent alluvium deposits. The upper part of the river basin possesses substantial mineral reserves, which has prompted the development of mineral-based industries along the river stretch (Giri and Singh, 2015). Climatically, the region experiences a maximum temperature of approximately 31.5 °C, while the minimum temperature recorded is around 20.5 °C. The overall annual rainfall in this area amounts to 1383.3 mm. To achieve the land use and landcover classification at the 2 km buffer zone on both sides of the river, a supervised classification approach using the maximum likelihood method (ArcMap 10.8) with a Kappa coefficient of 86% was employed. This process helped to categorize and identify different land use and landcover classes within the designated study area.

Quality assurance

Special attention has been made to avoid contamination and to increase the data’s confidence in terms of bias and variability. All glassware and apparatus were sterilized with 10% HCl and rinsed with Millipore ultra-pure water multiple times. Millipore ultra-pure water is used to prepare reagents and stock solutions. Merck-GR grade chemicals were used to prepare the reagent solutions. To achieve reliable outcomes, all instruments were carefully calibrated, and bank samples for every single metal parameter were prepared from their stock solutions. The reference materials (RM 1643E) supplied from the National Institute of Standards and Technology were meticulously followed. For each examined metal, the margin of error was < 5%.

Indexing approach

Water quality index (WQI)

The WQI reflects the combined influences of different physicochemical parameters and assesses the geogenic and anthropogenic influences on several key parameters that eventually effects on drinking water quality (Gupta et al., 2016; Kumar et al., 2019a, b). Hence, to calculate the WQI in this study, 11 physicochemical water parameter such as pH, EC, TDS, TA, TH, DO, Cl, SO42−, NO3, and PO43− have been considered for both pre- and post-monsoon season, and their prescribed limits recommended by WHO (2006) and BIS (2012) are mentioned in Table 1. The definite weightage (Wa) for each parameter is assigned between 1 and 5 according to its relative influence on the overall water quality (Table 1). In step 1, the relative weight (Wr) is calculated from the following equation.

$$W_{{\text{r}}} = {\raise0.7ex\hbox{${W_{{{\text{a}}i}} }$} \!\mathord{\left/ {\vphantom {{W_{{{\text{a}}i}} } {\mathop \sum \nolimits_{i = 1}^{n} W_{{{\text{a}}i}} }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\mathop \sum \nolimits_{i = 1}^{n} W_{{{\text{a}}i}} }$}}$$

where Wr and Wa represent the relative weightage and the assigned weightage of each physicochemical parameter where n denotes the number of parameters.

Table 1 Relative weight of physicochemical parameters of Subarnarekha River

In step 2, a quality rating scale (Qi) for each parameter is formulated as:

$$Q_{i} = \left[ {{\raise0.7ex\hbox{${\left( {C_{i} - V_{i} } \right)}$} \!\mathord{\left/ {\vphantom {{\left( {C_{i} - V_{i} } \right)} {\left( {S_{i} - V_{i} } \right)}}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\left( {S_{i} - V_{i} } \right)}$}}} \right] \times 100$$

where Qi is the quality rating; Ci the concentration of each chemical parameter; Vi is the ideal value of each parameter which is taken 0 for all except pH (7) and DO (14.6); Si is the standard permissible limit prescribed by WHO (2006) and BIS (2012) for each parameter.

Finally in step 3, computing the WQI, the sub-indices (SI) are the firstly determined for each chemical parameter, which is then now used to determine the overall WQI as per the following equation:

$${\text{SI}}_{i} = W_{{\text{r}}} * Q_{i}$$
$${\text{WQI}} = \sum {\text{SI}}_{i}$$

The computed WQI was categorized owing the suggested categorization of water quality (Singh et al., 2018) as: excellent (WQI < 25), good (WQI: 25–50), moderate (WQI: 51–75), poor (WQI: 76–100), and very poor (WQI > 100).

Evaluation of PTEs contamination in river water using pollution indices

Contamination index (C d)

The degree of contamination (Cd) is extensively used to evaluate the drinking water quality and to detect the composite influences of various quality parameters (Panda et al., 2021; Singh et al., 2017). In this study, the composite effects of PTEs are summarized with a single numerical value, suggesting the water quality for the drinking water purposes. The Cd is firstly established by Backman et al. (1997) and is formulated by the below equations:

$$C_{{\text{d}}} = \mathop \sum \limits_{i = 1}^{n} C_{{{\text{f}}i}}$$
(1)

where

$$C_{{{\text{f}}i}} = \frac{{C_{{{\text{A}}i}} }}{{C_{{{\text{N}}i}} }} - 1$$
(2)

where Cfi, CNi, and CAi define the contamination factor, the upper permissible value and the analytical value of ith component, respectively. N and CNi signify the “normative value” and standard permissible concentration (SPC), respectively.

Heavy metal evaluation index (HEI)

The heavy metal evaluation index (HEI) also delivers an overall quality status for the drinking purposes with respect to PTEs concentration like HPI (Edet & Offiong, 2002; Singh et al., 2017), and the HEI is formulated by the following equation:

$${\text{HEI}} = \mathop \sum \limits_{i = 1}^{n} {\raise0.7ex\hbox{${M_{i} }$} \!\mathord{\left/ {\vphantom {{M_{i} } {S_{i} }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${S_{i} }$}}$$
(3)

where Mi defines the examined value of PTEs and Si represent the maximum permissible concentration of the ith component as per standards guideline of BIS (2012).

Heavy metal pollution index (HPI)

The heavy metal pollution index (HPI) is a well-known tool to estimate overall quality of with respect to PTEs concentration (Panda et al., 2021, Ahamad et al., 2020, Kumar et al., 2019a, b). The HPI was initially established by Mohan et al. (1996) and suggested weighted arithmetic mean methods via two step indexing approaches to estimate it. In first step, establish a rating scale with assigned weightage for the studied PTEs (i.e. Pb, Cu, Ni, Cd, Fe, and Cr). Then, the pollution parameters are chosen to calculate pollution index of the study area. The estimated HPI value represented the composite effect of each PTE concentration on the overall quality where the assigned weight on studied PTEs lies in between 0 and 1. The HPI is computed by using the following equations:

$${\text{HPI}} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} W_{i} *Q_{i} }}{{\mathop \sum \nolimits_{i = 1}^{n} W_{i} }}$$
(4)

where Wi and n represent the unit weightage and the number of selected PTEs. Qi denotes the sub-index of the ith component and is formulated by

$$Q_{i} = \mathop \sum \limits_{i = 1}^{n} \frac{{\left\{ {M_{i} \left( - \right)I_{i} } \right\}}}{{(S_{i} - I_{i} )}}*100$$
(5)

where Mi, Si, and Ii denote the monitored value of PTE, the standard value, and the ideal value of the ith component, respectively. The sign (−) defines the numerical divergence among two values, disregarding the algebraic sign. The critical pollution index value of HPI is set at 100 for drinking water.

Statistical analysis

Principal component analysis (PCA) and hierarchical cluster analysis (HCA) were accomplished in SPSS (SPSS V.20.0) software to recognize the likely primary sources of PTEs and physicochemical factors in the Subarnarekha River water samples. In PCA, the “varimax with Kaiser normalization” rotation method and the “principal component” extraction method were applied based on eigenvalues larger than 1. Ward’s approach was used to compute the HCA using squared Euclidian distance.

GIS interpolation

Spatial interpolation of HPI and WQI indices were made by using IDW (inverse distance weighting) interpolation technique in ArcMap 10.8 platform. This method facilitated the creation of spatially continuous maps representing the HPI and WQI values across the study area, based on the sampled data.

Human health risk assessment

Non-carcinogenic risk

According to the USEPA (1989), the health risk assessment (HRA) is a classic tool for assessing the possible effects of harmful chemicals over a given time period. According to Gope et al., (2017, 2020), Yadav et al. (2019) and Rahman et al. (2020), the HRA index has been used to estimate probabilistic hazard in various environmental medium for different inorganic and organic pollutants. The hazard quotient (HQ) was calculated using the non-carcinogenic risk to human health when water is ingested (HQing) and in contact with the skin (HQderm). When HQ is less than 1, there are no harmful consequences; however, if HQ is equal to or more than 1, non-cancerous risk or harmful impacts on human health are a matter of concern (Wang et al., 2017). The hazard index (HI) value is calculated by adding the HQs from each exposure routes for individual PTE. According to Panda et al. (2021), HI > 1 generally denotes a high hazard effect, while HI < 1 symbolizes a low risk. Here, hazard index is calculated for Cd, Cu, Cr, Fe, Pb, and Zn to determine the non-carcinogenic health risk. According to USEPA (2004), the following equations were used to determine chronic/chemical daily intake from ingestion (CDIing) and chronic/chemical daily intake from dermal contact (CDIderm):

$${\text{CDI}}_{{{\text{Ing}}}} = \frac{{C_{m} \times {\text{IR}} \times {\text{EF}} \times {\text{ED}}}}{{{\text{BW}} \times {\text{AT}}}}$$
(6)
$${\text{CDI}}_{{{\text{Derm}}}} = \frac{{C_{m} \times {\text{SA}} \times K_{{\text{P}}} \times {\text{ET}} \times {\text{EF}} \times {\text{ED}} \times {\text{CF}}}}{{{\text{BW}} \times {\text{AT}}}}$$
(7)

where Cm (µg/L) is the concentration of each PTE, IR represents ingestion rate, EF is exposure frequency, ED denotes exposure duration, BW is body weight (BW), AT stands for average time, SA denotes the skin area, ET represents the exposure time, and Kp is the dermal permeability coefficient of the PTEs in water. The values of all the parameters are mentioned in Supplementary Table 1a and b. The unit conversion factor (CF) has value 1 × 10–3 for water (Panda et al., 2021).

By dividing the chronic daily intake (CDI) values of various PTEs by the corresponding chronic reference doses (RfD) (µg/L/kg/day), the hazard quotient (HQ) for non-carcinogenic risk of each PTE was calculated. HI was calculated by summing HQing and HQderm of each PTE (Wang et al., 2017).

$${\text{HQ}}_{{{\text{Ing}}}} = \frac{{{\text{CDI}}_{{{\text{Ing}}}} }}{{{\text{R}}_{{\text{f}}} {\text{D}}_{{{\text{Ing}}}} }}$$
(8)
$${\text{HQ}}_{{{\text{Derm}}}} = \frac{{{\text{CDI}}_{{{\text{Derm}}}} }}{{{\text{R}}_{{\text{f}}} {\text{D}}_{{{\text{Derm}}}} }}$$
(9)
$${\text{HI}} = \sum {\text{HQs}}$$
(10)

Carcinogenic risk (CR)

The incremental lifetime cancer risk (ILCR) model (USEPA, 2004) is used to assess the carcinogenic risk in the present investigation. The ILCR is calculated by multiplying the cancer slope factor (CSF) with chronic daily intake (CDI), as shown in Eq. (11).

$${\text{ILCR}} = {\text{CDI}} \times {\text{CSF}}$$
(11)

In this present study, the carcinogenic toxicity of Pb, Cd, and Cr has been assessed as these substances are thought to be potential carcinogens. The CSF values of Cd, Pb, and Cr (µg//kg/day) for ingestion pathway is mentioned in Supplementary Table 1b. If a single carcinogenic PTE has a CR value > 1 × 10–4, which is considered as intolerable, whereas a CR value < 1 × 10–6 indicates no significant health effects (Panda et al., 2021).

Uncertainty analysis by Monte Carlo simulation (MCS)

To reduce the ambiguity and investigate the possible risks of pollutants in food, air, water, and other environmental components, Monte Carlo simulation (MCS) was performed (Chorol and Gupta, 2023). The probabilistic approach is based on MCS, whereas the deterministic approach is based on point estimations (Kilavi et al., 2021). A statistical tool called MCS measures the inconsistency and uncertainty resulting from the input parameters in human health risk assessment. The oracle crystal ball (version 11.1.2.3.000) software was used in the present study to calculate the uncertainty in the carcinogenic risk analysis data using 1,00,000 iterations with the help of Excel programme. Plotting histograms of the probability distribution of carcinogenic risks and analysing their 95th percentile values helped to diminish the uncertainty in carcinogenic risk estimation. All the parameters used in MCS is illustrated in Supplementary Table 2.

Results and discussions

Characterization and spatio-temporal variations in measured river water parameters

The descriptive statistical summary of physicochemical parameters and PTEs concentration from 17 locations of lower stretches of Subarnarekha River for pre- and post-monsoon seasons is presented in Table 2. Again, the water quality parameter is compared with Indian standards for drinking water quality. The pH is a lead parameter which may influence for precipitation, co-precipitation, mobilization, and sorption of other significant cationic and anionic of water quality parameters in riverine ecosystem. In our study, the pH is ranging 6.25–7.25 and 7.30–7.9 during pre- and post-monsoon, respectively, suggesting the river water in slightly acidic to normal in range. The EC value of river water ranges from 480 to 804 µS/cm in pre-monsoon and 392–514 µS/cm (mean 452.65 µS/cm) in post-monsoon that depicts the clear seasonal variation of ionic load derived by anthropogenic inputs and industrial influx (Gupta et al., 2016). Commonly, the TDS value indicates the sum of cationic and anionic contents in water. TDS ranges 226–367 mg/L and 356–428 mg/L during pre- and pot-monsoon seasons, respectively. Comparatively, the higher TSS and TDS value in post-monsoon suggests the higher influx agricultural runoff, untreated sewerage, and industrial effluents dumping (Suthar et al., 2010). The DO content varies among 3.7–4.4 mg/L in pre-monsoon and 4.7–5.3 mg/L in post-monsoon reflecting the higher dissolution of oxygen in river water during post-monsoon season because lower temperature favours the said phenomenon (Kumar et al., 2013). The concentration of SO42− of river water samples is varied significantly from 126.45 to 176.6 mg/L in the pre-monsoon season and varied from 103.8 to 157 mg/L in the post-monsoon season, respectively. Generally, the SO42− contents in river water indicates the level of pollution and define the anthropogenic contribution. The highest SO42− contents noted at S13 (176.6 mg/L during pre-monsoon) and S15 (157 mg/L during post-monsoon) reveal that these locations are polluted because of industrial discharges in the river stream. Although, 100% sampling locations have the Cl concentration below the recommended guideline for drinking water (BIS, 2012).

Table 2 Statistical description of physicochemical parameters and PTE concentrations of Subarnarekha River during pre- and post-monsoon and comparative standard guidelines for drinking water

Distribution and seasonal variations of PTEs concentration in Subarnarekha River water

The descriptive statistics of measured PTE concentrations in river water samples for pre- and post-monsoon season is presented in Table 2. The mean value of Pb, Cu, Ni, Cd, Fe, and Cr during pre-monsoon season is 27.82, 30.59, 41.53, 0.82, 590.71, and 67.71 µg/L, respectively, while those in post-monsoon is 19.71, 22.94, 18.82, 0.41, 331.76, and 37.47 µg/L, respectively. Analytical results clearly show that the concentration Pb in river water ranges 16–39 µg/L and 11–39 µg/L in pre- and post-monsoon season, respectively, and 100% sampling locations are exceeded the desirable standards of drinking water quality. The variation of Cu concentration is 22–41 µg/L and 10–54 µg/L during pre- and post-monsoon seasons, respectively, which is well under the desirable water quality standards (except S16 in post-monsoon).

Evaluation of water quality of Subarnarekha River using WQI

The WQI values were computed using eleven physicochemical parameters of the river Subarnarekha for pre- and post-monsoon seasons. The calculated WQI values of river water is between 68.43 and 117.30 during pre-monsoon with a mean value of 87.41 which suggests the water quality are lies among “moderate” to “very poor” category. During post-monsoon, the WQI value ranges within 54.43 to 83.04 with an average value of 68.14 that reflects the quality of river water falls in “moderate” to “poor” category. In addition, the WQI value > 100, i.e. “very poor” category was observed at S3(104.29), S16(115.10), and S17(117.30) which may be due to higher discharges of industrial effluents and municipal waste dumping.

Nonetheless, the majority of the river water samples based on WQI value also revealed that 64.70% of sampling location are under “poor” water category during pre-monsoon, while in post-monsoon, 76.47% of sampling locations lie under “moderate” water category (Fig. 2). The result suggests the river water is not suitable for domestic uses without proper treatment. The deterioration of water quality can be attributed to significant industrialization in river banks along with various anthropogenic activities such as the inflow of direct sewerage discharges from residential and commercial establishments, surface runoff from agricultural cropland, and solid waste dumpling. Therefore, kind attention from government authorities is sought to take proper management for restoration of river water quality.

Fig. 2
figure 2

WQI of Subarnarekha River during a pre-monsoon and b post-monsoon seasons

Quantification of PTEs contamination

In this study, the level of metal pollution in Subarnarekha River is evaluated by using three pollution indices such as degree of contamination (Cd), heavy metal pollution index (HPI), and heavy metal evaluation index (HEI) with the reference guideline obtained from Indian standards for drinking water (BIS, 2012). The outcomes of calculated aforementioned pollution indices and their modified classification are described in Table 3.

Table 3 Heavy metal pollution index (HPI), heavy metal evaluation index (HEI), and degree of contamination (Cd) of Subarnarekha River in pre- and post-monsoon

Degree of contamination index (C d)

The estimated values of degree of contamination reflect the combined effects of metal pollution in riverine system, while contamination factor (Cf) highlights the influence of individual metals on total contamination. The calculated Cf values of PTEs in sampled water are represented in Table 4. The Cf values of PTEs in river water are Pb (0.6–2.9), Cu (− 0.99 to − 0.97), Ni (− 0.1 to 2.55), Cd (− 1 to nil), Fe (0.31–2.22) Cr (− 0.16 to 0.84) and Pb (0.1–2.9), Cu (− 0.99 to − 0.96), Ni (− 0.45 to 0.55), Cd (− 1 to nil), Fe (− 0.25 to 0.87), and Cr (− 0.48 to 0.3) in pre- and post-monsoon seasons, respectively. According to mean Cf values, PTEs in the river water are in the sequence of Pb(1.78) > Ni(1.08) > Fe(0.97) > Cr(0.35) > Cd(− 0.73) > Cu(− 0.98) in the pre-monsoon and Pb(0.97) > Fe(0.11) > Ni(− 0.06) > Cr(− 0.25) > Cd(− 0.86) > Cu(− 0.98) during the post-monsoon season, respectively.

Table 4 Site-wise contamination factor (Cf) values of potentially toxic elements in Subarnarekha River

During the pre-monsoon season, the calculated Cd values in river water ranged from 1.52 to 7.96. Approximately, 23.5% of the sampling sites are moderately contaminated, and the remaining 76.5% have low contamination level. Whereas in the post-monsoon season, Cd values ranged from 0.1 to 4.16, and all examined sites (100%) were identified as having low contamination in terms of PTEs (Table 3).

During pre-monsoon studies, the variation of HEI values is 5.27–12.98 with a mean of 8.48 (i.e. moderately contaminated), while in post-monsoon studies, HEI values range from 3.54 to 9.19 with a mean value of 4.92 (i.e. < 8, low contamination). The investigation reveals that 52.94% of sampling sites are moderately contaminated, and 47.06% of sampling sites have a low contamination level in pre-monsoon with respect to PTEs (Table 3)., However, estimated HEI values in post-monsoon studies demonstrate that the majority of sampling sites (94.12%) have low contamination levels with respect to PTEs, with the exception of sampling site S16 (moderately contaminated).

According to the HPI assessment (Fig. 3, Table 5), the range of this index is 58.34–185 (mean value 105.04) during the pre-monsoon and 36.07–167.89 (mean value 64.15) in the post-monsoon season, respectively. However, 64.71% and 35.29% of sampling locations were moderately and highly contaminated with PTEs during the pre-monsoon. The high HPI value in river water is mainly associated with discharges of wastewater and dumping of solid waste, agricultural, and other anthropogenic activities in river basin areas. Contradictorily, the majority of sampling sites (58.82%) are low contaminated with PTEs in post-monsoon season, which reveal good similarities with HEI and Cd values, whereas 29.41% sampling locations are moderately contaminated and only 11.77% of sampling locations are highly contaminated with PTEs during the post-monsoon.  However, relatively low contamination level of PTEs in the majority of sampling locations during post-monsoon for all estimated indices (Table 3) because of the dilution effects of river water.

Fig. 3
figure 3

HPI of Subarnarekha River during a pre-monsoon and b post-monsoon seasons

Table 5 Weightage and sub-index of PTEs for HPI calculation of river Subarnarekha during pre-monsoon and post-monsoon

Source identification of pollutants using multivariate statistical analysis

Principal component analysis (PCA)

Table 6 shows the principal component analysis’s findings. It demonstrates that only five components have been extracted with eigenvalues greater than 1 for post-monsoon season, while for pre-monsoon season, only four components have been disclosed. This is based on the premise made by Chatfield and Collin (1980) that components with eigenvalues less than 1 should be removed. In order to facilitate knowledge and highlight the basic importance of the extracted components to the water quality state of the selected study area, the extracted components were subsequently rotated in accordance with the varimax rotation. The rotation’s outcome also showed that the percentages of the cumulative variances of the four extracted components in pre-monsoon season and five extracted components in post-monsoon season, account for a total 80.0% and 89.5% of the overall variance of the observed variables, respectively (Table 6). This shows that these extracted components in both seasons had successfully accounted for the variation of the observed variables.

Table 6 Principal component analysis (PCA) of water quality parameters along with PTEs for Subarnarekha River

In pre-monsoon season, the first component, i.e. PC1, accounted for 37% of total variation and showed a strong loading of Fe, Cu, COD, and NO3 and moderate loading of Cr, Cl, and TH. The significant positive ion loadings in PC1 pointed out to anthropogenic activities occurring in the basin (Gupta et al., 2016; Gyimah et al., 2021). In post-monsoon season, PC1 has accounted for 42.3% of total variation and exhibited strong loading of COD, Cl, TH, with moderate loading of Cu, Fe, pH, and NO3. Owing to these results, PC1 can be linked to anthropogenic sources such as urban waste disposal and industrial wastewater discharges. Hamed (2019) also reported industrial activities due to the strong loading of Fe.

PC2 in pre-monsoon season has accounted for 25.3% of total variation and pointed out towards strong loadings of BOD and SO4, while moderate loadings of TSS, EC, and pH. PC2 components are derived from mixed sources of organic wastes from adjacent human settlements, agricultural fields, forest dominated areas, and geogenic attributes (Gupta et al., 2016). While in post-monsoon season (accounted for 25.9%), a strong negative loading has been seen for BOD, SO42−, and TSS, whereas positive strong loading of DO along with positive moderate loading of NO3 and PO43−, TA, and pH have been observed. The positive loadings indicated organic waste disposal from domestic, agricultural fields, and fish processing industries, while negative loading pointed out towards geogenic sources.

In pre-monsoon season, the third component (PC3) accounted for 9.8% of total variance, while 8.3% in post-monsoon season. The positive loading of PO4, TDS, DO, and TA has been observed in pre-monsoon season due to poor agricultural practices, discharges from fisheries industries, and sewage treatment plants (Roy and Bickerton, 2014). In post-monsoon studies, PC3 has executed positive loadings for Pb, Ni, Cr, and Cd, which can be attributed to fertilizer and pesticide residues in adjacent agricultural fields as well as discharges from the fisheries industry (Githaiga et al., 2021).

PC4 has accounted for 7.9% and 7.4% of total variance in pre-monsoon and post-monsoon, respectively. In pre-monsoon PC4 have shown strong positive loadings for Pb, Cd, and Ni). The findings ascribe that these toxic metals could have originated from fertilizers and pesticide residues in agricultural fields and can be associated with harbour and dockyards, as well as paints used in boats (Githaiga et al., 2021). However, in post-monsoon, PC4 explained positive loading of TDS only, which can be attributed to weathering and erosion and influx of domestic wastewater and runoff water.

PC5 in post-monsoon season has accounted for 5.7% and indicated the positive loading of only EC. EC demonstrates the presence of dissolved salts and ions in river water. PC5 represents the geogenic sources and describe natural mechanisms such as geochemical weathering and mineral dissolution which are the major contributors to dissolved salts and ions in river water.

Hierarchical cluster analysis (HCA)

In order to identify clusters, hierarchical cluster analysis (HCA) was used. In this analysis, the Wards method and squared Euclidean distance method using Z-scores standardization were employed. The numerous clusters have been represented by a dendrogram. Four groups were established in accordance with the HCA results in pre-monsoon season (Fig. 4). The results showed similar outcomes as PCA. Cluster 1 comprises of COD, Fe, NO3, Cl, Cu, TH, and Cr which directed towards anthropogenic activities. The loading of parameters in each PC was similar with each cluster in pre-monsoon season. Therefore, the sources are also analogous.

Fig. 4
figure 4

Dendrogram of the analysed parameters for a pre-monsoon and b post-monsoon seasons

In post-monsoon season, the HCA have shown five clusters (Fig. 4), and the output was relatively similar to the PCA results (Table 6). The basic grouping of the parameters in cluster and PCA was identical except two or three deviations. pH and NO3 were absent in Cluster 1, while both were loaded in PC1 in post-monsoon season. TDS and EC were independently loaded in PC4 and PC5, respectively; however, their grouping was observed in Cluster 4. So, weathering of rocks and dissolution of salts were the main reasons behind this. Only BOD, SO4, and TSS, which were independently grouped in cluster 5, showed negative loading in PCA. Cluster 5 was found in a separate branch of the dendrogram, indicating that its sources were distinct from those of other parameters and were clearly geogenic.

Health risk assessment

Supplementary Table 3 (a) and (b) displays the results of the evaluation of oral and dermal pathways for the health risk assessment of contaminants (Fe, Ni, Cu, Cr, Cd, and Pb) in water samples from the Subarnarekha River. The results for the dermal absorption hazard quotients (HQderm) also indicated values < 1 for all the elements examined at all the sites in both pre- and post-monsoon seasons for both adults and children. De Carvalho et al. (2021) also reported similar findings in Joanes River, Brazil. The ingestion hazard quotients (HQing) values for both adults and children in both the seasons showed values < 1 for every element inspected except Cr and Pb. In pre-monsoon season, Site 16, 17, and 7 showed HQ values > 1 for children, while for adults, only Site 16 and 17 showed HQ values above 1 for Pb. Site 2, 3, 6, 7, 16, and 17 showed HQing values > 1 for both adults and children for Cr. This displayed that the Cr and Pb posed potential harm over the course of a lifetime exposure at those particular sites in pre-monsoon. In post-monsoon season, only site 16 has shown HQing values above 1 for Pb for adults and children both. Therefore, in post-monsoon season, Pb posed the potential threat for the lifetime exposure at site 16. In comparison with adult values, it was found that children’s HQ and HI values were higher. Children may be more prone to being harmed by these harmful contaminants, according to the health risk assessment. Njuguna et al. (2020) and De Carvalho et al. (2021) also reported identical findings. Except for few sites, the HI values in both the season of all the elements (except Cr and Pb) for both adults and children in the basin are generally below the recommended limits (< 1) (Fig. 5a, b), indicating that the surface water from these basins clearly offered no possible health concern. Eight sites (Site 2, 3, 6, 7, 8, 14, 16, and 17) and six sites (Site 2, 3, 6, 7, 16, and 17) in pre-monsoon season for adult and children, respectively, indicated possible health concern for Cr as the HI values found above 1. Site 16 and 17 and site 7, 16, and 17 for adult and children, respectively, have shown HI values > 1 indicating health concerns for Pb in pre-monsoon season. In post-monsoon season, site 6 has shown HI > 1 in case of Cr for adults only, while site 16 has revealed HI above 1 in case of Pb for both adults and children indicating the presence of non-carcinogenic health risk at these sites.

Fig. 5
figure 5figure 5

a, b Spatial distribution of the hazard index (HI) and carcinogenic risk (CR) values of PTEs for adults and children during a pre- and b post-monsoon season in the study area

According to the present study, the ILCR values indicated that children are more prone to cancer risk from Pb, Cd, and Cr compared to adults. The cancer risks for Cd and Pb were between 10–5 and 10–6 at all the sites of Subarnarekha River for both adults and children. All the sites in pre-monsoon as well as post-monsoon season have showed the chance of occurrence of carcinogenic risk from Pb in nearby future as all the values indicated > 10–6 for both child and adults. In case of Cd, Site 4, 5, 9, 10, 11, 14, 15, 16, and 17 in pre-monsoon and Site 4, 15, 16, and 17 in post-monsoon have showed possibility of cancer risk in upcoming future for children and adults both as the ILCR values indicated > 10–5. ILCR values showed above 10–4 for Cr in pre-monsoon and post-monsoon season at all the sites for both children and adults. Some sites (Site 6, 16, and 17 in post-monsoon and all the sites except Site 1 and 10 for adult and Site 1 for children in pre-monsoon season) also specified the ILCR values for Cr > 10–3. Therefore, these values clearly revealed the presence of carcinogenic risk not only in adjoining future, but in current time also. Yuan et al. (2020) also specified the worries regarding the cancer risk from Cr in river water from Jiangjin district, China.

Monte Carlo simulation for carcinogenic risk

The deterministic carcinogenic risk estimations technique’s (conventional ILCR model) for adults and child exposed to the ingestion pathways for Pb, Cr, and Cd were simulated using the MCS approach. The 5th mean and 95th percentile risk exposure in the current investigation are represented, where 5th and 95th percentile defines the low-end and the worst-case scenario. Histograms comparing the carcinogenic risk associated with exposure to Pb, Cd, and Cr in children and adults showed that the 95th percentile for children was greater than it was for adults for each investigated carcinogenic PTEs in both the season. The MCS has provided the probabilistic carcinogenic health risk by reducing the loop holes in input parameters by running the simulation for 100,000 iterations. The 95th percentile values of Cd are 3.40E-05 and 5.46E-05, Pb are 1.70E-05 and 2.71E-05, and Cr are 2.42E-05 and 3.88E-05, respectively, for adult and child in pre-monsoon season. In post-monsoon season, the 95th percentile values have reduced for each investigated elements, and the values are 1.92E-05 and 3.11E-05 (Cd), 1.27E-05 and 2.04E-05 (Pb), and 1.37E-03 and 2.18E-05 (Cr) for adult and children, respectively. The overall carcinogenic risk levels Cr were greater than tolerable limits (1E-04) in both the child and adult groups, indicating a very high chance of cancer hazards for people in the present time and future (Fig. 6a, b). The 95th percentile values of Pb and Cd indicated that the possibility of occurrence of cancer risk is in near future as the values are higher than the safe limit (1E-06). MCS also revealed that the cancer risk for Cr, Pb, and Cd was found to be lower in post-monsoon compared to pre-monsoon, which is due to the dilution factor by rain.

Fig. 6
figure 6figure 6

a, b Monte Carlo simulation approach for health risk assessment during a pre-monsoon and b post-monsoon seasons in study area

Conclusions

In the present investigation, the quality of Subarnarekha River water is slightly acidic to normal in nature. The physicochemical assessment of river water reveals that most of the parameters are well under the prescribed guidelines of BIS (2012), except TH which has significant seasonal variations. Pollution indices and statistical models such as WQI, Cd, HEI, HPI, HIs, HQs, ILCR, PCA, and HCA were used to appraise the level of PTEs contamination in river water and identify the possible contamination sources.

Pollution indices and statistical models such as WQI, Cd, HEI, HPI, HIs, HQs, ILCR, PCA, and HCA were used to appraise the level of PTEs pollution and determine the possible contamination sources. The mean abundance of PTE concentration of the Subarnarekha River water follows the order of Fe > Cr > Ni > Cu > Pb > Cd during the pre-monsoon and Fe > Cr > Cu > Pb > Ni > Cd in the post-monsoon season. Based on Cd values, 76.50% of sampling sites were low contaminated, and 23.5% of sampling sites were moderately contaminated during the pre-monsoon study. However, post-monsoon studies have shown all sampling stations (100%) fall under the low contamination level. Pre-monsoon studies of HEI reveal that the river water has a moderate (52.94% sampling sites) to low level (47.06% sampling sites) of contamination with respect to PTEs, while in post-monsoon, the majority of sampling sites (94.12% sites) have low contamination level. Moreover, the estimated HPI of PTEs has a moderate-to-high level of contamination during the pre-monsoon season and a low to high level of contamination in the post-monsoon season. The descending trend of PTEs contamination in the post-monsoon season is because of dilution effects.

Non-carcinogenic and carcinogenic health risk assessments indicated the presence of health hazards in the Subarnarekha River. In the basin, the HI values for all the elements during both seasons are typically within the advised limits (except Cr and Pb), for both adults and children. Cr and Pb posed non-cancerous health concerns to both children and adults in some monitored sites of Subarnarekha River basin. Carcinogenic risk results distinctly demonstrate the presence of a cancer risk not only just in the near future, but also in the present. Cr emerged as the most hazardous element in terms of health risk in the Subarnarekha River basin. Children are more prone to cancer and non-cancer risks compared to adults in both seasons, but post-monsoon seasons have shown less risk compared to pre-monsoon seasons owing to the dilution factor due to rain. The findings of this study emphasize the necessity for well-informed policies to stop water quality degeneration and the accumulation of PTEs in order to protect human health. For the purpose of determining the risk levels, MCS projected the probability distribution of the carcinogenic risk. This method eliminates uncertainty, data measurement constraints, and other human errors. The likelihood of a few polluted samples tends to be on the higher end, which raises the mean value and, in turn, the danger for the entire region. To ascertain the severity or greatest risk for the region, however, assessing the risk for higher concentrations is also necessary.