Introduction

Bio-indicators are taxa or functional groups which reflect the state of the environment (Lencinas et al. 2015) as environmental indicators (by either acting as early warning indicators of any environmental change to the local environment) or as ecological indicators (by response to the specific ecosystem stress) or biodiversity indicators (by indicating the levels of taxonomic diversity at a site) (McGeogh 2007). The environmental and ecological bioindicators can be divided into detectors (naturally occurring indicators which are sensitive to environmental change and so decrease with added environmental stress), exploiters (increasing in abundance in response to environmental stress) and accumulators (organisms which take up chemicals such as heavy metals and can be used to measure toxin levels) (Jenkins 1971; Spellerberg 1991). Biomonitoring programs using resident, widespread and adapted species facilitate an efficient and early intervention, and represent a fundamental contribution to ecological risk assessment and environmental management (Conti et al. 2018). Bio-indicators are useful for conservation prioritisation by means of spatial comparisons of a site value, or monitoring of ecosystem recovery or response to management decisions (Peri et al. 2016; Schwerdt et al. 2018). Among the arthropods, the spiders have cosmopolitan distribution, they occupy different ecological niches, they have capacity of exploration and adaptation to various ecological conditions, they show rapid responses to disturbances, sensitive to changes in environmental conditions, and are relatively easy to sample. For having all these features, the spiders worldwide are considered to have potentials for bio-indicators (Pearce and Venier 2006; Mader et al. 2016; Schwerdt et al. 2018).

In recent years, the spiders are successfully used as bio-indicator for air pollution (Rachwał et al. 2018; Rutkowski et al. 2018), different agricultural practices, climatic conditions, or various cultivars grown (Radkova et al. 2019), disturbances of stream ecosystems (Reyes-Maldonado et al. 2018), effect of the canopy openness gradient on biodiversity conservation (Košulič et al. 2016), forest restoration (Borchard 2014), grazing effects (Kaltsas et al. 2019), ground of the central and peripheral area of olive agroecosystem (Benhadi‐Marín et al. 2019), health of natural grasslands (Schwerdt et al. 2018), heavy metal contamination (Yang et al. 2016; Conti et al. 2018; Hansson et al. 2019), low anthropogenic intervention (Rosa et al. 2019) and management strategies in fruit orchards (Mazzia et al. 2015).

The freshwater ecosystems are facing various adversities all over the world such as flow modifications by construction of dams and reservoirs, overexploitation of biotic resources, unlimited extraction of water, invasion of exotic species, organic and inorganic pollution from agricultural, industrial and domestic sources and climate change (Dudgeon et al. 2005; Strayer and Dudgeon 2010; Vörösmarty et al. 2010; Collen et al. 2014). Some recent studies indicated that the fresh water ecosystem of the Ganga river is intimidated by overexploitation, pollution, riverbed mining, physical barriers, change of land use pattern and climate change (Siddiqui and Pandey 2019; Singh and Singh 2019; Kamboj and Kamboj 2019; Dey et al. 2019; Verma et al. 2019; Jain and Singh 2020; Paudel and Koprowski 2020; Santy et al. 2020). But, as these types of works are labour intensive, time consuming and costly, there is a great need for ecological indicators for the Ganga river to know the state of the ecosystem in a short time at low cost. However, no research has been done to find out any invertebrate taxa for bio-indicator of habitat disturbance of riparian zones of the Ganga river until now. With this background, the present work was carried out to find out significant bio-indicator spider taxa for undisturbed habitat for Gangetic riparian zones.

Materials and methods

For the study, 27 sites were chosen on the banks of the Ganga river from Bijnour in Uttar Pradesh to Batanagar in West Bengal with average distance between two successive sites about 75 km (Fig. 1). For detail description of each study sites refer Ali et al. (2019). The fieldwork was performed during summer of 2018 (May and June) and 2019 (May and June) and winter of 2018 (November and December) and 2019 (November and December). Each site was visited once in each season in each year.

Fig. 1
figure 1

Location of 27 sampling sites in the Ganga river

At each site, a 50 m (width) × 100 m (length) plot was chosen beside either left bank or right bank of the river, depending upon accessibility and within each plot 15 quadrates of 4 m × 4 m (3 across the width and 5 across the length) were employed for spider collection. Six different collection methods—pitfall trapping, vegetation beating, litter sampling, ground hand collection, aerial hand collection, and sweep netting (Coddington et al. 1996) were employed for spider collection. For pitfall trapping, at the centre of each quadrate, one plastic bottle of 10 cm diameter and 11 cm depth (Churchill and Arthur 1999) filled with preservative (69% water, 30% ethyl acetate, and 1% detergent) was placed for overnight. For the other approaches, 30 min in day time were spent collecting spiders in each quadrate for each technique. The collected spiders were preserved in 70% ethanol and identified the specimens up to at least genus level from published literatures.

At each site nine types of natural habitat disturbance namely—agricultural activities, boats, effluent discharge, garbage dump, ghats (series of steps descending into the water body), grazing, human settlement, manmade embankment and sand mining were recorded. According to Gezie et al. (2017), score of 1 for any or minimum disturbance (if disturbances were found in 0–5 quadrates), 2 for moderate (if disturbances were found in 5–10 quadrates), and 3 for high disturbance (if disturbances were found in 10–15 quadrates) were assigned. The overall disturbance of each site was calculated by adding individual values (a total of nine different factors). The overall disturbance score ranges from 9 to 22. Based on this disturbance scores, the sites were classified into five groups, namely very low disturbed sites (disturbance score up to 11), low disturbed sites (disturbance score between 12 and 14), moderately disturbed sites (disturbance score between 15 and 17), highly disturbed sites (disturbance score between 18 and 20) and very highly disturbed sites (disturbance score more than 20).

For analysis, summed species matrix (i.e. pooled across all seasons) for each site was used. Prior to analysis the habitat disturbance score and species richness (mean = 0 and SD = 1) were normalized (Miyazono and Taylor 2013; Datry et al. 2016). All the analysis was performed in R (version 4.0.0) language and environment for statistical computing (R Core Team 2020). The non-parametric analysis of similarities (ANOSIM) test (Clarke 1993) and non-parametric permutational multivariate analysis of variance (PERMANOVA) (Anderson 2001) in R package ‘vegan’ (Oksanen et al. 2019) were used to understand similarities of species composition between the groups (of sites based on disturbance score). To facilitate interpretation of the results of ANOSIM and PERMANOVA, the non-metric multidimensional scaling (NMDS) based on spider composition was performed. The redundancy analysis (RDA) in R package ‘vegan’ (Oksanen et al. 2019) was performed to investigate the relationship between the spider distribution and habitat disturbance. To assess the significance of constraints, the permutation test (with 999 permutations) using both direct model (which permutes community data) and reduced model (which permutes residuals of the community data) was performed. The automatic backward stepwise model with 999 iteration steps was used to know which disturbance affected spider distribution in RDA ordination space. This method has the benefit of assessing the joint predictive potential of variables (Chowdhury and Turin 2020) as this process starts with all variables in the model and removes the least important variables early on, leaving only the most important variables in the model (Pham et al. 2020; Chowdhury and Turin 2020). The R package ‘MuMIn’ (Barton 2020) was used to find out best disturbance combination that affected the species richness of the spider community. Then the indicator value (IndVal: Dufrêne and Legendre 1997) was calculated for each species with ‘labdsv’ package (Roberts 2019). For the final list of indicator species, only those samples that were identified up to the species level were selected.

Results

It was found that the disturbance score ranged from 9 to 21 across different sites (Fig. 2). By non-parametric analysis of similarities (ANOSIM) test it was found that there was significant difference between the spider communities in different disturbed sites groups (ANOSIM statistic R = 0.3709, P = 0.001). By non-parametric permutational multivariate analysis of variance (PERMANOVA) test it was found that there was significant difference between the spider communities in different disturbed sites groups (ADONIS, F = 2.219, R2 = 0.287, P = 0.001). With the help of NMDS plot it was found that all the sites that had the similar species composition were clustered together and NMDS plot showed that different species composition were different in the very low disturbed sites and very highly disturbed sites (Fig. 3).

Fig. 2
figure 2

Comparative account of the disturbance score of 27 sites. In figure A, the size and shade of the bubbles (representing location of sampling sites) are proportional to the disturbance score. In figure B, the red horizontal line represents the mean value and the shaded area represents values within the mean ± standard deviation (color figure online)

Fig. 3
figure 3

Non-metric multidimensional scaling (NMDS) ordination plot illustrating differences among groups of sites plotted in species space

The permutation test for redundancy analysis (RDA) under direct model (F = 1.421, P = 0.002) and reduced model (F = 1.421, P = 0.004) were found as significant. The variance of the RDA biplot (Fig. 4) of spider metrics and habitat disturbance based on the first two axes explained 55.10% of the variance. Axes 1 and 2 explained 40.94% and 14.16% of the variation in spider assemblages, respectively. This first axis was positively correlated with the entire habitat disturbances; the second axis was positively correlated with boats, manmade embankment, garbage dump, effluent discharge, ghats and human settlement but negatively correlated with grazing, agriculture and sand mining. The automatic backward stepwise model resulted in 8 steps and it was found that the human settlement and sand mining had significant (P < 0.05) affect on the spider distribution in RDA ordination space (Supplementary file 1). The study found that the agriculture, human settlement, manmade embankment and sand mining were the best disturbance combination (Table 1) that affected the species richness of the spider community.

Fig. 4
figure 4

Ordination diagram of the redundancy analysis (RDA) of the spider community. Arrows represent the direction of change of the habitat disturbance. The length of the arrow indicates the variable's importance in explaining the spider community. Angles between variables indicate the correlation between the habitat disturbances. Sites locations (black dots) and species locations (grey dots) relative to each other indicate their similarity in ordination space (color figure online)

Table 1 Effects of four disturbance types (best predictors) on the species richness of spiders

The study found that eight species can be used as indicator of very low disturbed sites and only one species as bio-indicator for low disturbed sites (Table 2). For other sites, no spider species was found as significant bio-indicator.

Table 2 The spider species with significant (P < 0.05) indicator value (IndVal) for very low disturbed sites and low disturbed sites

Discussion

The ecological guild (following Cardoso et al. 2011) of the eight species of spiders found as indicator species for very low disturbed sites includes orb web builders (three species under families Araneidae and Uloboridae), sheet web builder (one species under family Pisauridae), ground hunter (one species under family Lycosidae) and other hunter (three species under family Salticidae) whereas the indicator species for low disturbed sites was space web builder (one species under family Theridiidae).

No indicator species were found for moderately disturbed sites, highly disturbed sites and very highly disturbed sites, but we did find indicator species for very low disturbed sites and low disturbed sites, which were also present in other sites. This interesting composition of spider community indicates that the community was likely to be nested because in nestedness, small communities form proper subsets of large communities which a typical bio-geographic pattern (Ulrich and Gotelli 2007).

Geographically any river is an ever-changing entity because the rainfall, temperature, amount of forest and the nature of the land use affect the river hydrology and in many cases it is seasonal and this change is accompanied by changes in the river-dependent biota. Prolonged bio-monitoring on a long river like the Ganga is a daunting task for any scientific body. For this, researchers have to rely on the people living on the banks of the river and depend on the river for their livelihood. Such animals and plants should be identified as bio-indicators which are readily available and known to them, so that they can present any visible change in an area to the scientists. Spiders are extremely useful for this task as they are a well-known animal to human. As the habitats of the riparian zones are altered by anthropogenic activities the riparian spider communities respond strongly to urbanization and anthropogenic activities (de Jesús-Crespo and Ramírez 2011; Ramírez et al. 2011; Reyes-Maldonado et al. 2018). As anthropogenic stressors likely make disturbance-tolerant species inhabiting disturbed habitats and disturbance-sensitive species vulnerable to extinction, there is an urgent need to develop methodologies for evaluating riparian ecological health from a multiple and integrated perspective (Zina et al. 2021). Small-scale monitoring, such as protocols and bio-indicators, should be utilised when specific parts of the watershed that are substantially degraded are recognised (Zaimes and Iakovoglou 2020). A vital step in developing such monitoring programmes is selecting cost-effective bio-indicators (Valente-Neto et al. 2021).

This is an initial work on the bio-indicator spiders for the Ganga river and the work should be iterated with the help of new information because scientific community has limited but always evolving knowledge about pattern of diversity and distribution of plants and animals, especially for invertebrates, the change in conservation strategy will be taken place with the increase of information and knowledge about ecology of any species (Abellán et al. 2005).

Conclusion

Through the present work it can be concluded that spiders can be used as bio-indicator of habitat disturbance in the riparian zone of river ecosystem. But as a single taxon cannot indicate overall biodiversity or environmental conditions (Lehmitz et al. 2020), it is expected that researchers will study the bio-indicator capacity of plants and animals living in the water and on the banks of the Ganga river and present new information for effective conservation, restoration and management of this freshwater ecosystem.