Introduction

Diatoms are a large and diverse group of unicellular algae, which are distributed throughout the world in most types of aquatic systems (Potapova and Charles 2002; Smucker and Vis 2011). They are very sensitive to environmental change, and their variation is expressed spatially and temporally (Urrea and Sabater 2009). Their composition and distribution may be influenced by a number of factors which includes geological, hydrological, climatic, physicochemical, and biological factors, such as nutrient concentration (Cardinale 2011), organic contamination (Kwandrans et al. 1998), electrical conductivity (Leira and Sabater 2005; Rimet 2009), pH (Charles 1985), etc. Water pollution caused by nutrients and organic contamination and subsequent environmental impacts are common problems in rivers throughout the world. Therefore, it is necessary to develop effective tools for monitoring rivers. Hambrook Berkman and Canova (2007) used algae biomass to assess water quality in lotic and lentic ecosystems. However, the response of algae biomass to water quality is not always predictable. Changes in biomass are not related to nutrients, which are not limiting factors for algae growth (Montuelle et al. 2010). Hill and Fanta (2008) showed that periphyton growth can be co-limited by phosphorus and irradiance in streams. Stephens et al. (2012) hypothesized that the biomass will increase as nutrients are added, unless it is limited by light. Like algae biomass, the use of algal biodiversity as an indicator of river pollution gave ambiguous results (Ndiritu et al. 2006; Blanco et al. 2012).

In contrast to algae biomass and biodiversity diatom, biomonitoring using diatom indices which use sensitivity and tolerance of diatoms to environmental degradation was useful and robust for pollution assessment (Rimet 2012). Diatoms are used widely as a biological monitoring tool which has been used to assess river pollution for 60 years (Rimet 2012). At least 18 diatom-based indices have been developed and used to assess aquatic environments (Blanco and Becares 2010). The specific pollution sensitivity index (IPS, Coste 1982) and biological diatom index (IBD, Lenoir and Coste 1996) are widely used in France, and IPS which incorporated the largest number of species among all diatom indices has usually been considered as the reference index to evaluate the applicability of other new diatom indices (Descy and Coste 1991). Trophic diatom index (TDI, Kelly and Whitton 1995) was developed in Great Britain for assessing the trophic status of river.

Aquatic ecosystems are faced with many pollution problems due to urbanization and rural development. Recently, studies have focused on the effects of human disturbance on diatom communities, particularly urbanization and rural land use (Newall and Walsh 2005; Urrea and Sabater 2009). However, very little is known about river systems in Central China where the Ganhe River flows through a rural basin and on to an urban area. In this study, we investigated the distribution of benthic diatom assemblages and their relationship with environmental factors based on spatial and temporal gradients and further explored three diatom-based indices to assess the water quality in the Ganhe River. Our major goals were (1) to analyze the relationships between diatom assemblages and environmental variables; (2) to identify suitable indicator species of river pollution; (3) to investigate whether the periphyton biomass and biodiversity correlated with the nutrient level or water pollution; and (4) to validate the applicability of diatom-base indices for water quality assessment in the Ganhe River of China.

Material and methods

Study area

This research was conducted in the Ganhe River watershed, which goes through Xianning City, Hubei Province, which is located in Central China (113° 32′–114° 58′ E, 29° 02′–30° 19′ N) where the climate is subtropical. February, April, July, October, and December correspond to early spring, spring, summer, autumn, and winter, respectively. Low winter temperatures persist for almost three months. Rainfall is rare and the water transparency is the highest in winter. The annual average mean temperature and rainfall are approximately 17 °C and 1500 mm, respectively. Intense rainfall events and floods occur during the rainy season (April-July). The Ganhe River (Fig. 1) is 76.6 km long and the basin area occupies 854 km2. It is divided into two main catchment areas: five tributaries (Huangshui stream, Mingshui stream, Nanchuan stream, Bodun stream, and Longtan stream) are situated in the upstream region (S1–S15), which typically consists of hills (altitude from 88 to 211 m) and rural regions (agricultural area percentage is about 30 %), whereas the downstream region (S16–S21) is a flatland and urbanized area (altitude from 21 to 44 m), which is heavily industrialized and densely populated. Compared to the upstream region, flat terrain of the downstream region resulted in its slower water flow velocity in general. The riverbed matrix in the upstream was rocky whereas depositional habitats with rare rocks were found in the downstream.

Fig. 1
figure 1

Locations of the sampling sites on the Ganhe River basin

Sample collection, preparation, and identification

Twenty-one sampling sites were selected for this study (Fig. 1). Periphyton samples were collected in triplicate from each site during February, April, July, October, and December 2013. To reduce the user error associated with potentially dissimilar estimates made by different people, because their subjectivity may affect their visual estimation of habitats (Smucker and Vis 2010), 3–5 pieces of rocks per sample were collected haphazardly along a zigzag pattern from each transect at the right, left, and center of the transect (Fore and Grafe 2002), thereby yielding a total of 9–15 pieces of rocks. We collected samples from the bottom sediment in these sites (S19 in Feb, S21 in Dec) because rocks were absent. Triplicate sediment samples were collected from areas of low flow by inverted petri dish along the riverbed to collect the surface material. To minimize the effects of light availability on benthic diatom (Hill et al. 1995), the selected sites had a canopy cover of 0–1.0 % and are saturated by light.

Each rock was fitted with a plastic O-ring. The area within the O-ring was scrubbed using a firm toothbrush and rinsed with stream water from near each rock. Small cores (surface area = 8 cm2) were taken from the surface of sediment samples to collect algae using modified 60 mL plastic syringes. Each of the three suspensions of epilithon and the three suspensions of epipelon from each sampling site was collected in a plastic bottle and transported to the laboratory inside a darkened ice box.

During sampling, the water temperature, pH (portable pH meter, PHB-4, China), dissolved oxygen (portable dissolved oxygen meter, JPB-607, China), and conductivity (portable conductivity meter, DDB-303A, China) were measured in the field, and water samples were collected from each location to measure the total nitrogen (TN), total phosphorus (TP), and chemical oxygen demand (COD) in the laboratory, according to APHA (1998).

Each suspension was divided into three portions in the laboratory: one was fixed in 4 % formaldehyde for diatom identification, the second was extracted in 90 % acetone for chlorophyll a analysis according to standard procedures (APHA 1998), while the third was processed to determine the ash-free dry mass based on APHA methods (1998).

One milliliter of captured diatoms were cleaned with a mixture of concentrated sulphuric and nitric acids and 40 μL of the cleaned diatom valves were embedded in synthetic resin as described previously by Poulı’cˇkova’ and Mann (2006). Diatoms were identified and enumerated using microscope fitted with a 100× oil immersion objective (numerical aperture, 1.25). A minimum of 500 valves per sample were counted unless the diatoms were sparse, and diatom identification was carried out according to Prygiel and Coste (2000), Krammer and Lange-Bertalot (1991–1997), Krammer (2002), and Zhu and Chen (2000). Samples with less than 200 valve counts were removed from the analysis. Thus, diatom count information of 12 samples (S6, S12 in Feb, S7, S10 in Apr, S2, S4, S10, S13, S19 in Jul, and S4, S14, S15 in Oct) was not included in the results.

Data analysis

To determine the relationship between the diatom assemblages and the environment gradient, a canonical correspondence analysis (CCA) was performed using CANOCO version 4.5 (Ter Braak and Smilauer 2002). Monte Carlo permutation tests (999 permutations, P < 0.01) were performed on each CCA. We selected the dominant species for the analysis if their relative abundance was >1.0 % in at least two sites. The environmental variables (except pH) and species data were log10 (x + 1) transformed to minimize the effects of extreme values as necessary.

Pearson’s correlation analysis was used to test the relationships between diatom-based indices and environmental factors. Sites were compared using a repeated measure analysis of variance (ANOVA) model to account for differences in the spatial and temporal gradients. A log transformation was applied to the diatom indices when necessary to ensure normality and the homogeneity of variances. All of the statistical analyses were performed using SPSS v.17.0 (SPSS Inc. 2008). The relationship between the concentration of chlorophyll a and total phosphorus was assessed using a simple linear regression. Pielou evenness (Pielou 1966), species richness (Sample contains the total number of species), and Shannon diversity (Magurran 2004) were calculated based on species composition. Diatom-based indices, i.e., the trophic diatom index (TDI: Kelly and Whitton 1995), the specific polluosensitivity index (IPS: Coste 1982), and the biological diatom index (IBD: Lenoir and Coste 1996), were calculated using OMNIDIA 5.3 (Lecointe et al. 1993). To facilitate comparisons, they were all transformed so they varied from 0 to 20.

Results

Aquatic environmental factors

The annual mean values for the aquatic environmental factors are shown in Table 1. The sampling sites were in the Ganhe River along seasonal and rural/urbanized gradients. The annual mean values of the water temperature (WT), electrical conductivity (EC), COD, TP, and TN at the urbanized sites (S16 to S21) were higher than those at the rural sites (S1 to S15), whereas the water pH and dissolved oxygen (DO) were lower at the urbanized sites. In general, the water was slightly alkaline and the pH varied from 7.3 to 9.4. The water temperature, electrical conductivity, and dissolved oxygen varied from 8.7 to 25.7 °C, 154 to 467 μs/cm, and from 6.5 to 19.8 mg/L, respectively. The mean EC and DO were the lowest in summer, whereas the COD and TP were the lowest in winter, indicating seasonal variation. The concentration of AFDM (0.75–88.79 g/m2) was lowest in summer, and it ranged from 0.75 to 12.28 g/m2 (median = 5.24; 25th, 75th percentile = 3.69, 7.49). The annual TP content ranged from 0.002 to 0.253 mg/L and was lower in winter compared with other seasons. The concentration of TN (0.147–5.104 mg/L) was higher in spring (median = 1.171; 25th, 75th percentile = 0.874, 1.613) and the lowest during October (median = 0.604; 25th, 75th percentile = 0.491, 0.822).

Table 1 Annual averages of environmental factors in the Ganhe River (min-max values)

Diatom assemblages in the Ganhe River basin

A total of 487 diatom taxa (including variations) from 36 genera were identified in Ganhe River. The diatom assemblage distribution and composition differed between months (seasons) in rural/urbanized regions (Fig. 2). We divided sampling time into the following: early spring (February), spring (April), summer (July), autumn (October), and winter (December). On the spatial scale, the species such as Achnanthidium minutissimum, Cocconeis placentula, and Cymbella affinis were persistently dominant species at rural sites (S1–S15). However, Eolimna minima (Grunow) Lange-Bertalot, species such as from the genera Navicula and Nitzschia predominated at urbanized sites (S16–S21). The most common species was C. placentula, which was present at all sites in each season. C. affinis was also a common taxon from S1–S15, and it was dominant at S1, S3, and S13. There were clear differences between rural and urban areas, i.e., A. minutissimum was the most abundant species at many rural sites whereas it was rare or absent at urbanized sites throughout the year. Its maximum relative abundance reached 85.5 % at S8 during early spring. On a temporal scale, there were few changes in the dominant species at rural sites. In winter, C. placentula had the highest mean relative abundances whereas A. minutissimum had the lowest mean value. However, the major seasonal variations in diatom assemblages were most apparent at urban sites (S16–S21), i.e., Navicula trivialis and Nitzschia palea were dominant during the early spring decreasing shortly after; Luticola goeppertiana (Bleisch in Rabenhorst) D. G. Mann, Gomphonema olivaceum, and Gomphonema parvulum increased in spring. Eolimna minima was the predominant species in most seasons (except spring). Nitzschia frustulum and G. parvulum were the second most abundant species at these sites in autumn and winter, respectively.

Fig. 2
figure 2

The mean relative abundance of the predominant species in rural/urbanized regions. The relative abundance of the predominant species was >5 % at two sites or >10 % at one site. The species codes and their corresponding full names are given in the Appendix

Environmental factors and diatom assemblages

The predominant species (relative abundance >5 % at two sites or >10 % at one site) were used in the CCA analysis (Fig. 3). During early spring, the first two CCA axes combined explained 78.8 % of the species data, and the species-environmental correlations were 0.986 and 0.914, respectively. The first CCA axis was positively correlated with TP (r = 0.98), TN (r = 0.96), and EC (r = 0.82), while the second axis was correlated with pH (r = 0.50) and DO (r = 0.39). Similarly, the first CCA axis was strongly correlated with TP (r = 0.93, 0.77, 0.84, 0.93), TN (r = 0.94, 0.39, 0.69, 0.89), and EC (r = 0.87, 0.08, 0.79, 0.73) in all seasons. pH (r = 0.60, 0.71) and DO (r = 0.45, 0.64) were correlated with the second axis in spring and winter, but they were usually negatively correlated with the first axis in summer and October. Water temperature was positively correlated with the first axis in all CCA analyses. In general, two main gradients appeared in all CCAs: the first axis represented a nutrient or urbanization gradient and the second axis represented an alkalinity gradient. E. minima and Navicula, Nitzschia, and Gomphonema species were often abundant at sites where the urbanized areas had higher nutrient levels. On the other gradient, Achnanthidium, Coccoenis, and Cymbella were associated with sites in the rural basin.

Fig. 3
figure 3

The relationships between benthic diatom communities and environmental factors in the CCA ordination diagrams. The codes of the diatoms and their corresponding full names are given in the Appendix

Richness, evenness, and diversity

The benthic diatom species richness and Pielou evenness ranged from 18 to 83 and from 0.24 to 0.8, respectively. Shannon diversity index (log base = e) varied from 0.75 to 3.37 (Fig. 4). The maximum values were recorded at site S16. The results indicated a noticeable difference in the spatial gradient (P < 0.01), but there was no significant difference in the temporal pattern (P = 0.436). The biodiversity increased in the urbanized areas with higher nutrient levels (TP and TN).

Fig. 4
figure 4

The biodiversity values at all sampling sites in each season

Diatom-based indices and water quality assessment

The IBD, IPS, and TDI indices were clearly different in the urbanized sites and the rural sites, and they decreased from 19.9 to 4.3 (median = 16.3), 18.7 to 3.3 (median = 16.6), and from 18.8 to 4.8 (median = 14.3), respectively (Fig. 5). These diatom-based indices exhibited a similar pattern of seasonal variation. They had significant correlations with some environmental factors (Table 2). The IBD, IPS, and TDI values decreased with higher electrical conductivity (r = −0.612, −0.660, and −0.572, respectively; P < 0.001), total phosphorus (r = −0.831, −0.826, and −0.635, respectively; P < 0.001), and total nitrogen (r = −0.720, −0.756 and, −0.651, respectively; P < 0.001), whereas they increased with greater pH (r = 0.302, 0.340, and 0.270, respectively; P < 0.05) and dissolved oxygen (r = 0.218, 0.259 and 0.284, respectively; P < 0.05). The mean IBD values were the lowest (mean = 14.0) during early spring and the highest (mean = 15.9) in winter. By contrast, the mean TDI index maximum value occurred in early spring (mean = 14.7) whereas its minimum (mean = 11.0) was in winter. The IBD (P = 0.612) and IPS (P = 0.985) indices were not significantly different among seasons, whereas the TDI index differed significantly (P < 0.05) according to the ANOVA.

Fig. 5
figure 5

The diatom-based indices (TDI, IPS, and IBD) at all sampling sites in each season

Table 2 Pearson correlation coefficients between average environmental factors and diatom indices

Discussion

Factors that affect the composition and distribution of benthic diatoms are complex which includes geological, hydrological, climatic, physicochemical, and biological factors. The upstream region (S1–S15) of the Ganhe basin typically consists of hills with high altitude (altitude from 88 to 211 m) which results in faster drop and higher flow velocity, whereas the downstream region (S16–S21) of the Ganhe basin is a flatland (altitude from 21 to 44 m) which results in its lower current velocity in general. Eutrophication of water bodies must have three conditions, including nutrients, slow current velocity, and suitable temperature, illumination conditions (Lei et al. 2010). Therefore, the downstream of the river eutrophication are the comprehensive results of multiple factors, and nutrition input is one of the important conditions. The specie Achnanthidium, which dominated in rural upstream areas with higher flow velocity, characterized the good water quality (Kwandrans et al. 1998; Fore and Grafe 2002). By contrast, the species (e.g., N. palea, E. minima) that dominated in urbanized downstream with lower flow velocity areas were indicators of bad water quality (Duong et al. 2006; Bere and Tundisi 2011; Raven 1998).

All of the CCAs showed that the TP, TN, and EC arrows pointed in the same direction along the first ordination axis. The conductivity mainly indicated the concentrations of ions, which is consistent with nutrient enrichment (Leira and Sabater 2005). This suggests that the first axis corresponded to a nutrient gradient, particularly TP, which was the principal nutrient gradient. The best indicator species for high nutrient levels or pollution were as follows: N. palea, N. trivialis, and L goeppertiana in spring; E. minima and Gomphonema angustum in summer; E. minima and N. frustulum in autumn; and E. minima in winter. These results confirmed that N. palea is cosmopolitan and a pollution-tolerant species, and it was dominant in polluted rivers in Spain (Urrea and Sabater 2009), Kenya (Ndiritu et al. 2006), Vietnam (Duong et al. 2006), and Brazil (Bere and Tundisi 2011). E. minima, a small cell-sized diatom, can adapt to different conditions (Raven 1998), and it was the dominant taxon at polluted sites after the summer due to harsh conditions such as flooding and excess light. Another major gradient emerged in the CCAs where the second axis represented the pH and DO gradients. The pH has a major effect on diatom assemblage composition (Verb and Vis 2000; Smucker and Vis 2011). C. placentula and C. affinis are alkaliphilous taxa and they prefer alkaline water (Urrea and Sabater 2009), which may explain this effect. The pH is regulated by alkaline metals, and it may be related to local agricultural activities. Some studies have shown that agriculturally impacted rivers have a high pH due to animal waste, liming, erosion, and fertilizers (Johnson et al. 1997; Carpenter and Waite 2000). However, this gradient was disrupted by anthropogenic nutrient stressors when the stream flowed into the urban area.

Algal growth and biomass is widely used to evaluate the trophic status of aquatic ecosystems. A high algae biomass can indicate eutrophication, but it can also accumulate in less productive habitats after long periods of stable flow (Stevenson and Bahls 1999). We found that total phosphorus (TP) explained only 5.07 % of the variability and total nitrogen (TN) only 4.62 % of the variability in benthic algae Chl a (Fig. 6). High or nuisance levels (Chl a >10 μg/cm2, Biggs 1996) of periphyton biomass were also present at sites with low nutrient levels in the Ganhe River. Nutrients, particularly phosphate, have important effects on algal growth (Pan et al. 1999; Camargo et al. 2005), but they are unlikely to be a factor that limits algal growth (Kelly and Wilson 2004; Wu et al. 2007). Our results showed that the concentration of periphyton Chl a was not related to the nutrient levels (TP and TN). There is good evidence that nutrient concentrations are poor predictors of Chl a (Figueroa-Nieves et al. 2006), which has been shown by other researchers (Munn et al. 1989). However, others have shown the opposite effect (LaPerriere et al. 1989). O’Brien and Wehr (2009) observed that the periphyton biomass was significantly greater in urbanized locations with a higher phosphate level. The effects of light availability on algal growth throughout the water column were not monitored, and we support the view that multiple factors may control the biomass of algae. Thus, the interaction between hydrology and light probably controls the periphyton biomass in nutrient-rich, agricultural streams (Figueroa-Nieves et al. 2006). Therefore, we suggest that using the biomass of periphyton as a parameter to assess the water quality in streams is not suitable because it can produce ambiguous results. The biomass was the lowest in July, and there were almost no filaments in a visual observation. This low biomass may have been the result of a recent storm event followed by spate. It may be related to the hydrological characteristics of the Ganhe River, which experiences a wet season in summer (Yang et al. 2009).

Fig. 6
figure 6

Regression showing the relationship between the total phosphorus and chlorophyll a concentration, the relationship between the total nitrogen and chlorophyll a concentration during the investigations

Biodiversity indices (e.g., diversity, species richness, and evenness) are not always predictable in response to environmental conditions (Ndiritu et al. 2006). We found that the diversity was higher in the urbanized area with an intensity of anthropogenic disturbance, with lower values in the upper reaches of the lower trophic level. The community composition exhibited a distinct change of species with the environmental gradients in the Ganhe River. The communities with a relatively low biodiversity in good water quality were dominated by only a few species of such as C. placentula and A. minutissimum.

C. placentula was the predominant species in weak alkaline streams in Italy (Bona et al. 2007) and Spain (Urrea and Sabater 2009). Similarly, it was present at all sites of the Ganhe River that were characterized by alkaline water in all seasons. A. minutissimum is known for its rapid colonization of new habitats because of its short generation time and high immigration rate, and it was prevalent and abundant on substrates due to its short mucilaginous stalks (Round 1990; Jöbgen et al. 2004). A. minutissimum is highly sensitive to organic pollution (Kwandrans et al. 1998), and it is regarded as a good indicator of disturbances (Fore and Grafe 2002). It dominated in the rural sites, as it does in similar areas elsewhere (Zampella et al. 2007; Urrea and Sabater 2009; Pei and Liu 2011).

These diatoms appeared to have many advantages that helped them to live in the environmental conditions present in the upstream sites of the Ganhe River. Their interspecific competitive advantages may be reflected in oligotrophic water bodies. Interspecies competition is not common in nutrient-rich water bodies so more species can coexist in the same environment. This may explain why the biodiversity was higher in the urbanized section of the Ganhe River. However, the river had an appreciable self-purification capacity (Mbuligwe and Kaseva 2005). Biodiversity may help to buffer natural ecosystems against the ecological impacts of nutrient pollution (Cardinale 2011). A higher biodiversity may be a manifestation of river self-purification and a result of long-term interactions between species and environmental factors in polluted rivers.

The diatom indices (IPS, IBD, and TDI) show different water quality classes when comparing upstream and downstream, and the 3 diatom indices tested showed similar results for a given site/time. The diatom indices (IPS, IBD, and TDI) indices were all relatively low in the urban sites, which reflected poor to bad (or meso-eutrophy to eutrophy) water quality. In contrast, the diatom indices from the sites of the rural area revealed high values which represent excellent, good to moderate (or oligotrophy, oligo-mesotrophy to mesotrophy) water quality. The diatom indices indicated differences in the water quality based on the relative abundances of indicators of trophic or saprobic stages and different types of diatom communities (Kwandrans et al. 1998). These indices had similar responses to environmental factors and were significantly negative correlated with the trophic level (TP and TN concentrations). These significant correlations indicated that the water pollution was mainly caused by nutrients. The IPS, IBD, and TDI were developed to assess the nutrient concentrations or eutrophication (Kelly and Whitton 1995), the pollution level (Coste 1982), and the general water quality (Lenoir and Coste 1996), respectively. The dominant diatoms at urban sites were nutrient-tolerant and pollution-tolerant taxa (e.g., N. palea, E. minima) in the Ganhe River basin (Duong et al. 2006; Bere and Tundisi 2011; Raven 1998). The diatom (Achnanthidium minutissimum) that dominated at the rural region was less polluted in the Ganhe River (Kwandrans et al. 1998; Fore and Grafe 2002). However, the most important factors that affected the distribution of diatom assemblages were the nutrient concentrations according to the CCA analysis of the Ganhe River. These diatom indices were also strongly negatively correlated with biodiversity (Table 2). Thus, we suggested that the pollution was due to excessive nutrient input into the Ganhe River.

Our investigations suggested that the diatom indices (IPS, IBD, and TDI) should be able to differentiate sites of various degrees of contamination and therefore seem adequate to use in China. This suggested that the diatom indices, especially IPS and IBD, should be suitable for monitoring rivers. The TDI index detected differences that may be attributed to A. minutissmum, which is considered to be a good water quality indicator in TDI (Rimet et al. 2005), although it was only distributed at urban sites and it had a relatively low abundance in winter. There were 96 % of taxa using for the calculation of the indices in our study. Xiang Tan et al. (2013) demonstrated the applicability of diatom indices for water quality and ecohealth assessment in subtropical rivers. These diatom indices produced good results in the water quality evaluation based on the spatial and temporal patterns, but this survey design only studied one river system for 1 year. Thus, further long-term research is necessary to determine whether these trends are seasonal cycles and whether the diatom assemblages exhibit seasonal species succession.