Introduction

In response to growing concerns about water pollution, flow regulation, and land use changes on aquatic ecosystems, the assessment of ecosystem health and the monitoring of biotic responses to environmental changes have received increasing attention from scientists and catchment managers worldwide. The index of biotic integrity (IBI) has been widely used for ecosystem health assessments across the world since it was developed by Karr (1981). Fish communities are effective indicators of ecosystem health because these integrate environmental variability with different spatial scales (Oberdorff et al. 2002), these are relatively easy to identify, and their position at the top of the food chain helps provide an integrative view of the watershed environment (Karr 1981; Simon and Lyons 1995).

The original IBI method was developed based on the biological conditions in the Midwestern United States by Karr (1981). It was later modified according to the characteristics of different basins. Oberdorff and Hughes (1992) modified and adapted the IBI to the main stem of the Seine River and its major tributaries in France. This successful modification of the IBI to a considerably different fish fauna on a different continent further supports its wider range of use outside the Midwestern United States. Casatti et al. (2009) applied a fish-based index of biotic integrity to lowland streams in a highly deforested region of the Upper Paraná River basin, and revealed that the biological integrity of 91 % of the studied streams was classified as poor or very poor. The IBI method is considered an acceptable tool for the evaluation of ecosystem health and has been widely used in the USA (Dauwalter and Jackson 2004; Tejerina-Garro et al. 2006), Europe (Breine et al. 2004; Siligato and Böhmer 2001), Brazil (Bozzetti and Schulz 2004; Casatti et al. 2009), New Zealand (Joy and Death 2004), and other countries (Ganasan and Hughes 1998; Kleynhans 1999; Qadir and Malik 2009). In addition, several studies have been conducted to evaluate the ecosystem health based on fish assemblages in eastern China such as in the Yangtze River (Liu et al. 2010; Zhu and Chang 2008) and the Liao River (Pei et al. 2010). However, neither an “index of biotic integrity” nor any other quantitative measure of biotic integrity has been applied to the Wei River.

A major reason for this is the intensive human disturbance in the Wei River. Over the past 30 years, intensified human activities have made a huge negative impact on the Wei River basin, which is characterized by decreasing annual runoff, heavy pollution, and a high sediment concentration (Song et al. 2007; Wu et al. 2012). All of the above changes have led to a marked deterioration in the ecosystem function, such as reduced fish richness, shrunk wetland, and decreased forest vegetation. The second reason is the scarcity of published biological community data from the Wei River. Little historical information is available to quantify the influence of human disturbance on the biotic integrity of the Wei River. The most recent investigation of fish assemblages in the Wei River basin was conducted in the 1980s (Xu and Li 1984), and this previous study was the first comprehensive investigation on fish assemblages. However, the changes in the fish assemblages that resulted from the huge human disturbance are still unknown, and few studies have focused on this field in the Wei River. The lack of ecological and life history information on most fishes of the Wei River basin, such as their uncertain functional role in the fish community, have also made it difficult to develop an IBI for the Wei River. In addition, there is a great variability in the environmental conditions caused by natural processes, such as precipitation, soils, and geology. The Wei River basin consists of three rivers: Wei River, Jing River, and Beiluo River, which are the first and second largest tributaries of the Wei River basin. These rivers flow through Loess Plateau, which consists of hyperconcentrated sediment, and are the main source of the sediment concentration in the Wei River basin (Song et al. 2010). The tributaries in the south bank originate from the Qinling Mountains, which are characterized by high flow and low sediment concentrations. The significant geographical features result in major natural difference in the species composition in the whole basin (Xu and Zhang 1996), which produces new challenges in the development of an IBI for the Wei River.

The previous studies on the Wei River basin mainly focused on the alteration of the hydrological regime (Wu et al. 2012), water quality assessments (Lu et al. 2010), and studies on the high sediment concentration in the river (Song et al. 2010). No study has evaluated the ecosystem health based on fish assemblages. The community structure and IBI of the fish assemblage are good indicators of human disturbance. Therefore, it is necessary to identify the ecosystem health based on the fish assemblage under the pressure of human disturbances. It is worth studying whether this tool is suitable for the assessment of ecosystem health in the Wei River. The flexible approaches used for the IBI in the Yangtze River and Liao River (Pei et al. 2010; Zhu and Chang 2008) are a good reference for the study area.

A method was proposed to develop a preliminary index of ecosystem health through three steps: (1) calculate the index of water and habitat quality (IWHQ) to identify the degraded situation of the sample sites, (2) select metrics sensitive to disturbance, and (3) construct a preliminary index of ecosystem health by scoring the identified metrics.

Materials and methods

Fish sampling methods

In the present investigation, 55 sites were sampled across the whole basin in October 2011 and April 2012 (Fig. 1). The data were divided into a calibration data set and a validation data set. The calibration data set consists of 45 sites (23 sites in the Wei River, 13 sites in the Jing River, and 9 sites in the Beiluo River). In addition, 49 sites were used as the validation data set, and 38 of these sites were sampled twice. The validation data set was used to verify the effect of the preliminary IBI. Fishes were collected for 30 min in three habitats (i.e., pools, riffles, and runs) within 200 to 300 m of all sites, and the individuals in the three habitats were combined to represent each site. In the wadeable streams, the fish collection was performed by a two-person fish collection team, i.e., one individuals used the backpack electrofisher with two handheld electrodes and one was responsible for netting fish with dip nets (Barbour et al. 1999). In the unwadeable streams, seines (mesh sizes of 30 and 40 mm) were used for fish sampling by boat, and electrofishing equipment was employed to ensure a good representation of the fish at the site. All of the individuals (with a total length longer than 20 mm) collected were identified by species, counted, examined for external anomalies (i.e., deformities, eroded fins, lesions, and tumors), measured (standard length to the nearest millimeter), and recorded on field data sheets. Moreover, individuals with a total length shorter than 20 mm were not identified or included in the standard samples. When a large number of individuals of any one species were encountered in a sample, the biometric characteristics (e.g., body length) of a representative subsample of 30 individuals were measured. Specimens that could not be identified in the field were preserved in a 10 % formalin solution and stored in labeled jars for subsequent laboratory identification. The fish were identified according to fishes in the Qinling Mountains area (Chen et al. 1987).

Fig. 1
figure 1

Location of the sample sites in the whole basin. The rose, green, and light blue points represent sites in the Wei River, Jing River, and Beiluo River, respectively. The white and green circles represent the main city and the provincial capital in the basin

Environmental measurement

The habitat conditions of each sample site were assessed during the sampling period using a qualitative habitat evaluation index (QHEI). The habitat quality assessment process based on visual observation requires rating the ten habitat parameters as “optimal,” “suboptimal,” “marginal,” or “poor.” Given the ecosystem situation in the Wei River, the habitat parameters in this study were modified based on that provided by Ohio’s EPA (Barbour et al. 1999). The following parameters were studied: substrate, habitat complexity, velocity/depth regimes, bank stability, channel alteration, channel flow status, vegetation protection, water quality, anthropogenic alterations, and riparian land use. The habitat conditions were assessed through the sum of the scores obtained for the ten parameters.

The following environmental variables were measured in situ at each sample site. The altitude (Altt) was determined using a GPS, i.e., Magellan eXplorist 200. The physicochemical variables, including pH, dissolved oxygen (DO), salinity (Sali), saturation (Satu), conductivity (Cond), and total dissolved solids (TDS), were measured by YSI Pro Plus 85 in the field. Other physiochemical variables, including alkalinity (Alk), chemical oxygen demand using KMnO4 (CODMn), water hardness (hardness), total nitrogen (TN), total phosphorus (TP), suspended solid (SS), and SiO3, were tested in the laboratory according to standards from the State Environmental Protection Administration of China (2002).

Selection of reference sites

The reference condition is representative of a group of minimally disturbed sites that were chosen based on selected physical, chemical, and biological characteristics (Reynoldson et al. 1997). The choice of an appropriate reference condition is a major problem in ecological assessments (Hughes 1995). Sample sites that are undisturbed by humans are ideally chosen as reference sites. However, nearly all of the rivers in China have been seriously disturbed by human activities, and it is impossible to find a pristine river. In this paper, the IWHQ was used to assess the environmental quality at each site. IWHQ is a hybrid indicator that integrates the habitat condition with the water quality. The original proposal by Lyons et al. (1995) selected the lower quality score (either water or habitat). Schmitter-Soto et al. (2011) combined both criteria to attain a qualitative scale of 10 points, i.e., IWHQ. IWHQ was later correlated with the final IBI to estimate the latter’s performance. We presume that a lower IWHQ reflects a better environmental quality, which indicates relatively fewer disturbances by human activities. Thus, those sites with better environment quality were selected as the reference sites.

In this paper, the IWHQ was modified based on the previous studies. There are many physiochemical variables in this analysis, and the correlation between some of the variables may be high. To simplify the calculation, principal component analysis (PCA) and Pearson correlation were applied to identify the primary physiochemical variables. The primary water quality variables and QHEI were scored through a single factor water quality identification index proposed by Xu (2005a, b), which is based on five levels of water quality standard in China (Table 1).

Table 1 Scoring criteria of the physiochemical variables and QHEI in the five classes

The minimum values of some of the variables, such as CODMn, SS, Cond, and TP, are indicative of the best water quality. The score can then be calculated as follows:

$$ {\mathrm{IWHQ}}_i=\left\{\begin{array}{cc}\hfill 1\hfill & \hfill {C}_i\le {S}_{\mathrm{i},1}\hfill \\ {}\hfill j+\frac{C_i-{S}_{i,j-1}}{S_{i,j}-{S}_{i,j-1}}\hfill & \hfill {S}_{i,j-1}\le {C}_i<{S}_{i,j},\kern2em j=2,3,4,5\hfill \\ {}\hfill 6+\frac{C_i-{S}_{i,5}}{S_{i,5}}\hfill & \hfill {C}_i\ge {S}_{i,5}\hfill \end{array}\right. $$

In contrast, the maximum values of some of the other variables, such as DO and QHEI, are indicative of the best water quality. In this case, the score can be calculated as follows:

$$ {\mathrm{IWHQ}}_i=\left\{\begin{array}{cc}\hfill 1\hfill & \hfill {C}_i\ge {S}_{\mathrm{i},1}\hfill \\ {}\hfill j+\frac{S_{i,j-1}-{C}_i}{S_{i,j-1}-{S}_{i,j}}\hfill & \hfill {S}_{i,j}\le {C}_i<{S}_{i,j-1},\kern2em j=2,3,4,5\hfill \\ {}\hfill 6+\frac{S_{i,5}-{C}_i}{S_{i,5}}\hfill & \hfill {C}_i\le {S}_{i,5}\hfill \end{array}\right. $$

where C i is the observed value of variable i, i = 1, 2…n, n = 8; S i,j is the j class value of variable i.

The final IWHQ is the average scores of all of the variables:

$$ \mathrm{IWHQ}=\frac{1}{n}{\displaystyle \sum_{i=1}^n{\mathrm{IWHQ}}_i} $$

According to the study conducted by Xu (2005a), the environmental quality is categorized into three classes: least degraded (IWHQ ≤ 2); medium degraded (IWHQ ≤ 4), and highly degraded (IWHQ > 4). The sampling sites with IWHQ ≤ 2 were chosen as the reference sites.

Metric systems for index of biotic integrity

Considering the present situation of the Wei River ecosystem and the application of the IBI (Karr 1991; Oberdorff and Hughes 1992; Pei et al. 2010; Simon and Lyons 1995), 40 metrics (Table 2) were calculated as candidate metrics. A statistical analysis (Baptista et al. 2007; Wang et al. 2005; Zhang et al. 2007) was applied to select the metrics that are sensitive to human disturbance. First, a metric was eliminated if the values for the metric at 90 % of the total sites are zero. Second, the fish community distribution may change with different altitudes (Ding et al. 2012). Certain characteristics of fish assemblages are expected to change with stream order (Yan et al. 2010). To distinguish the effects of stream order, altitude, and environmental quality on the fish assemblage characteristics, the Kruskal–Wallis test was applied to validate the distinction of different stream orders and altitudes based on the reference sites. For those metrics in which the altitude and the stream order did not contribute significantly to the analysis, a Mann–Whitney U test (p = 0.05) was used on the same metrics, and the environmental quality was the only independent classification variable used in this analysis. The sensitivity of each metric that exhibited an obvious difference between the reference and the impaired sites was then judged according to the degree of interquartile overlap in box and whisker plots (Barbour et al. 1999). The Pearson’s correlation was then calculated for the retained metrics. For two metrics with an obvious correlation (r ≥ 0.7), one will be retained to simplify the index because that one metric sufficiently represents the pattern of information. The retained metrics constitute the core index of biotic integrity.

Table 2 Candidate metrics used in the assessment of fish communities

The continuous scoring method (CALU), which produced the best overall index for this set of metrics and data according to the study conducted by Blocksom (2003), was applied to assess the ecosystem health at 45 sample sites. The continuous scoring method uses the 95th or 5th percentile value as the best expected value. The metric scores are truncated to a range between 0 and 100, and the ratios greater than 100 are set to 100. The total score of the IBI at each site was calculated as the sum of all of the metric scores.

The integrity classes were classified into four groups: good, fair, poor, and very poor. The 25th percentile of reference site values was calculated to separate the good and fair conditions (Klemm et al. 2003), and the other two thresholds were obtained by trisecting the 25th percentile to define very poor, poor, and fair conditions (Zhang et al. 2007). To validate the index, the overall IBI scores were correlated with the IWHQ to estimate the performance of the IBI in the Wei River based on the validation data set.

Statistical analysis

The PCA, Pearson correlation (two tailed), Kruskal–Wallis test, and Mann–Whitney U test were performed using the SPSS 16.0 software. The Origin 8.0 program was used for the box and whisker plots. The canonical correspondence analysis (CCA) was conducted using the Canoco 4.5 software program.

Results

Water quality analysis and selection of reference sites

The PCA of the physiochemical variables was used to select the primary variables. The first PCA axis accounted for approximately 30 % of the variance (eigenvalue = 3.91, Table 3) and Cond, sali, and hardness, which indicate the inorganic pollution, exhibited the highest contribution, and had the highest positive loadings. The second PCA axis accounted for 19.5 % of the variance (eigenvalue = 3.91) and saturation, DO, CODMn, and SiO3, which indicate the organic pollution, exhibited the highest contribution to the second axis. The third and fourth axes explained 12.8 and 9.6 % of the variance with eigenvalues of 1.66 and 1.25, respectively. SS, TP, and TDS were the three primary physiochemical variables for these two axes. The redundancy in these ten variables was evaluated via Pearson correlation coefficients. Those metrics with a correlation coefficient r greater than or equal to 0.7 were considered redundant. Because salinity and hardness were redundant with cond, cond was retained to represent the first axis. Saturation was redundant with DO and was thus excluded. DO, CODMn, and SiO3 were retained to represent the second axis. After the correlation analysis, Cond, DO, SS, TDS, TP, CODMn, and SiO3 were used to represent the water quality, and QHEI was added to calculate the IWHQ.

Table 3 Weights of the first four principal components

The environmental quality was evaluated, and the IWHQ was scored as summarized in Table 4 and Fig. 2. In the whole river basin, only seven sites were categorized as least degraded (IWHQ ≤ 2). These sites (W1, W5, W10, W12, W13, W14, and W16) were chosen as the reference sites. These seven sites were all located in the tributaries of the Wei River basin. Twenty sites were classified as the medium disturbed condition (IWHQ ≤ 4), and the remaining 18 sites, which is nearly half of the number of sites, were classified as highly disturbed (IWHQ > 4). In the Wei River, seven sites were in the least degraded condition, ten sites were in the medium degraded condition, and six sites were in the highly degraded condition. In the other two rivers, the environmental quality was much worse than in the Wei River. There was no site classified as least degraded. Seven of the 13 sites in the Jing River were in the highly degraded condition (IWHQ > 4). A similar phenomenon was found in the Beiluo River, in which five of the nine sites were in the highly degraded condition. Thus, the environmental quality in the Wei River was better than that in the Jing River and Beiluo River.

Table 4 Scores of the index of water and habitat quality (IWHQ) and ecosystem health for the calibration data set in the Wei River basin
Fig. 2
figure 2

The IWHQ at the sample sites in the whole river basin. a IWHQ scores of DO and SS. b IWHQ scores of TDS and Cond. c IWHQ scores of CODMn, SiO3 2−, and TP. d IWHQ scores of QHEI and the final IWHQ

IWHQ for variables exhibited spatial difference in the three rivers. The IWHQ of DO was lower, except for W2 and W7, and the W15, W18, W21, and W23 sites, which are located downstream of the main stem in the Wei River, exhibited a higher IWHQ of DO, which indicates that DO was the main polluted variable in the downstream part of the river. The TP of W15, W18, W21, and W23, which are located downstream of the main stem of the Wei River, exhibited a higher value. The SS in the Jing River and Beiluo River was of higher IWHQ than that in the Wei River, and the greatest values reached 5,778 mg L−1 (L3) and 5,516 mg L−1 (J9), respectively. A similar trend was observed with TDS and Cond with lower IWHQ in the Wei River compared with that in the other rivers. The statistical analysis showed that the SS and TDS in the Jing and Beiluo Rivers were significantly higher (p < 0.01). A higher Cond was measured in the Jing and Beiluo Rivers. However, W2, W6, and W7 were outliers with markedly higher TDS and Cond, and a similar phenomenon was found in the Jing and Beiluo Rivers. The CODMn and SiO3 2− in the three rivers exhibited slight differences (p > 0.05). The TP in the other rivers were markedly higher than that in those four sites (Fig. 2c). In addition, the QHEI in the three rivers also exhibited slight differences (p > 0.05).

A preliminary index of biotic integrity

Of the 40 metrics calculated for the sites, seven metrics (P4, P18, P22, P29, P30, P35, and P36) were eliminated before the next procedure because more than 90 % of these sites exhibited a value of 0 for the metrics. The Kruskal–Wallis test indicated that the distinguishing of the stream order and the altitude was not significant based on the remaining 33 metrics in the reference sites (p > 0.05). Based on the Mann–Whitney U test, the metrics of P1, P2, P5, P8, P10, P13, P23, P25, and P39 exhibit a marked difference between the reference sites and the impaired sites (p < 0.05). These nine metrics were confirmed to exhibit no or little overlap between the reference sites and the impaired sites through the box and whisker plots. P5 was redundant with P1. P13 and P23 were redundant with P10 (r > 0.7 and p < 0.01). Thus, six metrics were selected as the primary metrics for the IBI (Fig. 3): P1, P2, P8, P10, P25, and P39.

Fig. 3
figure 3

Box and whisker plots showing the nine candidate metrics that were sensitive to environmental changes. R (green boxplot) and I (red boxplot) represent the reference sites and the impaired sites, respectively. The lines in the boxplot from top to bottom represent the 95, 75, 50, 25, and 5 % percentile. The dot denotes the mean value of all the reference or impaired sites

The overall IBI was the sum of the individual scores for these six metrics, and it potentially ranged from 100 points, which indicates the best possible ecosystem health, to 0 points, which indicates the worst possible ecosystem health. Table 5 provides the thresholds for the four ecosystem health statuses. The IBI scores for the 45 sites in the calibration data set ranged from 0 to 68.59 points (Table 4).

Table 5 Score interpretation for IBI values in the Wei River

Validation of the IBI method

The adaptability of the IBI was validated through the correlation of the IBI scores and the IWHQ based on the validation data set. The overall IBI scores were negatively correlated with the index of environmental quality (IWHQ) (Pearson correlation = −0.64; Fig. 4), which indicates that the index should be useful for biomonitoring and the conservation of biodiversity. The IBI scores decreased linearly with the IWHQ. In addition, the validation in Fig. 5 also showed good discrimination between the reference sites and the impaired sites, which further illustrated the robustness of this index.

Fig. 4
figure 4

Regression of the IBI score as a function of the index of environmental quality (IWHQ) for the 49 study sites in the Wei River basin

Fig. 5
figure 5

Ability of IBI scores to differentiate between the reference and the impaired sites in the whole basin. The abbreviations are denoted in Fig. 3

Ecosystem health in the whole basin

The index of biotic integrity exhibited large variation in the whole basin based on the validation data set (Fig. 6). For the 23 sites in the Wei River, five sites (W1, W10, W12, W14, and W25) were in good condition, five sites (W3, W11, W16, W19, and W26) were in fair condition, eight sites (W0, W5, W8, W9, W15, W17, W20, and W21) were in poor condition, and five sites (W4, W18, W22, W24, and W27) were in very poor condition. The ecosystem health in the Jing River showed a significant difference between all sites (p = 0.000). Two sites (J6 and J10) were categorized to be in good condition, four sites (J3, J4, J5, and J9) were in fair condition, two sites (J11 and J8) were in poor condition, and five sites (J1, J2, J7, J12, and J13) were in very poor condition. No fish were obtained at site J13, which was classified as very poor. The sites in the Beiluo River exhibited the worst ecosystem health, and their IBI values ranged from 2.87 to 32.88. Thus, the ecosystem health was judged as poor or very poor. However, L2, with an IBI score of 2.87, exhibited the worst condition in this river. In this site, only three specimens were captured, which resulted in the low score. The ecosystem health of the Wei River was better than that of the Jing River, and the ecosystem health in the Beiluo River was the worst. Of the 49 sites, 13 sites were in very poor condition, 20 sites were in poor condition, 9 sites were in fair condition, and only 7 sites were in good condition. The sites in poor or very poor condition accounted for 67.35 % of the total sites. In summary, the ecosystem health of the whole basin was not good, but the ecosystem health in the Wei River was better than that in the Jing River, and ecosystem health in the Beiluo River displayed the worst condition.

Fig. 6
figure 6

Ecosystem health in the whole basin. The circle size indicates the ecosystem health and bigger circle indicates better ecosystem health. The others are denoted as in Fig. 1

Discussion

Spatial variation in the water quality

The water quality variables and QHEI exhibited different variations in the whole river basin (Fig. 2). Most of the sites with a lower IWHQ were located upstream or in the tributaries of the Wei River, where there is a low population density and high vegetation coverage. The downstream sites in the Wei River, except W2, W6, and W7, exhibited markedly worse environmental quality with higher TP and CODMn, and lower DO; this trend was particularly observed from Baoji to Weinan. It was common that a lower DO and higher TP and CODMn was measured downstream of the main stem in the Wei River (W15, W18, W21, and W23). There are large cities with large population density, such as Tianshui, Baoji, Xianyang, Xi’an, and Weinan, located downstream of the Wei River, and these caused great nutrient and organic pollution, such as industrial wastewater effluents and domestic disposal plant effluents. This effect was demonstrated by Lu et al. (2010), who found higher CODMn in the water from vicinal reach of Baoji, Xianyang, and Xi’an. A similar result was found in the Taizi River (Wan et al. 2013). However, W2, W6, and W7 were outliers with markedly higher TDS and Cond, and a similar phenomenon was found in the Jing and Beiluo Rivers.

The sites in the Jing River and Beiluo River, including J1–J13 and L1–L9, were characterized by high Cond, SS, TP, and TDS. These were mainly caused by three factors: (1) flowing through the Loess Plateau, which exhibits serious soil erosion and high suspended solid (Han 2003), (2) poor water quality in the natural state and characteristics of high total dissolved solids, saline, and hardness (Shi 2008), and (3) a massive development of the petroleum industry in the upstream of the Beiluo River and the Malian River, which is a tributary of the Jing River (Kang et al. 2008).

Selection of reference sites and development of IBI

The definition of the reference conditions establishes an appropriate benchmark for the assessment of the ecological status and the trends of wadeable streams. Therefore, the selection of reference sites is critical when the IBI is estimated. In many parts of the world, rivers in pristine condition are rare, and reference sites have to be defined as the least affected sites (Tejerina-Garro et al. 2006). Historical data were suggested as an essential source to describe the reference condition, but it is impossible to collect these data. Particularly in the Wei River basin, the latest and comprehensive study on fish assemblages was conducted in 1984 (Xu and Li 1984) and consisted of the species richness and ichthyological fauna in the whole basin. Other studies have been performed, but these were focused upstream of the Wei River (Song and Wang 1983), Shaanxi Province, or some other regions (Xu and Zhang 1996; Liu et al. 2003). The scarcity of information made it difficult to select the reference condition through historical data. Many studies (Qadir and Malik 2009; Zhu and Chang 2008; Casatti et al. 2009) chose the least degraded sites as the reference sites, such as in the case of highly degraded basins, and environmental variables, such as the water and habitat qualities, were the main consideration. Casatti et al. (2009) selected the reference sites according to the physical habitat index, which identified good physical habitats as the reference sites. Pei et al. (2010) selected the reference sites based on their water and habitat qualities. Studies have shown a significant correlation between the fish assemblages and the physical habitat condition (Casatti et al. 2006). It was possible to select reference sites based on the water quality and habitat quality.

In this study, the water and habitat qualities were calculated by the IWHQ, and those sites in the least degraded conditions (IWHQ ≤ 2) were identified as the reference sites. The selected reference sites were mostly located in the upstream reaches of the tributaries in remote, thickly forested mountains with less human disturbance. This result is similar to that reported by previous studies conducted by Qadir and Malik (2009) and Roth et al. (1998). Although these sites cannot characterize a pristine state, these exhibited the least degraded situation in the whole basin. In addition, all of the reference sites in the calibration data set, with the exception of W5, were rated as being in good condition. W5 exhibited a high IBI score of 57.05, which is near the good condition threshold. In the validation data set, all of these reference sites were also categorized as being in good condition. If a “reference site” consistently receives a fair or poor rating, then the site’s suitability for serving as a reference should be re-evaluated (Barbour et al. 1999). The assessment results in the calibration and validation data set indicated that the reference sites in this study were all classified as being in good condition, which confirms that the selected reference sites were suitable.

Metrics that are sensitive to environmental changes were measured to assess the ecosystem health. The Kruskal–Wallis test indicated that the differentiation of the stream order and altitude was not significant in the reference sites (p > 0.05), which indicated that the stream order and altitude exhibited a small effect on the metrics. Although the fish species changed gradually from headwaters to the main stem, the difference in their abundance may be related more to changes in the local microhabitat features than to any pervasive effect of stream order or longitudinal position in the stream system (Matthews 1986). In this basin, the altitude of these sites ranged from 2,400 to 360 m, although most were located at an altitude between 600 and 1,500 m; this variation in altitude caused some difference, but this difference was not significant. The primary metrics of the IBI consists of six metrics (Fig. 3): P1, P2, P8, P10, P25, and P39. Three of the primary metrics for the IBI (the number of species (P1), the proportion of individuals as sensitive species (P25), and the number of individuals in the sample (P39)) were the same or similar to the metrics used by the Karr (1981). Three metrics (the total biomass (P2), the number of Cobitidae species (P8), and the proportion of species in the middle and low tiers (P10)) discriminated significantly between the reference sites and the impaired sites in the Wei River basin. Cobitidae are very common in the Wei River (Xu and Li 1984) and comprise the groups of fishes that thrive under conditions of high human disturbance.

Reliability of the IBI method

The IBI scores decreased linearly with the IWHQ, as shown in Fig. 4. All of the least degraded sites had high index scores, and most of the highly degraded sites had low scores. The box plot in Fig. 5 showed good discrimination between the reference sites and the impaired sites, which further illustrates the reliability of this method. The ecosystem health based on the IBI exhibited a large variation in the whole basin based on the validation data set and accurately reflected a known spatial pattern in the environmental quality of the whole basin. The ecosystem health of the Wei River decreased from upstream to downstream and was better than that of the Jing River and the Beiluo River, similarly to the trends found in the environmental quality based on the IWHQ.

However, two types of errors were displayed in Fig. 4. The ecosystem health measured through the IBI score was much better than that measured through the IWHQ, as displayed in group I, which is represented by W14, W25, J6, and J10. The ecosystem health measured through the IBI score was worse than that measured through the IWHQ, as displayed in group II, which is represented by W4, W18, J1, J2, J7, J12, and L3. These inconsistencies also appeared in the fish-based index of French Guiana (Tejerina-Garro et al. 2006). These errors were mostly focused on those sites with an IWHQ value between 2 and 4. Site W0, which is located at the headwater of the Wei River, was classified as least degraded (IWHQ = 1.55) but exhibited an IBI score of 37.91 (fair condition). Four species and 24 individuals were collected in this site, and only one was a sensitive species, which resulted in a relatively low IBI score. W14 was in the medium degraded condition because of its slightly high TP but received a high index score based on the three species, 328 individuals and one sensitive species sampled. The TP of 0.443 mg L−1 appeared to have little impact on the fish assemblages in W14.

There were various reasons for these differences between the results obtained through the IWHQ and the IBI. First, the IBI was integrated from six metrics, which may react differently to different disturbances. However, the interactions among these factors and the ways in which these factors affect the fish history are not yet clear. Additionally, these environmental variables exhibited different contributions to the fish community structure in different catchments. Rodríguez and Lewis (1997) indicated that the assemblage structure is predictably related to only four descriptors of lakes: transparency, conductivity, depth, and surface area. It was apparent that fish assemblages are not limited by a single factor such as pH or conductivity. Instead, an interaction among many variables is involved (Rahel 1984). Second, our classification of reference and impaired sites could be limited in some cases. The habitat quality was assessed at each sampling site using a QHEI, which is based on visual observation. Ten descriptors (substrate, habitat complexity, velocity/depth regimes, bank stability, channel alteration, channel flow status, vegetation protection, water quality, anthropogenic alterations, and riparian land use) were included in this index. This index was obtained on the basis of visual observation, and inevitably leads to some subjective errors. Some better method, such as physical form condition, is more suitable to reflect geomorphic condition. However, it is more cost-consuming and time-consuming assessments. Therefore, a more rapid visual-based physical form assessment was applied widely according to USEPA’s RBP (Barbour et al. 1999).

Conclusions

The preliminary attempt to calculate a fish index of biotic integrity for the Wei River basin has shown that the ecosystem health of the Wei River is better than that of the Jing River and the Beiluo River. The Wei River has been severely disturbed by human activities, such as untreated industrial wastewater and urbanization, which has had negative impacts on the fish assemblages and diversity. The results obtained in this study show that the IBI is a suitable assessment tool to evaluate the ecosystem health of the Wei River. The IBI is correlated with the water and habitat quality (IWHQ) and is therefore a good indicator of the ecosystem health. However, some limits exist in this index. First, the water and habitat quality may not have accurately measured some types of environmental degradation, such as alteration of the hydrological regime. Second, this preliminary index was established using limited ecological data, and it is necessary to thoroughly test the ability of this index to assess the health of these rivers with an independent set of fish assemblage and environmental quality data. Third, although fish assemblages have been regarded as an effective biological indicator of environmental quality and anthropogenic stress in aquatic ecosystems since they are relatively easy to identify and their position at the top of the food chain helps to provide an integrative view of the watershed environment (Karr 1981), it is still incomplete for only using fish assemblages as the indicator. Biological assemblages respond to many kinds of disturbance, and assessments of biological condition would be based on multiple biological assemblages, thereby providing insight into the overall biological condition of an ecosystem (Carlisle et al. 2008). Some papers (such as Davies et al. 2010) also pointed out that ecosystem health is currently determined primarily on the basis of fish and macroinvertebrate condition, and recommend to include condition of vegetation in the future. Both macroinvertebrate and algae will be added to assess ecosystem health in the study area in the future. This investigation aimed to clarify the ecosystem health of the Wei River, where restoration activities are needed, and to monitor trends in ecosystem health over time.