Introduction

Knowledge of species distribution at multiple spatial scales is crucial to both the management and conservation of aquatic biodiversity and the ability to assess the consequences of anthropogenic changes in the river environment, such as damming and climate change (Root and Schneider 1995). Until recently, fisheries ecologists have attempted to help managers with the task of conserving fish populations by conducting research—primarily at small spatial scales—that focuses on describing species responses to local environmental conditions and is often only weakly linked to the larger-scale problems these managers are asked to address (Allan 2004). Such studies are often targeted at a single river or a specific region, and thus cannot be generalized to cover broader spatial scales (Lammert and Allan 1999). Streams are hierarchical systems in which large-scale climatic, geological and topographical factors determine the context of geomorphic processes (e.g. hydrology, sediment transport, woody debris recruitment) that create and maintain habitats at smaller local scales (Frissell et al. 1986). Modelling the occurrence and counts of fish species should therefore also account for the influence of large-scale regional factors that condition species presence within drainage basins (Huston 1999).

Several studies have examined the relationships between environmental factors and biological data at different spatial scales, particularly from a regional versus local site-based perspective (e.g., Reino et al. 2006; Ferreira et al. 2007a). When results obtained from both approaches are compared, such studies have very often come to conflicting conclusions about the nature of specific ecological systems (e.g. Wiley et al. 1997). This view of the system at very different scales often pays more attention to a number of different types of functional mechanism (e.g. abiotic and biotic) that structure populations, and what is more, may require distinct statistical techniques, thereby leading to divergent hypotheses and conclusions (Root and Schneider 1995). Because species are subjected to processes that operate at both the landscape (regional) and instream (local) habitat scales (Allan et al. 1997), there is a need to better understand their relative importance and assess the spatial scale at which changes in the landscape should be evaluated for their impact on species populations.

Advances in the field of landscape ecology placed a new emphasis on the importance of habitat heterogeneity by introducing the concept of a dynamic landscape model of fish life history (Schlosser 1991, 1995). This model highlights the spatial arrangement of habitats for the different fish life stages and the key role that movement between them plays in complete species life cycles. More recently, riverine landscapes have been recognized as “riverscapes”—a series of complex mosaics of habitat types and environmental gradients in which fishes’ movements play a pivotal role in fulfilling their life cycles (Fausch et al. 2002). Fish life stages require different physical habitats, and their complete life cycles may extend over large spatial scales, depending on the life-history characteristics of each species (Schlosser 1995). It is thus clear that models which describe the influence of landscape attributes on fish populations need to integrate the effect of multiple landscape attributes—i.e. regional and local—on the different life stages of fish. This is of major importance in the case of potamodromous species, which typically move a great deal within freshwater river systems (Lucas and Baras 2001). However, while a lot of information is available for commercial and game species, like salmonids, little is known about the ecology of potamodromous Iberian species—namely their habitat requirements at multiple spatial scales. Moreover, such information has often displayed contradictory results. For example, in terms of habitat guild based on the level of rheophily, the literature has classified the potamodromous Iberian cyprinid species Iberian barbel Barbus bocagei Steindachner as either rheophilic (Ferreira et al. 2007b), limnophilic (Oliveira 2006), or eurytopic (Lobón-Cerviá and Fernandez-Delgado 1984). Such findings raise the question of whether habitat requirements should be determined for the species, as reported in most studies (e.g. Pont et al. 2005; Mesquita et al. 2006), or if attention should instead be focused on analysing size-related (i.e. ontogenetic) differences in habitat use patterns and related instream habitat partitioning. This is particularly important for potamodromous species, given that different life stages are likely to exhibit different habitat requirements, with these habitat-use patterns being mediated by migratory processes (Northcote 1978). Quantifying the relationships between life stages of potamodromous species and environmental gradients measured at multiple spatial scales will improve knowledge of the mechanisms that influence species occurrence and counts, and will help conserve and manage their populations.

The use of different modelling techniques to understand the role of environmental gradients in species abundance has been recognised as a crucial step in conservation and/or restoration frameworks (Olden and Jackson 2002). However, whether they look at presence/absence or counts, most of these studies often exhibit a large proportion of zero values, (i.e. are said to be “zero-inflated”) and/or show evidence of overdispersion (e.g. Potts and Elith 2006), and therefore do not fit standard distributions (e.g. normal, Poisson, and binomial) for which they are routinely modelled. On the other hand, this zero inflation can be derived from the presence of an excess number of “true zero” observations, i.e. caused by unsuitable habitat or environmental processes, or can also be the result of sampling or observer errors during data collection (“false-zero” observations). Failing to account both for the presence of excess zero observations as well as their respective source will cause bias in parameter estimates and may lead to invalid scientific inferences (Martin et al. 2005). Though models that deal with zero inflation have mainly been used in disciplines such as medicine, healthcare, and econometrics, there has recently been an upsurge of interest in such techniques in the fields of ecology and animal science (e.g. Podlich et al. 2002; Potts and Elith 2006).

The present study investigates the environmental correlates at two spatial scales (i.e. regional and local) of three native potamodromous Iberian cyprinid fishes—the Iberian barbel Barbus bocagei Steindachner, the Iberian straight-mouth nase Pseudochondrostoma polylepis (Steindachner) and the Northern straight-mouth nase Pseudochondrostoma duriense (Coelho)—which are partitioned by life stages in Western Iberia using hurdle models. Analyses also focus on how multi-scale influences interplay to shape the distribution of species life stages. Finally, results are used to discuss potential applications in the management of habitat and species populations.

Materials and methods

Study area

The study area has been described in detail elsewhere (Santos et al. 2006; Ferreira et al. 2007a). Briefly, the study was conducted in the eight largest river catchments in central and northern Portugal. The geology is complex and includes granites, schists and quartzites with various degrees of metamorphism, mainly in the inland area; whereas the western coast is dominated by tertiary layers under quaternary deposits; and the southern region, below the main course of the river Tagus, by the flat platforms of the Hesperic Massif. The climate is variable and reflects both the elevation and the proximity of the Atlantic Ocean, ranging from temperate oceanic on the northern border of the study area, to Mediterranean pluviseasonal in the continental and southern regions. The river discharge presents a high intra-annual variability, with high floods occurring in autumn and early winter, followed by a gradual decline towards late spring and summer, when most of the streams, particularly in the southernmost regions, turn into a series of isolated pools or even dry out completely. Land-use varies greatly across the study area: the northern and central part is mostly forested with Pinus and Eucalyptus plantations, and also presents a patchy pattern of orchards, vineyards, olive groves and irrigation crops; southern areas are more homogenous and are characterised by the presence of cork-oak and holm-oak woodlands.

Site collection

Sample sites were extracted in advance from a dataset of more than 500 sampling locations right across the country (Ferreira et al. 2007a). Only undisturbed or minimally disturbed sites were considered. The identification followed the EU-FAME project approach (Melcher et al. 2007; Pont et al. 2007), in which human disturbance is ranked semi-quantitatively using all available field data and GIS information. In the present case five disturbance variables were considered: connectivity of the river segment, hydrological regime, morphological condition, input of toxic and acidic effluents, and nutrient organic input. Each variable was scored from 1 (minimum disturbance) to 5 (maximum disturbance), and only sites scoring 1 or 2 for all five variables were retained for analysis. In addition, only sites spaced out 2 km from each other were retained for analyses to reduce problems of spatial autocorrelation while maintaining a sufficiently large sample size. This yielded a total of 174 sites from eight river catchments (Minho, Lima, Cávado, Douro, Vouga, Mondego, Lis, and Tagus) in western Iberia (northern and central Portugal) (Fig. 1).

Fig. 1
figure 1

Map of the study area and location of the 174 sampling sites in the different river basins across Portugal

Fish sampling

Sampling was conducted during late spring-summer base-flow conditions (May–July) in order to prevent reproductive migration periods (March–April, Santos et al. 2005) and extreme-flow events from causing bias in fish sampling or in the measurement of local habitat variables. Sites were electrofished (DC, 300–700 V, or pulsed DC, 400–1,000 V, SAREL model WFC7-HV, Electracatch International, Wolverhampton, UK) with single passes according to CEN (2003) standards, to encompass multiple habitat types (riffles, pools). The sampled distance was at least 20 times the mean wetted width of the channel. The entire widths of wadeable streams were fished by walking slowly upstream and using one anode for every 5 m of stream width. Rivers with mean depths exceeding 0.6 m were electrofished by boat moving downstream, again sampling all habitat types, but focusing on the margins. Capture efficiency estimates, assumed to be constant across sites, were not available for the area but previous studies indicated that this sampling effort was sufficient to capture a large range of fish sizes (Ferreira et al. 2007a), though it did not catch larvae and small young-of-the-year, for which specific protocols are needed (Copp 1989). After the sampling, fishes were identified, counted, measured (total length, L T, to the nearest 1.0 mm), and returned to the river alive. To account for ontogenetic differences in species counts, fishes were stratified into distinct size-classes according to reported differences in age, growth, and reproduction (Lobón-Cerviá 1982; Lobón-Cerviá and Fernandez-Delgado 1984; Doadrio 2001), although such divisions are somewhat arbitrary and flexible depending on catchment characteristics (Oliveira et al. 2002): ≤10 (juveniles), 10–20 (small adults) and ≥20 cm L T (large adults) for barbel; and ≤11, >11 cm L T for nase, which roughly correspond to the juvenile and adult life stages, respectively. Counts of species lifestages was reported as total number of individuals per ha.

Environmental variables

The analyses of factors associated with species occurrence and counts focused on two sets of environmental variables that reflect the regional context and local habitat attributes (Table 1). Regional variables included catchment area (km2), distance from source (km), altitude (m), gradient slope (%), mean air temperature (°C), mean annual rainfall (mm), average annual runoff (mm), and water mineralization [scored as 1 (low) to 3 (high)]. Catchment area, distance from source, altitude, and gradient slope were derived from digital elevation models. Mean air temperature and mean annual rainfall were determined from climate models based on time series between 1941 and 1942, and 1990 and 1991, from Portuguese weather stations. Average annual runoff (1 = <25 mm; 2 = 25–50 mm; 3 = 50–100 mm; 4 = 100–150 mm; 5 = 150–200 mm; 6 = 200–300 mm; 7 = 300–400 mm; 8 = 400–600 mm; 9 = 600–800 mm; 10 = 800–1000 mm; 11 = 1000–1400 mm; 12 = 1400–1800 mm; 13 = 1800–2200 mm; 14 = >2200 mm) and water mineralization (1 = low; 2 = medium; 3 = high) were obtained from cartographic sources (Agência Portuguesa do Ambiente 2007). Local variables included mean wetted width (m), mean depth (m), water velocity, dominant substrate coarseness, instream cover, shading, and hydrological regime at the site. Width was recorded from five transects, regularly spaced across the channel. Depth was measured with a graduated dip-net pole along each transect at a number of random points (mean number = 25). Water velocity was measured using a calibrated dip-net in randomly selected points along the transects and was quantified using three ordinal categories (Santos et al. 2006): 1, no flow (no movement of the net); 2, weak flow (slow ballooning of the net); and 3, faster flow (moderate to fast ballooning of the net). Dominant substrate coarseness (1 = silt, <2 mm particle size; 2 = sand, <5 mm particle size; 3 = gravel/pebble/cobble, <150 mm particle size, and 4 = boulder/rock, >150 mm particle size) and cover (defined as (a) logs or any submerged structure (other than substrate) in which fish could be hidden from overhead view and (b) undercut banks, submerged or overhanging vegetation <50 cm above water surface) were determined visually in 0.8 × 0.8 m quadrats below each point. Cover was scored as: 0, absent; 1, some (<25%); dense (25–60%); and heavy (>60%) cover. Overhanging tree shading (visually determined as 0 = absent; 1: <25%; 2: 25–60% and 3: >60%) and hydrological regime (0-summer drying; 1-permanent) were also determined for each sampling site.

Table 1 Mean/median and range of environmental variables measured at 174 sites in western Iberia

Statistical analyses

Principal component analyses (PCA) of z-score normalized environmental variables (Ostrand and Wilde 2002) were first performed for each species’ natural area of occurrence (B. bocagei: all river catchments; P. duriense: Douro, Cávado, Lima, and Minho catchments; P. polylepis: Tagus, Lis, Mondego, and Vouga catchments), to extract both regional and local gradients. A varimax rotation was applied to the set of principal components with eigenvalues >1, in order to obtain simpler and more interpretable environmental gradients (Legendre and Legendre 1998). Loadings ≥|0.70| were used for the interpretation of environmental gradients.

A preliminary visual inspection of the histograms of the frequency of counts for the target species revealed data to be zero-inflated (Fig. 2). Species occurrence and counts were modelled using hurdle models, as the extent in terms of the number of sites and sampled area suggests that zero observations are predominantly true zeros (Tyre et al. 2003). The hurdle model (Cragg 1971) consists of two parts. The first is typically a binary (presence/absence) response model (e.g. a logistic regression), while the second is usually a truncated-at-zero count model (see detailed model equations in Potts and Elith (2006)). The hurdle model is therefore a modified count model in which the separate processes generating the zeroes and positive counts are not constrained to be the same. This methodology allows the interpretation of the positive outcomes (>0) that result from passing the zero hurdle (threshold). Overall, these models recognise the possibility that the mechanisms which determine presence can be different to those which determine abundance (Ridout et al. 1998).

Fig. 2
figure 2

Frequency (number of sites) of the number of fish life histories captured at a site in western Iberia: (a) juvenile Barbus bocagei; (b) small adult Barbus bocagei; (c) large adult Barbus bocagei; (d) juvenile Pseudochondrostoma duriense; (e) adult Pseudochondrostoma duriense; (f) juvenile Pseudochondrostoma polylepis and (g) adult Pseudochondrostoma polylepis. Note that over 35% of each dataset is represented by zero counts

Fish life stage responses to regional and local environmental gradients extracted from PCA were estimated using hurdle models, thereby accounting for the presence of zero-inflation. Logistic regression was used to model the binary (presence/absence) component, whereas the negative binomial (NB) distribution modelled the positive counts. Hurdle NB models have been shown to be particularly suitable for modelling data characterized by an excess of zeroes (Welsh et al. 2000). The association between the explanatory variables and the response variable was tested separately for both the regional and local environmental gradients, and subsequently using only the significant ones to produce the final model. First, a set of a priori candidate “partial” models was selected that included all possible combinations of environmental gradients. Given the relatively limited dataset, only linear responses were evaluated in order to avoid the development of overly complex models. The hurdle model approach was then implemented using the number of fish recorded at a site as the dependent variable and the sets of explanatory environmental gradients as independent variables. An information-theoretic approach—the Bayesian Information Criterion (BIC; Schwarz 1978)—was used as a model selection procedure, in which the best-fitting partial model is that with the lowest BIC value. Though BIC is less used than AIC (Akaike Information Criterion, Burnham and Anderson 2002) in ecological studies, it provides more consistent model selection estimates than the latter (see for example Claeskens and Hjort 2008). In addition BIC also produces more parsimonious (Barbottin et al. 2010) and precise (Alfaro and Huelsenbeck 2006) models than AIC. Therefore, given the present complex model framework, multivariate modelling based on the philosophy of model based inference seemed more adequate and ecologically meaningful with BIC rather than with AIC. The BIC permits the simultaneous comparison of multiple models in such a way as to select the most parsimonious model consistent with the data, while balancing precision and bias (Franklin et al. 2000). The BIC values were adjusted for bias due to small sample size (BICc), and the difference between each BICc value and the BICc of the top-ranked model (Δ i ) was calculated. This BIC difference estimates the fit of each model compared to that of the best-fitting model. Models presenting Δ i  ≤ 2 are typically considered to have substantial support, and should be considered along with the best model (Burnham and Anderson 2002). The model ranking was then inspected to determine whether more than one model had substantial support (none had). For the resulting set of models, in order to evaluate model plausibility, the BIC weights (w i ), which represent the relative likelihood of each model given the data, were calculated as the ratio of the likelihood of each model to the sum of all the model likelihoods. Thus the model with the lowest BICc and consequently the highest w i was considered the best model. The significant environmental gradients of the best partial model (i.e. regional and local) were then retained to produce a final global model for each species life stage.

Lastly, to assess the relative importance of regional and local gradients in the selected models, variation partitioning was used to decompose the variation explained by the best models into four different components. The methodology followed is an adaptation of that developed by Borcard et al. (1992) and Legendre and Legendre (1998), but extended here to hurdle models by using Nagelkerke pseudo-R 2 values (Nagelkerke 1991), instead of R 2, in order to have an approximate measure of the variation. For each species life stage the best candidate model was considered—i.e. the model with the lowest BICc value. First, a “partial” hurdle model was fitted with only the regional subset of predictors present in the best candidate model (yielding the quantity a + b). The same procedure was then performed with the difference that a second “partial” model was only fitted with the local subset of predictors (yielding b + c). Finally, the “full” model (with all the predictors) was fitted to obtain the shared effects (a + b + c) (see Reino 2005; Morgado et al. 2010). Following this analysis, the total variation explained by each species life stage model \( \left( {{\text{V}} = \sum {a + b + c + d} } \right) \) was partitioned into four independent components: (i) a pure regional component (a), not explained by the local component; (ii) a shared component, i.e., the regional variation that is locally and spatially structured (b); (iii) a pure local component (c), independent of the regional variables, and; (iv) any unexplained variation not attributed to any component (d). PCA were carried out with the Statistica program (StatSoft Inc. 2000). Hurdle models were fitted in the R 2.7.2 (R Development Core Team 2008) software using the hurdle function of the PSCL package (Zeileis et al. 2008).

Results

Fish species

Species counts ranged from 0 to 7900 individuals, depending on life stages present at a given site (Table 2). Strong evidence of overdispersion was also found for all species life-stages, as the variance was much greater than the mean of the data.

Table 2 Mean, variance, maximum (Max) and minimum (Min) values of counts (number of fish/ha) for the seven species life stages collected at 174 sites in western Iberia

Environmental gradients

Principal component analyses performed on large-scale regional variables for each species area of occurrence, yielded three components with eigenvalues >1, which accounted for between 80.4% (P. duriense) and 82.1% (P. polylepis) of the total variation. The variables retained (i.e. with loadings ≥|0.70|) in each component were similar, in terms of magnitude and direction, for the three species considered. Principal component 1 (regional gradient 1, PCR1) was positively loaded on gradient slope (SLO) and negatively loaded on catchment area (CAREA) and distance from source (DSOUR), and therefore described a longitudinal gradient towards upstream steeper river sites (Table 3). PCR2 was flow-related with high negative loadings on mean annual rainfall (RAIN) and average annual runoff (ROFF), and therefore described a gradient from higher rainfall areas towards sites with lower precipitation and runoff. Altitude (ALT) was positively loaded and mean air temperature (ATEMP) and mineralization (MIN) were negatively loaded on PCR3. This gradient largely reflected a shift towards colder and low-mineralised water rivers.

Table 3 Loadings of regional and local variables on the first three principal components (PC) extracted by PCA and the eigenvalues and proportions of variance accounted for by each axis for the 174 sites sampled in western Iberia

Similarly, PCAs on instream local variables recorded for each species area of occurrence generated three components with eigenvalues >1, which accounted for between 66.0% (P. polylepis) and 80.5% (P. duriense) of the total variation. Again, selected variables in each component were numerically similar for the three species considered. PCL1 (local gradient 1) was a habitat-size gradient with high loadings on mean width (MWID) and mean depth (MDEP), and was inversely related with instream shade (SHAD), thus representing larger and deeper sites with high canopy openness. PCL2 loaded high on instream cover (COV) and substrate coarseness (SUBS), thus reflecting sites with coarser substrates and higher instream cover. The final local gradient extracted (PCL3) was positively related with water velocity (WVEL), thus representing fast-flowing sites.

Multi-scale effects on species life stage occurrence and counts

The hurdle models showed that occurrence and counts of species life stages were significantly related with both regional and local gradients. Table 4 shows the top-ranked “partial” models for each species life stage, as no other models showing substantial support (Δi ≤ 2) were found. Model selection results also suggested a relatively low to moderate plausibility for each of the best ranking models in terms of BICc (0.26 ≤ w i  ≤ 0.58 and 0.29 ≤ w i  ≤ 0.61, for regional and local models respectively) (Table 4). Results also showed some differences in model plausibility when considering different species or life-stages. Model selection plausibility for regional models was slightly higher for juvenile B. bocagei and P. polylepis. On the other hand, model selection plausibility for adult P. duriense was higher than for juveniles (Table 4). Local models showed that model selection plausibility was higher for small adults and adults for B. bocagei and P. polylepis respectively, whereas in the case of P. duriense, juveniles had higher model selection plausibility (Table 4).

Table 4 Summary results of model selection for the relationships between species life-history stages and environmental gradients reflected in each of the three components (PCn) extracted from principal component analyses of regional (R) and local (L) variables

PCR1 underlined a strong tendency for B. bocagei and adult Pseudochondrostoma spp. to occupy low-gradient river reaches, with higher catchment areas and located farthest from sources. Counts of adults of the former species displayed a similar spatial distribution. On the contrary, juvenile P. duriense showed a positive association with high-gradient upstream rivers. A flow-related gradient (PCR2) was found to be inversely associated with the occurrence of juvenile P. duriense, whereas both adult B. bocagei and Pseudochondrostoma were positively related with this regional gradient—i.e. occupied drier and lower runoff areas. The presence and counts of juvenile B. bocagei and P. polylepis was found to be related with colder altitudinal sites presenting low-mineralization waters, as shown by a positive association with PCR3.

Local gradients were also significant correlates for the occurrence and counts of the different species life-stages. Accordingly, the habitat size gradient PCL1 displayed a positive effect on the occurrence of large adult B. bocagei and on the adults from both Pseudochondrostoma species, as they were mainly found at wider and deeper sites. On the other hand, juvenile B. bocagei displayed a negative association with such areas. Sites presenting higher instream cover and substrate coarseness, as shown by PCL2, favoured the counts of juvenile B. bocagei and juvenile Pseudochondrostoma. The latter were also associated with fast-flowing areas—i.e. higher and medium courses of rivers—inasmuch as their count was found to show a positive effect with PCL3.

Variation partitioning for hurdle models

Regional, local and shared components managed to explain between 11.3% (juvenile P. polylepis) and 43.5% (adult P. polylepis) of model variation, but the source of distribution patterns varied according to the species life stage considered (Fig. 3). “Pure” regional attributes represented a higher portion of variation, compared to “pure” local components in all life stages of B. bocagei, particularly large adults, which ranged from 38.49 to 46.61%. In contrast, after accounting for regional components, “pure” local gradients were highly represented in life stages of both nase, especially juveniles, ranging from 45.82% (adult P. duriense) to 68.23% (juvenile P. polylepis), thus suggesting an underlying reach-scale spatial trend in their distribution. The shared effects, which represented the amount of variation that could be explained by the overlap between landscape-scale descriptors (i.e. regional factors) and local attributes, ranged from 12.26% (juvenile P. polylepis) to 51.62% (small adult B. bocagei).

Fig. 3
figure 3

Explained variation partitioning (“pure” regional, shared and “pure” local variation) applied to the hurdle models developed for each species life stage in western Iberia

Discussion

The present study focused on the influence of both regional and local attributes on the distribution of different life stages of native Iberian cyprinid fishes, and in doing so emphasized the need to use this kind of multi-scale approach in order to understand the hierarchical patterns of species spatial organisation. It also differed from many others in that it was performed at a network of minimally disturbed sites, and thus represented a true and accurate picture of the reality and not a consequence of any human-imposed displacement towards sub-optimal conditions (see Ferreira et al. 2007a).

Incorrect model specification can have substantial impacts on model prediction and may lead to erroneous conclusions that then affect decision-making by environmental managers (Potts and Elith 2006). Because the present datasets were found to be zero-inflated and overdispersed, the hurdle model was used rather than the more common and straightforward Poisson model, to correctly deal with the problem of excess zeros that is frequently encountered in many ecological datasets (Martin et al. 2005). The use of these models provided support for the basic hypothesis of our modelling approach: that significant relationships exist between species life stage occurrence and counts and landscape influences at multiple (i.e. regional and local) spatial scales. However, there may have been other factors which were not accounted for in the present analyses and which might increase predictive power. For example, dissolved oxygen, pH, and conductivity, which are often considered in model prediction studies at a local scale (Bain and Robinson 1988; Fausch et al. 1988), could also have been important in this study. Biotic interactions, such as competition and predation, which are inherently local in scale, are likely to affect local species counts and distribution. However, given the broad extent of this study, their contribution to the generation of the observed patterns is unlikely to be significant, as the importance of biotic factors is thought to decrease with increases in spatial scale (Tonn et al. 1990).

The selected models clearly outlined the importance of examining species life stage-environmental gradient relationships at different spatial scales (regional and local), and showed that distribution patterns contrasted depending on life stage. Although in general all barbel life stages were associated with downstream low-gradient river reaches, this pattern was more pronounced at advanced stages (large adults), as shown by the lower regression coefficients on regional gradient 1—the longitudinal river gradient. This is also supported by a positive association of juveniles with colder (upstream) sites. On the other hand, a life-stage spatial segregation—i.e. adults showing negative associations with high-order streams, whereas juveniles favoured such areas—was noted for nases. Although the association of all barbel stages with mid- and lower river courses was to be expected (Santos et al. 2004), the results also provided new insights into the presence of both nase life stages along a longitudinal gradient. Indeed, juvenile nases were found to occupy steeper fast-flowing headwater river reaches, which is not in accordance with the existing literature in which individuals of this species are reported to inhabit mid-lower river courses (Carmona et al. 1999; Santos et al. 2005; Ferreira et al. 2007a). This highlights the need to consider the use of different life stages when making spatial distribution assessments that are particularly important from a conservation point of view. Though the large-scale gradient of longitudinal position has been shown to influence fish distribution within river systems elsewhere (Joy and Death 2002; Mugodo et al. 2006), other factors were also important in shaping the spatial organisation of species life stages.

Physical habitat and channel morphology descriptors also played an important role in species occurrence and counts. Stream width and openness were negatively related with the presence of juvenile and small adult barbel, whereas large adults were mainly found in wider and deeper rivers. Similar findings (i.e. larger fish associated with wider river sections) have been outlined in the literature (Pires et al. 1999; Morán-López et al. 2006), and emphasize the importance of width as an indicator of complexity and stability in structuring stream fish distribution (Gorman and Karr 1978; Schlosser 1995). In addition to width, canopy openness also loaded high on the first local gradient extracted by PCA, and showed a positive association with large adult barbel. In contrast, smaller-sized individuals were found to favour areas with low canopy openness—i.e. those presenting a high shade cover. The importance of shade to fishes—in particular small-sized classes—has been demonstrated in a number of papers, in that it provides refuge from predators, mainly in upper reaches and smaller tributaries (Schiemer et al. 1995; Stauffer et al. 2000). The association of shade with stream width is also known to influence water temperature and therefore metabolic rates, which control growth and determine population size (Jobling 1995), as increased width-to-depth ratio and decreased shade lead to an increase in mean water temperature (Pusey and Arthington 2003). Substrate coarseness and cover were also important correlates of species life stages distribution, as shown by significant associations with the second local PCA component. Accordingly, both variables were positively related with the counts of juveniles of the target species. It is hypothesized that juveniles used high-covered gravel-pebble areas as nurseries in upstream reaches, because of their shoreline structure, high retention of organic material, and stable temperature regimes. Similar findings have been found for other lithophilic barbel and nase in Central Europe (Jurajda 1999; Schiemer et al. 2003), and highlight the need to identify and protect such areas in order to maintain fish populations in Iberian streams. Taken as a whole, the observed patterns are consistent with the general theoretical expectation that considerable complementarity exists in the spatial distribution of life stages along a longitudinal gradient (Schlosser 1995; Fausch et al. 2002). Earlier life stages tended to be found predominantly in upstream higher-gradient river reaches, while later stages were mainly associated with lower and wider slow-flowing river sections (Schlosser 1987; Lobb and Orth 1991). Consequently, this study recognized the need to consider landscape heterogeneity at multiple spatial scales as a fundamental factor among the influences on the distribution of species life stages in Iberian rivers.

Both regional and local descriptors were important correlates of species life stages distribution in western Iberia. The purely local attributes accounted for much of the model variation of both nase, and the juveniles in particular. Previous studies had pointed out the role of physical factors and hydraulic conditions, such as substrate and velocity, in governing the distribution of nase (Santos et al. 2006; Ferreira et al. 2007a). In the present study the contribution of such factors seemed to be greater in early stages (juveniles), as can also be seen from the significant positive associations with such variables, thereby highlighting their importance to the satisfaction of critical life stage requirements. Moreover, given that habitat attributes are in turn likely to be influenced by physical processes operating at larger spatial scales (Frissell et al. 1986; Mugodo et al. 2006), it is possible that a part of the high local variation observed may implicitly have a regional structure. For example, variables linked to river morphology (e.g. substrate coarseness and velocity) may have a longitudinal component, as rivers in the study area flow mostly in the same westward direction, so their effect could be countered by the effect of geography. The higher dependence of juveniles on local-scale factors could also be related with other factors, such as regional dispersal to and from connected rivers. Hitt and Angermeier (2008) found that by regulating fish dispersal, river connectivity to riverine areas affects the relative importance of regional and local factors in shaping the distribution of stream fishes. Accordingly, the observed patterns could also reflect a restriction on the capacity of juvenile nases to explore nearby connected streams, thus suggesting that in high-gradient headwater rivers, these fishes are more limited by local-scale factors than adults inhabiting downstream lower-gradient river reaches farther from sources, due to the latter’s opportunistic use of connected riverine habitats. The use of advancing techniques such as radio-telemetry may help to elucidate species life stage occupancy and the movements of individuals and their use of particular habitats. Such technology may provide the basis for determining detailed information about small-scale environmental conditions and larger-scale use of streams.

On the other hand, factors related with large-scale geography possessed a greater explanatory capacity in relation to the species (barbel) that occupied the largest streams, and particularly adults. Although the dominance of larger individuals in larger lowland habitats is a common occurrence in many Iberian rivers (Godinho et al. 2000; Santos et al. 2004), the relatively small influence of local factors on adult barbel distribution found in the present study may reflect the very large geographic scale—all eight river catchments—in which they were collected (Roth et al. 1996; Marsh-Matthews and Matthews 2000). Unlike adult barbel, which are known to undertake considerable upstream movements within rivers in order to spawn (Santos et al. 2005), the purely local attributes accounted for much of the model variation in the juvenile barbel. Taken together, these results suggest that potamodromous adult barbel, whose spatial distribution is mainly driven by regional attributes, have traits that allow them to pass through habitat filters at the stream and reach scale (sensu Frissell et al. 1986, Dollar et al. 2006), in order to complete their life cycle. These filters determine the spatial dynamics of their progeny (i.e. juveniles) at the reach scale, and their count is then primarily affected by local habitat. The findings of this study therefore point to the need to account for the use of different life stages when modelling the influence of landscape attributes on fish populations at multiple spatial scales. It is true that the explanatory gradients together explained from 11.3 to 43.5% of the total variability, but this left a fairly high portion unexplained. This implies that fish life-stage occurrence and counts may vary over space and time in response to other environmental or biological mechanisms that were not accounted for in the present study (biotic interactions, food abundance, and recruitment effects), and makes it necessary to look in more depth at their role in governing the spatial distribution of species life stages.

Results of the present study showed that the occurrence and counts of cyprinids at different spatial scales was strongly dependent on the life stages of individuals. Juveniles, which primarily responded to local factors, may be more vulnerable to immediate disturbances, such as barrier construction, that restrain their distribution in upstream river reaches. Monitoring juveniles would surely provide sound information on current stock trends and/or habitat changes, and this suggests that their spatial distribution would be a good bioindicator of river connectivity and stock status. Maintaining connectivity and local habitat quality is therefore extremely important for supporting their populations. The use of juveniles as “functional describers” would therefore provide fisheries managers with a cost-effective method with which to assess and predict a river function in terms of fish reproduction and recruitment. As fish reproduction is strongly influenced by environmental conditions, the impact of natural or human disturbances could be assessed by sampling juvenile populations. Moreover, the latter’s monitoring generally requires relatively little effort in the field, as they are relatively easy to identify and count. On the other hand, advanced life stages, such as those of adult barbel, which primarily responded to regional gradients, are likely to be mainly affected by large-scale disturbances, such as climate change and land-use modifications, and can be therefore less responsive to local management actions (Ferreira et al. 2007a). However, the results of this study should be viewed with caution, as they represent an ecological snapshot, and populations of stream fishes generally undertake important aspects (e.g. reproduction, growth, shelter) of their entire life histories at different periods (Fausch et al. 2002; Pollux et al. 2006). Temporal variation, which could not be detected by our single sampling at each site, may play a major role in understanding the importance of the spatial distribution of species life stages. Future studies should consider monitoring fish populations at appropriate temporal scales, in such a way as to record life-history events and important physical disturbances that would certainly provide additional insights for understanding the environmental correlates of potamodromous Iberian species and their implication for species management and conservation.