Introduction

Ecological systems typically show considerable environmental and biological heterogeneity. Streams are particularly heterogeneous systems at multiple spatial scales (Frissell et al., 1986), and this heterogeneity is mirrored in the organization of their biological communities (Heino et al., 2015a). Much of the stream research has traditionally been conducted at small spatial scales (Allan, 2004; Johnson et al., 2007), often within stream reaches of a few hundred meters and in their immediate surroundings (Allan, 2004). However, it has been recognized that local habitat and stream biodiversity are strongly influenced by landforms and land-use in the surrounding valley (Hynes, 1975; Allen & Starr, 1982; Johnson et al., 2007). In fact, fish assemblage structure is driven by small-scale (e.g., reach) and large-scale (e.g., catchment) alterations in environmental characteristics (Fitzpatrick et al., 2001; Strayer et al., 2003), but their relative importance is debated. For instance, local-scale factors (e.g., substratum composition) explained a higher amount of the variation in the structure of stream fish assemblages among sites than large-scale variables (e.g., catchment land-use) (Lammert & Allan, 1999; Diana et al., 2006; Johnson et al., 2007). In contrast, other studies have emphasized the high importance of catchment features as drivers of fish assemblage structure (Roth et al., 1996; Leitão et al., 2018). Although scientists have increasingly adopted a catchment-scale view of streams (Allan et al., 1997; Johnson et al., 2007), the understanding of the relative influence of local-scale versus catchment-level factors on stream biota remains elusive (Angermeier & Winston, 1998; Strayer et al., 2003; Cianfrani et al., 2012; Li et al., 2019).

In Neotropical streams, few studies have considered the effects of environmental variables at different scales on fish assemblages (Roa-Fuentes et al., 2019; Benone et al., 2020). For some studies, catchment and local predictors together explained most of the variation (Bordignon et al., 2015; Leitão et al., 2018), while authors have found no effects of catchment features (Casatti et al., 2015; Gerhard & Verdade, 2016; Roa-Fuentes & Casatti, 2017). A better understanding of the influence of small-scale and large-scale environmental factors on fish assemblages in Neotropical streams could be useful to design conservation strategies and more robust monitoring and restoration programs (Johnson et al., 2007; Feld, 2013; Wahl et al., 2013; Leitão et al., 2018). Moreover, this knowledge can subsidize models for regional stream management, since catchment-scale features may be manipulated to influence factors at local scale, which ultimately would influence the structure of aquatic assemblages (Allan et al., 1997; Wang et al., 2003; Cruz et al., 2013; Feld, 2013). This is also in accordance with recent considerations on the stream restoration effectiveness, where catchment actions are more efficient than interventions focused exclusively on instream habitat restoration at the local scale (Palmer et al., 2010).

Beyond catchment and local-scale factors, assessing the relative contribution of spatial structure (a proxy for dispersal-related processes) to stream fish assemblages may suggest guidelines for biodiversity management, especially in human-dominated landscapes (Bengtsson, 2010; Heino, 2013). For instance, when there are low dispersal rates between sites and local processes determine diversity (i.e., species sorting), the management of local features and local habitat heterogeneity is fundamental to maintain diversity (Bengtsson, 2010). On the other hand, if dispersal has major effects on local biodiversity, it will be necessary to identify and manage source sites and promote landscape heterogeneity to maintain colonization sources for different species (Bengtsson, 2010).

Recently, functional, phylogenetic and taxonomic approaches have been used in (meta)community studies (Roa-Fuentes et al., 2019; Li et al., 2020), because they can improve our understanding on how species interact with each other and with the environment (García-Girón et al., 2020). Functional diversity may reflect the ability of a given assemblage to effectively respond to environmental changes (Díaz et al., 2007), or is linked to biological changes driven by modification of land-use and their consequences for ecosystem functioning (Luck et al., 2013; Leitão et al., 2018). In Neotropical streams, the functional diversity of fish assemblages has been addressed through characterization of traits associated with body size, fish trophic guilds, vertical and horizontal habitat preference, and foraging period, i.e., traits exhibiting response to local and landscape environmental variation (Teresa & Casatti, 2012; Casatti et al., 2015; Dala-Corte et al., 2016; Leitão et al., 2018; Zeni et al., 2019). Phylogenetic diversity is a proxy for the accumulated evolutionary history of a given assemblage and, therefore, might be associated with its ability to generate new evolutionary solutions in the face of disturbance and/or to species persist despite those disturbances (Forest et al., 2007; Faith, 2008; Safi et al., 2011). Phylogenetic diversity has been proposed as a means to prioritize species and areas for conservation (Faith, 2008; Safi et al., 2011), but its use in the context of stream fish assemblages is still rare (but see Aquino & Colli, 2017; Roa-Fuentes et al., 2019). Taxonomic diversity (e.g., species richness) is the most widely used measure for an approximation of biodiversity and to assess the effects of land-use on the structure of stream fish assemblages (e.g., Lammert & Allan, 1999). It has been used for many years in ecological studies, however, there is a general consensus that taxonomic diversity alone cannot adequately describe the processes involved in species coexistence and ecosystem functioning, and the differences in assemblage structure among sites (Safi et al., 2011). From a stream biomonitoring perspective, there is evidence that functional and phylogenetic facets are more sensitive to environmental disturbance, being able to discriminate impacted from preserved streams (Menezes et al., 2010; Saito et al., 2015a). Therefore, multiple and complementary facets should be included in studies describing not only stream biodiversity and their response to multiple-scale environmental changes, but also stream assessment or biomonitoring (Heino et al., 2008; Saito et al., 2015a).

In this study, we assessed the relative contribution of local features, catchment land-use and spatial structure to explain the variation in the taxonomic, functional and phylogenetic α-diversity of stream fish assemblages. Thus, we answered the following questions: (i) Are taxonomic, functional, and phylogenetic facets correlated with local and/or catchment environmental features? And, (ii) are these facets spatially structured? We hypothesized that: (i) both local and catchment features would explain the variation in the three biodiversity facets, since catchment land-use would affect local features, which in turn would influence stream fish assemblages. Moreover, (ii) biodiversity facets will be weakly spatially structured, as dispersal limitation will not be important at this spatial extent. In fact, Heino et al. (2015b) found that environmental control prevails over spatial constraints within single small drainage basins. Furthermore, the taxonomic facet will be more strongly affected by spatial factors, whereas the functional facet often distinguishes stream fish assemblages along habitat gradients, irrespective of spatial position of a site (Hoeinghaus et al., 2007).

Methods

Study area

This study was carried out in the Turvo-Grande and São José dos Dourados basins, Upper Paraná River basin, in the northwestern part of São Paulo State, Brazil (Fig. 1). These two basins belong to the same biogeographic province and, consequently, fish assemblages have a common evolutionary history (Géry, 1969). The region is part of the Serra Geral geological formation and shows a relatively flat slope and plains of quaternary fluvial sedimentary nature (IPT, 1999). The soil has a high erosive potential since it is composed of unconsolidated sediments (e.g., sand and clay) (Silva et al., 2007). The climate is tropical and hot, with one dry season with lower rainfall and cooler temperatures between June and September, and a wet season between December and February with higher rainfall and warmer temperatures (IPT, 1999). The region was formerly covered by semi-deciduous seasonal Atlantic forest (Silva et al., 2007), but the landscape has been changed since the beginning of the last century (1900) through the development of coffee crops, followed by the establishment of cattle (Victor et al., 2005), and more recently sugarcane (Rudorff et al., 2010). Nowadays, the native forest is limited to less than 4% of its original area, distributed in small and unconnected fragments embedded in agricultural matrices (Nalon et al., 2008). Similarly to other São Paulo river basins (e.g., Corumbataí basin; Gerhard & Verdade, 2016), the stream fish fauna of the study area is presumed to have already been homogenized, probably due to habitat simplification (Casatti et al., 2009), species introductions (Rahel, 2002) and an extensive, dynamic and long-lasting history of land-use change (Victor et al., 2005; Rudorff et al., 2010).

Fig. 1
figure 1

Sampling units (85 catchments = 85 reaches) along São José dos Dourados and Turvo-Grande River basins at northwest region of São Paulo State (gray area in the country map), southeastern Brazil

Site selection

For site selection, we mapped the land-use in the São José dos Dourados and Turvo-Grande basins through the digital processing of LANDSAT-5/TM satellite images of 2011 year (221–74, 221–75 and 222–74 scenes; 30 m spatial resolution). We used 2011 data because it was the most recent year available at the time of the analysis. We defined four land-use classes through visual estimate using Google Earth™ program: (i) native forest, (ii) pasture, (iii) sugarcane and (iv) other land-use, which included any other land-use different from native forest, pasture and sugarcane (i.e., towns and villages, rural installations, temporary cultures, highways, exposed soil and others). As the product of this processing, a land-use map was obtained for the study area (unpublished data). Using land-use information, we conducted a catchment pre-selection of catchments with area between 400 and 1,400 ha (corresponding to first-order streams to third-order streams according to the Strahler system, L. Casatti personal observation). Based on this pre-selection, we selected 85 catchments considering the environmental gradient in the region (i.e., pasture-sugarcane transition), accessibility and owners’ consent (Table S1). Finally, to increase the reliability of our land-use data, we refined land-use maps for each selected catchment using orthorectified aerial photographs (‘orthophotos’) with a 1 m spatial resolution (years 2010/2011 and 2012 only for sugarcane; most recent years available at the time of the analysis). We identified eight land-use classes: native forest, herbaceous and shrub vegetation, pasture, sugarcane, perennial crops, forest plantation, urban area and other land-use (see details in Table S2). For the sugarcane land-use class, the CANASAT project (sugarcane crop monitoring in Brazil; Rudorff et al., 2010) provided data about sugarcane area and location of in the São Paulo State.

For the digital preparation, processing, and classification of LANDSAT-5/TM satellite images and orthophotos, we used ERDAS IMAGINE 9.2 and ArcGis 9.3 softwares. The LANDSAT-5/TM satellite images were downloaded from the repository of the Instituto Nacional de Pesquisas Espaciais (INPE, http://www.inpe.br/). Orthophotos were supplied by Empresa Paulista de Planejamento Metropolitano SA – EMPLASA (CLU No. 060/14).

Fish sampling

Stream surveys and fish sampling were carried out in the dry season, between July and September in 2013. In each of 85 selected catchments, we used 5-mm-mesh stop nets to block upstream and downstream of a 75 m-long reach, following the standard method to sample fish in the region (see Casatti et al., 2009). We used two different methods of electrofishing: a stationary generator (AC, 220V, 50–60Hz, 3.4–4.1 A, 1000 W) to sample 42 stream reaches and a Smith Root Model LR-24 backpack electrofishing (pulsed DC, 50-990V, 1–120 Hz, 40 A peak max, 400 W) to sample the remaining 43 reaches. We used the stationary generator method because it was the sampling method of a long-term research project (Zeni et al., 2017). To conduct backpack electrofishing, the settings were adjusted in situ based on environmental conditions (i.e., with the quick set up feature activated, which automatically sets output voltage, frequency and duty cycle) and on the observation of fish behavior and recovery times. In each reach, a two-pass electrofishing technique was conducted during 45 min from downstream to upstream direction, covering from bank to bank to sample all available microhabitats. Moreover, to identify whether the observed pattern in fish assemblage structure was a product of the electrofishing method, we conducted a test for each of the biodiversity facet (i.e., binary variable; 1 = stationary generator, 0 = backpack). We did not observe significant effect of the electrofishing method on taxonomic, functional and phylogenetic diversity (Table S5).

Fish specimens were anaesthetized using a 100 mg/L clove oil solution, fixed in 10% formalin and transferred to a 70% alcohol. Fish were identified to species and voucher specimens were deposited at the fish collection of the Department of Zoology and Botany of São Paulo State University (DZSJRP 19264-19326), São José do Rio Preto, São Paulo, Brazil.

Predictor variables

Catchment-scale variables

For each catchment (i.e., catchment area delimited upstream from the sampled stream reach), we measured 30 descriptors related with land-use composition and heterogeneity quantified through (i) the number of the land-use classes (i.e., richness), (ii) diversity of land-use classes (i.e., Shannon index), and (iii) the relative proportion of each land-use class (Gustafson, 1998) (Tables S3 and S4). Considering that riparian land-use and stream fish assemblages are associated (Pusey & Arthington, 2003; Cruz et al., 2013; Santos et al., 2015), we grouped the catchment descriptors into three different sub-sets: catchment, drainage network and local (according to Strayer et al., 2003). The “catchment” sub-set included land-use in the entire catchment area; the “drainage network” sub-set comprised land-use within 60-m buffer zone (30 m on each stream margin) around the river network (minimum width established by the current Brazilian Forest Code for streams less than 10 m wide); and the “local” sub-set included land-use within a 150-m radius circle from the center of our sampled reach (Roa-Fuentes & Casatti, 2017).

Local scale variables

We quantified for each stream reach 31 local descriptors related with marginal vegetation (e.g., marginal grasses, roots, shrubs), water properties (e.g., water temperature, pH), instream structures (e.g., litter, wood debris), stream channel morphology (e.g., width, depth, flow), habitat composition and heterogeneity (e.g., mesohabitat, substrate composition) (Tables S3 and S4), following standard protocols previously used in the study area (Casatti et al., 2009). These descriptors are commonly used in stream fish ecology studies (Cruz et al., 2013; Carvalho & Tejerina-Garro, 2015a).

Spatial variables

We considered the distance between sites as network distance (i.e., the distance between sites following the riverine dendritic network, sensu Brown & Swan, 2010; Altermatt, 2013). Network distance is able to capture such spatial patterns that overland distance does not account for, and it can also better describe spatial patterns generated by fish dispersal along a riverine dendritic network (Altermatt, 2013; Heino et al., 2017). We calculated the network distance using Hawth’s Analysis Tool (Beyer, 2004) for ArcGIS 9.3. From the network distance matrix, we generated Principal Coordinates of Neighborhood Matrix (PCNM) and retained only PCNM eigenvectors with positive spatial correlation because we were concerned with patterns produced by spatially contagious processes (Borcard et al., 2011). To do this, we used the ‘PCNM’ function from the PCNM package (Legendre et al., 2013). The threshold value used in the PCNM analysis was the minimum distance giving connected network. All analyses were carried out in the R environment (R Development Core Team 2020).

Response variables

Taxonomic facet

For taxonomic facet, we considered species richness in each stream reach as response variable.

Functional traits

We obtained 12 functional traits for each fish species (S = 63; Table S6). Seven traits were ecomorphological indexes associated with fish functional specializations to water flow, to swimming ability and to their position in the water column (i.e., compression index; relative area of pectoral fin; pectoral fin aspect ratio; relative eye position; relative depth; index of ventral flattening; and fineness coefficient). For details about ecological interpretations of ecomorphological indices, see Casatti & Castro (2006) and Ribeiro et al. (2016).

The remaining five traits were related with trophic ecology (i.e., species were grouped in trophic guilds: algivores, detritivores, aquatic insectivores, terrestrial insectivores, lepidophagous, periphytivores, piscivores, and omnivores); size (i.e., three categories of standard length); preference for substrate (i.e., unconsolidated, consolidated); preference for water velocity (i.e., fast, medium, low); and adaptation to anoxic conditions (i.e., unadapted, adapted; Table S6). We used literature to obtain trophic guild (Zeni & Casatti, 2014); size categories (Teresa & Casatti, 2012; Casatti et al., 2015); preference for substrate and water velocity (Casatti et al., 2015); and adaptation to anoxic conditions (Chapman et al., 1995; Graham, 1997; Casatti et al., 2009; Boswell et al., 2009; Scarabotti et al., 2011; Teresa & Casatti, 2012). When information was not available in the literature, we measured at least five adult individuals of each species using fish sampled in 2013 or individuals from study area available in the fish collection of the Department of Zoology and Botany of São Paulo State University (DZSJRP).

The traits used in this study provide important ecological information about species and have been widely used to examine the functional structure in Neotropical stream fish (Teresa & Casatti, 2012; Carvalho & Tejerina-Garro, 2015a, b; Dala-Corte et al., 2016). Because we considered both quantitative and qualitative traits, we used the mixed-variables coefficient of distance, a generalization of Gower’s distance, to extract a between species functional distance matrix and to use it in subsequent analyses (Pavoine et al., 2009). The functional distance matrix was calculated using ‘ktab.list.df’ and ‘dist.ktab’ functions of ‘ade4’ package (Dray & Dufour, 2007) in the R environment (R Development Core Team 2020).

Phylogenetic hypothesis

We constructed a composite phylogenetic hypothesis for all fish species sampled in 2013 (Fig. S1) based in the following studies: Montoya-Burgos (2003), Genner et al. (2007), Near et al. (2012), Betancur-R et al. (2013), Chen et al (2013), Mariguela et al. (2013) and Sullivan et al (2013). The phylogeny was built manually using Mesquite software v.2.75 (Maddison & Maddison, 2011). Because there is no consensus on the timing of divergence among the major actinopterygian and teleostean lineages (Near et al., 2012), the phylogenetic distance between species was estimated using the ‘bladj’ utility in Phylocom software (Webb et al., 2011). Even if only some nodes are dated, the resulting phylogenetic distances are an improvement compared to using only the number of intervening nodes (Webb et al., 2011). We used the constructed phylogeny to extract a phylogenetic distance matrix used in the further analysis. The phylogenetic distance matrix was calculated using the ‘cophenetic’ function in R environment (R Development Core Team, 2020).

Functional and phylogenetic diversity metrics

We used metrics based on functional and phylogenetic distances among species because they provide the same mathematical basis for comparing both facets (Swenson, 2014). We used the standardized effect size of mean pairwise distance (i.e., SES.MPD; Webb, 2000) and standardized effect size of the mean nearest taxon distance (i.e., SES.MNTD; Webb, 2000), because the observed functional and phylogenetic metrics could be correlated with species richness, which can create problems to identify what additional information is actually gained using functional and phylogenetic facets (Swenson, 2014). The SES.MPD and SES.MNTD represent the average functional or phylogenetic differences among taxa in each assemblage (Tucker et al., 2016), indicating functional or phylogenetic clustering when lower than zero or functional or phylogenetic overdispersion when values are greater than zero (Webb et al., 2002). Because SES.MPD calculates all pairwise distances in a sample, it is often considered to be a “basal” metric, i.e., it captures the overall functional or phylogenetic dissimilarity of the taxa in a sample (Swenson, 2014; Geheber & Geheber, 2016). Conversely, SES.MNTD is considered as “terminal” relatedness measure because it detects finer-scale functional or phylogenetic patterns (Webb, 2000; Webb et al., 2002; Swenson, 2014). For these reasons, comparing both metrics is useful (Swenson, 2014).

To calculate standardized effect size (SES), we used the independent swap null model (i.e., maintain the observed species richness and occurrence frequency in the null assemblages; Gotelli & Entsminger, 2003) with the mean value obtained from 999 randomly generated assemblages. We established our species pool as all fish species (S = 63) found in the 85 stream reaches because (i) we assumed that species sampled in each site were not dispersal limited; thus, it was feasible that all species had similar potential to occur in each site (Geheber & Geheber, 2016), and (ii) stream reaches are located in the same biogeographical province (Géry, 1969). All functional and phylogenetic metrics were calculated using incidence-based assemblage data (i.e., presence-absence). Functional and phylogenetic metrics were estimated using ‘ses.mpd’ and ‘ses.mntd’ functions of ‘picante’ package (Kembel et al., 2010) in R environment (R Development Core Team 2020).

Data analysis

Exploratory data analysis (i.e., box-plots and quantile-quantile plots) were used to identify the normality of our predictor variables and the presence of outliers (Legendre & Legendre, 2012; see Fig. S2 for a schematic representation of the analysis procedure, and Roa-Fuentes et al., 2020 for results of exploratory data analysis). We used the logit transformation to the variables representing proportions (Warton & Hui, 2011), while for variables in other units, we applied square root or loge (X) transformations (Roa-Fuentes et al., 2020). After that, all predictor variables were standardized to zero mean and unit variance since these were measured in different units. To reduce strong linear dependencies among our predictor variables, we used two approaches (Fig. S2). First, we conducted Spearman correlations and removed predictor variables with correlations ≥ 0.7 (Dormann et al., 2013). Second, we performed a forward selection procedure with two stopping rules (sensu Blanchet et al., 2008; Table S5; Fig. S2).

To describe the main environmental gradients in the study area, we performed a principal component analysis (PCA) based on local and landscape variables with Spearman correlations < 0.7 (Dormann et al., 2013; Fig. S2). To aid the interpretation, we tested the significance of the correlation coefficient between a component and each variable (P ≤ 0.001) (Lê et al., 2008).

Finally, we performed a series of multiple regression analysis to explain the taxonomic, functional, and phylogenetic facets of biodiversity as a function of local, catchment or spatial variables previous retained by the forward selection (Fig. S2). Additionally, to use the beta coefficients to interpret multiple regression, we considered the structure and commonality coefficients to gain a broader and fuller perspective on the contributions that predictor variables made to the regression equation (Nathans et al., 2012). To do this, we used commonality analysis that decomposes the variance of R2 into unique (the amount of variance in the response variable that is uniquely accounted by a single predictor variable) and common (the amount of variance in the response variable that can be explained by two or more predictors together) effects of predictors (Ray-Mukherjee et al., 2014). Total (Total = Unique effects + Common effects) represents the total contribution of a predictor to the response variable regardless of collinearity with other variables (Prunier et al., 2015). In addition, commonality analysis helps to identify predictor variables with a suppression effect (i.e., variables that indirectly enhances the prediction, by improving the prediction of others, Nathans et al., 2012; Ray-Mukherjee et al., 2014). For forward selection, we used the functions ‘forward.sel’ (‘packfor’ package; Blanchet et al., 2008) and ‘rda’ (‘vegan’ package; Oksanen et al., 2015); for PCA we used the functions ‘PCA’ and ‘dimdesc’ (‘FactoMineR’ package; Lê et al., 2008). Moreover, we used ‘lm’ function for multiple regression analysis, and ‘regr’ function ‘(yhat’ package; Nimon et al., 2008) to calculate structure and commonality coefficients. All analyses and graphical displays were carried out in the R environment (R Development Core Team 2020). The datasets generated and/or analyzed during the current study are available from the authors CARF and LC on a reasonable request.

Results

The first two PCA axes accounted for 34% of the variation in the environmental variables across the study sites (Fig. 2). The first principal component accounted for 22% of variation and differentiated streams with high values for Physical Habitat Index, a high proportion of runs, high mesohabitat heterogeneity and marginal vegetation dominated by large roots, bryophytes and pteridophytes, from streams dominated by unconsolidated substrate (mainly sand) and a high proportion of grasses (mostly Brachiaria spp.) in the marginal vegetation. The second principal component accounted for 12% of variation and differentiated streams with high proportions of pasture in the catchment, higher pH and water temperatures, from streams with a high proportion of sugarcane in the catchment. In general, the PCA differentiated two main environmental gradients, one related to local environmental variation and the other associated with variation in land use in the catchment (Fig. 2).

Fig. 2
figure 2

Principal component analysis representing the main environmental gradients in the study area. Only variables with significant correlation with the PC1 and/or PC2 are shown (P ≤ 0.001). Sampling units represented by points. Codes: bry, bryophytes and pteridophytes; Coth, other land-use in the catchment; Cpas, pasture in the catchment; Cper, perennial crops in the catchment; Csug, sugarcane in the catchment; csu, consolidated substrate; gra, grasses (mostly Brachiaria spp.); lit, leaf litter; Lro, large roots; pH, pH; phi, physical habitat index; pool, pool; run, run; shr, shrubs; sub_H, ecotone diversity; TDS, total dissolved solids; tem, water temperature; usu, unconsolidated substrate; velSD, standard deviation of water velocity

We sampled 63 fish species belonging to 18 families and six orders in the 85 stream reaches (Table S7). In general, local-scale environmental variables explained taxonomic, functional and phylogenetic facets better than catchment-scale and/or spatial variables (Table 1). Catchment-scale variables accounted only for a small portion of variation in the phylogenetic SES.MNTD, whereas network distance was not related to any facet (Tables 1 and S5).

Table 1 Multiple regression and commonality analysis predicting the three facets of stream fish diversity by forward selected environmental variables

For the taxonomic facet, local-scale variables accounted for 33% of the variation (P < 0.001; Table 1). The squared structure coefficients (r 2s ) showed that mean depth, standard deviation of width, the proportion of pools and large roots were able to account for 33%, 32%, 25% and 18% of the regression effect given by the R2. The proportion of large roots and the mean depth were the predictor variables that had the higher unique effects (11% and 10%) to predict variation in species richness (Table 1). The β coefficients for the taxonomic facet multiple regression indicated that an increase in the mean depth, standard deviation of width and the proportion of pools led to an increase in the number of species (β = 1.236; β = 0.959; β = 0.993), while an inverse effect was observed with the increase in the proportion of large roots (β = − 1.336; Table 1).

Functional and phylogenetic SES.MPD were influenced only by local-scale variables, which accounted for ≥19% of the variation (Table 1). Functional SES.MPD was affected by proportion of unconsolidated substrate, mean water velocity and mean depth (R 2Adj = 0.30; Table 1); with squared structure coefficients (r 2s ) indicating that these variables were able to account for 65%, 35% and 17% of the regression effect given by the R2 (Table 1). The proportion of unconsolidated substrate had the higher unique effect (16%) to predict functional SES.MPD variance (Table 1). The β coefficients for SES.MPD multiple regression indicated that an increase of the proportion of unconsolidated substrate and of the depth led to functional clustering of stream fish assemblages (β = -0.454; β = -0.242), while an increase in water velocity led to a functional overdispersion (β = 0.280; Table 1).

On the other hand, phylogenetic SES.MPD was significantly affected by mean depth and proportion of grasses (R 2Adj = 0.19; Table 1). Squared structure coefficients (r 2s ) indicated that mean depth and proportion of grasses accounted for 43% and 30% of the regression effect given by the R2. Mean depth had the higher unique effect (10%) to predict variance in phylogenetic SES.MPD (Table 1). The β coefficients for SES.MPD multiple regression indicated that an increase in mean depth led to phylogenetic clustering of stream fish assemblages (β = − 0.385), while an increase of the proportion of grass led to phylogenetic overdispersion (β = 0.288; Table 1).

Phylogenetic SES.MNTD was the only metric influenced by one catchment-scale variable. Although weak, catchment land-use diversity accounted for 7% of the variation in the SES.MNTD and its unique effect was 6% (Table 1). The β coefficients indicated that high catchment land-use diversity led to phylogenetic overdispersion between closely related species (β = 0.263; Table 1). Functional SES.MNTD was not explained by any of the variables (Table S5).

Commonality analysis identified the proportion of leaf litter as a suppressor variable of the phylogenetic SES.MPD regression, since its unique effect (U = 0.065) was offset by its common effect (C = − 0.064; Table S8). The same was observed for the catchment land-use richness variable on phylogenetic SES.MNTD (U = 0.094; C = -0.094; Table S8). For this reason, these variables were omitted from the multiple regression analysis of phylogenetic SES.MPD and SES.MNTD, respectively.

Discussion

We found that local environmental factors explained a substantial fraction of variance of taxonomic, functional and phylogenetic α-diversity in a highly impacted landscape. Catchment land-use and dispersal-related factors (i.e., network distance) contributed little to the variation in these three biodiversity facets or were not significant at all. Furthermore, each facet was affected by different environmental variables, and functional and phylogenic facets responded differently than the taxonomic facet. For instance, an increase of depth, pools and width variation through stream reach led to an increase in the species richness. However, for the functional facet, we observed that unconsolidated substrate and increase of depth led to more redundant assemblages, while increase of water velocity led to more complementary assemblages. For the phylogenetic facet, an increase of depth resulted in assemblages with species with similar phylogenetic history, while marginal grasses had the opposite effect. Therefore, we would like to highlight that channel depth showed an opposite effect, since it led to an increase in species richness, but resulted in functional and phylogenetic homogenization of fish assemblages. Interestingly, the phylogenetic facet was the only one that responded significantly to landscape variables.

In general, studies addressing the importance of scale-related factors for stream fish have reported that both local and catchment-scale features influence assemblage structure; however, these studies also emphasized that local habitat variables are more important than landscape variables (Lammert & Allan, 1999; Wang et al., 2003; Diana et al., 2006; Barbosa et al., 2019; but see Fitzpatrick et al., 2001). Thus, the effect of local-scale environmental factors on the structure of stream fish assemblages has been recognized widely. For instance, changes in the taxonomic and functional facets of fish assemblages have been associated with channel depth (Sheldon, 1968; Schlosser, 1982; Carvalho & Tejerina-Garro, 2015a, b; Leitão et al., 2018), marginal vegetation propagation (Brachiaria spp. grasses; Casatti et al., 2009; Casatti et al., 2015), substrate composition (Casatti et al., 2015; Leitão et al., 2018), proportion of pool habitats (Schlosser, 1982), stream width (Angermeier & Karr, 1983; Lammert & Allan, 1999) and instream habitat structures (Dala-Corte et al., 2016; Leitão et al., 2018). Our results are consistent with these previous findings. In particular, variables such as mean channel depth, mean and standard deviation of channel width and mean water velocity were important predictors of the three facets, suggesting that species’ niche differences, in terms of environmental characteristics related with stream channel morphology, contribute to the variability in stream fish assemblage structure at the spatial scale evaluated by us. This outcome is also in accordance with species sorting, which is often considered the main mechanism structuring stream communities within single small drainage basins (Heino & Mykrä 2008; Heino et al., 2015b; but see Saito et al., 2015b).

We hypothesized that landscape structure can affect the way fish assemblages respond to the landscape itself. As noted by Allan et al. (1997), contrasting results regarding the importance of environmental variables at different scales could be a consequence of the study design or indicate that mechanisms operating at local and catchment scales are in fact different and uncorrelated. Considering that local and catchment variables were obtained in a standardized way for all streams, we assumed that our sampling design was not biased to detect local-scale effects and, therefore, catchment influence is really weak in this region. In agricultural areas under a long history of land-use change, the past (i.e., decades ago) land-use in the catchment and riparian zone can be a better predictor of present day taxonomic facet of stream biota than the current land-use (Harding et al., 1998; Surasinghe & Baldwin, 2014). In this regard, Zeni et al., (2017) found that current instream habitat and fish assemblages in our study area are also related to past catchment land-use; thus, our streams seem to show evidence of legacy effects and time-lag response. This finding could indicate that stream fish assemblages in heavily modified landscapes already overpassed the threshold of response to catchment modification. It is possible that, in these assemblages, the initial disturbance (e.g., deforestation) filtered the most sensitive species and nowadays they display a weak response to agricultural intensification (Balmford, 1996; Fitzpatrick et al., 2001; Balmford & Bond, 2005). In any case, it should be highlighted that high catchment land-use diversity led to phylogenetic overdispersion assemblages at finer scale (i.e., between closely related species). In other words, catchments with a greater number and evenness of land-uses, even in this long-term agroecosystem, were able to harbor fish assemblages that are phylogenetically complementary and comprising closely related species. This could be explained by the species turnover in an impoverished regional species pool, where most of the taxa belong to a few functionally similar clades (e.g., Characidae; Table S8). However, this pattern should be further explored considering that more than 93% of the variation in phylogenetic structure at finer scale was left unexplained.

Another possible explanation for the weak relationship between assemblage diversity and landscape variables is that homogeneous landscapes can display lesser variability to overwhelm the influence of local environmental variables (Heino et al., 2007; Casatti et al., 2015). Due to the long deforestation process and agricultural land-use development (Victor et al., 2005), our study region is an agroecosystem with high dominance of agriculture, especially pasture and sugarcane, representing 70% of all area. In this context, local environment (i.e., instream habitat) could represent the last ‘shield’ against the effects of extensive agricultural systems for fishes. Thus, local-scale environmental features may become even more important for the maintenance of stream fish biodiversity, and this may be a common phenomenon in heavily modified tropical landscapes (Casatti et al., 2015; Gerhard & Verdade, 2016; but see Wang et al., 2003 for opposite conclusion for temperate streams).

We also found that different local variables were important for the variation in the three diversity facets, and the effect of a common environmental variable could be different depending on the diversity facet analyzed. For example, depth had contrasting effects on taxonomic (i.e., increased species richness) and functional or phylogenetic facets (i.e., decreased diversity). The different effects of local environmental features highlight the importance of addressing complementary facets of biodiversity, since each one can point out different patterns in the structure of stream fish assemblages under the influence of agricultural land-use. According to Lyashevska & Farnsworth (2012), species richness, the most commonly used taxonomic measure to describe biodiversity, may result in the loss of a significant portion of the information (≈ 89%). Thus, taxonomic facet could be a poor substitute for other diversity facets. For instance, by considering taxonomic facet alone, we may underestimate the importance of some environmental variables that affect complementary facets (i.e., water velocity and substrate for functional diversity and proportion of grasses for phylogenetic diversity). In the same way, we may conclude that the increase in stream depth is a good strategy to stream restoration because it is followed by an increase of species richness. However, from functional and phylogenetic perspectives, it is possible to notice that the increase in species richness, mediated by the increase in depth, is due to an addition of functionally and phylogenetically similar species (i.e., functional and phylogenetic clustering).

An increase of species richness mediated by an increase of depth is predicted by the species-area hypothesis. According to this, a large area, or volume in this case, can support more species by increasing habitat heterogeneity (i.e., habitat diversity hypothesis) and/or by increasing colonization probability and decreasing extinction risk (for review, see Connor & McCoy, 2001). However, contrary to what is found in the literature (Tilman, 2001; Petchey & Gaston, 2002), an increase of species richness in our study did not lead to an increase of functional and phylogenetic diversity. It is possible that, in this region impacted by anthropogenic influences for a long time, past environmental filtering excluded most of the distinct species, leading to functional and phylogenetic homogenization of the regional species pool (Zeni et al., 2019). Thus, even if a large stream volume (i.e., increasing depth) is available, only redundant species can colonize the study streams. Although the increase of depth did not lead to complementary assemblages, redundancy in agricultural landscapes can act as a buffer to further functional diversity loss, since one redundant species can be replaced by another one without jeopardizing ecosystem function. In this context, siltation (i.e., decrease of depth) due to watershed and riparian degradation (Allan, 2004) can gradually eliminate redundant species through time, leading to the functional diversity loss of stream fish assemblages in tropical agroecosystems (Dala-Corte et al., 2016). Consequently, watershed management and riparian restoration, even in long-term agroecosystems, are also necessary to protect streams from siltation and further ecosystem function loss.

Through complementary facets of biodiversity, we can also identify environmental variables that are responsible for functional or phylogenetic overdispersion, such as water velocity and proportion of grasses, respectively. The increase of water velocity in our region usually indicates streams that are physically more structured with a higher proportion of riffles (i.e., shallow areas with consolidated substrate and high flow). Riffles could mediate the establishment of functionally distinct species (e.g., species with large pectoral fins, flattened bodies, dorsally situated eyes and ventral mouths), which are usually absent in less structured streams (Ribeiro et al., 2016). Marginal grasses can act similarly to macrophytes by proving shelter and foraging sites to fish species from different linages (i.e., phylogenetically distinct), as gymnotids (Gymnotus spp.) and cichlids (Crenicichla spp.). According to Roa-Fuentes et al. (2015), because some functional traits (e.g., ecomorphological traits) can exhibit strong phylogenetic signal, then the concordance between functional and phylogenetic facets could be expected. Although not observed in our study, marginal grasses have been associated with functional homogenization (Casatti et al., 2015). For this reason, we strongly recommend further studies about the influence of marginal grasses on all stream fish biodiversity facets.

In general, we found no effects of dispersal-related processes (i.e., network distance) on all three biodiversity facets. In a certain way, this is not surprising, since environmental control usually prevails over spatial constraints within small drainage basins (Mykrä et al., 2007; Heino & Mykrä 2008; Heino et al., 2015b). Among the possible reasons for absence of spatial structuring in stream assemblages is that, given enough time, stream biota can readily disperse between sites (Heino & Mykrä 2008). Another possibility that does not necessary exclude the first one is that the fish fauna here has already gone through a homogenization process, through extensive habitat modifications, introductions of non-native species, and extirpation of native ones. Therefore, fish assemblages are probably dominated by widespread habitat generalist species (McKinney & Lockwood, 1999; Devictor et al., 2008), which are closely related phylogenetically. It is also known that the importance of dispersal-related processes (e.g., colonization history or mass effects) is less expected to occur when the regional species pool is small or if it has been degraded by an intense disturbance (Chase, 2003; Goldenberg Vilar et al., 2014), as has probably happened in our present study region.

Stream restoration in human-altered landscapes

One critical factor for the success of stream restoration is the spatial scale of intervention (Lake et al., 2007). Despite that, habitat restoration projects are usually implemented without an understanding of the spatial scale necessary to produce positive ecological effects (Alexander & Allan, 2007; Sheldon et al., 2012). These scale-related effects remain an important and poorly understood question, and they might be one of the reasons for why a part of restoration projects have provided little evidence of ecological success (Alexander & Allan, 2007). One of the most used approaches to restore the ecological integrity of streams is the re-establishment of the native riparian forest (Harding et al., 1998; Sheldon et al., 2012). It has been demonstrated that one conserved riparian forest can regulate water temperature, diminish sediment inputs, stabilize stream banks (Osborne & Kovacic, 1993), provide large wood debris to stream channels in tropical agricultural landscapes (Paula et al., 2013), and maintain biotic integrity of fish assemblages in agricultural streams (Fitzpatrick et al., 2001).

Our results indicated that riparian forest, measured as a 30 m buffer zone around the river network (i.e., the minimum width established by the current Brazilian Forest Code) had weak or no effects on the three analyzed biodiversity facets. We would like to highlight, though, that this does not mean that riparian forest is not important for fish assemblages (for a review, see Pusey & Arthington, 2003). Conversely, since the riparian forest in our study area has been heavily altered over past decades (Silva et al., 2007), this result could indicate a weak riparian buffering effect and, consequently, that ecosystem functions and processes mediated by this adjacent area were already lost. In fact, in the 85 studied catchments only, 21% of the buffer zone along drainage network is composed of forests, whereas 60% is covered by herbaceous and shrub vegetation and 12% by pasture for livestock (unpublished data). However, the areas occupied by forest, herbaceous and shrub vegetation are not pristine because cattle usually can graze there. According to Fitzpatrick et al. (2001), even minor alterations of stream network buffer (e.g., 10% agriculture within a 50-m buffer) may inflict harmful effects on fish fauna in agricultural catchments. Consequently, actions directed toward riparian forest restoration in the drainage network should be a priority in the studied area (Casatti et al., 2012; Casatti et al., 2015).

Our findings also suggest that, in highly altered tropical streams, actions on a local-scale habitat features could produce effects on the three facets of biodiversity. Although this is an interesting finding, we should not neglect restoration actions at the landscape scale; in contrast, landscape restoration should be a priority in heavily modified tropical agroecosystems. Substantial improvements in the ecological integrity of streams are more likely to be accomplished with large-scale actions (Lake et al., 2007; Palmer et al., 2010; Wahl et al., 2013), as suggested by the hierarchy theory (Hynes, 1975; Allen & Starr, 1982; Johnson et al., 2007). For instance, water quality, disturbance regime, regional species pools and hydrological regimes respond to actions on large spatial scales (Palmer et al., 2010).

Finally, it is worth to mention that biodiversity responses to any local environmental variable must be thoroughly investigated, since changes in one environmental feature (e.g., depth) could generate contrasting effects on the different facets of biodiversity (Li et al., 2020). Additionally, the same spatial scale may not be important in every catchment and, therefore, successful restoration is context dependent (Sheldon et al., 2012). For this reason, we strongly recommend that any restoration actions should be correctly monitored over time (Palmer et al., 2010; Sheldon et al., 2012). This is because intensive agricultural practices may severely alter stream biota, and the influence of this disturbance may be long lasting (Harding et al., 1998).

Conclusions

In conclusion, our results indicated that local environmental factors are the most important predictors for variation in taxonomic, functional and phylogenetic facets of stream fish assemblages in a heavily modified tropical landscape. In contrast, catchment features and spatial structuring contributed little to the variation in the biodiversity facets or were not significant at all. It is possible that the long deforestation history in the study region homogenized the landscape and the regional species pool, thereby decreasing the potential importance of factors acting on large spatial scales over fish assemblages. Our results suggest that restoration actions focused on local-scale factors could produce significant effects on the three biodiversity facets. However, landscape restoration should also be a priority in heavily modified tropical agroecosystems, and restoration actions should thus carefully examine the effects of environmental variables, since each factor could show contrasting effects on different biodiversity facets.