Introduction

One of the most extensively investigated large-scale patterns in ecology is the relationship between species local abundance and their regional distribution (Williams, 1960; Brown, 1984; Gaston & Blackburn, 2000). In most cases, the relationship is positive, i.e., locally abundant species tend to be widespread (Gaston et al., 1997). This pattern is stronger for marine than for terrestrial systems, whereas the weakest relationships are found in freshwater systems (Blackburn et al., 2006). However, it tends to hold despite variations in the way data are gathered and analyzed (Gaston, 1994) and the taxonomic group considered (e.g., Gaston et al., 1998; Heino, 2005; Harcourt et al., 2005; Soininen & Heino, 2005; Heino & Virtanen, 2006).

Various hypotheses have been proposed to explain this relationship. These are mainly related to niche-based mechanisms, metapopulation dynamics, range position, and sampling artifacts related to phylogenetic and spatial non-independence (see Gaston et al., 1997). The niche breadth hypothesis (Brown, 1984) and the niche position (or habitat availability) hypothesis (Hanski et al., 1993; Venier & Fahrig, 1996) are among the most investigated with the latter, receiving good support (Gaston & Blackburn, 2000). In short, the former predicts that species occupying broader range of habitats and exploiting diverse environmental conditions and resources (i.e., wide niche breath) would be able to occupy more places. The latter hypothesis predicts that species utilizing common and widespread resources (or habitats) in a given region would be widespread and abundant. Some studies have demonstrated positive correlations between niche breadth and distribution, although few have demonstrated positive correlation between niche breadth and abundance (Gaston et al., 1997).

The niche of a species can be defined as the environmental conditions that allow this species to satisfy its minimum requirements, so that birth rate of a local population is equal to or greater than its death rate (Hutchinson, 1957; Chase & Leibold, 2003). For freshwaters, environmental characteristics of the water body (e.g., water flow and chemistry) are believed to be closely linked to the characteristics of the surrounding landscape (Frissell et al., 1986; Wiens, 2002). According to Poff’s (1997) landscape filter concept, environmental factors act to determine the occurrence and abundance of species at different spatial scales. That is, to be part of a local community, species in the regional pool must have appropriate characteristics to “pass” through the nested filters (Poff, 1997). Therefore, one could hypothesize that niche properties measured at the local and landscape levels would provide similar explanations for the abundance–distribution relationship.

Most studies on the relationship between distribution and abundance were developed in temperate regions and focused on terrestrial ecosystems (see Blackburn et al., 2006). An investigation on tropical freshwater organisms would certainly contribute to reduce system (and, indirectly, taxonomic) biases and to unravel the role of the mechanisms proposed to account for the relationship. Aquatic chironomids (Chironomidae: Diptera) are a useful group for exploring the abundance–distribution relationship due to their ecological importance as well as to applied aspects. Besides representing one of the most species-rich and abundant group in most aquatic environment, chironomids also present a range of life history that differs markedly, for example, in lifespan, locomotion, feeding habits, and physiological tolerance to oxygen deficit (Pinder, 1986).

It has been demonstrated that some assemblage patterns hold for different taxonomic resolutions on stream macroinvertebrates (e.g., species, genus, and family levels: Marchant et al., 1995; Lenat & Resh, 2001; Melo, 2005). Similar to species distributions, the distribution of higher taxa seems to be related to environmental and spatial variables as well (Murphy & Davy-Bowker, 2005). The reliability of the higher-taxa approach to detect general ecological patterns depends on how species within higher taxa respond to environmental gradients. If their responses are correlated, ecological patterns (e.g., abundance–distribution relationship) can be detected independently of the taxonomic resolution. Having this in mind, one could expect that the abundance–distribution relationship would also occur at higher taxonomic levels other than species, which would be desirable bearing in mind the limited knowledge available for Neotropical fauna. This seems especially suitable for chironomids, in which there are a high number of synonyms and unclear descriptions (Spies & Sæther, 2004) that create problems associated with differing species validity (Ferrington, 2008).

In this study, we investigated the relationship between local abundance and regional distribution of lotic chironomids. First, we analyzed the form of the relationship in an attempt to find out if a positive relationship holds at the genus level. Based on a “higher-taxon approach,” we hypothesized that there would be a positive relationship between distribution and local abundance in lotic chironomid genera. If so, we asked if the relationship could be explained by niche characteristics of taxa estimated using local and landscape environmental variables. Following Poff’s (1997) landscape filter concept, we hypothesized that niche breadth (Brown, 1984) and niche position (Hanski et al., 1993) estimated using both local in-stream and landscape variables would explain the positive relationship. We expected that both local abundance and regional distribution of chironomid genera would be positively related to their niche breadth and negatively related to their niche position. Moreover, these relationships should be significant when defining the niche with local and landscape features. However, given that the finest grained environmental filter through which a species go by is the local one (Poff, 1997), we predicted that the amount of variation explained in the relationships should be higher when using niche breadth and position estimated from the local scale.

Materials and methods

Studied sites and measured variables

We used data on chironomid larvae distribution extracted from the “Macroinvertebrates Database” compiled by the research group of the “Laboratório de Entomologia Aquática—Universidade Federal de São Carlos.” This data were collected during the dry seasons of 2001, 2005, and 2006 using Surber sampler (0.1 m2 area and 250 mm mesh size) in 47 forested streams of southeastern Brazil (20–25°S, 44–53°W; Fig. 1). Our research group has been continually visiting all sampling areas since 2001 and did not notice any drastic change in land use during this period.

Fig. 1
figure 1

The geographical location of the studied streams in the São Paulo state, Brazil. Areas in gray within the limits of São Paulo state represent forested areas

Sites were typical of Brazilian Atlantic Forest (sensu Oliveira-Filho & Fontes, 2000) headwater streams with water depths <50 cm, tree canopy coverage exceeding 70% of the channel, and absence of macrophytes. The riparian vegetation along all streams was well preserved. Six sampling units (three samples from pool and three from riffle sites) of chironomid larvae were taken randomly from a 100-m reach within each stream using a Surber sampler. For each stream, sampling units were pooled prior to statistical analysis. We mounted specimens on slides and identified them to genus level.

Environmental data

Local environmental measurements were taken at each site to characterize habitat conditions. Conductivity, pH, and dissolved oxygen were measured in situ using a Horiba U-10 or a Yellow Springs-556 water checker equipped with multiple probes. Percentage of canopy cover and predominant substrates were estimated visually. Substrates were classified as the proportion of the stream bottom covered by boulder and cobble (>256 mm), gravel (2–255 mm), sand (0.125–2 mm), and mud (<0.125 mm).

In order to compute the landscape metrics, we delineated a circular buffer area (500 m radius, ca. 78.5 ha) around a point located in the center of the stream channel of each of the 47 sampling sites (see Umetsu et al., 2008; for application of landscape metrics based on buffers of varying width). Some of our landscape metrics were derived from a land cover map at a scale of 1:50,000 from the Forestry Institute of São Paulo (Metzger et al., 2008). Macro-regional climatic variables, derived from coarse-scale maps, were also included in this set of variables. Landscape metrics derived from the land cover map were: cover area of forest; the total edge contrast index (TECI), and the edge density (LED). The macro-regional variables were: the enhanced vegetation index (EVI), the rainfall, and the solar radiation, all of which are indicative of primary production and biomass accumulation. We included four variables based on EVI measures: the EVI for the autumn (EVI AR) and for the winter (EVI AM), and both the range (EVI WR) and mean (EVI WM) between the two seasonal variables. We also included a measure of elevation in our set of landscape metrics. This metric was calculated using the mean value of the altitude across a 500-m radius circle around each sampling site provided by the Shuttle Radar Topography Mission (SRTM/NASA; http://www2.jpl.nasa.gov/srtm). Cover area of forest, total edge contrast index, and edge density were calculated using FRAGSTATS 3.3 (McGarigal et al., 2002), whereas the enhanced vegetation index and elevation were calculated using the Zonal statistics tool in ArcGis 9.

Niche measures and data analysis

From our database, we randomly selected 23 sites to estimate the taxa distribution and abundance and 24 to generate niche measures. Hereafter, these data sets are referred as “Distribution Data” and “Niche Data”. The reason for using two independent data sets is to overcome the circularity that occurs when one estimates species’ distribution and niche from the same data. This statistical bias is mainly associated with the problem of dissociating sample size effects from real differences between the niche breadth and position of common and rare species (Gaston, 1994). However, only that does not guarantee that our estimations are truly unbiased. For example, let us suppose that, by chance, the “Niche Data” included the most similar streams regarding environmental conditions. This would enhance the chance of reducing niche measures variability among different taxa and, thus, produce a biased model. For this reason, we randomly generated 1,000 Distribution and Niche data sets, and for each one we applied the steps explained below.

For each genus within the Distribution data sets, we calculated the regional distribution by summing the number of sites occupied by that genus and genus local abundance as the number of larvae of a given genus found in each stream. The ecological niche of a taxon can be represented by using its mean position and breadth along various environmental axes (Schoener, 1989). We applied the Outlying Mean Index (OMI) analysis (Dolédec et al., 2000; Thuiller et al., 2005, Broennimann et al., 2006 for other applications of this method) to generate measures of niche position and niche breadth for each taxon using the local and landscape variables measured within the Niche Data. The OMI (niche position hereafter) is a measure of the distance between the mean habitat conditions used by a taxon (centroid) and the mean habitat condition of the entire sampling area (origin of the niche hyperspace). The niche position of the taxon is assessed through their niche deviation from a reference. This reference represents a theoretical ubiquitous taxon that tolerates the most common habitat conditions. Genera that display high values of niche position have marginal niches (its environmental requirements are far from the mean conditions of the study area). Thus, we tested the niche availability hypothesis (Hanski et al., 1993; Venier & Fahrig, 1996) using OMI niche position. The OMI analysis also provides a measure of tolerance or niche breadth. Genera with high values of niche breadth are generalists, occurring across large portions of environmental gradients (wide habitat niche breadth).

Most former studies on the abundance–distribution relationship regarded abundance as the response variable. However, we cannot be sure whether the predominant direction of causality (if present) runs from local abundance to regional distribution or from the opposite (for a discussion, see Gaston & Blackburn, 2000). Thus, we estimated the relationships between both local abundance and regional distribution (response variables) with the explanatory variables niche breadth and niche position (both defined using local and landscape variables) using ordinary least-squares regression. All these variables were log-transformed before analysis.

We did each regression mentioned above 1,000 times—using the randomly partitioned data sets (Distribution and Niche data sets)—producing a distribution of coefficients of determination (r 2), P values, and regression coefficients, which we used to access the average explanatory power of each model. We only considered those significant models with less than 50 non-significant regressions (α = 5%). All analyses were performed in the R-language environment (R Development Core Team, 2006) using the package “ade4” (Dray & Dufour, 2007) for generating niche measures.

Results

We identified 41 chironomid genera occurring in all sampled streams. The most widely distributed taxa were Endotribelos (41 streams), the Tanytarsus/Caladomyia complex (40), Polypedilum (38), Parametriocnemus (35), Rheotanytarsus (33), and Larsia (31). Likewise, the most abundant taxa were the Tanytarsus/Caladomyia complex, Polypedilum, Parametriocnemus, Rheotanytarsus, Endotribelos, and Larsia. As we expected, there was a positive relationship between genus distribution and local abundance. This relationship was significant (P ≤ 0.05) in all 1,000 randomizations, with the models accounting for most of the variability in regional distribution (mean r 2 = 0.88, Fig. 2).

Fig. 2
figure 2

Distribution of coefficients of determination (r 2) of the regressions between regional distribution and local abundance. The arrow indicates the mean value of r 2 (0.88) across all 1,000 data subsets. All regression models between regional distribution and local abundance were significant at P ≤ 0.05

Niche position, as defined by local environmental variables, was not significantly related neither with local abundance nor with regional distribution in 52.4 and 62.4% of all data sets, respectively (Fig. 3a, c). On the other hand, when estimated from landscape variables, there was a significant negative relationship between niche position and local abundance in 99.8% of all data sets (Fig. 3b) and regional distribution in 99.7% of all data sets (Fig. 3d). We found a significant positive relationship between niche breadth (estimated using local and landscape variables) and both local abundance and regional distribution in more than 95% of all data sets (Fig. 4). In general, niche position and niche breadth explained less than a half of total variations in both local abundance and regional distribution (Figs. 3, 4). Therefore, in summary, niche breadth models significantly explained variation in abundance and distribution at both scales, whereas niche position models were significant only when we used landscape variables. Furthermore, in cases where they were significant, niche position and niche breadth models explained similar amounts of total variation in the response variables (Figs. 3, 4).

Fig. 3
figure 3

Distribution of r 2 values generated based on 1,000 regressions between niche position and local abundance (a, b) and regional distribution (c, d). Arrows indicate the mean value of r 2 across all 1,000 data subsets (a r 2 = 0.13*, b r 2 = 0.39, c r 2 = 0.16*, d r 2 = 0.40). * Non-significant models (a 47.6% of regressions with P > 0.05, c 37.6% of regressions with P > 0.05)

Fig. 4
figure 4

Distribution of r 2 values generated based on 1,000 regressions between niche breadth and local abundance (a, b) and regional distribution (c, d). Arrows indicate the mean value of r 2 across all 1,000 data subsets (a r 2 = 0.40, b r 2 = 0.39, c r 2 = 0.42, d r 2 = 0.36). More than 95% of regressions were significant at P ≤ 0.05

Discussion

Results obtained here confirm our expectation that the positive abundance–distribution relationship already observed in many taxa (see Gaston & Blackburn, 2000) also holds true for lotic chironomid genera. This result adds new evidence to the conclusion that higher taxonomic levels can be used to detect local assemblage patterns (e.g., Marchant et al., 1995; Melo, 2005) as well as macroecological relationships (e.g., Harcourt et al., 2005). Unfortunately, we do not have data on species level, thus it was not possible to analyze how much information was lost, if any, when using genus level identification. Nevertheless, the use of a higher taxonomic level aids in minimizing the potential statistical problem of phylogenetic dependency (which is a type of pseudo-replication) making the analyses more conservative. This is especially advantageous here since phylogenetic non-independence is one of the proposed artifactual mechanisms that can generate a positive abundance–distribution relationship (Gaston & Blackburn, 2000).

In general, we found that most of the variation in abundance and distribution was not explained by niche measures. These results are partly similar to previous studies on freshwater organisms. Tales et al. (2004) found support for the niche position hypothesis (Hanski et al., 1993; Venier & Fahrig, 1996) as a mechanism to explain the abundance–distribution relationship. However, they found that niche breadth was not a good explanation for variation in abundance and distribution of riverine fish. On the other hand, Heino (2005) found significant relationships between both niche position and niche breadth and abundance–distribution in stream insects. A general characteristic of these studies and ours is that there is much unexplained variability around these relationships. Our results differ from those reported by Heino (2005) in the sense that in his study niche position explained more variation in both abundance and distribution than niche breadth. Here, niche position estimated using local environmental variables did not explain variation in either local abundance or regional distribution. Therefore, the observed positive relationship between abundance and distribution is not a consequence of taxa niche position, regarding local environmental variables. This result is surprising given that most studies on the abundance–distribution relationship have found stronger support for the niche position hypothesis (see review in Gaston et al., 1997). This lack of support for the niche breadth hypothesis reported by several previous studies is due to, in part, difficulties in generating adequate niche breadth measurements (Gaston, 1994). Given the multidimensional nature of the niche (Hutchinson, 1957), important variables describing niche breadth might be missing from analyses. Also, an artifactual relationship is expected to arise because of the estimation of niche position is always unbalanced (Gaston et al., 1997). That is, rare and less widespread species contribute with a great number of zeros to the species matrix; thus, it is expected that those species will attain high values of niche position, i.e., marginal niches. The use of an independent data to estimate niche breadth through a resampling procedure like ours does not completely solve this particular problem but minimize it.

Here, we adopted a simple analytical procedure never used before—as far as we know—to estimate the average expected relationships between organism’s abundance and distribution, and their niche. A resampling procedure like ours is a useful tool to minimize the problem of dependence explained above and also to avoid the biased choice of two independent data sets. It is important to notice that we found considerable variability in the coefficients of determination (r 2). For example, we showed that it is possible to find, depending on the particular data subset, models with low explanatory power but also models explaining almost 80% of the variance in the data (Fig. 4d). Also, we showed that sometimes almost a half of the possible models can be non-significant (Fig. 3a). In that sense, we think that future studies seeking for such relationships would benefit from an approach similar to the one we used here because it is a step toward leaving particular explanations (related, for example, to a single data set) to a more general view of the processes that might being operating in nature.

Besides explaining only less than a half of the total variation in local abundance and regional distribution, these relationships seem to be affected by the type of variable used to generate niche measures. Local environmental features of a stream can be partially determined by factors acting at the scale of the surrounding landscape (Hynes, 1970). The OMI analyses we used here provided a description of the variability of habitats used by each taxa in the in-stream and surrounding environmental space. Thus, considering the expected association between in-stream and catchment environmental factors, one would expect that niche measures provided similar responses at both local and landscape levels. This was not the case here. For example, we found that niche position did not explain variation in any response variables when it was estimated from local environmental factors, but was related to both abundance and distribution when estimated from landscape variables. Note, however, that this is not a matter of landscape variables being necessarily more suitable to estimate niche measures. Niche breadth estimated from both local and landscape variables explained similar amounts of variation in local abundance and regional distribution. A number of studies have focused on how local and landscape characteristics of a stream can act to influence the distribution and abundance of macroinvertebrates (Corkum, 1992). Some of these studies have suggested that both scales act in structuring local communities, whereas others have found that local stream variables play a major role (see Richards et al., 1997; Death & Joy, 2004). Here, we demonstrated that landscape-based models were significant independently of the response and predictor variables, whereas local-based models were significant only when niche breadth was the predictor variable. Thus, at this moment, we can only agree with the view that stream communities are structured by processes operating at different spatial scales (Vinson & Hawkins, 1998), without ruling out which scales and factors are the most influential.

Given the high amount of unexplained variation in our regression models and assuming that we have measured appropriate environmental variables to represent niche, we postulate that niche-based processes may not be the main causes for the abundance–distribution relationship in lotic chironomids. In addition to niche position and breadth, some other mechanisms have been proposed as possible determinants of the positive relationship between abundance and distribution of species (Gaston et al., 1997). These other mechanisms are generally classified as statistical, range position, and population dynamic explanations. Since they are not mutually exclusive, it is likely that the abundance–distribution relationship could be a result of multiple processes (Gaston et al., 2000). Statistical explanations are mainly related to sampling artifacts, e.g., non-detection of uncommon taxa that are actually present at a given site, and phylogenetic non-independence, i.e., related taxa does not constitute independent data points in analysis. Previous studies have refuted these two statistical explanations by using large data sets of well-known groups and by controlling for phylogenetic non-independence (see Murray et al., 1998). Although we have not controlled for phylogeny here (there is no available phylogeny for Chironomidae), the most abundant and distributed genera belong to different tribes and subfamilies, so we believe this was not a major cause for the observed relationship. Range position explanations state that because, in general, species abundance are higher at the centers of its geographical range, species whose range limits are located within the study region would have lower abundances (Gaston & Blackburn, 2000). This mechanism was also rejected by a number of studies (see Gaston et al., 1997). As our study was based on genus, the likelihood of this mechanism is diminished here, because the sampled genera have geographical ranges that extrapolate our study area. Finally, population dynamic explanations stem from metapopulation models, mainly the rescue effect hypothesis (Hanski, 1991). In this, dispersal between patches decreases the probability of local population extinction and increases the proportion of patches occupied by a given species. Experimental evaluations of this hypothesis have been conducted but lead to equivocal conclusions. Gonzalez et al. (1998) experimentally interrupted dispersal between moss fragments resulting in a decline in both abundance and distribution, and the loss of the positive abundance–distribution relationship. When dispersal was re-established, there was an increase in both abundance and distribution and the positive relationship between them appeared again. On the other hand, Warren & Gaston (1997) found positive relationships in all treatments of an experiment with protists, even in those where patches were isolated by dispersal limitation. Unfortunately, we do not have information on metapopulation dynamics for the analyzed data. Therefore, a direct investigation of this mechanism is not possible at present.

Recent advances in community ecology point that niche assembly and dispersal assembly models are not mutually exclusive in explaining the same community patterns (Chave et al., 2002, Mouquet & Loreau, 2002). For instance, Cottenie (2005) suggested that when both environment and spatial processes are acting, local communities are structured by a combination of species sorting and mass effects dynamics. Species sorting dynamics emphasizes the importance of species niche and environmental heterogeneity in determining community structure and also assumes moderate dispersal rates (Chave et al., 2002). A mass effect describes a sink-source process of dispersal in heterogeneous environments high enough to change population abundances (Holt, 1993). There is a parallel here between these and the proposed mechanisms for the abundance–distribution relationship (niche-based hypothesis and rescue effect hypothesis). Thus, we believe that future studies should investigate if spatial processes, like dispersal, together with environmental processes affect the abundance–distribution relationship in the context of community structure. This perspective has recently been combined in a unified framework for explaining how compositions of local communities vary in space. The metacommunity framework (Leibold et al., 2004) offers the theoretical background and the analytical tools to integrate local and regional processes in a more inclusive concept for understanding community dynamics.

In summary, the main picture emerging here is that niche-based processes are not the unique cause for variation in the abundance and distribution of lotic chironomids. A novel finding of this study was that we found more support for the niche breadth hypothesis (Brown, 1984) than for the niche position hypothesis (Hanski et al., 1993). We also demonstrated the pertinence of using higher taxa data to analyze the relationship between distribution and abundance. Finally, we showed how a simple resampling procedure can be a useful tool to minimize the lack of independence in estimating niche, abundance, and distribution of taxa from the same data set.