Introduction

Across ecological systems, there are multiple, hierarchical determinants of species composition including climate, geology and chemistry. These larger-scale patterns influence fine-scale habitat features which ultimately influence the conditions an organism experiences in their environment. This hierarchy is especially apparent in river ecosystems (Burcher et al. 2007; Frissell et al. 1986) because these ecosystems integrate the landscapes they drain (Hynes 1975) and respond to changing terrestrial conditions (Likens et al. 1978). For example, discharge, solute and seston load interactively respond to land cover and landscape physiography at the watershed scale, while stream hydraulics, light and organic inputs tend to be more sensitive to reach-scale geomorphology and the condition of the riparian zone (Allan 2004; Snelder and Biggs 2002). Thus, stream communities are affected by land cover through multiple mechanisms operating over a range of spatial and temporal scales (Allan 2004; Burcher et al. 2007; Snelder and Biggs 2002; Winemiller et al. 2010). The distinction between the factors that have direct and indirect effects allows us to better predict those effects by establishing linkages between cause and effect of proximate factors that influence species composition (Burcher et al. 2007). Because of this, overarching affects such as landscape structure, shading and anthropogenic impacts directly influence underlying chemical parameters and physical habitat structure, which ultimately, in conjunction with biotic interactions, influences community composition.

Biomass of algae is in part controlled by light availability and the productivity. The species composition of benthic algae is often an important regulating factor determining macroinvertebrate species composition. Both the species composition and abundance of benthic algal communities in streams are a result of both biotic and abiotic factors that operate at several geographic scales as reviewed in Stevenson (1997). Benthic algae species composition has often been used as an indicator of environmental stress or is characteristic of particular habitat types (Greenwood and Lowe 2006; Reavie et al. 2010; Stelzer and Lamberti 2001; Taylor et al. 2004), especially diatoms (Dixit et al. 1992; Hill et al. 2003; Lawson 1999; Potapova and Charles 2002, 2007; Smucker and Vis 2011; Stevenson et al. 2008; Weilhoefer and Pan 2006; Zampella et al. 2007). Furthermore, conductivity of freshwaters has been shown to explain most of the variation in diatom assemblages in the USA (Potapova and Charles 2003) in addition to spatial factors across biogeographic regions (Potapova and Charles 2002).

Qualitative predictions have been made regarding general longitudinal patterns of light availability and species composition of aquatic invertebrates in rivers (Vannote et al. 1980), but explicit tests of these predictions are scarce (but see Julian et al. 2008c). In the framework of the river continuum concept (RCC), light availability is predicted to increase in a downstream direction as the river increases in width, but with benthic light availability dissipating with increased turbidity in downstream reaches. Studies of timber harvest which leads to greater light availability show distinct changes in algae flora with diatoms composing most of the periphyton prior to logging, and a shift toward large mats of green algae following harvest (Hansmann and Phinney 1973). However, the influence of basin-scale disturbances such as timber harvest on diatom species community is also explained in part by higher N, P, turbidity and conductivity in harvested watersheds (Naymik et al. 2005). While the effect of light (both shading and optical water quality) on benthic primary productivity has been investigated (Julian et al. 2008c), no study has investigated the specific role of light availability on algal species composition. This critical gap exists because light has not been widely recognized as a limiting resource in riverine ecosystems in comparison with nutrients and habitat.

Local habitat features such as substrate effects on algae composition and richness may vary in importance depending on scale. It would appear that at large geographic scales, the importance of substrate is minimal in influencing species diversity of stream diatoms (Potapova and Charles 2005). However, smaller-scale studies have found an influence of substrate on diatom species composition (Soininen and Eloranta 2004; Tuchman and Stevenson 1980) perhaps related to protection from grazing due to substrate roughness (Bergey and Weaver 2004). Thus, understanding the distribution and the features underpinning their occurrence is dependent upon the scale of interest.

Following the hierarchical framework proposed by Stevenson (1997) and the cascade concept outlined in Burcher et al. (2007), we test the determinants of benthic algae genus composition to understand the large-scale patterns that dictate local-scale conditions. Although many studies have utilized algal species identification, many have also shown the utility of genus-level assays in assessing stream condition (Hill et al. 2003; Wang et al. 2005). Researchers have shown that ion concentrations in the water determine algae species composition, specifically diatoms (Potapova and Charles 2003). Here we investigate factors that determine algae community assemblage within a stream and how these factors vary longitudinally along the stream continuum. This study investigates the dependence of physicochemical and biotic characteristics on land cover-associated factors at the watershed scale. We were particularly interested in how benthic light availability varied across the river continuum and how it influenced benthic algae community composition. Our results suggest a hierarchy of factors that determine species composition and indicate the dependence of community composition on differing light regimes.

Materials and methods

Study area

We sampled eight sites between June 8 and June 11, 2012, throughout the Kiamichi River in southeastern Oklahoma, USA (Fig. 1). The Kiamichi River, a tributary of the Red River in the Mississippi drainage, begins in the Ouachita Uplands and flows 197 km through a narrow, mainly ridge-and-valley watershed (3686 km2). The mean monthly discharge between 1973 and 2014 at the most upstream site (USGS 07335700) was 1.7 m3 s−1 and the mean monthly discharge near the most downstream sampling site (USGS 07336200) between 1983 and 2014 was 37.1 m3 s−1. This river flows through the Ouachita highland region and is known for its high fish and mussel biodiversity (Matthews et al. 2005). The study area is dominated by forest cover, and agriculture is primarily cattle and chickens. The river is typically shallow, with warm water temperatures during the summer and more moderate temperatures and flow during the remainder of the year (Table 1).

Fig. 1
figure 1

Algae collection sites along the Kiamichi River. Site 1 is the most upstream site, and site 8 is the most downstream site

Table 1 Water chemistry variables (mean), light availability (mean) and watershed land cover for each sample site

Physiochemical parameters

We used a calibrated YSI Professional Plus multiparameter sonde (YSI, Yellow Springs, OH, USA) to measure temperature, specific conductivity, pH and dissolved oxygen at each site. A Hach 2100Q portable turbidimeter (Hach, Loveland, CO, USA) was used to determine nephelometric turbidity units (NTU) at each site. We measured depth (m) and stream width (m) at each of the algae collection points. While we do not have water chemistry data for our sites in 2012, we collected triplicate water samples twice (June and July) in 2010 for total dissolved nitrogen and phosphorus determination at each of these sites except for the most upstream site. Additionally, we measured total dissolved nitrogen and phosphorus at one of these sites (site 7) in July 2011 and noted little difference in nutrient concentrations between 2010 and 2011. Site 1, the most upstream site, was only sampled once in July 2010. Samples were field filtered, acidified, and analyzed for total dissolved nitrogen and phosphorus (following persulfate digestion) within 28 days of collection using a Lachat QuikChem FIA+ 8000 Series flow injection analyzer (Hach Company, Loveland, CO, USA).

Algae sampling and identification

We designated the three main habitat types found in the Kiamichi River for sampling all sites: mid-channel, riffle and water willow (Justicia americana) areas. Within each habitat type, we sampled three rocks of varying sizes to try to minimize the effect of varying hydrologic disturbance on algae colonization: a small (2–16 mm), a medium (16–64 mm) and a large (64–250 mm) rock was sampled in each habitat type. We used a soft-bristled brush in water to remove algae from the entire rock, and the resulting slurry was collected and preserved in 3 % glutaraldehyde. In the laboratory, >300 cells were identified to genus and counted at 100× magnification for each individual sample, which resulted in ~1000 cells identified and counted for each habitat/site. Counts were used to calculate relative abundances (proportions) of algal genera and the distribution of algal groups (green algae, Chlorophyta; diatoms, Bacillariophyceae; and blue-green algae, Cyanobacteria) at each site.

Canopy data and light modeling

We used the benthic light availability model (BLAM; Julian et al. 2008b) to determine the amount of photosynthetically active radiation (PAR) available to the benthic algae community. BLAM (Julian et al. 2008b) calculates the amount of PAR at the stream bed (E bed) by incorporating the terrestrial and aquatic controls on benthic light availability. The first-order control on light availability is above-canopy PAR (E can) in mol m−2 day−1, where one mol equals 6.02 × 1023 photons. E can is the total PAR (as irradiance) available to the river before any shading from topography or riparian vegetation. We obtained E can measurements from a nearby Oklahoma mesonet station (http://www.mesonet.org/) for the week prior to sample collection. We took the average PAR across these days which was equivalent to the daily average PAR for the study period.

Topography and shading decrease the intensity of PAR that reaches the water surface of a stream, reducing E canE s . The ratio of E s :E can is the shading coefficient (s), where s decreases with increased shading. We used the “canopy photo method,” where a hemispherical canopy photograph is overlaid by the sunpath to calculate how much solar radiation is transmitted through openings in the canopy (Fig. 2). Most other methods used to quantify stream shade (e.g., clinometer and densiometer) can underestimate transmitted PAR by as much as 85 % (Chazdon and Pearcy 1991), while the canopy photograph method has been used successfully to quantify stream shading (Taylor et al. 2004). At each site, digital hemispherical canopy photographs were taken using a Nikon Coolpix 4500 with a fisheye lens. Three canopy photographs were taken at each site above each algae collection point (the three habitat types). For each photograph, we processed and analyzed each photograph with gap light analyzer (GLA) software according to Frazer et al. (1999) to obtain E s and s with parameters set in GLA as shown in Table 2.

Fig. 2
figure 2

Canopy photograph from sample site 1

Table 2 Parameters used in gap light analyzer program

Reflection at the air–water interface decreases the intensity of PAR that enters the water column, reducing E sE 0. The ratio of E 0:E s is the reflection coefficient (r) and was estimated using Fresnel’s formula (Kirk 1994). Once light enters the water column, it is attenuated exponentially with depth at a rate defined by the diffuse attenuation coefficient (K d). We estimated K d from turbidity (T n ) measurements, where K d = 0.17T n (Julian et al. 2008b). The PAR at depth (y) on the streambed (E bed) in mol m−2 day−1 at one location in time is:

$$E_{\text{bed}} = \left( {E_{\text{can}} \times s \times r} \right){\text{e}}^{{-K_d}y}$$

E bed was derived for each individual canopy photograph and then averaged to determine site E bed. The E bed predictor integrates total light availability, shading by the stream, water turbidity and average depth.

Land cover

We derived watershed areas for each sampling point using the Spatial Analyst Toolkit in ArcMap 10.0 (Environmental System Research Institute, Redlands, CA, USA) with a 30-m digital elevation model (DEM) from the National Elevation Dataset. Then, we obtained land cover (30-m resolution) for the USA from the 2006 National Land Cover Database (Homer et al. 2004). Grids were lined up, and for each site, we derived the watershed land cover. Land cover classifications included forest, agriculture, grassland, urban, wetland/open water and barren.

Data analysis

We investigated the structure and correlation structure within the abiotic data and land cover data with Spearman rank correlation similar as Burcher et al. (2007). To determine our sampling efficiency, we looked at the species–area curve. We also examined the most abundant genera using a log abundance plot using the mvabund package in R v3.1.2.

We used an analysis of similarity (ANOSIM) to determine whether there were significant differences in algal communities across the three habitat types sampled and/or across sites. Analysis of similarities provides a way to test whether there is a significant difference between two or more groups of sampling units (Clarke 1993). We removed algae genera in which only one cell was counted, and then, each sample was transformed into proportions. ANOSIM was calculated using Bray–Curtis distances with 999 Monte Carlo permutations. Each test in ANOSIM produces an R-statistic, which contrasts the similarities of sites within a habitat with the similarities of sites among habitats or vice versa.

To determine differences in algae across the sample sites and the variables that controlled algae community composition, algae data from all habitats were combined for each site due to algae communities not being different across the habitat types. Variation of algae assemblages across the sites were summarized using non-metric multidimensional scaling (NMDS), a multivariate ordination technique commonly used in ecological community analysis (Tabachnick and Fidell 2007). We used algal relative abundances to calculate Bray–Curtis dissimilarity indices among the sites. NMDS projects each site into a species-defined ordination space with two or more dimensions based on their ranked dissimilarity. The goodness of fit for the NMDS projections was measured as a stress value which quantifies the deviation from a monotonic relationship between the distance among sites in the original Bray–Curtis dissimilarity matrix and the distance among sites in the ordination plot.

We aimed to determine the relationships between land cover, location in the watershed and physiochemical parameters and their control on algal community composition. Following the NMDS, we wanted to understand the hierarchy of the data in the form of a conceptual diagram. We used the envfit function in the vegan package (Oksanen et al. 2011) in Rv3.1.2 to do a post hoc analysis of the physiochemical variables (i.e., conductivity, pH, temperature, light availability and nutrient concentrations) that may be contributing to algal community assemblage. The envfit function finds the direction in k-dimensional ordination space that has maximal correlation with an external variable and assesses the significance of the variables using permutation tests. All analyses were performed using the R software package (R Development Core Team 2012).

Results

Algae genera and predictors

We collected 49–66 algae genera at each of our samples sites and 85 genera total. We accumulated genera richness as we accrued more samples; however, this accumulation reached an asymptote (Fig. 3). The most common genera across all sites were Gomphonema, followed by Navicula and Scenedesmus (Fig. 4). Our ANOSIM indicated that there were no significant differences in algae genera assemblage structure across the three habitat types sampled at the sites (R 2 = 0.02, p = 0.28), but there were differences across the sites (R 2 = 0.49, p < 0.001). Because there were no differences among habitat types, we combined all data for each site into total proportion of algae assemblage. When we ran our NDMS on the combined algae proportions, NMDS explained over 99 % of the variation in the algae genera data structure with low stress (stress = 0.06; Fig. 5). NMDS axis one was negatively related to the abundance of many of the green algae taxa (i.e., Chaetophora, Chlamydomonas, Chlorella and Mallomonas) and positively related to several blue-green taxa (Aphanocapsa, Microcystis, and Rhizoclonium; Fig. 5). We found that NMDS axis two was negatively related to several diatom taxa (e.g., Ellerbeckia, Kirchneriella, Melosira, Nitzschia and Pseudostaurastrum) and positively related to Mougentia, Selenastrum, and Stigeoclonium. Joint plot analysis indicated that algae genera composition was most strongly regulated by light reaching the bottom of the stream (E bed) and conductivity (Fig. 5), while pH, nutrient concentrations and temperature were not significant predictors.

Fig. 3
figure 3

Cumulative algae genera collected during the study going from upstream to downstream in the Kiamichi River during this time period

Fig. 4
figure 4

Figure depicting the most common genera across all sites. The two most common groups, Gomphonema and Navicula, are both diatoms

Fig. 5
figure 5

Non-metric dimensional scaling plot indicating the algal community composition of the sites (points with corresponding site id) with the significant joint plot predictors shown. Conductivity and the light reaching the channel bed (E bed) are the strongest predictors of algae community composition

Abiotic data, land cover data and hierarchy

There was a strong correlation structure within the data. Conductivity was positively correlated with both watershed area and percent agriculture in the basin (Fig. 6; ρ = 0.83, p = 0.02; ρ = 0.90, p = 0.004, respectively). Total phosphorus was negatively correlated with watershed area, but not agricultural land cover (ρ = −0.76, p = 0.03; ρ = −0.51, p = 0.19, respectively). Total nitrogen (N) was not significantly correlated with watershed area or percent agriculture (ρ = 0.47, p = 0.24; ρ = 0.29, p = 0.50, respectively). The pH at the site was significantly correlated with watershed area (Fig. 6; ρ = 0.81, p = 0.02), but was not significantly correlated with agricultural land cover (p = 0.17). Turbidity was lowest at the most upstream site and was highest at site 3 (12.2 NTU) and remained relatively constant until declining downstream of site 5 and was not correlated with the amount of agriculture in the upstream basin (ρ = 0.43, p = 0.30). The amount of light coming though the canopy (% trans) was also significantly positively correlated with the watershed area (ρ = 0.73, p = 0.04), and the stream temperature was significantly positively correlated with the amount of light coming through the canopy (ρ = 0.73, p = 0.04). This suggests that the overall position in the watershed affects the two factors (i.e., conductivity and light reaching the benthic habitat) that most strongly regulated algae community composition in this watershed.

Fig. 6
figure 6

Conceptual diagram showing the controls on algal community assemblage. Significant predictors are shown with a dark solid line, while nonsignificant predictors are shown with a dotted line

Discussion

Our results indicate that landscape factors influence in-stream physiochemical parameters that play a role in controlling algal community composition in streams, supporting Stevenson’s (1997) framework. Both light reaching the stream bed and conductivity were direct regulating factors impacting the broader algal assemblage structure. However, in order to understand how these factors operated over an entire watershed and the landscape features that lended to these patterns, we had to examine watershed-scale factors that influence light and conductivity. By viewing streams as hierarchically organized systems, we were able to focus on a small set of variables at each level that most determine system behaviors and capacities within the relevant spatiotemporal frame. Our hierarchical approach successfully linked land cover to biotic responses through a series of intermediate abiotic links. Unlike bivariate comparisons, between land cover and species composition, our framework allowed for some mechanistic understanding of how landscape features are propagated to ultimately influence organisms. Previous results have shown that diatom assemblages are distributed continuously along gradients of conductivity (Potapova and Charles 2003), but did not investigate higher-order factors. By viewing our stream community as a system organized and developed around spatially defined habitats (Frissell et al. 1986; Poff 1997), we increased our understanding of the spatial structure and levels of organization. Stream communities can be viewed as systems organized within this hierarchical habitat template.

The qualitative expectation of light availability proposed in the RCC (Vannote et al. 1980) is a parabolic distribution in which light is low in the upper and lower reaches and high in the middle reaches, reflecting a transition from shading by riparian vegetation giving way to aquatic light attenuation by increasing turbidity in larger river reaches. While we observed decreased riparian shading as we moved downstream, turbidity did not necessarily get higher as we moved downstream, but was highest in the middle reaches which lead to a negative unimodal pattern in benthic light availability in our study system. Higher turbidity in downstream reaches has been noted in agricultural systems (Julian et al. 2008c), while an asymptotic pattern in turbidity was seen in a larger investigation of several unaltered streams (Julian et al. 2008a), suggesting that these varying patterns may be common and related to watershed condition. Water clarity is primarily dictated by the particulates in the water column rather than by dissolved constituents (Julian et al. 2008a, b, c). While we did not note any correlation between agricultural land cover and water turbidity in our study, there is some logging in the Kiamichi basin (C. Atkinson, personal observation) that could drive some of the patterns in turbidity.

At larger (multiple regions or continental) spatial scales, factors such as geology, ecoregion classification, disturbance and watershed land cover may have a greater impact and perform better in predicting species composition (Allan 2004; Cardinale et al. 2006; Pyne et al. 2007). However, our approach was important in determining the landscape-scale factors that influenced local abiotic conditions that lead to variability in algae genera composition within a watershed. Whether or not algal genera are structured in a similar fashion across broad geographic regions is not fully known. However, Potapova and Charles (2002) found in a national-level study that diatoms were structured by three major ecologic gradients, a gradient associated with the RCC, mineral content and pH and a temperature gradient-related latitude/altitude of sites, but did not investigate light and its association with the RCC. They found that temperature measured at the site at time of sampling had no direct relationship community composition, yet only at a national scale did variation in temperature correlate with species abundance.

Benthic light availability was most strongly correlated with benthic algal assemblages in our study. Our study shows that the BLAM model is effective to understand light availability and was a significant factor in determining algae genera community assemblage. More work across larger scales needs to be done to fully appreciate the importance of light on algal assemblages. This model may be used in future studies to predict the available energy for primary consumers and trade-offs between gross primary productivity and community assemblage. The importance of forest canopy cover can influence light levels received by the stream benthos, reducing incoming energy and nutrients, shifting the balance between algal groups (e.g., those that tolerate low light, such as diatoms, and those tolerate high light such cyanobacteria and green algae). Studies of timber harvest show distinct changes in algae flora with timber harvest (Hansmann and Phinney 1973). The influence of basin-scale disturbances such as timber harvest in previous studies correlates with diatom species community changes in part explained by higher total nitrogen, phosphorus, turbidity and conductivity versus unharvested watersheds (Naymik et al. 2005). We observed timber harvesting occurring in this basin, but it was difficult to map the extent and time period of harvests, and thus, direct data on timber effects could not be incorporated into this study. Yet, using the BLAM allowed us to incorporate both shading and turbidity into a single variable, which was successful in describing community composition.

An interplay between disturbance (flood frequency) and nutrients in flowing waters often results in higher species richness at intermediate levels (Biggs and Smith 2002; Cardinale et al. 2006). While we did not investigate temporal variability in benthic algal assemblages, our study investigated “stable” algae communities during summer flow periods, the time period of the highest biological activity in temperate streams. We observed little variability in overall genera richness across the sites (49–66 genera), there were not differences in the assemblage within the habitat types sampled at a site, and there was not a longitudinal pattern. Overall, this is a low-nutrient system (Atkinson et al. 2013), and we found that many of the genera detected in this system are often associated with nutrient-poor waters (Dillard 1999; Pan et al. 1996; Winter and Duthie 2000). Because it is a relatively nutrient-poor system, nutrients may be taken up at a high rate due to biological demand, potentially preventing us from detecting a nutrient affect. Furthermore, nutrients can be highly temporally variable; in-stream nitrogen concentrations based on a limited sampling frequency in agricultural streams are often poorly correlated with diatom metrics (Porter et al. 2008). Conductivity increased significantly with the percentage of the catchment in agricultural land cover as found in Biggs (1995); however, the correlation between location within the watershed and conductivity was relatively strong as well, making it difficult to determine whether it was a land cover effect.

Our study showed that a set of hierarchical factors affected the benthic genera composition, suggesting that better predictions can arise from the knowledge of the higher-scale factors that constrain lower- and local-scale factors. Many of these higher-scale factors may also have a strong influence on assemblage structure across systems; however, varying disturbance regimes and climate lead to different communities (Power and Stewart 1987). The influence of light, nutrients and consumer density can affect periphyton production through both bottom-up and top-down control (Mallory and Richardson 2005). Freshwater mussels have been found to be important in influencing algal assemblage structure within rivers in this region and can lead to variation in algal communities where they dominate (Atkinson et al. 2013). A multitude of factors operating various spatial and temporal scales that we did not incorporate can influence algal assemblages. Thus, incorporating a hierarchical framework with other models including the temporal framework and spatial heterogeneity across broad geographic areas will increase understanding of complex ecosystems and ability to deal with large-scale environmental changes.