Introduction

Understanding the drivers that shape community structure is a central theme in community ecology. These drivers can be studied in the context of a metacommunity (Leibold et al., 2004). A metacommunity is ‘a set of local communities that are linked by dispersal of multiple potentially interacting species’ (Wilson, 1992; Leibold et al., 2004). The concept of metacommunity is based on the notion that the variation in community structure is affected by both local- and large-scale environmental and spatial processes (Leibold et al., 2004; Holyoak et al., 2005). It has also been recognized that environmental filtering and dispersal are the fundamental processes structuring metacommunities (Lindström & Langenheder, 2012), as are also biological interactions (Cadotte & Tucker, 2017). Thus, metacommunity studies should focus on the relative roles of these processes (Heino et al., 2015).

The metacommunity has often been treated as a whole without any systematic division within different organismal groups (e.g. diatoms, macrophytes and macroinvertebrates). However, there is typically variation in biological and ecological characteristics between different species even if they belong to the same organismal group (Pandit et al., 2009). The effects of environmental and dispersal processes on local communities may depend on the differences in species traits in metacommunities. Thus, dividing data matrices into different groups by species traits can increase understanding of metacommunity organization (Lindström & Langenheder, 2012). This deconstructive approach has been increasingly applied in recent years when studying community patterns (Grönroos et al., 2013; Alahuhta et al., 2014; Algarte et al., 2014; Vilmi et al., 2017). One way to approach this is to split biological data matrices into smaller parts by dividing species into generalists and specialists based on species ecological specialization (Devictor et al., 2008; Pandit et al., 2009). For example, some studies have shown that environmental control is more dominant in specialist species whilst generalist respond mainly to spatial processes (e.g. Pandit et al., 2009), whereas other studies have shown different patterns, such as environmental control being dominantly independent of specialization (e.g. Székely & Langenheder, 2014). Furthermore, several studies have produced divergent results regarding which factors are important in determining variation in community structure. According to Pandit et al. (2009), these divergent results can be due to different ratios of ecological specialization in different systems studied.

In addition to ecological specialization, biological data matrices can be divided into smaller parts using other biological traits, for example, growth forms and cell sizes (Heino & Soininen, 2006; Rimet & Bouchez, 2012). In the study of freshwater algae, one approach is the use of different guild divisions (Göthe et al., 2013; Vilmi et al., 2017). Many of these studies have used guild classification based on Passy’s (2007) study. Originally, Passy (2007) proposed a diatom guild classification based on the potential of species to use nutrient resources and to resist physical perturbation. Rimet & Bouchez (2012) modified the classification and added one new guild corresponding to planktic species.

Different ecological guilds can be expected to respond in different ways to environmental and spatial processes. Several studies have shown that these guilds respond in different ways to environmental conditions both in lotic (Passy, 2007; Berthon et al., 2011; Göthe et al., 2013) and lentic (Gottschalk & Kahlert, 2012; Vilmi et al., 2017) environments. However, the patterns found have not always been similar, as same guilds have shown dissimilar responses to environment in different studies. Also, these studies have been conducted mainly in areas with relatively high nutrient concentrations, and there is a lack of studies in nutrient-poor, harsh subarctic stream environments (but see Berthon et al., 2011).

In the freshwater realm, studying the relative roles of the environmental and spatial components in community composition is a commonly used approach for understanding metacommunity organization (De Bie et al., 2012; Alahuhta et al., 2014; Vilmi et al., 2016, 2017). The environmental components of community variation can be seen as illustrating environmental filtering and the importance of spatial variables may suggest dispersal as determinants of metacommunity structuring (Hájek et al., 2011). Since it is challenging to measure dispersal rates directly (Jacobson & Peres-Neto, 2010), spatial-based dispersal proxies are commonly used (e.g. Grönroos et al., 2013). Specifically, there is very little information available on the dispersal rates of diatom species, and it is particularly difficult to determine dispersal rates of these passively dispersing species directly.

Environmental filtering has been shown to be the main mechanism structuring metacommunities of various organisms in different environments (Van der Gucht et al., 2007; Heino et al., 2017). According to the hierarchical landscape filters model of Poff (1997), species from a regional pool must pass through a series of nested filters in hierarchical order to join a local community. Until recent years, there has been a prevailing idea that unicellular organisms are ubiquitously distributed (Finlay, 2002), environmental filtering is the main mechanism structuring also diatom communities and spatial factors have only minor effects on their community structure (Finlay & Fenchel, 2004; Soininen, 2012). This has been due to the consideration that diatoms have enormous population sizes (Finlay, 2002) and are efficient passive dispersers (Kristiansen, 1996). Nevertheless, spatial factors have been shown to be important structuring elements for diatoms (Hillebrand et al., 2001; Soininen & Weckström, 2009; Heino et al., 2010), and they have been found to be important in determining diatom community structure at continental (e.g. Potapova & Charles, 2002), regional (e.g. Heino et al., 2010) and watershed scale (e.g. Göthe et al., 2013). However, many studies have also found that environmental conditions exceed spatial factors in importance for variation in community structure (e.g. Verleyen et al., 2009; Göthe et al., 2013). It has been suggested that the effects of spatial factors will increase with the spatial extent of the study area (Verleyen et al., 2009), and that the ratio of spatial and environmental components can be related to specific habitats (Soininen & Weckström, 2009). However, these can also be related to different ratios of ecological specialization (Pandit et al., 2009).

In this study, we examined the relative importance of environmental variables at local and catchment scales and spatial factors structuring stream diatom communities. Our aim was to study whether different species trait groups of stream diatoms show divergent responses to spatial and environmental factors and which processes are dominant in structuring a diatom metacommunity in subarctic streams. We tested whether responses to environmental and spatial variables varied between ecological guilds (i.e. high-profile, low-profile, motile and planktic guild) and between groups based on ecological specialization (i.e. generalists and specialists). Based on previous findings, we hypothesized the variation in the structure of the diatom communities as a whole to be related to both environmental and spatial variables (H1), but the environmental control to be more dominant (H2). We hypothesized weaker responses to the spatial variables due to the small study area (i.e. virtually no dispersal limitation). We also hypothesized that there would be variation in responses to environmental and spatial variables between the ecological guilds (H3), and that generalists and specialists would differ strongly in their responses to environmental and spatial variables (H4). We hypothesized that the environmental control would play a more important role in explaining the variation of specialist species (H5), and that the variation of generalist species would depend more on spatial factors (H6).

Materials and methods

Study area

This study was conducted in the Tenojoki drainage basin (centred on 70°N, 26°E). The drainage basin is located in northernmost Finland and Norway, and the main river, the River Tenojoki, flows to the Arctic Ocean (Fig. 1). The total area of the drainage basin is 16 386 km2. The study area had a mean annual temperature of − 1.3°C and a mean annual precipitation of 433 mm in the climatological normal period 1981–2010 (Pirinen et al., 2012). The study area is mainly in the subarctic deciduous birch zone and it is characterized by arctic-alpine vegetation (Hustich, 1961). At higher altitude, barren fell tundra is typical and at low altitude there are mountain birch (Betula pubescens ssp. czerepanovii) woodlands. The study area consists mainly of Precambrian bedrock and the topography of the area is characterized by variable gently sloping fells (i.e. rounded mountains) (Mansikkaniemi, 1970). Peatlands are located mainly in the valleys between fells and they are relatively rare. The percentage of lakes is quite low (3.1%; Korhonen & Haavanlammi, 2012) at the study area, and therefore, the streams have rapid fluctuations in discharge especially in the spring season (Mansikkaniemi, 1970). The area is very sparsely populated and anthropogenic influence is minimal. Thus, headwater streams in the drainage basin range from near-pristine to pristine (Schmera et al., 2013). Stream waters are circum-neutral, and nutrient levels are indicative of highly oligotrophic systems (Heino et al., 2003).

Fig. 1
figure 1

Map showing the location of the Tenojoki drainage basin, the study sites and the catchments of those sites (green). Only the streams from the Finnish side of the Tenojoki drainage basin are presented, with the exception of the main stem of River Tenojoki and the most north-eastern part of the map. Note that all 52 study sites are located in tributary streams and there are no sites in the main stem of the River Tenojoki. Only sites included in the data analyses are visible on the map

A total of 55 streams from the Finnish side of the Tenojoki drainage basin were surveyed in early June 2012. We aimed to sample all easily accessible sites that met the following criteria: (1) The length of a sampled stream must be at least 1 km. (2) The distance from the sampling site to a lake or a pond upstream had to be at least 0.5 km. (3) Only streams with permanent flow were included. (4) Large rivers (i.e. stream width > 25 m, water depth > 50 cm) were not included in order to get reliable and comparable samples. The size of the sampling site at each stream was approximately 50 m2. All 55 sampling sites are located in tributary streams and there are no sites in the main stem of the River Tenojoki. The distance between sampling sites furthest away from each other is 142 km.

Environmental variables

Three types of explanatory variables were used: environmental variables at local and catchment scale (Table 1) and spatial variables. We decided to divide the environmental variables into two separate groups, as stream communities are structured by the hierarchical effects of environmental variables at different scales, e.g. local environmental and catchment variables (Poff, 1997). Local variables were determined at the same time with the diatom sampling. Variables included both physical habitat and water chemistry variables. Mean width of the sampling site (m) was determined based on five cross-channel measurements. Height of the lower stream bank (area of no terrestrial vegetation; cm) and steepness of the stream bank (area of terrestrial vegetation; cm) were measured at the same locations. Height of the lower stream bank was measured from the water level to the start of terrestrial vegetation. Steepness of the upper stream bank (how many centimetres the stream bank rises in two metres’ distance from the stream) was measured perpendicular to the stream. Current velocity (m s−1) and depth (cm) were measured at 30 random locations in a sampling site. Moss cover (%) and particle size classes (%) were visually estimated at 10 1 m2 plots at random locations in each sampling site. A modified Wentworth’s (1922) scale of particle size classes was used: sand (0.25–2 mm), gravel (2–16 mm), pebble (16–64 mm), cobble (64–256 mm) and boulder (256–1024 mm). Based on visual estimates (%) for each plot, mean values for each site were subsequently calculated and used in all statistical analyses. Shading (%) by riparian vegetation at each sampling site was also visually estimated. Conductivity and pH were measured in the field at each sampling site using YSI device model 556 MPS (YSI Inc., Yellow Springs, OH, USA). Water samples taken during fieldwork were analysed for iron, manganese, colour and total nitrogen following European standards. In the study area, concentration of total phosphorus is mainly below the accuracy limits of the analysis methods used (< 5 μg/l) (e.g. Heino et al., 2003). Therefore, it was not analysed in this study.

Table 1 Summary of local and catchment variables across the study sites in the River Tenojoki drainage basin

The catchment variables of each stream were calculated using ArcGIS 10.1 software (ESRI, Redlands, CA, USA), and they were based on maps acquired from the National Land Survey of Finland (Table 1). These variables consisted of drainage basin area (km2), proportion of lakes (%), length of the stream (km) and lake distance index. Lake distance index was formed using the distance to the upstream lake. This index represents the influence of the lake. There were some streams that did not have a lake upstream, and for those streams a value two times the longest distance between sampling site and lake found in the study area was given to reflect zero influence. Additionally, proportion of peatlands (%), proportion of shrub (%), and proportion of rock and cobble deposit (%) were used to mirror natural background concentrations that influence water quality, as nutrients and other chemical components are leached from drainage basin to streams to a variable degree depending on land cover type.

In addition, variables representing productivity in catchment area were used: mean and standard deviation of the NDVI (normalized difference vegetation index; Tucker, 1979 and Tasseled Cap greenness (Crist & Cicone, 1984). The mean and standard deviation of both variables were computed, as it has been proposed that mean values describe the average degree of productivity and standard deviation describes the variation of productivity (Parviainen et al., 2013). In addition to productivity, it has been proposed that these variables act as proxies for nutrients leaching from terrestrial areas to aquatic ecosystems (Soininen & Luoto, 2012). NDVI and greenness indexes were calculated from the Landsat 7 ETM scene (Hjort & Luoto, 2006).

Spearman’s correlation test (cut-off level: rs = 0.8) was performed between all the environmental variables to avoid high correlations between variables. Pebble (16–64 mm), length of stream (km) and NDVI variables were excluded from further analyses based on strong correlations with other variables. There were also high correlations between other variables, but because those variables belong to different variable groups (i.e. local or catchment), these correlations were not taken into account.

Sampling and processing diatoms

Diatom sampling and processing were carried out in accordance with the European standard (SFS-EN 13946, 2003). At each sampling site, diatoms were sampled from randomly collected cobble-sized stones from water depths of approximately 10 to 30 cm. The upper surface of the stones was scrubbed with a toothbrush and stream water, the water being pooled into one composite sample for each sampling site. In the laboratory, the diatom samples were cleaned from organic material using a strong-acid solution (HNO3:H2SO4; 2:1) and mounted in a synthetic resin, Naphrax®. To determine the relative abundance of the diatom species, approximately 500 diatom valves were counted and identified to the lowest possible taxonomical level for each sample. This was done with a light microscope using differential interference contrast (1,000 × magnification). The identification and counting followed standard methods (SFS-EN 14407, 2005) using the Diatoms of Europe series (Lange-Bertalot, 2000, 2001, 2002, 2011) and Lange-Bertalot (2011) flora and other specialized bibliographical data when needed. Taxonomic assignments could not be made for some valves and they were omitted from analyses.

Dividing diatom data matrices into different groups

For dividing data matrices by species traits, diatoms were assigned into four ecological guilds reflecting their growth morphology. This was based on the classification made by Rimet & Bouchez (2012): low-profile, high-profile, motile and planktic guild. The low-profile guild includes species that grow very close to the substrate. These species are adapted to high current velocities and to low nutrient concentrations (Rimet & Bouchez, 2012). The high-profile guild includes species of tall stature. These species are adapted to low current velocities and high nutrient concentrations (Rimet & Bouchez, 2012). The motile guild includes species that can move actively relatively fast (Passy, 2007; Rimet & Bouchez, 2012). The planktic guild includes species that are adapted to lentic environments with morphological adaptations that enable them to resist sedimentation (e.g. Cyclotella spp.), and additionally nearly all filamentous diatom species (e.g. Aulacoseira) (Rimet & Bouchez, 2012).

Diatom species were also assigned into two groups, generalists and specialists, based on their ecological specialization. This was based on niche breadth measures determined previously by Heino & Soininen (2006) in northern Finland. The measure of niche breadth should preferably be based on a dataset different from the focal dataset in community–environment modelling. Heino & Soininen (2006) determined niche breadth that measures amplitude in species habitat distribution using the Outlying Mean Index (OMI; Dolédec et al., 2000) analysis. This multivariate method measures the marginality of species habitat distribution, i.e. the distance between the mean habitat conditions used by a species and the mean habitat conditions across the study area (Dolédec et al., 2000). It provides two relevant niche measures, including OMI (i.e. niche position) and tolerance (i.e. niche breadth). The latter was hence used as a measure of environmental niche breadth in this study, following previous studies (Heino & Soininen, 2006; Heino & Grönroos, 2014).

The sites, in which species from all four guilds and generalist and specialist species were not found, were excluded from data analysis. Thus, there were 52 sites left for further analysis (Fig. 1). Since all the diatom species found in the study area were not included in Rimet & Bouchez (2012) classification and Heino & Soininen’s (2006) data, we formed a matrix that included all the species that belonged to any of the four guilds and another matrix that included all generalists and specialist species. Therefore, there were nine species matrices in total for further analyses (Table 2).

Table 2 Total number of diatom species, and minimum (min), maximum (max), mean (mean) and standard deviation (SD) of local number of species in different species matrices

Statistical methods

To reveal spatial patterns at multiple spatial scales and address complex patterns of spatial variation, the method of Principal Coordinates of Neighbour Matrices (PCNM; Borcard & Legendre, 2002; Borcard et al., 2004; Fig. 2) was used. The PCNM analysis creates a number of spatial variables based on Euclidean (geographical) distances between sampling sites. The Euclidean distance matrix is analysed through a principal coordinate analysis to reveal spatial relationships amongst sites in decreasing order of spatial scale. The results are spatial variables representing spatial structures ranging from small- to large-scale across a study area. The first variables with large eigenvalues represent broad-scale variation and the last ones with small eigenvalues represent finer scale variation (Diniz-Filho & Bini, 2005). The PCNM analysis has been used increasingly to describe spatial patterns in various organism groups (e.g. Vilmi et al., 2017), as it is effective in modelling spatial structures in biological communities at multiple scales (Dray et al., 2012). The spatial structures represented by the PCNM variables can be the result of, for example, dispersal, historical factors, or spatial autocorrelation of environmental variables or biological interaction (e.g. Dray et al., 2012). However, it is also possible that using PCNM variables in variation partitioning overestimates the spatial component (Gilbert & Bennett, 2010; Smith & Lundholm, 2010). Spatial variables were derived from the geographical coordinates of sampling sites using the function pcnm in the R package PCNM (Legendre et al., 2013). In this study, only spatial variables showing positive spatial autocorrelation were employed (Borcard et al., 2011). Analyses were additionally done using east and north coordinates of the sampling sites instead of PCNM variables, but since the PCNM variables explained more of the variation in community structure, the coordinates were excluded from the analyses.

Fig. 2
figure 2

A schematic diagram showing the methodology used. The analyses were done separately for each species data matrix

The effects of local, catchment and spatial scale variables on diatom community structure were quantified using redundancy analysis (RDA; Rao, 1964; Fig. 2). This method evaluates how much of the variation in community structure can be explained by these variable groups. The pure and shared variations were analysed using variation partitioning through the partial redundancy analysis (pRDA; Borcard et al., 1992; Fig. 2). The aim in variation partitioning is to reveal how much of the variance in species community structure can be explained uniquely by each explanatory variable group as well as the shared variance explained by different combinations of these variable groups. Also, the unexplainable variation is revealed. With three groups of explanatory variables, the result is eight different components of variation (Fig. 3; Anderson & Gribble, 1998).

Fig. 3
figure 3

Venn-diagrams showing the fractions of diatom community structure explained by the local variables (Local), the catchment variables (Catchment) and spatial variables (Spatial). All fractions are based on adjusted R2 values shown as percentages of total variation. Values < 0 are not shown. Aall taxa, B ecological guilds, C high-profile guild, D low-profile guild, E motile guild, F planktic guild, G generalist and specialist, H generalist, I specialist

First, all species matrices were Hellinger-transformed, since the species data contained many zeros and this transformation enables the use of linear methods (Legendre & Gallagher, 2001; Fig. 2). The explanatory variables were selected for final analyses using the conservative forward selection method developed by Blanchet et al. (2008; Fig. 2). This method was used to prevent the occurrence of artificially inflated explanatory powers in models. The forward selection was carried out using function ordiR2step in the R package vegan (Oksanen et al., 2013) and it was done separately for each species matrix (i.e. low-profile guild, high-profile guild, etc.). The variation partitioning was done following the protocol of Borcard et al. (1992) using the function varpart in the R package vegan (Oksanen et al., 2013). In this study, only adjusted R2 values were used, as those take into account the number of explanatory variables at each variable group and sample size (Peres-Neto et al., 2006). The significance of each testable fraction was observed using test of fraction which is based on permutation (Fig. 2). This was done using function anova in the R package vegan (Oksanen et al., 2013). All these analyses were performed separately for each species matrices in precisely the same way.

Results

A total of 190 diatom taxa were identified, species richness per site ranging from 19 to 55 (Table 2; Online Resource). The most common species were Achnantidium minutissimum (Kützing) Czarnecki s.l., Rossithidium pusillum (Grunow) F.E.Round & Bukhtiyarova and Fragilaria gracilis Øestrup. The species with the highest average abundance were A. minutissimum s.l., R. pusillum and Fragilaria arcus (Ehrenberg) Cleve var. arcus, which all belong to the low-profile guild and are generalists. From the taxa, 117 species (62%) belonged to the ecological guild classification made by Rimet & Bouchez (2012). In the sampling sites, an average of 77% of species belonged to one of the ecological guilds. In the high-profile guild, there were more species than in the other guilds. Only 57 species of the taxa (33 generalist and 24 specialist species) were found in Heino & Soininen’s (2006) data. However, in the study sites, an average of 60% of species were either generalists or specialists.

Through the PCNM analysis, 15 spatial variables showing positive spatial autocorrelation were formed. The most common local variable included in the RDAs, determined by the forward selections, was moss cover (%) and the most common catchment variable was lake distance index (Table 3). Both variables, as well as the spatial variable describing broad-scale relations amongst sites (PCNM3), were selected for all analyses made for all species matrices. In general, the spatial variables representing the broad- and mid-scale relations amongst the sites were more commonly selected than the spatial variables illustrating finer scale relations amongst sites.

Table 3 The selected variables according to the forward selection procedure and their rank order

The diatom community structure

The local and catchment environmental conditions and the spatial variables all explained the diatom community structure, yet their relative importance varied for different species matrices (Table 4). Variables describing the spatial relations amongst sites at broad and medium scales (PCNM 2, 3, 1, 6, 8) explained slightly more (15.1%) of the variation of the whole community structure than the other two variable groups separately. The local variables that explained the variation of the whole community structure (11.9%) were moss cover (%), proportion of boulders (%), colour (mg Pt/l) and proportion of gravel (%). The catchment variables, lake distance index, standard deviation of greenness, shrub (%), and rock and cobble deposit (%), explained almost the same amount of the variation in community structure (12.2%) than the local variables.

Table 4 A summary of variation partitioning results. The significance of each fraction explained is indicated in the table (*P < 0.05, **P 0.01)

The variation partitioning analyses showed that for the whole community the variation in community structure was better explained by the pure spatial (4.9%) than by the pure local (2.6%) or catchment (2.5%) environmental components (Fig. 3; Table 4). The variations explained jointly by the different pairs of variable groups were approximately 4%–5%. The shared fraction between all variable groups was 1.4%. The amount of unexplained variation was relatively large in all models, with residuals ranging from approximately 65% to 84% for different ecological guilds and from 68% to 85% for generalist and specialist species matrices.

The diatom data matrices divided by ecological guilds

Almost the same pattern as with the whole community matrix emerged when only the species found in the ecological guild classification were included (i.e. ecological guilds matrix). Here, the environmental variable groups separately also contributed less than the spatial variables to the explanation of community variation. The pure catchment component accounted for only 3.6% of the variation, whilst the pure spatial component explained 7.6% of the variation. However, when the different ecological guilds were analysed separately, slightly different patterns emerged. Overall, the variations in different ecological guilds were better explained by the pure effects of the local variables and the spatial variables than by the pure effects of the catchment variables. The pure local and pure spatial variables explained the same amount of the variation in the high-profile guild. The pure local component explained more of the variation in the low-profile guild and motile guild than the spatial component. In explaining the variation in the low-profile guild, the catchment component was also important. Only the variation in the planktic guild was best explained by the spatial component. The shared fractions between all variable groups ranged from approximately 0 to 4% in all guilds, but the shared fractions of the spatial variables and the catchment variables were smallest (0% or negative values to 2%). The variation in the low-profile guild was explained best, as the unexplained variation was approximately 65%.

The diatom data matrices divided by ecological specialization

Almost the same picture as with the whole community emerged when only the species found in the specialist-generalist classification were included (i.e. generalist and specialist matrix). But as with the ecological guilds, when the generalists and the specialists were analysed separately, different patterns emerged. The pure catchment component explained much more of the variation in the generalist species (10.9%) than in the specialist species (0.9%). The specialists were better explained by the pure effects of spatial variables than by the pure effects of local or catchment variables. The amount of variation that could be explained was higher for the generalists (31.9%) than for the specialists (14.7%).

Discussion

In stream environments, local community structure typically portrays the effects of both environmental and spatial processes (Heino et al., 2015). Our results showed that local and catchment conditions and spatial variables all affected the organization of the subarctic diatom metacommunity with different relative importances. Our findings suggest that local conditions do not solely determine diatom metacommunity organization, but that there are also spatially structured patterns. Our findings also suggest that diatom communities are jointly structured by environmental filtering and spatial processes (Soininen & Weckström, 2009; Vilmi et al., 2017). These processes, however, play different roles in different species trait groups.

The factors structuring entire diatom communities

The organization of the entire diatom metacommunity was determined by spatial factors and environmental variables at local and catchment scales (supports H1). Thus, our results are consistent with earlier findings (Pan et al., 1999; see also reviews by Soininen, 2011, 2012 and references therein). However, when examining the environmental variable groups separately, our results showed that spatial variables had a relatively large effect on diatom metacommunity organization (contradicts with H2). In combination, local and catchment variables explained more variation than spatial variables alone. Previous studies have found that environmental factors exceed spatial factors in importance, and that stream communities are mostly under abiotic control (Verleyen et al., 2009; Göthe et al., 2013). Our findings are in contrast with many specific studies that suggest that diatom community structures primarily reflect variation in local conditions (De Bie et al., 2012; Gottschalk & Kahlert, 2012). Strong spatial patterns have previously been found mainly in large-scale studies, as in Heino et al.’s (2010) study concerning boreal stream diatom communities, or in highly connected environments, as in Vilmi et al.’s (2017) study in a boreal lake system. Indeed, these differences in findings may be due not only to different spatial scales (Mykrä et al., 2007) and environmental variables examined, but also to different ratios of ecological guilds (Göthe et al., 2013; Vilmi et al., 2017) and ecological specialization (Pandit et al., 2009).

The factors structuring ecological guilds

Our results showed that there was variation in responses to environmental and spatial variables between the ecological guilds (supports H3). Overall, the variations in different ecological guilds were better explained by the local and spatial variables than by catchment variables. Our findings suggest that the high- and low-profile guilds are simultaneously structured by environmental filtering and spatial processes in subarctic streams. However, environmental filtering plays a more important role for the motile guild, and spatial-related processes are important for planktic species. The planktic guild has shown clear spatial patterns in other studies as well (e.g. Vilmi et al., 2017). In boreal streams (Göthe et al., 2013) and lakes (Vilmi et al., 2017), diatom guilds have also been structured by various metacommunity processes. Göthe et al. (2013) suggested that the dissimilar findings between guilds could be due to diatoms’ traits related to dispersal capacity. According to Algarte et al. (2014), firmly attached algae (i.e. low-profile guild species) show clear spatial patterns, as they resist high current velocities (Passy, 2007). Thus, they have lower dispersal rates. In our study, this was not the case, as the local environmental component explained best the variation in the low-profile guild. It has been also suggested that the degree of attachment and the mobility of micro-organisms can affect the extent of dispersal (Vilmi et al., 2017). This can partly explain the importance of spatial-related processes to planktic guild species in our study. Unfortunately, dispersal capacities of diatom species and what traits determine them—at least in terms of long-distance dispersal—are a subject that has not been studied much (Kristiansen, 1996; Vyverman et al., 2007; Casteleyn et al., 2010; Souffreau et al. 2013; Rimet et al., 2014). However, the use of guild division can give us some indirect indications of dispersal processes.

The factors affecting different groups of ecological specialization

Our results showed that generalists and specialists differ strongly in responses to environmental and spatial variables (supports H4; Pandit et al., 2009; Székely & Langenheder, 2014). We thought that generalists would be structured by spatial-related processes because they can tolerate a wide range of environmental conditions (Devictor et al., 2010). However, the variation in the generalist species was explained mostly by catchment environmental factors (contradicts with H6). According to the hierarchical environmental filtering model (Poff, 1997), regional processes determine the species reaching the local habitat. Thus, it is possible that regional processes are limiting factors to generalist species. Our results also indicated that spatial processes are important to specialist species (contradict H5). Dispersal can be more challenging to specialist species because there are fewer suitable environments for them (Kolasa & Romanuk, 2005). However, it is unlikely that dispersal limitation would explain these spatial patterns due to the relatively small spatial extent of our study area and the fact that this study was conducted within one drainage basin (see Mouquet & Loreau, 2003; Leibold et al., 2004; Heino et al., 2017).

Our results are slightly inconsistent with previous studies (e.g. Pandit et al., 2009). With rock pool invertebrates, habitat generalists respond mainly to spatial factors and habitat specialists mostly to environmental factors (Pandit et al., 2009). On the other hand, community composition of generalist bacteria was best explained by environmental factors (Székely & Langenheder, 2014). In addition, for dragonflies, dispersal restricted the distributions of habitat specialist species (McCauley, 2007). In Alahuhta et al.’s (2014) study, the community compositions of both common and rare macrophyte species were explained by environmental factors, suggesting environmental filtering to be more dominant regardless of the degree of rarity.

In our study, the amount of explained variation was much higher for the generalists than for the specialists. This is not surprising, as specialist species have a narrower niche breadth, and environmental factors can affect different specialist species in different ways (Pandit et al., 2009). Overall, some species can be strongly specialized or clearly generalists, but generally, species are something in between these extreme ends (Heino & Soininen, 2006; Pandit et al., 2009). Thus, the generalist and specialist division in our study is rather coarse. However, our results suggest that even this coarse division can be useful when studying the effects of ecological specialization on community structure.

Spatial processes and scale dependency

Our results showed that spatial variables had a much larger effect on diatom metacommunity organization than we thought based on the relatively small spatial extent of our study area (Verleyen et al., 2009; Bennett et al., 2010). However, Astorga et al. (2012) have found that diatom communities are spatially structured in very similar environments at small scale (< 200 km) but not at larger spatial extents. In studies concerning microbial communities, spatial patterns have been found at the small spatial scale in systems of high connectivity (Lear et al., 2014; Vilmi et al., 2016, 2017). Connectivity probably can also play a role in stream diatom metacommunities. Historical factors are important in explaining geographical patterns found in diatom genus richness at regional to global scales, indicating the vital roles of dispersal limitation in structuring diatom communities (Vyverman et al., 2007). Thus, as the spatial variables used in this study can portray also the historical factors and dispersal (Dray et al., 2012), this could explain the importance of these variables also in our study, although the scale in our study is much smaller. However, spatial structures found in small spatial extent and within a region (i.e. Tenojoki drainage basin) are usually mainly related to homogenizing effects rather than dispersal limitations (Mouquet & Loreau, 2003; Leibold et al., 2004; Heino et al., 2017), even though both can produce spatial patterns (Ng et al., 2009). These homogenizing effects can take place via mass-effects (Mouquet & Loreau, 2003). In the Tenojoki drainage basin, diatom communities seem to be structured by processes active at multiple spatial scales, as they have been in comparable studies (Göthe et al., 2013; Vilmi et al., 2016, 2017). However, interpretation of spatial variables is always dependent on the size and connectivity of the study system (Dray et al., 2012).

Concluding remarks

The results of this study should be interpreted with caution, as the amounts of unexplained variation were relatively high. This was partly due to the statistical methods used (adjusted coefficient of determination; Peres-Neto et al., 2006), and low amount of explained variation is common in these kind of studies (e.g. Pandit et al., 2009; Algarte et al., 2014). Moreover, it is possible that some important explanatory variables are missing from the analysis (e.g. Algarte et al., 2014). For example, this study did not include biotic interaction, e.g. grazing. However, previous studies have shown that grazing has no apparent effects, at least on the structure of diatom guilds (e.g. Göthe et al., 2013; Vilmi et al., 2017). Yet, biotic and trophic interactions would be an interesting addition to the study of northern, nutrient-poor environments. According to Berthon et al. (2011), grazing pressure may be higher in nutrient-poor rivers than in nutrient-rich rivers because biofilms are rare. However, a more likely reason for the low amounts of explained variations is the occurrence of stochastic processes (Vellend et al., 2014), as biological communities are formed through very complex processes and interactions. The guild and ecological specialization information were not available for all species and this can have implications on results. However, we believe that our results are representative, because the reduced overall guild and ecological specialization matrices showed patterns similar to those of the entire community matrix.

Our findings suggested that dividing the whole community into different groups by species traits indeed increases understanding of metacommunity organization. Our study showed that diatom communities in subarctic streams are a result of both environmental filtering and spatial-related processes. Future studies should focus on measuring grazing pressure, especially in nutrient-poor subarctic streams, and dispersal rates of diatom species to acquire more reliable knowledge of the processes structuring diatom communities. Focusing on these biological processes would, however, necessitate experimental approaches, which may be complicated at spatial extents comprising entire drainage basins. Hence, large-scale observational studies offer necessary background information for guiding more detailed experimental work and provide important information for biodiversity assessment research.