Introduction

Freshwater mussels are among the most threatened group of species in the US and across the world (Ferreira-Rodríguez et al., 2019; Haag, 2019). A range of human disturbances, such as water-quality pollution, habitat degradation, invasive species, flow alteration, and habitat fragmentation, contribute to their decline (Downing et al., 2010; Galbraith et al., 2010; Haag, 2012, 2019; Modesto et al., 2018). As freshwater mussels are also dependent on fishes as hosts at the larval stage (glochidia) (Haag, 2012; Taeubert et al., 2012), they are also indirectly impacted by human disturbances through the loss of fish hosts (Fritts et al., 2012; Benson et al., 2017; Galbraith et al., 2018).

The importance of the environment and fish hosts to individual mussel species has been well documented and considered in conserving and restoring endangered or threatened species (Haag, 2012; Modesto et al., 2018). As the number of threatened species increases, single-species conservation has been increasingly replaced by multiple-species or community conservation (Regan et al., 2008; Franklin et al., 2011). This shift is particularly relevant for freshwater mussels when a large proportion of mussel species is threatened, and the whole mussel assemblage has declined in most aquatic ecosystems (Haag, 2019; Haag & Williams, 2014; Hornbach et al., 2019). Because almost all freshwater fish species are the hosts of one or more mussel species (Haag, 2012) and individual mussel species declines have been related to a lack of fish hosts (e.g., Fritts et al., 2012), understanding the relationship between mussel and fish species diversity shall help modeling and conservation planning of overall mussel biodiversity. Resource managers would need to know when their conservation efforts should be focused on the physical environment or fish diversity, or both and whether fish-focused monitoring can also capture the status and trends of mussel diversity. Ecologists may need to know whether fish data are essential to model and predict the changes in mussel diversity.

A positive correlation can be expected between mussel and fish SR just as in other parasite-host systems (Wood & Johnson, 2016). However, the reported correlation strength varies greatly among different studies, from nearly perfect (Watters, 1992; Myers-Kinzie et al., 2001; Haag, 2012) to intermediately strong (Vaughn & Taylor, 2000; Krebs et al., 2010; Cao et al., 2013), and weak (Daniel & Brown, 2013; Bey & Sullivan, 2015). One possible reason is that the correlation strength depends on natural environmental setting and human disturbances associated with a given dataset. For example, the stream size and land-use may differ much more among sites in one study than in others. If so, we need to understand how environmental gradients affect the correlation strength. Mussel diversity and abundance were reported to decline substantially in streams while fish assemblages remain stable and healthy (Haag, 2012, 2019). These observations suggest that freshwater mussel and fish assemblages differ in their environmental preferences. However, how mussel and fish SR differently respond to environmental gradients is poorly understood.

One can compare the relationships of mussel and fish SR with the environment and human disturbances by modeling them separately. However, the response curves of SR to an environmental variable can take a variety of forms, e.g., linear, non-linear, step-functions, and multi-modal (Austin & Gaywood, 1994), making it difficult to quantify the differences accurately and precisely. In the present study, we propose to overcome this difficulty by modeling the ratio of mussel and fish SR (referred to as the SR ratio hereafter). If mussel and fish SR values vary along an environmental gradient in the same way, the SR ratio will not change, while if they vary differently along this gradient, the ratio will change. For example, when mussel SR is negatively affected by an environmental factor more than fish SR, the SR ratio will decrease, and vice versa. In the present study, we focus on the relationships between the SR ratio and a wide range of natural environmental variables and measures of human disturbances, such as land-use and spatial connectivity changes.

The responses of mussels to the environment and fish hosts may also vary among different species guilds (Haag, 2012; Randklev et al., 2015; Ries et al., 2015; Hornbach et al., 2018). A species guild can be referred to as a group of species similar in some ecologically relevant way (Wilson, 1999). In the present study, we modeled how different mussel species guilds respond to environmental variables and fish SR. We focused hosting traits and life-history strategies (Haag, 2012; Moore et al., 2021). Host specialists can only use one or a few species for metamorphosis, often in the same family, while generalists use a wide range of species, normally in multiple families (Haag, 2012). The equilibrium species are mainly characterized by a long life span, slow growth rate, and stable habitat requirement; the opportunistic species by short life, fast growth and unstable habitats. The periodic species fall between the two strategists in these characteristics. We ask two questions: (1) is host-specialist SR more strongly affected by fish SR than host-generalist SR? (2) is SR of mussels with equilibrium strategy more sensitive to environmental changes than the SR for the other two types of strategists? The answers to the questions could help us to understand how fish biodiversity changes would affect different mussel guilds or how the SR of a given mussel guild may indicate the impact of human disturbances.

To accomplish the research goals described above, we compiled mussel and fish datasets for 459 stream sites in Illinois, the USA in this study and used Random-Forests regression, a robust machine learning technique (Breiman, 2001; Cutler et al., 2007) to model the relationships of the SR ratio and mussel guild SR with a range of natural environmental and land-use variables. We also showed how the variability of the mussel-fish SR ratio can be directly related to the strength of mussel-fish SR correlation. The findings of this study shall help to explain why mussel-fish SR correlations can vary among different studies and assist resource managers and researchers to identify where mussel SR is more likely restricted by the physical environment or by fish SR and better understand whether monitoring and conservation planning focused on fish assemblages could protect mussel biodiversity.

Methods

Mussel and fish sampling data

We used two published data sets from stream mussel and fish surveys (Cao et al., 2015, 2016) to address our research objectives. The mussel samples were collected with 4-person-hour timed searches over a 200-m reach in wadeable streams in the State of Illinois, an agriculturally-dominated region of 143,969 km2, during 2009–2012, and all individuals, including freshly died ones, were identified to species on site (Cao et al., 2015). The fish samples were collected with electro-seining, a highly efficient method over a reach of 20 times the wetted channel width, no less than 100 m for small streams and not greater than 300 m for large streams and fishes collected were identified to species mostly on site with hybrid individuals removed (IL-DNR, 2010; Cao et al., 2016). Out of the original 937 fish samples collected between 1995 and 2010, we matched 459 with the mussel samples for the location (Fig. 1). Because of the long lifespans of many mussels (most individuals from field surveys were estimated to be > 10 years old), these two datasets were considered well-matched in time. We also compiled the hosting and life-history traits of 39 mussel species recorded in our dataset based on the literature, mainly Haag (2012) and Moore et al. (2021) (Table 1), and calculated SR for each of the five mussel guilds as well as their percentages out of the total mussel SR at each site.

Fig. 1
figure 1

Study region and the locations of sampling sites for both mussel and fish assemblages in Illinois, USA

Table 1 Hosting and life-history (LH) strategies of 39 freshwater mussel species collected from 459 stream sites, Illinois, USA compiled from the literature

Landscape environmental data

Aquatic assemblages are often affected by multiple environmental processes operating at different scales (Poff, 1997; Jackson et al., 2001). A wide range of environmental variables summarized at the stream reach and watershed levels were considered in examining the variation of the SR ratio and guild SR among sampling sites in this study.

A stream reach is defined as a stream segment from the headwater to the first tributary or between two tributary confluences (Brenden et al., 2006). Each sampling site was assigned to a stream reach. From the GIS database of Illinois Department of Natural Resources (Holtrop et al., 2005), derived based on the Great Lakes Regional Aquatic Gap Analysis Project of the US Geological Survey (Brenden et al., 2006; McKenna et al., 2013), 69 environmental predictors were selected to model individual fish and mussel species in the study region by Cao et al. (2015, 2016). These predictor variables describe a wide range of stream environments, including stream size, climate, geology, soil, spatial connectivity, land use, and topography at the channel, riparian zone, and watershed scales. Multiple variables also are used to describe complex factors, such as thermal regimes (e.g., Max, Min, and mean temperature) and stream size (Strahler order, link, and watershed size). We used the same set of predictors in the present study, but included the latitude and longitude of a sampling site to detect potential spatial patterns (Online Appendix 1).

Mussel–fish SR ratios vs. mussel–fish SR correlation

The number of fish per mussel species has been noted to vary greatly among different biogeographic regions (Haag, 2012). We used the inverse of this metric (i.e., the mussel-fish SR ratio) to show the difference in environmental preferences between mussel and fish SR, as mentioned earlier. The SR ratio also can be linked to the SR correlation strength directly. When the SR ratio is constant, all data points in a mussel–fish SR plot shall fall on a single line, and the mussel–fish SR correlation will be perfect (Pearson r = 1.0), and the ratio becomes the slope of the SR plot. When the variability of the SR ratio increases, i.e., data points spreading more widely, the correlation will become weaker. We demonstrated this relationship by (1) dividing the whole dataset into 4 groups based on the SR ratio (< 0.2, > 0.2–0.4, > 0.4–6, and > 0.6), (2) calculating the coefficient of variation (CV) in the SR ratio and the mussel–fish SR Pearson correlation for each of the four group as well as for the whole dataset, (3) plotting the CV values against the Pearson r. We expected that the correlation coefficient decline with increasing CV.

Modeling the SR ratios- and guild SR–environment relationships

Random forests (RF) regression (Breiman, 2001) was chosen to model the relationships between the SR ratio and environmental variables for its strong performance in analyzing complex relationships (Cutler et al., 2007). RF is an ensemble algorithm built on the classification-regression tree (Breiman et al., 1984). Through bootstrapping all predictors, a large number of small random subsets of predictors are generated and used to build separate tree models. One-third of all samples by default, were selected at random and set aside for model prediction assessment, and the other 2/3 were used to build a tree model. As model performance may vary with mtry, i.e., the size of predictor subsets used, 5000 trees were built for each RF model at mtry = 1–20, and five replicate RF models were built at each mtry level. The model with the highest R2 was selected. The importance of a predictor variable was evaluated based on the percent increase of mean standard error (% Inc. MSE) when the order of the targeted predictor was randomized: The higher % Inc. MSE is the more important is a predictor (Breiman, 2001). Partial-dependence plots were used to interpret the response of the mussel–fish SR ratio to a predictor of interest. It is a tool to infer how a response variable (e.g., the SR ratio) changes with a given predictor (e.g., the watershed size) after averaging out the effects of all other predictors (Friedman, 2001).

RF regression is highly resistant to the collinearity of predictors (Breiman, 2001; Cutler et al., 2007), however, a strong collinearity makes the result interpretation difficult. We, therefore, conducted correlation analysis for a subset of key predictors after the models were built to help to identify the likely drivers of the SR ratio and the % guild SR.

We modeled the SR ratio based on all sites, including mussel-absence sites, as it is important to understand why a stream site may support fish species, but not mussels. The guild SRs were strongly correlated with the SR ratio (Online Appendix 2), and their models will unlikely offer much additional information. We thus focused on modeling the percent guild SR (% guild SR) based on 352 sites where mussels were collected.

Results

Across all 459 sites, 39 mussel and 132 fish species were recorded. In both groups, the relative occurrence frequency varied substantially among species, 0.0022–0.4161 for mussel, 0.0022–0.8545. On average, fish species, however, occurred more widely than mussel species by 34% (15% vs. 11% of all sites). Mussel SR varied more substantially among sites than fish SR (i.e., higher CV, Table 2), with no mussels collected at 107 sites. The total number of fish species is three times as high as the number of mussel species. The SR ratio ranged from 0 to 1.1 (mean = 0.21, standard deviation [SD] = 0.19) across the 459 sites and strongly skewed to the left (i.e., low at most sites and high at a small number of sites, Fig. 2). The SR ratio is more strongly correlated with mussel SR (Pearson r = 0.89, P < 0.01) than with fish SR (Pearson r = 0.10, P > 0.05). Fifteen of 39 mussel species are host generalists, and 24 are specialists. Out of the life history strategies, 14, 13, and 12 species are equilibrium, opportunistic, and periodic, respectively (Table 2). However, at the site level, the host generalists and specialists were almost equally common, and opportunistic species were dominant (Table 2).

Table 2 Statistical summary of fish and mussel SR metrics for 459 stream sites in Illinois, USA (G = host generalists; S = host specialists; E = equilibrium strategists; P = periodic strategists; O = opportunistic strategists, see Haag (2012) for definitions; PT = proportion, based on 352 mussel-present sites only)
Fig. 2
figure 2

Relative frequency distribution of mussel-fish SR ratios across 459 stream sites in Illinois, USA with low values at most sites

The correlations between the SR of the five mussel guilds (two hosting- and three life-history-based) and fish SR are generally low (Online Appendix 2) but slightly stronger for the host specialist guild than for the generalist guild (Pearson r = 0.32 vs. 0.43, P < 0.01), something that partly supports our hypothesis. The correlations between the total mussel SR and the SR of different mussel guilds were all strong (r = 0.74–0.92, P < 0.01), indicating that the SRs of all these guilds strongly co-varied with the total mussel SR across the sampling sites.

Mussel-fish SR ratios vs. mussel-fish SR correlation

The whole dataset and the site group with the SR ratio = 0–0.2 reached the highest CV and weakest mussel-fish SR correlations, while the other three groups showed similarly low CV and strong correlations. As a result, the Pearson r rapidly and linearly declined with increasing CV of the SR ratio (Fig. 3).

Fig. 3
figure 3

Effects of the variability measured with coefficient of variation (CV) in the mussel-fish SR ratio on the mussel-fish SR correlation

SR ratio–environment relationships

The RF models explained 47% and 40% of the variance in mussel (mtry = 5) and fish SR (mtry = 6), respectively. The directional responses of the two types of SR to the top predictors are similar, positive to measures of stream size (WT_Area and Link, see Online Appendix 1 for definitions), percent agricultural land in the watershed (WT_Agri), and deep bedrock in the watershed (WT_BG100), and negative to the watershed slope (WT_Slope) or channel slope (Channel_Slope) and percent shallow bedrock in the watershed (WT_BR50). However, the response curves of fish and mussel SR differed in disparate and complex ways (see Online Appendix 3 for examples) that were difficult to quantify. Therefore, our use of the SR ratio is well justified. Noticeably, fish SR was ranked 6th for the mussel-SR prediction importance in terms of % Inc. MSE, indicating its moderate effect on mussel SR in the study region.

The RF model explained 30.24% of the total variance in the SR ratio (mtry = 5). Many environmental variables contributed to the explanation of the changes in the SR ratio, with 20 of those reaching ≥ 15% in Inc. MSE (Fig. 4). The top 10 predictors are mostly natural environmental variables, including stream-size measures (WT_Area, Link), slope measures (W_Slope, WT_Slope, or Chan_Grad), and air temperature (WT_JMean, WT_JMin, WT_GDD). Other important predictors include percent agricultural-land and forest in the watershed (WT_Agri and WT_Forest), the percent watershed with bedrock at given depths (WT_BR50, WT_BG100), latitude, and longitude.

Fig. 4
figure 4

Importance of the top 20 predictors for mussel-fish SR ratios across 459 streams sites in Illinois, USA, measured with percent increase in mean standard error (MSE) when the values of a given predictor is randomized in the Random-Forests regression (see Online Appendix 1 for predictor descriptions)

Partial-dependence plots showed that the SR ratio strongly increased with the stream size (WT_Area or link), deep bedrock (WT_GB100), and WT_Agri, but decreased with stream or watershed slope (Fig. 5A–C, F). The relationship between the SR ratio and climate variables was more complex; the ratio largely stayed at a high level over the lower range of temperature, followed by a rapid decline (see Fig. 5E for example). The ratio also tended to decrease from the North to the South with a peak in the central Illinois (Fig. 5D) and consistently increased from the West to the East. As a result, the ratio is predicted to be the highest in central-eastern part of the study region.

Fig. 5
figure 5

Partial-dependence plots of Random-Forests regression suggesting the responses of mussel-fish SR ratios to key environmental predictors in streams of Illinois, USA

Most of the key predictors strongly co-varied along latitude and longitude. Stream or watershed slope and shallow bedrock (WT_BR50) increase at a lower latitude, and so do temperature and precipitation (Table 3). In comparison, soil permeability tends to decrease at lower latitudes. WT_Agri is negatively correlated with the slopes and positively with bedrock depth (WT_BG100) and soil permeability (WT_Perm); WT_Forest showed opposite trends along these three environmental gradients.

Table 3 Pearson correlations among key natural environmental, climate, and land-use variables for 459 stream sites at the watershed scale in the State of Illinois, USA, all significant at the P = 0.05 level except those labelled with * (see Wasserstein & Nicole, 2016 for the interpretation of P value)

Mussel guild SR in relation to the environment and fish SR

The % guild SR is weakly correlated with the SR ratio (Pearson r = 0.20–0.65, P < 0.01). The model fitting for the SR of the two hosting guilds are relatively low and equal (R2 = 0.23). For the three life-history strategies, the model fittings for the SR of the equilibrium and opportunistic guilds are similar to each other and better than for the % SR of the periodic guild (R2 = 0.30 vs. 0.16).

The top 15 predictors and the directions of their effects varied among the % guild SR (Table 4), but the natural environmental factors, including stream size, slope, climate, and geology are more important than land-use and spatial connectivity. The responses of the % SR of the equilibrium guild are similar to those of the SR ratio, i.e., positive to stream size, bedrock depth (WT_BG100), Agricultural land (WT_Agri), but negative to stream slope and warm-wet climate. The responses of the opportunist SR are opposite, and those of the periodic SR are mixed. The % SR of host-specialist guilds responded positively to stream size, agricultural land, warm-wet climate, and slope (W_Slope) but negatively to latitude (Table 4). The responses of the % SR of host-generalist guilds are opposite.

Table 4 Response types and importance ranking of the top 15 predictors to percent SR of five mussel guilds based on the partial-dependence plots of Random-Forests regression applied to 459 stream sites in Illinois, USA (%IncMSE = percent increase in mean standard error, a measure of importance; see Online Appendix 1 for descriptions of the predictors); responses are categorized as increase (↑), decrease (↓), peaking (^), U- or V-shaped (V)

Discussion

A great amount of effort has been focused on the relationships between individual mussel species and their fish hosts (Modesto et al., 2018). In comparison, less attention has been paid to the mussel–fish SR relationship, which varies greatly among studies (Watters, 1992; Vaughn & Taylor, 2000; Bey & Sullivan, 2015). Our analysis examined how the SR ratios are affected by environmental gradients and how different mussel species guilds are related to the environment and fish SR. We found that mussel and fish SR differed in their responses to certain environmental gradients, and some mussel species guilds are more closely associated with the environment or fish SR. These results can help us to reconcile the inconsistencies in mussel–fish SR relationships previously reported, provide new insights into the relative importance of fish and physical environment, and may inform effective mussel biodiversity conservation.

First, our analysis provided an explanation of the reported variable correlations between mussel and fish SR at the site scale. We demonstrated the clear and negative relationship between the variability in the SR ratio and the mussel-fish SR correlation (Fig. 3). Statistically, the inconsistent mussel-fish SR correlation is driven by different levels of variability in the SR ratio among studies. The ecological question is why the SR ratio varies among sites, regions or studies. Our RF model for the SR ratio showed that the different responses of mussel and fish assemblages to environmental gradients at least partly explained the variation of the SR ratio among sites. Given that the types and lengths of key environmental gradients for mussel and fish assemblages are essentially region- or dataset-specific, the mussel-fish SR correlations can be expected to vary among datasets.

Second, we showed what specific natural environmental gradients mussel and fish SR appeared to respond differently. Both fish and mussel SR increased with stream size (Online Appendix 3), as observed by others (Osborne & Wiley, 1992; Watters, 1992; Krebs et al., 2010). However, the increase in the SR ratio with the watershed size (Fig. 4A) indicates that mussel SR is more constrained by small stream size than fish SR. In fact, there are no or few mussel species in many small streams (Watters, 1992; Myers-Kinzie et al., 2001; Krebs et al., 2010). These observations are not surprising because many fish species can quickly colonize and recolonize small streams with seasonal or unstable flow (Datry et al., 2017), but mussels cannot. A stable flow in larger streams is preferred by both mussel and fish species (Haag & Warren, 1998, 2008; Troia & Gido, 2014), but it appears to benefit mussel even more. Both mussel and fish SR also decreased with increasing slope in our study (Online Appendix 3), but again the decrease in their ratio (Fig. 4B) suggests mussel SR is more sensitive to slope than fish species. A higher slope often implies less stable substrates and higher erosion (Arbuckle & Downing, 2002), which should affect mussel species as benthos and filters more than fishes. Hewitt et al. (2019) also noted that the number of fish species per mussel species that prefer low-slope streams was low, or the SR ratio is high. The SR ratio further decreased with the increasing % of shallow bedrock in the watershed (Fig. 5C). Watershed geology is known to be important for mussel assemblages, probably by affecting stream substrates, slope, flow, and water chemistry (Arbuckle & Downing, 2002; Newton et al., 2008). Future studies should examine how bedrock depth and other geological characteristics affect stream habitats and flow regimes in the study region. In Illinois, the spatial patterns of the watershed topography, soil, and surficial geology are known to be strongly affected by the Illinois and Wisconsin glaciations (Forbes, 1908; Grimley et al., 2017). The glacial history may also have affected the dispersal processes of both fishes and mussels and then the mussel SR and SR ratio. However, both metrics increased significantly from Northeastern to Central-Eastern Illinois, although both areas were glaciated, and decreased in Southern Illinois, which was not glaciated (Grimley et al., 2017). This pattern implies little direct effect of glaciations on these two metrics. Overall, our results suggest that the responses of mussel and fish species to main environmental gradients often are the same in direction but different in magnitude.

Climate often constrains the abundance and distributions of aquatic species through thermal and flow regimes (Olden & Naiman, 2010). In our study, both the SR ratio and the % SR of the five mussel guilds are strongly associated with temperature and precipitation (Fig. 4, Table 4), often decreasing with increases in both variables. This result seems to suggest that climate is limiting mussel SR in Illinois. However, the climate in the region strongly co-varied with stream slope, surficial geology, and soil permeability (Table 3). Several southern states in the US support higher freshwater mussel diversity in spite of their hotter and wetter climate, such as Tennessee and Arkansas in the US (Haag, 2012). The climate effect observed here is likely to be an artifact, and the real drivers are the co-variables above. Co-variation among environmental variables can present a challenge in modeling and predicting the impacts of climate changes, and modeling based on climate variables alone may be not reliable. However, the climate gradient is global and largely directional, while other environmental gradients, such as those mentioned above, are typically regional or local. Therefore, modeling the effect of climate at the level of continental or a large geographic region, such as the whole Midwestern US (Schartel et al., 2021), may overcome the difficulty associated with the co-variation.

Land-use is a major threat to freshwater biodiversity (Dudgeon et al., 2006; Blann et al., 2009). The negative impact of agricultural land-use on freshwater mussels has been widely reported (Arbuckle & Downing, 2002; Poole & Downing, 2004; Hornbach et al., 2019). However, our analysis contradicted these previous studies, i.e., the SR ratio appeared to increase with the percent agricultural land in the watershed (WT_Agri) and decreased with the increasing percent forested land in the watershed (WT_Forest). This result also disagrees with one previous study in the central-eastern part of the region (Cao et al., 2013). We think that this inconsistency is caused at least partly by the strong co-variation of land use with the stream slope, soil permeability, and depth of bedrock in the watershed in our study region (Table 3). This reasoning should also be applicable to the % SR of the mussel guilds. As a result, we could not effectively assess the effects of land use on the SR ratio and the different mussel guild SR in this study. As with climate impact, researchers need to carefully examine whether or how land use may co-vary with natural environmental gradients when assessing its ecological impacts and strive to overcome the challenge.

How important fish hosts are to mussel species relatively to the physical environment appears debatable. Inoue et al. (2017) hypothesized that mussel distributions are nested within fish-host distribution, and mussel species distribution is largely controlled by abiotic factors (Inoue et al., 2017). We showed that fish species are distributed more widely on average than mussel species, and mussel-fish SR correlation is weak at the site scale. These results support the hypothesis above. The average SR ratio is also low (0.21), which is equal to approximately 5 fish species per mussel species, compared with 1.9–13.7 fish species per mussel species previously reported (Haag, 2012). The low SR ratio suggests generally high fish host availability to mussel species, which agrees with the low importance of fish SR to mussel SR compared with environmental variables in our RF model. Physical environmental data thus may be generally sufficient for mussel-SR modeling and conservation planning, although fish hosts are known to be critical for specific mussel species (Fritts et al., 2012; Benson et al., 2017; Galbraith et al., 2018). We also noted that in some small, but significant number of streams the SR ratios were high (Fig. 1). Fish SR may be more important in those streams because mussel species could compete for fish hosts (Rashleigh & DeAngelis, 2007; Hewitt et al., 2019) and needs to be carefully weighed in conservation planning for those streams.

The wide range of the SR ratios observed in the present study also implies that fish SR is a poor index of mussel SR in the study region. This result agrees with the observations that mussel SR and abundances can change substantially while a fish-based biological index of integrity (IBI) remained stable (Cao et al., 2017; Haag, 2019). To effectively protect mussel biodiversity, mussel assemblages need to be routinely monitored by environmental or natural resources agencies rather than relying on fish SR and IBI.

The relationships between the % SR of the five mussel guilds and the environment and fish SR largely agree with the general expectations (Haag, 2012; Randklev et al., 2016). The positive response of the % SR for the equilibrium guild to stream size and the negative response to stream slope (Table 4) agree with the preference of the guild for generally stable flow in large streams. Daniel & Brown (2013) also reported this species guild was limited to high-order streams. The positive responses of the % SR of the opportunistic mussel guild to slope and negative response to stream size agree with the general observation that the species of this guild often occupy hydrologically unstable or disturbed habitats (Haag, 2012; Randklev et al., 2016). The correlations of fish SR with the SR of host-specialists is only moderately stronger than with the SR of host generalists. We wonder whether the difference in correlation would be greater if phylogenetic diversity instead of SR is used to measure fish diversity. If fish phylogenetic diversity is indeed more closely related to the % SR of host specialists, the higher % SR of host-specialists at low latitude (Table 4) may be explained because fish phylogenetic diversity increases in the South-East in the US (Qian et al., 2020).

Our model explained a significant proportion of the variance in most response variables (e.g., the SR ratio), but a large proportion remained unexplained. Aquatic biodiversity is affected by many processes operating at different spatial and temporal scales (Poff, 1997; Jackson et al., 2001). We focused on the GIS-based landscape-level variables in this study. Local habitat characteristics, water quality, and flow regimes are among other potential drivers of the SR ratio not considered. However, it is challenging to effectively characterize these factors. Water quality is often driven by weather events and flow (Sherson et al., 2015), and continuous monitoring over the long term is needed to assess its impacts on aquatic biodiversity, but unavailable in the study region. Flow data also generally fall short for wadeable streams. With continuous monitoring loggers for water quality and flow increasingly applied to stream surveys (Chapin et al., 2014; Parker et al., 2020), these difficulties could be overcome over time. For the modeling of mussel guild SR, the uncertainty associated with the categorization of biological traits in some species (Moore et al., 2021), may also contribute to the unexplained variance. Finally, biological pathogens could be a key diver of mussel diversity, but are poorly understood (Brian & Aldridge, 2019; Waller & Cope, 2019; Richard et al., 2020). The need for their inclusion in mussel surveys and SR modeling is warranted.

In conclusion, the mussel-fish SR ratios were generally low but highly variable among sampling sites. The high variability appeared caused by the different responses of mussel and fish SR to multiple environmental gradients, including stream size, stream slope, and watershed geology. The type and lengths of key environmental gradients likely vary among datasets, and so are the mussel-fish SR correlation. Overall, the physical environment appeared to be more important to mussel SR than fish SR in the study region. Land-use and climate could strongly co-vary with other environmental factors, and great caution is needed to model and predict their effects on mussel assemblages.