Introduction

Managers of water bodies in Europe are required to assess the water quality of lakes under the terms of European legislation adopted in 2000: the Water Framework Directive 2000/60/EC (WFD). This assessment must be conducted in terms of biological quality elements (BQEs), which include macrophytes (aquatic plants). BQEs are intended to describe subsets of the biological community, which have inherently highly complex and variable distributions, causing uncertainty in their use as biological indicators (e.g. Capers et al., 2010). Consequently, creating reliable assessment methods for these BQEs has been a major challenge for the monitoring authorities across Europe since the adoption of the WFD (Poikane et al., 2011). In general, assessment methods condense the taxonomic and distributional information gained from macrophyte surveys into metrics, which are usually designed to reflect water quality in terms of the water’s biota (e.g. Penning et al., 2008a; Birk and Willby, 2010).

The WFD requires that estimates of confidence and precision be included in the assessment of the status of lake BQEs. Understanding the effect of sampling variation and other sources of uncertainty on the ecological status class assessment and underlying metrics is essential in providing these estimates. Sources of uncertainty include natural spatial and temporal variation, sampling methodology and modelling of reference conditions (Clarke & Hering, 2006; Clarke, 2012). For macrophyte status assessment the sampling methodology is an important source of uncertainty. Standardised, objective, and repeatable monitoring methods are essential in monitoring programs with aims to detect anthropogenic impact on lake ecosystems. Results of macrophyte surveys are extremely sensitive to both vertical and horizontal variability of macrophyte communities (Jensen, 1977; Janauer, 2002; Hurford, 2010). In addition to the spatial variability there are potential errors related to the recognition and identification of individual species and also especially to abundance estimations of vegetation.

Previous work on running waters in the EU STAR (Standardisation of River Classifications) project showed that inter-surveyor differences were low and the influences of temporal variation (years and seasons) and shading were slightly stronger (Clarke & Hering, 2006). The strongest variation was due to habitat modifications, but few metrics were of sufficient precision in terms of sampling uncertainty to be useful for estimating the ecological status of rivers (Staniszewski et al., 2006). However, the probability of misclassification of a site was found to be largely associated with classification methodology (Szoszkiewicz et al., 2007, 2009).

Assessment of variability in macrophyte assemblages across a range of habitats cannot be adequately performed without accounting for the known natural and anthropomorphic determinants of those communities. For example, it is known that community structure is strongly related to alkalinity due to carbon uptake chemistry; high alkalinity waters are suitable for species that can utilise bicarbonate ion as a carbon source instead of, or as well as, carbon dioxide (Vestergaard & Sand-Jensen, 2000). In addition, alkalinity affects nutrient availability, by both reducing the decomposition inhibiting effects of acidification, and by providing bicarbonate ions which can compete with orthophosphate in bonding with cations (Smolders et al., 2006). The consequence of this is that alkalinity, eutrophication and aquatic macrophyte richness are closely linked, with eutrophication and alkalinity having a positive association (e.g. Penning et al., 2008b), and species richness having a hump-back distribution in relation to both nutrient availability and alkalinity (e.g. Murphy, 2002). Similarly to trophic status, water colour related to humic substances also affects the number of available habitats, which are also determined by lake area and length of the shoreline (Rørslett, 1991; Murphy, 2002). In addition, macrophyte richness can be determined by altitude (Jones et al., 2003), latitude (Heino & Toivonen, 2008) and available routes for dispersal and the regional species pool (Heegaard, 2004).

Aquatic macrophyte metrics are often used to describe plant community responses to environmental pressure (e.g. Penning et al., 2008b). In Europe, water managers are required to assess water quality in terms of macrophyte community, and metrics are a tool used to summarise the effects of human pressure. Eutrophication pressure is usually described using total phosphorus (TP) content of water as a proxy. This has lead to development of large numbers of phosphorus sensitive indicators used in assessment and especially in the Europe-wide WFD common intercalibration exercise (Poikane et al., 2011; Birk et al., 2012). In this study, we have used some of the very few known metrics that can be applied to aquatic macrophyte data collected across Europe.

This study aimed to assess the relative importance of different sources of variation in the sampling data on uncertainty in the available metrics. The general aim of this study was to assess uncertainty in various macrophyte metrics, which might be used in assessing status of this BQE in lakes. This has been achieved by analysing a dataset collected as part of the EU WISER (Hering et al., 2012; Water bodies in Europe: Integrative Systems to assess Ecological status and Recovery) project using a common standardized sampling method from 28 lakes in 10 European countries.

In our study, we focussed on four research questions. First, we assessed how qualitative versus quantitative (presence–absence vs. abundance) data affect metric results and their uncertainty. Second, we analysed how the inclusion or exclusion of helophytic taxa affects the results of the metric. Third, we assessed the uncertainty related to surveying only the 0–1 m depth-zone compared to surveying the whole depth range of potentially colonized area. Finally, we evaluated the variability of the different metrics between lakes, within a lake, and between transects. All four questions are relevant to macrophyte assessment methodologies in Europe (Penning et al., 2008a, b; Kolada et al., 2011).

A further, practical aim of this study was to give recommendations on appropriate sampling design and analysis methods that are most likely to reduce uncertainty in the assessment of the status of lake macrophytes.

Methods

Data collection

A sampling campaign was conducted in the summer of 2009, when 28 lowland clear-water (non humic) lakes from 10 countries representing broad geographical and trophic gradients were selected for survey (Table 1). Lakes selected were between 0.3 and 7.2 km2 in surface area, below 250 m altitude and had a mean depth between 3.8 and 18 m. Lakes were selected to represent a range of eutrophication pressures and a range of geographical types. Within each lake, six stations evenly distributed along a shoreline were identified (the first assigned arbitrarily, and the other five at regular intervals around the shore). Within each station three parallel transects were surveyed by boat, each being 5 m from its neighbour and each starting at the shore and proceeding towards the centre of the lake (Fig. 1). Transects followed straight lines as closely as was practicable. Each transect was divided into depth-zones of 1 m depth intervals down to the limit of macrophyte colonisation and in each depth-zone five randomly selected macrophyte sampling sites were used. At each sampling site a single sample was gathered from a rake dragged along the bottom for approximately 2 m, and supplemented by observation through a bathyscope, where this was possible. In each sample all truly aquatic species, and a pre-defined selection of emergent taxa, were identified and recorded. Identifications were performed by experienced field surveyors in all cases and uncertain specimens were referred to taxonomic experts.

Table 1 List of lakes surveyed for macrophytes in 2009. GIG (WFD Geographic Intercalibration Group) regions are Central-Baltic (CB), Nordic (N) and Mediterranean (Med). Alkalinity (Alk.) types are Low (<0.2 meq L−1), Medium (≥0.2 and <1 meq L−1) and High (≥1 meq L−1). Provisional (Prov.) status refers to local assessment of the macrophyte biological element of either High (H), Good (G), Moderate (M), Poor (P), or Bad (B). Eutrophication (Eutro.) and Hydromorphological (Hymo.) pressures were subjective expert judgements of surveyors, of low, medium or high pressure. Where information was not available this is denoted with a dash
Fig. 1
figure 1

Idealised sampling design used in the common field sampling protocol, employed in 2009. Three transects at one of six stations are shown

Additional to the aquatic macrophyte survey data, surface water samples were collected from a central station of each lake at least twice during the growing season and alkalinity and TP were measured from these samples. Averages of these measurements, from each lake, were used in later statistical analyses.

Data collected during the field campaign were compiled into a database format, maintaining the hierarchical structure of the data in a form analogous to the sampling design. In this format, each observation of a macrophyte taxon in a sample was given a separate record. Data were extracted from the database at various levels. These levels were depth-zone within transect, station and whole lake. At the lowest level, a taxon was present if it was recorded at least once. Abundance at each level was measured as a relative point frequency, which for each taxon was the number of observations of the taxon, divided by the total number of observations of all taxa at that level.

Data analyses

Exploratory multivariate analyses

A multivariate clustering analysis using group averaging was performed for quick exploration of the data available within the dataset to assess whether unexpected subsets of data could be distinguished, using the taxonomic composition of the samples, that were linked more to country or location than to the environmental variables of interest (TP and alkalinity). Species abundance-data were averaged per lake and analyses were performed using the statistical software programme PRIMER6. A similarity matrix was calculated using the Bray-Curtis similarity index on the non-transformed abundance-data. Using this similarity matrix, a dendrogram was plotted using group average to visualize specific subgroups of data.

Calculation of metrics

Taxon-specific trophic rank scores, also known as Intercalibration Metric scores and referred to in this report as ICM-LM scores (Intercalibration Common Metric for Lake Macrophytes), were supplied by Nigel Willby, of the University of Stirling, United Kingdom. These scores have been used in the Water Framework Directive Intercalibration Exercise for lake macrophyte BQEs as a means of comparing lake macrophyte status across Member States, where sampling methods and metric derivations are not consistent. For submerged aquatic plant taxa, scores were derived using methods similar to those used by the United Kingdom Technical Advisory Group (UKTAG, 2009), and Birk and Willby (2010). In general, scores were calculated by rescaling the median of the logarithms of the concentration of TP concentrations in lakes across Europe in which each taxon has been recorded as present. These scores are available in Kolada et al. (2011). Scores are combined across sites to form a metric, either as a simple mean, or by some measure of abundance to weight the mean (see Eq. 1, below). The site metric is intended to be representative of the nutrient status of the water.

The ICM-LM scores were only available for real hydrophytes, as this metric was designed to be used with submerged taxa. Ellenberg’s Nitrogen values for soil fertility (scores from 1 to 9; Ellenberg et al., 1991) were compiled for all taxa in the dataset to test the use of a metric with and without helophytes. We supplemented the original values with British values where original values were missing (Hill et al., 1999). Even with these supplements, there were 13 taxa, notably charophytes, for which no Ellenberg score was available, but for which an ICM-LM score was available. Scores for these taxa were inferred using a regression relationship between the ICM-LM and Ellenberg scores for all species with both values (Ellenberg = 0.22 + 0.79 × ICM-LM, n = 51, R 2 = 0.64). These modified taxon-specific Ellenberg scores were then used to calculate an average Ellenberg-N metric per lake. Taxa for which neither Ellenberg nor ICM-LM scores were available and excluded from further analyses (details given in Results section).

For each transect, where at least one taxon for which a score was known, ICM-LM and Ellenberg metrics were calculated from scores both as simple averages of the scores of the taxa found (unweighted), or as weighted averages,

$$ M_{\text{w}} = \frac{{\Upsigma S_{i} \times A_{i} }}{{\Upsigma A_{i} }} $$
(1)

where M w is the weighted metric, S i is the score for taxon i, and A i is the abundance of taxon i, for all taxa in the sample. In many cases, these metrics have been calculated for subsets of the macrophyte community, such as ‘submerged only’, or ‘helophytes’. Because not all transects contained any taxa with scores, it was not always possible to calculate a metric for a transect. For some of the analyses described below this limitation applied to entire lakes, for example for lakes where no taxa were found below 1 m depth, it was not possible to examine the effect of depth-zone.

Species richness was calculated as the number of macrophyte taxa identified at an individual sampling location. Maximum depth of colonisation (C max) was determined as the greatest depth in which rooted plants were found at each transect.

Uncertainty assessments

The WISER lake macrophyte data were used to examine variability associated with the four levels of the nested sampling scheme: transects, stations, lakes and countries. We assessed this for each of the response metrics described above. Values for each metric were calculated for each transect, the metrics for progressively higher nesting levels in the sampling design (sampling stations, lakes, countries) being derived implicitly as part of the model fitting process. We fitted linear mixed effects models (nlme package; Pinheiro and Bates, 2000) to the dataset, using the R environment for statistical computing (R Development Core Team, 2011). The levels of the sampling hierarchy were specified as nested random effects, with the lowest level, variation between transects, forming the residual. As a measure of uncertainty, we used standard deviation, as provided by the nlme package, to calculate absolute and relative variance at each of the nested levels of the overall sampling design.

We used lake-level alkalinity and TP concentration as covariates in the analyses for two reasons. First, these variables define strong gradients in the dataset (Table 2). Second, the response of the macrophyte metrics to lake-level TP, accounting for variations in alkalinity, and for uncertainty in surveying lakes, is of considerable interest in itself (Penning et al., 2008a, b). As these covariates were measured at the lake-level they explain variance between lakes and countries, but not within lakes. As alkalinity information for Étang des Aulnes (France) was not available, data from this lake were excluded from further analyses. For the analyses, values for TP and alkalinity were log transformed. Metric response values were un-transformed except for species richness, which was square-root transformed before model fitting.

Table 2 Parameters for multivariate model of transect-level abundance-weighted and presence–absence ICM-LM metrics as a function of alkalinity and TP, including their covariance at all levels of the model

Models were fitted using Residual Maximum Likelihood Estimation (REML) to produce unbiased estimates of random effect variances, but any comparison of models differing in their fixed effects was undertaken using Akaike’s Information Criterion and models fitted by standard Maximum Likelihood (ML).

To assess how qualitative versus quantitative (presence–absence versus abundance) data affect metric results and their uncertainty, we compared variance components from models using metrics calculated from presence/absence data with those using metrics calculated from scores weighted by point frequency. To assess how the inclusion or exclusion of helophytic taxa affect the results of metrics, we compared the results of models run using all species, with those run using only submerged and floating plants. To assess the uncertainty related to surveying only the 0–1 m depth-zone compared to surveying the whole depth range of potentially colonized area, each transect was divided into two depth-zones, above and below 1 m of water depth. Metrics were calculated at the transect-level for both depth-zones and the variance components, as well as the response to the covariates from both models were compared.

This study does not address the effects of probability of misclassification of water bodies in status classes as common status boundaries have not yet been defined for the metrics used in this study.

Results

There were 124 plant taxa recorded from the 28 lakes surveyed. 15 taxa, including filamentous algae, an undefined moss, woody species (for example Alnus sp. and Salix sp.) and some taxa recorded at genus or higher level, were excluded from all analyses. The remaining 109 taxa included 101 recorded at species level and 8 at genus level (Callitriche, Chara, Fontinalis, Mentha, Nitella, Nymphaea, Sparganium and Utricularia). There were 10 taxa for which neither an Ellenberg-N nor an ICM-LM score was available, and were therefore not used in the calculation of metrics. None of these 10 taxa occurred in more than two lakes.

Exploratory multivariate analyses

Three main groups of lakes were distinguished from the similarity analyses (Fig. 2): (1) mainly higher alkalinity central European lakes (France, Estonia, Poland, Germany, Denmark), with two more eutrophic, moderate alkaline lakes from the Northern GIG (United Kingdom, Finland), (2) a small group of higher altitude lakes (all Italian lakes and two Norwegian lakes) and (3) Nordic moderate and low alkalinity lakes (Finland, Sweden, Norway, UK). Similarity between lakes was never more than 60%. There was one outlying lake, Skirösjön in Sweden. Only one submerged species was recorded from this lake so it was considered to fall outside the three defined groups.

Fig. 2
figure 2

Hierarchical clustering of the lakes (country code followed by lake name), based on the abundance of aquatic plants, showing three main groups outlined in dashed ovals

Qualitative versus quantitative macrophyte data

Unweighted and abundance-weighted LCM-LM metrics for lakes were highly correlated (Table 2; Fig. 3). This correlation was highest at the lake scale, and progressively lower as one moves to the finer scales of station and transect within the lake. Compared to unweighted ICM-LM, the abundance-weighted ICM-LM gave a steeper (0.46 vs. 0.38) but slightly less precise (standard error of 0.30 vs. 0.28) association with TP, while the response to alkalinity was similar (Table 2). The unweighted ICM-LM shows greater variance at the lake scale than weighted ICM.

Fig. 3
figure 3

Comparison of ICM-LM unweighted metrics (based on presence/absence data) and metrics weighted by abundance, calculated at the lake-level for submerged and helophyte taxa

Helophytic taxa

There was a weak relationship between Ellenberg metrics calculated on weighted averages of submerged taxa only and the same metric calculated on helophytes only. When calculated at the lake scale, metrics based on Ellenberg scores for helophytes only had a much smaller range than their counterparts based on scores for submerged species (Fig. 4). Residual correlations between Ellenberg scores calculated for submerged taxa only and for helophytic taxa only were relatively weak (Table 3). This is likely because there is no overlap in the taxa used to calculate the alternative metric formulations. Furthermore, the helophyte metrics had weaker relationships with both pressure variables TP (results not shown) and Alkalinity (Table 3).

Fig. 4
figure 4

Comparison of Ellenberg metrics calculated from only submerged taxa with the same metrics calculated from only helophytes, for each lake in this study. All metrics produced as a weighted average of taxa scores

Table 3 Parameters for multivariate model of Ellenberg metrics based on only helophyte scores and only submerged taxa scores, as a function of alkalinity, including their covariance at all levels of the model

0–1 m depth-zone versus whole depth range

ICM-LM metrics calculated from the scores of plants found in depths greater than 1 m were lower than corresponding metrics calculated from shallower water (<1 m) plants (intercept of model at 4.51 vs. 5.05), indicating that, on average, species found in the shallow zone have higher trophic status (Table 4).

Table 4 Parameters for multivariate model of ICM-LM (abundance-weighted for submerged species only) for depth-zone <1 and >1 m versus alkalinity and TP

ICM-LM metrics calculated from deeper (>1 m) taxa were less variable between stations within lakes but marginally more variable at the station and transect scales (Table 4). There was a fairly high residual correlation between ICM-LM for the different depth-zones at the lake (0.79) and station (0.67) scales, but low correlation at the transect scale (0.1; Table 4). Hence, variation between depth-zones at the finer spatial scale (transect) tends to be averaged out between transects within stations and between stations within lakes.

Variability of metrics between lakes, within a lake, and between transects

For all metrics, the proportion of variance at the transect-level was much smaller (generally around half) than at the station level (Table 5). The proportion of variance at the country and lake sampling levels was dependent on the metric used, and on whether the explanatory driver variables were included in the model. Except for the Richness metric, inclusion of the explanatory variables always reduced the between-lake (country and lake) proportion of variance, mostly by reducing the variance at the country level. ICM-LM, compared to Ellenberg, illustrated a slightly higher proportion of variance between lakes, with correspondingly less variance within lakes. Maximum growing depth also behaved similarly to ICM-LM, although the covariates appeared slightly more successful in explaining between-lake variance. The Richness metric followed a different behaviour; introduction of the covariates reduced the variance between lakes but accentuated the variance between countries. Total between-lake variance remained roughly constant (Table 5).

Table 5 Proportions of variance at different levels of the sampling strategy for four different metrics and two formulations of the model: with and without TP/alkalinity

Although inclusion of the explanatory variables, TP and Alkalinity, reduced variance in the models, their relationships to the metrics were not always significant at the traditional (P < 0.05) level (Table 6). Alkalinity showed strong relationships with all metrics (P < 0.01) except Richness. Relationships between TP and metrics were always in the expected direction (higher TP corresponded to higher ICM-LM and Ellenberg, and lower C max and Richness), but for both ICM-LM and Ellenberg the relationships were relatively imprecise. This general pattern was confirmed through re-fitting models using ML estimation and comparison of Akaike’s Information Criteria (AIC) values. The significant relationships between TP and both C max and Richness metrics were notable. We re-fitted the C max and richness TP/Alkalinity models to the subset of data with an ICM-LM score. The C max–TP relationship was robust to this fitting to a smaller subset of the data, but the richness relationship was not. 49 transects had values for richness but not ICM-LM (meaning that plants were recorded from these transects but these plants had no ICM-LM score, mostly because they were helophytes); these were spread across 12 lakes, the lake with the largest number of transects lost being Glindower See with 15 (only one non-helophyte taxon was recorded from this lake). For C max, it is notable that the strong relationship with TP was entirely dependent on alkalinity also being in the model. Without the inclusion of alkalinity as a co-variate, the C max–TP relationship was very weak (results not shown).

Table 6 Significance (P values) for approximate tests for TP and alkalinity fixed effects for models for each metric in Table 5, and numbers of samples at each level of the model

Discussion

This study assesses the complex spatial variability in macrophyte communities, the effect of this variability on various plant metrics, and the implications to water managers, especially in relation to appropriate survey design. Although the study focuses on the assessment of aquatic macrophytes in European lakes, the results have implications for all BQEs in all WFD waterbody types, and indeed for any assessment of the quality of a biological community that uses a metric derived from taxonomic data, anywhere in the world.

Abundance-weighted metrics are preferable to metrics calculated from presence–absence data, but only when all sampling is done using the same methods.

In this study, metrics calculated as abundance-weighted means of taxon-specific scores provided a steeper relationship with the nutrient pressure (TP), and should therefore be considered better indicators of this pressure. It is arguable that abundance-weighted metrics should be used to assess ecological change because they reflect changes in the abundance of taxa, which cannot be detected by metrics based only on the presence or absence of taxa.

The results in this study are contrary to Penning et al. (2008a), who found that there was a little evidence of benefit in using metrics calculated using mean scores weighted by abundance. In fact, Penning et al. (2008a) found that relationships became weaker when plant abundance was used to weight metric scores. In that study, data were from multiple sources and collected using disparate sampling and quantification methods. The abundance data in the Penning et al. (2008a) study had to be re-scaled to the lowest resolution abundance scale from within the collated datasets, which had only three possible values. This accounts for the observed relative imprecision associated with abundance weighting.

The use of helophytes in the calculation of metrics appeared to provide little additional information, and metrics based on helophytes do not respond as well to nutrient pressure (TP) as do the submerged species.

The data used in this study provide a stronger basis for these conclusions than has been previously available to answer this question, and these conclusions are consistent with Penning et al. (2008b). Helophytes are less affected by water quality than submerged plants as their environment is not sub-aquatic, so their response to eutrophication is obscured by soil trophic characteristics, exposure, shoreline management and especially water level fluctuation dynamics (e.g. Coops et al., 1994). It is possible that the use of large datasets collated from multiple sources will provide spurious answers to this question, as it is likely that bias in sampling is related to trophic status. In regions with lakes where the submerged taxa are highly visible, flourishing and diverse, sampling effort will be concentrated on these plants, in contrast to regions where lakes are more eutrophic, so predominately have few submerged taxa, but a flourishing emergent community, where it is likely that sampling effort will concentrate on the helophyte taxa.

It should be noted that this assessment has been made on the basis that eutrophication pressure has been measured in terms of phosphorus, but the impact has been measured in terms of the Ellenberg metric, which is intended to reflect nitrogen status (Ellenberg et al., 1991). Unfortunately, neither a measure of eutrophication based on nitrogen nor a measure of impact (for helophytes) based on phosphorus, was available for this study.

Surveying only shallow vegetation may result in a worse classification of the macrophyte BQE than surveying the entire depth range of plant colonisation.

Calculation of the ICM-LM metric from only shallow vegetation (<1 m) resulted in a lower ecological status than calculation based on plants from the entire depth range of colonisation. Also, the ICM-LM metric calculated on macrophyte data from deeper samples (>1 m) showed a steeper and more precise relationship with TP than the metric calculated from shallow samples. In this study, higher ICM-LM scores were obtained from shallow zone samples than from deeper zone samples. The shallow littoral zone is often affected by incoming ditches and also provides sheltered habitats for species preferring more nutrient rich conditions (Alahuhta et al., 2011). Deeper areas are also typical habitats for oligotrophy-indicating large isoetids in soft water lakes (Murphy, 2002). This has important implications for the assessment of macrophyte status of lakes. If an assessment method uses only shore-based data (obtained by wading), it is likely to result in an assessment of condition that is worse and less precise than if the method used data from deep water as well (obtained by boat). Overall, assuming eutrophication from excess phosphorus is the stressor of prime interest, including macrophyte data from the full depth range is likely to give a more precise and less biased estimate of status.

Within lakes, metrics were twice as variable between stations as between replicate transects (5 m apart). Variance in metrics between lakes depends on the metric used, and on whether explanatory variables are included.

These results support the use of more sampling stations in macrophyte surveys to improve precision of macrophyte metrics, and show that sampling repeat transects at a station is less important. The results also illustrate that differences in the number of transects for which metrics may be calculated can have a strong influence on the results (Jensen, 1977). In particular, as TP levels increase, taxa richness decreases, but the number of taxa from which metrics such as ICM-LM can be calculated decreases even more rapidly. Increased imprecision of metrics associated with low richness of indicator taxa, and at the most extreme, non-calculability of such indices can have a significant influence on perceived metric performance. Therefore, to maintain the same degree of uncertainty, more sampling is required at either end of the trophic scale, when there is less vegetation to be sampled.

This study highlights the importance of alkalinity in the assessment of aquatic plants, especially when considering TP as an explanatory variable. TP and alkalinity are fairly well correlated in the dataset; hence it is not surprising that in some cases either variable on its own may show apparent relationships with metrics. In particular, in this dataset, for ICM-LM and Ellenberg, alkalinity was clearly the dominant explanatory variable, and although the partial relationships with TP were in the expected direction, they were less precise. Furthermore, the fact that the precision of the relationship between TP and C max was conditional on alkalinity also being in the model is notable and highlights the inter-relatedness of these variables.

Recommendations for sampling, data analysis and assessment methods

This article supports the following recommendations:

  1. 1.

    Assessment methods should include samples from the entire depth range of aquatic vegetation, as using only shallow samples can result in a worse assessment of trophic status.

  2. 2.

    Submerged taxa should be used in the assessment of the status of lake macrophyte communities. Helophytic taxa should not be used when assessing the effects of eutrophication pressure as they do not respond in the same way. Helophytes may still be useful in the assessment of hydromorphological pressure and general degradation.

  3. 3.

    Assessment of lake status should use data sampled from multiple stations around a lake.

  4. 4.

    In order to control metric uncertainty, more sampling is required in lakes where macrophytes are scarce or taxa richness is low. At these lakes, scores of individual taxa can have a much larger impact than in lakes with more macrophyte cover or more taxa.

  5. 5.

    Assessment methods should use quantitative data (not just presence/absence) where possible, but only in cases where all data has been collected using the same methods.

  6. 6.

    Examination of uncertainty in metrics should not be undertaken in the absence of the relationships between metrics and stressors. In the worst case scenario, a metric may illustrate desirable properties of low variance within lakes relative to variance between lakes, but may have undesirably low response to stressors.