Introduction

Diatoms are excellent indicators of environmental change in aquatic ecosystems as many taxa have distinct ecological requirements, limited tolerances to different water quality parameters and are highly sensitive to changes in their environment (Reid et al. 1995). Quantitative methods for relating diatom assemblages and water chemistry include modern calibration datasets and diatom inference models (i.e. transfer functions). These can be applied to sub-fossil diatom assemblages in sediment cores to reconstruct past water chemistry and to investigate past environmental and climate changes (Birks 1998). Much published work has been focused on inland lakes (e.g. Fritz et al. 1991; Bennion et al. 1996; Bigler and Hall 2003) but there are relatively few equivalent datasets for coastal lake and estuarine environments. However, some inference models have been developed and applied (e.g. Juggins 1992; Gehrels et al. 2006; Ryves et al. 2004; Ellegaard et al. 2006; Horton et al. 2006; Weckström et al. 2004; Hassan et al. 2009). In Australia, diatom inference models for coastal lake and estuarine environments have so far been developed for relatively small geographic regions (e.g. Hodgson et al. 1996; Saunders et al. 2007, 2008; Taffs et al. 2008). Yet in some studies it is desirable to have diatom-based inference models that span a wider range of modern analogues, over a wider geographic area to better encompass the range of variability encountered in sub-fossil deposits. This is particularly important in regions that have experienced diverse and extensive environmental changes and/or human impacts, such as parts of coastal Australia. The development of larger, combined datasets has previously been undertaken in Africa (e.g. Gasse et al. 1995), Europe (e.g. Juggins 2004; Bennion 2010) and North America (e.g. Wilson et al. 1996).

Many Australian ecosystems have been highly modified since European settlement and with the majority of the population (85%, SoE 2006) living in coastal regions, water quality is a key issue for environmental managers. Diatoms are recognised as one of the potentially most useful groups for water quality assessments in Australia (ANZECC 2000), but limited information regarding their ecological preferences has, to date, restricted their use. This paucity of data along Australia’s extensive coastline needs to be addressed if diatoms are to be more widely applied to long term monitoring and management of the coastal zone.

This paper presents a new dataset quantifying the relationship between diatom species assemblages and the water chemistry of southeast Australian estuaries and coastal lakes. It extends a previously published dataset from Victoria (Saunders et al. 2008) with 36 sites from eastern Tasmania. The addition of the Tasmanian sites is useful because many of them have experienced only minor or transitory human impacts compared with their Victorian counterparts, and therefore provide additional potential analogues for diatom assemblages associated with pre-impact conditions in sediment cores. The specific aims of this study were to (a) provide species-level data on the composition and distribution of surface sediment diatom species assemblages in coastal water bodies in southeast Australia; (b) investigate and identify the environmental gradients influencing diatom species distributions and individual species ecological preferences; (c) develop inference models (i.e. transfer functions) based on the relationship between diatom species assemblages and environmental variables; and (d) discuss potential applications of the inference model to ecological and paleoecological investigations, with the aim of providing information to facilitate better environmental management.

Materials and methods

Study area

Rising sea levels last submerged the land bridge between Tasmania and Victoria approximately 13,000 years ago and the present coastline started forming near the end of the post-glacial marine transgression approximately 7–8,000 years ago. The coastline of the study area consists of quartzose beaches and dunes with a small number of calcareous beaches and dunes west of Wilson’s Promontory. Geomorphological characteristics of the coastal estuarine habitats range from semi-enclosed bays that are characterize by marine waters with little fresh water inflow through to coastal water bodies that are brackish and rarely, if ever, connected to the ocean. The nature of these connections to the ocean determines the exchange of water and the salinity of these water bodies.

Climatologically southeast Australia has a variable, temperate climate (BOM 2009). Victoria’s climate is typified by warm summers and mild winters and the climate is dominated by the El Niño Southern Oscillation (ENSO), with El Niño events often leading to drought and La Niña events sometimes resulting in flooding (BOM 2009). Tasmania’s climate has four distinct seasons, with rainfall greatest in winter. The eastern half of the island is in the rain-shadow of the west and is more influenced by the ENSO. Victorian coastal systems experience seasonally variable rainfall, are infrequently flushed, not often stratified and tend to be more nitrogen-limited. Tasmanian coastal systems tend to receive more regular rainfall, are more commonly vertically stratified, frequently flushed and more likely to be phosphorus-limited (Davis and Koop 2006). The coastlines of both regions contain many coastal water bodies of high conservation value that range from small, closed lakes to large, open estuarine systems. These systems support the majority of the human population in both regions as well as key industries. They are consequently of significant environmental, societal and economic importance.

Sample collection

Thirty-six sites from 32 coastal water bodies in Tasmania were sampled and combined with the Victorian dataset (Saunders et al. 2008), which consisted of 45 sites from 13 coastal water bodies, creating a merged dataset of 81 sites. Sites spanned >1,500 km of coastline (37.50°S–42.92°S and 144.4°E–149.7°E, Fig. 1, Table 1). Water chemistry was measured both in August 2004 and February 2005 in order to capture some of the seasonal variation in limnology. Salinity, temperature, pH, turbidity and dissolved oxygen were measured in situ with a Horiba U-10 Water Quality Checker at 1 m water depth. Duplicate 10 mL water samples were collected for soluble reactive phosphate (referred to as phosphate hereafter), nitrate + nitrite and silicate analyses. Sample tubes were rinsed several times in lake/estuarine water at each site prior to collection. Samples were then frozen and analyzed using a Lachat Instrument (Injection Flow Analyzer) at the Commonwealth Scientific and Industrial Research Organization Marine and Atmospheric Laboratories, Hobart, within two months of collection. The autoanalyzer uses the principles of colorimetric analysis in a continuous flow system with all manipulations of the samples automated. Each analytical channel was calibrated using standards of known concentration (Eriksen 1997). The water chemistry measurements for each site were combined to determine a mean value for each variable for the statistical analyses.

Fig. 1
figure 1

a Location of study area; b location of sampling sites in Victoria and c Tasmania with specific sites numbered. See Supplementary Material for specific locations and details of sampling sites

Table 1 Summary of estuaries and coastal lakes in the dataset

Surface sediment samples (top 1 cm), which provide a spatially and temporally integrated sample of the diatom communities at each site (Lim et al. 2007), were collected from approximately 1 m water depth using a hand-operated gravity corer (Glew 1991). Surface sediment samples were collected at the end of the water sampling period (February 2005) to ensure that the diatom species identified more closely corresponded with the water chemistry measured. The number of samples taken from each water body was variable (ranging from 1 to 14 sites), depending on size and complexity, to try and account for spatial variability in diatom habitats and water chemistry.

Diatom preparation and analysis

Diatom samples were prepared following standard methods (Battarbee 1986). At least 400 frustules per sample were counted using oil immersion at 1000× magnification on a Zeiss 20 light microscope. The relative abundance of all species (including unidentified forms) was recorded as a percentage of the total number of frustules counted (Battarbee et al. 2001). Taxonomy was principally based on Australian taxonomic literature (i.e. Hodgson et al. 1997; John 1983; Sonneman et al. 2000) and datasets (i.e. Hodgson et al. 1996; Saunders et al. 2007; Taffs 2005), with additional reference to European coastal diatom taxonomic references (e.g. Witkowski et al. 2000). As diatom taxonomy requires further development in this region (see Vyverman et al. 1995) all taxa were photographed and are archived with the author.

Statistical analyses

Multivariate statistical analyses were used to identify major environmental gradients, explore diatom-environment relationships and identify environmental variables that explained independent portions of the variance in the diatom data.

Prior to statistical analyses, each environmental variable was checked for skewness, and nutrients and turbidity were log(x + 1) transformed. Principal Components Analysis (PCA) was performed on the environmental data to determine the major environmental gradients. Detrended Correspondence Analysis (DCA) with detrending by segments and downweighting of rare species was performed on the species data to establish whether species distribution was unimodal or linear. As gradient lengths were greater than 2 standard deviation units, unimodal ordination techniques were used (ter Braak 1995). Species data were log(x + 1) transformed in an attempt to stabilize the variance in the dataset (Birks et al. 2001).

A series of Canonical Correspondence Analyses (CCA) were performed with scaling focused on inter-species distances, biplot scaling and downweighting of rare species. Variance Inflation Factors (VIFs) were identified and any environmental variables with VIFs > 20 were removed. A series of CCAs of each environmental variable alone was performed, followed by CCAs of individual environmental variables with the remainder as covariables (i.e. forward selection) to determine which made independent, significant contributions to explaining the variation in the species data (i.e. p < 0.05, based on 999 Monte Carlo permutation tests without Bonferroni or other adjustments). Variance partitioning was used to determine the amount of variation explained by each variable and the interaction between them. Detrended Canonical Correspondence Analysis (DCCA), where the ratio of the constrained axis (axis 1) to the first unconstrained axis (axis 2) should be >0.5 (Kingston et al. 1992), was used to determine which environmental variable was most suitable for transfer function development (Birks 1998). All ordinations were performed using the software program R (R Development Core Team 2006) and CANOCO 4.5 for Windows (ter Braak and Smilauer 2002).

Species ecological preferences (i.e. optima and tolerances) were determined using simple Weighted Averaging (WA) on untransformed species data. Transfer functions were developed using simple WA with inverse and classical deshrinking and with/without tolerance downweighting, and Weighted Averaging Partial Least Squares (WAPLS) to determine which technique led to the best performing transfer function. Performance was assessed using leave-one-out cross validation (i.e. jackknifing, Birks 1998). When assessing which model to use, a WAPLS model with a higher number of components was only accepted if there was ≥5% improvement in RMSEP over alternatives with less components (Birks 1998). Sites were selected for deletion based on visual inspection of the residual difference between predicted and observed salinity, with outliers removed until model performance did not improve. Species ecological preferences and transfer functions were determined using the software program C2 version 1.4 (Juggins 2003).

Results

Environmental data and diatom assemblages

The water chemistry data for Tasmanian and Victorian estuaries and coastal lakes that were sampled are summarized and outlined in Tables 2 and 3. Salinity ranged from near fresh to hypersaline, while nutrient concentrations ranged from oligotrophic to eutrophic. In general, the Victorian sites had poorer water quality, as represented by higher nutrient and lower dissolved oxygen concentrations than the Tasmanian sites. Information for each site is provided in the Supplementary material.

Table 2 Summary of water quality data for Tasmanian and Victorian sites
Table 3 Water chemistry results for sites in the dataset

PCA of the environmental data showed that the first two axes explained 42% of the variation in the environmental data. Variation along the first axis was explained by nitrate + nitrite, while variation along the second axis was explained by pH (Fig. 2).

Fig. 2
figure 2

Principal Components Analysis of the environmental data. Boxed region in the upper panel expanded below. DO dissolved oxygen, N nitrate/nitrite, P phosphate, Si silicate, Temp temperature, Turb turbidity. See Table 3 for site codes and water chemistry information

In total, 399 diatom species were identified from 53 genera. Of these, 186 species (47%) were common between Tasmania and Victoria. Most of the diatom taxa were benthic. Thirty-five percent were rare (occurring only once), while 46% occurred at low (<1%) relative abundance. Thirty-nine species were abundant (i.e. ≥10% maximum relative abundance in one or more samples, Table 4), while 29 were widespread (i.e. occurring in ≥20 sites). Opephora guenter-grassii (Witkowski and Lange-Bertalot) Sabbe and Vyverman, and Planothidium hauckianium agg. (Kützing) Round and Bukhtiyarova were the most abundant and widespread: Opephora guenter-grassii occurred in 67 (83%) sites with 47.6% maximum relative abundance; Planothidium hauckianium agg. occurred in 78 (96%) sites with 57.5% maximum relative abundance. For the statistical analyses, only those species occurring with a maximum relative abundance ≥2% and in ≥2 samples were included, resulting in a total of 141 taxa. All taxa are listed in the Supplementary material (including their mean and maximum relative abundances).

Table 4 Number of occurrences (N), effective number of occurrences (Hill’s N2) (Hill 1973), maximum relative abundance, and weighted averaging salinity optima (WA opt) and tolerances (WA tol) of the most dominant (i.e. maximum relative abundance ≥ 10%) diatom taxa

Multivariate analyses: diatom-environment relationships

An initial CCA indicated the environmental data explain 16.3% of the variation in the diatom data (Table 5). As all VIFs were <10, all environmental variables were retained. Salinity was correlated to axis 1, while the remaining variables were correlated to both axis 1 and axis 2 (Fig. 3). Nitrate + nitrite, phosphate, salinity and turbidity explained independent portions of the variance in the diatom data (as determined by forward selection) and CCA of these variables indicated they explained 10% of the variation in the diatom data (Fig. 3, Table 5). Variance partitioning indicated that 7.3% of the variation in the diatom data was due to these environmental variables alone and the total interaction between them was 1.3% (Fig. 4). Salinity explained the most variation (2.6%), followed by phosphate (2.0%), nitrate + nitrite (1.7%) and turbidity (1.0%). The greatest interaction occurred between nitrate + nitrite and the other variables (0.6%).

Table 5 Canonical correspondence analyses of (a) all the environmental variables and (b) forward selected variables only (i.e. nitrate-nitrite, phosphate, salinity and turbidity)
Fig. 3
figure 3

Canonical Correspondence Analysis of the dataset with all environmental variables. a Sites displayed; b species displayed. Boxed region in the upper panels expanded below. DO dissolved oxygen, N nitrate/nitrite, P phosphate, Si silicate, Temp temperature, Turb turbidity. See Table 4 and Supplementary Material for species names

Fig. 4
figure 4

Summary of Variance Partitioning results

To determine which, if any, of the environmental variables were suitable for transfer function development, DCCA was performed with axis 1 constrained to each environmental variable in turn, with axis 2 left unconstrained. DCCA indicated that salinity was suitable (i.e. λ12 = 0.6).

Species optima and tolerances and model development

Simple WA was used to determine species optima and tolerances for salinity. Information for all species is provided in the Supplementary material and is summarized in Fig. 5. There were clear differences in the relative abundance of the most common diatoms along the salinity gradient (Fig. 6). For example, Diatomella cf. balfouriana Greville was most abundant at low (<5 ppt) salinity sites, Rhopalodia acuminata Krammer and Cyclotella choctawhatcheeana Prasad were most abundant at brackish (5–30 ppt) sites, while Cocconeis scutellum Ehrenberg var. 1 and Navicula cf. lusoria Giffen were more abundant at high (≥33 ppt) salinity sites.

Fig. 5
figure 5

Species salinity optima and tolerances, derived by simple weighted averaging

Fig. 6
figure 6

Occurrence of most dominant (≥20% maximum relative abundance) taxa in the dataset. Sites ordered by salinity. Species ordered by their salinity optimum

Simple WA with inverse deshrinking resulted in the best performing salinity transfer function. Six sites were identified as outliers and removed. The final model consisted of 75 sites (r 2 = 0.72, r 2jack = 0.58, RMSEP = 0.09 log ppt, Table 6, Fig. 7). WAPLS did not significantly improve transfer function performance.

Table 6 Transfer function results and comparisons with similar studies
Fig. 7
figure 7

Plots of observed versus inferred salinity and observed versus residual salinity based on simple WA regression and calibration

Discussion

The overall aims of this study were to quantify the relationship between diatom species assemblages and the water chemistry of southeast Australian estuaries and coastal lakes, develop regional inference models, and discuss potential applications of the inference models to ecological and paleoecological studies investigating natural variability and human impacts in coastal ecosystems.

Environmental data and diatom assemblages

Water chemistry differences between Tasmania and Victoria were primarily related to differences in nutrient and dissolved oxygen concentrations (Table 2). This is likely associated with different land use types, freshwater inputs, influence from the sea and other local and regional factors that influence estuarine nutrient cycling. Mean nitrate + nitrite and phosphate concentrations were nine and six times greater, respectively, in Victorian coastal water bodies compared to Tasmanian sites and no Tasmanian sites had nitrate + nitrite concentrations ≥55 μg N L−1 or phosphate concentrations ≥ 70 μg P L−1 (Table 2). However, the low median nitrate + nitrite concentration and, to a lesser extent phosphate concentration, in Victorian sites indicated that high nutrient sites were in the minority (Table 2). This was also found by Philibert et al. (2006) in their study of southeast Australian streams, where the majority of sites had low nutrient values. Mean and median dissolved oxygen in Victorian sites was almost half that of Tasmanian sites (Table 2), despite the maximum dissolved oxygen recorded in Victorian sites being greater than in Tasmanian sites. This indicates that, in general, Tasmanian sites have lower nutrient concentrations and higher dissolved oxygen than Victorian sites, which reflects generally better water quality and lower levels of nutrient enrichment in Tasmanian compared to Victorian sites.

The number of taxa identified (399 in total) highlights the wide diversity of diatom taxa in the southeast Australian coastal region. The most common and widespread taxa (Table 4) are considered cosmopolitan species, characteristic of brackish, coastal and marine water bodies (Witkowski et al. 2000) and have been identified in previous Australian coastal diatom studies (e.g. Fluin et al. 2007; Haynes et al. 2007; Saunders et al. 2007; Taffs et al. 2008).

Major environmental gradients and diatom species ecological preferences

This study demonstrated that diatom assemblages in southeast Australian coastal lakes, lagoons and estuaries are primarily influenced by salinity. Salinity has previously been found to be the overriding environmental variable influencing Australian diatom communities and diatom-salinity (or -conductivity) relationships have been described (e.g. Hodgson et al. 1997; Blinn and Bailey 2001; Gell et al. 2002; Philibert et al. 2006; Saunders et al. 2007; Tibby et al. 2007; Taukulis and John 2009). With the exception of Hodgson et al. (1997) and Saunders et al. (2007), none of these studies were conducted in coastal ecosystems.

Other factors not measured, such as grain size, water depth and light availability, can influence the composition of diatom communities (Stoemer and Smol 1999). For example, water depth has previously been found to have an important influence on benthic diatom communities (e.g. Weckström and Juggins 2005). The influence of depth was minimized in this study as all samples were collected from 1 m water depth. Similarly, light availability and grain size have previously been found to have an important influence on coastal diatom communities (e.g. Hassan et al. 2007). However, studies including these factors have still found salinity is the overriding factor in explaining changes in diatom species assemblages (e.g. Hassan et al. 2009).

The covariance of environmental gradients in coastal areas and estuaries (particularly with nutrients, e.g. Underwood et al. 1998) may make it difficult to determine the environmental factor to which species are primarily responding to (Ryves et al. 2004). Although it is possible to separate and test the independent effects of different parameters, there is often a large proportion of the variance unexplained (e.g. Ryves et al. 2004; this study). This is likely due to a combination of factors including potential mismatches between time-integrated sedimentary assemblages compared to ‘spot’ measurements of environmental variables. Additionally, even if spot measurements of environmental variables are repeated they may still fail to adequately capture the variation in physical chemical conditions that affect diatom growth. Therefore, whilst measurements in this study were made both in summer and winter, it is unlikely that the full ranges of the environmental parameters measured was captured at all the sites, which together with unmeasured parameters, contributes to the unexplained portion of diatom species-environment relationships seen in this dataset.

Prior to this study little was known about diatom species ecological preferences in southeast Australian coastal lakes and estuaries. The determination of species optima and tolerances and gradual transitions in species abundances and occurrences along the salinity gradient (Figs. 6, 7) allows the identification taxa that are more abundant at particular salinities. Whilst most of these more abundant taxa have been previously described as ‘brackish-marine’ taxa, this study permits a finer scale discrimination of their salinity preferences. For example, Cocconeis placentula has previously been identified as a brackish, shallow water species (Hodgson et al. 1996; Fluin et al. 2007), and this study is consistent with these observations (i.e. it was most abundant in brackish conditions, 5–30 ppt, Fig. 7). Grammatophora marina (Lyngbye) Kützing, Planothidium hauckianium agg., Mastogloia pusilla Grunow and Opephora pacifica Petit were most abundant at marine (30–35 ppt) sites (Fig. 7). This agrees with previous descriptions of Grammatophora marina and Opephora pacifica which have been classified as marine taxa (Vos and de Wolf 1993; Fluin et al. 2007). Planothidium hauckianium agg. was very common in the dataset and occurred at most sites. This is a consistent with previous descriptions, which have identified this as a cosmopolitan taxon commonly found in brackish sites, especially on sandy substrates (Witkowski et al. 2000).

Evaluation of the diatom-salinity transfer function

The best performing diatom-salinity transfer function was developed using simple WA with inverse deshrinking. The large number of different waterbody types included in this dataset, ranging from small coastal lakes to large open estuarine systems, provided a wide range of species assemblages distributed along long environmental gradients and a good range of modern analogue assemblages with which to interpret sub- fossil data. Six sites were identified as outliers and removed (P4, SC1, ML2, ML3, GL10, GL11). All were from low salinity (≤12 ppt) sites with very high phosphate concentrations (>450 μg PL−1). While the predictive ability of the diatom-salinity transfer function has a 0.09 log ppt (i.e. 4.3 ppt) error, its performance is better than a previously published transfer function based only on Victorian sites (Saunders et al. 2008). In contrast the Tasmanian transfer function presented by Saunders et al. (2007) has slightly better performance (Table 6) than the current study.

Potential applications of the dataset: ecological and paleoecological investigations of natural variability and human impacts

Environmental impacts on coastal lakes and estuaries in southern Australia are a major management concern. Large-scale environmental problems have arisen due to upstream river regulation, water extraction for irrigation, salinization, enhanced sedimentation rates, modifications of openings to the sea, urbanization of coastal catchments, acidification and eutrophication (SoE 2006; Fluin et al. 2007; Taukulis and John 2009). Management efforts are increasingly focused on alleviating environmental problems (SoE 2006). However, historical documentation and water quality monitoring data are usually inadequate for providing the long term perspective needed (SoE 2006). In the absence of long term water quality monitoring, paleoecological methods play a crucial role in establishing ‘baseline’ conditions and rates of change as they can identify the timing and progressive impact of different human activities and provide a long enough temporal perspective to disentangle natural from anthropogenic-related changes (Willis and Birks 2006). They also provide an opportunity to assess whether ecosystems have crossed critical thresholds (e.g. Smol and Douglas 2007) and take into account natural variability, which is essential for being able to evaluate the capacity of coastal ecosystems to adapt and respond to future changes, particularly predicted climate change and further pressure from human activities (Duke et al. 2003; Davis and Koop 2006). The dataset and transfer function developed in this study may be applied to sediment cores collected from coastal lakes and estuaries in southeast Australia to reconstruct past salinity changes and address issues arising from direct (e.g. hydrological modifications to estuary entrances) and indirect (e.g. changes in catchment land use and/or upstream irrigation schemes) human activities, together with the impacts of recent climate change (e.g. reduced rainfall). This provides a tool for understanding whether current conditions are outside the range of natural variability, attribute causes of change (e.g. Saunders et al. 2008) and assist in determining appropriate management strategies.

Conclusion

This study has presented a new diatom dataset quantifing the relationship between diatom species assemblages and the water chemistry of southeast Australian estuaries and coastal lakes. Salinity explained the most variation in the diatom assemblages and a diatom inference model for reconstructing salinity was developed. This is a potentially valuable tool for monitoring both modern changes in salinity using time-smoothed diatom species assemblages (rather than spot measurements) and reconstructing past salinity changes from sub-fossil species assemblages to identify long term natural variability and human impacts in southeast Australian coastal water bodies.