1 Introduction

Secchi disk depth (Z SD) is the optical measure of water clarity that specifies the sighting depth of a submerged target (Tyler 1968; Preisendorfer 1986), most often a 20-cm-diameter black and white quadrant circular disk in lacustrine waters (Kirk 2011). This long-term measurement (Wetzel 2001; Lee et al. 2015) remains closely coupled to the public’s perception of water quality (Effler 1985; Smith and Davies-Colley 1992). The magnitude of Z SD is determined by the composition and concentrations of an array of light attenuating substances, described here as optically active constituents (OACs; Fig. 1). Z SD is an apparent optical property (AOP) in that it depends on the geometry of the light field (Kirk 2011). While the OAC conditions are ultimately responsible for the values of this AOP, the effects of the OACs are mediated quantitatively through attenuating inherent optical properties (IOPs; independent of the geometry of the light field), including the scattering (b, m−1), absorption (a, m−1), and beam attenuation coefficients (c = b + a, m−1, Table 1; Mobley 1994; Kirk 2011). To understand and quantify the dependence of Z SD on the OACs, it is important to consider their connection through a theory-based framework (e.g., mechanistic model) that quantifies both the OAC → IOP (link 1) and IOP → AOP (Z SD; link 2) linkages (Fig. 1; Davies-Colley et al. 2003).

Fig. 1
figure 1

Conceptual layout of mechanistic model to predict Z SD (an AOP) from particulate OACs, with two links that quantify the dependencies of IOPs on OACs (link 1) and, in turn, Z SD on IOPs (link 2). Symbols defined in Table 1

Table 1 List of abbreviation and symbols

Most of the earlier work on the regulation of Z SD in lacustrine waters focused on phytoplankton biomass as the sole or primary OAC, acknowledging that it is accompanied by a retinue of associated organic materials (bacteria, protozoans, associated organic detritus), and used chlorophyll a concentration ([Chl]) as the surrogate metric (Jewson 1977; Megard et al. 1980; Field and Effler 1983; Tilzer 1983). Published empirical relationships reported between Z SD and [Chl] for certain lakes had an inverse form, e.g., Z SD −1 ∝ [Chl], that was reasonably strong in selected cases. However, the potential for noteworthy influence on lacustrine Z SD by OACs other than phytoplankton biomass, particularly other light scattering particulates, is clear when a widely accepted theory-based radiative transfer expression (Tyler 1968; Preisendorfer 1986) is considered (Eq. (1)).

$$ {Z}_{\mathrm{SD}}=\Gamma {\left({K}_{\mathrm{L}}+{c}_{\mathrm{L}}\right)}^{-1} $$
(1)

This equation represents the dependence of Z SD on IOPs, as c (for luminance (L) at the penetrating wavelengths for humans, c L) is equal to the sum of a and b for the L wavelength(s) (λ), and another AOP, the diffuse attenuation coefficient for illuminance/irradiance (K L). Luminance and illuminance are photometric quantities (describing the brightness sensed by the human eye). When making observations at a single wavelength, the differences between the photometric and radiometric quantities are not significant (Zaneveld and Pegau 2003). In lacustrine waters, c L is dominated by particulate b (b p) at the L wavelength(s) (b = b p + b w; scattering by water (b w) is minor); i.e., b p greatly exceeds a at these wavelength(s) (b L ≫ a L). Accordingly, Z SD may be influenced by a broad array of particle types (Kirk 1985; 2011), those that can make noteworthy contributions to b p (=Σb x, Fig. 1). Indeed, cases of weakness in Z SD −1 ∝ [Chl] relationships in both time and space have been widely identified in recent studies (Kirk 1985; Jassby et al. 1999; Davies-Colley et al. 2003), in part associated with substantial and variable contributions to b L (i.e., also c L) by non-phytoplankton particles (e.g., terrigenous soils; Effler et al. 2008). Pursuit of quantitative relationships between multiple OACs and IOPs, and thereby Z SD, in the form of a theoretical model (Fig. 1) has both research and lake management value by guiding (1) improved understanding and quantification of the linkages, (2) identification of appropriate OAC foci to manage Z SD, and (3) model of Z SD from OAC information.

Recent advancements in quantitative partitioning of IOPs according to contributions from multiple constituents (OACs) have established that minerogenic particles, in addition to phytoplankton biomass, are often also important components of b p, and thereby Z SD (Fig. 1; Peng and Effler 2010, 2011; Effler and Peng 2014). Moreover, there is a need to establish appropriate metrics for all of the important particulate OACs, as shortcomings in their representativeness translate to uncertainties in estimates of their effects on IOPs, and thereby on Z SD (Fig. 1). Chlorophyll [Chl] remains the most widely monitored of the metric options for the phytoplankton biomass OAC, though shortcomings have been recognized (Reynolds 2006). The concentration of particulate organic carbon [POC], an alternative proxy for phytoplankton biomass, may have advantages over [Chl] (Fennel and Boss 2003; Stramski et al. 2008). The traditional metric for the OAC of inorganic particulates had been gravimetric, the concentration of inorganic suspended particulate material [ISPM]. However, the various shortcomings of [ISPM] in representing the optical attributes of minerogenic particle populations (Peng and Effler 2015; Gelda et al. 2016) are likely responsible for the limited cases (Stavn and Richter 2008) of the applications of this metric for optics issues. Recently, an individual particle analysis technique, scanning electron microscopy interfaced with automated image and X-ray analyses (SAX), has been used to successfully characterize the light scattering features of minerogenic particle populations, including concentrations, sizes, and composition, and support estimates of their contributions to b p (reviewed by Peng and Effler (2016b). The total projected area of minerogenic particles per unit volume of sample (PAVm) determined from SAX characterizations has been found to be a particularly valuable metric of this OAC (Peng and Effler 2015).

Theory-based resolution of the contributions of minerogenic particles (b m; PAVm-based) vs. phytoplankton biomass (b o; [Chl]-based), as the two primary OACs of overall b p (=Σb x = b m + b o), has been advanced, through demonstrations of reasonable closure with bulk measurements of b p (Peng and Effler 2015).

This represents progress in the first link of a two-link mechanistic model to predict Z SD from OAC metrics (Fig. 1). However, the effects of alternate metrics for these two OACs for b p, and thereby Z SD predictions (e.g., [POC] and ISPM; Fig. 1), have not been considered. Tests of the second link (link 2), the consistency of bulk measurements of IOPs and another AOP (K d) with Z SD observations, in the context of a theory-based radiative transfer expression (Fig. 1), have also been reported (Lee et al. 2015). However, implementation of an overall mechanistic Z SD model that includes both of the two links (Fig. 1, link 1 and link 2), first the IOP estimates (e.g., b p) from OAC metrics (link 1), and, in turn, predictions of Z SD from those IOP estimates (link 2), has been absent for lacustrine waters.

A central goal of this paper is to advance the development and testing of a two-link theory-based mechanistic Z SD model (Fig. 1) to advance understanding, simulation, and management of water clarity that can be applied separately or integrated into a larger multiple parameter water quality model. The issue of modeling Z SD is advanced here based on robust OAC and optical data sets from Cayuga Lake, NY, and theory-guided analyses. The elements of the effort include (1) critical analysis of OAC identities and metric alternatives; (2) descriptions of dynamics of OAC metrics, IOPs, and AOPs, including Z SD, for the lake; (3) testing of OAC → IOP relationships through their closure with bulk IOP measurements (Fig. 1, link 1); (4) application of a theory-based radiative transfer expression (Fig. 1, link 2) to test closure of bulk measurements of IOPs with Z SD (IOPs → Z SD); (5) performance of an overall OAC → Z SD model (links 1 and 2); and (6) example model applications to depict key dependencies and responses of Z SD to hypothetical scenarios.

2 Methods

2.1 Contexts: Cayuga Lake and Radiative Transfer Expression for Z SD

Cayuga Lake (42° 41′ 30″ N, 76° 41′ 20″ W) is the fourth easternmost of the New York Finger Lakes (Fig. 2). It has the second largest volume (9.4 × 109 m3) and surface area (172 km2) of this group of lakes, with maximum and mean depths of 133 and 58 m, respectively (Schaffner and Oglesby 1978). This is an alkaline, hardwater, mesotrophic lake (Effler et al. 2010), with an average residence time of 5.5 years (Gelda et al. 2015). Nearly 40% of the total tributary inflow, including Fall Creek, the largest tributary, enters the southern end of the lake. Fall Creek flow dynamics has qualitative (Peng and Effler 2015; Prestigiacomo et al. 2016) and quantitative (Effler et al. 1989) value as indicative of lake-wide and local tributary input dynamics. The southernmost 2 km of the lake that receives these inflows is shallow (≤6 m) and described as the “shelf” (Fig. 2). The large sediment (soil; i.e., ISPM and PAVm) loads delivered to the shelf during runoff events, dominated by minerogenic particles (Prestigiacomo et al. 2016), cause short-term localized degradation of water quality, including decreases in Z SD (Effler et al. 2010; Effler and Peng 2014) and increases in turbidity (Effler et al. 2014) and particulate phosphorus (Peng and Effler 2015).

Fig. 2
figure 2

Map of Cayuga Lake, position within New York, and the 11 Finger Lakes and the inflow location of Fall Creek. Positions of four lake monitoring sites (nos. 2, 3, 5, and 7), one on the shallow shelf at the southern end (no. 2), with the pelagic sites along the primary axis

The leading radiative transfer expression for Z SD, adopted here Eq. (1), was originally developed by Tyler (1968) and Preisendorfer (1986), with Γ described as a quasi-constant. Much of our link 2 data analysis is conducted in the context of this equation; e.g., its representation of the effects of the IOPs and the AOP on K L. The “L” subscripts identify the dependence of c L and K L on the spectral features of penetrating light in the water column and the receptive response of the human eye in Z SD observations (i.e., photopic optical properties; Priensendorfer 1986). The peak response of the human eye to contrast is at a wavelength of ∼555 nm. This human vision factor, together with the spectral characteristics resolved by instrument measurements of a, b p, and irradiance and calculated results for downwelling diffuse light attentuation (K d), influenced the specification of the wavelengths in our analyses related to Z SD (Eq. (1)).

2.2 Supporting Measurements

Measurements of Z SD were made with a black and white quadrant Secchi disk of 20-cm diameter, between 10:00 and 14:00 h, over the April–October interval of 10 years (Table 2). The Z SD measurements were made according to common protocols, from a height of ∼75 cm above the water surface, from the sunny side of the boat, and as the average of the depths of disappearance (lowering) and reappearance (raising, Davies-Colley et al. 2003). The conduct of measurements was independent of variability in ambient meteorological conditions; i.e., sun availability and wave action varied among sites and monitoring days.

Table 2 Program of measurements supporting the Z SD modeling program for Cayuga Lake

The supporting monitoring program included multiple sites for the spring-fall interval of 10 years, with the most frequent and largest set of laboratory and field measurements in 2013 (Table 2). Both pelagic and shelf sites were included (Fig. 2). The locations supported differentiation between near-shore tributary inflow areas (e.g., site 2) and pelagic waters (the other sites), particularly the effects of high concentrations of terrigenous sediment caused by the inputs from major runoff events (Effler and Peng 2014; Peng and Effler 2015). Standard methods were used for the three OAC metrics that are widely measured: the two for phytoplankton biomass, [Chl] (Parsons et al. 1984, for 1998–2006; Arar and Collins 1997, for 2013) and [POC] (Rice et al. 2012), and [ISPM] for inorganic sediment (Clesceri et al. 1998).

Protocols for SAX analyses have been described in detail previously (Peng and Effler 2007; Peng et al. 2009). Features of natural minerogenic particle populations critical to their optical (e.g., b p) impact, including the number concentration (N), the particle size distribution (PSD), and their composition, are well quantified with SAX (Peng and Effler 2007, 2015). This individual particle analysis protocol places each one into one of six chemical groups that either represent terrigenous (allochthonous; e.g., clays and quartz) or autochthonous (e.g., calcite precipitation) origins (Peng and Effler 2015). More than 2000 individual minerogenic particles were analyzed for each sample. The average coefficient of variation (cv) values for analyses of triplicate samples conducted on 12 occasions from a pelagic site in 2013 were 6, 13, 14, and 37% for [Chl], [POC], PAVm, and [ISPM], respectively.

The beam attenuation coefficient was measured in each year at a wavelength (λ) of 660 nm with the c-star transmissometer (Table 2), identified as c s. The parameter c is acknowledged to be dominated by b (∼97%, and, in turn, b p) at this wavelength (Babin et al. 2003; Effler and Peng 2014). Scalar diffuse photosynthetically active irradiance (400–700 nm) was measured with another sensor (Table 2) attached to the same frame. The corresponding diffuse attenuation coefficient (K o(PAR)) was calculated according to Beer’s Law, by linear regression of the natural logarithm of the irradiance vs. depth (Kirk 2011); the presented values correspond to the 10% light-level depth (within epilimnion). The spectral values of c and a were measured (b p = c − a) within the upper waters with an ac-s meter (Table 2) on most of the monitoring days of 2013; this beam attenuation coefficient value was identified as c a in comparisons with c s. A more limited set (n = 18) of measurements of spectral downwelling irradiance was made at pelagic sites 3 and 5 (Fig. 2) in 2013, to support calculation of the corresponding downwelling diffuse attenuation coefficient (K d(λ)), that specifies the spectral dependence of light penetration.

2.3 Supporting Calculations

Equations adopted here from previous studies for the optical calculations of this study are summarized tabularly (Table 3). Bulk measurements of c(660) (i.e., b p(660)) were adjusted upward to represent the effects of the acceptance angles of the instruments using relationships developed and applied previously for this lake (Table 3, Eqs. (2) and (3)), guided by observations of Boss et al. (2009a) from other systems and the paired c a and c s measurements of 2013 in Cayuga Lake. Forward estimates of b p(660) were made as the summation of two components: one for natural minerogenic particles (b m(660)) and the second representing phytoplankton biomass (and its retinue; b o(660)) (Eq. (4), Table 3). These support closure testing with bulk measurements of b p(660) and Z SD (OACs → Z SD, Fig. 1). In some cases, b m(660) was partitioned according to terrigenous soils (b te(660)) and autochthonous calcite (b ca(660)) particles (Eq. (5), Table 3; Peng and Effler 2015). The overall b m was calculated as the product of SAX-measured PAVm and the average scattering efficiency factor (<Q b,m>) for minerogenic particles, established to be uniform (=2.3) in multiple lakes, including Cayuga Lake (Peng and Effler 2015). Both [Chl] (Eq. (7)) and [POC]-based (Eq. (8)) empirical expressions were used to estimate b o(660). The [POC]-based (i.e., 2013 only) expression was separately developed and successfully tested for b p closure for Cayuga Lake but was quite similar to marine equations (Cetinic′ et al. 2012; Peng and Effler 2016a).

Table 3 Equations supporting Z SD model and its links

The performance of a recent alternate radiative transfer expression for Z SD, developed by Lee et al. (2015), is compared preliminarily to that of the adopted framework (Eq. (1)). The relationship from Lee et al. (2015) is

$$ {Z}_{\mathrm{SD}}\sim 1/{K_{\mathrm{d}}}^{\mathrm{tr}} $$
(9)

where K d tr is the spectrally minimum K d within the visible range, described as the K d of the transparent “window” (most penetrating λ) of the water column (Lee et al. 2015). The wavelength (λ) for the “tr” and the values of K d tr were based on the system-specific measurements of K d(λ) (Table 2). The λ of minimum K d(λ) and the associated K d value correspond to tr and K d tr, respectively.

3 Results

3.1 Bulk Measurements: Spectral Features, Relative Magnitudes of IOPs and K d(550)

Development of this mechanistic model(s) required optical characterizations based on measurements with the instrumentation (Table 2). Spectral dependencies of K d (Fig. 3a), a (Fig. 3b), and b p (Fig. 3c) are presented for each as ratios of the measured spectra and the corresponding spectral average values (<K d(λ)>, <a(λ)>, and <b p(λ)>) for the 400–700-nm λ interval. The K d spectra established that the λ of maximum light penetration was generally recurring, at ∼550 nm, very similar to that of the peak response of the human eye to the contrast of Z SD. Accordingly, the 550-nm observations were adopted as corresponding to the L, embedded in the model’s radiative transfer equation (Fig. 1) and the tr of the Lee et al. (2015) expression, respectively. The K d spectral shape was driven by that of a (Fig. 3b); the average r 2 value for the paired K d and a spectra was 98%. The greatest spectral uniformity was for b p (note the different y-axis scaling; Fig. 3c), though modest variations were observed. The shape was most uniform over the 500 to 550 nm range, for which b p(λ)/<b p(λ)> remained between 1.0 and 1.1. The average value of the b p(550)/b p(660) ratio from the ac-s measurements was 1.15 (cv = 4.3%). The corresponding average value of the c(550)/c(660) ratio from these measurements was 1.30. Thus, the extent to which c (=b p + a) was higher at 550 nm was a result of approximately equal contributions by both a and b p, moving from 660 to 550 nm.

Fig. 3
figure 3

Select optical instrument measurements from pelagic sites made in 2013. Spectral shapes for a K d, b a, and c b. Relationships between selected IOPs at λ = 550 nm, K d(550), and K o(PAR): d K d(550) vs. K o(PAR), e distribution of ratio c(550)/K d(550), f distribution of ratio a(550)/c(550), and g distribution of ratio b p(550)/a(550). Shapes (ac) are the spectral observations dividing by the spectral average values; n is number of observations. Ratios were calculated from paired observations. n number of observations, int intercept from linear regression, s slope from linear regression, x value of mean, cv value of coefficient of variation

The paired observations of K o(PAR) and K d(550) were strongly correlated (Fig. 3d), but K o(PAR) values were substantially greater (average of K o(PAR)/K d(550) = 1.31, cv = 19%) because of its inclusion of the effects of higher a levels at λs below and above 550 nm (Fig. 3b). The relative magnitudes of c(550) vs. K d(550), a(550) vs. c(550), and b p(550) vs. a(550) were evaluated in a ratio format to indicate their relative importance in influencing Z SD observations and predictions, in the context of the radiative transfer equation (Eq. (1)), and to depict variability in the relationships. Of the two denominator components of the expression, c(550) was substantially greater than K d(550). The average of the ratio c(550)/K d(550) for the 16 paired observations of 2013 (Fig. 3e) was 6.1, with a cv = 38%. This distribution remained similar when the population of K d(550) was increased with estimates from the larger population of K o(PAR) observations (Fig. 3d). The dominance of b p in the denominator of the expression is established by (1) the small contribution of a(550) to c(550) (<10%, on average; Fig. 3f), (2) the large values of the b p(550)/a(550) ratio (Fig. 3g), and (3) by the substantial exceedance of K d(550) by c(550) (Fig. 3e). Moreover, the broad distribution of the b p(550)/a(550) establishes that the variations in these attenuation processes were not always strongly coupled (Fig. 3g).

3.2 Temporal and Spatial Observations: Examples, 2013

Time series of daily average flows at the mouth of Fall Creek (Q; Fig. 4a), in-lake paired measurements of OAC metrics (Fig. 4b–e), the IOP of c(550) (Fig. 4f), and the AOPs of K d(550) (Fig. 4g) and Z SD (Fig. 4h), are presented for the shelf and a pelagic site (nos. 2 and 3, Fig. 2) for the study interval of 2013. Near spatial uniformity was observed among the pelagic sites (not shown). Temporal averages and variability metrics (vertical bar is 1 standard deviation, cv included as adjoining numbers) are presented in the attached bar charts. Values of K d(550) and c(550) for days without direct measurements were estimated with their strong respective surrogates of K o(PAR) (Fig. 3d) and c c(660). The total tributary inflow rate for Fall Creek for the study interval of 2013 ranked 33rd highest for the 89-year record. This consisted of the low Q levels of April and May (15th lowest total of record for those 2 months), combined with high levels (three major runoff events) in July and August (6th highest total of record; Fig. 4a). The short-term perturbations of major runoff events for the shelf were demonstrated by the off-scale observations for the monitoring of July 3. Shelf averages and variability are shown with (2w) and without (2wo) the observations of that day.

Fig. 4
figure 4

Time series for the upper waters of Cayuga Lake for sites 2 and 3 for the April–October interval of 2013. a Fall Creek flow (daily). b PAVm. c [ISPM]. d [Chl]. e [POC]. f c(550). g K d(550). h Z SD, with inset of average and variability (±1 std. elev.) for monitored 10 years. Arrows with corresponding numbers of parameter observations of July 3 (runoff event), for b, c, e, and f. Attached (right side) bar charts present averages for the two sites, ±1 std. dev., and cv values for 2013; x-axis 2w includes all observations for site 2, including July 3, 2wo for site 2 without July 3, and 3 is all observations for site 3. Mean values within arrowheads of 2w bars for PAVm and [ISPM]

All of the metrics of OACs varied substantially over the 2013 study interval, though not as extreme as observed for July 3 for the minerogenic particle metrics that are substantially influenced by runoff events. Conspicuous increases were resolved for PAVm (Fig. 4b) and [ISPM] (Fig. 4c) on the shelf, linked to the runoff event of early July. These metrics were also distinctly higher on the shelf in early spring (Fig. 4a–c). The coupling of pelagic increases to the summer events was delayed and less conspicuous, and temporal variations were diminished compared to the near-shore observations in general (Fig. 4b, c). Even without July 3 conditions, average values of PAVm and [ISPM] were greater on the shelf (Fig. 4). The paired observations of PAVm and [ISPM], with the major runoff event shelf values deleted, were not strongly correlated (r = 0.49). Several [ISPM] samples from the pelagic site with low values had detection limit problems. By comparison, the [Chl] biomass metric demonstrated no clear linkage to the July 3 event on the shelf (Fig. 4d), while a modest (relative to PAVm and [ISPM]), likely terrigenous origin, enrichment was observed for [POC] on the shelf following the summer runoff events (Fig. 4e). The two metrics for the biomass OAC were reasonably correlated (r = 0.69) for the April–June interval but poorly (r = 0.41) for the entire 7 months. The ratio [Chl]/[POC] varied over the interval by a factor of 9 (Peng and Effler 2016a).

Rich variations in the AOPs and c(550) were also observed, with K d(550) (Fig. 4g) and c(550) (Fig. 4f) higher and Z SD (Fig. 4h) lower on the shelf, on average, even for the case of not considering the observations of July 3. Conspicuous responses to the July 3 runoff event were again observed. Relationships between these optical measures and the OAC metrics are described subsequently. The general seasonality and interannual variability in Z SD, presented as the average ± 1 standard deviation for the 10 years of measurements (inset of Fig. 4h), has had certain recurring features of higher average values and interannual variations in spring and fall with minima and reduced variability in August. However, the spring 2013 values were the highest of the 10-year record, with other features of the seasonal Z SD pattern in that year more recurring (Fig. 4h). Recall that tributary flow was unusually low in that spring.

3.3 Empirical Analyses for OAC(S)-Z SD −1 Relationships

The empirical analysis framework(s) adopted here has a level of consistency with theory (Eq. (1)), as inverse dependencies of Z SD (as Z SD −1; e.g., Effler et al. 2008) on OACs have been adopted. Qualitative review of the complete data set in the common scatter plot format for Z SD −1 vs. [Chl] (Fig. 5a) identifies a challenging feature common for many lakes, a weak relationship caused by a relatively small number of very low Z SD observations (here ≤2 m), not associated with extremely high [Chl] values. These extreme Z SD observations (n ∼20; Fig. 5a) instead were associated with very high minerogenic particle (PAVm) levels (Fig. 5b) delivered by the major runoff events, with all but two from the shelf. PAVm explained well these extreme conditions (r 2 = 0.87) that represented only ∼6% of the observations. This small percentage of the observations (Z SD ≤ 2 m) was not included in the linear regression analyses of OAC-Z SD relationships here (Fig. 5c; Table 4) because these extremes would bias the statistical analyses. Instead, the empirical analyses presented here addressed the other 95% of the observations that did not immediately follow major runoff events. The regression analysis results presented here (Table 4) reflect differences in (1) the availability of the various OAC metric measurements, (2) lake position(s) considered, and (3) the application of single or multiple (two OACs) linear regression approaches.

Fig. 5
figure 5

Linear regression analyses of the dependence of Z SD −1 on OAC metrics in Cayuga Lakes. a Single driver of [Chl] for the entire record. b Very high PAVm (≥0.5 m−1) values of long-term studies. c Single driver of [Chl] but with “outlier” cases of high PAVm removed. d Two drivers for 2013 of paired [Chl] and PAVm, from multiple linear regression. e Single driver of POC in 2013. f Two drivers for 2013 of paired POC and PAVm, from multiple linear regression. n number of observation, R 2 description of variability

Table 4 Regression analysis results describing the dependence of Z SD in Cayuga Lake, NY, on phytoplankton biomass and minerogenic particles, according to two metrics of each of the two OACs, observations of Z SD less than 2 m not considered

Weaker dependencies of Z SD −1 on the OAC metrics of [Chl] (Fig. 5c; cases 1, 3, and 5, Table 4) or PAVm (cases 2, 4, and 6, Table 4) prevailed individually, over the long-term program. However, the variations in Z SD −1 were much better explained by considering the combined effects of these two OAC metrics through multiple linear regression (Fig. 5d; cases 15, 16, and 17, Table 4), indicating the noteworthy contribution of minerogenic particles beyond the short-term post major runoff event intervals (Fig. 5b). Graphically, results of multiple linear regression are shown as predictions (on the x-axis) vs. observations (on the y-axis; Fig. 5d, f). The most striking difference for two metrics of the same OAC (e.g., biomass) was the superior performance of [POC] (Fig. 5e), compared to [Chl] (Fig. 5c), manifested in the single metric regressions (cases 7 and 9; Table 4). Note that [POC] alone in 2013 (Fig. 5e) performed better than the combination of [Chl] and [PAVm] for multiple years (Fig. 5d), indicative of the degree of superiority of [POC] compared to [Chl]. However, the benefit of including the effects of minerogenic particles in influencing Z SD dynamics was substantial when combined with the biomass metrics (Fig. 5d, f), even for the case of not including the very high PAVm levels following the major runoff events. Inclusion of PAVm, with a biomass metric, demonstrated improved performance in explaining variations in Z SD −1 in 2013, even with the exclusion of the major runoff events, in the multiple regression format (cases 18–25, Table 4). For example, paired [POC] and PAVm observations explained 65% of the variations in Z SD −1 in 2013 (Fig. 5f).

3.4 Mechanistic Model Performance

3.4.1 Link 1 and Link 2 Components

Performance of the mechanistic model was evaluated based on the extent of closure of calculated values (Table 3) with bulk measurements. This included separately, within the individual two model links (link 1, OACs → IOP; link 2, IOP → Z SD), and together across these coupled links to predict Z SD from OAC values (OACs → IOP → Z SD; Fig. 1). Testing of link 1 focused solely on the extent of closure of the summation of the estimated b p components with the bulk (i.e., instruments (Table 2)) measurements of b p, given the only minor contribution of a(550) to c(550) (Fig. 3f) and the great exceedance of K d(550) by c(550) (Fig. 3e). The performance for closure for b p has been presented in a ratio format ((b m + b o)/b p; Fig. 6a). In related Cayuga Lake studies, such performance presentations have been made at different time steps, such as patterns within individual years, on an annual basis, and for portions of the total multi-year record, and spatially by comparison of pelagic waters to the shelf (Effler and Peng 2014; Peng and Effler 2015, 2016a). The distributions of the ratio presented here (Fig. 6a) have two origins within the overall data set: (1) for the entire record (10 years) of paired [Chl] and PAVm observations and (2) for the [POC] and PAVm combination in 2013. The distribution that utilized [Chl] for b o (Fig. 6a) demonstrated a reasonably good approach toward an average value of 1 (e.g., closure), with a cv = 47%, consistent with the performance for other lakes (Peng and Effler 2015) and earlier less complete analyses for Cayuga Lake (Effler and Peng 2014). The alternate closure case, for the combination of [POC] and PAVm data of 2013, was better (e.g., cv = 35%), providing mechanistic support for [POC] being favored over [Chl] for implementation of the Z SD model.

Fig. 6
figure 6

Performance and an application of link 1, (OAC-IOP) of the mechanistic model. a Distribution of the closure of forward estimates of b p (=b m + b o) with bulk measurements [(b m + b o)/b p], for 10 years of paired [Chl] and PAVm measurements, and 2013 paired measurements of [POC] and PAVm. bd Calculated average seasonality for 10 years for b p components of b te, b ca, and b o, from application of the [Chl]-PAVm version of the model, with bounds of ±1 std. dev. Inset of monthly average Q and variability for Fall Creek presented in b

The tested link 1 of the two-link model was applied to portray the general magnitudes of the components of b p in pelagic waters (Fig. 6b–d) with respect to their average seasonalities and interannual variations (±1 standard deviations), based on calculations (Table 3) for the 10 years with the paired PAVm and [Chl] observations. Three components were considered, b te (Fig. 6b), b ca (Fig. 6c), and b o (Fig. 6d), with b te and b ca being subcomponents of b m (=b te + b ca; Table 3). The monthly average Fall Creek flows, with ±1 standard deviation, were included (inset to Fig. 6b) to qualitatively describe the seasonality of tributary inflow (i.e., external loading-driven increases in b te; Peng and Effler 2015) and its variability. A dependence of b te on Q (Fig. 6b) is indicated by their shared decreases in average values and variability with the approach to mid-summer; the large variability in b te was associated with the effects of major runoff events in the 10-year record (Peng and Effler 2015). In contrast, b ca has been limited to late summer (Fig. 6c). During the peaks of these whiting (calcite precipitation) events, b ca has exceeded b te, though it usually remains less than b o (Fig. 6d; Peng and Effler 2016b), and has usually coincided with the seasonal minimum in Z SD (Fig. 4h, inset). The parameter b o exhibited the least relative interannual variability of the three scattering components. Progressive increases in b o from late April through early June have usually been observed (Fig. 6d), with another increase in mid-summer that has usually preceded the whiting event. The generally recurring mid-summer increase in b o (Fig. 6d), together with a whiting event (Fig. 6c), was responsible for a broad July and August maximum in b p, that coincided with, and caused (Fig. 1), the minimum in Z SD (Fig. 4h, inset).

A testing of link 2 of the mechanistic model (Fig. 1) elucidated the extent to which related bulk instrumentation measurements closed with Z SD observations. It was guided by the elements of the chosen radiative transfer expression (Eq. (1), Fig. 1), integrating a series of empirical analyses that depict the effects of a progression of the inclusion of the elements that lead to the overall theory-based equation. This progression, moving top to bottom in columns of the figure, with the bottom most representative of the theoretical expression (Eq. (1); Fig. 7d–k), satisfies a goal of identifying the relative importance of the elements of the expression. Conveniently, the top row of the multiple columns of Fig. 7, the single dependence of Z SD −1 on K d(550) or K o(PAR), is representative of the alternate radiative transfer expression of Lee et al. (2015). The multiple columns (Fig. 7) correspond to cases of differences in supporting information. On the far left column, the dependencies of Z SD observations are on measurements made at the preferred λ of 550 nm, including K d(550) (Fig. 7a), b p(550) (Fig. 7b), c(550) (Fig. 7c), and [c(550) + K d(550)] (Fig. 7d), based necessarily on the 2013 measurements (Table 2). The Z SD vs. [c(550) + K d(550)] plot (Fig. 7d) represents the most complete test of link 2 of the model (Fig. 1), within the appropriate λ constraint (λ = 550 nm, Fig. 3a). The second column of analyses (Fig. 7e–h) reflects alternate information for the same year focused on λ = 660 nm, the λ of c s measurements, most often used in contemporary lacustrine studies. In this case, K d(550) was replaced by K o(PAR), b p(550) by b p(660), and c(550) by c(660). All b p and c data in these analyses of the second column (Fig. 7) were from the ac-s instrument. The third column (Fig. 7i–k) represents results from longer term monitoring (1998–2006), based only on the yet more often available measurements for K o(PAR) and c s(660) for lake studies.

Fig. 7
figure 7

Performance of link 2 [IOP/K d(550) − Z SD] of the mechanistic model, with presentation layout of vertical column of plots consistent with structure of Preisendorfer (1986) radiative transfer expression, and all plots with Z SD y-axis. a Z SD vs. K d(550)−1, in 2013. b Z SD vs. b p(550)−1, in 2013. c Z SD vs. c(550)−1, 2013. d Z SD vs. [c(550) + K d(550)]−1, in 2013. e Z SD vs. K o(PAR)−1, in 2013. f Z SD vs. b p(660)−1, in 2013. g Z SD vs. c(660)−1, in 2013. h Z SD vs. [c(660) + K o(PAR)]−1. i Z SD vs. K o(PAR)−1, for 1998–2006. j Z SD vs. c(660)−1, for 1998–2006. k Z SD vs. [c(660) + K o(PAR)]−1, for 1998–2006

While relationships between Z SD (y-axis) and the other AOP(s) (diffuse attenuation coefficients; K d(550) and K o(PAR)) were significant (Fig. 7a–i), these were not as strong as those that included the effects of b p (i.e., Fig. 7 all plots except a, e, i). The dependencies of Z SD on b p(550)−1 (Fig. 7b) and c(550)−1 (Fig. 7c) were substantially stronger than for Z SD-K d(550)−1 (Fig. 7a) in 2013, favoring the chosen radiative transfer expression over that of Lee et al. (2015) for that data set. The dominance of b p was indicated by the lack of noteworthy improvements in performance by considering c(550)−1 (Fig. 7c) or the summation c(550)−1 + K d(550)−1 (Fig. 7d) instead. These last two cases are consistent with the observed dominance of c(550) by b p(550) (Fig. 3f), and c(550) being much greater than K d(550) (Fig. 3e), because b p(550) ≫ a(550) prevailed (Fig. 3g). The general comparative performances of the elements of the radiative transfer expression remained similar for λ = 660 nm (Fig. 7e–h vs. Fig. 7a–d) that continued to support the dominant role of b p and its relatively low dependence on λ (Fig. 3c). Similar behavior was also manifested in the earlier (1998–2006), and much larger, c(660) and K o(PAR) data sets, though the strengths were somewhat diminished (Fig. 7i–k).

3.4.2 Overall, OACs → Z SD

The organization of the presentation of the performance for the overall (two links) mechanistic model (Fig. 1) reflects two alternatives in both (1) model complexity and (2) metrics of phytoplankton biomass, [Chl] or [POC]. The simplest of the two model frameworks has Z SD inversely dependent solely on b p(660) predictions, conceptually supported by the independent testing of the second link (Fig. 7). The more complex version has Z SD instead inversely dependent on [c(550) + K d(550)], more consistent with the adopted radiative transfer expression (Eq. (1), Fig. 1). The drivers of both model alternatives are paired predictions of the two particulate components of b p, as calculated from the metrics for phytoplankton biomass (b o) and minerogenic particles (b m). The added needs of the more complex framework, with λ = 550 nm, are paired estimates of a(550) and K d(550), driven empirically here (Fig. 8a) by b p(660) observations, based on the more complete optical measurements of 2013. The paired estimates of [c(550) + K d(550)] developed here were based on relationships with measured b p(660), made in three serial steps (Fig. 8a), estimating (1) b p(550) from b p(660), (2) a(550) from b p(550), and (3) K d(550) and c(550) from a(550). The most variable of these three relationships was between a(550) and b p(550) (Fig. 8a); the ratio a(550)/b p(550) had a cv = 34%.

Fig. 8
figure 8

Protocols for mechanistic Z SD model inputs and model coefficients. a Layout of protocol to specify the more complex Z SD model framework inputs for the radiative transfer expression based on b p(660) inputs, including the estimates of b p(550), a(550), K d(550), and [c(550) + K d(550)], with supporting relationships shown. b Distribution of the less complex model coefficient values of Γ′. c Distribution of the more complex model coefficient values of Γ

The approach adopted for the more complex of the two frameworks (Fig. 8a) enabled the same set of driver inputs for each, the metrics for the two particulate OACs. Distribution plots of the ratios of Z SD observations to b p(660) (Fig. 8b) and Z SD to [c(550) + K d(550)] (Fig. 8c) predictions both demonstrate a supporting centrality in performance. The averages of the normally distributed ratio populations are representations of the model coefficients, Γ′, for the b p(660) version, and Γ (Fig. 1) for the more complex (Fig. 8a) form. Accordingly, the two mechanistic models share a simple type of equation format for link 2 (Fig. 1); for the simple framework alternative, it is

$$ {Z}_{\mathrm{SD}}={\Gamma}^{\prime }/{b}_{\mathrm{p}}(660) $$
(10)

and for the more complex, it is

$$ {Z}_{\mathrm{SD}}=\Gamma /\left[ c(550) + {\mathrm{K}}_{\mathrm{d}}(550)\right] $$
(11)

Differences in the temporal availability of the two phytoplankton metrics were convenient in setting the time intervals of model testing (Chapra 1997), the intervals of calibration (initial testing) and validation (testing the calibrated model for a different set of inputs). Calibration addressed performance in 2013, the year of most intensive monitoring and paired measurements of [Chl] and [POC] that also provides comparative performance of these two biomass metrics for b o. The validation addresses performance over the rich historic [Chl] data set, supporting a challenging long-term (9 years) comparison with calibration based on that metric. The same mean Γ′ and Γ values (Fig. 8b, c) adopted for [Chl]-based calibration were retained for validation testing. Paired PAVm data for the entire testing period (Effler and Peng 2014; Peng and Effler 2016a) evenly supported the effects of b m estimates for the two testing intervals.

The two levels of complexity yielded essentially equal performance because both rely entirely on the predictions of b p(660) (Fig. 8a). The more complex framework was included here because it offers the possible future expansion of direct incorporation of mechanistic a (and thereby c and K d; Kirk 2011) predictions (less important than b p, Fig. 3g), that would be yet more complex and demanding for supporting optical information. A first sensitivity analysis was conducted on the effects of the simple three-step set of serial empirical adjustments conducted to convert the b p(660) predictions to corresponding values of [c(550) + K d(550)], the intermediate level of complexity adopted here (Fig. 8a). For this analysis, the empirically based estimates of the denominator were replaced by the direct measurements of c(550) and K d(550) available for portions of 2013. The analysis resolves the contribution of the empirical b p-driven protocol (Fig. 8a) to the uncertainty of the model predictions of Z SD and serves as an indicator of potential benefit of future, even greater, complexity (e.g., direct simulation of a).

Performances of the two mechanistic model frameworks and the sensitivity analysis are presented in two graphical formats (Fig. 9) and according to multiple summary statistics (Table 5). The predictions of Z SD (Z SD/p) are compared to observations (Z SD/o) in scatter and ratio (observations/predictions) plots according to the designated model frameworks and the sensitivity analysis. The three rows of the scatter plots correspond to the [Chl]-based calibration year (2013; Fig. 9a, d, g), the [POC]-based calibration case (2013; Fig. 9b–h), and the longer term [Chl]-based (1998–2006; Fig. 9c, f) validation case (i.e., no [POC]-based validation), moving from the top downward. The columns, moving from the left, represent the simplest b p(660)-only framework, next the more complex [c(550) + K d(550)] framework (described in Fig. 8a), and, on the right, the sensitivity analysis for added measurements to independently specify inputs for c(550) and K d(550) through measurements rather than the adopted estimates, for the more complex [c(550) + K d(550)] framework. The use of the ratio distribution plot format facilitated the comparisons of model prediction variability relative to Z SD observations, for the different cases of testing for both the simpler b p(660) model (Fig. 9i, j) and the more complex framework, including the sensitivity analysis (Fig. 9k, l). The predictions for the sensitivity case are designated Z SD/p-s (Fig. 9g, h).

Fig. 9
figure 9

Performance of Z SD mechanistic model(s). ac For the simpler b p(660)-driven version, based on a [Chl] in 2013 (calibration), b [POC] in 2013 (calibration), and c [Chl] for 1998–2006 (validation). df For the more complex (Fig. 8a) [c(550) + K d(550)]-driven version, based on d [Chl] in 2013 (calibration), e [POC] in 2013 (calibration), and f [Chl] for 1998–2006 (validation). g, h A form of sensitivity analyses, using the added measurements for inputs of a(550) and K d(550) in 2013, g [Chl] in 2013, and h [POC] in 2013. il Performance comparisons of i [Chl] vs. [POC], j [Chl] calibration vs. validation, k sensitivity based on [Chl], and l sensitivity based on [POC]. Performance plots present observed vs. predicted Z SD for ah and distribution comparisons of il. Paired PAVm data available and used for all model predictions. Z SD performance evaluation, Z SD/o observed Z SD by model, Z SD/p predicted Z SD by model, Z SD/p-s predicted for sensitivity analysis, with complex framework and direct measurements of a(550) and K d(550)

Table 5 Z SD model performance summary

Reasonably good closure between the two link (Fig. 1) predictions of Z SD (Z SD/p) and observations (Z SD/o) were demonstrated for the different model frameworks and the two cases of phytoplankton biomass metrics (Fig. 9, Table 5). Reasonably strong linear relationships in Z SD/o vs. Z SD/p, with approaches toward the value of 1.0 for the ratio Z SD/o/Z SD/p (1.0 to 1.16), and reasonably limited variability of the ratio (cv ≤ 32%) were observed for the various cases (Table 5). The long-term applicability and uniformity in performance was supported by the essential equivalence in calibration (Fig. 9i) and validation (Fig. 9j; limited to [Chl]-based b o and parallel PAVm-based b m values) testing results (Table 5). Moreover, the differences in performance of the two mechanistic frameworks of the model considered were small (Fig. 9k, Table 5), consistent with the linearities of the added optical properties driven by the b p(660)-driven estimates (Fig. 8a) for the more complex option. However, there were two noteworthy differences between the cases tested. The more modest case was the small improvement, as reduced variability in the [POC]-driven distribution of Z SD/p compared to Z SD/p-s (Fig. 9l, Table 5), for the sensitivity analysis case of direct integration of c(550) and K d(550) paired measurements, instead of the b p-based estimator protocol used in our complex framework alternative (Fig. 8a). The more compelling difference, of broader contemporary importance, is the major improvement in model performance with the availability of [POC] as the biomass metric instead of [Chl] (Fig. 9i, Table 5).

3.5 Select Model Applications

The successfully tested model is preliminarily applied to the long-term Cayuga Lake record considered here to make projections in shifts in Z SD that would be expected in response to changes in the b o and b m drivers. These drivers correspond to phytoplankton biomass and minerogenic particles and are of management interest for this lake related to the outcomes of external loading, respectively, of (1) nutrients (and resulting in-lake primary production; Effler et al. 2010) and (2) inorganic sediment (Peng and Effler 2015). Accordingly, these are both of interest for management of contributing watersheds in many lakes, including Cayuga Lake (Effler et al. 2010; Peng and Effler 2015). These demonstrations of model application are broad in character, particularly on a timing basis; e.g., shorter term dynamics, such as seasonal or responses to runoff events, could instead be targets. Responses to scenarios are presented as comparisons of predicted distributions for the hypothetical cases to those obtained in model testing of observations that addressed the observed long-term paired (10 years) records of the optical drivers of [Chl] and PAVm and the Z SD.

Predictions are presented for three cases: (1) a reasonable range of changes in [Chl] and thereby b o; (2) prevailing contributions of minerogenic particles (PAVm and thereby b m), at pelagic vs. shelf sites, vs. their absence; and (3) the late summer contribution (mid-July through August; Peng and Effler 2016b) of calcite. Rather broad distributions in Z SD were maintained for the model scenarios, consistent with the specifications of the drivers for the addressed cases. The shifts in the distributions of the prevailing Z SD (average = 4.4 m) populations for the ±20% changes in [Chl] were similar in character (Fig. 10a), demonstrating ∼10% increase in average Z SD for the 20% decrease in [Chl] and ∼10% decrease for the 20% increase in this biomass metric. This range of [Chl] change is not a formally agreed upon set of management limits but is reasonable, given the prevailing external phosphorus loading conditions of the lake (Prestigiacomo et al. 2016).

Fig. 10
figure 10

Selected applications of tested mechanistic Z SD model for Cayuga Lake shown as comparisons of distributions of predictions for scenarios vs. calibration. a ±20 of observed [Chl]. b Elimination of PAVm in pelagic waters. c Elimination of PAVm on shelf. d Elimination of calcite in late summer

The PAVm-based scenarios are intended to generally inform concerning the prevailing relative role of minerogenic particles in influencing Z SD in pelagic (Fig. 10b) and near-shore (Fig. 10c) waters. Even in the pelagic waters of the lake, the b m of minerogenic particles diminishes clarity substantially; in their absence, the average Z SD would have been more than 1.5 m (∼35%) greater (Fig. 10b). The average value of Z SD would have been ∼2.5 m (∼42%) greater on the shelf (Fig. 10c) because of the higher average PAVm levels compared to the pelagic waters. The extent to which PAVm lake levels have increased (e.g., since before the late 1990s) from anthropogenic effects and are subject to reduction remains unresolved for this lake (Peng and Effler 2015; Gelda et al. 2016). The nearly equal Z SD levels predicted for pelagic and near-shore waters for the no-PAVm scenarios (Fig. 10b, c) reflects their very similar [Chl] levels. The SAX-Mie protocol capabilities enable the resolution of the calcite fraction of PAVm and b p (b ca). In the absence of this internally produced b m component, the average Z SD would have been ∼15% higher on average in late summer in the lake (Fig. 10d). This shifts higher (∼25%) when the average annual maximum b ca is considered. The extent to which this temporally limited (Fig. 6c) calcite precipitation process is influenced by anthropogenic activity is generally not known (Peng and Effler 2016b).

4 Discussion

4.1 Advancements in Mechanistic Z SD Modeling: Supporting Components

Given the great contemporary concern for the clarity/transparency attribute among the features of lacustrine water quality (Davies-Colley et al. 2003; Cooke et al. 2005), theory-based simulation capabilities for Z SD have certainly trailed those for other important parameters (e.g., trophic state metrics) such as nutrients and phytoplankton biomass (Chapra 1997; Arhonditsis et al. 2004; Robson 2014). The primary contributions of this paper have been the advancement, testing, and application of a first theoretically sound mechanistic Z SD model for lacustrine waters (Fig. 1). The advancements in Z SD modeling reported here, dependent in part on recent advancement in both laboratory and field instrumentation, are now consistent with the levels of theoretical representation embedded in mechanistic frameworks for a number of other priority water quality parameters (Chapra 1997; Robson 2014). Recognition of the two serially coupled links that quantify the effects of OACs on IOPs (particularly b p) and, in turn, the dependence of Z SD on the affected IOP (s) was critical for the theoretical framework of the mechanistic model (Fig. 1). Such a conceptual framework was presented qualitatively earlier (Davies-Colley et al. 2003) but had not previously been quantitatively implemented, either independently or within a larger water quality model. The progress in mechanistic Z SD modeling presented here has taken advantage of recent advances in measurements and interpretation, implemented through a robust optical data set for Cayuga Lake. This supported quantification of b p through its two components, b o and b m (Effler and Peng 2014; Peng and Effler 2016a), and its integration into the theory-based predictions of Z SD (Fig. 9).

The compelling case made here for the regulation of Z SD in Cayuga Lake primarily by b p (Figs. 7 and 9) was supported also, albeit less completely, in earlier work for other lakes (Davies-Colley et al. 2003; Effler et al. 2008; Peng and Effler 2011). This case prevails widely (Davies-Colley 2003). Moreover, the two-component representation for b p (=b o + b m) has supported published closure with bulk b p measurements for a number (>20) of lakes (Peng and Effler 2016b), as also demonstrated here for Cayuga Lake (Fig. 6a). These successes for b p (link 1, Fig. 1) enable the meaningful extension (next step, link 2, Fig. 1) to predictions of Z SD with a mechanistic model and resolution of the contributions of various particulate groups to this clarity metric. Representation of only one component/constituent of b p in a Z SD model would rarely be adequate across the range of conditions experienced by most lacustrine systems and would have limited management value (Peng and Effler 2016b). Moreover, this paper has evaluated the important effect of the respective metrics selected to support b o and b m estimates, and thereby the predicted Z SD. The strongly recommended metrics are [POC] and [PAVm], for b o and b m, respectively, though [Chl] is adequate, but apparently accompanied by diminished performance.

The empirical expressions to estimate b o, developed and originally tested for the open oceans based on [Chl] (e.g., Huot et al. 2008) or [POC] (e.g., Cetinic′ et al. 2012), both support reasonable degrees of closure for b p in lakes (Peng et al. 2016a) and Z SD in Cayuga Lake (Fig. 9). However, the superior performance of [POC] compared to [Chl] in Cayuga Lake (Fig. 9, Table 5) is expected to be broadly the case for lacustrine systems. The Chl content of phytoplankton cells is known to vary more than POC in both marine (Fennel and Boss 2003; Stramski et al. 2008) and lacustrine (Reynolds 2006) waters, because of its dependence on community composition as well as ambient conditions (Behrenfield and Boss 2006). Our Z SD model-based recommendation is to include [POC] measurements in monitoring concerned with Z SD (the [Chl] record is too large to discontinue). Beyond providing the preferred [POC], implementation of this recommendation will more completely establish their superior comparative performance. The case of absence of [POC] data, with [Chl] the only available metric, should not be considered fatal for the goal of modeling Z SD but would be limiting with respect to the level of certainty that could be achieved (Fig. 9, Table 5). A morphometric alternative for phytoplankton-dominated b o (such as for PAVm-based b m) is not a reasonable goal, given the complexities of great variations in size, shape, and composition of natural communities. Empirical relationships (e.g., Eqs. (7) and (8), Table 3) will continue to be the basis to estimate b o within (link 1) a Z SD model.

This advancement in Z SD modeling has relied greatly on the recent progress in quantifying b m and the resolution of contributions of different minerals, provided by SAX and the SAX-Mie approach (Peng and Effler 2015). The estimation of b m values from particle characterizations by SAX represents a theory-consistent mechanistic element of the overall Z SD model. Its chemical characterization capability has enabled the partitioning of b m into allochthonous (e.g., b te) and autochthonous (e.g., b ca) components, that provides insights on origins, and their contributions to Z SD. The large variations in b te observed in time and space in Cayuga Lake are linked to effects of land use and runoff events (Figs. 4f and 6a; Effler and Peng 2014), that have noteworthy influences on Z SD patterns (Fig. 4h), and are thereby of management concern. This is the case for many lacustrine systems (Peng and Effler 2015). The b te component of b m influences the entire lake, but particularly the near-shore zone proximate to inflows (Figs. 4h and 10b, c). Long-term systematic increases in b te and decreases in Z SD are expected for many lakes based on predicted regional climate changes of increases in occurrences and severity of runoff (National Oceanic and Atmospheric Administration 2013). The duration of noteworthy b ca levels is presently relatively brief in Cayuga Lake (late summer Fig. 6c), with substantial interannual variations that are largely unexplained (Peng and Effler 2016b). However, the calcite “whiting events” contribute to the relatively low Z SD levels in late summer in Cayuga Lake (Fig. 10d) and other hardwater lakes where the phenomenon occurs (Peng and Effler 2016b). Accordingly, the SAX resolution capabilities for minerogenic particles provide critical input in quantifying their influence on b m and thereby Z SD and supporting related management considerations. Moreover, the credibility of the SAX-Mie estimates of b m has been established through extensive successful closure testing over a decade for more than 20 lacustrine systems of broad type, as recently reviewed (Peng and Effler 2015, 2016b).

The divergence in suitability of [ISPM] from PAVm as the OAC metric for minerogenic particles to support estimates of b m is far greater than [Chl] vs. [POC] to estimate b o. [ISPM] is not a legitimate alternative to support estimates of b m and thereby is an inappropriate basis to simulate Z SD where this b p component is noteworthy. [ISPM] is dismissed here based not only on study-specific shortcomings (e.g., flawed with respect to precision, PAVm relationships, empirical performance (Table 4)) but also those widely recognized in the literature, including (Peng and Effler 2015, 2016b) (1) [ISPM] provides an imperfect partitioning of inorganic vs. organic suspended particulate material; (2) relatively low concentrations in many lakes offer detection limit challenges; (3) variable inclusion of the diatom phytoplankton group (silica-rich frustules) within [ISPM]; (4) omission of smaller particle sizes from [ISPM] that can make noteworthy contributions to b p; (5) mass concentration, compared to surface area, has an inferior relationship with b p; (6) no partitioning of contributions of particle size or chemical classes (e.g., clays vs. calcite) critical to PAVm and b p is available; and (7) [ISPM] underrepresents the effects of calcite (whiting events) on b p in hard water lakes because it occurs as a thin coating of phytoplankton. While [ISPM] data cannot be recommended for Z SD model initiatives, it may have some utility for discontinued, or even ongoing, monitoring programs to identify the potential importance of b m. The following is a first approximation expression for the range of PAVm based on [ISPM], developed from paired observations.

$$ {\mathrm{PAV}}_{\mathrm{m}}\left(\mathrm{range}\right)\sim 0.1\ \mathrm{to}\ 0.3\times \left[\mathrm{ISPM}\right] $$
(12)

The field instrumentation measurements integrated into this study (Table 2) were unusually robust for lacustrine programs and invaluable in supporting development and testing of the model. A broadening of availability is emerging for lacustrine studies. The parameter c, usually for λ = 660 nm, has been measured commonly in marine waters for decades and long recognized as a surrogate of b p(660) (e.g., Babin et al. 2003). These measurements, critical to testing for closure with component-based estimates of b p(660) (=b m(660) + b o(660)), have now become reasonably common in lacustrine waters (Davies-Colley et al. 2003), and thereby available for system-specific testing of features (both links 1 and 2, Fig. 1) of the Z SD model. The more rare measurements of irradiance and other IOPs, made spectrally here (Table 2), enabled yet more complex related characterizations (Fig. 3a–c) to further support system-specific model development and testing (Figs. 7, 8, and 9). A number of the relationships resolved here for optical properties in this lake, and thereby the Z SD model and its coefficients, are expected to be transferable, eliminating the need for spectral measurements and coefficient changes for many lakes. However, it has been important to recognize that contemporary instrumentation for bulk measurements of b p (and c) are false low because of the acceptance angle effect (Boss et al. 2009a), requiring adjustment (Table 3).

This first documentation of testing of an optical theory-based mechanistic Z SD model for lacustrine waters presented here was comprehensive, considering model framework and OAC metric alternatives, and testing the two links and the overall linked model with large paired data sets, some of them extending over a decade. The successes in simulation were achieved with uniform coefficient values; i.e., those established in the single year of calibration supported good performance for the entire record (validation; [POC]-based limited to calibration). The tested model is ready for transfer for further testing and application for other lakes.

4.2 Uncertainties in Z SD Model

While the reasonably good performance of the mechanistic model in predicting Z SD (Fig. 9) is supportive of its general credibility and utility, there are multiple sources of uncertainty that contribute to its imperfect (variable) performance (e.g., sensitivity analysis had diminished variability) that could be targeted for overall improvement. We identify the more important sources and recommend protocols and research to improve overall model performance. It is valuable to consider the addressed items within the conceptual layout for the mechanistic model (Fig. 1). Substantial attention has already been received related to the b m component because of the recent advancing research contributions based on SAX and PAVm (e.g., Effler and Peng 2014; Peng and Effler 2016b) and expanded here based on the advantages over [ISPM]. Sources of uncertainty for the Z SD model expand considerations here including the following: (1) OAC metrics, (2) SAX measurements of light scattering characteristics of minerogenic particles, (3) estimates of b o, (4) bulk measurements of b p, (5) application of Mie theory to calculate b m from SAX results (the SAX-Mie approach), (6) the Z SD measurements, (7) the radiative transfer expression adopted, and (8) model complexity.

The cases for performance preferences of [POC] over [Chl] as the biomass metric, and PAVm over [ISPM] as the minerogenic particle metric, were strongly supported by this Cayuga Lake study (above) for both b p (Fig. 6a) and Z SD (Fig. 9) prediction performance. Moreover, the cases for [POC] preference (Fennel and Boss 2003; Stramski et al. 2008; Peng and Effler 2016a), and particularly that for PAVm superiority (Peng and Effler 2015), are also supported by recent literature. Sources of uncertainty for SAX characterizations include (1) precision of analyses (∼10%; Peng et al. 2009), (2) a lying-flat effect (Jonasz 1987) for clay platelets on related PAVm estimates (∼5% adjustment used, Peng and Effler 2007), and (3) representation of particle aggregation effects (Boss et al. 2009b) on b m. Particle aggregation complicates the light scattering of involved particles (Boss et al. 2009b) and can be a prominent effect in divalent cation-enriched systems, including coastal waters (Bowers et al. 2011). However, the effect is diminished in lacustrine waters and accommodated to some extent by the protocols of the SAX-Mie approach (Effler and Peng 2014; Peng and Effler 2016b).

The general recognition of negative bias of the bulk b p observations from the acceptance angle effect of the field instrumentation has been quantitatively advanced through paired measurements with equipment with different angles, at different sites (Boss et al. 2009a), that guided the adjustment relationships used here (Eqs. (2) and (3), Table 3). These expressions fairly represent the average of the acceptance angle effect integrated here. Unfortunately, there are substantial temporal variations in the magnitude of the effect (and system-to-system differences) associated with variations in composition and particle size distributions of natural particle populations, that are not presently subject to predictive quantification (Boss et al. 2009a). For example, the ratio of c(660) from the two instruments used in 2013 (Tables 2 and 3) had substantial variability (cv = 12%). Accordingly, a noteworthy portion of the apparent variability in the degree of closure of b m + b o estimates with bulk b p is associated with uncertainty in the bulk measurements. This is encountered in the separate testing of link 1 of the model, associated with the use of bulk b p observations, but is not involved in predicting Z SD directly from the scattering estimates for the OACs (b m and b o; OACs → Z SD, combined two links of Fig. 1). Accordingly, the closure testing with bulk b p estimates (Fig. 6a) provides guidance in the success of the representation of the two scattering components, b m and b o, rather than a final rigorous test. Note that the variability in b p (Fig. 6a) closure and Z SD prediction performances, both from bulk b p (Fig. 7) and OACs directly (Fig. 9), were not greatly different despite the added sources of uncertainty for Z SD, in part because of the above uncertainty in adjusted bulk b p (c) measurements that influenced b p closure performance (link 1, only).

The SAX results serve as inputs of light scattering attributes of minerogenic particles to Mie theory calculations of b m (Peng and Effler 2007, 2016a). Natural minerogenic particle populations do not meet the theoretical assumptions of such calculations of sphericity (e.g., clay platelets; Peng and Effler 2011) and chemical homogeneity (e.g., calcite coating of phytoplankton; Peng and Effler 2016b). However, Mie theory remains the practical framework to support forward estimates of b m for natural particle populations (Effler and Peng 2014; Peng and Effler 2016b). Moreover, its use has continued to be supported by closure successes (bulk b p = b o + b m) for an increasing number of lacustrine systems (Peng and Effler 2016b). The average scattering efficiency for minerogenic particles (<Q b,m>) determined specifically for Cayuga Lake (Effler and Peng 2014), and adopted here (Table 3; <Q b,m> = 2.3), is nearly equal to determinations for other lakes (Effler and Peng 2014; Peng and Effler 2016a). Accordingly, this coefficient value to estimate b m (from PAVm), thereby within the overall Z SD model, is appropriate for use in applications of the model elsewhere. Moreover, the credibility of this coefficient is supported by its approach to the theoretical limiting value of 2 for polydispersed particle populations (Bricaud et al. 1983; Jonasz 1987).

There are uncertainties in the Z SD measurement that depends on the properties of the disk, ambient light conditions, and the surrounding light field (Tyler 1968; Preisendorfer 1986; Smith 2001). These include the size and reflectivity of the disk, sunny vs. shaded conditions, the height of viewer above the water surface, water glare and glitter (Smith 2001), altitude of the sun, turbulence of the water surface (i.e., wave action), and visual acuity of the observer. The methods adopted here are widely adopted, with a number of the potential varying influences being held uniform, including the use of the same disk, measurement off the sunny side of the boat, the height of viewing above the surface, and measurements being made over a fixed interval of the day. Influences driven by attendant meteorological conditions, including surface glare and glitter, incident light, and wave action, were variable here. This is common because of the common inclusion of Z SD in broader, often fixed frequency, monitoring programs. Accordingly, efforts to control these later conditions are not consistent with a broad utility Z SD model. Wave action, in particular, tends to diminish Z SD observations (Preisendorfer 1986); Levin and Radomyslskaya (2012) indicated that the effect approached 20%.Typical levels of Z SD precision measurement have been reported to be ≤10% across the temporal variations in influencing ambient conditions (Davies-Colley and Vant 1988; Smith 2001). The fact that this uncertainty level for Z SD measurements is lower than that of the overall model performance (Fig. 9) and contributing steps (Figs. 6 and 7) indicates that improved model performance is a reasonable future target.

The theory-based (Preisendorfer 1986) radiative transfer expression adopted to predict Z SD based on IOPs and K d (Eq. (1)) can fairly be described as having the longest and widest support from the optics community. The recent alternate expression (Lee et al. 2015) received only a limited (e.g., single lake) comparison here with parts of the data set of this study. These and other theory alternatives need to continue to be compared and evaluated in the context of proper optical understanding and modeling of the widely used Z SD metric. The composition of the adopted relationship (Fig. 1) had some performance support here (Fig. 7), but more importantly, its optics dependencies can presently be better represented through contemporary progress in OACs and b p measurements and estimates. Central to this is the dominant role of b p and evolving capabilities to partition it according to the contributions of phytoplankton biomass (b o) and minerogenic particles (b m), as demonstrated here. Within a larger water quality model, Z SD simulations could be supported by parallel predictions of a biomass metric and PAVm (Gelda et al. 2016).

There is a minor role for a in influencing Z SD in this lake, and likely many others (Davies-Colley et al. 2003; Effler et al. 2008) as represented by the radiative transfer expression (Eq. (1)). Given the other sources of uncertainty considered here, rigorous treatment of a components is not presently warranted. This position is supported by the small gain in performance demonstrated in the sensitivity analysis that compared the adopted complex framework (but b p-based; Fig. 8a) vs. direct representation of a(550) (Fig. 9). The suggested improvement in overall uncertainty in Z SD (Fig. 9l) is presently small relative to most of the other sources considered above. Moreover, this would require simulation of the components of a (a x) if such a version of the model was to be integrated into a larger water quality model, that presently is not available for lacustrine waters. However, the existing more complex version of the Z SD model (Fig. 8a) provides a framework for potential expansion of its predictive capabilities to indeed directly represent the secondary effects of a(550) dynamics. Such an expression would be justified when some of the above other sources of uncertainty in Z SD predictions are reduced, and robust independent simulation capabilities for a x are available.

4.3 Applications, Transferability, and Advancing the Model

Water clarity conditions, as measured with Z SD, are the most directly (albeit qualitatively) available to the public of the three trophic state indicators (other total phosphorus and [Chl]). The mechanistic Z SD model developed here can be operated separately (e.g., Fig. 10) or integrated into a larger (more parameters) water quality model. Our example separate applications were reasonable beginnings for the Z SD issue, depicting realistic responses of Cayuga Lake to (1) changes in phytoplankton biomass (±20%, driven primarily by terrigenous/land use inputs of nutrients; Prestigiacomo et al. 2016), (2) the present contributions of minerogenic particles and the lower near-shore Z SD caused by higher PAVm levels (Effler and Peng 2014; Fig. 10b, c), and (3) the noteworthy contribution of the whiting event (calcite precipitation; Peng and Effler 2016b) in late summer (Fig. 10d). Of course, a wider range of scenarios could be pursued for this lake and others for which the model has been tested. The separate version of the Z SD model has great utility (e.g., Fig. 10) where this water quality attribute is a primary focus, giving valuable insights concerning appropriate OAC targets, feasibility of goals, and insights concerning causes of patterns.

An interest in integrating this Z SD model into larger (i.e., multiple metrics, including the other two for trophic state) water quality models will prevail. The primary limiting factors for the addition of this mechanistic Z SD submodel will be the availability of associated required inputs and outputs (b o and b m) from the future overall model. A newly developed, tested, and published PAVm model (driven by PAVm loads), for Cayuga Lake (Gelda et al. 2016), integrated as another submodel, will provide the necessary minerogenic particle information by supporting the b m drivers of the Z SD submodel. Contemporary water quality models can already support the estimate of b o from predictions of [Chl] or [POC] (Table 3). A forthcoming larger water quality model initiative for Cayuga Lake will integrate both the separately tested PAVm (Gelda et al. 2016) and Z SD (here) mechanistic submodels. The uncertainty in Z SD predictions from the larger water quality model configuration will necessarily be greater because the biomass and PAVm drivers will also be predicted, and thereby attended by uncertainties, rather than specified by the measured values used here.

The well-tested strong Z SD model developed here is expected to be highly transferable to many other lakes, without coefficient modification. Wide application of the model for many lakes, covering a wide range of characteristics and driver levels, is recommended to demonstrate the breadth of its applicability. Substantial variations between lakes are most likely for those with the largest deviations in the wavelength of highest penetrations of visible light that may require modest shifts in model coefficients.