Introduction

Estimation of plant biomass in aquatic and marine systems by retrieval of chlorophyll a from remote sensing observations has become routine in Case 1 waters where chlorophyll a is the dominant color producing agent (CPA) present in the water column, and some consensus is being reached with regards to an appropriate universal algorithm (McClain 2009; Arnone et al. 2006; Gordon and Morel 1983; O’Reilly et al. 1998). Many of these relationships make use of the blue-green spectral region (Babin et al. 2003). The situation is more complex in turbid Case 2 waters where multiple CPAs may confound the reflectance signal (Morel and Prieur 1977; Mobley 1994; Wang et al. 2009). Bodies of water that are classified as Case 2 include coastal waters with significant inputs of suspended sediment and terrestrial organic matter, and shallow, highly stratified systems in which the distribution of phytoplankton may include or be dominated by blue-green algae or other algal taxa which contain accessory pigments that alter the ratio of water-leaving radiances at various wavelengths. Many highly productive systems also have high concentrations of colored dissolved organic matter (CDOM), degradation products produced by the senescence and death of phytoplankton or excreted by their predators after feeding. The situation is further complicated by time-dependent changes in the concentrations of these sources of bias in these dynamic environments.

The Western Basin of Lake Erie (WBLE) qualifies as body that exhibits all of these complicating factors. Productivity is among the highest seen in fresh water ecosystems, the maximum depth of the WBLE is only 11 meters, and rivers draining eight major watersheds deliver a mix of sediment, CDOM, and riverine algae to the region. Nutrient controls in Lake Erie led to significant reduction in the biomass density in the 1980s; however, in addition to the historical issues with Lake Erie water quality, in recent years, the WBLE has experienced an increase in harmful blooms of Microcystis aeruginosa, a toxic blue-green alga (Ouellette et al. 2006; NOAA 2011). It is essential to efficiently and accurately monitor changes in water quality; this region of the lake serves as an economic and social resource through the fishing and recreation industries, as well as the contribution of the lake to the drinking water supply.

While the most direct methods of monitoring water quality involve the analysis of water samples, these methods do not provide the spatial and temporal coverage needed to continuously assess water quality on basin-wide scales. Use of satellite remote sensing provides complementary information which has the potential to greatly improve understanding of variations in water quality. Satellite sensors that detect variations in spectral reflectance from the Earth’s surface in carefully selected bands of the visible spectrum have proven promising for water quality assessment in other locations. With these systems, assessment of water quality is obtained by applying algorithms that relate satellite-measured reflectance to the concentrations of specific CPAs (Martin 2004). Ground-truthing is achieved by comparing co-located satellite and in situ observations.

Various empirical and semi-analytical algorithms have been developed for estimating chlorophyll a in coastal waters (Witter et al. 2009; Gitelson et al. 2000; Schalles 2006; Dall’Olmo and Gitelson 2005; Doerffer and Schiller 2007; Moses et al. 2009; Gons 1999; Gower et al. 1999; Doerffer and Fischer 1994; Dekker et al. 1997; Dekker 1993; Gons et al. 2002; Matthews et al. 2010); however, universally applicable remote sensing algorithms for the retrieval of water constituents from Case 2 waters are not known (Arnone et al. 2006). The challenge in algorithm development in these waters comes with the overlapping magnitude of the spectral coefficients from the various CPAs and their non-linear interaction. Moreover, the complex interactions of the physical, chemical and biological process at local environments and the lack of knowledge of the bio-optical properties specific to the region results in inaccuracies and poor performance in the standard algorithms (Woodruff et al. 1999; Whitlock et al. 1981; Wong et al. 2008). Specific algorithms are thus optimized and validated due to the optical convolution observed in turbid waters coupled with complexities of satellite observations (e.g., atmospheric interferences or issues pertaining to spatial and temporal sampling) (Gitelson et al. 2007).

Recent works demonstrated that applying standard bio-optical models to the WBLE has proven challenging due to the difficulty of separating the spectral signatures of the multiple CPAs in the environment (Witter et al. 2009; Becker et al. 2009). In addition to the optical complexity of the WBLE, which creates a challenge to the application of remote sensing models, the poor performance of the standard algorithms in the lake may be attributed to the use of low pixel resolution data that can potentially cause errors related to spectral mixing and/or the use of low spectral resolution sensor data that are not optimized for detecting the CPAs, in this case specifically the chlorophyll a.

Current generation satellite sensors such as the MEdium resolution imaging spectrometer (MERIS) are equipped with radiometric sensitivities optimized for measurements of CPAs in turbid waters. These waters normally have low reflectance due to the high absorption by water itself and its associated constituents such as phytoplankton and CDOM. MERIS has high spectral and temporal resolution (10 nm and 1–3 days, respectively) and therefore, allows for better characterization of subtle variations in spectral signals associated with the various water quality parameters.

The objective of this study was to evaluate the performance of existing published visible/NIR based empirical and semi-empirical models using reflectance data from MERIS sensor and their applicability for monitoring algal biomass in the WBLE. The algorithms were applied using MERIS bands that are sensitive to chlorophyll a attenuation spectral features and include: absorption features near 490 nm, minimum absorption coupled with cellular scattering near 560 nm and the red absorption features near 665 nm, and the chlorophyll a fluorescence peak near 680 nm.

Data acquisition and methodology

Field measurements

Assessment of models that predict concentrations of biogeochemical proxies requires collocated in situ data. In the summers of 2009 and 2010, in situ data and water samples were collected from the Western Basin of Lake Erie aboard the R/V Gibraltar III or R/V Erie Monitor. These cruises were concurrent with MERIS satellite overpasses. A total of 90 observations were made by visiting 18 stations during 5 cruises over a two summer periods. Completion of each transect, which was designed to collect water samples from a variety of environments, required approximately 10 h. Sample locations were stored as waypoints in a marine GPS to verify re-occupation of the same site during each cruise. Stations were located between Sandusky Bay and Pelee Island (Fig. 1). Station locations were pre-selected to be at least 2 km away from terrestrial bodies to avoid contamination of satellite pixels by land. Water quality properties (e.g., Secchi depth, biogeochemical proxies) were measured at each station and the field campaign included collection of water samples using screw capped Nalgene bottles for laboratory analysis. A submersible HACH Hydrolab DS5X equipped with multi-parameter sonde was used to measure the chlorophyll a, and phycocyanin concentrations among other biogeochemical proxies such as water temperature, pH, conductivity, salinity, dissolved oxygen, although here we focus on the plant pigment data only. Absorbance of CDOM, which is indicative of its concentration, was measured in situ using UV CYCLOPS-7 submersible fluorometer. All of the sensor probes were calibrated using standards. The chlorophyll a sensor was calibrated by the manufacturer, HACH. Quinine sulfate was used to calibrate the Cyclops-7 CDOM fluorometer.

Fig. 1
figure 1

Bathymetric map of the Western Basin of Lake Erie and the locations of 18 sampling stations visited during multiple cruises using RV Gibraltar III and Erie Monitor. The lake interacts with terrestrial environment through fluxes of the Maumee, Sandusky, Portage, Toussaint and other rivers such as the Detroit, which drain into the north western side of the basin (not shown). The inset shows, Lake Erie as part of the Great Lakes Basin

Satellite imagery

Five Full Resolution (FR) MERIS Level-1b images were acquired for summer of 2009 and 2010 from the European Space Agency’s (ESA) data gateway. These dates were selected on the basis of the coincident field campaign for in situ data when cloud coverage conditions over the WBLE were minimal. The FR-MERIS sensor is equipped with 15 bands representing the visible/NIR spectral range with 300 m and 10 nm spatial and spectral resolutions, respectively. In this study, only bands with wavelengths between 0.4 and 0.9 μm were used. The Basic Toolbox for ENVISAT (A) ATSR and MERIS (BEAM)-4.6 visualization, analyzing and processing software developed by Brockmann-consult, Germany, was used for converting the images from Level-1b to Level-2b which includes georectification and radiometric corrections.

Digital image processing including atmospheric correction was carried out on the images to account for multiple scattering and absorption due to atmospheric constituents. Removal of atmospheric effects is crucial to reduce effects of path radiance. In turbid waters such as the WBLE, the assumption of negligible reflectance at NIR wavelength becomes invalid due to increased scattering from non-algal components. Therefore, the conventional method for image correction, the black pixel approach does not hold (Schalles 2006; Morel and Prieur 1977). Improved versions of dark pixel atmospheric corrections methods such as the NIR bands procedure (Gordon and Wang 1994), the SWIR bands procedure (Wang and Shi 2005) and the Bright pixel atmospheric correction (Aiken and Moore 2000) atmospheric corrections have been developed and assessed in turbid waters (Moses et al. 2009). These methods account for absorption of water vapor in the NIR and do not make the classical assumption of zero reflectance in the NIR. However, according to Moses et al. (2009) the water-leaving radiance is very low in the NIR region and hence very sensitive to changes in the non-uniform path radiance effects. Therefore, a stable atmospheric correction, resulting in accurate retrievals of NIR reflectance, is required to the success of the models. A more stable recently developed atmospheric correction model for MERIS data is the Case 2 regional (C2R) atmospheric correction. This model is based on a neural network procedure specifically developed for turbid waters (Doerffer and Schiller 2007). It involves a two-step procedure—(a) a forward neural network for the retrieval of water-leaving radiances and, subsequently, remote sensing reflectance from the radiances, and (b) a backward neural network for the retrieval of the inherent optical properties of water and, subsequently, the concentrations of constituents by inverting the remote sensing reflectance. The neural networks are trained based on radiances simulated by radiative transfer solutions and built to parameterize the relationships between the top-of-atmosphere radiances and the water-leaving radiances (for the forward model) and between the remote sensing reflectance and the inherent optical properties (for the backward model). For this work, the recorded radiances at 12 MERIS wavebands (in visible and NIR wavelengths) are used in the neural network based atmospheric correction.

Spectral analysis

Primary objectives of aquatic visible and near-infrared remote sensing include: identifying the optical properties of CPAs and quantifying them remotely. Radiative transfer models show the fundamental relationship that exist between remote sensing reflectance R(λ) and the inherent optical properties of waters—the backscattering coefficient [b b(λ)] and the absorption coefficient [a(λ)] i.e.,

$$R\left( \lambda \right) = \frac{f\left( \lambda \right)}{Q\left( \lambda \right)} \frac{{b_{b} \left( \lambda \right)}}{{a\left( \lambda \right) + b_{b} \left( \lambda \right)}}$$
(1)

The f and Q factor represent the geometry of the light field that affects the R(λ)

The total spectral absorption coefficient [a(λ)] in m−1, within the water body can be described by:

$$a\left( \lambda \right) = a_{\text{w}} \left( \lambda \right) + a_{\text{ph}}^{*} \left( \lambda \right)c_{\text{Chl}} + a_{\text{cdom}} \left( \lambda \right) + a_{\text{SM}}^{*} \left( \lambda \right)c_{\text{SM}}$$
(2)

where \(a_{\text{w}} \left( \lambda \right), a_{\text{ph}} \left( \lambda \right), a_{\text{cdom}} \left( \lambda \right) \,{\text{and}} \,a_{\text{SM}} \left( \lambda \right)\) are the absorption coefficients of pure water, phytoplankton, dissolved organic matter and suspended matter, respectively. \(a_{\text{bph}}^{*} , \left( \lambda \right)\) is chlorophyll-specific absorption coefficient of phytoplankton and \(a_{\text{SM}}^{*} \left( \lambda \right)\) is the specific spectral absorption coefficient of suspended matter. The spectral backscattering coefficient is another inherent optical property that affects the magnitude of the reflectance from water bodies (Gons 1999). The total backscattering coefficient, \(b_{\text{b}} (\lambda )\) with in the water body can be described by:

$$b_{\text{b}} \left( \lambda \right) = 0.5b_{\text{bw}} \left( \lambda \right) + b_{\text{bph}}^{*} \left( \lambda \right)c_{\text{Chl}} + b_{\text{cdom}} \left( \lambda \right) + b_{\text{bSM}}^{*} \left( \lambda \right)c_{\text{SM}}$$
(3)

Where \(b_{\text{bw}} \left( \lambda \right)\) is the scattering coefficient of pure water which accounts for refraction due to path transmittance between medium transitions (air–water), it is assumed that backscattering is always 50 % of total backscattering by water and therefore expressed as \(0.5{\kern 1pt} b_{\text{bw}}\) (Mobley 1994). The \(b_{\text{bph}} \left( \lambda \right)\) represents the backscattering coefficient of phytoplankton and \(b_{\text{bSM}} \left( \lambda \right)\) is the backscattering coefficient of suspended matter. The degree of scattering of dissolved organic matter is negligible and assumed to be null, \(b_{\text{cdom}} \left( \lambda \right) \approx 0\). In Eq. (3), \(b_{\text{bph}}^{*} , \left( \lambda \right)\) is chlorophyll-specific spectral backscattering coefficient of phytoplankton and \(b_{\text{bSM}}^{*} \left( \lambda \right)\) is the specific backscattering coefficient of suspended matter.

Most of the existing bio-optical models that are developed for estimating chlorophyll a in aquatic environments are based on the relationship expressed in Eq. (1) (Gordon et al. 1988; El-Alem et al. 2012). The interaction of chlorophyll a and water absorption provides important insights for the interpretation of maxima and minima in the spectral reflectance curves. In Case 2 waters were multiple CPAs exist the performance of the chlorophyll a models will depend on the extent to which the absorption and backscattering effects from non-algal components (\(a_{\text{w}} \left( \lambda \right), a_{\text{cdom}} \left( \lambda \right) {\text{and}} a_{\text{SM}} \left( \lambda \right)\) and \(b_{\text{bSM}} \left( \lambda \right)\)) are accounted for so that the attenuation (absorption and scattering) induced by the pigment can be isolated independently. In the WBLE, absorption due to CDOM attenuates chlorophyll a signals with absorption effects inversely proportional to wavelength (Kutser et al. 2005). Backscatter from TSM and absorption of water itself also influences the reflectance across the visible-infrared spectrum, thus affecting chlorophyll a induced signals (Bukata et al. 1994). Isolating the spectral windows that correlate with the apparent optical property of chlorophyll a, independent of other water constituents requires a robust spectral analysis technique. Bale et al. (1994) showed that increasing the concentrations of phytoplankton biomass or CDOM have no effect on the spectral shape of the reflectance near the NIR region, thus the optical signatures of materials. The NIR region can thus be used to monitor and account for the contribution due to suspended sediments without interference from plant pigment or their degradation products.

In the WBLE, the optical properties of the multiple CPAs overlap in the visible range, and therefore the differentiation of the signatures requires the selection of optimal bands (Ali et al. 2012). In this study models that use chlorophyll a-sensitive spectral windows were applied using MERIS data (Fig. 2).

Fig. 2
figure 2

Spectral plot of pixels representing 18 stations based on MERIS data after atmospheric correction was applied. Dashed red lines represent spectra of stations from Sandusky Bay (Stations 19 and 20). The arrows show positions of chlorophyll a attenuation features, including one due to phycocyanin at 620 nm, which is the major accessory pigment in the WBLE

This study evaluated two-band algorithms based on a regionally developed blue-green model given by Witter et al. (2009) and a NIR-red model given by Moses et al. (2009) expressed as:

$${\text{chl}}\_a_{{({\text{MERIS}})}} \propto \frac{{R_{{(\lambda_{1} )}}^{ - 1} }}{{R_{{\left( {\lambda_{2} } \right)}}^{ - 1} }}$$
(4)

These models are based on ratios of spectral regions that represent maximum absorption and minimum absorption due to chlorophyll a. The spectral regions are matched with the corresponding MERIS bands.

The second NIR-red based empirical model that was applied in the WBLE is given by Moses et al. (2009) and this algorithm uses three bands and has been employed in several studies (Dall’Olmo and Gitelson 2005; Moses et al. 2009; Gitelson et al. 2008; Gitelson 1992; Gurlin et al. 2011):

$${\text{chl}}\_a_{{({\text{MERIS}})}} \propto \left( {R_{{\left( {\lambda_{1} } \right)}}^{ - 1} - R_{{\left( {\lambda_{2} } \right)}}^{ - 1} } \right) \times R_{{\left( {\lambda_{3} } \right)}}^{ - 1}$$
(5)

The NIR-red based three-band model (Eq. 5) includes a third band \(R_{{\left( {\lambda_{3} } \right)}}^{ - 1}\) as a normalizing multiplication factor to remove effects due to the backscatter. This is assumed to improve the sensitivity of the model to chlorophyll a variability. The multiplying factor is employed assuming that pigment and CDOM absorption is negligible near \(R_{{\left( {\lambda_{3} } \right)}}^{ - 1}\).

Two semi-empirical models implemented in this study are given by Simis et al. (2005) (Eq. 6) and Gons (1999) (Eq. 7):

$${\text{chl}}\_a_{{({\text{MERIS}})}} \propto \left( {\left\{ {\left[ {\frac{{R_{{\lambda_{1} }} }}{{R_{{\lambda_{2} }} }}} \right] \times \left[ {R_{{{\text{w}}_{708} }}^{ - 1} + b_{\text{b}} } \right]} \right\} - b_{\text{b}} - R_{{{\text{w}}\_665}}^{ - 1} \times \delta^{ - 1} } \right)$$
(6)
$${\text{chl}}\_a_{{({\text{MERIS}})}} \propto \left( {\left\{ {\left[ {\frac{{R_{{\lambda_{1} }} + b_{\text{b}} }}{{R_{{\lambda_{2} }} + b_{\text{b}} }}} \right] \times \left[ {R_{{{\text{w}}_{708} }}^{ - 1} + b_{\text{b}} } \right]} \right\} - b_{\text{b}} - R_{{{\text{w}}_{665} }}^{ - 1} } \right)$$
(7)

The semi-empirical models are optimized to MERIS NIR bands that are effective to retrieve the absorption of chlorophyll a in the WBLE. The Gons (1999) algorithm accounts for the effect of backscattering (\(b_{\text{b}}\)) and absorption due to water itself. In the present work, the backscattering coefficient was retrieved using the 775 nm band of MERIS following Gons (1999) procedure. This is a spectral region where absorption by other CPAs is assumed to be very small compared to absorption due to water (Gons 1999; Kirk 1994). In the Simis et al. (2005) algorithm, in addition to backscattering, effects due to CDOM and the accessory pigment, phycocyanin, are accounted for, using a correction factor \(\delta\), which reflects the ratio of retrieved absorption versus measured absorption by chlorophyll a at NIR wavelength. For this work, the value of the correction factor \(\delta\) was taken directly from Simis et al. (2005). These parameters are appropriate for this application because they were devised for use in shallow lakes dominated by blue-green algae similar to the WBLE.

Results and discussions

The spectral plot of the MERIS reflectance represents apparent optical properties of the water in the WBLE and clearly showed evidence of multiple CPAs (Fig. 2). The reflectance values are highly variable in the visible and NIR range. The blue-green spectral range shows reflectance patterns typically observed in turbid waters (Schalles et al. 2002) with significant absorption due to chlorophyll a, CDOM and other associated constituents. A local reflectance maximum is observed near 560 nm, also referred to as the green peak. This is the mainly due to minimum absorption by chlorophyll a combined with scattering effects. The reflectance peak at the red/NIR edge, which is mainly due to fluorescence from phytoplankton, is not strongly apparent in the satellite data. The data indicates local minimum near 620 nm due to phycocyanin, which is the major accessory pigment in blue-green algae, a prominent algae documented in the Lake (Brittain et al. 2000). The local minimum observed near 665 nm is due to chlorophyll a absorption. The absorption troughs in the red region, near 620 and 665 nm and the reflectance peaks at 560 and 680 nm are more noticeable in spectra (dashed lines) from stations closer to Sandusky Bay than in the open waters of WBLE (Figs. 1, 2).

CPAs in the WBLE

The average in situ concentration of chlorophyll a was 5.68 and 8.07 μg/l in June and September of 2009, respectively. In September, the concentrations across the basin varied between 4.96 and 9.61 μg/l with the higher concentration values recorded at stations located in the Sandusky Bay. In summer of 2010, chlorophyll a concentrations in the basin ranged from 2.14 to 10.67 μg/l. The standard deviation of the chlorophyll a concentrations decreased from 2.71 to 0.79 μg/l between July and September, in 2010. The high standard deviation of chlorophyll a concentrations in July is attributed to the relatively higher seasonal algal density in the Sandusky Bay. During the early summer, river discharges are high and significant amounts of terrestrial matter, including nutrients, are dumped into Sandusky Bay. This increases the combined concentrations of allochthonous and autochthonous phytoplankton and consequently higher chlorophyll a concentrations are recorded at stations located in Sandusky Bay (Stations 17, 19 and 20). This condition results in large variability of chlorophyll a between the waters in Sandusky Bay and the central WBLE. By late summer (September), the river discharges decrease and the waters between the two sub basins undergo mixing and the variability of chlorophyll a in the WBLE lowers.

Descriptive statistics of CDOM in the WBLE shows that the absorbance ranged from 0.115 to 1.38 m−1, with an average value of 0.375 m−1. This result is consistent with the findings of Binding et al. (2008) from their filtrate samples collected in Lake Erie. During July of 2010, significantly higher absorbance was recorded at stations 3, 4, 17, 19 and 20 (Fig. 3). Stations 3 and 4 are located close to the discharge zones of the Toussaint and Portage Rivers and stations 17, 19, and 20 are located in the Sandusky Bay, which is a shallow basin heavily influenced by terrestrial matters. This suggests that the optical properties of the waters near discharge zones are heavily influenced by terrestrially derived CDOM. As shown in Fig. 3, the spatial variability of CDOM concentrations across the WBLE decreases during the summer period. This may be attributed to the decreasing stream flow from late spring to summer and at the same time dispersion of in-water constituents into central WBLE.

Fig. 3
figure 3

Spatial and temporal variability of CDOM in the WBLE during summer of 2010 based on data from in situ measurements. Missing values due to instrument errors

The spatial structure of CDOM is similar to the chlorophyll a dynamics, providing strong evidence of the influence of rivers on the algal distributions observed in the WBLE (Fig. 4). Riverine input makes nutrients readily available for primary producers, and this increases the productivity of the lake system. Constituents such as CDOM absorb significant amount radiation in the UV-B region, thus reducing the energy available for primary producers. This may influence the community structure of the lake system by favoring primary producers that can utilize energy in the longer wavelength region. In the WBLE, the cyanobacteria represent the dominant bio-community (Makarewicz 1993). These organisms possess accessory pigments (e.g., phycocyanin and carotene) allowing them to harvest radiant energy beyond the blue spectral region and making their productivity less sensitive to the presence of CDOM.

Fig. 4
figure 4

In-situ measurement of CPAs shows the variability of chlorophyll a and CDOM in the WBLE during summer of 2010

Away from the littoral zone, in the central WBLE, CDOM concentration is relatively low and the influence of rivers is reduced. A different bio-community structure makes up the biomass. Studies by Ali et al. (2012) have indicated that the blue-green algae are generally lower in central WBLE than in Sandusky Bay, suggesting that this assemblage is more important in the Sandusky Bay than in central WBLE. The study has further indicated that central WBLE is dominated by the diatom/green algae community, which is consistent with the concept of CDOM interference.

Chlorophyll a prediction

Empirical models

Band ratios such as R 443/R 565 and R 520/R 565 were commonly used to estimate chlorophyll a in ocean waters (Arnone et al. 2006; Morel and Prieur, 1977). However, O’Reilly et al. (1998) and other studies have clearly indicated that blue-green models did not perform well in Case 2 waters due to the presence of multiple constituents (CDOM, suspended matters) affecting the absorption characteristics of the Soret bands. Witter et al. (2009) developed a regional blue-green chlorophyll a algorithm (Eq. 8) based on co-located satellite observations and field-collected samples from the WBLE.

$${\text{chl}}\_a_{{({\text{MERIS}})}} \propto \frac{{R_{490}^{ - 1} }}{{R_{560}^{ - 1} }}$$
(8)

This algorithm was applied in this study using MERIS data from 2009 and 2010 and the result is illustrated in Fig. 5. The regional algorithm gave R 2 values of 0.46 and indicated that the blue–green algorithm overestimates the chlorophyll a values in the WBLE. Careful examination of the results indicates that for the low chlorophyll a concentrations (<6 μg/l), the predicted values plot closer to the 1:1 line, but with higher chlorophyll a concentrations the model overestimates chlorophyll a values showing exponential relationship with the field measurements. The bias towards high values corresponds to samples collected from stations that are closer to river discharge zone or in areas where Secchi depth was low (Stations 3 4, 17, 19, 20). The overestimation is due to optical interference from other in-water constituents such as CDOM that is not necessarily correlated with pigment concentration.

Fig. 5
figure 5

Regression plot of MERIS based blue-green chlorophyll a model versus in situ chlorophyll a concentrations

MERIS is equipped with chlorophyll a red absorption band (665 nm) and a narrow NIR band (708, 7.5 nm band width) which is sensitive to non-algal absorption. Regression analysis between a two-band red/NIR (Eq. 9) MERIS based model (Moses et al. 2009) and chlorophyll a concentration for the WBLE is shown in Fig. 6a. The MERIS-based two-band model is able to explain approximately 60 % of the chlorophyll a variability in the WBLE. The band ratioing removed effects of other non-algal in-water constituents, and therefore the magnitude of the model increases proportionally with chlorophyll a concentration. Application of band ratios also removes systematic errors from instrumental disturbances and minimizes effects due to path radiance especially when working in smaller area where atmospheric effects can be assumed to be uniform (Binding et al. 2008; Gilerson et al. 2010; Babin et al. 2000; Sokoletsky et al. 2011). Another important strength of the two-band ratio model is its capability to remove back-scattering effects. However, scattering effects can significantly deteriorate the performance of the model. In this case, some of the factors that may have limited the performance of the two-band red/NIR model are absorption effects of accessory pigments, e.g., phycocyanin, and absorption due to the dissolved organic matter which varies as a function of (λ −1).

Fig. 6
figure 6

Regression plot of MERIS based chlorophyll a models versus the chlorophyll a concentrations measured in situ a two-band red/NIR and b three-band red/NIR chlorophyll a model

$${\text{chl}}\_a_{{({\text{MERIS}})}} \propto \frac{{R_{665}^{ - 1} }}{{R_{708}^{ - 1} }}$$
(9)

Regression analysis of a three-band MERIS model based using R 665, R 708 and R 753 versus chlorophyll a gave R 2 value of 0.57, indicating that this model has some potential for explaining chlorophyll a variability in the turbid WBLE. Model performance is illustrated using regression analysis (Fig. 6b). The R 2 values obtained from the MERIS based three-band model (Eq. 10) is relatively low and this can be generally attributed to the absorption effects of water on the longer wavelengths (>720 nm).

$${\text{chl}}\_a_{{({\text{MERIS}})}} \propto \frac{{(R_{665}^{ - 1} - R_{708}^{ - 1} )}}{{R_{753}^{ - 1} }}$$
(10)

In comparison to the two-band model, the performance of the three-band model is lower. This is because firstly, with increasing wavelength ranging from 665 to 708 nm, the absorption of water exponentially increases due to the low energy incident photons being easily attenuated (Babin et al. 2000). This in turn reduces reflectance from scattering effects of suspended particles and therefore subtracting absorption effects between the red and the NIR bands may not effectively remove the effects of backscattering causing the relationship between the model derived chlorophyll a estimate and in situ chlorophyll a to be relatively low. A second potential contribution to errors in the three-band prediction model of chlorophyll a is due to the use of R 753 in the model. This band is a multiplication factor that is usually applied to account for the backscattering from inorganic matters. However, in aquatic bodies, the irradiance of water at this wavelength region produces very low reflectance values and this low values are easily influenced by effects of path radiance and are also more susceptible to stochastic errors (Babin et al. 2000; Sokoletsky et al. 2011).

Semi-analytical models

Semi-analytical models constitute another approach for estimating chlorophyll a from satellite sensors. These models are based on known spectral features that are discovered from the correlation between the apparent optical properties (AOPs) and the inherent optical properties (IOPs) of the water (Gordon and Wang, 1994). In Case 2 waters, most of the semi-analytical algorithms used to retrieve chlorophyll a utilize the red and near-infrared (NIR) spectral region (650–750 nm) (Gons 1999; Simis et al. 2005).

The relationship between the Simis et al. (2005) NIR/red based model (Eq. 11) and in situ chlorophyll a concentration is stronger, with a coefficient of determination of 0.65. Figure 7a shows a plot of the Simis et al. (2005) model versus observed chlorophyll a concentrations. The best fit line is relatively close to the 1:1 line, indicating the potential of the Simis et al. (2005) model in retrieving CPAs. The higher predictive power of the model is attributed to the incorporation of correction factors such as absorption effects due to pure water (a w), backscatter (b b) from suspended matter and interference due to phycocyanin \((\delta^{ - 1} )\). The relatively higher R 2 value clearly indicates the importance of accounting for optical interference of other in-water constituents such as phycocyanin and suspended matter on the chlorophyll a reflectance.

Fig. 7
figure 7

Regression plot of MERIS based chlorophyll a model versus the chlorophyll a concentrations measured in situ a Simis et al. (2005), red/NIR and b Gons (1999), red/NIR chlorophyll a model

$${\text{chl}}\_a_{{({\text{MERIS}})}} \propto \left( {\left\{ {\left[ {\frac{{R_{708} }}{{R_{665} }}} \right] \times \left[ {R_{{{\text{w}}_{708} }}^{ - 1} + b_{\text{b}} } \right]} \right\} - b_{\text{b}} - R_{{{\text{w}}\_665}}^{ - 1} \times \delta^{ - 1} } \right.$$
(11)

Although it is demonstrated that the Simis et al. (2005) model performs well in the retrieval of chlorophyll a in a wide range of eutrophic lakes (Simis et al. 2005; Novoa et al. 2012; Gurlin 2012), the predictive power of the model can further be improved by optimizing correction factors such as δ based on local measurements of pigment specific absorption. This will provide an opportunity to correct for any offset values due to absorption by tripton and CDOM, which are significant water quality parameters in the WBLE (Makarewicz 1993). The Simis et al. (2005) model only corrects for the absorption effects due to phycocyanin using correction factors derived from slope of a line constructed between the measured absorption values of phytoplankton and total absorption at 665 nm. Generally, the model ignores the offsets values due to CDOM and tripton assuming it to be negligible near the red/NIR region and does not subtract it from the computed chlorophyll a absorption.

The second semi- analytical applied to the WBLE is the red/NIR model (Eq. 12) developed by Gons (1999) This model is very similar to the Simis et al. (2005) model, except that the model does not account for absorption by accessory pigments.

$${\text{chl}}\_a_{{({\text{MERIS}})}} \propto \left( {\left\{ {\left[ {\frac{{R_{708} + b_{\text{b}} }}{{R_{665} + b_{\text{b}} }}} \right] \times \left[ {R_{{{\text{w}}_{708} }}^{ - 1} + b_{\text{b}} } \right]} \right\} - b_{\text{b}} - R_{{{\text{w}}_{665} }}^{ - 1} } \right)$$
(12)

Plots of Gons (1999) red/NIR based chlorophyll a estimate versus in situ chlorophyll a concentrations show coefficient of determination of 0.61 (Fig. 7b). Relative to the Simis et al. (2005) algorithm, the Gons (1999) model gave lower prediction accuracy with very flat best fit line. This is mainly due to simplified assumptions taken in the development of the model. These include the assumption of negligible pigment reflectance in the NIR region. In addition, absorption by CDOM and tripton in the red/NIR is not accounted for in the model.

The Gons (1999) model overestimates low to moderate chlorophyll a concentrations, and tends to underestimate the higher chlorophyll a concentrations (Fig. 7b). The simplified assumption of negligible absorption of pigment in the NIR region leads to overestimation by the reflectance in the red region (665 nm) and therefore lower ratio between 708 and 665 nm, causing underestimation of chlorophyll a concentrations. Similarly, the assumption of the zero-absorption effects of CDOM and tripton may cause an underestimation of reflectance in the red region (lower R 708/R 665 ratio), which may not necessarily be correlated with chlorophyll a concentration. This leads to erroneous prediction of higher chlorophyll a.

Calibration-validation of chlorophyll a algorithms

The two-band (Moses et al. 2009) and the semi-empirical algorithms (Gons 1999; Simis et al. 2005 ) have produced the best results among the different empirical and semi-empirical chlorophyll a algorithms, with R 2 of 0.61, 0.61 and 0.65, respectively. The performance the three models were further evaluated using the leave-one-out cross-validation method (Kohavi 1995). All three models indicate statistically significant (p < 0.05) R 2-PRED values which indicates the efficiency of the models to predict responses for new observations. The RMSE of the predicted values are comparable to the error values of the fitting models. The Simis et al. (2005) model with relatively lower RMSE-PRED and higher R 2-PRED values shows slightly greater predictive ability when applied to the WBLE environment as compared to the Gons ( 1999 ) and the two-band Moses et al. (2009) algorithm (Table 1).

Table 1 Summary of calibration-validation of chlorophyll a algorithms based on Simis et al. (2005), Gons (1999) and two-band Moses et al. (2009)

Uncertainties

The algorithms presented in this work illustrate the strength of the red/NIR and blue-green based bio-optical models in detecting the variability of chlorophyll a when applied to high resolution satellite data (MERIS) in turbid waters such as the WBLE. However, studies indicated that the performance of the satellite algorithm comes with certain uncertainties. The models are not stable and calibration work has always been difficult due to various assumptions and physical factors (Moses et al. 2009; Kutser et al. 2005; Gitelson et al. 2008; Simis et al. 2005). Several factors limit the potential of satellite algorithms to estimate the concentrations of in-water constituents.

Assumption of negligible effects due to other components

In Case 2 waters, the red/NIR spectral range is often used to develop algorithms that estimate pigment concentration in the water column. The red-edge of the spectrum is usually invoked in order to minimize the complicating influences of other in-water constituents such as CDOM and inorganic particles. However, assumption of negligible effects of CDOM and inorganic particles in the red/NIR models led to significant bias.

Temporal variability

Data collected using conventional in situ methods cannot sample multiple water points at the same time. However, a satellite has the capability to capture optical properties of large inland water bodies instantly. The time difference between the sampling period and the satellite data may be enough to have significant changes in the optical properties of the water, especially in shallow-turbid waters such as the WBLE. Instant wind action over the waters of WBLE causes mixing, significantly changing the optical and hydrochemical properties of the water column. The small difference in the time of data acquisition can introduces random errors that significantly reduce the correlation factor between the retrieved constituents from the two platforms hence degrading model performance.

Pixel heterogeneity

Although in this study, higher spatial resolution satellite data were used compared to previous studies, in optically complex aquatic systems such as in the WBLE, the spatial variability of in-water constituents may be large. The spatial resolutions (0.09 km2 for MERIS) may be too course to correlate with in situ point measurements. In such cases collocated in situ data may not represent conditions averaged over the satellite pixel. This can reduce the correlation between the in situ measurements and the satellite retrievals.

Path radiance correction

The influence of path radiance due to atmospheric scattering can be substantial and the success of satellite remote sensing in predicting water quality parameters depends on the accuracy of the applied atmospheric correction module. Several atmospheric correction procedures including the NIR bands procedure (Gordon and Wang 1994), the SWIR bands procedure (Wang and Shi 2005), Bright pixel atmospheric correction (Aiken and Moore 2000) and the Case 2 regional processer (Doerffer and Schiller 2007) have been tested to correct for path radiance. The Case 2 waters regional algorithm has been selected as the optimal models. However, without synchronized in situ measurements of water-leaving radiances, it is not possible to assess accuracy of the procedures from the reflectance curves. Implementing accurate corrections due to interference of atmospheric constituents remains a significant challenge within optical remote sensing.

Conclusion

The results have illustrated the potential of the remote sensing algorithms utilizing high resolution MERIS data in red/NIR wavelength region to estimate chlorophyll a concentrations in optically complex productive waters. Among the various models applied, results indicate that the Simis et al. (2005) algorithm provides the best estimates of chlorophyll a in the optically complex waters of the WBLE. The Simis algorithm accounts for back scattering effects and the absorption effects due to major accessory pigment, phycocyanin. These corrections significantly increased its performance relative to others, such as the Gons (1999) model which does not account for effects of phycocyanin. The addition of correction factors for other optically active constituents including suspended sediments and CDOM will improve model performance and the task of formulating the correction factors is a promising direction for future research.

The increased spatial and spectral resolution of the FR-MERIS sensor aboard Envisat along with its chlorophyll a-sensitive bands such as 665 nm significantly increased the effectiveness of satellite-based remote sensing to estimate chlorophyll a distributions and hence the effective monitoring of water quality in large water bodies such as the WBLE. To develop more robust bio-optical models, future efforts must focus on: (a) improving atmospheric correction methods through calibration-validation techniques, (b) designing better field methods to accommodate for the temporal difference and effects of ground resolution between in situ and satellite data, and (c) applying mathematical transformation techniques on full spectral information instead of narrow individual bands signatures. This will enhance retrievals of CPAs from satellite-based sensors and improve model stability.