Introduction

Lake Taihu, third largest freshwater lake in China, is situated in the Yangtze River Delta (Fig. 1). Due to side-effects of economic and social development, algal blooms have frequently occurred in Lake Taihu since the 1980s (Hu et al., 2006). In 2007, algal blooms occurred 1 month earlier than in previous years, causing drinking water sources to be severely polluted and millions of people to be exposed to health hazards, which in turn imperiled the social stability of the region (Yang et al., 2008).

Fig. 1
figure 1

Location map of Lake Taihu in the Yangtze River Delta of China. Annotated numbers show stations used to sample monthly during 2005 by the Taihu Ecosystem Research and Field Observation Station of the Chinese Academy of Sciences. The water body of Lake Taihu is divided into eight water regions: Meiliang Bay, Zhushan Bay, Gonghu Bay, Central Lake, Western Lakeshore, Southern Lakeshore, East Taihu and Eastern Lakeshore

Currently, quantitative understanding of the temporal and spatial variability of chlorophyll a concentration in Lake Taihu is far from adequate. Accordingly, a better understanding could help scientific researchers, environmental managers and policy designers to know when, where, and why algal blooms occur in Lake Taihu. In order to accomplish a better understanding, remote sensing techniques can be used to supplement on-ground observations and laboratory experiments.

During evaluation of water color by remote sensing, two types of waters, Case 1 and Case 2, have been distinguished (Yentsch, 1984). Inland waters such as Lake Taihu belong to Case 2 waters, which are influenced by phytoplankton and associated organic matter, as well as other substances that vary independently of phytoplankton, such as inorganic particles in suspension and chromophoric dissolved organic matter (CDOM). It is well known that the standard algorithms in use for chlorophyll a retrieval from Case 1 waters break down in Case 2 waters (IOCCG, 2000; Dall’Olmo et al., 2005). Accordingly, many studies have been conducted to evaluate methods to derive chlorophyll a in different turbid waters using bio-optical models. All of these models are based on the spectral properties near 700 nm rather than in the blue (c. 440 nm) and green (c. 550 to 570 nm) spectral regions (Dall’Olmo et al., 2005) because of strong absorption in the blue spectral region by dissolved organic matter, tripton, and phytoplankton pigments in turbid waters. Similarly in Lake Taihu, it has been shown that the natural logarithm of chlorophyll a concentration is closely related to the ratios R 705/R 675 (Shu et al., 2000), R 702/R 685 (Yang et al., 2005) and R 706/R 682 (Ma & Dai, 2005a), where R is the reflectance and the number is the wavelength. Jiao et al. (2006) found that R 719/R 670 had the greatest precision for retrieval of the chlorophyll a concentration of Lake Taihu while Wei et al. (2007) found that the chlorophyll a concentration in summer was well correlated with the Normalized Difference Vegetation Index (i.e., NDVI) of R 696 and R 661. A regionally specific model was used by Ma et al. (2006), dividing the lake into several areas, and use bio-optical models for each location. Le et al. (2008) found in seasonal chlorophyll a retrieval models that the band ratios of R 705/R 668 and R 703/R 667 gave better results in spring and autumn, respectively. A four-band algorithm [R −1(662) − R −1(693)]/[R −1(740) − R −1(705)] has also been developed to attempt to remove the absorption effect of suspended solids and associated partial suppression of water absorption in turbid waters (Le et al., 2009b).

It should be noted that all of these results were achieved using specialized optical measurements in the lab/field and not using satellite data directly. Considering the bandwidth of satellite remote sensing and atmospheric interference, it is difficult to extend special theoretical algorithms to satellite measurements directly (Gitelson et al., 2009). The parameterization of bio-optical properties is the main factors hindering the use of bio-optical algorithms using satellite measurements in Case 2 waters. In Lake Taihu, field measurements and radiative transfer simulation of bio-optical properties, including inherent optical properties (IOPs) and apparent optical properties (AOPs) have previously been conducted (Li et al., 2006; Le et al., 2009a; Zhang et al., 2009). These studies have required in situ measurements continued over a long period of time and covering the whole lake.

Recently, several approaches based on a spectral mixing modeling approach have been proposed to derive the chlorophyll a concentration of turbid lakes (Svab et al., 2005; Tyler et al., 2006; Oyama et al., 2009). The spectral mixing modeling approach, coupled with a multivariate (end member) regression analysis can be used to derive accurate estimates of chlorophyll a. The accuracy of these approaches depends on good end member selection, and screening of end members from one image cannot be applied to another image directly, which may affect the application of the approach for time series images.

The usefulness of these empirical algorithms for Case 2 waters could be improved by use of appropriate wave bands (IOCCG, 2000). Some researchers (Gitelson, 1992; Dall’Olmo et al., 2005; Duan et al., 2008) consider that pigment-based algorithms in high chlorophyll Case 2 waters can be improved by using longer wavebands (i.e., red and the near infrared spectral regions) than the typical blue and green bands used in Case 1 waters. Until bio-optical property and hyperspectral datasets become available for Lake Taihu, the empirical algorithms are still practical and effective for application of satellite data for assessment of the temporal–spatial chlorophyll a concentration (Ma & Dai, 2005b).

Our study was conducted to: (1) achieve a common empirical chlorophyll a concentration algorithm that could be used to retrieve the chlorophyll a distribution in Lake Taihu by correlating the in situ chlorophyll a data collected during 2005 with synchronous MODIS surface reflectance data; (2) monitor the synoptic distribution of chlorophyll a in Lake Taihu from 2006 to 2008 and reveal the spatial variability of chlorophyll a; and (3) examine the temporal and spatial variability of remotely sensed chlorophyll a in Lake Taihu from 2006 to 2008 using empirical orthogonal function (EOF) analysis.

Data and methods

Study site

Lake Taihu is located between 30°55′40″ to 31°32′58″N and 119°52′32″ to 120°36′10″E. The length of the lake (from north to south) is 68.5 km and the width (from east to west) is 56 km. The mean depth of the lake is 1.9 m, and the maximum depth is 2.6 m. The lake bottom has a flat terrain with an average topographic gradient of 0°0′19.66″ and an elevation of 1.1 m above sea level. The area of shallow-water with a mean depth <1.5 m is about 452 km2, most of which is found in East Lake Taihu and accounts for 19.3% of the total surface area. The deepest areas (>2.5 m) are in the north and west, occupying 197 km2 or 8.4% of the total lake area (Qin et al., 2007).

In situ data

A total of 28 stations (shown in Fig. 1) was sampled on the entire lake. Field measurements were collected at 13–28 stations once a month during 2005 by the Taihu Ecosystem Research and Field Observation Station of the Chinese Academy of Sciences. Samples for chlorophyll a were filtered on GF/C filters (Whatman) and chlorophyll a was extracted with ethanol (90%) at 80°C and analyzed spectrophotometrically at 750 and 665 nm, with correction for phaeopigments (Chen & Gao, 2000). There were 173 samples collected from the lake during 2005 at times when there was limited or no cloud cover.

Satellite data

MODIS-Terra surface reflectance level 2 products (MOD09) with a daily frequency and with a spatial resolution of 250 and 500 m bands covering the region of interest from 2005 to 2008 were downloaded via the web interface of the Land Processes Distributed Active Archive Center (LP DAAC) of NASA using the Warehouse Inventory Search Tool (WIST). The MODIS 250 and 500 m bands, especially at the visible and near infrared range, provide sufficient sensitivity for water applications, even though they were originally designed as “sharpening” bands for land studies and cloud detection (Hu et al., 2004). Five bands (band 1: 620–670 nm; band 2: 841–876 nm; band 3: 459–479 nm; band 4: 545–565 nm; band 5: 1230–1250 nm) were used to retrieve the chlorophyll a concentration in Lake Taihu in this study. A total of 54 images from 2006 to 2008 that were (quasi-) synchronous with in situ data were selected to analyze the temporal and spatial variability of the algal blooms.

Pre-processing of MODIS images

Each image was re-projected onto the WGS 84 UTM zone 51 North coordinate system using the MODIS Reprojection Tool. The 500 m resolution images were resized to 250 m by the nearest neighbor method during re-projection.

The MOD09 product provides estimation of the surface spectral reflectance for each band as it would have been measured at ground level if there were no atmospheric scattering or absorption (Vermote et al., 2002). The quality of the atmospheric correction algorithm is strongly driven by the aerosol optical thickness (τa). For MODIS products, there is a climatological dataset of τa, and if an accurate τa is available, aerosol climatological data are replaced by an atmospheric correction algorithm, while the climatological data are still used when there are spatial and temporal gaps in τa (Vermote & Vermeulen, 1999; Vermote et al., 2002).

The surface reflectance of Lake Taihu is more accurate when τa is derived from MODIS products. Gaps sometimes exist in τa over Lake Taihu, however, thus some of the images are patchy and the surface reflectance from MOD09 could not be used directly in our research. Some areas therefore had to be re-corrected with τa obtained from climatological data. The quality description file of the MOD09 product (QA) includes aerosol retrieval information, which is important for re-correction. To examine the re-correction in detail, we used an image obtained on day 141 of 2008.

Figure 2 shows the grey-scale image of band 2, from which two different regions in Lake Taihu can be found. The state QA descriptions indicate that the aerosol parameters of region 1 and region 2 are climatology (mean of τa) and high, which means that there exists a gap of τa in region 1. Therefore, in our study, we used region 2 as the reference and adjusted the surface reflectance of region 1 to match that of region 2. To accomplish this, a simple relative normalization technique, Mean–Standard deviation (MS) normalization, was employed. This method is used to reduce the numeric differences between two images caused by disparities in the acquisition conditions such as sensor performance, solar irradiance, and atmospheric effects (Yuan & Elvidge, 1996). Two reference zones were used to achieve the correction parameters. Considering that the water quality of the two reference zones should be uniform, these two reference zones should be conjoint, and the distance between every reference zone and boundary should be limited to no more than one pixel (250 m). If \( R_{{{\text{region}}1}} \,{\text{and}}\,R_{{{\text{region}}2}} \) are the band reflectance of the two regions, \( \bar{R}_{{{\text{zone}}1}} \,{\text{and}}\,\bar{R}_{{{\text{zone}}2}} \) are the reflectance means of the two reference zones, and \( S_{{{\text{zone}}1}} \,{\text{and}}\,S_{{{\text{zone}}2}} \) are the standard deviations, then the MS coefficients are derived as:

$$ m = {\frac{{S_{{{\text{zone}}2}} }}{{S_{{{\text{zone}}1}} }}} \quad b = \bar{R}_{{{\text{zone}}2}} - m\bar{R}_{{{\text{zone}}1}} \quad R_{{{\text{region}}2}} = mR_{{{\text{region}}1}} + b $$
(1)
Fig. 2
figure 2

Grey-scale image of MOD09 product (wavelength: 841–876 nm) showing Lake Taihu on day 141 of 2008. Aerosol optical thickness (τa) of annotated “region 1” is from the climatological dataset, where there exist spatial and temporal gaps of τa in this area; τa of annotated “region 2” is high and derived from MODIS aerosol products directly

For this example, every reference zone was composed of 121 pixels chosen around different boundaries. \( \bar{R}_{{{\text{zone}}1}} \,{\text{and}}\,\bar{R}_{{{\text{zone}}2}} \) have values of 0.0996 and 0.00216, and \( S_{{{\text{zone}}1}} \,{\text{and}}\,S_{{{\text{zone}}2}} \) are 0.00350 and 0.00299, respectively. The slope (m) and intercept (b) of Eq. 1 are 0.854 and −0.0829, respectively. Another 55 pairs of pixels were processed in this way in order to validate the method. The results showed that the mean reflectance of the 55 pixels in region 1 would change from 0.0984 to 0.00109 after MS normalization, which is close to the mean reflectance of the corresponding pixels (0.00495) in region 2. Accordingly, the surface reflectance of region 1 can then be re-corrected to the base of region 2 using these equations. Figure 3A–C show the band 2 surface reflectance histograms of the entire lake, region 1 and region 2, respectively. Figure 3D shows the histogram of surface reflectance for the entire lake after re-correction. These findings indicate that the MS method can remove the correction difference between these two regions caused by inaccurate aerosol parameters.

Fig. 3
figure 3

Illustration of Mean–Standard Deviation method utilized in re-correction transformation. Statistical histograms of pixel numbers (250 m spatial resolution) are shown for the whole lake (A), region 1 (B) and region 2 (C), according to band 2 surface reflectance before re-correction; D is the histogram of pixel numbers in the whole lake after re-correction

EOF analysis

Empirical orthogonal function analysis is a useful technique for compressing the spatial and temporal variability of time series data down to the most energetic statistical modes. This method of data reduction was first applied to geophysical data for the purpose of statistical weather prediction (Yoder et al., 2002; Navarro & Ruiz, 2006). Recently, there have been many examples of EOF analysis for ocean-color data and analysis of chlorophyll a (Yoder et al., 2002; Navarro & Ruiz, 2006).

In this study, the computationally efficient singular value decomposition (SVD) method was applied to conduct the EOF analysis, to obtain the eigenvalues and amplitudes of the spatial and temporal variance (Yoder et al., 2002; Navarro & Ruiz, 2006). The spatial and temporal variance of the dataset (x m (t)), can be partitioned into modes (i), revealing spatial functions (Φ i (m)), with time-varying amplitudes (a i (t)), that are also known as principle components, such that

$$ x_{m} \left( t \right) - \sum\limits_{i = 1}^{N} {\left[ {a_{i} \left( t \right)\Phi_{i} \left( t \right)} \right]} $$
(2)

This means that the time-variation in the chlorophyll a concentration of each pixel is the summation of Φ i (m) with a i (t), indicating how the spatial modes vary with time. Eigenvalues can be considered as the portion of the total variance explained by the EOF, where the sum of the variances in the data equals the sum of variance in the eigenvalues (Yoder et al., 2002).

$$ \sum\limits_{m = 1}^{M} {\left\{ {\frac{1}{N}\sum\limits_{n = 1}^{N} {\left[ {x_{m} \left( {t_{n} } \right)} \right]^{2} } } \right\} = \sum\limits_{j = 1}^{M} {\lambda_{j} } } $$
(3)

The EOF method requires a dataset without any gaps in space or time series. Consequently, MODIS images in which the main portion of Lake Taihu was <5% of cloud pixels were used in the present study, with pixels covered by clouds being replaced by the average of the surrounding pixels prior to performing the EOF analysis.

The errors of the EOF analysis, δλ, can be estimated by (North et al., 1982):

$$ \delta \lambda \approx \lambda \left( {\frac{2}{n}} \right)^{1/2} $$
(4)

where λ is the eigenvalue and n is the number of images used in the EOF analysis. In this study, 54 MODIS images were used, so δλ ≈ 0.1925 λ. The mode of spatial variance is considered significant only if the distance between λ and its nearest eigenvalue is bigger than the error associated with its eigenvalue δλ.

Results

Empirical retrieval algorithm of chlorophyll a

Table 1 shows the bivariate correlation analysis of the surface reflectance of five MODIS bands and the chlorophyll a concentration from 2005. A significant (r = 0.871, n = 145, P < 0.01) relationship, excluding the 28 points observed in Feb 2005, was observed between the chlorophyll a concentration and band 2 reflectance in the near infrared range. However, the reflectance of the visible light range was poorly correlated with the chlorophyll a concentration (r = −0.027 for band 1; r = 0.182 for band 3; r = 0.195 for band 4; r = 0.468 for band 5). And the two-band reflectance ratio was also tested. Table 2 shows the relatively poor correlations (r < 0.445) between the chlorophyll a concentration and two-band ratios.

Table 1 Correlation coefficient values of the chlorophyll a concentration and MODIS band reflectance for five bands at 173 observation points in 2005
Table 2 Correlation coefficient values of the chlorophyll a concentration and reflectance band ratios for 145 observation points in 2005

Accordingly, the reflectance of band 2 was used for the retrieval model. Curve fits revealed that the chlorophyll a concentration showed better correlations with an exponent of three (r 2 = 0.907, n = 145) or two (r 2 = 0.907, n = 145) of band 2 reflectance. A combination of band 2 data and the second exponent of band 2 reflectance was used for the empirical retrieval model of chlorophyll a in Lake Taihu.

Based on the above results, an in situ dataset including 145 pairs of samples was used for the empirical model:

$$ C_{\text{chl}} = 9.8 \times 10^{3} \,R_{\text{b2}}^{2} - 44.0R_{\text{b2}} + 9.168\,\,\;\;r^{2} = 0.907 $$
(5)

where R b2 is the surface reflectance of MODIS band 2.

EOF results

The chlorophyll a distributions of Lake Taihu derived from the MODIS images were initially spatially resized from 250 m to 1 km to avoid areas of high EOF scattering. The eigenvalue and percentage distribution of the normalized variance explained by the top 12 modes of spatial distribution are shown in Fig. 4. The error estimation of EOF indicates that only the first four modes are significant, and these explain 54.0% of the total chlorophyll a variance. Figure 5 shows the chlorophyll a concentration distribution of Lake Taihu from 2006 to 2008 based on the empirical algorithm (Eq. 5) given above. During early spring, the chlorophyll a concentration of the south region increases first (Fig. 5), especially around the mouth of the Tiaoxi River (refer to Fig. 1). High chlorophyll a concentrations corresponding to algal blooms then move from south to north along the southwestern lakeshore, with Zhushan Bay, Meiliang Bay, and Gonghu Bay (Fig. 1) regularly having blooms from May to October. In winter, the region with high chlorophyll a concentrations changes from north to south along the southwestern lakeshore until bloom levels are no longer present.

Fig. 4
figure 4

Eigenvalue and percent of variance explained by the EOF modes

Fig. 5
figure 5

Algal blooms definition in Lake Taihu from 2006 to 2008 derived from MODIS images. Grey area represents missing values covered by cloud

Figure 6 shows the mean distribution of the chlorophyll a concentration derived from EOF analysis. It shows that the west and the north regions of Lake Taihu are characterized by relatively high chlorophyll a concentrations, and that large parts of the eastern region are covered by macrophytes. Figures 7 and 8 show the spatial maps and time series amplitude of the first four modes, respectively, which are useful in evaluating the seasonal and inter-annual chlorophyll a concentration spatial variability across the lake. The peak concentrations of chlorophyll a typically appeared during summer and autumn, while the minimum concentrations were observed during winter and early spring.

Fig. 6
figure 6

Mean chlorophyll a concentration derived from the MODIS data

Fig. 7
figure 7

First four modes of spatial distribution obtained from EOF analysis

Fig. 8
figure 8

Time series amplitude of the first four modes of spatial distribution obtained from EOF analysis

Figure 9 shows a scatter plot of 173 observation points that indicates that the chlorophyll a concentration of the measurement points relates very well to the surface reflectance of band 2, excluding the 28 points observed in Feb 2005. The relationship between the chlorophyll a concentration and band 2 reflectance of the remaining 145 points is much different from that of the 28 points for Feb. For Lake Taihu, as chlorophyll a increases, the reflectance spectrum is increasing dominated by chlorophyll a over TSM and CDOM.

Fig. 9
figure 9

Relationship between band 2 surface reflectance and chlorophyll a concentration of 173 observation points in 2005. Thick line is a quadratic regression only for 145 observation points

Comparison of the coincident satellite and in situ observations (50 samples) from December of 2008 to September of 2009 provided estimates of the accuracy and precision of the empirical algorithm. The locations of the samples were THL 04, 08, 10, 13, 16, 18, 19, 22, and 24 (Fig. 1). Chlorophyll a concentrations of samples were measured using the same methods as those for the 2005 dataset. We calculated the relative error (RE) and root mean square error (RMSE) between the predictions and observations using the following equations:

$$ {\text{RE}} = \left[ {\left| {C_{\text{pred}} /C_{{{\text{in}}\,{\text{situ}}}} \, - 1} \right| \cdot \,100} \right]\% $$
(6)
$$ {\text{RMSE}} = \left[ {\sum {\left( {C_{\text{pred}} - \,C_{{{\text{in}}\,{\text{situ}}}} } \right)^{2} /n} } \right]^{0.5} $$
(7)

where C pred is the concentration predicted from the retrieval algorithm, C in situ represents the concentration measured in the lab, and n is the number of samples.

Figure 10 shows a comparison of the calculations and observations of the 2005 dataset, and the 2008–2009 dataset. The chlorophyll a concentration of the two datasets ranged from 1.39 to 119.9 μg/l and 1.8 to 290.3 μg/l, respectively, while the RMSE for these two datasets was 6.28 and 7.48 μg/l. Figure 11 shows chlorophyll a concentration versus RE for the two datasets. The RE values of the 2005 and 2008–2009 datasets were 36.9 and 61.3%, respectively. When the chlorophyll a concentration was <10 μg/l, the RE values of the modeled and validated data were 62.1 and 83.8%, respectively. When the chlorophyll a concentration was ≥10 μg/l, the RE values were only 25.2 and 30.4%, respectively.

Fig. 10
figure 10

Scatterplot of retrieved and measured chlorophyll a concentration for 2005 dataset (A) and 2008–2009 dataset (B). The thick line is 1:1 relationship

Fig. 11
figure 11

Relationship between relative error (RE) and the scale of the chlorophyll a concentration

Discussion

MODIS bands retrieving chlorophyll a

Due to the high absorption of TSM and CDOM in the blue–green range, the absorption of phytoplankton in the blue range is weak and ability to quantify chlorophyll a is reduced (Yu et al., 2003). Conversely, in the near infrared range, the absorption of TSM and CDOM becomes very weak, and the reflectance in this range is useful for color remote sensing of phytoplankton. Duan et al. (2008) also considered MODIS band 2 to be the best band to distinguish algal blooms and water in Lake Taihu. The reflectance of algal blooms in the near infrared (NIR) range is much different from that of water, and resembles the reflectance of vegetation, especially the steep slope effect of vegetation (Duan et al., 2008).

For the data of Feb 2005, the wind velocity at the observation time was >4.2 m/s, which likely caused a high concentration of total suspended materials (TSM) (≥200 mg/l). The spectral reflectance increases in the visible spectrum in turbid waters, especially in the red portion (ca. 600–700 nm). The common method used to relate remotely sensed reflectance in the red portion to TSM concentration is reasonably robust in coastal and inland waters because scattering from suspended materials frequently dominates the reflectance spectra when compared to pure water and phytoplankton absorption (Miller & McKee, 2004). While the chlorophyll a concentration of Lake Taihu is relatively low during winter (≤10 μg l−1), the reflectance spectrum is dominated by TSM and CDOM (Yu et al., 2003). Moreover, a “red-shift” effect will occur when the TSM concentration is very high (Han & Rundquist, 1994). Therefore, for the 28 points observed in Feb, not only reflectance of bands in the visible range but also band 2 in the near infrared range was correlated well with the TSM concentrations (Table 1). Accordingly, the empirical algorithm (Eq. 5) using band 2 of MODIS could not be applied when the TSM concentration was very high (≥200 mg l−1).

Influence of the bottom on water color

In shallow waters, the reflected radiance from the water can have an additional component: light reflected off the bottom. The maximum depth from which the sensor receives any significant signal varies as a function of wavelength and with the clarity of the water (IOCCG, 2000). The Secchi depths of Lake Taihu are normally low, i.e., about 30–40 cm in the north and 40–50 cm in the south, and underwater irradiance measurements have shown that the irradiance at 1 m is about 10% of that at the surface (Qin et al., 2007). Furthermore, water quality of the eastern lakeshore and East Lake Taihu is relatively clear and water depth is about 1 m. Accordingly, in these areas the bottom can affect the accuracy of the empirical algorithm, and the mean RE of samples in the eastern lakeshore and East Lake Taihu were as high as 61.9%. Nevertheless, the RMSE of the samples located in these regions was only 2.92 μg l−1. The performance of the empirical model in the bottom-influenced area is acceptable when optical properties are unavailable for the region.

EOF analysis of chlorophyll a distribution

Seasonal variability

The first mode of spatial distribution responsible for 22.5% of the normalized variance of spatial distribution showed that there was high variation in chlorophyll a in Meiliang Bay and along the north lakeshore of Gonghu Bay as well as in the west and northwest shore of the lake and low values (≤10 μg l−1) were observed over the center and eastern portion of the lake. The amplitude of the time series indicates that this pattern is modulated by the general seasonal cycle and inter-annual variability. Most of the spatial coefficients of the first mode were positive (Fig. 7), consequently, when they were multiplied by positive temporal amplitudes of the first mode, the entire field of Lake Taihu increased with respect to the chlorophyll a climatology, while negative amplitudes resulted in a decrease in the field (Navarro & Ruiz, 2006). High values occurred in fall (September/October) of 2006, in summer (June/July) of 2007, and in the fall and early winter of 2008, with the 2007 summer peak of chlorophyll a concentrations being the highest observed during the study period.

The second mode of spatial distribution explained 16.4% of the normalized variance (Fig. 7). The spatial distribution of this mode allowed us to distinguish three different zones throughout the lake. The first zone was located in the open lake and eastern portion of Lake Taihu, and was characterized by values of the spatial coefficient near 0, which indicates a high stability of the surface chlorophyll a. The second area included Meiliang Bay and Gonghu Bay and was characterized by negative coefficients, suggesting the existence of a phytoplankton bloom when the temporal amplitude was also negative. The hydrodynamics of Lake Taihu during summer result in a stable southward current along the western coast and an unstable current in Meiliang Bay (Luo & Qin, 2004; Hu et al., 2004). In addition, there is a weak counter-clockwise current in the northern Meiliang Bay, regardless of the season (Qin et al., 2000; Luo et al., 2004a, b), which may explain why this area is favorable for concentrating algal blooms (Qin et al., 2000). In contrast, zone 3, which occupies the western lakeshore, was characterized by positive spatial modes, indicating that there is a dynamic response of wind-induced currents in Lake Taihu.

The characteristics of the third and fourth mode (12.1 and 2.9% of the variance, respectively) are similar to those of the first mode and the second mode, respectively. These two modes capture the high variability during the late spring or early winter period, which is primarily along the southwestern lakeshore. This spatial pattern was relatively weak in 2008, but intensified in 2007. During summer, the prevailing wind from the southeast or southwest can generate a counter-clockwise water current in the center of the lake, and there is a southward current along the western and southern coast (Qin et al., 2000). Accordingly, algal blooms may be move from the north area to the southern lakeshore by this current before the arrival of winter. Conversely, the phytoplankton may be shifted from the south to the north by the opposite current.

Variability among regions

The chlorophyll a spatial distribution of the first two modes corresponds to algal blooms along the western lakeshore, in Meiliang Bay and in Gonghu Bay. These findings indicate that algal blooms always occur in these two bays during summer, whereas chlorophyll a may increase from early summer to late fall along the western lakeshore. The temporal characteristics of the southwestern lakeshore could be derived from the third and fourth modes, which show that algal blooms occur during the early summer and fall, and sometimes during early winter in this area. Furthermore, the remaining regions, including the center of the lake, the eastern lakeshore and East Taihu, exhibit relatively stable chlorophyll a concentrations.

Inter-annual variability

The inter-annual variance analysis shown in Fig. 8 indicates that algal blooms occurred during late summer and fall of 2006. In 2008, the region of high chlorophyll a was similar to that of 2006, but the timing was shifted to summer and early spring. During 2007 the duration of the algal bloom-covered area was from April to December.

Table 3 shows the statistical results for areas of high chlorophyll a concentration (≥100 μg l−1) in Lake Taihu from 2006 to 2008, and Fig. 12 shows the time-series comparison over 3 years of high chlorophyll a concentration areas amongst eight regions of Lake Taihu. The largest area of high chlorophyll a was on day 239 of 2006, and was about 1,100 km2, or half of the entire lake. Ma et al. (2008) also found that since 2003, algal blooms in Lake Taihu occurred in the center of the lake, as well as in bays and near lakeshores, and that they occasionally covered almost the entire non-vegetation area. Blooms occurred in early May of 2006 and 2008, whereas they occurred in late March in 2007. Surprisingly, the duration of water blooms was greatest in 2007, lasting from early spring until winter. Furthermore, algal blooms always disappeared during late June, and this phenomenon was strong in 2006 and 2008 and weak in 2007. Generally, the rainy season begins in late June and lasts for at least several days. During this period solar radiation is low, which affects the growth of algae (Kong & Gao, 2005).

Table 3 Area (km2) of high chlorophyll a concentration (≥100 μg l−1) in Lake Taihu derived from Eq. 5
Fig. 12
figure 12

Time-series comparison of algal bloom area in eight regions of Lake Taihu for 3 years (2006–2008). The location of these eight regions is shown in Fig. 1

Implications for lake management

Due to the complexity of the causes of algal blooms, development of an integrated lake management system requires not only the most robust and effective techniques, but also systems analysis. In Lake Taihu, many engineering projects have been undertaken by the Chinese government to control algal blooms (Wang et al., 2006).

For the restoration projects in the littoral zone, appropriate location selection could improve the effectiveness of the projects. According to the results mentioned above, the west and southwest coast, the north lakeshore of Gonghu Bay and the entire lakeshore of Meiliang Bay could be selected as the preferred sites for regeneration projects such as the establishment of wetlands and riparian zones.

Water transfer from the Yangtze River to Lake Taihu is a large-scale project executed from 2002 to increase water circulation in the lake and rivers. For this project, Yangtze River water is pumped and channeled through the Wangyu River and discharged into Gonghu Bay. But the results of EOF also showed that during the diversion, the concentrated cyanobacterial was flushed into the northern lakeshore of Gonghu Bay, where it entered the Wuxi drinking water plant (Qin et al., 2010). Therefore, the effect of water diversion in such a large lake would likely not be as expected. This project should be viewed as a temporary measure to improve water quality of some regions, and it should be noted that this project may lead to new ecological and environmental problems in the downstream and coastal areas.

The water source of the Wuxi drinking water plant is located near the north lakeshore of Gonghu Bay, which has been demonstrated to be a region of high chlorophyll a and is therefore a high-risk region for algal blooms. Indeed, an algal bloom event resulted in a major crisis that directly affected water supply for approximately 2 million people (Qin et al., 2010). Considering the water quality around Wuxi is unsatisfactory, it is necessary to strengthen the management of the water source protection zone, improve the capability of the emergency response and establish an early warning system to ensure water quality.

The wind direction and velocity are key factors that influence the temporal and spatial distribution of algal blooms (Kong & Gao, 2005). The first four modes of our EOF analysis demonstrate that chlorophyll a is concentrated to the leeward region of the prevailing wind direction. Furthermore, extremely warm weather combined with local wind conditions favor bloom expansion (Qin et al., 2010). Studies have suggested that the Lake Taihu area will experience continued spring (March–May) warming, with increases ranging from 0.21 to 0.89°C occurring over the next decade (2010–2019) when compared to the previous four decades (1960–2000) (Qin et al., 2010). These findings suggest that algal blooms will likely recur earlier than historically observed in this lake (Chen et al., 2003; Zhu, 2008). According to the seasonal variability results from EOF analysis, algal blooms maybe take place along the southwestern lakeshore first. Therefore, during the late spring and early winter the southwestern coast should be the concern of the lake management administration of Lake Taihu, while the western part of the lake will become a high-risk area of algal blooms from summer to fall.

Conclusions

A common empirical retrieval model of the chlorophyll a concentration in Lake Taihu was developed using the results of in situ measurement and synchronous MODIS images from 2005. The synoptic distribution of chlorophyll a in Lake Taihu from 2006 to 2008 was then obtained using this algorithm and the time series MODIS data. EOF analysis was then conducted to obtain the temporal and spatial distributions of chlorophyll a and algal blooms in Lake Taihu.

The time-series chlorophyll a distribution of Lake Taihu from 2006 to 2008 and the mean distribution of the chlorophyll a concentration derived from EOF analysis show that the west and north regions are characterized by relatively high chlorophyll a concentrations, and the eastern region has relatively low chlorophyll a. In addition, the concentration peaks of chlorophyll a typically appeared in summer and autumn while the minima occurred during winter and early spring. The first four modes of EOF explain 54.0% of the total chlorophyll a variance and can be used to represent the spatial extent and temporal variability of the processes in the study area. The amplitude of each mode differed among the 3 years examined.

Different regions of variability indicate that during the summer algal blooms always occur in Meiliang Bay and Gonghu Bay, while they occur around the southwestern lakeshore during summer and fall, and even early winter. The remaining regions, including the center of the lake, east lakeshore and eastern portion of Lake Taihu, exhibit relatively low variance of chlorophyll a concentration at the water surface.

The largest area of high chlorophyll a concentration (≥100 μg l−1) was on day 239 of 2006. The year 2007 was different from 2006 and 2008, because the algal bloom-covered area was larger and the duration extended from April to December.

Although the accuracy and stability of our empirical model is satisfactory, theoretical retrieval algorithms for chlorophyll a are still preferable for Case 2 waters. The temporal and spatial distribution of algal blooms in Lake Taihu should be analyzed together with solar radiation, wind fields, hydrodynamics, and water quality distributions to enable an improved understanding of the environmental factors contributing to the spatial and temporal variations in chlorophyll a in the lake.