Introduction

The degradation of water quality is a key global environmental issue (Palmer et al. 2004; Zhao et al. 2011). With rapid urbanization, urban sprawl leads to water quality degradation. This is due to the increasing presence of impervious surfaces and human activities in the process of urbanization within watersheds (Brett et al. 2005; Schoonover et al. 2005; Zhao et al. 2006; Tu et al. 2007; Du et al. 2010; Pinto et al. 2013; Sun et al. 2013). Moreover, many natural and anthropogenic factors, including precipitation intensity, soil types, topographical and geological characters, urbanization, agricultural activities, and industrial and sewage discharge can affect the water quality (Baker 2003). Water quality has significant correlations with changes in landscape patterns and human activity (Azous and Horner 2001; Allan 2004). Water bodies are an important part of the landscape in urban areas, and water quality is influenced by the composition of land use types and spatial configuration of land use patterns within a watershed or catchments (Sliva and Willams 2001; Donohue et al. 2006; Moreno et al. 2006; Hwang et al. 2007; Jung et al. 2008; Guo et al. 2010; Li et al. 2012). Land use has been one of the key contributing factors for urban water pollution (Changnon and Demissie 1996; Mander et al. 1998; Foley et al. 2005; Grimm et al. 2008). For example, urban and agricultural land uses have a strong positive influence on nitrogen and sediments loads; on the other hand, grassland negatively contributes to water nutrient loads (Basnyat et al. 1999; Wang 2001; Tong and Chen 2002; Moreno et al. 2006).

Previous studies have mainly focused on the impact of land use types on water quality at varying spatial scales using the percentage of land use, such as impervious land, forestland, agriculture land, and wetland and so on (Sliva and Willams 2001; Hwang et al. 2007; Jung et al. 2008; Guo et al. 2010; Li et al. 2012). A number of studies have shown conflicting results on the impact of land use on water quality at different scales (Johnson et al. 1997; Sliva and Willams 2001; Sawyer et al. 2004). We should point out that most previous studies focusing on the relationship between land use and water quality were only carried out in watersheds and subwatersheds. However, due to the complexity and variability in spatial patterns of land uses, the trend of impacts of land uses on water quality has still not been explored explicitly.

From an ecological landscape perspective, the spatial configuration of landscapes may play a critical role in determining natural habitats, hydrological processes, energy flows, and nutrient cycles (Alberti et al. 2007). Recent studies have focused on examining the relationships between the spatial configuration of landscapes and water quality at various scales (Amiri and Nakane 2009; Lee et al. 2009; Shiels 2010; Carey et al. 2011; Jager et al. 2011; Huang et al. 2013; Zhang et al. 2013).

However, the effects of land use types and its spatial structures on water quality are always simultaneous in a watershed. Land use types and its spatial structures should be investigated together in order to examine which factors exert the most important influence. Guo et al. (2010) proposed a dynamic hypothesis that the impact of land use on water quality could vary with the expansion of the buffer size and that there should be an effective buffer zone where the strongest linkage occurs between land use and water quality. They also pointed out when beyond or under the effective buffer zone, the impact of land use on water quality should become weak or insignificant, given that the land use characteristics themselves varied with distance.

The primary statistical methods to analyze relationships between land use and water quality are the conventional statistical methods, such as Pearson’s correlation, simple correlation, single linear model, multiple linear regression model, nonparametric statistical analysis, nonlinear multiple regression, and ordinary least squares regression (OLS) (Seeboonruang 2012). The conventional statistical methods are “global statistics” that analyze the average situation for the whole study area. It is assumed that the relationships are constant over space. Land use parameters and landscape metrics are used as independent variables and water quality parameters obtained from observed statistical data act as dependent variables in the regression. These conventional statistical methods utilize global statistics that analyze the average situation for the whole study area, and usually, they are unable to explore the local variations in the relationships between independent variables and water quality indicators (Tu 2011). In recent years, a simple but powerful statistical method called geographically weighted regression (GWR) has been developed based on the OLS model to explore continuously varying relationships over space (Brunsdon et al. 1998; Fotheringham et al. 2002).The “local statistics” methods, such as GWR, attempt to capture spatial variations by allowing regression model parameters to change over space and can calculate a set of local regression results including local parameter estimates, the values of the t test on local parameter estimates, local R 2 values, and local residuals for each regression point. Given its strong advantage, GWR model has been applied to explore the spatially varying relationship between land use and water quality in several previous studies (Tu 2011; Pratt and Chang 2012; Yu et al. 2013).

In this study, we investigated the spatially varying relationships between topographical variables, land use parameters, landscape metrics, and water quality parameters using OLS and GWR models in Yundang Lake, Maluan, and Xinglin Bay of Xiaman City, China. Our objectives are (1) to determine which of the above-mentioned explanatory variables have the largest impact on water quality; and (2) to analyze scale effects of the relationships between explanatory variables and water quality parameters across an urbanization gradient within watersheds.

Study Area

Xiamen City is situated in the southeast coast of Fujian province, China. Mean annual precipitation is 1,530 mm, with significant spatial and temporal differences. Rainfall distribution in Xiamen City is uneven. The flood season lasts from April to October, accounting for 80 % of total annual rainfall. Precipitation gradually reduces from the northwest (above 2,000 mm) to the southeast (about 1,100 mm) (Lin 2009). Figure 1 shows a location map of three watersheds in Xiamen, including Yundang Lake, Maluan Bay, and Xinglin Bay. There is great spatial difference in the level of urbanization among the three study watersheds. Yundang Lake is located in the center of Xiamen Island, and the water area is up to 1.6 km2. There is no river flowing onto the lake; it only exchanges regularly with sea water through water gates. The areas around the water body are highly urbanized and mainly covered by commercial, residential, and recreational lands. In the early stages, industrial and residential sewage is drained off into water body without treatment, causing the water pollution of Yundang Lake to become increasingly serious. Since 1980s, government had taken some engineering measures to improve water quality of Yundang Lake. In recent years, the water quality has improved significantly as a whole, but parts of waters are still polluted serious. So adjustment of land use patterns at watershed scale should be the key measure to successfully improve urban lake water quality in the next step. Maluan Bay is located at the junction of Haicang district and Jimei district and has a water area of 2.1 km2. A small stream drains into Maluan bay from west to east. Water body exchanges the water through gate during the flux and reflux. The dominant land use types around the water body are industrial and residential lands. It belongs to a less urbanized area. Xinglin Bay is located at the Jimei district and has a water area of 6 km2. The lake was completely separated with outside sea by dam and only releases flood water to sea when extreme perception event happened. The areas around the Xinglin Bay have been constructing in Xiamen, which are expanding from the inside to the outside of the island, and it is at the period of transition from an agricultural landscape to an urban landscape.

Fig. 1
figure 1

Sketch map of Yundang Lake, Maluan Bay and Xinglin Bay

Materials and Methods

Water Quality Parameters

Water quality data were collected and analyzed on a quarterly basis (four times per year) from each of the 30 selected sampling sites. Water sampling was carried out quarterly from March 2011 to January 2012 (spring 2011.3, summer 2011.7, autumn 2011.11, and winter 2012.1). Generally, sampling time is selected in sunny day and tries to avoid any antecedent rain for the representation of water quality data. Thirteen sites for Yundang Lake, ten sites for Xinglin Bay, and seven sites for Maluan Bay were selected based on watershed characteristics. Water temperature (Temp), pH, dissolved oxygen (DO), chlorophyll a (Chl), salinity (Sal), and conductivity (Cond) were measured with Multi-Parameter Water Quality Monitoring Equipment (YSI int.) at three depths of approximately 0.1, 0.5, and 1.0 m. Surface water was collected for measurement of total organic carbon (TOC), total nitrogen (TN), total phosphorus (TP), and seven metal elements (including As, Hg, Pb, Cu, Fe, Zn, and Se). Water quality samples were measured by the standard analytical method for surface water quality. TOC-VCPH Total Organic Carbon Analyzer was used to measure TOC and TN. The ammonium molybdate spectrophotometric method was used to measure TP. The atomic fluorescence spectrometric method was applied to determine As and Hg. The atomic absorption spectrophotometric method was used to measure Pb, Cu, Fe, Zn, and Se. We used annual mean values of water quality parameters as dependent variables.

Land Use Data

Land use information within the watersheds was interpreted from SPOT5 imagery (2011-4-24) using ArcGIS 9.3 Desktop GIS software. This remote imagery covers most parts of Xiamen city, with a spatial resolution of 2.5 m. According to watershed characteristics, land use types are classified into seven broad categories, namely, (1) green land, including forest and shrub land; (2) agricultural land, including paddy field and dry land; (3) urban land, including industrial, residential, and recreational land; (4) road land; (5) water areas; (6) bare land, including bare ground and bare rocks; (7) fish pond areas, which are controlled ponds for breeding fish. The orthorectification procedure was utilized to the remote image in order to correct geometric distortions. Based on Google Earth map and visual interpretation, land use maps of three watersheds were finally derived, and the total classification accuracy was more than 90 %. Land cover polygons representing different land covers were then converted to a 10.0 m raster grid, which allowed for the measurement of land use composition and landscape metrics using FRAGSTATS3.3 software.

Explanatory Variables

Topographic, land use composition, and landscape variables are shown in Table 1. The watersheds were automatically delineated with the use of a 10.0 m digital elevation model (DEM) in ArcSWAT extension of ArcGIS 9.1 software (Winchell et al. 2007). The SWAT model divided the watersheds of Yundang Lake, Maluan Bay, and Xinglin Bay into 8, 3, and 7 subwatersheds, respectively, based on the outlet location of each watershed and upstream rivers (Fig. 2). The riparian areas were created by buffering the lake/bay 50 and 200 m. The land use composition and topographic variables in each of the 18 delineated subwatersheds were calculated using the Spatial Analyst tools in ArcGIS.

Table 1 Explanatory variables used in analysis (McGarigal and Marks 1995)
Fig. 2
figure 2

Location map of monitoring sites and schematic of watershed delineation for Yundang Lake (YD01-YD08), Maluan Bay (ML01-ML03), and Xinglin Bay (XL01-XL07)

In this study, five landscape metrics at the landscape level were calculated using the raster version of FRAGSTATS 3.3 (Table 2) based on 10.0 m-resolution land use data.

Table 2 Landscape metrics used in analysis

Analysis Methodology

Multivariable OLS and GWR models were developed to explore the relationship between independent variables and water quality parameters from global and local perspectives. The regression coefficients from OLS models can help us to find the dominant factors of varying water quality at 50, 200 m and watershed scales. The local regression coefficients at different sampling points from GWR models can reflect the strength of relationships between explanatory variables and water quality parameters across an urbanization gradient. The dependent variable is a factor calculated from the fifteen water quality parameters, and the explanatory variables are the seven land use composition parameters, three topographical factors, and five landscape metrics. The Kolmogorov–Smirnov (K–S) goodness of fit test was used to test for normality of the distribution of the water quality and independent variables. For water quality parameters, the TP and Fe were transformed logarithmically [log(x + 1)], Cond was removed due to unresolved skewedness. For independent variables, the percentages of fish ponds and agricultural land were removed, because they were only present in one watershed. At a watershed scale, the mean subwatersheds elevation, PD, and LPI were transformed logarithmically, and LSI was transformed, respectively. At a 200 m buffer scale, the slope standard deviation was transformed. OLS and GWR were performed using ArcGIS 9.3. An advantage of running an OLS regression in ArcMap is the output from the process which includes the residual for each site, allowing the researcher to more easily test the residuals for spatial autocorrelation (Pratt and Chang 2012).

The OLS and GWR model can be, respectively, expressed as Eqs. (1) and (2):

$$ y = \beta_{0} + \sum\limits_{i = 1}^{p} {\beta_{i} x_{i} + \varepsilon } , $$
(1)

where x i is the independent; y is the dependent variable; P denotes the number of independent variables; β 0 and β i represent intercept and coefficient, respectively; ε represents error.

The GWR model allows local rather than global parameters to be estimated for the location of samples, and the above model can be rewritten as:

$$ y_{j} = \beta_{0} (u_{j} ,v_{j} ) + \sum\limits_{i = 1}^{p} {\beta_{i} } (u_{j} ,v_{j} )x_{ij} + \varepsilon_{j} , $$
(2)

where u j and v j are the coordinates for each location j, β 0 (u j , v j ) is the intercept for location j, β i (u j , v j ) is the local parameter estimate for independent variable x i at location j. The model is calibrated using an exponential distance decay function.

$$ w_{ij} = \exp {{( - d_{ij}^{2} } \mathord{\left/ {\vphantom {{( - d_{ij}^{2} } {b^{2} }}} \right. \kern-0pt} {b^{2} }}). $$
(3)

The weight of site j as it effects site i, W ij , is calculated using the distance (d) between sites i and j with b acting as the kernel bandwidth. The weight decreases rapidly when the kernel is smaller than the distance. Both fixed and adaptive kernel bandwidth can be chosen for GWR. We used adaptive kernel bandwidth here, because the density of sample sites varied across watersheds. The optimal bandwidth was determined by minimizing the corrected Akaike Information Criterion (AICc). Given that multicollinearity existed among the explanatory variables, one explanatory variable acted as the independent variable, and water quality acted as the dependent variable in the regression models (Tu 2011). GWR models produce a set of local regression results, including local residual and R 2 results, as well as a global R 2 by ArcGIS software (Tu and Xia 2008). The Global Moran’s I for the residuals from both OLS and GWR models were calculated to test the model’s accuracy. If significant spatial autocorrelation exists in an OLS model, then the model violates the assumption of randomly distributed and independent residuals in regression models. The efficiency of the model is therefore suspect. In addition, the residuals may contain some geographic information that the model does not include and is therefore lost (Clark 2007). GeoDa 0.9.5-I (Beta) analysis software was used to calculate the Global Moran’s I statistics.

In order to evaluate the variability of water quality data in each watershed, the coefficient of variations (CV) was calculated. The CV-value is defined as

$$ {\text{CV}} = {\text{SD}}/{\text{MV,}} $$
(4)

where SD is the standard deviation, and MV is the mean value.

Results

Spatial Variations in Land Use Composition and Water Quality Parameters

Obvious heterogeneity of land uses and spatial gradient of urbanization exist in the three watersheds (Table 3). The dominant land use types in Yundang Lake watershed are residential and commercial land (total ≈60 %), and the proportion of urban built-up areas can be 96 %, which indicate that the Yundang Lake watershed is represented as a highly urbanized area. In the Maluan Bay watershed, there are larger areas of fish ponds (≈24 %); commercial and industrial lands have been expanding rapidly in recent years. All this indicates that the Maluan watershed is a peri-urbanization area. The proportion of urban built-up area in the Xinglin Bay watershed is about 66 %, and the bare land in the process of development and construction is larger than other watersheds (≈20 %). Therefore, the Xinglin Bay watershed is a rapidly urbanized area. Maluan Bay and Xinglin Bay watersheds are represented as less urbanized watersheds.

Table 3 Land use composition in the watershed of Yundang Lake, Maluan Bay, and Xinglin Bay in 2011

As shown in Table 4, there are significantly differences in topographical factors and landscape metrics between three watersheds. Higher mean slope, mean elevation, and slope deviation are observed in Yundang watershed, and lower values of such topographical factors are found in Xinglin watershed. For landscape metrics at landscape level, the values of LSP and SHDI are higher in Xinglin Bay and lower in Yundang Lake. The result shows obviously differences of landscape pattern in three watersheds and confirms our division of urbanization gradients for these study areas.

Table 4 Topographical factors and Landscape metrics in the watershed of Yundang Lake, Maluan Bay, and Xinglin Bay

The CVs (%) across different sampling sites, which range from 1.15 to 83.95 %, indicate that there are large spatial variations for water quality data (Table 5). In general, the spatial variations of Temp, Chl, TOC, Hg, As, Pb, and Fe in highly urbanized watershed were greater than in less urbanized watersheds. However, spatial variations of other water quality parameters like Cond, Sal, pH, DO, TN, Cu, Zn, and Se in less urbanized watersheds were greater than those in highly urbanized watersheds. In the Yundang Lake watershed, the CVs of Chl, TP, Pb, Cu, and Fe (CV > 50 %) were greater than the other water quality parameters. In the Maluan Bay watershed, the CVs of Chl and Pb were greater than the other water quality parameters. In the Xinglin Bay watershed, the spatial variations of Pb and the CVs of Cu and Zn were greater than the other water quality parameters.

Table 5 Statistical summary of the water quality parameters in the Yundang Lake, Maluan Bay, and Xinglin Bay watersheds

Spatial Analysis of OLS Regression Models

Final models were selected when every variable in the model was significant (P < 0.05), and the adjusted coefficient of determination (Adj. R 2) is larger than 0.6. As shown in Table 6, the correlation of explanatory variables with water quality varied with the spatial scales. The regression coefficients for explanatory variables indicate that landscape metrics (such as SHDI and SHEI) and topographical factors like elevation and slope are the dominated variables for predicting water quality parameters at three spatial scales, while the correlations between percentage of land use types and water quality parameters are extremely weak compared with other explanatory variables (|β i | ≈ 0). Thus, the impact of land use composition on water quality can be neglected and is not discussed in the followed sections.

Table 6 Multivariable regression models for explanatory variables and water quality parameters at the watershed, 50 and 200 m buffer scales

Variation in water quality at the 50 m buffer scale is dominated with topographical variables. Mean subwatershed slope is negatively correlated to the concentrations of Sal, TOC, Hg, As, Cu, Fe, and Se. However, higher standard deviation of the slope corresponds to increasing concentrations of Sal, Hg, As, Cu, and Se. Compared with topographical variables, the impact of landscape metrics on water quality is relatively small because of less number of occurrences and low frequency of the regression coefficients.

At 200 m buffer scale, landscape metrics becomes the main factors for most of the water quality parameters, while topographical variables influence on water quality weakens.

At the watershed scale, SHDI and SHEI are the main indicators, and their impacts on water quality strengthen further. SHEI has a negative correlation with Sal, Hg, As, and Se while being positively related to Temp, DO, Chl, and Fe. SHDI has a completely opposite impact on water quality variation compared with SHEI. However, it should be pointed out that there are significant correlations between TN and TP, and landscape metrics and topographical variables at the watershed scale rather than at 50 and 200 m buffer scales. Specifically, mean subwatershed elevation has a positive correlation with the concentration of TN and TP; PD, LPI, SHDI, and SHEI have negative correlations with the concentration of TN and TP.

The values of Adj. R 2 in OLS models could indicate the ability of explanatory variables to explain water quality parameters. As shown in Table 6, the values of Adj.R 2 at the watershed scale are generally larger than those at the 50 and 200 m buffer scales, which reflect the scale effects of dominant factors and the existence of the effective buffer zone.

Table 7 shows the results of Moran’s I statistics on the residuals from the OLS model. Significant negative spatial autocorrelations are observed, Temp at the 50 m buffer scale (Moran’s I = −0.2, P < 0.05), Sal at the watershed (Moran’s I = −0.21, P < 0.05), DO at the 200 m buffer and watershed scales (Moran’s I = −0.2, P < 0.05), Hg at the 50 m buffer scale (Moran’s I = −0.2, P < 0.05), and Fe at the watershed scale (Moran’s I = 0.19, P < 0.05). The results indicate that these OLS models might miss the variables that explain the variation and are unsuitable for identifying the relationships between explanatory variables and water quality parameters.

Table 7 Global Moran’s I of the residuals from OLS models

Spatial Analysis of Local GWR Models

In the previous sections, the global relationships at three spatial scales have been obtained by means of OLS models, but these results may miss some local relationships and cannot reflect the spatially varying relationships. Therefore, the GWR method was applied to explore spatially varying relationships between explanatory variables and water quality parameters along the gradient of the urbanization level. As shown in Tables 6 and 7, the global coefficient of determination (R 2) and local parameter estimates from GWR models are summarized. The local parameter estimates can be used to reflect the relationships between the independent variable and dependent variable at different sampling sites. The global R 2 values from GWR model reflect the ability of explanatory variables to explain the spatial variation in the water quality parameters at all sampling sites. There are 180 (12 times 15) GWR models in this study. We are not able to present all the results from GWR models due to space limitations. The two key water quality parameters of DO and Chl were selected to examine the spatially varying relationships. DO is the most important indicator of the health of a water body and its capacity to support a balanced aquatic ecosystem. Chlorophyll is a commonly used parameter for monitoring phytoplankton biomass and nutrient status. It can help track and predict deadly algae blooms. Nutrient enrichment leads to increased production of organic matter and resulting in low levels of dissolved oxygen which is killing marine life (Pelley 1998).

As shown in Table 8, the values of global R 2 in GWR models range from 0.003 to 0.998, and the median value is larger than 0.6. The local parameter estimates from GWR models are shown in Table 9. Topographical variables and landscape metrics have a wider range of values. Wider ranges of values imply that explanatory variables might have a larger impact on water quality variations.

Table 8 Statistical summary of global coefficient of determination (R 2) for GWR
Table 9 Statistical summary of local parameter estimates for GWR models

A clear spatially varying relationship exists along the gradient of the urbanization level as shown in Figs. 3 and 4. Figure 3 shows the spatial pattern of local parameter estimates from GWR models for DO and mean subwatershed elevation, mean subwatershed slope and SHDI at three spatial scales. At the 200 m buffer scale, stronger negative correlations between DO and mean subwatershed elevation are found in the less urbanized areas. At the 50 and 200 m buffer scales, positive correlations are observed between DO and mean subwatershed slope in highly urbanizes areas, while negative correlations are shown in less urbanized areas. Similarly, inside the 200 m buffer zones, DO has positive correlations in the highly urbanized areas, while there are negative correlations with SHDI in the less urbanized areas.

Fig. 3
figure 3

Spatial patterns of the local parameter estimates obtained from GWR models of DO and explanatory variables: MCE mean subwatershed elevation, MCS mean subwatershed slope, and SHDI at the watershed, 50 and 200 m buffer scales

Fig. 4
figure 4

Spatial patterns of the local parameter estimates obtained from GWR models of Chl and explanatory variables: MCE mean subwatersheds elevation, MCS mean subwatersheds slope, and SHDI at the watershed, 50 and 200 m buffer scales

Figure 4 shows the spatial pattern of local parameter estimates from GWR models for Chl and mean subwatershed elevation, mean subwatershed slope, and SHDI at three spatial scales. At the 50 m buffer scale, the concentration of Chl is positively related to mean subwatershed elevation in highly urbanized areas, while Chl is negatively related to mean subwatershed elevation in less urbanized areas. At the 50 and 200 m buffer scales, stronger correlations between Chl and the mean subwatershed slope are found in less urbanized areas. Stronger negative correlations are found at 50 m buffer scale for both less and highly urbanized areas. At the watershed scale, Chl is positively related to SHDI in the highly urbanized areas, while Chl is negatively related to SHDI within less urbanized areas. The results from GWR models indicate that the spatially varying relationships between explanatory variables and water quality parameters depend on the spatial analysis scale and the urbanization level of the watershed.

As shown in Table 10, significant positive spatial autocorrelations are found in the GWR model for DO and SHDI (Moran’s I = 0.2045, P < 0.05) and in the GWR model for Chl and SHDI (Moran’s I = 0.4296, P < 0.05). Significant autocorrelations made the two GWR models unsuitable for examining the relationships between SHDI and water quality parameters.

Table 10 Global Moran’s I of the residuals from GWR models

Discussion

Dominant Factors Influencing Water Quality

This study found that the dominant factors varied with the spatial analysis scale. Topographical variables dominated the water quality variations at the 50 m buffer scale. Several other studies also noted the importance of topographical variables. For example, Pratt and Chang (2012) suggested that not only a flat slope may be acting as a sink for total solids (TS) and slowing runoff, but it also may be receiving more pollutants from outside areas and has more residence time to carry more pollutants into the water body. However, Chang (2008) suggested that highly variable slopes could act as sinks for oxygen demand, sediment and nutrients rather than their sources. Our results also showed that mean subwatersheds slope is negatively correlated to the concentrations of Sal, TOC, Hg, As, Cu, Fe, and Se, which indicate highly variable slopes could act as the sink of water pollutants.

Landscape characteristics play an important role in pollutant transport and retention (Chen et al. 2002). Landscape metrics are the mainly influencing factors at the watershed and 200 m buffer scales, especially SHDI and SHEI. In our study area, landscape characteristics appear to be most important in determining water parameters at the 200 m buffer zone or larger scales. However, previous studies generally take landscape metrics as single explanatory variables. Huang et al. (2013) found that SHDI is significantly positively correlated with CODMn. Lee et al. (2009) also reported that degraded water quality was positively associated with SHDI, PD, and ED and negative to LPI.

Our study suggested that the difference of dominant factors in different buffers is due to the urban landscape heterogeneity in the buffer zones. The value of SHDI generally increased with the size of buffer zone (50, 200 m and entire watersheds from the edge of lakes). It indicated that the spatial pattern of land use is singular in the near-lake zones (50 m buffer), and human activity is relatively weak. The topographical variables in the subwatersheds become more important for the pollution export to lakes. With increasing distance from the lakes, the interspersion and diversity of land use types increase, and human activity density strengthens. Thus, the spatial patterns of land use in the entire watershed and 200 m buffer zone dominate the variations in water quality. Lee et al. (2009) suggested that land use patterns affect the hydrological and chemical runoff processes that carry pollutants and nutrients into streams, rivers, reservoirs, and lakes, while the proportions of certain land use types determine the pollutant and nutrient loadings from watersheds. Water quality is likely degraded when different land use types are greatly interspersed and when the number of types increases at the landscape level.

However, our results indicated that the major sources of TN and TP are the non-point pollutions at the watershed rather than at 50 and 200 m buffer scales. This finding is similar to Nielsen et al. (2012). The correlations between landscape metrics and concentrations of TN and TP indicated that diversity and complexity of landscape pattern were negative, which indicated that diversity and complexity of landscape pattern could reduce pollution loading into lakes. Increase of landscape diversity and complexity at the watershed scale could improve the water quality of lakes.

Water quality in Bay/lake is influenced by various factors such as water exchange with ocean/river and water circulation effects, point-source pollutions, and non-point-source pollutions. In our study area, three bays are all isolated from West Sea by sluices and exchanged the water at regular intervals. Furthermore, we did not carried out sampling when the sea flooded in. Thus, the influence of tide to water quality is relatively weak. In addition, the non-point-source pollution has become the major source of pollution with other conditions unchanged. We illustrated that the dominant factors that impact on water quality gradually varied with the expansion of buffer zones, which change from topographical factors to landscape metrics. The impact of human activity on water quality is obvious in a watershed.

Scale Effect on Spatially Varying Relationships

As mentioned in the above discussion section, the scale effect is embodied in the difference of dominant factors at different buffers. On the other hand, the scale effect is also embodied in the differences across effective buffer zones for a given water quality parameter.

Guo et al. (2010) suggested that the impact of land use on water quality could vary along the expansion of the buffer size and that there should be an effective buffer zone where the strongest linkage occurs between land use and water quality. Study would support Guo’s hypothesis. As shown in Fig. 5, the curves of R 2 values from OLS models are fluctuating at different buffers. It can help us to identify the most effective buffer zone for water quality parameters. The effective buffer zone to pH, DO, and Chl is within watersheds, the weakest buffer zone is in the 50 m buffer zone; the effective buffer zone for Sal, TOC, As, Fe, and Se is the same as the watershed, the weakest buffer zone is in the 200 m buffer zone; unlike the other water quality parameters, the effective buffer zone to Hg is within the 50 m buffer zone. The results indicate that the watershed scale is the effective spatial analysis scale for most of the water quality parameters in the study area, which seems to be consistent with the findings of previous studies in some other watersheds (Lee et al. 2009; Pratt and Chang 2012; Trolle et al. 2012). This is because the independent variables at the watershed scale could capture much more information which can offer better explanations for variations in the water quality. However, the studies of Johnson et al. (1997), Sawyer et al. (2004), and Huang et al. (2013) found that land use pattern near water bodies is a better predictor of water quality than in the other buffer zones. Thus, we concluded that the effective buffer zone does exist, but it varies in the different watersheds for given water quality parameters.

Fig. 5
figure 5

Variation of R 2 value obtained from OLS models at three spatial analysis scales

Spatially Varying Relationships Along the Urbanization Gradient

The heterogeneity of land use patterns in different urban regions might contribute to weakness or even offset of relationships between land use pattern and water quality. Tu (2011) found that spatial diversity in the relationships between land use and water quality parameters existed at the watershed scale with different levels of urbanization in eastern Massachusetts, USA.

In this study, the relationships between explanatory variables and water quality parameters at different sampling sites vary along the urbanization gradient, and clear spatial differences in the relationships are observed (Figs. 3, 4). The results obtained from GWR models show that both positive and negative relationships exist in the study area. At 200 m buffer scale, higher mean subwatershed elevation and mean subwatershed slopes are related to a poor water quality condition in highly urbanized areas, while the relationship between mean subwatershed elevation and the water quality condition is not clear in less urbanized areas. At the watershed scale, higher landscape diversity is related to poor water quality conditions in highly urbanized areas, while the relationship between landscape diversity and water quality conditions is not clear in less urbanized areas.

The results from GWR models indicated that the spatially varying relationships between explanatory variables and water quality parameters are more diversified and complex in less urbanized areas than in highly urbanized areas. This is presumably due to various spatial differences for landscape characteristics in each subwatershed of the less urbanized areas. So the less urbanized areas may have complex “delivery” mechanisms for the pollutants. The stronger or even adverse impacts of land use patterns on water quality are usually observed in less urbanized watersheds rather than in highly urbanized watersheds.

Conclusions

In this study, we examined the relationship between 12 explanatory variables and 15 water quality indicators in the three coastal lake watersheds of Xiamen city, China. Both OLS and GWR models were used in order to identify the spatially varying relationships and scale effects along the urbanization gradient. We concluded that (1) topographical features of subwatersheds are identified as key variables in near-lake zones, while landscape metrics are the dominant factors of water quality in the whole watershed and 200 m buffer zones. The impact of landscape metrics on water quality gradually strengthens with the expansion of buffer zones. At the watershed scale, increasing the landscape diversity may improve the water quality of lakes. Land uses are too weak to be considered as a strong influential factor on water quality. (2) Study would support Guo’s hypothesis (Guo et al. 2010), which confirms the existence of effective buffer zones and their variations within different watersheds. (3) Spatially varying relationships between explanatory variables and water quality parameters are more diversified and complex in less urbanized areas than in highly urbanized areas. So we should pay more attention to the relationships between landscape patterns and water quality in less urbanized areas. Integrating more explanatory variables of watershed characteristics can help us to identify the key pollution causes and to establish more reasonable pollutant control policies for urban water bodies.

This study proposed that adjustment of landscape patterns in a watershed is the key measure to improve urban lake water quality, and implementing effective buffer zones are ideal measure to reduce pollutant load into lakes.