Introduction

Agricultural non-point source (NPS) pollution has been a significant threat to water quality and ecosystem of the Miyun Reservoir in recent decades. Runoff leaching from fertilizers and livestock litter contribute to high nutrient levels in some rivers in the agricultural areas, which lead to a wide range of risks associated with eutrophication and toxicity of water bodies and inadequate water supply quality for Beijing (Li et al. 2016; Meissner et al. 2016). Implementation of sustainable land-management practices or conservation practices is becoming an important concern to control NPS pollution in the Miyun Reservoir Watershed (MRW). However, before the selection and placement of suitable management practices in this watershed, it requires a detailed investigation of the extent, distributions and causes of the pollution, and the effectiveness of alternative practices.

The complexity and uncertainties related to the sources, generation, and transport of NPS pollution lead to numerous difficulties in quantifying pollution characteristics and efficiency of best management practices (BMPs) (Arnold et al. 2012b; Qiu et al. 2018b). A mass of research has relied on watershed models or water quality models, such as the Soil and Water Assessment Tool (SWAT) model (Arnold et al. 1998), the Storm Water Management Model (SWMM) (Huber et al. 1975), and the annualized Agricultural Non-Point Source Pollution Model (AnnAGNPS) (Cronshey and Theurer 1998), to thoroughly study NPS pollution characteristics and the impacts of management practices on sediment and nutrient loss at watershed scale. The SWAT model has been applied in numerous studies because of its open code source, its high accuracy (Arnold et al. 1998), and its strengths in performing calibration and validation, and analyzing parameter sensitivity and uncertainty (Abbaspour et al. 2015). Reliable representation of BMPs in models is critical for credible simulation of BMP efficiency. Multiple efforts have proven SWAT’s capability of simulating BMP effects on hydrology and water quality in the watershed processes (Abbaspour et al. 2015; Liu et al. 2018; Volk et al. 2016; Yen et al. 2016). Xie et al. (2015) stated that the SWAT model could simulate more types of BMPs than other models such as HSPF, AGNPS, and AnnAGNPS, including grazing management, grade stabilization structure, and stream channel stabilization. The SWAT model simulates the efficiency of conservation practices by changing the input data or parameters and by operating specific modules. The SWAT-CUP program enables sensitivity analysis, calibration, and uncertainty analysis of SWAT model by linking five optimization procedures, including SUFI2, ParaSol, PSO, GLUE, and MCMC (Abbaspour et al. 2007). In this study, the SUFI2 program was used for parameter sensitivity analysis and model calibration, considering its high effectiveness in calibration and uncertainty analysis (Akhavan et al. 2010; Faramarzi et al. 2009).

BMPs for agricultural NPS pollution have primarily focused on the control of soil erosion and fertilizer amount in China. Traditional practices, including conservation tillage, crop rotation, contour farming, and nutrient management, have been widely implemented in recent years (Jia et al. 2019; Liu and Huang 2013; Smith and Siciliano 2015; Sun et al. 2018). They protect the land surface by limiting soil-disturbing activities, reduce erosion by improving soil structure, and control the source of pollutants by managing the amount of fertilizes (Logan 1993; Prosdocimi et al. 2016; Zhu et al. 2015). Because of the increasing attention on NPS pollution, structural practices including the terraces, constructed wetlands, grassed waterways, and filter strips are widely implemented, especially for stormwater management (Fonseca et al. 2018; Small et al. 2018). These practices are constructed to reduce runoff through increased infiltration (Logan 1993), to absorb nutrients or other pollutants by specific perennial plants and to reduce soil erosion through decreasing the slope. Multiple watershed models are extensively used to evaluate the performance of BMPs. The SWAT model has been demonstrated to be an effective tool to evaluate the effects of alternative management practices on water resources across watershed and regional scales.

The Miyun Reservoir is stressed by an increasing drinking water supply for a large population of 21.75 million in Beijing. Unpredictable changes related climate, land cover, agricultural activities, and pollution sources pose a challenge for freshwater supplies, in view of the high concentration of nitrogen that is near the eutrophication standards (Li et al. 2016; Qiu et al. 2019; Xia et al. 2015; Zheng et al. 2016). The development of integrated water management strategies is necessary to avoid the risk of inadequate water quantity and quality, which thus can ensure freshwater provisioning and improve ecosystem services (Abbaspour et al. 2015; Ferreira et al. 2019; Halbe et al. 2018). The goal of this study was to use SWAT to simulate the hydrological and water quality processes and to assess the efficiencies of several conservation practices at the sub-basin level and monthly time intervals in a large-scale watershed. The objective of this study can be achieved by the following tasks: (1) analyze the parameter sensitivities in the different subwatersheds; (2) identify the spatial-temporal distributions and priority control areas of different pollutants; (3) analyze the causal factors for the spatial-temporal distributions of nutrient loads; (4) assess the effects of conservation practices or BMPs on agricultural NPS pollution. In particular, variations in the efficiency of different practices were analyzed based on surface conditions, pollution characteristics, climate features, and BMP design parameters. The results of this study provide consistent information on the temporal and spatial distributions of the quantity and quality of water resources, highlight the sources of contamination, describe the impact of climate and land use change on water resources, identify the applicability of conservation practices, and eventually provide valuable information for the development of integrated watershed management strategies.

Materials and methods

Watershed description

The Miyun Reservoir is the most important surface drinking water source in China’s capital, Beijing. The MRW (40° 19′~41° 36′ N, 115° 27′~117° 35′ E) is a drained watershed of the Miyun Reservoir, covering two tributaries: the Chao River and the Bai River. This watershed encompasses forests, agricultural areas, pastureland, streams, and industrial and residential sites, with a total area of approximately 14,923.95 km2. The Chao River subwatershed covers an area of 5892.97 km2, and the Bai River subwatershed covers an area of 9030.98 km2 (see Fig. 1). The average annual temperature observed in the period of 1960–2014 is approximately 8.5–12 °C, with extreme low temperatures below − 30 °C and extremely high temperatures 39 °C. The average annual precipitation is 660 mm, and about 80% of the precipitation occurs from June to September. Agriculture and economic forest planting are the main economic activities in the region. The massive application of fertilizers and manures has caused a great release of nitrogen (N) and phosphorus (P) into the streams, aggravating the risk of eutrophication in the Miyun Reservoir. Therefore, it is important to implement appropriate conservation practices to control nutrient losses for ensuring drinking water security in this area.

Fig. 1
figure 1

The location of the Miyun Reservoir Watershed

Model description

The SWAT model was developed to evaluate water quality, NPS pollution, and the impacts of management practices on water resources at watershed scale. It is a conceptual, semi-distributed, continuous time model at daily, monthly, or annual time intervals developed by USDA Agricultural Research Service (USDA-ARS) and Texas A&M AgriLife Research (Arnold et al. 1998; Srinivasan et al. 1998). The input data for the SWAT simulation is shown in Table 1. Digital Elevation Model (DEM) data with a 90-m grid were used to divide the watershed into multiple sub-basins and to extract the information of channel length and average slope in each sub-basin. Sub-basins were then further subdivided into hydrologic response units (HRUs), which are the smallest unique computational units that combine topographical, land covers, crop types, management operations, and soil characteristics with homogeneous hydrologic response for the simulation of flow and pollutant loads (Ahmadzadeh et al. 2016; Molina-Navarro et al. 2016). In this study, land use data (1:100,000) were provided by the Institute of Geographical and Natural Resources Research, China, and were categorized in the map that consisted of forest-mixed land (49.24%), agricultural land (21.54%), grassland (27.33%), open water (1.18%), industrial and residential land (0.55%), and barren land (0.16%). FAO soil (http://www.fao.org/soils-portal/soil-survey/soil-maps-and-databases/en/) was categorized into 14 types, which is shown in Fig. S2 in the Supplementary Material. Local weather data were obtained from weather stations available in this watershed, including daily rainfall, relative humidity, maximum and minimum temperatures, solar radiation, and wind speed. Monthly flow and water quality data (sediment, N and P loads) were provided by local environmental monitoring stations. The calibration and validation of the flows and nutrient loads were performed for 1990–1998 and 1999–2010, respectively, whereas the periods for sediment calibration and validation were 2006–2008 and 2009–2010, respectively, due to limitations of the data. Other input data for the SWAT model, including livestock amounts, crop types, tillage patterns, the types, amount and timing of fertilizer, manure and pesticide application, social economics, and population were collected by field investigations and from statistical yearbooks that were published by the National Bureau of Statistics. Based on the results of on-site questionnaire survey, corn is the main crop usually grown in this area, and 325 kg/ha of fertilizer is applied in cropping land per year, mainly in May and October when approximately 200 kg/ha of fertilizer or manure are applied in the walnut and chestnut forests. Tillage type for corn is ploughing before sowing without mulching film. We assume that irrigation is only available in combination with fertilization, with unlimited volume up to the maximum annual value of 500 mm. Corn, chestnuts, and walnuts are harvested in September. Taking into account the independence and the differences in surface characteristics and water quality between the two subwatersheds, this study calibrated the model in the Chao River subwatershed and the Bai River subwatershed separately.

Table 1 Description of data used in the SWAT model

The model performance indicators were used to assess the agreement between the observed and simulated data, including Nash-Sutcliffe efficiency coefficient (ENS) (Nash and Sutcliffe 1970) and coefficient of determination (R2) (Legates and McCabe Jr 1999). The value of ENS varying from −∞ to 1 is defined in Eq. (1). The value of R2 varying from 0 to 1 is defined in Eq. (2).

$$ {E}_{NS}=1-\frac{\sum \limits_{i=1}^n{\left({P}_i-{O}_i\right)}^2}{\sum \limits_{i=1}^n{\left({O}_i-\overline{O}\right)}^2} $$
(1)
$$ {R}^2={\left[\left(\sum \limits_{i=1}^n\left({O}_i-\overline{O}\right)\left({P}_i-\overline{P}\right)\right)/\left(\sqrt{\sum \limits_{i=1}^n{\left({O}_i-\overline{O}\right)}^2}\sqrt{\sum \limits_{i=1}^n{\left({P}_i-\overline{P}\right)}^2}\right)\right]}^2 $$
(2)

where Oi is the observed data, \( \overline{O} \) is the mean value of the observed data, Pi is the simulated value, and \( \overline{P} \) is the mean value of the simulated data.

BMP scenarios

The assessment of BMP performance is to calculate the changes in water quantity and quality with or without BMP in multi-spatial and multi-temporal scales by changing the model inputs or parameter values in special modules according to a modeling guide for conservation practices (Waidler et al. 2011; Xie et al. 2015). Several conservation practices were evaluated by the well-calibrated model, including converting farmland to forestland (over 15° slope and over 25° slope, CFF15 and CFF25); nutrient management practices such as 20% and 30% fertilizer reduction; no-till (conservation tillage); residue management practice; strip cropping; contour farming; filter strips with 5 m, 10 m, and 15 m (FS5m, FS10m, and FS15m); detention basins; constructed wetlands; and grassed waterways. For land-management practices, such as converting cropland to forestland, we changed the land use input of SWAT model by reforesting the cropping land with slope over 15° and over 25°. The simulation of no-till was conducted by adjusting the mixing efficiency (EFFMIX) and the mixing depth (DEPTIL) in tillage management file (till.dat). In addition, the SCS runoff curve number for soil moisture condition II (CNOP) was changed and the Manning’s roughness coefficient for overland flow (OV_N) was increased, which help to decrease runoff and peak flow rate (Waidler et al. 2011). Contour farming and strip cropping practices were simulated by decreasing CN2 value and USLE practice factor (USLE_P) in the schedule management operation file (.ops). Furthermore, the Manning’s n and USLE cropping factor (USLE_C) values in the .ops file were adjusted for strip cropping operation. For nutrient management practices, the amount of fertilizer applied in cropland was reduced by 20% and 30% in the management file (.mgt). For residue management practice, the CN2 and USLE_C in the .ops file were adjusted to slow surface runoff.

In the SWAT model, FS was evaluated in the VFS routine, where the field area to VFS area ratio (FILTER_RATIO), the fraction of total runoff from a field entering the most concentrated 10% of a VFS (FILTER_CON), and the fraction of flow through the most concentrated 10% of fully channelized VFS (FILTER_CH) were adjusted (Arnold et al. 2012a). The width of filter strip represented by FILTERW was set to 5 m, 10 m, or 15 m to calculate interception efficiency. For grassed waterways, the depth of channel (GWATD), the average width (GWATW), the linear parameter for calculating sediment re-entrained in channel sediment routing (GWATSPCON), and Manning’s n value for overland flow (GWATN) were adjusted to reduce the amount of sediment and nutrients entering rivers. Detention basins was represented by respectively setting the erodibility factor (CH_EROD), the Manning’s roughness (CH_N2), the surface area of ponds when filled to principal spillway (PND_PSA), the fraction of sub-basin area that drains to ponds (PND_FR), and the hydraulic conductivity through bottom of ponds (PND_K) in the pond management file (.pnd). Constructed wetland was simulated by adjusting several wetland parameters in the .pnd file such as the fraction of sub-basin area that drains into the wetland (WET_FR), the initial volume of water in wetlands (WET_VOL), the surface area of the wetland at maximum water level (WET_MXSA), the volume of water stored in wetlands at maximum water level (WET_MXVOL), and the hydraulic conductivity through the bottom of the wetland (WET_K). The description of BMP simulation scenarios is shown in Table 2. More details of BMP scenarios are given in the Supplementary Material.

Table 2 The description of BMP simulation scenarios

In this study, efficiency of each BMP was simulated separately, which was represented as the removal rate (negative value of percent change) of TN and TP loads that was calculated as:

$$ E=-\frac{Load_{BMP}-{Load}_{base}}{Load_{base}}\times 100 $$

where Loadbase is the load with no BMPs, and LoadBMP is the load with a BMP implementation.

Results and discussions

Model calibration and validation

To enhance the accuracy and validity of the model, a parameter sensitivity analysis is required to select the key parameters that have a dominant effect on the hydrological and water quality processes both in the streams and on the sub-basins. In this study, the SWAT-CUP program was used to analyze the sensitivity of the parameters associated with runoff, N and P, which employed the global sensitivity analysis method that regresses the Latin hypercube generated parameters against the objective function values (Abbaspour 2007; Abbaspour et al. 2007). The results are shown in Tables 3 and 4 for the Chao River subwatershed and Bai River subwatershed, respectively. According to the results in Table 3, the ranking of sensitivity of parameters for the runoff simulation is as follows: GWQMN > SOL_BD > REVAPMN > ESCO > ALPHA_BNK > CN2 > SOL_Z. Different from that of other studies (Baker and Miller 2013; Chen et al. 2015; Jang et al. 2017; Leta et al. 2015), the results in this study showed that CN2 was less sensitive. Similar results were reported by other studies (Bai et al. 2017; Zhu et al. 2011). In the SWAT model, a modified Soil Conservation Service curve number method (SCS-CN) is used to predict runoff volume from a rainstorm, and the CN2 value indicates the soil moisture conditions in pervious areas, which has substantial effects on the generation of surface runoff and water balance (SCS 1993). The decreasing precipitation and the low available water capacity of soil layer led to low sensitivity of CN2 in this watershed. In addition, parameters related to groundwater processes (GWQMN and REVAPMN) were sensitive because the water balance system in this watershed is characterized by large contributions of groundwater to streamflow in non-flood season due to the decreasing annual precipitation (Li et al. 2016). SOL_NO3 and SOL_SOLP were identified as the most sensitive parameter for N and P simulations. The initial concentrations of nitrate and soluble P in the soil were in the high degree of sensitivity; these results are consistent with previous studies (Shen et al. 2014; Shrestha et al. 2012). Watershed characteristics and local factors determined the specific parameter sensitivity results that were inapplicable to other watersheds. The sensitive parameters identified in Bai River subwatershed were similar to those in Chao River subwatershed owing to the similar surface characteristics and meteorological conditions within the MRW (see Table 4). The most sensitive parameters were selected for model calibration and validation in the SWAT-CUP program.

Table 3 The sensitivity rank of parameters in the Chao River Watershed
Table 4 The sensitivity rank of parameters in the Bai River Watershed

The R2 values and ENS values were used to evaluate the acceptability of the model performance, which are shown in Table 5. According to the results, the R2 and ENS values were consistent with other application in the literature reports (Ahmadzadeh et al. 2016; Chen et al. 2016; Qiu et al. 2018a; Radcliffe and Mukundan 2017; Shrestha et al. 2016), with a range of 0.53–0.88 and 0.44–0.84, respectively, which indicates that the performance of the SWAT model was satisfactory and applicable to the simulation of flow and NPS pollution in these two subwatersheds. The disagreement between the simulated and observed data may result from the unpredictable agricultural activities and management, the limitation in precipitation representation, the impacts of the regular water withdrawals from the upstream reservoirs, and uncertainties from the model structure and input data (Abbaspour et al. 2015; Leta et al. 2015). This study did not ensure the accuracy of the predicted water storage, and inflow and outflow of these small reservoirs in the simulation process due to the lack of observed data. There are more reservoirs and fewer weather stations in the Bai River subwatershed; this is the important reason that the values of model performance indicators (R2 and ENS) in the Bai River subwatershed were lower than those in the Chao River subwatershed. Additionally, the transferred amount of flow and pollutant from channel to the base flow and groundwater system could not be monitored and validated in the time series. The model accuracy in the flow simulation directly affected the simulation of nutrients and pollutants.

Table 5 The calibration and validation results at the subwatershed outlets

The parameter values differ greatly between these two watersheds. The CN2, SOL_BD, and SOL_K values in the Bai River subwatershed are higher than those in the Chao River subwatershed, which is related to larger surface runoff in the Bai River. However, the GW_DELAY, SOL_AWC, CWQMN, CANMX, REVAPMN, and ESCO values are smaller in the Bai River subwatershed. The proportion of farmland in the Bai River subwatershed is slightly higher than that in the Chao River subwatershed, which resulted in more runoff given its larger CN2. In the Chao River subwatershed, the land use types are dominated by forestland (including economic forest) (50.29%) where the vegetation has important ecological function on water and soil conservation. However, the larger area of barren land with soil erosion contributed to large nutrient loads in this subwatershed. The differences in parameter values account for the discrepancy in the effects of the same conservation practice between these two subwatersheds.

Influencing factors analysis

The relationship between precipitation and temporal distribution of pollutant loads

The temporal distribution of annual and monthly pollutant loads could be used to identify the relationship between precipitation and nutrient losses (see Fig. 2). Precipitation is the driving force for NPS pollution. During rainstorm events, substantial nutrient and sediment losses are caused by surface runoff and leaching, especially in the agricultural areas with extensive use of fertilizers and frequent cultivation (Abbaspour et al. 2015; Li et al. 2017; Prosdocimi et al. 2016). As shown in Fig. 2 a, b, and c, there is a good linear relationship between annual precipitation and runoff (R2 = 0.929, statistical significance identified by p < 0.05) and both TN and TP loads were increasing along with the increase of annual precipitation, which approached a linear relationship (R2 = 0.630 and R2 = 0.636, respectively), e.g., high values of runoff and nutrient loads were observed in typical wet years due to the large precipitation volume (1990 and 1998), whereas the values were low in typical dry years (2002–2005). However, the findings shown in Fig. 2 also imply that precipitation is not the exclusive influencing factor on the nutrient loss from the surface, and other factors, such as fertilizer types, amount and duration of exposure, and even changes in land covers and tillage patterns, have significant impacts on the temporal distribution of nutrient loads (Adimassu et al. 2017; Jakrawatana et al. 2017). This can be explained by the uneven distribution of mean monthly nutrient loads. Figure 2 e and f show that the monthly nutrient loads are not linearly correlated with precipitation, although runoff had a linear correlation with precipitation (Fig. 2d, R2 = 0.756, p < 0.05).

Fig. 2
figure 2

The relationship between precipitation and variables: annual runoff (a), annual TN (b), and annual TP (c); monthly runoff (d), monthly TN (e), and monthly TP (f)

As shown in Fig. 3 a, the rainfall was concentrated in the period from May to September, which contributed to the heavy runoff and nutrient loads for the year. The inter-annual distributions of pollution loads indicated a seasonal variation of a low level in the dry period and a sharp increase from the normal-flow period to the flood period; this result was consistent with previous studies (Du et al. 2014; Ylöstalo et al. 2016). As shown in Fig. 3 b, the high nutrient loads in March revealed that after a dry period there was a “first flush effect,” which refers to rapid changes in water quality after early-season rains (Gupta and Saul 1996). Within one year, the periods for frequent cultivation, fertilization, and harvesting were distributed in different months (from April to October), which were the critical periods for soil erosion and nutrient losses along with increasing precipitation. In the MRW, with the application of base fertilizer, corn was planted in May and harvested in September or October; thus, the high nutrient losses in May were caused by the timing of planting, fertilizer application, and soil disturbance.

Fig. 3
figure 3

Temporal distributions of pollutant loads: annual TN and TP loads (a), monthly TN and TP loads (b)

The spatial distribution of pollutant loads and influencing factors related to land uses

The spatial distributions of the mean annual runoff and NPS pollution loads from 1990 to 2010 are shown in Fig. 4, which demonstrated that NPS pollution had apparent spatial heterogeneity. The spatial distributions of runoff and different pollutants were in agreement with the spatial distributions of land uses. The runoff and nutrient losses were higher in the sub-basins where agricultural land accounted for a large area. These results were consistent with the results of other studies (Cerdà et al. 2016; Ockenden et al. 2017). The land use map is shown in Fig. S1 in the Supplementary Material.

Fig. 4
figure 4

Spatial distribution of average annual precipitation (PREC) (a), runoff (b), sediment (c), nitrate nitrogen (NO3) (d), organic nitrogen (ORGN) (e), total nitrogen (TN) (f), organic phosphorus (ORGP) (g), soluble phosphorus (SOLP) (h), and mineral phosphorus attached to sediment (SEDP) (i) and total phosphorus (TP) (j)

Land use has significant effects on water quality through NPS pollutants within a watershed (Cerdà et al. 2016; Wu et al. 2012). Figure 5 illustrates the nutrient load intensities from different land uses. The nutrient losses were most serious in agricultural land, due to conventional ploughing in sloping terrain, lower vegetation coverage, soil disturbance, the application of pesticides, nitrate and phosphate fertilizers and manures, and soil compaction and sealing on cropping land. Additionally, barren lands were also associated with high load intensities of nutrients, where soil erosion and nutrient loss is prone to occur owing to low vegetation cover and intensive disturbances by composting activities and garbage disposals in this watershed. These results are consistent with previous studies (Bu et al. 2014; Shen et al. 2013). Additionally, forestland and pastureland released relatively high amounts of nutrients, which might result from the use of manure (200 kg/ha set in the SWAT model) in economic forests to enhance the production of dry fruit, such as walnuts and chestnuts, and the high-intensive soil disturbance caused by grazing within pastureland.

Fig. 5
figure 5

The relationship between land uses and nutrient loads. AGRL means agricultural land, FRST means mixed forest, PAST means pasture, BARR means vacant land, UIDU means industrial land, URMD means residential land with medium density, and URML means residential with med/low density

The spatial distribution of pollutant loads and influencing factors related to soil types

As stated in previous studies, the soil characteristics and properties are critical factors that affect when and where runoff and diffuse pollution occurs (Fuka et al. 2016). The spatial variation of soil properties directly determines the spatial distributions of nutrient loss, thereby providing significant information for land-management decisions. The nutrient load intensities on different soils is shown in Fig. 6. The nutrient losses from several types of soil were high, including calcaric Cambisols (CMc), eutric Cambisols (CMe), haplic Greyzems (GRh), and luvic Kastanozems (KSl). This is associated with their properties implicated by hydrologic groups C or D that indicate weak ability of water conservation, and fertility preservation due to small soil porosity, slow infiltration rate, strong penetration, and high potential to runoff generation. These soil types are dominated by Cambisols and Regosols, which are intensively used for agricultural production and grazing land in northern China. Cambisols concentrated in temperate regions are the most productive soils distributed in mountainous terrain along with active erosion association with other mature tropical soils (FAO 2016). Farmers apply large amounts of fertilizers to make the soils fertile in order to produce large crop yields. The Regosols are distributed mainly in eroding lands of mountainous regions with steep slope, particularly in the dry tropics, where the soil erosion is combined with wind erosion and water erosion (FAO 2016). In this regard, a large amount of nutrient leaches into water bodies when Regosols are concentrated in the high-intensive irrigated areas.

Fig. 6
figure 6

The relationship between soil types and nutrient loads. ARc represents calcaric arenosols, CMc represents calcaric Cambisols, CMe represents eutric Cambisols, FLc represents calcaric fluvisols, GRh represents haplic Greyzems, KSl represents luvic Kastanozems, LVg represents gleyic luvisols, LVh represents haplic luvisols, LVk represents calcic luvisols, RGe represents eutric Regosols, and RGc represents calcaric Regosols

Efficiencies of BMPs

The simulation results revealed that nutrient loss in the MRW was serious. Average reduction of at least 85.91% of TN load in the Chao River is necessary to comply with the surface-water quality standard (No. GB38382002) that stipulates the limit of TN concentration which is 0.5 mg/L, and reduction of at least 68.53% is necessary in the Bai River, because of the massive use of fertilizers (high N) and manure and a poor wastewater treatment system. Although TP does not exceed the designated water quality standards (0.1 mg/L) at mounts of the Chao River and Bai River, we included it as an important environmental indicator to assess the BMP efficiencies in the MRW. Considering the impacts of land covers and nutrient losses from soil on water quality, a stricter land use program and other practice for source control and water quality restoration should be implemented for integrated watershed management. The monthly TN and TP loads under different BMP scenarios were simulated in SWAT. Generally, the efficiencies of most structural BMPs in the Bai River subwatershed are slightly lower than those in the Chao River subwatershed. The higher CN2 (higher runoff) and the slightly lower concentration in the Bai River are the main reasons. A rapid runoff on slopes leads to short retention time of runoff flowing over these structures, which is adverse to their effectiveness. This result was consistent with other studies (Fonseca et al. 2018; Jayakody et al. 2014; Liu et al. 2017). However, the land use management practices for controlling nutrient losses in the Bai River subwatershed were more effective than those in the Chao River subwatershed because the proportion of agricultural land that was targeted for re-planning was higher in Bai River subwatershed (Table 6), which contributed to greater changes of land covers and pollution sources. The model also showed that different BMPs had significantly different impacts on the reductions of nutrient loads. Generally, the removal efficiencies of structural BMPs such as filter strips, grassed waterways, and constructed wetlands were better than those of non-structural BMPs.

Table 6 Land use scenarios of converting farmland to forestland (CFF15 and CFF25)

Land use management practices

Integrated watershed management requires the coordination between land uses and water quantity and quality. Implementation of eco-friendly tillage patterns, such as residue management and no-till, resulted in very slight improvements in nutrient loss throughout the watershed (Fig. 7). However, in some sub-basins, they resulted in small increases or no significant change in TN and TP losses. This can be explained by the limited surface runoff and soil erosion and the model limitation that assumes a specific time of tillage for the whole watershed, but this assumption contradicts the actual timing of tillage that varies in different areas within the watershed and depends on local climate conditions and farmer preferences. In addition, the eco-friendly tillage patterns only have the potential to control the small amount of nutrients in the surface and subsurface layers that are available for redistribution by surface runoff. Moreover, increased residues limit soil-disturbing activities, which results in limited incorporation of manures or fertilizers within soils and more nutrient accumulating in topsoil that further increases the amount of nutrient washed out by rainfall and runoff. These results were supported by other studies (Lam et al. 2010; Lam et al. 2011; Taylor et al. 2016; Wallace et al. 2017). The low efficiency can also be attributed to a potential increase in soluble nutrient loads because infiltration and nutrient transport in subsurface flow are increased with more residue cover, and solute nutrients are easily leached through subsurface tiles. It has been found in previous research that solute nutrients are the main components of nutrient losses in the MRW (Qiu et al. 2018b).

Fig. 7
figure 7

The monthly efficiency of BMPs for TN load (a) and TP load (b) among different sub-basins. Point value means the average monthly removal rate of different sub-basins, bar height means the differential between the top quartile and the bottom quartile, and line length means the differential between the highest and the lowest values

The land use change scenarios by CFF15 and CFF25 are shown in Table 6. The croplands with slope over 15° and over 25° slope fields were reforested, respectively. Figure 7 shows that the percent reductions of TN load are similar to those of TP load achieved by CFF practices in this watershed. As previously noted, agricultural lands are the critical sources of water pollution in this watershed. The massive application of fertilizers and manures and frequent activities has caused the release of a large amount of N and P, resulting in the risk of eutrophication in the Miyun Reservoir, particularly from slopping field, where the nutrients are prone to leach into surface runoff accompanied by soil erosion during heavy rain events. The conversion of cropland to forestland on sloped land is part of a national ecological recovery program in China, which has lots of significant ecological impacts, including the minimization of wide-scale soil erosion and vegetation degradation, increases in the soil moisture, reductions in runoff, decreases in the use of fertilizers or manures, and improvements in water quality through the absorption of nutrients or other pollutants by vegetation (Huang et al. 2017; Zhang et al. 2000). The reduction efficiencies of CFF15 for TN and TP loads reached more than 10%, and those of CFF25 were 7–9%. These results are consistent with another study (Liu et al. 2013). Consequently, the small changes between the baseline scenario and the CFF scenarios can be attributed to the low percentage of slopping fields (22% of the slopping area) that are reforested in this watershed. In addition, reforestation with economic forests was represented in the model by setting a small application of fertilizers and manures and specific logging time in the management operation file, which may increase nutrient losses and soil erosion due to low grass coverage in this afforested area, although there are significant reductions in nutrient losses from cropland.

According to Fig. 7, contour farming and strip cropping on high slope fields (over 15° slope) have similar impacts on nutrient loads. Contour farming involves tillage, planting, and other farm operations along the contour lines to increase water infiltration and reduce runoff, soil erosion, and transport of contaminants (Gao and Yang 2015; Sklenicka et al. 2015; USDA-NRCS 2016). In this study, contour farming reduced mean monthly TN and TP losses by 24.09% and 21.15%, respectively. Compared to other studies (Jeon et al. 2018; Liu et al. 2013), the greater reduction of nutrient loads achieved by contour farming is attributed to the larger area of slopping field that is dominated by dry land, which accounts for 45.79% of the total area of cropland in the MRW. The main crop in sloping fields is corn that has a weak ability to maintain soil and water. Moreover, nutrient losses are easier to induce in dry land than in paddy field, which has been reported in previous studies (Shen et al. 2013). Consequently, the greater reductions of nutrient loads were achieved by contour farming in the sloping fields (dry lands) in the MRW. Consequently, the local conditions, such as slope, crop type, tillage pattern, and soil property, have a substantial effect on the efficiency of contour farming.

Strip cropping, a farming method to conserve soil structure and prevent soil erosion on a steep slope, arranges and cultivates a field into equal width strips with a crop rotation system (USDA-NRCS 2016). In this study, strip cropping reduced mean monthly TN and TP losses by 24.04% and 21.10%, respectively. In contrary to other studies (Fan et al. 2015; Jakrawatana et al. 2017), strip cropping tended to be the most effective at reducing TN than TP in the MRW, due to its capability to reduce surface runoff and erosion and the significantly increased N uptake in plant biomass with the application of urea that is the critical source of N loss. In the MRW, the majority of cultivated land is distributed on hillsides, and the implementation of contour farming and strip cropping can help preserve soil fertility by minimizing soil erosion as a result of the roots of natural dams holding the soil on the steep slopes. These results indicated that nutrient-use efficiency and soil structure could be improved by strip cropping, which were supported by other studies (Tauchnitz et al. 2018; Wagena and Easton 2018); therefore, strip cropping should be extensively implemented in the MRW.

Nutrient management practices

Based on the on-site questionnaire survey, the current fertilizer amounts applied in cropping land are 325 kg/ha for corn, which would be adjusted in the nutrient management scenarios. The results obtained from the scenarios showed that small (20%) reductions in fertilizer application for arable land resulted in reductions in nutrient loads at the watershed outlet. The simulated values of the reductions in the average monthly loads for TN and TP are 8.22% and 9.74%, respectively. Similar results were obtained in other studies (Lam et al. 2011; Liu et al. 2013). When the fertilizer amount decreased by 30%, the TN and TP loads decreased by 8.73% and 10.20%, respectively. The high content of N in the soil resulted in milder effects of these scenarios on TN than on TP. Nutrient management practices have not covered the fertilizers and manure used in economic forests; this is the primary reason for the relatively low efficiencies of nutrient management practices. Because of the limited data regarding the amount of manure used in this watershed, the control of the amount of chemical fertilizers was applied as a suboptimal nutrient management practice.

Structural best management practices

As shown in Fig. 7, structural BMPs achieved large reductions in TN and TP losses in the sub-basins. Filter strips were added along channels and the edge of fields. The results indicate that filter strip is an effective option to control nutrient losses in these two subwatersheds. Filter strips of 5-m, 10-m, and 15-m width reduced mean monthly TN losses by 23.89%, 37.23%, and 40.42%, respectively, and reduced mean monthly TP losses by 37.01%, 39.34%, and 41.69% in the sub-basins. A reduction in the total area of cropland, a reduction in the total amount of fertilizer and manure applied to cropland within the sub-basin, and a great nutrient transformation/removal by soil microbial processes due to hydrologic pathway alterations are most likely responsible for the reduction in nutrient losses under buffer strips. These results are consistent with the pollution removal rates of filter strips noted by the Conservation Practices Modeling Guide for SWAT and APEX (Waidler et al. 2011), the values accepted for loading analyses of vegetated filter strips (NHDES 2011), and the findings in other studies (Taylor et al. 2016; Vought et al. 1995). The effectiveness of field strips highly depends on the width of the buffer strips and the extent of the area to which they are applied (Klatt et al. 2017; Stehle et al. 2016). The reductions of nutrient loads increased along with the increase of strip width from 5 to 15 m. As evidenced by our study and the findings of others (Glavan et al. 2012; Klatt et al. 2017), field strips tended to be most effective at reducing TP export than TN. This is due to capability of field strips to reduce surface runoff and surface erosion, which are the primary drivers for sediment associated P export that is the main form of TP losses in the MRW (Qiu et al. 2018b). Furthermore, application of nitrogenous fertilizer (urea) in Cambisols and Regosols soils significantly increases N losses by rainfall-runoff. High TN concentrations in surface runoff can stimulate plant growth in the strips, which results in more P uptake by the plant, and this also partially explains the further reduction in TP by filter strips. These results were supported by other studies (Rousseau et al. 2013; Wagena and Easton 2018).

A constructed wetland is a biological treatment system for sediment retention and nutrient removal that is widely implemented to control streamwater pollution on the urban and watershed scale (USDA-NRCS 2016). As shown in Fig. 7, implementing a constructed wetland within the sub-basin reduced TN and TP losses by 28.03% and 37.83%, respectively. These results have been well documented by previous studies (Ding et al. 2016; Johari et al. 2016; Kuschk et al. 2003), which indicated that the mean removal rates for TN load in a constructed wetland ranged from 10 to 53%, and from 40 to 45% for TP load. The wide ranges of nutrient removal in wetland systems are mainly influenced by the biological process, which depends on plant species, retention times, climatic conditions, soil properties, landscape impact, wastewater types and amount, and system configurations, such as capacity, seasonal storage, and height of embankments (USDA-NRCS 2016; Wu et al. 2015). As previous studies (Ma et al. 2010; Zheng et al. 2016) proposed, the MRW has seen increased precipitation intensity that causes rapid surface runoff, especially on the slope. The residence time is significantly shortened when water flows across constructed wetlands, which is not conducive to the function of constructed wetlands. The increasing precipitation in winter and decreasing precipitation in summer may have substantial seasonal effects on the efficiency of constructed wetlands, which will be further analyzed in the future.

According to Fig. 7, the removal rate of grassed waterways for TN and TP are 32.67% and 35.02%, respectively, which are close to the efficiencies of filter strips. These results are consistent with the pollution removal rates of grassed waterways noted by the Conservation Practices Modeling Guide for SWAT and APEX (Waidler et al. 2011). Grassed waterways are constructed channels with suitable vegetation for the transport of polluted water and are typically located in agricultural fields (USDA-NRCS 2016). The vegetation can effectively reduce surface erosion, trap sediment, and retain nutrient on the farmlands. As noted in other studies (Qi et al. 2016; Shipitalo et al. 2012; Smith et al. 2015), the removal rates of grassed waterways for N and P loads in surface runoff range from 18.7 to 52.2% and from 19.6 to 52%, respectively. The results of this study fell within the ranges, due to the average slope of the MRW is within the slope interval of these studies. Similar to filter strips, the efficiency of grassed waterways is mainly affected by local conditions and design parameters, such as capacity, stability, width, depth, and slide slopes.

A detention basin is constructed with an engineered outlet to detain sediment-laden runoff, waste solids, and other debris at the site or from other areas (USDA-NRCS 2016). In this study, the removal rates of detention basin for TN and TP loads were 38.36% and 32.60% (see Fig. 7). These results are slightly below the reference values (55% and 68%) in the Conservation Practice Modeling Guide (Waidler et al. 2011). The effect of detention basins on nutrient loads is mainly through reductions in surface runoff and a result of sediment trapping. The loads of sediment and sediment-bound nutrients in surface runoff are relatively lower in the MRW (Fig. 4) than other watersheds (Forsee and Ahmad 2011; USEPA 2009). Thus, a smaller amount of nutrients was detained by detention basins in the MRW. As evidenced by the comparison of our study and the findings of others, the effectiveness of detention basins is also dependent on local conditions and watershed characteristics.

Conclusions

Agricultural NPS pollution is the major cause of water quality problems in the Miyun Reservoir, the only drinking water source for Beijing. This study used the SWAT model to assess the spatial and temporal distributions of NPS pollution, the factors that affected NPS pollution, and the efficiencies of several conservation practices or BMPs to control nutrient losses. The model determined that the nutrient losses were higher in agricultural land and barren land and were significantly affected by soil properties. The soils with high nutrient losses included Cambisols and Regosols, which are used intensively for agricultural production in northern China. The temporal distribution of pollutant loads was consistent with that of precipitation. Then, the efficiencies of several conservation practices were evaluated using the well-calibrated model, which would help to recommend the appropriate practices for water quality improvement and ecological restoration. The removal efficiencies of structural BMPs such as filter strips, grassed waterways, and constructed wetlands were found to be better than those of non-structural BMPs, such as residue management, conservation tillage, and nutrient management. The parameter values and the BMP efficiencies differed between the Chao River subwatershed and the Bai River subwatershed owing to slight variances in the land cover, topographic features, and hydrological characteristics.

The findings of this study would be useful to identify the priority control areas and critical control periods of NPS pollution and generate the possible combinations of BMPs to protect drinking water quality in Beijing. Further research is needed to identify the crucial factors that influence the efficiencies of BMPs and extend the potential applications of the BMPs or their combinations that result in the greatest reduction in pollutant loads and requiring the least cost to achieve targeted water quality improvement at watershed scale.