Introduction

In the last three decades, water quality legislation has substantially decreased point source pollution into the streams and rivers of the United States of America (Malina 1996; Mays 1996). While reductions in point source pollutant loads have had remarkable positive impacts on individual river-systems, many others are still not meeting state and national water quality objectives. Indeed, failure to preserve water quality and quantity may significantly impact human quality of life, and may have potentially irreversible effects on ecosystems. These consequences underlie initiation of major federal programs to address basin scale watershed issues, e.g., (1) the USGS National Water Quality Assessment (NAWQA) Program, and (2) the USEPA Total Maximum Daily Load Program (TMDL). To reach the water quality objectives of these programs, scientists and regulators increasingly focus on non-point sources of pollutants in watersheds. Thus, the watershed has been widely adopted as the logical unit for scientific study and management of non-point sources of water pollution.

Wetland restoration is commonly presented as an important strategy for maintaining and enhancing the water quality and ecological capital of watershed scale ecosystems. While numerous studies have assessed ecosystem function of individual wetland restoration projects (Richardson 1985; Mitsch and Reeder 1991; Mitsch et al. 1995; Bruland and Richardson 2005, 2006), fewer studies have assessed the “cumulative impact” of wetlands on watersheds (Whigham 1988; Childers and Gosselink 1990; Johnston et al. 1990; Weller et al. 1996). None of these studies have examined the importance of watershed-level land-use variables as potential sources of water quality impairment that can be addressed through riparian (i.e., integrated wetland and/or stream) restoration.

The key objectives of our study are: (a) to develop a decision support tool for prioritizing wetland restoration sites within the context of watershed-level water quality functions, (b) to apply our decision tool to select restoration sites in the Harrison Creek watershed in eastern North Carolina (Fig. 1), and (c) to quantify the effect of restoring high priority sites on in-stream nutrient concentrations in the Harrison Creek watershed.

Fig. 1
figure 1

Location of study watersheds in eastern North Carolina, USA

Methods

Statistical modeling

We examine the relationships between land-use/land-cover and water quality on the Coastal Plain of North Carolina, an area of low topographical relief and high annual precipitation. The Coastal Plain has historically supported vast areas of wetlands including pocosins and carolina bays on poorly drained interfluvial areas and bottom-land hardwood forests on the floodplains of major river systems. Many of these wetland areas have proven to be prime agricultural land after the installation of extensive drainage systems (Sharitz and Gresham 1998; Richardson et al. 1981; Richardson and McCarthy 1994; Tiner 2003).

Our models were developed using water quality data from 14 watersheds in eastern North Carolina (Fig. 1) that are part of the United States Geological Survey’s National Water Quality Assessment (NAWQA) Program. A subset of stations were selected based on location within the coastal plain of North Carolina, and availability of parameters including instantaneous discharge, total phosphorus (TP), ortho-phosphorus (OP), and Nitrate-Nitrite (NOx). The dataset included over 300 individual samples collected between 1992 and 1997. The land-use and land-cover characteristics of the watersheds of each NAWQA station were summarized using the ArcView® GIS system and publicly available spatial data from the United States Geological Survey, the United States Department of Agriculture (USDA), Natural Resource Conservation Service (NRCS), and the United States Fish and Wildlife Service (USFWS). The database includes area of wetlands from the National Wetlands Inventory (USFWS 19752005) and other land-uses from the 1992 National Land Cover Dataset (Vogelmann et al. 2001), types of wetlands in the watershed, and continuity of riparian buffers. Codes describing watershed variables are presented in Table 1. These codes have a prefix describing landscape position and a numeric suffix describing land-use. A prefix of WS indicates land-use variable as a percentage of the entire watershed area, and BF indicates percent area of a buffer extending 90 m on both sides of a given stream reach. Prior-converted croplands were defined as polygons with hydric soils, current agricultural land-uses, and intersection by drainage ditches.

Table 1 Description of predictor variable codes used in the classification tree models

The first phase of our study was to identify watershed scale sources of water quality impairment having potential for mitigation through stream and wetland management and/or restoration. We developed regression tree models using the ‘‘rpart library’’(Therneau and Atkinson 2005) for the R statistical package (R Development Core Team 2008). Rpart produces a graphical tree model that can be thought of as a decision tool relating watershed variables such as land-use/land-cover, soils, buffer continuity, wetland fragmentation, and hydrologic discharge to water quality variables including ortho-phosphorus, total phosphorus, and nitrate-nitrite concentration. Regression tree models are well suited for exploring and modeling complex data sets, which are nonlinear and often display interactions between variables (Breiman et al. 1984; Ripley 1981; De Ath and Fabricius 2000). Tree-based modeling is an increasingly accepted technique for uncovering structure in data sets because it: (a) is useful for variable screening, (b) can be used for developing prediction rules that are easily evaluated, (c) is invariant to monotone re-expressions of predictor variables, (d) allows for missing values, (e) allows simultaneous use of continuous and categorical variables, (f) is adept at capturing non-additive behavior, and (g) does not have the restrictive assumptions of many other techniques (Qian and Anderson 1999; Qian et al. 2001; Qian 2009).

The model solutions produced by recursive tree procedures are not the only feasible model for a given dependent variable (Breiman et al. 1984). The collinearity of variables creates opportunities for alternative splits that perform nearly as well as the selected split. This is a consequence of the ‘greediness’ of the regression tree algorithms such as rpart (Venables and Ripley 1999). If several predictor variables are very similar in their ability to improve model homogeneity at the first split, it is possible for the model algorithm to select randomly between these competing variables.

The random forest method (Breiman 2001) is an alternative modeling procedure that addresses the shortcomings of rpart. The random forest procedure does not produce a single graphical decision tree, but relies on a “forest” of 500 or more trees developed from bootstrap samples of training data. The product of this procedure is a nutrient concentration prediction for a given combination of predictor variables that is the “consensus” of predictions from the individual trees that make up the forest. Predictions produced by Random Forest models typically demonstrate 30% lower error rates than those of individual tree models (Breiman 2001). The accuracy of model predictions can be summarized with a pseudo-R 2 produced by comparing the nutrient concentration prediction of a single tree to an observed value. The R 2 for the entire forest is then calculated using the equation:

$$ R^{2} = 1 - {\frac{\text{MSE}}{{\text{var} \left( y \right)}}} $$
(1)

where MSE is the mean squared error between predicted and observed nutrient concentrations averaged from all trees making up the forest, and var(y) is variance in the observed data set (Breiman 2001; Parkhurst et al. 2005).

A key feature of random forest is an assessment of the importance of explanatory variables in influencing the predictive accuracy of the forest of individual tree models. Importance values utilize out-of-bag samples (i.e., cases excluded from the bootstrap samples used to develop individual trees) to evaluate error of fit of each tree model as explanatory variables are randomly permuted and error of fit is evaluated. The importance of an individual variable increases as a function of the average increase in MSE when the variable is permuted compared with the average MSE when no explanatory variables are permuted (Liaw and Wiener 2002; Parkhurst et al. 2005).

Site monitoring

We monitored water quality at the Barra Farms Cape Fear River Mitigation Bank located in the headwaters of Harrison Creek (Fig. 2). This monitoring was performed to assess nutrient delivery to the Cape Fear River from prior converted croplands and a restored wetland, and to provide data for validation of our statistical model. The site is located within a Carolina Bay and displays the shallow elliptical basin, organic soils, and broadleaf evergreen plant communities that are characteristic of these wetlands (Bruland et al. 2003). Large areas of this wetland were ditched and converted to agricultural production, but 25% remained undisturbed. In 1997 about 240 hectares of converted agricultural land was restored to wetland, while adjacent agricultural areas remained in production. This spatial arrangement allows for the unique opportunity to monitor areas of Harrison Bay that are largely unimpacted reference areas, areas that remain actively managed for agriculture, and former agricultural areas that have been restored to wetlands (Fig. 2). Flows from these areas were quantifiable because they occur through clearly defined drainage ditches. We delineated the watersheds of the restored, agricultural, and reference areas of Harrison Bay using ArcView® GIS. The land-use variables described in Table 1 were compiled for each of these local watersheds. Water samples were collected monthly from the restored, reference, and adjacent agricultural areas for 2 years beginning in the spring of 1999. Each watershed site was gauged to determine daily streamflow volume between September 1999 and April 2001.

Fig. 2
figure 2

Location of the Barra Farms study site within the Harrison Creek watershed. Letters on the inset aerial photograph indicate: A) restored wetland, B) active agriculture, and C) reference area

Results and discussion

Regression trees

Natural log transformed ortho-phosphorus (OP) concentration was modeled using 34 potential explanatory watershed variables. The rpart modeling procedure selected a small subset of these explanatory variables for their ability to reduce the variance of the response variable. These included the portion of 90 m wide stream buffers in agriculture (BF.AG.ALL), the percentage of basin-wide area of prior converted croplands (PRIOR), the portion of watershed area in transition from agriculture to forest (WS.TRANS) and instantaneous measure of discharge standardized as millimeters per day (DIS.MM) (Fig. 3). The model has a cross-validated r-square of 0.83. Water samples with the highest OP concentrations (lowest water quality) came from watersheds where prior converted croplands account for more than 4.4% of the total watershed area and where agricultural land uses account for more than 16.9% of buffer areas. Watersheds with the lowest OP concentrations have small areas of prior converted agricultural lands and the presence of some areas of transitional land-use (WS.TRANS) that often indicate cleared agricultural lands are reverting to forest or wetland due to low commodity prices, or due to conservation set-asides. Thus in targeting restored wetland sites, primary objectives would be to reduce total area of prior converted cropland and increase buffer continuity by locating restored wetlands within stream buffer areas and/or on prior converted croplands.

Fig. 3
figure 3

Regional regression tree of log natural transformed ortho phosphorus concentration. The text above each split shows the variable that is being split and the test below indicates the threshold value identified by the model (% of watershed or buffer area). The text below the terminal nodes shows the natural log of the predicted constituent concentration. See Table 1 for abbreviations

A regression tree model for natural log transformed total phosphorus (TP) with a cross-validated r-square of 0.71 is presented in Fig. 4. The explanatory variables most able to reduce the variance of the response variable included the proportion of the basin classified as prior converted croplands (PRIOR), wetland fragmentation (WET.PERI.AR), and the portion of stream buffers used for pasture/hay (BF.AG.PAST). The highest TP concentrations occur in watersheds where prior-converted cropland (PRIOR) constitutes more than 4.41% of the basin and substantial portions (>4.78%) of stream buffers are used for pasture/hay (BF.AG.PAST). The lowest total phosphorus concentrations are predicted in watersheds where prior converted cropland constitutes less than 4.41% of the total area, the same as found for ortho-phosphorus model, and wetland fragmentation (WET.PERI.AR) is low (as indicated by perimeter/area ratios of polygons from U.S. Fish and Wildlife Service National Wetland Inventory). The management implications of this model are much the same as those of the ortho-phosphorus model except greater impetus is given to management actions that reduce wetland fragmentation.

Fig. 4
figure 4

Regional regression tree model of log natural transformed total phosphorus concentration. See Fig. 3 caption for further explanation

A regression tree model for in stream nitrate-nitrite (NOx) concentration is presented in Fig. 5. The portion of undeveloped land, i.e., forest and wetland, within the riparian buffer (BF.UNDV) largely explains the variability in stream NOx concentration. The lowest NOx concentrations (highest water quality) are predicted for streams where BF.UNDV accounts for greater than 71.66% of the watershed area, evergreen forests (WS.FOR.EVG) coverage is greater than 10.77%, and samples were collected in low flow conditions (DIS.MM < 1.02 mm/day). The highest NOx concentrations (lowest water quality) occur in watersheds where BF.UNDEV coverage is less than 71.66% and where wetland fragmentation (WT.PERI.AR) is relatively high. The NOx model presented in Fig. 5 has a cross-validated r-square of 0.71. The management implications of this model support the importance of maintaining riparian buffer continuity and reversing wetland fragmentation in reducing NOx concentrations in streams. Both of these management directives are readily addressed through stream and wetland restoration.

Fig. 5
figure 5

Regional regression tree model of NO3–NO2 concentration. See Fig. 3 caption for further explanation

Random forests

The random forest procedure was used to develop regional models relating watershed explanatory variables to total phosphorus, total nitrogen and nitrate-nitrite concentrations at USGS NAWQA program stations located across the coastal plain of North Carolina. These data were used to “train” our model and water quality data collected from the restored, reference, and agricultural subwatersheds of Harrison Creek were used as “test” data sets. Table 2 lists the R 2 values (%) of predicted versus measured values for both regional (training data set) and local watershed (test data sets) predicting NOx, TP, and OP concentrations, and the natural log transformed constituents (lnTP, lnOP, lnNOx). These values ranged from 15 to 94.7%. In our regional models Land-use/Land-cover variables along with discharge explained between 74.81, 75.01, and 83.72 percent of variance in log transformed TP, NOx, and OP concentrations, respectively. These numbers are quite high for regional models and suggest our random forest models can provide useful insights about the relationships between watershed land-use/land-cover variables and water quality. Within the local watersheds of Harrison Creek, the random-forest models performed best in the restored watershed where models explained between 26.2 and 94.7% of variance. Accuracy of model predictions were not as strong in the Agricultural and Reference watersheds suggesting that other factors such as human management (fertilizer application) and other stochastic processes played larger roles in these relatively small watersheds.

Table 2 The R 2 values (%) between Random Forest model predictions and observed water quality constituent values for both the regional training data set and the local test data sets from the Harrison Creek watershed

The random forest procedure generates importance indices describing the predictive value of each potential explanatory variable (Parkhurst et al. 2005; Breiman 2001). While the outputs of the random forest models lack the graphical representation and the threshold values generated by the rpart procedure, they are expected to be a more robust ranking of the strength of relationship between a given explanatory variable and predicted water quality (Breiman 2001). The 10 explanatory variables with the highest importance values derived from the regional models of OP, TP, and NOX are shown in Table 3.

Table 3 Explanatory variable importance values (IMP; see text) produced by Random Forest analyses of regional TP, OP, and NOX datasets

The single most important predictor of phosphorus concentrations in the study watersheds, for both OP and TP, was the portion of the watershed classified as prior converted cropland (PRIOR). Several other studies have identified drained agricultural lands as having substantial impacts on water quality in the coastal plain region of North Carolina (Evans et al. 1991, 1995; Skaggs et al. 1994; Dukes and Evans 2006). This is due in part to the enhanced transport of dissolved and sorbed phosphorus to streams due to the presence of drainage systems and erosion of cropland soils.

It has been demonstrated that various systems of controlled drainage can reduce loads of nitrogen and phosphorus from agricultural lands between 30 and 50% (Skaggs et al. 1994; Evans et al. 1995). It can be logically inferred that the controlled drainage resulting from wetland restoration would reduce these nutrient loads by at least 30–50% due to the re-establishment of wetland hydrology, cessation of fertilizer and manure application, reduced soil erosion, increased P sorption, and enhanced denitrification. These results indicate that potential restoration sites located on prior converted croplands should be given the highest priority for restoration. Other high priority explanatory variables for total phosphorus concentration include several urban land-uses. These results were somewhat unexpected given the largely rural setting of the study watersheds. Low intensity residential land-use intrusion into riparian buffers (BF.RES.LI) represented only about 1.7% of the total watershed area and yet was the second most important predictor of TP concentration (lnTP) suggesting that fertilizer application, septic fields, and disruption of riparian buffer continuity had a disproportionate impact on phosphorus concentration. Where appropriate, stream restoration in these areas might present an opportunity to reestablish riparian buffers and reduce TP loads from these areas.

Emergent wetlands were among the most important predictors of OP concentration (Table 3). Many studies have demonstrated the connection between wetlands and water quality at the watershed scale (Weller et al. 1996; Whigham 1988; Brinson 1988). In our study of watersheds the area of emergent wetlands is strongly related to predicted OP concentrations. It has long been reported that emergent wetlands were very effective in the retention and/or transformation of soluble inorganic phosphorus (Richardson 1985; Kadlec and Knight 1996; Reddy et al. 1999; Vymazal 2007; Mitsch and Gosselink 2007). It is unclear if OP undergoes long-term storage, short-term storage, or transformation in these emergent wetlands (Richardson 1999). However, emergent wetland related explanatory variables (WS.WET.EM, BF.WET.EM) are among the 10 most important predictors of TP concentration (Table 3), suggesting these emergent wetlands are net phosphorus sinks instead of transformers of OP to a organic component of TP (Richardson 1999). A synaptic analysis of GIS coverages suggests that some of these emergent wetland areas may be associated with existing Best Management Practices such as restored wetlands, vegetated swales, and storm water ponds so that wetland restoration practices may already be having a demonstrable affect on water quality in our study area.

The most important predictors of in stream NOx concentrations were explanatory variables that expressed the relative proportion of disturbed versus natural land-cover within a given watershed. The total proportion of undeveloped land-use in the watershed (WS.UNDEV, see Table 3 for calculations) was the most important predictor of NOx concentration. The portion of riparian buffers classified as wetland (BF.NWI) was also an important predictor of stream NOx concentrations. Other studies have linked stream NOx concentration with the proportion of a watershed classified as natural land-cover (i.e. forest and wetlands) and the continuity of riparian wetlands (Johnston et al. 1990; Gilliam 1994; Groffman 1994; Lowrance et al. 1997; Mitsch et al. 2001). Agricultural land-uses were also important explanatory variables of stream NOx concentration. These included the proportion of the riparian buffer occupied by all classes of agricultural lands (BF.AG.ALL), and row crops (BF.AG.ROW); and the proportion of the watershed occupied by all agricultural lands (WS.AG.ALL),and row crops (WS.AG.ROW). These results present several management implications for reducing stream NOx concentrations in the coastal plain of North Carolina. Shifts in watershed land-use away from agricultural to natural land-uses will reduce in stream NOx concentrations. Such shifts might include conservation set-asides and/or large stream and wetland restoration projects. Reducing the area of agricultural land-uses and increasing wetlands in riparian buffers would decrease predicted in stream NOx concentrations, and these management goals could be accomplished through the use of integrated stream and wetland restoration projects in the riparian zone (Evans et al. 2007).

Monitoring

The final component of our proposed framework is monitoring the effectiveness of a high priority wetland restoration site to confirm if restoration will, as predicted by our regression tree models, improve water quality on the landscape. The wetland restoration project undertaken at the Barra Farms site was consistent with the site priorities produced by our regional modeling framework in that: (a) the site is a prior converted agricultural area with a well developed system of drainage ditches and (b) there was extensive disturbance of the riparian buffer. In the spring of 1999, we began monitoring at the Barra Farms Cape Fear Regional Mitigation Bank. Water quality data from both the restored wetland area and from adjacent agricultural areas indicated substantial differences in nutrient concentration. These data suggest that the restored wetland is having a substantial positive impact on the water quality of Harrison Creek, and consequently on the Cape Fear River. For example, boxplots of nutrient concentrations in grab samples collected from the restored wetland, the agricultural area and from a bay forest reference area are shown in Fig. 6. The mean TP, OP and NO3–NO2 concentrations where consistently lowest at the reference site, intermediate at the restored site, and highest at the agricultural site. There is noticeably greater variability in nutrient concentrations from the agricultural sites. Importantly, we found that nutrient runoff concentrations for NO3–NO2 N and total phosphorus data collected during a hurricane related storm-peak were consistently 5–7 times lower in the restored wetland than in agricultural area runoff (Fig. 7). These results provide a basis to validate the watershed-level site prioritization produced by our tree models, and demonstrate the water quality effects of the ensuing restored wetlands. Restoration activities at the Barra Farms Cape Fear River Mitigation Bank resulted in conversion of 250 ha of drained agriculture land to restored wetland (See area A of inset in Fig. 2), mitigating the water quality impacts fertilizer application to the site before restoration (Bruland et al. 2003). Collectively our findings suggest that a model and prioritization approach which helps select important land use types and related water quality provides a basis for restoring wetlands on the landscape to provide a significant improvement in water quality relative to agricultural and other land-uses.

Fig. 6
figure 6

Boxplots of long-term water quality data collected between April, 1999 and April 2001 from the agricultural, reference and restored areas of Harrison Bay (Barra Farms site)

Fig. 7
figure 7

Nitrate-nitrite nitrogen and total phosphorus concentrations in storm samples from taken between 9/24/99 and 9/27/99. Stations are located in the reference (REF), restored (REST) and agricultural (AG) areas of the Barra Farms site. See Fig. 1

Conclusions

Prioritizing restoration sites on the landscape is often a haphazard process when water quality improvement is a primary restoration objective. Priorization is often based on widely held, though often untested, assumptions about relationships between watershed characteristics and water quality. We present a framework to target and prioritize wetland restoration locations using both regional and watershed-level screening models. The regression tree and random forest models presented in this paper identify the variables with the strongest relationships to a given water quality variable, present a clear hierarchy of variable importance, and present approximate thresholds in watershed area where these variables express the greatest impact on water quality. The tree models could be used as “stand alone” decision tools in watersheds where limited restoration opportunities make less robust and less costly decision making approaches more appropriate. While the Random Forest models do not provide the graphical decision tree produced by the regression-tree procedure nor the estimate of thresholds, they do produce much more accurate predictions of water quality based on land-use/land-cover and discharge. Prior converted agricultural lands were identified as important predictors of both ortho and total phosphorus. These sites have poorly-drained hydric soils that require artificial drainage to be suitable for agriculture. These drainage systems become conduits for transporting dissolved and total phosphorus from agricultural field and to area streams and rivers (Skaggs et al. 1994). Fortunately because these areas were historically wetlands, they are often very suitable for wetland restoration. The continuity of stream buffers is identified as another important predictor of water quality. This seems to be especially true with regard to NO3–NO2 concentrations. Both the rpart and random forest models agree that maintaining natural land-cover within the 90 m buffer is a fundamental strategy to maintaining low NO3–NO2 concentrations. Specifically, it is recommended that: (a) agricultural land-uses are excluded from riparian buffers, (b) wetland land-cover is increased and c) wetland fragmentation is reduced (Fig. 5 and Table 3). All of these objectives can be achieved by conversion of selected agricultural land-cover to wetland through stream and/or wetland restoration. Low intensity residential land-use is also important predictor of phosphorus concentration in streams, likely due to septic system failure and fertilizer application in yards. Reestablishment of riparian buffers associated with stream restoration projects in urban and semi-urban residential areas may considerably reduced OP and TP concentration in streams.

Finally, the models present in this paper were not primarily developed to predict in stream nutrient concentrations, but to identify important wetland-related landscape variables that can be used for selection of restoration criteria where appropriate. We have identified several such variables and have presented a graphical tool with an easily communicable format for conveying our results to watershed managers.