Introduction

Impervious surfaces often increase the magnitude, speed, and chemical load of stormwater runoff to freshwater ecosystems within and surrounding cities (Arnold and Gibbons 1996). Over the past 20 years, municipalities have implemented a wide variety of green infrastructure (e.g. rain gardens, green roofs, bioswales) in efforts to control stormwater quantity and quality in and from urban environments by mimicking predevelopment biogeochemical processes (United States Environmental Protection Agency 2012). Green infrastructure can attenuate the flashiness of stormwater flows through detention and retention, reduce pollution loads through detention and filtration, and dampen temperature extremes (urban heat island) through evaporative cooling and dissipation of heat from impervious runoff (Demuzere et al. 2014). Likewise, from a socioeconomic perspective green infrastructure can increase the aesthetics, property value, safety, psychological health, and pride of communities by creating open, active green spaces (Johnston 2005; Dunn 2010). These regulatory and cultural ecosystem services promote green infrastructure as a recurring theme in scenarios of plausible urban futures, which integrate stakeholder visions and biogeochemical modeling to predict long-term ecological responses to climate, land-use, and socioeconomic change (Carpenter et al. 2006; Gill et al. 2007; Carpenter et al. 2015). However, the ecosystem services, potential disservices, and tradeoffs from green infrastructure can be unpredictable and require a more mechanistic understanding from controlled, replicated field studies of biogeochemical processes (Pataki et al. 2011). This study focuses on a prevalent green infrastructure practice that continues to outpace its assessment—residential-sized rain gardens—and systematically explores the detailed biogeochemical processes, ecosystem services, disservices, and tradeoffs associated with vegetation type, a key design variable.

Rain gardens (4–10 m2) are typically constructed by excavating and backfilling an existing topographical depression with an engineered mix of soil, sand, and compost (Thompson et al. 2008); the structures are usually planted with a variety of vegetation types, often native plants, shrubs, and grasses (Prince George’s County 1993) that have been shown to alter soil physical properties (Johnston et al. 2016). The anticipated ecosystem services provided by rain gardens (e.g. maintenance of pre-development hydrology, pollution load reduction, and urban heat island mitigation) are largely anthropocentric and require further characterization as quantifiable exchanges of water, nutrients, and energy. Key measurable exchanges include (1) soil moisture storage and drainage (deep percolation), which retain stormwater runoff and increase groundwater recharge; (2) soil nitrogen (N) retention and plant N uptake, which protect adjacent surface waters by filtering N loads from stormwater runoff; and (3) evapotranspiration (ET), which could mitigate daytime urban heat island effects through evaporative cooling. These exchanges of water, nutrients, and energy occur within the soil-plant-atmospheric continuum and vary with stormwater input, rain garden design, soil properties, and vegetation type. Although soil properties and the overall design of rain gardens have been evaluated (Hsieh and Davis 2005; Davis 2008; Thompson et al. 2008; Carpenter and Hallam 2010), the isolated effect of vegetation type on the transport and cycling of water and N has received less experimental attention.

Given the long understood role of vegetation in the partitioning of stormwater into ET, storage and drainage under different land covers, the selection of vegetation type likely causes a tradeoff between groundwater recharge and evaporative cooling in rain gardens. The design element underlying this tradeoff is the difference in rain garden storage capacity, which can be controlled by altering the drainage boundary condition or vegetation type. Wadzuk et al. (2014) altered the drainage boundary condition to include an internal water storage layer and found that ET rates were twice that of a free-drainage rain garden using weighing lysimeters. Vegetation selection could similarly modify ET and drainage partitioning by changing rain garden storage capacity. Johnston (2011) found that rain gardens planted with prairie and shrub vegetation had significantly lower soil volumetric water content at depths of 0–0.15 and 0.30–0.45 m (3–4 and 10 % lower, respectively) compared to turfgrass prior to storm events, which suggests that vegetation selection may shift the storage capacity of rain gardens during dry periods via ET-induced changes to antecedent water content. Though climatic parameters have been shown to strongly correlate to actual ET from rain gardens (Hickman et al. 2011), these parameters determine the evaporative demand or potential ET of the systems. When all climatic parameters are equal, different vegetation types will vary in their actual ET based on physiological responses to the opposing tension between soil moisture supply, conductance, and external evaporative demand (Tanner 1957).

The drainage rate from rain gardens may also be altered by vegetation type, which can change soil hydraulic conductivity through root growth and morphology. For example, Le Coustumer et al. (2012) conducted a biofiltration column study testing five different species relative to a bare soil control over 72 weeks found that a thick-rooted species, Melaleuca ericifolia, significantly increased soil hydraulic conductivity over the course of the experiment, which was attributed to its high mean root diameter. Although components of the water budget such as drainage and ET have been previously quantified (Davis 2008; Hatt et al. 2009; Li et al. 2009; Selbig and Balster 2010; Hickman et al. 2011; Johnston 2011; Wadzuk et al. 2014), there are limited replicated field studies experimentally isolating the effect of vegetation type on the hydrological budgets of residential-sized rain gardens.

Differences in vegetation type may modify the transport and cycling of N from rain gardens through changes in plant N uptake, soil N retention, and stormwater N detention, especially following dry periods (Mangangka et al. 2015). Different plants have been shown to vary up to 170-fold and 570-fold in nitrate or nitrite (NO3 , NO2 ) and ammonium (NH4 +) drainage concentrations, respectively, in a pot trial of 20 species typically used for rain gardens in Australia (Read et al. 2008). In a follow-up study, Read et al. (2009) found that rooting depth, longest root, total plant mass, and relative growth rate were negatively correlated to NO3 and NH4 + concentrations in semi-synthetic stormwater drained from bioretention cells. These findings suggest that differences in above and below ground structure and biomass may correlate to differences in N transport from rain gardens with different vegetation types. Additionally, studies from tile-drained agricultural ecosystems have shown that plant differences in ET and rooting depth alter the coupled drainage of water and N (Johnsson et al. 1987). These known impacts of vegetation type on hydrology and N transport argue for a greater understanding of vegetation in the performance of rain gardens to inform design and assess functional tradeoffs.

Our primary objective was to evaluate the effect of vegetation type on the hydrological budgets and corresponding ecosystem services of residential-sized rain gardens by specifically investigating changes in ET, drainage, soil moisture storage, N cycling, and N transport. We compared replicated rain gardens planted with prairie, shrub, and turfgrass to bare soil rain gardens to better understand baseline hydrology and N transport with changes in vegetation. We hypothesized that after 1 year of establishment, (1) total drainage following stormwater applications would rank in the following order: bare soil > turfgrass > shrub > prairie, explained by differences in ET and soil moisture ranking in the reverse order and (2) N load reduction following stormwater applications would rank in the following order: prairie > shrub > turfgrass > bare soil, explained by differences in N concentration observed in soil, plant tissue, and drainage.

Materials and methods

Experimental design

We established a 1500 m2 research site in 2005 consisting of 12 field-scale bioretention cells (2.4 × 2.4 × 1.2 m, Fig. 1a) in the urban agglomeration (population: 402,000) of Madison, Wisconsin, which comprises the Yahara Watershed (U.S. Census Bureau 2012). In physical terms, the experimental bioretention cells function as free-drainage lysimeters, where the lower 1.2 m boundary is exposed to atmospheric pressure, thus allowing gravitational drainage and excluding drainage controlled by an actual or imposed matric potential gradient (Abdou and Flury 2004). The bioretention cells were replicated on level ground along a linear path with 1.2 m spacing between each cell. Each bioretention cell was sized according to the Wisconsin Department of Natural Resources (WI-DNR) specifications (depth, surface area, receiving area, soil mixture) recommended for residential rain gardens (Bannerman and Considine 2003; Dussaillant et al. 2004) that are generally applicable to the Midwestern United States.

Fig. 1
figure 1

(a) Photograph of 1-year old bioretention cells annotated to identify turfgrass, shrub, prairie, and bare soil treatments from bottom left to top right corner (b) Cross-sectional schematic (not to scale) of free-drainage bioretention cells (vegetation types not depicted)

Each bioretention cell was constructed with a centered channel drain along the bottom to facilitate free-drainage conditions (Fig. 1b). Because rain gardens are positioned in landscape depressions to receive and not contribute to surface runoff, we intentionally eliminated surface runoff from the bioretention cells by installing a 0.15 m raised rim around each bioretention cell. The inside walls and bottom of each cell were fitted with a 30 mil rubber liner; 0.05 m of wash stone was placed in the bottom of each cell to avoid clogging and anaerobic conditions. Landscaping fabric was placed over the wash stone to prevent excessive sediment loss. Finally, a ring of bentonite clay was positioned along the inside perimeter of each bioretention cell to prevent preferential flow paths along the inside walls.

We chose the most common vegetation types recommended for rain gardens in the Midwestern United States: mesic native prairie plants, woody shrubs, and turfgrass (Bannerman and Considine 2003, Table 1). These three vegetation types were combined with bare soil in a complete randomized block design with three blocks and four treatments (one treatment per three bioretention cells). Per WI-DNR guidelines, all bioretention cells were filled with 6.27 m3 of equal parts topsoil, sand, and compost. This bioretention soil mixture was classified as a loam texture consisting of 43 % sand, 40 % silt, and 17 % clay determined by the hydrometer method (Gee and Or 2002) after pretreatment with hydrogen peroxide (Mikutta et al. 2005). The organic matter content of the mixture was quantified at 6 % by loss on ignition (Heiri et al. 2001). To promote plant establishment, the vegetated bioretention cells were irrigated after planting, but received no irrigation in 2006, other than stormwater inputs during experimental applications. No fertilizer, pesticides, or herbicides were used throughout this study. Three event-based stormwater applications were conducted to quantify drainage, soil moisture storage, ET, and N loads among the treatments in 2006.

Table 1 Species planted in each type of rain garden and initial planting density (plants per m−2)

Stormwater applications and data collection

We conducted three controlled stormwater applications (20 mm hr−1) that would be of a sufficient magnitude to quantify and collect drainage. During each simulation, each bioretention cell received stormwater runoff collected from 35 m2 of the contributing roof surface area (6:1 storm events) in accordance with WI-DNR guidelines that recommend sizing residential stormwater basins at 17 % of the contributing impervious area (Bannerman and Considine 2003). All three stormwater applications were conducted during the growing season of 2006 on July 15th, August 10th, and September 26th (Fig. 2). During rain events preceding stormwater applications, roof runoff was collected in two large rain barrels (5,678 L) from a tin roof (417 m2) on site. For the July 15th and August 10th 6:1 applications, each bioretention cell received 76 mm of stormwater, which is equivalent to a 9–15 mm rain event in Wisconsin. For the September 26th 6:1 application, the bioretention cells received 25 mm of stormwater, which is equivalent to a 3–5 mm rain event in Wisconsin. Before each controlled stormwater application, we measured initial soil volumetric moisture content (θ i ) in each cell at 0.1 m increments from a depth of 0.2–0.6 m (7 depths x 2 replicates x 12 lysimeters) with a neutron hydroprobe (503 DR Hydroprobe, Campbell Pacific Nuclear). Because of the potential errors and dangers associated with using a neutron hydroprobe at the surface, we used a dielectric moisture sensor (TH2O, Dynamax Inc.) for replicated surface measurements (0–0.03 m depth) of θ i .

Fig. 2
figure 2

Precipitation events and 6:1 stormwater applications (mm) using collected roof runoff over the experimental time period, July 1st-October 15th, 2006 at the rain garden field site in Dane County, WI

After determining θ i for each bioretention cell, a metered volume (Sotera 825, Tuthill Corporation) of the collected stormwater runoff was applied equally to each cell through a distribution system of PVC irrigation tubing at a rate of 20 mm hr−1. Drainage was monitored using flow meters (Sotera 825, Tuthill Corporation) to determine the cumulative drainage at 1 hr, 24 hrs, and the designated endpoint of each stormwater application event. The designated endpoint of each application was when drainage had ceased from each cell. At the endpoint of each application, the final soil volumetric water content (θ f ) was measured again at 0.1 m soil layer increments from 0.2 to 0.6 m and at the surface of each bioretention cell.

Water budgets

Because all bioretention cells were essentially closed-system lysimeters without surface runoff, cumulative ET was quantified for the duration of each stormwater application as the residual of measured components of the water balance equation:

$$ SI+P=D+\varDelta S+ET $$
(1)

where SI is stormwater input (mm) from collected roof runoff, P is precipitation (mm), which was nearly negligible during stormwater applications, D is cumulative drainage (mm) from each lysimeter, ΔS is the change in soil water storage (mm), and ET is cumulative evapotranspiration (evaporation for bare soil, mm). We estimated ΔS for each soil layer by interpolating initial and final neutron hydroprobe and dielectric moisture sensor measurements, θ i, and θ f (Brye et al. 2000). After quantifying the SI, D, and ΔS components of the water budget, the residual balance was attributed to ET [1]. Mean daily ET rates (mm d−1) were determined over the duration of each stormwater application for each bioretention cell.

Potential and reference evapotranspiration

We used three potential or reference ET models (i.e. Hamon, Priestley-Taylor, and FAO Penman-Monteith) to assess the evaporative demand (mm d−1) of the environment during the experimental time period from July 1-October 15, 2006 (Hamon 1963; Priestley and Taylor 1972; Allen et al. 1998). Potential ET models (Hamon 1963; Priestley and Taylor 1972) are indices of the evaporative demand of the environment, while the FAO reference ET model (Allen et al. 1998) is an index of the hypothetical ET from a well-watered, uniform 0.12 m grass with a constant albedo of 0.23 and surface resistance of 70 s m−1. We chose these particular models to represent a range of meteorological complexity. Hamon is the least complex and most empirical PET model because it only requires daily temperature data to compute PET, while Priestley-Taylor requires both solar radiation and temperature data (Hamon 1963; Priestley and Taylor 1972; Lu et al. 2005). The FAO Penman-Monteith model is the most complex and least empirical index we used to calculate evaporative demand, as it requires solar radiation, vapor pressure deficit, wind speed, and temperature data to compute reference ET and empirically represents canopy physiology, which is ignored in PET models (Allen et al. 1998). We calculated Hamon and FAO Penman-Monteith estimates using local climatological data from the National Climatic Data Center (NOAA) measured at the Dane County Regional Airport, 24 km from the study site. Priestley-Taylor estimates were made using a satellite-based insolation model that assumes a constant surface albedo of 0.25 and outputs daily PET estimates for Wisconsin and Minnesota (Diak et al. 1998).

Nitrogen retention and transport

In order to quantify soil N retention over the course of the field season, replicated soil cores were collected at 0–0.3 and 0.3–0.6 m depths from each cell once in July 2006 prior to all stormwater applications and a second time in October 2006 following all applications. Molar ratios were used to calculate the change in soil storage of inorganic nitrogen (DIN) by summing nitrogen contributions from NO3 and NH4 + and taking the difference between July and October soil DIN values for each bioretention cell. To quantify plant N uptake over the course of the field season, composite plant tissue samples were collected from each vegetation type in July, August, and October 2006. Finally, we collected stormwater input and drainage samples from each bioretention cell for N analyses at 1 hr, 24 hrs, and the end of each stormwater application. All soil, plant tissue, and water samples were analyzed for total Kjeldahl nitrogen (TKN) using standard methodology (Burt 1996). All soil NO3 and NH4 + concentrations (mg L−1) were determined using a flow-injection analysis system (Lachat QuikChem 8000 Series, Method 12-107-04-1-F).

Measured concentration and volume of NO3 and NH4 + from stormwater input and incremental drainage (1 hr, 24 hr, final) throughout each stormwater application were used to calculate the net mass stormwater input and gross mass output (kg ha−1) of NO3 and NH4 + from each bioretention cell. Net mass export or net mass import (kg ha−1) were calculated as the difference between net mass stormwater input and gross mass output of NO3 and NH4 + (net mass export values are positive indicating overall net N loss from the bioretention cells as drainage, net mass import values are negative indicating overall net N gain into the bioretention cells as storage). Molar ratios were used to calculate the net mass export or net mass import of dissolved inorganic nitrogen (DIN) by summing N contributions from NO3 and NH4 +. Finally, we calculated N removal efficiency (%) for each stormwater application as the ratio of net mass import to net mass stormwater input of DIN for each bioretention cell (negative % efficiency values indicate an overall N contribution from the bioretention cells as drainage, positive % efficiency values indicate an overall N removal by the bioretention cells as storage).

Statistical analyses

All data were analyzed using SAS 9.2 (SAS Institute 2008) by linear mixed models (PROC MIXED) to account for blocking and assess the statistical interaction of vegetation type (treatment) and stormwater application. Because the September stormwater application differed in both the time period and magnitude of its input (25 mm as opposed to 75 mm stormwater input from the July and August stormwater applications), we cannot isolate seasonal effects (i.e. plant phenology) from the magnitude of the stormwater application when considering overall stormwater application effects compared to vegetation type; therefore, applications were treated as event-based analyses. Each treatment consisted of three replicated bioretention cells, thus standard errors were used to evaluate differences among treatments by vegetation type, stormwater application, and the statistical interaction of vegetation type and stormwater application. Post-hoc comparisons were evaluated using Tukey’s significant differences at a 0.05 probability level (HSD0.05) to limit experiment-wise error. Log-transformations were made on all data when necessary based on the Shapiro-Wilk normality test of the residuals (PROC UNIVARIATE). All figures were created using SigmaPlot 13.0 (Systat Software 2014).

Ecosystem service comparison

In order to rank ecosystem services by vegetation type based on the largest observed responses and compare multiple services, we normalized all means for each service analog to the highest mean for each vegetation type (Raudsepp-Hearne et al. 2010; Doherty et al. 2014). We used total drainage, ET, and ΔS as ecosystem service analogs for potential recharge, daytime evaporative cooling, and stormwater retention, respectively. Likewise, we used N removal efficiency, cumulative soil NO3 and NH4 + storage (average of 0–0.3 and 0.3–0.6 m depths), and % TKN in plant tissue as service analogs for N load reduction, soil N retention, and plant N uptake, respectively. To enable graphical comparison across different ecosystem services, we applied linear transformations to ensure that all individual means had positive values before normalization.

Results

Though all bioretention cells had the same internal design structure and soil mixture, there were differences in hydrology, N cycling and transport, and associated ecosystem services between treatments (vegetation type: veg) 1 year after plant establishment. Similarly, there were differences in hydrology, N cycling and transport, and associated ecosystem services corresponding to each stormwater application (app), which differed in the magnitude of stormwater applied and the external environmental conditions (i.e. seasonality). In many cases, differences in hydrology, N cycling and transport, and associated ecosystem services resulted from the combined effect of veg and app, which was not necessarily synergistic and constituted a statistical interaction.

Drainage and potential recharge

We found that app had a statistically significant effect (F = 67.86, p = 0.0001, S.E. = ±4 mm) on 1-hr drainage (Table 2). The September application had a mean 1-hr drainage of 3 mm or 12 % of stormwater input (SI) compared to 22 mm (29 % SI) and 21 mm (28 % SI) for July and August applications, respectively. There was a statistically significant interaction (Table 2, Fig. 3) between veg and app for drainage at 24 h (F = 7.60, p = 0.0006, S.E. = ±3 mm) and total drainage (F = 9.76, p = 0.0001, S.E. = ±3 mm) such that the main effects could not be interpreted separately. The 24-hour drainage trends were generally the same as final drainage trends; differences between vegetation treatments were less pronounced after 24 h.

Table 2 Mean water budget components determined by a statistical mixed model where vegetation type (veg), application (app), and veg x app are designated as fixed effects with each F-ratio’s numerator and denominator reported as df-num/df-den. Letter (abcdef) differences indicate post-hoc means comparisons evaluated by Tukey’s HSD with significant veg x app interactions superseding separate post-hoc means comparisons for veg or app for cumulative drainage (D), change in storage (ΔS), and actual evapotranspiration (AET). Mean values with common letters do not differ significantly at p <0.05. AET values are compared to potential and reference evapotranspiration (PET, RET) values generated by the Priestly-Taylor, Hamon, and FAO-Penman-Monteith models for each application time period referred to as PET-Pri-Tay, PET-Hamon, and RET-FAO, respectively
Fig. 3
figure 3

Statistical interactions between stormwater application (app; time period and magnitude) and vegetation type (veg) for rain garden water budgets including total drainage, change in storage (ΔS), and evapotranspiration (mm). Error bars represent one standard error of the mean

We found a significant statistical interaction between veg and app for final drainage, indicating that differences in the hydrologic budget depended on both vegetation type and the timing and magnitude of stormwater applications (Table 2, Fig. 3). For the July application, prairie drained 19 mm (25 % SI), which offered significantly lower drainage or potential recharge than all other vegetation types (Table 2, Fig. 3). Bare soil, shrub, and turfgrass did not statistically differ from one another in July with total drainage values of 53, 45, and 45 mm (70, 60, and 60 % SI) respectively. Drainage differences between vegetation types were most pronounced during the August stormwater application. Prairie drained 15 mm (20 % SI) in August and was significantly different from all other vegetation types. Shrub drained 37 mm (49 % SI) in August and was significantly different from both prairie and bare soil. Turfgrass drained 48 mm (63 % SI) during the August application and was significantly different from prairie, but not shrub or bare soil. Bare soil drained 58 mm (76 % SI) during the August stormwater application, which was the largest potential recharge event observed during the course of the study. There were no statistically significant differences between vegetation types in potential recharge for the September application, which was 15, 0, 4, and 9 mm (60, 0, 16, and 36 % SI) for bare soil, prairie, shrub, and turfgrass, respectively.

ET

There was a statistically significant interaction (Table 2, Fig. 3) between veg and app for total ET over the application time period (F = 7.52, p = 0.0006, S.E. = ±3 mm) such that differences in ET depended on both vegetation type and the application magnitude and date. The July stormwater application had the highest total ET losses and therefore, the greatest potential for daytime evaporative cooling. Over the course of the July stormwater application, prairie lost 62 mm (82 % SI) as ET, which offered significantly greater evaporative cooling than bare soil, shrub, and turfgrass, which respectively lost 26, 39, and 39 mm (34, 52, and 52 % SI) to ET and were not significantly different from one another. During the August stormwater application, prairie and shrub lost a total of 44 and 33 mm (58 and 43 % SI) as ET, respectively, and were significantly greater than turfgrass and bare soil, which lost 23 and 16 mm (30 and 21 % SI) as ET, respectively. Like potential recharge, there were no statistically significant differences between vegetation types in ET for the September application, where bare soil, prairie, shrub, and turfgrass lost 12, 21, 21, and 19 mm (48, 84, 84, and 76 % SI) as ET, respectively.

Priestley-Taylor, Hamon, and FAO-Penman Monteith models of potential or reference ET followed similar seasonal trends during the stormwater application time period, though their magnitudes differed on a daily basis (Table 2). Average daily ET for shrub and turfgrass was at or near the predicted ET for all three applications (Fig. 4). Prairie had higher average daily ET rates than calculated PET and RET for both the July and August stormwater applications at 9 and 7 mm d−1, respectively, while bare soil evaporation was markedly lower than PET and RET in July and August with rates of 4 and 3 mm d−1 (Fig. 4), respectively. All vegetation types decreased in average daily ET rates as the season progressed except shrub, which maintained nearly the same average daily ET of 6 mm d−1 between July and August stormwater applications.

Fig. 4
figure 4

Average daily evapotranspiration (AET, mm) inferred using lysimetry method for each vegetation treatment following stormwater applications, as well as daily potential or reference evapotranspiration (PET, RET, mm) calculated using Priestley-Taylor, Hamon, and Penman-Monteith models. Error bars represent one standard error of the mean

ΔS and soil moisture

There was a statistically significant interaction (Table 2, Fig. 3) between veg and app for the change in soil moisture storage (ΔS) over the application time period (F = 5.55, p = 0.0028, S.E. = ±2 mm). There were no statistically significant differences in ΔS between vegetation types during the stormwater application in July. During the July application, bare soil, prairie, shrub, and turfgrass had ΔS values of −1, −3, −5, and −6 mm, respectively, indicating that all vegetation types lost soil moisture over the course of the application when seasonal evaporative demand was also at its peak. Prairie had the greatest ΔS during the August stormwater application by storing 19 mm, which was significantly different from bare soil, shrub, and turfgrass, which stored 4, 7, and 7 mm, respectively. There were no statistically significant differences in ΔS during the September stormwater application and ΔS was 0, 6, 1, and −2 mm for bare soil, prairie, shrub, and turfgrass, respectively.

We also found a statistically significant interaction between veg and app for both θ i, and θ f at all measured depths between 0.03 and 0.6 m, with the exception of θ i at 0.03 m (Table 3, Figs. 5, 6 and 7). The θ i of turfgrass at 0.03 m, 0.306 m3 m−3, was significantly greater (F = 75.87, p < 0.0001, S.E. = ±0.013) than bare soil, prairie, and shrub treatments, which were 0.171, 0.180, and 0.184 m3 m−3, respectively. Though there was an interaction between veg and app (F = 3.36, p = 0.0245, S.E. = ±0.01) at 0.03 m, we found that turfgrass still retained a significantly higher θ f at 0.03 m with values of 0.263, 0.332, and 0.281 m3 m−3 for July, August, and September applications compared to the other vegetation types, which ranged from 0.158 to 0.190 m3 m−3. Turfgrass was not significantly different from bare soil in both θ i and θ f at all depths greater than 0.03 m and soil moisture profiles are nearly indistinguishable from between turfgrass and bare soil at depth (Fig. 5). Prairie had a distinctively dry soil moisture profile (Figs. 5, 6 and 7) and was almost always significantly lower than all other vegetation types between depths of 0.2–0.6 m in both antecedent and post-drainage soil moisture (Table 3). The shrub treatment had the most distinctive θ i and θ f profiles (Figs. 5, 6 and 7) during the August and September stormwater applications and often had significantly higher soil moisture than prairie and lower soil moisture than turfgrass or bare soil at 0.2–0.4 m depths both preceding and following stormwater applications (Table 3).

Table 3 Mean antecedent and post-drainage volumetric soil water content (m3 m−3) at 0.03, 0.2, 0.3, 0.4, 0.5, and 0.6 m depths determined by a statistical mixed model where vegetation type (veg), application time period/magnitude (app), and veg x app are designated as fixed effects with degrees of freedom of each F-ratio’s numerator and denominator reported as df-num/df-den. Letters abcdef indicate post-hoc means comparisons evaluated by Tukey’s HSD with significant veg x app interactions superseding separate post-hoc means comparisons for veg or app. Mean values with common letters do not differ significantly at p <0.05
Fig. 5
figure 5

Antecedent and post-drainage soil volumetric water content (θ, m3m−3) of each vegetation treatment at six soil depths 0.03–0.6 m preceding and following each stormwater application. Error bars represent one standard error of the mean

Fig. 6
figure 6

Statistical interactions between stormwater application (app; time period and magnitude) and vegetation type (veg) for antecedent volumetric water content (θ, m3m−3) at six soil depths between 0.03 and 0.6 m. Error bars represent one standard error of the mean

Fig. 7
figure 7

Statistical interactions between stormwater application (app; time period and magnitude) and vegetation type (veg) for post-drainage volumetric water content (θ, m3m−3) at six soil depths between 0.03 and 0.6 m. Error bars represent one standard error of the mean

Soil N retention, plant N uptake, and N transport

We found no statistically significant differences in soil nitrogen (%TKN, NO3 , and NH4 +) pools between vegetation types. However, we found significant differences between the July and October sampling time periods (time) across all bioretention cells for soil NO3 at both the 0–0.3 and 0.3–0.6 m depths and soil NH4 + at the 0.3–0.6 m depth (Table 4). In July, prior to stormwater applications, mean soil NO3 from all bioretention cells was 2.4 and 2.6 mg L−1 for 0–0.3 and 0.3–0.6 m depths, respectively. In October, following all stormwater applications mean soil NO3 from all bioretention cells was 0.3 and 0.4 mg L−1 at 0–0.3 and 0.3–0.6 m depths, respectively. These reductions in soil NO3 content between July and October were statistically significant at both the 0–0.3 m depth (F = 156.48, p < 0.0001, S.E. = ±0.1) and the 0.3–0.6 m depth (F = 90.69, p < 0.0001, S.E. = ±0.2). In contrast, mean soil NH4 + from all bioretention cells at the 0.3–0.6 m depth increased from 0.3 mg L−1 in July to 0.5 mg L−1 in October. This addition of NH4 + between July and October at the 0.3–0.6 m depth was statistically significant (F = 6.77, p = 0.0315, S.E. = ±0.09). We found a statistically significant interaction (Table 4) between veg and app in plant N uptake over the July, August, and October sampling periods (F = 4.81, p = 0.015, S.E. = ±0.02 mm). Turfgrass had 0.4 % TKN during all three sampling periods, which was significantly greater than prairie and shrub (0.2–0.3 % TKN).

Table 4 Mean soil nitrogen (N) content as Total Kjeldahl Nitrogen (TKN), nitrate (NO3 ), and ammonium (NH4 +) at 0–0.3 and 0.3–0.6 m depths and plant N content (TKN) determined by a statistical mixed model where vegetation type (veg), application (app) and veg x app are designated as fixed effects with degrees of freedom of each F-ratio’s numerator and denominator reported as df-num/df-den. Letter differences (abcd) indicate post-hoc means comparisons evaluated by Tukey’s HSD with significant veg x app interactions superseding separate post-hoc means comparisons for veg or app. Mean values with common letters do not differ significantly at p < 0.05. Soil N samples were collected in July and October 2006, while plant N samples were taken in July, August, and October 2006

There were no significant differences in NO3 and NH4 + net mass input across vegetation types, however the September stormwater application received 0.04 kg ha−1 of NO3 and 0.04 kg ha−1 of NH4 + in roof runoff, which was significantly less (F = 184842, p < 0.0001, S.E. = ±0.0001) than the 0.13 kg ha−1 of NO3 and 0.14 kg ha−1 of NH4 + received during the July and August applications (Table 5). Bare soil leached the greatest amount of DIN observed in the study during the July and August applications, losing 0.57 and 0.96 kg ha−1, respectively. Prairie was the only vegetation type that imported (negative values, Table 5) DIN for all three applications. Shrub had a net mass export (or import) of 0.34, 0.06, and −0.03 kg ha−1 for July, August, and September applications, respectively. Turfgrass remained at almost the same overall N content by having a net mass export of 0.00, 0.14, and 0.00 kg ha−1 for July, August, and September applications, respectively. Averaged across all applications, nitrogen removal efficiency was 72, −36, −109, and −500 % for prairie, turfgrass, shrub, and bare soil, respectively, with prairie being the only vegetation type to offer N load reduction as an ecosystem service.

Table 5 Net mass input (kg ha−1), 1-hour drainage concentration (mg L−1), 24-hour drainage concentration (mg L−1), and net mass export (kg ha−1) of NO3 , NH4 +, and Dissolved Inorganic Nitrogen (DIN), determined by a statistical mixed model where vegetation type (veg), application time period/magnitude (app), and veg x app are designated as fixed effects with degrees of freedom each F-ratio’s numerator and denominator reported as df-num/df-den. Letters abcd indicate post-hoc means comparisons evaluated by Tukey’s HSD with significant veg x app interactions superseding separate post-hoc means comparisons for veg or app. Mean values with common letters do not differ significantly at p <0.05

Ecosystem services

All vegetation types differed in their provisioning of the seven ecosystem services evaluated in this study (Fig. 8). Prairie ranked first in runoff reduction, evaporative cooling, N load reduction, soil NH4 + and NO3 retention; second (tied with shrub) for plant N uptake; and last for potential recharge services. Shrub did not rank first for providing any ecosystem service, but ranked second in runoff reduction (tied with bare soil), evaporative cooling, soil NH4 + retention, and plant N uptake (tied with prairie); third in N load reduction and potential recharge; and last in soil NO3 retention. Turfgrass ranked first in plant N uptake; second in potential recharge and N load reduction; third in evaporative cooling, soil NH4 + retention (tied with bare soil), and NO3 retention; and last in runoff reduction. Bare soil ranked first in potential recharge; second (tied with shrub) in runoff reduction and soil NO3 retention, third in soil NH4 + retention (tied with turfgrass); and last in evaporative cooling, N load reduction, and plant N uptake (by default as there were no plants present).

Fig. 8
figure 8

Relative provisioning of seven ecosystem services from four vegetation treatments based on analogous measured parameters. Total drainage, evapotranspiration, and the change in storage were used as ecosystem service analogs for groundwater recharge, daytime evaporative cooling, and stormwater retention, respectively. N removal efficiency, soil NO3 and NH4 + storage, and % total Kjeldahl nitrogen in plant tissue were used as ecosystem service analogs for N removal, soil N retention, and plant N uptake, respectively. All data were normalized to the vegetation treatment that had the maximum mean for each measured parameter. Spatial arrangement of ecosystem services is arbitrary, thus polygon shapes are nonunique to these normalized data. Linear interpolation between and across ecosystem services is inappropriate in this context

Discussion

Vegetation type altered the hydrological and N budgets of residential-sized rain gardens by altering drainage, ET, soil moisture, and the cycling and transport of N. Within individual applications, drainage and ET varied up to 2.5-fold and 4-fold, respectively, across the vegetation types assessed in this study. Additionally, differences in antecedent soil moisture preceding stormwater applications yielded up to a six-fold increase in storage across the vegetation types assessed in this study. Furthermore, rain gardens receiving the same stormwater N-load functioned as either net N exporters (via leaching) or net N importers (via retention) depending solely on their vegetation type. In this study, lower antecedent soil moisture coincided with high levels of storage, ET, and N removal, which we observed as concomitant hydrological processes in residential-sized rain gardens. Therefore, we posit that antecedent soil moisture modified by vegetation type may be an important mechanism underlying rain garden function and corresponding ecosystem services.

In this study, each vegetation type developed a characteristic antecedent soil moisture profile at depth, which contributed to its overall storage, ET, and N transport. Though antecedent soil moisture and ET were not explicitly measured, a bioretention study in Queensland, Australia demonstrated a similar relationship between antecedent dry periods, storage, and N retention where longer antecedent dry periods between storm events resulted in greater N retention (Mangangka et al. 2015). Mangangka et al. (2015) hypothesized that longer antecedent dry periods increase the storage capacity of rain gardens by drying soil pore space and reducing antecedent soil moisture prior to storm events. It has also been suggested that prior rainfall and biophysical conditions (i.e. evaporative demand) control antecedent soil moisture (Davis et al. 2011) and that vegetation type could be maximized as a rain garden design parameter to draw down antecedent moisture and increase storage capacity during dry periods (Mangangka et al. 2015).

We further develop these hypotheses using the differences in ET and antecedent soil moisture observed in this study to propose the following hydrological mechanism: rain gardens lose stored soil moisture to ET during dry periods between storm events. Dry-period ET consistently lowers soil moisture at depths accessible to plant root systems. During and immediately following storm events (when drainage is still occurring), rain gardens with large dry-period ET losses (having depleted water throughout the soil profile) also have a higher storage capacity for incoming stormwater, which leads to increased stormwater storage, N retention, decreased drainage, and large wet-period ET losses. We speculate that different vegetation types will differ in their partitioning of these concomitant hydrological processes based on their ET rates during both dry and wet periods. Though it has been suggested that wet-period ET may not be an important component of the water budget during and immediately following storm events (He and Davis 2010), we found it to be a significant component of the water budgets in this experiment, depending on vegetation type. To the best of our knowledge, this is the first controlled, replicated study to demonstrate that when exposed to the same rainfall patterns and evaporative demand, certain vegetation types, such as prairie, will decrease the antecedent soil moisture and increase the storage capacity of rain gardens compared to other vegetation types, such as shrubs and turfgrass.

In this study, vegetation-induced changes to the antecedent soil moisture of rain gardens drove differences in N retention and transport (primarily as NO3 ), as opposed to biofiltration processes (e.g. plant N uptake, soil N retention, denitrification), which would have been observed as a dilution in the concentration of N in drainage relative to stormwater inputs. We acknowledge that input N concentrations in this study were low, ranging from 0.16 to 0.18 mg L−1 NO3 , compared to the higher range of NO3 (0.23–1.42 mg L−1) used in previous studies investigating stormwater N reduction (Houdeshel et al. 2015; Taylor et al. 2005; Hunt et al. 2006; Dietz and Clausen 2005; Sharkey 2006). Additionally, it has been demonstrated in unsaturated rain gardens that stormwater input N exhibits a positive linear relationship with drainage N concentration and that the slope of this relationship may be modified by vegetation type (Houdeshel et al. 2015). Though adding a saturated zone to promote denitrification may significantly dampen or remove this linearity (Kim et al. 2003; Yang et al. 2013; Gilchrist et al. 2013; Houdeshel et al. 2015), saturated rain gardens have been shown to emit higher levels of methane (McPhillips and Walter 2015) and often drain slower than their unsaturated counterparts (Yang et al. 2013). Therefore, in urban areas receiving high stormwater runoff rates and volumes or in localized hot spots where additional greenhouse gas emissions are a concern, it may be preferable to construct high-infiltration, unsaturated rain gardens planted with vegetation that supports high ET rates. Based on our findings, we contend that N transport from unsaturated rain gardens will always be contingent on storage capacity, which may be modified by vegetation type via antecedent soil moisture.

An increased understanding of the timing and magnitude of plant N uptake and mineralization in different vegetation types may provide better control of biofiltration processes in unsaturated rain gardens where denitrification is not a treatment option. Because of the low stormwater N concentration, as well as the nascent growth of vegetation in this study, observed net mass export of N may be indicative of the leaching potential of rain gardens with different vegetation types. It has been established that plant detritus can be mineralized into NO3 and leached from rain gardens (Li and Davis 2014), which likely explains why drainage N concentration was often greater than input N concentration in this study. Considering these results, stormwater practitioners may expect newly established rain gardens to leach N even when incoming stormwater N concentration is low. Much progress has been made identifying regional plant species that reduce N loads via N uptake processes in rain gardens (Hunt et al. 2015; Read et al. 2008, 2009). For control of N transport from unsaturated rain gardens, we propose that further regional work is needed to identify specific plants or plant communities that maximize the combination of high N uptake, low N leaching, high ET, and thus greater storage capacity.

Though seasonal partitioning of ET and drainage is inevitable because of evaporative demand, our findings suggest that vegetation type is a rain garden design parameter that may be manipulated to diminish or amplify seasonal tradeoffs between ET and drainage. Similar to other studies (Denich and Bradford 2010; Hickman et al. 2011; Wadzuk et al. 2014), we observed the highest ET from rain gardens during peak evaporative demand in July, though July and August stormwater applications had the same stormwater input rate and magnitude. During peak evaporative demand in this study, rain gardens planted with prairie partitioned 80 % of incoming stormwater to ET, while gardens planted with shrub and turfgrass partitioned only 50 % of incoming stormwater to ET indicating that vegetation type can mediate the hydrological tradeoff between ET and drainage and corresponding ecosystem service tradeoff between evaporative cooling and groundwater recharge. Though maximizing ET from rain gardens and other stormwater infrastructure has been suggested as an urban heat island (UHI) mitigation practice (Endreny 2008; Clark et al. 2010), daytime ET cooling may not necessarily mitigate nighttime UHI effects (Gober et al. 2009). In our Yahara Watershed, two studies using a dense array of meteorological sensors found that the Madison UHI is the strongest during the summer months at nighttime and that the continuous high nighttime UHI could increase the potential for adverse human health impacts during heat waves (Schatz and Kucharik 2014, 2015). Though vegetated urban surfaces with high ET positively correlate to nighttime cooling because of reductions in heat capacity; this relationship is nonlinear (Gober et al. 2009) and spatially-dependent (Mitchell et al. 2008), thus we cannot extrapolate the vegetation-mediated differences in ET from this study to translate to nighttime cooling outcomes. Additional research is needed to understand the contribution of rain gardens to nighttime cooling and whether manipulating vegetation type or spatial arrangement could make rain gardens a viable UHI mitigation strategy.

In residential-sized rain gardens, vegetation selection supported concomitant (storage, evapotranspiration, and N transport) and antithetical hydrological processes (drainage). These processes are fundamentally linked to ecosystem services that can occur from green infrastructure in bundles (stormwater retention, daytime evaporative cooling, and N removal) and tradeoffs (potential recharge) (Raudsepp-Hearne et al. 2010). All ecosystem services cannot be maximized from a single vegetation type and stormwater practitioners will have to select rain garden vegetation that addresses site-specific problems. Though this is the first replicated study to identify vegetation-driven bundles and tradeoffs from small-scale rain gardens, previous green infrastructure studies have identified bundles and tradeoffs arising from differences in design parameters. A study comparing four mesoscale (400–1410 m2) saturated and unsaturated bioretention basins in Ithaca, New York identified a tradeoff between N-removal (via denitrification) and greenhouse gas emissions arising from saturated, slow-draining conditions in bioretention basins (McPhillips and Walter 2015). Likewise, a study comparing three mesoscale (1100 m2) bioretention swales in the Midwestern U.S. with different clay layer thicknesses identified a tradeoff between drainage (which resulted in a five-service hydrological bundle) and net primary productivity (Doherty et al. 2014). A consistent idea emerging from our study and others (Doherty et al. 2014; McPhillips and Walter 2015) is that to some degree, ecosystem service bundles and tradeoffs will always result from differences in green infrastructure design such that all services cannot be maximized from a single archetype.

In order to optimize the versatility of ecosystem services in urban ecosystems, we recommend incorporating vegetation type in biogeochemical models of green infrastructure used for predicting water quantity and quality in future climate, land-use, and socioeconomic scenarios. Validated models representing the bundles and tradeoffs associated with vegetation selection are critical to assess how small, spatially-distributed, green infrastructure practices perform across different scales including individual parcels, residential developments, and entire urban landscapes (Potter 2006). Though biogeochemical models have been developed to simulate various flow and soil textural configurations (Dussaillant et al. 2004; He and Davis 2010; Brown et al. 2013), no current models of green infrastructure practices explicitly represent different vegetation types and their associated biogeochemical processes and impacts. Pataki et al. 2011 provides an experimental framework for assessing ecosystem services, disservices, and net efficacy of green infrastructure practices using biogeochemical methods. Although our study used this framework to identify bundles and tradeoffs in ecosystem services from small-scale rain gardens during the growing season, we did not quantify the effects of spatial arrangement and interannual climate variability on these outcomes, as differences should occur across space and time (Raudsepp-Hearne et al. 2010). Additionally, our study quantified the differences between nascent vegetation, the effects of which will likely increase over time via changes in soil structure and hydraulic connectivity (Johnston et al. 2016; Angers and Caron 1998). Therefore, we suggest future work should incorporate observed processes into modeled urban scenarios, while conducting additional controlled, replicated studies to isolate long-term effects and feedbacks between vegetation type, soil texture, and other key design elements that may alter biogeochemical cycles in green infrastructure.