Introduction

Trophic cascades occur when the regulation of a consumer by a predator, either through killing the consumer or changing its behavior, benefits the resources of that consumer (Hairston et al. 1960; Paine 1980). Traditionally defined by the green world hypothesis, this terminology normally refers to the indirect effects of carnivores on plants through regulation of herbivores (Hairston et al. 1960; Estes et al. 2011). Trophic cascades have been detected in a diverse array of ecological systems and are especially well documented in aquatic ecosystems and systems involving insects (Shurin et al. 2002; Borer et al. 2005; Estes et al. 2011). However, manipulative experimental evaluation of top–down impacts generated by large mammals is sparse, and this methodological bias has impeded our ability to predict when and where strong trophic cascades will occur in terrestrial systems (Schmitz et al. 2000; Shurin et al. 2002; Borer et al. 2005). A lack of focus on the mechanisms through which predators are generating cascades has also impeded these predictions (Schmitz et al. 2004).

The best described trophic cascades involving large mammalian carnivores and herbivores in terrestrial systems come from correlative natural experiments in Rocky Mountain national parks (Estes et al. 2011). Restored gray wolves (Canis lupus) are thought to be generating measurable benefits to aspen (Populus tremuloides) at these locations through a combination of lethal and behavioral top–down impacts on elk (Cervus canadensis), a large herbivore which inhibits the growth and recruitment of this tree species (Ripple et al. 2001; Hebblewhite et al. 2005; Painter et al. 2014). Earlier research at another national park colonized by wolves, Isle Royale, also found evidence suggesting a trophic cascade. Here, wolf-induced declines in moose (Alces alces) populations led to an increase in balsam fir (Abies balsamea) growth (McLaren and Peterson 1994), although it should be noted that the effect varied considerably with winter severity and other exogenous random events (Peterson et al. 2014). There is also a long history of exclosure experiments documenting negative ungulate impacts on plant growth in African savannas, as well as of predator removals which led to increases in many African ungulate species (Sinclair et al. 2010). However, to our knowledge, this latter cascade has not been directly tested.

It is also important to note that predators can produce trophic cascades in two different ways. A density-mediated trophic cascade occurs when predators benefit plants via direct regulation of herbivore numbers (i.e., kills) (Hairston et al. 1960; Polis and Strong 1996; Werner and Peacor 2003). Numerous studies have documented density-mediated cascades in various ecosystems (Estes and Palmisan 1974; Paine 1980; Carpenter et al. 2001; Hebblewhite et al. 2005). A trait-mediated cascade can develop when predators induce anti-predator (fear) responses in their prey, including spatial and temporal avoidance, increased movement, and increased vigilance (Abrams 1995; Lima and Dill 1990; Werner and Peacor 2003; Schmitz 2008). While these responses reduce individual predation risk, they come at the cost of foraging opportunities, which indirectly benefits plants (Abrams 1995; Werner and Peacor 2003). Importantly, recent reviews suggest that trait-mediated effects should produce stronger trophic cascades than density-mediated effects, with the rationale being that trait-mediated effects reduce herbivory by the entire group, whereas density-mediated effects only reduce herbivory by killed individuals (Schmitz et al. 2004; Schmitz 2005; Preisser et al. 2005, 2007).

With respect to cascades generated by large mammalian predators, the evidence appears to support density-mediation as the primary driver, not trait-mediation (McLaren and Peterson 1994; Hebblewhite et al. 2005; Hebblewhite and Smith 2010; Kauffman et al. 2010; Sinclair et al. 2010; Painter et al. 2015). For example, Rocky Mountain wolves have substantially reduced elk numbers (Hebblewhite and Smith 2010) and altered elk movements and foraging behavior (Laundré et al. 2001; Childress and Lung 2003; Fortin et al. 2005), but research suggests the trait-mediated effects on movement and foraging are too weak to drive the observed trophic cascades (Creel et al. 2008; Hebblewhite and Smith 2010; Kauffman et al. 2010).

This is not to say that large mammalian predators, such as wolves, cannot produce trait-mediated trophic cascades elsewhere. The strength of trait-mediated trophic cascades is inherently dependent on ecosystem-specific characteristics, such as plant composition and structure (Trussell et al. 2006) as well as species behavior (Schmitz 2005; Preisser et al. 2007). Ungulates in North American and African systems have been shown to increase vigilance in areas of low visibility (e.g., dense forest) or, alternatively, avoid them in favor of areas of high visibility (e.g., open rangeland) (Underwood 1982; Kunkel and Pletscher 2001; Valeix et al. 2009). Wolves also switch behavior based on habitat visibility and prey size (Peterson and Cucci 2003). Wolves in relatively high-visibility Rocky Mountain systems with large elk as prey act as roaming chase predators (Creel et al. 2008), whereas wolves in low-visibility forest systems with smaller deer species as prey act as ambush predators (Kunkel and Pletscher 2001; Peterson and Cucci 2003). Similar to low visibility, the ambush strategy reduces the ability of prey to detect the predator from a safe distance and likewise produces stronger trait-mediated effects (Preisser et al. 2007; Schmitz 2008). Tests of trophic cascades in other ecosystems using similar measures are thus necessary to assess the generality of the Rocky Mountain findings (Pace et al. 1999; Creel et al. 2008).

Great Lakes forests represent a good experimental system for performing such an assessment. White-tailed deer (Odocoileus virginianus) numbers in this region dramatically increased in the absence of gray wolf predation, and the subsequent increased deer herbivory has had significant negative impacts on forest sapling growth and forb biodiversity (Anderson and Loucks 1979; Alverson et al. 1988; Augustine and Frelich 1998; Rooney and Waller 2003; Côte et al. 2004; White 2012). Density-mediated effects following wolf recolonization also appear to be weak or non-existent in these mainland forests (DelGiudice et al. 2009; Rooney and Anderson 2009), marking a stark contrast with the Rocky Mountain, Isle Royale, and African savanna systems.

Based on (1) documented negative impacts of deer on forest plants, (2) observed changes in deer distributions in relation to wolf territories (Mech 1977), and (3) the lack of a substantial decline in Great Lakes deer numbers (DelGiudice et al. 2009), we hypothesized that recolonizing wolves are generating trait-mediated cascades in Great Lakes forests (Rooney and Anderson 2009; Callan et al. 2013). Also, given that trait-mediated effects should be stronger in low-visibility systems with ambush predators (Underwood 1982; Kunkel and Pletscher 2001; Preisser et al. 2007; Schmitz 2008) and that trait-mediated effects are expected to produce stronger trophic cascades than density-mediated effects (Schmitz et al. 2004; Schmitz 2005; Preisser et al. 2005, 2007), we also hypothesized that Great Lakes cascades will be stronger than Rocky Mountain cascades in order to assess any differences between these systems.

The aims of this study were to quantify deer herbivory impacts and determine if these impacts differ in relation to wolf use. We first surveyed and compared sapling growth and forb richness in relation to wolf use, then used a network of deer exclosures in areas of high and low wolf use, respectively, to experimentally isolate and quantify deer impacts. This manipulative experiment is crucial because experimental support for wolf-generated trophic cascades is lacking, leaving previous study conclusions open to question (Kauffman et al. 2010). We also monitored the density and behavior of deer in both areas of high and low wolf use to determine if there is evidence for wolf density-mediated and/or trait-mediated effects at the scale where species interactions occur. Finally, we quantified the size of the effects and compared them with Rocky Mountain cascades.

Study area and methods

Study area

Our study took place at the University of Notre Dame Environmental Research Center (UNDERC), which is an approximately 3200-ha property located along the Wisconsin/Michigan border (46°13′N, 89°31′W). The terrestrial habitat is a complex mosaic of northern mesic forest patches scattered among bogs and conifer swamps. A mix of two deciduous species, sugar maple (Acer saccharum) and red maple (Acer rubrum), dominates the forest canopy and understory. Balsam fir, a species which deer avoid consuming, is the dominant conifer. Maples represent the overwhelming majority of available saplings preferred for consumption by deer, while balsam fir represents the vast majority of non-preferred saplings (>90 and >70 %, respectively, based on unpublished plot data). Forb communities are largely dominated by Canada mayflower (Maianthemum canadense) or Canadian bunchberry (Cornus canadensis), with scattered dense layers of ferns and graminoids. Deer in this region often remove rarer forbs, such as bluebead lily (Clintonia borealis) and Solomon’s seal (Polygnatum spp.) (Rooney and Anderson 2009).

White-tailed deer are effectively the only ungulate in the study area. Current local deer management unit densities are 7–12 km−2 (B.J. Dhuey, unpublished data of the Wisconsin Department of Natural Resources). This is substantially higher than pre-European settlement densities of ≤4 km−2, which were maintained by natural predation, unfavorable dense forest habitat, and hunting by Native Americans. Starting in the 1850s, the creation of favorable deer habitat (opening of forests by logging), removal of predators, and protective hunting laws led to sharp rises in the deer population in this region (Alverson et al. 1988; Ripple et al. 2010). Gray wolves are the natural top predator of Great Lakes deer (Mech and Karns 1977), but targeted hunting activities by humans resulted in their complete disappearance from the UNDERC area by the late 1950s (Thiel 1993), and they remained extinct locally for 40–50 years. Michigan Department of Natural Resources wolf monitoring data and local observations indicate that a new pack established a territory which bisects UNDERC around 2002–2006. Additional packs use surrounding areas, but not UNDERC (B. Roell, unpublished Michigan Department of Natural Resources data; Wydeven et al. 2009; Callan et al. 2013). Recent numbers suggest there are about three to five wolves per 1000 deer in this region (Wydeven et al. 2009).

Isle Royale is less than 200 km from UNDERC, but it is a much different system (i.e., island, wolf–moose–fir) and has been subject to very complex temporal variation over several decades of study, making it difficult to draw direct comparisons to other systems (Peterson et al. 2014).

Wolf use

It should be noted this is not a study of wolf behavior; due to reasons beyond our control, we had only one pack to study at UNDERC. However, we employed the variable use of the area by the pack (high vs. low visitation) as a treatment effect on the deer (risk of predation). We defined the risk of predation to be high in a high wolf use area and to be low in a low wolf use area. High wolf use areas are largely within wolf territories, and low wolf use areas are outside of territories or within avoided buffer zones (Mech 1977). In collaboration with the Michigan Department of Natural Resources, we equipped three members of the pack with GPS/ARGOS collars (Lotek Wireless, Newmarket, ON) in 2010 to examine use of the UNDERC property by wolves. We estimated high wolf use areas to be within a boundary line containing 95 % of GPS collar fixes located on or within 600 m of UNDERC property (excluding outliers). We used this approach instead of a minimum convex polygon because polygons overestimate areas (Powell 2012).

We also carried out scat surveys along all accessible UNDERC gravel roads (>37 km, usually once per week) from May through October 2008 and 2010–2013 (including accumulated overwinter scats preserved by cold) in order to obtain additional evidence in support of these collar-based area designations. Any other evidence of wolf use (scrapes, tracks, sightings, etc.) was collected and pooled these with scats, with each observation treated as a single wolf sign point (Elbroch 2003; Gese 2004). Because we did not remove wolf scats, GPS locations were recorded for individual points and were used (along with sign age/condition) to avoid re-counting. We mapped GPS collar and wolf sign locations manually in Google Earth v. 7.1.1.1888 (Google Inc., Mountain View, CA). All of the wolf work was conducted following the conditions established in University of Notre Dame Institutional Animal Care and Use Committee protocols 12-009, 15-015, and 15-022 and followed the guidelines of the American Society of Mammalogists (Sikes et al. 2011).

Site selection and exclosure construction

We selected ten northern mesic forest patches for exclosure construction in 2009 and included two additional patches as study sites in 2011. These sites (average distance to nearest neighbor >1 km) were distributed evenly among high and low wolf use areas throughout UNDERC and consisted of 10 × 10-m paired control (open) and experimental (enclosed to prevent deer feeding) plots. At each site, the paired exclosure and control plot were adjacent (<10 m apart) so that forest conditions were as similar as possible. We constructed 12 × 12-m exclosures around the experimental plots out of 2.4-m high polypropylene mesh fencing (Deerbusters, Frederick, MD). This allowed for a 1-m buffer around the plot to prevent unwanted fence effects and provide a walkway for researchers. To exclude only deer, we cut 0.2 × 0.2-m holes into the bottom of the fencing to allow entry by smaller herbivores and predators.

Deer responses

We monitored deer activity on five high and five low wolf use plots (controls) using ten motion-activated, infrared-flash camera traps (Bushnell, Overland Park, KS). Cameras were deployed for 3 weeks, spanning June–August 2012, which is an adequate period of time for a seasonal white-tailed deer survey (Jacobson et al. 1997). Cameras had an effective capture and trigger range of 12–15 m, which allowed for clear, differentiable photography of deer behavior within the plots at all hours of the day (color) and night (infrared). Three time-stamped images per trigger (10-s delays between triggers) were taken. We used images of single deer visits as an index of deer density (Augustine and Frelich 1998; Altendorf et al. 2001). As an index of movement we used the duration of individual visits (time elapsed between first and last image of an individual). We considered a deer to be foraging when images showed the head in a down position. The proportion of images indicating foraging (head down) was used as an index of foraging time within the deer activity budget (Altendorf et al. 2001). Images per site were lumped as one summer of data.

Plant responses

We recorded the height of all saplings in each plot annually in August (2009–2012) in order to track changes in growth. A sapling was classified as an individual with a height of 25–240 cm. Those individuals with a height of <25 cm were either new seedlings or unavailable to deer during the previous winter (snow cover), and young trees which exceeded a height of >240 cm (exclosure height) have grown beyond the reach of deer and are part of the subcanopy. We also recorded woody browse damage on all saplings. Damage by deer (rough cuts) was differentiated from that by snowshoe hare (Lepus americanus; angular cuts). We also established 20 random 1 × 1-m subplots in each of the deer experimental and control plots in 2010 to monitor forb community characteristics. We surveyed these subplots each August, and the number of forb species present (pooled per plot) was used to estimate species richness.

Statistical analyses

To confirm that wolf use was statistically different between high and low wolf use areas, we compared the number of collar fixes within 600 m of exclosure sites using a Mann–Whitney U test. As an additional test, we also compared the distribution of all wolf sign data (scat and other use data pooled) between high and low wolf use areas using a chi-square test. The chi-square test was corrected for differences in road area between high (approx. 14.6 km) and low (approx. 22.7 km) wolf use areas. Differences in average frequency of deer use, visit duration (square-root transformed), percentage of activity spent foraging, and the percentage of browsed saplings (logit transformed) between high and low wolf use areas were examined using Student’s t tests.

We quantified deer herbivory impacts on sapling growth and forb species richness using log response ratios [Log (Exc x /Con x ), where Exc x represents the plant value for an exclosure plot x, and Con x represents the value for its adjacent control plot] and then analyzed whether wolf use significantly affected these impacts using a multi-way repeated measures analysis of variance (ANOVA), with wolf use and year as factors. In the case of a significant interaction, we used one-way repeated measures ANOVAs to see if there were significant changes over time in each wolf use category separately. The log response ratio is commonly used in exclosure/cascade studies as a measure of effect size because it reflects the proportional change resulting from the experimental manipulation (Hedges et al. 1999; Schmitz et al. 2000; Hebblewhite et al. 2005; Preisser et al. 2005; Rooney 2009). It is also preferred over the normal response ratio because the log ratio is affected equally by changes in the denominator or numerator, while the normal ratio is affected more by changes in the denominator (Hedges et al. 1999). Positive values indicate higher exclosure values than matched controls, while negative values indicate lower exclosure values.

We also wanted to compare our study results to those of previous wolf trophic cascade studies performed in the much different Rocky Mountain systems. Unfortunately, there were no comparable studies which used a dichotomous high versus low wolf use comparison and exclosures. Most Rocky Mountain studies have used either gradients of wolf use or pre-wolf reintroduction versus post-wolf reintroduction comparisons (Kauffman et al. 2010; Hebblewhite and Smith 2010). Therefore, our selection was limited to two studies which used the comparison of high and low wolf use areas and measures of similar plant characteristics. The first was the study by Ripple et al. (2001) which looked at a wolf–elk–aspen system in Yellowstone National Park (Wyoming, USA), and the second was the study by Hebblewhite et al. (2005) which looked at a wolf–elk–aspen system in Banff National Park (Alberta, Canada). In order to compare our system with the systems of these studies, we performed additional tests to compare average sapling height and the number of 100-cm size class saplings in 2012 between high and low wolf use control plots. Saplings in the 100-cm size class were limited to individuals established during wolf recolonization (diameter at breast height <2.5 cm based on Lessard et al. 2001 and unpublished data). We used Student’s t tests for these comparisons, or Mann–Whitney U tests if residuals were non-normal. We also calculated effect sizes (log response ratios) for these as well as deer density, visit duration, foraging allotment, and percentage saplings browsed for non-statistical side-by-side comparison. Foraging allotment was compared with high versus low wolf use results from a wolf–elk study in Yellowstone National Park (Childress and Lung 2003).

We also compared 2012 maple sapling density (square-root transformed), maple relative abundance, and forb species richness on control plots using Student’s t tests in order to see whether there were substantial differences between high and low wolf use areas. The comparison of forb species richness allowed us to assess whether our data would support the results of a previous study by Callan et al. (2013) which found significant increases in forb species richness upon the re-occupation of sites in Great Lakes forests by wolves for 8–10 years.

Results

Wolf use

Analysis of GPS telemetry and wolf sign data verified that experimental sites differed in terms of wolf use. Based on GPS collar telemetry points, wolves were in the vicinity (600 m) of high wolf sites 24-fold more frequently than in the vicinity of low wolf sites [ ± standard error: 4 ± 1.21 vs. 0.17 ± 0.17 fixes, respectively; U = 3.5, df = 10, P = 0.013] [Electronic Supplementary Material (ESM) 1]. The distribution of 130 wolf sign observations also corroborated the GPS collar location distribution, with 95 % of wolf sign observations (df = 1, χ2 = 169.020, P < 0.001; annual range of 92–98 %) being within high wolf use areas.

Deer responses

Deer exhibited very different behavior in areas of high and low wolf use, respectively. Deer visited high wolf use plots less frequently than they did low wolf use plots (1.00 ± 0.47 vs. 2.60 ± 0.60 visits per week; t = −2.096, df = 8, P = 0.035) (Fig. 1a), and the duration of the visits to high wolf use areas was shorter (36.0 ± 23.0 vs. 197 ± 85.0 s; t = −2.228, df = 8, P = 0.028) (Fig. 1b). Deer also spent a lesser proportion of their time foraging in high wolf use areas (32.7 ± 11.4 vs. 57.0 ± 3.7 %; t = −2.5065, df = 6, P = 0.023) (Fig. 1c).

Fig. 1
figure 1

Impacts of wolf (Canis lupus) activity on white-tailed deer (Odocoileus virginianus) on visit frequency (a), visit duration (b), foraging activity budget (c), and browse damage to maple (Acer spp.) (d). Data were collected from University of Notre Dame Environmental Research Center (UNDERC) control plots in 2012 using camera trap images (ac) or sapling surveys (d) of these plots. Bars: mean ± standard error (SE). n = 10 (a),  10 (b),  8 (c), 12 (d)

Sapling characteristics and responses

In this study, we selected the most common deer-preferred sapling species (maple, Acer spp.; >90 % of preferred saplings) to perform our analyses in order to avoid comparing plant species with different inherent growth rates and deer palatability. Maple sapling density was not significantly different between the areas of high and low wolf use (3.31 ± 1.34 vs. 1.68 ± 1.27 m−2, respectively, square-root transformed; t = 1.1164, df = 10, P = 0.290) (ESM 2a); relative abundance of maple saplings (ESM 2b) was also not significantly different (62.2 ± 13.5 vs. 66.2 ± 15.8 %, respectively; t = −0.1891, df = 10, P = 0.854). Per plot, deer browsed a substantially higher proportion of all maple saplings in low wolf use areas (49.7 ± 12.1 vs. 7.3 ± 3.1 %, logit transformed; t = −2.9075, df = 10, P = 0.008) (Fig. 1d).

We limited analyses of maple sapling growth and forb diversity (see subsection “Forb responses”) using exclosure plot data for exclosures built in 2009 to maintain a constant exclusion time among plots. The impact of deer herbivory on average maple sapling height differed significantly between areas of high and low wolf use (F 1,8 = 13.32, P = 0.006) and years (F 3,24 = 5.48, P = 0.005). The interaction of these two factors was also significant (F 3,24 = 6.18, P = 0.003) (Fig. 2a). In the low wolf use plots, the response ratio increased by twofold from 2009 to 2012, indicating that maple saplings grew substantially taller when protected from deer browse by deer exclosures (one-way ANOVA F 3,12 = 4.08, P = 0.033). However, exclosures did not provide any significant benefit to maple sapling growth in high wolf use areas, where the response ratio remained statistically unchanged (total decline of <3.6 %) from 2009 to 2012 (one-way ANOVA F 3,12 = 1.62, P = 0.236). However, this pattern was not observed for the sapling species that deer do not prefer to consume (balsam fir, Prunus serotina, Picea spp., etc.). Log response ratios for non-preferred saplings were not significantly different based on wolf use (F 1,8 = 0.001, P = 0.976), year (F 3,24 = 0.914, P = 0.449), or the wolf × year interaction (F 3,24 = 0.210, P = 0.888) (Fig. 2b).

Fig. 2
figure 2

Results of experiments comparing deer impacts on areas of high and low wolf use on maple sapling height (a), non-preferred (N-P) sapling height (b), and natural survey design tests (c, d) comparing maple sapling height (2012) (c) and 100-cm size class maple sapling abundance (d). The response ratios (E x /C x ; see section “Statistical analyses” for explanation of Exc x /Con x ratio) and non-transformed data are shown. Values are the mean ± SE (n = 10)

Post-hoc comparisons showed significant differences in maple sapling growth between high and low wolf use areas. Average maple sapling height (Fig. 2c) was approximately 70.6 ± 10.0 cm in high wolf use areas and approximately 30.1 cm ± 1.5 cm in low wolf use areas (t = 6.051, df = 10, P < 0.001). In addition, significantly more saplings were recruited to the 100-cm size class in high wolf use areas than in low wolf use areas (38.2 ± 17.8 vs. 1.3 ± 0.80, respectively; U = 1.5, df = 10, P = 0.005) (Fig. 2d).

Forb responses

The impact of deer herbivory on forb species richness was also significantly affected by wolf use (F 1,8 = 6.636, P = 0.033), but not year (F 2,16 = 1.418, P = 0.271). There was no significant interaction between these two variables (F 2,16 = 0.800, P = 0.466) (Fig. 3a). Average forb species richness increased by 38–110 % in exclosures in low wolf use sites while it was equivalent (SE within a ratio of 1) between exclosures and controls in high wolf use areas. Post-hoc comparisons between high and low wolf use control plots showed that forb species richness (Fig. 3b) was significantly greater in high wolf use areas than low wolf use areas (7.2 ± 0.83 vs. 4.0 ± 0.68; t = 2.939, df = 10, P = 0.007).

Fig. 3
figure 3

Comparison of forb species richness between high- and low-wolf use areas using Exc x /Con x response ratios (E x /C x ; a) and control plot values (2012) (b). Values are the mean ± SE (n = 10)

Discussion

The density and behavior of UNDERC deer in high wolf use areas were significantly different from those of UNDERC deer in low wolf use areas. Deer density, visit duration, and foraging time allocation were lower by 62 %, 82 %, and 43 % on high wolf use plots compared to low wolf use plots. These differences in deer density and behavior result in different impacts on plants. The proportion of saplings browsed in high wolf use areas was also lower—by 85 %. In addition, in high wolf use areas average maple sapling height more than doubled, forb species richness nearly doubled, and recruitment of maples was 24-fold higher. Comparisons of paired experimental and control plots suggested that the presence of deer in low wolf use areas had negative impacts on maple sapling growth and forb biodiversity. Conversely, in high wolf use areas, deer impacts on maple sapling growth and forb diversity were negligible. Taken together, these results suggest that wolves have substantial effects on white-tailed deer and also positively impact maple sapling growth and forb biodiversity as a result.

Our results support previous research which suggested that wolves have density- and/or trait-mediated effects on white-tailed deer (Mech 1977; Mech and Karns 1977). They also provide manipulative experimental support for previous natural experiments which suggest wolves can generate trophic cascades affecting sapling recruitment in various regions (McLaren and Peterson 1994; Ripple et al. 2001; Hebblewhite et al. 2005) and forb species richness in the Great Lakes forests (Callan et al. 2013).

However, there are other possible factors, such as differences in abiotic characteristics, which could lead to similar plant results regardless of top–down interactions (Hebblewhite and Smith 2010; Kauffman et al. 2010). We attempted to address some of these concerns by also analyzing sapling species which deer do not readily consume (i.e., non-preferred). Non-preferred plant species should not be strongly affected by changes in deer herbivory (White 2012), but they should react to confounding factors. Our observed lack of a significant difference in non-preferred sapling growth does not support the alternative hypothesis that factors other than wolf–deer interactions are responsible for the observed differences in maple and thus further supports the wolf–deer trophic cascade hypothesis. Meanwhile, given the short duration of both our experiment and wolf occupancy at UNDERC, the lack of significant differences in maple sapling density and relative abundance should be expected as maple saplings can survive repeated damage due to deer browsing over several years (albeit as short, stunted individuals) (Anderson and Loucks 1979).

Differing escape impediment levels, visibilities, and slopes could also explain our behavioral results (Underwood 1982; Kunkel and Pletscher 2001; Ripple and Beschta 2007; Valeix et al. 2009; Ripple et al. 2010). However, there was little difference in visibility as all of our UNDERC sites are densely forested, escape impediments (i.e., downed logs) were also relatively equivalent at all sites, and the terrain lacks the sharp contrasts in elevation seen in the Rocky Mountain elk behavior studies. Furthermore, a recent study using camera traps to record red deer (Cervus elaphus) behavior in control and experimental areas (where researchers placed fresh wolf scats) suggests that the olfactory cues of wolves are sufficiently strong to greatly increase deer vigilance (Kuijper et al. 2014). These results are in line with our scat distribution and deer behavior observations.

Great Lakes population patterns also suggest that any recent wolf cascade would have to be trait-mediated, not density-mediated (Rooney and Anderson 2009). Based on an average density of 28–36 wolves per 1000 km2 (Beyer et al. 2009; Wydeven et al. 2009) and 7000–12,000 deer per 1000 km2 (B.J. Dhuey, Wisconsin Department of Natural Resources data), as well as a wolf kill rate of 15–19 deer per wolf per year (Mech and Peterson 2003), wolves should be directly killing <10 % of available deer per year in this region. However, our observed reductions of deer use and sapling browse damage in high wolf use areas were multiple fold greater than 10 %, further supporting the potential importance of trait-mediated effects at UNDERC. Furthermore, regional deer numbers have been fairly stable since 1989 (DelGiudice et al. 2009), at which time the combined Michigan and Wisconsin population of wolves was <40 animals (Beyer et al. 2009; Wydeven et al. 2009).

In our study, the combined density-mediated and trait-mediated effects of the carnivore on the herbivore appear to be stronger than the effect of the carnivore on plant biomass (i.e., growth), which is consistent with the results of previous trophic cascade studies (Schmitz et al. 2000; Shurin et al. 2002, Table 1). Effect sizes of the carnivore on herbivore density, plant damage, and plant biomass are also within the respective ranges for terrestrial systems based on literature reviews. However, the effect size on biomass at UNDERC is higher than that found in most terrestrial studies conducted to date (Schmitz et al. 2000; Shurin et al. 2002; Hebblewhite and Smith 2010).

Table 1 Comparison of effect sizes among Yellowstone National Park (Wyoming, USA), Banff National Park (Alberta, Canada), and UNDERC (Wisconsin/Michigan, USA)

The differences in response strengths between wolf–ungulate studies also further highlight the variability of these direct and indirect effects. The effect sizes of wolves on saplings in our study system appear to be much stronger than those reported in comparable Rocky Mountain wolf–ungulate studies (see Table 1). We suspect that this difference is due, at least in part, to differing strengths of density- and trait-mediated effects in each system. Strong trait-mediated effects are expected to generate much stronger cascades than strong density-mediated effects. Trait-mediated effects reduce herbivory by all prey at risk, whereas density-mediated effects only eliminate herbivory by killed individuals (Schmitz et al. 2004; Preisser et al. 2005). Recent evidence suggests that density-mediated effects are quite strong in the Rockies (Kauffman et al. 2010; Hebblewhite and Smith 2010), while trait-mediated effects are weak (Creel et al. 2008; Kauffman et al. 2010) and may only explain plant growth variation within the same areas (Painter et al. 2014, 2015). Conversely, the declines in deer numbers due to wolf predation in the Great Lakes region are not nearly as large as those found for elk in the Rocky Mountain region (DelGiudice et al. 2009). However, our study suggests that wolves generate strong trait-mediated effects on UNDERC deer, and these trait-mediated effects appear to be stronger than those in the Rockies (see Table 1).

We suggest the strengths of the trait-mediated effects may be greater at UNDERC due to differences in behavior. Trait-mediated effect strength varies by predator hunting strategy (Preisser et al. 2007), and wolves switch hunting strategies based on prey size and habitat (Peterson and Cucci 2003). Active (search and chase) predators are expected to produce much weaker responses than ambush predators because active predators are more easily detected from a safe distance (Preisser et al. 2007; Creel et al. 2008; Schmitz 2008). Herding also helps prey detect predators (Dehn 1990). Wolves roam and chase herds of large elk through open ranges and pick out susceptible animals, whereas wolves hunting smaller deer in dense forests sneak up and ambush individuals (Kunkel and Pletscher 2001; Peterson and Cucci 2003). Thus, wolves should be expected to generate stronger trait-mediated effects when hunting deer than when hunting elk.

An alternative hypothesis is that differences in time since wolf recovery could explain stronger cascading effects (Callan et al. 2013). The length of time after wolf recovery was indeed shorter in the Yellowstone studies (4 years in Ripple et al. 2001, 3–6 years in Childress and Lung 2003) than in our study (6–10 years). However, we also saw stronger effects at UNDERC than those observed at Banff National Park, where the recovery of wolves was longer (11–13 years). Comparison with more recent studies conducted at Yellowstone (Painter et al. 2014, 2015; 17 years post-wolf recovery) still show fivefold greater browse intensities by a sharply reduced elk population than that observed in UNDERC deer in high wolf use areas. Therefore, we propose that differences in wolf exposure time cannot fully explain observed differences in effect size between the Rockies and UNDERC.

Researchers have questioned some conclusions of past wolf–ungulate–plant studies in other systems because the use of natural experimental designs makes it difficult to eliminate confounding factors as alternative explanations. With respect to Yellowstone National Park, droughts, increased winter severity, and human harvest could be more important factors affecting elk declines, and varying inherent abiotic conditions (e.g., elevation, water table, etc.) among the sites there could be responsible for differences in aspen growth (Hebblewhite and Smith 2010; Kauffman et al. 2010; Peterson et al. 2014). We did our best to avoid or minimize the influence of these confounding factors. Instead of only comparing plant community characteristics between high and low wolf use areas, we designed a manipulative experimental set-up (deer exclosures) which allowed us to isolate and compare the impact of the ungulate on plants. By using a high versus low wolf use area comparison, we also avoided the confounding factors of winter severity or drought which affect pre-wolf versus post-wolf designs, as both wolf use areas should have been affected equally. Moreover, there were no obvious contrasts in abiotic conditions between high and low wolf use areas. Deer harvest in Great Lakes wolf range has also remained relatively constant since 1989 (DelGiudice et al. 2009), and our documented changes in deer foraging behavior provide evidence for the hypothesized cascade mechanism, further supporting our conclusions.

Our understanding of trophic cascades involving large terrestrial carnivores requires additional research (Shurin et al. 2002; Schmitz et al. 2004), and this work represents an important early step in addressing these hypotheses. Our study provides one of the first manipulative tests to support the presence of trophic cascades in forest ecosystems involving large mammalian carnivores and herbivores. Ideally, future studies will test these hypotheses over a wider region than UNDERC. Our study was limited by the presence of only one wolf territory occupying UNDERC (although there were others just beyond our borders; unpublished data). This inherently limits our ability to generalize our results. However, given the strength and the straightforward nature of our findings, as well as our use of comparisons over time, we do not expect that there would be major qualitative differences in the results of our own work and future replications of this study (Oksanen 2001). This is even further supported by our forb richness results, which are in line with the results of another Great Lakes forest study (Callan et al. 2013).

The ability of predator-free ungulates to become drivers of environmental change in temperate and boreal forests is becoming increasingly recognized as a major conservation issue (Ripple et al. 2010). There is also a rising awareness that predator removal and reintroduction can have dramatic impacts on ecosystems (Estes et al. 2011). The question then is whether restoring these ungulate predators will halt these environmental changes (Rooney and Anderson 2009; Licht et al. 2010). If maples continue to grow towards the canopy in our high wolf use sites and if these results are transferable to other Great Lakes forests, then we propose that the natural regeneration of maples in Great Lakes forests will be restored in areas with wolves in similar fashion to what has occurred with aspen in Yellowstone (Painter et al. 2014, 2015) and Banff National Park (Hebblewhite and Smith 2010). This re-establishment of maple regeneration would allow maples to continue to be the dominant canopy species and possibly allow other preferred tree species to regenerate. We also propose that forb diversity in high wolf use areas at UNDERC will increase further over time as new species colonize from other locations and become established (Callan et al. 2013). We thus suggest that wolves and other large mammalian predators have the potential to be useful tools for forest ecosystem restoration, promoting the regeneration of ungulate-preferred species and/or increased forb diversity through the generation of density-mediated and/or trait-mediated trophic cascades.