Introduction

The successful establishment of non-native insect herbivores depends, in part, upon climatic suitability and the availability of host plants in the introduced range. As poikilotherms, insects are directly affected by temperatures, which can affect overwintering strategies and immature development (Sharpe and DeMichele 1977; Tauber et al. 1986). Not surprisingly, physiological restraints to temperature play an important role in determining the distributional ranges of invading and native insects (Bale et al. 2002; Jepsen et al. 2008; Tobin et al. 2014b). With temperatures projected to increase due to anthropogenic climate change, many species are expected to experience changes in their distributional ranges (Hill et al. 2011; Weed et al. 2013). Past work on the response of individual forest insects to climate change has revealed variable responses that range from intensification in outbreaks and range expansion, to outbreak collapse and range retraction (Esper et al. 2007; Haynes et al. 2014; Logan et al. 2003; Tobin et al. 2014a). It remains integral to understand the effect that climate will have on insects, especially for invading species that cause ecological and economic damage.

The geographic ranges and population density of insect herbivores are also dependent on the availability and quality of host plants upon which they depend (Collinge 2000; Hunter 2002). Immigration to and emigration from habitat patches can also be affected by patch size and their spatial distribution across the landscape (Connor et al. 2000; Cronin 2003). Patches with greater host plant area typically yield higher population densities (Denno et al. 1981), and patch connectivity is a critical factor that affects patch occupancy and the regional dynamics of insect populations (Hanski 1998; Hanski and Gilpin 1991).

Upon successful establishment, non-native insect herbivores spread into new areas, often through stratified diffusion in which new colonies are formed by both short- and long-range dispersal events (Hengeveld 1989; Liebhold and Tobin 2008; Shigesada et al. 1995). Understanding the interplay between climate and host plant availability in an invading species can inform management plans. However, tracking an invading species at the low densities typical in newly established colonies is not trivial due to a lack of sensitive detection tools and a comprehensive monitoring network. The invasion of the European strain of Lymantria dispar (L.) in the United States is an exception in that its invasion along an expanding population front has been extensively monitored under the Slow-the-Spread program (Grayson and Johnson 2018; Tobin and Blackburn 2007).

Since its introduction in 1869, L. dispar has established in the eastern and Midwestern United States (Tobin et al. 2012), or roughly one third of its potential habitat in the United States (Morin et al. 2005), and continues to spread. Its rate of spread varies through time and space, and is believed to fluctuate as a function of variation in host availability, frequency of human transport, federal management, and Allee effects (Bigsby et al. 2011; Contarini et al. 2009; Hajek and Tobin 2009; Johnson et al. 2006; Liebhold et al. 1992; Liebhold and Tobin 2006; Tobin et al. 2007b). Because of the economic and ecological consequences of L. dispar invasion (Doane and McManus 1981; Elkinton and Liebhold 1990; Tobin et al. 2012), many aspects of its population biology and ecology have been studied. This includes the relationship between L. dispar invasion potential and climate (Allen et al. 1993; Gray 2004; Logan et al. 2007; Pitt et al. 2007) and the relationship between L. dispar outbreaks and the density of preferred host plants (Haynes et al. 2009; Herrick and Gansner 1986; Liebhold et al. 1994).

Populations of L. dispar are constrained by suboptimal and supraoptimal temperatures. While exposure to cold temperatures is a requirement to terminate diapause, exposure to temperatures <  − 22 °C for extended periods increases mortality (Gray 2009). Furthermore, the supercooling point in overwintering egg masses has been estimated at − 28 °C (Doane and McManus 1981). Supraoptimal temperatures during immature development can also affect L. dispar populations. For example, Tobin et al. (2014a) reported on range retraction in response to supraoptimal temperatures during spring–summer larval feeding. Thompson et al. (2017) found that extended exposure to supraoptimal temperatures affected both the larval mass and developmental time to 4th instar, and reported a lethal supraoptimal temperature of ~ 32 °C.

Despite the breadth of L. dispar host species (Elkinton and Liebhold 1990; Liebhold et al. 1995), there can be considerable spatial variation in defoliation throughout the eastern United States, which is thought to be a result of L. dispar preferential feeding (Liebhold et al. 1994). Although larvae are polyphagous folivores that can feed on > 300 host plant species in the United States, 146 species are considered as primary host plants (Liebhold et al. 1995), and only a portion of primary host plants are likely to be present in a specific area of invasion. Larval feeding can be divided into two phases: early larval feeding (instars 1 to 3), and late larval feeding (instars 4 to 5/6). Early larval feeding is initiated in the spring following egg hatch, which typically extends over three weeks (Stoyenoff et al. 1994). Early instars have more dietary limitations relative to late instars because the former are not metabolically adapted to host plant defensive compounds (Barbosa and Greenblatt 1979; Barbosa and Krischik 1987), which can reduce survivorship and larval weight, and increase developmental time (Miller and Feeny 1983). Consequently, L. dispar host plants can be broadly divided into primary or secondary hosts. Primary hosts support L. dispar larval development with minimal, if any, costs to fitness, while secondary hosts can generally only be consumed by later instars. Although primary hosts have received more attention in L. dispar population dynamics given that a sufficient quantity of primary hosts is needed to support the development of outbreaks (Herrick and Gansner 1986; Liebhold et al. 1994; Haynes et al. 2009), scant attention has been given to the potential role of secondary host species in supporting expanding, low-density L. dispar populations. Moreover, it is not known if suboptimal and supraoptimal temperatures interact with primary and secondary host plant density and fragmentation to affect nascent L. dispar population growth at a landscape scale. In this study, we used a 16-year spatially-referenced dataset along the L. dispar expanding population front from Minnesota to North Carolina, USA, to quantify the role of temperature, and primary and secondary host plant density and distribution, on L. dispar population growth rates along its leading invasion front.

Materials and Methods

Study region

The spatial scale of this study encompassed the L. dispar invasion front within the United States from 1999 to 2015, which included an area from Minnesota to North Carolina. The invasion front was subset into three regions due to differences in regional range dynamics (Tobin et al. 2007b), temperature regimes (Gray 2004), and host plant availability (Morin et al., 2005). The Northern region consisted of Michigan, Wisconsin, and Minnesota; the Midwestern region consisted of Illinois, Indiana, and Ohio; and the Southern region consisted of North Carolina, Virginia, and West Virginia. To illustrate the differences in L. dispar spread by region, we estimated the year-to-year rate of L. dispar spread using boundary displacement (Sharov et al. 1996; Tobin et al. 2007a), and mean spread rates by region are presented in Table 1. Spread rates have been highest in the Northern region, lowest in the Midwestern region, and intermediate in the Southern region.

Table 1 Mean (± SD) region-specific rates of L. dispar spread, and mean minimum January and mean maximum July temperatures, 1999–2015

Lymantria dispar growth rates

Growth rates were derived from monitoring data collected by the Slow-the-Spread program in which ~ 100,000 georeferenced pheromone-baited traps are deployed annually along and ahead of the L. dispar population front (Tobin et al. 2012). Pheromone-baited traps are generally deployed 2–8 km apart, with a finer resolution deployed in areas farthest from the established L. dispar area (Tobin et al. 2012). We used data from 1999–2015 as these years generally contained a continuous trap grid across the entire invasion front. Because some areas within the Slow-the-Spread program are treated for L. dispar, all traps within 1.5 km of a treated area were excluded; generally, < 2% of the monitoring area is treated each year (Tobin et al. 2012). Because trap locations can shift from year-to-year, trap data (male moths/trap) from each year were interpolated using indicator kriging in GSLIB (Deutsch and Journel 1992) over a network of 5 × 5 km cells to generate a smooth surface (Sharov et al. 1995, 1997a). Within each region, we calculated growth rates for each 5 × 5 km cell. Because the Slow-the-Spread program monitors areas that are both along and ahead of the expanding population front, and our interest on newly establishing populations, we only used growth rates from cells at which there was an initial interpolated value > 0 and < 1 male moths/trap. To ensure growth rates within cells were comparable among cells through time and region, the data were further truncated to include cells that contained at least one value of 0 within the time sequence (1999–2015). We also estimated growth rates in cells with interpolated values from > 0 and up to 10 male moths/trap. We used 10 as a threshold due to our interest in newly establishing L. dispar colonies given that 10 moths/trapping area is a benchmark used in the L. dispar Slow-the-Spread program to denote L. dispar establishment and subsequent incorporation of the trapping area into the L. dispar federal quarantine (United States Department of Agriculture 2019). Growth rates from year t to t + 1 were calculated for each individual 5 × 5 km cell and for each pair of years (e.g., 1999 to 2000, 2000 to 2001, …, 2014 to 2015) according to:

$$Growth\,Rate = log_{e} \left( {\frac{{N_{t + 1} }}{{N_{t} }}} \right),$$
(1)

where Nt and Nt+1 are the interpolated values (male moths/trap) in the current and following year, respectively.

Temperature data

Temperature data were obtained through the PRISM Climate Group (2017). To represent the potential effect of suboptimal and supraoptimal temperatures for L. dispar development (Logan et al. 1991; Tobin et al. 2014a), we used mean daily minimum January and mean daily maximum July temperatures at a resolution of 5 × 5 km for all years from 1999 to 2015 (PRISM Climate Group 2017), which we matched with the growth rate in the cell during the corresponding time period. For example, the growth rate in a cell from 1999 to 2000 was linked to the mean January minimum temperature in 2000 in this cell (to which overwintering eggs are subjected regardless of the region) and the mean July maximum temperature in 2000 in this cell (to which late instars-to-adults are subjected, depending on the region). Although we recognize that different L. dispar stages are present in July depending on the region, we used July maximum temperatures as a proxy for maximum summer heat. The mean minimum January temperature and mean maximum July temperature are summarized by region in Table 1.

Host plant fragmentation

Forest inventory data were obtained from the U.S. Forest Service, Forest Inventory Analysis (2017) as an ArcGIS layer at a scale of 240 × 240 m. These inventories are typically conducted every 5–15 years and include more than 1000 plots within each state on both private and federal lands. From this inventory, we quantified the proportion of basal area (e.g., the mean amount of an area occupied by tree stems, which is based upon stem diameter ~ 1.3 m from the ground) attributed to each tree species within the study region. Tree species were classified as either primary or secondary host plants according to Liebhold et al. (1995). Common primary host tree genera in the study region included aspen (Populus spp.), birch (Betula spp.), hawthorn (Crataegus spp.), larch (Larix spp.), oak (Quercus spp), and willow (Salix spp). Common secondary host tree genera in the study region included beech (Fagus spp.), elm (Ulmus spp.), hickory (Carya spp.), maple (Acer spp.), pine (Pinus spp.), and walnut (Juglans spp.).

We used FRAGSTATS 3.3 (McGarigal et al. 2012) to estimate mean patch cohesion for primary and secondary host plants. Mean patch cohesion was estimated across a network of 240 × 240 m cells that were fully contained into each 5 × 5 km cell, which was linked to L. dispar growth rates. The patch cohesion index quantifies the connectivity of a specific host type patch within the entirety of the landscape. Patch cohesion increases as the host type patch becomes more aggregated within the landscape. This index will approach 0 as a host type becomes more fragmented, and approach 100 as a host type becomes less fragmented. Patch cohesion index, PCI, was estimated according to:

$${\text{PCI}} = \left[ {1 - \frac{{\mathop \sum \nolimits_{j = 1}^{n} p_{ij} }}{{\mathop \sum \nolimits_{j = 1}^{n} p_{ij} \sqrt {a_{ij} } }}} \right]\left[ {1 - \frac{1}{\sqrt A }} \right]^{ - 1} \cdot \left( {100} \right),$$
(2)

where pij is the perimeter of patch ij in terms of the number of cell surfaces, aij is the area of patch in terms of the number of cells, and A is the total number of cells within the landscape.

Statistical analyses

Due to spatial autocorrelation in L. dispar growth rates, which complicates data analysis due to the lack of independence, we sought to account for spatial autocorrelation in growth rates in our analyses. First, we estimated spatial autocorrelation in L. dispar growth rates in each region across the network of 5 × 5 km cells for each pair of years from 1999 to 2015. We used the ncf package (Bjørnstad 2020) in R (Bjørnstad 2020; R Core Team 2018). Confidence intervals were estimated using the 2.5 and 97.5 percentiles of a bootstrapped distribution based on 300 replications (Efron and Tibshirani 1993). We also estimated spatial synchrony in L. dispar growth rates in each region using the ncf package (Bjørnstad 2020). Due to computation challenges given the spatial and temporal extent of the dataset (i.e., ~ 5000 cells for each of the 16 year-to-year pairs of growth rates), we randomly selected a subset of 500 locations through time for each region, from which we estimated spatial synchrony. This was repeated 5000 times to estimate a mean and confidence interval based on the 2.5 and 97.5 percentiles of the bootstrapped distribution. We then accounted for spatial autocorrelation in growth rates by including the distance-weighted mean of the growth rate as a term in our analyses according to Walter et al. (2015). Based on spatial autocorrelation of the growth rates (see results), cells in the Midwestern and Southern regions that were > 100 km apart were assigned a weight of 0. Within the Northern region, cells > 160 km apart were assigned a weight of 0. Otherwise, the weight, wij, was calculated based on the fitted relationship between spatial autocorrelation and distance in each region according to:

$${\text{Northern region}}:w_{ij} = {6}.{7 }{-} \, 0.{54} \times {\text{log}}_{{{1}0}} ({5}.{445} \times {1}0^{{ - {4}}} + d_{ij} ),$$
(3)
$${\text{Midwestern region}}:w_{ij} = - 0.{16 }{-} \, 0.{94} \times {\text{exp}}( - {1}.{9} \times {1}0^{{ - {5}}} + d_{ij} ),$$
(4)
$${\text{Southern region}}:w_{ij} = - 0.0{44 }{-} \, 0.{51} \times {\text{exp}}( - {1}.{6} \times {1}0^{{ - {5}}} + d_{ij} ),$$
(5)

where dij is the straight-line distance between the focal points i and j (Anselin and Bera 1998). The distance-weighted mean growth rate,\(\overline{\lambda }_{m}\), was then calculated as:

$$\overline{\lambda }_{m} = \frac{{\mathop \sum \nolimits_{j = 1}^{n} w_{ij} \lambda_{j} }}{{\mathop \sum \nolimits_{j = 1}^{n} w_{ij} }},$$
(6)

where λj are the mean growth rates within the previously defined spatial autocorrelated neighborhood for each region (Walter et al. 2015).

In subsequent analyses, we use generalized additive models (GAMs). GAMs combine properties of generalized linear models and additive models, allowing the replacement of linear regression coefficients with nonparametric smooth functions such as splines (Hastie and Tibshirani 1987). Using smoothed estimates for covariates is advantageous as it allows for the detection of nonlinear relationships, such as those between environmental covariates and aspects of L. dispar population dynamics (Haynes et al. 2012; Sharov et al. 1997b). In GAMs, smooth functions are penalized for increased nonlinearity to balance model fit and complexity. We used a backward selection protocol, based on Wood and Augustin (2002), to arrive at a parsimonious model for each region beginning with the following full model:

$$\begin{aligned} \lambda_{{\text{m}}} = & s\left( {\overline{\lambda }_{m} } \right) + s\left( {{\text{JanM}}} \right) \, + s\left( {{\text{JulM}}} \right) \, + s\left( {{\text{PrimeCoh}}} \right) \, + s\left( {{\text{SecCoh}}} \right) \, + s\left( {{\text{PrimeBA}}} \right) \\ & + s\left( {{\text{SecBA}}} \right) \, + te\left( {{\text{PrimeCoh}},{\text{ SecCoh}}} \right) \, + te\left( {{\text{SecBA}},{\text{ PrimeBA}}} \right) \, + te\left( {{\text{PrimeCoh}},{\text{ PrimeBA}}} \right) \\ & + te\left( {{\text{SecCoh}},{\text{SecBA}}} \right) \\ \end{aligned}$$
(7)

in which the growth rate, λm, was predicted by mean minimum January temperature (JanM), mean maximum July temperature (JulM), primary (Prime) and secondary (Sec) host cohesion (Coh) and basal area (BA), the distance-weighted mean growth rate (\(\overline{\lambda }_{m}\)), and biologically important interaction effects. Within the full model, s indicates a smooth spline function of the covariates and te indicates a tensor product smooth. Following Wood and Augustin (2002), variables were removed from the model if: (1) the estimated degrees of freedom for that term were close to 1; (2) the confidence region for the smooth function included zero for all values of the independent variable; and (3) the generalized cross-validation (GCV) score for the full model decreases if the term is removed. GAMs were implemented using the mgcv package (Wood 2006) in R (R Core Team 2018).

Results

Summary statistics on the mean L. dispar growth rate for each region are presented in Table 2. Overall mean growth rates (i.e., across all years) were highest in the Northern region and lowest in the Midwestern region. The highest and lowest year-to-year mean growth rate was observed from the Northern region, which was the region most likely to be subjected to colder overwintering conditions (Table 1), including the 2014 North American cold wave during which the lowest year-to-year growth rate was observed.

Table 2 Region-specific L. dispar growth rates, 1999–2015

We observed significant local spatial autocorrelation (i.e., the estimate of spatial autocorrelation as the lag distance approaches 0) in growth rates in all regions (Table 3), and spatial ranges that generally extended beyond 50 km (Metz 2017, Supplemental Information 1–3). We also observed spatial synchrony in growth rates in all regions (Fig. 1). In all regions, significant spatial synchrony (based on 95% confidence intervals) extended to at least 250 km, while synchrony in the northern region was measured to ~ 500 km. Statistical details of the GAM models testing the main and interaction effects on L. dispar growth rates (Eq. 7) for each region are presented in Table 4. The distance weighted mean growth rate, (\(\overline{\lambda }_{m}\)), consistently met retention criteria and was retained in all GAM models, suggesting that its inclusion reduced spatial autocorrelation in the GAM residuals (Walter et al. 2015). This was not surprising given the extent to which growth rates were spatially autocorrelated (Table 3, Supplemental Information 1–3).

Table 3 Estimates (± 95%) bootstrapped confidence intervals) of the local spatial autocorrelation and spatial range (km) in the year-to-year growth rates across the Northern, Midwestern, and Southern regions of the L. dispar invasion front
Fig. 1
figure 1

Estimates of spatial synchrony (Bjørnstad 2020) in L. dispar growth rates from the Northern, Midwestern, and Southern regions, 1999–2015

Table 4 Results of the GAM models predicting L. dispar moth growth rates (Eq. 7) in the Southern, Midwestern, and Northern regions (estimated degrees of freedom for each predictor variable is noted in parentheses). All main effects are shown, but only significant interaction terms are listed

Mean minimum temperatures in January and mean maximum temperatures in July were retained in GAM models for all regions (Table 4). Lymantria dispar growth rates in all regions were highest when mean minimum January temperatures were >  − 10 °C (Fig. 2). In overwintering L. dispar eggs, mortality generally begins at ~ –18 °C (Campbell 1973; Sullivan and Wallace 1972). Although mean January minimum temperatures were < –18 °C for only three years in the Northern region, we did observe reductions in growth rates when temperatures were < –18 °C (Fig. 2A). The optimal temperature for L. dispar larval development is ~ 28 °C (Casagrande et al. 1987; Logan et al. 1991). Although year-to-year growth rates, when summarized by region, are variable, the highest growth rates across all regions tended to be observed at mean maximum July temperatures between 27 and 30 °C (Fig. 2B).

Fig. 2
figure 2

Region-specific mean L. dispar growth rates by mean minimum January A and mean maximum July B temperature for each pair of years (e.g., 1999–2000, …. 2014–2015). Fitted lines are estimated using locally-weighted polynomial regression

Excluding temperature, no other main or interaction effects significantly predicted growth rates in the Midwestern region. Also, the basal area of primary or secondary host species were not significant predictors of L. dispar growth rates in any region (Table 4). However, primary host species cohesion as a main effect, and its interaction with secondary host species cohesion, significantly predicted growth rates in the Southern region. Secondary host species cohesion as a main effect, and its interaction with primary host species cohesion, significantly predicted growth rates in the Northern region (Table 4). In the Northern and Southern regions, the results underscore the importance of host plant cohesion, as opposed to basal area per se, in the L. dispar invasion process.

In Southern region, primary host plants generally have a high degree of cohesion. However, we did observe increases in L. dispar growth rates as primary host plant cohesion increased. Moreover, when considering the significant interaction between primary host species cohesion and secondary host species cohesion (Table 4), estimates of growth rates in the Southern region were highest when primary and secondary host cohesion were > 80 (Fig. 3). In the Northern region, there was more variation in both primary and secondary host plant cohesion. When considering the significant interaction between primary host species cohesion and secondary host species cohesion in the Northern region (Table 4), L. dispar growth rates were highest when primary and secondary host cohesion were > 40. However, we also observed high L. dispar growth rates in areas with low primary host plant cohesion when secondary host plant cohesion, which was significant as a main effect in this region (Table 4), was > 30 (Fig. 3). The predicted overall L. dispar growth rates (1999–2015) when using region-specific GAM models that incorporate significant predictors (Table 4) are shown in Fig. 4 along with the overall observed L. dispar growth rates.

Fig. 3
figure 3

Relationship between primary and secondary host species cohesion, and mean L. dispar growth rates in the Northern A and Southern B regions. Mean growth rates were averaged across all years (1999–2015). The size of the circles is proportional to the growth rate; the largest-sized circles represent growth rates ≥ 2 or ≤  − 2, and the smallest sized circles are 0. Grey circles are positive growth rates and open circles are negative growth rates

Fig. 4
figure 4

Observed A and predicted B spatially-explicit mean L. dispar growth rates, 1999–2015. Predicted growth rates are derived from region-specific GAM models that incorporate temperature, and when applicable to the region, primary and secondary host plant cohesion (Table 4). Observed and predicted growth rates from some states are shown even though they were not included in the analysis (i.e., Iowa, Kentucky, and Tennessee); data from these states were excluded due to limited spatial and temporal extent of L. dispar monitoring. Predictions in Kentucky and Tennessee were based on the Southern region GAM model, and predictions in Iowa were based on the Midwestern region GAM model

Discussion

Across the L. dispar expanding population front, the results reinforce the importance of temperature on L. dispar population growth. Lymantria dispar growth rates were maximized when mean July temperatures were between 27 and 30 °C. This is consistent with prior research in which exposure to supraoptimal temperatures has been found to affect larval development and survival, and spread rates (Thompson et al. 2017; Tobin et al. 2014a). This study also extends our understanding of the effect of cold temperatures on L. dispar population growth rates across a spatially-large and diverse landscape. The highest L. dispar growth rates were observed when mean minimum January temperatures were >  − 10 °C (Fig. 2), which is consistent with previous research on L. dispar egg mass mortality at cold temperatures (Sullivan and Wallace 1972; Summers 1922). There were only three years in which minimum January temperatures were < –18 °C, and all from the Northern region; however, it does demonstrate the constraining effect that overwintering temperatures can have on an invading species, even in a region where spread rates (Table 1) and growth rates (Table 2) are the highest along the invasion front. Given this constraint, warming winter temperatures in the Northern region would likely further increase L. dispar population growth rates and invasion speed. Indeed, recent studies have reported L. dispar invasion success in northern Minnesota (Streifel et al. 2019; Tobin et al. 2016), which was previously considered to be climatically unsuitable to L. dispar, in part due to historical suboptimal winter temperatures (Gray 2004).

The high degree of spatial autocorrelation in L. dispar growth rates (Table 3) is not surprising given that insect populations tend to be highly spatially autocorrelated. It also underscores the importance of considering spatial autocorrelation when assessing the factors that affect L. dispar growth rates, and when developing predictive models of invasion across a landscape. When using the GAM modeling framework that incorporated distance weighted mean growth rates, we determined that L. dispar growth rates in the Midwestern region were driven by temperature only (Table 4). This region contains a low density of L. dispar primary host plants and forest cover, with little variation in the density of each (Morin et al. 2005), which may have reduced the predictive capability of primary host density and cohesion on L. dispar growth rates. Instead, growth rates appear to be mostly affected by both overwintering temperatures and temperatures during larval feeding. Specifically, low temperatures in January below − 10 °C and high temperatures in July above the optimal temperature for immature development (28 °C) would decrease the growth rates in this region.

In contrast to the Midwestern region, the Southern region contains a high density of L. dispar primary host plants and a high density of forest cover, especially in the Appalachian Mountains (Morin et al. 2005). The fact that primary host basal area did not statistically predict L. dispar growth rates is perhaps due to large-scale availability of primary host plants throughout this region. Instead, primary host plant cohesion was a significant predictor of growth rates in this region, with growth rates increasing in areas where primary hosts are less fragmented (Fig. 3). There was also an interaction with secondary host plants in this region (Table 4), with higher predicted growth rates in areas that also have high secondary host plant cohesion (Fig. 3). Temperatures during January and July are also important predictors of L. dispar growth rates in the Southern region, and past research has reported that supraoptimal temperatures during larval development causes range retraction in portions of this region (Tobin et al. 2014a).

The Northern region contains areas of relatively high amounts of L. dispar primary host plants and forest cover, especially in its northern portions, and areas with very low amounts of L. dispar primary host plants and forest cover, especially in its southern portions (Morin et al. 2005). In this region, we detected an effect of secondary host plant cohesion, and its interaction with primary host plants (Table 4). Growth rates in this region were higher when the cohesion of both primary and secondary hosts increased (Fig. 3). However, in the absence of high primary host cohesion, L. dispar growth rates were ≥ 2 provided that secondary host cohesion was > 40. The presence of high secondary host cohesion could reflect the availability of L. dispar host plants that can sustain populations, even if these plants are not classified as preferred host species. It could also reflect higher forest cover and therefore, an increased potential for available host plants. Lastly, the classification of host plants as primary or secondary, according to Liebhold et al. (1995), is useful but is also a broad classification that does not consider variation within primary or secondary classifications. Some plants classified as primary hosts, such as Quercus spp., could be more suitable to L. dispar larvae than others, such as Betula and Populus spp., that are also classified as primary hosts. Similarly, there is variation in plants classified as secondary hosts, with some hosts (Acer spp.) more suitable than others (Carya spp.). The results from the Northern region, where primary hosts are in abundance in some areas and in paucity in others, suggest that secondary host plants can be important in maintaining L. dispar populations when primary hosts are scarce.

The spread of non-native insect species often occurs through stratified dispersal in which colonies are formed ahead of an expanding population front (Shigesada et al. 1995; Liebhold and Tobin 2008). For invading insect herbivores, the presence of suitable host plants is a necessity for successful population establishment and growth in newly-arriving colonies. Not surprisingly, host plant availability is a key ingredient in risk models to predict establishment (Venette 2015). Temperature also poses a barrier to invading insect species, which have species-specific tolerances to extreme hot and cold temperatures; consequently, climate suitability models are also used to predict invasive insect distributions (Venette 2017). Incorporating temperature constraints (Gray 2004; Logan et al. 2007; Pitt et al. 2007) and the distribution of primary host species (Morin et al. 2005) into L. dispar risk models is not new. However, we provide evidence that host plant cohesion, as opposed to host basal area, predicts L. dispar growth rates in new populations along its expanding population front. Including host plant cohesion as a factor in risk models, either in addition to or in place of host plant density, could improve the predictive capability of risk assessment models in other invading herbivorous insect species.