Introduction

Invasive species currently represent one of the biggest threats to the environment, and cause enormous economic losses (e.g., Ehrenfeld 2010; Pyšek et al. 2012; Simberloff et al. 2013). Hence, in recent years, researchers have investigated ways to prevent and reduce the impact that such species might cause (Buckley 2008). Prevention and rapid response have been widely recognized as the most successful management approaches to hamper non-indigenous species in a cost-effective way (Alvarez and Solis 2019). Prevention is usually preferable and represents a priority among management policies (Finnoff et al. 2007; Leung et al. 2002). However, it does not necessarily guarantee success (Vander Zanden et al. 2010), and rapid response remains a critical strategy for management (Wittenberg and Cock 2001), especially when the invader’s density is low and measures like eradication are more feasible and cost-effective (Simberloff et al. 2013; Westbrooks 2004). The importance of the appropriateness of intervention has prompted discussion on how resources should be spent (e.g., Leung et al. 2005; Vander Zanden et al. 2010). Prioritizing instances where management would be most needed is essential to direct limited resources (Lohr et al. 2017). Particularly, promptly identifying species and locations of highest concern would allow stakeholders to respond to new introductions and detections more quickly and effectively (e.g., Vander Zanden and Olden 2008). Yet, it remains a major challenge in invasion ecology (Stewart-Koster et al. 2015).

In this study, we provide insight into rapid response for eradication. We focused on the establishment phase, i.e. the process by which a non-indigenous species in a novel location founds a self-sustaining population, with individuals surviving and successfully reproducing (Lockwood et al. 2013). However, even populations that temporarily survive and reproduce can subsequently die off without anthropic intervention (Blackburn et al. 2011). We thus considered that distinguishing between an initial sub-stage wherein a species is found in a novel location temporarily (henceforth termed casual establishment) and the subsequent step wherein the species exists as an enduring self-sustaining population over time (henceforth termed persistent establishment) would be of practical importance for management. In fact, simple detections of a species in the wild are often treated as establishments or a proxy for it, but they arguably relate more closely to casual establishment, overestimating those species which persist to potentially cause harm. Therefore, when non-indigenous organisms are detected in the wild, distinguishing species that would likely go extinct without human mediation from those that pose a real threat could help determine instances where eradication effort is needed, and provide critical information for prioritizing rapid response resources.

While mathematical models provide rigorous, quantitative means to prioritize management (e.g., Chadès et al. 2011; Kerr et al. 2016), decisions in this field often do not make use of this quantitative evidence, relying instead on expert opinion, and less technical scoring-based approaches (Cook et al. 2010; Leung et al. 2012). Arguably, science should facilitate the usability of technical knowledge for stakeholders (Cassey et al. 2018a), for instance by converting model parameters into rules of thumb. Ideally, such rules of thumb should be easy to apply and should provide useful insights. Similar concepts are often used in medicine to express changes in the odds of an outcome following exposure to a certain factor (e.g., the risk of low weight at birth is 4 times higher in neonates from women exposed to tobacco; Mumbare et al. 2012). Simultaneously, they should still be based on a solid statistical foundation that derives reliable predictions from empirical data. In this manuscript, we also express our models as rules of thumb, in addition to parameter values.

Finally, we consider the fundamental insights from studying the two phases of casual and persistent establishment separately. Currently, few studies have looked into identifying the factors associated with failures at different stages of invasions (Dawson et al. 2009; Marchetti et al. 2004). Despite establishment being the best studied phase in biological invasions (Leung et al. 2012), very few studies specifically concentrated and used information on temporary establishments to more finely characterize this stage (e.g., Essl et al. 2015; Ficetola et al. 2009; Milbau and Stout 2008). For example, Duggan et al. (2006) examined the importance of propagule pressure and other introduction characteristics (e.g., maximum size) on the probability of freshwater fish species from the aquarium trade being introduced (i.e. detected) in the wild and successfully establishing. Here, we analyzed casual and persistent establishment, simultaneously considering propagule pressure, environmental suitability, and species traits (i.e. the main predictors of establishment; Leung et al. 2012), to assess their relative contributions during each sub-stage. Specifically, despite having been widely recognized as a strong determinant of invasion success (Cassey et al. 2018b; Lockwood et al. 2013), it is unclear whether propagule pressure primarily contributes to casual establishment (e.g., Leung et al. 2012) or it remains important for persistence (e.g., Ficetola et al. 2009). Similarly, environmental conditions, and more importantly climate matching, were shown to be consistently relevant factors influencing establishment across taxa (e.g., Duncan et al. 2014; Hayes and Barry 2008; Mahoney et al. 2015). Nonetheless, local conditions might have a stronger effect on the early survival of the propagules reaching a new location (casual establishment; Essl et al. 2015) or could affect long-term population persistence through vulnerable phases like reproduction (Ficetola et al. 2009). Analogously, studies focusing on the role of species traits found them to be important predictors of establishment (Pyšek et al. 2009), either allowing individuals to cope with novel conditions (Blackburn et al. 2009), or having a stronger effect on persistence, for example favouring future reproduction in spite of early population growth (Sol et al. 2012).

In brief, we focused on fish species introduced through the aquarium trade pathway as our study system to address three-fold objectives: (1) to develop a model to help prioritize rapid response, by separating the sub-stages of establishment (casual versus persistent); (2) to provide “rules of thumb”, wherein model parameters are converted into a series of simple multiplicative risk factors; (3) to increase fundamental understanding of the invasion process, by elucidating the relative contributions of propagule pressure, environment, and species traits for each of the two sub-stages of establishment.

Methods

In this study, we focused on aquarium freshwater fish species traded in the United States for which import data were available. The aquarium trade is responsible for importing thousands of individual fish annually and is a significant source of non-indigenous species (Howeth et al. 2016; Rixon et al. 2005). To separate the predictors associated with the early establishment and persistence, we examined spatially referenced records of non-indigenous freshwater fish species in the USA over the past 50 years (United States Geological Survey 2017). We selected all the detection records classified as “aquarium release” and compiled them by state. The USGS categorized each record by status, based on reproduction, persistence and eradication (Table 1). We grouped the species detected in the wild into casually established species (CS) and persistently established species (PS). The CS included all the species by state that were detected at some point in time after 1971 regardless of their present status (59 species and 151 species-location combinations). We chose 1971 as a threshold since previous work from Chapman et al. (1997) and Bradie et al. (2013) had shown that species’ popularity remained relatively consistent in the aquarium fish trade after that year. Two occurrences (Devario malabaricus, NV, and Serrasalmus rhombeus, FL) which were classified as “Eradicated” (Table 1) might have persisted in the absence of human intervention. We found that results were robust to their inclusion and kept these species in the CS group. Since a species that casually establishes can either go extinct or persist, CS included both casual and persistent species. In contrast, the PS group included only those species which avoided extirpation, i.e. species that successfully reproduced and overwintered, and for which multiple life stages were identified in the wild (21 species and 28 species-location combinations, identified as “Established”; Table 1). All other aquarium fish species imported into the USA that have never been detected in the wild were considered unestablished (neither casually nor persistently). The complete list of occurrences for both CS and PS can be found in Appendix S1 (Table S1.1 and S1.2).

Table 1 Aquarium species status categories, as defined by the USGS Non-indigenous Aquatic Species website and as categorized in this study (CS/PS)

We used the PET modeling framework (Della Venezia et al. 2018) to account for species traits, environment, propagule pressure and density-dependent effects simultaneously. The model defined the probability of a species establishing as:

$$P\left( E \right) = 1 - \left( {1 - p} \right)^{{N^{c} }}$$
(1)

where p was the probability of a single propagule establishing, N was the number of propagules introduced, and c was a shape parameter allowing density-dependent effects. p was modeled as a logistic function of species-specific and location-specific predictors:

$$p_{sl} = \frac{1}{{1 + e^{{ - z_{sl} }} }}$$
(2)

where zsl was defined as:

$$z_{sl} = b_{0} + \mathop \sum \limits_{w = 1}^{W} \left( {b_{w1} X_{ws} + b_{w2} X_{ws}^{2} } \right) + \mathop \sum \limits_{m = 1}^{M} \left( {b_{m1} E_{ml} + b_{m2} E_{ml}^{2} } \right) + \mathop \sum \limits_{w = 1}^{W} \mathop \sum \limits_{m = 1}^{M} (b_{mw} X_{ws} E_{ml} )$$
(3)

Each Xws was a trait of species s, for a total of W traits, while each Eml was an environmental condition for location l, for M environmental variables. Both first and second order terms for each of these predictors were included to allow a non-monotonic relation with establishment probability. b denoted the coefficients of the model, for traits (w) and environmental conditions (m), and traits-environment interactions, each described by parameter value bmw, common across all species/location combinations.

We used the PET framework in two different ways. Firstly, we identified the factors that predicted casual establishment (i.e. CS versus unestablished species that were never detected in the wild). We referred to this as the “casual establishment model”. Secondly, to create rules of thumb for rapid response, we identified the factors that predicted PS versus extirpated species (the fraction of CS that later went extinct). We referred to this as the “persistence model”.

Variable choice

The PET establishment model incorporated propagule pressure (P), environment (E) and species traits (T) as predictors (Appendix S2). Building from Della Venezia et al. (2018), species traits originated from the online database FishBase (Froese and Pauly 2018) and were minimum and maximum temperature tolerance, northernmost latitude, trophic level and maximum length. Additional traits were removed to avoid multicollinearity, while missing data were imputed using the methodology described in Della Venezia et al. (2018). The environmental variables were obtained from the Bioclim database (www.worldclim.com; Hijmans et al. 2005). Given that the establishment data were at the state level, we estimated mean (x̄) and variance (s2) of each variable for each state. The final set, after removing the highly correlated ones, included variability of the diurnal range (BIO2 2s ), mean and variability of the minimum temperature of the coldest month (BIO6 and BIO6 2s ), mean temperature of the warmest quarter (BIO10) and mean precipitation of the wettest month (BIO13). All variables were standardized before fitting the models.

Since fish releases from aquarists were virtually impossible to track, as a proxy for propagule pressure we used Canadian import data from Fisheries and Ocean Canada (B. Cudmore and N. Mandrak, unpublished data). Bradie et al. (2013) showed that US aquarium fish imports could accurately be obtained by scaling Canadian imports by population size. The same approach was used to derive geographically explicit estimates of propagule pressure based on population by state. Population size data were available from the United States Census Bureau (https://www.census.gov/en.html).

Model fitting

Our models were fit using maximum likelihood estimation, which allowed finding the best fitting parameters given the observed data. The log-likelihood function (Della Venezia et al. 2018) was defined as:

$$\log \left( L \right) = \mathop \sum \limits_{l = 1}^{L} \left( {\mathop \sum \limits_{{i_{l} = 1}}^{{I_{l} }} \log \left( {1 - (1 - p_{il} )^{{N_{il}^{c} }} } \right) + \mathop \sum \limits_{{u_{l} = 1}}^{{U_{l} }} { \log }(1 - p_{ul} )^{{N_{ul}^{c} }} } \right)$$
(4)

i represented a successful species, i.e. CS or PS depending on the model, and u was a species which failed to establish either temporarily or persistently, for each location l. The sum was iterated over all L states.

We applied a forward selection procedure to identify the most predictive variables (Johnson and Omland 2004). Although issues associated with stepwise variable selection are known (e.g., inflated type I errors) and more robust methods of model selection exist, we opted for a forward selection approach, as other approaches which began with all terms simultaneously in the model resulted in problems of complete separation (Albert and Anderson 1984), given the data available. We selected the Akaike Information Criterion (AIC; Akaike 1974) as an efficient metric to select predictive models (Aho et al. 2017), and each variable was retained in the model when its inclusion decreased AIC by at least 2 units (Anderson and Burnham 2002). Model performance was estimated using the area under the curve (AUC) of the receiver operating characteristic (ROC; Hanley and McNeil 1982).

Finally, to improve our fundamental understanding of the establishment process, we characterized the importance of the three categories of predictors in analysis by comparing the full PET framework to each submodel (i.e. excluding propagule pressure, environment or species traits). We ranked them based on their AIC and the percentage of deviance explained, to evaluate the relative importance of species traits, environment and propagule pressure during each sub-stage of establishment.

Multiplicative risk factors

Once the best model for each sub-stage was identified, we derived “rules of thumb” to easily compare species and prioritize resources. We determined the effect of each important predictor on the likelihood of a species successfully establishing (casually or persistently) versus failing. To do so, we converted the parameters of the fitted models (Eqs. 2, 3) and expressed each factor as a multiplier, increasing or decreasing risk of establishment. This was accomplished by calculating the odds ratios (OR), i.e. the ratio between the odds (probability of successfully establishing versus failing) for varying values of each significant predictor and the odds at a reference value (Appendix S3):

$$OR_{k} = \frac{{Odds_{k} }}{{Odds_{{ref_{k} }} }}$$
(5)

For each significant predictor k, its mean value across species or locations was chosen as reference. Odds ratios are the simplest way of interpreting the results of a logistic model and have been used extensively in epidemiology and medicine (e.g., Bland and Altman 2000; Cummings 2009). Each ORk represented a multiplicative risk factor, i.e. a measure of the relative change in risk of establishing versus failing, relative to the odds of average species and locations. Based on logistic regression, odds ratios had the advantage of being estimated independently for each variable, so that the relative contribution of each predictor could be assessed separately. Also, each ORk mathematically corresponded to a multiplier, so that the cumulative effect of all predictors (ORc) was the product across all ORk.

$$OR_{C} = \mathop \prod \limits_{k = 1}^{K} OR_{k}$$
(6)

For instance, if a species s had a value for trait k = 1 corresponding to ΔOR1 = 2 (Fig. 1a), the likelihood of successfully persisting versus failing for species s would be twice as high as that of a species with a mean value for trait 1. If species s also had OR2 = 1.25 for trait k = 2 (Fig. 1b), then the overall ORc = 2 × 1.25, and, altogether, species s would be 2.5 times more likely to establish versus not establishing than the average species. Odds ratios could be used to compare different species as well, where the ratio ORs=1/ORs=2 represents the relative odds of success of species 1 over species 2.

Fig. 1
figure 1

Illustrative example of rules of thumb, i.e. odds ratios (OR), derived for two species traits. Each dark dot represents the reference point, i.e. the mean trait value across species in the dataset, while each triangular dot corresponds to the species s of interest

Based on logistic regression, probabilities can be obtained from the odds (\(p = \frac{Odds}{1 + Odds}\)). Knowing ORC and the Oddsref when all predictors were set to the reference value, actual probabilities for each species/location combination would correspond to:

$$p = \frac{{Odds_{ref} OR_{C} }}{{1 + Odds_{ref} OR_{C} }}$$
(7)

All data manipulations, model fitting and analyses were conducted in the R statistical programming environment (R Core Team 2018).

Results

Five factors distinguished species likely to become naturally extirpated from those that persisted (AUC = 0.946, ≈ 52% deviance explained; Table 2). The minimum temperature of the coldest month (BIO6 2s ) was among the strongest predictors of persistence, likely determining whether the species could overwinter (Table 3). In particular, its variability (BIO6 2s ) was the best determinant for persistence (Table 3), presumably because strong fluctuations could drive species to extirpation, even those that were able to initially survive the low minima. Persistence also appeared to be favoured at quite high mean temperatures of the warmest quarter (BIO10) and for intermediate species with high maximum temperature tolerances (Table 3). Finally, the probability of persistence appeared to change with interacting minimum temperature of the coldest month and maximum length, with big species being favored in more stable cold environments (Table 3).

Table 2 AIC, AIC difference from the best model (ΔAIC), AUC, fitted \(\hat{c}\) parameter and percentage of deviance explained (%dev.exp) by the full casual establishment and persistence model, and their respective submodels
Table 3 Parameter values for the predictors retained in each sub-stage best model

Multiplicative risk factors

Using the parameters from the fitted models (Table 3), we calculated the odds for varying values of the predictors (Appendix S3), and we compared them against the mean across locations and species as our reference value (Oddsref = 0.002184, corresponding to pref = 0.002180) to obtain our rules of thumb (Fig. 2; Table 4). Some predictors were substantially more important. For example, maximum temperature tolerance appeared more influential than maximum length for persistence, as shown by the range of corresponding ORs (Fig. 2a, b). Temperature tolerance sharply increased persistence, with species with tolerances higher than 35 °C being at least 50 times more likely to successfully persist than failing (Fig. 2a). On the other hand, persistent establishment appeared likely for a relatively narrow range of lengths, with species about 60 cm in length being 4 times more likely to persist than to fail compared to the mean, and very big species being extremely unlikely to cause concern in the long term (Fig. 2b). Similarly, the effect of minimum temperature variability (BIO6 2s ; Fig. 2d) appeared substantially stronger than the mean warmest temperature (BIO10; Fig. 2c), increasing the odds up to ≈ 200 times with respect to average conditions.

Fig. 2
figure 2

Effect of each significant predictor on the likelihood of persistently establishing versus failing, expressed as odds ratio (OR), when gradually varying each predictor. OR equals 1 (dashed line) at each variable’s mean value, reported by the corresponding point. The mean values for the interaction plot (e) correspond to those of the respective main terms (b, d). The triangles indicate the OR of P. managuensis (traits) and Hawaii (environmental conditions), and their interaction (e). Very high values in (e) coincide with areas of extremely low absolute probability values, so that little probability increases determine very high OR

Table 4 Selected multiplicative risk factors to quickly quantify risk of casual and persistent establishment

Odds ratios allowed us to combine the contribution of different predictors by simple multiplication, to obtain their overall expected effect on persistence risk. To illustrate, the species expected to have the highest likelihood of persistent establishment in the USA was the jaguar guapote (Parachromis managuensis; for a list of traits, see Appendix S1). The estimate of OR was ≈ 150 for maximum temperature tolerance, and ≈ 4 for size, respectively (Fig. 2a, b), making this species about (150 × 4 =) 600 times more likely to establish than the average fish in our persistence dataset. Similarly, based on local environmental conditions, the state where persistent establishment was more likely to occur was Hawaii, with OR of about 1 and 55, for mean temperature of the warmest quarter and minimum temperature variance, respectively (Fig. 2c, d), making the insular state ≈ 55 times more suitable to persistent species than the average state. For the jaguar guapote, the calculated overall likelihood of successfully persisting versus failing in Hawaii was more than 60 thousand times higher than the average species/location, including the contribution of the interaction terms (OR interaction corresponding to ≈ 2, Fig. 2e; overall OR = 150 × 4 × 1 × 55 × 2). Thus, while the mean probability of persistence was low across all species and locations (pref = 0.002180), for the jaguar guapote in Hawaii it was 0.9964. Similar estimates can be easily obtained and compared for any species/location combination, using the parameter values from each model, and the information provided in Appendix S1 and Appendix S3.

Re-evaluating “risky” species in terms of persistence

Additionally, we looked at species within our dataset which had been flagged as potentially invasive in the literature. As a case example, we focused on the Great Lakes and looked at species previously identified as potentially invasive in this area, including the European weatherfish (Misgurnus fossilis), the spined loach (Cobitis taenia), the white cloud mountain minnow (Tanichthys albonubes), the clown loach (Chromobotia macracanthus), the silver arowana (Osteoglossum bicirrhosum) and the glass catfish (Kryptopterus bicirrhis). We found that none of the species listed appeared to be likely to persist in the Great Lakes region, although some of them would be troubling in other states, being tens to hundreds of times more likely to succeed than on average. These were the glass catfish in Hawaii, and the white cloud mountain minnow and the clown loach in both Florida and Hawaii (Appendix S4). By comparison, the silver arowana was never able to persist, nor was it predicted to pose a substantial threat across the USA, despite having been casually established in 9 states.

On the other hand, among the species/location combinations for which casual establishment had already occurred, our model predictions for persistence suggested different species of highest concern. These included the red-bellied piranha (Pygocentrus nattereri), the lowland cichlid (Herichthys carpintis), the Rio pearlfish (Nematolebias whitei), the banded leporinus (Leporinus fasciatus), and the climbing perch (Anabas testudineus), particularly in California, Florida, Hawaii, New Mexico and Texas. All had a likelihood of successfully persisting versus failing tens to hundreds of times higher than average species/locations. If these species were detected again in their casual occurrence sites, they should be prioritized for rapid response.

Comparing establishment sub-stages: casual versus persistent

The combination of propagule pressure, environment, and traits (i.e. PET) performed better than any of the submodels for casual establishment, both in goodness of fit (AIC) and prediction accuracy (AUCPET = 0.957, ≈ 36% deviance explained; Table 2; Appendix S5). In contrast, for persistence, the best model included only species traits and environmental conditions (AUCET = 0.946, ≈ 52% deviance explained; Table 2). Particularly, the large difference in AIC between each best model and its alternatives (ΔAIC > 22 for the casual establishment model; ΔAIC > 145 for the persistence model; Table 2) suggested that our results should be robust to specific approaches for model selection (i.e., our use of forward selection).

For casual establishment, species traits were more explanatory than the environment (either with or without propagule pressure; Table 2), and indeed all traits examined were important (Table 3; Appendix S5). More specifically, casual establishment risk was inversely related to trophic level, seemingly favouring herbivorous species. Casual establishment risk also increased with latitude for species whose northernmost distribution limit ranged up to ≈ 23°N, and then decreased at higher latitudes (Table 3; Appendix S5). Some traits were also predictive of persistent species, with maximum length and, to a lesser extent, maximum temperature tolerance being important for both sub-stages. Interestingly, risk was unimodally related to maximum length, with intermediate to big species having an advantage for casual establishment, and relatively smaller ones being favored for persistence. Optimal physiological ranges of temperature were significant both for casual and persistent establishment, with minimum temperature tolerance losing relevance at the latter sub-stage (Table 3; Appendix S5).

In contrast, environmental conditions appeared to be more relevant for persistence, explaining alone ≈ 35% of deviance (Table 2). While precipitations and mean minimum temperatures were determinants of casual establishment (Table 2; Appendix S5), persistence was predicted by mean temperatures of the warmest quarter and variability in minimum temperature of the coldest month (Table 2). Nonetheless, the submodel combining species traits and environment was the best in distinguishing persistent species from those that were subsequently extirpated, while propagule pressure was generally not predictive of persistence for aquarium fish (Table 2). Propagule pressure appeared to favour casual establishment and its inclusion in the model added considerably to the final percentage of explained deviance (Table 2), with values of propagule pressure higher than the median increasing risk (Appendix S2). However, propagule pressure was not important for persistence of aquarium fish species (Table 2; Appendix S2). This was further corroborated by the fitted \(\hat{c}\) value being consistently very close to zero for persistence models (at zero, propagule pressure would have no effect; Table 2).

Discussion

Strides have been made in the field of invasion biology, aiming at identifying priorities for management and guiding strategies for prevention and early response. However, prioritization remains a challenging task, and geographically explicit, multispecies risk assessment frameworks could help make the process more efficient. Here, we focused on establishment and rapid response. Among the casual fish species considered in this study, more than 80% later went extinct without human intervention, and would have resulted in unnecessary allocation of valuable resources, if funds had been spent on their eradication. Instead, by separating establishment into two sub-stages and pinpointing the factors associated with successful casual and persistent establishment, we have provided analyses to target species likely to persist, after they have been detected.

Although we recognize that non-indigenous species could generate local impact even during casual establishment, being able to prioritize species that pose a lasting threat and to redirect (often scarce) resources toward instances where their investment would be necessary, would help maximize the efficacy of management strategies (Jenkins 2013; Keller and Perrings 2011). When detections take place early, the five predictors identified and the multiplicative risk factors derived from the persistence model would allow a quick simultaneous assessment of multiple species and locations. In an exemplificative case, the pirapitinga (Piaractus brachypomus) has managed to casually establish in as many as 44 states, but it has never persisted due to unfavorable local conditions. However, its likelihood of persistence is non-negligible in some states like Florida and Nevada, where measures should be taken if detected. Further, we have listed Pygocentrus nattereri, Herichthys carpintis, Nematolebias whitei, Leporinus fasciatus and Anabas testudineus as the most likely to persist among species that have already casually established. These species are ranked as highest concern and they should be eradicated if detected again, especially in Texas, New Mexico, California, Florida or Hawaii. Instead, when we looked at species that had already been classified as potential threats in the Great Lakes area (e.g., Howeth et al. 2016; Kolar and Lodge 2002; Rixon et al. 2005), our results suggested that even if they were detected, they would have a high likelihood of extirpation without intervention. Although casual establishment might occur for some of these species, persistence was predicted to be very unlikely, and the states bordering the Great Lakes should prioritize the investment of management resources on different pathways of introduction of potentially invasive species or alternative environmental concerns.

On the other hand, species traits and local environmental conditions that are advantageous in the earliest phases of establishment might also represent useful filtering criteria to drive restrictions in the aquarium market and to define targets to invest resources for early detection (Mehta et al. 2007). Although we focused primarily on informing rapid response, preventing casual establishment could be necessary in specific cases, e.g., for species that can cause substantial temporary impact, or for which eradication would hardly be feasible (e.g., Dogliotti et al. 2018; Simberloff 2003). In such cases, a reduction in the number of commercialized individuals should be considered, as it might be sufficient to make establishment risk virtually inexistent. However, targeting propagule pressure after detection would not reduce persistence risk for freshwater aquarium fish, based on our results. Overall, our model and the associated multiplicative risk factors provide quantitative support to decision making that would help target which species might pose a risk and in which locations, and thus where rapid response strategies should be developed. The specific strategies for rapid response (eradication, containment, etc.; Britton et al. 2011), and their feasibilities (i.e., risk management) have not been explicitly examined here as they were beyond the scope of this manuscript, and would require a separate set of models and analyses (e.g., Peterson et al. 2008).

Separating establishment into sub-stages also provided additional fundamental knowledge about this phase of invasions. Generally, our results reflect the importance of distinct predictors during separate phases of the invasion process (e.g., Dawson et al. 2009; Essl et al. 2015; Milbau and Stout 2008). For instance, propagule pressure has been recognized as the most consistent predictor of establishment success across taxa (Cassey et al. 2018b; Lockwood et al. 2013). Looking at sub-stages, our results suggested that propagule pressure was very important for casual establishment, but not for persistence, confirming previous observations on freshwater fish (Marchetti et al. 2004). However, studies on vascular plants had found that continual propagules contribution could enhance establishment success at later stages (e.g., Essl et al. 2015), suggesting that establishment dynamics vary across taxa.

In contrast to propagule pressure, both species-specific and location-specific characteristics played an important role during both sub-stages of establishment. However, species traits were more important for casual establishment, while location-specific variables were more important for persistence. As expected, the relevant predictors differed between stages. For example, trophic level was retained as a predictor of casual establishment, but did not have an effect on persistence. Even for traits that were relevant for both stages, their relationship with risk changed. For example, very large species were generally disadvantaged across stages, in agreement with findings in the literature (Ribeiro et al. 2008; Ruesink 2005). While mid-range species were favored for early establishment, potentially due to an initial survival advantage, only relatively small ones appeared to find a suitable environment for persistence. Such species were often detected in the wild across northern states (e.g., Pygocentrus nattereri; Appendix S1), but coming from tropical or subtropical regions, they could only overwinter in the mild climate of the southernmost USA (Bennett et al. 1997).

The environment, on the other hand, seemed to play a greater role for persistence, similarly to what has been observed for bryophytes (Essl et al. 2015). Our results supported observations from previous studies about the importance of climatic variables at different stages of invasion in fish (Bomford et al. 2010; Howeth et al. 2016) and other vertebrates (Duncan et al. 2001; Forsyth et al. 2004; Mahoney et al. 2015). Expectedly, the minimum temperature of the coldest month was one of the strongest environmental predictors of casual establishment and persistence, both as mean and variability. After overcoming low temperatures during the earlier phase, aquarium species seemed to favor locations that are relatively steady in winter, in accordance with previous studies (Bradie and Leung 2017; Drake and Lodge 2004).

The approach used here can be applied to other suits of organisms across different pathways of introduction, to derive geographically explicit, pathway-specific risk factors. Clearly, when more information is available, the methodology can be extended to accommodate alternative predictors among species traits and environment, e.g., invasion history or native range size (Kolar and Lodge 2002; Peoples and Midway 2018). Analogously, all available information on both successful and failed establishments should be incorporated as it becomes available, while data on escapes and intentional releases of non-indigenous individuals in the wild would represent ideal propagule pressure estimates and help exclude species that were never released. Ideally, density data could be integrated in the analysis and related to casual and persistent establishment, as well as impact. The scale of our study and the types of data available also prevented us from obtaining more finely resolved predictions to inform management with higher geographical resolution. Although data might be particularly limiting for certain pathways, information-rich datasets exist (e.g., trade in birds, reptiles and amphibians; Abellán et al. 2016; Herrel and van der Meijden 2014; Robinson et al. 2015), and advancements are being made to improve pathway-level knowledge compiled from diverse sources (e.g. Saul et al. 2017). Despite these limitations, in the context of invasions by aquarium fish, this work suggests a small number of predictors can differentiate species and locations likely to establish and persist. Moreover, our model suggests different species of most concern, from the perspective of persistence, and thus different targets of rapid response, once detection of a species in the wild has occurred.