Introduction

Biological invasions are a major driver of species loss, contributing to up to 54 % of extinctions with documented causes (Clavero and García-Berthou 2005). The ecological damage often attributed to biological invasions, and the need to prevent and manage invasions, has motivated studies seeking to characterise the invasion process and identify factors which favour biological invasions (Lodge et al. 2006). While anthropogenic activity resulting in elevated alien propagule pressure (Colautti et al. 2006) or habitat disturbance (Davis et al. 2000) is often identified as a driver of biological invasions, some habitats are inherently more susceptible to the successful establishment of alien species than others (Johnson et al. 2008). The lack of biotic resistance, characterised as resistance to introduction of alien species resulting from interactions with native biota, is among widely discussed hypotheses explaining this observation (Moyle and Light 1996). The resistance of native biota against aliens is thought to be pertinent in the initial introduction, as well as at the spread stage of an invasion process (Lodge et al. 2006) through several forms of interaction, including competition (Levine et al. 2004; Dzialowski et al. 2006).

Being an important determinant of competition, trophic interactions and their components are commonly studied in relation to biological invasions (Marks et al. 2011; Cucherousset et al. 2012) in addition to a wide range of human disturbances such as species loss (Dunne et al. 2002a), habitat degradation (Henley et al. 2000), and eutrophication (Hadwen and Bunn 2005). Clear associations between trophic structure and the susceptibility of a habitat to biological invasions are yet to be conclusively determined, with current studies suggesting that invasibility of a habitat in relation to pre-existing communities are often scale dependent (Byers and Noonburg 2003). However, on a small spatial scale, there is some evidence to suggest that the trophic ecology of native biota can influence the success or failure of a biological invasion (Lake and Dowd 1991; Kennedy et al. 2002; Dzialowski et al. 2006).

The complexity of food webs and community level trophic interactions are among the major stumbling blocks in elucidating their association with alien species establishment. Consequently, studies testing the biotic resistance hypothesis tend to be theoretical (Tilman 2004; Romanuk et al. 2009; Baiser et al. 2010), while empirical studies have focused primarily on plant systems (Tilman 1997; Byers and Noonburg 2003). This bias is often compounded by the paucity of substantial field data pertaining to the community of interest, as exemplified by lentic habitats in Southeast Asia where little of the ecology is known despite an extensively studied diversity (Johnson 1973; Kottelat et al. 1993; Rainboth 1996; Rintelen et al. 2007).

To address this gap in knowledge of biological invasions, we applied a novel approach which couples food web modelling with statistical analyses of an extensive data set consisting of 370 fish species from 26 lakes from the Southeast Asian region where empirical food webs are largely unknown. As models of trophic interactions are widely applied (Pace et al. 1999; Williams and Martinez 2000) (see Rossberg et al. (2006) for limitations) and can be useful when community data from the field are lacking, we constructed literature-derived food web models based on algorithms dictated by organism size and diet as a basis for investigating the association between community and species level trophic traits with the successful establishment of alien fish taxa. The latter was done primarily by adopting the information-theoretic approach which is a comparatively robust and informative method when testing ecological questions (Lukacs et al. 2007; Sodhi et al. 2008). Using a combination of these techniques, we attempted to answer the following research questions: (1) is the presence of alien fish taxa associated with community level food web traits; and (2) is there a trend in species level trophic traits of the alien fish found in the habitats we studied?

Materials and methods

Data collection

We examined a total of 26 lentic habitats—20 artificial (i.e., man-made via excavations or river impoundments) and six natural—in the Southeast Asian region (Fig. 1) (Appendix A in ESM). Species lists of fishes of lentic habitats were compiled from various sources for Cambodia (Lim et al. 1999), Indonesia (Thomas 2005), Malaysia (Zakaria-Ismail and Lim 1995; Ali 1996; Mohd et al. 1996; Sim 2002), the Philippines (Aquino et al. 2011; Papa and Mamaril 2011), Singapore (Ng and Tan 2010), and Thailand (Chookajorn et al. 1999; Pholprasith and Sirimongkonthaworn 1999). Species checklists were verified against records on FishBase (Froese and Pauly 2015) and Catalog of Fishes (Eschmeyer 2013) to account for possible misidentifications. Uncertain records were excluded from subsequent analyses, after which, 970 entries of 370 fish species from 71 families remained (Appendix C in ESM).

Fig. 1
figure 1

Map of lakes we analysed on our study. Singapore, marked in a black box on the main map, is represented in the inset

From the collated species list, ecological traits, i.e., dietary habits, maximum recorded size (l max ) and status (native or alien) of the fishes were determined from several published sources (Appendix C in ESM) (Kottelat et al. 1993; Rainboth 1996). When dietary data were unavailable, dietary ecology was inferred from congeners of similar size. Owing to the incongruence of resolution in literature derived dietary data, we categorised food items under the following general groups: detritus, periphyton, plant matter, phytoplankton, zooplankton (microinvertebrates), macroinvertebrates and fishes. Fish taxa were assigned to the following feeding guilds as a function of their dietary components: benthic herbivore (i.e., species feeding on periphyton), detritivore (i.e., species feeding on detritus), herbivore (i.e., species feeding on various combinations of plant matter, periphyton and phytoplankton), macroinvertivore (i.e., species feeding on macroinvertebrates and zooplankton), omnivore/generalist (i.e., species feeding on a combination of various plant and animal food sources), pelagic planktivore (i.e., species feeding on zooplankton, or phytoplankton, or both), generalist predator (i.e., species feeding on combinations of zooplankton, macroinvertebrates and fishes), and piscivore (i.e., species feeding on fish). Trophic levels indicated on FishBase (referred to as FishBase trophic level) (Froese and Pauly 2015) were also recorded to complement dietary data as a general indicator of trophic niche.

Construction of food webs

We constructed hypothetical cumulative food webs (Schoenly and Cohen 1991) of each habitat reviewed in the form of predation matrices (described in Appendix D in ESM). As our study focusses on fish community, and considering the absence of data pertaining to non-fish taxa, our hypothetical food webs consisted primarily of fish species present in the lentic habitats. Other taxa/components included in the food webs were: detritus (i.e., dead organic matter of plant or animal origin), plant matter (i.e., all non-algal producers), phytoplankton (i.e., pelagic producers consisting of all species of phytoplankton), periphyton (i.e., benthic or encrusting producers), zooplankton (i.e., all species of zooplankton), and macroinvertebrates (e.g., decapod crustaceans and aquatic insect larvae). Trophic interactions in the matrices were a function of dietary data based on literature. For all piscivores, a size-constrained model in which a predator–prey size ratio of 4:1 was assumed (Scharf et al. 2000; Turesson et al. 2002). All species in the data set were ascribed with their common length (l com ), the most numerically abundant size class in a population, and this was calculated with the following formula; l com  = 0.65 × Maximum length (Pauly 2000). We analysed the resultant hypothetical food web using the foodweb package in the R environment (Perdomo et al. 2012) to generate associated food web network topology and other descriptors, namely; trophic species richness (w rich ), connectance (w c ), total links (w tl ), link density (w d ), index of omnivory (w o ), proportion of herbivores (w h ), proportion of basal taxa i.e., primary producers (w b ), proportion of intermediate taxa, i.e., primary and secondary consumers (w i ), proportion of top taxa i.e., apex predators (w t ), ratio of prey to predator (w pp ) and number of trophic levels (w l ) (Appendix B in ESM). Using the same package, we also calculated species trophic levels (s l ) from the hypothetical webs by averaging trophic levels of prey-items, where basal food sources (i.e., primary producers) were designated s l  = 1 (Williams and Martinez 2004). Unlike FishBase trophic level (s f ), calculated trophic level of a single species (s l ) can differ between lakes, as a function of the local food web structure, thus offering a more accurate, site-specific reflection of the trophic ecology of a species.

Approximations of propagule pressure and lake sizes

The likelihood of biological invasions is commonly attributed to propagule pressure (Colautti et al. 2006; Simberloff 2009) although data quantifying propagule pressure are often lacking and this necessitates the use of proxy measurements (Rouget and Richardson 2003). Similarly, we were not able to obtain empirical data measuring propagule pressure of all alien fish taxa present at all the habitats surveyed, thus we attempted to statistically control for propagule pressure by estimating the accessibility of the lakes following the (research supported) assumption that propagule pressure is positively correlated with human activity (Marchetti et al. 2004a; Bossenbroek et al. 2007). We approximated accessibility of the lakes by measuring the following two variables: (1) average travel time to population centre of 50,000 people or more (Nelson 2009); and (2) proportion of tree cover in buffer area consisting of riparian zones up to 1 km away from shore. We computed (1) by adopting the GIS raster layer from Nelson (2009), following which, time taken to travel to population centres of 50,000 was averaged across all 1 arc-minute grids occupied by the lakes, respectively. Variable (2) was computed from Global Forest Change (Hansen et al. 2013) tree cover GIS raster layer where tree cover data was first simplified by recoding the original continuous format (i.e., 0–100 % values in each 1 arc-second grid) in a binary form (i.e., 1 or 0) assuming substantial tree cover was only present in 30 m2 grids with average tree cover greater than 20 %. Following the simplification of tree cover data, we approximated the proportion of tree cover in a 1 km buffer area around the lakes using the formula: tree cover (t j ) = (∑Covered grids in buffer area j ) × (∑Number of grids in buffer area j )−1, where a t j value of 1 represents complete tree cover in the buffer area around lake j. In order to prevent over parameterising our models in subsequent analyses, we summarised both variables (1) and (2) using Principal Components Analysis on the Ade4 statistical package (Dray and Dufour 2007) where a single covariate (principal component 1) was used as a proxy for habitat accessibility (accessibility). In addition, we also wanted to account for the potential scale dependence of biotic resistance (Byers and Noonburg 2003), and this was done by approximating lake area (area) based on Google Earth satellite images. All spatial analyses (i.e., construction of polygons approximating lake area, construction of buffer areas measuring 1 km away from lake shores, conversion of tree cover data, and summing of grids positive for tree cover as well as total number of grids present in buffer area) were conducted on the QGIS*2.10 geographic information system (QGIS Development Team 2014).

Statistical analyses

To answer our first research question (i.e., is the presence of alien fish taxa associated with community level food web traits?), we narrowed down the list of variables potentially predictive of the number of native species in a habitat from among the values generated from our hypothetical food webs to the following: w rich , w native , w c , w d , w l , w o , w h , w i , and w t . These variables were selected considering the frequency of use in food web literature and potential ecological importance as proxies of biotic resistance. The relevance of these variables to our analyses are summarised in Table 1.

Table 1 Summary of variables describing the traits and network topology of our food web models

We tested the association between number of alien species present with variables describing food web traits and network topology by fitting multiple generalized linear mixed-effects models (glmm) in an inferential framework, from which the model(s) which best described our data was (were) selected (Burnham and Anderson 2002; Johnson and Omland 2004; Fox and Weisberg 2011). Our models consisted of combinations of variables described in Table 1 and were built according to plausible a priori hypotheses (Appendix E in ESM). The response variable in all the models was total number of alien species present in the community (i.e., total aliens). As our response variable is a count data, we applied poisson error structures in all our models. Habitat type (i.e., artificial or natural) was coded as a random-effect, within which the models were nested. This was to address potential confounders resulting from the distinct natural histories of the two habitat types, which may include, but are not restricted to: age, colonisation history, proximity to human activities, and heterogeneity of riparian zones (Gopal and Ghosh 2009). In addition to that, accessibility was also added to our models as a random slope variable to control for differences in public accessibility of the lakes as an approximation of human activity, which correlates with propagule pressure (Marchetti et al. 2004a). Moreover, we also split our models into two groups, assuming the presence and absence of scale dependence, respectively, where the latter includes area (km2) as a covariate (Appendix E in ESM).

All predictor variables in our models were scaled to avoid potential biases resulting from differences in magnitudes. We also avoided building models with more than three predictor variables to prevent over-parameterising our models, as the co-dependent nature of the variables describing food webs (i.e., proportion of basal, intermediate and top taxa will necessarily sum to 1) means that high levels of multicollinearity exist in the data set. This prevents the inclusion of large numbers of variables (e.g., a saturated model in which all variables are included). We also addressed the issue of multicollinearity by computing variance inflation factors (vif) of equivalent generalized linear models (i.e., same set of predictor variables excluding random nesting effect) using the car package (Fox and Weisberg 2011) ensuring that all predictive variables in the models had a vif value of < 3. One of the more problematic collinearities (should it be significant) was the relationship between area and w rich in view of its potential to confound assessments pertaining to the scale dependence of the association between number of alien species and community level food web traits and network topology. Therefore, in addition to ensuring a vif value of < 3 for all models, the linear relationship between area and w rich was also tested by fitting the following generalised linear model: area ~ w rich ; and we found that the association between the two variables were not statistically significant (p value = 0.201). All our models were built with the lme4 statistical package (Bates et al. 2015).

In order to determine the predictive ability of our parameterised models (i.e., models describing a priori hypotheses associating combinations of native species richness and food web traits and network topology to number of alien species present) relative to the null models (i.e., model describing the null hypothesis which assumes no association between number of alien species present and variables representing native species richness and food web traits and network topology) (Appendix E in ESM), we calculated Kullback–Leibler information loss based on Akaike’s information criteria corrected for small sample sizes (AICc) (Burnham and Anderson 2002). Information loss relative to the null model was defined as Δi = AICci − AICcMin where AICci represents the AICc value of the ith model while AICcMin represents the AICc value of the most parsimonious model (i.e. lowest AICc value) (Lukacs et al. 2007). The probability of the model given the data (Akaike weights) was then computed with the following formula: \(w_{i} = \frac{{\exp \left( { - \frac{1}{2}\Delta_{i} } \right)}}{{\sum\nolimits_{j = i}^{N} {\exp \left( { - \frac{1}{2}\Delta_{i} } \right)} }}\), where exp represents the exponential function (Lukacs et al. 2007). In addition to that, we also calculated evidence ratios (e): \(\frac{{w_{i} }}{{w_{Null} }}\) where w Null represents the Akaike weight of the null model, allowing us to determine the likelihood of the ith model in comparison to the null model given our data (i.e., a value of 10 indicates that the ith model is ten times more likely to be true as compared to the null model). In order to best represent our data, we selected the best models by setting a cut-off point of ΔAICc = 2 (where ΔAICc refers to differences in AICc values) from the model with the lowest AICc value (Burnham and Anderson 2002). Should there be more than one model selected using this criteria, the most parsimonious models were averaged using via the MuMIn package (Barton 2013). Further, we assessed general model fits by calculating marginal and conditional R 2 values for all the models (Nakagawa and Schielzeth 2013). The relationship between predictor variables and the response variable (total aliens) is linear, so a coefficient value of 1 suggests that a one unit increase in the associated predictor variable would result in a concomitant single unit increase of total aliens.

To answer our second research question at a species level, we conducted non-metric dimensional scaling (nMDS) of the literature derived dietary components of 370 fish species using the vegan package (Oksanen et al. 2013) to visualise potential dietary separation between native and alien fishes. Clusters of native and alien fishes in ordinal space were represented by dispersion ellipses calculated based on the standard error (s.e.) of the average scores. We tested for statistical significance in the separation between native and alien fishes using PERMANOVA with the adonis function on the vegan statistical package (Oksanen et al. 2013). Further, we constructed a contour plot derived from smooth surface thin-plate splines from a general additive model (gam) of calculated trophic level (s l ) (Marra and Wood 2011).

Besides testing for potential differences in the dietary component of alien fishes in comparison to native taxa, we also tested if native and alien species had significantly different species-specific trophic traits. We did this by conducting information-theoretic analyses on a data set consisting of 970 data points of 370 fish species (a single species may occur in multiple habitats), using combinations of species specific trophic traits (Table 2) as predictor variables and the status of the fish (i.e., native or alien) as a response variable. All predictor variables were again scaled prior to analysis. As our response variable is binary, we applied binomial error variance structures in all our models (i.e., logistic regression), and controlled for family and locality (i.e., habitat from which the species was recorded) by including them as random nesting factor. This allowed us to minimise potential confounders caused by phylogeny and habitat specific stochastic events (e.g., fish stocking). The list of models we built and tested based on a priori hypotheses are summarised in Appendix F in ESM. The parameterised models (i.e., models describing a priori hypotheses associating combinations of species-specific trophic traits with the status of a fish) were compared to the null model (i.e., model which assumes no association between any species-specific trophic traits with the status of a fish) and quantified by calculating Akaike weights and evidence ratios. Here again, we estimated the general fit of the models. In our species level analyses, the association between predictor variables and the response variable (status) was assessed via logistic regression, thus a coefficient value of 1 suggests that a one unit increase in the associated predictor variable is e 1 times more likely to be true for the non-intercept status (i.e., native), where e represents the mathematical constant, Euler’s number. All statistical analyses were conducted on the R statistical environment * 2.15.1 (R core team 2015).

Table 2 Summary of species specific trophic related traits we included in our information-theoretic analyses

Results

Out of the 34 models we constructed, two models, which cumulatively accounted for nearly 100 % of the Akaike’s posterior weights, best represented our data at the community level. This suggests that total native richness (w native ) was the most informative trait in predicting the number of alien species in the community, with an evidence ratio indicating that the models were 2.48 × 109 and 1.92 × 109 times more likely than the null model (Table 3). The total native richness (w native ) of a community reflected a significant negative relationship (p value < 0.05) to number of alien species (Fig. 2; Table 4). In addition to that, models which were controlled for lake size (area) also generally outperformed their counterparts, suggesting the importance of accounting for spatial scale of the habitats.

Table 3 Summary of the five most parsimonious models describing the relationship between variables describing traits and network topology with the proportion of native species present in 26 lentic habitats
Fig. 2
figure 2

Plots representing coefficients (top) and predicted relationships (bottom) between total aliens and predictor variables in our most parsimonious community level model. Associated errors (95 % confidence intervals) are denoted by whiskers and grey zones respectively

Table 4 Details of the averaged model derived from two models of greatest parsimony describing community level trends

At the species level, the results of our nMDS (stress value = 0.096) revealed distinct, separate clusters of native and alien fishes (Fig. 3), suggesting the potential dietary segregation between the two groups of fishes. We also observe qualitatively that alien fishes appear to feed at lower trophic levels (i.e., overlaps contour line representing trophic level 3.5). The clustering and segregation between the two groups of fishes was statistically significant (p value <0.0005), with detritus (57 %), macroinvertebrates (20 %), and plant matter (9 %), the food items accounting for most of the variance in our data.

Fig. 3
figure 3

Non-metric dimensional scaling (nMDS) plot of the dietary components of 370 fish species from 26 lentic habitats in Southeast Asia. Ellipses on the plots represent standard errors (s.e.) of average ordinal scores while contour lines represent smooth surface thin-plate splines from a general additive model (gam) of calculated species trophic level. Food items of the fishes are represented by bold letterings on the plot

Our qualitative assessments were also supported by the more quantitative information-theoretic approach where results reveal that the model describing associations between fishbase trophic level (s f ) and maximum length with the status of a fish (i.e., native or alien) was the most parsimonious, representing 95 % of the total posterior weights in our data (Table 5) and was 2.58 × 1013 times more likely to be true compared to the null model (SpNull). This suggests that fishbase trophic level (s f ) and maximum length were important variables in predicting the status of a fish. Other predictor variables which made up the most parsimonious models include s l which was our alternative (more flexible) measure of trophic level. We found that the alien fishes were associated with lower trophic levels and greater maximum length (Figs. 4, 5; Table 6). To further explore the differences observed, we calculated the calculated Cohen’s d and obtained medium effect sizes of 0.34 and 0.41 for s f and maximum length, respectively.

Table 5 Summary of the five most parsimonious models describing the relationship between the status of a fish (i.e. native or alien) and species specific trophic related variables in 26 lentic habitats
Fig. 4
figure 4

Boxplots showing the distribution of maximum length and fishbase trophic level (s f ) of native and alien fishes in our data set consisting of 970 data points representing 370 fish species in 26 lentic habitats

Fig. 5
figure 5

Plot of predicted probability of a fish species being native as a function of predictor variables associated with our most parsimonious species level model, namely, fishbase trophic level (s f ) and maximum length

Table 6 Details of the most parsimonious model describing species level trends

Discussion

Community level food web traits

Our findings suggest that at the community level, native species richness appeared to be the most important determinant of the invasiblity of a habitat, where an inverse relationship was observed between the total number of native and alien species. This observation concurs with early studies discussing the relationship between the diversity of native communities and their biotic resistance against invasions (Elton 1958; Moulton and Pimm 1983). The negative relationship between native species richness and the number of alien species we observed can likely be attributed to a lack of competition in less species rich communities resulting from a lower degree of resource utilisation (Tilman 1997, 2004). Communities which have lower species richness are also likely to have vacant or unsaturated niches with unexploited resources which favours the establishment of introduced taxa (Wiens 2011). Moreover, less species rich communities are more vulnerable to stochastic disturbances (Petchey 2000) that could result in disruptions to ecosystem function, which may in turn be exploited by biological invaders (Shea and Chesson 2002). While one may fairly assume that the negative relationship observed between richness of native and alien fish species may be a product, rather than a driver of invasion, this is less likely to be the case in our study as biological invasions have not been shown to cause native extirpations in Southeast Asian habitats (Brook et al. 2003; Sodhi et al. 2004).

Despite the seemingly intuitive nature of the resistance hypothesis suggesting an inverse relationship between the diversity of native species and the probability of biological invasions (Elton 1958; Stachowicz and Tilman 2005), some recent studies disagree with the hypothesis and suggest that at larger scales, the opposite trend, termed the “biotic acceptance” hypothesis, is true (Byers and Noonburg 2003; Jeschke 2014). Others proposed that native species density rather than diversity was a more important determinant of resistance against invaders (Levine et al. 2004; Capers et al. 2007). However, evidence supporting the deterrent properties of biotic resistance (represented by community diversity) against invasion at smaller spatial scales is considerable (Kennedy et al. 2002; Stachowicz and Tilman 2005; Oakley and Knox 2013), suggesting that biotic resistance is not a universal phenomenon and is only observable in studies conducted at certain scales. The data collected in our study appears to lend further support to the scale dependence of the biotic resistance hypothesis as evidenced by how, with all other factors being equal, controlling for lake area resulted in greater model performance (Table 3). Even at relatively restricted range of spatial scale (i.e., all data points were at the local, single habitat level), findings from our study demonstrate the importance of contextualising (e.g., spatial scale or organisation level) findings pertaining to biotic resistance. This may reconcile our findings with previous studies suggesting “biotic acceptance” in animal models, which were surveyed at larger spatial scales (Poessel et al. 2013), considering that disproportionately large increases in resources available relative to the potential increase in interspecific interactions with increasing spatial scales (Byers and Noonburg 2003) could potentially reverse the directionality of relationships between native and alien species richness (i.e., from negative to positive).

In addition, contrasting observations made in studies proposing the “biotic acceptance” hypothesis were largely conducted on plant models (Levine et al. 2004; Capers et al. 2007; Jauni and Hyvo 2012; Lowry et al. 2013), potentially explaining the opposing trends reported. The different criteria for resource utility between plants and animals, which are arguably less narrow in the former, may affect the intensity of competition, which is an important determinant of resistance against biological invasion (Levine et al. 2004). To illustrate, while an autotrophic invader is capable of exploiting surpluses in nutrients (Huenneke et al. 2014) or light (Parendes and Jones 2000), a heterotrophic alien is constrained by its dietary requirements. Consequently, the intensity of competition among heterotrophic taxa is elevated, thus increasing the potency of biotic resistance against invaders. Although this line of reasoning is speculative based on current knowledge, the logic is consistent with diminishing resources (i.e., energy) up the trophic levels (Lindeman 1942; Pauly and Christensen 1995), and with field studies showing that interspecific competition is more commonly observed in marine and freshwater vertebrates as compared to marine, freshwater, and terrestrial plants (Connell 1983). Comparisons with terrestrial plants are especially relevant considering the model systems studied in a large proportion of “biotic acceptance” proponents. This again suggests that the negative association between native diversity and biotic resistance may remain valid, at least in animal models (Dzialowski et al. 2006; Taylor and Duggan 2012), given the appropriate spatial scale, and at the very least, merits further testing.

One limitation in this section of our study was the absence of data directly quantifying anthropogenic activity such as fish stocking, particularly in view of the presence of some evidence suggesting fish stocking activity in several of the habitats we studied (Yang 1989; Goltenboth and Lehmusluoto 2006). We attempted to address this by including habitat type as a random intercept with the assumption that artificial habitats, such as reservoirs, experience a greater level of anthropogenic impacts. In addition, a survey of aquaculture literature revealed that of the 74 species which were recorded as alien in at least one of the 26 habitats we analysed, only 11 (i.e., Barbonymous gonionotus, Clarias batrachus, Cyprinus carpio, Ctenopharyngodon idella, Gambusia affinis, Hypothalmichthys molitrix, Hypothalmichthys nobilis, Labeo rohita, Oreochromis mossambicus, Oreochromis niloticus, and Pangasianadon gigas) were historically stocked in Southeast Asia, though not necessarily in any of the 26 habitats included in our study (De Silva 1987; Balayut 1999; De Silva and Funge-Smith 2005; Arthur et al. 2010). The 11 species mentioned above were also native in some of the habitats we analysed and will not confound our analyses. Moreover, even in habitats where they were not native (e.g., P. gigas is native in Tonle Sap but not in Lake Taal), these species were not always successful in forming established populations (Balayut 1999; De Silva and Funge-Smith 2005), suggesting that fish stocking is not an important driver of biological invasions, at least in lentic water bodies in the region.

Confounding effects resulting from differences in anthropogenic activities were also minimised by our measures of controlling for accessibility of the lakes with the assumption that the level of human activities correlate with propagule pressure (Meyerson and Pys 2013; Roux et al. 2013). Should our control measures fail to completely account for all facets of anthropogenic disturbances, our assertions remain relevant, as a known consequence of habitat disturbance is species loss, which results in reduced ecosystem functioning (Duffy 2003), and consequently favouring biological invasions (Stachowicz and Tilman 2005). Therefore, even when assuming anthropogenic disturbances other than what we attempted to account for to be the ultimate driver of biological invasions, biotic resistance is likely to be one of the more important proximate factors determining habitat invasibility. To further illustrate this point, disturbed habitats are inherently less suitable for native taxa adapted to status quo in their native ranges and are thus likely to suffer impacts to abundance and richness (in cases where extirpations occur). We argue that the “passenger theory” pertaining to biological invasions and habitat change (Didham et al. 2005; MacDougall and Turkington 2005) should not necessarily be viewed as being exclusive of biotic resistance, as an impacted native community would logically be less able to repulse invading aliens via mechanisms such as competition or predation. In short, the “passenger” status of habitat degradation should apply to both alien and native species, albeit with opposite effects (i.e., in favour of the former but not the latter), giving rise to more synergistic/interactive consequences (Didham et al. 2007). Having said that, we believe that the relative importance and exact interactions in the effects of abiotic (habitat disturbance) and biotic resistance in predicting biological invasions is an important avenue for research and warrants future experimental and empirical work.

Finally, the paucity of high-resolution food web data at our study sites necessitated the use of modelling approaches to approximate food web structure. While this approach has been shown to be a relatively reliable (Pace et al. 1999; Williams and Martinez 2000), there remain shortcomings, particularly in its tendency to oversimplify trophic relationships and aggregate species (Polis 1991). Although our most important finding, which is the negative relationship between the richness of native and alien fish present, will not be affected by potential lack of resolution in our food webs, it is important to note that food web traits and network topology may be more important in predicting the invasibility of communities than our findings suggest. Despite the likelihood that these traits were underestimated, our findings suggest that w d was one of the more important predictors of total number of aliens present (Table 3; Com23), and the model parameterised with it was 6.45 × 103 times more likely to be true compared to the null model (i.e., ComNull) and had a negative albeit not statistically significant (p value = 0.14) relationship.

As all our food webs were constructed using identical methods, we also have no reason to believe that increasing food web resolution (i.e., less species aggregation and increase number of links attributed to ontogeny) would change the directionality of relationships between food web traits and alien species richness, thus the central thesis of our community level inferences is likely to remain unchanged. However, we note that the inability to further taxonomically resolve invertebrate taxa may result in the inability to detect potential differences in food web structure in our analyses, and this may be relevant in comparisons between communities that have vastly different assemblages of non-fish taxa.

Species trophic traits

In addition to the association between native species richness and the invasibility of a habitat, our data also suggest that at the species level, alien taxa with certain trophic traits are more likely to become established. In both our nMDS (Fig. 2) and information-theoretic analyses, we found that alien species fed lower down the food web in comparison to native taxa. This observation raises three possible scenarios: (1) native species feeding at lower trophic levels are more likely to be displaced by alien fauna; (2) alien species occupying lower trophic levels are more likely to be released (i.e., introduced), thus increasing propagule pressure; or 3) alien species feeding lower down the food chain are more likely to be established in lentic habitats in the region because of the availability of poorly exploited resource niches. While (1) is possible, there is again little direct evidence to support the occurrence of fish extirpations resulting from biological invasions (Brook et al. 2003; Sodhi et al. 2004), particularly, as a result of competitive exclusion (Sax and Gaines 2008). This leaves the competing hypotheses (2) and (3) as the more likely factors underlying our observations, both with their own merits. While (2) describes a likely scenario in aquaculture (Tacon et al. 2009) attributable to the relative ease in growing low trophic level species, we believe that fish stocking is not an important driver of biological invasions in the lakes we studied due to reasons discussed in preceding paragraphs. We cannot, however, preclude the possibility that species feeding at lower trophic levels are being released at a disproportionately higher rate from the ornamental fish trade, though there is no data currently available to suggest that this is true. Anecdotally, fish occupying high trophic levels are just as likely to be released as unwanted pets (e,g., Scleropages formosus) or as angling species (e.g., Cichla orinocensis), as are low trophic level taxa (e.g., Pterygoplichthys pardalis). Unless further evidence is made available, we argue that (3) is the most well supported explanation as species feeding lower down the food web have been shown to be more likely to overcome resource limitations by utilising low quality food sources with low energy yield (Marchetti et al. 2004b; Gido and Franssen 2007). The association between alien taxa and food items of low energy yield (i.e., detritus and plant matter which collectively accounts for 66 % of the variance in our data) further supports this line of argument (Fig. 3). Our findings are consistent with theoretical (Dunne et al. 2002b; Romanuk et al. 2009) and empirical (Moyle and Light 1996; Gido and Franssen 2007) studies demonstrating a higher likelihood of invasion success among species that are capable of exploiting resources available at lower trophic levels, which suggests that competition and resource utilisation were the primary mechanisms underlying biotic resistance, at least in the fish communities we assessed.

We also observed a discrepancy of sizes between native and alien taxa suggesting that the ability to attain size refuge and consequently achieve predator avoidance is an important factor favouring successful alien establishment. As predation by native taxa is one of the factors contributing to biotic resistance (Harvey et al. 2004), this finding was not unexpected. Alternatively, the differences observed could again be attributed to local natural history as native species in certain areas (e.g., Singapore) are largely small forest stream specialists (Yeo and Lim 2011) and are necessarily more diminutive. Local natural history is another possible mechanism underlying the disparity between alien and native fishes belonging to the pelagic planktivore feeding guild (Table 6) where regions with no large rivers or lakes are not likely to have native planktivorous fish (Goldstein and Meador 2004).

We attempted to minimise potential confounding effects by controlling for locality as a random intercept in our glmm analysis with the assumption that the different habitats will have different propagule pressures. This is especially applicable to reservoirs in Singapore where high connectivity between water bodies and high public accessibility of reservoirs may increase propagule pressure (Yeo and Lim 2011). Further we observed substantially lower marginal R 2 values in comparison with conditional R 2 scores in all the models suggesting that at the species level, much of the variance in the data is associated with location and phylogeny, both random factors we controlled for in all our models.

Conclusions: biotic resistance revisited

Testing of the biotic resistance hypothesis in relation to food web traits or network topology often faces the challenge of inherent complexities in elucidating complex food webs. Consequently, many published studies of this nature use simulated food webs (Byers and Noonburg 2003; Romanuk et al. 2009; Baiser et al. 2010) as model systems. In our study, we applied a combination of (1) modelling food webs using data from literature and (2) statistical analyses to answer two research questions where we found trends in the establishment of alien freshwater fishes at both the community and the species level. At the community level, we found that communities with high native species richness were associated with lower numbers of aliens, while at the species level, we found that alien fishes fed at lower trophic levels compared to the natives. Both these observations corroborate the role of biotic resistance in deterring the establishment of alien species as our findings suggest that empty niches or a surplus of resources, which is likely to be more common in less speciose communities (Stachowicz and Tilman 2005; Jeschke 2014) and in lower trophic levels (Sinclair 1975), favour the establishment of alien species. Moreover, the differences in species level trophic traits between alien and native fishes suggest that the main mechanism underlying biotic resistance in the habitats we studied were likely to be related to resource utilisation. In addition to other known drivers of biological invasion such as propagule pressure (Simberloff 2009), and anthropogenic disturbances (Mack et al. 2000), our study shows that biotic resistance remains an important factor, which should not be overlooked when assessing the successful establishment of alien species in a community.