Introduction

To effectively monitor and protect rare species, we must know the geographic locations of their populations, but many rare species lack precise distribution information. Field surveys are needed to fill these gaps (Peterson et al. 2011), but they are time-consuming and expensive (Lindenmayer et al. 2013). Thus, reliable protocols are needed to help efficiently direct survey efforts.

Species distribution models have recently proliferated as conservation tools (Guisan et al. 2013). By using species occurrence data along with environmental or spatial predictors, the habitat suitability or probability of occurrence for a species can be predicted across an area of interest (Guisan and Zimmerman 2000; Elith et al. 2006). SDMs have also been used to predict current species richness (e.g. Guisan and Theurillat 2000; Newbold et al. 2009; Parvianen et al. 2009), predict future species richness with changing climate (Thuiller et al. 2005), predict shifts in species’ distributions (e.g. Thuiller 2004; Elith et al. 2010; McKenney et al. 2015), assess the efficacy of current protected areas and propose locations for new ones (Loiselle et al. 2003; Koch et al. 2017; Amaral et al. 2017), prescribe management for invasive species (Bennett 2014), and find new occurrences of rare species (e.g. Williams et al. 2009; Rebelo and Jones 2010; Peterson et al. 2011; McCune 2016).

Outputs of SDMs for individual species can be stacked together to create one composite map (Ferrier and Guisan 2006); this technique is often referred to as S-SDM (where the “S” stands for stacking; Dubuis et al. 2011). S-SDMs offer an effective method for highlighting areas of special conservation importance based on predictions of species richness of a taxonomic group of interest (Newbold et al. 2009; Yu et al. 2017), and have been used to predict threatened species richness (Parvianen et al. 2009; Koch et al. 2017). Knowledge of which areas have the highest potential concentration of rare species can point to promising survey sites, thus leading to the discovery of previously unknown rare species locations (Williams et al. 2009). This in turn increases the accuracy of the estimate of the number of rare species populations, which allows for a more accurate assessment of a species’ conservation status and better-informed management.

Two aspects that have begun to receive attention in the S-SDM literature are: (1) accuracy of SDMs, and (2) species’ conservation status. The accuracy of distribution models for individual species (not stacked) has been discussed extensively (e.g. Segurado and Araaujo 2004; Elith and Leathwick 2009). A model that overpredicts suitable habitat and makes commission errors will waste survey resources by sending surveyors to unsuitable areas that were predicted to be suitable, while a model that underpredicts suitable habitat and makes omission errors will cause surveyors to overlook areas where the species is present (Loiselle 2003). Stacking individuals models together in a S-SDM can lead to inaccurate predictions in the form of species richness (usually, but not always, as an overestimation, Guisan and Rahbek 2011), for which correction methods have been proposed (Calabrese et al. 2014; Del Toro et al. 2019; Pouteau et al. 2015). Fewer studies have accounted for individual model accuracy, within the resulting S-SDMs (but see Dunn et al. 2016). When working within an S-SDM framework, there will be variation in the accuracy of the individual models being stacked due to the differing characteristics of the species being modeled (Guisan et al. 2007; Syphard and Franklin 2010), the quality of the data for each species (e.g. Graham et al. 2004; Moudrý and Šímová 2012), and the relative prevalence of the species (Hernandez et al. 2006; Le Lay et al. 2010). Fernandes et al. 2018 examined the effects on the predictive accuracy of S-SDMs for virtual species by varying individual model components known to affect accuracy at the single model level and found that sampling size and modeling technique had the largest impacts. However, we have yet to see individual SDM accuracy for real species quantitatively estimated based on data independent of the records used to build the SDM, rigorously evaluated, and weighted within the S-SDM framework itself. In fact, assessing the accuracy of SDMs using independently collected presence and absence data is rarely done in any context. It is important to take individual model accuracy into account in the stacking process if the goal is to focus on survey locations with the highest probabilities of containing at least one of the species of interest, as would be the case in attempts at rare species discovery.

Secondly, many agencies prioritize management of species based on rarity (e.g. ESA 1973; SARA 2002), so knowing where less prevalent species are may be more important than knowing where more prevalent species are. Some modelers have noted this fact and have weighted individual models in S-SDMs by species’ relative rarity (Albuquerque and Beier 2016; Tukainen et al. 2017; Yu et al. 2017), or have used the habitat specificity of the species (Miličić et al. 2017) to highlight potential areas important for threatened species conservation. However, to our knowledge, no study has tested whether weighting individual model outputs by both rarity and accuracy within an S-SDM framework leads to an increased detection rate of priority threatened and rare species.

To do so, we built SDMs for 22 rare vascular plant species in southern Ontario, Canada and stacked them, weighting each individual model by accuracy, species conservation status, both, or neither. This weighting procedure allowed us to create an index based on the resulting efficiency values, from which we conducted field surveys in areas predicted to be suitable for one or more species to test the effectiveness of these S-SDM maps in leading to the discovery of new rare plant occurrences. We aimed to test (i) How much more likely are surveys to lead to the discovery of at least one rare species in cells with high predicted probability of occurrence for multiple rare species, compared to cells with high predicted probability of occurrence for only one species (i.e. how much do we gain in survey efficiency by stacking SDMs), and (ii) What effect does weighting individual species’ models by model accuracy and/or species rarity have on survey efficiency?

Methods

Study region and species

We focused our study on the forests of Southern Ontario (Fig. 1), predominantly in the Carolinian forest zone of the extreme southwest (Crins et al. 2009), which is characterized by deciduous canopy cover. We also located some study sites in the forests north and east of the Carolinian zone, which are characterized by mixed deciduous-evergreen cover (Crins et al. 2009). Southern Ontario is important for vascular plants at both the provincial and national level, as approximately 72% of Ontario’s and more than 40% of Canada’s plant species occur here (Oldham 2017), despite the dominance of urban development and agricultural land use (Crins et al. 2009). The total study area encompasses approximately 7 million hectares.

Fig. 1
figure 1

Study area showing the range of efficiency index values based on weighting individual model outputs by species rarity and model accuracy with the multi-species cell (MSC) and single species cell (SSC) sites surveyed in 2017 overlaid

We initially selected 27 vascular plant species native to Ontario and growing in woodland habitats, based on their conservation importance, and their relative ease of identification in the field. We defined rare species as those ranked S1, S2, or S3 at the provincial level, corresponding to critically imperiled, imperiled, and vulnerable species (Faber-Langendoen et al. 2012, Table 1). Most of the species included here are at the northern edge of their range within southern Ontario, with large portions of their ranges extending south into the United States. The species represent a wide taxonomic range and include trees, shrubs, herbs and ferns. Nomenclature follows the Ontario Natural Heritage Information Centre (NHIC, https://www.ontario.ca/page/get-natural-heritage-information).

Table 1 Model information for each of the 27 species tested, including the 22 species included in the S-SDM efficiency map

Building and evaluating individual models

We obtained presence-only occurrence records for each species from the NHIC. These records include herbarium records, field surveys by Ministry of Natural Resources (MNRF) biologists, and other confirmed sightings. We used only records that had < 100 m accuracy to build the SDMs, to correspond with the resolution of the environmental variables (see below). The total number of occurrence records we used to build the models ranged from 4 to 1594 with a mean of 83.7 records per species and a median of 17.5 (Table 1). Species with occurrence records on the lower end of this range are potentially subject to overfitting (Merow et al. 2013); however, SDMs can perform well even with sample sizes as low as 5 (Hernandez et al. 2006; Pearson et al. 2007; van Proosdij et al. 2016) and we are specifically studying rare species with limited available records. Additionally, species that have lower prevalence (i.e. percent area occupied within the study region) require fewer records than higher prevalence species to achieve similar model accuracy (van Proosdij et al. 2016). In the case of most of the species we modeled here, they can reasonably be assumed to have low prevalence within our study region given the infrequency of sightings (see results for further validation of this assumption). Records date from 1897 to 2012. We considered high spatial accuracy to be the most important criterion for including occurrence records. However, we explored the effect of record age on the performance of SDMs and found minimal effect on model accuracy when using records with < 100 m spatial accuracy (unpublished data). While we only used the partial range for each species in their respective SDMs (the study region as opposed to their entire extent of occurrence), our study region represents the edge of the range for most species included here, which has been found to produce more accurate model results than including the complete range (Luoto et al. 2005). Because we evaluated each SDM prior to including it in the S-SDM based on independently collected presence and absence records, we are able to exclude SDMs that had low accuracy as a result of few occurrence records, sampling bias in the records, or any of the many other factors that can influence SDM accuracy (see Merow et al. 2013).

We collected data on climatic, topographic, soil, and surficial geology environmental variables to use as predictors in the models (Online Resource 1). Previous research (McCune 2016) suggests that forest type and the amount of forest on the landscape surrounding a site can influence whether a threatened plant species is present. Therefore, we also collected data on forest contiguity (number of 1-hectare cells out of the 9 × 9 cell area immediately surrounding the focal cell that are forested) and land cover type (deciduous forest, mixed forest, swamp, etc.) across the study region. We tested for multicollinearity between environmental predictors and used only those that were not highly correlated (r < 0.7), following Gogol-Prokurat (2011). Where 2 variables were correlated at r > 0.7, the more generalized variable was used (e.g., mean annual temperature and mean temperature of the warmest quarter were correlated so the former was kept). We resampled each variable to a 100 m × 100 m resolution using the Resample function in ArcGIS and the “Majority” resampling technique (see Online Resource 1 for original spatial resolutions for each variable).

Because the occurrence data were presence-only records and were limited in number, we chose to use Maxent (Phillips et al. 2006) to build SDMs. Maxent has been shown to perform very well using a range of performance measures (Elith and Graham 2009), even when few species occurrence records are available (Hernandez et al. 2006; Pearson et al. 2007; van Proosdij et al. 2016). We built 8 different SDMs for each species (Fig. 2). As a beginning foundation, all models for each species contained the climatic, topographic, soil, and surficial geology predictor variables (14 predictors). Additionally, we built models for each species that included either the forest contiguity or land cover variables or both to determine if these less commonly used variables would help improve the accuracy of models for any species. We ran each of these model types twice: once with a regularization multiplier set to 1 and once with regularization set to 0.5. The regularization parameter can correct for model overfitting, with higher values allowing for a more generous inclusion of predicted suitable habitat and lower values providing a more conservative estimate of predicted suitable habitat. Given the desire to narrow down the area for potential survey sites as much as possible, we chose to test a lower regularization parameter value in addition to the default value of 1.Testing different regularization parameters is recommended, in order to arrive at the best-performing model (Merow et al. 2013). We set aside a random subsample of 25% of available records to test each model and fit the model 10 times with different model fitting and test subsamples, making the final result an average of the outputs from the subsamples. We used Maxent’s cumulative output in which each grid cell receives a score from 0 to 100, which can be interpreted as the percentage of cells that have a value equal to or less than that cell’s value (Merow et al. 2013). We set features in Maxent to auto (which includes linear, quadratic, product, threshold, and hinge functions), jackknife to measure variable important was selected, and set the background as the complete study area. We set the random test percentage to twenty-five percent with replicated run type of subsample.

Fig. 2
figure 2

Flowchart showing progression of steps taken to attain efficiency maps from Maxent SDMs

We evaluated the models with independent presence and absence data originating from two sources. First, we obtained independent presences from the NHIC’s central holdings database, which consists of records that have not yet been added to the main database of occurrences. Second, we obtained independent presences and absences from 2014 and 2015 field data in which botanists surveyed 156 100 m × 100 m cells throughout Southern Ontario predicted to be suitable for one or more rare plant species (see McCune 2016; McCune et al. 2017). We excluded any independent presence located within the same grid cell as a record used to build the SDM, using the raster package in R (R Core Team 2016; Hijmans and van Etten 2017). We also excluded absences if the grid cell was surveyed outside the time of year during which the plant is present and identifiable.

We chose one SDM for each species based on three criteria: (1) first, we chose the model with the highest sensitivity (the percent of actual presences correctly classified as presences by the model) based on calculations with the independent data (i.e. the model that predicted the highest number of independent presence records as suitable) using a threshold that achieved 10% omission of training presences for species with at least 15 records, and 0% omission (i.e. threshold = minimum training presence suitability) for species with fewer than 15 records.; (2) if more than one model was tied for the highest sensitivity, we chose the model with the highest AUC (a threshold independent measure of model performance, Fielding and Bell 1997); (3) if there was still a tie between models, we chose the model with the lowest area predicted suitable (Engler et al. 2004). We considered sensitivity to be the most important measure of predictive performance for this study because we wanted to minimize the number of false negatives produced by each model. If an area is falsely characterized as unsuitable, then it will not be surveyed and rare occurrences will be missed as a result of omission error (Liu et al. 2016). We used a 0% omission rate for species with very few occurrences because with fewer known occurrences, it is less likely that the lowest 10% suitability values represent areas unsuitable for the species; this approach is recommended by Pearson et al. (2007). We chose model AUC as a selection factor because of its independence from a threshold, its ubiquitous use by modelers, and its generally effective measure of model discrimination ability (e.g. Pearce and Ferrier 2000). Smallest suitable area was also used as a selection factor because narrowing down the area considered suitable for a species narrows down the potential survey area, an important consideration when time is a limiting factor.

It is important to recognize that the MaxEnt predicted habitat suitability does not necessarily correlate linearly with species probability of occurrence (Gogol-Prokurat 2011; Vaughan and Ormerod 2005). Therefore, we converted MaxEnt suitability outputs from the best SDM for each species (which range from values of 0 to 100) to estimated probabilities of occurrence (0 to 1 value range) based on the independent presence and absence data described above, using generalized linear models (GLMs) with a binomial link function. Gogol-Prokurat (2011) tested for a linear relationship between SDM output and probability of occurrence using a modified Hosmer–Lemeshow deciles of risk test. However, we recognize that these relationships may be significant but not linear, so we used binomial GLMs. We extracted Maxent suitability values for each presence and absence location, using the suitability values as the explanatory factor in the logistic model. We used the resulting GLMs to create new rasters for each species in which each cell represents the probability of occurrence of that species in that cell. This allowed us to estimate the likelihood of species presence in a cell based on independent data not used to build the SDM. To test the usefulness of the GLM for predicting species probability of occurrence, we compared it to an intercept-only model and deemed habitat suitability a significant predictor of probability of occurrence when ΔAIC was less than 2 (Burnham and Anderson, 2002).

S-SDM efficiency maps

We used only the species that had acceptable AUC values and GLM probability of occurrence models that were significantly better than an intercept-only model (ΔAIC > 2; Burnham and Anderson 2002) in the stacking process, to avoid using models that were not useful for predicting species distributions. We defined acceptable AUC values as those ≥ 0.6 since an AUC value of 0.5 is no better than random. Although a common practice is to consider AUC values ≥ 0.7 as adequate (Swets 1988), we adopted 0.6 as the cutoff because the species modeled here are rare, and we assumed that any information that is better than random is potentially important for creating useful models that could be used in management. Additionally, evaluating the models with independently collected data from field surveys (as opposed to data separated from the original dataset) further helped to ensure all models included in the stack produced reasonable results.

To create the efficiency maps, we stacked the estimated probability of occurrence model outputs for these 22 species in four ways: (1) individual estimated probability of occurrence maps added together with no weighting; (2) probabilities of occurrence added together and weighted by accuracy of the SDM (as measured by AUC) and by the S-rank (threat level) of the species, using the following equation:

$$E_{i} = \mathop \sum \limits_{j = 1}^{n} \left( {P_{ij} \cdot R_{j} \cdot A_{j} } \right)$$
(1)

where Ei is the resulting Efficiency Index of cell i, Pij is the estimated probability of occurrence of species j occurring in cell i, Rj is the rarity weight (i.e., provincial S-rank where S1 = 3, S2 = 2, S3 = 1) of species j, and Aj is the accuracy of the SDM for species j as measured by AUC; (3) models weighted by accuracy but not by rarity; and (4) models weighted by rarity but not by accuracy. We used a rarity weighting because rarer species are of greater priority and urgency for locating new populations. We used an accuracy weighting because when field survey time is limited, it is best to visit areas with greater confidence in the model predicting the species’ probability of occurrence.

Field surveys

We selected candidate cells for surveys that were either highly suitable for multiple species or suitable for just one species. We term the former multi-species cells (MSC) and the latter single-species cells (SSC). There has been some debate about the appropriateness of applying a suitability threshold and obtaining binary predicted presence/absence maps for individual species (bS-SDM) compared to using raw estimated probability of occurrence values (pS-SDM) (Guisan and Rahbek 2011; Calabrese et al. 2014; D’Amen et al. 2015); we used both methods in identifying MSC and SSC. Specifically, we defined MSC as being suitable for 2 or more species using a threshold allowing for a 10% omission rate for species with greater than 15 records used to build their SDM, and a 0% omission rate for species with fewer than 15 records (bS-SDM), while also having an efficiency index within the top 5% of all grid cells according to the s-SDM weighted by both accuracy and rarity (pS-SDM) (Parviainen et al. 2009). These two constraints taken together ensured that cells were not chosen for surveys that had low to moderate probability of species presence for many species. We defined SSC as being suitable for only one species (using the same omission rate threshold rules as defined above) independent of the cell’s efficiency value. We attempted to survey the same number of MSC and SSC within each tertiary level watershed (subdivisions of secondary watersheds which are mostly made up of large river systems) to reduce spatial segregation of MSC and SSC. On each survey day, we randomly chose several cells among the possible MSC/SSC within a given watershed and surveyed the first site for which we could obtain landowner permission. Because of the limited amount of time available to conduct these searches, both in our study and commonly in practice by field botanists, we did not survey sites with low habitat suitability for all species. These sites are unlikely to harbor any of our modeled species, and the goal of our study was to test the efficiency of S-SDMs versus single SDMs to direct surveys for rare species. Thus, these would not have been useful to survey.

We surveyed 70 cells (including 33 MSC and 37 SSC) on privately owned sites as well as protected areas (e.g. Nature Conservancy of Canada, Nature Trust, or Provincial Park land), between May and August 2017. We obtained written or verbal permission from the landowners for all privately owned sites, and research permits for protected areas. At each site, we navigated to the center of the 100 m by 100 m grid cell using a handheld GPS unit and used flagging tape and a compass to delineate four quadrants. The field team included one to four people with at least one trained botanist present. We walked the entire square grid cell systematically, recording all vascular plant species observed. Each survey lasted 2.5–5 h.

Comparison of S-SDM methods

We built logistic regression models to test the significance of the relationship between the efficiency values (four versions for each surveyed cell) and the probability of finding any rare plant species at that location based on our field surveys. We also tested for differences between MSC and SSC in the total, native, and exotic species richness recorded, using t-tests assuming unequal variances. We performed all data analysis in R 3.3.1 (R Foundation for Statistical Computing, Vienna, Austria 2016).

Results

The independent AUC value for the best MaxEnt models for each species ranged from 0.442 to 0.999 with a mean of 0.83 (Online Resource 2). Five species were excluded from further analysis due to low model accuracy: three based on low independent AUC values and two based on ΔAIC < 2 (Burnham and Anderson 2002) when comparing their GLMs to an intercept-only model. This left 22 species included in the S-SDM efficiency maps.

Of the 70 cells surveyed, 22 had at least one rare plant species (ranked S1, S2, or S3). Fifteen had one species, five plots had 2, and two plots had 3. We found a total of 30 occurrences of 17 rare plant species. Only 4 out of these 17 species were those we modeled and incorporated into the efficiency maps (Castanea dentata, Celtis tenuifolia, Cornus florida, and Lithospermum latifolium), the rest being incidental discoveries of species not modeled by the SDMs (Online Resource 3).

The probability of finding at least one rare plant species was approximately double in MSC compared to SSC. Including incidental rare plant discoveries, MSC had a 42.4% success rate (presence of at least one rare species) while the SSC had a 21.6% success rate. Not including the incidental species, the MSC had an 18.2% success rate while the SSC had an 8.1% success rate (Fig. 3). Results were qualitatively the same when excluding SSC sites where we had no paired MSC. None of the measures of species richness (total, native, or exotic) were significantly different between the two site types.

Fig. 3
figure 3

a The percent of multi-species cell (MSC) and single species cell (SSC) plots that had at least one rare plant species discovered, either including or excluding incidental species discoveries (species which were not modeled) and b Visualization of logistic regression model showing estimated probability of presence of at least rare plant species across efficiency index values of the S-SDM weighted by both rarity and accuracy with 95% confidence intervals

GLMs relating the estimated probability of occurrence of at least one rare plant species to the efficiency index showed little difference among weighting procedures used to create the S-SDMs. All ΔAIC values were < 5 and percent deviance explained by the full models was similar (Table 2). There was a significant positive relationship in the logistic regression between the efficiency index values of all the S-SDMs and the probability of at least one rare species (target and incidental species included) being present (p = 0.01). Results were similar when only target species were included. There was no significant relationship between the calculated efficiency index values and the field-measured total species richness (p = 0.21, R2 = 0.01) or total species richness and estimated probability of occurrence of a rare species (p = 0.23).

Table 2 Results of the generalized linear models predicting probability of at least one species of conservation concern based on S-SDM score for each of the different weighting procedures

Discussion

The efficiency maps we created by stacking the probability of occurrence model outputs for 22 rare plant species allowed for the discovery of new occurrences of 17 rare species. These discoveries were approximately twice as likely to occur in multi-species (MSC) sites, where multiple species were predicted to have suitable habitat, than in single-species (SSC) sites, where only one species was predicted to have suitable habitat. While we found new occurrences of only 4 out of the 22 species modeled, because we were working with rare species this discovery rate is not unusual (MacDougall and Loo 2002; Williams et al. 2009; McCune 2016). Our results point to the usefulness of pairing modeling with field surveys for uncovering previously unknown rare species occurrences, thereby increasing our knowledge of rare species distributions and habitat occupancy.

We found 11 rare plant species which were not modeled and not incorporated into the efficiency maps. This preponderance of incidental finds suggests that the efficiency index not only helps to find species explicitly modeled, but also those which are not modeled but also rare. Thus, the efficiency index is capturing something shared in the ecological niches of the species modeled as well as in some rare species that were not modeled. We noticed that the incidental rare species that we discovered tend to prefer mesic, moderately shaded, floodplain conditions in older woodlands. Therefore, our efficiency index may be especially helpful at highlighting these areas for surveys.

The most important variables included in the individual models according to percent contribution were surficial geology, land cover, and soil texture. The most important variables according to permutation importance (Online Resource 4) were annual mean temperature, precipitation of the warmest quarter, forest contiguity, and mean temperature of the growing season. It seems that these environmental variables largely dictate the presence or absence of rare species in our study area. The importance of the land cover variable to the performance of the majority of the models is of particular note given the unresolved question of its inclusion as an appropriate and useful predictor in SDMs; of especial concern can be its categorical (rather than continuous) nature and its resolution size in relation to species’ scale of interaction with their environments (Bucklin et al. 2015; Cord et al. 2014; Eskildsen et al. 2013; Wilson et al. 2013). Our results agree with those of Pearson et al. (2004), who found that adding land cover to their distribution models for plant species improved predictive performance Additionally, the results of Luoto et al. (2007) showed increased model accuracy including land cover as a predictor when spatial resolution was fine. These results also agree with those of this study given the high resolution of our variables and resulting cells within the models. It may be possible to even further improve model accuracy with the addition of continuous remote sensing predictors (Cord et al. 2014), however the good to excellent performance of most of our models indicates that a categorical land cover variable may be sufficient, especially for rare species that prefer distinct land cover type(s). We are not surprised that land cover was important for many of our modeled species, as it likely allowed MaxEnt to focus on areas of the proper forest type (coniferous, mixed, or deciduous) and remove areas of agricultural or urban land use.

The efficiency index performed better at predicting rare species presence than a simple measure of total species richness: there was no significant relationship between total species richness and probability of rare species presence in the logistic regression, while there was a significant positive relationship between the efficiency index and rare species probability of presence (Fig. 3b). This is important because, at least for rare plants in our study region, creating SDMs for a subset of rare species may be sufficient if the main goal is to locate new rare species sharing a given habitat type., rather than creating models to predict total species richness, which may require more information.

It must be noted that this efficiency index will not be useful for finding new rare plant species occurrences in every situation. Firstly, the efficiency index did not result in the discovery of any species ranked S1, those which are the most imperiled in the province. This is likely because these species are the least prevalent and thus any new population locations (if they exist) will be the most difficult to find. Le Lay et al. (2010) had similar results when attempting to discover new occurrences of two extremely rare plant species based on ensembles of SDMs.

Secondly, the efficiency index will not aid in the discovery of rare species that have very distinct habitat requirements and thus are not likely to occur in the MSC of our efficiency maps. The MSC represent areas that have especially suitable habitat based on the ecological requirements of many of the species modeled. Species that do not share these ecological requirements will not overlap in distribution. Consequently, new locations of this type of species will be more likely to occur in the SSC areas of the efficiency map. For example, Asplenium scolopendrium, a fern species that can only grow on limestone substrate (Oldham and Brinker 2009), was not found at any MSC locations we searched in 2017. There are contrasting results in the literature concerning whether or not species with distinct habitat requirements are more easily modeled with SDMs than generalists (Elith and Burgman 2002; Hernandez et al. 2006; Le Lay et al. 2010; Grenouillet et al. 2011; McCune 2016; Soultan and Safi 2017; Rhoden et al. 2017). If species with distinct habitats are in fact easily modeled, then their lack of discovery in our efficiency maps is not a major point of concern because any specialized species of particular interest can be separately modeled. The methodology presented here focuses on efficiently finding the most occurrences of rare plant species and not necessarily on finding individual species. If the latter is the desired conservation outcome, an individual SDM should be used.

Of course, our models, like all SDMs, were imperfect. For example, the field surveys that informed the models likely did not sample the full niche space, and the AUC of our models may not perfectly reflect model accuracy, in part due to this measure’s effective lowering of species prevalence (Raes and ter Steege 2007). However, our results clearly indicate that even such imperfect models can be very good at efficiently indicating potential survey sites for rare species.

Like Williams et al. (2009), we found that many surveyed sites with high probability of occurrence lacked our rare species, which is likely a result of factors other than habitat suitability leading to species absence. Dispersal limitation could act as one of these additional distribution determinants. Suitable habitat may exist outside of where the species of interest is currently found, but the species’ physical inability to spread to these areas through seeds or spores restricts its distribution to a smaller area. Previous research has found differences in the amount of predicted suitable habitat when dispersal is or is not accounted for in plant distribution models, with less habitat available when dispersal is restricted (Krause et al. 2015). Given that the ability of plants to disperse is limited and that dispersal can be especially problematic for rare plant species (Primack and Miao 1992), dispersal limitations should be kept in mind when creating models for rare plant species. Another limit to plant species real distributions beyond the abiotic environment could be biotic interactions, including pollinators (Giannini et al. 2013), pathogens (Bueno de Mesquita et al. 2016), and competition from other plant species (Meier et al. 2010; Pellissier et al. 2010). Modelers may also choose to include the distributions of one or more interacting species to create more realistic models.

Our results did not show a clear difference among the weighting systems (species S-rank, model accuracy, both, neither) in predicting the presence of at least one rare plant species using our ΔAIC criterion. However, there was a strong relationship between the efficiency index and the estimated probability of occurrence of rare species. Thus, although the stacking of individual model outputs was useful for discovering new rare plant occurrences, the weighting of model outputs by threat level and model accuracy was unnecessary in this study for the goal of field site survey prioritization. Dunn et al. (2016) also found that weighting SDMs did not change the areas highlighted for conservation very much compared to unweighted SDM stacking. This does not mean that model weighting by either threat level or accuracy should be completely discounted for future S-SDM analysis. It is possible that a similar weighting system used in another study area, with a different combination of species, or with more surveyed cells would have different results.