Skip to main content

Advertisement

Log in

Effects of sample size, data quality, and species response in environmental space on modeling species distributions

  • Research Article
  • Published:
Landscape Ecology Aims and scope Submit manuscript

Abstract

Context

There have been many studies using species distribution models (SDMs) to predict shifts in species distributions due to environmental changes, but few consider effects of data quantity, data quality, or species response shape. Modeling studies using field-sampled data may be impaired to an unknown degree by lack of knowledge on species’ true relationships with environmental changes.

Objectives

Using simulations with known relationships we assess model predictions, and investigate which models are more sensitive to sample size, detection limit, or species response shape issues when different SDMs are used for predicting species distribution shifts under environmental changes.

Methods

We simulated 16 species response relationships to ecological gradients differing in response shape (skewness and kurtosis) using a generalized β-function. Populations were randomly sampled at different sample sizes and detection limits. Linear discriminant analysis (LDA), multiple logistic regression (MLR), generalized additive models (GAM), boosted regression trees (BRT), random forests (RF), artificial neural networks (ANN), and maximum entropy models (MaxEnt) were developed on sampled datasets and compared for predicting species occurrence. We used these SDMs to predict distribution patterns for virtual species with different response shapes across a real landscape of varying heterogeneity in environmental conditions, and compared them with the probability of occurrence generated by the β-function.

Results

GAM and BRT were sensitive to both sample size and detection limit changes; RF was more affected by detection limit; ANN and MaxEnt were more affected by sample size; LDA and MLR were sensitive to species response shape changes.

Conclusions

Overall, if little is known about species response to environmental changes, ANN is recommended especially for large sample size. If a focal species is likely to occur only in a narrow range of environmental conditions, GAM and BRT are preferred for large good-quality datasets, and GAM tends to perform slightly better under varied data conditions; RF is recommended for limited amounts of good-quality data. If a focal species is likely to be present in a wide range of environmental conditions, MaxEnt is preferred but caution should be taken for small sample size. If the goal is to identify potential distributions of invasive or endangered species but data quantity and quality are very limited, LDA and MLR are recommended as they generally provide reasonable model sensitivity.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

  • Aiello-Lammens ME, Boria RA, Radosavljevic A, Vilela B, Anderson RP (2015) spThin: an R package for spatial thinning of species occurrence records for use in ecological niche models. Ecography 38:541–545

    Article  Google Scholar 

  • Araújo MB, Pearson RG, Thuiller W, Erhard M (2005) Validation of species-climate impact models under climate change. Glob Chang Biol 11:1504–1513

    Article  Google Scholar 

  • Austin MP (1976) On non-linear species response models in ordination. Vegetation 33:33–41

    Article  Google Scholar 

  • Austin MP (1985) Continuum concept, ordination methods, and niche theory. Annu Rev Ecol Syst 16:39–61

    Article  Google Scholar 

  • Austin MP, Belbin L, Meyers JA, Doherty MD, Luoto M (2006) Evaluation of statistical models used for predicting plant species distributions: role of artificial data and theory. Ecol Model 199:197–216

    Article  Google Scholar 

  • Bateman BL, VanDerWal J, Williams SE, Johnson CN (2012) Biotic interactions influence the projected distribution of a specialist mammal under climate change. Divers Distrib 18:861–872

    Article  Google Scholar 

  • Begon M, Howarth RW, Townsend CR (2014) Essentials of ecology, 4th edn. Wiley, Chichester

    Google Scholar 

  • Blanchet FG, Legendre P, Borcard D (2008) Modelling directional spatial processes in ecological data. Ecol Model 215:325–336

    Article  Google Scholar 

  • Bolker BM, Brooks ME, Clark CJ et al (2009) Generalized linear mixed models: a practical guide for ecology and evolution. Trends Ecol Evol 24:127–135

    Article  PubMed  Google Scholar 

  • Bouchet PJ, Peterson AT, Zurell D et al (2019) Better model transfers require knowledge of mechanisms. Trends Ecol Evol 34:489–490

    Article  PubMed  Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45:5–32

    Article  Google Scholar 

  • Buisson L, Thuiller W, Lek S, Lim P, Grenouillet G (2008) Climate change hastens the turnover of stream fish assemblages. Glob Chang Biol 14:2232–2248

  • Bulluck L, Fleishman E, Betrus C, Blair R (2006) Spatial and temporal variations in species occurrence rate affect the accuracy of occurrence models. Global Ecol Biogeogr 15:27–38

    Article  Google Scholar 

  • Capinha C, Anastacio P (2011) Assessing the environmental requirements of invaders using ensembles of distribution models. Divers Distrib 17:13–24

    Article  Google Scholar 

  • Christin S, Hervet É, Lecomte N (2019) Applications for deep learning in ecology. Methods Ecol Evol 10:1632–1644

    Article  Google Scholar 

  • Coelho MTP, Barreto E, Rangel TF et al (2023) The geography of climate and the global patterns of species diversity. Nature. https://doi.org/10.1038/s41586-023-06577-5

    Article  PubMed  PubMed Central  Google Scholar 

  • Connor T, Hull V, Viña A, Shortridge A, Tang Y, Zhang J, Wang F, Liu J (2018) Effects of grain size and niche breadth on species distribution modeling. Ecography 41:1270–1282

    Article  Google Scholar 

  • De Marco P, Diniz-Filho JAF, Bini LM (2008) Spatial analysis improves species distribution modelling during range expansion. Biol Lett 4:577–580

    Article  PubMed  PubMed Central  Google Scholar 

  • De’ath G, (2007) Boosted trees for ecological modeling and prediction. Ecology 88:243–251

    Article  PubMed  Google Scholar 

  • Dibble KL, Yackulic CB, Kennedy TA, Budy P (2015) Flow management and fish density regulate salmonid recruitment and adult size in tailwaters across western North America. Ecol Appl 25:2168–2179

    Article  PubMed  Google Scholar 

  • Dorazio RM (2012) Predicting the geographic distribution of a species from presence-only data subject to detection errors. Biometrics 68:1303–1312

    Article  PubMed  Google Scholar 

  • Dray S, Chessel D, Thioulouse J (2003) Co-inertia analysis and the linking of ecological data tables. Ecology 84:3078–3089

    Article  Google Scholar 

  • Early R, Sax DF (2014) Climatic niche shifts between species’ native and naturalized ranges raise concern for ecological forecasts during invasions and climate change. Glob Ecol Biogeogr 23:1356–1365

    Article  Google Scholar 

  • Elith J, Graham CH (2009) Do they? How do they? WHY do they differ? On finding reasons for differing performances of species distribution models. Ecography 32:66–77

    Article  Google Scholar 

  • Elith J, Leathwick JR (2009) Species distribution models: ecological explanation and prediction across space and time. Annu Rev Ecol Evol Syst 40:677–697

    Article  Google Scholar 

  • Elith J, Graham CH, Anderson RP et al (2006) Novel methods improve prediction of species’ distributions from occurrence data. Ecography 29:129–151

    Article  Google Scholar 

  • Elith J, Leathwick JR, Hastie T (2008) A working guide to boosted regression trees. J Anim Ecol 77:802–813

    Article  CAS  PubMed  Google Scholar 

  • Elith J, Phillips SJ, Hastie T et al (2011) A statistical explanation of MaxEnt for ecologists. Divers Distrib 17:43–57

    Article  Google Scholar 

  • Engler R, Guisan A, Rechsteiner L (2004) An improved approach for predicting the distribution of rare and endangered species from occurrence and pseudo-absence data. J Appl Ecol 41:263–274

    Article  Google Scholar 

  • Fei S, Yu F (2016) Quality of presence data determines species distribution model performance: a novel index to evaluate data quality. Landsc Ecol 31:31–42

    Article  Google Scholar 

  • Ficetola GF, Cagnetta M, Padoa-Schioppa E et al (2014) Sampling bias inverts ecogeographical relationships in island reptiles. Global Ecol Biogeogr 23:1303–1313

    Article  Google Scholar 

  • Fielding AH, Bell JF (1997) A review of methods for the assessment of prediction errors in conservation presence/absence models. Environ Conserv 24:38–49.

    Article  Google Scholar 

  • Foody GM (2008) GIS: biodiversity applications. Prog Phys Geogr 32:223–235

    Article  Google Scholar 

  • Franklin J (2013) Species distribution models in conservation biogeography: developments and challenges. Divers Distrib 19:1217–1223

    Article  Google Scholar 

  • Gábor L, Jetz W, Lu M, Rocchini D, Cord A, Malavasi M, Zarzo-Arias A, Barták V, Moudrý V (2022) Positional errors in species distribution modelling are not overcome by the coarser grains of analysis. Methods Ecol Evol 13:2289–2302

    Article  Google Scholar 

  • Gauch HG, Whittaker RH (1972a) Coenocline simulation. Ecology 53:446–451

    Article  Google Scholar 

  • Gauch HG, Whittaker RH (1972b) Comparison of ordination techniques. Ecology 53:868–875

    Article  Google Scholar 

  • Golding N, Purse BV (2016) Fast and flexible Bayesian species distribution modelling using Gaussian processes. Methods Ecol Evol 7:598–608

    Article  Google Scholar 

  • Graf R, Bollmann K, Suter W et al (2005) The importance of spatial scale in habitat models: Capercaillie in the Swiss Alps. Landsc Ecol 20:703–717

    Article  Google Scholar 

  • Grüss A, Thorson JT (2019) Developing spatio-temporal models using multiple data types for evaluating population trends and habitat usage. ICES J Mar Sci 76:1748–1761

    Article  Google Scholar 

  • Gu WD, Swihart RK (2004) Absent or undetected? Effects of non-detection of species occurrence on wildlife–habitat models. Biol Conserv 116:195–203

    Article  Google Scholar 

  • Guisan A, Thuiller W (2005) Predicting species distribution: offering more than simple habitat models. Ecol Lett 8:993–1009

    Article  PubMed  Google Scholar 

  • Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36

    Article  CAS  PubMed  Google Scholar 

  • Hastie T, Fithian W (2013) Inference from presence-only data; the ongoing controversy. Ecography 36:864–867

    Article  PubMed  PubMed Central  Google Scholar 

  • Hattab T, Albouy C, Lasram FBR et al (2014) Towards a better understanding of potential impacts of climate change on marine species distribution: a multiscale modelling approach. Glob Ecol Biogeogr 23:1417–1429

    Article  Google Scholar 

  • Hengl T, Sierdsema H, Radović A, Dilo A (2009) Spatial prediction of species’ distributions from occurrence-only records: combining point pattern analysis, ENFA and regression-kriging. Ecol Model 220:3499–3511

    Article  Google Scholar 

  • Hernandez PA, Graham CH, Master LL, Albert DL (2006) The effect of sample size and species characteristics on performance of different species distribution modeling methods. Ecography 29:773–785

    Article  Google Scholar 

  • Hirst CN, Jackson DA (2007) Reconstructing community relationships: the impact of sampling error, ordination approach, and gradient length. Divers Distrib 1:361–371

    Article  Google Scholar 

  • Hirzel AH, Helfer V, Metral F (2001) Assessing habitat-suitability models with a virtual species. Ecol Model 145:111–121

    Article  Google Scholar 

  • Hirzel AH, Hausser J, Chessel D, Perrin N (2002) Ecological-niche factor analysis: how to compute habitat-suitability maps without absence data? Ecology 83:2027–2036

    Article  Google Scholar 

  • Hocking DJ, Thorson JT, O’Neil K, Letcher BH (2018) A geostatistical state-space model of animal densities for stream networks. Ecol Appl 28:1782–1796

    Article  PubMed  Google Scholar 

  • Hui C, Veldtman R, McGeoch MA (2010) Measures, perceptions and scaling patterns of aggregated species distributions. Ecography 33:95–102

    Article  Google Scholar 

  • Hutchinson GE (1957) Concluding remarks. Cold Spring Harb Sym 22:415–427.

    Article  Google Scholar 

  • Jiménez L, Soberón J, Christen JA, Soto D (2019) On the problem of modeling a fundamental niche from occurrence data. Ecol Model 397:74–83

    Article  Google Scholar 

  • Jiménez-Valverde A, Lobo JM, Hortal J (2008) Not as good as they seem: the importance of concepts in species distribution modelling. Divers Distrib 14:885–890

    Article  Google Scholar 

  • Jiménez-Valverde A, Acevedo P, Barbosa AM, Lobo JM, Real R (2013) Discrimination capacity in species distribution models depends on the representativeness of the environmental domain. Glob Ecol Biogeogr 22:508–516

    Article  Google Scholar 

  • Johnson JM, Khoshgoftaar TM (2019) Survey on deep learning with class imbalance. J Big Data 6:27

    Article  Google Scholar 

  • JPL MUR MEaSUREs Project (2010) GHRSST Level 4 MUR Global Foundation Sea Surface Temperature Analysis. Ver. 2. PO.DAAC, CA, USA. https://doi.org/10.5067/ghgmr-4fj01. Accessed 25 July 2022

  • Karp MA, Brodie S, Smith JA et al (2023) Projecting species distributions using fishery-dependent data. Fish Fish 24:71–92

    Article  Google Scholar 

  • Kleisner KM, Fogarty MJ, McGee S, Hare JA, Moret S, Perretti CT, Saba VS (2017) Marine species distribution shifts on the US Northeast Continental Shelf under continued ocean warming. Prog Oceanogr 153:24–36

    Article  Google Scholar 

  • Kozak KH, Graham CH, Wiens JJ (2008) Integrating GIS-based environmental data into evolutionary biology. Trends Ecol Evol 23:141–148

    Article  PubMed  Google Scholar 

  • Krawczyk B (2016) Learning from imbalanced data: open challenges and future directions. Prog Artif Intell 5:221–232

    Article  Google Scholar 

  • Latimer AM, Wu S, Gelfand AE, Silander JA Jr (2006) Building statistical models to analyze species distributions. Ecol Appl 16:33–50

    Article  PubMed  Google Scholar 

  • Leathwick JR, Elith J, Hastie T (2006) Comparative performance of generalized additive models and multivariate adaptive regression splines for statistical modelling of species distributions. Ecol Model 199:188–196

    Article  Google Scholar 

  • Leroy B, Delsol R, Hugueny B et al (2018) Without quality presence–absence data, discrimination metrics such as TSS can be misleading measures of model performance. J Biogeogr 45:1994–2002

    Article  Google Scholar 

  • Lichstein JW, Simons TR, Shriner SA, Franzreb KE (2002) Spatial autocorrelation and autoregressive models in ecology. Ecol Monogr 72:445–463

    Article  Google Scholar 

  • Liu C, White M, Newell G (2011) Measuring and comparing the accuracy of species distribution models with presence-absence data. Ecography 34:232–243

    Article  CAS  Google Scholar 

  • Liu KY, Smith MR, Fear EC, Rangayyan RM (2013) Evaluation and amelioration of computer-aided diagnosis with artificial neural networks utilizing small-sized sample sets. Biomed Signal Proces 8:255–262

    Article  Google Scholar 

  • Liu C, Newell G, White M (2019) The effect of sample size on the accuracy of species distribution models: considering both presences and pseudo-absences or background sites. Ecography 42:535–548

    Article  CAS  Google Scholar 

  • Loke LHL, Chisholm RA (2023) Unveiling the transition from niche to dispersal assembly in ecology. Nature 618:537–542

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Lu M, Jetz W (2023) Scale-sensitivity in the measurement and interpretation of environmental niches. Trends Ecol Evol 38:554–567

    Article  PubMed  Google Scholar 

  • Lu M, Winner K, Jetz W (2021) A unifying framework for quantifying and comparing n-dimensional hypervolumes. Methods Ecol Evol 12:1953–1968

    Article  Google Scholar 

  • MacDougall D, Crummett WB et al (1980) Guidelines for data acquisition and data quality evaluation in environmental chemistry. Anal Chem 52:2242–2249

    Article  CAS  Google Scholar 

  • Manzoor SA, Griffiths G, Lukac M (2018) Species distribution model transferability and model grain size—finer may not always be better. Sci Rep 8:7168

    Article  PubMed  PubMed Central  Google Scholar 

  • McGarigal K, Wan HY, Zeller KA et al (2016) Multi-scale habitat selection modeling: a review and outlook. Landsc Ecol 31:1161–1175

    Article  Google Scholar 

  • Meynard CN, Kaplan DM (2013) Using virtual species to study species distributions and model performance. J Biogeogr 40:1–8

    Article  Google Scholar 

  • Meynard CN, Quinn JF (2007) Predicting species distributions: a critical comparison of the most common statistical models using artificial species. J Biogeogr 34:1455–1469

    Article  Google Scholar 

  • Meynard CN, Leroy B, Kaplan DM (2019) Testing methods in species distribution modelling using virtual species: what have we learnt and what are we missing? Ecography 42:2021–2036

    Article  Google Scholar 

  • Miguet P, Jackson HB, Jackson ND et al (2016) What determines the spatial extent of landscape effects on species? Landsc Ecol 31:1177–1194

    Article  Google Scholar 

  • Miller JA (2014) Virtual species distribution models: using simulated data to evaluate aspects of model performance. Prog Phys Geogr 38:117–128

    Article  Google Scholar 

  • Miller DAW, Brehme CS, Hines JE, Nichols JD, Fisher RN (2012) Joint estimation of habitat dynamics and species interactions: disturbance reduces co-occurrence of nonnative predators with an endangered toad. J Anim Ecol 81:1288–1297

    Article  PubMed  Google Scholar 

  • Minchin PR (1987a) An evaluation of the relative robustness of techniques for ecological ordination. Vegetation 69:89–107

    Article  Google Scholar 

  • Minchin PR (1987b) Simulation of multidimensional community patterns: towards a comprehensive model. Vegetation 71:145–156

    Article  Google Scholar 

  • Moisen GG, Frescino TS (2002) Comparing five modeling techniques for predicting forest characteristics. Ecol Model 157:209–225

    Article  Google Scholar 

  • Ocean Biology Processing Group (2003) MODIS Aqua Level 3 Global Monthly Mapped 4 km Chlorophyll a. Ver. 6. PO.DAAC, CA, USA. http://oceandata.sci.gsfc.nasa.gov/MODISA/Mapped/Monthly/4km/chlor/. Accessed 25 July 2022

  • Olden JD, Jackson DA (2002) Illuminating the “black box”: understanding variable contributions in artificial neural networks. Ecol Model 154:135–150

    Article  Google Scholar 

  • Olden JD, Neff BD (2001) Cross-correlation bias in lag analysis of aquatic time series. Mar Biol 138:1063–1070

    Article  Google Scholar 

  • Olden JD, Jackson DA, Peres-Neto PR (2002) Predictive models of fish species distributions: a note on proper validation and chance predictions. Trans Am Fish Soc 131:329–336

    Article  Google Scholar 

  • Olea PP, Mateo-Tomás P (2011) Spatially explicit estimation of occupancy, detection probability and survey effort needed to inform conservation planning. Divers Distrib 17:714–724

    Article  Google Scholar 

  • Olson CA, Beard KH, Koons DN et al (2012) Detection probabilities of two introduced frogs in Hawaii: implications for assessing non-native species distributions. Biol Invasions 14:889–900

    Article  Google Scholar 

  • Peres-Neto PR, Jackson DA, Somers KM (2005) How many principal components? Stopping rules for determining the number of non-trivial axes revisited. Comput Stat Data an 49:974–997

    Article  Google Scholar 

  • Petit LJ, Petit DR (1996) Factors governing habitat selection by prothonotary warblers: field tests of the Fretwell-Lucas models. Ecol Monogr 66:367–387

    Article  Google Scholar 

  • Phillips SJ, Anderson RP, Schapire RE (2006) Maximum entropy modeling of species geographic distributions. Ecol Model 190:231–259

    Article  Google Scholar 

  • Phillips SJ, Dudík M, Elith J et al (2009) Sample selection bias and presence-only distribution models: implications for background and pseudo-absence data. Ecol Appl 19:181–197

    Article  PubMed  Google Scholar 

  • Pichler M, Hartig F (2023) Machine learning and deep learning – A review for ecologists. Methods Ecol Evol 14:994–1016

    Article  Google Scholar 

  • Ponti R, Sannolo M (2023) The importance of including phenology when modelling species ecological niche. Ecography 2023:e06143

    Article  Google Scholar 

  • Puy A, Beneventano P, Levin SA, Lo Piano S, Portaluri T, Saltelli A (2022) Models with higher effective dimensions tend to produce more uncertain estimates. Sci Adv 8:9450

    Article  Google Scholar 

  • Qiao H, Soberón J, Peterson AT (2015) No silver bullets in correlative ecological niche modeling: insights from testing among many potential algorithms for niche estimation. Methods Ecol Evol 6:1126–1136

    Article  Google Scholar 

  • Quinn GP, Keough MJ (2002) Experimental design and data analysis for biologists. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • R Development Core Team (2022) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.r-project.org/

  • Real R, Barbosa AM, Vargas JM (2006) Obtaining environmental favourability functions from logistic regression. Environ Ecol Stat 13:237–245

    Article  Google Scholar 

  • Reese GC, Wilson KR, Hoeting JA, Flather CH (2005) Factors affecting species distribution predictions: a simulation modeling experiment. Ecol Appl 15:554–564

    Article  Google Scholar 

  • Richter IA, Giacomini HC, de Kerckhove DT, Jackson DA, Jones NE (2022) Correcting for size bias in electrofishing removal samples. Ecol Model 467:109929

    Article  Google Scholar 

  • Riva F, Graco-Roza C, Daskalova GN, Hudgins EJ, Lewthwaite JMM, Newman EA, Ryo M, Mammola S (2023) Toward a cohesive understanding of ecological complexity. Sci Adv 9:eabq4207

    Article  PubMed  PubMed Central  Google Scholar 

  • Royle JA, Nichols JD (2003) Estimating abundance from repeated presence-absence data or point counts. Ecology 84:777–790

    Article  Google Scholar 

  • Royle JA, Chandler RB, Yackulic C, Nichols JD (2012) Likelihood analysis of species occurrence probability from presence-only data for modelling species distributions. Methods Ecol Evol 3:545–554

    Article  Google Scholar 

  • Santika T (2011) Assessing the effect of prevalence on the predictive performance of species distribution models using simulated data. Global Ecol Biogeogr 20:181–192

    Article  Google Scholar 

  • Santika T, Hutchinson MF (2009) The effect of species response form on species distribution model prediction and inference. Ecol Model 220:2365–2379

    Article  Google Scholar 

  • Sax DF, Early R, Bellemare J (2013) Niche syndromes, species extinction risks, and management under climate change. Trends Ecol Evol 28:517–523

    Article  PubMed  Google Scholar 

  • Seoane J, Carrascal LM, Alonso CL, Palomino D (2005) Species-specific traits associated to prediction errors in bird habitat suitability modelling. Ecol Model 185:299–308

    Article  Google Scholar 

  • Skalak DB, Niculescu-Mizil A, Caruana R (2007) Classier loss under metric uncertainty. Machine Learning: ECML 2007, Lecture Notes in Computer Science, 4701, pp 310–322, Springer Berlin Heidelberg.

  • Strebel N, Kéry M, Guélat J, Sattler T (2022) Spatiotemporal modelling of abundance from multiple data sources in an integrated spatial distribution model. J Biogeogr 49:563–575

    Article  Google Scholar 

  • Swan JMA (1970) An examination of some ordination problems by use of simulated vegetation data. Ecology 51:89–102

    Article  Google Scholar 

  • Thibaud E, Petitpierre B, Broennimann O, Davison AC, Guisan A (2014) Measuring the relative effect of factors affecting species distribution model predictions. Methods Ecol Evol 5:947–955

    Article  Google Scholar 

  • Thorson JT (2019a) Forecast skill for predicting distribution shifts: a retrospective experiment for marine fishes in the Eastern Bering Sea. Fish Fish 20:159–173

    Article  Google Scholar 

  • Thorson JT (2019b) Guidance for decisions using the Vector Autoregressive Spatio-Temporal (VAST) package in stock, ecosystem, habitat and climate assessments. Fish Res 210:143–161

    Article  Google Scholar 

  • Uriarte M, Yackulic CB, Lim Y, Arce-Nazario JA (2011) Influence of land use on water quality in a tropical landscape: a multi-scale analysis. Landsc Ecol 26:1151–1164

    Article  PubMed  PubMed Central  Google Scholar 

  • Valladares F, Matesanz S, Guilhaumon F et al (2014) The effects of phenotypic plasticity and local adaptation on forecasts of species range shifts under climate change. Ecol Lett 17:1351–1364

    Article  PubMed  Google Scholar 

  • Venables WN, Ripley BD (2002) Modern applied statistics with S, 4th edn. Springer, New York. https://www.stats.ox.ac.uk/pub/MASS4/

  • Wagner T, Hansen GJA, Schliep EM, Bethke BJ, Honsey AE, Jacobson PC, Kline BC, White SL (2020) Improved understanding and prediction of freshwater fish communities through the use of joint species distribution models. Can J Fish Aquat Sci 77:1540–1551

    Article  Google Scholar 

  • Walker SC, Jackson DA (2011) Random-effects ordination: describing and predicting multivariate correlations and co-occurrences. Ecol Monogr 81:635–663

    Article  Google Scholar 

  • Wang L, Jackson DA (2011) Modeling the establishment of invasive species: habitat and biotic interactions influencing the establishment of Bythotrephes longimanus. Biol Invasions 13:2499–2512

    Article  Google Scholar 

  • Wang L, Jackson DA (2014) Shaping up model transferability and generality of species distribution modeling for predicting invasions: implications from a study on Bythotrephes longimanus. Biol Invasions 16:2079–2103

    Article  Google Scholar 

  • Wang L, Kerr LA, Record NR et al (2018) Modeling marine pelagic fish species spatiotemporal distributions utilizing a maximum entropy approach. Fish Oceanogr 27:571–586

    Article  Google Scholar 

  • Warren DL, Glor RE, Turelli M (2010) ENMTools: a toolbox for comparative studies of environmental niche models. Ecography 33:607–611

    Article  Google Scholar 

  • Wenger SJ, Olden JD (2012) Assessing transferability of ecological models: an underappreciated aspect of statistical validation. Methods Ecol Evol 3:260–267

    Article  Google Scholar 

  • Wintle BA, McCarthy MA, Parris KM, Burgman MA (2004) Precision and bias of methods for estimating point survey detection probabilities. Ecol Appl 14:703–712

    Article  Google Scholar 

  • Wisnoski NI, Andrade R, Castorani MCN, Catano CP, Compagnoni A, Lamy T, Lany NK et al (2023) Diversity-stability relationships across organism groups and ecosystem types become decoupled across spatial scales. Ecology 104(9):e4136

    Article  PubMed  Google Scholar 

  • Wisz MS, Hijmans RJ, Li J et al (2008) Effects of sample size on the performance of species distribution models. Divers Distrib 14:763–773

    Article  Google Scholar 

  • Wood S (2017) Generalized additive models: an introduction with R, 2nd edn. Chapman and Hall/CRC, Boca Raton

    Book  Google Scholar 

  • Xu W-B, Blowes SA, Brambilla V et al (2023) Regional occupancy increases for widespread species but decreases for narrowly distributed species in metacommunity time series. Nat Commun 14:1463

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Yackulic CB, Ginsberg JR (2016) The scaling of geographic ranges: implications for species distribution models. Landsc Ecol 31:1195–1208

    Article  Google Scholar 

  • Yackulic CB, Chandler R, Zipkin EF, Royle JA, Nichols JD, Campbell Grant EH, Veran S (2013) Presence-only modelling using MAXENT: when can we trust the inferences? Methods Ecol Evol 4:236–243

    Article  Google Scholar 

  • Yackulic CB, Reid J, Nichols JD, Hines JE, Davis R, Forsman E (2014) The roles of competition and habitat in the dynamics of populations and species distributions. Ecology 95:265–279

    Article  PubMed  Google Scholar 

  • Yackulic CB, Nichols JD, Reid J, Der R (2015) To predict the niche, model colonization and extinction. Ecology 96:16–23

    Article  Google Scholar 

  • Yates KL, Bouchet PJ, Caley MJ et al (2018) Outstanding challenges in the transferability of ecological models. Trends Ecol Evol 33:790–802

    Article  PubMed  Google Scholar 

  • Yates LA, Aandahl Z, Richards SA, Brook BW (2023) Cross validation for model selection: a review with examples from ecology. Ecol Monogr 93:e1557

    Article  Google Scholar 

  • Young M, Cavanaugh K, Bell T et al (2016) Environmental controls on spatial patterns in the long-term persistence of giant kelp in central California. Ecol Monogr 86:45–60

    Article  Google Scholar 

  • Zhu K, Woodall CW, Ghosh S, Gelfand AE, Clark JS (2014) Dual impacts of climate change: forest migration and turnover through life history. Glob Chang Biol 20:251–264

    Article  PubMed  Google Scholar 

  • Zurell D, Jeltsch F, Dormann CF, Schröder B (2009) Static species distribution models in dynamically changing systems: how good can predictions really be? Ecography 32:733–744

    Article  Google Scholar 

Download references

Acknowledgements

This project was funded by the Natural Sciences and Engineering Research Council of Canada (NSERC) through both the Canadian Aquatic Invasive Species Network (CAISN) and Discovery Grant programs to DAJ and by the University of Toronto. We thank Marie-Josée Fortin, Charles K. Minns, Brian J. Shuter, Steven C. Walker, Norman D. Yan, Nicholas E. Mandrak, and Derrick T. de Kerckhove for discussions and comments on early versions of this manuscript.

Funding

This project was funded by the Natural Sciences and Engineering Research Council of Canada (NSERC) through both the Canadian Aquatic Invasive Species Network (CAISN) and Discovery Grant programs to DAJ and by the University of Toronto.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study conception and design. Data simulation and analysis were performed by LW. The first draft of the manuscript was written by LW and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Lifei Wang.

Ethics declarations

Competing interests

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 12738 kb)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, L., Jackson, D.A. Effects of sample size, data quality, and species response in environmental space on modeling species distributions. Landsc Ecol 38, 4009–4031 (2023). https://doi.org/10.1007/s10980-023-01771-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10980-023-01771-2

Keywords

Navigation