Introduction

Individual-based models (IBMs)—also known as agent-based models (ABMs)—have a long history across a wide array of systems for simulating bottom-up mechanistic processes and producing emergent top-down patterns (see Box 1 for definitions of italicized terminology). IBMs are often used when considering individually variable traits, such as behavior and physiology, necessary to produce mechanistic simulations to understand population-level patterns and processes and have been broadly applied in aquatic ecology and management [1]. While IBMs have been widely adopted in academic research focused on aquatic systems, their implementation into plans for conservation and management has lagged (i.e., the research-implementation gap [2]). IBMs have at times been criticized for being a “black box” due to a lack of transparency and approachability. Our primary goals with this review/prospectus are to highlight the utility of IBMs for practical applications while opening the IBM “black box” for practitioners in aquatic systems. First, we introduce the general concepts, history, and terminology related to IBMs and describe their potential for research and management such as those related to aquatic systems, specifically for eco-evolutionary (i.e., “eco-evo”) simulation modeling incorporating demogenetic processes. Second, we provide case studies to highlight the utility and practicality of using IBMs to answer a wide variety of eco-evo questions important for the management and conservation of aquatic ecosystems. We focus on the current utility and future potential of IBMs for eco-evo processes and highlight studies that have used this approach, including (1) the conservation and management of native species in systems impacted by invasive species, (2) life history evolution and implications for fisheries management, (3) predictions of the interaction between changing environments and management decisions, and (4) testing factors that drive system dynamics in order to evaluate consequences of management actions. Third, we discuss the future of IBMs, including ongoing challenges, opportunities, and future considerations. In sum, we hope to provide encouragement and foundational knowledge for researchers to consider supplementing current projects with IBMs where appropriate.

IBM Concepts, History, and Usage

What is an IBM? Definition and History in Aquatic Systems

At their core, IBMs differ from analytical or equation-based models in that they simulate individual organisms, heterogeneity among individuals, and interactions among individuals and between individuals and their environment to consider changes through time and space (modified from [3] and [4]). This definition that we present represents an inclusive definition of both IBMs and ABMs, with ABMs being explicit about resource availability, adaptive behavior, and variation within age classes (for IBM versus ABM, see Box 1). However, the overall process for an IBM is similar to other modeling frameworks and generally may follow an iterative framework or “modeling cycle” that includes the development of research questions, software selection or model creation, and sensitivity analyses among other steps (Section 2.3 of [3]; Fig. 1).

Fig. 1
figure 1

Iterative model process, starting with the co-development of research questions among researchers and managers which can be applied to individual-based models to co-develop research and build toward making management decisions. After determining questions which need to be answered with simulations of individuals, factors within the environment will need to be selected, with an understanding of simplifying assumptions and pieces not captured by reality. Then, researchers should decide either the software or, if developing the model from scratch, the coding language (Table 1). After that, empirical data will need to be collected and the literature reviewed to parameterize the model. Once the model is created, it can be validated and evaluated, and then, researchers should complete a sensitivity analysis of model outcomes to different parameters. This can lead to inferences about model structure and the system, which may result in the need to collect more data and better parameterize the models. Conversely, these inferences may lead to sharing the results with collaborators and then publishing. This process may be implemented within structured decision-making and adaptive management but may also be pursued independently

Similar to various other models, IBMs can be used to forecast or hindcast and are generally appropriate for evaluating diverse scenarios. IBMs may be particularly useful when average values describing population characteristics are insufficient, and variation among individuals’ attributes (e.g., location, size) is critical for the effective modeling of social and ecological processes [5]. The process-based nature of IBMs also makes them useful when considering modeling when future or past conditions are outside of the values initially used to create a model. Modeling outside of the bounds of creation is inappropriate for several other model types. They also are useful for exploring emergent properties of complex systems [6], which arise from the fine-scale interactions occurring at the individual level. Many studies have used IBMs to address ecological questions, with several syntheses available on behavior, such as habitat selection [7, 8] and social interactions [9]. By coupling with geographic information systems (GIS), IBMs can directly incorporate real, spatially explicit environmental factors to answer questions related to spatial ecology and animal movements [5, 10]. This ability to incorporate spatially explicit data points to the particular utility of IBMs for landscape-level research. Evolutionary processes, however, have received less attention in the IBM literature, although the incorporation of evolution into ecological IBMs has been growing including for management purposes [11, 12], driven by research questions requiring the inclusion of individual genotypes (e.g., thermal tolerance and gene flow [13]) and increasing model complexity coupled with faster processing speeds and greater computational power.

Why use IBMs?

The reasons for the current growth of IBM use are similar to the three reasons previously outlined by Judson [14]: the difficulties of generality and modeling in ecology, ecological interactions of individuals are important, and computational resources, including computer power and software, have increased accessibility of modeling. As we learn more about systems, their complexity, or the number of independent and interacting components [15], makes generalized modeling efforts difficult [14]. Because individual organism characteristics and interactions between organisms may drive system patterns, understanding these processes has gained greater interest. Moreover, improvements in computational resources have only continued to expand since the publication of Judson [14].

IBMs are prevalent across scientific disciplines, including many subdisciplines of biology and social science, and as the field has grown and computational resources and methodologies have improved, researchers have been able to simulate individuals with both ecological and evolutionary processes occurring in tandem, moving into the realm of eco-evo simulations. For many research questions and management goals, the interaction of ecological and evolutionary processes is important for understanding overall system dynamics. Furthermore, these eco-evo processes and the emergent patterns of the system are often best addressed by simulating individuals, including population viability [16, 17], hybridization [18], invasive species management [19], and impacts of barriers to movement [20]. These eco-evo models expand the realm of areas where IBMs are able to help explore research questions related to management goals that cannot be effectively examined using empirical approaches alone or other modeling techniques. Because of the utility of IBMs to management where eco-evo models are beneficial, there are many examples in the literatures of their uses. This includes software that is ready to be used immediately, as well as modeling efforts where coding is done from the bottom up. Across these approaches, there are numerous examples in aquatic systems, often within a spatially explicit context (Table 1).

Table 1 Examples of software using individual-based models to run simulations to answer ecological or evolutionary biology questions at the landscape level. Broadly, researchers either use existing published software or established common programming languages to build models themselves. Example usages in bold represent aquatic examples, although non-aquatic example model structure may be applicable to other systems

IBM Case Studies and Management/Conservation Applications for Aquatic Systems

Because IBMs are mechanistic and have the capability of being both broad- and fine-scale, they are particularly useful in the context of management programs. As compared to population-based and stage-structured models of differential equations, they also provide the ability to be more easily transferred to different systems, future scenarios (e.g., climate change), and novel management practices as they are developed due to the ability to better understand the emergent properties of the system while simulating the various mechanistic pieces at the individual level [25]. In addition, the emergence and transferability of mechanistic models make them attractive for their ability to be more robust to novel scenarios and environments than classical modeling approaches [25]. To illustrate these characteristics of IBMs, in each subsection below, we introduce the topic being covered, examples of IBM applications, and some of the reasons why IBMs provide a benefit over other modeling approaches. Although not all examples from the literature provided are eco-evo IBMs in aquatic systems, the topics being covered lend themselves to understanding potential future dynamics in aquatic systems. In addition, many of these aquatic examples are interested in questions at the landscape extent and are spatially explicit because of the necessity to include space when addressing management goals and research questions. However, in some cases, IBMs may be spatially implicit or have a pseudo-geography because they are based on relative distances or positions [26]. However, these may not be directly tied to environmental heterogeneity, and the position of features (landscape configuration) of the environment and may not be appropriate for certain research questions and fields (e.g., landscape genetics).

Models for Understanding and Mitigating Invasive Species Impacts

Management and conservation of native species in systems affected by invasive species present unique challenges to resource managers. Often, goals involve the evaluation of mechanisms underlying successful invasions and strategies to control or eradicate invasive species while simultaneously minimizing additional impacts on native species. IBMs can help model such systems by incorporating individual interactions which may be critical for determining population patterns and informing management decisions. An example of an IBM-informed management program that considers both native and non-native aquatic species is the invasive brook trout (Salvelinus fontinalis) suppression program in Sullivan Creek in northeastern Washington State, USA [27]. This program is focused on eliminating brook trout in Sullivan Creek to benefit the native cutthroat trout (Oncorhynchus clarkii lewisi). In concert with continued mechanical suppression activities in the watershed, a novel approach was introduced in which brook trout with two Y chromosomes (YY-males) are bred in captivity and then released in Sullivan Creek to reproduce with the naturalized brook trout population [28]. The goal of the YY-male program is to shift the sex ratio of the naturalized brook trout population until no females are left and the population is extirpated, because no long-term data sets exist to guide the best management actions, a spatially explicit IBM of the brook trout population to model the performance of released YY-males, which simulated the movement, reproduction, inheritance of chromosomes, and mortality of individual fish [29]. Using an IBM was an appropriate modeling approach, because there were no broad-scale patterns with which to consider potential outcomes. Instead, fine-scale, individual-based interactions parameterized with empirical data were used to ultimately scale up to emergent model outcomes (Fig. 2 [30]). This IBM has been critical for informing the best approach for implementing the YY-male program in several ways. First, simulation outcomes provided annual benchmarks against which the ongoing management was compared and progress toward brook trout extirpation was reevaluated (Fig. 2). Second, the model identified key drivers of brook trout extirpation, informing the optimal use of limited resources and reducing time to achieving management goals. For example, simulations demonstrated a strong effect on the reproductive status of stocked YY fish, supporting the management decision to produce a higher ratio of mature to immature YY fish in the hatchery [30]. Finally, the modeling work identified key gaps in information, guiding empirical work that focused on factors such as YY-male dispersal and fitness because of the significant effect that these factors had on model outcomes [29]. In turn, ongoing data collection informed the accuracy of the model, which was then recalibrated to produce results that were more closely aligned with reality. Thus, management actions and the IBM worked iteratively to understand how to optimize outcomes for this novel management method.

Fig. 2
figure 2

Including invasive species impacts. Emergent simulation outcomes of brook trout abundance before and after introduction of YY males (top) and the proportion of ~ 200 m patches occupied in the stream network of the Sullivan Creek Watershed in Washington State (bottom) where brook trout are invasive. Lines represent averages across 10 simulation replicates and polygons represent 95% confidence bands. Population declines occur due to skewing the sex ratio through the release of captive-raised individuals. Figure reproduced from Day et al. [29]

Considering Life History Evolution Impacts on the Management of Fisheries

Evolutionary impacts have important implications for fisheries management [31, 32]. The potential evolutionary effects of fisheries harvest have been explored through a variety of study approaches including controlled experiments [33], genetic analyses [34], and statistical inquiries of multi-year assessment data [35]. However, taken as a whole, these approaches do not adequately account for variation across time, space, or ecogenetic processes driven by individual interactions, providing a unique opportunity for IBMs. IBMs are useful when considering fisheries-induced evolution given the interaction between ecological and evolutionary dynamics, the multi-generational effects of selection, and the ability to evaluate multiple harvest scenarios, as well as accounting for potential short-term ecological and long-term evolutionary consequences of environmental stressors and targeted management actions, such as harvest [36, 37]. Historically, IBMs allowed for individual phenotypic variation (e.g., in foraging, growth, and reproductive rates) to arise through chance encounters which would be exacerbated through interactions and feedbacks. Ecogenetic IBMs (or demo-genetic agent-based models [16]) incorporate genetic differences among individuals with phenotypic differences arising through a combination of genetic differences, stochastic events, and ecological feedbacks. These have been developed and applied to model fisheries-induced evolution in both generalized theoretical frameworks and to assess effects on specific freshwater and marine fish stocks. Ecogenetic IBMs represent a subset of potential eco-evo IBMs, although many models incorporating evolution may require genetic differences among individuals to be explicitly simulated.

As a specific example, fisheries harvest based on individual size or some other desirable characteristic can eliminate individuals with the highest reproductive output. Such discriminatory harvest exerts an additional selective force that can dramatically alter the combinations of life history traits that are appropriate for maximizing lifetime reproductive success. Due to this increased selection favoring certain sizes or characteristics, harvest can lead to changes in average behaviors, growth rates, maturation schedules, and ultimately the productive capacity of fish populations [38]. Several studies indicate that targeted harvesting of older, larger fish selects for maturation at an earlier age and smaller size [39]. Earlier maturation likely comes at a cost to growth and may exacerbate recruitment [40]. Evolution induced by harvest could decrease the productive capacity of fish stocks and the ability of these stocks to recover during fishery closures [41]. At the same time, detecting evolutionary effects from harvest is not straightforward. In particular, the ecological and evolutionary responses to harvest may actually mask each other. These models allow for fisheries harvest to act simultaneously on ecological and evolutionary dynamics, and patterns can be related to empirical data [42].

Collectively, such IBMs have evaluated the response to the harvest of traits such as maturation schedules, growth, behavior, and reproductive investment [16, 43, 44]. In turn, at a population level, IBMs have helped reveal effects on stock productive potential [45], the ability of stocks to recover during fishery closures [46], and potential economic repercussions of fisheries-induced evolution [47]. As a specific example, Ayllón et al. [48•] used IBMs to simulate the interaction of harvest and life history evolution with climate change. They looked at the response of brown trout (Salmo trutta) across > 1500 combinations of climate changes and harvest scenarios by using artificial neural networks and found that any size selection during the harvest process in conjunction with climate changes could lead to population declines. Even without climate change, conservation goals often were not met except under limited size-selection scenarios. Ultimately, they suggested that banning harvest may be necessary with climatic changes to conserve existing populations.

Incorporating Climate Change and Adaptive Capacity to Guide Management

The application of IBMs in understanding species response to climate change was identified by Shugart et al. [49] 30 years ago. Climate change is an inherently spatial process, including heterogeneous rates of warming across the globe [50]. The adaptive capacity of a species is its ability to accommodate, cope with, or respond to changing conditions, via dispersal, acclimation, or phenotypic plasticity, and evolution [51,52,53,54]. Overall, complex feedbacks exist between the three components of adaptive capacity, and IBMs provide a mechanism for controlling for a single factor while varying the others [11]. For example, evolution may reduce extinction risk through high genetic variance [50]. Because these processes span the interface of ecology and evolution which directly are impacted by how individuals interact with each other and the environment, complex IBMs can be useful for estimating range shifts and persistence. In contrast, simpler modeling approaches—including early species distribution modeling techniques—may not effectively capture important processes [55]. In addition, climate change impacts may have synergistic with landscape change in aquatic systems [56], increasing overall management challenges. The incorporation of multiple processes is challenging, and the ability to model some processes independently, such as plasticity, in a spatially explicit manner is still needing further development [11, 57]. These challenges have limited implementation of IBMs which cover the eco-evo processes influencing adaptive capacity; for example, in marine systems, there are a limited number of studies using IBMs to evaluate adaptive capacity and instead have focused only on genetic adaptation [11, 13]. However challenging, by modeling with the eco-evo processes present across a spatially heterogeneous environment, researchers may over- or underestimate adaptive capacity. For example, a model which only includes local adaptation may underestimate the probability of population persistence in species with high levels of phenotypic plasticity. One important aspect of modeling is that some processes may need to act independently, because local adaptation and plasticity may vary spatially (e.g., the size of response with latitude [58]). However, not all aspects are independent, such that plasticity itself may evolve, or that local adaptation is interconnected with gene flow [51, 59].

Multiple examples of forecasting models covering eco-evo processes and the impacts of climate change across varied taxa exist, with model complexity varying with a priori knowledge, computational needs, and specific aims of the project (see Table 1). Depending on the specific research question and system, these forecasting simulations can be population instead of individual-based [60]. Early IBMs related to climate change were focused on dispersal or other ecological processes [10]. Models now routinely track not just the individual location, phenotype, and movement, but also the underlying genotype. This allows for the investigation of eco-evo process relationships, such as dispersal and local adaptation [61]. From a management standpoint, these simulations may also highlight the impact of the translocation of individuals under an assisted gene flow framework [60, 62, 63].

Testing Factors that Drive System Dynamics in Order to Inform Management Decisions

A crucial step in all model development is to demonstrate that it accomplishes its intended purpose to answer the desired research question and produces outputs that match expectations under controlled test conditions. IBMs are developed using literature and field estimates. However, decision-makers need to know whether a model is a sufficiently accurate representation of a natural system and can answer their research question. In some cases, models may not need to perfectly represent reality if they are able to address the specific question [64]. The “evaludation” framework, which combines validated and evaluation, provides guidelines for understanding the effects of parameters on model processes and outcomes [65]. Without such an understanding, model results are vulnerable to spurious or biased outcomes, whereby model results are not a function of any tested mechanism but of random chance as a result of some particular combination of parameters. Spurious results can be limited by developing and testing mechanistic hypotheses within the model but also by conducting sensitivity analysis to understand relationships among model parameters and model outcomes [66, 67]. Sensitivity analysis becomes more important as uncertainty in model parameterization increases, and the range of uncertainty can be explored. Furthermore, sensitivity analysis should be conducted not only on key parameters of interest, but within the range of uncertainty on all model parameters to ensure that the model functions as intended and continues to behave realistically under extreme parameter values. It is also important to note that the sensitivity of aquatic systems to different factors may vary through time [68]. Which parameters to conduct the sensitivity analysis on and the breadth of parameter space explored, using methods such as Latin hypercube sampling or a fully factorial design, is an important consideration and may be limited based on available computational resources [69, 70]. Conversely, for some researchers, the ability to test an increasing range of parameter values and the overall number of models continues to become easier as access to computing clusters and the resources within those clusters continues to improve. Overall, sensitivity analysis highlights those parameters that have the greatest influence on simulation outcomes.

In IBMs, sensitivity analysis is particularly important because model outcomes and relationships among variables can be difficult to predict due to the emergence of patterns that result from many non-linear functions and fine-scale, including higher-order, interactions [71]. Multiple approaches and methods are available [66], but they can be broadly broken into two categories: global and local. Global sensitivity analysis is useful for parameters of low interest to the researcher and is conducted by simultaneously varying parameter values across hyper-dimensional parameter space and extracting the effects of individual parameters on model outcomes [29]. Local sensitivity analysis is most useful for key parameters of interest to model hypotheses and usually involves holding values for all but one parameter constant and exploring the parameter of interest [72]. In addition, sensitivity to model structure should be considered, such as changes in inference if certain processes are included or not, and the form of the process, such as whether the process is stochastic or not. Structural uncertainty and determining the necessary processes to answer the research question can be challenging [73].

To help ground this process, it is worth considering the example of Baggio et al. [74] which explored the genetic differentiation of fish populations separated by dams. Using a demographic-genetic model, researchers assessed movement within metapopulations before and after dams, with tributary spawning groups acting as subpopulations. Sensitivity analysis was conducted to determine which parameters most influenced population structure. They considered the straying rate, carrying capacity, mutation rates, mortality rate, and permeability of the system through the addition of a dam (see Table 2 of the manuscript). Sensitivity revealed that population structure stabilized around 200 generations, and it was not sensitive to mutation rate but was sensitive to straying rate, carrying capacity, and mortality rates. These relationships were not always linear, and these sensitivity analyses helped to highlight the need to monitor spawning sites and consider the impacts of different connectivity scenarios.

Current Challenges and Future Opportunities

Incorporating Multi-species Interactions

Many IBMs are developed for a single system, where a single species of interest is simulated to understand how individuals interact with one another and the abiotic environment. While valuable for understanding eco-evo mechanisms, single-species IBMs have a narrower scope of research questions they can answer, potentially limiting their ability to contribute to ecological theory or to conserve broader landscapes and functional communities [75]. Furthermore, these models do not capture the complex behaviors or demographics that are expressed as a result of multi-species interactions (e.g., competition, symbiosis).

Research paradigms are moving toward incorporating the effects of multi-species interactions, especially in conservation and connectivity modeling [76,77,78]. These approaches can incorporate the effects of species interactions (e.g., predation, competition, parasitism, mutualism) and community formation on ecological and evolutionary processes. They can also model scenarios with implications for many species or whole communities for broader application for research into conservation, community ecology, and ecological theory. These models can be difficult to generate, given they require parameterization of the generalized effects of landscapes and their environments with interactions across multiple species; however, they show great promise in providing an understanding of ecosystem properties and species demographics [79, 80].

For multi-species IBMs in aquatic systems, there is a robust body of research describing organismal and community ecology, including IBM software targeted toward multi-species models of fish and marine organisms (e.g., PISCATOR [81]). Many of these are focused on community structure [82, 83], food webs [84, 85], and population dynamics [86, 87]. While there are plentiful examples of ecological multi-species models in aquatic systems, those that explore evolutionary processes remain rare. An exciting opportunity is available to explore more evolutionary mechanisms [16], including hybridization, landscape connectivity, local and community adaptation, and how multi-species interactions can lead to co-expressed phenotypes. This expansion of IBMs to incorporate a greater range of research questions then leads to a greater range of applications for resource managers.

Genomics and Large-Scale Data Challenges

The field of evolutionary biology has experienced unprecedented advancements in high-throughput sequencing technologies over the past two decades increasing available genomic data and resources [88]. With these approaches, researchers can now explore molecular diversity across whole genomes to better understand demographics [89] and mechanisms that underpin phenotypes of interest [90] in populations of non-model organisms, with implications for informing an array of management actions. For example, a recent study by Yoshida and Yáñez [91] used dense genotyping to reveal five polymorphisms that explain most of the variation in body weight in rainbow trout (Oncorhynchus mykiss). To better understand ecological and evolutionary processes in wild systems, there is an opportunity to incorporate genomic information into IBMs. The integration of genetics into demographic models was a useful step in understanding structure and predictions for management interventions, such as translocations at the population level [12, 92, 93]. By shifting to simulating genomic data integrated into IBM demographic models, researchers are able to answer questions both about neutral processes determined with increased statistical power compared to genetics, such as population structure and migration rates, as well as adaptive processes, like responses to environmental change or local adaptation [94]. Without considering genomic data, the response to certain management goals, such as climate change adaptation, would not be possible.

We anticipate that this integration can be done in one of two ways. First, genomic-based demographic information (e.g., population structure, gene flow rates, hybridization zones) can be used to guide the development of more ecologically relevant models. Second, evolutionary mechanisms (e.g., trait heritability, plasticity, or the inheritance of genomic variants) can be incorporated into IBMs to simulate the inheritance and expression of genomic-based traits from one generation to the next. The latter approach is of interest to both ecological and evolutionary biologists alike, as it provides the ability to replicate evolutionarily relevant mechanisms (e.g., allele frequencies) and forecast the ecological and evolutionary ramifications of functional variation that are discovered. While there is an appetite in the research community to develop eco-evo IBMs [12, 95, 96], few researchers have incorporated genomic data into IBMs to date (but see also [95, 97]). This may be the result of a research gap between evolutionary biologists and ecological modelers or because it is difficult to model complex and large genomic datasets. The decision between selecting software and models that may be better at simulating genomics than ecology and landscape data is important to acknowledge; however, this division may be reduced either through software advances or the decision of researchers to employ multiple modeling methods [20]. This second approach is particularly useful, because a different modeling structure presents different assumptions. Hence, if two models of the same system give a different inference, the assumptions themselves may provide an important understanding of the system.

The combination of increased genomic, phenotypic, and environmental data availability, size, and complexity all act in conjunction to create data challenges in multiple forms when simulating individuals. For example, depending on the scale, it is computationally expensive to model the complexities of the genome across thousands of individuals. Coupling with the genomic data is the relationship of individual genotypes and phenotypes with the environment. Gene-by-environment interactions are one of the hallmarks of eco-evo IBMs. The growing challenges of big data with geographic information systems (GISs) and other remote sensing technologies have been reviewed previously [98], in addition to the ability of these technologies to enhance understanding of the landscape for the purposes of ecology resources [99]. For some research questions, the environment is also dynamic, and spatiotemporal data can cause additional challenges, such as attribute information, algorithms, and conceptual frameworks [99]. In addition, many ecologically relevant traits may be polygenic, which makes modeling in a mechanistic manner difficult, although this challenge is becoming easier to address due to advances in sequencing and modeling [100•]. If models include many genome-wide markers, high resolution or large extent environment data, and many individuals with long periods of time, it may be too computationally intensive to perform or for future researchers to replicate. There are potential solutions to these big data GIS challenges through cloud computing resources or machine learning algorithms, with particular benefits to parallel computing [101,102,103].

Continued Stakeholder Engagement and Integration with the Social-Ecological System

Previous studies have highlighted the importance of co-developing research questions with researchers and practitioners to bridge the research-implementation gap [2]. The ability to tie spatially explicit, customized IBMs to management questions helps provide context and can increase the sense of understanding and ownership of the data and results. Due to the extensive impact of humans on natural systems, understanding eco-evo processes and patterns is key to effective management [104•]. Early engagement and co-development, even with limited data, can help identify research priorities and data needs for further evaluation [105]. Engaging stakeholders and managers in the development of IBMs can ensure that the most relevant questions are being asked and therefore aid in effective decision-making [106]. Ideally, this is accomplished through a formal framework such as adaptive management [107], where IBMs can model a suite of alternative management actions to evaluate predicted outcomes, describe associated uncertainty, identify critical knowledge gaps, and optimize the allocation of limited resources.

Specifically, adaptive management and structured decision-making frameworks have often relied on modeling efforts to aid in decision-making [108,109,110]. IBMs, when used in conjunction with an iterative modeling framework and the co-production of research (Fig. 2), can be easily integrated within adaptive management and structured decision-making, although they are not obligate processes. IBMs can be particularly useful for adaptive management because management actions can be simulated and evaluated before any decisions are made, thus reducing the chances of surprises [111]. When multiple management decisions surrounding management, such as restoration, occur, it is possible that the different decisions may produce opposing outcomes [112]. In the scientific literature, sometimes the application of IBMs is limited to the discussion and potential for future expansion of the model to create a direct connection to management decisions, instead of the model creation and simulation experiments being intertwined with direct management decisions from the beginning. There are a number of examples where modeling was conducted with the management objective explicitly in mind [29]. Besides IBMs and other bottom-up approaches being better at exploring future novel conditions as previously discussed, they may also lend themselves well from the standpoint of communication among stakeholders and researchers. Although IBMs may encompass many parameters, often these parameters are more directly linked to the empirical data being collected (e.g., individual lengths), and individuals are an intuitive unit for simulating, drawing inference from, and communicating, as opposed to other population approaches (e.g., individual fecundity compared to intrinsic population growth rates) [113]. These two components can help with building trust in model outcomes and integration within management decisions and actions.

Placing model outcomes in a social-ecological system (SES) context by testing scenarios informed by a wide range of stakeholders can provide more relevant insight into the management applications of IBMs. The SES framework considers natural resources within the interactions and interdependence of the users and their governance [114], and integrating social and ecological IBMs can identify important dynamics between social and ecological systems that might otherwise be missed [111, 115]. As one example of the process of integrating IBMs with social-ecological research, Jossie et al. [116] used a spatially explicit IBM approach to explore how stakeholder-informed stream connectivity scenarios determined by semi-structured interviews analyzed with mental modeling software can influence hybridization dynamics of native Yellowstone cutthroat trout (Oncorhynchus clarkii bouvieri) and invasive rainbow trout (Oncorhynchus mykiss). In this example, humans are not directly simulated within the system to create a holistic IBM of the social-ecological system, but their actions defined the parameters and parameter values being explored to understand the tradeoff of managing for large body size, migratory life history variation, and exposure to invasive species. The connectivity scenarios were informed by a diverse array of stakeholders with a variety of interests including fisheries, agriculture, conservation, housing, and commercial development. Using IBMs in this study worked because we were able to simulate the contexts of real streams as described by the participants of the SES. Interestingly, the most important factors for achieving desirable outcomes for Yellowstone cutthroat trout (i.e., increased abundance) were the traits of the fish themselves and their behavior (i.e., large body size, migratory life history, and self-preference mating), rather than different scales and approaches to barrier removal identified by stakeholders. However, there were some differences among connectivity scenarios that may provide some insight into the possible outcomes of the different management approaches associated with each scenario. For example, scenarios with relatively high connectivity resulted in the complete loss of genetically “pure” Yellowstone cutthroat trout when there was no reproductive separation modeled, even when large, migratory Yellowstone cutthroat trout were modeled. Using a spatially explicit IBM offered the ability to further understand how fish behavior (i.e., expressing a migratory life history, reproductive overlap with an invasive species) might interact with physical changes to the landscape (i.e., barrier removal in streams) to achieve desirable outcomes for populations of conservation concern.

Integration with the overall social-ecological system can happen in four ways. First, research on the social components, such as collecting stakeholder information, can be used to inform the IBM of the ecological system, such as determining parameter space [105, 106]. For example, integrating qualitative stakeholder interview data into IBM parameterization can capture the richness of individual decision-making and complex social-ecological dynamics [116, 117]. This allows managers and researchers to understand if model outputs are responsive to conservation needs to inform conservation implications and decisions [105, 106]. Second, IBMs (or more often referenced as ABMs) of the social system can occur to inform the ecological system patterns or responses to management decisions [118]. The third option is to complete IBMs of both the social and ecological components in tandem. These are relatively rare (but see Rammer and Seidl [119] for an example of forest management and succession dynamics) but may be the area for the greatest expansion and future development. Last, IBMs of the ecological system could be done in tandem with non-ABM models of human managers. In the future, developing stakeholder-informed, social-ecological IBMs across all social-ecological system components, including humans, will provide a useful tool for conservation management.

Balancing System Complexity, Development, and Interpretation

Ecologists expect to find principles rather than mere rules of thumb in their study systems, even if these principles are not as concrete as those found in other fields, such as thermodynamics [14]. There has been a long history of specifically pointing to IBMs as one means to illuminate a unifying theory of ecology [120, 121]. Complexity and limited generality are often quoted as the main limitations of individual-based models [122]. This may be exacerbated by increasing knowledge of a system and the potential parameters and their values. The exploration of increasing parameter numbers and space will continue to grow with computational power but may result in increased amounts of time in exploratory steps. For example, the complexity of inheritance processes and models of landscape change may make generality more difficult as research and knowledge growth continues. However, in a modeled system, researchers have explicit knowledge of the rules, as opposed to natural systems. Because IBMs may consist of several thousand lines of code, it can often be just as complicated to understand the model as it is to understand the real system. In general, as the system grows in complexity, the number of parameters that must be estimated also increases. Parameter estimation introduces uncertainty into the system due to experimental error, differences in experimental assays, or error from data fitting techniques. Moreover, when the number of agents is large and their interactions numerous and complicated, it can become difficult to extract or isolate the key processes responsible for a given outcome [123]. IBMs may not always be rigorously described or mathematically represented as equations. Instead, IBM descriptions may be somewhat “fuzzy,” defined by a series of algorithms, and many descriptions of IBMs fail to state their assumptions in depth, making the comparison of models even more difficult [4, 124]. One additional consideration of any model must be how well it can account for data from natural systems [14] and the process of verifying, calibrating, and validating. This can be challenging with the complexity of IBMs.

These challenges related to complexity, development, and interpretation do not have a single solution. The solutions may also be systems specific and difficult to generalize. For example, a system and IBM focused on population structure may have unique complexity, development, and interpretation challenges when compared to an IBM focused on environmental change and evolution. Some projects may look to unify these two research lines, which would add another layer to work through to address the challenges.

Despite these challenges, IBMs still present many advantages. As discussed previously, IBMs may allow researchers to answer novel questions not possible with other modeling approaches, such as when individual mate choice may be important. In addition, they may be more appropriate to address questions and management goals when individual-level processes and the diversity of traits among individuals are important and are likely the drivers of the population- and landscape-level patterns. For example, it may be better to use an IBM approach to understand population dynamics if individual movement may be essential as opposed to modeling in a classic metapopulation approach.

Recommendations

Successful implementation of IBMs to inform specific management decisions and broader adaptive management strategies is easier with early engagement between modelers and managers (Fig. 2). By reaching out to modelers to co-develop potential research questions, the selection process of software, the gathering of existing data, and the planning for new empirical data generation, projects are more likely to be successful and efficient. Implementing IBMs can be challenging, and for managers to use software without guidance from experienced modelers, modelers will need to improve the usability of software and modeling code. One exciting development for those interested in IBMs is the growing availability of generalized modeling software which can facilitate addressing many eco-evo questions, as well as opportunities for researchers to write their own modeling software (Table 1). Out-of-the-box software reduces the barrier to entry to the field and allows researchers to run simulations of their own systems without the need to develop novel model code. In addition, standards of reporting, validating, and evaluating models continue to be developed and help standardize the field and improve accessibility [125••]. It is also important for modelers to create user interfaces and make it easier for managers to implement the software they have developed.

Conclusions

Eco-evo IBMs provide an opportunity to explore the complexities of natural systems and apply ecological and evolutionary processes across systems. While this review has focused primarily on aquatic systems, it is important to note that these approaches are applicable across a diversity of systems. We hope that by providing a conceptual framework, citing a wide variety of examples, and highlighting future challenges and opportunities, researchers interested in questions focused on eco-evo processes will find this paper as a building block from which to develop system-specific IBMs. In addition, although this review has focused on accessibility for practitioners and stakeholders, it is also important that IBM researchers and academics make their models approachable and practical for managers for continued and improved application of IBMs to happen. The complex nature of many of these processes and the complexity of the data were often prohibitive in the past. Because of this, modeling efforts are occurring in an exciting time where IBMs have never been more accessible across research questions and disciplines.