Keywords

1 Introduction

In order to improve our chances to control an epidemic outbreak, we need proper models which describe the spread of a disease. Institutes, governments, and scientists all over the world work intensively on forecasting systems in order to be well prepared if an unknown disease appears.

In recent years a huge amount of theoretical and experimental study has been conducted on this topic. While theoretical analysis provides important and sometimes even counter intuitive insights into the behavior of an epidemic (e.g. [6, 8]), in an experimental study one can take many different settings and parameters [16, 17] into account. These usually cannot be considered simultaneously in a mathematical framework. A specific topology, for example, may have its own attributes that are completely different in other topological settings.

The goal of this paper is to present and empirically analyze a dynamic model for the spread of epidemics in an extended manner. One of our objectives is to find the right parameters, which lead to realistic settings. Therefore, we investigate a general simulation environment, in which the different parameters can easily be adjusted to real world observations. A second objective is to evaluate similarities between countermeasure approaches in our model and the real world. We use empirical data for the comparison. Our tool is agent-based, i.e., the individuals (or groups of such) are modeled by agents interacting with each other. The environment approximates the geography of Germany, in which agents may travel between cities. Within a city the agents interact according to the probabilistic model presented in [8] in a distributed manner. For a detailed description of the algorithmic framework see Sect. 2. In order to describe the problems and the related work, we often utilize the style and wording of [8].

1.1 Related Work

There is plenty of work considering epidemiological processes in different scenarios and on various networks. The simplest model of mathematical disease spreading is the so called SIR model (see e.g. [14, 20]). The population is divided into three categories: susceptible (S), i.e., all individuals which do not have the disease yet but can become infected, infective (I), i.e., the individuals which have the disease and can infect others, and recovered (R), i.e., all individuals that recovered and have permanent immunity (or have been removed from the system). Most papers model the spread of epidemics using a differential equation based on the assumption that any susceptible individual has uniform probability \( \beta \) to become infected from any infective individual. Furthermore, any infected player recovers at some stochastically constant rate \( \gamma \).

This traditional (fully mixed) model can easily be generalized to a network. It has been observed that the corresponding process can be modeled by bond percolation on the underlying graph [13, 19]. Interestingly, for certain graphs with a power law degree distribution, there is no constant threshold for the epidemic outbreak as long as the power law exponent is less than \( 3 \) [20] (which is the case in most real world networks, e.g. [1, 3, 11, 21]). If the network is embedded into a low dimensional space or has high transitivity, then there might exist a non-zero threshold for certain types of correlations between vertices [19]. However, none of the papers above considered the dynamic movement of individuals, which seems to be the main source of the spread of diseases in urban areas [10].

In [10] the physical contact patterns are modeled by a dynamic bipartite graph, which results from movement of individuals between specific locations. The graph is partitioned into two parts. The first part contains the people who carry out their daily activities moving between different locations. The other part represents the various locations in a certain city. There is an edge between two nodes, if the corresponding individual visits a certain location at a given time. Obviously, the graph changes dynamically at every time step.

In [7, 10] the authors analyzed the corresponding network for Portland, Oregon. According to their study, the degrees of the nodes describing different locations follow a power law distribution with exponent around \( 2.8 \).Footnote 1 For many epidemics, transmission occurs between individuals being simultaneously at the same place, and then people’s movement is mainly responsible for the spread of the disease.

The authors of [8] considered a dynamic epidemic process in a certain (idealistic) urban environment incorporating the idea of attractiveness based distributed locations. The epidemic is spread among \( n \) agents, which move from one location to another. In each step, an agent is assigned to a location with probability proportional to its attractiveness. The attractiveness’ of the locations follow a power law distribution [10]. If two agents meet at some location, then a possible infection may be transmitted from an infected agent to an uninfected one. The authors obtained two results. First, if the exponent of the power law distribution is between \( 2 \) and \( 3 \), then at least a small (but still polynomial) number of agents remains uninfected and the epidemic is stopped after a logarithmic number of rounds. Secondly, if the power law exponent is increased to some large constant, then the epidemic will only affect a polylogarithmic number of agents and the disease is stopped after \( ({ \log }\,{ \log }\,n)^{{{\mathcal{O}}(1)}} \) steps. In this case each agent is allowed to spread the disease for a number of time steps, which is bounded by some large constant.

In addition to the theoretical papers described above, plenty of simulation work exists. Two of the most popular directions are the so called agent-based and structured meta-population-based approach, respectively (cf. [2, 15]). Both models have their advantages and weaknesses. The main idea of the meta-population approach is to model whole regions, e.g. georeferenced census areas around airport hubs [5], and connect them by a mobility network. Then, within these regions the spread of epidemics is analyzed by using the well known mean field theory. The agent-based approach models individuals with agents in order to simulate their behavior. In this context, the agents may be defined very precisely, including e.g. race, gender, educational level, etc. [16, 17], and thus provide a huge amount of detailed data conditioned on the agents setting. Furthermore, these kinds of models are also able to integrate different locations like schools, theaters, and so on. Thus, an agent may or may not be infected depending on his own choices and the ones made by agents in his vicinity. The main issue of the agent-based approach is the huge amount of computational capacity needed to simulate huge cities, continents or even the world itself [2]. This limitation can be attenuated by reducing the number of agents, which then entails a decreasing accuracy of the simulation. In the meta-population approach the simulation costs are lower, sacrificing accuracy and some kind of noncollectable data.

A specific field of application of such simulations is the investigation of the impact of (non-)pharmaceutical countermeasures on the behavior of epidemics. Germann et al. [12] investigated the spread of a pandemic strain of the influenza virus through the U.S. population. They used publicly available 2000 U.S. Census data to identify seven so-called mixing groups, in which each individual may interact with any other member. Each class of mixing group is characterized by its own set of age-dependent probabilities for person-to-person transmission of the disease. They considered different combinations of socially targeted antiviral prophylaxis, dynamic mass vaccination, closure of schools and social distancing as countermeasures in use, and simulated them with different basic reproductive numbers \( R_{0} \). It turned out that specific combinations of the countermeasures have a different influence on the spreading process. For example, with \( R_{0} = 1.6 \) social distancing and travel restrictions did not really seem to help, while vaccination limited the number of new symptomatic cases per 10,000 persons from ~100 to ~1. With \( R_{0} = 2.1 \), such a significant impact could only be achieved with the combination of vaccination, school closure, social distancing and travel restrictions.

1.2 Our Results

The results of this paper are two-fold. First, we show that by increasing the number of agents we are able to significantly improve the accuracy of our results in the scenarios we have tested. This is due to different phenomena which are only visible if the amount of agents in use is large enough. For example, if the number of agents exceeds a certain value, then the epidemic manages to keep a specific (low) amount of infected individuals over a long time period. Furthermore, the number of agents has to be above some threshold to allow the epidemic to enter some specific areas/cities in the environment we used. Obviously, a certain amount of agents is also needed to avoid significant fluctuations in our results.

The second main result of this paper is that by setting the parameters properly, one can approximate the effect of some non-pharmaceutical countermeasures, that are usually adopted if an epidemic outbreak occurs. This observation is supported by the empirical study of [18]. Interestingly, the right choices of parameters in our experiments seem to be in line with previous observations in the real world (e.g. the right power law exponent seems to be in the range of \( 2.6 \)\( 2.9 \), cf. [10]). To analyze the effect of the countermeasures mentioned above, we integrate the corresponding mechanisms on a smaller scale, and then verify their impact on a larger scale too.

2 Theoretical Models and Algorithmic Framework

In the following we provide an empirical analysis on a small as well as on a large scale. Hereby, the cities are chosen from a list in descending order of their population size. It is intuitively clear that large (and thus attractive) cities play a major role for the epidemic pace since a higher population density entails a potentially higher infection probability. Excluding such hotspots would of course slow down the infection spread. The problem is, this could only be achieved by putting the whole city into quarantine. However, isolating an entire city is not a trivial task. For example, people living in the suburbs but working in the city might not be willing to risk their job by obeying the quarantine. Therefore, we consider such strategies only on a smaller scale.

In our model the agents may not only move between locations within a city but between cities as well. Furthermore, due to simplicity, the agents are not categorized (i.e., they do not provide further properties like gender etc.). Since we are not interested in the evaluation of such details. In the following, we briefly introduce the model. Due to readability we present our model with respect to the following four categories: (1) The environment on a large scale (inter-city movement), (2) The environment on a small scale (intra-city movement), (3) The epidemic model, and (4) The countermeasure model.

2.1 The Environment on a Large Scale

Let \( G(d) = (V,E) \) be a complete graph with \( m \) nodes \( V : = \{ c_{1}, \ldots ,c_{m} \} \) and parameter \( d : = \{ d_{{c_{1} }}, \ldots, d_{{c_{m} }} \} \), whereas \( c_{i} = G_{{c_{i} }} \) (see below) represents the corresponding city \( c_{i} \). Let further \( p_{{c_{i} }} \) be the population size of \( c_{i} \) and \( p : = \{ p_{{c_{i} }}, \ldots, p_{{c_{m} }} \} \). Then the attractiveness of \( c_{i} \) is given by \( d_{{c_{i} }} = \frac{{p_{{c_{i} }} }}{{\sum\nolimits_{1 \le i \le m} p_{{c_{i} }} }} \). Consequently, \( G \) represents the topology, which contains all cities, on a large scale. In other words, we model the inter-city movement using the complete graph \( G(d) \). In this graph, each \( c_{i} \in V \) corresponds to a city of Germany. However, depending on the size, not every city of Germany is represented by a node in \( V \).Footnote 2 The population is represented by \( n = \sum\nolimits_{1 \le i \le m} {nd_{{c_{i} }} } \) agents, with \( nd_{{c_{i} }} \) being the number of agents assigned to \( c_{i} \). Note that said number is proportional to the cities real world population. Furthermore, each city \( c_{i} \in V \) is assigned an attractiveness \( d_{{c_{i} }} \) proportional to its population size (w.r.t. the whole population). Let \( A_{i,s,t} \) be the event that agent \( i \) travels from city \( s \) to \( t \). Let further \( p^{\prime} \) be the probability that an agent decides to travel at all, and let \( dist(s,t) \) be the Euclidean distance between cities \( s \) and \( t \). Then the probability that event \( A_{i,s,t} \) occurs is given by

$$ Pr(A_{i,s,t} ) = p^{\prime} \cdot \frac{{d_{t} \cdot dist^{ - 1} (s,t)}}{{\sum\limits_{(s,j) \in E} d_{j} \cdot dist^{ - 1} (s,j)}}. $$

Thus, the probability for an agent entering a specific city depends on the distance to said city, its population size as well as the current position of the considered agent.

2.2 The Environment on a Small Scale

Let \( G_{{c_{i} }} (d(v)) = \{ V_{{c_{i} }} ,E_{{c_{i} }} \} \) be a complete graph with \( m_{{c_{i} }} = \left\lceil {\kappa nd_{{c_{i} }} } \right\rceil \) nodes (also called cells), with \( d(c_{i}):= \{d_{{v_{1}}}, \ldots, d_{{v_{{m_{c}}}}} \} \). Here, \( \kappa > 0 \) is a constant, which will be specified in the upcoming experiments. Further, note that \( \kappa \) does not affect the amount of agents but the amount of cells only. Then the attractiveness of cell \( v_{i} \in V_{{c_{i} }} \) within a city \( c_{i} \) is given by \( d_{{v_{i} }} \). Said attractiveness is chosen randomly with probability proportional to \( 1/\beta^{\alpha } \) for a value \( \beta > 1 \), where \( \alpha > 2 \) is a constant depending on the simulation run. In other words, each \( c_{i} \in V \), representing city \( c_{i} \), is a clique of cells on its own, thus incorporating intra-city movement into our model. The cells represent locations within a city an agent can visit. Each cell may contain agents (individuals), depending on the cells so-called attractiveness. If an agent decides to stay within its current city, said agent moves to a randomly chosen cell according to the distribution of the attractiveness’ among the cells. This also holds for the first cell an agent is accommodated in after entering a city.

2.3 Epidemic Model

We use three different states to model the distributed spreading process. These states partition the set of agents into three groups; \( {\mathcal{I}}(j) \) contains the infected agents in step \( j \), \( {\mathcal{U}}(j) \) contains the uninfected (susceptible) agents in step \( j \), and \( {\mathcal{R}}(j) \) contains the resistant agents in step \( j \). Whenever it is clear from the context, we simply write \( {\mathcal{I}} \), \( {\mathcal{U}} \), and \( {\mathcal{R}} \), respectively. If an uninfected agent \( i \) visits a cell (within a city) which also contains agents of \( {\mathcal{I}}(j) \), then \( i \) becomes infected with probability \( 1 - (1 - \gamma )^{{|{\mathcal{I}}^{\prime } (j)|}} \), where \( {\mathcal{I}}^{\prime } (j) \) represents the set of infected agents accommodated in the same cell as \( i \). We refer to the concrete value of \( \gamma \) in the upcoming simulations.

2.4 Countermeasure Models

Our countermeasure models mainly take advantage of the parameters \( \alpha \) and \( \kappa \). That is, high values of these two parameters imply a high level of countermeasures and vice versa. With countermeasures applied, individuals avoid places with a large number of persons more often, waive needless tours, and are more careful when meeting other people. While \( \alpha \) is mostly responsible for a decreasing number of visitors within a cell, and thus for the avoidance of crowded areas for example, \( \kappa \) determines the total space available for all individuals. As pointed out in [18], a single countermeasure alone is most likely not sufficient to stop an epidemic. Therefore, we assume a combination of them to be in place, which then would be able to sufficiently influence the parameters \( \alpha \) and \( \kappa \). Note, our countermeasure models apply to each city \( c_{i} \in V \) individually.

We use two different types of countermeasure-models: a (multi-tier) level based approach considering the amount of infected agents in the current step, and a ratio based approach considering the amount of newly infected agents in the current step compared to the one in the step before. In the following we use \( \alpha_{0} \) and \( \kappa_{0} \) as initial values for \( \alpha \) and \( \kappa \), respectively.

In the level based model we have one or more levels in which a certain pair of parameters \( \alpha \) and \( \kappa \) is used. Let \( LM_{m} \) stand for the level based model with \( m \) levels \( L = \{ l_{1} , \ldots ,l_{m} \} \cup l_{0} \). Further, let \( T = \{ l_{0}^{d} ,l_{0}^{u} ,l_{1}^{d} ,l_{1}^{u} , \ldots ,l_{m}^{d} ,l_{m}^{u} \} \) be the set of transition points for all levels, i.e., \( l_{i}^{d} \) defines the transition point from level \( i \) to \( i - 1 \) whereas \( l_{i}^{u} \) defines the transition point from level \( i \) to \( i + 1 \). Note that, \( l_{i}^{u} = l_{i + 1}^{d} \) does not necessarily hold. Additionally, \( \alpha_{0} ,\alpha_{1} , \ldots ,\alpha_{m} \) and \( \kappa_{0} ,\kappa_{1} , \ldots ,\kappa_{m} \) define the parameters \( \alpha \) and \( \kappa \), which are applied in the corresponding levels \( l_{0} , \ldots ,l_{m} \). Figure 1a depicts an example situation.

Fig. 1
figure 1

Basic examples for the two countermeasure models. a An example demonstrating a possible configuration of LM 2. The broken line depicts the amount of total infections in the city while the scope of activity of the countermeasure levels is represented by the rectangles. b An example demonstrating a possible configuration of RM. The broken line depicts the amount of total infections in the city (on a logarithmic scale) while the scope of activity of the countermeasure levels is represented by the rectangles

In contrast, the ratio based model \( RM \) uses a non static approach. Let the set of newly infected nodes of a city \( c_{i} \) in step \( j \) be denoted by \( {\mathcal{I}}_{{c_{i} }}^{*} (j) \). Furthermore, let \( \alpha_{j} \) and \( \kappa_{j} \) denote the corresponding parameters used in step \( j \). If \( \frac{{|{\mathcal{I}}_{{c_{i} }}^{*} (j)|}}{{|{\mathcal{I}}_{{c_{i} }}^{*} (j - 1)|}} \ge a \), for some constant \( a \), then we set \( \alpha_{j + 1} = \alpha_{j} + z_{1} \) and \( \kappa_{j + 1} = \kappa_{j} + z_{2} \), where \( z_{1} ,z_{2} \) are some small constants which will be specified later. Consequently, we set \( \alpha_{j + 1} = max\{ \alpha_{0} ,\alpha_{j} - z_{1} \} \) and \( \kappa_{j + 1} = max\{ \kappa_{0} ,\kappa_{j} - z_{2} \} \) whenever \( \frac{{|{\mathcal{I}}_{{c_{i} }}^{*} (j)|}}{{|{\mathcal{I}}_{{c_{i} }}^{*} (j - 1)|}} \le 1/a \). The values applied in the various models are specified in Sect. 3.3. Figure 1b depicts an example situation.

3 Experimental Analysis

The environment in our simulations approximates the geography of Germany utilizing 10 million agents and more. Note that, the obtained results remain similar utilizing up to 100 million agents. Depending on the number of agents, our simulations incorporate several hundred cities as visitable areas spread all over the country. Only cities with an initial agent amount of at least one are included in the simulation. Each city is assumed to be reachable from any other city. However, an agent may travel at most 1,000 km within a single round. Each round represents a whole day in the real world. Consequently, an agent moving from one city to another has to wait until its destination is reached before it can interact with other agents at said destination.

3.1 Simulations

In the upcoming sections we present and evaluate our results with the focus on: (1) the impact of the number of agents on the characteristics of the simulated epidemic compared to real world data, and (2) the impact of non-pharmaceutical countermeasures on the behavior of the epidemic (e.g. social distancing, school closures, and isolation [18]).

Furthermore, we also analyze our parameter settings. Although this is only a short part, our settings seem to coincide with the real world observations of [10], and thus provide an additional valuable insight. Note that the figures presented in this section show values based on the real world population size and not on the number of agents.

3.2 Relevance of the Chosen Parameters

Based on real world observations (e.g. [10]), we chose \( \alpha = 2.8 \) and \( \kappa = 1 \) as a starting point for a series of simulations concerning \( \alpha \) and \( \kappa \), respectively. Each plot represents the average value of 50 different simulation runs for each parameter constellation utilizing 10 million agents. The parameter notation is as follows: \( \gamma \) is the probability for an agent \( v \in {\mathcal{U}} \) to become infected independently by each \( w \in {\mathcal{I}} \) occupying the same cell at the same time, \( \xi \) is the amount of rounds an agent \( v \in {\mathcal{I}} \) is infectious, thus being able to infect others, \( City_{init} \) is the initial amount of cities the infection is being placed in, \( Agent_{init} \) is the amount of initially infected agents which are placed in \( City_{init} \) different cities, \( \alpha \) is the power law exponent used to individually compute the attractiveness of the cells within each city, and \( \kappa \) is a multiplicative factor to increase/decrease the amount of cells proportional to the initially assigned amount of agents.

Figure 2a and b show the impact of \( \alpha \) and \( \kappa \) on the behavior of the epidemic, and compare the results to the characteristics of a typical Influenza case reported by the RKI.Footnote 3 To increase the readability, we omit to add the RKI-plot as the 6th one. Instead we refer the reader to Fig. 3. Among all five \( \alpha \)-values \( \alpha = 2.8 \), which has also been obtained in [10] in a different context, represents the best tradeoff between curve similarity and amount of infections. All remaining parameters were set to identical values as in Case 1 (cf. Sect. 3.3). For \( \kappa \) a similar phenomenon can be observed. With increasing \( \kappa \) (including fractional values), the characteristics of the curve (i.e., the amount of infected individuals at the peak vs. total number of infections and total duration of the outbreak) become less and less accurate. Even if we increase the value to \( 2 \), the obtained curve does not follow the characteristics of the real world observations reported by the RKI [see footnote 3] anymore.

Fig. 2
figure 2

A composition of multiple simulation runs concerning varying \( \alpha \) (a) and \( \kappa \) (b) only. All other parameters are identical to Case 1 (cf. Sect. 3.3). Each result represents the average of 50 different simulation runs with 10 million agents for the topology of Germany. a 2.4 ≤ α ≤ 3.2. b 1 ≤ κ ≤ 5

Fig. 3
figure 3

Simulation results for Case 1a) (green) in comparison to real world data (red) provided by the RKI for a varying amount of agents. The abbreviation NII stands for Newly Infected Inhabitants. All results up to and including 10 million agents represent the average values of 50 different simulation runs, whereas the result for 100 million agents averages over 20 different simulation runs. The reliability of the averaged value is indicated by the corresponding confidence interval. a 10.000 agents. b 500.000 agents. c 10.000.000 agents. d 100.000.000 agents

In terms of the probability of infection \( \gamma \) we simply chose a reasonable value low enough to model an Influenza epidemic, but high enough to provoke an outbreak. This seemed reasonable due to the (at least to our knowledge) lack of concrete values deduced from real world observations.

3.3 Case 1—Number of Agents

Before we present the results for this case, we first introduce the relevant sources for comparison. Here we compare our results to real world data provided by the RKI [22] SurvStat system for the year 2007. The parameter values were taken from reference data provided by the RKI [4] where possible, or set to reasonable ones otherwise (cf. Sect. 3.2 for more details).

3.3.1 RKI: Basis of Comparison

In the following we compare our results to the real world data provided by the RKI [see footnote 3]. For this purpose we use two different data sources: the official report of the Influenza epidemiology of Germany for 2010/2011 [4] and an online database containing obligatory reports called SurvStat [22].

Relevance. The data for the SurvStat database and the report of 2010/2011 itself were obtained from more than 1 % of all primary care doctors spread all over Germany. This indicates the significance according to international standards. Unfortunately, there are some drawbacks resulting from the type of data ascertainment itself. Note that not every infected person consults a doctor, which implies that the data of the SurvStat system contains only the serious courses of the disease. Nonetheless, these sources provide a valuable tool to obtain insight into the spread and persistence of the Influenza virus in Germany. Further, due to the data’s significance, it is possible to estimate the number of infections within certain areas as well. Since the spread of infections in the real world is influenced by factors like seasonal fluctuations or the virus’ aggressiveness, the number of total infections may significantly differ from year to year, cf. [22] for different years. However, the course of the curve usually does not change. Consequently, we do not focus on absolute values in our simulations, but on the characteristics of our results. These characteristics remain, up to some scaling factor, identical over the whole data set provided by the RKI.

3.3.2 Case 1

The parameters for this subcase are as follows. We set \( \gamma = 7\,\% \), \( \xi \) to 5 days, the amount of initially infected cities \( City_{init} \) to 1 (namely Berlin), and the amount of initially infected agents \( Agent_{init} \) to 0.0015 % of the overall agents used for these simulations. Furthermore, \( \alpha = 2.8 \) and \( \kappa = 1 \).

Figure 3 shows the results for this case. Here the green curve represents the real-world data provided by the RKI for the year 2007 while the red curve represents our simulation results. Note that both curves vary significantly in terms of absolute numbers. However, this is not our focus here. Due to the level of abstraction in our model and since the RKI data only contains reported cases (see above), the absolute numbers do not coincide. Additionally, as stated above, the data provided by the RKI also differs significantly (in terms of absolute numbers but not the disease characteristics) from year to year (cf. [22]). Therefore, we focus on the course of the disease and the resulting characteristics of the plotted curves.

It is easy to see that the more agents are used, beginning from Fig. 3a up to 3d, the more the curve characteristics converge. Moreover, the accuracy of each simulation run increases as well (cf. the confidence intervals in Fig. 3). With at least \( 500,000 \) agents in use, both curves become similar.

To obtain a more formal evaluation, we define three measures, which are used to compare our results to the data provided by the RKI. These are: the time to peak (TTP), the epidemic duration (ED), and the area of the curve (AC). The time to peak describes the week with the maximum amount of newly infected agents of the corresponding curve. The area of the curve is simply the summation of the area between the origin and the endpoint EP (defined by the epidemic duration). Finally, the endpoint of the epidemic duration is the week in which only a minor amount of new infections occur, and no significant new infections are observed anymore. Minor infections are defined to start in a round \( i \) and last till the last round \( j \) of the simulation while for all rounds \( i \le i^{\prime } \le j \) it holds that the amount of newly infected agents does not exceed 9 % of the maximum value.

In the following we consider the relative values of these measurements compared to the RKI data. That is, the numbers represent the ratio between the value obtained in our simulations and the value provided by the RKI. For example, a value of \( 4.00 \) for TTP in the case of \( 10,000 \) agents implies that the time to peak in our case divided by the time to peak in the real world data is 4. Using the individual deviation measurements, we define a global deviation value by the formula \( \frac{1}{3} \cdot {\text{TTP}} + \frac{1}{3} \cdot {\text{AC}} + \frac{1}{3} \cdot {\text{ED}} \). For simplicity we consider each individual measurement uniformly weighted. The results, which confirm our previous observations, are shown in Fig. 4. All obtained results and previous statements imply a fragile balance between the accuracy, the parameter setting, and the amount of agents in use.

Fig. 4
figure 4

A visual representation of the data computed using the deviation measurements for the experiments conducted in Case 1

3.3.3 Case 2—Non-pharmaceutical Countermeasures

Now we extend our analysis to incorporate non-pharmaceutical countermeasures such as school closures and social distancing. Here we stick to a fictional epidemic simply because it simplifies the presentation, i.e., due to the increase of \( \gamma \) to \( 12\% \), a faster spread is achieved and the impact of the used countermeasures is amplified. Similar to Case 1, we set \( \xi \) to 5 days, the amount of initially infected cities \( City_{init} \) to 1 (namely Berlin), and the amount of initially infected agents \( Agent_{init} \) to \( 0.00075\% \) of the overall agents (to compensate the higher \( \gamma \) in the beginning). All relevant parameters regarding the countermeasure models can be found in the original paper [9].

We assume that non-pharmaceutical countermeasures basically affect the parameters \( \alpha \) and \( \kappa \), since the individuals will most likely avoid places with a large number of persons, waive needless tours, and be more careful when meeting other people. For obvious reasons, we cannot compare our simulation results to current real-world data containing results for different epidemics with varying (or no) countermeasures in use. Therefore, we use the work of Markel et al. [18] for this purpose. Especially Fig. 5 is of particular interest. There, the direct correlation between establishing countermeasures and a decreasing amount of new infections (and vice versa) is well highlighted. We observed an identical effect in our simulations (cf. Fig. 6). Note that different combinations of countermeasures used in [18] entail different kinds of impacts on the death rates. In contrast, we do not focus on specific combinations but on sufficient ones to actually achieve an immediate impact on the epidemic.

Fig. 5
figure 5

Weekly excess death rates from September 8, 1918, through February 22, 1919 [18, Fig. 3]

Fig. 6
figure 6

Example results for BER for Case 2. The green curve represents the amount of infected inhabitants in the corresponding city while the red curve indicates the activated countermeasure level with respect to the countermeasure model in use. The number of agents refers to the total amount of agents used for the simulation. a Model LM 3, 100 million agents. b Model RM, 10 million agent

As already described in Sect. 2, we use two different countermeasure approaches: the level based (\( LM_{1} , LM_{2} \) and \( LM_{3} \)), and the ratio based (\( RM \)), respectively. Both use different mechanisms and parameters to achieve the embankment of the epidemic. Recall that all transition points in the level based model are chosen w.r.t. the ratio between the amount of currently infected individuals and the population size of the city. Figure 7 compares all models to each other on the national level.

Fig. 7
figure 7

Comparison of different countermeasure models on a large scale, i.e., each value represents the situation for the corresponding countermeasure model in all cities combined. a 10.000 agents. b 100.000.000 agents

Although the level based approach is completely different compared to the ratio based approach, the achieved results are similar. However, the overall increase of \( \alpha \) and \( \kappa \) by the ratio based approach may be noticeably higher, especially if a large number of agents is used. That is, while all \( LM \)-models use \( \alpha \le 3.3 \), the \( RM \)-model goes above \( 4 \). This implies that the \( LM \)-models are more cost efficient, since both \( \alpha \) and \( \kappa \) are kept lower and therefore less effort is needed to achieve and maintain said values.

Additionally, to be able to compare our results to the findings in [18], we examine the following city in more detail: Berlin (BER) with a population size of \( \approx 3.5 \) million. Figure 6 represents a composition of some interesting results. Note that the green curve in these figures represents the countermeasure level for the \( LM \) models at the corresponding time, and indicates the number of times \( z_{1} ,z_{2} \) have increased \( \alpha ,\kappa \) in the \( RM \) model. Recall that, in the \( RM \)-model level \( j \) implies \( \alpha_{i} = \alpha_{0} + \sum\nolimits_{k = 1}^{j} z_{1} \) and \( \kappa_{i} = \kappa_{0} + \sum\nolimits_{k = 1}^{j} z_{2} \) for a step \( i \).

Our results confirm the impact of different countermeasures observed in the real world [18]. Compared to Fig. 5, our simulations show a similar behavior (i.e., more than one peak during the epidemic). It is easy to see that the countermeasures presented in [18] directly influence the course of the epidemic. The same property can be observed in our results (cf. Fig. 5). One can see that depending on the countermeasure level (indicated by the activated/used level), the number of infections increases or decreases. Note that although our figures show the number of infected individuals and not the death rate as shown in Fig. 5, a comparison is still possible, since this deviation can be normalized using a scaling factor.

Furthermore, we observe that small adjustments of the two parameters \( \alpha \) and \( \kappa \) entail a significant impact on the number of overall infections. Among others we already proved that if the power law exponent (and \( \kappa \) as well) is assumed to be some large constant, then even a very aggressive epidemic with \( \gamma = 100\,\% \) will affect no more than a polylogarithmic number of the population. Our findings now back up these observations.

In conclusion, we showed the impact of different countermeasures on the behavior of a population w.r.t. our model. Although some complexity of the real world is lacking, the similarities to real-world observations are still present. Starting with settings for the environment, and therefore implicitly the individuals’ behavior, based on real-world observations, relatively low level countermeasures were sufficient to embank or at least significantly suppress an outbreak. Essentially the same properties were already observed in reality (cf. [18]). This underlines the importance of behavioral and environmental models based on power law distributions.

4 Conclusions

Agent based simulators offer various possibilities to perform very detailed experiments. However, the parameters used in these experiments highly influence the results one might obtain. As we have seen, even the number of agents has a significant impact on the quality of the results. This includes the reliability of different simulation runs with an identical parameter setting. By using the right parameter settings and a proper number of agents, it is possible to approximate the course of a disease as observed in the real world. Furthermore, our experiments indicate that the algorithmic framework presented in this paper is able to describe, to some extent, the impact of certain non-pharmaceutical countermeasures on the behavior of an epidemic.