Robustness for Adversarial Risk Analysis

Ríos Insua, David; Ruggeri, Fabrizio; Alfaro, Cesar; Gomez, Javier

doi:10.1007/978-3-319-33121-8_3

David Ríos Insua⁷,
Fabrizio Ruggeri⁸,
Cesar Alfaro⁹ &
…
Javier Gomez⁹

Part of the book series: International Series in Operations Research & Management Science ((ISOR,volume 241))

1654 Accesses
8 Citations

Abstract

Adversarial Risk Analysis is an emergent paradigm for supporting a decision maker who faces adversaries in problems in which the consequences are random and depend on the actions of all participating agents. In this chapter, we outline a framework for robust analysis methods in Adversarial Risk Analysis. Our discussion focuses on security applications.

Access provided by Autonomous University of Puebla. Download chapter PDF

Adversarial Risk Analysis as a Decomposition Method for Structured Expert Judgement Modelling

Expert Judgement in Terrorism Risk Assessment

Improving Defense Against Intelligent Adversaries

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

3.1 Introduction

Large scale terrorist events like S-11 led to huge security investments. In turn, this has promoted many modeling efforts to support how to efficiently allocate such resources. Parnell et al. [15] provided an in-depth review for the US National Academy of Sciences on bio-terrorism assessment, concluding, among other things, that traditional risk analysis tools, like event trees, are not adequate in this application area for not accounting for adversarial intentionality; the critical and, in many contexts, doubtful common knowledge assumptions of game theoretic approaches; and, finally, the problems of decision analytic based approaches in forecasting adversarial actions. Merrick and Parnell [12] reviewed numerous approaches in this research area, commenting favorably on Adversarial Risk Analysis (ARA) , which is a framework to manage risks derived from actions of intelligent adversaries, see [20] or [1].

ARA aims at providing one-sided prescriptive support to one of the intervening agents, the Defender (D, she), based on a subjective expected utility model treating the adversary’s decisions as uncertainties. To do so, we model the adversary’s (A, Attacker, he) decision making problem and, assuming that he is an expected utility maximizer, try to assess his probabilities and utilities. We can consequently forecast his optimal action. However, our uncertainty about the adversary’s probabilities and utilities is propagated to his decision, leading to a random optimal adversary decision which provides us with the required distribution over the Attacker’s decision. Sometimes such assessments may lead to a hierarchy of nested decision problems, as described in [17], similar to the concept of level-k thinking, see [24]. In contrast with game theoretic approaches, we do not assume the standard, but unrealistic, common knowledge hypothesis, see [5], according to which the agents share information about their utilities and probabilities.

A critical issue in ARA is elicitation. As in any subjective Bayesian analysis , one needs personal probabilities over the parameters in the problem. Obtaining them is not easy and we need to cope with many biases, see e.g., [14]. This is aggravated in our context because of the involved strategic considerations. Nau [13] as well as Wang and Bier [26] provide discussions of elicitation in the context of adversarial situations.

The practical difficulty of elicitation raises the question of robustness. One wants an ARA to be robust to the elicited probabilities and utilities, the model entertained and, when available, the data. A good way forward is sensitivity analysis. The above mentioned review by Parnell [15] recommends it, and Von Winterfeldt and O’Sullivan [25] perform a systematic sensitivity analysis with respect to elicited probabilities in an event tree concerning MANPADS. A different approach is taken by Kardes [9], who considers robust stochastic games.

Robust Bayesian analysis facilitates finding the entire set of posterior distributions for a parameter when the prior lies within a class of distributions. The results are typically expressed in terms of upper and lower bounds on probabilities and expected utilities. Berger et al. [2] review this methodology which has yet to be used in ARA. The only direct application is given by McLay et al. [11], who point the way towards a principled means to incorporate robustness into ARA. They consider a level-k thinking analysis of the sequential Defend-Attack game in which the Attacker imperfectly observes the decision made by the Defender. The game is modeled through an information structure comprising several signals and, conditional on the defense choice, there is a specified distribution over the signals, a model initially proposed by Rothschild et al. [22]. Robustification occurs by setting upper and lower bounds over parameters for which distributions must be elicited, and then calculating the outcome under the worst case combination of upper and lower values.

This chapter provides a complete outline of the role of robust methods in ARA. After introducing basic notions in Bayesian robustness, we first describe the robust ARA approach for sequential games and, then, for simultaneous games. In both cases, we start by computing the game theoretic solution. We apply robust concepts to assess such solution. If it is not robust, we use the ARA approach to find an alternative solution. Again, we criticize it through robust ideas. If the solution is still unstable, we may appeal to conventional robust concepts, such as the γ maximin . We illustrate the ideas with a simple numerical example concerning routing security.

3.2 Bayesian Robustness

We present here the basic ideas on Bayesian robustness. We refer to Ríos Insua and Ruggeri [18] for an in-depth overview. In the Bayesian approach to inference, prediction and decision making, the interest frequently lies on the behavior of the posterior distribution on a parameter θ obtained by combining experimental evidence provided by the likelihood and expert knowledge expressed through the prior distribution, via Bayes theorem. This is used to compute posterior (and predictive) expectations of functions g(θ) which typically will be set indicators, powers or utility functions, providing, respectively, set probabilities, moments and expected utilities. The robust Bayesian approach stems from the practical difficulty of specifying a unique prior distribution and/or a unique utility function, corresponding, respectively, to the expert’s beliefs and the decision maker’s preferences. Therefore, classes of priors and/or utilities are entertained and the consequences of different possible choices of such pairs are evaluated through synthetic indices which determine whether the quantity of interest is subject to small or large variations when changing the prior/utility, i.e. whether there is robustness or not.

In accordance with the content of this chapter, we shall consider utilities u in a class $\mathcal{U}$ and probability measures p in a class $\mathcal{P}$ (without distinguishing whether they are priors or posteriors). We suppose that the probability measure p has a density p(s) over the states s, and the utility function has the form u(d, s), where d is an action (decision) in the feasible set $\mathcal{D}$. We are interested in computing the expected utilities ψ _up(d) = ∫ u(d, s)p(s)ds for various alternatives d and the feasible alternative $d_{up}^{{\ast}}\in \mathcal{D}$ maximizing expected utility, given such choice u and p.

In a robust context, the interest would typically be in the ranges that relevant quantities span when p and u vary in the class, e.g. the range of the expected utility for a certain alternative d

$$\displaystyle{\rho _{\psi }(d) =\sup _{p\in \mathcal{P},u\in \mathcal{U}}\psi _{up}(d) -\inf _{p\in \mathcal{P},u\in \mathcal{U}}\psi _{up}(d),}$$

or the distance between the optimal alternative and a reference alternative d ^∗

$$\displaystyle{\rho _{d} =\sup _{p\in \mathcal{P},u\in \mathcal{U}}e(d_{up}^{{\ast}},d^{{\ast}}),}$$

for some distance e. Looking at ρ _d, we claim that there is robustness if its value is small with respect to the entertained problem and the decision maker’s perception. In this case, essentially any p and u, and the corresponding d _up ^∗, may be used for decision making purposes. Otherwise, efforts are required to get smaller classes until either robustness can be achieved or no further refinement is possible.

In the latter case, some criterion could be introduced to choose a pair (p, u) and the corresponding d _up ^∗. A possible choice for a decision could be the minimum regret decision,

$$\displaystyle{\hat{d} = \mbox{ arg min}_{d\in \mathcal{D}}\max _{p\in \mathcal{P},u\in \mathcal{U}}\left [\psi _{up}(d_{up}^{{\ast}}) -\psi _{ up}(d)\right ].}$$

For a related discussion see [19]. In particular, the decision $\hat{d}$ is conservative in the sense that it protects against the worst loss in expected utility when replacing an optimal decision d _up ^∗ by another one.

3.3 Sequential Games

We start by considering sequential games: one agent first makes her decision and, then, the other agent implements his alternative. As an example, imagine a case in which a company deploys their cybersecurity countermeasures and then, observing them, a hacker decides whether he launches an attack or not towards such company.

Specifically, we consider a Defend-Attack situation in which a Defender chooses a defense $d \in \mathcal{D}$ and, then, the Attacker, having observed the defense, chooses his attack $a \in \mathcal{A}$. The corresponding bi-agent influence diagram is shown in Fig. 3.1. An arc reflects that the Defender’s choice is observed by the Attacker. The consequences for both players depend on the success s of the attack. Each decision maker assesses differently the probability of the result of an attack, which depends on the defense and attack adopted: p _D(s∣d, a) and p _A(s∣d, a). The utility function of the Defender u _D(d, s) depends on her chosen defense and the result of the attack. Similarly, the Attacker’s utility function is u _A(a, s). We first recall the standard game theoretic approach and check its robustness. We then present the ARA solution and, again, provide a robust analysis.

3.3.1 Game Theoretic Solution and Robustness

The standard game theoretic solution does not require the Attacker to know the Defender’s probabilities and utilities, since he observes the Defender’s actions. However, the Defender needs to know the Attacker’s utilities and probabilities (u _A, p _A), an example of common knowledge. We then proceed as follows. First, we compute the expected utilities of the players at node S in Fig. 3.1:

$$\displaystyle{ \psi _{A}(a,d) =\int u_{A}(a,s)p_{A}(s\vert a,d)ds, }$$

(3.1)

$$\displaystyle{\psi _{D}(a,d) =\int u_{D}(d,s)p_{D}(s\vert a,d)ds.}$$

Then, we compute the Attacker’s best response to the Defender’s action d, which is

$$\displaystyle{ a^{{\ast}}(d) = \mbox{ arg max}_{ a\in \mathcal{A}}\psi _{A}(a,d). }$$

Knowing this, the Defender’s optimal action is, then,

$$\displaystyle{d_{GT}^{{\ast}} = \mbox{ arg max}_{ d\in \mathcal{D}}\psi _{D}(a^{{\ast}}(d),d).}$$

The solution a ^∗(d _GT ^∗), d _GT ^∗ is a Nash equilibrium and, indeed, a sub-game perfect equilibrium, see [5]. We call d _GT ^∗ the Nash defense.

3.3.1.1 Robustness of the Game Theoretic Solution

Since we are supporting the Defender, we could argue that we know reasonably well (u _D, p _D). However, we would contend that knowledge about (u _A, p _A) is that precise, since it would require the Attacker to reveal them (common knowledge). This is questionable in many application areas including security, cybersecurity and competitive marketing. We may use robust methods to criticize such information and, consequently, assess the game theoretic solution.

As discussed in Sect. 3.2, from a conceptual point of view, to perform robustness we may consider classes for the Attacker’s utilities and probabilities that we model through $u \in \mathcal{U}_{A}$, $p \in \mathcal{P}_{A}$. Then, mimicking the approach above, for each feasible (u, p) we could:

Compute the expected utilities (ψ _A ^u, p(d, a), ψ _D ^u, p(d, a)) at node S in Fig. 3.1.
Compute the best response attack a _u, p ^∗(d) for each d.
Compute the optimal defense d _u, p ^∗.

Then, if d _u, p ^∗ remains reasonably stable for the allowed perturbations of u and p, with $u \in \mathcal{U}_{A}$, $p \in \mathcal{P}_{A}$, the game theoretic solution seems robust. However, if d _u, p ^∗ is not that stable, we have an issue which questions, at first sight, the relevance of the proposed Nash defense d _GT ^∗. At a deeper level, it also questions the appropriateness of the (u _A, p _A) assessment, actually serving to criticize the game theoretic assumptions, specially that of common knowledge, see [16] or [10].

From an operational point of view, the above robustness analysis scheme for the game theoretic approach boils down to two computational issues:

Exploring the whole range of perturbations $u \in \mathcal{U}_{A}$, $p \in \mathcal{P}_{A}$. In some cases, for classes of probabilities and utilities widely studied in the robust Bayesian literature, see [2], it is possible to identify the extremal elements of $\mathcal{U}_{A}$ and $\mathcal{P}_{A}$ and compute upper and lower bounds on the quantities of interest (namely optimal decisions d _u, p ^∗ and their expected utilities), through numerical optimization methods. Another possible approach would be to randomly sample elements u, p from the sets $\mathcal{U}_{A},\mathcal{P}_{A}$ and check for eventual large variations in d _u, p ^∗ (and their expected utilities).
Declaring whether the effects induced by changes over d _u, p ^∗ and the expected utility are sufficiently small. As discussed in Sect. 3.2, a possible criterion could be given by the range spanned by d _u, p ^∗ as utility and probability vary in the classes, i.e. $u \in \mathcal{U}_{A}$ and $p \in \mathcal{P}_{A}$, respectively. Regarding the effects on the expected utility, a criterion of interest could be based on the regret r _u, p(d _GT ^∗) given by the difference in expected utility when considering, for a given pair (u, p), the Nash defense d _GT ^∗ and the optimal defense d _u, p ^∗. A small value of $\sup _{(u,p)\in \mathcal{U}_{\mathcal{A}}\times \mathcal{P}_{\mathcal{A}}}r_{u,p}(d_{GT}^{{\ast}})$ would denote robustness with respect to the choice of utility and probability and, therefore, any pair (u, p) can be chosen as opinion on the Attacker’s behavior with no significant change in the consequences. If robustness is not achieved, then we could undertake a minimum regret approach as discussed in Sect. 3.2.

An alternative would be to move to ARA, as discussed next.

3.3.2 ARA Solution and Robustness

If the game theoretic solution is not robust, then we need to address the issue. One way forward is to perform an ARA approach. For this, we weaken the common knowledge assumption. In the sequential game, this means that the Defender does not know (p _A, u _A). The problem she faces is depicted in Fig. 3.2.

To solve her problem, the Defender requires more information than p _D(s | a, d) and u _D(d, s), available from our earlier discussion. She also needs p _D(a | d), which is her assessment of the probability that the Attacker will choose attack a after having observed that she has chosen the defense d. Once the Defender has completed these assessments, she can solve the problem. Indeed, the expected utility of d would be

$$\displaystyle{\psi _{D}(d) =\int \psi _{D}(a,d)p_{D}(a\vert d)da =\int \left [\int u_{D}(d,s)p_{D}(s\vert a,d)ds\right ]\;p_{D}(a\vert d)da.}$$

Finally, her optimal decision would be $d_{ARA}^{{\ast}} = \mbox{ arg max}_{d\in \mathcal{D}}\;\psi _{D}(d)$. Note that, in terms of classic game theory , the solution d _ARA ^∗ for our sequential game may not correspond to a Nash equilibrium , see the example in Sect. 3.5.

Eliciting p _D(a | d) requires the Defender to analyze the problem from the Attacker’s perspective.

First, the Defender puts herself in the Attacker’s shoes, and thinks about his decision problem. Figure 3.3 represents the Attacker’s problem, as seen by the Defender, assuming he is an expected utility maximizer. The Defender will use all the information and judgment that she can obtain about the Attacker’s utilities and probabilities. Instead of using point estimates for p _A and u _A to find the Attacker’s optimal decision a ^∗(d) for a given d, the Defender’s uncertainty about the Attacker’s decision should derive from her uncertainty about the Attacker’s (p _A, u _A), through a distribution F on the space of utilities and probabilities, which we designate random probabilities and utilities . This induces a distribution over the Attacker’s expected utility in (3.1), where the random expected utility for A would be

$$\displaystyle{\varPsi _{A}(a,d) =\int U_{A}(a,s)P_{A}(s\vert a,d)ds,}$$

for (P _A, U _A) ∼ F. Then, the Defender would find

$$\displaystyle{p_{D}(a\vert d) = \mathbb{P}_{F}[a = \mbox{ arg max}_{x\in \mathcal{A}}\varPsi _{A}(x,d)],}$$

in the discrete case and, similarly, in the continuous case. We can use Monte Carlo simulation to approximate p _D(a | d) by drawing N samples P _A ⁱ, U _A ⁱ _i = 1 ^N from F and setting

$$\displaystyle{ \hat{p}_{D}(a\vert d) \approx \frac{\#\{a = \mbox{ arg max}_{x\in \mathcal{A}}\;\varPsi _{A}^{i}(x,d)\}} {N}, }$$

(3.2)

where Ψ _A ⁱ(a, d) = ∫ U _A ⁱ(a, s)P _A ⁱ(s | a, d)ds.

3.3.2.1 Robust Analysis

The above approach leads to a Bayesian decision analysis problem with the peculiarity that we have a complex procedure to forecast the adversarial actions. To do so, we formulate the adversary decision making problem and propagate our uncertainty about the adversary judgments to the optimal adversarial action.

We could then think about performing a robust Bayesian analysis. The inputs to the Defender’s decision analysis are (u _D(d, s), p _D(s | a, d), p _D(a | d)). We focus here on sensitivity to the last component p _D(a | d), surely the most contentious one, attained through adversarial calculations based on the proposed U _A(a, s), P _A(s | a, d). For that, we define classes $\mathcal{U}_{A}$, $\mathcal{P}_{A}$ of random utilities and probabilities. For each pair U, P in such class, we define p _D ^UP(a | d) through the ARA approach which, in turn, leads to d _ARA ^∗UP.

Then, it is possible to consider the impact of the imprecision about U and P over three quantities: p _D ^UP(a | d), d _ARA ^∗UP and ψ(d _ARA ^∗UP). The first quantity requires the comparison of densities (actually of their Monte Carlo approximations) using indices like the Kullback-Leibler divergence or Gini index. For the first and second quantities, the interest centers around the variation of the decision (for the Defender), whereas for the third one, the focus is on the expected utility of the decision. The last quantity should be of major interest. In all three cases, we say that robustness holds when the value of interest does not change much, whereas additional analysis should be taken otherwise, as described in Sect. 3.2. In particular, if the distributions p _D ^UP(a | d) do not differ too much, it is possible to choose one of them and use d _ARA ^∗UP directly.

3.3.3 A Full Robust Solution

If the ARA analysis is not robust, we may opt for gathering additional information to reduce the classes $\mathcal{U}_{A}$ and $\mathcal{P}_{A}$. The choice of increasing the sample size in the Monte Carlo estimation $\hat{p}_{D}(a\vert d)$ in (3.2) would be useful in reducing the variability of the distribution. However, it will typically be ineffective in increasing robustness.

Once all possible sources of information have been exploited to try to increase robustness about d _ARA ^∗ and ψ(d _ARA ^∗), then some extra criterion has to be introduced to make a decision and report a value about the quantity of interest. In any case, such decision should be reported with the warning of lack of robustness. As discussed in Sect. 3.2, we could consider the decision d _R ^∗ minimizing the maximum regret, i.e.

$$\displaystyle{\min _{d}\max _{U\in U_{A},P\in P_{A}}\left [\int \psi _{D}(a,d_{ARA}^{{\ast}UP})p_{ D}^{UP}(a\vert d_{ ARA}^{{\ast}UP})da -\int \psi _{ D}(a,d)p_{D}^{UP}(a\vert d)da\right ].}$$

3.4 Simultaneous Games

We discuss now the simultaneous game model: two agents choose their decisions, without knowing the action selected by each other. Among others, see [27] for a related discussion within a game theoretic framework. As an example, imagine a case in which the EASA decides whether to introduce undercover marshals in an airplane that might, or not, be hijacked by terrorists.

Assume that the adversaries have alternative sets $\mathcal{D}$ and $\mathcal{A}$ of defenses and attacks, respectively. The only relevant uncertainty is S, denoting the success s of the attack. Each decision maker assesses differently the probability of the result of the attack, which depends on the defense and attack adopted: p _D(s∣d, a) and p _A(s∣d, a). The utility function of the Defender u _D(d, s) depends on her chosen defense and the result of the attack. Similarly, the Attacker’s utility function is u _A(a, s), as illustrated in Fig. 3.4.

3.4.1 Game Theoretic Solution

Under common knowledge, preferences and beliefs from both the Defender and the Attacker, (u _D, p _D) and (u _A, p _A) respectively, are disclosed. Therefore, each adversary knows the expected utility that each pair $(d,a) \in \mathcal{D}\times \mathcal{A}$ would provide to both of them, computed through

$$\displaystyle{\psi _{D}(d,a) =\int u_{D}(d,s)p_{D}(s\vert a,d)ds,}$$

$$\displaystyle{\psi _{A}(d,a) =\int u_{A}(a,s)p_{A}(s\vert a,d)ds.}$$

A Nash equilibrium (d _GT ^∗, a _GT ^∗) for this game would satisfy

$$\displaystyle\begin{array}{rcl} \psi _{D}(d_{GT}^{{\ast}},a_{ GT}^{{\ast}})& \geq & \psi _{ D}(d,a_{GT}^{{\ast}})\,\,\,\forall \,d \in \mathcal{D}\quad \mbox{ and } {}\\ \psi _{A}(d_{GT}^{{\ast}},a_{ GT}^{{\ast}})& \geq & \psi _{ A}(d_{GT}^{{\ast}},a)\,\,\,\forall \,a \in \mathcal{A}. {}\\ \end{array}$$

Finding Nash equilibria may require the use of randomized strategies, see [4]. There could be several equilibria, with no unambiguous criteria to further discern among them, see [16] for a discussion.

If utilities and probabilities are not common knowledge among the adversaries, a game-theoretic approach proceeds by modeling the game as one with incomplete information, see [6–8], by introducing the notion of player types. Each player will be of a certain type which is known to him but not to his opponent: a player’s type represents the private information he may have. Harsanyi proposes the Bayes-Nash equilibrium as a solution concept, still under a strong common knowledge assumption: the adversaries’ beliefs about the opponent’s types are common knowledge and modeled through a common prior distribution. Moreover, it is assumed that the players’ beliefs about other uncertainties in the problem are also common knowledge. Again randomized strategies might be required to find such equilibria.

3.4.1.1 Robustness of the Game Theoretic Solution

We could argue that we know reasonably well (u _D, p _D), since we are supporting the Defender. However, we would contend that (u _A, p _A) is properly known, since it requires common knowledge, which is questionable. To address this concern, we perform a robust analysis of the Defender’s decision at the Nash equilibrium .

For that, we would consider classes for the Attacker’s utilities and probabilities represented as $u \in \mathcal{U}_{A}$, $p \in \mathcal{P}_{A}$. Then, for each feasible (u, p) we could compute the corresponding Nash equilibrium (d _up ^∗, a _up ^∗). If d _up ^∗ remains stable for the feasible perturbations of u and p, the game theoretic solution d _GT ^∗ seems robust, from the perspective of the Defender. However, if d _up ^∗ changes, specially the corresponding expected utility, we have a problem which questions, at first sight, the relevance of the proposed d _GT ^∗ and, at a deeper level, the appropriateness of the (u _A, p _A) assessment, actually serving to criticize the game theoretic approach at large and, in particular, the common knowledge assumption. The two computational issues about finding all possible optimal decisions and assessing robustness are dealt with as mentioned in Sect. 3.3.3.

Note that we could actually study robustness with respect to (u _D, p _D, u _A, p _A) and consider changes in $d_{u_{A},p_{A},u_{D},p_{D}}^{{\ast}}$. In this case, if the Defender’s Nash equilibrium decision is sensitive, we might question the Defender’s knowledge, besides the game theory postulates.

3.4.2 ARA Solution and Robustness

If the Nash equilibrium is unstable, we may try an ARA approach. We have to weaken the common (prior) knowledge assumptions. As reflected in Fig. 3.5, the Defender has to choose a defense $d \in \mathcal{D}$, whose consequences depend on the success of an attack $a \in \mathcal{A}$ simultaneously chosen by the Attacker, which is, therefore, uncertain for the Defender at the time she makes her decision.

By standard Decision Theory , the Defender should maximize her expected utility, see [3]. The Defender knows her utility function u _D(d, s) and her probability assessment p _D over S, conditional on (d, a). However, she does not know the Attacker’s decision a at node A. She expresses her uncertainty through a probability distribution p _D(a). Then, the optimization problem she should solve to find d _ARA ^∗ is

$$\displaystyle{ \begin{array}{r@{}l} \max _{d}\int \psi _{D}(a,d)p_{D}(a)da& =\max _{d}\int \left [\int u_{D}(d,s)p_{D}(s\vert a,d)\,ds\right ]p_{D}(a)\,da \\ & =\max _{d}\int \int u_{D}(d,s)p_{D}(s\vert a,d)p_{D}(a)\,ds\,da.\end{array} }$$

(3.3)

We could then perform a robust analysis based on u _D, p _D(s | a, d) and p _D(a). However, eliciting this last probability distribution is more difficult. We may use ARA as follows to get it.

Suppose the Defender thinks that the Attacker is an expected utility maximizer who tries to solve the decision problem shown in Fig. 3.6. The Attacker would look for the attack $a \in \mathcal{A}$ providing him maximum expected utility:

$$\displaystyle{a^{{\ast}} =\mathop{ \mathrm{arg\,max}}\nolimits _{ a\in \mathcal{A}}\,\int \int u_{A}(a,s)p_{A}(s\vert a)p_{A}(d)ds\;dd.}$$

In general, the Defender will be uncertain about the Attacker’s utility function and probabilities, and she would consider random utilities and probabilities through F = (U _A(a, s), P _A(s | a), P _A(d)) and compute the random optimal alternative

$$\displaystyle{ A^{{\ast}}\vert D =\mathop{ \mathrm{arg\,max}}\nolimits _{ a\in \mathcal{A}}\,\int \int U_{A}(a,s)P_{A}(s\vert a)P_{A}(d)ds\,dd. }$$

(3.4)

Then, we would make

$$\displaystyle{p_{D}(a) = P(A^{{\ast}} = a\vert D)}$$

in the discrete case and, similarly, in the continuous case.

Note that (U _A(a, s), P _A(s | a)) would be comparatively easily elicited from the Defender, see examples in [1]. However, the elicitation of P _A(d) may require further analysis leading to a next level of recursive thinking: the Defender would need to think about how the Attacker analyzes her problem. This is why we condition in (3.4) by (the distribution of) D.

In the above, the Defender presumes that the Attacker thinks she is an expected utility maximizer trying to solve a decision problem like that described in Fig. 3.5. Therefore, in order for the Defender to assess the required distribution, she will elicit (U _A, P _A) from her viewpoint, and assess P _A(D) through the analysis of her decision problem, as thought by the Attacker, mimicking the resolution of problem (3.3) from the Attacker’s perspective. This reduces the assessment of P _A(D) to computing the distribution

$$\displaystyle{D\mid A^{1} \sim \mathop{\mathrm{arg\,max}}\nolimits _{ d\in \mathcal{D}}\,\int \int U_{D}(d,s)P_{D}(S = s\mid d,a)P_{D}(A^{1} = a)ds\,da,}$$

assuming that the Defender is able to assess P _D(A ¹). A ¹ represents the Attacker’s decision within the Defender’s second level of recursive thinking in the nested decision model used by the Defender to predict the Attacker’s analysis of her decision problem. To complete the assessment, the Defender should elicit (U _D, P _D) ∼ G, representing her probabilistic knowledge about how the Attacker may estimate her utility function u _D(d, s) and her probability p _D over S | d, a, when she analyzes how the Attacker thinks about her decision problem. The elicitation of P _D(A ¹) might require further recursive thinking from the Defender, see our final discussion.

3.4.2.1 Robustness

Performing a robust analysis for the ARA approach to the simultaneous game would be similar to what described earlier. Consider a class for $(U_{A}(a,s),P_{A}(s\vert a),P_{A}(d)) \in (\mathcal{U}_{A},\mathcal{P}_{A},\mathcal{Q}_{A})$. We use (U, P, Q) to simplify the notation describing the elements in the classes. Then, for (U, P, Q) satisfying the constraints, replicating the approach above we could compute p _D ^UPQ and d _ARA ^∗UPQ. If d _ARA ^∗UPQ remains stable with respect to changes in (U, P, Q), then the problem seems robust and we could apply the ARA approach with little concern. Otherwise, we could still use a robust solution concept, like the minimum regret mentioned in Sect. 3.2.

3.5 An Example

As an illustration, we consider a sequential defend-attack security routing problem. An organization needs to make a trip, either through a safe, but costly, route, or through a cheaper, but more dangerous, route. In this case, they may invest in security, rendering the route less dangerous. See [23] for a case concerning piracy in Somalia. Table 3.1 displays the consequences, expressed as costs, for various defend and attack possibilities.

Table 3.1: Loss function in routing problem

Full size table

The following parameters are used:

θ ₁ represents the fraction of assets lost by the organization when attacked but protected.
θ ₂ represents the fraction of assets lost by the organization when attacked and not protected.
c is the cost per unit of assets.
K are the protection costs.
H is the cost of going through the expensive route.
d is the Attacker’s gain per unit of assets lost by the Defender.
B is the cost of an attack.

The Defender has beliefs for θ _i, with θ _i ∼ β(a _i, b _i), i = 1, 2. She is risk averse and her utility function is strategically equivalent to − exp(hx), where x is her cost and h > 0 is her risk aversion coefficient. The Attacker has different beliefs for θ _i with θ _i ∼ β(c _i, e _i), i = 1, 2. He is risk prone and his utility function is strategically equivalent to exp(−mx), where x is his cost and m > 0 is his risk proneness coefficient. Both agents expect θ ₁ to be smaller than θ ₂, but not necessarily. This may be reflected in the choice of the beta parameters, for example with $a_{1}/(a_{1} + b_{1}) <a_{2}/(a_{2} + b_{2})$, in the case of the Defender. Table 3.2 provides the expected utilities for both agents under various interaction scenarios.

Table 3.2: Expected utilities in routing problem

Full size table

The problem may be viewed through the game tree in Fig. 3.7, where d ₁ means going through the dangerous route but protected; d ₂ means going through the dangerous route but unprotected; and, finally, d ₃ means going through the safe route, whereas a means attack and $\bar{a}$ means no attack.

We are supporting the Defender and assess from her the values c = 200,000, K = 50,000, H = 100,000, h = 3. We also elicit from her the distributions β(a ₁, b ₁), with mean 0. 3 and standard deviation 0. 07, leading to a ₁ = 12. 325, b ₁ = 28. 76; and β(a ₂, b ₂), with mean 0. 7 and standard deviation 0. 18, leading to a ₂ = 3. 815, b ₂ = 1. 635.

3.5.1 Game Theoretic Approach

Under common knowledge, we assume the Defender knows that d = 30,000, B = 10,000, m = 5 and the distributions β(c ₁, e ₁), with mean 0. 313 and standard deviation 0. 16, leading to c ₁ = 2. 272, e ₁ = 4. 978; and β(c ₂, e ₂), with mean 0. 324 and standard deviation 0. 11, leading to c ₂ = 5. 49, e ₂ = 11. 45. We, then, proceed as follows:

At node A ₁, compute $\max \;(\psi _{A}(d_{1},a),\psi _{A}(d_{1},\bar{a}))$ and call the optimal action a ^∗(d ₁). In the example, we have max (1. 001, 1) = 1. 001 and the optimal decision for the Attacker is a.
At node A ₂, $\max \;(\psi _{A}(d_{2},a),\psi _{A}(d_{2},\bar{a}))$ and call the corresponding action a ^∗(d ₂). We have max (1. 002, 1) = 1. 002 and the optimal decision for the Attacker is a.
At node D, compute max (ψ _D(d ₁, a ^∗(d ₁)), ψ _D(d ₂, a ^∗(d ₂)), ψ _D(d ₃)) and call the optimal action d _GT ^∗. In our case, $\max \;(-29.67,-106.03,-20.08) = -20.08$ and the Nash defense d _GT ^∗ is d ₃, that is, to choose the safe route.

3.5.2 Robustness of the Game Theoretic Solution

We consider now the robustness of the game theoretic solution. We simplify and assume that the attack cost B = 10,000 is reasonably well known. Assume that d is not that well known and we express this through a constraint d ∈ [10000, 50000]. Similarly, suppose that c ₁ ∈ [0, 3], e ₁ ∈ [1, 6], c ₂ ∈ [2, 8] and e ₂ ∈ [10, 14]. We sample randomly from these intervals 1000 times and repeat the procedure in Sect. 3.5.1.

The three defenses may be Nash, given the constraints. Indeed, based on the above sampling scheme, we estimate that the probabilities of the three alternatives being Nash are, respectively, 0. 454, 0. 236 and 0. 31, therefore with no clear winner. The maximum loss when we implement the defense d _GT ^∗ = d ₃ is 19. 08. This is deemed large enough and we need to perform an ARA approach.

3.5.3 ARA Approach

The problem faced by the Defender is described in the decision tree in Fig. 3.8.

The expected utilities of the first two alternatives have the form

$$\displaystyle{\psi _{D}(d_{i}) = p_{D}(a\vert d_{i})\psi _{D}(d_{i},a) + p_{D}(\bar{a}\vert d_{i})\psi _{D}(d_{i},\bar{a}),i = 1,2.}$$

Thus, we need to assess the attack probabilities p(a | d _i) given the implemented defense d _i.

We illustrate the estimation of p _D(a | d ₁). We assume that d, c ₁, e ₁, c ₂, e ₂ are uniformly distributed over the intervals described in Sect. 3.5.2. Thus, we assume that $d \sim \mathcal{U}$[10000, 50000], $c_{1} \sim \mathcal{U}[0,3]$, $e_{1} \sim \mathcal{U}[1,6]$, $c_{2} \sim \mathcal{U}[2,8]$ and $e_{2} \sim \mathcal{U}[10,14]$. Then, we may use Algorithm 1 to estimate the required probability, where ψ _A ^k(d ₁, x) designates the expected utility that the Attacker reaches, when the Defender implements d ₁ and he implements attack x and the sampled parameters are d ^k, c ₁ ^k, c ₂ ^k, e ₁ ^k, e ₂ ^k.

In our particular case, with N = 10,000, we obtain $\hat{p}(a\vert d_{1}) = 0.406$ (and, consequently, $\hat{p}(\bar{a}\vert d_{1}) = 0.594$). Similarly, $\hat{p}(a\vert d_{2}) = 0.764$ and $\hat{p}(\bar{a}\vert d_{2}) = 0.236$. Then, we have $\psi (d_{1}) = -14.7$, $\psi (d_{2}) = -81.2$ and $\psi (d_{3}) = -20.08$ and the optimal ARA defense d _ARA ^∗ is d ₁, which is different to d _GT ^∗.

Algorithm 1: Estimating p(a | d ₁)

3.5.4 Robustness of the ARA Solution

We consider now the robustness of the ARA solution. For that, we consider classes of beta distributions with the same support than the corresponding parameters. As an example, for d, we shall assume that d ∼ β[o ₁, o ₂] over the interval [10000, 50000], with o ₁ ∈ [0. 5, 1. 5], o ₂ ∈ [0. 5, 1. 5]. Similarly, for the other parameters we use beta distributions over the previous intervals, with parameters as in Table 3.3, where the first parameter of the beta distribution is uniform over [LL, LU] and the second parameter of the beta distribution is uniform over [UL, UU].

Table 3.3: Upper and lower limits for the parameters of the involved beta distributions

Full size table

We sample 100 times from such distributions and repeat the procedure in Sect. 3.5.3. Then, the estimated probabilities of each defense being optimal, in the ARA sense, would be, respectively, $\hat{p}(d_{1}) = 1$, $\hat{p}(d_{2}) = 0$ and $\hat{p}(d_{3}) = 0$. Therefore, d ₁ seems clearly the most likely alternative for being optimal.

The regrets when we implement various solutions, are respectively, 0 for d ₁, 37. 91 for d ₂ and 8. 54 for d ₃. Thus, the minimum regret defense is d ₁.

3.6 Discussion

Adversarial Risk Analysis is an emergent paradigm when supporting a decision maker who faces adversaries and such that the consequences are random and depend on the actions of all participating agents. The prevalent paradigm in this area is Game Theory. In this chapter, we have provided a framework for robustness analysis in this area.

The approach we have followed is:

Under common knowledge assumptions compute the game theoretic solution. Perform a robust analysis for such solution. If it is stable, such solution may be used with confidence and we do not require further analysis.
Otherwise, perform an ARA. Undertake a robust analysis for the ARA solution. If it is stable, the ARA solution may be used with confidence and the analysis stops. Otherwise, gather more data and/or refine the relevant classes, eventually declaring the robustness of the ARA solution. If not sufficient, move towards next stage.
Undertake a minimum regret (or other robust) concept.

We have illustrated it with two simple models, the sequential defend-attack and the simultaneous defend-attack, but the ideas would extend to more complex ARA models. Similarly, we have assumed that the attacker was maximising expected utility but the ideas may be translated to other attacker rationalities, as in [21].

There are many other sensitivity analysis questions relevant in ARA. For example, we mentioned above the recursive assessment required in the simultaneous game, which may be expressed as follows, see [17]:

Algorithm 2: Recursive assessment required in the simultaneous game

This hierarchy would stop when the Defender lacks the information necessary to assess the distribution F ⁱ or G ⁱ associated with the decision analysis of A ⁱ and D ⁱ, respectively. At this point, the Defender would assign an unconditional probability distribution over A ⁱ or D ⁱ, respectively, without going deeper into the hierarchy, summarizing all the information she might have through the direct assessment of $\varPi _{D^{i-1}}(A^{i})$ or $\varPi _{A^{i}}(D^{i})$, as might correspond. Should she have no additional information to do so, she could assign a noninformative distribution, see [3].

However, climbing up one level in the hierarchy entails a lot of effort. We could question whether this is worth it by using value of information types of computation.

Acknowledgements

DRI, CA and JG are grateful to the MINECO MTM2014-56949-C3-1-R and the AXA-ICMAT Chair in Adversarial Risk Analysis. The authors are grateful to the support of the COST IS1304 Action on Expert Judgement.

References

Banks, D.L., Aliaga, J.M.R., Ríos Insua, D.: Adversarial Risk Analysis, vol. 343. CRC Press, Boca Raton, FL (2015)
Book Google Scholar
Berger, J.O., Ríos Insua, D., Ruggeri, F.: Bayesian robustness. In: Ríos Insua, D., Ruggeri, F. (eds.) Robust Bayesian Analysis, pp. 1–32. Springer, New York (2000)
Chapter Google Scholar
French, S., Ríos Insua, D.: Statistical Decision Theory. Wiley, New York (2000)
Google Scholar
Gibbons, R.: A Primer in Game Theory. Harvester Wheatsheaf, Hemel Hempstead (1992)
Google Scholar
Hargreaves-Heap, S., Varoufakis, Y.: Game Theory: A Critical Introduction. Routledge, Abingdon (2004)
Google Scholar
Harsanyi, J.C.: Games with incomplete information played by “bayesian” players, part i. The basic model. Manag. Sci. 14 (3), 159–182 (1967)
Google Scholar
Harsanyi, J.C.: Games with incomplete information played by “bayesian” players, part ii. Bayesian equilibrium points. Manag. Sci. 14 (5), 320–334 (1968)
Google Scholar
Harsanyi, J.C.: Games with incomplete information played by “bayesian” players, part iii. The basic probability distribution of the game. Manag. Sci. 14 (7), 486–502 (1968)
Google Scholar
Kardes, E.: Robust Stochastic Games and Applications to Counter-Terrorism Strategies. Center for Risk and Economic Analysis of Terrorism Events, University of Southern California, Los Angeles (2005)
Google Scholar
Lippman, S.A., McCardle, K.F.: Embedded Nash bargaining: risk aversion and impatience. Decis. Anal. 9 (1), 31–40 (2012)
Article Google Scholar
McLay, L., Rothschild, C., Guikema, S.: Robust adversarial risk analysis: a level-k approach. Decis. Anal. 9 (1), 41–54 (2012)
Article Google Scholar
Merrick, J., Parnell, G.S.: A comparative analysis of PRA and intelligent adversary methods for counterterrorism risk management. Risk Anal. 31 (9), 1488–1510 (2011)
Article Google Scholar
Nau, R.F.: Joint coherence in games of incomplete information. Manag. Sci. 38 (3), 374–387 (1992)
Article Google Scholar
O’Hagan, A., Buck, C.E., Daneshkhah, A., Eiser, J.R., Garthwaite, P.H., Jenkinson, D.J., Oakley, J.E., Rakow, T.: Uncertain Judgements: Eliciting Experts’ Probabilities. Wiley, Hoboken, NJ (2006)
Book Google Scholar
Parnell, G., Banks, D., Borio, L., Brown, G., Cox, L., Gannon, J., Harvill, E., Kunreuther, H., Morse, S., Pappaioanou, M., et al.: Report on methodological improvements to the department of homeland security’s biological agent risk analysis National Academies Press, Washington DC (2008)
Google Scholar
Raiffa, H., Richardson, J., Metcalfe, D.: Negotiation Analysis: The Science and Art of Collaborative Decision Making. Harvard University Press, Cambridge, MA (2002)
Google Scholar
Rios, J., Ríos Insua, D.: Adversarial risk analysis for counterterrorism modeling. Risk Anal. 32 (5), 894–915 (2012)
Article Google Scholar
Ríos Insua, D., Ruggeri, F.: Robust Bayesian Analysis. Springer, New York (2000)
Book Google Scholar
Ríos Insua, D., Ruggeri, F., Vidakovic, B.: Some results on posterior regret γ-minimax estimation. Stat. Decis. 13, 315–331 (1995)
Google Scholar
Ríos Insua, D., Rios, J., Banks, D.: Adversarial risk analysis. J. Am. Stat. Assoc. 104 (486), 841–854 (2009)
Article Google Scholar
Ríos Insua, D., Banks, D., Rios, J.: Modeling opponents in adversarial risk analysis. Risk Anal. (2015)
Google Scholar
Rothschild, C., McLay, L., Guikema, S.: Adversarial risk analysis with incomplete information: a level-k approach. Risk Anal. 32 (7), 1219–1231 (2012)
Article Google Scholar
Sevillano, J.C., Rios Insua, D., Rios, J.: Adversarial risk analysis: the Somali pirates case. Decis. Anal. 9 (2), 86–95 (2012)
Article Google Scholar
Stahl, D.O., Wilson, P.W.: On players models of other players: theory and experimental evidence. Games Econ. Behav. 10 (1), 218–254 (1995)
Article Google Scholar
Von Winterfeldt, D., O’Sullivan, T.M.: Should we protect commercial airplanes against surface-to-air missile attacks by terrorists? Decis. Anal. 3 (2), 63–75 (2006)
Article Google Scholar
Wang, C., Bier, V.M.: Expert elicitation of adversary preferences using ordinal judgments. Oper. Res. 61 (2), 372–385 (2013)
Article Google Scholar
Zhuang, J., Bier, V.M.: Balancing terrorism and natural disasters-defensive strategy with endogenous attacker effort. Oper. Res. 55 (5), 976–991 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Instituto de Ciencias Matemáticas, Consejo Superior de Investigaciones Científicas, 28049, Madrid, Spain
David Ríos Insua
Consiglio Nazionale delle Ricerche, Istituto di Matematica Applicata e Tecnologie Informatiche, Via Bassini 15, 20133, Milano, Italy
Fabrizio Ruggeri
Department of Statistics and Operations Research, Rey Juan Carlos University, Fuenlabrada, 28943, Madrid, Spain
Cesar Alfaro & Javier Gomez

Authors

David Ríos Insua
View author publications
You can also search for this author in PubMed Google Scholar
Fabrizio Ruggeri
View author publications
You can also search for this author in PubMed Google Scholar
Cesar Alfaro
View author publications
You can also search for this author in PubMed Google Scholar
Javier Gomez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David Ríos Insua .

Editor information

Editors and Affiliations

School of Production Engineering & Mgmt, Technical University of Crete, Chania, Greece
Michael Doumpos
School of Production Engineering & Mgmt, Technical University of Crete, Chania, Greece
Constantin Zopounidis
School of Production Engineering & Mgmt, Technical University of Crete, Chania, Greece
Evangelos Grigoroudis

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ríos Insua, D., Ruggeri, F., Alfaro, C., Gomez, J. (2016). Robustness for Adversarial Risk Analysis. In: Doumpos, M., Zopounidis, C., Grigoroudis, E. (eds) Robustness Analysis in Decision Aiding, Optimization, and Analytics. International Series in Operations Research & Management Science, vol 241. Springer, Cham. https://doi.org/10.1007/978-3-319-33121-8_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-33121-8_3
Published: 13 July 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-33119-5
Online ISBN: 978-3-319-33121-8
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics

Robustness for Adversarial Risk Analysis

Abstract

Similar content being viewed by others

Adversarial Risk Analysis as a Decomposition Method for Structured Expert Judgement Modelling

Expert Judgement in Terrorism Risk Assessment

Improving Defense Against Intelligent Adversaries

Keywords

3.1 Introduction

3.2 Bayesian Robustness

3.3 Sequential Games

3.3.1 Game Theoretic Solution and Robustness

3.3.1.1 Robustness of the Game Theoretic Solution

3.3.2 ARA Solution and Robustness

3.3.2.1 Robust Analysis

3.3.3 A Full Robust Solution

3.4 Simultaneous Games

3.4.1 Game Theoretic Solution

3.4.1.1 Robustness of the Game Theoretic Solution

3.4.2 ARA Solution and Robustness

3.4.2.1 Robustness

3.5 An Example

3.5.1 Game Theoretic Approach

3.5.2 Robustness of the Game Theoretic Solution

3.5.3 ARA Approach

Algorithm 1: Estimating p(a | d 1)

3.5.4 Robustness of the ARA Solution

3.6 Discussion

Algorithm 2: Recursive assessment required in the simultaneous game

Acknowledgements

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation

Algorithm 1: Estimating p(a | d ₁)