1 Introduction

Economic life is rich with uncertainties, most of which look nothing like probabilistic risk. In the subjective expected utility theory of Savage (1954), the decision maker’s beliefs about uncertainty can be quantified probabilistically. All uncertainty reduces to risk. This theory also excludes probabilistic risk attitudes towards this subjective risk. Attitudes such as optimism and pessimism, for which there is considerable empirical support. This paper studies a simple modification of expected utility to allow for optimism and pessimism.

Cohen (1992) argued that the most important departures from expected utility can be explained by the security and potential factors. These are the worst and best possible outcomes of a decision (Lopes 1987). Axiomatic models have been developed by Gilboa (1988), Jaffray (1988), Cohen (1992), Essid (1997), Chateauneuf et al. (2007), and Schmidt and Zimper (2007). The NEO-expected utility (NEO-EU) model of Chateauneuf et al. (2007) evaluates choices using the formula:

$$\begin{aligned} \gamma \text{(worst } \text{ possible } \text{ utility) } + (1-\gamma -\delta )\text{(expected } \text{ utility) } + \delta \text{(best } \text{ possible } \text{ utility) }. \end{aligned}$$

NEO-EU is a special case of Choquet expected utility (Gilboa 1987; Schmeidler 1989; Wakker 1989) with a Non-Extremal-Outcome additive (NEO-additive) capacity. A NEO-additive capacity is a transformation of a probability measure that is linear for non-extreme probabilities, and departs from expected utility if and only if it is discontinuous at zero or one. As such, it retains subjective expected utility’s probabilistic sophistication property (Machina and Schmeidler 1992) whilst allowing for non-expected utility attitudes. Wakker (2001, 2005) gave model-free, behavioural definitions of pessimism and optimism using an uncertainty analogue of the common consequence effect (Allais 1953).Footnote 1 According to these definitions, a NEO-EU maximiser, with \(\gamma \) and \(\delta \) strictly positive, responds to the probabilistic beliefs by exhibiting both pessimism and optimism simultaneously, a condition called ambivalence or inverse-S behaviour. Ambivalence is the most prevalent uncertainty attitude observed in experiments (Wakker 2010: 203–243, 290–292).

NEO-EU has been applied extensively (Abdellaoui et al. 2010; Dominiak et al. 2012; Dominiak and Lefort 2013; Ford et al. 2013; Eichberger et al. 2012; Eichberger and Kelsey 2011, 2014; Ludwig and Zimper 2014; Romm 2014; Teitelbaum 2007; Zimper 2012). Its tractability stems from both: the probabilistic sophistication property, and the simple way such probabilities are used in the evaluation. Naively applied, it is not compatible with the ambiguity aversion examples of Ellsberg (1961). It has been argued, however, that source dependence of uncertainty attitudes is appropriate for explaining Ellsberg’s examples (Heath and Tversky 1991; Fox and Tversky 1995; Chow and Sarin 2001; Wakker 2001; Abdellaoui et al. 2011). For example, a British investor may respond more optimistically to his beliefs about the UK market than his beliefs about foreign markets, even if beliefs are probabilistic in both cases. By allowing for source dependence of uncertainty attitudes, probabilistic sophistication can be derived within each source of uncertainty, without requiring it to hold across each source (Chew and Sagi 2008). Allowing parameters of decision models to be source dependent adds further layers of difficulty for empirical applications. NEO-expected utility’s simple form, however, makes such an approach tractable, as demonstrated by Abdellaoui et al. (2011).

Chateauneuf et al. (2007) gave a preference foundation for NEO-EU in a purely subjective, Savage-style framework, in particular, using simple acts that map an arbitrary state space to a connected and separable outcome set. By deriving a continuous, cardinal utility on this rich outcome space, the subjective mixture techniques of Ghirardato et al. (2003) could be employed to give elegant preference axioms. In certain economic applications, most notably when outcomes are monetary, assuming rich topological structure on the outcome set is appropriate. In other applications, it is not natural to assume such a structure, for example, when the outcomes are: health outcomes, environmental outcomes, durable goods.

In this paper, the original Savage framework with an arbitrary outcome set is considered. The state space is at least countably infinite. There are several problems with NEO-EU in this framework. Because the probability transformation can be discontinuous, the axiomatic foundations are complicated. For risk, which is a special case of the Savage framework, Webb and Zank (2011) provided a preference foundation for NEO-expected utility. Complicated trade-off axioms, consistent optimism and consistent pessimism were used to measure and ensure consistency of the discontinuities, and additional structural assumptions were necessary to derive cardinal utility and unique parameters. A similar approach in the purely subjective setting would be no less complex. For arbitrary outcome sets, the NEO-EU parameters need not be uniquely determined (Webb and Zank 2011, 710, Example 8). For applications based on such a framework, it may be difficult to use NEO-EU to generate behavioural predictions related to comparative optimism or pessimism. Futhermore, applications of NEO-EU to frameworks with infinite states are problematic. Discontinuities in the evaluation formula mean that even simple applications are vulnerable to problems such as empty best responses, leading to non-existence of equilibria.Footnote 2

To resolve both of the problems above, a continuous version of NEO-EU is developed in this paper. We start with NEO-EU on a finite state space, where the above-mentioned problems do not arise and consider the simplest extension to infinite states that has a continuous probability transformation function. The resulting theory is called Piecewise Additive Choquet Expected (PACE) utility. It will be shown that PACE utility admits a simple axiomatisation in the Savage framework and can be applied in cases where NEO-EU fails. If NEO-EU is the smallest departure from expected utility to allow inverse-S behaviour, then PACE utility is the smallest continuous departure from expected utility to allow inverse-S behaviour.

The remainder of the paper is structured as follows: Sect. 2 presents the theoretical background of this paper; Sect. 3 considers the problem of extending NEO-additive capacities to infinite state spaces and introduces PACE utility, and Sect. 4 studies the preference foundations in the Savage framework, with the paper’s main theorem delivering an axiomatic characterisation of those preferences that admit PACE utility representations.

2 Preliminaries

This section outlines the framework for choice under uncertainty and models for ambiguity. Let \(\fancyscript{S}\) a set of states and \(\fancyscript{E}\) be a \(\sigma \)-algebra of events. We allow for the case where \(\fancyscript{E}\) is the power set of \(\fancyscript{S}\). Some richness will later be imposed on \(\fancyscript{S}\) via a solvability condition, which will imply that \(\fancyscript{S}\) contains infinitely many states.Footnote 3 Let \(\fancyscript{X}\) be a set of outcomes. The set of outcomes can be finite or infinite. States and outcomes are the only primitives; from these all other definitions are derived.

An act is a function \(f:\fancyscript{S}\rightarrow \fancyscript{X}\) that is measurable with respect to \(\fancyscript{E}\). It is assumed that acts are simple, that is, they take only finitely many values. Acts will also be written \(f=[A_1,f_2;\ldots ; A_n,f_n]\), denoting the act with outcome \(f_i\) if the state belongs to event \(A_i\). Acts are the objects of choice. By choosing act \(f\), the decision maker receives outcome \(f(s)\) if state \(s\) obtains. The act results in outcome \(x\) if the state belongs to \(f^{-1}(x)\in \fancyscript{E}\). The set of acts is \(\fancyscript{A}\). An act \(f\) is constant if \(f(s)=x\) for all \(s \in \fancyscript{S}\) (we will write \(f=x\)). An act may be defined by its subacts. For \(f,g \in \fancyscript{A}\) and \(A \in \fancyscript{E}\), \(f_A g := \{ h \in \fancyscript{A}: s\in A \Rightarrow h(s)=f(s), s\notin A \Rightarrow h(s)=g(s) \}\). An event \(A\in \fancyscript{E}\) is null if \(f_A h \sim g_A h\) for all \(f,g,h\in \fancyscript{A}\), otherwise it is non-null.

A set function \(\nu :\fancyscript{E}\rightarrow \mathbb {R}\) is normalised if \(\nu (\emptyset ) = 0\) and \(\nu (\fancyscript{S})=1\). It is monotonic if, for all \(A,B\in \fancyscript{E}\), \(A\subseteq B\) implies \(\nu (A)\leqslant \nu (B)\). It is additive if, for all disjoint \(A,B\in \fancyscript{E}\), \(\nu (A\cup B) = \nu (A) + \nu (B)\). A capacity is a real-valued set function \(\nu :\fancyscript{E}\rightarrow \mathbb {R}\) that is normalised and monotonic. A capacity \(\nu :\fancyscript{E}\rightarrow [0,1]\) is convex-valued if for all \(\alpha \in [0,1]\) there exists \(A\in \fancyscript{E}\) such that \(\nu (A)=\alpha \). A probability measure is an additive capacity. A capacity \(\nu :\fancyscript{E}\rightarrow [0,1]\) is a probability transformation if there is a strictly increasing function \(\phi :[0,1]\rightarrow [0,1]\) and a probability measure \(p:\fancyscript{E}\rightarrow [0,1]\) such that \(\nu =\phi \circ p\).

The decision maker exists only to maximise a preference relation \(\succcurlyeq \) defined over \(\fancyscript{A}\). A utility function \(U\) over acts \(\fancyscript{A}\) is a real-valued function that represents preferences such that \(f\succcurlyeq g\) if and only if \(U(f)\geqslant U(g)\). Preferences \(\succcurlyeq \) over acts \(\fancyscript{A}\) conform to subjective expected utility if they are represented by:

$$\begin{aligned} E(p,u)(f) = \sum _{x\in \fancyscript{X}} p(f^{-1}(x))u(x) \end{aligned}$$

where \(p:\fancyscript{E}\rightarrow [0,1]\) is a probability measure and \(u:\fancyscript{X}\rightarrow \mathbb {R}\) is a utility function for outcomes. Additivity of the probability measure ensures that, for all \(x\in \fancyscript{X}\), the following holds:

$$\begin{aligned} p(f^{-1}(x))=p \left( \bigcup _{y\succcurlyeq x} f^{-1}(y)\right) - p \left( \bigcup _{y\succ x} f^{-1}(y)\right) . \end{aligned}$$

The subjective expected utility formula can therefore be written as:

$$\begin{aligned} E(p,u)(f) = \sum _{x\in \fancyscript{X}} \left[ p \left( \bigcup _{y\succcurlyeq x} f^{-1}(y)\right) - p \left( \bigcup _{y\succ x} f^{-1}(y)\right) \right] u(x). \end{aligned}$$

This exercise helps one clearly distinguish between subjective expected utility and the following model, Choquet expected utility (Schmeidler 1989; Wakker 1989), in which the probability measure \(p\) in the above expression is replaced with \(\nu \), a (possibly) non-additive capacity. Preferences \(\succcurlyeq \) over acts \(\fancyscript{A}\) conform to Choquet expected utility if they are represented by:

$$\begin{aligned} E(\nu ,u)(f) = \sum _{x\in \fancyscript{X}}\left[ \nu \left( \bigcup _{y\succcurlyeq x} f^{-1}(y)\right) -\nu \left( \bigcup _{y\succ x} f^{-1}(y)\right) \right] u(x) \end{aligned}$$

where \(\nu :\fancyscript{E}\rightarrow [0,1]\) is a capacity and \(u:\fancyscript{X}\rightarrow \mathbb {R}\) is a utility function for outcomes. Notice that we use the shorthand \(E(\nu ,u)(f)\) for the Choquet expected utility of act \(f\in \fancyscript{A}\) using capacity \(\nu \) and utility for outcomes \(u\).

Choquet expected utility allows for non-neutral attitudes to ambiguity. A capacity \(\nu :\fancyscript{E}\rightarrow [0,1]\) is convex if \(\nu (A\cup B)-\nu (B)\) is non-decreasing as \(B\) \(\supseteq \)-increases. Similarly, a capacity is concave if \(\nu (A\cup B)-\nu (B)\) is non-increasing as \(B\) \(\supseteq \)-increases. Wakker (2001, 2005) gave behavioural definitions of pessimism and optimism and characterised such behaviour in context of Choquet expected utility. Under Wakker’s definitions, a Choquet expected utility maximiser is pessimistic if and only if the capacity is convex, and optimistic if and only if the capacity is concave. The most prevalent attitude found in experiments is ambivalence, a composition of pessimistic and optimistic responses to uncertainty. Because such capacities are initially concave and then convex, ambivalence is often called inverse-S behaviour.

A special case of Choquet expected utility, presented and axiomatised by Chateauneuf et al. (2007), is the NEO-additive capacities. These are discussed further in the next section. The Choquet expected utility of an act \(f\) with respect to a NEO-additive capacity can be shown to be a convex combination of subjective expected utility, the utility of the act’s best outcome, \(u(f^*)\), and utility of the act’s worst outcome, \(u(f_*)\):

$$\begin{aligned} \hbox {NEO}(f) = \gamma u(f_*) + (1-\gamma -\delta )E(p,u)(f) + \delta u(f^*) \end{aligned}$$

with \(\gamma ,\delta \geqslant 0\) and \(\gamma + \delta < 1\). Here, \(\gamma +\delta \) dictates the extent of the departure from expected utility; the degree of ambiguity. It is apparent from the representation that two acts with identical (or indifferent) best and worst outcomes will be ranked according to their expected utilities. Choquet expected utility using NEO-additive capacities is sometimes called NEO-expected utility (NEO-EU).

3 PACE utility

In this section, the problem of extending NEO-additive capacities to Savage’s infinite state space is addressed. Consider a NEO-additive capacity \(\omega \) defined over a finite set of states \(S\subset \fancyscript{S}\), \(S=\{s_1,\ldots , s_n\}\). Let \(\fancyscript{E}_S= \{\emptyset , A_1, \ldots , A_i, \ldots , \fancyscript{S} \}\) denote the set of \(2^n\) events, formed by all subsets of \(S\). Suppose, for the sake of presentational simplicity, that all non-empty events in \(\fancyscript{E}_S\) occur with positive probability, it is known that such NEO-additive capacities have the following form:

$$\begin{aligned} \omega :&\fancyscript{E}_S\rightarrow [0,1], \\&\emptyset \mapsto 0, \\&A_1 \mapsto (1-\gamma -\delta )p(A_1) + \delta , \\&\vdots \\&A_i \mapsto (1-\gamma -\delta )p(A_i) + \delta , \\&\vdots \\&S \mapsto 1, \end{aligned}$$

where \(\gamma ,\delta \geqslant 0\) and \(\gamma +\delta < 1\).

The NEO-expected utility model is tractable for economic applications assuming finitely many states. Consider the following example, which will be used throughout this section:

Example 3.1

(A simple insurance model with finite states) An agent with monetary wealth \(w\) faces a loss of \(l\) with probability \((1-p)\). Full insurance is available at premium \(z\), which is actuarially fair, \(w-z = pw + (1-p)(w-l)\). The agent chooses the probability \(q\) that she receives full insurance.Footnote 4 Suppose there are finitely many options: \(0 = q_1 < \cdots < q_{n-1} < q_n =1\), the problem is choosing \(q\) to make the lottery \((q, w-z; (1-q)p, w; (1-q)(1-p), w-l)\) as preferable as possible. The agent is a CEU maximiser with a strictly increasing and strictly concave utility for money \(u\) and probability transformation \(\omega =\phi \circ p\). If \(\phi \) is the identity (expected utility), then \(u(w-z)> pu(w)+(1-p)u(w-l):= e\), and full insurance is strictly preferred to no insurance. Maximising \(qu(w-z) + (1-q)e\) yields \(q = q_n = 1\) as the unique solution. If \(\phi \) is NEO-additive then, for \(q\in (0,1)\), the utility is given by \( \gamma u(w-l) + (1-\gamma -\delta )[qu(w-z) + (1-q)e] + \delta u(w)\), which is strictly increasing in \(q\). It is possible, however, that \(q=1\) is not optimal. Utility for \(q\in (0,1)\) is bounded below by \( \gamma u(w-l) + (1-\gamma -\delta )e + \delta u(w)\) which is greater than \(u(w-z)\) for \(\delta \) sufficiently close to one. In that case, when the agent has a high degree of optimism, optimality occurs where \(q=q_{n-1}\).

An extension of \(\omega \) to the infinite state space \(\fancyscript{S}\), is a capacity \(\nu :\fancyscript{E}\rightarrow [0,1]\) that coincides with \(\omega \) wherever \(\omega \) is defined, \(\nu |_{\fancyscript{E}_S} = \omega \). One extension of a NEO-additive capacity to consider is the probability transformation such that, for all \(A\in \fancyscript{E}\):

$$\begin{aligned} \nu (A) = \left\{ \begin{array}{ll} 0 &{} \text{ if } p(A) = 0, \\ (1-\gamma -\delta ) p(A) + \delta &{} \text{ if } 0< p(A) < 1, \\ 1 &{} \text{ if } p(A)= 1, \end{array} \right. \end{aligned}$$

with \(\gamma ,\delta \geqslant 0\) and \(\gamma + \delta < 1\). That is, \(\nu =\phi \circ p\) with the transformation \(\phi \) that is strictly increasing everywhere, linear for all probabilities between zero and one, but possibly discontinuous at zero and/or at one. The probability transformation above is certainly the most obvious extension of a NEO-additive capacity to infinite states. Indeed, we call such \(\phi \) a NEO-additive transformation function. There are, however, some problems with the NEO-expected utility model that results. In particular, the discontinuity present in this transformation function presents difficulties for even simple applications. In the following example, there is a continuum (a compact and connected set) of states, which is typical of many economic applications:

Example 3.2

(A simple insurance model with a continuum of states) Consider the model of example above, except the problem now involves choosing \(q\in [0,1]\). If \(\phi \) is the identity, maximising \(qu(w-z) + (1-q)e\) yields \(q = 1\) as the unique solution. If \(\phi \) is continuous, the problem has at least one solution. If \(\phi \) is NEO-additive then, utility is strictly increasing for \(q\in (0,1)\). If \(\delta \) sufficiently close to one, \(q=1\) is not optimal. In that case there is no well-defined solution to the agent’s problem.

These issues do not arise for probability transformations that are continuous. Therefore, we now seek the simplest extension of a NEO-additive capacity that has a continuous transformation function. Recall that the NEO-additive transformation function above linearly transforms all non-extreme probabilities:

$$\begin{aligned} A\mapsto (1-\gamma -\delta ) p(A) + \delta \quad \text{ if } 0< p(A) < 1. \end{aligned}$$

Because of this, a NEO-additive transformation function \(\phi \) is continuous if and only if it is the identity. Consider the following, minor weakening of the above requirement. Let \(\kappa \in [\frac{1}{2},1]\) and consider a capacity \(\nu =\phi \circ p\) that satisfies:

$$\begin{aligned} A\mapsto (1-\gamma -\delta ) p(A) + \delta \quad \text{ if } 1-\kappa < p(A) < \kappa . \end{aligned}$$

By taking \(\kappa \) close to one, this capacity is empirically indistinguishable from a NEO-additive capacity. Under this weaker requirement, however, continuity of \(\phi \) can be retained. To do so, we must specify \(\phi \) on \([0,1-\kappa ]\) and \([\kappa ,1]\). The simplest assumption, and therefore most in keeping with the NEO-EU spirit, is that \(\phi \) is also strictly increasing and linear on these intervals. The only capacity to achieve all of this is the capacity \(\nu \) such that, for all \(A\in \fancyscript{E}\):

$$\begin{aligned} \nu (A) = \left\{ \begin{array}{ll} (1-\gamma - \frac{\delta \kappa }{1-\kappa }) p(A) &{} \text{ if } p(A) \leqslant 1-\kappa \\ (1-\gamma -\delta ) p(A) + \delta &{} \text{ if } 1-\kappa \leqslant p(A) \leqslant \kappa \\ \frac{1-[(1-\gamma -\delta )\kappa +\delta ]}{1-\kappa }p(A) + \frac{(1-\gamma -\delta )\kappa + \delta -\kappa }{1-\kappa } &{} \text{ if } \kappa \leqslant p(A) \end{array} \right. \end{aligned}$$

with \(p:\fancyscript{E}\rightarrow [0,1]\) a probability measure, \(\kappa \in [\frac{1}{2},1]\), \(\kappa \leqslant \frac{1-\gamma }{1-\gamma -\delta }\) and \(\kappa \leqslant \frac{1-\delta }{1-\gamma -\delta }\) and \(\gamma +\delta < 1\). That is, \(\nu =\phi \circ p\) with the transformation \(\phi \) being continuous and strictly increasing everywhere, and linear on \([0,1-\kappa ]\), \([1-\kappa ,\kappa ]\) and \([\kappa ,1]\). We call such capacities piecewise additive. Notice that \(\gamma ,\delta \geqslant 0\) is not required, hence departures from additivity can, but need not, be of the inverse-S variety. Hence, piecewise additive capacities may be considered continuous versions of the generalised NEO-additive capacities of Eichberger et al. (2012).

Definition 3.3

(Piecewise Additive Choquet Expected (PACE) utility) PACE utility holds if preferences are represented by Choquet expected utility with a piecewise additive capacity.

The PACE utility model can be thought of as “expected utility with kinks”, hence the \(\kappa \). In applications, it is easier to deal with “kinks” than to deal with “jumps”. In the insurance example developed above, the decision maker’s best response will always be non-empty if \(\phi \) is continuous. The problem encountered with NEO-EU in Example 2 cannot arise under PACE utility. Also, PACE utility remains tractable enough to obtain a closed-form solution:

Example 3.4

(A simple insurance model with a continuum of states) Suppose, in the example above, that the agent chooses \(q\in [0,1]\) and her capacity is piecewise additive. Utility in this case varies continuously with \(q\in [0,1]\), hence the problem has a well-defined solution. It is possible for the solution to differ from the expected utility case. To see this, let \(\kappa > p\), and let \(\delta \) be close to one. Then, utility increases with \(q\) on \([0,\frac{\kappa -p}{1-p}]\) and decreases with \(q\) on \([\frac{\kappa -p}{1-p},1]\). Optimality occurs where \(q = \frac{\kappa -p}{1-p}\). At this \(q\), the probability of getting at least \(w-z\) is \(\kappa \), and the agent is more sensitive to the unlikely, best outcome \(w\). This agent prefers full insurance to no insurance, but most prefers to gamble on being insured.

A NEO-EU maximiser stratifies events into “impossible”, “uncertain” and “certain”, being probability zero, probability in \((0,1)\), and probability one respectively, and behaves as an expected utility maximiser within each class, but not across the classes. PACE utility is based on a similar trichotomy, buts allows the classes to be subjective. Events are now stratified into “unlikely”, “moderate” and “likely”, corresponding to probability “low enough”, probability “not too low or too high”, and probability “high enough”, respectively. To operationalise this idea, we used a personal parameter, \(\kappa \in [\frac{1}{2},1]\), such that “unlikely”, “moderate” and “likely” correspond to probability not greater than \(1-\kappa \), in \([1-\kappa ,\kappa ]\), and not less than \(\kappa \), respectively. For example, if \(\kappa = 2/3\), then events occurring with probability less than \(1/3\) are dubbed “unlikely” and events occurring with probability 2/3 or greater are dubbed “likely”.Footnote 5 PACE utility reduces to expected utility within the unlikely, moderate, and likely classes of events. It permits departures from expected utility when comparing events from different classes.

The main theorem of this paper, Theorem 4.6, presents a preference foundation for PACE utility under purely subjective uncertainty. The axiomatic foundations of Choquet expected utility with NEO-additive capacities are complicated in the rich state space, arbitrary outcome set framework (Webb and Zank 2011). PACE utility will be derived here, however, from a simple weakening of expected utility’s sure-thing principle.

4 A preference foundation

Here we recall the axioms for subjective expected utility. We assume there are at least three outcomes \(x,y,z\in \fancyscript{X}\) such that \(x\succ y \succ z\).

Axiom 1

(Ordering) Preferences \(\succcurlyeq \) over acts \(\fancyscript{A}\) are a weak order.

Axiom 2

(Monotonicity) For acts \(f,g\in \fancyscript{A}\), \(f(s)\succcurlyeq g(s)\) for all \(s\in \fancyscript{S}\) implies \(f\succcurlyeq g\).

An Archimedean axiom is required. Here, a rank-dependent axiom is used. A simpler axiom could be used at this point. But rank dependence is required later. The benefit of using a slightly more complicated Archimedean axiom is that only one such axiom will be used throughout the paper. For an act \(f\) and event \(A\), the event \(B\) dominates \(A\) under \(f\) if, for all \(\tilde{s}\in B\) and \(s\in A\), \(f(\tilde{s})\succcurlyeq f(s)\). The rank of an event \(A\) under \(f\), denoted \(\mathcal {R}(A,f)\), is the largest event that dominates \(A\) under \(f\). Hence, \(f(\tilde{s})\succcurlyeq f(s)\) holds for all \(\tilde{s}\in \mathcal {R}(A,f)\) and \(s\in A\) holds, and also \(f(\tilde{s})\preccurlyeq f(s)\) holds for all \(\tilde{s}\notin \mathcal {R}(A,f)\) and \(s\in A\) because \(\mathcal {R}(A,f)\) is the \(\supseteq \)-maximal dominating event. For a simple act, \(f=[A_1,f_1; \ldots ; A_n, f_n]\), we may label the outcomes of \(f\) so that \(f_1 \prec \cdots \prec f_n\). Then, the rank of an event \(A_i\) under \(f\) is given by \(\mathcal {R}(A_i, f) = \cup _{j=i}^n A_j\).

Axiom 3

(Archimedeanity) If \(S = A^1, A^2, \ldots ,\) is a sequence of non-null events such that:

$$\begin{aligned} x_{A^1} f \succ x_{A^1} g \;\; \text{ and }\;\; x_{A^i} f \sim x_{A^{i+1}} g \end{aligned}$$

with \(\mathcal {R}(A^i, x_{A^i} f) = \mathcal {R}(A^i, x_{A^i} g)\), for all \(i=1,2,\ldots ,\) then \(S\) is finite.

The following condition, solvability, relates to the structure of \(\fancyscript{E}\), so is separated from the preference conditions above. Solvability holds if, for all acts \(f,g,h\in \fancyscript{A}\) with \(f\succ g \succ h\), there is an event \(A\) such that \(g \sim f_A h\).

Let \(\fancyscript{B}\) denote the set of binary acts, acts taking at most two values. Given \(x\in \fancyscript{X}\), \(A\in \fancyscript{E}\) and \(f\in \fancyscript{B}\), the act \(x_A f\) takes at most three values. The following axiom concerns consistency of a likelihood order revealed using such acts:

Axiom 4

(Comparative likelihood consistency) For all \(f,g,\tilde{f},\tilde{g}\in \fancyscript{B}\), all \(A,B\in \fancyscript{E}\), \(x,y,\tilde{x},\tilde{y}\in \fancyscript{X}\) with \(x\prec y\) and \(\tilde{x}\prec \tilde{y}\), the implication:

$$ \begin{aligned} x_A f \sim x_B g, \;\; \tilde{x}_A \tilde{f} \sim \tilde{x}_B \tilde{g}, \;\; \& \;\; y_A f \succcurlyeq y_B g \;\; \Rightarrow \;\; \tilde{y}_A \tilde{f} \succcurlyeq \tilde{y}_B \tilde{g}, \end{aligned}$$

holds if \(\mathcal {R}(A,j)\) and \(\mathcal {R}(B,k)\) are constant, where \(j = x_Af, \tilde{x}_A\tilde{f}, y_A f, \tilde{y}_A\tilde{f}\) and \(k= x_B g, \tilde{x}_B \tilde{g}, y_B g, \tilde{y}_B \tilde{g}\).

Axiom 4 implies Savage’s axiom P4.Footnote 6 Axiom 4 is the well-known P2* axiom of Gilboa (1987), restricted to ranked, three-outcome acts.Footnote 7 Abdellaoui and Wakker (2005) provides an extensive treatment of such axioms. Define an order \(\succ \) over events, written \(A\succ ^L B\) and read, “\(A\) is, subjectively, more likely than \(B\),” whenever \(x_A y \succ x_B y\) for some (for all, by axiom 4) \(x,y\in \fancyscript{X}\) with \(x\succ y\). Definition 2.1 of Abdellaoui and Wakker (2005) referred to the same condition as revealed more likely in a basic sense. Define \(\prec ^L\), \(\succcurlyeq ^L\), \(\preccurlyeq ^L\), and \(\sim ^L\) in the usual way. It can be shown that \(\succcurlyeq ^L\) is a well-defined weak order over \(\fancyscript{E}\).

Subjective expected utility preferences and PACE utility preferences both satisfy axioms 14. The following axiom, the sure-thing principle, when combined with axioms 14, characterises subjective expected utility:

Axiom 5

(The sure-thing principle) For all events \(A\in \fancyscript{E}\) and acts \(f,\tilde{f},g,h \in \fancyscript{A}\), the following implication holds: \(f_A g \succcurlyeq f_A h \;\Rightarrow \; \tilde{f}_A g \succcurlyeq \tilde{f}_A h\).

The sure-thing principle is will be suitably modified to account for the type of uncertainty attitudes permitted under PACE utility. It is useful to first consider the how PACE utility compares with NEO-EU. Under NEO-EU, a decision maker will conform to expected utility when comparing acts with common best and worst outcomes. Indeed, even if acts do not have common best and worst outcomes, a sure-thing principle holds whenever common outcomes are changed in a way that leaves best and worst outcomes unaffected. For an act \(f\in \fancyscript{A}\), let \(b(f)\) and \(w(f)\) denote the ranks of the best and worst outcomes of \(f\), respectively. Of course, \(w(f)\sim ^L \fancyscript{S}\). Then, NEO-EU necessarily satisfies the following condition:

Definition 4.1

(The NEO-sure-thing principle) For all events \(A\in \fancyscript{E}\) and acts \(f,\tilde{f},g,h \in \fancyscript{A}\), the implication:

$$\begin{aligned} f_A g \succcurlyeq f_A h \;\;\Rightarrow \;\; \tilde{f}_A g\succcurlyeq \tilde{f}_A h \end{aligned}$$

holds if \(w(j) \succ ^L \mathcal {R}(A,j) \succ ^L b(j)\), for all \(j=f_A g , f_A h , \tilde{f}_A g, \tilde{f}_A h\).

The NEO-sure-thing principle is necessary but, when combined with axioms 14, it is not sufficient for NEO-EU. See example 10 of Webb and Zank (2011:711). To pin down NEO-EU, Webb and Zank (2011) employed further axioms, consistent optimism and consistent pessimism. The derivation of PACE utility, however, will require only a weakening of the sure-thing principle.

Consider a PACE utility representation, and let \(K\) be an event with \(p(K)=\kappa \). The key properties of PACE utility will be sure-thing principles, the above implication, that hold whenever:

  1. 1.

    Events are ranked as likely: \(\mathcal {R}(A,j) \succcurlyeq ^L K\) for all \(j\).

  2. 2.

    Events are ranked as unlikely: \(K^{\complement } \succcurlyeq ^L \mathcal {R}(A,j)\) for all \(j\).

  3. 3.

    Events are ranked as moderate: \( K \succcurlyeq ^L \mathcal {R}(A,j) \succcurlyeq ^L K^{\complement }\) for all \(j\).

In each case above, \(j=f_A g , f_A h , \tilde{f}_A g, \tilde{f}_A h\). Sure-thing principles based on the above conditions are necessary for PACE utility. They are not, however, falsifiable axioms because they assume a priori knowledge of an event \(K\) with the required properties. A preference axiom, based on falsifiable conditions, is now developed that will imply both the existence of such a \(K\) and the corresponding behaviour within in each class of events. This axiom will be called the piecewise sure-thing principle. Before stating the piecewise sure-thing principle, it is necessary to formulate various local versions of the sure-thing principle. For a given event, \(A\in \fancyscript{E}\), we define upper, lower, outer and inner sure-thing principles that hold “at \(A\)”.

Definition 4.2

(Upper sure-thing principle at \(A\) ) For \(A\in \fancyscript{E}\), the implication:

$$\begin{aligned} f_B g \succcurlyeq f_B h \;\;\Rightarrow \;\; \tilde{f}_B g\succcurlyeq \tilde{f}_B h \end{aligned}$$

holds if \(\mathcal {R}(B,j) \succcurlyeq ^L A\), for all \(j=f_B g , f_B h , \tilde{f}_B g, \tilde{f}_B h\).

The upper sure-thing principle at \(A\) implies the sure-thing principle holds for outcomes ranked likelier than \(A\). This is a simple and testable condition. Conforming to the standard sure-thing principle is often seen as normatively desirable. One might appeal to a weaker criterion such as this, when violations of the sure-thing principle are permitted. The upper sure-thing principle at \(K\) necessarily holds under PACE utility.

Definition 4.3

(Lower sure-thing principle at \(A\) ) For \(A\in \fancyscript{E}\), the implication:

$$\begin{aligned} f_B g \succcurlyeq f_B h \;\;\Rightarrow \;\; \tilde{f}_B g\succcurlyeq \tilde{f}_B h \end{aligned}$$

holds if \(A \succcurlyeq ^L \mathcal {R}(B,j)\), for all \(j=f_B g , f_B h , \tilde{f}_B g, \tilde{f}_B h\).

The lower sure-thing principle at \(A\) implies the sure-thing principle holds for outcomes ranked less likely than \(A\). This seems to carry the same normative content as the upper sure-thing principle. Under PACE utility, events are considered unlikely only if they are no more likely than \(K^{\complement }\). Then, the lower sure-thing principle at \(K^{\complement }\) necessarily holds under PACE utility. Consider an event \(A\), with \(A\) likelier than its complement \(A^{\complement }\). Given their apparently equivalent normative status, if one conforms to the upper sure-thing principle at \(A\), then conforming to the lower sure-thing principle at \(A^{\complement }\) is reasonable. We call this the outer sure-thing principle:

Definition 4.4

(Outer sure-thing principle at \(A\)) The upper sure-thing principle at \(A\) and the lower sure-thing principle at \(A^{\complement }\) both hold, or the upper sure-thing principle at \(A^{\complement }\) and the lower sure-thing principle at \(A\) both hold.

Under PACE utility, preferences must satisfy the outer sure-thing principle at \(K\). That is, the sure-thing principle holds for likely outcomes and unlikely outcomes. The third class, moderate likelihood, is covered by the following condition:

Definition 4.5

(Inner sure-thing principle at \(A\) ) For \(A\in \fancyscript{E}\), with \(A\succcurlyeq ^L A^{\complement }\), the implication:

$$\begin{aligned} f_B g \succcurlyeq f_B h \;\;\Rightarrow \;\; \tilde{f}_B g\succcurlyeq \tilde{f}_B h \end{aligned}$$

holds if \(A \succcurlyeq ^L \mathcal {R}(B,j) \succcurlyeq ^L A^{\complement }\), for all \(j=f_B g , f_B h , \tilde{f}_B g, \tilde{f}_B h\).

The inner sure-thing principle applies to events with ranks of moderate likelihood. Under PACE utility, the inner sure-thing principle at \(K\) necessarily holds, where \(K\) and \(K^{\complement }\) are labelled so that \(K\succcurlyeq ^L K^{\complement }\). In probability terms, this amounts to expected utility when the outcomes are ranked with probability in the “inner” interval \([1-\kappa , \kappa ]\). The outer sure-thing principle refers to the “outer” intervals \([0,1-\kappa ]\) and \([\kappa ,1]\).

The key axiom for PACE utility can now be stated:

Axiom 5 \(\kappa \) (The Piecewise Sure-Thing Principle): For all \(A\in \fancyscript{E}\), at least one of the inner sure-thing principle at \(A\) or the outer sure-thing principle at \(A\) holds.

The piecewise sure-thing principle takes the content of the above conditions, then adds a simplifying assumption. It forces every event to fall into at least one of three categories: a set of events where the upper sure-thing principle holds, a set of events where the lower sure-thing principle holds, or a set of events where the inner sure-thing principle holds. The following theorem characterises PACE utility:

Theorem 4.6

Let solvability hold. Then, the following statements are equivalent:

  1. 1.

    The preference relation \(\succcurlyeq \) satisfies axioms 1, 2, 3, 4 and 5\(\kappa \) (weak order, monotonicity, Archimedeanity, comparative likelihood consistency, and the piecewise sure-thing principle).

  2. 2.

    There exists a convex-valued, piecewise additive capacity \(\nu _{\kappa }\) over \(\fancyscript{E}\) and a real-valued, strictly \(\succcurlyeq \)-increasing utility function \(u\) over outcomes \(\fancyscript{X}\) such that \(\succcurlyeq \) is represented by \(E(\nu _{\kappa },u)\). That is, PACE utility holds.

In statement 2, the capacity is unique and utility is cardinal.

5 Closing comments

This paper has presented a simple way of integrating optimism and pessimism into subjective expected utility. To get PACE utility, ‘kinks’ were incorporated into expected utility. In rough terms, approximating ‘inverse-S’ with a ‘Z’. An intuitive weakening of the sure-thing principle called the piecewise sure-thing principle was introduced. Theorem 4.6 provided a behavioural foundation for PACE utility.

Chateauneuf et al. (2007) have shown how NEO-expected utility can be applied to resolve well-known phenomena that are difficult to reconcile with expected utility, such as the coexistence of gambling and insurance and the equity premium puzzle. Since, for finite state spaces, NEO-expected utility is obtained as a special case, PACE utility can generate the same results. In applications with rich state spaces, PACE utility is a more tractable alternative.