1 Introduction

Present-day social and economic policies will not only affect the quality of life of future generations but also affect the number of people who exist in these generations. Thus, policy makers face a trade-off between the sheer number of future people and their quality of life. Population ethics is the analysis of such trade-offs using tools from social choice theory and moral philosophy. It arose as a response to the Repugnant Conclusion, an ethical paradox first identified by Parfit (1984). Parfit noted that, under seemingly plausible normative hypotheses, we should prefer a future where a hundred trillion people lead wretched lives that are barely worth living, over a world where a much smaller number (say, ten billion) lead lives of much higher quality. This disturbing observation is not only a reductio ad absurdum of classical utilitarianism: it also afflicts a wide variety of other moral systems, particularly versions of welfarist consequentialism. A variety of solutions have been proposed, but none are entirely satisfactory. Recent surveys of this literature are Arrhenius et al. (2019) and Greaves (2017). For book-length treatments, see Ryberg and Tännsjö (2004), Blackorby et al. (2005), Arrhenius (2018), and Arrhenius and Bykvist (2019).

Cowen (2004) observed that the Repugnant Conclusion has a similar structure to the Saint Petersburg Paradox: in both cases, the paradox arises when a valuable thing is allowed to become minuscule in one “dimension”, as long as it simultaneously grows huge along some other dimensions. Cowen proposed that such paradoxes could be avoided by insisting that the value of any single dimension be bounded. But he did not formalize this idea. Earlier and independently, Sider (1991) had proposed a rule of population ethics he called “geometrism”, which avoids the Repugnant Conclusion through precisely the boundedness strategy suggested by Cowen. But Sider was well aware of geometrism’s shortcomings (in particular, its anti-egalitarianism), and he introduced it only as a counterexample to a conjectured impossibility result, not as a serious alternative. More recently, Asheim and Zuber (2014, 2017) have studied and axiomatically characterized rank-discounted generalized utilitarianism; like Sider’s geometrism, it avoids the Repugnant Conclusion via Cowen’s boundedness strategy, but unlike geometrism, it is also inequality-averse.

In this paper, I will introduce and axiomatically characterize a family of population ethical theories which generalize both Sider (1991) and Asheim and Zuber (2014, 2017). To define this family, I need some terminology. A social outcome specifies both what people exist and what the lifetime utility of each person is. A social welfare order (SWO) is an ordering over social outcomes; it embodies not only ethical judgements about the trade-offs we must make between the lifetime utilities of different people, but also ethical judgements about trade-offs we must make between these lifetime utilities and overall population size. For example, a SWO might judge that it is better to have a relatively small population of relatively happy people, than to have a much larger population of less happy people.Footnote 1

I will assume that lifetime utilities are measured on an absolute, interpersonally comparable scale, where a lifetime utility of zero is the lower limit for a life which is “worth living”. If someone’s lifetime utility is positive, then this means that, for her, it is better to exist than not to exist. But if her lifetime utility is negative, then this means that, for her, it would have been better to not exist at all. Note that the fact that a person’s life is worth living for her does not necessarily imply that it is ethically better that she exist; it may be that adding a particularly unhappy life to an already populous world is not an ethical improvement, even if the person who lives that life still regards it as worth living, on the balance.Footnote 2

I will consider two kinds of SWO in this paper. They differ in the precise information encoded in the social outcome. In a possibilist SWO, a social outcome assigns a lifetime utility to all people who could possibly exist. If someone does not actually exist, then she is simply assigned a lifetime utility of zero in this representation. Thus, possibilist SWOs do not distinguish between an outcome where Alice exists but has a lifetime utility of zero (i.e. a life so wretched that she is indifferent to not existing), and an otherwise identical outcome where Alice simply doesn’t exist at all. In contrast, in an actualist SWO, each social outcome specifies precisely which people exist. Thus, a clear distinction is made between an outcome where Alice exists but has a lifetime utility of zero, and an outcome where she doesn’t exist.

In both the actualist and possibilist frameworks, I will investigate a family of SWOs that I call rank-additive. These are SWOs which admit an additively separable representation, like the classical utilitarian or prioritarian SWOs. Each lifetime utility is transformed by a continuous increasing function before summation. However, people are ranked in order from lowest to highest lifetime utility, and different transformations can be applied to different entries in this ranking. Thus, the person with the highest lifetime utility may have her utility transformed in a different way than a person with a lower lifetime utility, before summation. This generalizes rank-weighted utilitarian (or generalized Gini) social welfare orders (Donaldson and Weymark 1980; Weymark 1981; Yaari 1988; Bossert 1990). But like Ebert (1988) and Zank (2007), it allows different utility transformation functions (as opposed to merely different multiplicative weights) to be applied at different positions in the ranking.Footnote 3

In defining rank-additive SWOs, there is a key difference between the actualist and possibilist frameworks. In an actualist SWO, we only rank, transform, and sum the lifetime utilities of the (finite) set of people who actually exist. By contrast, in a possibilist SWO, we rank, transform, and sum the lifetime utilities of everyone who could possibly exist—this includes a finite collection of nonzero utilities (amongst those who actually exist) and also an infinite collection of zero utilities (of those who do not exist). These zero utilities contribute nothing to the sum itself, but they have implications for how we rank the utilities of the people who do exist. Because of this, rank-additive axiologists have different functional forms in the actualist and possibilist frameworks, and admit different axiomatic characterizations. In particular, possibilist rank-additive SWOs always satisfy the axioms of Positive expansion and Negative expansion, which say that it is always good to add another person whose lifetime utility is above zero, and never good to add another person whose lifetime utility is below zero. This means they evade the Sadistic Conclusion, a paradox which afflicts critical-level utilitarianism, average utilitarianism, and many other proposed solutions to the Repugnant Conclusion (Arrhenius 2000). By contrast, actualist rank-additive SWOs almost never satisfy Positive expansion and Negative expansion, and hence frequently lead to the Sadistic Conclusion. On the other hand, actualist rank-additive SWOs easily reconcile inequality aversion with avoidance of the Repugnant Conclusion, whereas possibilist rank-additive SWOs do not. Sider’s (1991) geometrism is a possibilist rank-additive SWO. Asheim and Zuber’s (2014, 2017) rank-discounted utilitarianism is an actualist rank-additive SWO.

Most of the literature in population ethics adopts the actualist framework (e.g. Blackorby et al. 2005). Perhaps this is because of the suspicion that there is something nonsensical about imputing a utility to someone in a scenario where she does not even exist, or making welfare comparisons between scenarios where she exists and scenarios where she doesn’t. But several authors have argued convincingly that one can make such welfare comparisons, once they are construed in the right way (Holtug 2001; Roberts 2003, §4; Adler 2008, §III.A; Adler 2019, §II.A; Arrhenius and Rabinowicz 2010, 2015; Fleurbaey and Voorhoeve 2015, §3). So possibilism cannot simply be rejected as logically incoherent. The choice between the possibilist and actualist frameworks thus turns on which of them offers more attractive solutions to the central problems of population ethics. As we shall see, each framework has advantages and disadvantages.

The remainder of the paper is organized as follows. Section 2 concerns possibilist SWOs. Section 2.1 introduces the formal framework and key examples. Section 2.2 contains the first main result of the paper: an axiomatic characterization of possibilist rank-additive SWOs. Section 2.3 contains further results, such as necessary and sufficient conditions for these SWOs to be inequality-averse and to evade the Repugnant Conclusion. Section 3 concerns actualist SWOs and has a similar structure: Sect. 3.1 introduces the framework and key examples, while Sect. 3.2 contains the second main result of the paper: an axiomatic characterization of actualist rank-additive SWOs. Section 3.3 contains further results. Section 4 discusses a major problem confronting all rank-additive SWOs—their violation of the axiom of Existence independence—and proposes some ways of mitigating this problem. Finally, Sect. 5 discusses some undesirable properties of rank-additive SWOs.

2 Possibilist social welfare orders

2.1 Definitions and examples

Let \({\mathcal {I}}\) be an infinite set, whose elements represent all the people who could ever exist. Let \({\mathbb {R}}^{\mathcal {I}}\) be the set of all infinite \({\mathcal {I}}\)-indexed profiles \({\mathbf {r}}=(r_i)_{i\in {\mathcal {I}}}\) of real numbers. For all \(i\in {\mathcal {I}}\), interpret \(r_i\) as the lifetime utility of individual i. If \(r_i>0\), then overall, i has a life worth living. If \(r_i<0\), then overall, i has a life not worth living—it would have been better if she had never existed at all. If \(r_i=0\), then i’s life is indifferent to non-existence. This is usually referred to as the neutral level of lifetime utility. We will also set \(r_i=0\) in any scenario where i does not exist; the possibilist framework does not distinguish between non-existence and existence with a neutral lifetime utility.

Let \({\mathcal {X}}\) be the set of all elements of \({\mathbb {R}}^{\mathcal {I}}\) with only finitely many nonzero entries. An element of \({\mathcal {X}}\) represents a complete specification of all the lifetime utilities of all the people who will ever exist. (I assume this number to be finite.) I will refer to elements of \({\mathcal {X}}\) as social outcomes. A possibilist social welfare order is a preference order (i.e. complete, transitive, reflexive binary relation) \(\succeq \) on \({\mathcal {X}}\). Let \(\approx \) denote the symmetric part of \(\succeq \), and let \(\succ \) denote its asymmetric part.

If \(\pi :{\mathcal {I}}{{\longrightarrow }}{\mathcal {I}}\) is any bijection, then define \(\pi ^*:{\mathbb {R}}^{\mathcal {I}}{{\longrightarrow }}{\mathbb {R}}^{\mathcal {I}}\) by setting \(\pi ^*({\mathbf {r}}) := (r_{\pi (i)})_{i\in {\mathcal {I}}}\) for all \({\mathbf {r}}=(r_i)_{i\in {\mathcal {I}}}\) in \({\mathbb {R}}^{\mathcal {I}}\). Clearly, \(\pi ({\mathcal {X}})={\mathcal {X}}\), and \(\pi \) restricted to \({\mathcal {X}}\) defines a bijection from \({\mathcal {X}}\) to itself. We will be interested in SWOs satisfying the following axiom:

  • Anonymity. If \(\pi :{\mathcal {I}}{{\longrightarrow }}{\mathcal {I}}\) is any bijection, and \({\mathbf {x}}\in {\mathcal {X}}\), then \({\mathbf {x}}\approx \pi ^*({\mathbf {x}})\).

This is a standard axiom, which says that the SWO must treat all people the same. Let \({\mathbb {R}}_+:=\{r\in {\mathbb {R}}\); \(r\geqslant 0\}\) and let \({\mathbb {R}}_-:=\{r\in {\mathbb {R}}\); \(r\leqslant 0\}\). Let \({\mathbb {R}}_+^{\infty }\) be the set of all infinite sequences \({\mathbf {r}}=(r_n)_{n=1}^{\infty }\) of nonnegative numbers. Let \({\mathbb {R}}_+^{\propto }\) be the set of all elements of \({\mathbb {R}}_+^{\infty }\) with only finitely many nonzero entries, and let \({\mathbb {R}}_{+}^{\propto _\downarrow }\) be the set of all nonincreasing sequences in \({\mathbb {R}}_+^{\propto }\). Likewise define \({\mathbb {R}}_-^{\infty }\) and \({\mathbb {R}}_-^{\propto }\), and let \({\mathbb {R}}_{-}^{\propto _\uparrow }\) be the set of all nondecreasing sequences in \({\mathbb {R}}_-^{\propto }\). For any \({\mathbf {x}}\in {\mathcal {X}}\), let \(x^+_1\geqslant x^+_2\geqslant x^+_3\geqslant \cdots \geqslant x^+_N>0\) be all the positive entries of \({\mathbf {x}}\), listed in decreasing order with each value appearing as many times in this list as it appears in \({\mathbf {x}}\), and define \({\mathbf {x}}^+:=(x^+_1,x^+_2,x^+_3,\ldots ,x^+_N,0,0,\ldots )\), an element of \({\mathbb {R}}_{+}^{\propto _\downarrow }\). Likewise, let \(x^-_1\leqslant x^-_2\leqslant x^-_3\leqslant \cdots \leqslant x^-_N<0\) be all the negative entries of \({\mathbf {x}}\), listed in increasing order with each value appearing as many times in this list as it appears in \({\mathbf {x}}\), and define \({\mathbf {x}}^-:=(x^-_1,x^-_2,x^-_3,\ldots ,x^-_M,0,0,\ldots )\), an element of \({\mathbb {R}}_{-}^{\propto _\uparrow }\). Now define the function \(\phi :{\mathcal {X}}{{\longrightarrow }}{\mathbb {R}}_{+}^{\propto _\downarrow }\times {\mathbb {R}}_{-}^{\propto _\uparrow }\) by setting \(\phi ({\mathbf {x}}) := ({\mathbf {x}}^+,{\mathbf {x}}^-)\), for any \({\mathbf {x}}\in {\mathcal {X}}\). Clearly, \(\phi \) is a surjection. If \(\succeq _*\) is any preference order on \( {\mathbb {R}}_{+}^{\propto _\downarrow }\times {\mathbb {R}}_{-}^{\propto _\uparrow }\), then we can define an SWO \(\succeq \) on \({\mathcal {X}}\) by the formula:

(2A)

It is easy to see that \(\succeq \) satisfies Anonymity: if \(\pi :{\mathcal {I}}{{\longrightarrow }}{\mathcal {I}}\) is any bijection, and \({\mathbf {x}}'=\pi ^*({\mathbf {x}})\), then \(\phi ({\mathbf {x}}')=\phi ({\mathbf {x}})\), so that \({\mathbf {x}}\approx {\mathbf {x}}'\). Conversely, if \(\succeq \) is an SWO on \({\mathcal {X}}\) satisfying Anonymity, then there is a unique preference order \(\succeq _*\) on \( {\mathbb {R}}_{+}^{\propto _\downarrow }\times {\mathbb {R}}_{-}^{\propto _\uparrow }\) satisfying formula (2A). In other words, there is a natural bijective correspondence between preference orders on \( {\mathbb {R}}_{+}^{\propto _\downarrow }\times {\mathbb {R}}_{-}^{\propto _\uparrow }\) and SWOs on \({\mathcal {X}}\) satisfying Anonymity.

A preference order on \({\mathbb {R}}_{+}^{\propto _\downarrow }\times {\mathbb {R}}_{-}^{\propto _\uparrow }\) enables us to treat a person’s lifetime utility differently depending on how it is ranked relative to the lifetime utilities of other people. There are at least two reasons why this is useful. The first is aversion to inequality, which suggests that a small increase in the utility of a less happy person is more valuable than a similar increase in the utility of a more happy person, ceteris paribus. The second reason is specific to population ethics: the introduction of a new person with a relatively good life is generally regarded as desirable, whereas the introduction of a new person with a relatively mediocre life might be regarded as undesirable in some situations, even if that life is on the balance worth living. Both of these intuitions can be formalized using orderings on \({\mathbb {R}}_{+}^{\propto _\downarrow }\times {\mathbb {R}}_{-}^{\propto _\uparrow }\).

A social welfare functionFootnote 4 (SWF) is a function \(W:{\mathcal {X}}{{\longrightarrow }}{\mathbb {R}}\). It is rank-additive (RA) if there are continuous, increasing functions \(\phi ^+_n:{\mathbb {R}}_+{{\longrightarrow }}{\mathbb {R}}_+\) and \(\phi ^-_n:{\mathbb {R}}_-{{\longrightarrow }}{\mathbb {R}}_-\) with \(\phi ^+_n(0)=0=\phi ^-_n(0)\) for all \(n\in {\mathbb {N}}\), such that for any \({\mathbf {x}}\in {\mathcal {X}}\), we have

$$\begin{aligned} W({\mathbf {x}})= \sum _{n=1}^{\infty }\phi ^+_n(x^+_n) \ + \ \sum _{n=1}^{\infty }\phi ^-_n(x^-_n). \end{aligned}$$
(2B)

(There are only finitely many nonzero summands, by the definition of \({\mathcal {X}}\).) A SWO \(\succeq \) is rank-additive if it is represented by a rank-additive social welfare function. For example:

  • Suppose that \(\phi ^\pm _n(r)=r\) for all \(r\in {\mathbb {R}}_\pm \) and all \(n\in {\mathbb {N}}\).Footnote 5 Then we obtain the classical utilitarian SWF, defined by

    $$\begin{aligned} W({\mathbf {x}})= \sum _{n=1}^{\infty }x^+_n \ + \ \sum _{n=1}^{\infty }x^-_n = \sum _{i\in {\mathcal {I}}} x_i,\qquad \text{ for } \text{ all } {\mathbf {x}}\in {\mathcal {X}}\text{. } \end{aligned}$$
    (2C)
  • Let \(\phi :{\mathbb {R}}{{\longrightarrow }}{\mathbb {R}}\) be a continuous, increasing function with \(\phi (0)=0\). Suppose that \(\phi ^\pm _n(r)=\phi (r)\) for all \(r\in {\mathbb {R}}_\pm \) and all \(n\in {\mathbb {N}}\). Then we obtain the generalized utilitarian SWF, defined by

    $$\begin{aligned} W({\mathbf {x}}) \ = \ \sum _{n=1}^{\infty }\phi (x^+_n) \ + \ \sum _{n=1}^{\infty }\phi (x^-_n) \ = \ \sum _{i\in {\mathcal {I}}} \phi (x_i),\quad \text{ for } \text{ all } {\mathbf {x}}\in {\mathcal {X}}\text{. } \end{aligned}$$
    (2D)

    In particular, if \(\phi \) is strictly concave, then (2D) is called a prioritarian SWF, and exhibits inequality aversion.

  • Let \(\{c^+_n\}_{n=1}^{\infty }\) and \(\{c^-_n\}_{n=1}^{\infty }\) be two sequences of positive constants. Suppose that \(\phi ^\pm _n(r)=c^\pm _n\,r\) for all \(r\in {\mathbb {R}}_\pm \) and all \(n\in {\mathbb {N}}\). Then we obtain the rank-weighted utilitarian SWF (Weymark 1981; Yaari 1988):

    $$\begin{aligned} W({\mathbf {x}})= \sum _{n=1}^{\infty }c^+_n\,x^+_n \ + \ \sum _{n=1}^{\infty }c^-_n\,x^-_n,\qquad \text{ for } \text{ all } {\mathbf {x}}\in {\mathcal {X}}\text{. } \end{aligned}$$
    (2E)
  • In particular, let \(\beta \in (0,1)\), and suppose that \(\phi ^\pm _n(r)=\beta ^n\, r\) for all \(r\in {\mathbb {R}}_\pm \) and all \(n\in {\mathbb {N}}\). Then we obtain the geometric SWF proposed by Sider (1991):

    $$\begin{aligned} W({\mathbf {x}})= \sum _{n=1}^{\infty }\beta ^n\,x^+_n \ + \ \sum _{n=1}^{\infty }\beta ^n\,x^-_n,\qquad \text{ for } \text{ all } {\mathbf {x}}\in {\mathcal {X}}. \end{aligned}$$
    (2F)

The classical utilitarian SWF (2C) arises as a special case of generalized utilitarianism (with \(\phi (x)=x\)) and rank-weighted utilitarianism (with \(c^\pm _n=1\) for all \(n\in {\mathbb {N}}\)). Unfortunately, as is well known, any generalized utilitarian SWF (and in particular, the classical utilitarian SWF ) leads to Parfit’s Repugnant Conclusion. In contrast, the rank-weighted utilitarian SWF (2E) evades the Repugnant Conclusion, as long as the sequence \(\{c^+_n\}_{n=1}^{\infty }\) decays quickly enough that \(\sum _{n=1}^{\infty }c^+_n<{\infty }\) (see Proposition 2.2(a)). However, in this case, the rank-weighted utilitarian SWF is anti-egalitarian amongst all people with positive lifetime utility (see Proposition 2.4). To reconcile inequality aversion with evasion of the Repugnant Conclusion, we will need to consider rank-additive SWOs defined by other choices of functions \(\{\phi ^\pm _n\}_{n=1}^{\infty }\).

Rank-additive SWOs have several attractive properties. For any \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\) and \({\mathbf {z}}\in {\mathbb {R}}^N\), write “\({\mathbf {y}}= {\mathbf {x}}\uplus {\mathbf {z}}\)” if there exist distinct \(j_1,j_2,\ldots ,j_N\in {\mathcal {I}}\) such that \(x_{j_n}=0\) and \(y_{j_n}=z_n\) for all \(n\in [1\ldots N]\), while \(x_i=y_i\) for all \(i\in {\mathcal {I}}\setminus \{ j_1,j_2,\ldots ,j_N\}\).Footnote 6 In other words, \({\mathbf {y}}\) is obtained by adding to \({\mathbf {x}}\) exactly N new people, whose lifetime utilities are given by \((z_1,\ldots ,z_N)\). For any \(r\in {\mathbb {R}}\), we define \({\mathbf {x}}\uplus r:={\mathbf {x}}\uplus {\mathbf {z}}\), where \({\mathbf {z}}\) is an outcome containing a single individual with lifetime utility r. It is easily verified that any rank-additive possibilist SWO satisfies the next four axioms.

  • Pareto. For all \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\), if \(x_i\geqslant y_i\) for all \(i\in {\mathcal {I}}\), then \({\mathbf {x}}\succeq {\mathbf {y}}\). If, furthermore, \(x_i> y_i\) for some \(i\in {\mathcal {I}}\), then \({\mathbf {x}}\succ {\mathbf {y}}\).

  • Positive expansion (or Mere Addition). For any \({\mathbf {x}}\in {\mathcal {X}}\) and any \(r>0\), \({\mathbf {x}}\uplus r \succ {\mathbf {x}}\).

  • Negative expansion. For any \({\mathbf {x}}\in {\mathcal {X}}\) and any \(r<0\), \({\mathbf {x}}\uplus r \prec {\mathbf {x}}\).

  • No Sadistic Conclusion. For any \({\mathbf {x}}\in {\mathcal {X}}\), any \(N,M\in {\mathbb {N}}\), and any \({\mathbf {y}}\in {\mathbb {R}}_{++}^N\) and \({\mathbf {z}}\in {\mathbb {R}}_{--}^M\), \({\mathbf {x}}\uplus {\mathbf {y}}\succ {\mathbf {x}}\uplus {\mathbf {z}}\).

Positive expansion says it is always good to add another person whose life is worth living (i.e. whose lifetime utility is positive). Negative expansion says it is always bad to add another person whose life is not worth living (i.e. whose lifetime utility is negative). Both of these are consequences of the Pareto axiom. Meanwhile, No Sadistic Conclusion is a consequence of Positive expansion and Negative expansion; it means that rank-additive possibilist SWOs avoid a well-known problem of average utilitarian and critical-level generalized utilitarian principles first identified by Arrhenius (2000).Footnote 7 For any \({\mathbf {x}}\in {\mathbb {R}}_\pm ^\propto \), let \(|{\mathbf {x}}|\) denote the number of nonzero entries in \({\mathbf {x}}\). Rank-additive possibilist SWOs also satisfy the next axiom.

  • Existence independence of the wretched. For all \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\) and \(N,M\in {\mathbb {N}}\) such that \(|{\mathbf {x}}^+|=|{\mathbf {y}}^+|=N\) and \(|{\mathbf {x}}^-|=|{\mathbf {y}}^-|=M\), and any \(z\in {\mathbb {R}}\) such that \(\max \{x^-_M,y^-_M\}\leqslant z\leqslant \min \{x^+_N,y^+_N\}\), we have \({\mathbf {x}}\succeq {\mathbf {y}}\) if and only if \({\mathbf {x}}\uplus z \succeq {\mathbf {y}}\uplus z\).

This axiom is similar to the axiom of Existence independence of Blackorby et al. (2005, §5.6), but it only applies the people whose lifetime utilities are close to zero (“the wretched”).Footnote 8

An important feature of RA possibilist SWOs is that individuals with positive lifetime utilities (i.e. lives worth living) are evaluated using the functions \(\{\phi ^+_n\}_{n=1}^{\infty }\), whereas individuals with negative lifetime utilities (i.e. lives not worth living) are evaluated using \(\{\phi ^-_n\}_{n=1}^{\infty }\). This gives us the freedom to treat lives which are not worth living in a completely different way than we treat lives worth living, in accord with many people’s ethical intuitions. For example, if we augment a social outcome by adding a trillion wretched lives that are barely worth living, then a rejection of the Repugnant Conclusion suggests that the marginal gain in social welfare obtained by adding the trillionth such life is less than the marginal gain from adding the first such life. But if we add a trillion lives of terrible suffering that are clearly not worth living, then our ethical intuitions suggest that the addition of the trillionth such life adds just as much evil to the world as the first one. This intuition is sometimes called the asymmetry (McMahan 1981, 2009; Roberts 2011, 2019). Since \(\{\phi ^+_n\}_{n=1}^{\infty }\) and \(\{\phi ^-_n\}_{n=1}^{\infty }\) can have different properties, it is easy to accommodate this intuition.

It seems natural to assume that \(\{\phi ^+_n\}_{n=1}^{\infty }\) and \(\{\phi ^-_n\}_{n=1}^{\infty }\) should both arise as restrictions to \({\mathbb {R}}_+\) and \({\mathbb {R}}_-\) of some common family of utility functions defined on all of \({\mathbb {R}}\), as in the generalized utilitarian SWF in formula (2D). If they didn’t, and we treated negative and positive utilities in a completely different way, then one might worry about creating an “ethical discontinuity” in our treatment of an individual as her lifetime utility changes from positive to negative. But this concern is misconceived. To understand this, let \({\mathbf {x}}\in {\mathcal {X}}\) be a social outcome, and define \({\mathbf {x}}^+=(x^+_1,x^+_2,x^+_3,\ldots ,x^+_N,0,0,\ldots )\) and \({\mathbf {x}}^-=(x^-_1,x^-_2,x^-_3,\ldots ,x^-_M,0,0,\ldots )\) as prior to statement (2A). If we imagine all the coordinates of \({\mathbf {x}}\) of being arranged in decreasing order, then the (infinite) number of zero coordinates will all appear between the coordinates of \({\mathbf {x}}^+\) and those of \({\mathbf {x}}^-\). In other words,

$$\begin{aligned} {\mathbf {x}}\ = \ (x^+_1,x^+_2,x^+_3,\ldots ,x^+_N,0,0,0,\ldots \ldots \ \ldots \ldots ,0,0,0,x^-_M,\ldots ,x^-_3,x^-_2,x^-_1). \end{aligned}$$

Observe that \(\{\phi ^+_n\}_{n=1}^{\infty }\) deal with the coordinates at the left end of this infinite array, while \(\{\phi ^-_n\}_{n=1}^{\infty }\) deal with the coordinates at the right end. There is no reason to believe that these two families of functions should have anything in common with one another. Indeed, suppose we gradually reduce one individual’s lifetime utility, while holding all other utilities constant. As her utility decreases, it is shuffled further and further rightward in the ordering of \(x^+_1,\ldots ,x^+_N\). But when it passes from positive to negative, it jumps an infinite number of positions rightward (leaping over the infinite number of zero coordinates), to become part of \(x^-_M,\ldots ,x^-_1\). If there is an “ethical discontinuity” in our treatment of the person at this moment, it can be attributed to this infinite jump.

2.2 Axiomatic characterization

The first main result of the paper is an axiomatic characterization of rank-additive SWOs in the possibilist framework. This will use the Anonymity and Pareto axioms introduced in Sect. 2.1, along with two other axioms. For any \(N\in {\mathbb {N}}\), let \({\mathbb {R}}_{+}^{N_\downarrow }:=\{{\mathbf {r}}\in {\mathbb {R}}^N\); \(r_1\geqslant r_2 \geqslant \cdots \geqslant r_N\geqslant 0\}\), and let \({\mathbb {R}}_{-}^{N_\uparrow }:=\{{\mathbf {r}}\in {\mathbb {R}}^N\); \(r_1\leqslant r_2 \leqslant \cdots \leqslant r_N\leqslant 0\}\). We can treat \({\mathbb {R}}_{+}^{N_\downarrow }\) as a subset of \({\mathbb {R}}_{+}^{\propto _\downarrow }\) in a natural way, by identifying the N-tuple \((x_1,x_2,\ldots ,x_N)\) with the sequence \((x_1,x_2,\ldots ,x_N,0,0,\ldots )\). Likewise, we can treat \({\mathbb {R}}_{-}^{N_\uparrow }\) as a subset of \({\mathbb {R}}_{-}^{\propto _\uparrow }\). Note that \({\mathbb {R}}_{+}^{2_\downarrow }\subset {\mathbb {R}}_{+}^{3_\downarrow }\subset {\mathbb {R}}_{+}^{4_\downarrow }\subset \cdots \subset {\mathbb {R}}_{+}^{\propto _\downarrow }\) and \({\mathbb {R}}_{-}^{2_\uparrow }\subset {\mathbb {R}}_{-}^{3_\uparrow }\subset {\mathbb {R}}_{-}^{4_\uparrow }\subset \cdots \subset {\mathbb {R}}_{-}^{\propto _\uparrow }\). It follows thatFootnote 9

$$\begin{aligned} {\mathbb {R}}_{+}^{\propto _\downarrow }\times {\mathbb {R}}_{-}^{\propto _\uparrow }= \bigcup _{N=1}^{\infty }\left( {\mathbb {R}}_{+}^{N_\downarrow }\times {\mathbb {R}}_{-}^{N_\uparrow }\right) . \end{aligned}$$
(2G)

For all \(N\in {\mathbb {N}}\), let \(\succeq _N\) be the restriction of \(\succeq _*\) to \({\mathbb {R}}_{+}^{N_\downarrow }\times {\mathbb {R}}_{-}^{N_\uparrow }\). The order \(\succeq _*\) is uniquely determined by this sequence \((\succeq _N)_{N=1}^{\infty }\) of finite-population SWOs. The next two axioms concern these orders. Note that \({\mathbb {R}}_{+}^{N_\downarrow }\times {\mathbb {R}}_{-}^{N_\uparrow }\) is a closed convex subset of \({\mathbb {R}}^N\times {\mathbb {R}}^N={\mathbb {R}}^{2N}\); endow it with the subspace topology it inherits from \({\mathbb {R}}^{2N}\). We need two more axioms.

  • Continuity. For every \(N\in {\mathbb {N}}\), the order \(\succeq _N\) is continuous on \({\mathbb {R}}_{+}^{N_\downarrow }\times {\mathbb {R}}_{-}^{N_\uparrow }\).

  • Separability. For every \(N\in {\mathbb {N}}\), and every pair of subsets \({\mathcal {J}}_+,{\mathcal {J}}_-\subseteq [1\ldots N]\), there is a preference order \(\succeq _{{\mathcal {J}}_\pm }\) defined on \({\mathbb {R}}^{{\mathcal {J}}_+}\times {\mathbb {R}}^{{\mathcal {J}}_-}\) such that, for any \({\mathbf {x}}=({\mathbf {x}}^+,{\mathbf {x}}^-)\) and \({\mathbf {y}}=({\mathbf {y}}^+,{\mathbf {y}}^-)\) in \({\mathbb {R}}_{+}^{N_\downarrow }\times {\mathbb {R}}_{-}^{N_\uparrow }\), if \(x^+_n=y^+_n\) for all \(n\in [1\ldots N]\setminus {\mathcal {J}}_+\), and \(x^-_n=y^-_n\) for all \(n\in [1\ldots N]\setminus {\mathcal {J}}_-\), then \({\mathbf {x}}\succeq _N {\mathbf {y}}\) if and only if \(({\mathbf {x}}^+_{{\mathcal {J}}_+},{\mathbf {x}}^-_{{\mathcal {J}}_-}) \succeq _{{\mathcal {J}}_\pm } ({\mathbf {y}}^+_{{\mathcal {J}}_+},{\mathbf {y}}^-_{{\mathcal {J}}_-})\).Footnote 10

These axioms are somewhat weaker than the familiar axioms with similar names: they only apply to the restriction of \(\succeq \) to a population of fixed, finite size N, and only compare social outcomes that are comonotonic. The Continuity axiom says that a small change in the lifetime utilities of individuals should only cause a small change in the ranking of the social outcome, in comparison with other social outcomes having the same population. Separability says that, if certain people (namely those in \([1\ldots N]\setminus {\mathcal {J}}_+\)) have the same positive lifetime utility in two social outcomes \({\mathbf {x}}\) and \({\mathbf {y}}\), and furthermore occupy the same position in the ranking from best-off to worst-off, then the comparison between \({\mathbf {x}}\) and \({\mathbf {y}}\) should be entirely determined by other people—those whose utilities differ from \({\mathbf {x}}\) to \({\mathbf {y}}\) (namely those in \({\mathcal {J}}_+\)). Likewise, if certain people (namely those in \([1\ldots N]\setminus {\mathcal {J}}_-\)) have the same negative lifetime utility in two social outcomes \({\mathbf {x}}\) and \({\mathbf {y}}\), and furthermore occupy the same position in the ranking from worst-off to best-off, then the comparison between \({\mathbf {x}}\) and \({\mathbf {y}}\) should be entirely determined by other people—those whose utilities differ from \({\mathbf {x}}\) to \({\mathbf {y}}\) (namely those in \({\mathcal {J}}_-\)). Thus, this axiom has the same normative content as the standard Separability axiom—it is just more complicated to state (and in fact, logically weaker), because it must deal separately with people having positive and negative lifetime utilities, and furthermore only deals with rank-ordered vectors of utility. Here is the first main result of the paper.

Theorem 1

Let \(\succeq \) be a possibilist SWO on \({\mathcal {X}}\). Then \(\succeq \) satisfies Anonymity, Continuity, Pareto, and Separability if and only if it is rank-additive. Furthermore, in the representation (2B), the functions \(\{\phi ^\pm _n\}_{n=1}^{\infty }\) are unique up to multiplication by a common positive constant.

Proof sketch For all \(N\in {\mathbb {N}}\) and all \({\mathbf {x}}\in {\mathbb {R}}_{+}^{N_\downarrow }\times {\mathbb {R}}_{-}^{N_\uparrow }\), use the classic representation theorem of Debreu (1960) to obtain a “local” additive representation of \(\succeq _N\) in a neighbourhood of \({\mathbf {x}}\). Then glue together these local representations to obtain a single, global additive representation on all of \({\mathbb {R}}_{+}^{N_\downarrow }\times {\mathbb {R}}_{-}^{N_\uparrow }\), using a result of Chateauneuf and Wakker (1993). Finally, for any \(N<M\), use the natural embedding \({\mathbb {R}}_{+}^{N_\downarrow }\times {\mathbb {R}}_{-}^{N_\uparrow }\subset {\mathbb {R}}_{+}^{M_\downarrow }\times {\mathbb {R}}_{-}^{M_\uparrow }\) together with standard uniqueness results to show that the 2N utility functions involved in the additive representation on \({\mathbb {R}}_{+}^{N_\downarrow }\times {\mathbb {R}}_{-}^{N_\uparrow }\) are actually the first 2N utility functions involved in the additive representation on \({\mathbb {R}}_{+}^{M_\downarrow }\times {\mathbb {R}}_{-}^{M_\uparrow }\). See Appendix A for details.

Independence of the axioms Consider a rank-additive SWF (2B). If some of the functions \(\{\phi ^\pm _n\}_{n=1}^{\infty }\) are not increasing, but all are continuous, then the resulting SWO violates Pareto but satisfies the other three axioms of Theorem 1. If, on the other hand, some of \(\{\phi ^\pm _n\}_{n=1}^{\infty }\) are not continuous, but all are increasing, then the resulting SWO violates Continuity but satisfies the other three axioms.

Next, suppose we replace (2B) with the expression \(\left( \sum _{n=1}^{\infty }x_n^+\right) ^2- \left( \sum _{n=1}^{\infty }|x_n^-|\right) ^2\). The resulting SWO violates Separability but satisfies the other three axioms. Finally, for all \(i\in {\mathcal {I}}\), let \(\phi _i:{\mathbb {R}}{{\longrightarrow }}{\mathbb {R}}\) be a continuous, increasing function with \(\phi _i(0)=0\), and define \(W:{\mathcal {X}}{{\longrightarrow }}{\mathbb {R}}\) by setting \(W({\mathbf {x}}):=\sum _{i\in {\mathcal {I}}}\phi _i(x_i)\) (well defined because only finitely many summands are nonzero). The resulting SWO \(\succeq \) satisfies Pareto, but if the functions \(\{\phi _i\}_{i\in {\mathcal {I}}}\) are distinct, then it violates Anonymity. It is meaningless to ask whether \(\succeq \) satisfies Continuity or Separability, since these axioms are formulated in terms of the order \(\succeq _*\), which is not even well defined if \(\succeq \) violates Anonymity. But for any finite subset \({\mathcal {J}}\subset {\mathcal {I}}\), if we define \(\succeq _{\mathcal {J}}\) to be restriction of \(\succeq \) to the (finite-dimensional) subspace \({\mathbb {R}}^{\mathcal {J}}\times \{0\}^{{\mathcal {I}}\setminus {\mathcal {J}}}\), then it is easy to see that \(\succeq _{\mathcal {J}}\) satisfies axioms analogous to Continuity and Separability.

2.3 Further results

I earlier noted that any rank-additive SWO satisfies the axiom Existence independence of the wretched. We might also consider SWOs that satisfy the following axioms:

  • Top-independence in good worlds. For all \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\) such that \(x_i\geqslant 0\) and \(y_i\geqslant 0\) for all \(i\in {\mathcal {I}}\), and all \(z\in {\mathbb {R}}\) with \(z>\max \{x^+_1,y^+_1\}\), we have \({\mathbf {x}}\succeq {\mathbf {y}}\) if and only if \({\mathbf {x}}\uplus z \succeq {\mathbf {y}}\uplus z\).

  • Bottom-independence in bad worlds. For all \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\) such that \(x_i\leqslant 0\) and \(y_i\leqslant 0\) for all \(i\in {\mathcal {I}}\), and all \(z\in {\mathbb {R}}\) with \(z<\max \{x^-_1,y^-_1\}\), we have \({\mathbf {x}}\succeq {\mathbf {y}}\) if and only if \({\mathbf {x}}\uplus z \succeq {\mathbf {y}}\uplus z\).

These axioms are similar to Separability; they say that, when comparing certain kinds of social outcomes, we can disregard the utility—or even the existence—of certain people who are indifferent between these outcomes. The first axiom is like Asheim and Zuber’s (2014) axiom Existence independence of the best off, except that it applies only in “good” worlds, where everyone’s lifetime utility is nonnegative. The second axiom is like Asheim and Zuber’s (2014) Existence independence of the worst off, but it applies only in “bad” worlds.Footnote 11 The next result says that these axioms lead to something resembling Sider’s (1991) “geometric” SWF from formula (2F).

Proposition 2.1

Let \(\succeq \) be a rank-additive possibilist SWO with the SWF (2B).

(a):

\(\succeq \) satisfies Top-independence in good worlds if and only if there is a continuous, increasing function \(\phi ^+:{\mathbb {R}}_+{{\longrightarrow }}{\mathbb {R}}_+\) (unique up to multiplication by a positive constant) and a unique constant \(\beta _+>0\) such that \(\phi ^+_n \ = \ \beta _+^n\,\phi ^+\) for all \(n\in {\mathbb {N}}\).

(b):

\(\succeq \) satisfies Bottom-independence in bad worlds if and only if there is a continuous, increasing function \(\phi ^-:{\mathbb {R}}_-{{\longrightarrow }}{\mathbb {R}}_-\) (unique up to multiplication by a positive constant) and a unique constant \(\beta _->0\) such that \(\phi ^-_n \ = \ \beta _-^n\,\phi ^-\) for all \(n\in {\mathbb {N}}\).

(c):

If \(\succeq \) satisfies the conditions of both (a) and (b), then \(\phi ^+\) and \(\phi ^-\) are unique up to multiplication by the same positive constant.

If \(\succeq \) satisfies both Top-independence in good worlds and Bottom-independence in bad worlds, then Proposition 2.1 yields a variant of Sider’s geometric SWF. But nothing in Proposition 2.1 requires \(\beta _+=\beta _-\), nor are either \(\beta _+\) or \(\beta _-\) required be less than 1.

Repugnant Conclusions For any \(N\in {\mathbb {N}}\), let \(\varvec{1}_N\) refer to an element of \({\mathcal {X}}\) such that exactly N coordinates take the value 1, and all other coordinates are zero. (By Anonymity, it does not matter which coordinates we choose.) For any \(r\in {\mathbb {R}}\), \(r\,\varvec{1}_N\) refers to the corresponding element of \({\mathcal {X}}\) such that exactly N coordinates take the value r, and all other coordinates are zero. Consider the following axioms.

  • No Repugnant Conclusion. There exist \(r_0 >0\) and \({\mathbf {x}}\in {\mathcal {X}}\) such that \({\mathbf {x}}\succ r_0\,\varvec{1}_N\) for all \(N\in {\mathbb {N}}\).

  • No utility monsters. For all \(N\in {\mathbb {N}}\), there exists \({\mathbf {x}}\in {\mathcal {X}}\) such that \({\mathbf {x}}\succ r\,\varvec{1}_N\) for all \(r>0\).

The first axiom rules out Parfit’s Repugnant Conclusion. It says there is a minimum positive utility \(r_0\) (representing a life which is technically worth living, but perhaps not very pleasant) and a social outcome \({\mathbf {x}}\) (e.g. the population of a modern industrialized country) which is better than any population of people with life utilities less than or equal to \(r_0\), no matter how large this population becomes. The second axiom rules out Nozick’s (1974) Utility Monster paradox. It says that for any finite-population size N, there exists a social outcome (presumably involving a larger number of people) which is better than any society which involves only N people, no matter how high their lifetime utilities becomes. Thus, even if the first N people are somehow much more efficient at converting resources into lifetime utility than everyone else, the SWF does not allow them to simply absorb unlimited amounts of resources from the rest of humanity to boost their own utilities.

Proposition 2.2

Let \(\succeq \) be a rank-additive possibilist SWO with the SWF (2B). Let \({\overline{W}}:=\sup \{W({\mathbf {x}})\); \({\mathbf {x}}\in {\mathcal {X}}\}\).

(a):

\(\succeq \) satisfies No Repugnant Conclusion if and only if there exists \(r_0>0\) such that \(\displaystyle \sum _{n=1}^{\infty }\phi ^+_n(r_0)<{\overline{W}}\).

(b):

\(\succeq \) satisfies No utility monsters if and only if \(\displaystyle \lim _{r{\rightarrow }{\infty }} \sum \nolimits _{n=1}^N \phi ^+_n(r)<{\overline{W}}\), for all \(N\in {\mathbb {N}}\).

If \({\overline{W}}<{\infty }\), then both these conditions are satisfied.

Fig. 1
figure 1

Functions \(\phi ^+_n(r)=n\cdot (1-\exp (-r/n))\), for \(n\in \{1\ldots 10\}\)

It is well known that Nozick’s Utility Monster paradox can be evaded by using a generalized utilitarian social welfare like (2D) when the function \(\phi \) is bounded above. In particular, some prioritarian social welfare functions have this form. But Proposition 2.2(b) goes beyond this trite observation, because in an RA SWF, the functions \(\{\phi ^+_n\}_{n=1}^{\infty }\) need not be identical, so they need not have the same upper bound. For example, suppose that \(\phi ^+_n(r):= n\cdot (1-\exp (-r/n))\) for all \(n\in {\mathbb {N}}\) and all \(r\in {\mathbb {R}}_+\); then, the condition of Proposition 2.2(b) is satisfied. However, as shown in Fig. 1, we have \(\sup (\phi ^+_n({\mathbb {R}}_+))=n\) for all \(n\in {\mathbb {N}}\).

The Saint Petersburg paradox The Repugnant Conclusion and the Utility Monster are both ethical paradoxes which arise when a valuable thing is allowed to become extremely small in one “dimension”, as long as it simultaneously grows extremely large along some other dimensions. Perhaps the earliest paradox of this kind is the Saint Petersburg Paradox (Bernoulli, 1738 [1954]). This suggests the next axiom. Let \(W:{\mathcal {X}}{{\longrightarrow }}{\mathbb {R}}\) denote a SWF representing a SWO \(\succeq \).

  • No Saint Petersburg Paradox. There is some \(\epsilon >0\) and some \({\mathbf {x}}\in {\mathcal {X}}\) such that for any \({\mathbf {y}}\in {\mathcal {X}}\) and any \(p<\epsilon \), \(W({\mathbf {x}}) > p \, W({\mathbf {y}})\).

Let \({\mathbf {z}}\) be an outcome such that \(W({\mathbf {z}})=0\) (for example, the “empty world” where \(z_i=0\) for all \(i\in {\mathcal {I}}\)). We can interpret \(p \, W({\mathbf {y}})\) as the expected W value of a lottery which yields \({\mathbf {y}}\) with probability p, and \({\mathbf {z}}\) with probability \((1-p)\). Thus, No Saint Petersburg Paradox says that the sure outcome \({\mathbf {x}}\) is better than any such lottery, no matter how good \({\mathbf {y}}\) is.Footnote 12

Proposition 2.3

Let \(\succeq \) be a rank-additive possibilist SWO with the SWF (2B). Then \(\succeq \) satisfies No Saint Petersburg Paradox if and only if \(\sup \{W({\mathbf {x}})\); \({\mathbf {x}}\in {\mathcal {X}}\}<{\infty }\). In this case, \(\succeq \) automatically satisfies No Repugnant Conclusion and No utility monsters.

Inequality aversion Let \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\). Say that \({\mathbf {y}}\) is a Pigou–Dalton transform of \({\mathbf {x}}\) if there exist \(j,k\in {\mathcal {I}}\) and \(\epsilon >0\) such that \(y_j=x_j+\epsilon \leqslant y_k=x_k-\epsilon \), while \(y_i=x_i\) for all other \(i\in {\mathcal {I}}\setminus \{j,k\}\). Consider the following axioms.

  • Inequality neutrality. Let \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\). If \({\mathbf {y}}\) is a Pigou–Dalton transform of \({\mathbf {x}}\), then \({\mathbf {y}}\approx {\mathbf {x}}\).

  • Inequality aversion. Let \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\). If \({\mathbf {y}}\) is a Pigou–Dalton transform of \({\mathbf {x}}\), then \({\mathbf {y}}\succeq {\mathbf {x}}\).

  • Strict inequality aversion. Let \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\). If \({\mathbf {y}}\) is a Pigou–Dalton transform of \({\mathbf {x}}\), then \({\mathbf {y}}\succ {\mathbf {x}}\).

Proposition 2.4

Let \(\succeq \) be a rank-additive possibilist SWO with the SWF (2B).

(a):

\(\succeq \) satisfies Inequality neutrality if and only if it is classical utilitarianism.

(b):

\(\succeq \) satisfies Inequality aversion if and only if, for all \(n,m\in {\mathbb {N}}\), \(r,s\in {\mathbb {R}}\) and \(\epsilon >0\):

(i):

If \(r\geqslant 0\geqslant s\), then \(\phi ^+_n(r+\epsilon )-\phi ^+_n(r) \leqslant \phi ^-_m(s)-\phi ^-_m(s-\epsilon )\).

(ii):

If \(n< m\) and \(r\geqslant s\geqslant \epsilon >0\), then \(\phi ^+_n(r+\epsilon )-\phi ^+_n(r) \leqslant \phi ^+_m(s)-\phi ^+_m(s-\epsilon )\).

(iii):

If \(n> m\) and \(s\leqslant r\leqslant -\epsilon <0\), then \(\phi ^-_n(r+\epsilon )-\phi ^-_n(r) \leqslant \phi ^-_m(s)-\phi ^-_m(s-\epsilon )\).

Thus, for all \(q\in {\mathbb {R}}_+\), we have

$$\begin{aligned}&\phi ^+_1(q)\leqslant \phi ^+_2(q)\leqslant \phi ^+_3(q)\leqslant \cdots \cdots \nonumber \\&\quad \cdots \cdots \leqslant -\phi ^-_3(-q)\leqslant -\phi ^-_2(-q)\leqslant -\phi ^-_1(-q). \end{aligned}$$
(2H)

In particular, if \(\{\phi ^+_n\}_{n=1}^{\infty }\) and \(\{\phi ^-_n\}_{n=1}^{\infty }\) are differentiable, then for any positive nonincreasing sequence \(r_1\geqslant r_2 \geqslant r_3 \geqslant \cdots \geqslant 0\) and negative nondecreasing sequence \(s_1 \leqslant s_2 \leqslant s_3 \leqslant \cdots \leqslant 0\),

$$\begin{aligned}&(\phi ^+_1)'(r_1)\leqslant (\phi ^+_2)'(r_2)\leqslant (\phi ^+_3)'(r_3)\leqslant \cdots \cdots \nonumber \\&\quad \cdots \cdots \leqslant (\phi ^-_3)'(s_3)\leqslant (\phi ^-_2)'(s_2)\leqslant (\phi ^-_1)'(s_1). \end{aligned}$$
(2I)
(c):

\(\succeq \) satisfies Strict inequality aversion if and only if all the statements in part (b) hold with strict inequalities.

Example 2.5

The generalized utilitarian SWF (2D) satisfies Inequality aversion if and only if the function \(\phi \) is concave; it satisfies Strict inequality aversion if and only if \(\phi \) is strictly concave. The rank-weighted utilitarian SWF (2E) satisfies Inequality aversion if and only if \(c^+_1\leqslant c^+_2 \leqslant c^+_3 \leqslant \cdots \leqslant c^-_3 \leqslant c^-_2 \leqslant c^-_1\); it satisfies Strict inequality aversion if and only if these inequalities are all strict. Note, however, that in a general rank-additive SWF, we do not need the functions \(\{\phi ^+_n\}_{n=1}^{\infty }\) and \(\{\phi ^-_n\}_{n=1}^{\infty }\) to be concave to ensure inequality aversion, as long as the conditions of Proposition 2.4 are satisfied. \(\square \)

Unfortunately, by comparing Proposition 2.4 with Proposition 2.2(a), one sees that it is impossible to simultaneously satisfy Inequality aversion and No Repugnant Conclusion. If \(\epsilon >0\) is small, then No Repugnant Conclusion requires the sequence \(\{\phi ^+_n(\epsilon )\}_{n=1}^{\infty }\) to be summable, whereas Inequality aversion requires this sequence to be nondecreasing as in (2H)—a contradiction. Thus, to avoid the Repugnant Conclusion, we must somehow weaken Inequality aversion. Let \(\theta >0\). Let \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\). Say that \({\mathbf {y}}\) is a \(\theta \)-restricted Pigou–Dalton transform of \({\mathbf {x}}\) if there exist \(j,k\in {\mathcal {I}}\) and \(\epsilon >0\) such that \(y_j=x_j+\epsilon \leqslant y_k=x_k-\epsilon \), while \(y_i=x_i\) for all other \(i\in {\mathcal {I}}\setminus \{j,k\}\), and furthermore, none of \(x_j,y_j,x_k,y_k\) is in the interval \([0,\theta ]\). Consider the following axioms.

  • Restricted inequality neutrality. There is some \(\theta >0\) such that, for any \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\), if \({\mathbf {y}}\) is a \(\theta \)-restricted Pigou–Dalton transform of \({\mathbf {x}}\), then \({\mathbf {y}}\approx {\mathbf {x}}\).

  • Restricted inequality aversion. There is some \(\theta >0\) such that, for any \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\), if \({\mathbf {y}}\) is a \(\theta \)-restricted Pigou–Dalton transform of \({\mathbf {x}}\), then \({\mathbf {y}}\succeq {\mathbf {x}}\).

  • Restricted strict inequality aversion. There is some \(\theta >0\) such that, for any \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}\), if \({\mathbf {y}}\) is a \(\theta \)-restricted Pigou–Dalton transform of \({\mathbf {x}}\), then \({\mathbf {y}}\succ {\mathbf {x}}\).

This might seem like a rather stingy version of inequality aversion, since it specifically excludes the wretched. But it allows us to avoid the Repugnant Conclusion.

Proposition 2.6

Let \(\succeq \) be a rank-additive possibilist SWO with the SWF (2B).

(a):

\(\succeq \) satisfies Restricted inequality neutrality if and only if there are linear functions \(\phi ^\pm :{\mathbb {R}}_\pm {{\longrightarrow }}{\mathbb {R}}_\pm \) and constants \(\{c_n\}_{n=1}^{\infty }\) such that for all \(n\in {\mathbb {N}}\), \(\phi ^-_n=\phi ^-\) and \(\phi ^+_n(r)=\phi ^+(r)+c_n\) for all \(r\geqslant \theta \).

(b):

\(\succeq \) satisfies Restricted inequality aversion if and only if, for all \(n,m\in {\mathbb {N}}\), all \(r,s\in {\mathbb {R}}\) and all \(\epsilon >0\):

  • If \(r\geqslant \theta >0\geqslant s\), then \(\phi ^+_n(r+\epsilon )-\phi ^+_n(r) \leqslant \phi ^-_m(s)-\phi ^-_m(s-\epsilon )\).

  • If \(n< m\) and \(r> s>\epsilon +\theta >0\), then \(\phi ^+_n(r+\epsilon )-\phi ^+_n(r) \leqslant \phi ^+_m(s)-\phi ^+_m(s-\epsilon )\).

  • If \(n> m\) and \(s<r<-\epsilon <0\), then \(\phi ^-_n(r+\epsilon )-\phi ^-_n(r) \leqslant \phi ^-_m(s)-\phi ^-_m(s-\epsilon )\).

In particular, for all \(r<0\), the sequence \(\{\phi ^-_n(r)\}_{n=1}^{\infty }\) is nonincreasing.

(c):

\(\succeq \) satisfies Restricted strict inequality aversion if and only if all the statements in part (b) hold with strict inequalities. In this case, for all \(r\in {\mathbb {R}}_-\), the sequence \(\{\phi ^-_n(r)\}_{n=1}^{\infty }\) is strictly decreasing.

Note that Proposition 2.6 does not require the sequence \(\{\phi ^+_n(r)\}_{n=1}^{\infty }\) to be nondecreasing for any \(r>0\). In effect, \(\phi ^+_n\) must be inequality-averse for “sufficiently large” lifetime utilities (those above the threshold \(\theta \)), but to block the Repugnant Conclusion, \(\phi ^+_n\) must become increasingly inequality-seeking for “small” positive lifetime utilities (those in \([0,\theta ]\)), as \(n{\rightarrow }{\infty }\). This is because the rank-additive SWF (2B) must assign rapidly decreasing marginal value to adding more wretched people to an already very populated world. But to respect Positive expansion, \(\succeq \) must still regard these wretched new lives as a net improvement, as long as they are lives worth living. The only way to reconcile these two conflicting imperatives is for the slope of \(\phi ^+_n\) near zero to decay to zero as \(n{\rightarrow }{\infty }\).Footnote 13

Fig. 2
figure 2

Functions \(\phi ^+_n(r)\) in Example 2.7, for \(n\in \{1,\ldots , 5\}\)

Example 2.7

For all \(n\in {\mathbb {N}}\), let \(a_n:=\ln \left( 1+\frac{1}{n^2}\right) \), and then define

$$\begin{aligned} \phi ^+_n(r):= \left\{ \begin{array}{l@{\quad }l@{\quad }l} a_n\, r&{}\,\,{\,\mathrm{if }\,}&{} \,\,r\in [0,1]; \\ \ln \left( r+\frac{1}{n^2}\right) &{}\,\,{\,\mathrm{if }\,}&{} \,\,r\geqslant 1. \end{array}\right. \end{aligned}$$

(See Fig. 2.) If n is large, then \(a_n\approx {1}/{n^2}\) (by the Taylor expansion of \(\ln (x)\) around \(x=1\)). Thus, for all \(r\in [0,1]\), we have \(\sum _{n=1}^{\infty }\phi ^+_n(r) \approx r\cdot \sum _{n=1}^{\infty }\frac{1}{n^2}\), which is finite; thus, the hypothesis of Proposition 2.2(a) is satisfied, so the resulting RA SWO satisfies No Repugnant Conclusion (for any \(r_0<1\)). Meanwhile, for all \(n\in {\mathbb {N}}\), we have

$$\begin{aligned} (\phi ^+_n)'(r)=\frac{1}{r+1/n^2}, \quad \text{ for } \text{ all } r\in (1,{\infty }). \end{aligned}$$

Thus, \(\phi ^+_n\) is concave increasing on \([1,{\infty })\), and furthermore, if \(n<m\) and \(r\geqslant s\), then \((\phi ^+_n)'(r)< (\phi ^+_m)'(s)\). Thus, the second condition in Proposition 2.6(c) is satisfied (with \(\theta :=1\)). Observe that \((\phi ^+_n)'(r)<1\) for all \(n\in {\mathbb {N}}\) and \(r\in {\mathbb {R}}_+\). Thus, if \(\phi ^-:{\mathbb {R}}_-{{\longrightarrow }}{\mathbb {R}}_-\) is any concave, increasing, differentiable function such that \(\phi ^-(0)=0\) and \((\phi ^-)'(0) \geqslant 1\), and we define \(\phi ^-_n:=\phi ^-\) for all \(n\in {\mathbb {N}}\), then the other two conditions of Proposition 2.6(c) are also satisfied. Thus, the resulting SWO satisfies Restricted strict inequality aversion. \(\square \)

Related literature  In possibilist population ethics, a lifetime utility of zero plays a special role, because it is the utility assigned to non-existent people. This makes it possible to axiomatize SWOs that treat positive and negative utilities differently—a possibility first recognized by Blackorby and Donaldson (1982) and further developed by Zank (2007). Like the results in this section, Zank axiomatically characterizes rank-dependent SWOs that assign special significance to zero utility, and hence treat positive and negative utilities differently. However, he works with a fixed population and uses different axioms than the ones used here.

3 Actualist social welfare orders

3.1 Definition and examples

As in Sect. 2, let \({\mathcal {I}}\) be an infinite set, whose elements represent all the people who could ever possibly exist. Let \({\mathbb {R}}_*:={\mathbb {R}}\sqcup \{\not \exists \}\), where \(\not \exists \) is a special symbol representing “non-existence”. Let \({\mathbb {R}}_*^{\mathcal {I}}\) be the set of all \({\mathcal {I}}\)-indexed profiles \({\mathbf {r}}=(r_i)_{i\in {\mathcal {I}}}\) of \({\mathbb {R}}_*\). For all \(i\in {\mathcal {I}}\), if \(r_i=\not \exists \), then this means i does not exist. On the other hand, if \(r_i\in {\mathbb {R}}\), then interpret \(r_i\) as the lifetime utility of individual i, with the same interpretation as in Sect. 2: if \(r_i>0\), then i’s life is worth living, if \(r_i<0\), then i’s life is not worth living, and if \(r_i=0\), then i’s life is indifferent (for her) to non-existence.

Let \({\mathcal {X}}_\propto \) be the set of all elements of \({\mathbb {R}}_*^{\mathcal {I}}\) where only finitely many entries are not equal to \(\not \exists \). (Some of these non-\(\not \exists \) entries may be zero.) An element of \({\mathcal {X}}_\propto \) represents a complete specification of all the people who will ever exist (I assume this number to be finite), and the lifetime utilities each of them. I will refer to elements of \({\mathcal {X}}_\propto \) as social outcomes. An actualist social welfare order is a preference order on \({\mathcal {X}}_\propto \).Footnote 14

If \(\pi :{\mathcal {I}}{{\longrightarrow }}{\mathcal {I}}\) is any bijection, then define \(\pi ^*:{\mathbb {R}}_*^{\mathcal {I}}{{\longrightarrow }}{\mathbb {R}}_*^{\mathcal {I}}\) by setting \(\pi ^*({\mathbf {r}}) := (r_{\pi (i)})_{i\in {\mathcal {I}}}\) for all \({\mathbf {r}}=(r_i)_{i\in {\mathcal {I}}}\) in \({\mathbb {R}}_*^{\mathcal {I}}\). Clearly, \(\pi ({\mathcal {X}}_\propto )={\mathcal {X}}_\propto \), and \(\pi \) restricted to \({\mathcal {X}}_\propto \) defines a bijection from \({\mathcal {X}}_\propto \) to itself. We will be interested in SWOs satisfying the following axiom:

  • Anonymity. If \(\pi :{\mathcal {I}}{{\longrightarrow }}{\mathcal {I}}\) is any bijection, and \({\mathbf {x}}\in {\mathcal {X}}_\propto \), then \({\mathbf {x}}\approx \pi ^*({\mathbf {x}})\).

For any \({\mathbf {x}}\in {\mathcal {X}}_\propto \), let \(|{\mathbf {x}}|\) be the number of non-\(\not \exists \) entries in \({\mathbf {x}}\). In particular, let \(\emptyset \) be the empty world: the unique element of \({\mathcal {X}}_\propto \) such that all entries are \(\not \exists \); then, \(|\emptyset |=0\). If \(|{\mathbf {x}}|>0\), then we say \({\mathbf {x}}\) is nonempty. For any \(N\in {\mathbb {N}}\), let \({\mathcal {X}}_N:=\{{\mathbf {x}}\in {\mathbb {R}}_*^{\mathcal {I}}\); \(|{\mathbf {x}}|=N\}\), and let \({\mathbb {R}}^{N_\uparrow }:=\{{\mathbf {r}}\in {\mathbb {R}}^N\); \(r_1\leqslant r_2\leqslant \cdots \leqslant r_N\}\) be the set of all nondecreasing elements of \({\mathbb {R}}^N\). For any \({\mathbf {x}}\in {\mathcal {X}}_N\), let \({\mathbf {x}}^\uparrow :=(x^\uparrow _1,x^\uparrow _2,\ldots ,x^\uparrow _N)\in {\mathbb {R}}^{N_\uparrow }\) be the N-dimensional vector consisting of all non-\(\not \exists \) entries of \({\mathbf {x}}\), listed in nondecreasing order. Let \({\mathbb {R}}^{\propto _\uparrow }:=\bigcup _{N=1}^{\infty }{\mathbb {R}}^{N_\uparrow }\), and let \(\succeq _*\) be a preference order on \({\mathbb {R}}^{\propto _\uparrow }\). Then we can define an SWO \(\succeq \) on \({\mathcal {X}}_\propto \) by the formula:

(3A)

It is easy to see that \(\succeq \) satisfies Anonymity: if \(\pi :{\mathcal {I}}{{\longrightarrow }}{\mathcal {I}}\) is any bijection, and \({\mathbf {y}}=\pi ^*({\mathbf {x}})\), then \({\mathbf {y}}^\uparrow ={\mathbf {x}}^\uparrow \), so that \({\mathbf {x}}\approx {\mathbf {y}}\). Conversely, if \(\succeq \) is an SWO on \({\mathcal {X}}_\propto \) satisfying Anonymity, then there is a unique preference order \(\succeq _*\) on \( {\mathbb {R}}^{\propto _\uparrow }\) satisfying formula (3A). In other words, there is a natural bijective correspondence between preference orders on \( {\mathbb {R}}^{\propto _\uparrow }\) and SWOs on \({\mathcal {X}}_\propto \) satisfying Anonymity. Thus, we can work directly with preference orders on \( {\mathbb {R}}^{\propto _\uparrow }\).

For all \(n\in {\mathbb {N}}\), let \(\phi _n:{\mathbb {R}}{{\longrightarrow }}{\mathbb {R}}\) be a continuous, increasing function. Consider the social welfare function \(W:{\mathcal {X}}_\propto {{\longrightarrow }}{\mathbb {R}}\) defined as follows:

$$\begin{aligned} W(\emptyset ) \ := \ 0, \ \ \text{ and } \ \ W({\mathbf {x}}) \ := \ \sum _{n=1}^{|{\mathbf {x}}|} \phi _n(x^\uparrow _n), \ \text{ for } \text{ all } \text{ nonempty } {\mathbf {x}}\in {\mathcal {X}}_\propto \text{. } \end{aligned}$$
(3B)

This is called an ascending rank-additive (ARA) social welfare function. The social welfare order it represents is an ascending rank-additive SWO. For example:

  • Suppose \(c\in {\mathbb {R}}\), and \(\phi _n(r)=r-c\) for all \(n\in {\mathbb {N}}\) and all \(r\in {\mathbb {R}}\). Then formula (3B) yields the critical-level utilitarian SWF. In particular, if \(c=0\), then we get the classical utilitarian SWF. If \(\phi :{\mathbb {R}}{{\longrightarrow }}{\mathbb {R}}\) is a continuous, increasing function, and \(\phi _n=\phi \) for all \(n\in {\mathbb {N}}\), then (3B) yields a generalized utilitarian SWF:

    $$\begin{aligned} W(\emptyset ) \ := \ 0, \ \ \text{ and } \ \ W({\mathbf {x}}) \ := \ \sum _{n=1}^{|{\mathbf {x}}|} \phi (x^\uparrow _n), \ \text{ for } \text{ all } \text{ nonempty } {\mathbf {x}}\in {\mathcal {X}}_\propto \text{. } \end{aligned}$$
    (3C)
  • Let \(\{a_n\}_{n=1}^{\infty }\) be a decreasing sequence of positive constants, and suppose \(\phi _n(r)=a_n\,r\) for all \(n\in {\mathbb {N}}\) and all \(r\in {\mathbb {R}}\). Then formula (3B) yields an ascending rank-weighted utilitarian SWF.

  • More generally, let \(\phi :{\mathbb {R}}{{\longrightarrow }}{\mathbb {R}}\) be a continuous, increasing function, and suppose \(\phi _n(r)=a_n\,\phi (r)\) for all \(n\in {\mathbb {N}}\) and all \(r\in {\mathbb {R}}\). Then formula (3B) yields an ascending rank-weighted generalized utilitarian SWF:

    $$\begin{aligned} W({\mathbf {x}}):= \sum _{n=1}^{|{\mathbf {x}}|} a_n\, \phi (x^\uparrow _n), \quad \text{ for } \text{ all } {\mathbf {x}}\in {\mathcal {X}}_\propto . \end{aligned}$$
    (3D)

    These have been studied by Asheim and Zuber (2017). In particular, let \(\beta \in (0,1)\), and for all \(n\in {\mathbb {N}}\), let \(\phi _n:=\beta ^n\,\phi \). Then formula (3D) becomes a rank-discounted generalized utilitarian SWF, which was axiomatically characterized by Asheim and Zuber (2014):

    $$\begin{aligned} W({\mathbf {x}}):= \sum _{n=1}^{|{\mathbf {x}}|} \beta ^n\, \phi (x^\uparrow _n), \quad \text{ for } \text{ all } {\mathbf {x}}\in {\mathcal {X}}_\propto . \end{aligned}$$
    (3E)

In these examples, I do not assume that \(\phi _n(0)=0\). In other words, I do not assume that the existence of an individual with a neutral level of lifetime utility is ethically equivalent to her non-existence. In fact, even if \(\phi _n(0)=0\) for all \(n\in {\mathbb {N}}\), this would not be the case: introducing a new person with zero lifetime utility can change the rankings of people who already exist, thereby changing overall social welfare in a complex way. Thus, ARA SWOs are fundamentally different from the possibilist rank-additive SWOs introduced in Sect. 2; they typically do not satisfy either Positive expansion or Negative expansion.

However, as noted by Asheim and Zuber (2014, 2016, 2017), ARA SWOs are attractive because they can avoid the Repugnant Conclusion while exhibiting inequality aversion at all welfare levels. To see this, consider the ascending rank-weighted generalized utilitarian SWF (3D). For simplicity, suppose \(\phi (r)=r\) for all \(r\in {\mathbb {R}}\). If the sequence \(\{a_n\}_{n=1}^{\infty }\) is decreasing, then this SWF is inequality-averse, because it assigns lower marginal social welfare to the lifetime utilities of more fortunate individuals (who appear higher in the ranking). Furthermore, Asheim and Zuber (2017, Proposition 6) show that this SWF avoids the Repugnant Conclusion if and only if \(\sum _{n=1}^{\infty }a_n<{\infty }\). Clearly, this summability condition is compatible with \(\{a_n\}_{n=1}^{\infty }\) being a decreasing sequence—for example, it is satisfied by the rank-discounted generalized utilitarian SWF (3E). I generalize this result in Proposition 3.2.

3.2 Axiomatic characterization

I will characterize ARA SWOs with six axioms. The first one is Anonymity. The next three are quite standard and also appeared in Sect. 2.2. To state these axioms, suppose an SWO \(\succeq \) on \({\mathcal {X}}_\propto \) satisfies Anonymity. Then it can represented by a preference order \(\succeq _*\) on \({\mathbb {R}}^{\propto _\uparrow }\). For any \(N\in {\mathbb {N}}\), recall that \({\mathbb {R}}^{N_\uparrow }\subset {\mathbb {R}}^{\propto _\uparrow }\); let \(\succeq _N\) be the restriction of \(\succeq _*\) to \({\mathbb {R}}^{N_\uparrow }\). Note that \({\mathbb {R}}^{N_\uparrow }\) is a closed, convex subset of \({\mathbb {R}}^N\); endow it with the subspace topology it inherits from \({\mathbb {R}}^{N}\). The next two axioms concern the preference orders \(\{\succeq _N\}_{N=1}^{\infty }\).

  • Continuity. For every \({\mathbf {x}}\in {\mathcal {X}}_\propto \), and every \(N\in {\mathbb {N}}\), the upper contour sets \(\{{\mathbf {y}}^\uparrow \); \({\mathbf {y}}\in {\mathcal {X}}_N\) and \({\mathbf {x}}\preceq {\mathbf {y}}\}\) and the lower contour sets \(\{{\mathbf {y}}^\uparrow \); \({\mathbf {y}}\in {\mathcal {X}}_N\) and \({\mathbf {x}}\succeq {\mathbf {y}}\}\) are closed subsets of \({\mathbb {R}}^{N_\uparrow }\).

  • Pareto. For every \(N\in {\mathbb {N}}\) and for all \({\mathbf {x}},{\mathbf {y}}\in {\mathbb {R}}^{N_\uparrow }\), if \(x_n\geqslant y_n\) for all \(n\in [1\ldots N]\), then \({\mathbf {x}}\succeq _N {\mathbf {y}}\). If, furthermore, \(x_n> y_n\) for some \(n\in [1\ldots N]\), then \({\mathbf {x}}\succ _N {\mathbf {y}}\).

These axioms have the same justification as the corresponding axioms in Sect. 2. Note that Continuity is slightly stronger than requiring the orders \(\succeq _N\) to be continuous: it also requires closure of contour sets determined by elements outside of \({\mathcal {X}}_N\).Footnote 15

To formulate the last axiom, we need some notation. Let \({\mathbf {x}}\in {\mathcal {X}}_\propto \), let \(N:=|{\mathbf {x}}|\), let \(n\in [1\ldots N]\), and let \(b\in {\mathbb {R}}\) be such that \(x^\uparrow _{n-1}\leqslant b\leqslant x^\uparrow _{n+1}\).Footnote 16 Let \(b_{(n)}{\mathbf {x}}\) be the unique element \({\mathbf {y}}\in {\mathcal {X}}\) such that \(|{\mathbf {y}}|=|{\mathbf {x}}|\), \(y^\uparrow _n=b\), and \(y^\uparrow _m=x^\uparrow _m\) for all other \(m\in [1\ldots N]\setminus \{n\}\).Footnote 17

Now let \(n<m\in {\mathbb {N}}\), and let \(a<b<c<d\in {\mathbb {R}}\). I will write \({(a{\mathop {\leadsto }\limits ^{n}} b)} \cong {(c{\mathop {\leadsto }\limits ^{m}} d)}\) if there exists \({\mathbf {x}}\in {\mathcal {X}}_\propto \) such that \(x^\uparrow _n=a\), \(x^\uparrow _m=c\), \(b_{(n)}{\mathbf {x}}\), and \(d_{(m)}{\mathbf {x}}\) are well defined, and \( b_{(n)}{\mathbf {x}}\approx d_{(m)}{\mathbf {x}}\). This means that switching a to b in coordinate n is “ethically equivalent” to switching c to d in coordinate m. If \(\succeq \) is represented by a SWF W, then \({(a{\mathop {\leadsto }\limits ^{n}} b)} \cong {(c{\mathop {\leadsto }\limits ^{m}} d)}\) if the change in W induced by switching a to b in the nth coordinate is exactly equal to the change in W induced by switching c to d in the mth coordinate. If W has an ascending rank-additive representation (3B), then \({(a{\mathop {\leadsto }\limits ^{n}} b)} \cong {(c{\mathop {\leadsto }\limits ^{m}} d)}\) if and only if \(\phi _n(b)-\phi _n(a) = \phi _m(d)-\phi _m(c)\). Here is the last axiom:

  • Trade-off Consistency. For any \(n<m\in {\mathbb {N}}\), and any \(a<b<c<d\in {\mathbb {R}}\) such that \({(a{\mathop {\leadsto }\limits ^{n}} b)} \cong {(c{\mathop {\leadsto }\limits ^{m}} d)}\), and any \({\mathbf {y}},{\mathbf {z}}\in {\mathcal {X}}_\propto \), such that \(y^\uparrow _n=a\), \(y^\uparrow _{n-1}\leqslant b\leqslant y^\uparrow _{n+1}\), \(z^\uparrow _m=c\), and \(z^\uparrow _{m-1}\leqslant d\leqslant z^\uparrow _{m+1}\), if \({\mathbf {y}}\approx {\mathbf {z}}\), then \(b_{(n)}{\mathbf {y}}\approx d_{(m)}{\mathbf {z}}\).

Note that this axiom does not assume that \(|{\mathbf {y}}|=|{\mathbf {z}}|\). It says: if the act of switching a to b in coordinate n is “ethically equivalent” to the act of switching c to d in coordinate m when both switches are applied to the same outcome \({\mathbf {x}}\), then this same ethical equivalence should also be observed when these switches are applied to two different outcomes \({\mathbf {y}}\) and \({\mathbf {z}}\), possibly with different population sizes. Finally, we need the following structural condition.

  • Neutral population growth. For all \(N\in {\mathbb {N}}\), there exists some \({\mathbf {x}}\in {\mathcal {X}}_N\) such that \({\mathbf {x}}\approx \emptyset \).

This condition is natural and easily satisfied. For example, if \(\succeq \) is a rank-weighted generalized utilitarian SWO as in (3D), then it satisfies Neutral population growth if and only if \(\phi (r)=0\) for some \(r\in {\mathbb {R}}\). Meanwhile, if \(\succeq \) is an ARA SWO represented by (3B), then it satisfies Neutral population growth if \(\phi _1\) is unbounded below, while \(\phi _n\) takes at least some positive values for all \(n\geqslant 2\). Here is the second main result of the paper.

Theorem 2

Let \(\succeq \) be an actualist SWO satisfying Neutral population growth on \({\mathcal {X}}_\propto \). Then \(\succeq \) satisfies Anonymity, Continuity, Pareto, and Trade-off Consistency if and only if it is ascending rank additive. In the representation (3B), the functions \(\{\phi _n\}_{n=1}^{\infty }\) are unique up to multiplication by a common scalar.

Proof sketch For all \(N\in {\mathbb {N}}\), use Trade-off Consistency to show that \(\succeq _N\) satisfies a limited separability axiom called Coordinate IndependenceFootnote 18 in a neighbourhood around each point in \({\mathbb {R}}^{N_\uparrow }\). Then use a result of Wakker (1988) to obtain “local” additive representations in a neighbourhood of each point in \({\mathbb {R}}^{N_\uparrow }\). Then, as in the proof of Theorem 1, use a result of Chateauneuf and Wakker (1993) to get a global additive representation on all of \({\mathbb {R}}^{N_\uparrow }\). Next, use Trade-off Consistency to show that, for any \(N<M\), the N utility functions in the additive representation on \({\mathbb {R}}^{N_\uparrow }\) are actually the first N utility functions for the additive representation on \({\mathbb {R}}^{M_\uparrow }\). Finally, use Trade-off Consistency again to show that these additive representations also correctly account for comparisons between outcomes with different population sizes. For details, see Appendix B.

Independence of the axioms One can demonstrate the independence of Anonymity, Continuity, Pareto using examples very similar to those at the end of Sect. 2.2. Meanwhile, one can show the independence of Trade-off Consistency using an example very similar to the example for Separability from Sect. 2.2.

3.3 Further results

Critical levels Let \({\mathbf {x}}\in {\mathcal {X}}_\propto \) and let \(c\in {\mathbb {R}}\). In the terminology of Blackorby et al. (2005), c is a critical level for \({\mathbf {x}}\) if adding a new person with lifetime utility c to \({\mathbf {x}}\) is an ethically neutral act. By the Pareto axiom, such a critical level is unique, if it exists. For example, in the classical utilitarian SWO, \(c=0\) for all \({\mathbf {x}}\in {\mathcal {X}}_\propto \) . In the average utilitarian SWO, c is the average lifetime utility in \({\mathbf {x}}\). The ARA SWOs characterized in Theorem 2 do not necessarily possess such critical levels for every social outcome. In other words, they do not necessarily satisfy the following axiom:

  • Critical levels. For any \({\mathbf {x}}\in {\mathcal {X}}_\propto \), there exists \(c\in {\mathbb {R}}\) (depending on \({\mathbf {x}}\)) with \({\mathbf {x}}\approx {\mathbf {x}}\uplus c\).

This axiom says that there is no outcome \({\mathbf {x}}\) so bad that adding any new person to \({\mathbf {x}}\) is always considered an improvement, or so good that adding any new person to \({\mathbf {x}}\) is always considered a deterioration. Suppose \(\succeq \) is an ARA SWO defined by a collection of functions \({\varvec{\phi }}:=\{\phi _n\}_{n=1}^{\infty }\). To ensure that \(\succeq \) satisfies Critical levels, we must impose some conditions on \({\varvec{\phi }}\). One might think that it is sufficient to require, for all \(n\in {\mathbb {N}}\), the existence of some \(c_n\in {\mathbb {N}}\) with \(\phi _n(c_n)=0\). But this is not quite sufficient, as we will now see. For all \(n\in {\mathbb {N}}\), define the function \(\delta \phi _n:{\mathbb {R}}{{\longrightarrow }}{\mathbb {R}}\) by \(\delta \phi _n(r):=\phi _{n+1}(r)-\phi _n(r)\). Then define

$$\begin{aligned} S({\varvec{\phi }}):= \sup {\left\{ \sum _{n=1}^N \delta \phi _n(x_n) \; ; \; N\in {\mathbb {N}}{\,\,\mathrm{and }\,\,}x_1\leqslant x_2\leqslant \cdots \leqslant x_N \right\} }. \end{aligned}$$
(3F)

Proposition 3.1

Let \(\succeq \) be an ARA SWO on \({\mathcal {X}}_\propto \) with representation (3B), such that for all \(n\in {\mathbb {N}}\), there is some \(c_n\in {\mathbb {N}}\) with \(\phi _n(c_n)=0\). The following statements are equivalent:

(a):

\(\succeq \) satisfies Critical levels.

(b):

\(\inf (\phi _1({\mathbb {R}}))\leqslant -S({\varvec{\phi }})\), and if \(\inf (\phi _1({\mathbb {R}}))= -S({\varvec{\phi }})\), then the supremum in formula (3F) is never obtained.

In most cases, the condition in Proposition 3.1 is easily satisfied. For example, if \(\phi _1\) is unbounded below (so that \(\inf (\phi _1({\mathbb {R}}))=-{\infty }\)), then the condition is automatically true. Meanwhile, in a generalized utilitarian SWO (3C), we have \(S({\varvec{\phi }})=0\), so the condition simply requires that that \(\inf (\phi _1({\mathbb {R}}))<0\).

Suppose that \(\succeq \) is as in Proposition 3.1. Then for any \(N\in {\mathbb {N}}\) and \({\mathbf {x}}\in {\mathcal {X}}_\propto \), if \(|{\mathbf {x}}|=N-1\) and \(\max ({\mathbf {x}})\leqslant c_N\), then \({\mathbf {x}}\approx {\mathbf {x}}\uplus c_N\). In other words, adding a person with lifetime utility \(c_N\) to the world is an ethically neutral act, as long as everyone who already exists has an even lower level of lifetime utility. This is similar to the axiom Existence of a critical level employed by Asheim and Zuber (2014) in their axiomatic characterization of rank-discounted generalized utilitarian SWOs, but weaker: Asheim and Zuber additionally require that \(c_N=c_M\) for all \(N,M\in {\mathbb {N}}\).

Inequality and the Repugnant Conclusion As Asheim and Zuber (2014) noted, an ARA SWO can reconcile inequality aversion with evasion of the Repugnant Conclusion by assigning lower marginal social welfare to the lifetime utility of the better-off individuals in any social outcome. The next result makes this precise. It parallels Propositions 2.2 and 2.4.

Proposition 3.2

Let \(\succeq \) be an ARA SWO with the SWF (3B).

(a):

\(\succeq \) satisfies No Repugnant Conclusion if and only if there exists \(r>0\) such that \(\displaystyle \sum _{n=1}^{\infty }\phi _n(r)<{\infty }\).

(b):

\(\succeq \) satisfies Inequality aversion if and only if for all \(n,m\in {\mathbb {N}}\) with \(n\geqslant m\), all \(r,s\in {\mathbb {R}}\) with \(r\geqslant s\), and all \(\epsilon >0\), we have \(\phi _n(r+\epsilon )-\phi _n(r) \leqslant \phi _m(s)-\phi _m(s-\epsilon )\). In particular, for all \(r\in {\mathbb {R}}_+\), we have \(\phi _1(r)\geqslant \phi _2(r)\geqslant \phi _3(r)\geqslant \cdots \)

Furthermore, if \(\{\phi _n\}_{n=1}^{\infty }\) are differentiable, then for any nondecreasing sequence \(r_1\leqslant r_2 \leqslant r_3 \leqslant \cdots \) of real numbers, we have \( \phi '_1(r_1)\geqslant \phi '_2(r_2)\geqslant \phi '_3(r_3)\geqslant \cdots \)

(c):

\(\succeq \) satisfies Strict inequality aversion if and only if all the statements in part (b) hold with strict inequalities.

There are also versions of Proposition 2.2(b) and 2.3 for ARA SWO (for avoiding utility monsters and the St. Petersburg Paradox), but they are obvious, and are left to the reader.

Example 3.3

Let \(\phi :{\mathbb {R}}{{\longrightarrow }}{\mathbb {R}}\) be a concave increasing function, let \(\{a_n\}_{n=1}^{\infty }\) be a nonincreasing sequence of positive constants, and suppose W is the ascending rank-weighted generalized utilitarian SWF (3D). Then Proposition 3.2(b) says that \(\succeq \) satisfies Inequality aversion. If \(\sum _{n=1}^{\infty }a_n<{\infty }\), then Asheim and Zuber (2017) say that \(\succeq \) is proper. In this case, Proposition 3.2(a) says that \(\succeq \) satisfies No Repugnant Conclusion. In particular, if \(\beta \in (0,1)\), and W is the rank-discounted generalized utilitarian SWF (3E), then \(\succeq \) satisfies both Strict inequality aversion and No Repugnant Conclusion. \(\square \)

Tyrannies of aggregation and nonaggregation For any \({\mathbf {x}}\in {\mathcal {X}}_\propto \), let \({\mathcal {I}}({\mathbf {x}}):=\{i\in {\mathcal {I}}\); \(x_i\ne \not \exists \}\) be the set of people who exist in the outcome \({\mathbf {x}}\). Fleurbaey and Tungodden (2010) have proposed the following axiom:

  • Minimal Aggregation. For some \(N\in {\mathbb {N}}\), and all \({\mathbf {x}}\in {\mathcal {X}}_N\), and all \(i\in {\mathcal {I}}({\mathbf {x}})\), there exists some \(\alpha>\beta >0\) such that, for any other \({\mathbf {y}}\in {\mathcal {X}}_\propto \) with \({\mathcal {I}}({\mathbf {y}})={\mathcal {I}}({\mathbf {x}})\), if \(x_i\geqslant y_i\geqslant x_i-\beta \), while \(y_j\geqslant x_j+\alpha \) for all other \(j\in {\mathcal {I}}({\mathbf {x}})\), then \({\mathbf {y}}\succeq {\mathbf {x}}\).

This says that there is at least one situation where it is considered acceptable for one person (namely i) to make a small sacrifice (at most \(\beta \)) so that everyone else can gain a larger amount (at least \(\alpha \)). This rules out the maximin and leximin SWFs, which give absolute priority to the worst-off. In the terminology of Fleurbaey and Tungodden, it excludes the tyranny of nonaggregation.

For any \({\mathbf {x}}\in {\mathcal {X}}_\propto \), let \({\underline{\mathcal {I}}}({\mathbf {x}}):=\{i\in {\mathcal {I}}({\mathbf {x}})\); \(x_i\leqslant x_j\) for all \(j\in {\mathcal {I}}({\mathbf {x}})\}\); these are the worst-off people in the outcome \({\mathbf {x}}\). Let \({\overline{\mathcal { I}}}({\mathbf {x}}):=\{i\in {\mathcal {I}}({\mathbf {x}})\); \(x_i\geqslant x_j\) for all \(j\in {\mathcal {I}}({\mathbf {x}})\}\); these are the best-off people in \({\mathbf {x}}\). Fleurbaey and Tungodden (2010) also propose the next axiom:

  • Mild Nonaggregation. For all \(r,q\in {\mathbb {R}}\) with \(r>q\), there exists some \(\alpha>\beta >0\) such that, for any \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}_\propto \) with \({\mathcal {I}}({\mathbf {x}})={\mathcal {I}}({\mathbf {y}})\), if there exists \(i\in {\underline{\mathcal {I}}}({\mathbf {y}})\) with \(y_i\leqslant q\) and \(x_i\geqslant y_i+\alpha \), while for all other \(j\in {\mathcal {I}}({\mathbf {x}})\), either \(x_j=y_j\) or \(j\in {\overline{\mathcal { I}}}({\mathbf {x}})\cap {\overline{\mathcal { I}}}({\mathbf {y}})\), \(y_j\geqslant r\), and \(x_j \geqslant y_j-\beta \), then \({\mathbf {x}}\succeq {\mathbf {y}}\).

This is an egalitarian principle which says that, if one of the worst-off people (namely i) has the opportunity to gain a large enough amount (at least \(\alpha \)), then it is acceptable to require all the best-off people to make a small sacrifice (at most \(\beta \)). This rules out principles such as utilitarianism, in which a tiny welfare gain for every member of a large group (say, a billion people) could in principle justify a huge sacrifice (say, a painful death) for one person. In the terminology of Fleurbaey and Tungodden, it excludes the tyranny of aggregation. Note that the axiom allows us to make \(\alpha \) arbitrarily large and \(\beta \) arbitrarily small, depending on the values of q and r; in that sense it is a fairly mild principle. However, \(\alpha \) and \(\beta \) do not depend on the size of \({\mathcal {I}}({\mathbf {x}})\); thus, even if \({\overline{\mathcal { I}}}({\mathbf {x}})\cap {\overline{\mathcal { I}}}({\mathbf {y}})\) has a quadrillion people, their \(\beta \)-sized gains are insufficient to offset i’s \(\alpha \)-sized loss.

For any \({\mathbf {x}}\in {\mathcal {X}}_\propto \) and \(N\in {\mathbb {N}}\), let \({\mathbf {x}}^N\) denote a social outcome with \(|{\mathbf {x}}^N|=N\,|{\mathbf {x}}|\), such that every person in \({\mathbf {x}}\) is “replicated” N times in \({\mathbf {x}}^N\). (If an SWO satisfies Anonymity, it does not matter how we arrange these N replicated copies). An SWO \(\succeq \) is replication-invariant if, for any \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}_\propto \) with \({\mathbf {x}}\succeq {\mathbf {y}}\), we have \({\mathbf {x}}^N\succeq {\mathbf {y}}^N\) for all \(N\in {\mathbb {N}}\). It is easily seen that rank-additive SWOs are not replication-invariant, in general.

Fleurbaey and Tungodden prove that Minimal Aggregation and Mild Nonaggregation are incompatible in any replication-invariant SWO.Footnote 19 But they note that these axioms are compatible if we give up replication invariance; in particular, they are both satisfied by rank-discounted utilitarian SWFs like (3E). The next result makes this precise.

Proposition 3.4

(a):

Any ARA SWO satisfies Minimal Aggregation.

(b):

Let \(\succeq \) be an ARA SWO with the SWF (3B). Suppose \(\{\phi _n\}_{n=1}^{\infty }\) are concave and differentiable, and for all \(r\in {\mathbb {R}}\), there exists \(A\in {\mathbb {R}}_+\) such that \(\sum _{n=N+1}^{\infty }\phi '_n(r)< A\cdot \phi '_m(r)\), for any \(m,N\in {\mathbb {N}}\) with \(m\leqslant N\). Then \(\succeq \) satisfies Mild Nonaggregation.

The condition in Proposition 3.4(b) implies that the sequence \(\{\phi '_n(r)\}_{n=1}^{\infty }\) decays quickly enough that \(\sum _{n=1}^{\infty }\phi '_n(r)<{\infty }\). But it says more: it says that this sequence decays with roughly “exponential” speed. For example, suppose there is some \(b\in (0,1)\) such that \(\phi '_{n+1}(r)\leqslant b\cdot \phi '_{n}(r)\) for all \(n\in {\mathbb {N}}\) (e.g. \(\phi '_{n}(r)=1/b^n\) for all \(n\in {\mathbb {N}}\)); then, the condition in Proposition 3.4(b) is satisfied, with \(A:=b/(1-b)\). Asheim and Zuber (2017, Proposition 5) have proved a result similar to Proposition 3.4 for ascending rank-weighted generalized utilitarian SWFs of type (3D).

Top-independence and bottom-independence For any \({\mathbf {x}}\in {\mathcal {X}}_\propto \), let \(\max ({\mathbf {x}})\) be the maximal lifetime utility of any person in the social outcome \({\mathbf {x}}\). (Equivalently, if \({\mathbf {x}}^\uparrow =(x_1^\uparrow ,\ldots ,x_N^\uparrow )\), then \(\max ({\mathbf {x}})=x_N^\uparrow \).) All ARA SWOs clearly satisfy the next axiom.

  • Top-independence. For all \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}_\propto \) with \(|{\mathbf {x}}|=|{\mathbf {y}}|\) and all \(z\in {\mathbb {R}}\) with \(z\geqslant \max \{\max ({\mathbf {x}}),\max ({\mathbf {y}})\}\), we have \({\mathbf {x}}\succeq {\mathbf {y}}\) if and only if \({\mathbf {x}}\uplus z \succeq {\mathbf {y}}\uplus z\).

This axiom has a similar justification to Separability or the axiom of Top-independence in good worlds from Sect. 2.3: it says that, when comparing two social outcomes, we can disregard the utility—or even the existence—of someone who is indifferent between these outcomes, as long as she would be the happiest person in both outcomes. Unfortunately, ARA SWOs do not, in general, satisfy the analogous property involving unhappy people:

  • Bottom-independence. For all \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}_\propto \) with \(|{\mathbf {x}}|=|{\mathbf {y}}|\) and all \(z\in {\mathbb {R}}\) with \(z\leqslant \min \{\min ({\mathbf {x}}),\min ({\mathbf {y}})\}\), we have \({\mathbf {x}}\succeq {\mathbf {y}}\) if and only if \({\mathbf {x}}\uplus z \succeq {\mathbf {y}}\uplus z\).

One way to obtain SWFs satisfying Bottom-independence is to apply a formula like (3B) to decreasing rather than increasing sequences. Formally, for any \(N\in {\mathbb {N}}\), let \({\mathbb {R}}^{N_\downarrow }:=\{{\mathbf {r}}\in {\mathbb {R}}^N\); \(r_1\geqslant \cdots \geqslant r_N\}\) be the set of all nonincreasing elements of \({\mathbb {R}}^N\). Let \({\mathbb {R}}^{\propto _\downarrow }:=\bigcup _{N=1}^{\infty }{\mathbb {R}}^{N_\downarrow }\). For any \({\mathbf {x}}\in {\mathcal {X}}_N\), let \({\mathbf {x}}^\downarrow :=(x^\downarrow _1,\ldots ,x^\downarrow _N)\in {\mathbb {R}}^{N_\downarrow }\) be the N-dimensional vector of all non-\(\not \exists \) entries of \({\mathbf {x}}\), listed in nonincreasing order. For all \(n\in {\mathbb {N}}\), let \(\phi _n:{\mathbb {R}}{{\longrightarrow }}{\mathbb {R}}\) be continuous and increasing. Define \(W:{\mathcal {X}}_\propto {{\longrightarrow }}{\mathbb {R}}\) by:

$$\begin{aligned} W(\emptyset ) \ \ := \ \ 0,\quad \text{ and }\quad W({\mathbf {x}}) \ \ := \ \ \sum _{n=1}^{|{\mathbf {x}}|} \phi _n(x^\downarrow _n), \quad \text{ for } \text{ all } \text{ nonempty } {\mathbf {x}}\in {\mathcal {X}}_\propto \text{. } \end{aligned}$$

This is called a descending rank-additive (DRA) social welfare function. These are axiomatically characterized by a result very similar to Theorem 2, except that the axioms Pareto, Continuity, and Trade-off Consistency are applied to orderings defined on \({\mathbb {R}}^{N_\downarrow }\) rather than \({\mathbb {R}}^{N_\uparrow }\) (for all \(N\in {\mathbb {N}}\)). However, DRA SWOs are less appealing than ARA SWOs. As observed in Proposition 3.2, ARA SWOs can simultaneously Strict inequality aversion and No Repugnant Conclusion. But DRA SWOs cannot. Indeed, for a DRA SWO to satisfy No Repugnant Conclusion, it must be inequality-promoting, which is much less attractive. Fortunately, there is another way to obtain Bottom-independence, while preserving the other desirable properties of ARA SWOs.

Corollary 3.5

Let \(\succeq \) be an actualist SWO on \({\mathcal {X}}_\propto \). Then \(\succeq \) satisfies the axioms of Theorem 2 and also Bottom-independence if and only if it is rank-discounted generalized utilitarian, as in formula (3E). In this representation, \(\beta \) is unique and \(\phi \) is unique up to multiplication by a positive constant.

This is similar to the main result of Asheim and Zuber (2014), except that they do not require Trade-off Consistency, but instead employ versions of Top-independence and Bottom-independence,Footnote 20 an axiom positing the existence of egalitarian equivalents, and a slightly stronger form of Critical levels.

Related literature Ebert (1988) axiomatizes a class of ascending rank-additive SWOs, but he works with populations of fixed size, and uses different axioms. Bleichrodt et al. (2008) obtain a very general axiomatization of rank-additive utility representations. Although their intended application is descriptive decision theory, they note that their result can also be applied to social welfare. They also use continuity, monotonicity (i.e. Pareto), a richness condition analogous to Neutral population growth, and a “Trade-off Consistency” axiom, but their version of this axiom is different than the one in this paper.

3.4 Actualism versus possibilism

Proposition 3.1 shows that ARAs do not generally satisfy Critical levels. Even when they do, the critical level depends on the social outcome under consideration, and is not typically zero. But suppose that \(\succeq \) was an actualist SWO where the critical level was zero for every social outcome. Thus, for any \({\mathbf {x}}\in {\mathcal {X}}_\propto \), if \({\mathbf {x}}'\) is obtained from \({\mathbf {x}}\) by converting any finite number of \(\not \exists \) components to zeros, then \({\mathbf {x}}\approx {\mathbf {x}}'\). Let \({\mathcal {X}}\) be the space of “possibilist” social outcomes defined in Sect. 2.1, and let \(\varPhi ({\mathbf {x}})\) be the element of \({\mathcal {X}}\) obtained by converting all the \(\not \exists \) components in \({\mathbf {x}}\) to zeros. This defines a surjection \(\varPhi :{\mathcal {X}}_\propto {{\longrightarrow }}{\mathcal {X}}\). We can then define a possibilist SWO \(\succeq '\) on \({\mathcal {X}}\) as follows: for any \({\mathbf {x}}',{\mathbf {y}}'\in {\mathcal {X}}\), stipulate that \({\mathbf {x}}'\succeq '{\mathbf {y}}'\) if and only if \({\mathbf {x}}\succeq {\mathbf {y}}\) for some \({\mathbf {x}},{\mathbf {y}}\in {\mathcal {X}}_\propto \) such that \(\varPhi ({\mathbf {x}})={\mathbf {x}}'\) and \(\varPhi ({\mathbf {y}})={\mathbf {y}}'\). (This is well defined independent of the choice of \({\mathbf {x}}\) and \({\mathbf {y}}\), precisely because the critical level of \(\succeq \) is zero for all outcomes.)

Given this construction, it seems that possibilist SWOs are just a special case of actualist SWOs—namely those with a universal zero critical level. Thus, it seems redundant to introduce the separate framework of Sect. 2. It is also puzzling that the axioms Anonymity, Pareto, Continuity, and Separability were enough to axiomatically characterize rank-additive SWOs in the possibilist framework (Theorem 1), when these four axioms are not enough in the actualist framework. (The axiom of Trade-off Consistency is stronger than Separability.) Furthermore, the class of ARA SWOs we eventually do obtain in Theorem 2 is substantially different in nature.

However, a closer inspection reveals that the possibilist framework is not “just” a special case of the actualist framework. Its additional structure allows for logically weaker versions of key axioms and leads to different conclusions. For example, the possibilist Continuity axiom is not just the actualist Continuity axiom “projected” through the mapping \(\varPhi \)—it is a weaker axiom. To see this, note that the possibilist Continuity axiom makes a rigid distinction between the positive and negative components of a social outcome; it essentially says that the SWO is continuous with respect to small perturbations of individual utility levels, as long as these perturbations do not switch a positive utility to a negative one, or vice versa. The actualist Continuity axiom makes no such distinction. For similar reasons, the possibilist Separability axiom is weaker than the axiom that would be obtained by formulating an analogous Separability axiom in the actualist framework, and then “projecting” this axiom through \(\varPhi \). There is a further difference between the two Continuity axioms (already noted in Sect. 3.2). Both axioms require closed contour sets in the set \({\mathcal {X}}_N\) of outcomes of size N (for all \(N\in {\mathbb {N}}\)). But the possibilist Continuity axiom only considers contour sets determined by elements of \({\mathcal {X}}_N\) itself, whereas the actualist Continuity axiom considers contour sets in \({\mathcal {X}}_N\) determined by all elements of \({\mathcal {X}}_\propto \).

Of course, the possibilist Continuity and Separability axioms could be formulated in the actualist framework and might even appear reasonably natural if we were already fully committed to a universal zero critical level. We could then formulate a version of Theorem 1 in the actualist framework, using these two axioms (together with Anonymity and Pareto), along with an axiom universal zero critical level. However, the fact that these two axioms are substantially weaker than their actualist counterparts, and yet still sufficient to characterize the (possibilist) rank-additive SWOs without the need for Neutral population growth or Trade-off Consistency, tells us that universal zero critical level is in fact a very powerful axiom—powerful enough that it is perhaps more appropriate to think of it as determining an entirely different analytical framework, rather than just a special case of the actualist framework.Footnote 21

4 Existence independence

Rank-additive SWOs violate an axiom which Blackorby et al. (2005, §5.6) call Existence independence. This axiom says that the ethical evaluation of outcomes concerning some collection \({\mathcal {K}}\) of individuals (say, those currently alive on planet Earth) should not depend upon information about the lifetime utilities—or even the existence—of people outside of \({\mathcal {K}}\) (say, people who died long ago, who will be born in the far future, or who live on other planets). As Blackorby et al. (2005, §5.1.1) note, the ethical evaluation of presently existing people should not depend on the utility of some long-dead historical figure, such as Euclid. Likewise, suppose that a colony of humans on another planet has long ago lost all contact with Earth; Blackorby et al. (2005) argue that it would be absurd if the ethical evaluations of the colonists depended upon the utilities of the earthlings (or vice versa).Footnote 22

The generalized utilitarian SWF in formula (2D) satisfies Existence Independence, as does any “critical-level” variant of generalized utilitarianism (with a constant critical level). But it is violated by average utilitarianism, number-dampened utilitarianism, and any other SWF where the critical level depends on the utilities of already existing people. Rank-additive SWOs violate Existence Independence in an even more fundamental way: if \({\mathcal {K}}\) is the collection of individuals under consideration, then we don’t even know how to assign ranks to the members of \({\mathcal {K}}\) until we know the lifetime utilities of all the other people not in \({\mathcal {K}}\). We know almost nothing about the well-being of the vast majority of people who have existed in human history (say, over the last 250,000 years). This creates problems for any SWO whose assessment of present and future social outcomes is sensitive to such historical data.Footnote 23 There are several possible responses:

(A):

Interpret social outcomes in \({\mathcal {X}}\) as specifying only the lifetime utilities of individuals who will be affected by policy decisions; treat everyone else as ethically irrelevant. (In particular, ignore anyone who is already dead.)

(B):

Interpret social outcomes in \({\mathcal {X}}\) as specifying only the lifetime utilities of individuals living in the present or the future. Ignore the past.

(C):

Interpret social outcomes in \({\mathcal {X}}\) as specifying all individuals whose lifetime utilities are already known or can be predicted (including some people in the past). Ignore people about which nothing can be known.

(D):

Interpret social outcomes in \({\mathcal {X}}\) as specifying only the lifetime utilities of individuals living after a fixed date (e.g. January 1, 2018). Ignore everyone before this date.

(E):

Treat the utilities of unobserved individuals as a source of policy uncertainty, and deal with it the same way we deal with any other source of uncertainty: by positing a probability distribution over the unknown variables and then maximizing expected value with respect to this probability distribution.

The problem with (A) is that it is not entirely predictable who will be affected by our decisions in the future. For example, suppose the lost colony world unexpectedly re-establishes contact with Earth, after many centuries of isolation; at this moment, the rankings of everyone on the colony and on Earth would need to be recalculated, possibly leading to large changes in the evaluation of social policies. In particular, if \({\mathbf {x}}\) and \({\mathbf {y}}\) are two social outcomes which concern only the colonists and \({\mathbf {x}}'\) and \({\mathbf {y}}'\) are two social outcomes which concern only earthlings, then we may end up with a perverse situation where \({\mathbf {x}}\succ {\mathbf {y}}\) and \({\mathbf {x}}' \succ {\mathbf {y}}'\), but \({\mathbf {x}}\uplus {\mathbf {x}}' \prec {\mathbf {y}}\uplus {\mathbf {y}}'\).

Option (B) avoids this problem. But an obvious problem with both (A) and (B) is time inconsistency: as time passes, people move from “the future” or “the present” into “the past”, and are removed from the specification of the social outcome. This changes the rankings of the remaining people, and hence, the evaluation of social outcomes. It would seem strange if social outcome \({\mathbf {x}}\) was deemed preferable to outcome \({\mathbf {y}}\) before David Bowie died, but a moment after he dies, we decide that \({\mathbf {y}}'\) is actually better than \({\mathbf {x}}'\) (where \({\mathbf {x}}'\) and \({\mathbf {y}}'\) are obtained by removing Bowie’s lifetime utility from \({\mathbf {x}}\) and \({\mathbf {y}}\), respectively).

Approach (C) avoids time inconsistency. But it can still respond perversely to the arrival of new information. For example, a new and unanticipated archaeological discovery could change our estimate of the lifetime utilities of the citizens of a large ancient civilization (say, the Achaemenid Empire) and thus perturb our evaluation of social outcomes in the present day. Again, this seems absurd.

Approach (D) avoids the problems of (A), (B), and (C), but it is motivated more by pragmatism than by principle; certainly, we must give up any pretentions of moral realism if we allow our ethical evaluations to depend on an arbitrarily stipulated date on a calendar. Furthermore, (D) is still vulnerable to unknown information about the future; since we cannot really predict the lifetime utilities of far future people with any degree of precision, how are we supposed to incorporate them into the social welfare evaluation?

This leaves us with approach (E). Approach (E) does not try to exclude unknown or unknowable lifetime utilities from the specification of the social outcome by some arbitrary criterion. Instead, it “bites the bullet”, acknowledging that these unknowns exist, they are ethically relevant, and they must be taken into account. To formalize approach (E), I will assume the possibilist framework of Sect. 2. (The formalization for actualist SWOs is similar and is left to the reader.)Footnote 24 Let \({\mathcal {I}}={\mathcal {J}}\sqcup {\mathcal {K}}\), where \({\mathcal {J}}\) is an infinite set representing all potential unobserved individuals (living in the distant future, the forgotten past, or on faraway planets), while \({\mathcal {K}}\) is another infinite set representing all observable individuals (e.g. those presently alive on Earth). Let \({\mathcal {Y}}:=\{{\mathbf {y}}\in {\mathbb {R}}^{\mathcal {J}}\); only finitely many coordinates of \({\mathbf {y}}\) are nonzero\(\}\) and let \({\mathcal {Z}}:=\{{\mathbf {z}}\in {\mathbb {R}}^{\mathcal {K}}\); only finitely many coordinates of \({\mathbf {z}}\) are nonzero\(\}\). Then \({\mathcal {X}}= {\mathcal {Y}}\times {\mathcal {Z}}\). Let \(\mu \) be a probability distribution over \({\mathcal {Y}}\), representing our beliefs about the lifetime utilities of all unobserved people. For any social outcome \({\mathbf {z}}\in {\mathcal {Z}}\) (representing the lifetime utilities of observed people), define

$$\begin{aligned} {\widetilde{W}}({\mathbf {z}}) := \int _{\mathcal {Y}}\ W({\mathbf {y}}\uplus {\mathbf {z}}) \ \ \mathrm {d}\mu [{\mathbf {y}}]. \end{aligned}$$
(4A)

This defines a new SWF \({\widetilde{W}}:{\mathcal {Z}}{{\longrightarrow }}{\mathbb {R}}\), and it is this SWF that (E) says we should maximize. How does this work from a practical point of view? Let \({\mathbf {z}}:=({\mathbf {z}}^+,{\mathbf {z}}^-)\in {\mathbb {R}}_{+}^{\propto _\downarrow }\times {\mathbb {R}}_{-}^{\propto _\uparrow }\). For any \(n\in {\mathbb {N}}\), we can define a probability distribution \(\rho ^+_{{\mathbf {z}},n}\) on \([n\ldots {\infty })\) where \(\rho ^+_{{\mathbf {z}},n}(m)\) is the probability (according to \(\mu \)) that the individual in \({\mathcal {K}}\) with lifetime utility \(z^+_n\)actually has rank m amongst all individuals with positive utility, once we take into account all the unobserved individuals in \({\mathcal {J}}\). Likewise, for any \(n\in {\mathbb {N}}\), we define a probability distribution \(\rho ^-_{{\mathbf {z}},n}\) on \([n\ldots {\infty })\), where \(\rho ^-_{{\mathbf {z}},n}(m)\) is the probability that the individual in \({\mathcal {K}}\) with lifetime utility \(z^-_n\) actually has rank m amongst all individuals with negative utilities. (Note that \(\rho ^\pm _{{\mathbf {z}},n}\) are only supported on \([n\ldots {\infty })\), because introducing new individuals to the list can only increase the rank of any existing individual.) We then define the functions \({\widetilde{\phi }}^\pm _n:{\mathbb {R}}_\pm {{\longrightarrow }}{\mathbb {R}}_\pm \) by

$$\begin{aligned} {\widetilde{\phi }}^\pm _n(r):= \sum _{m=n}^{\infty }\rho ^\pm _{{\mathbf {z}},n}(m) \, \phi ^\pm _m(r), \quad \text{ for } \text{ all } r\in {\mathbb {R}}_\pm . \end{aligned}$$
(4B)

The SWF \({\widetilde{W}}\) in formula (4A) is then simply the rank-additive SWF obtained by inserting \(\{{\widetilde{\phi }}^+_n\}_{n=1}^{\infty }\) and \(\{{\widetilde{\phi }}^-_n\}_{n=1}^{\infty }\) into formula (2B).

Of course, approach (E) faces the same question as any decision under uncertainty: How can we construct the probability distribution \(\mu \)? But this question already confronts any social decision problem which concerns people living in the far future. One way to minimize the dependency on \(\mu \) is to minimize the amount of variation between the functions \(\{\phi ^+_n\}_{n=1}^{\infty }\) and \(\{\phi ^-_n\}_{n=1}^{\infty }\)—or more importantly, between their derivatives. If the derivatives \(\{(\phi ^+_n)'\}_{n=1}^{\infty }\) are all very similar to one another, then the derivatives \(\{({\widetilde{\phi }}^+_n)'\}_{n=1}^{\infty }\) of the functions defined in formula (4B) will also be very similar, independent of the precise choice of \(\mu \) (and likewise for \(\{(\phi ^-_n)'\}_{n=1}^{\infty }\) and \(\{({\widetilde{\phi }}^-_n)'\}_{n=1}^{\infty }\)). Proposition 2.2 tells us that the functions \(\{\phi ^+_n\}_{n=1}^{\infty }\) must rapidly decay to zero in a neighbourhood of the neutral utility 0. Hence, in this neighbourhood, we cannot expect them to be similar in this desired sense. But outside of this neighbourhood, nothing prevents us from ensuring that their derivatives are as similar as possible; see Example 2.7. If \((\phi ^+_{100})'\) and \((\phi ^+_{10000})'\) are almost the same, then it doesn’t matter whether a certain individual is ranked 100th or 10,000th—the marginal social welfare contribution of her lifetime utility is almost the same in both cases, so she will treated the same in any policy decision in both cases.

This consideration suggests avoiding the rank-weighted utilitarian SWFs such as (2E) and (3D), where inequality aversion is obtained by systematically increasing the slopes of the functions \(\{\phi ^+_n\}_{n=1}^{\infty }\) as \(n{\rightarrow }{\infty }\) (and systematically decreasing the slopes of the functions \(\{\phi ^-_n\}_{n=1}^{\infty }\) as \(n{\rightarrow }{\infty }\)). Instead, it suggests that we use something like the generalized utilitarian SWF in formula (2D), where the functions \(\{\phi ^+_n\}_{n=1}^{\infty }\) are all as similar as possible, and inequality aversion is obtained by making them sufficiently concave.

5 Excess (in)egalitarianism

As explained in Example 3.3, an ascending rank-weighted generalized utilitarian (ARWGU) SWO (3D) is proper if \(\sum _{n=1}^{\infty }a_n<{\infty }\). If \(\{a_n\}_{n=1}^{\infty }\) is decreasing, then such an SWO satisfies both Inequality aversion and No Repugnant Conclusion—an attractive combination. However, these SWOs have a problem. If there is a sufficiently large number of people with “satisfactory” lives, then a proper ARWGU SWO will prioritize the needs of a small population with slightly worse lives over the creation of an arbitrarily large population with excellent lives. To see this, note that, for any \(\epsilon >0\), there is some \(N(\epsilon )\in {\mathbb {N}}\) such that \(\sum _{k=N(\epsilon )+1}^{\infty }a_k < \epsilon \). Suppose for simplicity that \(\phi \) is the identity. (The same argument works for any choice of \(\phi \)). Consider a population \({\mathbf {x}}\) consisting of a large number N of people with lifetime utility 100 (representing a “satisfactory” life) and a much smaller number M of people with lifetime utility 99. For concreteness, say that \(M=50\). Let \(B:= \sum _{k=1}^M a_k\), let \(\epsilon := B/1{,}000{,}000\), and suppose that \(N>N(\epsilon )\). Now consider the following options:

  • \({\mathbf {y}}\) consists of \(N+M\) people, all having lifetime utility 100.

  • \({\mathbf {z}}= {\mathbf {x}}\uplus {\mathbf {u}}\), where \({\mathbf {u}}\) is a “utopia” containing a trillion people, all having lifetime utility 1,000,000.

It is easily seen that \(W({\mathbf {y}})> W({\mathbf {z}})\). Formally:

$$\begin{aligned} W({\mathbf {y}})= & {} W({\mathbf {x}}) + \sum _{n=1}^M a_k = W({\mathbf {x}}) + B \\= & {} W({\mathbf {x}}) + 1{,}000{,}000\,\epsilon > W({\mathbf {x}}\uplus {\mathbf {u}}) = W({\mathbf {z}}). \end{aligned}$$

In other words, the ARWGU SWO represented by W considers it better to help 50 people slightly improve their lifetime utility from 99 to 100, rather than to create a utopia with a trillion people leading excellent lives, each with a lifetime utility of 1,000,000.

For concreteness, let’s say \(N= 10\) billion, and that 100 represents the lifetime utility of the average middle-class person in a Western European country in the early twenty-first century. So \({\mathbf {x}}\) represents a world somewhat more populous than our own, but with poverty entirely eliminated worldwide. Perhaps it is Earth two hundred years in the future. Now suppose that astronomers discover that this world faces an apocalyptic threat—say, it is about to be struck by a huge asteroid, and the resulting explosion will destroy all life on the planet. However, for a relatively small investment of resources, it would be possible to evacuate some fraction of humanity to a self-sufficient lunar colony. (This is a future where the technological problems of space travel and lunar settlement have been solved.) Let us suppose that this lunar colony will not only survive, but flourish, and give rise to a vast and long-lived interstellar civilization (represented by \({\mathbf {u}}\)) which, over the coming millennia will be home to a trillion inhabitants who all live very long, happy, and fulfilling lives. For the sake of the thought experiment, suppose (implausibly) that this happy outcome is guaranteed in advance, and is known to the inhabitants of Earth. This is outcome \({\mathbf {z}}\).

Alternately, instead of saving human civilization, we could use these same resources to slightly improve the well-being of a small but unfortunate minority, who have slightly subaverage lifetime utility (i.e. 99 instead of 100). Perhaps they need minor cosmetic surgery. This is outcome \({\mathbf {y}}\). Most people’s moral intuitions say that \({\mathbf {z}}\) is better than \({\mathbf {y}}\). But according to the ARWGU social welfare function W says \({\mathbf {y}}\) is better than \({\mathbf {z}}\).Footnote 25

Here is another counterintuitive consequence. For any \(N\in {\mathbb {N}}\), let \({\mathbf {x}}^N\) describe a world containing N million people, where the vast majority (say, 99.9999%) have excellent lives (say, a lifetime utility of 10,000) but a tiny minority (0.00001%) have lives so terrible that they are not even worth living (say, a lifetime utility of \(-1\)). Any “utopia” which one can imagine will have welfare distribution something like this: no matter how perfect the utopia, there will inevitably be some tiny fraction of people who, through simple bad luck, end up with miserable lives—perhaps they suffer from some extremely rare disease, or perhaps they are victims of some incredibly improbable but terrible accident.

One would think that such a utopia is so wonderful that we should make N as large as possible. But according to the SWO W, the larger we make N, the worse\({\mathbf {x}}^N\)becomes.Footnote 26 Indeed, if N is large enough, then total social welfare is negative, meaning that a vast galactic utopia with the above statistical welfare distribution of well-being is ethically worse than a totally lifeless galaxy. For a less stark comparison, let \({\mathbf {y}}\) be a “small, safe, but boring” world, containing only one million people, all of whom have lives which are wretched, but technically worth living (say, a lifetime utility of 1). It is easily verified that, if N is large enough, then \(W({\mathbf {x}}^N)<W({\mathbf {y}})\). Suppose humanity had to choose between two futures: one leading to a galactic utopia (\({\mathbf {x}}^N\), for large N) and the other leading to a wretched but anodyne future (\({\mathbf {y}}\)). The SWO W says that humanity should choose \({\mathbf {y}}\).

Excess egalitarianism in particular affects the rank-discounted utilitarian SWO (3E) characterized by Asheim and Zuber (2014). However, if proper ARWGU SWOs suffer from excess egalitarianism, then possibilist RA SWOs can suffer from an even worse problem: excess inegalitarianism. To see this, recall from Proposition 2.2(a) that a possibilist RA SWO with SWF W as in (2B) satisfies No Repugnant Conclusion if and only if there exists \(r_0>0\) such that \(\displaystyle \sum \nolimits _{n=1}^{\infty }\phi ^+_n(r_0)<{\infty }\). Thus, for any \(\epsilon >0\), there is some \(N(\epsilon )\) such that \(\displaystyle \sum \nolimits _{n=N+1}^{\infty }\phi ^+_n(r_0)<\epsilon \).

For concreteness, suppose that \(r_0=1\), while a lifetime utility level 100 represents, say, a middle-class life in a Western European country. Let \(\epsilon :=0.0001\times (\phi ^+_1)'(100)\); this is roughly the increase in total value that would be obtained if the best-off person in society increased her lifetime utility from 100 to 100.0001. Let \(N:=N(\epsilon )\), and let \(M:= 1{,}000{,}000{,}000\,N\). Let \({\mathbf {x}}\) be a social outcome containing N “well-off” people with lifetime utility 100, and M “miserable” people with a lifetime utility of 0.1 (that is, lives of abject misery, barely worth living). Consider the following possible improvements:

  • In \({\mathbf {y}}\), the best-off person’s lifetime utility is increased from 100 to 100.0002, while the lifetime utility of everyone else stays exactly the same as in \({\mathbf {x}}\).

  • In \({\mathbf {z}}\), the N well-off people remain the same, while the lifetime utilities of the M miserable people are increased from 0.1 to 1.

It is easily verified that \(W({\mathbf {y}})>W({\mathbf {z}})\); in other words, the SWO considers it better to increase the utility of the most fortunate individual by a minuscule amount, rather than significantly boost the utilities of an astronomically vast population of miserable people.

6 Conclusion

Excess egalitarianism and excess inegalitarianism are very unappealing problems, which plague any rank-additive SWO (either actualist or possibilist) that avoids the Repugnant Conclusion via Propositions 2.2(a) and 3.2(a). In the light of this, Theorems 1 and 2 might not seem like positive results, but rather impossibility theorems. Rank-additive SWOs also have other shortcomings: actualist SWOs violate Positive and Negative expansion, while possibilist SWOs violate Inequality aversion. As always in population ethics, there are trade-offs to be made. What is the best way to make them? This is an interesting problem for future research.