1 Introduction

1.1 Re-examining the principle of being forward-looking

This paper is concerned with the following questions. Should a collective decision criterion have the same form of rationality as those of individuals do? If not, what type of dynamic constraint should be imposed? What should be the decision-theoretic content of collective decision criteria?

It is one of the central doctrines of ‘rational’ decision making that an agent should be forward-looking and not be bound by bygones. This presumes that the past and things already known not to have occurred are ‘sunk,’ and have no role to play except purely informational ones to facilitate inference about uncertain or strategic worlds. Throughout this paper, we will call this principle the principle of being forward-looking.Footnote 1

Most economic analyses presume the principle of being forward-looking for collective decision criteria as well. To evaluate streams of social outcomes, they assume that the social welfare function satisfies the discounted utility form which is standard for individual decision making: a stream of social outcomes \((c_{0},c_{1},\ldots )\), where \(c_{\tau }\) denotes the social outcome at period \(\tau \), is evaluated in the form

$$\begin{aligned} \sum _{\tau =0}^{\infty } \beta ^{\tau }u(c_{\tau }), \end{aligned}$$

in which \(\beta \) is the discount factor and the function u evaluates the social outcomes at each period independently of history and time (see Koopmans 1960). Thus, after the passage of \(t-1\) periods of time, the ex post social welfare function induced by the ex ante one evaluates a stream starting at period t, denoted by \((c_{t},c_{t+1},\ldots )\), in the form

$$\begin{aligned} \sum _{\tau =t}^{\infty } \beta ^{\tau -t}u(c_{\tau }), \end{aligned}$$

which is independent of \((c_{0},\ldots ,c_{t-1})\), the history of social outcomes up to \(t-1\), and independent even of when ‘now’ is.

Likewise, to evaluate uncertain social outcomes, these analyses assume that the social welfare function satisfies the expected utility theory which is standard for individual decision making: an uncertain social outcome f given as a mapping from the set of possible states of the world S into the set of social outcomes C is evaluated in the form

$$\begin{aligned} \int _{S}u(f(s))p(ds), \end{aligned}$$

in which p is the prior belief adopted as a ‘consensus’ and the function u evaluates social outcomes independently of which states they are realized at. Thus, after learning that an event \(E\subset S\) has occurred, or, equivalently, that \(S{\setminus } E\) turned out not to have occurred, the ex post social welfare function induced by the ex ante one takes the form of the conditional expectation

$$\begin{aligned} \int _{E}u(f(s))p(ds|E), \end{aligned}$$

where \(p(\cdot |E)\) denotes the posterior conditional on E. Note that this ex post welfare judgment conditional on E is independent of what outcome would have been obtained if event E had not occurred, and when there is no more uncertainty, it even does not matter which event has been realized.

We argue, however, that the principle of being forward-looking is not an appealing principle for collective decision making, even if we accept it for individuals’ evaluations of decisions. First, there are a handful of impossibility results. Zuber (2010) and Jackson and Yariv (2014) show that aggregate preferences over deterministic streams of outcomes can satisfy the Pareto principle, dynamic consistency, and independence of histories and time, which are assumed in the discounted utility form above, only if individual and aggregate preferences are stationary additive, i.e., the discount factors are the same for all individuals, and the aggregation rule is additive.

This means that under the principle of being forward-looking, any dynamically consistent social welfare function excludes (i) intertemporal trading due to differences in time perspectives, and (ii) equity concerns even when individuals have the same time perspective. To understand the impossibility of (i), imagine two individuals, one of whom is more patient and the other is less patient. Then the Pareto principle implies that the first one should get more consumption now and less in the future, and the opposite for the other. This, however, cannot be compatible with the social evaluation of outcomes per period being time invariant. To understand the impossibility of (ii), imagine that the society should equalize the lifetime utilities of the individuals. But once somebody is happier and somebody else is unhappier in some period, we cannot apply the same rule to sequences of social outcomes starting in the next period.

In the setting of uncertainty, Mongin (1995) shows that an aggregate preference over social outcomes (considered as random variables) satisfies the ex ante Pareto principle and the subjective expected utility theory only when individuals have the same beliefs and the aggregation is additive. Even if individuals are assumed to have the same beliefs, in earlier papers, Hammond (1996) shows that a social choice rule satisfies the Pareto principle, dynamic consistency, and is forward-looking only if it follows additive aggregation. More strongly, Zuber (2010) and Mongin and Pivato (2015) show that an aggregate preference can satisfy the Pareto principle, dynamic consistency, and the state-independence of ex post aggregate preferences, which are assumed in the subjective expected utility form above, only if individual and aggregate preferences have the subjective expected utility form, the individuals all have the same beliefs, and the aggregation rule is additive. These strengthen the result of Harsanyi (1955) that individual and aggregate preferences have the expected utility forms and satisfy the Pareto principle only if the aggregation is additive.

This means that under the principle of being forward-looking, any dynamically consistent social welfare function excludes (i) trading across states due to differences in beliefs, and (ii) equity concerns as considered in Diamond (1967), that the society should give equal chances to everybody for the sake of fairness, even when individuals have the same beliefs. To understand the impossibility of (i), imagine two individuals, one of whom believes it will rain tomorrow and the other believes it will be sunny. Then the Pareto principle implies that the first one should get more consumption if it rains and less if sunny, and the opposite for the other. This, however, cannot be compatible with the social evaluation of outcomes per period being state-independent, as was pointed out by Chambers and Hayashi (2009). To understand the impossibility of (ii), notice that a strict preference for randomization leads to a violation of the independence axiom due to von Neumann and Morgenstern, and also that under the principle of being forward-looking, the independence axiom is implied by dynamic consistency: this classic point has been extensively discussed by Machina (1989).

Not only that, we argue through examples below that the past and things already known not to have occurred have intrinsic roles to play. Let us first discuss a classical fable attributed to Aesop and one of its variations.

Example 1

‘The Grasshopper and the Ant’

(This is also known as ‘The Cicada and the Ant’.) The ancient version, Babrius 140, in Perry (1965) runs:

An ant in the winter-time was dragging out of his hole some grain which he had stored up in the summer, in order to air it. A cicada, dying of starvation, begged him to give him some of his food, to keep him alive. “What were you doing last summer?” asked the ant. “I was not loafing,” said the cicada, “I was busy singing all the time.” The ant laughed and barred up his grain, saying: “Dance in the winter, since you piped during the summer.”

A variation due to Disney (from the retelling by Brown 1993) is:

.....When the ants came to the door, they found him there, half frozen. And ten of the kind and busy ants came out and carried the poor grasshopper into their house. They gave him hot corn soup. And they hurried about, making him warm.

Then the Queen of All the Ants came to him. And the grasshopper was afraid, and he begged of her: “Oh, Madam Queen, wisest of ants, please, please give me another chance.”

The Queen of All the Ants looked at the poor, thin frozen grasshopper as he lay shivering there. Then she spoke these words: “With ants, just those who work may stay. So take your fiddle – and PLAY!”

The grasshopper was so happy that his foot began beating out the time in the old way, and he took up his fiddle and sang.....

To translate the above stories into a more formal setting, consider an economy with two periods, summer and winter, in which Ant and Grasshopper each initially own two units of the consumption good and has access to a storage technology. Assume that Ant saves one unit and Grasshopper consumes the entire two units during the summer. Thus in winter there is just one unit left, which was saved by Ant. If we accept the ending of the classic version, it reveals a judgment which ranks giving the entire one unit to Ant over splitting the remaining one unit. If we accept the ending of the modern variation, it reveals a judgment which prescribes splitting the remaining unit, let’s say equally.Footnote 2 \(^{,}\) Footnote 3

Each of the judgments amounts to a problem. The second one leads to an obvious conflict between ex post justice and ex ante justice. The idea of ex post justice is quite prevalent, and it is indeed regarded as a basis for redistribution in many real life situations. In this example, it prescribes splitting the remaining unit equally. However, under the natural requirement of dynamic consistency, the ex ante welfare criterion must prescribe that Ant should receive a two-period consumption profile (1, 0.5) and Grasshopper should receive (2, 0.5), which is unfair in any sense from the ex ante viewpoint.

The first judgment is, surprisingly, not explained by any of the accepted arguments in welfare economics. We should notice that the resource allocation problem they face in winter is an ex post problem, which is different from the ex ante one. The problem here is not about sharing across periods, but about whether we should take some resource from Ant and move it to Grasshopper. The standard welfare economics argument, based on the the principle of being forward-looking, is silent about this.

Let us further illustrate our points with a different situation: risk and uncertainty.

Example 2

‘Machina’s Mom’

From Machina (1989):

Mom has a single indivisible item – a ‘treat’ – which she can give to either daughter Abigail or son Benjamin. Assume that she is indifferent between Abigail getting the treat and Benjamin getting the treat, and strongly prefers either of these outcomes to the case where neither child gets it. However, in a violation of the precepts of expected utility theory, Mom strictly prefers a coin flip over either of these sure outcomes, and in particular, strictly prefers 1/2: 1/2 to any other pair of probabilities. This random allocation procedure would be straightforward, except that Benjie, who cut his teeth on Raiffa’s classic Decision Analysis, behaves as follows:

Before the coin is flipped, he requests a confirmation from Mom that, yes, she does strictly prefer a 50:50 lottery over giving the treat to Abigail. He gets her to put this in writing. Had he won the flip, he would have claimed the treat. As it turns out, he loses the flip. But as Mom is about to give the treat to Abigail, he reminds Mom of her preference for flipping a coin over giving it to Abigail (producing her signed statement), and demands that she flip again.

What would your Mom do if you tried to pull a stunt like this? She would undoubtedly say “You had your chance!” and refuse to flip the coin again. This is precisely what Mom does.

Machina continues, ‘By replying “You had your chance”, Mom is reminding Benjamin of the existence of the snipped-off branch (the original 1/2 probability of B) and that her preferences are not separable, so the fact that nature could have gone down that branch still matters. Mom is rejecting the property of consequentialism—and, in my opinion, rightly so.’ Here the term ‘consequentialism’ is what the principle of being forward-looking is called in the literature of choice under uncertainty, which says, in particular, that the decision should be independent of events or outcomes which turned out not to have occurred (see Hammond 1996, for example).

Again, Mom’s claim amounts to a problem, as does Benjamin’s claim as well. If we accept Benjamin’s claim, in so far as we want our welfare judgment to be dynamically consistent, this implies that the ex ante judgment must support Abigail’s winning the item with probability \(1/2 \times 1/2=1/4\) and Benjamin’s winning with probability \(1/2+1/2 \times 1/2=3/4\), which is unfair in any sense from the ex ante viewpoint. If we accept Mom’s claim, rejecting consequentialism, then we are required to propose an alternative decision-theoretic restriction, since there is no welfare economic theory stepping outside the realm of the principle of being forward-looking, except that Epstein and Segal (1992) address issues arising in non-consequentialist updating of quadratic social welfare functions.

1.2 An alternative dynamic restriction

We have argued above that the principle of being forward-looking is not an appealing principle for collective decision making, even if it is acceptable for individuals’ evaluations of decisions. This leads us to ask what is the alternative dynamic restriction that should be accepted even after rejecting the principle of being forward-looking.

It should be noted that we are not looking for a particular ethical standpoint, just as the principle of being forward-looking is not a particular ethical standpoint. What we are looking for is a form of rationality which should be satisfied by collective decisions.

It is presented in the form a meta axiom, or axiom-selection device in other words, which a reasonable axiom on social decisions is required to satisfy. This contrasts to that the principle of being forward-looking reduces to the form of a particular axiom, such as independence of history and time index.

To motivate our question, let us come back to the Ant and the Grasshopper. One may think of simply applying the welfare criterion which was accepted ex ante. Under the rationality assumption that both Ant and Grasshopper have correct foresight of the consequences of their (non-)saving, one may argue that Ant preferred a two-period consumption profile (1, 1) to (2, 0) ex ante and the opposite for Grasshopper, and deduce that since they have unanimously agreed to a plan, Ant’s receiving (1, 1) and Grasshopper’s receiving (2, 0), it is unreasonable to redistribute consumption in the winter.

However, if we support the execution of a plan merely on the ground that it was accepted in the past as being desirable, where is there any thinking in the present? This is essentially to say,

Grasshopper, you should die. Why? Because we decided that before.

Can’t we say anything other than that? Anybody could say this just by mechanically following the projection of the ex ante decision. There is no autonomy. We believe that ex post decision making must be a deliberate recalibration, which should satisfy a certain level of rationality.

A practical argument may say that the execution of an ex ante agreed-to plan must be enforced (or self-enforcing) even if there is no intrinsic reason to do so ex post, because otherwise such a plan is null and will never be implemented. For example, consider why you should pay your debt. The practical argument is that you should pay your debt because otherwise you would be punished or nobody would finance you in the first place. Our argument is rather that the fulfillment of a plan (repayment of a debt) is not only practically necessary but also should be justifiable by the deliberate recalibration of welfare.Footnote 4

To illustrate further, imagine any situation in which there is no prior contract or agreement explicitly made in the past, and one goes to court requesting compensation for a loss or lack of fairness due to somebody’s wrongdoing; or imagine any situation in which a prior contract or agreement is violated by somebody and one goes to court requesting its fulfillment or compensation for the foregone benefit due to its violation. The judge then needs to make a decision as an independent thinker, and needs to track back the history in order to calculate the right compensation, by investigating what could have been done, what could have occurred, and what had been conceived as agreeable. This is a deliberate process of recalibration. What, then, is the judge maximizing?

Dynamic consistency is of course necessary, since otherwise the welfare judgment is not regarded as credible over time. We consider that it is not sufficient as a reasoning to derive an ex post welfare ranking, however. We would even say that dynamic consistency itself may be trivial unless there is a formal restriction imposed on the process of decision criteria. One can always have the process dynamically consistent trivially, by simply taking the projections of the initial decision criterion.

Consider the following example taken from Hayashi (unpublished). There is a cake and the problem is how much of it to eat over time, and the decision maker has the following process of dynamic decision criteria.

  • Period 1: I would like to eat double today, and would like to eat equal amounts everyday from tomorrow on.

  • Every period after Period 1: I would like to eat equal amounts everyday.

This process is dynamically consistent, but it is totally unnatural and absurd, both descriptively and normatively. When the decision maker says ‘today is special’ it is natural to expect that the same decision maker will say ‘today is special’ in the next period again and in further future periods as well. In the literature of intertemporal choice, stationarity of preferences is usually referred to as a synonym of ‘time consistency’, and its violation is referred to as ‘time inconsistency’. It should be noted, however, that such an equivalence is an implication of the principle of being forward-looking.

To be precise, let \(\succsim _{h^{t-1}}\) denote the ranking over streams of outcomes starting at period t, conditional on the history of the outcomes up to the previous period, denoted by \(h^{t-1}\). Then, dynamic (time) consistency means that

$$\begin{aligned}&(c_{t}, c_{t+1}, c_{t+2},\ldots ) \succsim _{h^{t-1}} (c_{t}, c_{t+1}^{\prime }, c_{t+2}^{\prime },\ldots ) \\&\quad \Longleftrightarrow (c_{t+1}, c_{t+2},\ldots ) \succsim _{(h^{t-1},c_{t})} (c_{t+1}^{\prime }, c_{t+2}^{\prime },\ldots ) , \end{aligned}$$

where \((h^{t-1},c_{t})\) denotes the updated history after having \(c_{t}\). On the other hand, the stationarity of preference means that

$$\begin{aligned} (c_{t}, c_{t+1}, c_{t+2},\ldots ) \succsim (c_{t}, c_{t+1}^{\prime }, c_{t+2}^{\prime },\ldots ) \Longleftrightarrow (c_{t+1}, c_{t+2},\ldots ) \succsim (c_{t+1}^{\prime }, c_{t+2}^{\prime },\ldots ) \end{aligned}$$

where \(\succsim \) is a ranking which is not indexed by history or time. Thus, what underlies the equivalence between the two is the conjunction of (i) independence of history, i.e.,

$$\begin{aligned} \succsim _{h^{t-1}}=\succsim _{\widetilde{h}^{t-1}}\equiv \succsim _{t} \quad \text {for all } h^{t-1}=\widetilde{h}^{t-1} \end{aligned}$$

and (ii) independence of time, i.e.,

$$\begin{aligned} \succsim _{t}=\succsim _{t+1}\equiv \succsim \quad \text {for all } t. \end{aligned}$$

The principle of being forward-looking, put into the context of intertemporal choice, is now understood as the conjunction of the independence of history with the independence of time. Since we are departing from the principle of being forward-looking, we have to give up these properties. However, unless we impose an alternative restriction, the dynamic consistency requirement would be trivial, and such an imposition is exactly what we are going to propose.

This point is important in the normative argument as well. It is unfair to give the special nature or status only to the current (self of the) decision maker. If there is a good reason to say ‘today is special’, there will naturally be a good reason to say ‘today is special’ in the next period again and in further future periods as well.

This point applies to decisions with chances as well. If the decision maker says ‘this is the last chance’, it is natural to expect that the same decision maker will say ‘this is the last chance’ next time, and again in future moments as well. Also, if there is a good reason to say ‘this is the last chance’, there will naturally be a good reason to say ‘this is the last chance’ next time, and again in further future moments as well. This is exactly the conundrum underlying the existing impossibility results.

Our proposal is that the structure of deliberate recalibration itself must be ‘dynamically consistent’. There is a distinction between the structure of a welfare judgment, i.e., the form of thinking, and a particular choice of welfare weights and priorities. We hold that a normatively meaningful structure which appears in an ex ante welfare judgment should appear again in the ex post welfare judgment as its own property, and at the same time in a recurrent manner, while particular weights should depend on histories and vary over time. This is the dynamic constraint we propose, and will be presented in the form of a meta axiom. We call it Recurrence under Consistent Updating.

This meta axiom has been informally discussed by Epstein and Segal (1992), in order to show the merit of their quadratic social welfare function, which they axiomatically characterized in the setting of risk – the non-consequentialist update of a quadratic social welfare function is again quadratic. Also, Gumen and Savochkin (2013) have recently adopted this dynamic restriction (they call it dynamic stability) as the primary condition for characterizing a dynamic decision criterion under ambiguity, in particular, the variational model.

We did not have to be explicit about this meta axiom as long as we were staying with the principle of being forward-looking, since it was implicitly presumed there. It has to be made explicit now, since we are departing from the principle of being forward-looking. We believe that the alternative dynamic restriction still should be a decision-theoretically natural one. The recurrence condition is a natural weakening of the principle of being forward-looking in the sense that it still maintains the ‘stationarity’ property at a meta level, and avoids introducing an ad hoc exceptional feature of the initial decision node.

The meta axiom we impose is a requirement of formal rationality rather than substantive rationality. It is compatible with imposing substantive criteria that are particularly appealing in judgment at the initial stage. What should be recurrent under consistent updating is the structure of the judgment, not the particular choice of welfare weights. The axioms of horizontal equity, for example, are not recurrent under consistent updating and cannot be justified without introducing an ad hoc special role of the initial decision node. We can see this from the previous example: the criterion of equalizing lifetime utilities across individuals can be imposed only at the initial period, because in some period one may be happier and another may be unhappier and in order to be consistent over time we cannot apply the same criterion to sequences of social outcomes starting after such a history. However, such an axiom of horizontal equity is appealing when the initial node is agreed to be indeed ‘special,’ and our argument does not exclude it. We emphasize that being recurrent under consistent updating and being meaningful particularly at the initial stage are logically independent properties.

The fact that the constraint we propose is formal rather than substantive does not mean that it lacks relevance, since the relevance here is about the decision theoretic content of collective decision criteria rather than particular ethical standpoints. What form of rationality should be met by collective decisions is central and intrinsic particularly when time and risk/uncertainty are involved.

1.3 Outline

In this paper we consider a dynamic process of social welfare orderings explicitly. We then propose a dynamic constraint, which is a conjunction of two assertions: (i) the process must be dynamically consistent, which means an ex ante welfare judgment must be respected by ex post ones and there should be no contradiction between them; (ii) the structure of welfare judgment should be recurrent under consistent updating, in the sense that a postulate met by an ex ante welfare judgment is met by any of ex post ones as its own property.

Based on this standpoint, we present a set of axioms for social welfare orderings which are recurrent under consistent updating, and characterize a set of social welfare functions which are closed under updating. With such a class of social welfare functions, we characterize the roles for pasts and things known not to have occurred, which are played in the updating stage.

To give an overview of the results, consider that the objects of choice are streams of utility profiles. Ex-post decision at time t can depend on history of utility profiles up to period \(t-1\). Thus it is conditional on \(\mathbf {u}^{t-1}=(u_{0},\ldots ,u_{t-1})\), a history of utility profiles up to period \(t-1\), where I denotes the set of individuals and \(u_{\tau }=(u_{i\tau })_{i\in I}\) denote the utility profile at period \(\tau \). For simplicity, assume there is a common discount factor \(\beta \), while in the main text we cover the case of heterogenous discounting. Then the characterized social welfare function \(\Phi \) defined over streams of utility profiles starting at period t, typically denoted by \(\mathbf {u}_{t}=(u_{t},u_{t+1},\ldots )\), takes either (i) the exponential form

$$\begin{aligned} \Phi (\mathbf {u}_{t}|\mathbf {u}^{t-1})=-\sum _{i\in I}a_{i}(\mathbf {u}^{t-1}) \exp \left( -\lambda (\mathbf {u}^{t-1})\sum _{\tau =t}^{\infty }\beta ^{\tau -t}u_{i\tau }\right) \end{aligned}$$

in which the welfare weight vector \(a(\mathbf {u}^{t-1})\) and the degree of inequality aversion \(\lambda (\mathbf {u}^{t-1})\) follow the updating rule

$$\begin{aligned} a_{i}(\mathbf {u}^{t-1})= & {} \frac{a_{i}(\emptyset )\exp \left( -\lambda (\emptyset ) \sum _{\tau =0}^{t-1}\beta ^{\tau }u_{i\tau }\right) }{\sum _{j\in I}a_{j}(\emptyset )\exp \left( -\lambda (\emptyset )\sum _{\tau =0}^{t-1}\beta ^{\tau }u_{j\tau }\right) } \\ \lambda (\mathbf {u}^{t-1})= & {} \lambda (\emptyset )\beta ^{t} \end{aligned}$$

or (ii) the additive form

$$\begin{aligned} \Phi (\mathbf {u}_{t}|\mathbf {u}^{t-1})=\sum _{i\in I}a_{i}(\mathbf {u}^{t-1}) \sum _{\tau =t}^{\infty }\beta ^{\tau -t}u_{i\tau } \end{aligned}$$

in which \(a(\mathbf {u}^{t-1})\) follows

$$\begin{aligned} a_{i}(\mathbf {u}^{t-1})= a_{i}(\emptyset ), \end{aligned}$$

where \(\emptyset \) denotes the empty history.

These formulae say that if the social ranking has equity concern by taking the exponential form then it must incorporate past utilities when updating welfare weights, while the equity concern must decrease over time according to the exponential order. There, higher (lower) past utilities must be ‘compensated’ in the way that they decrease (increase) welfare weights in the exponential manner. This provides a bound on how much the society can or should be tolerant of redistribution based on the idea of ex post equity, so that it is compatible with the idea of ex ante equity. And it says that when the social ranking has no equity concern, by taking the additive form, welfare weights must be constant over time, and it does not take past utilities into account.

In the problem of updating under uncertainty the objects of choice are random utility profiles defined over the set of terminal states denoted by \(\Omega \), where information gradually is revealed over time. The ex-post decision at event \(E_{t}\) at period t can depend on utility profiles which would have been obtained if \(E_{t}\) had not occurred. Thus it is conditional on \(\mathbf {u}_{-E_{t}}\), denoting a random utility profile defined over the complement of event \(E_{t}\). For simplicity, let p be the common belief across individuals, while in the main text we cover the case of heterogenous beliefs. Then the characterized social welfare function \(\Phi \) defined over random utility profiles over the realized event \(E_{t}\), typically denoted by \(\mathbf {u}_{E_{t}}\), takes either (i) the exponential form

$$\begin{aligned} \Phi (\mathbf {u}_{E_{t}}|\mathbf {u}_{-E_{t}})=-\sum _{i\in I}a_{i}(\mathbf {u}_{-E_{t}}) \exp \left( -\lambda (\mathbf {u}_{-E_{t}})\int _{E_{t}}u_{i}(s)p(ds|E_{t})\right) \end{aligned}$$

in which the welfare weight vector \(a(\mathbf {u}_{-E_{t}})\) and the degree of inequality aversion \(\lambda (\mathbf {u}_{-E_{t}}) \) follow the updating rule

$$\begin{aligned} a_{i} (\mathbf {u}_{-E_{t}})= & {} \frac{a_{i}(\emptyset ) \exp \left( -\lambda (\emptyset ) \int _{\Omega {\setminus } E_{t}}u_{i}(s)p(ds)\right) }{\sum _{j\in I}a_{j}(\emptyset ) \exp \left( -\lambda (\emptyset ) \int _{\Omega {\setminus } E_{t}}u_{j}(s)p(ds)\right) }\\ \lambda (\mathbf {u}_{-E_{t}})= & {} \lambda (\emptyset )p(E_{t}) \end{aligned}$$

or (ii) the additive form

$$\begin{aligned} \Phi (\mathbf {u}_{E_{t}}|\mathbf {u}_{-E_{t}})=\sum _{i\in I}a_{i}(\mathbf {u}_{-E_{t}}) \int _{E_{t}}u_{i}(s)p(ds|E_{t}) \end{aligned}$$

in which the welfare weight vector \(a(\mathbf {u}_{-E_{t}})\) follows

$$\begin{aligned} a_{i}(\mathbf {u}_{-E_{t}})= a_{i}(\emptyset ) \end{aligned}$$

where \(\emptyset \) denotes the initial history at which nothing is known.

These formulae say that if the social ranking has equity concern, by taking the exponential form, it must incorporate utilities which turned out not to have occurred when updating welfare weights, while the equity concern must decrease as we acquire more and more information about the true state of nature. There, higher (lower) utilities which turned out not to have occurred must be ‘compensated’ in the way that they decrease (increase) welfare weights conditional on the present event. This explains Machina’s Mom reply, “You had your chance!” It also says that when the social ranking has no equity concern, by taking the additive form, welfare weights must be constant, and it does not take utilities into account which turned out not to have occurred.

The paper proceeds as follows. In Sect. 2, we consider the problem of updating in the deterministic setting with intertemporal compensations, and propose a condition that an axiom imposed on social welfare ordering should be recurrent under consistent updating. Then we provide a set of axioms, hence a class of characterized social welfare functions, which is closed under consistent updating, and present the updating rule of welfare weights and equity attitudes, which explains the role of pasts. In Sect. 3, we consider the problem of updating along with resolution of uncertainty, and propose a condition that an axiom imposed on social welfare ordering be recurrent under consistent updating. Then we provide a set of axioms, hence a class of characterized social welfare functions, which is closed under updating, and present the updating rule of welfare weights and equity attitudes, which explains the role of things known not to have occurred. All proofs are relegated to the “Appendix”. Also in the “Appendix” we present an axiomatic characterization of the class of static social welfare functions, which allows for the role of equity concern and is shown to be closed under consistent updating in each of Sects. 2 and 3.

2 Consistent updating with intertemporal compensations

2.1 Consistent updating under homogeneous discounting

In this section we consider the problem of aggregation and updating with intertemporal compensation in a deterministic setting. First we limit attention to the case where discounting is homogeneous across individuals. In a later section, we will extend the argument to the case of heterogeneous discounting.

Consider discrete time with an infinite horizon. Let I be the set of individuals. For technical reasons, we assume \(|I|\ge 3\).Footnote 5

For each t, let \(u_{t}=(u_{it})_{i\in I}\) be the list of utilities received by the individuals at period t. For each t, let \(\mathbf {u}^{t-1}=(u_{0},\ldots ,u_{t-1}) \in \mathbb {R}^{I \times (t-1)}\) be the history of utilities received by the individuals before period t. Denote the initial point, with a null history, by \(\emptyset \). Also, for each t, let \(L^{I \times [t,\infty ]}=\{\mathbf {u}_{t}=(u_{t},u_{t+1},\ldots ):\max _{i} \sup _{\tau \ge t} |u_{i\tau }|<\infty \}\) be the set of bounded sequences starting at period t. For each possible history \(\mathbf {u}^{t-1}\), let \(\succsim _{\mathbf {u}^{t-1}}\) be the social welfare ordering over \(L^{I \times [t,\infty ]}\) conditional on \(\mathbf {u}^{t-1}\). Given \(\mathbf {u}_{t}, \mathbf {v}_{t} \in L^{I \times [t,\infty ]}\), the ranking may be written for example as \(\mathbf {u}_{t} \succsim _{\mathbf {u}^{t-1}} \mathbf {v}_{t}\). Let \(\{\succsim _{\mathbf {u}^{t-1}}:t\in \mathbb {N} \ \text {and} \ \mathbf {u}^{t-1} \in \mathbb {R}^{I\times (t-1)}\}\) be a process of such social welfare orderings. Also, let \(\beta \in (0,1)\) be the discount factor which is assumed to be the same for all individuals.

We will consider two kinds of axioms. One kind describes the properties of the social welfare criteria which are desirable to be satisfied at each time in each possible history, and are investigated as to whether they are recurrent under consistent updating over time. We call such axioms intra-profile axioms. The other kind describes relations between social welfare orderings indexed with different times and histories. We call such axioms inter-profile axioms.

Dynamic consistency is the inter-profile axiom we impose throughout the analysis.

  • Dynamic consistency for all t and \(\mathbf {u}^{t-1} \in \mathbb {R}^{I\times (t-1)}\), for all \(u_{t}\in \mathbb {R}^{I}\) and \(\mathbf {u}_{t+1}, \mathbf {v}_{t+1} \in L^{I \times [t,\infty ]}\),

    $$\begin{aligned} (u_{t},\mathbf {u}_{t+1}) \succsim _{\mathbf {u}^{t-1}} (u_{t},\mathbf {v}_{t+1}) \Longleftrightarrow \mathbf {u}_{t+1} \succsim _{(\mathbf {u}^{t-1},u_{t})} \mathbf {v}_{t+1}. \end{aligned}$$

We define the condition that an intra-profile axiom is recurrent under consistent updating over time.

Definition 1

An intra-profile axiom is said to be recurrent under consistent updating if for any process of social welfare orderings \(\{\succsim _{\mathbf {u}^{t-1}}\}\) satisfying dynamic consistency, for all \(\mathbf {u}^{t-1}\in \mathbb {R}^{I\times (t-1)}\) and \(u_{t}\in \mathbb {R}^{I}\), if it is true for \(\succsim _{\mathbf {u}^{t-1}}\), then it is true for \(\succsim _{\mathbf {u}^{t-1},u_{t}}\).

Example 3

The following intra-profile axioms are not recurrent under consistent updating.

  • Anonymity at a given history \(\mathbf {u^{t-1}}\): for all \(\mathbf {u}_{t}, \mathbf {v}_{t}\in L^{I \times [t,\infty ]}\) and any permutation \(\pi :I\rightarrow I\) it holds that

    $$\begin{aligned} \mathbf {u}_{t} \succsim _{\mathbf {u}^{t-1}} \mathbf {v}_{t} \Longleftrightarrow \mathbf {u}_{t}^{\pi } \succsim _{\mathbf {u}^{t-1}} \mathbf {v}_{t}^{\pi }, \end{aligned}$$

    where \(\mathbf {u}_{t}^{\pi }\) is given by \(u_{i\tau }^{\pi }=u_{\pi ^{-1}(i)\tau }\) for all \(i\in I\) and \(\tau \ge t\).

  • Homogeneity at a given history \(\mathbf {u^{t-1}}\): for all \(\mathbf {u}_{t}, \mathbf {v}_{t}\in L^{I \times [t,\infty ]}\) and for all non-negative numbers c,

    $$\begin{aligned} \mathbf {u}_{t} \succsim _{\mathbf {u}^{t-1}} \mathbf {v}_{t} \Longleftrightarrow c \mathbf {u}_{t} \succsim _{\mathbf {u}^{t-1}} c\mathbf {v}_{t}. \end{aligned}$$

To see that these axioms are not recurrent under updating, consider the social welfare function at history \(\mathbf {u}^{t-1}\) in the form

$$\begin{aligned} \Phi (\mathbf {u}_{t}|\mathbf {u}^{t-1})=\min _{i\in I} \sum _{\tau =t}^{\infty }\beta ^{\tau -t}u_{i\tau }. \end{aligned}$$

This obviously satisfies anonymity and homogeneity at history \(\mathbf {u}^{t-1}\). However, its consistent updating \(\Phi (\mathbf {u}_{t+1}|\mathbf {u}^{t-1},u_{t})\) at history \((\mathbf {u}^{t-1},u_{t})\) must be ordinally equivalent to the function

$$\begin{aligned} \min _{i\in I} \left( u_{it}+ \beta \sum _{\tau =t+1}^{\infty }\beta ^{\tau -(t+1)}u_{i\tau }\right) , \end{aligned}$$

which does not satisfy anonymity or homogeneity as a ranking over \(\mathbf {u}_{t+1}\).

Violating the first of these two axioms means something different from violating the other one.

Anonymity says that the identity of individuals does not matter in evaluating future utilities. From the ex ante viewpoint of horizontal equity, this is a natural and desirable axiom. However, anonymity cannot be maintained over time when we update social welfare orderings consistently over time.

This does not mean that we should abandon anonymity or axioms of horizontal equity in general just because they are not recurrent under consistent updating. It rather means that the axioms of horizontal equity are meaningful only when they are imposed on a judgment at the initial stage.

What the condition requires is that the structure of the judgment, not particular ethical attitudes, should be recurrent under consistent updating. Thus the second case has something more to bite. Homogeneity says that changing the common scale of individual utilities in the future does not matter. However, changing the scale of future utilities changes the impact of past utilities, which in general results in violating dynamic consistency.

To introduce intra-profile axioms, fix an arbitrary time t and a history \(\mathbf {u}^{t-1}\). The first one is self-explanatory.

  • Order for all \(t\in \mathbb {N}\) and for all \(\mathbf {u}_{t-1} \in \mathbb {R}^{I \times (t-1)}\), \(\succsim _{\mathbf {u}^{t-1}}\) is a complete and transitive ordering over \(L^{I \times [t,\infty ]}\).

Next we consider the axiom that the social ranking over utility streams depends only on individuals’ evaluations of discounted utility (DU). This says that each individual is taken to be maintaining ‘unity’ over time, and their utilities at different periods are summarized by themselves, and the social ranking concerns only such summaries by individuals. One may see the non-triviality of this condition by thinking of a situation in which each individual has different ‘selves’ at different periods and there is a conflict among them, and the social choice may concern the resolution of such conflicts as well.

  • DU-Pareto for all \(t\in \mathbb {N}\), \(\mathbf {u}_{t-1} \in \mathbb {R}^{I\times (t-1)}\) and for all \(\mathbf {u}_{t}, \mathbf {v}_{t} \in L^{I \times [t,\infty ]}\), \(\sum _{\tau =t}^{\infty } \beta ^{\tau -t} u_{i\tau } \ge \sum _{\tau =t}^{\infty } \beta ^{\tau -t} v_{i\tau }\) for all \(i\in I\) implies \(\mathbf {u}_{t} \succsim _{\mathbf {u}^{t-1}} \mathbf {v}_{t}\), and the conclusion is strict if \(\sum _{\tau =t}^{\infty } \beta ^{\tau -t} u_{i\tau } > \sum _{\tau =t}^{\infty } \beta ^{\tau -t} v_{i\tau }\) for some \(i\in I\) in addition.

Under DU-Pareto, one can define a social ranking induced over discounted utilities.

Definition 2

Given t and \(\mathbf {u}^{t-1}\), the DU-welfare ordering \(\succsim _{\mathbf {u}^{t-1}}^{*}\) over \(\mathbb {R}^{I}\) induced by \(\succsim _{\mathbf {u}^{t-1}}\) is defined by

$$\begin{aligned} U_{t}\succsim _{\mathbf {u}^{t-1}}^{*} V_{t} \Longleftrightarrow \mathbf {u}_{t} \succsim _{\mathbf {u}^{t-1}} \mathbf {v}_{t} \end{aligned}$$

for \(\mathbf {u}_{t}, \mathbf {v}_{t} \in L^{I \times \infty }\) with \(U_{t}=\left( \sum _{\tau =t}^{\infty } \beta ^{\tau -t} u_{i\tau }\right) _{i\in I}\) and \(V_{t}=\left( \sum _{\tau =t}^{\infty } \beta ^{\tau -t} v_{i\tau }\right) _{i\in I}\) respectively. The DU-Pareto axiom ensures that this is well-defined.

We consider the following four intra-profile axioms on the induced ranking. Although one can write them directly in terms of the original ranking, this helps simplifying the exposition.

  • DU-continuity for all \(t\in \mathbb {N}\) and for all \(\mathbf {u}_{t-1} \in \mathbb {R}^{I \times (t-1)}\), \(\succsim ^{*}_{\mathbf {u}^{t-1}}\) is a closed subset of \(\mathbb {R}^{I} \times \mathbb {R}^{I}\).

  • DU-inequality aversion for all \(t\in \mathbb {N}\) and for all \(\mathbf {u}_{t-1} \in \mathbb {R}^{I \times (t-1)}\), for all \(U_{t},V_{t}\in \mathbb {R}^{I}\) and \(c \in [0,1]\), \(U_{t}\sim ^{*}_{\mathbf {u}^{t-1}} V_{t}\) implies \(c U_{t}+(1-c)V_{t} \succsim _{\mathbf {u}^{t-1}}^{*} U_{t}\).

  • DU-separability for all \(t\in \mathbb {N}\) and for all \(\mathbf {u}_{t-1} \in \mathbb {R}^{I \times (t-1)}\), for all \(J\subset I\), for all \(U_{Jt},V_{Jt}\in \mathbb {R}^{J}\) and \(U_{-Jt},V_{-Jt}\in \mathbb {R}^{I{\setminus } J}\), \((U_{Jt},U_{-Jt})\succsim ^{*}_{\mathbf {u}^{t-1}} (V_{Jt},U_{-Jt})\) holds if and only if \((U_{Jt},V_{-Jt})\succsim ^{*}_{\mathbf {u}^{t-1}} (V_{Jt},V_{-Jt})\).

  • DU-shift covariance for all \(t\in \mathbb {N}\) and for all \(\mathbf {u}_{t-1} \in \mathbb {R}^{I \times (t-1)}\), for all \(U_{t},V_{t}\in \mathbb {R}^{I}\) and \(c\in \mathbb {R}\), \(U_{t}\succsim ^{*}_{\mathbf {u}^{t-1}} V_{t}\) implies \(U_{t}+c \mathbf {1}\succsim _{\mathbf {u}^{t-1}}^{*} V_{t}+c\mathbf {1}\).

This last, DU-shift covariance, needs explanation. In the static setting, the shift covariance axiom says that adding ‘equal utilities’ to everybody does not change the ranking. In the context of updating over time, this means that adding ‘equal lifetime utilities’ does not change the ranking. While Homogeneity, another prominent independence property, is not recurrent under consistent updating, (DU-)shift covariance can be shown to be recurrent under consistent updating. Note, however, that it relies on the assumption that the individual utility functions fall in the class of stationary discounted utility.Footnote 6

Proposition 1

Order, DU-Pareto, DU-continuity, DU-inequality aversion, DU-separability and DU-shift covariance are recurrent under consistent updating.

The proof of this result and most of the other results are in the “Appendix”.

The following characterization result deals with each time and history separately, which has nothing to do with dynamic consistency yet. Let \(\Delta ^{I} = \left\{ a \in \mathbb {R}^{I}_{+}:\sum _{i\in I} a_{i} =1\right\} \) and \(int \Delta ^{I} \) denote the relative interior of \(\Delta ^{I}\).

Proposition 2

Fix a time t and a history \(\mathbf {u}^{t-1}\). The social welfare ordering \(\succsim _{\mathbf {u}^{t-1}}\) satisfies order, DU-Pareto, DU-continuity, DU-inequality aversion, DU-separability and DU-shift covariance if and only if either of the following two cases holds:

  1. (i)

    There exists \(\lambda (\mathbf {u}^{t-1}) >0\) and a vector \(a(\mathbf {u}^{t-1}) \in int \Delta ^{I}\) such that \(\succsim _{\mathbf {u}^{t-1}}\) can be represented in the form

    $$\begin{aligned} \Phi (\mathbf {u}_{t}|\mathbf {u}^{t-1})=-\sum _{i\in I}a_{i}(\mathbf {u}^{t-1}) \exp \left( -\lambda (\mathbf {u}^{t-1})\sum _{\tau =t}^{\infty }\beta ^{\tau -t}u_{i\tau }\right) . \end{aligned}$$

    We call this class of orderings and representations the exponential class.

  2. (ii)

    There exists a vector \(a(\mathbf {u}^{t-1}) \in int \Delta ^{I}\) such that \(\succsim _{\mathbf {u}^{t-1}}\) can be represented in the form

    $$\begin{aligned} \Phi (\mathbf {u}_{t}|\mathbf {u}^{t-1})=\sum _{i\in I}a_{i}(\mathbf {u}^{t-1}) \sum _{\tau =t}^{\infty }\beta ^{\tau -t}u_{i\tau }. \end{aligned}$$

    We call this class of orderings and representations the additive class.

Moreover, for each t and \(\mathbf {u}^{t-1}\), in case (i), \(a(\mathbf {u}^{t-1})\) and \(\lambda (\mathbf {u}^{t-1})\) are unique, and in case (ii), \(a(\mathbf {u}^{t-1})\) is unique.

Theorem 1

The process of social welfare orderings \(\{\succsim _{\mathbf {u}^{t-1}}\}\) satisfies order, DU-Pareto, DU-continuity, DU-inequality aversion, DU-separability and DU-shift covariance at \(\emptyset \) and dynamic consistency if and only if either

  1. (i)

    \(\{\succsim _{\mathbf {u}^{t-1}}\}\) falls in the exponential class and \(\{a(\mathbf {u}^{t-1})\}\) and \(\{\lambda (\mathbf {u}^{t-1})\}\) follow the updating rule

    $$\begin{aligned} a_{i}(\mathbf {u}^{t-1})= & {} \frac{a_{i}(\emptyset )\exp \left( -\lambda (\emptyset ) \sum _{\tau =0}^{t-1}\beta ^{\tau }u_{i\tau }\right) }{\sum _{j\in I}a_{j}(\emptyset )\exp \left( -\lambda (\emptyset )\sum _{\tau =0}^{t-1}\beta ^{\tau }u_{j\tau }\right) } \end{aligned}$$
    (1)
    $$\begin{aligned} \lambda (\mathbf {u}^{t-1})= & {} \lambda (\emptyset )\beta ^{t} \end{aligned}$$
    (2)

    for all \(\mathbf {u}^{t-1}\), or

  2. (ii)

    \(\{\succsim _{\mathbf {u}^{t-1}}\}\) falls in the additive class and \((a_{i}(\mathbf {u}^{t}))_{i\in I}\) follows

    $$\begin{aligned} a_{i}(\mathbf {u}^{t-1})= a_{i}(\emptyset ) \end{aligned}$$
    (3)

    for all \(\mathbf {u}^{t-1}\).

The theorem says that if the social ranking has a concern for equity, then it must incorporate past utilities when updating welfare weights as described by formula (1), while the equity concern must decrease over time according to the exponential order as described by formula (2). There, higher (lower) past utilities must be ‘compensated’ in the way that they decrease (increase) welfare weights in the exponential manner. This is the way how we maintain compatibility between the concept of ex ante equity and the concept of ex post equity. It also says that when the social ranking has no equity concern, welfare weights must be constant over time as in formula (3), and does not take past utilities into account.

This implies that when we require that social rankings should ignore the past, the only possibility is additive aggregation with weights being constant over time.

  • Independence of past utilities for all t and for all \(\mathbf {u}^{t-1}\) and \(\widetilde{\mathbf {u}}^{t-1}\), \(\succsim _{\mathbf {u}^{t-1}}=\succsim _{\widetilde{\mathbf {u}}^{t-1}}\).

Corollary 1

Suppose that the process of social welfare orderings \(\{\succsim _{\mathbf {u}^{t-1}}\}\) satisfies dynamic consistency and falls in the class as characterized in Proposition 2. Then it satisfies the independence of past utilities if and only if it falls in the additive class with \(a(\mathbf {u}^{t-1})= a(\emptyset )\) for all \(\mathbf {u}^{t-1}\).

2.2 Consistent updating under heterogeneous discounting

If we attempt to extend the previous argument on aggregating discounted utility to the case of heterogeneous discounting described by \((\beta _{i})_{i\in I}\), we have

$$\begin{aligned} \Phi (\mathbf {u}_{0}|\emptyset )= & {} -\sum _{i\in I}a_{i}(\emptyset ) \exp \left( -\lambda (\emptyset )\sum _{\tau =0}^{\infty }\beta _{i}^{\tau }u_{i\tau }\right) \\= & {} -\sum _{i\in I}a_{i}(\emptyset )\exp \left( -\lambda (\emptyset )\sum _{\tau =0}^{t-1}\beta _{i}^{\tau }u_{i\tau }\right) \exp \left( -\lambda (\emptyset )\beta _{i}^{t} \sum _{\tau =t}^{\infty }\beta _{i}^{\tau -t}u_{i\tau }\right) , \end{aligned}$$

which in general cannot represent the same ranking over \(\mathbf {u}_{t}\) as \(\Phi (\mathbf {u}_{t}|\mathbf {u}^{t-1})= -\sum _{i\in I}a_{i}(\mathbf {u}^{t-1}) \exp \left( -\lambda (\mathbf {u}^{t-1})\sum _{\tau =t}^{\infty }\beta ^{\tau -t}_{i}u_{i\tau }\right) \) does, because the term \(\lambda (\emptyset )\beta _{i}^{t}\) is not the same for different individuals.

Therefore, the exponential class is not closed under updating under heterogeneous discounting, and we have the following result.

Proposition 3

Consider the class of social welfare functions as obtained in Proposition 2. Under strictly heterogeneous discounting, it satisfies dynamic consistency if and only if the social welfare function is limited to the additive class with \(a(\mathbf {u}^{t-1})= a(\emptyset )\) for all \(\mathbf {u}^{t-1}\).

The reason is that DU-shift covariance is not in general recurrent under consistent updating when discounting is heterogenous. However, a weaker version of DU-shift covariance can be shown to be recurrent under consistent updating.

  • DU-general shift covariance for all \(t\in \mathbb {N}\) and for all \(\mathbf {u}_{t-1} \in \mathbb {R}^{I \times (t-1)}\), there exists a \(W_{\mathbf {u}^{t-1}}\in \mathbb {R}^{I}_{++}\) such that for all \(U_{t},V_{t}\in \mathbb {R}^{I}\) and \(c\in \mathbb {R}\), \(U_{t}\succsim ^{*}_{\mathbf {u}^{t-1}} V_{t}\) implies \(U_{t}+c W_{\mathbf {u}^{t-1}} \succsim _{\mathbf {u}^{t-1}}^{*} V_{t}+cW_{\mathbf {u}^{t-1}}\).

Proposition 4

Under heterogeneous discounting, order, DU-Pareto, DU-continuity, DU-inequality aversion, DU-separability and DU-general shift covariance are recurrent under consistent updating.

Proposition 5

For each fixed time t and history \(\mathbf {u}^{t-1}\), the social welfare ordering \(\succsim _{\mathbf {u}^{t-1}}\) satisfies order, DU-Pareto, DU-continuity, DU-inequality aversion, DU-separability and DU-general shift covariance if and only if either of the following two cases holds:

  1. (i)

    there exists a vector \(\lambda (\mathbf {u}^{t-1}) \in \mathbb {R}_{++}^{I}\) and a vector \(a(\mathbf {u}^{t-1}) \in int \Delta ^{I}\) such that \(\succsim _{\mathbf {u}^{t-1}}\) can be represented in the form

    $$\begin{aligned} \Phi (\mathbf {u}_{t}|\mathbf {u}^{t-1})=-\sum _{i\in I}a_{i}(\mathbf {u}^{t-1}) \exp \left( -\lambda _{i}(\mathbf {u}^{t-1})\sum _{\tau =t}^{\infty }\beta _{i}^{\tau -t}u_{i\tau }\right) . \end{aligned}$$

    We call this class of orderings and representations the generalized exponential class.

  2. (ii)

    there exists a vector \(a(\mathbf {u}^{t-1}) \in int \Delta ^{I}\) such that \(\succsim _{\mathbf {u}^{t-1}}\) can be represented in the form

    $$\begin{aligned} \Phi (\mathbf {u}_{t}|\mathbf {u}^{t-1})=-\sum _{i\in I}a_{i}(\mathbf {u}^{t-1}) \sum _{\tau =t}^{\infty }\beta _{i}^{\tau -t}u_{i\tau }. \end{aligned}$$

    We call this class of orderings and representations the generalized additive class.

Moreover, for each t and \(\mathbf {u}^{t-1}\), in case (i) \(a(\mathbf {u}^{t-1})\) and \(\lambda (\mathbf {u}^{t-1})\) are unique and in case (ii) \(a(\mathbf {u}^{t-1})\) is unique.

Theorem 2

The process of social welfare orderings \(\{\succsim _{\mathbf {u}^{t-1}}\}\) satisfies order, DU-Pareto, DU-continuity, DU-inequality aversion, DU-separability and DU-general shift covariance at \(\emptyset \) and dynamic consistency if and only if either

  1. (i)

    \(\{\succsim _{\mathbf {u}^{t-1}}\}\) falls in the generalized exponential class and \(\{a(\mathbf {u}^{t-1})\}\) and \(\{\lambda (\mathbf {u}^{t-1})\}\) follow the updating rule

    $$\begin{aligned} a_{i}(\mathbf {u}^{t-1})= & {} \frac{a_{i}(\emptyset )\exp \left( -\lambda _{i}(\emptyset ) \sum _{\tau =0}^{t-1}\beta _{i}^{\tau }u_{i\tau }\right) }{\sum _{j\in I}a_{j}(\emptyset )\exp \left( -\lambda _{j}(\emptyset )\sum _{\tau =0}^{t-1}\beta _{j}^{\tau }u_{j\tau }\right) } \end{aligned}$$
    (4)
    $$\begin{aligned} \lambda _{i}(\mathbf {u}^{t-1})= & {} \lambda _{i}(\emptyset )\beta _{i}^{t} \end{aligned}$$
    (5)

    for all \(\mathbf {u}^{t-1}\), or

  2. (ii)

    \(\{\succsim _{\mathbf {u}^{t-1}}\}\) falls in the generalized additive class and \((a_{i}(\mathbf {u}^{t}))_{i\in I}\) follows the updating rule

    $$\begin{aligned} a_{i}(\mathbf {u}^{t-1})= \frac{a_{i}(\emptyset )\beta _{i}^{t}}{\sum _{j\in I} a_{j}(\emptyset )\beta _{j}^{t}} \end{aligned}$$
    (6)

    for all \(\mathbf {u}^{t-1}\).

The theorem says that if the social ranking has equity concern then it must incorporate past utilities when updating welfare weights as described by formula (4), while the equity concern on each individual decreases over time according to her discount factor as in formula (5). There, higher (lower) past utilities must be ‘compensated’ in the way that they decrease (increase) welfare weights in the exponential manner. It also says that when the social ranking has no equity concern welfare weights evolve according to individuals’ discount factors as in formula (6), which depends on the time index but does not take past utilities into account.

Let us come back to the Ant/Grasshopper story here. Since Anonymity or the axiom of horizontal equity cannot be recurrent under consistent updating our ex post welfare judgment cannot support the idea of ex post redistribution in the complete sense. However, as the vector of initial degree of inequality aversion: \(\lambda (\emptyset )\) is larger when the updated ones are larger as well, implying that we are more supportive of the idea of redistribution.

The above result implies that when we require that social rankings should ignore past utilities, the only possibility is additive aggregation.

Corollary 2

Suppose that the process of social welfare orderings \(\{\succsim _{\mathbf {u}^{t-1}}\}\) falls in the class as characterized in Proposition 5. Then it satisfies dynamic consistency and independence of past utilities hold if and only if it falls in the generalized additive class in which the updating of welfare weights follows

$$\begin{aligned} a_{i}(\mathbf {u}^{t-1})= \frac{a_{i}(\emptyset )\beta _{i}^{t}}{\sum _{j\in I} a_{j}(\emptyset )\beta _{j}^{t}}. \end{aligned}$$

for all \(\mathbf {u}^{t-1}\).

Notice that under heterogeneous discounting, even if there is no equity concern, the social welfare function still must depend on histories in the sense that when the history started should make a difference.

  • Independence of past utilities and time for all ts and for all \(\mathbf {u}^{t-1}\) and \(\widetilde{\mathbf {u}}^{s-1}\), \(\succsim _{\mathbf {u}^{t-1}}=\succsim _{\widetilde{\mathbf {u}}^{s-1}}\).

Corollary 3

Under strictly heterogeneous discounting, there is no process of social welfare orderings which satisfies dynamic consistency and independence of past utilities and time.

3 Consistent updating under uncertainty

3.1 Consistent updating under homogeneous beliefs

In this section, we consider the problem of aggregation and updating along with the resolution of uncertainty. Here we limit attention to the case of common beliefs held by all individuals. In a later section, we will extend the argument to the case of heterogeneous beliefs.

As before, let I be the set of individuals with \(|I|\ge 3\). Let \(\Omega \) be the set of states of the world, which is a measure space with a common prior p. Given a measurable subset \(E\subset \Omega \) and a subset of individuals \(J\subset I\), let \(L^{J\times E}\) be the set of integrable functions from E to \(\mathbb {R}^{J}\), which is interpreted as a set of random utility profiles for group J conditional on E.

Let \(\{{\mathcal {E}}_{t}\}\) be a refining sequence of partitions of \(\Omega \), which describes how uncertainties are resolved over time. Assume that \(p(E_{t})>0\) for all t and \(E_{t}\in {\mathcal {E}}_{t}\). Given t and \(E_{t}\in {\mathcal {E}}_{t}\), let \(\mathbf {u}_{-E_{t}} \in L^{I\times (\Omega {\setminus } E_{t})}\) be an integrable random utility profile defined over \(\Omega {\setminus } E_{t}\), which describes what individuals’ utilities would have been if \(E_{t}\) had not occurred. Denote the initial point with no such foregone utilities by \(\emptyset \). For simplicity, we restrict attention to utility values at final states, which can easily be combined with the argument with intertemporal consumption as presented in the previous section. Given \(E_{t}\in {\mathcal {E}}_{t}\), let \(p(\cdot |E_{t})\) be the Bayesian update of p conditional on \(E_{t}\).

For each possible \(\mathbf {u}_{-E_{t}}\), let \(\succsim _{\mathbf {u}_{-E_{t}}}\) be the social welfare ordering over \(L^{I \times E_{t}}\) conditional on \(\mathbf {u}_{-E_{t}}\). Given \(\mathbf {u}_{E_{t}}, \mathbf {v}_{E_{t}} \in L^{I \times E_{t}}\), the ranking may be written for example as \(\mathbf {u}_{E_{t}} \succsim _{\mathbf {u}_{-E_{t}}} \mathbf {v}_{E_{t}}\). Let \(\{\succsim _{\mathbf {u}_{-E_{t}}}:t\in \mathbb {N}, E_{t}\in {\mathcal {E}}_{t}, \text {and} \mathbf {u}_{-E_{t}} \in L^{I \times (\Omega {\setminus } E_{t})}\}\) be a process of such social welfare orderings.

Note that when \(\succsim _{\mathbf {u}_{-E_{t}}}\), the foregone utility profile \(\mathbf {u}_{-E_{t}}\) that would have obtained if event \(E_{t}\) had not occurred is taken to be a commitment plan. To see this, imagine that Benjamin counterargues against Mom’s claim ‘You had your chance!’ as follows: ‘But if I had won the flip Abigail would have claimed the same thing as I did and you would have accepted her claim, so why not flip the coin again for me as well?’ This will lead to a hierarchical argument that the decision at the current event depends on the decision at the counterfactual event, which again depends on the decision at the current event and so on, but we avoid this problem in the current paper.

We will consider two kinds of axioms. One kind describe the properties of social welfare criteria for which it is desirable that they be satisfied at each time and possible event, investigate whether they are recurrent under consistent updating with the passage of time and the realization of uncertainty. We call such axioms intra-profile axioms. The other kind describe the relations between social welfare orderings indexed with different times and possible events. We call such axioms inter-profile axioms.

Dynamic consistency is the inter-profile axiom we impose throughout the analysis. Given t and \(E_{t}\in {\mathcal {E}}_{t}\), let \({\mathcal {E}}_{t+1} \cap E_{t}=\{E_{t+1} \in {\mathcal {E}}_{t+1} :E_{t+1} \subset E_{t}\}\).

  • Dynamic consistency for all t, \(E_{t}\) and \(\mathbf {u}_{-E_{t}}\), for all \(E_{t+1}\in {\mathcal {E}}_{t+1}\cap E_{t}\) and \(\mathbf {u}_{E_{t}{\setminus } E_{t+1}} \in L^{I\times (E_{t}{\setminus } E_{t+1})}\), and for all \(\mathbf {u}_{E_{t+1}}, \mathbf {v}_{E_{t+1}} \in L^{I\times E_{t+1}}\),

    $$\begin{aligned} (\mathbf {u}_{E_{t+1}}, \mathbf {u}_{E_{t}{\setminus } E_{t+1}} ) \succsim _{\mathbf {u}_{-E_{t}}} (\mathbf {v}_{E_{t+1}}, \mathbf {u}_{E_{t}{\setminus } E_{t+1}} ) \Longleftrightarrow \mathbf {u}_{E_{t+1}} \succsim _{\mathbf {u}_{-E_{t}}, \mathbf {u}_{E_{t}{\setminus } E_{t+1}} } \mathbf {v}_{E_{t+1}}. \end{aligned}$$

We define the condition that an intra-profile axiom is recurrent under consistent updating over time.

Definition 3

An intra-profile axiom is said to be recurrent under consistent updating if for any process of social welfare orderings \(\{\succsim _{\mathbf {u}_{-E_{t}}}\}\) satisfying dynamic consistency, for all t, \(E_{t}\) and \(\mathbf {u}_{-E_{t}}\), for all \(E_{t+1}\in {\mathcal {E}}_{t+1}\cap E_{t}\) and \(\mathbf {u}_{E_{t}{\setminus } E_{t+1}} \in L^{I\times (E_{t}{\setminus } E_{t+1})}\), if it holds for \(\succsim _{\mathbf {u}_{-E_{t}}} \), then it holds for \(\succsim _{\mathbf {u}_{-E_{t}}, \mathbf {u}_{E_{t}{\setminus } E_{t+1}} } \).

Example 4

The following intra-profile axioms are not recurrent under consistent updating.

  • Anonymity at given t, \(E_{t}\) and \(\mathbf {u}_{-E_{t}}\): for all \(\mathbf {u}_{E_{t}}, \mathbf {v}_{E_{t}} \in L^{I \times E_{t}}\), and any permutation \(\pi :I\rightarrow I\),

    $$\begin{aligned} \mathbf {u}_{E_{t}} \succsim _{\mathbf {u}_{-E_{t}}} \mathbf {v}_{E_{t}} \Longleftrightarrow \mathbf {u}_{E_{t}}^{\pi } \succsim _{\mathbf {u}_{-E_{t}}} \mathbf {v}_{E_{t}}^{\pi }, \end{aligned}$$

    where \(\mathbf {u}_{E_{t}}^{\pi }\) is given by \(u_{i, E_{t}}^{\pi }=u_{\pi ^{-1}(i), E_{t}}\) for all i.

  • Homogeneity at given t, \(E_{t}\) and \(\mathbf {u}_{-E_{t}}\): for all \(\mathbf {u}_{E_{t}}, \mathbf {v}_{E_{t}} \in L^{I \times E_{t}}\), and for all non-negative number c,

    $$\begin{aligned} \mathbf {u}_{E_{t}} \succsim _{\mathbf {u}_{-E_{t}}} \mathbf {v}_{E_{t}} \Longleftrightarrow c\mathbf {u}_{E_{t}} \succsim _{\mathbf {u}_{-E_{t}}} c\mathbf {v}_{E_{t}}. \end{aligned}$$

To see that these axioms are not recurrent under updating, consider the following social welfare function at \(\mathbf {u}_{-E_{t}}\):

$$\begin{aligned} \Phi (\mathbf {u}_{E_{t}}|\mathbf {u}_{-E_{t}})=\min _{i\in I} \int _{E_{t}}u_{i}(s)p(ds|E_{t}). \end{aligned}$$

This obviously satisfies anonymity and homogeneity at \(\mathbf {u}_{-E_{t}}\). However, its consistent updating \(\Phi (\mathbf {u}_{E_{t+1}}|\mathbf {u}_{-E_{t}}, \mathbf {u}_{E_{t}{\setminus } E_{t+1}} )\) at \((\mathbf {u}_{-E_{t}}, \mathbf {u}_{E_{t}{\setminus } E_{t+1}} )\) must be ordinally equivalent to the function

$$\begin{aligned} \min _{i\in I} \left( \int _{E_{t}{\setminus } E_{t+1}}u_{i}(s)p(ds) +\int _{E_{t+1}}u_{i}(s)p(ds) \right) , \end{aligned}$$

which does not satisfy anonymity or homogeneity as a ranking over \(\mathbf {u}_{E_{t+1}}\).

Violating the first of these two axioms means something different from violating the other one.

Anonymity says that identity of an individual does not matter in evaluating still possible utilities. However, in the presence of events which did not occur but might have delivered different utilities to different individuals, the identity of the individuals is indispensable for consistent updating.

Again, this does not mean that we should abandon anonymity or axioms of horizontal equity in general just because they are not recurrent under consistent updating. It rather means that the axioms of horizontal equity are meaningful only when they are imposed on a judgment at the initial stage.

What the condition requires is that the structure of the judgment, not particular ethical attitudes, should be recurrent under consistent updating. Thus the second case has something more to bite. Homogeneity says that changing the common scale of individual utilities in the future does not matter. However, changing the scale of future utilities changes the impact of past utilities, which in general results in violating dynamic consistency.

To introduce intra-profile axioms, fix an arbitrarily time t, \(E_{t}\) and \(\mathbf {u}_{-E_{t}}\). The first one is self-explanatory.

  • Order for all \(t\in \mathbb {N}\), \(E_{t}\in {\mathcal {E}}_{t}\) and \(\mathbf {u}_{-E_{t}} \in L^{I\times (\Omega {\setminus } E_{t})}\), \(\succsim _{\mathbf {u}_{-E_{t}}}\) is complete and transitive ordering over \(L^{I \times E_{t}}\).

Next we require that the social ranking over utility streams depends only on individuals’ expected utility (henceforth EU) evaluations. This says that each individual is taken to be maintaining ‘unity’ over time, and their utilities at different periods are summarized by themselves, and the social ranking concerns only such summaries. One may see the non-triviality of this condition by thinking for example of treating inequalities across both individuals and states in tandem, which is excluded by the axiom (see the discussions in Ben-Porath et al. 1997).

  • EU-Pareto for all \(t\in \mathbb {N}\), \(E_{t}\in {\mathcal {E}}_{t}\) and \(\mathbf {u}_{-E_{t}} \in L^{I\times (\Omega {\setminus } E_{t})}\), for all \(\mathbf {u}_{E_{t}}, \mathbf {v}_{E_{t}} \in L^{I \times E_{t}}\), \(\int _{E_{t}}u_{i}(s)p(ds|E_{t}) \ge \int _{E_{t}}v_{i}(s)p(ds|E_{t})\) for all \(i\in I\) implies \(\mathbf {u}_{E_{t}} \succsim _{\mathbf {u}_{-E_{t}}} \mathbf {v}_{E_{t}}\), and the conclusion is strict if \(\int _{E_{t}}u_{i}(s)p(ds|E_{t}) > \int _{E_{t}}v_{i}(s)p(ds|E_{t})\) for some \(i\in I\) in addition.

Under EU-Pareto, one can define a social ranking induced over expected utilities.

Definition 4

Given \(t\in \mathbb {N}\), \(E_{t}\in {\mathcal {E}}_{t}\) and \(\mathbf {u}_{-E_{t}} \in L^{I\times (\Omega {\setminus } E_{t})}\), the EU-welfare ordering \(\succsim _{\mathbf {u}_{-E_{t}}}^{*}\) over \(\mathbb {R}^{I}\) induced by \(\succsim _{\mathbf {u}_{-E_{t}}}\) is defined by

$$\begin{aligned} U_{E_{t}}\succsim _{\mathbf {u}_{-E_{t}}}^{*} V_{E_{t}} \Longleftrightarrow \mathbf {u}_{E_{t}} \succsim _{\mathbf {u}_{-E_{t}}} \mathbf {v}_{E_{t}} \end{aligned}$$

for \(\mathbf {u}_{E_{t}}, \mathbf {v}_{E_{t}} \in L^{I \times E_{t}}\) with \(U_{E_{t}}= \big (\int _{E_{t}}u_{i}(s)p(ds|E_{t})\big )_{i\in I}\) and \(V_{E_{t}}= \big (\int _{E_{t}}v_{i}(s)p(ds|E_{t})\big )_{i\in I}\) respectively. The EU-Pareto axiom ensures that this is well-defined.

We impose the following four intra-profile axioms on the induced ranking. Although one can write them down directly in terms of the original ranking, this helps simplifying the exposition.

  • EU-continuity for all \(t\in \mathbb {N}\), \(E_{t}\in {\mathcal {E}}_{t}\) and \(\mathbf {u}_{-E_{t}} \in L^{I\times (\Omega {\setminus } E_{t})}\), \(\succsim ^{*}_{\mathbf {u}_{-E_{t}}}\) is a closed subset of \(\mathbb {R}^{I} \times \mathbb {R}^{I}\).

  • EU-inequality aversion for all \(t\in \mathbb {N}\), \(E_{t}\in {\mathcal {E}}_{t}\) and \(\mathbf {u}_{-E_{t}} \in L^{I\times (\Omega {\setminus } E_{t})}\), for all \(U_{E_{t}},V_{E_{t}}\in \mathbb {R}^{I}\) and \(c \in [0,1]\), \(U_{E_{t}}\sim ^{*}_{\mathbf {u}_{-E_{t}}} V_{E_{t}}\) implies \(c U_{E_{t}}+(1-c)V_{E_{t}} \succsim _{\mathbf {u}_{-E_{t}}}^{*} V_{E_{t}}\).

  • EU-separability for all \(t\in \mathbb {N}\), \(E_{t}\in {\mathcal {E}}_{t}\) and \(\mathbf {u}_{-E_{t}} \in L^{I\times (\Omega {\setminus } E_{t})}\), for all \(J\subset I\), for all \(U_{J,E_{t}},V_{J,E_{t}}\in \mathbb {R}^{J}\) and \(U_{-J,E_{t}},V_{-J,E_{t}}\in \mathbb {R}^{I{\setminus } J}\), \((U_{J,E_{t}},U_{-J,E_{t}})\succsim ^{*}_{\mathbf {u}_{-E_{t}}} (V_{J,E_{t}},U_{-J,E_{t}})\) holds if and only if \((U_{J,E_{t}},V_{-J,E_{t}})\succsim ^{*}_{\mathbf {u}_{-E_{t}}} (V_{J,E_{t}},V_{-J,E_{t}})\).

  • EU-shift covariance for all \(t\in \mathbb {N}\), \(E_{t}\in {\mathcal {E}}_{t}\) and \(\mathbf {u}_{-E_{t}} \in L^{I\times (\Omega {\setminus } E_{t})}\), for all \(U_{E_{t}},V_{E_{t}}\in \mathbb {R}^{I}\) and \(c\in \mathbb {R}\), \(U_{E_{t}}\succsim ^{*}_{\mathbf {u}_{-E_{t}}} V_{E_{t}}\) implies \(U_{E_{t}}+c \mathbf {1}\succsim _{\mathbf {u}_{-E_{t}}}^{*} V_{E_{t}}+c\mathbf {1}\).

This last, EU-shift covariance, needs some explanation. In the static setting, the shift covariance axioms say that adding ‘equal utilities’ to everybody does not change the ranking. In the context of updating under uncertainty, it means that adding ‘equal expected utilities’ does not change the ranking. While Homogeneity, another prominent independence property, is not recurrent under consistent updating, (EU-)shift covariance can be shown to be recurrent under consistent updating. Note, however, that it relies on the assumption that the individual utility functions fall in the class of expected utility.

Proposition 6

Order, EU-Pareto, EU-continuity, EU-inequality aversion, EU-separability and EU-shift covariance are recurrent under consistent updating.

Proposition 7

Fix t, \(E_{t}\) and \(\mathbf {u}_{-E_{t}}\). The social welfare ordering \(\succsim _{\mathbf {u}_{-E_{t}}}\) satisfies order, EU-Pareto, EU-continuity, EU-inequality aversion, EU-separability and EU-shift covariance if and only if either of the following two cases holds:

  1. (i)

    there exists \(\lambda (\mathbf {u}_{-E_{t}}) >0\) and a vector \(a(\mathbf {u}_{-E_{t}}) \in int \Delta ^{I}\) such that \(\succsim _{\mathbf {u}_{-E_{t}}}\) can be represented in the form

    $$\begin{aligned} \Phi (\mathbf {u}_{E_{t}}|\mathbf {u}_{-E_{t}})=-\sum _{i\in I}a_{i}(\mathbf {u}_{-E_{t}}) \exp \left( -\lambda (\mathbf {u}_{-E_{t}})\int _{E_{t}}u_{i}(s)p(ds|E_{t})\right) . \end{aligned}$$

    We call this class of orderings and representations the exponential class.

  2. (ii)

    there exists a vector \(a(\mathbf {u}_{-E_{t}}) \in int \Delta ^{I}\) such that \(\succsim _{\mathbf {u}_{-E_{t}}}\) can be represented in the form

    $$\begin{aligned} \Phi (\mathbf {u}_{E_{t}}|\mathbf {u}_{-E_{t}})=\sum _{i\in I}a_{i}(\mathbf {u}_{-E_{t}}). \int _{E_{t}}u_{i}(s)p(ds|E_{t}) \end{aligned}$$

    We call this class of orderings and representations the additive class.

Moreover, for each t and \(\mathbf {u}_{-E_{t}}\), in case (i) \(a(\mathbf {u}_{-E_{t}})\) and \(\lambda (\mathbf {u}_{-E_{t}})\) are unique and in case (ii) \(a(\mathbf {u}_{-E_{t}})\) is unique.

Theorem 3

The process of social welfare orderings \(\{\succsim _{\mathbf {u}_{-E_{t}}}\}\) satisfies order, EU-Pareto, EU-continuity, EU-inequality aversion, EU-separability and EU-shift covariance at \(\emptyset \) and dynamic consistency if and only if either

  1. (i)

    \(\{\succsim _{\mathbf {u}_{-E_{t}}}\}\) falls in the exponential class and \(\{a(\mathbf {u}_{-E_{t}})\}\) and \(\{\lambda (\mathbf {u}_{-E_{t}})\}\) follow the updating rule

    $$\begin{aligned} a_{i} (\mathbf {u}_{-E_{t}})= & {} \frac{a_{i}(\emptyset ) \exp \left( -\lambda (\emptyset ) \int _{\Omega {\setminus } E_{t}}u_{i}(s)p(ds)\right) }{\sum _{j\in I} a_{j}(\emptyset ) \exp \left( -\lambda (\emptyset ) \int _{\Omega {\setminus } E_{t}}u_{j}(s)p(ds)\right) } \end{aligned}$$
    (7)
    $$\begin{aligned} \lambda (\mathbf {u}_{-E_{t}})= & {} \lambda (\emptyset )p(E_{t}) \end{aligned}$$
    (8)

    for all \(\mathbf {u}_{-E_{t}}\), or

  2. (ii)

    \(\{\succsim _{\mathbf {u}_{-E_{t}}}\}\) falls in the additive class and \(\{a(\mathbf {u}_{-E_{t}})\}\) follows

    $$\begin{aligned} a_{i}(\mathbf {u}_{-E_{t}})= a_{i}(\emptyset ) \end{aligned}$$
    (9)

    for all \(\mathbf {u}_{-E_{t}}\).

The theorem says that if the social ranking has a concern for equity, then it must incorporate past utilities which turned out not to have occurred when updating welfare weights as described by formula (7), while the equity concern must decrease as we acquire more and more information about the true state of nature, according to formula (8). There, higher (lower) utilities which turned out not to have occurred must be ‘compensated’ for in the way that they decrease (increase) welfare weights conditional on the present event. This is the way how we maintain compatibility between the idea of fair chance ex ante and the idea of fair chance ex post. It also says that when the social ranking has no equity concern welfare weights must be constant as in formula (9), and it does not take utilities into account which turned out not to have occurred.

This implies that when we require that social rankings should ignore utilities which turned out not to have occurred, the only possibility is additive aggregation with weights’ being constant.

  • Independence of counterfactual consequences for all t, for all \(E_{t}\), for all \(\mathbf {u}_{-E_{t}}\) and \(\widetilde{\mathbf {u}}_{-E_{t}}\), and for all \(U,V\in \mathbb {R}^{I}\),

    $$\begin{aligned} (U_{i}\mathbf {1}_{E_{t}})_{i\in I} \succsim _{\mathbf {u}_{-E_{t}}} (V_{i}\mathbf {1}_{E_{t}})_{i\in I} \Longleftrightarrow (U_{i}\mathbf {1}_{E_{t}})_{i\in I} \succsim _{\widetilde{\mathbf {u}}_{-E_{t}}} (V_{i}\mathbf {1}_{E_{t}})_{i\in I} . \end{aligned}$$

Corollary 4

Suppose that the process of social welfare orderings \(\{\succsim _{\mathbf {u}_{-E_{t}}}\}\) satisfying Dynamic Consistency falls in the class as characterized in Proposition 7. Then it satisfies independence of counterfactual consequences if and only if it falls in the additive class with \(a(\mathbf {u}_{-E_{t}})=a(\emptyset )\) for all \(\mathbf {u}_{-E_{t}}\).

3.2 Consistent updating under heterogeneous beliefs

Now consider heterogeneous priors \((p_{i})_{i\in I}\), assuming that \(p_{i}(E_{t})>0\) for all i, t and \(E_{t}\in {\mathcal {E}}_{t}\). If we attempt to extend the previous argument on aggregating expected utility to the case of heterogeneous beliefs, for the exponential class we have

$$\begin{aligned} \Phi (\mathbf {u}_{\Omega }|\emptyset )= & {} -\sum _{i\in I}a_{i}(\emptyset ) \exp \left( -\lambda (\emptyset )\int _{\Omega }u_{i}(s)p_{i}(ds)\right) \\= & {} -\sum _{i\in I}a_{i}(\emptyset ) \exp \left( -\lambda (\emptyset ) \int _{\Omega {\setminus } E_{t}}u_{i}(s)p_{i}(ds)\right) \\&\times \exp \left( -\lambda (\emptyset ) p_{i}(E_{t} )\int _{E_{t}}u_{i}(s)p_{i}(ds|E_{t})\right) , \end{aligned}$$

which in general cannot represent the same ranking over \(\mathbf {u}_{E_{t}}\) as

$$\begin{aligned} \Phi (\mathbf {u}_{E_{t}}|\mathbf {u}_{-E_{t}})= -\sum _{i\in I}a_{i}(\mathbf {u}_{-E_{t}}) \exp \left( -\lambda (\mathbf {u}_{-E_{t}})\int _{E_{t}}u_{i}(s)p_{i}(ds|E_{t})\right) \end{aligned}$$

does, because the term \(\lambda (\emptyset ) p_{i}(E_{t})\) is not independent of the individual.

Therefore, the exponential class is not closed under updating with heterogeneous beliefs, and we have the following result.

Proposition 8

Consider the class of social welfare functions as obtained in Proposition 7. Under strictly heterogeneous beliefs, it satisfies dynamic consistency if and only if the social welfare function is limited to the additive class with \(a(\mathbf {u}_{-E_{t}})= a(\emptyset )\) for all \(\mathbf {u}_{-E_{t}}\).

The reason is that EU-shift covariance is not in general recurrent under consistent updating when beliefs are heterogeneous. However, a weaker version of EU-shift covariance is shown to be recurrent under consistent updating.

  • EU-general shift covariance for all \(t\in \mathbb {N}\), for all \(E_{t} \in {\mathcal {E}}_{t}\) and for all \(\mathbf {u}_{-E_{t}} \in L^{I \times (\Omega {\setminus } E_{t})}\), there exists \(W_{\mathbf {u}_{-E_{t}}}\in \mathbb {R}^{I}_{++}\) such that for all \(U_{E_{t}},V_{E_{t}}\in \mathbb {R}^{I}\) and \(c\in \mathbb {R}\), \(U_{E_{t}}\succsim ^{*}_{\mathbf {u}_{-E_{t}}} V_{E_{t}}\) implies \(U_{E_{t}}+c W_{\mathbf {u}_{-E_{t}}} \succsim _{\mathbf {u}_{-E_{t}}}^{*} V_{E_{t}}+c W_{\mathbf {u}_{-E_{t}}}\).

Proposition 9

Under heterogeneous beliefs, order, EU-Pareto, EU-continuity, EU-inequality aversion, EU-separability and EU-general shift covariance are recurrent under consistent updating.

Proposition 10

Fix t, \(E_{t}\) and \(\mathbf {u}_{-E_{t}}\). The social welfare ordering \(\succsim _{\mathbf {u}_{-E_{t}}}\) satisfies order, EU-Pareto, EU-continuity, EU-inequality aversion, EU-separability and EU-general shift covariance if and only if either of the following two cases holds:

  1. (i)

    there exists a vector \(\lambda (\mathbf {u}_{-E_{t}}) \in \mathbb {R}_{++}^{I}\) and a vector \(a(\mathbf {u}_{-E_{t}}) \in int \Delta ^{I}\) such that \(\succsim _{\mathbf {u}_{-E_{t}}}\) can be represented in the form

    $$\begin{aligned} \Phi (\mathbf {u}_{E_{t}}|\mathbf {u}_{-E_{t}})=-\sum _{i\in I}a_{i}(\mathbf {u}_{-E_{t}}) \exp \left( -\lambda _{i}(\mathbf {u}_{-E_{t}})\int _{E_{t}}u_{i}(s)p_{i}(ds|E_{t})\right) . \end{aligned}$$

    We call this class of orderings and representations the generalized exponential class.

  2. (ii)

    there exists a vector \(a(\mathbf {u}_{-E_{t}}) \in int \Delta ^{I}\) such that \(\succsim _{\mathbf {u}_{-E_{t}}}\) can be represented in the form

    $$\begin{aligned} \Phi (\mathbf {u}_{E_{t}}|\mathbf {u}_{-E_{t}})=-\sum _{i\in I}a_{i}(\mathbf {u}_{-E_{t}}) \int _{E_{t}}u_{i}(s)p_{i}(ds|E_{t}). \end{aligned}$$

    We call this class of orderings and representations the generalized additive class.

Moreover, for each t and \(\mathbf {u}_{-E_{t}}\), in case (i) \(a(\mathbf {u}_{-E_{t}})\) and \(\lambda (\mathbf {u}_{-E_{t}})\) are unique and in case (ii) \(a(\mathbf {u}_{-E_{t}})\) is unique.

Theorem 4

The process of social welfare orderings \(\{\succsim _{\mathbf {u}_{-E_{t}}}\}\) satisfies order, EU-Pareto, EU-continuity, EU-inequality aversion, EU-separability and EU-general shift covariance at \(\emptyset \) and dynamic consistency if and only if either

  1. (i)

    \(\{\succsim _{\mathbf {u}_{-E_{t}}}\}\) falls in the generalized exponential class and \(\{a(\mathbf {u}_{-E_{t}})\}\) and \(\{\lambda (\mathbf {u}_{-E_{t}})\}\) follow the updating rule

    $$\begin{aligned} a_{i} (\mathbf {u}_{-E_{t}})= & {} \frac{a_{i}(\emptyset ) \exp \left( -\lambda _{i}(\emptyset ) \int _{\Omega {\setminus } E_{t}}u_{i}(s)p_{i}(ds)\right) }{\sum _{j\in I} a_{j}(\emptyset ) \exp \left( -\lambda _{j}(\emptyset ) \int _{\Omega {\setminus } E_{t}}u_{j}(s)p_{j}(ds)\right) } \end{aligned}$$
    (10)
    $$\begin{aligned} \lambda _{i}(\mathbf {u}_{-E_{t}})= & {} \lambda (\emptyset )p_{i}(E_{t}) \end{aligned}$$
    (11)

    for all \(\mathbf {u}_{-E_{t}}\), or

  2. (ii)

    \(\{\succsim _{\mathbf {u}_{-E_{t}}}\}\) falls in the generalized additive class and \(\{a(\mathbf {u}_{-E_{t}})\}\) follows

    $$\begin{aligned} a_{i}(\mathbf {u}_{-E_{t}})= \frac{a_{i}(\emptyset )p_{i}(E_{t})}{\sum _{j\in I}a_{j}(\emptyset )p_{j}(E_{t})} \end{aligned}$$
    (12)

    for all \(\mathbf {u}_{-E_{t}}\).

The theorem says that if the social ranking has a concern for equity, then it must incorporate utilities which turned out not to have occurred when updating welfare weights as described by formula (10), while the equity concern on each individual decreases when acquiring more and more information about the true state of nature (as interpreted by prior beliefs) as in formula (11). There, higher (lower) utilities which turned out not to have occurred must be ‘compensated’ for in the way that they decrease (increase) welfare weights conditional on the present event. It also says that when the social ranking has no equity concern, welfare weights conditional on an event must be proportional to the agent’s belief in the event as in formula (12), which depends on the event which occurred but does not take into account utilities which turned out not to have occurred.

This implies that when we require that a social rankings should ignore utilities which turned out not to have occurred, the only possibility is additive aggregation.

Corollary 5

Suppose that the process of social welfare orderings \(\{\succsim _{\mathbf {u}_{-E_{t}}}\}\) satisfying Dynamic Consistency falls in the class as characterized in Proposition 10. Then it satisfies independence of counterfactual consequences if and only if it falls in the generalized additive class with

$$\begin{aligned} a_{i}(\mathbf {u}_{-E_{t}})= \frac{a_{i}(\emptyset )p_{i}(E_{t})}{\sum _{j\in I}a_{j}(\emptyset )p_{j}(E_{t})} \end{aligned}$$

for all \(\mathbf {u}_{-E_{t}}\).

Notice that under heterogeneous beliefs, even if there is no equity concern, the social welfare function must depend on history in the sense of depending on what the individuals had believed. This is consistent with the result of Mongin (1995, 1998), which has recently been refined by Chambers and Hayashi (2009) and Zuber (2010), that under heterogeneous beliefs any social ranking respecting the Pareto principle cannot be state-independent.

  • Independence of counterfactual consequences and events for all t, for all \(E_{t}\) and \(\widetilde{E}_{t}\), for all \(\mathbf {u}_{-E_{t}}\) and \(\widetilde{\mathbf {u}}_{-\widetilde{E}_{t}}\), and for all \(U,V\in \mathbb {R}^{I}\),

    $$\begin{aligned} (U_{i}\mathbf {1}_{E_{t}})_{i\in I} \succsim _{\mathbf {u}_{-E_{t}}} (V_{i}\mathbf {1}_{E_{t}})_{i\in I} \Longleftrightarrow (U_{i}\mathbf {1}_{\widetilde{E}_{t}})_{i\in I} \succsim _{\widetilde{\mathbf {u}}_{-\widetilde{E}_{t}}} (V_{i}\mathbf {1}_{\widetilde{E}_{t}})_{i\in I} . \end{aligned}$$

Corollary 6

Under strictly heterogeneous beliefs, there is no process of social welfare orderings which satisfies dynamic consistency and independence of counterfactual consequences and events.

4 Conclusions

The principle of being forward-looking, a central principle in economic decision making that says that a rational agent should be forward-looking and not be bound by bygones, is not necessarily appealing for collective decision making, where bygones may have necessary and substantive roles to play.

Then we considered a dynamic process of social welfare orderings, and proposed a restriction which is acceptable even when rejecting the principle of being forward-looking: the process must be dynamically consistent and any meaningful normative postulate should be recurrent under consistent updating.

Based on this standpoint, we have presented a set of axioms for social welfare orderings which are recurrent under consistent updating, and characterized a set of social welfare functions which are closed under updating. With such a class of social welfare functions, we characterized the roles for pasts and things known not to have occurred, which are played in the updating stage.

Although we do not say that the class of social welfare functions is the only one which is closed under consistent updating, we see it as a sufficiently tight class which enables operational arguments, and at the same a sufficiently general class which allows all the important arguments to be carried out.

Our arguments are limited to the cases where the individual utility functions fall in the class of stationary discounted utilities in the setting of intertemporal compensations, and fall in the class of expected utilities in the setting of uncertainty. The shift covariance axiom in particular makes sense for such classes of individual utility functions, and we view these classes as important benchmarks to start with. However, we do not know what should be the right counterpart of such an axiom when individual utility functions take more general forms. It is reasonable to ask what form of rationality should be imposed on social welfare functions, or what should be the decision theoretic content of social welfare functions, when we go beyond those classes of individual utility functions.