1 Introduction

The literature on income inequality measurement distinguishes two approaches. The ethical approach, initiated by Dalton (1920), aims at assessing inequality as the loss of potential social welfare resulting from the fact that income is unevenly distributed. By contrast, the axiomatic approach aims at characterising measures consistent with properties which seem suitable for inequality indices. This approach usually leads to a unique class of indices, called here the generalised entropies (see Bourguignon 1979; Cowell 1980; Shorrocks 1980, 1984). Unfortunately, neither of these two approaches is free from criticism.

The first limitation of the generalised entropies is the lack of ethical foundations. Indeed, the value given by an index in this class has no particular meaning, and the notion of inequality aversion it captures cannot be tied to the social evaluator’s preferences. The second important limitation is the ‘strange behaviour’ of the generalised entropies, as identified by Shorrocks (1980) for some values of the inequality aversion parameter. As this parameter increases, the index becomes more and more sensitive to progressive income transfers—i.e. transfers from a richer person to a poorer person—at the very top of the distribution, a behaviour which contradicts the principle of weak transfer sensitivity (Kolm 1976; Shorrocks and Foster 1987). A common recommendation for empirical studies is to avoid such parameter values. Nevertheless, what we call in this paper the ‘Shorrocks issue’ has not been rationalised, to date, through compelling arguments.

The ethical approach—through the so-called Atkinson–Kom–Sen and Kolm–Pollak classes of indices—is also, but for different reasons, controversial. It is here assumed that social welfare can be computed from the individual utilities. The same utility function applies to each individual as a measure of social well-being they receive from their incomes, according to the social evaluator’s views. Then concavity of the utility function is supposed to reflect the social evaluator’s inequality aversion. Nevertheless, Hansson (1977) and Sen (1978) have identified serious shortcomings for the strategy which aims at defining an inequality index from a social welfare function. The so-called Hansson–Sen paradox states that, for a given income distribution, a concave transformation of the utility function leads to a decrease in inequality of individual welfare, and hence the value given by the inequality index should also decrease. However, and paradoxically enough, the opposite implication is observed: for a given income distribution, the more concave the utility function, the larger the value given by the inequality index.

This paper proposes a new interpretation of the generalised entropies which builds upon an original representation, both formal and geometric. We argue that the function that generates the generalised entropies can be viewed as the antiderivative of a utility function. We first establish that such an interpretation is admissible if we assume that the utility function is a cardinal representation of the social evaluator’s preferences. In this framework, inequality aversion is related to the convexity of the antiderivative of the utility function or equivalently to the increasing rate of the utility function. This interpretation, obviously non-standard, provides ethical foundations for the generalised entropies.

The interest of this interpretation is twofold. First, the so-called Shorrocks issue becomes crystal-clear: the strange behaviour of the generalised entropies, for some values of the inequality parameter, results from the convexity of the underlying utility function. We emphasise that the index is here consistent with the Pigou–Dalton principle of transfers as long as the utility function is increasing, whatever the sign of its second derivative. The second interest is to provide a solution to escape the Hansson–Sen paradox. After a simple normalisation—again in order to take into account the cardinal measurability of the utility function—a generalised entropy index decreases after a concave transformation of the utility function.

An association between inequality aversion and the increasing rate of the utility function may seem surprising as it is a common practice in normative economics to link inequality aversion to the concavity of the utility function. We take the stance that this standard way of proceeding may be deemed as a conceptual error. Of course, in a utilitarian framework, an uneven income distribution is suboptimal in terms of production of welfare, and the more concave the utility function, the more intense the suboptimality. Nevertheless, an inequality index that is concerned by the effects of income on welfare should be sensitive to any reduction of the utilities between the individuals. Consequently, it makes more sense to relate, loosely speaking, inequality aversion to the slope of the utility function because for a given income distribution, the more the utility increases, the larger the inequality of welfare is.

The rest of the paper is organised as follows. Section 2 provides a quick presentation of the inequality indices obtained in the ethical and axiomatic approaches. We also discuss the meaning of the ‘utility function’ which represents the social evaluator’s preferences in the ethical approach. We then propose in Sect. 3 a new interpretation of the usual generalised entropies as inequality-of-utility measures, with a geometric representation on the graph of this utility function. We argue in Sect. 4 that this representation can explain the strange behaviour of the generalised entropies for some values of the inequality aversion parameter. Section 5 investigates the fundamental problem pointed out by Hansson (1977) and Sen (1978) that affects the usual ethical inequality indices. We establish that a normalised version of the generalised entropies can escape the paradox. Finally, Sect. 6 concludes the paper.

2 The standard framework of income inequality measurement

2.1 Notation and preliminary definitions

An income distribution is a list \(\varvec{x}{{\mathrm{=}}}(x_1,x_2,\ldots ,x_n)\) where \(x_i \in \mathscr {D}\) is the income of individual i, and \(\mathscr {D}{{\mathrm{=}}}[\underline{x},\overline{x}] \subseteq \mathbb {R}\) is the set of admissible incomes. In this paper we mainly focus, without loss of generality, on equal mean distributions (see Appendix A for the general case), hence distributions drawn from the set \(\mathscr {D}^{n}(\mu ) {{\mathrm{=}}}\{\varvec{x}\in \mathscr {D}^{n}| \mu (\varvec{x}) = \mu \}\) where \(\mu (\varvec{x}) {{\mathrm{=}}}\frac{1}{n} {\sum _{i=1}^{n} x_i} / n\). Then we let \(\varvec{1}_n{{\mathrm{=}}}(1,\dots ,1)\) represent a list where 1 is repeated n-times.

The ethical approach of inequality measurement aims at deriving inequality indices from a social welfare function \(W:\mathscr {D}^{n}\longrightarrow \mathbb {R}\). The equally-distributed-equivalent-income \(\Xi (\varvec{x})\) is income which, if received by each individual, results in a distribution socially indifferent to \(\varvec{x}\), so that \(W (\varvec{x}) = W \left( \Xi (\varvec{x}) \varvec{1}_n\right) \). The literature is usually focused on the utilitarian approach, in which social welfare is written as \(W_{U}(\varvec{x}) {{\mathrm{=}}}\frac{1}{n} \sum _{i=1}^n U(x_i)\) and \(\Xi _U(\varvec{x}) = U^{-1}\left( \frac{1}{n} \sum _{i=1}^n U(x_i) \right) \).

Definition 1

For all \(\varvec{x}\in \mathscr {D}^{n}(\mu )\), the class of ethical and utilitarian inequality indices—referred hereafter as ethical indices—can be written as:

$$\begin{aligned} I_{U}(\varvec{x}) {{\mathrm{=}}}\mu - \Xi _{U}(\varvec{x}), \end{aligned}$$
(2.1)

for any real-valued (utility) function U, continuously differentiable and concave, defined up to a transformation \(U_{*}(x) {{\mathrm{=}}}\alpha \,U(x) + \beta \) for all \(x \in \mathscr {D}\), where \(\alpha > 0\) and \(\beta \in \mathbb {R}\).

Alternatively, the axiomatic approach, which consists of identifying suitable properties to be applied to the inequality indices, usually leads to a unique class of indices, called the generalised entropies (see Magdalou and Nock 2011).

Definition 2

For all \(\varvec{x}\in \mathscr {D}^{n}(\mu )\), the class of generalised entropies can be written as:

$$\begin{aligned} G_\phi (\varvec{x}) {{\mathrm{=}}}\frac{1}{n} \sum _{i=1}^{n}\left[ \phi (x_i) - \phi (\mu ) \right] , \end{aligned}$$
(2.2)

for any real-valued function \(\phi \), continuously differentiable and strictly convex, defined up to a transformation \(\phi _{*}(x) {{\mathrm{=}}}\alpha \,\phi (x) + \beta \,x + \gamma \) for all \(x \in \mathscr {D}\), where \(\alpha > 0\) and \(\beta , \gamma \in \mathbb {R}\).Footnote 1

2.2 A critical discussion on the ethical approach

A discussion on the meaning of the function U in the ethical approach is required. Of course, it cannot be viewed as a representation of the individuals’ real preferences and thus, the use of the term ‘utility’ may be ambiguous. Following Dalton (1920), this function is sometimes defined in terms of ‘individual economic welfare’ or ‘personal welfare’, as it can be assessed by a social evaluator. Alternatively, it can be defined only in terms of the evaluator’s view on income inequality:

‘Note that there is nothing inherently utilitarian in this formulation. The social welfare function is the sum of identical concave transformations of individual incomes, but social welfare is expressed in terms of incomes not utilities. Thus, the concavity of the social welfare function may represent the aversion to inequality of the evaluator rather than the degree of relative risk aversion of a utility function identical across all individuals’ (Atkinson and Brandolini 2015, p. 214).

The distinction between the two interpretations is not trivial. First, we recall that the approach initiated by Kolm (1969) and Atkinson (1970) is grounded on the equivalence between the following two statements, for any distributions \(\varvec{x}, \varvec{y}\in \mathscr {D}^{n}(\mu )\) (Hardy et al. 1934):

  1. (a)

    \(\sum _{i=1}^n U(x_i) \ge \sum _{i=1}^n U(y_i)\) for any strictly concave function U defined on \(\mathscr {D}\);

  2. (b)

    \(\varvec{x}\) is obtained from \(\varvec{y}\) by means of a finite sequence of permutations and/or progressive transfers, that is, mean-preserving transfers of income from a richer to a poorer person.

This result can be interpreted in several different ways. If one reduces the notion of ‘income inequality’ as a purely descriptive notion of ‘distance in income’—which is unambiguously decreased as the result of progressive transfers—then the concavity of U captures the aversion to income inequality of the measure \(\sum _{i=1}^n U(x_i)\), in the sense that the higher the value given by the measure, the lower the income inequality. To go a step further, there is no need at all, in that case, to interpret \(\sum _{i=1}^n U(x_i)\) as a social welfare evaluation. Nevertheless, inequality also refers to the normative notion of equity, and the Pigou–Dalton principle of transfers must be seen as an ethical judgment which describes a progressive transfer as a social improvement. Dalton and Pigou take a resolutely utilitarian stance, since it is assumed that a progressive transfer ‘must increase the sum of satisfaction’ (Pigou 1912, p. 24). This argument justifies the interpretation of statement (a) as a social welfare comparison, and thus endows the concavity of the function U with the ethical views of the principle of transfers.

We emphasise that the normative interpretation of the principle of transfers (statement b) cannot be dissociated to the choice of the social welfare function (statement a). In Kolm–Atkinson’s framework, as presented above, a progressive transfer is socially desirable because an uneven income distribution is suboptimal in terms of production of social welfare. Coupled with the fact that the social welfare function is additively separable in the individual situations, it may be difficult not to interpret the concave transformation of individual incomes as a representation of personal welfare, although ‘objectively assessed’ by a social evaluator, with the property that marginal welfare ‘diminishes as income increases’ (Dalton 1920, p. 349).

In the same vein, note that Dasgupta et al. (1973) generalise the Atkinson’s result to any social welfare function \(W = W \left( U(x_1), U(x_2), \dots , U(x_n) \right) \) where U is explicitly interpreted as a utility function and such that W is increasing, symmetric, and quasi-concave in individual utilities. The most important argument, for our purpose, is that quasi-concavity is justified in terms of inequality-of-utility aversion: ‘Quasi-concavity reflects a tendency to prefer equality; given two utility distributions an average of the two must be no worse than the less preferred of the two distributions’ (Dasgupta et al. 1973, p. 181).

Aside from the interpretation of the function U, another question is raised by the justification of the quasi-concavity as proposed by Dasgupta et al. (1973): are we only concerned by income inequality, or does the inequality of utility also matter? One may reasonably consider that the income is not, for an individual, an end in itself but a means to achieve personal welfare. In this respect, Dalton’s line of argument is perfectly transparent: ‘For the economist is primarily interested, not in the distribution of income as such, but in the effects of the distribution of income upon the distribution and total amount of economic welfare’ (Dalton 1920, p. 431).

The problem is, actually, the ambiguous stance of the Kolm–Atkinson’s approach to that question. Indeed, if U is interpreted as a utility function, then the indices \(I_U\) are inequality-of-utility averse even if the underlying social welfare function \(W_U\) is inequality-of-utility neutral. Let us clarify this argument. If one applies the principle of transfers to the utility distribution instead of the income distribution one observes—if in addition we suppose that U is increasing and concave—that a utility transfer which preserves the sum of the utilities (hence \(W_U\) but also \(\Xi _U\)) is equivalent to a mean-reducing income transfer. It follows that such a utility transfer—through an underlying income transfer, the only transferable variable—implies a reduction of any index \(I_U\) and thus its consistency with the ‘principle of utility transfers’. This point can partially explain why this approach has not been much more challenged in the literature.

In this paper we explicitly take the stance—which is of course open to debate—that \(U(x_i)\) refers to personal welfare or utility of individual i, and that utility inequality is a matter of concern, as well as income inequality.Footnote 2 We also point out that it is impossible to separate the problems of income inequality and utility inequality. Formally, the two variables are inextricably linked, as utility is an increasing and concave transformation of income. ‘We have to deal, therefore, not merely with one variable, but with two, or possibly more, between which certain functional relations may be presumed to exist’ (Dalton 1920, p. 431). Mean-preserving progressive income transfers can be justified per se through normative arguments, as well as equalising transformations of the utility distribution.

3 An alternative representation of the indices

Even if the ordinal equivalence between indices \(I_{U}\) and \(G_\phi \) (Definitions 12) is often mentioned in the literature, it is rather surprising that the relationship between functions U and \(\phi \) has never been raised nor investigated. However, for the relative and absolute indices, \(\phi \) is nothing but the antiderivative of U (\(U_{r} = \phi '_{r}\) and \(U_{a} = \phi '_{a}\), see Appendix A). In this section, we propose a theoretical framework which helps rationalize this interpretation. Notice that the informational content of \(\phi \) and U makes this interpretation possible. Indeed, \(\phi \) is defined up to a transformation \(\phi _{*}(x) {{\mathrm{=}}}\alpha \,\phi (x) + \beta \,x + \gamma \), where \(\alpha > 0\) and \(\beta , \gamma \in \mathbb {R}\). If \(U = \phi ^{\prime }\), then U is defined up to a transformation \(U_{*}(x) {{\mathrm{=}}}\alpha \,U(x) + \beta \) with \(\alpha > 0\) and \(\beta \in \mathbb {R}\), as required by the ethical indices: U is a cardinal representation of the social evaluator’s preferences.

3.1 Ethical indices and generalised entropies as inequality-of-utility measures

From now on we assume that a social evaluator assesses the social well-being of each individual according to the utility resulting from the income they receive. This section establishes that the generalised entropies can be interpreted as inequality-of-utility measures if \(\phi \) is defined as the antiderivative of U, which is also the case of the standard ethical indices. Let us first define the notion of ‘Bregman divergence between two distributions’ (see Magdalou and Nock 2011).

Definition 3

For all \(\varvec{x},\varvec{y}\in \mathscr {D}^{n}(\mu )\), a Bregman divergence can be written as:

$$\begin{aligned} D_\varphi (\varvec{x}|\!| \varvec{y}) {{\mathrm{=}}}\frac{1}{n} \sum _{i=1}^{n}\left[ \varphi (x_i) - \varphi (y_i) - (x_i-y_i) \varphi '(y_i)\right] , \end{aligned}$$
(3.1)

for any function \(\varphi \) continuously differentiable and strictly convex.Footnote 3

One remarks that \(D_\varphi \) satisfies ‘anonymity’, in the sense that a permutation of all the pairs \((x_i,y_i)\) does not modify its value. Because \(\varphi \) is convex, we also have \(D_\varphi (\varvec{x}|\!| \varvec{y}) \ge 0\) for all \(\varvec{x},\varvec{y}\in \mathscr {D}^{n}(\mu )\), and \(D_\varphi (\varvec{x}|\!| \varvec{y}) = 0\) if and only if \(\varvec{x}=\varvec{y}\). Moreover one immediately observes that a generalised entropy index is equivalent to a Bregman divergence between distribution \(\varvec{x}\) and the hypothetical situation in which all individuals have mean income \(\mu \). Equivalently, \(G_\phi (\varvec{x}) = D_\phi (\varvec{x}|\!| \mu \varvec{1}_n)\).

Ethical indices are usually described as measures of income loss resulting from inequality. Alternatively the loss of potential social welfare resulting from inequality can be assessed by the difference between the ‘mean of the utilities’, which is simply written \(\mu (U) = W_{U}(\varvec{x})\), and the ‘utility of the mean’, \(U(\mu )\). From Jensen’s inequality and concavity of U, we have \(U(\mu ) \ge \mu (U)\). The following proposition establishes that an ethical index is actually a Bregman divergence between the distribution of the utilities \(\varvec{U}{{\mathrm{=}}}(U(x_1),U(x_2),\ldots ,U(x_n))\) and the hypothetical situation in which all individuals have the mean utility \(\mu (U)\). The proofs are relegated in Appendix B.

Proposition 1

For all \(\varvec{x}\in \mathscr {D}^{n}(\mu )\) and all concave utility functions U, an ethical index (Definition 1) can be written as \(I_{U}(\varvec{x}) = D_\varphi (\varvec{U}|\!| \mu (U) \varvec{1}_n)\), where \(\varphi (t) = U^{-1}(t)\) is a convex function.

Hence an ethical index has an interpretation in terms of utility loss and is thus an inequality-of-utility measure. This remark is in line with the fact that ethical indices are consistent with the ‘principle of utility transfers’. An equivalent interpretation is possible for the generalised entropies. Nevertheless, the individual utilities are not compared to the same reference: It is not the mean utility but the utility of the mean income (\(\mu (U)\) versus \(U(\mu )\)).

Proposition 2

Let \(U = \phi '\). For all \(\varvec{x}\in \mathscr {D}^{n}(\mu )\) and all convex functions \(\phi \), a generalised entropy (Definition 2) can be written as \(G_{\phi }(\varvec{x}) = D_\varphi (U(\mu ) \varvec{1}_n|\!| \varvec{U})\), where \(\varphi (t) = t\,U^{-1}(t) - \phi \left( U^{-1}(t) \right) \) is a convex function.

Propositions 1 and 2 establish that the ethical indices and the generalised entropies can each be interpreted as measures of loss of potential social welfare resulting from inequality. Nevertheless, there is a fundamental difference. While inequality aversion—through consistency with the principle of transfers—is captured by the concavity of the utility function in the standard ethical approach, it is here captured by the slope of the utility function \(U'=\phi '' > 0\) in the generalised entropies (the convexity of \(\phi \)). This point is quite surprising and, in some sense, disturbing for a reader trained in normative economics, because the association between the concavity of U and the social evaluator’s inequality aversion is now well-established. However, it is also well-admitted that this way of defining inequality aversion is not free from criticism, as pointed out by Hansson (1977) and Sen (1978) (see Sect. 5).

3.2 A simple geometric representation of the generalised entropies

Now we provide a geometric representation of \(G_\phi \) when \(\phi \) is interpreted as the antiderivative of the utility function U. It builds on the following result.

Proposition 3

For all \(\varvec{x}\in \mathscr {D}^{n}(\mu )\), a generalised entropy (Definition 2) can be written as \( G_\phi (\varvec{x}) = \frac{1}{n} \sum _{i=1}^{n} {\int _{x_i}^{\mu } \left[ U(\mu ) - U(x) \right] dx}\), where \(U {{\mathrm{=}}}\phi ^{\prime }\) is strictly increasing.

Fig. 1
figure 1

A representation of an index \(G_\phi \) on the graph of \(U = \phi ^{\prime }\)

As an illustration, consider the distribution \(\varvec{x}= (x_1, x_2)\) with \(x_1 < x_2\). It follows from Proposition 3 that \(G_\phi (\varvec{x}) = (A + B)/2\), where \(A = \int _{x_1}^{\mu } \left[ U(\mu ) - U(x) \right] dx\) and \(B = \int _{\mu }^{x_2} \left[ U(x) - U(\mu ) \right] dx\). Because A and B are computed as integrals of U, they can be represented as areas on its graph. A representation of A and B, with a strictly concave function U, is provided in Fig. 1. The farther out individual incomes \(x_1\) and \(x_2\) are from the mean, the larger areas A and B are. Figure 1 is thus a simple manner to demonstrate that the convexity of \(\phi \), or equivalently the fact that its first derivative U is increasing, is sufficient to make the generalised entropies consistent with the principle of transfers. It is also easily recognised that \(G_\phi (\varvec{x}) > 0\) as soon as \(x_1 \ne x_2\) (areas A and B are strictly positive), and that \(G_\phi (\varvec{x}) = 0\) if and only if \(x_1 = x_2 = \mu \). We emphasise that U is not necessarily concave, as it just needs to be strictly increasing. Because \(G_\phi \) is additively separable according to the individual incomes, this example can be easily generalised for a population consisting of \(n>2\) individuals. In that case, \(G_\phi \) is the mean of areas comparable to A for incomes below the mean and areas comparable to B for incomes above.

By analogy with equivalent notions in other sciences (physics, information theory, ...), a generalised entropy can be loosely associated with an ‘amount of disorder’ generated by income inequality. Because \(G_\phi \) is—in our interpretation—utility based, it can be described as a measure of ‘utility change’ due to the fact that individuals 1 and 2 have incomes \(x_1\) and \(x_2\) instead of the mean \(\mu \). More precisely, area A in Fig. 1, which deals with the situation of individual 1 only, aggregates the utility loss \((U(\mu ) - U(x))\) of all potential incomes x, from \(x_1\) to \(\mu \). Hence, area A can be viewed as a (surplus-like) measure of utility wasted by individual 1, taking into account the curvature of the utility function over the whole income interval between \(x_1\) and \(\mu \). The same interpretation is possible for area B, but as a measure of utility gained by individual 2. Nevertheless, both areas A and B contribute positively to the global measure \(G_\phi \).

We now propose to illustrate the impact of a progressive transfer on \(G_\phi \). Precisely, we denote by \(\varvec{x}(i,j;\Delta )\) a distribution obtained from \(\varvec{x}\) by means of a progressive transfer of income \(\Delta > 0\) from individual j to individual i, under the restriction that \(x_i + \Delta \le x_j - \Delta \). One deduces from Proposition 3 that for all distributions \(\varvec{x}\in \mathscr {D}^{n}(\mu )\):

$$\begin{aligned} G_\phi (\varvec{x}) - G_\phi (\varvec{x}(i,j;\Delta )) = \frac{1}{n} \left[ \int _{x_j - \Delta }^{x_j} U(x) dx - \int _{x_i}^{x_i + \Delta } U(x) dx \right] . \end{aligned}$$
(3.2)
Fig. 2
figure 2

The impact of a progressive transfer

We have represented \(C = \int _{x_j - \Delta }^{x_j} U(x) dx\) and \(D = \int _{x_i}^{x_i + \Delta } U(x) dx\) in Fig. 2. It is obvious that \(G_\phi (\varvec{x}) - G_\phi (\varvec{x}(i,j;\Delta )) = C-D >0\) as soon as U is strictly increasing (or \(\phi \) strictly convex), whatever the sign of the second derivative of U. The interpretation of the first derivative of \(\phi \) as a utility function is used in the next section to clarify the real foundations of what we call in this paper the Shorrocks’ issue.

4 A fresh look at the Shorrocks’ issue

4.1 The Shorrocks’ issue

If we denote again by \(\varvec{x}(i,j;\Delta )\) a distribution obtained from distribution \(\varvec{x}\) by means of a progressive transfer of an income amount \(\Delta >0\), we know that \(G_\phi (\varvec{x}) > G_\phi (\varvec{x}(i,j;\Delta ))\) by the strict convexity of \(\phi \). Here we introduce the notion of weak transfer sensitivity (Shorrocks and Foster 1987), related to ‘the position of the transfer’ in the distribution. According to this property, when the amount transferred and the income distance between the individuals involved in the transfer is fixed, the lower down the transfer is in the distribution, the more the inequality should be reduced.Footnote 4

Definition 4

(Shorrocks and Foster 1987) An inequality index I is weakly transfer sensitive if and only if, for all \(\varvec{x}\in \mathscr {D}^{n}(\mu )\), we have \(I(\varvec{x}) - I(\varvec{x}(i,j;\Delta )) > I(\varvec{x}) - I(\varvec{x}(k,l;\Delta ))\) whenever \((x_j-x_i) = (x_l - x_k)\) and \(x_i < x_k\).

It can be shown that \(G_\phi \) is weakly transfer sensitive if and only if \(\phi '''< 0\). Shorrocks was the first to point out the strange behaviour of indices \(G_\phi \) with \(\phi ''' \ge 0\) (Shorrocks 1980, p. 623). It is illustrated by the following example, provided by Shorrocks and Foster (1987). Indices \(G_\phi \) with \(\phi ''' = 0\) have the property that a transfer of £1 between millionaires whose incomes differ by £1 million has the same impact on inequality as a £1 transfer from an individual with an income of £1 million to someone with zero income. Worse still, the first transfer is more inequality-reducing than the second one if \(\phi ''' > 0\). The interpretation of the first derivative of \(\phi \) as a utility function is a very intuitive way to rationalise the strange behaviour of indices \(G_\phi \) with \(\phi ''' \ge 0\), what we call in this paper the Shorrocks’ issue.

4.2 A new interpretation

If one assumes that \(U {{\mathrm{=}}}\phi '\) is a utility function, concavity (\(U'' = \phi ''' < 0\)) is a necessary and sufficient condition for consistency of \(G_\phi \) with the weak transfer sensitivity property. We have depicted in Fig. 3 the impact of the two progressive transfers \(\varvec{x}(i,j;\Delta )\) and \(\varvec{x}(k,l;\Delta )\) as described in Definition 4. We actually distinguish two cases. In Fig. 3a, we consider a concave utility function (\(U'' = \phi ''' <0\)) and in Fig. 3b, we consider a convex one (\(U'' = \phi ''' >0\)). One can observe in both graphs that \(G_\phi (\varvec{x}) - G_\phi (\varvec{x}(i,j;\Delta ))=C-D\) and \(G_\phi (\varvec{x}) - G_\phi (\varvec{x}(k,l;\Delta ))=E-F\). Without ambiguity, \(G_\phi (\varvec{x}) - G_\phi (\varvec{x}(i,j;\Delta )) > G_\phi (\varvec{x}) - G_\phi (\varvec{x}(k,l;\Delta ))\) when the utility function is concave, whereas inequality is reversed when the function is convex. If the utility is linear, one immediately deduces that \(G_\phi (\varvec{x}) - G_\phi (\varvec{x}(i,j;\Delta )) = G_\phi (\varvec{x}) - G_\phi (\varvec{x}(k,l;\Delta ))\). In that case, the impact of a transfer is the same whatever its position on the income scale.

Fig. 3
figure 3

Position of the transfer and inequality reducing impact

We can now discuss the ethical meaning of these different cases. Whatever their positions on the income scale, the two considered transfers have the same inequality reducing impact on incomes: The individuals involved in the transfers are at the same income distance, and the amounts transferred are also the same. Nevertheless, if the inequality measure also appraises the impact in terms of inequality of utility, it seems reasonable to assume that the more the ‘utility distance’ is reduced between the individuals involved in the transfer, the more the overall inequality is reduced. If the utility function is linear, the decrease in inequality of utility is the same regardless of the position of the transfer on the income scale. If utility is strictly concave, the lower down the progressive transfer is in the distribution, the more the utility inequality is reduced between the individuals involved in the transfer. Thus, if we also assume that the overall inequality measure is additively separable according to the situation of the individuals, the lower down the transfer is in the distribution, the more the value of the inequality index should be reduced. The sensitivity to the position of the transfer is of course reversed with a strictly convex utility function.

The previous description of the generalised entropies, with \(U = \phi '\) a utility function, attests that they can be suitably interpreted as measures of income inequality, as well as measures of inequality of utility. This point will be discussed in great detail in the following section.

5 A fresh look at the Hansson–Sen issue

5.1 The Hansson–Sen issue

The Hansson–Sen issue—or paradox—applies to any standard ethical index based on a utilitarian social welfare function \(W_U\), but it can be extended to any index based on a more general but Schur-concave social welfare function (see Sen 1978). At first sight, what is surprising is the discrepancy between, on the one hand, the magnitude of the criticisms raised by Hansson (1977) and Sen (1978) and, on the other hand, the widespread academic support for this approach. Indeed, Sen writes that ‘the idea of measuring inequality on the basis of an overall social welfare function is fundamentally misconceived. It leads to a clearcut answer but to a question different from the one that was posed’ (Sen 1978, page 428). Hansson even considers that the ethical indices ‘are not conceptually linked to inequality at all, even if they accidentally have some relationship to this concept’ (Hansson 1977, page 305).

It is now well-accepted that income inequality must be analysed through its impact on social welfare and that an inequality index should capture the social evaluator’s views on the notion of inequality itself. Whereas most of the usual properties deal with the effects of changes in the income distribution—such that the principle of transfers—Hansson (1977) focuses on changes of the utility function:

‘My proposal is the following: whenever the curve relating welfare to income flattens out so that from some point on the new curve is wholly below the old one, but still increasing and concave, then the measure of inequality shall decrease for the same distribution of nominal income. This principle of reduced welfare from excess income is related to the principle of transfers in the following way: when the curve changes the welfare of some of those who are better off decreases and even if no actual transfer takes place, there is a general shrinking of difference’ (Hansson 1977, page 306).

Figure 4a is an illustration provided by the author to show that indices \(I_U\) violate this property. He considers a society consisting of two individuals with incomes \(x_1\) and \(x_2\), and social welfare is assessed from two utility functions, U and V. Function V is equal to U up to the mean \(\mu \), and more concave afterwards. One immediately observes that the distance from \(\mu \) to \(\Xi _V(\varvec{x})\) is greater than the distance from \(\mu \) to \(\Xi _U(\varvec{x})\). Hence \(I_U\) increases after such a concave transformation of U, whereas the principle of reduced welfare from excess income suggests that it should decrease. This is the problem identified by Hansson.

Fig. 4
figure 4

Hansson–Sen paradox for the standard ethical inequality indices

When discussing Hansson’s criticism, Sen (1978) considers another example, in some sense more general. The author rightly argues that, as the utility function becomes more and more concave, the inequality index increases but, in the same way, the gap between the personal welfare values becomes smaller. The decrease in inequality of personal welfare values is in complete contradiction with the index assessment.Footnote 5 The author illustrates the situation with Fig. 4b (for relative indices), by normalising the utility functions such that \(U(x_1) = V(x_1) = u\), with V more concave than U. Such a normalisation is possible through an affine transformation of the utility function, without affecting the value of the inequality index. Sen (1978) also considers the impact of a progressive transfer on the index. As the utility function becomes more concave, the impact of a transfer on the index is larger and larger, whereas the impact in terms of inequality of personal welfare goes on decreasing.

Nevertheless, the notion of difference in welfare should be accurately defined, which is not the case above. Indeed the utility function is defined up to an affine transformation and it does not make sense to compare differences \((U(x_2) - U(x_1))\) and \((V(x_2) - V(x_1))\) as these quantities are not invariant to such transformations. We propose to appeal to the following classical result in the theory of decision under risk, to clarify the paradox.

Remark 1

(see Pratt 1964, Theorem 1) The following three statements are equivalent:

(a):

\(V \left( U^{-1}(t) \right) \) is a strictly concave function of t.

(b):

\(I_V(\varvec{x}) > I_U(\varvec{x})\) for all \(\varvec{x}\in \mathscr {D}^{n}(\mu )\).

(c):

\(\frac{V(y)-V(x)}{V(w)-V(v)} < \frac{U(y)-U(x)}{U(w)-U(v)}\) for all \(v,w,x,y \in \mathscr {D}\) with \(v< w \le x < y\).

Statement (a) defines the notion ‘V more concave than U’. Hansson–Sen paradox is a consequence of (c), observing that the ratios are invariant to affine transformations of the utilities. Hansson’s criticism can be interpreted as follows, with (c) equivalently written as:

$$\begin{aligned} \frac{V(y)-V(x)}{V'(w)} \le \frac{U(y)-U(x)}{U'(w)},\quad \text {for all}\ w,x,y \in \mathscr {D}\quad \text {with}\quad w \le x < y. \end{aligned}$$
(5.1)

Consider a distribution \(\varvec{x}=(x_1,x_2)\), let \(w=x=x_1\) and \(y=x_2\), and normalise the utility functions—by affine transformations—such that \(U'(x_1)=V'(x_1)\). Equation (5.1) implies that the difference in welfare, relative to the marginal utility of the poorer individual which is assumed constant whatever the utility function, is lower with utility V than with U. As a contradiction, statement (b) states that the inequality is higher with \(I_V\) than with \(I_U\). For the Sen example, two points of the utility function must be normalised. Recalling that the minimum possible income is \(\underline{x}\), let \(U(\underline{x})=V(\underline{x})\) and \(U(x_1)=V(x_1)\). Now from statement (c), let \(v=\underline{x}\), \(w=x=x_1\) and \(y=x_2\). In that case, we reach the same conclusion as in the Hansson example, but the difference in welfare is here relative to the supplement of utility from the minimum the poorer individual beneficiates, which is assumed constant whatever the utility function.

Even if one assumes that only income inequality matters—and not also utility inequality, as assumed in this paper—the Hansson–Sen criticism cannot be entirely ruled out. We recall that the Kolm–Atkinson’s ethical approach draws ‘on the parallel with the formally similar problem of measuring risk in the theory of decision-making’ (Atkinson 1970, p. 244). The Atkinson index is merely the analogue of the proportional risk premium (Pratt 1964). Strictly speaking, by applying this analogy, we emphasise that the (proportional) risk premium is not, in itself, a direct measure of risk as it can be approximated by the product of the (proportional) Arrow–Pratt risk aversion coefficient and an ‘objective’ risk measure—not affected by preferences—which is the variance. Coming back to our context, one can admit that the more concave the utility is, the more the social evaluator is inequality averse. Nevertheless, the statement that also adds ‘the more concave the utility is, the more the income inequality is increased’ is at least ambiguous and, at worst, a misinterpretation. As established in the following section, the generalised entropies are not affected by the Hansson–Sen criticism if they are properly normalised.

5.2 Generalised entropies as a solution

The ethical indices \(I_U\) have an intuitive interpretation as the ‘cost of inequality’ in terms of loss of total income. At the opposite, an interpretation in terms of ‘measure of inequality’ may be viewed as misconceived if we take the stance that U is a function reflecting personal welfare and that inequality-of-utility matters, as assumed in this paper. As illustrated by the so-called Hansson–Sen paradox, these notions are different: It is possible to observe a decrease in the inequality of utility and, at the same time, an increase in the income loss resulting from inequality. In this respect, we argue that an interpretation of the concavity of U as an indicator of inequality aversion is not appropriate. It makes more sense to relate, loosely speaking, inequality aversion to the slope of the utility function (\(U'\)). Indeed, for a given income distribution, the more the utility increases, the larger the inequality of welfare is. This is actually the idea—which will become more precise hereafter—behind the interpretation of the generalised entropies as non-standard ethical inequality indices.

By assuming that U is a strictly increasing function representing the social evaluator’s preferences, let \(G_\phi \) be defined with \(\phi ' = U\). It is worth emphasising that the consistency with the Pigou–Dalton principle of transfers is here ensured by \(U'>0\), not by \(U''<0\). We also recall that indices \(G_\phi \) are defined up to a transformation of \(\phi \) by \(\phi _{*}(x) = \alpha \,\phi (x) + \beta \,x + \gamma \) for all \(x \in \mathscr {D}\), with \(\alpha > 0\) and \(\beta , \gamma \in \mathbb {R}\) or, equivalently, by an affine transformation of the utility function, \(U_{*}(x) = \alpha \,U(x) + \beta \). In order to escape the Hansson–Sen criticism, \(G_\phi \) should satisfy the following property: The more concave the utility function, the lower the value given by the index. Even if an affine transformation of the utility function does not modify the ranking of the distributions provided by the index, a problem is that its value is not invariant to such a transformation (\(G_{\phi _{*}} = \alpha \,G_\phi \) when \(\phi _{*}(x) = \alpha \,\phi (x) + \beta \,x + \gamma \)). Hence, it does not make sense to directly compare, without normalisation, the values of two different indices based on a more or less concave utility function, as illustrated in Fig. 5.

Fig. 5
figure 5

Generalised entropy and concave transformation of the utility function

This figure is based on the geometric interpretation of \(G_\phi \) as provided in Sect. 3.2. We consider a distribution \(\varvec{x}{{\mathrm{=}}}(x_1,x_2)\) and two utility functions \(U {{\mathrm{=}}}\phi '\) and \(V {{\mathrm{=}}}\psi '\) such that V is strictly more concave than U. In Fig. 5a, one observes that \(G_\psi (\varvec{x}) < G_\phi (\varvec{x})\). Indeed, \(G_\phi (\varvec{x}) = G_\psi (\varvec{x}) + (A+B)\), where areas A and B are positive. In that case, a concave transformation of the utility function reduces the inequality. In Fig. 5b, the outcome is precisely the opposite: \(G_\psi (\varvec{x}) > G_\phi (\varvec{x})\), observing that \(G_\phi (\varvec{x}) = G_\psi (\varvec{x}) - (C+D)\) with positive areas C and D. The difference between these two situations can be easily understood: The function V is increasing less than U in Fig. 5a, and increasing more in Fig. 5b.

A simple normalisation can solve the problem. The requirements are: (1) the index becomes invariant to any affine transformation of the utility function, (2) the index remains consistent with the interesting properties of the unnormalised index (principle of transfers, principle of diminishing transfers...) and (3) the index escapes the Hansson–Sen criticism. Several normalisations are actually possible. We propose a normalisation based on the following Lemma.

Lemma 1

Consider two strictly increasing utility functions U and V defined on \(\mathscr {D}\) with values in \(\mathbb {R}\) such that \(V \left( U^{-1}(t) \right) \) is a strictly concave function of t. Moreover assume that there exist \(x,y \in \mathscr {D}\) with \(x<y\) such that \(V'(x) \le U'(x)\) and \(V(y)=U(y)\). It follows that \(V(z) > U(z)\) for all \(z \in [x,y)\), and that \(V(z) < U(z)\) for all \(z > y\).

This Lemma establishes sufficient conditions to be in a situation like in Fig. 5a, where a decrease in inequality follows a concave transformation of the utility function. The fixed-point y in Lemma 1 can be chosen such as \(y=\mu \), the mean income. Then, in order to have V above U up to \(\mu \) and below afterwards, a sufficient condition is to have \(V'(x) \le U'(x)\) where x is the lowest considered income in distribution \(\varvec{x}\). A solution to make this income independent from \(\varvec{x}\) is to let \(x = \underline{x}\), the lower bound of the set of admissible incomes \(\mathscr {D}{{\mathrm{=}}}[\underline{x},\overline{x}]\). We then suggest letting \(V'(\underline{x}) = U'(\underline{x})\). To that end, we propose the following normalisation of the utility function: Let \(U_{*}(x) = \alpha \,U(x) + \beta \) with \(\alpha \) and \(\beta \) such that \({U_{*}}'(\underline{x}) = 1\) and \(U_{*}(\mu )= c\) where c is a constant. Moreover we know that \({U_{*}} = {\phi _{*}}'\) so that \(\phi _{*}(x) = \alpha \,\phi (x) + \beta \,x + \gamma \), and thus \(G_{\phi _{*}} = \alpha \,G_\phi \). It follows that, in our case, \(\alpha = 1/U'(\underline{x})\).

Inequality aversion can be associated with the slope of the utility function but, regarding the cardinal nature of U, this slope can be meaningfully interpreted only in relative terms (see statement (c) in Remark 1). Here, we propose to choose as reference the slope of U at the minimal income. We finally obtain the following class of normalised indices.

Definition 5

Consider \(G_\phi \) as presented in Definition 2. For all \(\varvec{x}\in \mathscr {D}^{n}(\mu )\), the class of the normalised and generalised entropies can be written as:

$$\begin{aligned} I_\phi (\varvec{x}) {{\mathrm{=}}}\frac{1}{U'(\underline{x})} G_\phi (\varvec{x}), \end{aligned}$$
(5.2)

where \(U {{\mathrm{=}}}\phi '\) is the utility function representing the social evaluator’s preferences. The lower bound \(\underline{x}\) of the set of admissible incomes \(\mathscr {D}{{\mathrm{=}}}[\underline{x},\overline{x}]\) can be replaced by the lowest considered income in the distributions under comparison.Footnote 6

This class of inequality indices inherits all the properties of indices \(G_\phi \). A new property is the invariance of \(I_\phi \) to any transformation of \(\phi \) by \(\phi _{*}(x) = \alpha \,\phi (x) + \beta \,x + \gamma \) or, equivalently, by an affine transformation of the utility function (\(I_\phi = I_{\phi _{*}}\)). Finally, as established by Proposition 4 below, this class of indices escapes the Hansson–Sen criticism: The more concave the utility function, the lower the value given by the index. Notice that it makes sense now to compare the values of two different indices, respectively based on a more or less concave utility function, because they are invariant to affine transformations of the utility functions.

Proposition 4

Let \(V {{\mathrm{=}}}\psi '\) and \(U {{\mathrm{=}}}\phi '\). Statement (a) below implies statement (b), but the converse is false:

  1. (a)

    \(V \left( U^{-1}(t) \right) \) is a strictly concave function of t.

  2. (b)

    \(I_\psi (\varvec{x}) < I_\phi (\varvec{x})\) for all \(\varvec{x}\in \mathscr {D}^{n}(\mu )\).

The fact that (b) does not imply (a) is not surprising: In contrast to the standard ethical indices \(I_U\), inequality aversion for the generalised entropies is not directly related to the concavity of the utility function but to its slope. Hence a concave transformation of the utility function is sufficient to imply a decrease of the normalised indices \(I_\phi \), but it is not necessary.

6 Discussion

The aim of this paper is to revisit the well-known class of inequality indices called the generalised entropies, through a new interpretation: We argue that the generating function \(\phi \) can be interpreted as the antiderivative of the utility function representing social evaluator’s preferences. In this framework, these indices are consistent with the Pigou–Dalton principle of transfers if and only if the antiderivative of the utility function is convex, or equivalently if and only if the utility function is increasing. Then, they satisfy the principle of diminishing transfers if and only if the utility function is concave. The originality of this approach is to separate the standard notion of diminishing marginal utility to the degree of inequality aversion of the social evaluator.

This interpretation clarifies the strange behaviour of the generalised entropies, as identified by Shorrocks (1980) for some values of the inequality aversion parameter, and what we call the Shorrocks issue: It appears with convex utility functions. In that case, the higher a progressive transfer is on the income scale, the more the inequality of personal welfare decreases between the individuals involved in the transfer, and thus the more the inequality as measured by the inequality index decreases. This is the reason why, in that case, the principle of diminishing transfers is violated.

This interpretation also provides a solution to escape the Hansson–Sen paradox, which affects ethical inequality indices. This paradox states that a concave transformation of the utility function leads to a decrease in inequality of personal welfare, and thus presumes that it should also imply an inequality reduction as measured by the indices, whereas the opposite implication is observed. We establish that a normalised version of the generalised entropies can escape the paradox. A normalisation is necessary because a generalised entropy index is not invariant to an affine transformation of the utility function. We propose to divide the index by the marginal utility at the lower bound of the set of admissible incomes, or at the lowest income as observed in the distributions under comparison. Nevertheless, the value given by such an index cannot be directly interpreted, which is probably the price to pay to escape the paradox.