1 Introduction

Suppose 100 voters are to elect 10 representatives in a parliament. There are 3 political parties respectively receiving 25, 43 and 32 votes. If we want to achieve proportional representation, the three parties should respectively have 2.5 (i.e., \(10 \times 25/100\)), 4.3 and 3.2 representatives.Footnote 1 This ideal representation is not feasible because it is not integer. Hence, one has to choose between one of the many different ways to obtain an integer number of representatives. Many allocation methods (for instance the Hare quota or the Droop quota) for choosing the right distribution of representatives have been proposed (Balinski and Young 1975). In order to compare these methods, or more precisely to compare the outcomes of these methods, political scientists have proposed several indices to measure the dissimilarity between the perfect proportional (but usually non-integer) distribution and a feasible one: the so-called disproportionality indices. The first one was the Rae index (Rae 1967), soon followed by many others (Karpov 2008). Around the year 2000, there were so many disproportionality indices at hand that political scientists realized that a systematic approach to the analysis of disproportionality indices was necessary.Footnote 2 This led them to proposing some conditions or properties that disproportionality indices should ideally satisfy (Bolun 2012; Karpov 2008; Koppel and Diskin 2009; Taagepera and Grofman 2003), but no characterization of a disproportionality index has been proposed so far in political sciences.Footnote 3

A related problem is the apportionment of seats among electoral districts. There also, some malapportionment indices have been proposed; they are formally equivalent to disproportionality indices and are also discussed by Bolun (2012), Karpov (2008), Koppel and Diskin (2009) and Taagepera and Grofman (2003).

Another related problem is that of comparing two probability distributions. For instance, Cha (2007) reviews more than 60 dissimilarity indices between two probability distributions. If we model a parliament by means of proportions of seats, then dissimilarity indices (between two probability distributions) and disproportionality indices (between a seat distribution and a vote distribution) are formally equivalent. But such a modelling would implicitly assume a homogeneity hypothesis (i.e. the size of a parliament does not matter; only proportions matter). We do not consider this homogeneity hypothesis as compelling and we therefore consider our problem as related but distinct from the measurement of dissimilarity between two probability distributions.

We will not propose and champion a new disproportionality index or support an existing one because we think that a single best universal disproportionality index does not exist. For instance, we probably do not need the same index if we want to measure malapportionment in a small or in a large country. Instead of supporting a specific index, we will provide a formal framework in which it is possible to analyze and compare various indices, using clear and explicit arguments. This is in line with the authors’ previous work in decision theory (e.g. Bouyssou and Pirlot 2004; Bouyssou and Marchant 2007).

2 Definitions, notation and some indices

In this section, we introduce the notation that will be needed. We also define several disproportionality indices.

2.1 Notation and definitions

The set \(N= \{1, 2, \ldots , n\}\), with \(n \ge 2\), represents the set of parties or electoral districts. We will throughout consider that N does not vary. We will consider that the distribution of votes among parties can adequately be described by a vector with rational components.Footnote 4 This distribution will be called the target distribution or just target. The set of non-negative rational numbers is denoted by \(\mathbb {Q}_{+}\). The set of strictly positive (resp. non-negative) real numbers is denoted by \(\mathbb {R}_{++}\) (resp. \(\mathbb {R}_{+}\)). The set of natural numbers is denoted by \(\mathbb {N}\) and \(\mathbb {N}_{0} = \mathbb {N} {\setminus } \{0\}\). The target distribution is \(\pi = (\pi _{1}, \ldots , \pi _{n})\), with \(\pi _{i} \in \mathbb {Q}_{+}\) for all \(i \in N\) and \(\sum _{i \in N} \pi _{i} = 1\). In the context of proportional seat allocation, \(\pi \) is the vector of the relative vote shares among the parties; in the context of apportionment, it is the vector of the relative population shares among the districts.

A parliament is described by the distribution of seats among parties. It is denoted by \(x=(x_{1}, \ldots , x_{n})\), with \(x_{i} \in \mathbb {N}\) for all \(i \in N\) and where \(x_{i}\) represents the number of seats allocated to party or district i. The size of parliament x is denoted by \(s(x) = \sum _{i \in N} x_{i}\). The relative frequency distribution in a parliament x is \(x/s(x) = (x_{1}/s(x), \ldots , x_{n}/s(x))\). The set of all parliaments is \(X= \mathbb {N}^{N}\), i.e. the set of all mappings from N to \(\mathbb {N}\). The set \(\varPi \) of all target distributions is a subset of \(\mathbb {Q}_{+}^{N}\). Formally, \(\varPi = \{\pi \in \mathbb {Q}_{+}^{N} : \sum _{i \in N} \pi _{i} = 1 \}\).

Definition 1

A disproportionality index f is a mapping from \(X \times \varPi \) to \(\mathbb {R}_{+}\) satisfying \(f(x, \pi )=0\) iff \(x/s(x) = \pi \).

A disproportionality index measures how far from the target \(\pi \) the actual seat distribution x in the parliament is. It is equal to zero only when the seat distribution is perfectly proportional.

The reader may have the impression that comparisons of disproportionality indices across different elections are not possible in our setting because the set N of parties is fixed while the set of parties in real elections is almost never the same. This is not completely correct. Indeed, we can consider some very large set N (containing e.g. 100 parties) and, when we want to model a particular election involving a subset \(M \subset N\) of parties, we just ensure that the parties in \(N{\setminus } M\) have 0 votes and 0 seats. We will discuss this in detail in Sect. 4.

2.2 Two disproportionality indices

We begin with an additive index: \(f_{\text{ DD }}(x, \pi ) = \sum _{i \in N} \left| x_{i}/s(x) - \pi _{i} \right|\). For each party i, we compute the deviation \(x_{i}/s(x) - \pi _{i} \) between the proportion in the parliament (\(x_{i}/s(x)\)) and the target proportion (\(\pi _{i}\)). We summarize the n such deviations by means of their sum after taking absolute values in order to avoid compensation between positive and negative deviations. Index \(f_{\text{ DD }}\) is the Duncan and Duncan (1955) (also called Loosemore and Hanby 1971) index.Footnote 5

A different index is \(f_{\text{ L }}(x, \pi ) = \max _{i \in N} \left| x_{i}/s(x) - \pi _{i} \right|\). Here, instead of using the sum as a summary, we focus on the largest absolute deviation. Index \(f_{\text{ L }}\) (or Lijphart index) has been proposed in Lijphart (1994).

Indices \(f_{\text{ DD }}\) and \(f_{\text{ L }}\) correspond to \(L^{p}\) distances (with \(p= 1\) and \(\infty \) resp.) between the vectors \(\pi \) and x / s(x). Many more indices can be defined by just considering other values of p or other distances. If we do not limit ourselves to distances, many other indices can be thought of. For instance, Taagepera and Grofman (2003) discusses nineteen disproportionality indices.

3 Characterizations

As already mentioned in the introduction, we do not promote a particular index. For the same reason, we will not try to justify our properties on normative grounds. Properties need not be compelling: they should be simple properties, easy to understand and as elementary as possible. They allow us to break down an index (possibly defined by a complex formula) into a small number of elementary and simple blocks. And if one of the properties characterizing an index is easy to understand but the social planner does not find it adequate, then he has a good reason not to use that index. For a deep discussion about the utility of this kind of analysis, we refer to Luce et al. (1990, pp. 196–202). See also Bell et al. (1988, Chapter 1) in which various views on using characterizations are described and compared. Our view, in the context of disproportionality measurement, is close to that of Thomson (2001), in the context of game theory and resource allocation.

In this paper, we characterize two simple and popular indices: \(f_{\text{ DD }}\) and \(f_{\text{ L }}\). These two characterizations should be considered as examples of what can be achieved and expected within the framework we propose.

3.1 Characterization of index \(f_{\text{ DD }}\)

3.1.1 Properties

Homogeneity Suppose \(n=2\) and party 1 has 400 seats in parliament x with size \(s(x)= 600\) and 800 seats in parliament y with \(s(y)=1\,200\). In terms of proportions, x and y are identical and this suggests that the disproportionality index (given the same target \(\pi \)) should be equal in the two parliaments. The first property we discuss formalizes this idea: it imposes that the size of a parliament does not matter, in the sense that scaling up or down a parliament by a multiplicative factor does not change its index.

P 1

Homogeneity. For all \(\lambda \in \mathbb {N}_{0}\) and all \(x \in X\), \(f(\lambda x, \pi ) = f(x, \pi )\).

Let us show that index \(f_{\text{ DD }}\) satisfies Homogeneity.

$$\begin{aligned} f_{\text{ DD }}(\lambda x, \pi ) = \sum _{i \in N} \left| \pi _{i} - \frac{\lambda x_{i}}{ s(\lambda x)} \right| = \sum _{i \in N} \left| \pi _{i} - \frac{\lambda x_{i}}{\lambda s(x) }\right| = \sum _{i \in N} \left| \pi _{i} - \frac{x_{i}}{ s(x) }\right| = f_{\text{ DD }}( x, \pi ). \end{aligned}$$

Although Homogeneity seems appealing, if we consider very small and very large parliaments, then it is perhaps no longer so compelling. Suppose \(n=2\), \(s(x)=6\), \(s(y)=600\) and the target is fifty-fifty. Suppose party 1 has 4 seats in parliament x. Then 4 seats for party 1 with respect to a target of 3 seems a very small deviation, not really indicating a representativity problem. On the contrary, 400 seats for party 1 in parliament y with respect to a target of 300 seems problematic, and it seems reasonable to have a higher disproportionality index for y than for x, although they are identical in terms of proportions. Even worse: suppose party 1 has two seats in parliament z with \(s(z)=3\). In terms of proportions, z is identical to x and y. But a perfect proportionality is impossible in a parliament of size three with a fifty-fifty target. It would be unfair to associate the same disproportionality index to parliaments z and x or y. We move to another property.

Radial Linearity Consider a target \(\pi \) and two parliaments x and y with the same size. Then \(s(x)\pi \) is a perfectly proportional parliament with the same size as x. Suppose that x and y lie on the same ray originating in \(s(x)\pi \). Then, \((x+y)/2\) lies between x and y, on the same ray, and a very mild monotonicity requirement is that \(f((x+y)/2,\pi )\) lies between \(f(x,\pi )\) and \(f(y,\pi )\). This monotonicity is satisfied by \(f_{\text{ DD }}\) and we could use it to characterize the ranking induced by \(f_{\text{ DD }}\) (see Sect. 4.2), but we chose to characterize the index \(f_{\text{ DD }}\). We thus need a stronger condition, fixing the exact position of \(f((x+y)/2,\pi )\) between \(f(x,\pi )\) and \(f(y,\pi )\). The simplest thing we can then do is to say that this position is exactly halfway and we can motivate this by a kind of principle of insufficient reason. This is formalized hereafter.

P 2

Radial Linearity. If x and y are two parliaments such that (a) \(s(x)=s(y)\), \((x+y)/2 \in X\) and (b) \(x = \alpha s(x) \pi + (1-\alpha ) y\) for some \(\alpha \in [0,1]\), then

$$\begin{aligned} f\left( \frac{x+y}{2},\pi \right) = \frac{f(x,\pi ) + f(y,\pi )}{2}. \end{aligned}$$

Radial Linearity is a necessary condition for \(f_{\text{ DD }}\). Indeed, \(f_{\text{ DD }}(x+y,\pi )\)

$$\begin{aligned} = \sum _{i \in N} \left| \pi _{i} - \frac{x_{i} + y_{i}}{ 2s(x)} \right| = \frac{1}{2} \sum _{i \in N} \left| 2 \pi _{i} - \frac{x_{i} + y_{i}}{ s(x)} \right| =\frac{1}{2} \sum _{i \in N} \left| \pi _{i} - \frac{x_{i}}{ s(x)} + \pi _{i} - \frac{ y_{i}}{ s(y)} \right| . \end{aligned}$$

Since \(x = \alpha s(x) \pi + (1-\alpha ) y\), we know that \(\{i \in N : x_{i}/s(x) \ge \pi _{i}\} = \{i \in N : y_{i}/s(y) \ge \pi _{i}\}\) and we have \(f_{\text{ DD }}(x+y,\pi )\)

$$\begin{aligned}&= \frac{1}{2} \sum _{i \in N} \left| \pi _{i} - \frac{x_{i}}{ s(x)} + \pi _{i} - \frac{ y_{i}}{ s(y)} \right| \\&= \frac{1}{2} \left( \sum _{i \in N} \left| \pi _{i} - \frac{x_{i}}{ s(x)} \right| + \sum _{i \in N} \left| \pi _{i} - \frac{ y_{i}}{ s(y)} \right| \right) =\frac{f_{\text{ DD }}(x,\pi ) + f_{\text{ DD }}(y,\pi )}{2}. \end{aligned}$$

Radial Linearity may seem a (too) strong condition and characterizing an index with an additive form such as \(f_{\text{ DD }}\) with Radial Linearity may seem too obvious for being interesting. Actually, there are many non-additive indices satisfying Radial Linearity. Index \(f_{\text{ L }}\) is such an index (see Sect. 3.2). The main effect of Radial Linearity is not to make things additive, but rather to force the index to vary linearly when a parliament deviates from perfect proportionality in a given direction. Notice also that, despite an additive structure, the index \(f_{\chi ^{2}}\) defined by Nagel (1984) as

$$\begin{aligned} f_{\chi ^{2}}(x,\pi ) = \sum _{i \in N} \left( \frac{(x_{i}/s(x) - \pi _{i})^{2}}{\pi _{i}} \right) \end{aligned}$$

does not satisfy Radial Linearity.

Substitution Let \(\mathbf {1}^{i}\) be an element of X such that \(\mathbf {1}^{i}_{k} = 0\) for all \(k \in N {\setminus } \{ i \}\) and \(\mathbf {1}^{i}_{i} = 1\). For example, \(\mathbf {1}^{2}= (0,1,0, \ldots , 0)\). If we add \(\mathbf {1}^{i}\) to a parliament, we obtain a new parliament where an extra seat has been allocated to party i.

Suppose \(n=3\), the target \(\pi \) is (1 / 3, 1 / 3, 1 / 3) and consider the parliament \(x=(10,40,50)\). Parties 2 and 3 are overrepresented. We move a seat from party 3 to party 2, yielding \(y=(10,41,49)\). Parties 2 and 3 are still overrepresented. It makes sense to consider that we did not improve the situation: nothing changed for the underrepresented parties; they did not gain a single seat. We may therefore request that \(f(x,\pi ) = f(y,\pi )\). Put differently, we can move a seat in a parliament from a party to another, without changing the index value, as long as both parties are in excess (or both in deficit) before and after the change.

P 3

Substitution. Suppose \([(x_{i}-1)/s(x) \ge \pi _{i}\) and \(x_{j}/s(x) \ge \pi _{j}]\) or \([0 < x_{i}/s(x) \le \pi _{i}\) and \((x_{j}+1)/s(x) \le \pi _{j}]\). Then \(f(x,\pi ) = f(x - \mathbf {1}^{i} + \mathbf {1}^{j},\pi )\).

Notice that this condition is vacuous when \(n=2\). Substitution is satisfied by \(f_{\text{ DD }}\). We only show it for the first part of the condition (corresponding to the case of parties in excess).

$$\begin{aligned} f_{\text{ DD }}(x - \mathbf {1}^{i} + \mathbf {1}^{j}, \pi ) = \left|\frac{x_{i} - 1}{s(x)} - \pi _{i}\right| + \left|\frac{x_{j} + 1}{s(x)} - \pi _{j}\right| + \sum _{k \in N {\setminus } \{i,j\}} \left|\frac{x_{k}}{s(x)} - \pi _{k }\right| . \end{aligned}$$

The dimensions in excess in x and \(x - \mathbf {1}^{i} + \mathbf {1}^{j}\) are the same. The right-hand side of the above equation can therefore be rewritten as

$$\begin{aligned} \left|\frac{x_{i}}{s(x)} - \pi _{i}\right| + \left|\frac{x_{j}}{s(x)} - \pi _{j}\right| + \sum _{k \in N {\setminus } \{i,j\}} \left|\frac{x_{k}}{s(x)} - \pi _{k }\right| = f_{\text{ DD }}(x , \pi ). \end{aligned}$$

Invariance We need an additional condition to characterize \(f_{\text{ DD }}\) when \(\pi \) is not fixed. It says that, if a parliament is perfectly proportional and if we move a seat from a party to another (thus moving away from the target), then the value of the index is the same, whatever the target is.

P 4

Invariance. For any \(i,j \in N\) and any \(\lambda \in \mathbb {N}_{0}\), if \(\lambda \pi , \lambda \pi - \mathbf {1}^{i} + \mathbf {1}^{j}, \lambda \pi ' , \lambda \pi ' - \mathbf {1}^{i} + \mathbf {1}^{j} \in X\) then \(f(\lambda \pi - \mathbf {1}^{i} + \mathbf {1}^{j}, \pi ) = f(\lambda \pi ' - \mathbf {1}^{i} + \mathbf {1}^{j}, \pi ').\)

Index \(f_{\text{ DD }}\) satisfies Invariance. Indeed,

$$\begin{aligned} f_{\text{ DD }}(\lambda \pi - \mathbf {1}^{i} + \mathbf {1}^{j}, \pi ) = \left|\frac{\lambda \pi _{i} - 1}{\lambda } - \pi _{i}\right| + \left|\frac{\lambda \pi _{j} + 1}{\lambda } - \pi _{j}\right| = \frac{2}{\lambda } . \end{aligned}$$

In the same way, \(f_{\text{ DD }}(\lambda \pi ' - \mathbf {1}^{i} + \mathbf {1}^{j}, \pi ') = 2/\lambda \). This proves that \(f_{\text{ DD }}(\lambda \pi - \mathbf {1}^{i} + \mathbf {1}^{j}, \pi ) = f_{\text{ DD }}(\lambda \pi ' - \mathbf {1}^{i} + \mathbf {1}^{j}, \pi ').\)

Invariance is a kind of symmetry condition. It is, to some extent, a desirable property, but it comes at a cost. Suppose indeed \(\pi = (1/3,1/3,1/3)\), \(\pi '=(1/300,149/300,150/300)\), \(x=(99,101,100)\), \(y=(0,150,150)\) and consider the following situations: (a) parliament x combined with target \(\pi \) and (b) parliament y combined with \(\pi '\). In both situations, party one has one seat less than what it deserves. In situation (a), the relative deviation for party 1 is 1/100 (it deserves 100 seats); in situation (b), the relative deviation for party 1 is 1/1. One could therefore argue that \(f(x,\pi )\) should be smaller than \(f(y,\pi ')\). Yet Invariance (with \(\lambda =300\), \(i=1\) and \(j=2\)) implies \(f(x,\pi ) = f(y, \pi ')\).

3.1.2 Results

We can now state a first characterization theorem.Footnote 6

Theorem 1

A disproportionality index f satisfies Homogeneity, Radial Linearity and Substitution iff, for all \(\pi \in \varPi \), there exists \(\gamma _{\pi } \in \mathbb {R}_{++}\) such that

$$\begin{aligned} f(x,\pi ) = \gamma _{\pi } \sum _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right| = \gamma _{\pi } f_{\text{ DD }}(x,\pi ). \end{aligned}$$
(1)

If Invariance is added to the list of necessary and sufficient conditions, then \(\gamma _{\pi } = \gamma _{\pi '}\) for all \(\pi , \pi ' \in \varPi \). The properties are logically independent, but Substitution can be omitted when \(n=2\).

The first part of this result (without Invariance) is useful when \(\pi \) is not allowed to vary. The second part allows us to make comparisons of indices across parliaments in different countries or across time, where \(\pi \) is not constant.

The idea of the proofFootnote 7 is as follows. We first assume \(\pi \) is fixed and we show that f depends only on the relative distribution x / s(x), thanks to Homogeneity. Any parliament can then be described by its position (x / s(x)) in the simplex \(\varPi \). We then partition the set \(\varPi \) into several domains. Each domain is characterized by a set of parties in excess (i.e., with a proportion \(x_{i}/s(x)\) larger than \(\pi _{i}\)). Using Radial Linearity, we show that f varies linearly on each line segment in \(\varPi \) having \(\pi \) as an endpoint. Notice that such a line segment always belongs to a single domain. We then show that, within a domain, the set of all points with equal value of the index is a hyperplane, by Substitution. It follows that f is an affine function in each domain and it can therefore be represented by (1).

3.2 Characterization of index \(f_{\text{ L }}\)

3.2.1 Properties

It is easy to check that \(f_{\text{ L }}\) does not satisfy Substitution. Does this imply that \(f_{\text{ L }}\) is a bad index? We do not think so, because it is not that obvious that Substitution is a compelling condition in all contexts, as illustrated in the following example. Suppose \(\pi =(1/4, 1/4, 1/4, 1/4)\); then, according to Substitution, \(x=(0, 20, 25, 55)\) and \(y=(0, 20, 40, 40)\) must have the same disproportionality. Yet people may have a different opinion and reason as follows: parliaments x and y do an equally bad job with respect to parties 1 and 2, but x does also discriminate between parties 3 and 4, while y does not. So, one could argue that the disproportionality index of y should be lower than that of x. In such a case, Substitution could be questioned. We will therefore propose a slightly different condition, after introducing a new piece of notation.

For every \(i \in N\), \(\pi \) being fixed, define \(P(i) = \{ x \in X : \frac{x_{i}}{s(x)} - \pi _{i} \ge \left| \frac{x_{j}}{s(x)} - \pi _{j} \right| \ \forall j \in N \}\). In words, P(i) is the set of all parliaments such that party i is in excess and has the maximal deviation in absolute value. Similarly, \(M(i) = \{ x \in X : \pi _{i} - \frac{x_{i}}{s(x)} \ge \left| \frac{x_{j}}{s(x)} - \pi _{j} \right| \ \forall j \in N \}\). The set M(i) interprets similarly. For every i, P(i) and M(i) cannot be both empty.

Suppose \(n=3\), the target \(\pi \) is (1 / 3, 1 / 3, 1 / 3) and consider the parliament \(x=(30,30,40)\). The largest deviation w.r.t. the target occurs for party 3. We move a seat from party 1 to party 2, yielding \(y=(29,31,40)\). The largest deviation w.r.t. the target still occurs for party 3. It makes sense to consider that we did not improve the situation: nothing changed for the party with the maximal deviation; it did not gain or loose a single seat. We may therefore request that \(f(x,\pi ) = f(y,\pi )\). Put differently, we can move a seat in a parliament from a party to another, without changing the index value, as long as the parties with maximal deviations remain so after the change. Formally,

P 5

Substitution*. Suppose that x and \(x - \mathbf {1}^{i} + \mathbf {1}^{j} \in P(k)\) or x and \(x - \mathbf {1}^{i} + \mathbf {1}^{j} \in M(k)\), with \(k \notin \{i,j\}\). Then \(f(x, \pi ) = f(x - \mathbf {1}^{i} + \mathbf {1}^{j}, \pi ).\)

Notice that this condition is vacuous when \(n=2\). Substitution* is satisfied by \(f_{\text{ L }}\). We show it only for the first part of the condition, the other part being similar. Since x and \(x - \mathbf {1}^{i} + \mathbf {1}^{j} \in P(k)\), with \(k \notin \{i,j\}\), we have

$$\begin{aligned} f_{\text{ L }}(x - \mathbf {1}^{i} + \mathbf {1}^{j}, \pi ) = \left|\frac{x_{k}}{s(x)} - \pi _{k}\right| = f_{\text{ L }}(x , \pi ). \end{aligned}$$

3.2.2 Results

Proposition 1

(Particular case) If \(n=2\) or 3, then indices \(f_{\text{ DD }}\) and \(f_{\text{ L }}\) are identical, up to a multiplicative constant. More precisely, \(f_{\text{ L }}=f_{\text{ DD }}/2\).

Proposition 1 no longer holds when \(n \ge 4\). Indeed, let \(x = (30,35,20,15)\), \(y=(32, 37, 19, 12)\) and \(\pi = (1/4,1/4,1/4,1/4)\). Then \(f_{\text{ L }}(x,\pi ) = 0.1\) and \(f_{\text{ L }}(y,\pi ) = 0.13\) while \(f_{\text{ DD }}(x,\pi ) = 0.3\) and \(f_{\text{ DD }}(y,\pi ) = 0.38\). So, there is no \(\lambda \) such that \(f_{\text{ L }} = \lambda f_{\text{ DD }}\). We now turn to the general case.

Theorem 2

A disproportionality index f satisfies Homogeneity, Radial Linearity and Substitution* iff, \(\forall \pi \in \varPi \), \(\exists \gamma _{\pi } \in \mathbb {R}_{++}\) s.t. \( f(x,\pi ) = \gamma _{\pi } \max _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right| = \gamma _{\pi } \, f_{\text{ L }}(x,\pi ). \) If Invariance is added to the list of necessary and sufficient conditions, then \(\gamma _{\pi } = \gamma _{\pi '}\) for all \(\pi , \pi ' \in \varPi \). The properties are logically independent, but Substitution* can be omitted when \(n=2\).

4 Discussion

We have shown that \(f_{\text{ L }}\) and \(f_{\text{ DD }}\) both satisfy Homogeneity, Radial Linearity and Invariance, while they differ only by the kind of substitution they satisfy.

4.1 Properties for the measurement of disproportionality in the literature

We mentioned in Sect. 1 the existence of some papers presenting properties for indices of disproportionality. In this discussion, we will focus on the eight properties presented in Koppel and Diskin (2009) because they largely overlap with properties in other papers and they are more precisely stated than in the other papers (notice though that they are stated for a proportionality index and not a disproportionality index, but it is easy to adapt them). For space reasons, we will not discuss the eight properties individually. Let us just make two remarks.

The eight properties are not strong enough to obtain a characterization. Indeed, the cosine measure discussed in Koppel and Diskin (2009) satisfies all eight properties, but the square or the square root of the cosine measure also satisfies all eight properties. So, we definitely need to define some new ones in order to characterize some index. But, of course, we use some of the properties proposed in Koppel and Diskin (2009) (their Scale Invariance is our Homogeneity) and some of our new properties, taken individually or jointly, imply or are implied by some properties in Koppel and Diskin (2009). For instance, their Dalton’s principle of transfers is implied by our Radial Linearity.

Second, we find some of the properties used in Koppel and Diskin (2009) irrelevant for characterizing an index. In particular, that the maximal value of the index should be 1. A maximal value equal to 100 (or any positive number) would be equally fine. It is just a matter of convenience, without any substantial consequence.

4.2 Index versus ranking

There are cases where we need a ranking of the parliaments and not an index. Since many indices induce the same ranking,Footnote 8 we need stronger properties for characterizing an index than for characterizing a ranking. In particular, the ranking induced by \(f_{\text{ DD }}\) (resp. \(f_{\text{ L }}\)) is characterized by Homogeneity, Substitution (resp. Substitution*), Invariance (all restated in terms of rankings) and a monotonicity condition much weaker than Radial Linearity.

4.3 Comparisons across elections

Our setting assumes that the set N of parties does not vary. Yet, if we want to compare the disproportionality of two parliaments after two different elections (in the same country at time \(t_{1}\) and \(t_{2}\) or in different countries), we will probably face different sets of parties. We saw in Sect. 2 that this problem can be tackled by adopting a large set N of parties so that any realistic situation can be modelled by imposing \(\pi _{i}=0 = x_{i}\) for some i’s. This raises some questions.

Is an election with 4 parties equivalent to an election with the same 4 plus 96 other parties having zero vote? If we want to analyze indices like the Rae index,Footnote 9 where the number of parties plays a role in the definition of the index, then our setting is not adequate, because the number of parties is some large number without any real meaning and it does not vary. Yet, for the indices characterized in this paper and for most popular indices, adding dummies does not impact the value of the disproportionality index. So, although not perfect, our setting seems fine for analyzing many interesting indices.

Is there a better setting? There is at least a more general one. We can consider a universal set \(\varOmega \) of parties, for instance \(\varOmega = \mathbb {N}\), and, for each particular election, a specific subset N of \(\varOmega \). The index f is then a function with three arguments: Nx and \(\pi \). We can then compare \(f(N,x,\pi )\) with \(f(N',x',\pi ')\). In such a setting, it would be tempting to impose the following condition: \(f(N,x,\pi ) = f(N',x',\pi ')\) whenever \(i \notin N\), \(N' = N \cup \{i \}\), \(x'_{i}=0=\pi '_{i}\), \(x'_{j}=x_{j}\) and \(\pi '_{j}=\pi _{j}\) for all \(j \in N\). This amounts to saying that adding a party without votes nor seats to an election does not modify the disproportionality index. This is of course not the case for the Rae index, but it holds for most popular indices. If we impose this property, then the remaining set of indices is exactly the set we can analyze in our setting with N fixed. This justifies the use of our setting. If we do not impose this property, then we can analyze a larger set of indices. We can also state some interesting properties saying, for instance, what should happen when several parties merge into a bigger one, inheriting all votes issued in favor of the original parties (as in Independence from split, Karpov 2008). This is not possible in our setting with N fixed. This shows the superiority of the more general setting, but it comes at a price: a heavier formalism.

In conclusion, no setting is perfect. Thanks to the simplicity of our setting, we can use a simple notation and focus on some simple ideas underlying the Duncan and Duncan index as well as the Lijphart index.

5 Proofs

Lemma 1

Suppose \(\pi \) is fixed. If a diversity index f satisfies Homogeneity, then \(f(x, \pi )= F(x/s(x))\), for some mapping \(F: \varPi \rightarrow \mathbb {R}_{+}\).

Proof

Since \(\pi \) is fixed, we can define a mapping \(g: X \rightarrow \mathbb {R}_{+}\) such that \(f(x, \pi )= g(x)\). Define the mapping \(F: \varPi \rightarrow \mathbb {R}_{+}\) as follows. For any \(p \in \varPi \), \(F(p) = g(x)\) if there is \(x \in X\) such that \(p = x/s(x)\). The mapping F is defined everywhere because p has rational components and, hence, there is always \(x \in X\) such that \(p = x/s(x)\). The mapping F is well defined. Indeed, suppose there are xy such that \(p = x/s(x)\) and \(p = y/s(y)\). By Homogeneity, \(f(x, \pi ) = f(y, \pi )\). Therefore, \(F(p)=g(x)=g(y)\). \(\square \)

Thanks to this lemma, when \(\pi \) is fixed, we can consider f as a mapping defined on the \((n-1)\) dimensional simplex \(\varPi \) (more precisely, the points in the simplex with rational coordinates) and we can therefore use a graphical representation when there are three categories, as in Fig. 1.

Lemma 2

Suppose pq are two elements of \(\varPi \) such that \(p = \alpha \pi + (1-\alpha ) q\) for some \(\alpha \in [0,1]\). If f satisfies Homogeneity and Radial Linearity, then

$$\begin{aligned} F\left( \frac{p + q}{2}\right) = \frac{F(p) + F(q)}{2}, \end{aligned}$$
(2)

with F defined as in Lemma 1.

Proof

Clearly, \((p+q)/2\) belongs to \(\varPi \). Since F is defined everywhere, there are two parliaments xy such that \(x/s(x) = p\) and \(y/s(y) = q\). Hence \(F(p)=f(x,\pi )\) and \(F(q)=f(y,\pi )\). By Homogeneity, \(F(p)=f(x,\pi ) = f(2s(y)x,\pi )\) and \(F(q)=f(y,\pi ) = f(2s(x)y,\pi )\). The two parliaments s(y)x and s(x)y have the same size, i.e., s(x)s(y). Moreover, \(s(y)x = \alpha s(s(y)x) \pi + (1-\alpha ) s(x)y\) for some \(\alpha \in [0,1]\) and \((2s(y)x + 2s(x)y)/2 \in X\). Hence we can apply Radial Linearity and we find

$$\begin{aligned} \frac{F(p) + F(q)}{2} = \frac{f(2s(y)x, \pi ) + f(2s(x)y,\pi )}{2} {\mathop {=}\limits ^{\text{ Rad.Lin. }}} f \big (s(y)x + s(x)y, \pi \big ). \end{aligned}$$

By definition of F,

$$\begin{aligned} f(s(y)x + s(x)y, \pi ) = F\left( \frac{s(y)x + s(x)y}{2s(x)s(y)}\right) = F\left( \frac{p + q}{2}\right) , \end{aligned}$$

from which we easily infer (2). \(\square \)

Let us consider a non-empty strict subset \(J \subset N\) and define

$$\begin{aligned} \varPi _{J} = \{p \in \varPi : p_{i} \ge \pi _{i} \ \forall i \in J \text { and } p_{i} \le \pi _{i} \ \forall i \notin J\}. \end{aligned}$$

All these subsets are represented in Fig. 1 left (in case \(n=3\)). Notice that the union of all such sets (letting J vary) covers \(\varPi \); formally, \(\bigcup _{\emptyset \ne J \subsetneq N} \varPi _{J} = \varPi .\) But these sets do not partition \(\varPi \) because they intersect (when some \(p_{i}=\pi _{i}\)). In particular, \(\forall J \subset N, J \ne \emptyset \), \(\pi \in \varPi _{J}\). Notice also that \(\varPi _{J}\) contains only one point in some cases. More precisely, \(\varPi _{J} = \{ \pi \}\) iff \(\pi _{i} = 0\) for all \(i \notin J\).

Fig. 1
figure 1

Left: the domains \(\varPi _{J}\). The set \(\varPi \) consists of all points in the triangle with rational coordinates. Right: an affine subspace P(pJ)

Lemma 3

Suppose \(\pi \) is fixed. For every non-empty \(J \subset N\), the set \(\varPi _{J}\) is rational convex, i.e., \(\forall p,q \in \varPi _{J}\), \(\forall \alpha \in [0,1] \cap \mathbb {Q}\), \(\alpha p + (1-\alpha )q \in \varPi _{J}\).

Proof

This results from the fact that \(\varPi _{J}\) is the intersection of two rational convex sets: (1) the polyhedral cone with vertex \(\pi \), defined by the inequalities \(p_{i} \ge \pi _{i} \ \forall i \in J\) and \(p_{i} \le \pi _{i} \ \forall i \notin J\) and (2) the set \(\varPi \). \(\square \)

Given a point \(p \in \varPi \), there can be several non-empty sets \(J \subset N\) such that \(p \in \varPi _{J}\) (this is the case when \(p_{i} = \pi _{i}\) for some \(i \in N\)). For every \(p \in \varPi \), if \(p \in \varPi _{J}\), we define the set \(P(p,J) = \left\{ q \in \varPi _{J} : \sum _{j \in J} q_{j} = \sum _{j \in J} p_{j}\right\} .\) The set P(pJ) is the intersection of the convex set \(\varPi _{J}\), located in the hyperplane \(\{q \in \mathbb {Q}^{N} : \sum _{j \in N} q_{j} = 1 \}\), with the hyperplane \(\{q \in \mathbb {Q}^{N} : \sum _{j \in J} q_{j} = \sum _{j \in J} p_{j}\}\). Whenever \(\varPi _{J}\) is not reduced to \(\{\pi \}\), these two hyperplanes are distinct and, hence, P(pJ) is located in an affine subspace of dimension \(n-2\) in \(\mathbb {Q}^{N}\). A set P(pJ) is depicted in Fig. 1 right (with \(n=3\)). The sets P(pJ) with p varying in \(\varPi _{J}\) form a partition of \(\varPi _{J}\) because, for all \(p, p' \in \varPi _{J}\), we have \(P(p,J) \ne P(p',J)\) iff \(\sum _{j \in J} p_{j} \ne \sum _{j \in J} p'_{j}\).

Lemma 4

Suppose \(\pi \) is fixed. If \(\varPi _{J} \ne \{\pi \}\), then

$$\begin{aligned} \min _{p \in \varPi _{J}} \sum _{j \in J} p_{j} = \sum _{j \in J} \pi _{j} < \max _{p \in \varPi _{J}} \sum _{j \in J} p_{j} = 1. \end{aligned}$$

The maximum is attained by all \(p \in \varPi _{J}\) such that \(p_{j}=0\) for all \(j \notin J\).

Proof

If \(\varPi _{J} \ne \{\pi \}\), there is \(p \in \varPi _{J}\) such that \(p \ne \pi \). Hence, for some \(j \in J\), \(p_{j} > \pi _{j}\) and, consequently, \(\sum _{j \in J} p_{j} > \sum _{j \in J} \pi _{j}\). Since \(\pi \in \varPi _{J}\), this establishes that \(\sum _{j \in J} \pi _{j} = \min _{p \in \varPi _{J}} \sum _{j \in J} p_{j} < \max _{p \in \varPi _{J}} \sum _{j \in J} p_{j}\).

Clearly, if p is such that \(p_{j}=0\) for all \(j \notin J\), then \(\sum _{j \in J} p_{j} = 1\) and this sum is maximal. Such a p always exists, since \(J \ne N\). \(\square \)

Lemma 5

Suppose \(\pi \) is fixed and f satisfies Homogeneity and Substitution. For every non-empty \(J \subset N\), \(\forall p \in \varPi _{J}\), \(\forall q \in P(p,J)\), we have \(F(p) = F(q)\).

Proof

Consider some pq as in the statement of the lemma. By definition of F, there are xy such that \(p = x/s(x)\) and \(q = y/s(y)\). By Homogeneity, \(f(x,\pi ) = f(s(y)x, \pi )\) and \(f(y,\pi ) = f(s(x)y, \pi )\). The two parliaments s(y)x and s(x)y have the same size. Moreover \(\sum _{j \in J} q_{j} = \sum _{j \in J} p_{j}\) and \(\sum _{j \notin J} q_{j} = \sum _{j \notin J} p_{j}\). We therefore have \(\sum _{j \in J} s(x)y_{j} = \sum _{j \in J} s(y)x_{j}\) and \(\sum _{j \notin J} s(x)y_{j} = \sum _{j \notin J} s(y)x_{j}\). This means that we can ‘move’ from s(x)y to s(y)x by a series of successive exchanges between pairs of dimensions in excess or between pairs of dimension in deficit, as in the statement of Substitution. Therefore, \(f(s(x)y,\pi ) = f(s(y)x,\pi )\) and \(F(p) = F(q)\). \(\square \)

Lemma 6

Suppose \(\pi \) is fixed and f satisfies Homogeneity and Radial Linearity. Let J be a non-empty subset of N, and \(\overline{p}\) be an element of \(\varPi _{J}\) such that \(\sum _{j \in J} \overline{p}_{j} = 1\). Then F varies linearly along the rational segment \(r_{J}\) joining \(\pi \) to \(\overline{p}\). More precisely, for \(p = \alpha \overline{p}+ (1-\alpha ) \pi \), with \(\alpha \in [0,1] \cap \mathbb {Q}\), \(F(p) = \alpha F(\overline{p})\). Furthermore, we have \(F(p) = \gamma ^{J}_{\pi } \sum _{i \in N} \left| p_{i} - \pi _{i} \right|\) for some real number \(\gamma ^{J}_{\pi }\).

Proof

If \(\varPi _{J} = \{\pi \}\), there is nothing to prove. Otherwise, using Lemma 4, there is \(\overline{p}\in \varPi _{J}\) such that \(\sum _{j \in J} \overline{p}_{j}=1 = \max _{p \in \varPi _{J}} \sum _{j \in J} p_{j}\). Let \(p = \alpha \overline{p}+ (1-\alpha ) \pi \), for some \(\alpha \in [0,1] \cap \mathbb {Q}\). Such a point belongs to \(\varPi _{J}\) since \(\varPi _{J}\) is rational convex. We define the function \(G: [0,1] \cap \mathbb {Q}\rightarrow \mathbb {R}_{+}\) by \(G(\alpha ) = F(p)\). Consider another point \(q = \beta \overline{p}+ (1-\beta ) \pi \), \(\beta \in [0,1] \cap \mathbb {Q}\). We have \((p+q)/2 \in \varPi _{J}\) and

$$\begin{aligned} \frac{p+q}{2} = \frac{\alpha + \beta }{2} \ \overline{p}+ \left( 1- \frac{\alpha +\beta }{2} \right) \pi . \end{aligned}$$

Hence (2) can be rewritten as \(G\left( \frac{\alpha + \beta }{2}\right) = \frac{G(\alpha ) + G(\beta )}{2}. \) This is Jensen’s equation and it holds for all rational \(\alpha , \beta \in [0,1]\). Its unique solution is \(G(\alpha ) = a \alpha + b\) (Aczél 1966, p. 43). Hence \(F(\alpha \overline{p}+ (1-\alpha ) \pi )= \alpha a + b\).

Letting \(\alpha = 0\), we obtain \(F(\pi )=0=b\). Letting \(\alpha = 1\), \(F(\overline{p}) = a\) obtains. Hence, \(F(\alpha \overline{p}+ (1-\alpha ) \pi )= \alpha F(\overline{p})\). This proves that F varies linearly along the rational segment \(r_{J}\) joining \(\pi \) to \(\overline{p}\).

If \(p = \alpha \overline{p}+ (1-\alpha ) \pi \), then

$$\begin{aligned} \sum _{j \in J} p_{j} = \sum _{j \in J} (\alpha \overline{p}_{j} + (1-\alpha ) \pi _{j}) = \alpha \sum _{j \in J} \overline{p}_{j} + (1-\alpha ) \sum _{j \in J} \pi _{j} \!=\! \sum _{j \in J} \pi _{j} + \alpha \left( \! 1- \sum _{j \in J} \pi _{j} \!\right) . \end{aligned}$$

Hence \(\alpha = \frac{\sum _{j \in J} (p_{j} - \pi _{j})}{ 1- \sum _{j \in J} \pi _{j} }.\) Since \(\sum _{j \in J} (p_{j}-\pi _{j}) + \sum _{j \notin J} (p_{j}-\pi _{j}) = 0\), we get

$$\begin{aligned} \alpha= & {} \frac{\sum _{j \in J} (p_{j} - \pi _{j}) - \sum _{j \notin J} (p_{j}-\pi _{j})}{ 2 \left( 1- \sum _{j \in J} \pi _{j} \right) } = \frac{\sum _{j \in J} (p_{j} - \pi _{j}) + \sum _{j \notin J} \left|p_{j}-\pi _{j}\right|}{ 2 \left( 1- \sum _{j \in J} \pi _{j} \right) } \\= & {} \frac{\sum _{j \in N} \left|p_{j} - \pi _{j}\right|}{ 2 \left( 1- \sum _{j \in J} \pi _{j} \right) } \end{aligned}$$

and

$$\begin{aligned} F(\alpha \overline{p}+ (1-\alpha ) \pi ) = \frac{\sum _{j \in N} \left|p_{j} - \pi _{j}\right|}{ 2 \left( 1- \sum _{j \in J} \pi _{j} \right) } F(\overline{p}). \end{aligned}$$

Letting \(\gamma ^{J}_{\pi } = \frac{ F(\overline{p}) }{ 2 \left( 1- \sum _{j \in J} \pi _{j} \right) },\) we obtain \(F(\alpha \overline{p}+ (1-\alpha ) \pi ) = \gamma ^{J}_{\pi } \sum _{j \in N} \left|p_{j} - \pi _{j}\right|.\)\(\square \)

Lemma 7

\(\forall J \subset N, J \ne \emptyset \), \(\forall q \in \varPi _{J}\), \(\exists p = \alpha \overline{p}+ (1-\alpha ) \pi \) s.t. \(q \in P(p, \pi )\).

Proof

Let \(\alpha = \frac{ \sum _{j \in J} (q_{j} - \pi _{j} ) }{ \sum _{j \notin J} \pi _{j} }.\) For \(p = \alpha \overline{p}+ (1-\alpha ) \pi \), straightforward calculations show that \(\sum _{j \in J} p_{j} = \sum _{j \in J} q_{j}\) and therefore \(q \in P(p, \pi )\). \(\square \)

Lemma 8

Suppose \(\pi \) is fixed and f satisfies Homogeneity, Substitution and Radial Linearity. For every \(J \subset N, J \ne \emptyset \), \(\forall p \in \varPi _{J}\), \(F(p) = \gamma ^{J}_{\pi } \sum _{i \in N} \left| p_{i} - \pi _{i} \right|\) for some real number \(\gamma ^{J}_{\pi }\).

Proof

Fix J. If \(\varPi _{J} = \{\pi \}\), there is nothing to prove. We suppose henceforth that \(\varPi _{J} \ne \{\pi \}\). For any \(p \in \varPi _{J}\), by Lemma 7, there is \(q = \alpha \overline{p}+ (1-\alpha ) \pi \) such that \(p \in P(q, \pi )\) and, by Lemma 5, \(F(p)=F(q)\). By Lemma 6, \(F(p)=F(q) = \gamma ^{J}_{\pi } \sum _{i \in N} \left|q_{i} - \pi _{i}\right|\).

Since p and q both belong to \(\varPi _{J}\) and \( \sum _{j \in J} q_{j} = \sum _{j \in J} p_{j}\) (see proof of Lemma 7), we have \(\sum _{i \in N} \left|q_{i} - \pi _{i}\right| = \sum _{i \in N} \left|p_{i} - \pi _{i}\right|\). Therefore, \(F(p)=F(q) = \gamma ^{J}_{\pi } \sum _{i \in N} \left|q_{i} - \pi _{i}\right| = \gamma ^{J}_{\pi } \sum _{i \in N} \left|p_{i} - \pi _{i}\right|\). \(\square \)

Lemma 9

Suppose \(\pi \) is fixed and f satisfies Homogeneity, Substitution and Radial Linearity. There is some \(\gamma _{\pi } \in \mathbb {R}_{++}\) such that, \(\forall J \subset N, J \ne \emptyset \) and for every \(p \in \varPi _{J}\), we have \(F(p) = \gamma _{\pi } \sum _{i \in N} \left| p_{i} - \pi _{i} \right|\).

Proof

Let \(I \subseteq N\) be such that \(\vert I \vert \ge 2\) and \(j \in I\). Define \(J= I {\setminus } \{j\}\). If \(\varPi _{J} = \{\pi \}\), then we can choose \(\gamma ^{J}_{\pi }\) equal to \(\gamma ^{I}_{\pi }\) without any consequence. Otherwise, there exists \(p \in \varPi _{I} \cap \varPi _{J}\), with \(p \ne \pi \). It is such that \(p_{j} = \pi _{j}\). Because \(p \in \varPi _{I} \cap \varPi _{J}\), we can write (by Lemma 8)

$$\begin{aligned} F(p) = \gamma ^{I}_{\pi } \sum _{i \in N} \left| p_{i} - \pi _{i} \right| = \gamma ^{J}_{\pi } \sum _{i \in N} \left| p_{i} - \pi _{i} \right| . \end{aligned}$$

Since \(\sum _{i \in N} \left| p_{i} - \pi _{i} \right| \ne 0\) (because \(p \ne \pi \)), we obtain \(\gamma ^{I}_{\pi } = \gamma ^{J}_{\pi }\).

Consider now a set \(I \subseteq N\) such that \(\vert I \vert \le n-2\) and \(i \notin I\). Define \(J= I \cup \{i\}\). The same reasoning as above shows that \(\gamma ^{I}_{\pi } = \gamma ^{J}_{\pi }\).

Let I and K be two arbitrary non-empty subsets such that \(I \ne N \ne K\). We want to show that \(\gamma ^{I}_{\pi } = \gamma ^{K}_{\pi }\). We consider two exclusive cases.

  1. 1.

    \(I \cap K \ne \emptyset \). We rename the elements of \(I {\setminus } K\) so that \(I {\setminus } K = \{ y_{1}, \ldots , y_{k} \}\). Using the reasoning in the first part of this proof, we find \(\gamma ^{I}_{\pi } = \gamma ^{I {\setminus } \{y_{1}\}}_{\pi } = \gamma ^{I {\setminus } \{y_{1}, y_{2}\}}_{\pi } = \cdots = \gamma ^{I {\setminus } (I \setminus K)}_{\pi } = \gamma ^{I \cap K}_{\pi }\). We rename the elements of \(K {\setminus } I\) so that \(K {\setminus } I = \{ z_{1}, \ldots , z_{l} \}\). Using the reasoning in the first part of this proof, we find \(\gamma ^{I}_{\pi } = \gamma ^{I \cap K}_{\pi } = \gamma ^{(I \cap K) \cup \{z_{1}\}}_{\pi } = \gamma ^{(I \cap K) \cup \{z_{1}, z_{2}\}}_{\pi } = \cdots = \gamma ^{(I \cap K) \cup (K {\setminus } I)}_{\pi }\). Since \((I \cap K) \cup (K {\setminus } I) = K\), we have proven that \(\gamma ^{I}_{\pi } = \gamma ^{K}_{\pi }\).

  2. 2.

    \(I \cap K = \emptyset \). We rename the elements of I so that \(I = \{ y_{1}, \ldots , y_{k} \}\). Using the reasoning in the first part of this proof , we find \(\gamma ^{I}_{\pi } = \gamma ^{I {\setminus } \{y_{1}\}}_{\pi } = \gamma ^{I {\setminus } \{y_{1}, y_{2}\}}_{\pi } = \cdots = \gamma ^{I {\setminus } \{y_{1}, \ldots , y_{k-1}\}}_{\pi } = \gamma ^{\{y_{k}\}}_{\pi }\). Notice we have not removed \(y_{k}\) from I to avoid considering an empty set. We will remove \(y_{k}\) later.

    We rename the elements of K so that \(K= \{ z_{1}, \ldots , z_{l} \}\). Using again the reasoning in the first part of the proof of this lemma, we find \(\gamma ^{I}_{\pi } = \gamma ^{\{y_{k}\}}_{\pi } = \gamma ^{\{y_{k}\} \cup \{z_{1}\}}_{\pi } = \gamma ^{\{y_{k}\} \cup \{z_{1}, z_{2}\}}_{\pi } = \gamma ^{\{y_{k}\} \cup \{z_{1}, \ldots , z_{l}\}}_{\pi } = \gamma ^{ \{z_{1}, \ldots , z_{l}\}} = \gamma ^{K}_{\pi }\).

This proves that \(\gamma ^{I}_{\pi } = \gamma ^{J}_{\pi }\) for all non-empty subsets IJ of N. Finally, \(\gamma _{\pi } \in \mathbb {R}_{++}\) because we have imposed in our definition of the diversity index that \(f(x, \pi ) \ge 0\) for all \(x \in X\) and \(f(x, \pi ) = 0\) iff \(x/s(x) = \pi \). \(\square \)

Proof of Theorem 1

Necessity has already been proved. We first assume \(\pi \) is fixed. According to Lemma 9 and to the definition of F and g (see Lemma 1), we can write \(f(x,\pi ) = g(x) = \gamma _{\pi } \sum _{i \in N} \left| x_{i}/s(x) - \pi _{i} \right|\).

Suppose now \(\pi \) is allowed to vary. By the first part of Theorem 1, we have \(f(x,\pi ) = \gamma _{\pi } \sum _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right|\), for every \(\pi \in \varPi \). Let \(\pi ^{*} \in \varPi \) be a target such that \(\pi _{i}^{*} > 0\) for all \(i \in N\). We will show that, for any \(\pi \in \varPi \), we have \(\gamma _{\pi } = \gamma _{\pi ^{*}}\).

Let \(\lambda \) be a natural number such that \(\lambda \pi , \lambda \pi ^{*} \in \mathbb {N}^{N}\) and \(\lambda \pi _{j} , \lambda \pi ^{*}_{j} \ge 1\) for some \(j \in N\). By Invariance,

$$\begin{aligned} f(\lambda \pi + \mathbf {1}^{i} - \mathbf {1}^{j},\pi ) = 2 \gamma _{\pi }/\lambda = f(\lambda \pi ^{*} + \mathbf {1}^{i} - \mathbf {1}^{j},\pi ^{*}) = 2 \gamma _{\pi ^{*}}/\lambda . \end{aligned}$$

Hence \( \gamma _{\pi } = \gamma _{\pi ^{*}}\). This proves that \(\gamma _{\pi }\) does actually not depend on \(\pi \).

For the logical independence of the properties in Theorem 1, we provide four examples violating one and only one property.

  • Invariance\(f(x,\pi ) = \gamma _{\pi } \sum _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right|\), with \(\gamma _{\pi }\) non-constant and positive. For instance, we could choose \(\gamma _{\pi }\) in such a way that f be normalized between 0 and 1, i.e., such that \(\max _{x \in X} f(x, \pi ) = 1\).

  • Homogeneity\(f(x,\pi ) = \sum _{i \in N} \left| x_{i} - s(x) \pi _{i} \right|.\) This index is similar to \(f_{\text{ DD }}\) but is based on deviations between absolute frequencies instead of deviations between relative frequencies.

  • Substitution (when \(n \ge 3\)): \(f(x,\pi ) = f_{\text{ L }}(x,\pi ).\)

  • Radial Linearity\(f(x,\pi ) = \left( \sum _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right| \right) ^{2}.\)

\(\square \)

Proof of Proposition 1

Case \(n=2\). Suppose \(x \in \varPi _{J}\), with \(J \subset N\) and \(J \ne \emptyset \). We assume w.l.o.g. \(J = \{ 1 \}\). Then \(\left| \frac{x_{1}}{s(x)} - \pi _{1} \right| = \left| \frac{x_{2}}{s(x)} - \pi _{2} \right|= \max \left( \left| \frac{x_{1}}{s(x)} - \pi _{1} \right| , \left| \frac{x_{2}}{s(x)} - \pi _{2} \right| \right) \). Hence, \( \sum _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right| = 2 \max _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right|. \)

Case \(n=3\). Suppose \(x \in \varPi _{J}\), with \(J \subset N\) and J non-empty. Two cases can arise: (1) \(J = \{ i \}\) or (2) \(J = \{ i, j \}\). In case (1), we assume w.l.o.g. \(J = \{ 1 \}\). Then \(\left| \frac{x_{1}}{s(x)} - \pi _{1} \right| = \left| \frac{x_{2}}{s(x)} - \pi _{2} \right| + \left| \frac{x_{3}}{s(x)} - \pi _{3} \right| \) and \(\left| \frac{x_{1}}{s(x)} - \pi _{1} \right| \ge \max \left( \left| \frac{x_{2}}{s(x)} - \pi _{2} \right| , \left| \frac{x_{3}}{s(x)} - \pi _{3} \right| \right) \). Hence,

$$\begin{aligned} \sum _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right|= & {} \left| \frac{x_{1}}{s(x)} - \pi _{1} \right| + \left| \frac{x_{2}}{s(x)} - \pi _{2} \right| + \left| \frac{x_{3}}{s(x)} - \pi _{3} \right| \\= & {} 2 \, \left| \frac{x_{1}}{s(x)} - \pi _{1} \right| = 2 \, \max _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right|. \end{aligned}$$

In case (2), the reasoning is similar. \(\square \)

Lemma 10

(necessity of some conditions for \(\mathrm{f}_\mathrm{L}\)) Index \(f_{\text{ L }}\) satisfies Homogeneity, Radial Linearity, Substitution* and Invariance.

Proof

We already proved that Substitution* is necessary and Homogeneity obviously holds. Let us show the necessity of Radial Linearity. We have

$$\begin{aligned} f_{\text{ L }}\left( \frac{x+y}{2},\pi \right)= & {} \max _{i \in N} \left| \pi _{i} - \frac{x_{i} + y_{i}}{ 2s(x)} \right| = \frac{1}{2} \max _{i \in N} \left| 2 \pi _{i} - \frac{x_{i} + y_{i}}{ s(x)} \right| \\= & {} \frac{1}{2} \max _{i \in N} \left| \pi _{i} - \frac{x_{i}}{ s(x)} + \pi _{i} - \frac{ y_{i}}{ s(y)} \right|. \end{aligned}$$

Using \(x = \alpha s(x) \pi + (1-\alpha ) y\) and the fact that \(\pi _{i}-x_{i}/s(x) \ge 0\) iff \(\pi _{i}-y_{i}/s(y) \ge 0\), we have \(f_{\text{ L }}\left( \frac{x+y}{2},\pi \right) \)

$$\begin{aligned}= & {} \frac{1}{2} \max _{i \in N} \left| \pi _{i} - \frac{x_{i}}{ s(x)} + \pi _{i} - \frac{ y_{i}}{ s(y)} \right| = \frac{1}{2} \max _{i \in N} \left( \left| \pi _{i} - \frac{x_{i}}{ s(x)}\right| + \left| \pi _{i} - \frac{ y_{i}}{ s(y)} \right| \right) \\= & {} \frac{1}{2} \left( \max _{i \in N} \left| \pi _{i} - \frac{x_{i}}{ s(x)} \right| + \max _{i \in N} \left| \pi _{i} - \frac{ y_{i}}{ s(y)} \right| \right) = \frac{f_{\text{ L }}(x,\pi ) + f_{\text{ L }}(y,\pi )}{2}. \end{aligned}$$

Necessity of Invariance: \( f_{\text{ L }}(\lambda \pi + \mathbf {1}^{i} - \mathbf {1}^{j}, \pi ) = \left| \frac{\lambda \pi _{i} + 1}{\lambda } - \pi _{i}\right| = 1/\lambda .\) In the same way, \(f_{\text{ L }}(\lambda \pi ' + \mathbf {1}^{i} - \mathbf {1}^{j}, \pi ') = 1/\lambda \). This proves that \(f_{\text{ L }}(\lambda \pi + \mathbf {1}^{i} - \mathbf {1}^{j}, \pi ) = f_{\text{ L }}(\lambda \pi ' + \mathbf {1}^{i} - \mathbf {1}^{j}, \pi ').\)\(\square \)

For every \(i \in N\), \(\pi \) being fixed, define \(\varPi ^{+}_{i} = \{p \in \varPi : p_{i} - \pi _{i} \ge \left| p_{j} - \pi _{j} \right| \ \forall j \in N {\setminus } \{i\} \}\) and \(\varPi ^{-}_{i} = \{p \in \varPi : \pi _{i} - p_{i} \ge \left| p_{j} - \pi _{j} \right| \ \forall j \in N {\setminus } \{i\} \}.\) If \(n=2\) or 3, then, \(\forall i \in N\), \(\varPi ^{+}_{i} = \varPi _{\{i\}}\) and \(\varPi ^{-}_{i} = \varPi _{N {\setminus } \{i\}}\) (\(\varPi _{\{i\}}\) has been defined just after the proof of Lemma 2). But these identities are no longer true when \(n > 3\). Notice that \(\bigcup _{i \in N} (\varPi ^{+}_{i} \cup \varPi ^{-}_{i} ) = \varPi .\) These sets do not partition \(\varPi \) because they intersect. In particular, \(\forall i \in N\), the point \(\pi \) belongs to \(\varPi ^{+}_{i}\) and \(\varPi ^{-}_{i}\). Notice also that \(\varPi ^{+}_{i}\) and \(\varPi ^{-}_{i}\) contain only one point in some cases. More precisely, \(\varPi ^{+}_{i} = \{\pi \}\) if \(\pi _{i} = 1\). Similarly, \(\varPi ^{-}_{i} = \{\pi \}\) if \(\pi _{i} = 0\).

Lemma 11

Suppose \(\pi \) is fixed. For \(i \in N\), the set \(\varPi ^{+}_{i}\) is rational convex, i.e., \(\forall p,q \in \varPi ^{+}_{i}\), \(\forall \alpha \in [0,1] \cap \mathbb {Q}\), \(\alpha p + (1-\alpha )q \in \varPi ^{+}_{i}\). \(\varPi ^{-}_{i}\) is also rational convex.

Proof

This results from the fact that \(\varPi ^{+}_{i}\) is the intersection of n rational convex sets: (1) \(n-1\) sets corresponding to the inequalities \(p_{i} - \pi _{i} \ge \left| p_{j} - \pi _{j} \right| \ \forall j \in N {\setminus } \{i\}\) and (2) the set \(\varPi \). The proof for \(\varPi _i^-\) is similar \(\square \)

Given a point \(p \in \varPi \), there can be several \(i \in N\) such that \(p \in \varPi ^{+}_{i}\) or \(p \in \varPi ^{-}_{i}\). For every \(p \in \varPi \), if \(p \in \varPi ^{+}_{i}\), we define the set \(P^{+}(p,i) = \left\{ q \in \varPi ^{+}_{i} : q_{i} = p_{i}\right\} \) and, if \(p \in \varPi ^{-}_{i}\), we define the set \(P^{-}(p,i) = \left\{ q \in \varPi ^{-}_{i} : q_{i} = p_{i}\right\} .\) Notice that \(P^{+}(p,i)\) and \(P^{-}(p,i)\) are included in an affine subspace of dimension \(n-2\) containing p. This affine subspace is the intersection of the hyperplane \(\sum _{i \in N} p_{i} = 1\) with the hyperplane defined by \( q_{i} = p_{i}\).

Lemma 12

Suppose \(\pi \) is fixed and the diversity index f satisfies Homogeneity and Substitution*. For every \(i \in N\), for every \(p \in \varPi ^{+}_{i}\) (resp. \(\varPi ^{-}_{i}\)),

  1. 1.

    \(\forall q \in P^{+}(p,i)\) (resp. \(P^{-}(p,i)\)), we have \(F(p) = F(q)\), with F as in Lemma 1;

  2. 2.

    F(p) depends only on \(p_{i}\).

Proof

We prove it only for \(p \in \varPi ^{+}_{i}\) and \(q \in P^{+}(p,i)\).

(1) Consider some pq as in the statement of the lemma. By definition of F, there are xy such that \(p = x/s(x)\) and \(q = y/s(y)\). By Homogeneity, \(f(x,\pi ) = f(s(y)x, \pi )\) and \(f(y,\pi ) = f(s(x)y, \pi )\). The two parliaments s(y)x and s(x)y have the same size. Moreover \(q_{i} = p_{i}\). We therefore have \( s(x)y_{i} = s(y) x_{i}\). This means that we can ‘move’ from s(x)y to s(y)x by a series of successive exchanges between pairs of dimensions different from i, as in the statement of Substitution*. Therefore, \(f(s(x)y,\pi ) = f(s(y)x,\pi )\) and \(F(p) = F(q)\).

(2) For every \(q \in P^{+}(p,i)\), \(F(q)=F(p)\). Besides, \(q \in P^{+}(p,i)\) iff \(q_{i}= p_{i}\). Therefore, F(q) only depends on \(q_{i}\) and F(p) only depends on \(p_{i}\). \(\square \)

Lemma 13

Suppose \(\pi \) is fixed. Define the line segment \(r^i\) (resp. \(s^{i}\)) with endpoints \(\pi \) and \(\mathbf {1}^{i}\) (resp. \(\pi \) and \((\pi -(\pi _{i}\mathbf {1}^{i}))/(1-\pi _{i})\)). For every \(p \in \varPi ^{+}_{i}\), the set \(P^{+}(p,i)\) intersects \(r^{i}\). For every \(p \in \varPi ^{-}_{i}\), the set \(P^{-}(p,i)\) intersects \(s^{i}\).

Proof

By construction, \(r^{i} \in \varPi ^{+}_{i}\) and \(s^{i} \in \varPi ^{-}_{i}\). For every \(p \in \varPi ^{+}_{i}\), we have \(\pi _{i} \le p_{i} \le 1\). By construction, there is \(q \in r^{i}\) such that \(q_{i}= p_{i}\). Hence \(q \in P^{+}(p,i)\). For every \(p \in \varPi ^{-}_{i}\), we have \(0 \le p_{i} \le \pi _{i}\). By construction, there is \(q \in s^{i}\) such that \(q_{i}= p_{i}\). Hence \(q \in P^{-}(p,i)\). \(\square \)

Lemma 14

Suppose \(\pi \) is fixed and f satisfies Homogeneity and Radial Linearity. For each \(i \in N\), F varies linearly along \(r^{i}\) in \(\varPi ^{+}_{i}\) (resp. \(s^{i}\) in \(\varPi ^{-}_{i}\)).

Proof

We prove it only for \(\varPi ^{+}_{i}\). If \(\varPi ^{+}_{i} = \{\pi \}\), there is nothing to prove. We suppose henceforth that \(\varPi ^{+}_{i} \ne \{\pi \}\). Every point of \(r^{i}\) can be written as \(\alpha \pi + (1-\alpha ) \overline{p}\), with \(\alpha \in [0,1] \cap \mathbb {Q}\). Define the mapping \(G : [0,1] \cap \mathbb {Q} \rightarrow \mathbb {R}_{+}\) by \(G(\alpha ) = F(p)\) if \(p = \alpha \pi + (1-\alpha ) \overline{p}\). G is clearly well-defined. Consider any two points \(p,q \in r^{i}\), the position of which on \(r^{i}\) is characterized by \(\alpha \) and \(\beta \) respectively. Clearly, \((p+q)/2 \in r^{i}\) and \(\frac{p+q}{2} = \frac{\alpha + \beta }{2} \pi + \left( 1 - \frac{\alpha + \beta }{2} \right) \overline{p}\) with \((\alpha + \beta )/2 \in [0,1] \cap \mathbb {Q}\). Hence (2) can be written as \( G\left( \frac{\alpha + \beta }{2}\right) = \frac{G(\alpha ) + G(\beta )}{2}. \) This is Jensen’s equation and it holds for all \(\alpha , \beta \in [0,1] \cap \mathbb {Q}\). Its unique solution is \(G(\alpha ) = a \alpha + b\) (Aczél 1966, p. 43). Hence, F varies linearly along the line segment \(r^{i}\). \(\square \)

Lemma 15

Suppose \(\pi \) is fixed and the diversity index f satisfies Homogeneity, Substitution* and Radial Linearity. For every \(i \in N\), for every \(p \in \varPi ^{+}_{i}\), \(F(p) = \gamma ^{+}_{i,\pi } \max _{i \in N} \left| p_{i} - \pi _{i} \right|\) for some positive real number \(\gamma ^{+}_{i,\pi }\). Similarly, for every \(i \in N\), for every \(p \in \varPi ^{-}_{i}\), \(F(p) = \gamma ^{-}_{i,\pi } \max _{i \in N} \left| p_{i} - \pi _{i} \right|\) for some positive real number \(\gamma ^{-}_{i,\pi }\).

Proof

We prove it only for \(\varPi ^{+}_{i}\). Fix i. If \(\varPi ^{+}_{i} = \{\pi \}\), there is nothing to prove. We suppose henceforth that \(\varPi ^{+}_{i} \ne \{\pi \}\). We know from Lemma 12 that F is constant in each set \(P^{+}(p,i)\), for all \(p \in \varPi ^{+}_{i}\), and depends only on \(p_{i}\). By Lemma 13, we know that each set \(P^{+}(p,i)\) intersects the line segment \(r^{i}\). Finally, Lemma 14 tells us that F varies linearly along \(r^{i}\).

In conclusion, \(F(p) = \gamma ^{+}_{i,\pi } p_{i} + \delta ^{+}_{i} \) for some real numbers \(\gamma ^{+}_{i,\pi }, \delta ^{+}_{i}\). Notice that these real numbers are indexed by i because they can be different in every set \(\varPi ^{+}_{i}\). The definition of a diversity index imposes that \(F(\pi ) = \gamma ^{+}_{i,\pi } \pi _{i} + \delta ^{+}_{i} = 0\). Hence, \(\delta ^{+}_{i} = - \gamma ^{+}_{i,\pi } \pi _{i} \) and \(F(p) = \gamma ^{+}_{i,\pi } (p_{i}-\pi _{i}) = \gamma ^{+}_{i,\pi } \max _{j \in N} \left| p_{j} - \pi _{j} \right|\). \(\square \)

Lemma 16

Suppose \(n \ge 3\), \(\pi \) is fixed and the diversity index f satisfies Homogeneity, Substitution* and Radial Linearity. There exists \(\gamma _{\pi } \in \mathbb {R}_{++}\) such that, for every \(i \in N\) and for every \(p \in \varPi ^{+}_{i}\), \(F(p) = \gamma _{\pi } \max _{i \in N} \left| p_{i} - \pi _{i} \right|\).

Proof

Choose some \(i \in N\) with \(\varPi ^{+}_{i} \ne \{\pi \}\). Choose some other \(j \in N\). If \(\varPi ^{-}_{j} = \{\pi \}\), then we can choose \(\gamma ^{-}_{j,\pi }\) equal to \(\gamma ^{+}_{i,\pi }\) without any consequence. Otherwise, define p by \(p_{i} = \pi _{i} + \epsilon , p_{j} = \pi _{j} - \epsilon \) and \(p_{k}=\pi _{k}\) for every \(k \ne i,j\). If we choose \(\epsilon \) small enough, we are sure that \(p \in \varPi ^{+}_{i} \cap \varPi ^{-}_{j}\), with \(p \ne \pi \). Because \(p \in \varPi ^{+}_{i} \cap \varPi ^{-}_{j}\), we have (by Lemma 15) \(F(p) = \gamma ^{+}_{i,\pi } \max _{i \in N} \left| p_{i} - \pi _{i} \right| = \gamma ^{-}_{j,\pi } \max _{i \in N} \left| p_{i} - \pi _{i} \right| .\) Since \(\max _{i \in N} \left| p_{i} - \pi _{i} \right| \ne 0\) (because \(p \ne \pi \)), we obtain \(\gamma ^{+}_{i,\pi } = \gamma ^{-}_{j,\pi }\). We can repeat the same reasoning and we then find for instance \(\gamma ^{+}_{i,\pi } = \gamma ^{-}_{j,\pi } = \gamma ^{+}_{k,\pi } = \gamma ^{-}_{i,\pi } = \cdots \) This proves that \(\gamma ^{+}_{i,\pi } = \gamma ^{+}_{j,\pi } = \gamma ^{-}_{i,\pi } = \gamma ^{-}_{j,\pi }\) for all \(i, j \in N\). Finally, \(\gamma _{\pi } \in \mathbb {R}_{++}\) because, by definition of the diversity index, \(f(x, \pi ) \ge 0\) for all \(x \in X\) and \(f(x, \pi ) = 0\) iff \(x/s(x) = \pi \). \(\square \)

Proof of Theorem 2

Necessity has been proved in Lemma 10. If \(n=2\), Substitution and Substitution* are vacuous and, hence, the properties of Theorems 1 and 2 are equivalent. We therefore know that \(f= \gamma _{\pi } f_{\text{ DD }}\) for some \(\gamma _{\pi }\). By Proposition 1, we also know that \(f_{\text{ DD }} = 2 f_{\text{ L }}\). In conclusion, \(f = 2 \gamma _{\pi } f_{\text{ L }}\).

We now assume \(\pi \) is fixed and \(n \ge 3\). Thanks to Lemma 16 and the definition of F and g (Lemma 1), we find \(f(x,\pi ) = g(x) = \gamma _{\pi } \max _{i \in N} \left| x_{i}/s(x) - \pi _{i} \right|\).

We turn to the case where \(\pi \) is allowed to vary. By Theorem 2, we know that, for every \(\pi \in \varPi \), \(f(x,\pi ) = \gamma _{\pi } \max _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right|\). Let \(\pi ^{*} \in \varPi \) be a target such that \(\pi _{i}^{*} > 0\) for all \(i \in N\). We will show that, \(\forall \pi \in \varPi \), we have \(\gamma _{\pi } = \gamma _{\pi ^{*}}\). Let \(\lambda \) be a natural number such that \(\lambda \pi , \lambda \pi ^{*} \in \mathbb {N}^{N}\) and \(\lambda \pi _{j} , \lambda \pi ^{*}_{j} \ge 1\) for some \(j \in N\). By Invariance, \(f(\lambda \pi + \mathbf {1}^{i} - \mathbf {1}^{j},\pi ) = \gamma _{\pi }/\lambda = f(\lambda \pi ^{*} + \mathbf {1}^{i} - \mathbf {1}^{j},\pi ^{*}) = \gamma _{\pi ^{*}} /\lambda .\) Hence \( \gamma _{\pi } = \gamma _{\pi ^{*}}\). This proves that \(\gamma _{\pi }\) does actually not depend on \(\pi \) and that f can be written as in the statement of Theorem 2.

For the logical independence of the properties in Theorem 2, we provide four examples violating one and only one property.

  • Invariance\(f(x,\pi ) = \gamma _{\pi } \max _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right|, \) with \(\gamma _{\pi } >0 \) and non-constant.

  • Homogeneity\(f(x,\pi ) = \max _{i \in N} \left| x_{i} - s(x) \pi _{i} \right|.\)

  • Substitution*: \(f(x,\pi ) = f_{\text{ DD }}(x, \pi )\).

  • Radial Linearity\(f(x,\pi ) = \left( \max _{i \in N} \left| \frac{x_{i}}{s(x)} - \pi _{i} \right| \right) ^{2}. \)\(\square \)