Using Similarity and Dissimilarity Measures of Binary Patterns for the Comparison of Voting Procedures

Kacprzyk, Janusz; Nurmi, Hannu; Zadrożny, Sławomir

doi:10.1007/978-3-319-40314-4_8

Janusz Kacprzyk^5,6,
Hannu Nurmi⁷ &
Sławomir Zadrożny^5,6

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 344))

592 Accesses
2 Citations

Abstract

An interesting and important problem of how similar and/or dissimilar voting procedures (social choice functions) are dealt with. We extend our previous qualitative type analysis based on rough sets theory which make it possible to partition the set of voting procedures considered into some subsets within which the voting procedures are indistinguishable, i.e. (very) similar. Then, we propose an extension of those analyses towards a quantitative evaluation via the use of degrees of similarity and dissimilarity, not necessarily metrics and dual (in the sense of summing up to 1). We consider the amendment, Copeland, Dodgson, max-min, plurality, Borda, approval, runoff, and Nanson, voting procedures, and the Condorcet winner, Condorcet loser, majority winner, monotonicity, weak Pareto winner, consistency, and heritage criteria. The satisfaction or dissatisfaction of the particular criteria by the particular voting procedures are represented as binary vectors. We use the Jaccard–Needham, Dice, Correlation, Yule, Russell–Rao, Sockal–Michener, Rodgers–Tanimoto, and Kulczyński measures of similarity and dissimilarity. This makes it possible to gain much insight into the similarity/dissimilarity of voting procedures.

To Ron, Professor Ronald R. Yager, whose highly original and ground breaking ideas, and vision, have shaped and changed research interests of so many of us for years.

Access provided by CONRICYT-eBooks. Download chapter PDF

Towards a Comprehensive Similarity Analysis of Voting Procedures Using Rough Sets and Similarity Measures

Multi-agent Systems and Voting: How Similar Are Voting Procedures

Social Choice Voting with Linguistic Preferences and Difference in Support

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

In this paper we deal with voting procedures, maybe the most intuitively appealing examples of social choice function, which are meant to determine the winner of some election in the function of individual votes—cf. for a comprehensive exposure in particular Pitt et al. [1] but also Pitt et al. [2, 3], Arrow, Sen and Suzumura [4], Kelly [5], Plott [6], Schwartz [7], etc.

Basically, we consider the following problem: we have n, $n\ge 2$ individuals who present their testimonies over the set of m, $m\ge 2$, options. The testimonies can be exemplified by individual preference relations which are often, also here, binary relations over the set of options, orderings over the set of options. We look for social choice functions, or—to be more specific—a voting procedure that would select a set of options that would best reflect the opinions of the whole group, as a function of individual preference relations.

A traditional line of research here has been whether and to which extent the particular voting procedures do or do not satisfy some plausible and reasonable axioms and conditions, maybe best exemplified by the famous Arrows theorem, and so many paradoxes of voting. We will not deal with this, for details cf. Arrow [8], Gibbard [9], Kelly [10], May [11], Nurmi [12], Riker [13], Satterthwaite [14], etc.

We will deal with an equally important, or probably practically more important, problem of how similar or dissimilar the particular voting procedures are. This was discussed in Nurmi’s [12] book, cf. also Baigent [15], Elkind, Faliszewski and Slinko [16], McCabe-Dansted and Slinko [17], Richelson [18], etc.

In this paper we will deal with the above mentioned problem of how to measure the similarity and dissimilarity of voting procedures. First, we will take into account only a subset of well known voting procedures. Then, we will employ the idea of a qualitative similarity (and its related dissimilarity) analysis of voting procedures proposed by Fedrizzi, Kacprzyk and Nurmi [19] in which Pawlak’s rough sets (cf. Pawlak [20, 21], cf. also Pawlak and Skowron [22]), have been used. Then, we will use the idea of the recent approach proposed by Kacprzyk, Nurmi and Zadrożny [23] in which the above mentioned more qualitative rough sets based analysis has been extended with a quantitative analysis by using the Hamming and Jaccard-Needham similarity indexes.

This paper is a further extension of Kacprzyk, Nurmi and Zadrożny [23]. Basically, we consider some other more popular similarity (and their related dissimilarity) measures:

Jaccard-Needham (to repeat, for completeness, the results already obtained for this measure in [23]),
Dice,
correlation,
Yule,
Russell–Rao,
Sockal–Michener,
Rogers–Tanimoto, and
Kulczyński—cf. Tubbs [24] for details.

Notice that these measure are just a small subset of a multitude of similarity measures known in the literature, cf. Choi, Cha and Tappert [25]. Moreover, in this paper we limit our attention to those similarity measures which, first of all, take in values in [0, 1], and the corresponding dissimilarity measures of which are dual in the sense that their values add up to 1, which is not the case for all measures.

Notice that this approach is different both conceptually and technically from the approach by Kacprzyk and Zadrożny [26, 27] in which some distinct classes of voting procedures are determined using the concept of Yager’s [28] ordered weighted averaging (OWA) aggregation operator (cf. Yager and Kacprzyk [29], Yager, Kacprzyk and Beliakov [30]), and the change of the order of variables to be aggregated and the type of weights (i.e. the aggregation behavior) determines various classes of voting procedures.

2 Foundations of the Theory of Rough Sets

Rough sets were proposed in the early 1980s by Pawlak [20], and then extensively developed by Pawlak [21], Polkowski (e.g., [31]), Skowron (e.g., [22, 32, 33]), Słowiński (e.g., [34]), etc. and their collaborators. It is a conceptually simple and intuitively appealing tool for the representation and processing of imprecise knowledge when the classes into which the objects are to be classified are imprecise but can be approximated by precise sets, from the above and below.

Here we will just briefly recall some basic concepts and properties of rough sets theory which may be useful for our purpose, and for more detail, cf. Pawlak [20, 21], Polkowski (e.g., [31]), Skowron (e.g., [22], Pawlak and Skowron [32, 35], Pawlak et al. [33]), and Greco et al. (e.g., [34]) etc. to just list a few.

Let $U=\{u\}$ be a universe of discourse. It can usually be partitioned in various ways into a family R of partitionings, or equivalence relations defined on U. A knowledge base, denoted by K, is the pair $K=(U, \mathbf{R})$. Let now P be a non-empty subset of R, $\mathbf{P} \subset \mathbf{R}, \mathbf{P}\ne \emptyset $. Then, the intersection of all equivalence relations (or partitionings) in P, which is also an equivalence relation, is called an indiscernibility relation over P and is denoted by $IND(\mathbf{P})$.

The family of its equivalence classes is termed the P-basic knowledge about U in K and it represents all that can be said about the elements of U under P. Therefore, one cannot classify the elements of U any deeper than to the equivalence classes of $IND(\mathbf{P})$. For instance, if for some U, $\mathbf{P}=\{R_1, R_2\}$ such that $R_1$ partitions the objects into the classes labeled “heavy” and “lightweight”, and $R_2$ partitions into the classes labeled “black” and “white”, then all that can be said about any element of U is that it belongs to one of: “heavy-and-black”, “heavy-and-white”,“lightweight-and-black”, “lightweight-and-white”.

Equivalence classes of $IND(\mathbf{P})$ are called the basic categories (concepts) of knowledge P. If $Q \in \mathbf{R}$, that is, Q is an equivalence relation on U, then its equivalence classes are called the Q-elementary categories (concepts) of knowledge R.

If $X \subset U$, and R is an equivalence relation on U, then X is called R-definable or R-exact if it is a union of some R-elementary categories (R-basic categories); otherwise, it is called R-rough.

Rough sets can be approximately defined by associating with any $X \subset U$ and any equivalence relation R on U the following two sets (U / R denotes the set of all equivalence relations of R):

a lower approximation of X:
$$\begin{aligned} R_LX = \bigcup \{Y \in U/R \mid Y \subset X\} \end{aligned}$$
(1)
an upper approximation of X:
$$\begin{aligned} R_UX = \bigcup \{Y \in U/R \mid Y \cap X \ne \emptyset \} \end{aligned}$$
(2)

and a rough set is defined as the pair $(R_L, R_U)$.

The lower approximation yields the classes of R which are subsets of X, i,e, contains those elements which are necessarily also elements of X, while the upper approximation yields those classes of R which have at least one common element with X.

For our purposes two concepts related to the reduction of knowledge are crucial. First, for a family of equivalence relations R on U, one of its elements, Z, is called dispensable in R if

$$\begin{aligned} IND(\mathbf{R}) = IND(\mathbf{R} \setminus \{Z\}) \end{aligned}$$

(3)

and otherwise it is called indispensable. If each Z in R is indispensable, then R is called independent.

For a family of equivalence relations, R, and its subfamily, $\mathbf{Q} \subset \mathbf{R}$, if:

$\mathbf{Q}$ is independent, and
$IND(\mathbf{Q}) = IND(\mathbf{R})$,

then $\mathbf{Q}$ is called a reduct of R; clearly, it need not be unique.

The core of R is the set of all indispensable equivalence relations in R, and is the intersection of all reducts of R—cf. Pawlak [20].

From the point of view of knowledge reduction, the core consists of those classifications (equivalence relations) which are the most essential in the knowledge available in that no equivalence relation that belongs to the core can be discarded in the knowledge reduction process without distorting the knowledge itself. A reduct yields a set of equivalence relations which is sufficient for the characterization of knowledge available without losing anything relevant.

In this paper our analysis is in terms of indiscernibility relations; for the concept of a discernibility relation, cf. Yao and Zhao [36].

3 A Comparison of Voting Procedures Using Rough Sets

The problem of comparison and evaluation of voting procedures (social choice functions) is very important and has been widely studied in the literature, cf. Richelson [18], Straffin [37], Nurmi [12], to name a few.

A simple, intuitively appealing, rough set based approach, was proposed by Fedrizzi, Kacprzyk and Nurmi [19]. It was more qualitative, and was extended to include more quantitative aspects by Kacprzyk, Nurmi and Zadrożny [23]. We will now briefly recall this approach since it will provide a point of departure for this paper.

We assume that we have 13 popular voting procedures:

1.
Amendment: proposals (options) are paired (compared) with the status quo. If a variation on the proposal is introduced, then it is paired with this proposal and voted on as an amendment prior to the final vote. Then, if the amendment succeeds, the amended proposal is eventually paired with the status quo in the final vote, otherwise, the amendment is eliminated prior to the final vote.
2.
Copeland: selects the option with the largest so-called Copeland score which is the number of times an option beats other options minus the number of times this option loses to other options, both in pairwise comparisons.
3.
Dodgson: each voter gives a rank ordered list of all options, from the best to the worst, and the winner is the option for which the minimum number of pairwise exchanges (added over all candidates) is needed before they all become a Condorcet winner, i.e. defeat all other options in pairwise comparisons with a majority of votes.
4.
Schwartz: selects the set of options over which the collective majority preferences are cyclic and the entire cycle is preferred over the other options; it is the single element in case there is a Condorcet winner, otherwise it consists of several options.
5.
Max-min: selects the option for which the minimal support in all pairwise comparisons is the largest.
6.
Plurality: each voter selects one option (or none in the case of abstention), and the options with the most votes win.
7.
Borda: each voter provides a linear ordering of the options which are assigned a score (the so-called Borda score) as follows: if there are n candidates, $n-1$ points are given to the first ranked option, $n-2$ to the second ranked, etc., and these numbers are summed up for each option to end up with the Borda score for this option, and the option(s) with the highest Borda score win(s).
8.
Approval: each voter selects a subset of the candidate options and the option(s) with the most votes is/are the winner(s).
9.
Black: selects either the Condorcet winner, i.e. an option that beats or ties with all others in pairwise comparisons, when it exists, and the Borda count winner (as described above) otherwise.
10.
Runoff: the option ranked first by more than a half of the voters is chosen if one exists. Otherwise, the two options ranked first by more voters than any other option are compared with each other and the winner is the one ranked first (among the remaining options) by more voters than the other option.
11.
Nanson: we iteratively use the Borda count, at each step dropping the candidate with the smallest score (majority); in fact, this is sometimes called a modified version of the Nanson rule, cf. Fishburn [38],
12.
Hare: the ballots are linear orders over the set of options, and we repeatedly delete the options which receive the lowest number of first places in the votes, and the option(s) that remain(s) are declared as the winner(s).
13.
Coombs: each voter rank orders all of the options, and if one option is ranked first by an absolute majority of the voters, then it is the winner. Otherwise, the option which is ranked last by the largest number of voters is eliminated, and the procedure is repeated.

What concerns the criteria against which the above mentioned voting procedures are compared, we use some basic and popular ones presented in the classic Nurmi’s [12] book. More specifically, we will consider 7 criteria the voting procedures are to satisfy:

1.
A—Condorcet winner,
2.
B—Condorcet loser,
3.
C—majority winner,
4.
D—monotonicity,
5.
E—weak Pareto winner,
6.
F—consistency, and
7.
G—heritage,

the essence of which can be summarized as:

1.
Condorcet winner: if an option beats each other option in pairwise comparisons, it should always win.
2.
Condorcet loser: if an option loses to each other option in pairwise comparisons, it should always loose.
3.
Majority winner: if there exists a majority (at least a half) that ranks a single option as the first, higher than all other candidates, that option should win.
4.
Monotonicity: it is impossible to cause a winning option to lose by ranking it higher, or to cause a losing option to win by ranking it lower.
5.
Weak Pareto winner: whenever all voters rank an option higher than another option, the latter option should never be chosen.
6.
Consistency criterion: if the electorate is divided in two and an option wins in both parts, it should win in general.
7.
Heritage: if an option is chosen from the entire set of options using a particular voting procedure, then it should also be chosen from all subsets of the set of options (to which it belongs) using the same voting procedure and under the same preferences.

We start with a illustrative account of which voting procedure satisfies which criteria(‘0” stands for “does not satisfy”, and “1” stands for “satisfies”) which is presented in Table 1; the 13 voting procedures correspond to the rows while the 7 criteria correspond to the columns, here and in next tables.

Table 1 Satisfaction of 7 criteria by 13 voting procedures

Using Similarity and Dissimilarity Measures of Binary Patterns for the Comparison of Voting Procedures

Abstract

Similar content being viewed by others

Towards a Comprehensive Similarity Analysis of Voting Procedures Using Rough Sets and Similarity Measures

Multi-agent Systems and Voting: How Similar Are Voting Procedures

Social Choice Voting with Linguistic Preferences and Difference in Support

Keywords

1 Introduction

2 Foundations of the Theory of Rough Sets

3 A Comparison of Voting Procedures Using Rough Sets

4 Simplification of Information on the Voting Procedures and Criteria to Be Fulfilled

5 Similarity and Dissimilarity of the Voting Procedures: A Quantitative Approach Based on Similarity and Dissimilarity Measures for Binary Patterns

6 Concluding Remarks

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation