1 Introduction

A large strand of literature in experimental social psychology and experimental economics has demonstrated that informal sanctions can solve social dilemmas.Footnote 1 The upshot of this literature is that, when people are able to punish others dependent on their contributions, they do so in a way that free riding is no longer profitable and even selfish subjects find it worthwhile to contribute. The public goods game is then no longer a social dilemma and the groups manage to avoid the tragedy of the commons (Hardin 1968). This is very remarkable, because the punishment of free riders is per construction also plagued by free rider incentives: in the best world I would have others educate the free riders to become contributors and enjoy their contributions without engaging in costly punishment myself.

Recent experimental evidence, however, puts the universality of these results into question. Gächter et al. (2005) began investigating public goods games with punishment in different cultures and found that the punishment option is hardly effective in enhancing cooperation in some of their subject pools. They presented preliminary evidence that the variation in cooperation across their subject pools is connected to the use of the punishment option. Interestingly, the differences are not in the way subjects treat free riders, but in the way they treat cooperative subjects. For punishment targeted to subjects with an equal or a higher contribution than the punisher Herrmann et al. (2008) coined the term ‘antisocial punishment’.Footnote 2 They investigate 16 culturally diverse subject pools and show that there is a clear-cut connection between the prevalence of antisocial punishment and the effectiveness of the punishment option in fostering cooperation. In subject pools where antisocial punishment is frequent subjects do not profit from the punishment option and earn lower profits than without the punishment option.

Given that antisocial punishment is a major obstacle for cooperation it is important to understand the motives behind antisocial punishment. Fehr and Gächter (2000, p. 990) devote a footnote to the causes of antisocial punishment. They mention random error, improvement of the relative position (status preferences), and revenge for anticipated or past punishment. Herrmann et al. (2008) provide a more extensive account for the causes of antisocial punishment, adding ‘do-gooder derogation’ (Monin 2007), and a desire to punish non-conformists to the list. In the data they find evidence for the revenge explanation but their experiment is not designed to differentiate between various motives behind antisocial punishment. To date there is no experimental study that systematically explores the causes of antisocial punishment.

This paper brings an additional, maybe surprising reason for antisocial punishment into the discussion. In the next section I use the model of Fehr and Schmidt (1999) to show that inequality aversion predicts antisocial punishment in many cases. Consider a situation where (i) cooperative players are faced with a free rider and (ii) not all cooperators are willing to punish the free rider. Inequality aversion predicts that those who punish do not only punish the free rider, but also the cooperative players who do not punish. This happens for purely material reason: it ensures that the punishing players do not fall behind the players who free ride on their punishment expenditures.

Suspecting inequality aversion as a motivation for punishment acts is quite natural. For punishment of free riders Fehr and Fischbacher (2004) and Fowler et al. (2005)Footnote 3 provide evidence that egalitarian motives drive punishment decisions. In Sect. 3, I use data from three experimental studies on public goods games with punishment to investigate whether the same is true for antisocial punishment. The answer is no—a majority of antisocial punishment acts occur in situations which do not meet the conditions for antisocial punishment as explained by inequality aversion.

2 Theory

On pages 836–843 Fehr and Schmidt (1999) derive equilibria in public goods games with and without punishment. They show that in both games cooperative equilibria are possible, provided sufficiently many players are sufficiently inequality averse. In the public goods game without punishment selfish players always contribute zero. In the game with punishment even selfish players contribute as long as there is a subgroup of inequality averse players ready to punish deviators. Fehr and Schmidt’s characterization of the off-equilibrium path in the game with punishment is, however, incomplete and thus fails to account for antisocial punishment. In the following I show that inequality averse players might not only punish deviant group members, but also players who contributed more than themselves.

2.1 The game

There are \(n\geqslant 3\) players with an endowment of \(y\) monetary units. The game consists of two stages. In the first stage players simultaneously contribute \(g_i\in [0,y]\) to a public good. After the contribution decision all players learn all contributions and their stage 1 earnings \(w_i\), calculated as

$$\begin{aligned} w_i(\varvec{g})= y - g_i + a \sum _{j=1}^{n} g_j, \end{aligned}$$
(1)

where \(\varvec{g}\) is the vector of all contributions and \(a\) is the marginal per capita return of the public good with \(\frac{1}{n}<a<1\). In the second stage players can punish each other bilaterally by assigning punishment points \(p_{ij} \geqslant 0\), which reduce the monetary payoff of \(j\) by one unit at a cost of \(c\) units to the punisher \(i\), and \(0<c<1\). Antisocial punishment occurs when a player \(i\) punishes another player \(j\) who chose a weakly higher contribution \((g_j \geqslant g_i)\). Punishment of a player \(j\) with a lower contribution \((g_j < g_i)\) will be labeled as free-rider punishment, irrespective of the absolute level of \(g_j\). The final monetary payoff for player \(i\) is

$$\begin{aligned} x_i(\varvec{g},\varvec{P})= w_i(\varvec{g}) - \sum _{j=1}^{n} p_{ji} - c \sum _{j=1}^{n} p_{ij}, \end{aligned}$$
(2)

with \(\varvec{P}\) being an \(n\times n\) matrix with imposed zeros on the main diagonal (masochistic punishment is usually not allowed in these experiments). Players have Fehr–Schmidt utility functions

$$\begin{aligned} u_i(\varvec{x})=x_i-\frac{\alpha _i}{n-1} \sum _{j\ne i}\max \left[ x_j-x_i,0\right] - \frac{\beta _i}{n-1} \sum _{j\ne i}\max \left[ x_i-x_j,0\right] , \end{aligned}$$
(3)

where every unit of payoff difference in bilateral payoff comparisons reduces utility by \({\alpha _i}/({n-1})\) in case the other player earns more and \({\beta _i}/({n-1})\) in case the other player earns less than player \(i\). Fehr and Schmidt put restrictions on the preference parameters: both kinds of inequality hurt or are neutral \((\alpha _i, \beta _i \geqslant 0)\), having less than others hurts weakly more than having more \((\alpha _i \geqslant \beta _i)\), and having more than others must not hurt so badly that an individual would want to burn his own money to restore equality \((\beta _i <1)\).

Fehr and Schmidt (1999, p. 841) show that, if sufficiently many players are inequality averse, subgame perfect equilibria with non-minimal contributions \(g_i=g \in (0,y]\; \forall \, i\) exist. In these equilibria also selfish players contribute \(g\), because they face a credible threat of punishment by the inequality averse players. In equilibrium no punishment occurs. To explain any kind of punishment, we need to assume that at least one player deviates from the equilibrium strategy and chooses \(g_i<g\). In the following we restrict our attention to the subgame where \(p_{ij}\) is chosen; I refer to this subgame as the ‘punishment subgame’.Footnote 4 I present three simple numerical examples to illustrate how players with Fehr–Schmidt preferences punish other players. A general solution of the punishment subgame is provided in the Appendix. The first two examples focus on a situation in which only one player is willing to punish other players.

2.2 The lone enforcer

Example 1

Consider the game with \(\{n,y,a,c\}=\left\{ 3,20,\frac{1}{2},\frac{1}{3}\right\} \). Assume the group consists of two players, 1 and 2, with selfish preferences, i.e. \(\{\alpha _i,\beta _i\}=\{0,0\},\,i=1,2\), while the third player \(E\) shall be inequality averse with \(\{\alpha _E,\beta _E\}=\{4,.6\}\), henceforth called the enforcer.

Consider a situation where \(E\) and player 1 contributed 20, while player 2 contributed 0.Footnote 5 The first part of Table 1 shows the utilities that result from this situation if no punishment is exerted. How should players punish each other? The two selfish players would certainly not punish because punishment is costly and yields no benefit to them. Player \(E\), however, might want to punish because she does not like the fact that player 2 earns more than she does. Her utility without punishment amounts to \(-20\), because she suffers from disadvantageous inequality towards player 2 of 20 monetary units. According to the Fehr–Schmidt utility function this difference is weighted by \({\alpha _E}/{2}\) and subtracted from her monetary income. What relief can \(E\) give herself by punishing others? According to Fehr and Schmidt’s Proposition 5, \(E\) should reduce player 2’s income by \(p_{E2}=30\) units (see intermediate columns in Table 1). This would equalize player 2’s and \(E\)’s monetary income and increase \(E\)’s utility to \(-10\).

Table 1 Antisocial punishment in a three player public goods game with punishment

However, this is not the equilibrium of the punishment subgame. In fact, \(E\) is not playing best response if she just punishes the deviator (player 2). The last three columns of Table 1 show the optimal punishment strategy. \(E\) maximizes her utility if she punishes both other players. She does so in a way to equalize all three monetary payoffs, which allows her to enjoy a utility of zero. Thus, the Fehr–Schmidt utility function does indeed predict antisocial punishment, i.e. punishment of players who are equally or more cooperative than the punisher.

Table 1 shows also that the punishment for the free rider becomes harsher once we take antisocial punishment into account. For the existence of an equilibrium with non-minimal contributions punishment has to be sufficiently strong to make deviations unprofitable. This is clearly the case in the example: had player 2 contributed 20 there would have been no punishment and all players would have earned 30. With the deviation player 2 earns 10 (according to Fehr–Schmidt) or 0 for optimal punishment. Thus, if the parameter constellation derived in Fehr–Schmidt Prop. 5 allows for a cooperative equilibrium it will also do so when we take the possibility of antisocial punishment into account.Footnote 6

Example 2

In a next step let us consider a richer example. Table 2 depicts the situation of a public goods game with \(\{n,y,a,c\}= \left\{ 4,20,\frac{2}{5},\frac{1}{3} \right\} \). To make things interesting I assume that the enforcer chose a contribution of 9, and the other players in the group chose 0, 11 and 20. This produces a set of stage 1 earnings as depicted in the third column of Table 2. Let us look at two types of enforcers, a ‘weakly inequality averse’ player with \(\{\alpha _E,\beta _E\}=\{1,.6\}\), and a ‘strongly inequality averse’ player with \(\{\alpha _E,\beta _E\}=\{4,.6\}\). These two types will have different optimal punishment patterns. The three columns in the middle of Table 2 show the optimal punishment strategy of the weakly inequality averse player. Such a player maximizes his utility by punishing the richest among the other players (the free rider). However, punishment is only utility maximizing as long as \(E\)’s income does not drop below another player’s income. Consequently, as soon as player \(E\)’s income is as low as the next lower income (in this case player 2’s income), he does not exert more punishment, despite the fact that player 1 still earns more than he does.

A strongly inequality averse player does not stop here. Her much higher \(\alpha _E\) means that the income difference towards player 1 still weighs heavily on her well-being. A strongly inequality averse player prefers to further decrease player 1’s income. However, in order to avoid falling behind player 2, she punishes both other players with weakly higher incomes. Player \(E\)’s utility is maximized when the incomes of player 1 and 2 are equal to her own income.

Table 2 Optimal punishment for a weakly or strongly inequality averse enforcer in a four player public goods game with punishment

What does this mean for antisocial punishment? The weakly inequality averse player in our example does not engage in antisocial punishment, whereas the strongly inequality averse player does mete out punishment to a player who was more cooperative than herself. This example demonstrates the necessary conditions for antisocial punishment: (i) player \(E\) must be sufficiently inequality averse to be willing to punish more than one other player, (ii) there must be at least one player with a strictly higher income than \(E\) and (iii) in the process of punishing the strictly richer player, \(E\)’s income must undershoot one of the other (weakly poorer) player’s incomes.

2.3 Teaming up

Example 3

What if more than one player is inequality averse enough to punish other players? Let there be a subgroup of \(n^{\prime }=2\) homogeneous enforcersFootnote 7 and \(\{n,y,a,c\}= \left\{ 4,20,\frac{2}{5},\frac{1}{3} \right\} \). Table 3 shows the situation of two enforcers with \(\{\alpha _E,\beta _E\}=\{4,.6\}\) facing one cooperative player and one free rider. According to Proposition 5 in Fehr and Schmidt the two enforcers should punish player 2 (the free rider) such that this player’s income is equal to their income. However, similar to the case of the lone enforcer, the two enforcers can improve their situation if they also punish player 1. In the example they maximize their utility by equalizing the monetary payoff of all four players. However, unlike in the case of the lone enforcer, only punishing the deviant player is also an equilibrium of the punishment subgame. If \(E1\) punishes only player 2 by 12 units, then \(E2\)’s best response is to punish only player 2 by 12 units as well and vice versa. However, the equilibrium including antisocial punishment towards the fully contributing non-enforcer is clearly more efficient for the two enforcers.

Table 3 Optimal punishment in a four player public goods game with punishment with two enforcers

2.4 Conditions for antisocial punishment

The three examples show that antisocial punishment motivated by Fehr–Schmidt inequality aversion happens only in order to prevent falling behind other players who do not punish.Footnote 8 The Appendix provides a general solution to the punishment subgame. A decisive prerequisite is the existence of other players who do not engage in punishment. Thus, the decision whether to engage in antisocial punishment or not depends on players’ beliefs about other players’ punishment behavior. This dependency on beliefs makes it difficult to identify sufficient conditions for antisocial punishment and an empirical investigation would require information about subjects’ beliefs about all other punishments. However, for an empirical test of the predictions it is easy to derive two necessary conditions for the occurrence of antisocial punishment in public goods games with punishment, which hold for any belief about other player’s \(\alpha \) and \(\beta \). If antisocial punishment is motivated by Fehr–Schmidt like inequality aversion, then it must hold that:

  1. C1:

    A maximum earner never punishes, i.e. antisocial punishment can only occur if the punisher’s stage 1 income is below the maximum stage 1 income in the group.

  2. C2:

    The only reason for an enforcer to punish a weakly poorer player is to avoid falling behind this player while punishing free riders. Consequently, antisocial punishment can only occur in combination with punishment of a richer player.Footnote 9

Proofs are provided in the Appendix. In the next step I confront these two conditions with data on antisocial punishment.

3 Empirics

In this Section I use the data of three studies on public goods games with punishment. First I look at the study by Fehr and Gächter (2002) who report data from 120 subjects playing six rounds of the public goods game with punishment with \(\{n,y,a,c\}=\Big \{4,20,\frac{2}{5},\frac{1}{3}\Big \}\).Footnote 10 They use a perfect stranger matching protocol, which ensures that, during the six rounds, a subject will not meet another subject in the session more than once. Thus, from a game-theoretic perspective we observe the subjects playing six consecutive one-shot games.

Before addressing the necessary conditions for antisocial punishment I investigate the frequency of the prototypical situations for observing antisocial punishment. Recall that the situations typically require the presence of a free rider and non-punishing contributors. The overall frequency of free-rider punishment (strictly positive amounts of punishment targeted to subjects with a lower contribution) in the sample is 46.9 %, i.e. in slightly more than half of the cases subjects do not punish others with a lower contribution.Footnote 11

In a next step I investigate whether observed acts of antisocial punishment are in line with the predictions of the Fehr–Schmidt model. The first result addresses C1.

Result 1

: Maximum earners punish

Support According to C1 all antisocial punishment must come from subjects who earn less than the maximum stage 1 income in a group. In Fehr and Gächter’s data top earners punish weakly poorer subjects by 1.60 punishment points on average.Footnote 12 Antisocial punishment meted out by top earners is obviously not zero, but is that number small or large? Subjects who are not among the richest in their group mete out .43 punishment points to weakly poorer subjects.Footnote 13 Thus, not only do the richest subjects in the experiments by Fehr and Gächter (2002) punish other weakly poorer players—they do so even stronger than subjects who are not among the richest in the group. This difference is significant by a two-sample Wilcoxon rank-sum test (\(z = -2.61,\,p = 0.008\), two-sided exact \(p\)-value, test based on independent session averages).

Instead of just focusing on whether a subject is richest or not it is also possible to consider a more fine grained measure for a subject’s position in the income hierarchy before punishment. If we assign rank one to the richest subject(s) and subsequent ranks for poorer subjects (standard competition ranking) then antisocial punishment is decreasing in ranks starting with 1.60 for the richest to .48 (.32) for the second (third) richest. The poorest cannot punish antisocially by definition. Interestingly, free-rider punishment is also decreasing in ranks. The second (third, fourth) richest mete out 4.48 (3.34, 2.25) units of free-rider punishment. Thus strongest punishment of free riders does not stem from the highest contributors but from the intermediate contributors. This fits very nicely to the concept of inequality aversion because it suggests that intermediate contributors are willing to pay a larger part of the punishment costs than the poorer high contributors in their group. Thus, while free-rider punishment seems to be accessible to an inequality aversion explanation, antisocial punishment is not.

Result 2

: Subjects frequently mete out only antisocial punishment

Support According to C2 every act of antisocial punishment should be accompanied by punishment of a subject with a lower contribution than the punisher. The upper part of Table 4 classifies all 720 individual punishment vectors. In about 43 % of the cases no subject gets punished. In about 42 % of the cases only free riders (i.e. subjects with a lower contribution than the punisher) receive punishment. About 15 % (105) of the cases involve antisocial punishment. In 75 % of these cases antisocial punishment is not accompanied by free-rider punishment. Thus, contrary to the theoretical prediction, antisocial punishment is most frequently meted out without accompanying free-rider punishment.

Table 4 No. of cases with no, antisocial, free rider, or both kinds of punishment, with row and column totals

The evidence presented clearly indicates that the two necessary conditions for antisocial punishment as explained by the Fehr–Schmidt model are very frequently not met for actual acts of antisocial punishment. The analysis could, however, be influenced by the fact that subjects in the experiments by Fehr and Gächter (2002) play six consecutive games. Despite the use of the perfect stranger matching we cannot be sure that the game reflects a true one-shot game. The fact that the authors observe a time trend in average contributions even suggests that the six rounds are not identical. A recent study by Gächter and Herrmann (2009) provides data from true one-shot games with \(\{n,y,a,c\}= \left\{ 3,20,\frac{1}{2},\frac{1}{3} \right\} \). In a next step I apply the same analysis to this dataset. The data contain observations from 265 subjects in four locations in Russia and Switzerland.Footnote 14 For the moment I ignore the fact that motivations for antisocial punishment might differ across subject pools. This point will be addressed below.

The results from the first and second dataset are qualitatively very similar. In the data by Gächter and Herrmann (2009) average antisocial punishment meted out by top earners is 2.24 compared to .95 by all other subjects. The difference is significant (\(z = -3.04,\,p = 0.002\), test based on individual observations). The lower part of Table 4 also shows that in this dataset the majority of antisocial punishment acts (72 %) is not accompanied by free-rider punishment.

Cheung (2012) provides further evidence on one-shot games using the same parameters as Gächter and Herrmann, but with the punishment decision elicited by the strategy method. Subjects choose punishments for all possible combinations of other subjects’ contributions. This allows for a much more detailed analysis of punishment behavior.Footnote 15 The paper does not provide exact numbers on C1 but from the figures it becomes clear that there is a considerable amount of punishment meted out by the maximum earners. For C2 Cheung (2012, p. 24) provides a detailed analysis showing that 21 out of 280 cases potentially fit to inequality aversion. However, the majority of the subjects involved does also punish antisocially when being among the maximum earners, which is clearly incompatible with the prediction.

Herrmann et al. (2008) report data from 16 subject pools from various cultural backgrounds. They observe punishment decisions from experiments with ten times repeated public goods game with punishment and \(\{n,y,a,c\}= \left\{ 4,20,\frac{2}{5},\frac{1}{3} \right\} \). Their data contain observations from 1,120 subjects in 16 cities around the globe. The 16 subject pools differ markedly in the frequency and strength of antisocial punishment. Thus, even if inequality aversion fails to explain antisocial punishment in the three datasets discussed so far it is possible that this explanation works well in some of the subject pools observed by Herrmann et al. (2008).Footnote 16

Table 5 shows the two measures discussed above for each of the 16 subject pools separately. The first column shows the average punishment meted out by subjects who are maximum earners in the specific period. This is the number that should be zero according to the prediction. Obviously there is a huge variation in the strength of antisocial punishment. Average punishment differs by more than a factor ten when comparing the extreme cases. The second column shows the average antisocial punishment by subjects who are not among the richest in their group. In 14 out of the 16 subject pools subjects choose more antisocial punishment when they are top earners compared to when there are others in the group with higher stage 1 incomes.

Table 5 Patterns of antisocial punishment across subject pools

The third column in Table 5 shows the fraction of antisocial punishment acts not accompanied by free-rider punishment. Everywhere, in at least 50 % of the cases with antisocial punishment subjects mete out only antisocial punishment. To conclude, despite the huge differences in the level of antisocial punishment across subject pools the pattern of antisocial punishment acts does not differ substantially. In none of the subject pools antisocial punishment seems to be motivated by inequality aversion.

4 Conclusion

To test whether observed acts of antisocial punishment can be explained by inequality aversion as formalized by Fehr and Schmidt (1999) I derived two conditions: (i) the punishing subject must not be among the richest in the group and (ii) antisocial punishment requires accompanying acts of free-rider punishment. Empirical evidence on antisocial punishment shows that a majority of the punishment acts occur in situations in which these two criteria are not met. Thus, inequality aversion is not a key determinant of antisocial punishment. This paper contributes to the literature on motives behind the punishment acts. Fowler et al. (2005) and Fehr and Fischbacher (2004) argue that egalitarian motives drive punishment decisions. This might be the case for free-rider punishment but most likely not for antisocial punishment. The results presented here are in line with Falk et al. (2005), who compare the frequently used one-to-three punishment technology to a one-to-one punishment technology \((c=1)\). In the latter regime no Fehr–Schmidt player would use punishment. However, in their experiments with Prisoners’ Dilemma games cooperators punished defectors also in the treatments with \(c=1\), despite the fact that payoff differences could not be reduced.Footnote 17 Defectors’ punishment of cooperators was observed if \(c=\frac{1}{3}\) but vanished in the treatments with the one-to-one punishment, indicating that relative payoff concerns may be an important determinant of antisocial punishment.Footnote 18 However, the relative payoff maximization hypothesis has a serious flaw when applied to public goods experiments with four players and one-to-three punishment mechanism (like used e.g. in Fehr and Gächter 2002 or Herrmann et al. 2008): Given these parameters no player can improve her relative position towards the others by punishment. Due to \(c=\frac{1}{3}\) every unit invested in the punishment of another player \(j\) increases the relative position towards \(j\) by two units (\(j\) loses three, punisher loses one). However, at the same time the relative position towards the other two players decreases by one unit. Thus, if a relative payoff maximizer weighs all bilateral comparison equally, then punishment does not improve the relative position at all.Footnote 19

Understanding the determinants of antisocial punishment is crucial, given that the efficiency of sanctioning mechanisms strongly depends on the frequency and strength of antisocial punishment, leading to strikingly different macro results across different cultures (Gächter et al. 2010). The contribution of this paper is to show that interdependent preferences as formalized in the Fehr–Schmidt model provides a rationale for antisocial punishment. Interestingly, antisocial punishment as rationalized by Fehr–Schmidt preferences is kind of a higher order punishment, targeted to players who are unwilling to bear their share of the punishment costs necessary to discipline the free riders (see e.g. Denant-Boemont et al. 2007). Seen from this angle, antisocial punishment might not be that ‘perverse’ after all.

The data, however, suggest that other forces must be at work. A likely candidate is revenge. Subjects might engage in antisocial punishment to take revenge for punishment received in past periods or for anticipated punishment in the current period. Herrmann et al. (2008) provide evidence for this, showing that antisocial punishment is stronger when a subject was punished in the previous period. A second likely candidate is that some subjects might simply enjoy to destroy others’ property. There is experimental evidence that, in some settings, subjects reduce other subjects’ incomes, even if this comes at a cost to themselves (see e.g. Abbink and Herrmann 2011; Abbink and Sadrieh 2009; Zizzo 2003; Zizzo and Oswald 2001). For public goods games with punishment under uncertainty Grechenig et al. (2010) show that, contrary to the expectation, being in doubt about other subjects’ actions does not discourage from using the punishment option. Thus, a pure ‘appetite for destruction’ might account for some of the antisocial punishment acts. This would be compatible with the observation that antisocial punishment is, to a large extent, meted out by subjects with a high payoff. These are the subjects who can afford to spend money on their pleasure to reduce the income of others without falling behind others. An experimental analysis that cleanly separates these causes is yet to be conducted.