1 Introduction

Relative performance incentives are a common feature of the workplace. An interesting property of relative pay is that a worker’s performance also affects his or her co-workers’ compensation; in particular, it imposes a negative externality. An increase in one’s own performance will not only increase one’s own compensation, but inevitably also decrease a co-worker’s expected pay. How this externality affects the incentives of a worker will crucially depend on whether a worker incorporates this reduction in her own effort decision. However, it will also depend on other features of the workplace environment, including how easily communication can foster coordination, the likelihood of someone leading such coordination, and the expectation of future interactions.

In this paper, we consider how these forces might blunt the effectiveness of relative performance incentives. In particular, we use a controlled laboratory environment to examine two primary channels through which agents may reduce effort under relative incentives. The first one is “other-regarding” concerns: some agents may incorporate other agents’ payoffs into their own effort choice. Thus, other-regarding agents should respond differently to relative incentives compared to “selfish” agents. Even though the fact that individuals have heterogeneous degrees of other-regardingness (e.g., see Andreoni and Miller 2002; Fisman et al. 2007) is well-documented in the literature, we know little about the effect of other-regarding concerns on the effectiveness of relative performance pay.

The second channel is indefinitely repeated interaction. Workplace interaction usually takes place for an indefinite period of time, so the “shadow of the future” may also affect agent’s behavior (e.g., see Dal Bó 2005). We consider this channel because the social preference composition of the group affects the severity of future punishment, hence the extent to which coordination can be sustained.

Important to exploring the channel of indefinitely repeated incentives is accounting for the role of communication and leadership. Coordinating on low efforts seems likely to be driven by the ease with which communication can happen (e.g., see Cooper et al. 1992). In addition, the potential for coordination may stimulate leadership emergence (e.g., see, Hermalin 2012; Kreps 1986). Although leaders make mutually beneficial outcomes focal in simple coordination games, we know little about their effect on agents’ behavior in indefinitely repeated interactions within a relative performance incentive structure. Leaders in our setting are important since they can direct individuals towards low effort outcomes and their emergence may well be linked to social preferences.

Our experimental results indicate that overall groups with more other-regarding players tend to depress total efforts. When communication is not possible, outcomes in which all group members choose minimal efforts rarely occur and the average effort level is close to the stage game equilibrium prediction. Nonetheless, other-regarding group members depress their effort by around 15% relative to selfish group members. Thus, our results are consistent with other-regarding individuals internalizing the externality they impose without engaging in long-term strategic behavior.

With communication, a coordinating leader may emerge. In our particular setting, we label “leader” as any individual who suggests that the group should coordinate on minimal effort—which is the Pareto optimal outcome from the agents’ viewpoint. Controlling for the emergence of this sort of leadership, we find that other-regarding subjects depress their effort relative to selfish ones by about 50% before a leader emerges. We also find that selfish individuals are 2.7 times more likely than other-regarding individuals to successfully lead their groups to the minimal effort outcome.

This implies that the effect of social preferences on work performance under relative incentives is complex. On the one hand, other-regarding workers have a tendency to depress effort, apparently through the internalizing of their efforts’ negative externality. On the other hand, with the availability of communication, selfish workers seem more likely to help direct the group to the lowest of efforts.

We see the contributions of this paper as threefold. First, we document for the first time how individual social preferences affect behavior when facing relative performance incentives in indefinitely repeated settings. Second, we explore how the composition of a group in terms of individual social preferences affects outcomes. Third, we identify how communication and endogenous leadership mediate these effects as well as how social preferences relate to the emergence of coordinating leaders.

2 Literature

The significant body of literature that documents different degrees of social preferences (for example Andreoni and Miller 2002; Fisman et al. 2007; DellaVigna 2009) has led researchers to investigate their effects on public good contributions and other pro-social behaviors (e.g. Loch and Wu 2008; Dreber et al. 2014; Bowles and Polania-Reyes 2012; Kőszegi 2014). Moreover, Fehr and Fischbacher (2002) point out that when scholars disregard social preferences, they fail to understand the determinants and consequences of incentives. In our paper, we explore the effects of social preferences on productivity in the setting of relative performance incentives (e.g. see Kidd et al. 2013; Erkal et al. 2011; Rey-Biel et al. 2012; Riyanto and Zhang 2013). Similar to Gächter and Thöni (2005) and Fischbacher and Gächter (2010) we use one game (a dictator game as in Andreoni and Miller 2002) to predict other-regarding concerns and relate those predictions to behavior in the relative performance game. Although our relative performance game is similar to the dilemmas used in those papers (i.e., players are better off if they “cooperate” in low efforts), an important difference is that the interactions in our game are indefinitely repeated—which is a common feature of many important settings, such as the workplace. For indefinitely repeated settings it is not clear a priori whether other-regarding concerns will depress efforts due to internalizing the negative externality imposed on others or will instead increase efforts due to more lenient punishment in the case of a deviation, which makes sustaining a collusive outcome harder. Consequently, the effects of social preferences seen in indefinitely repeated games could be quite different from those captured through the other types of settings commonly found in the extant literature.

The importance of group composition in a dimension other than the degree of other-regardingness has been previously explored. Casas-Arce and Martínez-Jerez (2009) for example, find that relative performance incentives (tournaments in their setting) are less effective than piece rates when participants have heterogeneous abilities. A similar result is found by Backes-Gellner and Pull (2013) in a sales contest within a German insurance firm. To our knowledge, the effect of group composition in terms of other-regardingness on efforts has not been explored, and yet there have been studies that show that individual other-regardingness is important. For example, Bandiera et al. (2005) allude to the role of social preferences in indefinitely repeated (or at least long-term) interactions. Although the core of Bandiera et al. (2005) is to compare workers’ productivity under piece rate and relative incentives, they also document two results that are related to this paper. First, Bandiera et al. (2005) compare fruit pickers with the aforementioned incentive schemes in two different settings: one that allows peer monitoring and another one that does not. They find that relative compensation leads to lower productivity only when monitoring is allowed. They conclude that monitoring, not social preferences, drives down effort in their setting. The authors keep their monitoring technology and relative incentives constant throughout their study; they also do not exogenously vary their subjects’ exposure to altruism. Second, Bandiera et al. (2005) find that workers with social ties depress effort. Social ties could capture social preferences; but they could also capture the salience of punishment should one “defect” from low efforts. As a result, although this study clearly showed that social ties can reduce efforts, it is unclear whether social preferences can do the same. Our paper complements this work by directly measuring participants’ social preferences (à la Andreoni and Miller 2002) and randomly forming groups whose members have varying degrees of social preferences to identify the link between social preferences and behavior, both as a function of individual preferences and group composition.

Indefinitely repeated settings have been another important area of research: Pareto improvements over the one-shot Nash equilibrium can be obtained as equilibrium outcomes if the value of the future is high enough.Footnote 1 However, the fact that cooperation (or “collusion” in the context of competition) can be an equilibrium outcome does not guarantee that subjects will cooperate (Dal Bó and Fréchette 2011, 2014).Footnote 2 In fact, it has been documented that the majority of the time individuals do not achieve the Pareto-optimal outcomes (e.g., Palfrey and Rosenthal 1994 find cooperation rates from 29 to 40% in public goods games, and Dal Bó (2005) found cooperation rates of 38% in indefinitely repeated prisoner’s dilemmas). Further, there has been a great variety of outcomes in this literature, some of which deviate from standard economic models (see Fudenberg et al. 2012). Our paper complements this work by documenting the role of individual and group social preferences on outcomes in indefinitely repeated games.

Although theoretically cheap talk communication does not rule out equilibria, empirically it has been found to facilitate coordination in indefinitely repeated games (Fonseca and Normann 2012; Embrey et al. 2013). One channel through which communication helps equilibrium selection in games of coordination is through a leader, as argued by Kreps (1986) and Hermalin (2012). The theoretical economics literature on leadership has focused on how pre-imposed self-regarding leaders coordinate (e.g. Bolton et al. 2013), motivate (e.g. Rotemberg and Saloner 1993, 2000), and signal information through their actions (e.g. Hermalin 1998). The role of leaders in these studies is to overcome individuals’ incentives to act against the interest of the group. Meanwhile, the experimental literature has focused on whether leaders foster cooperation in social dilemmas, mostly from Hermalin (1998) leading-by-example perspective. These studies have found that leaders indeed spur cooperation, often through reciprocity from followers.Footnote 3 To our knowledge one study, Koukoumelis et al. (2012), explores leadership through communication in a social dilemma. In their study, the authors exogenously assign the role of “communicator” to one group member in a finitely repeated voluntary contribution game. They find that this one-way “free-form” communication has a large positive effect on contributions. A growing experimental literature studies leaders without pre-imposed salience or authority in finitely repeated interactions (see e.g. Bruttel and Fischbacher 2010; Gächter et al. 2012; Kocher et al. 2013; Arbak and Villeval 2013). Also focusing on social dilemmas, this literature has documented that emergent leaders are motivated by efficiency concerns, social image or generosity, and generally contribute more than non-leaders. Our work complements this literature in that we explore the endogenous emergence of leaders in indefinitely repeated settings, and how this phenomenon relates to social preferences. In addition, whereas we primarily study leadership through communication, most of the other papers study leadership influence through actions and authority.

Finally, our work also contributes to the literature on communication in games with multiple equilibria (e.g. Cooper et al. 1992; Ledyard 1994; Seelya et al. 2007; Cooper and Kühn 2014); while the extant literature is concerned about the effect of communication on the frequency of Pareto-optimal outcomes, we instead explore how a group’s social preference composition leads to patterns of communication (e.g., leadership emergence) that result in players coordinating on their Pareto-optimal outcome.

3 Experimental design

In total, we conducted 7 experimental sessions with 147 subjects. Participants were students from UC Berkeley, enrolled in the X-lab subject pool. Sessions lasted approximately 60 min from reading instructions to subject payment, which averaged approximately $16 per subject. Participants were not allowed to take part in more than one session. The treatments were programmed and conducted using z-Tree developed by Fischbacher (2007).

We had the dual purpose of identifying subjects’ social preferences and measuring their choices when facing a relative performance incentive scheme. In order to achieve this, the experiment was divided into three stages. In the beginning of the first stage, we randomly matched subjects into anonymous groups of three individuals and they remained in the same group for the remainder of this stage. Participants were then given 100 tokens for each of 9 periods and played a dictator game with their group members (including themselves). In each period, participants faced different “prices” or token exchange rates of giving to each group member. Prices varied such that we could both identify individuals’ willingness to give to others and individuals’ willingness to give between others when facing different prices of giving.Footnote 4 We use these 9 periods to classify our subjects in terms of social preferences. In periods 10 and 11 we conducted allocation decisions with upwards-sloping budget sets as in Andreoni and Miller (2002) where subjects are given an allocation and decide on the overall exchange rate. In contrast to the previous dictator menus, here there is no possibility to distribute value between oneself and the other group members. The only choice a subject has is on the overall value of the endowment, not on how it is split up. We will use these decisions to test whether aversion to disadvantageous inequality matters in addition to other-regardingness in responding to relative incentives. These results are reported in the online appendix for this paper. Finally, since we follow the categorization of Andreoni and Miller (2002), we are thus exploring unconditional rather than conditional social preferences.

Subjects did not learn their other group members’ choices to avoid uncontrolled learning. Participants were told that for 5 out of a total of 11 allocation decisions one of the group members’ choices would be randomly selected to compute payoffs.

We use this first stage, in particular decisions in rounds 1–9, to classify participants as “Selfish” or “Other-Regarding”, consistent with our intended meaning used in Sect. 4. An archetypal Selfish type is only interested in his own monetary payoff and thus should never allocate any tokens to his or her group members. Thus we classify as Selfish all subjects that throughout rounds 1–9 do not allocate any tokens to another group member. The remainder of subjects are classified as Other-Regarding. We consider various other possible classifications in the analysis found in our online appendix; however, they provide little additional insight to this simple classification.

For the second stage, participants were again randomly matched with two other players for the remainder of the experiment.Footnote 5 The purpose of this stage was to give players the possibility to collude by jointly providing low levels of effort.Footnote 6 Thus, we implemented an indefinitely repeated game with continuation probability of \(\delta =95\%\). In order to gain consistency across treatments, we randomly drew the number of periods before running the sessions as in Fudenberg et al. (2012). In particular, our random draw resulted in 29 periods of relative-performance-pay play, which was then fixed for all subjects, in all treatments.

A subject′s per period payoff during this stage was calculated as follows:

$$ \pi _{i}=12+\frac{x_{i}}{\overline{x}}15-x_{i} $$

where \(\overline{x}=\frac{\Sigma x_{j}}{3}\) is the average effort across \(i^{\prime }s\) group and i chooses effort \(x_{i}\in [1,12]\).Footnote 7 Hence, each participant’s effort is discounted by the average effort, so a higher average effort will reduce payoffs, ceteris paribus. This is the relative performance evaluation similar to the contract used by Bandiera et al. (2005).Footnote 8 Note these figures are in Berkeley Bucks $, converted at $66.6 Berkeley Bucks to 1 US$, which is how it was presented to subjects.Footnote 9 Each participant received an endowment of $12 (Berkeley Bucks $) each period from which they could choose costly effort. Effort costs $1 for each unit of effort. Subjects were paid the sum of their earnings over all periods for this stage.Footnote 10

The one-shot Nash equilibrium for homogeneous and Selfish players is to play \(x_{i}=10\) for all i which is below 12 (the upper bound of the action space). Coordinating on \(x_{i}=1\) under grim-trigger strategies is sustained by a continuation probability \(\overline{\delta }>60\%\) (optimal one-shot deviation from the Pareto-dominant outcome is to play \(x_{i}\simeq 7.5\)). Therefore, our \(\delta =95\%\) should guarantee the feasibility of coordinating on low efforts for utility maximizing rational Selfish agents.

For the final stage, subjects completed a risk aversion test as in Holt and Laury (2002), and a basic demographic questionnaire.

We also varied factors considered important for creating and sustaining low levels of effort. In particular, in the first treatment (“Chat”) we allowed chat via computer terminals during each period and observability of choices and payoffs after every period. We recorded the chat messages in order to identify coordination leaders and their social preferences. In the second treatment (“No Chat”) we did not allow for chat but continued with observability after each period.

If we were able to mechanically switch on and off subject’s social preferences, we could directly identify the effect of social preferences on effort. Unfortunately, this is not generally possible. However, we conducted a final treatment where we approximate this idea. Instead of facing human subjects, a subject played against their computer, which simulated the play of past subjects’ decisions (“Robot” treatment). This treatment attempted to “switch off” social preferences by making it clear to subjects that even though they faced the same consequences for their choices as if playing human subjects, their effort decisions no longer affected any person’s payoffs. We report the results of this treatment in the online appendix.

4 Hypotheses

Before turning to our results, we develop several hypotheses to guide our ensuing analysis. To ease exposition, we use the label Selfish to mean those individuals that only value their own payoff. In addition, we use the label Other-Regarding to denote those individuals that value both their own payoff and some fraction of their partners’ payoffs.Footnote 11

Let us start by studying the stage game. Recall payoffs for individual i are \(\pi _{i}=\frac{x_{i}}{\bar{x}}\times 15-x_{i}+12\) in each round. The utility of subject i is a combination of her payoffs and the payoffs of the other subjects in her group:

$$ u_{i}=\rho _{s}\pi _{i}+\rho _{o}\sum _{k\ne i}\pi _{k}, $$

where \(\rho _{s}\) is the weight placed on her own payoff and \(\rho _{o}\) is the weight placed on the payoffs of others. We assume the following:

  1. 1.

    \(\rho _{s},\rho _{o}\in [0,1]\), and \(\rho _{s}+2\rho _{o}=1\).Footnote 12 Note \(\rho _{s}=1\) means subject i is Selfish and \(u_{i}=\pi _{i}\); \(\rho _{o}>0\) means subject i is Other-Regarding.

  2. 2.

    \(\rho _{s}>2\rho _{o}\). (To focus on unique interior solutions in the analysis of the stage game.) Note that assumptions 1 and 2 imply \(1/2<(\rho _{s}-\rho _{o})/\rho _{s}\le 1\). This means that Other-Regarding subjects care more about their own payoff than the payoffs of the two other subjects combined. To ease notation, let us define \(\Delta \equiv (\rho _{s}-\rho _{o})/\rho _{s}\).Footnote 13

  3. 3.

    Other-Regarding subjects have identical preferences. That is, \(\rho _{o}\) is the same across Other-Regarding subjects.

  4. 4.

    \(\rho _{s}\) and \(\rho _{o}\) are common knowledge. We assume complete information and perfect monitoring. This simplification allows us to build our hypotheses drawing upon the theory of infinitely repeated games of complete information (e.g., Abreu 1988).Footnote 14 In particular, we center our analysis on the fact that coordination can only be supported if players know deviations will be punished by switching from a high-payoff to a low-payoff continuation equilibrium.

Table 1 shows the stage-game Nash effort choices (the details of the calculations are in the online appendix). The first row shows the effort choices of Other-Regarding subjects for each group composition, the second row shows the effort choices of Selfish subjects for each group composition, and the third row shows the average effort across subjects for each group composition. Simple algebra confirms that Other-Regarding subjects’ effort choices are decreasing in the number of Selfish individuals in the group. Let us call \(x_{0s}^{o}\) the highest of these efforts—the effort choice of an Other-Regarding subject in a group with no Selfish individuals. It is also direct to check that Selfish subjects choose lowest effort when all the group members are Selfish. Let us call \(x_{3s}^{s}\) this effort level. Note that \(x_{3s}^{s}>x_{0s}^{o}\). As a result, Other-Regarding subjects choose lower efforts than Selfish subjects across groups when in each group the stage-game Nash efforts are played. This logic leads to our first hypothesis which compares individual behavior across social preferences.

Table 1 Nash equilibrium effort choices in the stage game by Selfish and Other-Regarding subjects across group’s social preference composition

Hypothesis 1

Given that the stage game is played, Selfish subjects exert more effort than Other-Regarding subjects.

Group composition also affects average effort choices. From the third row of Table 1 it follows that average efforts in a group are increasing in the number of Selfish subjects in the group. As a result, we hypothesize to observe the following:

Hypothesis 2

Given that the stage game is played, average group effort choices are increasing in the number of Selfish group members.

In indefinitely repeated games, achieving Pareto-dominant outcomes is a well-known theoretical possibility—provided fixed partners and not too much discounting. It proves difficult, however, to sustain the Pareto-dominant outcome experimentally absent communication (see, e.g., Fonseca and Normann 2012). Evidence suggests that subjects usually either play always the stage-game Nash efforts or revert to stage-game Nash efforts after a few attempts to coordinate (Dal Bó and Fréchette 2014). Thus we expect Hypotheses 1 and 2 to apply when subject cannot communicate, i.e. in the No Chat treatment. Communication, however, is an important aspect in the real world and may serve as a mechanism to facilitate coordination on minimal (cooperative) efforts. In fact, Cooper and Kühn (2014) find that communication increases coordination in a related experiment. Therefore, we should expect that, with communication, chosen efforts are on average lower than chosen efforts when communication is not possible.

Hypothesis 3

Absent communication, average group effort choices are consistent with the stage game equilibrium. When communication is possible, group effort choices are reduced as subjects attempt to cooperate.

The effect of social preferences on the likelihood of cooperation can take different forms. First, we study how social preferences of group members affect the minimum continuation probability necessary to sustain the cooperative outcome. The minimum continuation probability can be used as a measure of the likelihood of cooperation as a lower number implies more “cases” where cooperation is sustainable in equilibrium. Second we discuss how social preferences may affect communication itself.

To study the possibility of cooperation, let us consider grim-trigger strategies in which each subject chooses the minimal effort in round 1 and continues cooperating until there is a defection, then each subject chooses the stage-game Nash effort forever.Footnote 15 We focus on this strategy profile not because we believe that subjects necessarily behave as this strategy prescribes, but rather to use it as a benchmark to state the next hypotheses of this paper.Footnote 16 A subject’s decision to coordinate using a grim-trigger strategy depends on his own social preferences and the number of Selfish subjects in the group. Let us denote \(C_{Ns}^{k}, D_{Ns}^{k}\), and \(P_{Ns}^{k}\) the per-period utilities from coordination on minimal efforts, best unilateral deviation and (stage-game Nash) punishment, respectively, for a subject k = {Other-Regarding, Selfish} in a group with Ns Selfish subjects. Note that \(C_{Ns}^{k}\) does not depend on k or Ns, hence \(C_{Ns}^{k}=C.\) Similarly, \(D_{Ns}^{k}\) does not depend on the number of Selfish individuals, only on the social preferences of the subject, hence \(D_{Ns}^{k}=D^{k}\) (the analytical expressions for \(D^{k},C\) and \(P_{Ns}^{k}\) are given in the online appendix). Provided a continuation probability of \(\delta \) this is an equilibrium in a group with Ns Selfish subjects if and only if

$$\delta \ge \delta ^{\min }=\max _{k}\left\{ \frac{D^{k}-C}{D^{k}-P_{Ns}^{k}}\right\} $$
(1)

Inequality (1) makes explicit the core idea of this paper. For a fixed k, the harsher the punishment (i.e. the lower the per-period utility from the punishment phase, \(P_{Ns}^{k}\)) the smaller is the right-hand side of (1), which means that each k subject requires a lower continuation probability to cooperate. Given our previous analysis, one would be tempted to think that the right-hand side of (1) is decreasing in the number of Selfish subjects as they choose higher efforts in the punishment phase. As it turns out, this relationship is non-monotonic. Figure 1 plots the minimum continuation probability needed to sustain cooperation for a given value \(\rho _{s}\), the Other-Regarding player’s weight of another’s payoff, for each of the four possible group configurations.

Fig. 1
figure 1

Minimum continuation probability \(\delta ^{\min }\) needed to sustain cooperation

For example, fix the weight that an Other-Regarding subject puts on own payoffs at \(\rho _{s}=0.75\). The minimum required continuation probability is lowest (0.52) if there are no Selfish members in the group. This value, however, is highest (0.69) when there is only one Selfish member in the group. Note this is the case for any value of \(\rho _{s}\).Footnote 17 Intuitively, the lone Selfish faces weak punishment in a group with two Other-Regarding and thus has a strong incentive to deviate. Therefore, we should expect that the cooperative outcome is harder to sustain when there is only one Selfish subject in the group.

Hypothesis 4a

Coordination on minimal effort is least likely for a group with only one Selfish player.

In stating Hypothesis  4a we are equating the success of communication with the size of the minimum continuation probability. Communication will be more likely to produce cooperation in groups with a lower \(\delta ^{\min }.\) But social preferences may affect the success of communication and thus cooperation also through different channels more directly. Although theoretical details about the role of communication and social preferences on behavior remain elusive in settings like ours (see, e.g., Fonseca and Normann 2012), experimental evidence suggests that the effect of communication may depend on the group’s social preference composition. In our setting, theory prescribes that both Other-Regarding and Selfish subjects could use the chat to communicate that they will be using grim-trigger strategies. Coordination will occur (provided it is an equilibrium) because such messages create incentives to the sender to fulfill them, if others consider the message credible (e.g., such messages are “self-committing”, see Aumann 1990; Farrell and Rabin 1996, p. 114). However, although coordination yields utility C each period for everyone and forever, the utility each period from playing always stage-game Nash efforts, \(P_{Ns}^{k},\) depends on the social preferences of the subject (k) for each group composition (Ns). Specifically, \(P_{Ns}^{s}>P_{Ns}^{o}\) for Ns = 1, 2, i.e., Selfish subjects fare better than Other-Regarding subjects in the punishment phase.Footnote 18 In repeated games, successful communication should specify arguments in favor of coordination that require agreements not just on a single action but on entire contingent plans (see, e. g., Farrell and Rabin 1996; Cooper and Kühn 2014). Selfish subjects therefore may use the chat to convey they indeed have the least to lose if the outcome is stage-game Nash forever so their messages should be more credible. Credible threats of punishment have been found to be the most effective type of communication as they are associated with significantly more coordination (Cooper and Kühn 2014). In our setting, every player can use the chat to communicate, but promises of punishment made by Selfish subjects should be more credible than the promises made by Other-Regarding subjects, everything else equal. This logic, however, leads to a different prediction: Cooperation should be easier to sustain in groups with at least one Selfish subject. Thus, by means of initiating messages advocating for cooperation, Selfish subjects may lead others towards coordination more easily. This leads to our alternative hypothesis regarding the relationship of social preferences and likelihood of cooperation.

Hypothesis 4b

Coordination on minimal effort is more likely for a group with at least one Selfish player than a group with no Selfish players.

The broad results on communication in infinitely repeated settings preclude us from providing a clear prediction as to which of the two forces dominates. Thus, ultimately, we treat the question of which group composition is most frequently associated with coordination as an empirical one when communication is allowed.

5 Experimental results

We begin by classifying subjects by their social preferences derived from their giving behavior. We then use this classification to study the relationship of individual social preferences as well as group composition of social preferences and effort choices. We study the likelihood of observing a “collusive” outcome and how it relates to social preferences in Sect. 5.4. In Sect. 5.5 we present a short summary of the robustness checks we conducted.

5.1 Categorizing social preference types from giving menus

Table 2 summarizes the mean choices of our subjects under all 9 price vectors in the Chat and No Chat treatments. Vector (a, b, c) represents the price a of giving to one’s self, the price b of giving to player 1, and the price c of giving to player 2.

Table 2 Giving rates

We see that regardless of the price of giving, subjects keep on average just above 70% of their endowment. Using these choices, we sort our subjects into Selfish and Other-Regarding. A subject is categorized as Selfish if he or she does not allocate any tokens to the other group members in any of the nine periods. All subjects who at some point allocated tokens to their group members are categorized as Other-Regarding. In Sect. 5.5 we report the results of an alternative classification, where a subject is classified as Selfish if he or she kept at least 90% of the endowment on average over the 9 rounds. Furthermore, we explore other, more detailed, categorizations of social preferences. Details for all of these can be found in the online appendix. Using our Baseline categorization, and taking together the two treatments (Chat and No Chat) most of the participants (81%) are categorized as Other-Regarding. The remaining subjects (19%) are categorized as Selfish.Footnote 19

As described in Sect. 3 subjects were randomly allocated into groups without regard to their social preference type. Table 3 shows the distribution of Selfish subjects across treatments. We observe both Selfish and Other-Regarding in each treatment. Table 4 shows the distribution of groups with different numbers of Selfish group members in the second stage. Since subjects were allocated randomly and Selfish subjects are relatively rare we do not observe groups with only Selfish group members in the Chat and No Chat treatments. Otherwise, we do observe random variations across groups in the number of Selfish subjects which we will use to identify the effect of group composition in the next sections.

Table 3 Distribution of Selfish and Other-Regarding individuals by treatment
Table 4 Distribution of groups by number of Selfish for each treatment

5.2 Social preferences and effort

Figure 2 provides a summary of effort choices over time by treatment. In both the Chat and the No Chat treatment we observe average effort of around 8 units at the beginning of the relative incentives stage. As hypothesized in Sect. 4, there is a strong tendency to coordinate on lower efforts over time when subjects are able to communicate in the Chat treatment (dashed line). When communication was absent (No Chat treatment), average effort stays close to the one-shot Nash equilibrium prediction (i.e., 10) for the Selfish type (dotted line).

Fig. 2
figure 2

Average effort by treatment over time

How do individual social preferences and group composition relate to efforts? To find an answer to this question we exploit the random allocation of subjects into groups. We compare behavior of groups with different numbers of Selfish and Other-Regarding individuals in each of the two treatments.Footnote 20

Fig. 3
figure 3

Overview of effects of social preferences on effort

Figure 3 gives a first overview of our findings. Consider first panel a). We compare the average effort of subjects categorized as Selfish with the average effort of subjects categorized as Other-Regarding. We see that for both treatments, average effort is higher for subjects categorized as Selfish, although a t test rejects equality only for the No Chat treatment (p values: p < 0.60 in Chat and p < 0.01 in No Chat).Footnote 21 Thus, we find that in the No Chat treatment, average efforts are similar to the stage game Nash equilibrium efforts (i.e., efforts of 10 for Selfish individuals) rather than to a cooperative outcome of (1, 1, 1) and Other-Regarding subjects provide lower efforts on average.

In panel (b) we consider average group effort as a function of the number of Selfish players within a group. When communication was not possible, we observe that each additional Selfish group member modestly increases average group effort though none of these increases reach statistical significance. When communication is possible, there is a pronounced increase in average effort when comparing a group with two Selfish group members versus those with fewer Selfish members; however, likely due to only one group with two Selfish members in the data, the difference does not reach statistical significance. Meanwhile, groups with only one Selfish member generate the lowest average effort.

We further explore differences in group average effort choices as a function of the number of Selfish subjects through regression analysis in Table 5. We use as the dependent variable the group effort averaged over all rounds of play (at stage 2, our relative performance stage) in columns 1, 2 and 3, and averaged over the final periods, periods 30–40 in columns 4, 5 and 6. Pooling both treatments together (columns 1 and 4) we find no effect of social preferences on average group effort over all periods, while there is a positive, marginally significant effect of the number of Selfish group members on group effort in the last periods. Each Selfish group member increases average group effort by about 1.4 units. Consistent with the literature on non-binding communication as a coordination device (see e.g. Cooper and Kühn 2014), the possibility to communicate significantly reduces average group effort by about 6.2 units relative to the No Chat treatment over all periods and 6.5 units in late periods only. This result supports Hypothesis  3. Splitting the data by treatment, we do not find a significant effect of Selfish group members in the Chat treatment (columns 2 and 5). This is likely the result of greater effort from a group with two Selfish members cancelling out the reduced effort of the groups with only one Selfish member. In contrast, when communication is not possible (No Chat treatment, column 3), each Selfish group member increases average group effort by approximately 0.9 units, which equals a 9% increase over our baseline mean effort of roughly 9.7 per period. This effect is slightly larger in the last periods (column 6).

Table 5 Effect of group’s social preference composition on group effort

To disentangle the effect of an individual’s social preferences from group composition effects, we estimate a random effects model, clustering standard errors at the group level.Footnote 22 The dependent variable is individual effort in a given period and the explanatory variables are: Selfish and the number of other Selfish individuals in each group (# Other Selfish). We control for # Other Selfish since, as given in Sect. 4, we expect Selfish and Other-Regarding group members to influence efforts differently.Footnote 23 This means that we are exploring the effect of individual social preferences conditional on how many other Selfish players are in one’s group.

Table 6 reports our results. For the Chat treatment, all social preference variables remain insignificant (columns 1 and 4). On the other hand, we find further evidence that Other-Regarding subjects choose significantly less effort in the No Chat treatment both overall and in the last periods only (columns 2 and 5). Controlling for group composition, these subjects choose 1.5 fewer units of effort on average and 1.9 fewer units in the last periods. The group composition effect on the other hand, is positive but insignificant. Thus, absent communication, Other-Regarding subjects depress efforts relative to Selfish subjects, but only through the channel of individual social preferences. Finally, when pooling both treatments (columns 3 and 6), we again observe that the possibility of communication significantly reduces individual effort by about 6–6.5 units for both Selfish and Other-Regarding. Also, consistent with the analysis by treatment, Selfish individuals provide significantly more effort in the No Chat treatment than Other-Regarding ones, while this difference does not reach statistical significance in the Chat treatment. Testing for joint significance of the coefficients of Selfish plus Chat * Selfish the p values equal p = 0.443 for all periods and p = 0.208 for periods 30–40.

Table 6 Effect of own and others social preferences on own effort

Overall, these results suggest that, absent communication, average efforts are consistent with stage game Nash equilibrium efforts, as hypothesized in Section 4, Hypothesis  3. This provides our first two primary results, which are consistent with our first two hypotheses:

Result 1

Absent communication, Other-Regarding subjects depress efforts relative to Selfish subjects.

Result 2

Absent communication, average group effort increases in the number of Selfish group members

When communication is introduced, however, efforts seem to more closely follow the cooperative outcome and results are somewhat surprising: The presence of one Selfish individual leads to lowest aggregate efforts. This is due to Selfish individuals being more likely to lead by suggesting coordination on low efforts, as we describe it in the next section.

5.3 Effort choices, chat and leadership

In the Chat treatment, a subject can take the initiative through chat, asking the group members to jointly exert low effort. This coordinating leader can then overcome the equilibrium selection problem. From the content of chat messages we label a “Min-Effort Leader” as a subject that is the first to propose coordinating on the minimum effort case (i.e., for all group members to provide effort of 1).Footnote 24 Thus a group can at most possess one Min-Effort Leader. We identify 13 Min-Effort Leaders (21%) among the 63 subjects (21 groups) in the Chat treatment and thus 13 out of 21 groups exhibit such a leader.Footnote 25

How is leadership related to social preferences? We find that 7 of the 13 Min-Effort Leaders are Other-Regarding while 6 are Selfish. That means that 15% of all Other-Regarding are Min-Effort Leaders while 40% of all Selfish are Min-Effort Leaders. Thus, considering the likelihood of a given individual becoming a leader, Selfish individuals are more likely to be Min-Effort Leaders than Other-Regarding ones. A Pearson chi–squared test confirms that this difference is significant at the 5% level (p = 0.03).

Do social preferences affect outcomes in the Chat treatment beyond the likelihood of a Selfish subject emerging as a coordinating leader? To answer this question, we again perform group and individual level regressions, now controlling for the emergence of a Min-Effort Leader. Table 7 presents the results of the group level analysis. Group average effort, averaged over all periods (columns 1–2) and periods 30–40 (columns 3–4) is regressed on the number of Selfish in a group, whether the group exhibited a Min-Effort Leader and their interaction. Groups in which a Min-Effort Leader emerged have significantly lower effort levels. Furthermore, overall, an additional Selfish group member increases efforts by about 2 units, significantly so in the last 11 periods. Controlling for whether a Min-Effort Leader emerged in a group (columns 2 and 4) we find that an additional Selfish group member increases average effort only when no Min-Effort leader emerged in a group. In groups with a Min-Effort Leader the number of Selfish group members does not affect average effort. In these groups, the sum of coefficients on number of Selfish and Min-Effort Leader in Group equals −0.441 for all periods and −0.356 for the later periods and p values are 0.657 for all periods and 0.506 for late periods, respectively.

Table 7 Effect of social preferences on group effort controlling for emergence of a Min-Effort Leader (Chat treatment)

To complement the group-level analysis, Table 8 reports the results of a random effects model exploring individual effort choices. Column 1 shows a regression without considering leader emergence, analogous to the results reported in Table 6. In column 2 we add a control for whether a Min-Effort Leader has emerged and whether the subject herself is a Min-Effort Leader. Notice that the coefficients of own social preference as well as group members’ social preferences are highly significant and larger in magnitude once controlling for leadership in this way. This means that after controlling for the effect of social preferences influencing leadership emergence, social preferences lead to significantly lower group efforts. The effect is slightly larger in magnitude than in the No Chat treatment. Precisely, a Selfish subject puts in 2 units effort more per period than an Other-Regarding subject, after controlling for the emergence of a coordination leader. Furthermore, the presence of an additional Selfish group member increases a subject’s own effort by 2 units per period also controlling for leader emergence.

Table 8 Effect of social preferences on individual effort controlling for leadership (Chat treatment)

Column 3 includes interactions of social preference measures and the emergence of a leader. We find that social preferences depress efforts when a Min-Effort Leader has not emerged in a group. At the beginning of the game Selfish subjects expend an additional 2.8 units of effort relative to Other-Regarding subjects. Also, having an additional Selfish group member increases individual effort by about the same amount. Once a leader emerges there is no difference between Selfish and Other-Regarding choices. Formally, testing the joint significance of the coefficient on Selfish and the coefficient on the interaction MELeaderE * Selfish gives a p value of 0.87 and a joint coefficient of negligible magnitude of 0.068. Testing the joint significance of the coefficient on # Other Selfish and MELeaderE * # OthSelf gives a p value of 0.87 as well. Finally, note that the coefficient of Min-Effort Leader is insignificant. Thus, Min-Effort Leaders do not lead also by good example: i.e., they only lead through suggesting low effort by chat message and not through actually initiating lower effort themselves. Column 4 reports estimates from only the last 11 periods of play and finds results similar to those reported in column 3.

We conclude that social preferences are an important determinant of group effort also in the Chat treatment, though in a more nuanced way. On the one hand, subjects use communication to coordinate the group on a cooperative outcome, consistent with Hypothesis  3. Such a “leader” tends to be a Selfish individual. This explains why the presence of only one Selfish individual is associated with lower efforts in the Chat treatment (see Fig. 3b). On the other hand, controlling for the existence of a leader in the group, Other-Regarding subjects have a tendency to put in lower effort than their Selfish counterparts before a coordination leader emerges, exactly as in the No Chat treatment, suggesting these individuals internalize the externality their effort inflicts on their group members. From a principal’s perspective these results suggest that in a work environment where communication is possible a heterogeneous social-preference group leads to the lowest work effort: adding a Selfish subject to an otherwise Other-Regarding group of workers could promote the emergence of a leader to coordinate on low efforts.Footnote 26 Finally, once a coordination leader emerges and is successful, both Selfish and Other-Regarding workers are providing the same minimal effort, which means that there is no longer a difference between their efforts. Thus, our analysis yields two additional results:

Result 3

Consistent with Hypothesis  4b, Selfish subjects are more likely to lead others to coordinate on low efforts.

Result 4

Without/before the emergence of a coordination leader, Other-Regarding subjects depress efforts relative to Selfish subjects, consistent with Hypothesis  1. When a leader emerges, there are no differences in effort choices between Other-Regarding and Selfish subjects

5.4 Propensity to cooperate

Thus far we have been focusing on the relationship between social preferences and depressed efforts. Depressed efforts can of course also be a consequence of coordination (or collusion if we interpret our design as a competitive setting). While we are naturally unable to observe our subjects’ strategies directly, we take an indirect approach and measure the frequency of “collusive” outcomes consistent with coordination on minimum efforts: That is, all three players coordinate on efforts of 1 [i.e., efforts of (1, 1, 1)]. We additionally include as “collusive outcome” the setting where all three players coordinate on the outcome of two players choosing effort of 1 while a third player chooses maximal (payoff) effort of 12, and then the players alternate the player who gets the maximal payoff. This latter form of coordinating on low efforts is only witnessed in the Chat treatment.Footnote 27

Figure 4 depicts the dynamics of groups achieving the “collusive” outcome in the Chat treatment. Here, we separate groups by the number of Selfish members (groups with 0, 1, or 2 Selfish members). Similar to our results on efforts from Sect. 5.3, when chat is available, groups with 1 Selfish member are more likely to exhibit collusive outcomes than groups with no Selfish members. When we expand the definition of “collusion” to include the case of the group cycling efforts of (1, 1, 12) across players, we again find groups with 1 Selfish member are more successful at achieving the cooperative outcome than groups with no Selfish members. Note though that the fraction of groups choosing the turn-taking strategy (1, 1, 12) is similar for groups with one or no Selfish group member, which means that this outcome does not seem to be related to social preferences.

Fig. 4
figure 4

Fraction of groups achieving (1, 1, 1) for 3 successive rounds of play (hollow symbols) and (1, 1, 1) or alternating (1, 1, 12) for 3 successive rounds (solid symbols) by number of Selfish group members for the Chat treatment

Comparing the results in Fig. 4 with the results in Fig. 3 leads to an interesting observation. Even though groups with one Selfish member are more likely to cooperate, average effort is quantitatively not very different from a group with no Selfish (3.2 vs. 4.1). As already explained in Sect. 5.3 the reason for this is that in the “pre-collusion phase” groups with no Selfish members put in lower efforts than groups with one Selfish member (average effort is 5.4 in a group of only Other-Regarding vs. 7.5 in a group with one Selfish prior to the emergence of a Min-Effort Leader). This further corroborates our result that social preferences seem to matter in complex ways when communication is possible: Selfish individuals play an important role in facilitating coordination while Other-Regarding have a tendency to put in lower efforts even without coordination (Hypothesis  1). Thus, we summarize our final primary result, which is consistent with Hypothesis  4b, but contradicts Hypothesis  4a:

Result 5

With communication, coordination on minimal effort is more likely for a group with only one Selfish player.

For the No Chat treatment, coordinating on a “collusive outcome” was more difficult, since subjects were not able to chat. As shown in Table 9, we find for this setting that only 1 out of 21 groups end up with minimum efforts in the last 3 periods and only if the group has no Selfish members. One other group with no Selfish group members managed to sustain (1, 1, 1) for 3 periods during the course of the game, but then reverted back to higher effort. If we expand the definition of “collusive” outcome to include two subjective cases of “collusion” (we report their behavior in the online appendix), then we find one additional group with no Selfish members and one additional group with 1 Selfish member successfully “collude” by the end of the game. Again, it seems that collusion is not a main driver of behavior in this treatment and results seem more consistent with the predictions of the stage game.

Table 9 Propensity to “collude” by # of Selfish in the No Chat treatment

5.5 Robustness checks

We performed a number of robustness checks on which we report in more detail in the online appendix. First, our regression results using our baseline classification are robust to clustering standard errors at the individual level instead of the group level in our individual-level analysis. Second, we relaxed the definition of Selfish to include all individuals who kept 90% or more on average. Using this classification, we find support for Result 1 and  4, while we do not find contradictory evidence to Result  2. Regarding Result  3 we still find that Selfish are more likely to be Min-Effort leaders (18% of Other-Regarding vs. 24% of Selfish are categorized as Min-Effort Leaders), though this difference is now smaller and ceases to be significant. Finally, regarding Result  5, we again observe that groups with one Selfish group member are most likely to coordinate on minimum effort though also this difference is attenuated.

We also explored an additional continuous social preference measure. In particular, we conducted individual-level regressions using the average endowment keep in rounds 1–9 directly in our regressions. Our results are qualitatively unchanged. In addition, since effort choices are constrained to be between 1 and 12, we re-run our analysis using a Tobit panel model. We find these results are qualitatively the same. We also conducted our individual level analysis controlling for gender, education major, and risk preferences, and find the results qualitatively unchanged. Furthermore, none of these additional controls show consistent patterns throughout the analysis. Finally, in order to create a starker contrast between groups, we compare average efforts of Selfish to Other-Regarding individuals who on average kept less than 50% of their endowment. To do this we ran a random effects regression of individual effort on Period and an indicator whether that individual is Selfish for the Chat and No Chat treatment, dropping all subjects that kept between 50 and 100% of their endowment. Consistent with our baseline results in the Chat treatment there is no significant effect (not controlling for leadership) while in the No Chat treatment Selfish individuals expend around 1.7 units more effort than Other-Regarding individuals.

Since the environment we study is dynamic with fixed matching, subjects can respond to past effort choices of their group members. Controlling for the social preferences of the group members can account for some of this path dependence in our analysis, though it is clearly imperfect. Thus, we finally conduct our analysis including lagged effort choices of all group members. Both own and others’ lagged effort are significant and important predictors of individual effort choices. Nonetheless, our previous social preference parameters are still significant, although attenuated since we are now controlling for past choices.

6 Conclusion

We studied how an important dimension of worker heterogeneity affects the performance of those subject to relative performance incentives. In particular, we found that a basic form of social preferences, the degree of other-regardingness, is substantially linked to reduced effort choices, but in a complex way. First, subjects categorized as Selfish are more likely to coordinate their group members to exercise minimal efforts, when communication is available. Second, before the emergence of such leaders, subjects categorized as Other-Regarding exert lower levels of effort—an average of over 30% lower effort. Thus, when communication is available, a group that is heterogeneous in social preferences can most successfully create and sustain very low efforts over those groups with no Selfish members. Finally, when communication is not available, groups of Other-Regarding subjects produce the lowest levels of effort. Since we find little evidence of collusive outcomes, this is again consistent with the idea that Other-Regarding individuals internalize their efforts’ negative externality imposed on other people’s payoffs.

Our findings suggest that for organizations attracting more other-regarding workers (e.g., firms engaged in corporate social responsibility or non-profit firms), relative performance incentives are unlikely to be as effective as they are for other organizations. For firms using relative incentive pay, screening workers for particular positions according to their social preferences could improve performance. Human resource departments often provide potential workers with psychological-based exams. These could readily incorporate explicit measures of other-regardingness. Similarly, information obtained from resumes, such as a potential worker’s involvement in philanthropic activities, could shed light on a worker’s degree of other-regardingness.

We note that we did not consider the case where workers might value their firm’s payoff. Thus, our results can be seen as applying to settings where ownership is dispersed or the worker is removed from the top of the hierarchy. Finally, our measure of leadership is endogenous to the effort exerted in each group. It is an interesting challenge to design an experiment in which leadership varies with incentives and analyze how it relates to social preferences.

Although our setting only allows for the possibility of valuing negative externalities, to the extent that workers also value their positive externalities, other-regarding preferences could mitigate the free rider problem amongst teams. That is, a team of workers with Other-Regarding preferences that receive a share of the common output are more likely to provide higher outputs, as they further value their effort’s positive effects on their team members. We leave these topics for future research.