1 Introduction

How is responsibility attributed when an outcome results from a chain of actions? First, consider a disaster (i.e., sinking of large ships) as an example of a bad outcome. Disasters are often a result of a chain of unfortunate circumstances, decisions, and actions. Whittingham (2004) presents several examples of disasters and discusses the responsibility of the different people involved. Typically, someone makes a mistake which is not detected or appropriately fixed. This mistake causes/adds to further problems until a disaster is unavoidable. Similarly, good outcomes are often the result of the (sequential) interaction of people. For instance, joint production is an example of a positive decision chain. If a firm releases a product, research and development, production, and marketing sequentially contribute to the success. In our study, we experimentally investigate how people attribute responsibility in decision chains by allowing people to allocate blame and praise to others in the form of punishment and reward.Footnote 1

The general question of responsibility attribution has been addressed from different angles. A normative point of view has been taken from a philosophical (Feinberg, 1970) as well as from a legal perspective (Hart and Gardner, 2008). More recently, the question has also attracted the interest of psychologists (Ross and Nisbett, 1991; Weiner, 1995; Gerstenberg et al., 2011) political scientists (Iyengar, 1994) and economists, both from an empirical (Charness, 2000; Bartling and Fischbacher, 2012; Bartling et al., 2015; Duch et al., 2015) and theoretical perspective (Besley, 2006; Bartling and Fischbacher, 2012; Engl, 2022). Our study has an empirical point of view as we investigate in a lab experiment how people assign responsibility using punishment and reward. Understanding the empirical patterns of responsibility attribution is important because it has consequences for how we setup liability rules and how we distribute the benefits of joint ventures. Our experiment shows what rules of responsibility attribution people spontaneously apply, and the analysis of the heterogeneity provides insights on how well people agree with respect to these rules.

In real life decision chains, the decision makers and the actions differ in many dimensions. In our experiment, we study the impact of the sequence in isolation. For this reason, we investigate decisions in a sequential voting game, in which symmetric voters decide with majority over a good or bad outcome for other people. The subjects in our experiment are matched into groups of five voters and five recipients. The five voters choose sequentially between two options of how to allocate points between voters and recipients, while the outcome is determined by simple majority rule. The two allocations differ in their fairness: the unfair allocation favors the voters, while the fair allocation results in similar payoffs for the voters and recipients. The recipients receive full information about the voting sequence and then have the possibility to sanction the voters. Our treatments differ in the sanctioning options. There is a treatment with only punishment, one with only reward, and one with both. Finally, we use process measures and record the response times for all participants and use eye-tracking for the recipients. We make several contributions to the existing literature. We study different outcomes (good and bad outcomes), compare different sanctioning options (punishment and reward), explore more motives (outcome, choice, intention, initiation, pivotality, causal responsibility based on models), and add process measures (response time and eye-tracking). We are investigating the following research questions:

First, we study how responsibility is attributed to different roles in the decision chain. We investigate how the position in the decision chain affects responsibility attribution, and we use two theoretical measures of responsibility as predictors (Bartling and Fischbacher, 2012; Engl, 2022). Two voters are in a particular focus, the first voter in favor of the resulting outcome, which we call initiator and the third voter in favor of the resulting outcome, which we call the pivotal voter.Footnote 2 Bartling et al. (2015) show that in sequential decisions pivotal voters are blamed the most for unfair outcomes. Duch et al. (2015) find that in simultaneous collective decisions, proposal power plays an important role for responsibility attribution. In our design with five voters, we can compare the initiator and the pivotal player with a majority voter who is neither. Further, we can distinguish between majority voters who still had a say and those who had no more influence on the outcome. We call the former the intentional voters and the latter the non-intentional voters.Footnote 3 On the aggregate, our experiment shows that punishment targets people who vote for the unfair outcome. People who potentially had an impact on the outcome, and in this respect are intentionally unkind, are punished more, and the pivotal players even more within this group. Analogously, choosing the kind option leads to rewards, which are higher if the choice can be considered as intentional and even higher if the choice is pivotal. We do not find that higher reward or punishment is assigned to the initiator of a good or bad outcome. The theoretical measures of responsibility (Bartling and Fischbacher, 2012; Engl, 2022) are correlated with the empirical responsibility attribution but the correlations are not very high and these measures do not outperform simpler measures.

Second, we study whether responsibility attribution differs for good and bad outcomes in comparable situations. We do so, by making use of our treatments with reward and punishment options, as well as the combination, in which both reward and punishment are available. As mentioned above, the evaluation of responsibility is consistent between reward and punishment. Subjects reward others for good outcomes very similarly to how they punish others for bad outcomes – both on the aggregate level as well as in the individual analysis introduced below. The environment in which both reward and punishment are available shows that people tend to prefer to use punishment over reward. However, responsibility attribution is less differentiated compared to the environments in which only one option is available.

Third, as the explanatory power of the general model is not very high, we explore individual patterns in responsibility attribution. Studying different patterns in social interactions has been the object of several studies.Footnote 4 We find that the individual behavior can be classified into three main groups. These groups are analogous in the reward and the punishment condition. There is a group of subjects who barely rewards or punishes. Another group of subjects particularly targets the pivotal voter, and a third group of subjects mostly attributes responsibility according to the choices of the voters.

Fourth, we investigate how the voters respond to the incentives created by the option of punishment and reward. We find evidence that voters are (at least partially) aware of how responsibility is attributed. In particular, they are aware that pivotality matters and partially use delegation in order to avoid blame for unkind decisions or seek responsibility for kind decisions in order to gain credit.

Fifth, we study the underlying decision process of responsibility attribution. We analyze the response time patterns of voters and find that they have longer response times when they are potentially pivotal, i.e., if their decision can finalize the outcome. Further, we use eye tracking during the sanctioning decisions. Our results show that the gaze analysis does not confirm the behavioral focus on the pivotal player. If any player is more in the focus, then it is the initiator.

The remainder of this paper is structured as follows: Sect. 2 explains the experimental design used in this study while Sect. 3 outlines the different motives used to study responsibility attribution and lists our predictions. Our results are presented in Sect. 4 and Sect. 5 concludes.

2 Experimental design

We build on Bartling et al. (2015) who investigated a sequential voting task with punishment. We add a treatment with reward in order to directly compare responsibility attribution for good and bad outcomes, and we increase the number of voters as it allows investigating more roles in this sequential decision process. In our experiment, we randomly assign the role of voter and recipient to subjects. Five voters and five recipients form a group and keep their roles throughout the experiment. The five voters sequentially decide between two allocations in order to distribute 50 points among all ten group members. There are two sets of allocations. In one set, voters can choose between a fair allocation, in which all group members receive 5 points, and an unfair allocation, in which the voters receive 9 points each and the recipients receive 1 point each. In the second set, the voters can choose between a fairer allocation, in which the voters receive 6 points and the recipients receive 4 points, and an unfair allocation, in which the voters receive 8 points each and the recipients receive 2 points each. We chose the two sets in such a way that the alternatives create a similar trade-off, and, thus, the two sets can be treated equally.

The position of the voters in the voting sequence is randomly determined. Each voter is informed about the decisions of all previous voters in the sequential process before choosing an allocation. A majority rule is applied, which means that the allocation that is chosen by at least three of the five voters is implemented.

The recipients are informed about the individual voting decisions and thus also about the voting outcome. One randomly determined recipient receives an extra point and has the option to sanction the voters. We vary the sanction option across three treatments: Recipients can only punish voters (Punishment), they can only reward voters (Reward), and they can reward and punish voters (Both). In all the treatments, the recipient first has to decide whether to sanction the voters at the cost of the extra point by clicking a button. In the second decision, the recipient can then assign 0 to 7 reward and/or punishment points to each voter individually. Figure 1 illustrates an exemplary decision screen of a selected recipient who decided to sanction the voters.

The payoff of each voter is determined by the resulting voting outcome and the reward or punishment points the voter receives from the recipient. Each recipient gets a payoff according to the chosen allocation. The selected recipient can additionally keep the extra point if she decides not to sanction.

Fig. 1
figure 1

Exemplary decision screen of a recipient in the treatment Both. Note: Voters are denoted as A1 - A5 and their decisions are indicated by a check at the selected allocation. The positioning of the allocations in the top or bottom row was randomly determined. The outcome in this example is the unfair allocation (9 points for each voter and 1 point for each recipient) and the recipient attributes three reward points to voter 2 and deducts four punishment points from voter 4. We added the respective allocation on both sides of the screen to minimize subject’s gaze being biased towards one side of the screen. The font size in the figure was enlarged for better readability

The game is played as a one-shot game and we use the strategy method for both voters and recipients, that is, each voter and each recipient makes choices for all possible scenarios. Each voter chooses between the fair and unfair allocation in every voter position for every possible combination of previous voter choices. This results in 31 binary choices for each of the two allocation sets which we display in random order. Additionally, the voters play one round of a dictator game for each allocation set resulting in two additional decisions. All recipients act as if they were chosen to be the recipient to sanction. For every possible voting sequence, the recipients decide whether they want to sanction any of the voters and if yes, by how much they want to sanction each voter. The scenarios differ in the decision constellations of the voters and the allocation sets.

As process measures, we collect response time data for both voters and recipients. In addition, we use eye-tracking to record the gaze pattern of the recipients to evaluate the information recipients use when attributing responsibility to the voters.

2.1 Procedural details

The experiment was programmed using the software “z-Tree" (Fischbacher, 2007). Participants were students who were recruited by the data-system ORSEE (Greiner, 2015). In total, nine sessions were conducted in February 2019, three sessions for each treatment. The experiment was carried out at the experimental laboratory of the University of Konstanz (Lakelab) in Germany. Each session consisted of two groups, 10 voters and 10 recipients, such that there were 30 voters and 30 recipients in each of the three treatments (Punishment, Reward, Both). One subject was excluded from the analysis due to insufficient attention during the session.

The average age of our subjects was 22 years (min: 18, max: 33) and 55.56% of the subjects were female. The subjects earned on average 22.58 EUR (about 25.40 USD at that time) which included a show-up fee and an extra compensation for the usage of eye-tracking. The sessions lasted 90 min on average.

We used Tobii EyeX eye-trackers with a sampling rate of 60Hz to record gaze data. The subjects used chin rests to improve data quality and the seating distance to the screen was approximately 58 cm. The screens were 22 inch color monitors with a resolution of 1920x1080 pixels. The calibration of the subjects to the eye-tracking system was done at the beginning of the experiment via a seven-point calibration. Two additional subjects have been excluded from the eye-tracking analysis because of technical issues and poor quality of the gaze data.

Fixations are identified with the help of the DBSCAN-algorithm (Ester et al., 1996). We create ten non-overlapping areas of interest (AOI), each with a radius of 90 pixels. Each AOI covers a box on the decision screen indicating whether the voter voted in favor or against the specific allocation (see Fig. 1). Therefore, for each voter there are two AOIs. The horizontal distance between the centers of two AOIs was 320 pixels and the font size of the cues was set to 20.

3 Criteria and theoretical predictions

In this section, we present criteria according to which people could attribute responsibility. It includes motives like intention-based reciprocity and measures of responsibility. We will assess the relevance of these criteria by using them as predictors for reward and punishment. The measures that we will present first have not been intended to be used as measures of responsibility. However, these measures have been suggested to explain reward and punishment. Even when they do not capture all facets of responsibility, they capture some facets, which has also been discussed in the theoretical literature on responsibility. For example, Shaver (1985) mentions dimensions of responsibility, among them, intentions.

Outcome. Outcome-oriented models such as Fehr and Schmidt (1999) and Bolton and Ockenfels (2000) are quite popular. In our voting game, it means that reward and punishment is not directed to a specific voter. The assessment only depends on the outcome. The unfair alternative is considered as bad, and the fair alternative as kind or neutral.Footnote 5 The models predict equal punishment for all voters in case of an unfair outcome and if at all, equal reward in case of a fair outcome.

Choice. This motive assumes that a vote for the fair allocation is perceived as kind and a vote for the unfair allocation as unkind, independent of whether the vote was relevant for the outcome or not. It can be considered as a naive notion of intention. If the voter would not take the behavior of the other voters into account and would believe that their own vote is decisive, then their own vote would correspond to their preference and the vote would express their intention.

Intention. This motive captures preferences as suggested in the reciprocity models of Rabin (1993) and Dufwenberg and Kirchsteiger (2004). Since there are only two options, either both are neutral, or one option is kind and the other is unkind. Theoretically, whether voting for fair or unfair is kinder depends on the belief of what the other voters do.Footnote 6 However, in the experiment, voting for the fair allocation always results in a higher probability of getting the fair outcome than voting for the unfair allocation. Thus, as long as a majority for one of the allocations is not reached, votes are impactful and voting for the fair allocation can be considered as kind and voting for the unfair allocation as unkind. As soon as the decision is made and the outcome can no longer be changed, votes are no longer impactful, no intentions can be inferred, and the vote is considered as neutral.

Initiation. This motive is motivated from Duch et al. (2015) who showed that proposal power is an important aspect in how subjects attribute responsibility for collective decisions.Footnote 7 Applied to our experiment, it assumes that the first voter who votes for the resulting outcome has a special responsibility for the outcome.

Pivotality. This motive is motivated from Bartling et al. (2015) who found that the pivotal voter is punished more than the non-pivotal voters for voting for the unfair outcome. In our setup, the pivotal voter is the third voter who votes in favor of one of the two allocations. After this choice, the outcome is determined and can no longer be changed.

Bartling and Fischbacher (2012) Responsibility Measure. In the responsibility measure formalized by Bartling and Fischbacher (2012) (from now on called BF Responsibility), the responsibility of the different voters for a certain outcome is assigned proportional to how much their vote contributes to an increase in the probability that this outcome results. The measure depends on the belief about the voter’s decisions. In order to keep the following explanations simpler, we present the case where the outcome is unfair. The measure works exactly in the same way for the fair outcome. It is calculated as follows: First, we calculate for every decision node the probability that the unfair outcome results. Next, each action (e.g. each vote) gets a raw responsibility, which is the difference between the probability before and after the action. Finally, in order to get the responsibility measure, the raw responsibility is normalized. This means specifically: The responsibility of an action that does not increase the probability of an unfair outcome is set to zero. The responsibility of an action that increases the probability of an unfair outcome is the raw responsibility divided by the sum of all positive raw responsibilities along the decision path from the start to the final outcome. This measure lies between zero and one. As mentioned above, it depends on a belief about the voters’ decisions. Practically, we use the empirical distribution of voters’ decisions as their belief. Note that the responsibility measure refers to the outcome. We expect that responsibility for the unfair outcome triggers punishment and responsibility for the fair outcome triggers reward.

Engl (2022) Responsibility Measures. Another notion of responsibility has been suggested by Engl (2022). We consider a simple variant of the model and explain what it predicts in our case. Engl (2022) distinguishes between ex-ante and ex-post causal responsibility. For the calculation of ex-post causal responsibility, the outcome is considered as given. The idea of this measure is that the attributed responsibility of an action increases in proportion to the pivotality of this action. An action is considered as pivotal if it causes the outcome and not choosing it would result in the counter-factual outcome. With respect to the ex-post responsibility, the action has the outcome as consequence and, therefore, the responsibility of the action is just the probability that not choosing this action causes the other outcome.Footnote 8 Note that the ex-post responsibility is defined for an action within a decision path. The ex-ante responsibility of a vote is calculated as the expected value of the ex-post responsibility of all paths following this vote. Thus, the ex-ante responsibility fixes the target vote as well as the preceding votes and calculates the expected value of the ex-post responsibilities over all paths that follow the target vote.

Let us illustrate the idea of ex-post responsibility in a few examples where F refers to a vote for the fair allocation, U refers to a vote for the unfair allocation, and the order of F and U refers to the sequence of votes. So, FUFUU represents a situation, in which the first and the third voters vote for fair and the other voters vote for unfair. The fourth and the fifth vote in this sequence are fully responsible for the unfair outcome because choosing fair would result in the fair outcome. In the sequence UFFFF, the fair outcome results. The last voter in this sequence is not responsible for the fair outcome at all because a change of the action does not change the outcome. The ex-post responsibility of the fourth voter in the sequence UFFFF depends on the probability p that the last voter chooses U. If the fourth voter would vote U, then the unfair outcome would result with a probability of p. Thus, the ex-post responsibility of voter four for the fair outcome equals p. Note that different from the BF responsibility measure, the Engl responsibility for the fair outcome is not always zero when the unfair outcome results and vice versa. For example, the first unfair vote (U) in UFFFF does not have zero responsibility for the fair result, and the first voter who votes for F bears some positive responsibility for the unfair outcome. Therefore, we consider the difference between the responsibility for the unfair and the fair outcome as predictors. We use the variables Ex-ante Engl Difference (U-F) and Ex-post Engl Difference (U-F) as predictors for the Punishment treatment and Ex-ante Engl Difference (F-U) and Ex-post Engl Difference (F-U) as predictors for the Reward treatment.

Table 1 shows a summary of the presented criteria and their theoretical predictions, which we use to analyze how people attribute responsibility. The third column of the table illustrates the theoretical predictions of how responsibility is assigned for the unfair outcome in the voting sequence FUFUU. The fourth column of the table provides the theoretical predictions of the responsibility assignment for the fair outcome in the voting sequence UFFFF. The tables in Section A.3 of the appendix display the responsibility predictions for each voter in each voting sequence according to the models by Bartling and Fischbacher (2012) and Engl (2022).

Table 1 Overview of responsibility measures

In the exemplary situation of the voting sequence FUFUU, the unfair outcome results. Therefore, the outcome-based models would predict equal punishment for all voters. If Choice is used as a criterion to assign responsibility, unfair choices will be punished equally (second, fourth and fifth voter) and fair choices will be rewarded equally (first and third voter). The intention-based models would predict equal responsibility for the second, fourth and fifth voter for the unfair outcome since a majority of votes was not reached before. With respect to initiation, the second voter would be solely responsible for the unfair outcome since this voter was the first to vote for the unfair allocation. The fifth voter in the sequence is the pivotal voter and would be fully responsible according to this criterion. BF Responsibility would predict zero responsibility for the voters who voted for the fair allocation (first and third position voters) and a responsibility between 0 and 1 for the second, fourth and fifth position voter. As outlined above, the ex-post Engl Responsibility model predicts that the fourth and fifth voters are fully responsible. All the other voters are partially responsible. The ex-ante Engl Responsibility is non-zero for all voters because in all decisions, the unfair outcome is possible. It equals 1 for the last voter, because for the last voter, the ex-ante and the ex-post measures coincide. For voter 4, the ex-ante measure is the expected ex-post measure of the two sequences FUFUF and FUFUU. For voter 3, it is the expected ex-post measure of four sequences, and so on.

4 Results

Our main research focus is how recipients use punishment and reward (attribute responsibility) for sequential collective decisions, which are either fair or unfair. To do so, we first test the criteria and theoretical predictions stated in Sect. 3, followed by an integrative model, and a short analysis of individual heterogeneity and process data. In the analysis of the voters’ behavior, we study voting patterns, strategic voting and delegation, and response times. We do not distinguish between the two possible allocation settings because the results are, as expected, quite similar.Footnote 9Footnote 10

4.1 Sanctioning Behavior

For the analysis, we separate the decisions according to whether the outcome was fair or unfair, and classify each voter into the majority group (those who voted for the resulting outcome) or into the minority group (those who voted against the resulting outcome). In each voting sequence there are between three and five majority voters and between zero and two minority voters. Among the majority voters, we distinguish between voters who vote before a majority is reached and who are therefore impactful for the final outcome (first three majority voters) and non-impactful voters (the possible fourth and fifth majority voter). Finally, we separate the impactful voters (first three majority voters) into the initiator, i.e., first majority voter, the second majority voter and the pivotal voter, who is the third majority voter. The impactful voters are named according to their roles in the sequential decision.

Outcome. Fig. 2 shows the average punishment and reward points for fair and unfair outcomes across treatments. Note that in the Both treatment, recipients could use the seven points for both punishment and reward. Thus, the Both treatment is included in both sub-figures.

Fig. 2
figure 2

Average punishment and reward points for fair and unfair outcomes across treatments. Note: Standard error bars are shown in black

In the Punishment treatment, recipients punish more in unfair outcomes than in fair outcomes (4.04 and 1.40 points, Wilcoxon signed-rank test, \(p\) < 0.001).Footnote 11 In the Reward treatment, more reward points are assigned on average in fair outcomes compared to unfair outcomes (3.63 and 1.59 points, \(p\) < 0.001). In the Both treatment, where subjects can both punish and reward voters, more punishment points are used on average for unfair outcomes (2.84) than for fair outcomes (1.07, \(p\) < 0.001). In contrast, recipients do not reward fair and unfair outcomes differently (0.91 and 0.83 points, \(p\) = 0.975). This shows that punishment is used more frequently and in a more differentiated way than reward when both options are available.

Our results show that recipients indeed punish unfair outcomes more than fair outcomes. However, our outcome-based prediction cannot be supported insofar as recipients also punish when the outcome is a fair allocation and use reward points for both outcomes.

Choices. Fig. 3 shows the average sanction points for different voter roles for unfair and fair outcomes across treatments. Table A2 in the appendix lists the corresponding average sanction points in more detail by also taking the voter position into account. It is important to note that not every voter role is equally represented across all possible scenarios due to the natural composition of all possible voting choice constellations. Table A3 shows the average sanction points for each voter position in each scenario faced by the recipients.

Fig. 3
figure 3

Average sanction points for different voter roles across treatments. Note: The bars show the average sanction points for different sanction motives separated by outcome (fair vs. unfair) and treatment. Depending on the voting sequence the following voter roles are possible: Minority 1 represents the first voter voting against the final outcome. Minority 2 represents the second minority voter. Majority 1 (Initiator) is the first voter to vote for the final outcome. Majority 2 is the second majority voter. Majority 3 (Pivotal) is the third voter to vote for the final outcome. Majority 4 and Majority 5 are the fourth and fifth majority voter.

Figure 3 shows that choice clearly matters for responsibility attribution. In all treatments, voting for the fair allocation is rewarded more and/or punished less than voting for the unfair allocation (all \(p\)-\(values\) \(< 0.02\)). This is illustrated by comparing the minority voters (yellow bars) with the respective majority voters (blue bars) in Fig. 3 for each outcome. Our choice-based prediction can thus be confirmed by the data and shows that subjects attribute responsibility according to choices. In particular, there is virtually no punishment for fair votes and, vice versa, hardly any reward for unfair votes. Nevertheless, reward points for subjects voting for the unfair allocation as well as punishment points for subjects voting for the fair allocation are significantly different from zero for almost all voter roles (all \(p\)-\(values\) < 0.05, except for one case).

Intentions. We now disentangle who is held more responsible among the voters choosing the same allocation. We call the first three majority voters of a voting sequence intentional voters, while the majority voters four and five are non-impactful voters as their vote can no longer change the outcome. As shown in Fig. 3, recipients punish intentional voters for unfair outcomes and reward them for fair outcomes more than non-impactful voters (all \(p\)-\(values\) < 0.05). In the Both treatment, the same results hold except for one case.Footnote 12

Our results show that the intention-based prediction can be confirmed. Thus, when attributing responsibility for an outcome, recipients take the impact of the votes on the final outcome into account.

Initiation. We analyze whether there are differences in responsibility attribution among intentional voters. We first test whether the initiator is sanctioned more than the second majority voter. Across all treatments, we do not find evidence for an initiator effect (average sanction points for initiator vs. second majority voter: Punishment \(-\)1.18 vs. \(-\)0.99; Reward 0.94 vs. 0.97; Both fair outcomes 0.23 vs. 0.22; Both unfair outcomes \(-\)1.02 vs. \(-\)0.72). Recipients do not seem to punish and/or reward the initiator differently than the second majority voter (all \(p\)-\(values\) > 0.1). Therefore, the initiation-based prediction does not hold.

Pivotality. We expect recipients to attribute the highest responsibility to the pivotal voter (e.g., Bartling et al. (2015)). In the Punishment treatment the pivotal voter is punished more than both other intentional voters when the outcome is unfair (\(-\)1.52 vs. \(-\)1.18 / \(-\)0.99, both \(p\)-\(values\) < 0.02). In the Reward treatment the pivotal voter is rewarded the most for fair outcomes (1.19 points on average) which is more than the other two intentional voters (both \(p\)-\(values\)  < 0.08). However, in the Both treatment the pivotal voter is not treated differently compared to other intentional voters (all \(p\)-\(values\) > 0.1). The pivotality-based prediction can partially be confirmed. Pivotality plays an important role when attributing responsibility for cases where reward and punishment are separately available.

Taken together, our analysis allows us to answer the first two research questions on how responsibility is attributed to different roles in a decision chain and between different outcomes. Our results show that people attribute responsibility differently depending on the outcome of the sequential decision. Generally, subjects are held responsible according to their choices. Furthermore, impactful voters are perceived to be more responsible than non-impactful voters. Last, the pivotal voter bears the highest responsibility while the initiator of a sequential voting sequence is not treated differently than the second impactful voter. In addition, we find that the different criteria of responsibility attribution are the same whether people praise others for good outcomes or blame them for bad outcomes.

4.1.1 Econometric comparison of sanctioning motives

In this section, we provide an econometric comparison between the different motives and measures of responsibility attribution. As the correlation tables in Section A.4 in the appendix show, there are high correlations between the different motives and measures. Therefore, in order to compare the explanatory power of the different motives, we first study them in isolation.

Figure 4 shows the \(R^2\) of individual OLS regressions with reward and/or punishment points as dependent variables and the criteria of responsibility attribution as presented in Sect. 3 as independent variables.Footnote 13 The figure shows that the criterion Choice has the highest explanatory power across all treatments, followed by the criterion Intention, further followed by the theories of responsibility attribution (BF, Ex-ante Engl Diff, Ex-post Engl Diff), which have similar explanatory power but among them the Ex-ante Engl Diff measure has the highest explanatory power in all situations. The criteria Pivotality and Initiation are amongst the criteria with the least explanatory power.Footnote 14

Fig. 4
figure 4

Comparison of \(R^2\) for different responsibility measures

So far, each motive of how responsibility is attributed has been tested separately. Outcome-based models predict when people use punishment or reward, but they do not predict who is held responsible. Models based on reciprocity and intentions can explain who is perceived responsible but not when. We now test which motives have explanatory power when considering all motives simultaneously and, thereby, compare the importance of the different motives. Importantly, even though the outcome is the same in many scenarios in our experiment, the number of votes for and against the outcome differs and also which voter position was associated with which motive (i.e., whether the third, fourth, or fifth voter is pivotal; whether the first, second or third voter is the initiator). Table 2 shows the corresponding OLS regression outputs. On the left side of the table we regress the punishment points on each sanctioning motive simultaneously for the treatments Punishment and Both, while the right side shows the respective regressions for the reward points in the treatments Reward and Both.Footnote 15 Since the measures are correlated, we also report regressions with only the positional variables (choice, outcome, intention, initiator and pivotal), with only the responsibility measures and with single regressors in Appendix A.5. We control for the size of the majority voters but do not include the voter position.Footnote 16

Table 2 Joint OLS regressions to compare the impact of the criteria on the usage of punishment and reward points

The regression output in Table 2 shows that choices have an explanatory power on top of all other motives. Unfair choices predict the punishment patterns seen in the treatments Punishment and Both. In the Punishment treatment, voters who choose the unfair allocation get 0.778 more punishment points than voters choosing the fair allocation, and this increase equals 0.466 in the Both treatment. On the other side, fair choices are a good predictor for how people reward collective decisions in the treatments Reward and Both. In these treatments, voters are rewarded 0.455 / 0.335 more reward points when choosing the fair allocation compared to the unfair allocation. In the treatments Punishment and Reward pivotality has predictive power for the perceived responsibility when considering all motives. Being pivotal for the unfair outcome leads to 0.565 more punishment points and being pivotal for the fair outcome leads to 0.213 more reward points compared to other intentional voters. Looking at the responsibility measures by Bartling and Fischbacher (2012) and Engl (2022) one can see that the criterion Ex-post Engl Difference for fair and unfair outcomes helps in explaining the punishment behavior in this joint regression. The remaining responsibility measures BF Responsibility and Ex-ante Engl Difference do not help much in explaining who is held responsible when combining all measures. This is because the responsibility measures encompass various individual sanction motives, which are included in this regression by other variables.Footnote 17 The results of the econometric comparison indicate that in sequential decisions, subjects mainly focus on the choices and the pivotal decision-maker when attributing responsibility.

Since the criterion Choice seems to be a crucial factor, we will zoom in on its role. First, we investigate what explains reward and punishment on top of Choice. So, we separately investigate what explains sanctions for fair and unfair choices. The regressions are presented in Table A26 in the appendix. On the one hand, they show that after a fair choice, there is almost no punishment and subjects do not differentiate between the different voters who voted for the fair outcome when punishing. The same is true for reward in case of an unfair choice. This result can also be seen in Fig. 3. On the other hand, the regressions show that among the voters with an unfair choice, the punishment is significantly explained by Pivotality. Again, the same is true for the reward among the voters with a fair choice. This suggests that the choice is a necessary condition for sanctioning. If the choice is fair, then there is no punishment and when the choice is unfair, there is no reward. Otherwise, more sophisticated criteria come into play.Footnote 18 Second, we explore whether subjects shift their strategies over time, especially since there are many decisions to take. For example, Choice is a relatively easy criterion and could become more pronounced towards the end of the experiment. This simpler (heuristic) strategy might come at the cost of other criteria, in particular Pivotality. The Tables A35 and A36 in the appendix show the regression tables for decisions 1-21, 22-42, and 43-64. Choice does not lose predictive power over time and Pivotality becomes even more important over time.

4.1.2 Heterogeneity

While the different theoretical models and motives have shown to be important in explaining the responsibility attribution pattern on average, the explanatory power of these measures is still quite low (see Fig. 4). A potential reason for the low explanatory power of the joint analysis is heterogeneity in individual behavior. We perform finite mixture models to test if the overall data can be better explained by a mixture of different subgroups.Footnote 19 Using a bootstrap likelihood ratio test with 100 bootstrap replicates, we test if a model without subgroups (one component of coefficients) is a better fit to the data than a model with subgroups (more than one component of coefficients). In each treatment, the tests suggest that heterogeneity in individual behavior exists as the data can be better explained by a mixture of more than one subgroup (\(p\)< 0.02 for all treatments).

Although there is only a limited number of subjects per treatment, we explore the individual patterns in each treatment. We identify the optimal number of clusters in each treatment by testing the best goodness of fit. Three different punishment and three different reward patterns are present in our experiment.

Figure 5 shows the average punishment and reward points for the different voter categories used by recipients categorized within the same cluster. Considering only punishment points, 30% of the subjects in the Punishment treatment use no or little punishment (Cluster 1).Footnote 20 However, most of the subjects in the Punishment treatment punish intentional voters and especially the pivotal voter for unfair outcomes and are categorized in Cluster 2. Subjects in Cluster 3 focus on punishing unfair choices. The cluster analysis in the Reward treatment shows that 25% in our experiment only use little reward (Cluster 1). Subjects in Cluster 2 focus on rewarding intentional voters for fair outcomes (especially the pivotal voter), while subjects categorized by Cluster 3 show a tendency to reward fair choices.

Overall, the patterns shown in the treatments Punishment and Reward are very similar as these clusters can be described by: no or little punishment/reward, focus on the pivotal voter, and focus on choices.

Fig. 5
figure 5

Cluster analysis: Punishment and reward patterns in Punishment and Reward treatment based on finite mixture models. Note: The figure shows the average punishment and reward points in absolute terms used in each cluster for fair and unfair outcomes across treatments. Hereby, the punishing patterns in the Punishment treatment are presented in the upper part of the figure, while the reward patterns in the Reward treatment are presented in the lower part of the figure. The number of subjects contained in each cluster per treatment are indicated in the titles of each sub-figure

The finite mixture models in the Both treatment are performed separately for punishment and reward points and the results are shown in Fig. 6.Footnote 21 The punishing as well as the rewarding behavior in the Both treatment can be described by three patterns: no punishment/reward, little sanctioning and sanctioning choices. These patterns are different than the ones observed in the Punishment and Reward treatment and suggest a less differentiated approach.

Fig. 6
figure 6

Cluster analysis: Punishment and reward patterns in the Both treatment based on finite mixture models. Note: The figure shows the average punishment and reward points in absolute terms used in each cluster for fair and unfair outcomes in the Both treatment. Hereby, the punishing patterns are presented in the upper part of the figure, while the reward patterns are presented in the lower part of the figure. The number of subjects contained in each cluster per treatment are indicated in titles of each sub-figure.

In sum, the presented exploratory cluster analysis indicates that there exist different types of responsibility attribution patterns. These results should be taken with care as there are only roughly 30 subjects per treatment and some clusters are rather small. Nonetheless, the patterns are interesting and suggest different types which are in line with different motives. We identify very similar punishment and reward patterns in our experiment. The expressed patterns are: little sanctioning, sanctioning according to intentions / pivotality and sanctioning according to choices. We find heterogeneity in individual behavior which can be an explanation of the low predictive power of the theoretical models and motives in Table 2.

After having analyzed how recipients on an aggregate and individual level attribute responsibility we complement the behavioral analysis by looking at the behavior of the voters.

4.2 Voting behavior

We now turn to the voting behavior and analyze how subjects vote in collective decisions under the prospect of reward and punishment. Pivotality is an important aspect in the process of responsibility attribution as we have shown in the analysis of the recipients’ behavior. Accordingly, strategic voters might take this into account and prevent (favor) being the target of punishment (reward) linked to pivotality.

We examine this strategic behavior by studying the voters’ behavior in potentially pivotal decisions.Footnote 22 Table 3 offers a general overview of the share of decisions in which voters choose the unfair allocation depending on their position and the previous votes. We separate the results by our treatments (columns 3-5) and by the decisions of the voters in the two dictator decisions (columns 6-8). Importantly, we use the dictator game to elicit the preference of the voters for the fair or unfair allocation when neither a collective decision nor punishment or reward are implemented. 63 voters show a preference for the unfair allocation, while 14 voters show a fair preference. The remaining 13 voters have mixed preferences depending on the two allocation sets we offer them. Bold sequences in column 2 indicate situations in which voters face a potentially pivotal decision.

Table 3 Voting Behavior - Share of unfair choices

Voters who face a potentially pivotal decision are influenced by the choices of the previous voters. Across all treatments, the share of unfair choices for potentially pivotal voters is higher when the majority of previous voters voted unfair in comparison to a fair majority of previous votes (columns 3-5).Footnote 23 The biggest discrepancy results in the treatment Punishment for potentially pivotal voters on voting position three. 50% of these voters’ decisions are unfair when the first two voters voted unfair, while only 28% of the decisions are unfair when the first two voters voted fair.

The sequential decision design allows voters to use strategic (non-)delegation in order to avoid (seek) pivotality. Here, we focus on the potentially pivotal voters on either position three or four. In these cases the voters can ensure being pivotal by following the majority of previous votes. But the voters can also vote against the majority of previous voters and can therefore delegate the notion of being pivotal to the next voter. Subjects showing a preference for the fair allocation in the dictator game mostly choose the fair allocation when being potentially pivotal (column 6 in Table 3). In contrast, voters expressing a preference for the unfair allocation in the dictator game often behave against their true preference in potentially pivotal situations of the collective decision (column 8). On voting positions three and four, potentially pivotal voters with an unfair preference ensure the unfair outcome by being pivotal in only 51% and 59% of all decisions respectively. In 62% and 44% of the cases where the majority of previous voters chose the fair allocation, potentially pivotal voters on positions three and four voted against their true preference and decided on being pivotal for the fair outcome. On voting position 5 where no strategic (non-)delegation is possible voters followed their true preference in only 65% of the cases.

Taken together, voters showing a preference for the unfair outcome often vote against their preference when being potentially pivotal. They avoid (seek) being pivotal for unfair (fair) outcomes by strategic (non-)delegation. But surprisingly many subjects do not try to appear fair when the outcome can no longer be changed, in particular when the outcome of the decision is unfair.

This completes our behavioral results. We now turn to the processing data we collected for recipients and voters to complement the behavioral analyses.

4.3 Process measures

For the voters, we collected response time as a process measure. We hypothesize that response times inform us about the decision-making process, the difficulty of the decision and strategic decision-making (Konovalov and Ruff, 2022; Konovalov and Krajbich, 2019; Hausfeld et al., 2020; Spiliopoulos and Ortmann, 2018). A decision has to be considered as difficult if the voter strategically votes against the outcome preferences when being potentially pivotal. These decisions are characterized by a higher internal conflict, and should be accompanied by longer response times (Rubinstein, 2007). In Sect. 4.2, we showed that voters often decide against their true preference when being potentially pivotal. Potentially pivotal means that exactly two previous voters voted for the same allocation and that the outcome can now be determined by the respective voter. This behavior suggests an internal conflict of being potentially pivotal.

Table 4 shows how the response time of voters is affected by being potentially pivotal, the voter position and the choice of the voters. When accounting for the voter position, we find that voters take significantly more time in choosing an allocation when they are potentially pivotal (actual effect size is around 590 ms).Footnote 24 Another conflict shows when separating the voters by the true preference shown in the dictator game (last three columns of Table 4). First, being potentially pivotal still leads to higher response times for all types of voters. However, if voters vote against their true preferences, we find response times to be similarly affected. We find that fair types of voters take significantly more time when choosing the unfair allocation. In contrast, mixed and unfair types of voters spend less time when choosing the unfair allocation. Together with the results presented in Sect. 4.2, we can answer our fourth and fifth research question on how voters respond to the incentives created by responsibility attribution. Voters are aware of the responsibility that is linked to pivotality as they strategically use delegation to avoid punishment or non-delegation to gain reward even if it means that they vote against their true preference. This behavior is accompanied by a higher response time.

Table 4 Response Time Analyses Voters

For the recipients, we collected gaze data as a process measure. We tracked the gaze of participants in order to analyze whether their sanctioning behavior is reflected in their information search. The gaze data reveals interesting insights about the importance of saliency, positioning of voters and the impact of sanctioning motives on the share of fixations. We present and discuss these analyses in Section A.11 of the appendix.

5 Conclusion

In our study, we use reward and punishment to investigate how people attribute responsibility in decision chains. In our experiment, five voters choose sequentially between two options of how to allocate points between voters and recipients. One option is fair, the other is unfair. The recipients can reward and/or punish the voters, which we take as our measure of responsibility attribution. We test the relevance of different motives and measures and find that the actual choice, i.e., fair or unfair, plays a dominant role. It is clearly the most important determinant for the decision on whether to punish or not. Nevertheless, in line with Bartling et al. (2015), we find that the pivotal voter is assigned more punishment if the voters vote for an unfair allocation. We extend this result, showing that pivotality also matters when the voters vote for the fair allocation and the recipients can reward. Even though people have rather sophisticated responsibility attribution patterns, the conceptual models of Bartling and Fischbacher (2012) and Engl (2022) explained surprisingly little in comparison to, and in particular on top of the simple and mechanistic idea that it is the deed that determines responsibility. Overall, the explanatory power of the measures is not very high. This is partly due to heterogeneity. We show, using a finite mixture model, that three classes provide a better fit. Even though the sample size is too low to make strong statements about the distribution of types, in our sample there are subjects who sanction little, subjects who focus on choice and subjects who sanction the pivotal more than the others. Our results also show that the voters are aware of how responsibility is attributed and are particularly concerned when they are pivotal, which is also visible in longer response times.

Of course, specific features of the experiment could be relevant for the outcome, for example when investigating the role of the initiator. In our experiment, the two options were chosen with similar frequency. It is possible that when an action is rarely chosen, the initiator is assigned a higher level of responsibility. This could in particular be relevant in the case of fatalities. Another feature of our design could have reduced the relevance of the pivotal voter. Voters still had to vote, even when the decision was already made. Finishing the procedure when the result is determined could make the pivotal voter even more focal. Such variants could reveal the sensitivity of responsibility attribution to the specific situation. In addition, investigating costless reward and punishment could reveal whether selfish people apply different patterns of responsibility attribution. However, one can argue that also outside of the lab, reward and punishment bear some cost and therefore it is more important to know the responsibility attribution of people who are willing to bear such cost.

The responsibility attribution in our voting game captures situations, in which people take sequential decisions or actions that jointly generate an outcome. What does this imply in the real world, for example in the case of a disaster? First, any bad action is attributed some responsibility. Second, the pivotal person, i.e., the person after whose action the disaster was unavoidable, is generally assigned the highest responsibility. In his book, Whittingham (2004) observes that often the institutional environment is an important reason for disasters. Translated to our setting, he would consider the initiator as particularly responsible. However, we find few subjects who agree with this view. Of course, there are important differences to our lab experiments. First, in such disasters it is more difficult to identify the sequence. In particular, it is not easy to identify the pivotal agent - it is difficult to find out when the disaster was no longer avoidable. Further, the different agents are less symmetric, both with respect to their contribution to the disaster and with respect to their formal responsibility. There are also many institutional details that could matter. Further research will allow investigating such variations, and our experiment provides a framework to do so.