1 Introduction

Incentives to free ride are pervasive in team production, creating a tension for individuals between serving the team by providing full effort and serving themselves by shirking. Standard voluntary contribution mechanism (VCM) public good games capture this tension very well. However, they do not reflect that in real-world team production, teams often have some ability to change the productivity of effort. By investing in better workspaces, computers, equipment, tools, support staff, etc., the productivity of effort can be increased, potentially affecting the subsequent provision of effort. Furthermore, being involved in determining the productivity of effort might affect provision decisions.

To study any such effects, we conduct a public good game experiment in which contribution productivity is endogenous. Subjects collectively decide (by voting) how much to invest in augmenting the technology used for producing the public good, and subsequently make individual voluntary contributions to provision, as in standard games (Marwell and Ames 1979; Isaac et al. 1984). We compare contributions to a control treatment in which contribution productivity is on average the same, but imposed exogenously. Our aim is determining if having subjects choose contribution productivity reduces the free riding problem typical in the voluntary provision of public goods (Ledyard 1995; Chaudhuri 2011).

We find that while subjects in the experimental treatment invest considerable amounts in contribution productivity, their contributions to provision of the public good are not significantly different from in the control treatment. Our main contribution to the literature is thus a null result on how endogenizing contribution productivity affects contributions. We also contribute to the broader literature on how the endogenous selection of parameters and/or institutions affects behavior in cooperative games. Specifically, our results are consistent with those of Kingsley and Brown (2016), who fail to find that endogenous institutional choice promotes contributions, but we contrast Dal Bo et al. (2010) and Sutter et al. (2010), who report that pro-social behavior increases when subjects choose their institutions.

Previous papers on endogenous institutions in public good games focus mainly on the implementation and subsequent effects of punishment mechanisms (Gurerk et al. 2006; Tyran and Feld 2006; Ertan et al. 2009; Kosfeld et al. 2009; Putterman et al. 2011; Markussen et al. 2014). Related to this, Sutter et al. (2010) find that choosing whether to have reward or punishment mechanisms increases contributions compared to when the same institutions are imposed exogenously. Kingsley and Brown (2016) have a central authority that detects self-interested behavior only with some probability. One would think that endogenously choosing the detection probability might increase contributions, but Kingsley and Brown (2016) fail to find evidence of this. On the topic of choosing the benefits of cooperation, Dal Bo et al. (2010) report that when subjects vote to change the payoffs in a prisoner’s dilemma game in a way that promotes cooperation, subjects are more cooperative than when the same payoffs are implemented exogenously.

Norton and Isaac (2010) also study the endogenous determination of the benefits of cooperation. Each group’s manager chooses contribution productivity and others make voluntary contributions. Managers typically choose high productivity, and when the manager’s decision history is provided to the others, contribution decay is often avoided. Under the right conditions, endogenous institutional choice, therefore, promotes contributions, which is different from what we find. Finally, in Isaac and Norton (2013), subjects vote over taxes that serve as preliminary contributions that can later be supplemented by individual contributions. Total provision is higher than in a baseline treatment that does not allow taxation. Endogenous institutional choice, therefore, increases provision, once again contrasting our results.

2 The experiment

Contribution productivity is endogenous in the experimental treatment and exogenous in the control. In both treatments, subjects are assigned to groups of four that are fixed for ten rounds. They receive endowments of ten lab dollars (LD; later converted to USD at a rate of 1 LD \(=\) 0.1 USD) at the start of each round.

2.1 The experimental treatment

Each round has investment and contribution stages. In the investment stage, subjects vote over how many LD each subject will invest in contribution productivity (each subject invests the same amount). Specifically, each subject submits a number between 0 and 10 (inclusive; up to one digit after the decimal point allowed). Investment is the median of the four votes (mean of the two middle votes).Footnote 1 Note that it is not incentive compatible to reveal true preferences. Subjects have an incentive to exaggerate their preferences to move the outcome toward their true preferences. An alternative voting scheme, such as majority voting between two investment amounts, would make it weakly dominant to truthfully reveal. However, it would also restrict the potential outcomes. We opt to instead elicit a continuum of investment outcomes, so that when we later examine the relationship between contributions and investment, both are continuous variables.Footnote 2

Each subject has the investment amount deducted from her endowment of ten LD and contribution productivity is:

$$\begin{aligned} M=\sqrt{0.1\cdot I} \end{aligned}$$
(1)

where I is investment and M stands for “multiplier” (the amount by which the sum of contributions is later multiplied to determine each subject’s return from the public good). The returns to investment are positive (\(M_{\mathrm{I}}>0\)), but diminishing (\(M_{\mathrm{II}}<0\)), which incorporates empirical realism, and moves the social optimum of the game away from the center of the action space, where it would be if the relationship between I and M was linear.Footnote 3 , Footnote 4

After the voting, subjects learn M, and proceed to the contribution stage, which is like conventional games. However, instead of having their full endowments of ten LD, subjects have a budget of \(10-I\) LD. In addition, contribution productivity is the M that was chosen in the investment stage. As such, the payoffs are:

$$\begin{aligned} \pi _{\mathrm{s}}=10-I-c_{\mathrm{s}}+M(I)\cdot \sum _{t=1}^{4}c_{t} \end{aligned}$$
(2)

where \(c_{\mathrm{s}}\) is the contribution of subject s, whose four group members are indexed by t.

Having subjects complete the contribution stage with a budget of \(10-I\) LD allows us to later analyze how they make unconstrained allocations between investing, contributing, and keeping money for themselves. An alternative design would give subjects separate budgets for investing and contributing, but this would restrict substitution between investment and contributions. In addition, specifying payoffs as linear in contributions is consistent with the bulk of the previous literature on VCM public good games (Ledyard 1995; Chaudhuri 2011). Again, we want for our primary departure from the previous literature to be the addition of the investment stage, and not changes to the VCM.

At the end of each round, subjects are told the sum of contributions to the group account and their payoffs from the round. At the end of the ten rounds, each subject’s ten payoffs are converted to USD and added to a $5 show-up fee.

2.1.1 Subgame perfect Nash equilibrium

Assuming individual wealth maximization, a subgame perfect Nash equilibrium to the game is for every round to have:

  1. 1.

    Votes \((v_{1},v_{2},v_{3},v_{4})\) such that \(v_{1}=v_{2}=v_{3}=0\le v_{4}\), and

  2. 2.

    Contributions \(c_{1}=c_{2}=c_{3}=c_{4}=0\).

The votes achieve \(I=0\), and since there are no contributions, the payoffs are ten LD for each subject in each round (subjects simply keep their endowments).Footnote 5

2.1.2 Social optimum

The social optimum, in contrast, involves all subjects fully expending their endowments. The critical thing is finding the optimal balance between investment and contributions. The social optimum is for every round to have:

  1. 1.

    A set of votes \(v_{1}\le v_{2}\le v_{3}\le v_{4}\) such that \(\frac{(v_{2}+v_{3})}{2}=\frac{10}{3}\), and

  2. 2.

    Contributions \(c_{1}=c_{2}=c_{3}=c_{4}=\frac{20}{3}\).

    The votes imply \(I=3.33\rightarrow M=0.58\), and when subjects subsequently contribute all that remains (6.67 LD each), their payoffs are 15.40 LD.

2.2 Control treatment

The control treatment is a conventional repeated public good game with parameters chosen to on average replicate the incentives of the contribution stage in the experimental treatment. This creates a baseline for identifying the effect on contributions of having subjects choose M. Using the data from the experimental treatment, we calculate the average M and associated remaining budget (\(10-I\)) in each round. Subjects in the control treatment play a game in which the M in each round is the average M from the same round in the experimental treatment, and the endowments in each round are the average remaining budgets (after investment) from the same round in the experimental treatment.

An alternative design would match each group in the control to a group in the experimental treatment, and have each group in the control face the sequence of Ms chosen by its experimental treatment counterpart. The problem with this type of matched pairs design is that without being able to match groups based on similarity of observable characteristics (which is impractical given the need to conduct many sessions one at a time), variations in behavior due to group characteristics would create noise that would bias the comparison of the treatments toward failing to reject the null hypothesis of no difference.

To create parallelism between the treatments, the control treatment also has two stages: the information and contribution stages. In the information stage, subjects are told their endowments and M in the round. The contribution stage then proceeds as in the experimental treatment. In this game, the Nash equilibrium is contributing nothing, and the social optimum is contributing everything.

3 Results

The experiment was programmed and conducted with the experiment software z-Tree (Fischbacher 2007). Student subjects were recruited from introductory economics classes at our school. A total of 136 participated. There were 88 subjects in the experimental treatment (11 sessions of eight) and 48 in the control (six sessions of eight). Experimental and control sessions lasted about 45 and 35 min. Average total earnings were $15.31 and $15.58 (including the $5 show-up fee). Summary statistics are given in Table 1.

Table 1 Summary statistics

3.1 The trends

In Fig. 1, we plot the trends of average investment and contributions in the experimental treatment, and average contributions in the control.Footnote 6 In the experimental treatment, investment begins considerably higher than the socially optimal 3.33 LD, but declines toward 3.33 LD as the rounds proceed. Using a Wilcoxon signed rank test, and treating each group as an independent observation, investment is significantly different from 3.33 LD in each of the first seven rounds (\(p\le 0.02\); \(n=22\) for each test), but not significantly different from 3.33 LD in each of the final three rounds (\(p\ge 0.19\); \(n=22\) for each test). Contributions, on the other hand, are consistently well short of the socially optimal 6.67 LD, and appear to be quite similar between treatments.

Fig. 1
figure 1

Trends of investment and contributions

We present statistical evidence on the significance of the trends in Table 2. While the tendencies for votes and investment in the experimental treatment to decline over time are highly significant, the evidence on the decline of contributions in the two treatments is at most weakly significant. Payoffs are also fairly stable over time.

Table 2 Random effects regressions of key variables on a linear time trend

Now treating group level averages as the independent units of observation, we have 22 observations from the experimental treatment and 12 from the control. The difference in average contributions between the treatments (\(2.99-2.94=0.05\) LD) is not significant (Wilcoxon ranksum \(p=0.77\)), failing to provide evidence that having subjects endogenously determine contribution productivity promotes cooperation. A power analysis indicates that to reject the null hypothesis of no difference at the 5% level of significance 80% of the time, a difference of 0.69 LD is required. Increasing the sample size of the control treatment to 22 independent groups would only reduce the required difference to 0.61 LD. Furthermore, for our observed difference of 0.05 LD to be significant at the 5% level 80% of the time, we would need a sample of 3190 groups in each treatment.Footnote 7

3.2 Differences between groups

In addition to the basic summary statistics that we present in Table 1, we also report statistics on group level heterogeneity. Specifically, for each treatment, we partition the full set of groups based on group level average cumulative earnings. For the top and bottom halves of the groups, we report summary statistics on a variety of variables, where the individual values of the variables are the group level averages. While votes, investment and Ms in the experimental treatment all tend to be slightly higher for the bottom half of groups than for the top half, none of the differences are statistically significant (Wilcoxon ranksum \(p\ge 0.41\) in all three cases). However, for both treatments, the differences in average contributions between the two halves are highly significant (Wilcoxon \(p<0.01\) in both cases).Footnote 8 The nonparametric analysis, therefore, suggests that it is not differences in voting and investment (and thus M) between groups that underlie differences in cumulative earnings. Rather, cumulative earnings appear to be most closely related to contributions.

We continue our examination of differences between groups by regressing average payoffs at the group level (in each round) on M and average contributions (see Table 3). M has a negative effect on average payoffs in the experimental treatment, but the effect of M is not significant in the control. In contrast, the effect of average contributions is highly significant in both treatments, emphasizing the important role of contributions in determining payoffs.

Table 3 Random effects regressions of average payoffs at the group level

3.3 Analysis of individual votes and contributions

We now examine the correlates of individual votes and contributions (see Table 4). We present the results of fixed effects models in light of consistent evidence (in the models that we present in Table 4) that the corresponding random effects models are biased (see the Hausman p values at the bottom of the table). Regressing votes and contributions on variables capturing previous votes and contributions introduce multiple potential sources of endogeneity, including omitted variable biases and simultaneity. Fixed effects control for the effects of any time invariant omitted variables (such as unobservable individual factors) that might simultaneously cause votes and contributions.Footnote 9

Table 4 Fixed effect regressions of individual votes and contributions

In model (1) (experimental treatment), contributions in the previous round have a positive effect on votes, suggesting that there is some “momentum” or “inertia” associated with pro-social behavior such that when one makes a high contribution, she then has an increased inclination to vote high in the next round, perhaps as a means of incentivizing others to contribute (due to a higher M). In model (2), the round is the only significant correlate of contributions. In model (3), to include how the subject’s vote deviated from the median, we have to drop votes from the model, since vote deviations are a linear combination of votes and M; vote deviations do not have a significant effect.

Model (4) has two vote deviation terms: one for the amount by which the subject’s vote exceeded the median (takes on a value of zero if the subject’s vote was less than the median), and another for the amount by which the subject’s vote was less than the median (takes on a value of zero if the subject’s vote was higher than the median). Neither vote deviation variable is significant, suggesting that voting differently from other group members is not associated with higher or lower contributions. The most important result from models (2)–(4), however, is that the effect of M is never significant, failing to provide evidence that a higher M creates sufficient incentive for subjects to contribute more.Footnote 10

In model (5)(control treatment), contributions are positively associated with contributions in the previous round and negatively associated with the subject’s deviation from the average contribution of others in the previous round. The latter effect is consistent with previous literature (Ashley et al. 2010; Smith 2015). The effect of M is once again not significant.

4 Discussion

Our two main results are that: (1) compared to the social optimum, people over-invest and under-contribute, and (2) endogenously determining contribution productivity does not have a significant effect on contributions. Both findings merit further discussion. We designed our experiment to measure the levels of investment and contributions when contribution productivity is endogenously determined by subjects. Definitively determining why the levels are what they are is beyond the scope of the study.

A possible explanation is that the early round over-investment may reflect an attempt to encourage others to cooperate. Alternatively, it could be the result of the incentives of our voting mechanism to exaggerate preferences over I. In either case, it seems plausible that as subjects learn that a higher M does not lead to higher contributions, they reduce their investment until in the final rounds, it converges to the social optimum. The under-contributions are similar to that in previous experiments in which subjects start off by contributing about 50% (Ledyard 1995; Chaudhuri 2011). A potential reason for why contributions do not unravel over time in our experiment is that it is a reaction to the falling M in both treatments (and increasing amounts of money, or “remaining budgets,” from which to make contributions).

Our null result on the effect of endogenizing contribution productivity, which contrasts previous literature on endogenous institutions (Dal Bo et al. 2010; Norton and Isaac 2010; Sutter et al. 2010; Isaac and Norton 2013), is potentially attributable to a variety of factors. First, votes and contributions are chosen from continuous action spaces. Sutter et al. (2010), who have subjects choose one of three institutions, speak directly on the importance of the size of the action space, explaining that they “wanted to keep the design simple so that subjects fully understood the available institutions in the endogenous treatments. Any more complicated reward or punishment technology would have made the choice task of the participants more difficult” (Sutter et al. 2010, p. 1544).

Dal Bo et al. (2010) prisoner’s dilemma of course has a binary action space, and rates of cooperation are known to be higher with smaller action sets (Gangadharan and Nikiforakis 2009), creating the possibility of observing different treatment effects for no other reason than two games having action sets of different sizes. Dal Bo et al. (2010) experiment is also, to some extent, a different kind of game from ours because when subjects elect to change the payoffs in the prisoner’s dilemma game, the game transforms into a coordination game, with high- and low-payoff Nash equilibria, making whether the game is a coordination game potentially relevant as well.

Finally, our voting mechanism could be part of the explanation behind our null result. Since some subjects, under certain circumstances, have an incentive to exaggerate their preferences, our voting mechanism does not provide as strong a signaling opportunity as it could. However, it is not clear that signaling opportunities are better in other environments. In Dal Bo et al. (2010), subjects vote in groups of four over changing prisoner’s dilemma payoffs. They learn something about the distribution of votes from the implemented outcome (chosen by majority), but they are never informed of the distribution of votes, nor whether the computer had to break a tie (which happens a lot). In Sutter et al. (2010), unanimity is required for an institution to be implemented. Each of four subjects in the group has to “accept” a particular institution for it to be implemented, and while this indicates something about each subject’s preference for a particular institution, it says nothing about any individual’s preference ordering over the three institutions.

In our experiment, a different decision rule, such as majority voting between two investment levels, or defining the investment level as the lower (or higher) of the two middle votes, might lead to different investment outcomes, and different contribution behavior. We can only speculate about how a different voting mechanism might affect our findings. Our null result should thus be taken with caution. What is clear is that we provide contrasting results to previous studies reporting that choosing institutions improves outcomes (Dal Bo et al. 2010; Sutter et al. 2010), suggesting that more research is required to determine the conditions under which the “democracy premium” occurs.