In this paper, we consider a simple chance improving decision model under risk, and experimentally investigate the effect of a background risk on subjects’ decision behaviour. We attempt to address two different but entangled issues. First, we experimentally test a theoretical prediction on the effect of the independent and exogenous background risk to initial wealth. While there have been many theoretical studies on the effect of the background risk on risky choices (Pratt and Zeckhauser 1987; Kimball 1993; Eeckhoudt, Gollier and Schlesinger 1996; Gollier and Pratt 1996), only a few empirical studies exist (Guiso and Paiella 1999). Moreover, these empirical studies crucially depend on respondents’ self-report on their background risk. This is due to the fact that the background risk in field data is hardly measurable. Hence, an experimental approach to this issue has an advantage in the sense that such a background risk can be manipulated. Furthermore, there has been no empirical or experimental study on the effect of the background risk in the situation where the winning probability of the gain is positively related to the giving-up amount among the potential gains.Footnote 1 Our study attempts to fill the gap.

There are many decision problems where the background risk would matter: portfolio building, insurance, auctions, pricing, mergers and acquisitions decisions, self-protection, and so on. Whatever the specific decision problem is, the main question is, “What is the impact of the background risk to initial wealth on a risk-averse person’s decision?” To concentrate on the basic question, we in this paper take a very simple and restrictive decision making model where the winning chances are positively related to a giving-up amount among potential gains and there is no loss of initial wealth.Footnote 2 We call this the simple chance improving decision model. In this model, a subject can choose how much he or she gives up among the potential gains in order to increase the winning probability of the gain. If he or she is rational and knows the probability distribution of the winning chance as an increasing function of the giving-up amount, he or she will choose the giving-up amount which maximises his or her expected utility. It would not be difficult to predict that the choice will depend on his or her risk attitudes. Thus, the choice could be also affected by whether a background risk exists to his or her initial wealth or not. We in this paper derive some basic theoretical predictions on the effect of the independent and exogenous background risk from this model and experimentally test the predictions. The experiment has been carefully designed to directly represent the decision situation in the model. In the experiment, subjects have to choose the giving-up amount (though the context is abstract) and we explicitly set the background risk as a treatment variable. In this way, we can directly test the theoretical prediction on the effect of the background risk on subjects’ behaviour.

Second, we also contribute to an experimental methodology issue. It has been an issue of debate whether a wealth effect exists in a laboratory experiment where a modest stake is used. The wealth effect exists if the accumulated wealth through rounds of an experiment affects subjects’ decision making.Footnote 3 In theory, subjects should be approximately risk neutral in the case (e.g., see Friedman and Cassar 2004), but there has been much evidence to suggest that most subjects are risk averse even for such a modest stake (Holt and Laury 2002; Laury 2005; Harrison et al. 2005; Binswanger 1980). If subjects are risk averse and do not show constant absolute risk aversion (CARA), an experimenter would need to control the possible wealth effect. In fact, whether the wealth effect exists or not directly decides which payoff mechanism has to be used in an experiment where repetition is used. In the repetition environment, there are two widely used payoff mechanisms which we could call the “accumulated payoff mechanism (APM)” and the “random round payoff mechanism (RRPM),” respectively. Under APM, subjects’ monetary rewards in the experiment are simply the sum of their monetary rewards of each round through the whole experiment. Most economics experiments have traditionally used APM. However, the wealth effect would exist and cannot be controlled under APM if subjects’ preferences are not risk neutral or CARA. By this reason, some experimenters have used RRPM. Under RRPM, subjects are given their monetary rewards based on one or only a few randomly chosen round(s) after a whole experiment has been completed. RRPM is also called “random lottery incentive mechanism” in the literature. Since the monetary earnings are not accumulated and not deterministic under RRPM, the wealth effect could be mitigated.

Grether and Plott (1979), which experimentally investigated the preference reversal phenomenon, was an example of the early use of RRPM and invoked the debate on the validity of RRPM in experiments related to the individual decision making context. While Holt (1986) raised a theoretical question that RRPM could not elicit subjects’ true preference if they are not expected utility maximisers (in the sense of violation of the independence axiom), the responding experimental studies reported that subjects tended to separate each task under RRPM and thus it could elicit subjects’ true preference (Hey and Lee 2005a, b; Cubitt, Starmer and Sugden 1998; Starmer and Sugden 1991). As a result, many recent experiments in an individual decision making context tend to use RRPM instead of APM.

However, a puzzling finding is that APM is still widely used in most experiments related to market and game theories. This seems to be due to the tendency that many game theorists and economists still presume that subjects are risk neutral in the experiment where a moderate stake is typically used. However, there is quite robust evidence that subjects are risk averse as we noted above. Thus, experimenters in such contexts may also need to carefully consider using RRPM instead of APM. This will depend on the answer to the more fundamental question of whether subjects show CARA preferences and RRPM could control the wealth effect in the case that subjects’ preferences are not CARA. While the previous studies on RRPM have focused on the validity of eliciting subjects’ true preference (implicitly assuming that RRPM can control the wealth effect), we directly address the fundamental question. The results from our experiment suggest that risk-averse subjects tend to follow decreasing absolute risk aversion (DARA) under APM, and that RRPM could control the wealth effect.

The next section presents the theoretical results and hypotheses to be tested. In this section, we introduce the simple chance improving decision model with more details. Assuming that the probabilities on outcomes are uniformly distributed, we theoretically show that DARA implies the decrease of the giving-up amount as initial wealth increases and that a risk-averse subject would behave more risk aversely if there is an independent and exogenous background risk to the initial wealth.

The experimental design to test these theoretical predictions is introduced in Section 2. In the experiment, subjects faced two different tasks in each round. One was the simple chance improving task which was introduced in the previous section (Task 1), and the other was a two-person game task (Task 2). Subjects were divided into two groups by the background risk treatment. We articulated the background risk treatment by using a different treatment on the earnings from Task 2. That is, the earnings from Task 2 were decided according to RRPM for one group while the earnings from Task 2 were accumulated through 10 rounds (APM) for the other. Hence, the former faced an independent and exogenous background risk at initial wealth in each round while the latter did not.

We find that most subjects are risk averse for the small stake and the risk-averse subjects behave more risk aversely when the background risk at their initial wealth exists. This confirms our theoretical prediction: that is, risk-averse subjects give up more potential gains to increase the winning probability under the background risk. Moreover, we also find that the risk-averse subjects tend to become less risk averse as their initial wealth is accumulated (that is, the wealth effect exists) but RRPM could control it. These results are presented in Section 3.

In Section 4, we summarise this study and discuss the implications of the results from our experiment. We conclude that the experimental method could be a good tool for testing the theoretical predictions on the effect of the background risk to wealth on subjects’ behaviour while we deal with a quite restrictive decision situation. Moreover, we argue in terms of experimental methodology that our results suggest that experimenters would need to control the wealth effect and RRPM could work well for this purpose.

1 Simple chance improving decision model

In this section, we introduce the simple chance improving decision model and derive some theoretical predictions. The hypotheses, which are based on the theoretical predictions and will be experimentally tested, are also presented.

Suppose a subject whose preference follows the expected utility has the initial wealth of w 0 . Let us denote the maximum amount of potential gain by M, the giving-up amount by x∈(0,M) and the probability distribution of winning the potential gain by p(x). Note that the probability distribution is a function of the giving-up amount. Assume that p(x) > 0 and p(x) ≤ 0: that is, the winning probability increases as the subject gives up more among the potential gains with a decreasing rate. Then the risk-averse subject whose preference is represented by von Neuman–Morgenstern utility function u(.) solves the following expected utility maximisation problem:

$$ {\mathop {{\text{Max}}EU}\limits_{{\left\{ x \right\}}} } = p{\left( x \right)} \cdot u{\left( {w_{0} + M - x} \right)} + {\left( {1 - p{\left( x \right)}} \right)} \cdot u{\left( {w_{0} } \right)} $$
(1)

where u(.) > 0, u(.) > 0, and u(.) < 0. We find that the solution mainly depends on the property of p(.), and p(x) ≤ 0 is sufficient for the maximisation problem. To get a sharp and testable implication, assume that p(x) is uniformly distributed for x∈(0,M): thus, p(x) = x/M, p(x) = 1/M, and p(x) = 0. This assumption, while strongly restrictive, has merits in the experimental investigation: that is, the implication of the uniform distribution is clear and hence it is easy to make subjects understand it. Moreover, it is easy to execute the distribution by using a simple random device. Furthermore, the prediction is sharp and clear.

The optimisation problem is a fairly standard one. So, we present the basic results without much explanation.

Proposition 1)

A risk-neutral subject chooses x N  = M/2, and a risk-averse subject chooses x A  > M/2 at every initial wealth.

Proposition 2)

A more risk-averse subject chooses a higher x A : that is, he or she gives up more.

Proposition 3)

If a risk-averse subject has a DARA preference (i.e., u‴ > 0), then the optimal giving-up amount x A decreases as his or her initial wealth w 0 increases.

Proposition 4)

Suppose a mean-zero independent and exogenous background risk to the initial wealth w 0 : that is, \( w_{\varepsilon } = w_{0} + \varepsilon \) where E(ɛ) = 0. So ɛ can be regarded as a mean-preserving spread parameter. If u‴ > 0 and u″″ < 0, then at every initial wealth level, a risk-averse subject whose initial wealth is under the background risk will give up more than one whose initial wealth does not have any risk. That is, \( x_{\varepsilon } > x_{A} \) for every wealth level where x ɛ denotes the optimal x under the background risk and x A denotes the optimal x in the case that there is no background risk.

Proofs for these propositions are in Appendix A. Proposition 1 and 2 imply that a risk-averse (a more risk-averse) subject would be willing to give up more than a risk-neutral (a less risk-averse) subject does in order to increase the winning chance. This is intuitive given the definition of risk aversion. As it will become clear later, we manipulate our experimental design so that both propositions should hold rather than be tested. Proposition 3 confirms our intuition on the effect of the initial wealth on the choice of x. This proposition shows that every subject with DARA would decrease the giving-up amount as their initial wealth increases. This gives us an experimentally testable implication: DARA is consistent with the decrease of the giving-up amount x A as the initial wealth is accumulated. So, if a risk-averse subject has a preference of DARA, the effect of an independent and exogenous background risk at any given wealth level is clear with the additional assumption as shown in Proposition 4. The additional assumption u″″ < 0 implies a decreasing absolute prudence. Note that u‴ > 0 is a minimum requirement for Proposition 4. The theoretical prediction of Proposition 4 is our main hypothesis to be tested. We also empirically test whether subjects show DARA or not. This would make the test result more robust. In fact, \( x_{\varepsilon } > x_{A} \) is compatible only with DARA preference. Moreover, whether subjects’ risk aversion is DARA with a small stake which is typically used in laboratory experiments is a controversial issue in experimental economics methodology. While subjects would be risk neutral in theory, much empirical evidence shows that most subjects are risk averse even for small stakes. Thus, if an experimenter uses repetition, controlling the wealth effect through rounds arises as a methodological issue. If most subjects are risk neutral or have CARA, the experimenter may not need to worry about the wealth effect. However, if most subjects have DARA or IARA (increasing absolute risk aversion), the experimenter should control the wealth effect since it would bias the experimental results. This is ultimately an empirical question. Our second hypothesis deals with this.

We summarise our hypotheses to be tested:

Hypothesis 1)

Subjects behave more risk aversely if there is an independent and exogenous background risk to initial wealth. That is, they give up more potential gain under the background risk.

Hypothesis 2)

Subjects show DARA as their initial wealth is accumulated, but RRPM could control the wealth effect by DARA preference.

2 Experimental design

This experiment was implemented in a computerised environment using the z-Tree software (Fischbacher 1999) in the EXEC laboratory at the University of York. The participants were undergraduates and postgraduates of the university. In this experiment, there were 48 subjects. Each subject had to choose a number in each of two similar but different tasks at each round through 10 rounds. The first task was an individual decision making problem and the second was a two-person game. We call the former Task 1 and the latter Task 2. The design of the experiment is summarised in Table 1. Using a 2 by 2 treatment design, there were 4 groups with 12 subjects each. The average duration of the experiment for each group was 40 minutes, and subjects’ average earnings were £7.72 for the experiment as a whole.

Table 1 Experimental design

The basic payoff structure for both tasks follows:

$$ p = \left\{ {\begin{array}{*{20}c} {{c \cdot {\left( {m - x} \right)}}} & {{if\;\;x > y}} \\ {{\frac{c} {2} \cdot {\left( {m - x} \right)}}} & {{if\;\;x = y}} \\ {0} & {{if\;\;x < y}} \\ \end{array} }\right.$$
(2)

where p denotes the payoff, m is the reference number, x∈[1,10] is the integer number which the subject decides and y∈[1,10] is the random integer number from the uniform distribution (Task 1) or the integer number which the partner chooses (Task 2). The c is a conversion rate. Suppose we set m = 12 and c = £1. Following the logic of our basic model in Section 2, we can easily derive the implications from this set-up: In Task 1, a risk-neutral subject will choose 6 while a risk-averse [risk-loving] subject will choose a number higher [lower] than 6. And, a more risk-averse subject would choose a higher number. Of course, the implications of Proposition 3 and 4 are also valid. In Task 2, the unique Nash equilibrium is that both players choose 10. It can be shown that this equilibrium is dominant solvable by iteratively eliminating weakly dominated strategies. Since subjects’ behaviour in Task 1 is of our main concern, we do not present the results in Task 2 which are dealt with in more detail in Lee (2006). In our actual experiment, we used slightly different number sets in order to avoid the flat payoff problem (Harrison 1989). However, the interpretation of the data is not affected by this, and hence we proceed in this paper with the number sets introduced above.Footnote 4

For Task 2, the accumulated payoff mechanism (APM) has been used for half of the subjects and the random round payoff mechanism (RRPM) for the other half.Footnote 5 We call the former the APM group and the latter the RRPM group. For Task 1, all subjects faced an identical treatment: RRPM.Footnote 6 Hence, in terms of Task 1, there were only two comparable groups of the APM group and the RRPM group. The only difference between these two groups was the payoff mechanism used for Task 2.

Subjects’ total earnings from the experiment were the sum of their cash earnings in Task 1 and Task 2. Note that the earnings in the randomly chosen round are multiplied by 10 when RRPM is used for Task 1 and Task 2. The APM group faced a decision problem for Task 1 in each round with a certain amount of initial wealth from Task 2 which was accumulated through rounds. However, the RRPM group faced Task 1 in each round with a risky initial wealth since RRPM implies that the initial wealth from Task 2 is risky. Since we equated the expected earnings of APM and RRPM for any identical decision, it can be easily seen that RRPM induces a mean preserving spread payoff distribution of APM (Lee 2006). Thus, the RRPM group faced an independent and exogenous background risk to initial wealth while the APM group did not.

The implication of this design is straightforward. In each round, the RRPM group faces Task 1 with the background risk at the earnings from Task 2 while the APM group faces Task 1 with a certain amount of accumulated earnings from Task 2. Moreover, the effect of wealth accumulation from Task 2 would be diluted under RRPM. Hence, the experimental version of the hypotheses in Section 1 follows:

Hypothesis 1.1)

Choice numbers of the RRPM group are higher than those of the APM group.

Hypothesis 2.1)

The wealth effect exists under APM (DARA under APM) while it would be diluted under RRPM.

3 Experimental results

3.1 Categorising risk attitudes

Since the theoretical predictions critically depend on subjects’ risk attitude and our main concern is with risk-averse subjects’ behaviour, we first need to categorise subjects’ risk attitudes. For an illustration, we calculate the interval of the constant absolute risk aversion coefficients for each decision number in Task 1. This is shown in Table 2. It shows that a risk-neutral subject would choose 6 while a risk-averse [risk-loving] subject would choose a number higher [lower] than 6 and that a more risk-averse subject would choose a higher number. This implies that our Propositions 1 and 2 naturally hold as a result of our experimental design.

Table 2 Relationship between decision numbers and risk attitudes

In principle, we can categorise and get an interval estimate of subjects’ risk attitudes directly from the above if subjects’ preference is CARA and it is consistent. However, our data shows that subjects’ decisions tend to be inconsistent through rounds: that is, most subjects change the choices through rounds.Footnote 7 Of course, it is not surprising for a subject in the APM group to change a decision through rounds because there may be a wealth effect if his or her preference is not CARA. But our data shows that there is a more problematic inconsistency that subjects’ preferences oscillate between risk loving and risk aversion. This cannot be explained by the wealth effect. If RRPM is used for Task 2 and a subject’s preference is EU, then the independence axiom implies that he or she should make an identical choice through rounds (that is, RRPM controls the wealth effect). Though it is not impossible that the subject makes different decisions at each round under RRPM if his or her preference is not EU and he or she does not separate each decision task, the previous experimental studies do not support this possibility (Hey and Lee 2005a, b; Cubitt, Starmer and Sugden 1998; Starmer and Sugden 1991). Thus, we conjecture that there is a noise when subjects make a decision, and need to incorporate it in categorising subjects’ risk attitudes (Hey and Orme 1994; Harless and Camerer 1994; Loomes, Moffatt and Sugden 2002).

One way to incorporate the noise is to fit a flexible utility function to our data. Suppose that subjects’ utility function follows an expo-power type (Saha 1993):

$$ u{\left( w \right)} = \frac{{1 - \exp {\left({-\alpha w^{{1 - \gamma }} } \right)}}} {\alpha } $$
(3)

where w includes the accumulated initial wealth until the previous round (w 0 ) and the possible payoffs at the current round for the option i (p i ), i.e. \( w = w_{0} +p_{i}\). It can be easily shown that this utility function implies CARA as γ goes to 0 while it implies CRRA as α goes to 0. Moreover, this utility function can represent various types of risk aversion according to the relationship of the signs of α and γ. For example, the utility functional implies DARA and IRRA (increasing relative risk aversion) if both α and γ are positive while it implies IARA (increasing absolute risk aversion) and IRRA if α is positive and γ is negative. It is possible to estimate the parameters incorporating a noise by using the error response model suggested by Holt and Laury (2002). This model implies that the probability of choosing option i among our 10 possible options is

$$ \Pr {\left( {{\text{choosing}}\,\,option\,\,i} \right)} = \frac{{EU{\left( {w_{0} + p_{i} } \right)}^{{1/\mu }} }} {{{\sum\limits_{j = 1}^{10} {EU{\left( {w_{0} + p_{j} } \right)}^{{1/\mu }} } }}}$$
(4)

where i, j = 1, 2, ..., 10 and the parameter μ represents a noise: the choices will be random among 10 options as μ goes to infinity while an option with the highest expected utility is chosen as μ goes to 0.Footnote 8 We estimated three parameters for each subject by applying maximum likelihood estimation subject by subject. However, our data (i.e., 10 observations for each subject) was insufficient to get a robust result. This implies that categorising subjects’ risk attitudes only by this method would not be convincing enough.

The second way to incorporate the noise is to run interval estimation for the risk aversion coefficients shown in Table 1 assuming that subjects’ preference is CARA. This is possible since each decision number can represent the interval of the CARA coefficient. Thus if we denote the exact value of the CARA coefficient by α*, the probability that α* lies in the interval (a j , a j+1 ] is derived as follows:

$$ \Pr {\left( {a_{j} < \alpha * < a_{{j + 1}} } \right)} = F*{\left( {a_{{j + 1}} } \right)} - F*{\left( {a_{j} } \right)}$$
(5)

where F(.) denotes the cumulative distribution function of α*, j = 0, 1, ...,8. Note that a 0  = −∞ and a 8  = ∞. Assuming that the errors are normally distributed and excluding the stochastically dominated decision (x = 1), we can derive the log-likelihood function and get the estimation results for each subject by using the maximum likelihood method. However, as the case of the estimation of the expo-power utility function, the data is not enough and thus using only this method would not provide a credible categorisation. While this method is more directive in categorising risk attitudes, we cannot directly estimate whether subjects’ preference is CARA or not in both the individual level and the aggregate level since we presume CARA.

We add a supplementary third categorisation method which we call a modified mode method since we do not have enough data. By this method, we categorise each subject’s risk attitude by the modified mode. That is, through 10 rounds, a subject who chooses the number 6 most frequently is categorised as risk neutral, and a subject who chooses a number greater [less] than that number most frequently is categorised as risk averse [risk loving]. Some would argue that this method would be biased since only the number 6 allows risk neutrality while any integer number between 7 and 10 allows risk aversion. However, it should be noted that a risk-neutral preference has a strong robustness because it is only concerned with the first moment while the degree of risk aversion (or risk loving) is easily affected by the change of the higher moments: DARA preference is an example. It is also possible that a subject has preferences with a random variation (Loomes, Moffatt and Sugden 2002). In that case, a risk-neutral subject is not affected by such a preference variation, but a risk-averse [risk-loving] subject is affected by it so that his or her degree of risk aversion [risk loving] could change. We believe that this would be sufficient to justify the validity of the modified mode for the categorisation.

As a result, we categorised subjects’ risk attitudes by the following two steps. First, we estimated each subject’s risk aversion by the expo-power utility estimation or the interval estimation. If the estimation results were significant, then we categorised his or her risk attitude following the results. Then, we used the modified mode method for subjects on which the estimation did not provide a significant result. We believe that this strategy would lead to a satisfying categorisation of risk attitudes though it is difficult to estimate each individual’s risk aversion types (i.e. whether each subject’s preference is CARA or not). Table 3 summarises the categorisation results. We in this paper use the categorisation using Method I (the expo-power utility estimation supplemented by the modified method). Note that using the other categorisation methods presented in Table 3 hardly changes the qualitative results. By Method I, there are 29 risk-averse subjects and 16 risk-neutral subjects.Footnote 9

Table 3 Categorisation of subjects’ risk attitudes

Before we tested the hypotheses, we ran an estimation of the expo-power utility parameters using the aggregate data of risk-averse subjects. The estimation results are shown in Table 4. Recall from equations (3) and (4) that the value of μ decides the degree of noise and the values of α and γ decide the type of risk aversion. The second column shows the estimation results for all the rounds. We can find that there is a significant noise (μ = 0.205) and subjects show IARA and IRRA preference since α is positive and γ is negative. These results are contrary to our intuition. We suspect that the noise crucially affects these results. Thus, we use subjects’ answers on the post-experiment questionnaire: the average round in which the learning process was completed was 2.292 according to their report. Accordingly, we estimated the parameters by excluding the data up to the second round. The results are presented in the third column. Most of all, the noise parameter (μ = 0.073) is remarkably reduced as consistent with the subjects’ reports. The estimation results are also changed. Since both α and γ are positive, we find that risk-averse subjects show DARA and IRRA consistent with the results of Holt and Laury (2002).

Table 4 The expo-power utility estimation results for risk-averse subjects

3.2 Analysis on hypothesis 1

Our first hypothesis indicates that risk-averse subjects would choose a higher number under RRPM than under APM and that risk-neutral subjects’ behaviour would not be different between RRPM and APM. The number of risk-averse subjects is 14 in the RRPM group and 15 in the APM group. Their choices for Task 1 in each round are shown in Table 5 and Figure 1. For most rounds, there is a clear difference of chosen numbers between the RRPM group and the APM group: the mean chosen numbers of the RRPM group are clearly higher than those of the APM group. This difference is significant by the Mann–Whitney test (p = 0.022). The result is qualitatively the same even if we exclude the decisions in the sixth round which looks like an outlier.Footnote 10 Hence, this result supports our first hypothesis: risk-averse subjects in the RRPM group give up more (i.e., behave more risk aversely) than those in the APM group do. Recalling that the difference of payoff mechanisms is only in Task 2 (i.e., RRPM has been used for Task 1 in all sessions), this result implies that the existence of an exogenous risk at initial wealth imposed by RRPM could make people behave more risk aversely.

Table 5 Risk-averse subjects’ average chosen numbers for Task 1
Fig. 1
figure 1

Risk-averse subjects’ choice for Task 1. Note: APM and RRPM represent the accumulated payoff mechanism and the random round payoff mechanism, respectively. The decision for Task 1 is an integer x∈ [1, 10]

For purposes of comparison, the numbers chosen by risk-neutral subjects can be compared between the RRPM group (n = 8) and the APM group (n = 8). This is shown in Figure 2. Since the riskier environment induced by RRPM is not relevant to risk-neutral subjects, it is predicted that there is no difference between the RRPM group and the APM group. As predicted, the null hypothesis that there is no difference between them cannot be rejected (p = 0.797).

Fig. 2
figure 2

Risk-neutral subjects’ choice for Task 1. Note: APM and RRPM represent the accumulated payoff mechanism and the random round payoff mechanism, respectively. The decision for Task 1 is an integer x∈ [1, 10]

The implication of this result seems to be clear. Risk-averse people tend to behave more risk aversely if their exogenous initial wealth faces a risk. For example, a risk-averse company, which is to merge with another firm and take its debt, would try to take more debt to increase the chances of a successful merger if it were to face a background risk to its wealth. A risk-averse job seeker who confronts a background risk to his or her wealth would reduce his or her willingness-to-accept wage. An investor who has real estate as initial wealth would behave more conservatively in a risky stock market if the price of his real estate were at risk. Of course, it should be noted that this result is subject to our strong restriction on p(x). In fact, McGuire, Pratt and Zeckhauser (1991) show that a p(x), which would lead to a contrary result to ours, could exist. Still, we believe that our results contribute to the literature in that we first (to our knowledge) suggest the evidence supporting the theoretical prediction on the relationship between a chance improving behaviour and a background risk.

3.3 Analysis on hypothesis 2

Now, we test our second hypothesis using a simple regression analysis. That is, we test that there could exist a wealth effect under APM (especially, DARA) while RRPM could control or mitigate it.

The relationship between subjects’ risk attitudes and an initial wealth effect has been one of the topics of debate in experimental economics literature. Do we need to control for the wealth effect in a laboratory experiment where a modest payoff is typically used? Many experimenters argue no, since subjects must be approximately risk neutral for such a modest payoff (e.g., see Friedman and Cassar 2004). However, many experimental studies report that a high proportion of subjects are risk averse even for such low stakes (Holt and Laury 2002; Laury 2005; Harrison et al. 2005; Binswanger 1980). Of course, if subjects’ risk aversion is constant with initial wealth, then we still do not need to control the wealth effect.Footnote 11 However, this is an empirical question: that is, we do not know whether subjects’ preference is increasing or decreasing with initial wealth. If we allow the possibility that a subject’s risk aversion is decreasing with initial wealth for low stakes, then the wealth effect could exist for such low stakes, and hence we need to control it.

Our experiment could give us a clue to the answer to this question. We already found that more than half of the subjects in our experiment could be categorised as risk averse for the small payoffs used in our experiment. We also found evidence suggesting that the risk-averse subjects as a whole would show DARA preference. In this section, what we want to know is whether these risk-averse subjects’ behaviour is affected by accumulated wealth during an experiment under a more articulate definition of the wealth effect. That is, we want to isolate the pure effect of the accumulated wealth up to a particular round from the effects of the concern with the future opportunities for accumulating wealth and of a learning process. This basically can be done by investigating the relationship between subjects’ choices for Task 1 in each round and the accumulated earnings from Task 2 up to that round. Hence, APM for Task 2 represents a situation where the wealth effect is not controlled (i.e. the accumulation of certain amounts of initial wealth through rounds) while RRPM for Task 2 represents a situation where it is controlled (i.e. no accumulation of certain amounts of initial wealth through rounds).

Our prediction is that under the APM treatment for Task 2, if the wealth effect exists, then subjects’ chosen numbers for Task 1 will decrease as their accumulated initial wealth from Task 2 increases, implying that their preferences show DARA for an initial wealth level. However, the RRPM treatment for Task 2 could control such a wealth effect, and hence subjects’ choices for Task 1 will not be affected by the increase of initial wealth from Task 2.Footnote 12

We can construct a linear regression model to test the wealth effect as below:

$$ z_{i} = \beta _{0} + \beta _{1} r_{i} + \beta _{2} w_{i} + \beta _{3} p_{i} + \beta _{4} g_{i} + \beta _{5} d_{i} + u_{i}$$
(6)

where z i is the subject i’s choice number in each round for Task 1, r i is the round in which the subject chooses the number for Task 1, w i is subject i’s pounds value of accumulated earnings (from Task 2) up to that round, and p i , g i and d i are dummy variables for experience (experience=1), gender (male=1) and major (economics and PEP=1), respectively. Here, the variable r i is a proxy variable representing the round effect including the effects of learning and of subjects’ perception on future earning opportunities.

We cannot completely exclude a learning process through rounds though Task 1 is, we believe, quite easy to understand and there is no feedback (e.g. Loomes and Sugden 1998). We call this the learning effect. Moreover, a subject’s perception at each round on future earning opportunities could affect her current decision, and we call this the opportunity effect (Lee and Lima 2004). That is, making decisions round after round would imply the decrease of remaining earning opportunities. Consider a subject in the APM group. The subject could earn deterministic wealth from Task 2 as rounds proceed. However, at the same time, the fact that rounds proceed also implies that his or her remaining opportunities to increase wealth decrease. Suppose that the subject is to make a decision for Task 1 in the first round. He or she does not have any initial wealth at the moment but has 10 opportunities for future earnings from Task 2 by which he or she could increase wealth.Footnote 13 Hence, if his or her risk aversion is decreasing with wealth, the zero initial wealth may lead him or her to behave more risk aversely while the ten future earning opportunities decrease his or her risk aversion toward any risky choice. Given that RRPM has been used for Task 1 regardless of the payoff treatments used for Task 2, the choice behaviour for Task 1 must represent subjects’ risk attitudes after subjects contemplate both the initial wealth and the chance for future earnings from Task 2. Related arguments in terms of experimental methodology can be found in Holt and Davis (1993, pp.84–85), and in Gollier (1996) in terms of theoretical economic applications. But, the effect of the perception related to the future opportunities must be diluted under RRPM since only the choice(s) in the randomly chosen round(s) would decide a subject’s final payoff. Thus, RRPM would control the opportunity effect as well as the wealth effect.

This argument suggests that we need to consider both the opportunity effect and the wealth effect. Moreover, we need to isolate the opportunity effect from a learning effect in the round effect. In order to do this, we used subjects’ answers for the questionnaire on when they learned the structure of Task 1, which was given after they completed the experiment. We used the average number of rounds in which subjects answered that they completed learning about Task 1. According to their reports, this average was 2.292. Hence, we conjecture that learning for Task 1 was completed after the second round on average, and as a result the learning effect would not exist from the third round. If we run a regression for the samples only from the third to the final round, then the round effect would represent only the opportunity effect excluding the learning effect. Therefore, if our hypothesis is right, then there will be a significant positive round effect under APM while the round effect will not be significant under RRPM.

There were 15 risk-averse subjects in the APM group and 14 in the RRPM group. We separately ran a simple linear regression for each group. The regression results are summarised in Table 6. We also include the regression results for risk-neutral subjects in the APM group and the RRPM group for the purpose of comparison.

Table 6 Regression results for Task 1 (from third round)

In the APM group, risk-averse subjects’ choices for Task 1 significantly decrease as accumulated earnings increase, which implies that there is a significant wealth effect. As accumulated earnings increase by £1, the number chosen for Task 1 decreases by 0.442 and it is significant. Recalling Proposition 3, we can conclude that subjects’ choice behaviour becomes less risk averse as their initial wealth is accumulated. The round effect is also significant, indicating that the number for Task 1 increases by 0.211 as rounds proceed which means that subjects become more risk averse as the number of remaining rounds decreases. Because we isolated the opportunity effect from the learning effect in the round effect, this seems to support that the negative opportunity effect exists under APM.

In the RRPM group, risk-averse subjects’ choices for Task 1 also decrease as accumulated wealth increases, but it is not significant implying that we cannot reject the null hypothesis that the wealth effect does not exist.Footnote 14 Moreover, we find that for risk-averse subjects, the round effect is not significant under RRPM. Since we can regard this round effect as the opportunity effect, we may conclude that there is the opportunity effect under APM, but RRPM controls it. This result is in accordance with our hypothesis and supports the idea that subjects may separate each task under RRPM: at least, subjects may not consider future earning opportunities when they face a current decision under RRPM. The results also partially support Gollier (1996)’s theoretical prediction that a risk-averse EU maximiser will behave less risk aversely if there are remaining opportunities for identical and independent risky gambles.Footnote 15

The accumulated wealth effect is insignificant for risk-neutral subjects regardless of the payoff mechanisms. Moreover, our prediction that the opportunity effect would be insignificant for risk-neutral subjects is supported.

Hence, we conclude that only risk-averse subjects under APM sessions are affected by initial wealth accumulation through rounds (specifically, they show DARA), and that RRPM could control the initial wealth effect for risk-averse subjects. Thus, our Hypothesis 2 is supported.

4 Conclusion

In this paper, we attempted to fill the gap between many theoretical studies and the little empirical evidence on the effect of the independent and exogenous background risk to wealth. Moreover, we tackled an important issue in the experimental economics methodology: whether subjects show DARA in laboratory experiments and whether RRPM can control the wealth effect.

We found that the background risk makes subjects behave more risk aversely in our very simple chance improving decision situation. While our set-up of the probability distribution of outcomes may be quite restrictive, the test result is clear: it unambiguously supports the theoretical prediction on the effect of the background risk given the particular probability distribution. This result becomes more robust with findings that subjects tend to show DARA as their initial wealth is accumulated. It is a little striking in the theoretical sense that subjects show DARA in a laboratory experiment where a moderate stake is used. However, it was already striking that many experimental studies have found that most subjects were risk averse in the same environment. In fact, this issue is ultimately an empirical one which could be settled after evidence is accumulated. In the meantime, it would be a better strategy for an experimenter to control the possible wealth effect. We in this paper found that RRPM would work well for this purpose. Our result suggests with the previous experimental studies on the effect of RRPM (Hey and Lee 2005a, b; Cubitt, Starmer and Sugden 1998; Starmer and Sugden 1991) that RRPM could be a valid and better payoff mechanism under repetition in individual decision making experiments.