1 Introduction

Supply chains facing asymmetrically distributed information basically have two opposite means executing their operations. On the one hand, they can choose to operate in a cooperative mode in which private information is communicated truthfully and the benefits are shared. On the other hand, they can choose a non-cooperative form of interaction and align their incentives via complex contract schemes without communicating private information. In the cooperative mode, supply chain efficiency can be achieved, but high levels of trust and trustworthiness are required. In the non-cooperative mode, the contract mechanism guarantees a second best supply chain performance, but only if all parties choose their equilibrium strategies without trembles. However, basically all of the experimental literature on behavior in supply chains shows that neither a perfectly cooperative nor a perfectly non-cooperative outcome can be achieved. Cooperation is frequently shown to be disrupted by deceptive signals or the lack of trust in truthful signals (see Özer et al. 2011; Inderfurth et al. 2013). The non-cooperative equilibrium and profit maximization often fail due to some persistent degree of out-of-equilibrium behavior that may be due to misperceptions, bounded rationality, or social preferences (see Schweitzer and Cachon 2000; Lim and Ho 2007; Bolton and Katok 2008; Katok and Wu 2009; Wu 2013; Kremer et al. 2010; Becker-Peth et al. 2013). In the context of asymmetric information, we define strategic risk as the risk that the supplier faces either concerning the consistency of the buyers’ shared information and contract choice behavior or concerning the buyers adherence to payoff maximizing behavior.

In this paper we present an experimental study in which we compare a baseline experimental treatment in supply chain interaction with two other treatments involving reduced strategic risk. In one of the two treatment variations strategic risk is lower due to the introduction of a punishment option that allows suppliers to punish apparently uncooperative buyers. In the other treatment variation, the strategic risk of the supplier is lower than in the baseline due to the fact that buyers’ profit-maximization is enforced in screening contracts.

We consider an adverse selection problem in a simple serial supply chain in which the buyer (she) holds private information. The supplier (he) offers either a simple coordinating contract (in the following referred to as “simple contract”) or a screening contract (also referred to as “a menu of contracts”). The simple contract is first best, only if the buyer (she) shares her private information truthfully. However, if the buyer strategically misrepresents her private information, the supplier does best by ignoring the signals and offering the screening contract (i.e. the screening contract is inefficient, but second best). While a number of studies have analyzed the effectiveness of different contracts formats in supply chains with asymmetric information (see Özer et al. 2011, 2014; Inderfurth et al. 2013; Spiliotopoulou et al. 2015), this study is the first that examines the endogenous choice of these contract types. This endogenous choice of the contract type allows us to identify the role of strategic risk in the different operational modes. The implications for our research provide valuable insights for supply chain managers concerning contract type decisions.

Knowing that the predictability of the buyer’s behavior is a key driver when suppliers choose between contract types, we systematically investigate how the reduction of each type of strategic risk affects the subtle interactions between contract offers, information sharing, and contract choices. First, we provide the supplier with a punishment mechanism that enables him to punish apparently uncooperative behavior and, thus, to substantially reduce the incentives for deception.Footnote 1 Second, we enforce the buyers’ self-selection once a screening contract was offered by the supplier. This enforcement eliminates the strategic risk that the buyer destroys supplier’s payoffs by contract choices that entail a small loss for the buyer but a substantial loss for the supplier.

We find that reducing strategic risk improves supply chain performance in both the cooperative and the non-cooperative mode of operation. With the suppliers’ punishment mechanism, buyers’ send more truthful signals leading to a better alignment of trust and trustworthiness. The enforcement of self-selection leads to a significantly greater number of screening contract offers which reduces efficiency losses from a misalignment of trust and trustworthiness. Thus, strategic risk management is profitable for the supply chain no matter if using cooperative or non-cooperative contract design.

Our results are highly relevant for the supply chain coordination and contracting literature. Supply chain managers should seek to reduce the variability and increase the predictability of the supply chain partners’ behavior in adverse selection environments. Punishment mechanisms—such as terms of contracts that penalize hostile deviations from “business as usual” between supply chain partners—can increase the trustworthiness of communication and can boost performance of supply chains operating under relatively simple contract formats. However, in business interactions lacking punishment options, tailoring contracts to shared information may cause huge losses. In these cases, suppliers should offer screening contracts, but set the incentives for the buyers high enough to guarantee the desired self-selection and to reduce the strategic risk of buyers’ choices deviating from profit maximization. Therefore, suppliers may consider adding a slack that increases the cost of deviation for buyers and facilitates behaviorally robust self-selection (see Sect. 3 for a more formal argument that is in line with Laffont and Martimort 2002, Chapter 9.8.1 and Voigt 2015).

2 Literature review

The present study is most closely related to game-theoretic and behavioral work in the field of supply chain coordination via contracts. We review both game-theoretic and behavioral work on contracting under full and asymmetric information.

2.1 Game theoretic models on contracting in supply chains

In non-cooperative supply chains, there is a large body of work showing that the less informed supply chain party maximizes his expected profit by offering a sophisticated menu of contracts, i.e., screening contract (see, for example, Corbett et al. 2004, and the references therein). These screening contracts align the incentives of the supply chain members in a way that the holder of the private information reveals her information by the contract choice. Nonetheless, the outcome is inefficient from a supply chain perspective. In contrast, cooperative supply chains may engage in truthful information sharing and trusting information processing. In such a cooperative operating mode, simple coordinating contracts (e.g., two part tariffs) are tailored to the truthfully shared information, resulting in supply chain efficient outcomes (see e.g. Goyal 1977). This cooperative view stresses that communication (e.g. Cachon and Fisher 2000) and trust (Moore 1998; Zaheer et al. 1998) are necessary for successful supply chain management.

2.2 Behavioral studies in full information context

There are numerous studies investigating supply chains that face uncertain demand (e.g. in the newsvendor context) and operate under simple wholesale price contracts. In most of these studies, the wholesale price is an exogenous parameter and is used as a focus variable for identifying decision biases for high- and low-profit situations (see, e.g., Schweitzer and Cachon 2000; Katok and Wu 2009; Kremer et al. 2010). In contrast, Keser and Paleologo (2004) investigate the supply chain behavior in the newsvendor context when the wholesale price is not exogenously determined, but set by the supplier who has full information regarding the distribution of stochastic demand. Becker-Peth et al. (2013) show that wholesale prices in combination with a buy-back component (buy-back contract) can be systematically manipulated in order to account for decision biases of the buyer. Lim and Ho (2007) experimentally investigate the effect of contract design on the inefficiencies resulting from double marginalization in a deterministic demand setting. Their primary focus lies on how the number of price blocks in a quantity discount scheme under full information impacts the supply chain performance.

Note that some of the papers mentioned above study decisions in supply chains without interaction. In these games strategic risk obviously plays no role. However, these studies establish that there is considerable behavioral variance even in the individual decision making setting. In this study, we take the literature a step further by focusing on the question how behavioral variance plays out in a truly interactive setting.

2.3 Behavioral studies in asymmetric information context

Inderfurth et al. (2013) study the impact of information sharing on the supplier’s screening contract offers in a dyadic supplier–buyer supply chain. They do not allow for an endogenous choice of the contract type. Kalkanci et al. (2011) present an experimental analysis of the impact of contract complexity under asymmetric demand information in a dyadic supplier–buyer supply chain. They analyze how the supplier sets the price-breaks for an all-unit quantity discount. In contrast to our experimental setup, however, buyers’ decisions are automated and strategic risk in the interaction is not considered. Özer et al. (2011, 2014) investigate the interaction of supply chain members given a simple wholesale price contract and asymmetric information. They find that there is partial truth-telling and trust although theory predicts that all communication accounts to no more than cheap talk. In contrast to our experimental setup, the supplier in Özer et al. (2011, 2014) was limited to only offering wholesale price contracts, while we allow an endogenous choice of the contract type.

3 Outline of the model

We briefly review a strategic lot sizing model and refer the interested reader for a broader motivation and more concise derivation of the game theoretic benchmarks to Voigt and Inderfurth (2012) and Voigt (2011).

We assume a serial supply chain consisting of a supplier delivering a product to a buyer. We assume that the buyer’s demand is deterministic and constant over time and, without loss of generality, it is standardized to one unit per period. Hence, unit profits equal period profits. The product is sold at an exogenously determined price to the end-customer, i.e., the impact of ordering and pricing decisions on the buyer-customer interface are not considered in the underlying model. The supplier incurs fixed cost, f, per replenishment and therefore prefers high order sizes. In contrast, the buyer prefers low order sizes, because she incurs holding cost h for every unit stored per period. The holding cost may vary from period to period depending on a number of parameters (e.g., cost of capital, handling and storage cost, etc.). Instead of modeling the holding cost explicitly, we assume that the holding cost is a random variable. The distribution of the holding cost is known to both parties, but the actual realization is only known to the buyer. We assume that all external customer demand must be immediately fulfilled by the buyer, i.e., backlogging is not allowed.

The model captures the basic conflict of interest in supply chain management that buyers prefer low order sizes, while suppliers prefer high order sizes (see Corbett and de Groote 2000; Sucky 2006). In case of uncoordinated actions, the buyer’s profit maximizing order size is arbitrarily close to zero (i.e., no inventory holding due to just-in-time delivery) and the supplier’s profit maximizing order size is infinite (i.e., arbitrarily small fixed cost per period). Thus, both individual optimal order policies are contrary and some form of negotiation has to take place to agree upon acceptable order sizes for both parties.

We assume that the buyer negotiates the terms of delivery and asks the supplier to ship in smaller lots in order to reduce holding cost. The supplier, in turn, tries to induce a higher order size to lower his average cost per unit. The supplier has to take into account the buyer’s outside option, i.e., sourcing from an alternative supplier at cost R. The supplier has fixed revenues of \(Y_{s}\). The buyer has fixed revenues of \(Y_{b}^{'} = Y_{b} + Y_{s}\), i.e., the buyers revenues are a markup on the suppliers fixed revenues.Footnote 2

Note that our research is not strictly limited to this specific operations management environment, but can be generalized to other adverse selection problems. The first essential assumption is that the supplier has convex cost in the contract variables. The second is that the buyer has a linear cost function that is private information. As an example, an analogous model setup follows when the buyer’s demand rate (which we standardized to one in our setup) is private information. In this case, the high demand buyer would try to mimic the low demand buyer in order to claim a higher compensation for larger lot sizes.Footnote 3

3.1 Full information and simple coordinating contracts

Under full information, the supplier has knowledge of the buyer’s realization of holding cost (h). Hence, the supplier can offer the buyer an optimal price per unit, w, to promote a higher order size, q. This leads to the contract \(\left\langle {w,q} \right\rangle\) as an outcome of the following optimization problem:

$$\begin{aligned} {\text{Problem}}\;{\text{FI}}{:}\,\,\hbox{max} \;\pi_{s} = w - \frac{f}{q} \hfill \\ s.t.\;\pi_{b} = Y_{b}^{'} - w - \frac{h}{2} \cdot q \ge Y_{b}^{'} - R. \hfill \\ \end{aligned}$$

Since demand is standardized to one unit per period, \(\pi_{s}\) and \(\pi_{b}\), denote the supplier’s and the buyer’s unit profit margins, respectively. The supplier’s objective function maximizes his unit profit. In the optimal solution to problem FI, the buyer’s participation constraint is binding. Solving the participation constraint for \(w\), we insert \(w\) in the objective function and optimize optimal order quantity, and get \(q^{FI} = \sqrt {\frac{2f}{h}}\). The optimal order size, thus, resembles the classical economic order quantity with demand per period standardized to one unit. The optimal unit price, \(w^{FI}\), is set to satisfy the buyer’s participation constraint, ensuring that the buyer’s profits are not smaller than in her outside option, i.e., \(w^{FI} = R - \frac{h}{2} \cdot q^{FI}\). The optimal contract parameters under full information \(\left\langle {w^{FI} ,q^{FI} } \right\rangle\) not only optimize the supplier’s profit, but also the overall supply chain performance.

3.2 Asymmetric information and screening contracts

Under asymmetric information, the holding cost realization is only known to the buyer, but not to the supplier. The supplier only has the information on the probability distribution \(p_{i} ,i = 1, \ldots ,n\) over possible values of the buyer’s holding cost \(h_{i} ,i = 1, \ldots ,n;\;h_{1} > \cdots > h_{n}\).

The basic screening idea is that the profit maximizing buyer reveals her private information with her contract choice. Let \(\pi_{b,i} \left( {q_{j} } \right) = Y_{b}^{'} - w_{j} - h_{i} /2 \cdot q_{j}\) denote the unit profit margin of the buyer facing holding cost h i , and choosing the contract \(\left\langle {w_{j} ,q_{j} } \right\rangle\). Information revelation is ensured by the incentive constraint \(\pi_{b,i} (q_{i} ) \ge \pi_{b,i} (q_{j} ),\;\forall i \ne j;\;i,j = 1, \ldots ,n\). The buyer facing holding costs h i will always choose the offer \(\left\langle {w_{i} ,q_{i} } \right\rangle\) as any other contract \(\left\langle {w_{j} ,q_{j} } \right\rangle\) will result in a lower unit profit margin. The participation constraint \(\pi_{b,i} \left( {q_{i} } \right) \ge Y_{b}^{'} - R,\;\forall i = 1, \ldots , n\) ensures that the buyer will not benefit from choosing the alternative supplier.

Let \(\pi_{s,j} = w_{j} - f/q_{j}\) denote the supplier’s unit profit margin if the buyer chooses the contract q j . Due to the incentive constraint the supplier knows that the buyer will choose the contract \(\left\langle {w_{i} ,q_{i} } \right\rangle\) with probability p i . Hence, the supplier can maximize his expected unit profit margin with the following optimization problemFootnote 4:

$$\begin{aligned} {\text{Problem}}\;{\text{AI:}} \,\,\hbox{max} \;E\left[ {\pi_{s} } \right] = \sum\limits_{i = 1}^{n} {p_{i} \cdot \pi_{s,i} } \hfill \\ s.t.\quad \pi_{b,i} \left( {q_{i} } \right) \ge \pi_{b,i} \left( {q_{j} } \right),\quad \forall i \ne j;\;i,j = 1, \ldots ,n \hfill \\ \pi_{b,i} \left( {q_{i} } \right) \ge Y_{b}^{'} - R,\quad \forall i = 1, \ldots ,n \hfill \\ \end{aligned}$$

The following notation is used to refer to the supplier’s optimal menu of contracts (screening contract) \(A = \left( {A_{i} |i = 1, \ldots ,n} \right)\) where \(A_{i} = \left\langle {w_{i}^{AI} ,q_{i}^{AI} } \right\rangle ,\;\forall i = 1, \ldots ,n\). Furthermore, \(F_{i} = \left\langle {w_{i}^{FI} ,q_{i}^{FI} } \right\rangle ,\;\forall i = 1, \ldots ,n\) denotes the supply chain’s optimal contract when the buyer faces holding costs h i . We refer to Voigt (2011) for a derivation of the optimal menu of contracts.

One important feature of the screening contracts is that order sizes are increasing with decreasing holding cost levels, i.e., \(q_{i}^{AI} \ge q_{i + 1}^{AI}\) and the unit prices are decreasing with decreasing holding cost levels, i.e. \(w_{i} \le w_{i + 1} ,\;\forall i = 1, \ldots ,n\). The menu of contracts can, therefore, be interpreted as a quantity discount that is inefficient, since all order sizes except \(q_{1}^{AI}\) are downward distorted. If the supplier would have full information, he could offer \(q_{i}^{FI}\) instead of letting the buyer self-select the distorted order size \(q_{i}^{AI}\) and, thus, enhance the supply chain performance. A numerical example follows in the next section on the basis of the parameters we use in our experiment.

We derive the theoretical benchmark along with our research hypotheses in Sect. 4.2. Note, that our model only captures those elements that are relevant from a game-theoretic perspective. For our experiment, however, we add a few game-theoretically irrelevant features (punishments and rewards, additional incentives for self-selection) to identify behavioral effects. In the discussion of the hypotheses, we explain why those features have no impact from a game-theoretic perspective (Hypotheses 1a–d, game-theoretic benchmark), but may have a behavioral impact (Hypotheses 2a–c, punishment treatment, and Hypothesis 3, enforced self-selection treatment).

Finally, note that the supplier might want to limit himself by offering only one contract even under asymmetric information. If he wants to make offers that leaves all cost types at least the reservation profit, then the contract \(F_{H}\) would be optimal. This can be easily confirmed by solving problem AI with \(\left\langle {w_{i} ,q_{i} } \right\rangle = \left\langle {w,q} \right\rangle ,\;\forall i = 1, \ldots ,n\). In turn, the assumption that all buyers receive an acceptable offer may be lifted (frequently referred to as a “cut-off” policy). Under these assumptions, the contract FM turns out to be the supplier’s optimal simple contract offer with expected profits of 47.22 followed by \(F_{L}\) with 46.76 and \(F_{H}\) with 36.9.

4 Experimental design, implementation, and research hypotheses

The experimental software was implemented with the toolbox z-Tree (Fischbacher 2007). Participants were recruited online using ORSEE (Greiner 2015) and randomly distributed over the treatments. The subject pool contains graduate and undergraduate students of a mid-scale university in Germany.

Upon arrival, each participant received written instructions that were read out aloud (see online-supplement). The instructions for both roles (supplier/buyer) were identical. All remaining questions were answered privately in the subject’s cubical at the experimental laboratory. Subjects were paid according to their performance in the experiment.

Subject’s payments were the total cumulative points earning during the experiment multiplied with a conversion rate of 0.01, i.e. every experimental monetary unit exchanges to 1 cent. The average earnings were 9.7 EUR (Max: 18.5 EUR/Min: 7 EUR). Sessions lasted no longer than 90 min. The experiment consisted of three treatments with a total of 72 subjects. Subject were recruited using a standard recruiting software and were randomly assigned to treatments. From a subject pool of almost 2000, most subjects were undergraduates in a mixture of all study programs. About 47 % of the subjects were female. Each subject participated in only one treatment (between subjects design). All subjects played 30 rounds.

We consider a serial supply chain consisting of one buyer and one supplier. The subjects were randomly assigned the role of buyer or supplier after the instructions were read. The matching of buyers and suppliers remains unchanged over time.

In the following, we present the experimental design for our three treatments (see Table 1). We compare the baseline treatment to two behavioral variations with reduced strategic risk. 24 subjects participated in each treatment (12 buyers and 12 suppliers), leaving twelve independent observations per treatment.

Table 1 Treatment summary

4.1 Treatment design

4.1.1 Baseline treatment

4.1.1.1 Parameters

There are three holding cost realizations \(h_{L} = 1,\;h_{M} = 5\) and \(h_{H} = 9\) that occur with the corresponding a priori probabilities \(p_{L} = 0.4,\;p_{M} = 0.3\) and \(p_{H} = 0.3\). The holding costs are drawn independently in every round according to the distribution function which is common knowledge. Thus, frequencies of holding cost realizations might differ across treatments. The supplier’s total fixed cost is f = 800 and the buyer’s unit cost of sourcing from the alternative supplier is R = 157. The buyer’s revenue is fixed at \(Y_{b}^{'} = 160\) per round. The supplier’s revenue is fixed at \(Y_{s} = 155\) per round.Footnote 5

4.1.1.2 Decision sequence

(t = 1) At the beginning of each round, the buyer sends a signal S that may communicate her holding cost to the supplier, where \(S \in \left[ S_{L} = h_{L} ,S_{M} = h_{M} ,S_{H} = h_{H} ,S_{No}\, \doteq \,No\;Signal \right]\).Footnote 6

(t = 2) After the buyer sends her signal, the supplier offers a contract. Contract offers are restricted to (i) \(F_{L} = \left\langle {w_{L}^{FI} ,q_{L}^{FI} } \right\rangle\), (ii) \(F_{M} = \left\langle {w_{M}^{FI} ,q_{M}^{FI} } \right\rangle\), (iii) \(F_{H} = \left\langle {w_{H}^{FI} ,q_{H}^{FI} } \right\rangle\) and (iv) \(A = \left( {A_{L} ,A_{M} ,A_{H} } \right)\) where \(A_{i} = \left\langle {w_{i}^{AI} ,\,q_{i}^{AI} } \right\rangle\), \(i \in L,M,H\). Thus, the supplier may either offer one of the three tailored contracts F i (i.e. one of the contracts that are optimal under full information, see Problem FI) or the screening contract A that is optimal (but second best) under asymmetric information (see Problem AI).

(t = 3) If the supplier offers a tailored contract in t = 2, then the buyer can either accept or reject this offer. If the supplier offers the screening contract A in t = 2, the buyer chooses \(A_{L}\), \(A_{M}\), \(A_{H}\), or rejection. If the buyer rejects the offer, she sources from the alternative supplier at the cost R, the supplier realizes zero profits, and the next round starts.

(t = 4) The supplier has the option to give the buyer a reward. This is, because the truth-telling buyer will only receive her reservation profit under a tailored \(F_{i}\)—contract. The reward allows the supplier to compensate the buyer for the profit difference between the screening contract and the reservation profit. Because of the efficiency gap between screening contracts and tailored contracts, the reward can be set such that both parties are better off. The buyer, thus, faces a situation similar to the well-known trust game (see, e.g., Berg et al. 1995): the buyer is vulnerable if she sends a truthful signal and has to trust that she receives a sufficient reward from the supplier that compensates for accepting the efficiency enhancing \(F_{i}\)—contract.

The maximum size of the reward is limited to the supplier’s earnings in the respective round. This limit on rewards still gives sufficient leeway for cooperative play resulting in win–win outcomes. In particular, the highest reward that is necessary to implement win–win cooperation occurs when the buyer signals low holding cost and accepts the efficient contract accordingly. In this case, the minimum reward that leaves the buyer indifferent between reporting truthfully and accepting and the respective profit in the screening contract is 43.77.

Finally, note that the true holding cost realization is never revealed to the supplier. The supplier can only check consistency (contract choice in the screening contract matches the signaled holding cost level) or the credibility of the signal by statistical interference.

(t = 5) A new holding cost parameter is drawn in every round. Thus, the supplier can neither infer the buyer’s holding cost parameter of the next round through the buyer’s signal nor through the buyer’s action in the current period.

Table 2 depicts the parameter values and the resulting payoffs for our experiment. For example, if the buyer has low holding costs of \(h_{L}\) and accepts the contract \(F_{M}\), she realizes a profit of \(\pi_{b} = 38.88\) (net of the additional reward in t = 4) and the supplier realizes \(\pi_{s} = 67.46\) (net of the additional reward in t = 4).

Table 2 Contracts and corresponding profits

Note that the participation constraint makes the buyer of type i (with holding cost \(h_{i}\)) indifferent between her outside option and the contract \(F_{i}\). To avoid indifference in the experiment, we add 0.1 to the buyer’s profit when accepting the contract \(F_{i}\). Similarly, we break the tie between accepting the self-selection option of the screening contract for the true holding cost versus the next higher holding cost, by adding 0.1 to the true option.Footnote 7 For example, if a buyer with \(h_{M}\) chooses the contract \(A_{H}\), she earns 0.1 less than she earns by choosing the self-selection option \(A_{M}\). In this case, we refer to \(A_{H}\) as the indifference contract and to \(A_{M}\) as the self-selection contract.

The following extensive game form in Fig. 1 illustrates the effects of trust and mistrust in our experimental design. The buyer’s decision node is denoted with ‘B’ and the supplier’s with ‘S’, respectively. Additionally, ‘N’ denotes the so-called nature’s decision node, i.e. the nature decides with respect to the a priori probabilities which holding cost realization is assigned to the buyer. The buyer gives a signal \(S \in \left( {S_{L} ,S_{M} ,S_{H} ,S_{No} } \right)\). The supplier cannot distinguish whether a report is truthful or not. This fact is captured by the shaded boxes. Each of these boxes contains the same signal, e.g. \(S_{H}\), but this signal is not necessarily identical to the holding cost realization, e.g. \(h_{H}\).

Fig. 1
figure 1

Payoff consequences of trust and mistrust (adapted from Voigt 2011, p. 29)

The supplier’s decision to trust (i.e. tailor a \(F_{i}\) contract to the signal) is denoted with ‘T’ and to mistrust (i.e. offer a screening contract) with ‘MT’ respectively. For the sake of clarity, Fig. 1 only shows the payoff consequences for the cases in which the buyer reports truthfully, overstates his holding costs or refuse to give a signal. Understatements of holding cost realizations are indicated by dotted circles. All payoffs that result for a specific action sequence do directly follow from Table 2 and \(Z_{R}\) denotes the amount of reward from the supplier to the buyer. For the sake of clarity, we assume that no reward is paid if the supplier mistrusts (i.e. \(Z_{R} = 0\)). Furthermore, the game-tree does not capture the buyer’s choice to choose the alternative supplier.

Obviously, cooperative behavior (i.e. report truthfully and trust) can lead to Pareto improvements compared to mistrust, as long as the additional side-payment is sufficiently high. For example, if the buyer faces cost \(h_{M}\), reports truthfully \(S_{M}\), and the supplier trusts, then the total supply chain profit is 67.46 + 3.1 = 70.56.Footnote 8 Yet, if the supplier mistrusts the signal and offers the menu of contracts, the profits under self-selection are 42.72 + 21.88 = 64.6. The efficiency loss is therefore 70.56 − 64.6 = 5.96. The side-payment resulting in a win–win outcome can be calculated from \(67.46 - Z_{R} \ge 42.72\) and \(3.1 + Z_{R} \ge 21.88\), i.e. \(18.78 \le Z_{R} \le 24.74\).

In turn, consider a buyer who faces holding costs \(h_{L}\) and reports \(S_{H}\) (see bolded line in Fig. 1). If the supplier trusts and offers \(F_{H}\) than the supplier yields profits of \(\pi_{s} = 36.9 - Z_{R}\) and the buyer \(\pi_{b,l} = 56.43 + Z_{R}\), respectively. The supply chain profits result from \(36.9 + 56.43 = 93.33\). Yet, if the supplier mistrusts (i.e. offering the menu of contracts) the supply chain profits would amount to \(73.13 + 46.87 = 120\) (see dotted line in Fig. 1). Hence, trust in comparison to mistrust leads to an efficiency loss of \(120 - 93.33 = 26.67\).

Finally, note that communication can only be an effective coordination instrument as long as the supplier does not simply ignore all signals. In this case, the menu of contracts would be offered regardless of the buyer’s signal. The interrupted lines in Fig. 1 highlights this for the holding cost realization \(h_{M}\). Obviously, the outcome is independent of the buyer’s signal and communication is not effective.

4.1.2 Punishment treatment

In the punishment treatment, the supplier has the option to punish the buyer or to reward the supplier as in the baseline treatment. The punishment option provides the buyer a means to retaliate uncooperative behavior. Yet, since the true holding cost level is never revealed to the buyer, the supplier’s intent to play cooperative or uncooperative can only be inferred from contract choice behavior (e.g., inconsistencies between the signaled cost and the selected contract option indicate uncooperative behavior) or the signaling strategy (e.g., too frequent high cost signals indicate uncooperative behavior).

The supplier may punish the buyer by arbitrarily reducing her profits in t = 4. Punishment, however, is costly. Every unit of punishment costs the supplier 0.2. The maximum punishment per round is limited to 40. We conducted this treatment since punishments are a realistic threat in supply chain relationships (see Cui et al. 2007 for a discussion on punishments in supply chain relationships). As Kahneman et al. (1986) put it: “Even profit-maximizing firms […] are willing to […] punish unfair firms at some cost […]” (see Kahneman et al. 1986, p. 285). In practice, such a punishment may be operationalized by terms of contract that penalize deviations from business-as-usual.

4.1.3 Enforced self-selection treatment

The only difference between the baseline treatment and this treatment is that the buyers in this treatment are forced to choose the profit-maximizing self-selection contract in t = 3 whenever offered a menu of contracts. In particular, indifference contract choices are not possible which means that the self-selection mechanism of the screening contract works perfectly, substantially reducing suppliers’ strategic risk.

In practice, self-selection can be enhanced by adding slack into the incentive constraint (see footnote 4). Although our treatment cannot be used to assess the optimal threshold for the buyers’ self-selection incentives, it provides a clear benchmark on the effects of sufficiently high slack in incentive constraints.

4.2 Research hypotheses

This section provides the game theoretic benchmark as well as the behavioral research hypotheses for our three treatments. Note, that the game-theoretic benchmark is identical in all three treatments. This is, because we only test behavioral features that are irrelevant on the equilibrium path. We present the game-theoretic benchmark in Hypotheses 1a–1e, the behavioral research hypotheses for the punishment treatment in Hypotheses 2a–2c, and the behavioral research hypothesis for the enforced self-selection treatment in Hypothesis 3.

Our game-theoretic benchmark is derived from a one-shot game. This approach is in line with a number of other experimental papers (see Boles et al. 2000; Croson et al. 2003) that study repeated interaction games in which the one-shot game equilibrium is also the sequentially rational equilibrium of the finitely repeated game.Footnote 9

Hypothesis 1a: Communication is uninformative Since the preferences in the underlying game are perfectly opposed (i.e., the buyer always tries to convince the supplier that she has high holding cost), and since private information is independently distributed between periods, credible information sharing cannot take place in the non-cooperative equilibrium (see Crawford 1998; Fudenberg et al. 1990).

Hypothesis 1b: Suppliers ignore signals As the fully rational supplier anticipates that all signals are uninformative (see Hypothesis 1a), we will not observe any correlation between the buyer’s signals and the supplier’s behavior.

Hypothesis 1c: Suppliers only offer screening contracts Without informative signals (see Hypothesis 1a), the screening contract maximizes the supplier’s expected profits. Hence, we will only observe screening contracts.

Hypothesis 1d: Buyers choose the self-selection option of the screening contract The optimal choice of profit maximizing buyers, who are offered a screening contract (see Hypothesis 1c), is the self-selection option.

Hypothesis 1e: Punishments and rewards are not observed Since suppliers offer screening contracts and buyers self-select into their profit maximizing option (Hypotheses 1a–1d), we only observe equilibrium play, in which—by definition of a game-theoretic equilibrium—payoffs cannot be increased using the reward and punishment options (see, e.g., Fudenberg and Tirole 1992).

There is ample evidence in experimental economics that punishments options have a positive effect on cooperation in social dilemmas such as ultimatum bargaining, trust games or voluntary contribution to public goods. Instead of reviewing this extensive literature here, we refer interested readers to a recent meta-analysis on the effects of punishment and rewards on cooperation in social dilemmas by Balliet et al. (2011). They report that a punishment option is particularly effective when punishment is costly and administered in repeated interaction. Since both conditions hold in our punishment treatment, we expect a cooperation enhancing effect of the punishment option. Cooperation in our setup is characterized by supply chain profit maximizing actions (see Hypotheses 2a, 2b, and 2c, below) resulting from contracts that are tailored to truthfully shared information.

Hypothesis 2a: The frequency of truthful signals is higher in the punishment than in the baseline treatment Our experimental design entails a cheap talk game, in which a sender (the buyer) gives a signal based on her private information about her type (i.e. the holding cost), with the receiver (the supplier) reacting to this signal. We refer to Crawford (1998), for a comprehensive survey on cheap talk theory. Brandts and Charness (2003) report an experimental laboratory study in which the receiver observes ex-post whether the signal was deceptive or honest. They report that receivers show a strong and significant tendency to punish deceptive senders. Sánchez-Pagés and Vorsatz (2007) replicate the result that and show that senders are significantly less deceptive with punishment than without and tend to give no signals (similar to our notion of \(S_{No}\)) more frequently. Peeters et al. (2013) report significantly higher levels of truthfulness and less deception when a punishment option is available.

In contrast to above mentioned experimental studies, our experimental design does not allow to verify ex-post if a signal was honest or deceptive when a simple contract is chosen. However, supplier can observe signal consistency with screening contracts and can discover statistically excessive high cost reporting. Hence, we hypothesize that buyers send truthful signals more often in the punishment treatment compared to the baseline, because they fear to be punished more frequently for apparently uncooperative behavior when suppliers have a punishment option. We define buyers’ behavior as apparently uncooperative if the high cost signal is send significantly more often as statistically expected or if there is an inconsistency between the signaled cost and the selected contract option.

Hypothesis 2b: The supplier offers the contract tailored to the signal more frequently (i.e., he shows a higher level of trust) in the punishment treatment than in the baseline treatment Sánchez-Pagés and Vorsatz (2007) and Peeters et al. (2013) report a significantly higher level of receiver’s trust in the sender’s signal when a punishment option is available. We therefore expect the supplier to show a higher level of trust in the punishment treatments, in which the threat of punishment enhances truthful and consistent signals. A higher level of trust is characterized by a higher frequency of simple contracts that are tailored to the respective signal.

Hypothesis 2c: The supply chain performance is higher in the punishment treatment than in the baseline treatment We have two reasons to believe that the punishment option may enhance supply chain performance. First, we expect the punishment option to increase truthful signaling (2a) and trust (2b). Thus, supply chain actions are better coordinated, because simple contracts fit to the actual holding cost level. Second, we expect the punishment option to decrease the frequency of inconsistent indifference contract choices.

Hypothesis 3: Screening contracts are offered more frequently in the enforced self-selection treatment than in the baseline treatment Consider a buyer giving a deceptive signal, e.g., she signals high holding costs although she only incurs medium holding costs. In case the supplier offers the menu of contracts, the buyer can easily pretend to be honest by choosing the high cost contract that fits to her signal (but not to her holding cost realization). If the supplier anticipates that the buyer may choose an indifference contract in order to cover up her deceptive signal, then offering a simple tailored contract might be profit maximizing. In these cases, offering a simple contract does not result from trust in the signal, but from mistrust towards the buyers’ self-selection behavior. Thus, eliminating the strategic risk of not choosing the self-selection contract allows us to disentangle trust from strategic risk avoidance in this treatment.

5 Experimental results

We present the results of the experiments in the sequence of the decisions taken, i.e., the signal, the contract offers, the contract choices, and the cooperation facilitating role of punishments and rewards. We conclude the analysis with analyzing the impact of observed behavior on supply chain performance.

5.1 Buyers’ signaling behavior

Table 3 summarizes the frequencies of signals across treatments. If all buyers’ reported truthfully, we would expect a frequency of signals according the a priori distribution, i.e., 40, 30 and 30 % for the low cost, medium cost and high cost level respectively. We observe a clear shift to the strategically most relevant high cost signals in all treatments, however, the shift is more pronounced in the baseline treatment than in the two control treatment. Omitting all \(S_{No}\) observations, we find a highly significant correlation between holding cost levels and signals (Spearman, p < 0.01, ρ = 0.4). Thus, buyers give significantly higher cost signals if holding cost increase and vice versa. We therefore reject hypothesis 1a (“Communication is uninformative”).

Table 3 Frequencies of signals in treatments

The following Fig. 2 shows the development of truthfulness over time. The horizontal line indicates the average level of truthfulness across all treatments, i.e., 53 %. A visual inspection of the graphs tends to support our hypothesis that truthfulness is higher in punishment treatment than in the baseline treatment.

Fig. 2
figure 2

Average truthfulness by period and treatment

In order to further assess if the shift in signals is due to a higher level of truthful signals, we run the following random effects probit regression estimating the buyer’s tendency to signal truthfully or not:

\({\text{Probit}}\text{(}Y_{it} = 1\text{) = }\beta_{0} + \beta_{1} \times {\text{Punish}} + \beta_{2} \times {\text{Enforced}} + u_{i} + \varepsilon_{it}\) where \(Y = 1\) if the buyer reports truthfully (i.e., Si = hi) and \(Y = 0\) otherwise. The variables “Enforced” and “Punish” are the treatment dummies. We run separate regressions for each holding cost level. The marginal effects and respective statistics of the probit regression are summarized in Table 4.

Table 4 Probit regression of treatment effects on truthfulness

Overall, the regression results support directionally but not significantly hypothesis 2a, i.e. we observe a higher propensity to report truthfully in the punishment treatment. Taking all holding cost realization into account (column “All”) we see a higher chance to observe truthful behavior in the punishment treatment. A closer look on the separate regressions (columns “hlow, hmed, hhigh” denote the respective selection of included holding cost realizations) shows that this effect is mainly driven by the subjects’ tendency to report more truthfully in the punishment treatment when facing the holding realization \(h_{M}\). Taking the average truthfulness given \(h_{M}\) per subject as an independent observation, the latter observation is weakly significant (Mann–Whitney U (MWU), p < 0.1, one sided). Thus, buyers with holding cost \(h_{M}\) tend to report more truthfully instead of giving the deceptive high cost signal in the punishment treatment.

5.2 Suppliers contract offers

The previous section shows that buyers share their cost information to some extent in all treatments. Yet, even if all buyers truthfully report their holding cost, there would be no effect of communication if the signals are ignored by the suppliers. Table 5 summarizes the frequency of contract offers by types. The high degree of variance in the suppliers’ contract offer behavior shows that we have no empirical support for hypothesis 1c (only screening contracts are offered). A one side binomial test with the alternative hypothesis that the frequency of screening contract is smaller than one is highly significant (p < 0.01) for all subjects in all treatments.

Table 5 Frequency of \(F_{i}\)- and screening-contract offers

Simply comparing the ratio of \(F_{i}\)—contracts to screening-contracts across treatments is not a good measure for the suppliers’ trust in the signal, since a \(F_{i}\)—contract offer is not necessarily related to the respective signal. Furthermore, note that the frequency of signal consistent \(F_{i}\)—contracts does not necessarily measure the suppliers’ trust in the buyers’ signals either. For suppliers to offer signal tailored \(F_{i}\)—contracts, it suffices that the suppliers trust in the buyers’ consistent signal-choice behavior, even if the signals are not truthful. Hence, we must cautiously distinguish between trust in the truthfulness of the signal, trust in signal consistent contract choices, and the lack of strategic risk resulting from indifference contract choices. The first concept plays no role if suppliers offer screening contracts. It is, however, essential for coordinating the supply chain based on signal tailored contracts. The second concept is essential for suppliers’ payoffs, because suppliers’ payoffs only depend on the signal-choice consistency, but not on the truthfulness of the signal. The third concept mainly plays a role when suppliers offer screening contracts, because with a screening contract suppliers’ payoffs can be substantially affected by buyers’ out-of-equilibrium play at minor costs for the buyers.Footnote 10 Comparing the two contract types, we can conjecture that an increase in (either form of) trust or an increase in strategic risk (in the screening contract) will lead to an increased use of tailored contracts (see also hypothesis 3).

The following Fig. 3 shows the development signal tailored contracts over time. The horizontal line indicates the average level of truthfulness across all treatments (33 %). A visual inspection of the graphs clearly highlights fewer signal tailored contracts in the enforced self-selection contract. In turn, hypothesis 2b stating that we observe a higher level of signal tailored contracts in the punishment treatment is not supported.

Fig. 3
figure 3

Frequency of tailored \(F_{i}\) contract offers

In order to further assess the supplier’s contract offer behavior, we run the following random effects probit regression that estimates the supplier’s tendency to tailor the contract to the signal or not. We also run a separate regression for SL, SM, and SH. The regression results and respective statistics are summarized in Table 6.

Table 6 Probit regression of treatment effects on tailored contracts

Overall, we do not find support for our hypothesis that the punishment option (hypothesis 2b) increases trust in signals. However, the above regression indicates that tailored contracts are less likely to be observed in the enforced self-selection than in the baseline treatment. A closer look on the frequency of contract offers (see Table 7) indicates that the lower level of tailored contracts is compensated by a higher level of screening contract offers. A visual inspection of the frequency of screening contract offers by period and treatment in Fig. 4 supports the hypothesis of more screening contract offers in the enforced self-selection treatment. The horizontal line depicts the average level of screening contract offers across all treatments (52.5 %).

Table 7 Probit regression of treatment effects on screening contract offers
Fig. 4
figure 4

Frequency of screening contract offers by period and treatment

We run a random effects probit regression to test for differences in screening contracts offers across treatments (i.e., \(Y = 1\) if the supplier offered the screening contract and \(Y = 0\) otherwise). The regression results are significantly in favor of hypothesis 3, i.e., the suppliers in the enforced self-selection treatment show a significantly higher propensity to offer screening contracts in the enforced self-selection treatment than in the baseline treatment.

We finally test over all treatments for a dependency of signals to contract offers with random effect probit regressions by contract offers (see Table 8). The results show that contract offers significantly depend on the respective signals, i.e., the likelihoods of observing an \(F_{i}\)—contract are significantly higher if the buyer sends the respective signal \(S_{i}\). The likelihood of observing screening contract, in turn, is significantly higher when observing the signal \(S_{No}\) (i.e. the marginal effects for \(S_{M}\) and \(S_{H}\) are significantly negative and for \(S_{L}\) not significantly different from zero). Thus, Hypothesis 1b (“Suppliers ignore signals”) is rejected.

Table 8 Probit regression of signaled holding cost on contract offers

5.3 Buyers contract choice behavior

Table 9 gives an overview of buyers’ contract choices. We divide the buyers’ contract choices into five categories: (1) the buyer chooses the profit maximizing contract, (2) the buyer chooses the indifference-contract, i.e. she chooses \(A_{M}\) \(\left( {A_{H} } \right)\) although she faces holding costs of \(h_{L} \left( {h_{M} } \right)\), (3) the buyer chooses the alternative supplier when (almost) indifferent between the offered contract and the outside option, i.e., whenever she faces holding costs \(h_{H}\) or is offered the tailored contract that fits her actual holding cost \(F_{i} = h_{i} ,i \in \left( {L,M} \right)\), (4) the buyer chooses the alternative supplier because the participation constraint is not satisfied, which is only possible if the supplier offers \(F_{L}\) or \(F_{M}\), and (5) the buyer has a loss greater than 0.1. The values in Table 9 indicate the percentages of the cases in each treatment.

Table 9 Buyers’ contract choices

First note that only 3 % of all choices fall in category 5 and therefore 97 % of all observations are optimal or nearly optimal contract choices. However, we find overall a substantial amount of only nearly profit maximizing behavior (i.e., categories (2) and (3)). Overall, the results only partly confirm hypothesis 1d (i.e. profit maximization of buyers), leaving a substantial number of non-profit maximizing choices (≈23 %; cases 2–5). In most of these cases, however, the buyer incurs only a marginal loss (≈20 %, cases 2–4). A one sided binomial test with the null hypothesis that only self-selection contracts are chosen once the screening was offered is rejected for 26 of 36 subjects (p < 0.01, one-sided). On an aggregate level (i.e. over all subjects), the null hypothesis is rejected. Thus, while we observe that some buyers always self-select (10 of 36), we have to reject hypothesis 1d (“Buyers self-select”).

In order to test for treatment effects, we compared the average contract choice frequencies of the buyers in the respective treatments with a Mann–Whitney U test. We did not find any significant differences in the contract choice behavior except the following. There are significantly less rejections in case of violated participations constraints (category 4) in the enforced self-selection treatment than in the baseline treatment (MWU, \(\alpha < 0.1\), two-sided). This is due to the fact that violations of the participation constraints are only possible in case of (mis) tailored contracts. Because tailored contracts are offered significantly less often in the enforced-self-selection treatment (see Table 5), we observe such instances less often.

We further analyze the buyer’s contract choice behavior by analyzing the interaction between consistency and profit maximizing behavior by contract offer, i.e., \({\text{F}}_{\text{i}}\) contracts (Table 10) and menu of contracts (Table 11). We skip \(S_{No}\) observations (151 observations) from this analysis. Clearly, inconsistencies are rather low under \({\text{F}}_{\text{i}}\) contracts and only observed along with profit maximizing contract choices. In these cases, the buyer rejects \({\text{F}}_{\text{i}}\) offers although their reported holding cost level suggests otherwise. In turn, inconsistencies are frequently observed under menus of contracts and highest in the enforced self-selection treatment, highlighting that inconsistencies are more likely to occur when deceptive signals cannot be covered up.

Table 10 Buyers’ contract choices by consistency and (non) profit maximization for \(F_{i}\) contracts
Table 11 Buyers’ contract choices by consistency and (non) profit maximization for menu of contracts

Figure 5 shows signal consistent contract choices by treatment and period. The horizontal line depicts the average level of signal consistent contract choice, i.e., 79 %. A visual inspection confirms the overall lower level of inconsistency in the enforced self-selection treatment and shows a negative trend as time goes by.

Fig. 5
figure 5

Signal consistent contract choice

We finally present random effects probit regressions by signal to provide further insights on the consistency of signals (see Table 12). The analysis confirms that consistency in enforced self-selection treatment is significantly lower than in the baseline treatment. However, consistency between the baseline and the punishment treatment is not significantly different. Furthermore, the analysis shows that inconsistency in the enforced self-selection treatment is strongly driven by the strategically most relevant high cost signal.Footnote 11

Table 12 Probit regression of treatment effects on consistency

5.4 Cooperation, contract rejection, punishments, and rewards

Table 13 summarizes the number of contract offers by categories and treatments, how many of these contracts are tailored to the signal, the rejection rate, the average reward and the absolute number of rewards given in the respective contract offer state.

Table 13 Contract offers, rejection rates and awards by treatments and contract type

The surprising finding is that the frequency of simple contracts is rather low for the low and medium signals even though buyers cannot use these signals strategically. Since a large fraction of the low and medium signals are actually truthful, we would expect suppliers to offer more tailored contracts after low/medium signals than after high signals. Note, however, that the rejection rates of the tailored contracts for low/medium signals are extremely high. This indicates that buyers who truthfully signal low/medium cost prefer screening contracts to tailored contracts, because with screening they do not pass the entire cost advantage to the supplier. The high rejection rates expose the suppliers to a high degree of strategic risk, which they avoid by offering screening contracts after receiving a low/medium signal.

An alternative to using screening contracts for profit sharing in the case of low and medium cost would be a tailored contract combined with a sufficiently large reward payment. As we can see in Table 13 there are two reasons why this mechanism does not work in our experiment. First, the tailored contracts are frequently rejected making it impossible for suppliers to share profits by providing rewards. Second, even in those cases in which tailored contracts are implemented, the suppliers often provide rewards that are too low to establish win–win outcomes. In particular, given the contract choice FL the minimum win–win reward is 43.77 and given the contract choice FM the minimum win–win reward is 18.78.Footnote 12 As shown in Table 13, we see that even in those rounds in which rewards were provided, only 50 % of the rewards lead to a win–win outcome for the supply chain parties. Even though we must reject the strictly payoff maximizing hypothesis 1e (binomial test, p < 0.01, one-sided), because we observe a substantial number of non-zero rewards, we also cannot detect a cooperation sustaining behavioral effect of rewards.

While we reject hypothesis 1e that punishments are not observed (binomial test, p < 0.01, one-sided), it is worth mentioning that the punishment option is only used rarely in. The punishment option was only used in about 6 % of the rounds (21 of 360) with an average punishment size of 25. Only 6 of 12 subjects in the punishment treatment used this option. Nevertheless, the observed differences between the punishment and the baseline treatment indicate that the mere presence of the punishment option has an impact on supply chain behavior.

5.5 Supply chain performance

We compare the observed supply chain performance to three benchmarks in Table 14. The first benchmark is the supply chain efficient outcome (first best). The second benchmark (Screening) is the equilibrium performance based on self-selection in screening contracts (second best). The third benchmark (F high ) captures the situation in which the supplier always offers FH. This benchmark is meaningful because for this contract type the participation constraint of all buyers is satisfied and the variance of the supplier’s payoff is zero. Moreover, this contract yields the highest expected supply chain profits among the \(F_{i}\) contracts.

Table 14 Average deviations from the respective benchmarks

We have calculated each benchmark given the actual realization of the holding cost. Conditioning the benchmarks on the actual realization gives more exact measures than the expected benchmark, because the effect of stochastic cost variations is neutralized. We exclude all observations in which the alternative supplier is chosen, since additional assumptions are required regarding the welfare effects of these choices.

Table 14 shows the average benchmark deviation in percentage of the supply chain optimal solution. The significance levels based on a Wilcoxon test show that we have a significantly lower performance than the supply chain optimum in all three treatments. Yet, the supply chain performance is not significantly lower than the second best benchmark in the enforced self-selection contract. Moreover, when comparing the observed performance to the Fhigh benchmark we see a significant improvement in both the punishment treatment and the enforced self-selection treatment. Thus, while we do not see a significant difference in the baseline treatment to the counterfactual benchmark in which the tailored contract is offered in every period, we see such an improvement in our control treatments.

We additionally performed OLS regression to test treatment effects on the deviation from the screening benchmark (see Table 15). We included dummies for medium and high holding cost realizations. As expected, we observe that the average deviation of observed efficiency from the screening benchmark is significantly negative (constant = −12.27) in all treatments. Since tailored contracts are only efficiency enhancing for low and medium contracts, we observe significantly positive effects for these cost realizations. Most importantly, we observe a highly significant effect of enforcing self-selection, i.e., the supply chain performance is on average 4.3 points closer to the screening benchmark. We also observe a weakly significant effect of the punishment option, i.e., the performance is on average 1.3 points closer to the screening benchmark. Note, however, that the effect of the punishment option is on the border of significance. Future research—perhaps on variants of the game—will show how robust this effect actually is.

Table 15 OLS Regression–deviation from screening benchmark

The analysis highlights, that there are basically two means for enhancing supply chain performance. First, if a punishment mechanism can be installed, then the buyer tend to send more trustworthy signals leading to a better alignment of trust and trustworthiness. Second, if self-selection is enforced, the supply chain performance can be enhanced because the effectiveness of the screening contract is improved due to the obliteration of indifference contract choices. Without indifference contract choices, the number of screening contract offers increases. Correspondingly, we observe significantly more screening contracts (enforced self-selection: 68 %; baseline: 45 %) and a significantly better performance in the enforced self-selection treatment than in the baseline treatment. Note that even if no buyer in the baseline treatment had chosen an indifference contract after being offered a screening contract, the performance would still be lower than the screening benchmark. The important issue here is that suppliers offer too few screening contracts in the baseline treatment due to strategic risk of indifference contract choices.

6 Discussion and managerial insights

The main message of our study is that supply chain environments with a high degree of strategic risk, such as the one we study in the baseline treatment, are detrimental to supply chain performance and should be avoided as much as possible. We examine two possible ways to reduce the degree of strategic risk. In our punishment treatment, the supplier has the possibility to punish buyers who exhibit apparently uncooperative behavior, i.e., if the high cost signal is send significantly too frequently or if there is an inconsistency between the signaled cost and the selected contract option. In our enforced self-selection treatment, we technically limit strategic risk by ruling out the buyers’ possibility to choose indifference contracts, i.e., contracts that entail a small loss for the buyer but a substantial loss for the supplier.

Our results show that a reliable second best supply chain outcome can only be achieved with screening contracts if the incentives for self-selection are sufficiently high to reduce the behavioral variance in contract choices and, thus, the strategic risk that the suppliers face. Clearly, our experimental design, which provides the most extreme form of incentives for self-selection (i.e. the buyers have no other option to earn income, except by choosing the self-selection contract), cannot be used to assess the optimal threshold for the buyers’ self-selection incentives. While theoretical approaches to robust contracts are already available (see Laffont and Martimort 2002, Chapter 9.8.1), the assessment of optimal incentive thresholds in the field will depend on specific market parameters and characteristics.

However, when managers can only choose between classical screening contracts with marginal incentives and tailored contracts, we observe that a punishment option increases trustworthiness of buyers’ signals particularly for medium holding cost levels. In this environment, suppliers can tailor contracts to achieve a supply chain performance that is enhanced compared to a baseline without punishment.

In essence, our study shows that contract design should factor in behavioral mechanisms that reduce the strategic risk inherent in many supply chain interactions. On the one hand, allocation mechanisms that seem ineffective from a game-theoretic perspective (such as our punishment option) can reduce the strategic risk resulting from buyers’ deceptive signals. The reduction of strategic risk leads to more coordinated supply chain outcomes. On the other hand, contractible incentives that are excessive from a game-theoretic perspective can also reduce behavioral variance, increasing suppliers’ willingness to offer screening contracts and, thus, enhancing supply chain coordination. Hence, it seems that from a behavioral perspective the reduction of strategic risk is essential in a supply chain setting no matter which type of contract is considered.

There are several promising directions for future research. It seems interesting to further understand the variance in buyer’s contract choice behavior. Theoretical explanations range from bounded rationality (see Basov 2009) to social preferences (see Voigt 2015). A better understanding of the underlying behavioral concepts might help to identify other forms of behavioral mechanisms that can reduce strategic risk. As an example, while we observe that a punishment option seems to be relevant for reducing strategic risk, it seems that the reward mechanism as it was provided in the present study (i.e., voluntary payment after contract choice) is not appropriate for incentivizing cooperative play, since we hardly identify rounds in which win–win communication took place. Thus, different forms of reward mechanisms might be analyzed in future work (e.g., discounts are given before the buyer chooses a contract). Finally, we should point out that the communication technology used in our experiment is rather crude. More elaborate communication, e.g. face-to-face negotiations, may enhance the supply chain outcomes, especially since there is evidence that the means of communication can affect outcomes in other settings (Valley et al. 1998; Brosig et al. 2003).