The “Win-Continue, Lose-Reverse” Rule in Cournot Oligopolies: Robustness of Collusive Outcomes

Izquierdo, Segismundo S.; Izquierdo, Luis R.

doi:10.1007/978-3-319-09578-3_3

Segismundo S. Izquierdo¹⁵ &
Luis R. Izquierdo¹⁶

Part of the book series: Lecture Notes in Economics and Mathematical Systems ((LNE,volume 676))

1023 Accesses
4 Citations
1 Altmetric

Abstract

The so-called “Win-Continue, Lose-Reverse” (WCLR) rule is a simple procedure that can be used to choose a value for any numeric variable (e.g. setting a production level to maximise profit). The rule dictates that one should evaluate the consequences of the last adjustment made to the value (e.g. an increase or a decrease in production), and keep on changing the value in the same direction if the adjustment led to an improvement (e.g. if it led to greater profits), or reverse the direction of change otherwise. Somewhat surprisingly, this simple rule has been shown to lead to collusive outcomes in Cournot oligopolies, even though its application requires no information whatsoever about the choices made by any competing firms or about their results. Firms applying the WCLR rule need only know whether the last change in their own production turned out to be profitable or not; thus, there is no room for explicit coordination or collusion. In this paper we show that the convergence of the WCLR rule towards collusive outcomes can be very sensitive to small independent perturbations in the cost functions and in the income functions of the firms. These perturbations typically push the process towards the Cournot–Nash equilibrium of the one-shot game. Importantly, the destabilizing power of the independent perturbations is mainly due to the fact that they create miscoordination among the firms. In fact, if there is correlation between the perturbations, their impact on the dynamics of the model is not so dramatic.

Access provided by Autonomous University of Puebla. Download conference paper PDF

On the existence and stability of equilibria in N-firm Cournot–Bertrand oligopolies

Article 17 December 2019

Collusion in mixed oligopolies and the coordinated effects of privatization

Article 20 August 2017

Sustaining collusion in markets with entry driven by balanced growth

Article 21 November 2015

Keywords

1 Introduction and Motivation

It is generally recognised that the actual decision-making processes followed by real-world firms when they have to set prices or production levels have often little to do with those assumed in the idealized analytical framework of perfect information.^{Footnote 1} In practice, the use of simple revisable strategies, imitation tactics and rules of thumb seems to be a key ingredient of many decision processes. Thus, when analysing a market and its expected behaviour, it seems valuable to go beyond the perfect-information analysis, and also consider other decision procedures that enjoy greater empirical support and which may be deemed plausible for the context at hand. This point is particularly relevant in markets potentially subject to regulation (e.g. oligopolies) and in situations where the perfect-information theoretical analysis of the social interaction reveals the presence of multiple possible equilibria—as is often the case in indefinitely repeated strategic interactions, including oligopolies in particular. Consequently, several different rules for setting prices or production levels in oligopolies have been analyzed. Bigoni and Fort (2013) provide a recent review of the theoretical and experimental literature on learning in oligopolies.

In this paper we analyse Cournot oligopolies in which some firms provide a homogeneous good or service and have to choose their production level q _i. We consider that the market process advances in discrete time steps and at every time step the companies have to simultaneously choose whether to increase or decrease the value of their decision variable (q _i). The decision rule considered here can be simply stated as: repeat your last action (i.e. an increase or a decrease in production) if your profits have grown; otherwise, choose the opposite action. This simple rule has been named “Win-Continue, Lose-Reverse” (WCLR) by Huck et al. (2003),^{Footnote 2} who conducted a thorough study of its convergence properties in symmetric Cournot oligopolies.

The WCLR rule adjusts the level of the decision variable in the direction that is expected to make profits grow, according to the observed effect on profits of the last increment/decrement. Note that this gradual adjustment strategy can be considered a type of reinforcement learning rule: an action (i.e. an increase or decrease in production) is deemed satisfactory—and therefore repeated—if it provides a profit boost, and it is considered unsatisfactory—and therefore avoided—otherwise.

Mathematically, the WCLR strategy presents some similarities with a gradient ascent optimization method. In fact, if the profits of a company were to depend only on its own price or level of production (as in a monopoly with stable demand and costs), this rule would be a gradient ascent method and, under conditions that are well known in the optimization literature (Snyman 2005), it would lead to the vicinity of a local optimum. In a duopoly, however, the profits of a company depend also on its competitor’s price or output level, and the application of the WCLR rule by each of the companies independently does not constitute a gradient ascent method for the joint profit of the two companies. Thus, it is interesting to study to which reference point of the strategic game (e.g. collusive outcome, competitive outcome, or one-shot Nash equilibrium) such a simple strategy converges, if it does converge to any at all.

For a Cournot duopoly in which companies vary their production levels q _i by a predefined amount δ (step size), Huck et al. (2003) show that, under rather general conditions, for small values of δ, the quantities q _i converge to a small area around the cooperative (collusive) solution. In this paper, we show that the convergence of the WCLR rule to collusive outcomes is not robust to small independent perturbations in the profit functions of the firms (e.g., small independent variations in the cost functions, or small differences on the price received by each company). The existence of such small independent perturbations tends to push the process towards the Nash equilibrium of the one-shot game.

The structure of the remaining of the paper is very simple: in Sect. 2 we present the results for the Cournot model, and then we end with the conclusions.

2 Competition in Quantities: Cournot Model

In this section we analyse a Cournot duopoly in which at every time step t (t = 0, 1, …) each company i (i = 1, 2) chooses a production level or quantity [q _i]_t. The market price [p]_t is the same for both companies and it depends on the total quantity produced by both firms. The amount [q _i]_t is produced on period t with a cost function C(q). The profit for each company on period t is [π _i]_t = [p]_t [q _i]_t − C([q _i]_t). Incremental values are naturally defined as [∆π _i]_t := [π _i]_t − [π _i]_t−1, for t > 0, and initial values at time step 0 are [∆π _i]₀ = 0, and [∆q _i]₀ = 0.

Let us also define [s _i]_t := sign ([∆q _i]_t [∆π _i]_t). Note that [s _i]_t is equal to +1 if the last changes in [q _i]_t and [π _i]_t took place in the same direction, and [s _i]_t is equal to −1 if such changes went in opposite directions.

For each company i, the production levels are calculated as [q _i]_t+1 = max([q _i]_t +[∆q _i]_t+1 , 0), starting with some initial positive production level [q _i]₀ at time step 0. The decision rule WCLR used to calculate the production increments [∆q _i]_t+1 is implemented as follows:WCLR Rule:

If t = 0 or [s _i]_t = 0, then [∆q _i]_t+1 takes one random value out of the set {−δ_i, 0, δ_i}, where δ_i is the step size.
Otherwise, [∆q _i]_t+1 = δ_i [s _i]_t.

It is also assumed that the process includes some “noise” such that, with a small probability ε for each company in every period, the company will deviate from the value prescribed above for [∆q _i]_t+1 and will take a random choice out of the set {−δ_i, 0, δ_i}. This “decision noise” can represent occasional mistakes or experimentation.

Huck et al. (2003) prove that, with δ _i = δ, under rather general conditions, if the step size δ and the noise level ε are sufficiently small (but strictly positive), in the long run the process [q ₁, q ₂]_t will spend most of the time in a small neighbourhood around the collusive outcome, and their simulations show a quick convergence to that situation. The remaining of this section is devoted to show that this convergence can be very sensitive to small independent perturbations in the profit functions of the firms. The reader can run all the simulations reported here using the online model at http://luis.izqui.org/models/wc-lr-cournot/. The computer model has been implemented in NetLogo (Wilensky 1999).

2.1 The WCLR Rule in the Cournot Model with Noise

For illustrative purposes we consider a linear inverse demand function: p = max(100 − (q ₁ + q ₂), 0) and a quadratic cost function: C(q) = 10q + 0.1q ². In this situation, the collusive value for the production of each company, characterized by the first-order conditions $ \dfrac{\partial \left({\pi}_1+{\pi}_2\right)}{\partial {q}_i}=0 $, is q _i = 21.43, which corresponds to a price level p = 57.14. The Cournot equilibrium, characterized by the equations $ \dfrac{\partial {\pi}_i}{\partial {q}_i}=0 $, is q _i = 28.13, corresponding to a price level p = 43.75.

We also set δ_i = 0.1 and ε = 0.01. Initial levels of production [q _i]₀ are set randomly in the range [0, 50], but note that the model is ergodic (since ε > 0); thus, its long-run behaviour does not depend on initial conditions.

Departing from the baseline scenario above, we study the sensitivity of the model to three types of noise:

1.
“Decision noise”, characterised by the parameter ε, as described above.
2.
“Cost noise”, characterised by the parameter ε _cost, and implemented by altering each firm’s base cost according to the following formula:
$$ {c}_i=\left(10{q}_i + 0.1{q_i}^2\right)\ \left(1 + {\varepsilon}_{\it cost}{\mathrm{U}}_i\left[-1,1\right]\right) $$
where U_i[−1,1] denotes a continuous uniform random variable with range [−1,1].
3.
“Price noise”, characterised by the parameter ε _price, and implemented by giving each firm i a price p _i according to the following formula:
$$ {p}_i = p\left(1 + {\varepsilon}_{\it price}{\mathrm{U}}_i\left[-1,1\right]\right) $$
where p is the price that corresponds to the total level of output using the inverse demand function. This modified model represents small differences in the price that each company gets for its products, which can be due to a number of different reasons, such as random deviations in the quality of the products of a company with respect to the average quality, different times of arrival at the market (which would allow for some variability in demand), different intermediaries with variable commissions, existence of local markets (which would allow for some variability in price), etc.

Figure 1 below shows a representative run for each of the three types of noise.^{Footnote 3} It is clear that in the absence of cost noise or price noise, the WCLR rule leads to the collusive outcome, as shown by Huck et al. (2003). In stark contrast, small independent perturbations in the cost function or in the price function seem to destabilise the collusive outcome and push the simulation towards the Cournot equilibrium. The sensitivity of the model to perturbations in price seems to be greater than the sensitivity to perturbations in cost.

To study this effect rigorously, we conducted a computational experiment where we explored different values of ε, ε _cost, and ε _price. For each value of these variables we conducted 100 simulation runs, and for each of the runs we computed the average price in the simulation (taken over 10⁵ time steps, and neglecting the first 10⁴ time steps). Figures 2, 3 and 4 below show the results obtained.

Figure 2 shows that the WCLR rule leads to collusive outcomes even if the probability of a random decision is fairly high. Figure 3, in contrast, shows that small perturbations in the cost functions of the firms destabilise the collusive outcome and push the process towards the Cournot–Nash equilibrium of the one-shot game. In the same spirit, Fig. 4 shows that the sensitivity of the model to small perturbations in prices is even higher, and the collusive outcome is completely destabilised in favour of the Cournot–Nash equilibrium for values of the price noise as low as 1 %.

Why is the WCLR rule so robust to “decision noise”, but so sensitive to “cost noise” and “price noise”? Note that the stability of the collusive outcome induced by the WCLR rule relies on coordinated moves. When WCLR firms move in the same direction (either increasing or decreasing production levels), they receive signals that make them move towards the collusive outcome and linger around it. Alternatively, an uncoordinated move in the vicinity of the collusive equilibrium (possibly due to a perturbation) will make both firms move towards the Cournot–Nash equilibrium in the following time step—assuming no more deviations from the WCLR rule occur. Note, however, that this move towards Nash is itself coordinated, so at the following time step, both firms will simultaneously decrease production and they will keep doing so until they return to the neighbourhood of the collusive outcome. This explains why the collusive outcome is so robust to “decision noise”. Decision mistakes have an impact only on the decision at the time step at which they occur, and the process goes back towards collusion automatically in two time-steps.

By contrast, the effects of “cost noise” and “price noise” are more profound, as they do not only affect the decision at the time step they occur, but they also have a direct impact on subsequent decisions. This is because these perturbations effectively change the profit landscape and, in that way, they alter the relation between [∆q _i]_t and [∆π _i]_t. This deeper type of perturbation, which transcends the time step at which it occurs, is a greater source of miscoordination and, as explained above, uncoordinated moves push the process towards the Cournot–Nash equilibrium.

One final question remains to be answered: why does price variability have a greater impact than cost variability? The answer relates to the different strength with which these two sources of miscoordination affect the profit landscape. It turns out that, given the parameter values used in the illustrations above, profits for both firms in the region of interest are always positive and quite sizable, i.e. income is significantly greater than cost for both firms. In such a situation, a certain percentage change x in prices (and, therefore, in income) induces a greater change in profit than the same percentage change x in costs. Greater changes in profit mean higher chances of altering the sign of [∆π _i]_t := [π _i]_t − [π _i]_t−1, and hence, greater impact on the dynamics of the model. Thus, under such favourable circumstances, it is natural that price variability constitutes a greater source of miscoordination than cost variability. If income and cost were closer in magnitude, the sensitivity of the model to these two types of noise—“price noise” and “cost noise”—would also be more alike. This point can be checked adding a fixed cost equal to 900, i.e. the new cost function reads C(q) = 900 + 10q + 0.1q ². This change makes income and cost similar in the region of interest. In these conditions, the observed impact of cost variability was similar to that of price variability.

2.2 Other Noise Distributions

In this section we show that our results are robust to changes in the noise distribution considered for the price or the cost perturbations. To illustrate this fact, we focus here on a normal distribution with the same mean and standard deviation as the uniform distribution U[−1,1], i.e. the normal distribution N[0, 1/3] with mean 0 and variance 1/3.

First, we show in Fig. 5 below a representative run for each of the three types of noise.^{Footnote 4} Figure 5, which uses the noise distribution N[0, 1/3] for the price and the cost perturbations, is analogous to Fig. 1, which used the noise distribution U[−1,1].

To study the robustness to changes in the noise distribution, we conducted a computational experiment where we explored different values of ε _cost and ε _price using the noise distribution N[0, 1/3], in the same spirit as the experiments shown in Figs. 3 and 4 for noise distribution U[−1,1]. Figure 6 below presents the results obtained for ε _cost, which are very similar to those obtained in Fig. 3. The same similarity was obtained for price perturbations (ε _price), showing that the sensitivity of the model to cost and price noise does not depend on whether the noise distribution is a uniform distribution or a normal distribution.

2.3 Correlated Perturbations

In this section, we show that the destabilizing factor of the variability in cost or price is not so much the existence of the perturbations, but the fact that they are somewhat independent or uncorrelated between the firms. To illustrate this, here we consider the effect of correlated perturbations. Correlations would be observed in the real world if there were variations in costs or in the demand function that affected both companies in a similar way (for instance, seasonal demand variability). To study such situations, we model a price perturbation for each firm which is composed of both a common factor α U[−1,1]—with weight α—and an independent factor (1 − α) U_i[−1,1]—with weight (1 − α)—, according to the formula:

$$ {p}_i = p\left(1 + {\varepsilon}_{\it price}{{\mathrm{R}}^{\upalpha}}_i\right) $$

where

$$ {{\mathrm{R}}^{\upalpha}}_i=\upalpha\, \mathrm{U}\left[-1,1\right] + \left(1\ \hbox{--} \upalpha \right)\ {\mathrm{U}}_i\left[-1,1\right]. $$

Thus, parameter α is a measure of the correlation between the perturbations of each firm. Extreme value α = 0 represents completely uncorrelated perturbations (as analyzed above), and extreme value α = 1 represents full correlation (where the perturbations for each firm are exactly the same). Figure 7 below shows that the more correlated perturbations are, the less impact they have on destabilising the collusive outcome. As explained before, perturbations affect the dynamics of the model mainly through the generation of miscoordination between the firms; thus, it is natural that the impact of correlated noise, which does not cause so much miscoordination, is less acute than the effect of uncorrelated perturbations.

2.4 More than Two Competing Firms

The simulation results of Huck et al. (2003) in symmetric oligopolies with more than two competing firms (up to ten) and some small decision noise also showed convergence of the WCLR rule to collusive outcomes. We show in Fig. 8 below that, as in the duopoly case, the existence of small independent perturbations in the price that each company obtains also destabilises the collusive outcome and pushes the process towards the Nash equilibrium of the one-shot game.

Uncorrelated perturbations in cost have the same qualitative effect, so they are not shown here.

It should also be noted that, as the number of competing firms increase, the one-shot Cournot–Nash equilibrium gets closer to the outcome predicted under the assumption of perfect competition, so, as the number of firms increase, the WCLR rule with independent cost or price perturbations leads to market prices and production levels which approach those predicted by the perfect competition theory. Figure 9 below shows the effect of uncorrelated 2 % price perturbations in oligopolies with different number of firms. The results also show an increasing difference between the simulated price and the Cournot price as the number of firms in the market increases, which can be due to the decreasing marginal importance of one firm in the market as the number of firms in the market increases.

Conclusions

The results obtained by Huck et al. (2003) indicate that the simple, individual, “sensible” and not forward-looking decision rule WCLR (“Win-Continue, Lose-Reverse”) can lead to collusion-like outcomes in Cournot oligopolies, even though each company is independently trying to maximize its own profit, and is acting based only on its own past information. Similar results were obtained by Waltman and Kaymak (2008) considering a more involved learning algorithm (Q-learning). In principle, these results could raise important concerns about the fairness of fining firms in oligopolies for apparently carrying out collusive practices, since one could always allege that observed collusion-like outcomes could just be the unintended result of using this type of independent (and thus legitimate) decision rule.

However, this paper has shown that small independent variations in the cost functions, or small uncorrelated perturbations in the price obtained by each firm, can all destabilize the convergence of the WCLR rule to collusive outcomes, pushing the outcomes towards the Nash solution of the one-shot game. Previous simulation results (Keen and Standish 2006) had also indicated that introducing variability in the step sizes used by each company in each period could also push the process towards the Cournot–Nash solution in markets where firms compete in quantities. Consequently, in markets where there is some independent variability over time in the profit functions of the competing firms (which can be due, for instance, to spatially local effects), our results throw doubts on the validity of arguments that try to justify collusive-like outcomes as the unintended result of this kind of “innocent” decision rules.

Notes

1.
This statement does not necessarily imply that market predictions made using the perfect-information model are irrelevant in real life; the famous “as if” theory of Friedman (Friedman 1953) proves sufficiently accurate and useful in many contexts.
2.
The same authors use the name “trial and error” in (Huck et al. 2004), where they also present and discuss this learning rule in a discrete-time setup, though the analysis in that paper is focused on a continuous version of the process.
3.
Note that the simulation runs with “cost noise” or “price noise” have ε = 0.01 too, as prescribed in the baseline scenario.
4.
Note that the simulation runs with “cost noise” or “price noise” have ε = 0.01 too, as prescribed in the baseline scenario.

References

Bigoni M, Fort M (2013) Information and learning in oligopoly: an experiment. Games Econ Behav 81(1):192–214
Article Google Scholar
Friedman M (1953) Essays in positive economics. Part I – the methodology of positive economics. University of Chicago Press, Chicago, pp 3–43
Google Scholar
Huck S, Normann HT, Oechssler J (2003) Zero-knowledge cooperation in dilemma games. J Theor Biol 220(1):47–54
Article Google Scholar
Huck S, Normann HT, Oechssler J (2004) Through trial and error to collusion. Int Econ Rev 45(1):205–224
Article Google Scholar
Keen S, Standish R (2006) Profit maximization, industry structure, and competition: a critique of neoclassical theory. Physica A 370(1):81–85
Article Google Scholar
Snyman JA (2005) Practical mathematical optimization: an introduction to basic optimization theory and classical and new gradient-based algorithms. Springer, Heidelberg
Google Scholar
Waltman L, Kaymak U (2008) Q-learning agents in a Cournot oligopoly model. J Econ Dyn Control 32(10):3275–3293
Article Google Scholar
Wilensky U (1999) NetLogo. Center for Connected Learning and Computer-Based Modeling, Northwestern University, Evanston. http://ccl.northwestern.edu/netlogo/

Download references

Acknowledgments

We thank three anonymous reviewers for their useful comments. This work has received financial support from the Spanish Ministry of Science and Innovation (CSD2010-00034, SIMULPAST).

Author information

Authors and Affiliations

Department of Industrial Organization, Universidad de Valladolid, 47011, Valladolid, Spain
Segismundo S. Izquierdo
Department of Civil Engineering, Universidad de Burgos, 09001, Burgos, Spain
Luis R. Izquierdo

Authors

Segismundo S. Izquierdo
View author publications
You can also search for this author in PubMed Google Scholar
Luis R. Izquierdo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Segismundo S. Izquierdo .

Editor information

Editors and Affiliations

University Toulouse 1 Capitole, Toulouse, France
Frédéric Amblard
Laboratory for Socio-Historical Dynamics (LSDS), Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
Francisco J. Miguel
University Toulouse 1 Capitole, Toulouse, France
Adrien Blanchet
University Toulouse 1 Capitole, Toulouse, France
Benoit Gaudou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Izquierdo, S.S., Izquierdo, L.R. (2015). The “Win-Continue, Lose-Reverse” Rule in Cournot Oligopolies: Robustness of Collusive Outcomes. In: Amblard, F., Miguel, F., Blanchet, A., Gaudou, B. (eds) Advances in Artificial Economics. Lecture Notes in Economics and Mathematical Systems, vol 676. Springer, Cham. https://doi.org/10.1007/978-3-319-09578-3_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-09578-3_3
Published: 17 October 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09577-6
Online ISBN: 978-3-319-09578-3
eBook Packages: Business and EconomicsEconomics and Finance (R0)

Publish with us

Policies and ethics