1 Introduction

Game theory is concerned with how rational individuals make decisions when they are mutually interdependent. This theory finds many applications in various branches of economics and areas such as the social sciences, biology and engineering [1,2,3]. With the recent interest in quantum computing and quantum information theory, there has been much effort in trying to recast classical game theory using quantum probability amplitudes, and hence study the effect of quantum superposition, interference and entanglement on the agents’ optimal strategies [4, 5]. The seminal work on quantum game theory by Meyer [6] studied a simple coin tossing game and showed how a player utilizing quantum superposition could win with certainty against a classical player. Since then, many interesting aspects on quantum games with discrete strategies have been studied theoretically. For example, a general protocol for “two-player-two-strategy” quantum games with entanglement was developed by Eisert et al. [7] using the well-known prisoners’ dilemma as an example, and this was extended to multi-player games by Benjamin and Hayden [8]. Besides the games with discrete strategies, Li et al. [9] also investigated the quantization of games with continuous strategic space. They gave a “minimal” quantum structure of Cournot duopoly and showed that the players can escape the frustrating dilemma-like situation, i.e. the unique Nash equilibrium is inferior to the Pareto optimal as in the prisoners’ dilemma, if the structure involves a maximally entangled state. The essence of their “minimal” extension of the classical Cournot duopoly into the quantum domain is that the strategic space is maintained unexpanded and only the initial state is extended to be an entangled state. This extension has the advantage that if the game of nonzero entanglement exhibits any features not seen in the classical game, these features can be completely attributed to the quantum entanglement.

Later, Lo and Kiang [10] generalized Li et al.’s [9] “minimal” quantization rules to both Cournot model and Bertrand model of oligopoly. Inspired by the generalized multimode SU(1,1) displacement operator introduced by Lo and Liu [11], they proposed a unitary entanglement operation of the form

$$\begin{aligned} {\hat{J}}\left( \gamma \right) \equiv \exp \left\{ -\frac{1}{2} \sum _{i,j=1}^{N}\beta _{ij}\left( {\hat{a}}_{i}^{\dag }{\hat{a}}_{j}^{\dag }- {\hat{a}}_{i}{\hat{a}}_{j}\right) \right\} , \end{aligned}$$
(1)

where \(\beta _{ij}=2\gamma \left( 1/N-\delta _{ij}\right) \), and the operators \({\hat{a}}_{i}\) and \({\hat{a}}_{i}^{\dag }\) are the annihilation and creation operators of the ith field mode, respectively, satisfying the commutation relations: \(\big [ {\hat{a}}_{i},{\hat{a}}_{j}^{\dag } \big ] \) \(=\delta _{ij}\) and \(\big [ {\hat{a}}_{i},{\hat{a}}_{j}\big ] =0\). It should be noted that the entanglement operation \({\hat{J}}\left( \gamma \right) \) is symmetric with respect to the interchange of any pair of field modes, and that the matrix \({\varvec{\beta }}\) has been chosen such that in the limit of \(N\rightarrow \) \(\infty \) there is no entanglement in order to guarantee a perfectly competitive market. Following Li et al. [9], they assumed the starting state to be the tensor product of N single-mode vacuum states which are subsequently entangled via the unitary operation \({\hat{J}} ( \gamma ) \) described in Eq. (1). The resultant state \(|\psi _{i}\rangle \equiv {\hat{J}}( \gamma ) |\)vac\(\rangle _{1}\otimes |\) vac\(\rangle _{2}\otimes |\)vac\(\rangle _{3}\cdots \otimes |\)vac\(\rangle _{N}\) is a generalized multimode squeezed vacuum state introduced by Lo and Sollie [12]. Then, all the N firms execute their strategic moves via the unitary operations \(\big \{ {\hat{D}}_{i}( x_{i}) \equiv \exp \big \{ x_{i}\big ( {\hat{a}}_{i}^{\dag }-{\hat{a}}_{i}\big ) /\sqrt{2}\big \};\;i=1,2,3,\ldots ,N\big \} \), respectively. Having executed their moves, the firms forward their electromagnetic field modes to the final measurement, prior to which a disentanglement operation \({\hat{J}}( \gamma ) ^{\dag }\) is performed. Thus, the final state prior to the measurement is given by \(|\psi _{f}\rangle \equiv {\hat{J}}( \gamma ) ^{\dag } \big [ \prod _{i=1}^{N}{\hat{D}}_{i}( x_{i}) \big ] {\hat{J}}( \gamma ) |\)vac\(\rangle _{1}\otimes |\)vac\(\rangle _{2}\otimes |\)vac\( \rangle _{3}\cdots \otimes |\)vac\(\rangle _{N}\). The final measurement is made corresponding to the observables \(\{ {\hat{X}}_{i}\equiv ( {\hat{a}}_{i}^{\dag }+{\hat{a}}_{i}) /\sqrt{2};\;i=1,2,3,\ldots ,N\} \). It is obvious that without quantum entanglement, namely \(\gamma =0\), this quantum structure can be easily shown to yield a faithful representation of the classical game, and the final measurement gives the original classical results: \(q_{i}\equiv \langle \psi _{f}|{\hat{X}}_{i}|\psi _{f}\rangle =x_{i}\) for \(i=1\), 2, 3,..., N. On the other hand, for any case of nonzero quantum entanglement, it is straightforward to show that the final measurement yields the following respective quantities of the firms:

$$\begin{aligned} q_{i}=\sum _{j=1}^{N}\left( \exp \left( {\varvec{\beta }}\right) \right) _{ij}x_{j}=x_{i}\exp \left( -2\gamma \right) +\frac{1}{N}\left[ 1-\exp \left( -2\gamma \right) \right] \sum _{j=1}^{N}x_{j}. \end{aligned}$$
(2)

In the Cournot model the profits of the firms can then be evaluated as:

$$\begin{aligned} u_{i}=q_{i}\left( k-\sum _{j=1}^{N}q_{j}\right) \quad ;\quad j=1,2,3,\ldots ,N. \end{aligned}$$
(3)

Solving for the Nash equilibrium yields the unique solution

$$\begin{aligned} q_{i}^{*}=\frac{k}{N+1}\left\{ \frac{1-\left[ 1-\exp \left( 2\gamma \right) \right] /N}{1-2\left[ 1-\exp \left( 2\gamma \right) \right] /\left( N+1\right) }\right\} \end{aligned}$$
(4)

for \(i=1,2,3,\ldots ,N\). The corresponding profits of the firms at this equilibrium are given by

$$\begin{aligned} u_{i}^{*}=\frac{k^{2}}{\left( N+1\right) ^{2}}\left\{ \frac{\left\{ 1- \left[ 1-\exp \left( 2\gamma \right) \right] /N\right\} \exp \left( 2\gamma \right) }{\left\{ 1-2\left[ 1-\exp \left( 2\gamma \right) \right] /\left( N+1\right) \right\} ^{2}}\right\} ;\quad i=1,2,3,\ldots ,N. \end{aligned}$$
(5)

As a result, a “minimal” quantum extension of the classical Cournot N-opoly is obtained.

Furthermore, Zhou et al. [13] extended the quantum Cournot duopoly to three firms with different pair-wise entanglement parameters. They found that the sum of the firms’ profit increases monotonically to a limit as the three entanglement parameters tend to infinity. Indeed, the sum of the three firms’ profit will attain the same asymptotic limit even if there are only two pair-wise entanglement parameters (that is, one of the entanglement parameter is zero), provided these two parameters tend to infinity. They also considered the case for an arbitrary number, N, of firms, but then restricted themselves in this case to a common entanglement parameter.Footnote 1 Such a situation had been discussed by Lo and Kiang [10], though a different form for the entanglement parameter was used.

In the Cournot oligopoly, the firms assume a symmetric role, and the presence of different pair-wise entanglement parameters seems artificial. In contrast, there is an inherent asymmetry in the Stackelberg duopoly, where there is a first mover and a follower. Unlike the Cournot duopoly in which both firms make their strategic moves at the same time and thus have to guess what the action of their opponents would be, the Stackelberg duopoly allows one of the firms to move first. Since the other firm can now observe its opponent’s move before making its own decision, the game can no longer be modelled as static. In such a dynamic game, there exists an interesting result: there is a clear advantage to moving first. This first-mover advantage in the Stackelberg duopoly is due to the fact that being able to make its strategic decision known, the first mover does not need to guess what the follower will do because the follower is assumed to behave optimally. We therefore believe that it is more natural to have differential entanglement in a multi-player quantum Stackelberg game than in a Cournot one. In this paper we study this scenario and, in particular, we examine how the differential entanglement affects the first-mover advantage. Our analysis indicates that for a small number of firms the leaders could lose the first-mover advantage and might experience a disadvantage instead. This surprising feature is certainly not expected in the classical game. The general scheme of this paper is as follows: in the next section the model of quantum Stackelberg oligopoly with differential pair-wise entanglement is proposed, and its solution is discussed in detail in Sect. 3. The last section concludes our findings.

2 Quantum Stackelberg oligopoly

In the Stackelberg model of oligopoly some of the firms, namely the leaders, are allowed to move first while the remaining firms, namely the followers, can observe the leaders’ moves before making their own choices. To find the solution to this oligopoly problem, the concept of subgame perfect equilibrium or backwards induction is needed [1,2,3]. This means we first solve for the followers: each of the followers reacts optimally to the moves of both the leaders and other followers, by making his/her own choice which maximizes his/her profits. Then, each of the leaders takes into account the optimal response of the followers and set his/her own move in such a way as to maximize his/her own profits against the followers and other leaders. That is, the leaders substitute the followers’ reaction functions into their own profit functions and then optimize their moves accordingly. An appropriate candidate of the entanglement operator used in quantizing the Stackelberg oligopoly is given by

$$\begin{aligned} {\hat{J}}\left( \gamma ,\lambda ,\xi \right)= & {} \exp \left\{ -\frac{1}{2} \sum _{i,j=1}^{N}\beta _{ij}\left( {\hat{a}}_{i}^{\dag }{\hat{a}}_{j}^{\dag }- {\hat{a}}_{i}{\hat{a}}_{j}\right) \right\} \exp \left\{ -\frac{1}{2} \sum _{i,j=1}^{M}\alpha _{ij}\left( {\hat{b}}_{i}^{\dag }{\hat{b}}_{j}^{\dag }- {\hat{b}}_{i}{\hat{b}}_{j}\right) \right\} \nonumber \\&\times \exp \left\{ -\frac{\xi }{NM}\sum _{i=1}^{N}\sum _{j=1}^{M}\left( {\hat{a}}_{i}^{\dag }{\hat{b}}_{j}^{\dag }-{\hat{a}}_{i}{\hat{b}}_{j}\right) \right\} \end{aligned}$$
(6)

for \(\beta _{ij}=2\gamma ( 1/N-\delta _{ij}) \) and \(\alpha _{ij}=2\lambda ( 1/M-\delta _{ij}) \). The operators \({\hat{a}} _{i} \) and \({\hat{a}}_{i}^{\dag }\) represent the annihilation and creation operators of the followers, while the operators \({\hat{b}}_{i}\) and \({\hat{b}} _{i}^{\dag }\) are associated with the leaders. These operators satisfy the commutation relations: \(\big [ {\hat{a}}_{i},{\hat{a}}_{j}^{\dag }\big ] =\big [ {\hat{b}}_{i},{\hat{b}}_{j}^{\dag }\big ] =\delta _{ij}\) and \(\big [ {\hat{a}} _{i},{\hat{a}}_{j}\big ] =\big [ {\hat{b}}_{i},{\hat{b}}_{j}\big ] =\big [ \hat{a }_{i},{\hat{b}}_{j}\big ] =\big [ \hat{a}_{i},\hat{b}_{j}^{\dag }\big ] =0\). Unlike the entanglement operator used in the Cournot model, this operator takes into account the inherent asymmetry in the Stackelberg model. Furthermore, for \(M=N=1\) and \(\gamma =\lambda =0\), this model is reduced to the quantum Stackelberg duopoly model studied by Lo and Kiang [14].

For simplicity, in the following we shall concentrate on two special cases, namely the case of a single leader with N followers and the case of a single follower with M leaders, in order to demonstrate the effect of quantum entanglement on the outcomes of the Stackelberg oligopoly. It is obvious that the former case is obtained by setting \(M=1\) and \(\lambda =0\), while requiring \(N=1\) and \(\gamma =0\) yields the latter case.

2.1 Special case of a single leader with N followers

In the special case of a single leader with N followers, there can be one entanglement parameter among the fellow followers, and a different one between the first mover and the followers. Thus, the entanglement operator becomes

$$\begin{aligned} \hat{J}\left( \gamma ,\xi \right) =\exp \left\{ -\frac{1}{2} \sum _{i,j=1}^{N}\beta _{ij}\left( \hat{a}_{i}^{\dag }\hat{a}_{j}^{\dag }- \hat{a}_{i}\hat{a}_{j}\right) \right\} \exp \left\{ -\frac{\xi }{N} \sum _{i=1}^{N}\left( \hat{a}_{i}^{\dag }\hat{b}^{\dag }-\hat{a}_{i}\hat{b} \right) \right\} . \end{aligned}$$
(7)

Our subsequent analysis begins with introducing the initial state \(|\psi _{i}\rangle \) of the system, which is the tensor product of \(( N+1) \) single-mode vacuum states subjected to the entanglement operation given in Eq. (7). The leader’s strategic move is executed via the unitary operation \(\hat{D}( x_{0}) \equiv \exp \{ x_{0}( \hat{b}^{\dag }-\hat{b}) /\sqrt{2}\} \), while the followers’ are represented by the unitary operations \(\{ \hat{D} _{i}( x_{i}) \equiv \exp \{ x_{i}( \hat{a}_{i}^{\dag }- \hat{a}_{i}) /\sqrt{2}\};\;i=1,2,3,\ldots ,N\} \). Having executed their moves, the final state after the disentanglement operation is given by \(|\psi _{f}\rangle \equiv \hat{J}( \gamma ,\xi ) ^{\dag } \big [ \prod _{i=1}^{N}\hat{D}_{i}( x_{i}) \,\hat{D}( x_{0}) \big ] \hat{J}( \gamma ,\xi ) |\psi _{i}\rangle \). Then, the final measurement gives the quantities of products produced by the firms:

$$\begin{aligned} q_{0}= & {} x_{0}\cosh \left( \frac{\xi }{\sqrt{N}}\right) +\frac{1}{\sqrt{N}} \sinh \left( \frac{\xi }{\sqrt{N}}\right) \sum _{j=1}^{N}x_{j} \nonumber \\ q_{i}= & {} x_{i}\exp \left( -2\gamma \right) +\frac{1}{N}\left[ \cosh \left( \frac{\xi }{\sqrt{N}}\right) -\exp \left( -2\gamma \right) \right] \sum _{j=1}^{N}x_{j} \nonumber \\&+\frac{1}{\sqrt{N}}\sinh \left( \frac{\xi }{\sqrt{N}}\right) x_{0}\quad ;\quad i=1,2,3,\ldots ,N. \end{aligned}$$
(8)

Here, the subscript 0 refers to the single leader. As a result, the profits of the leader and the followers are given by

$$\begin{aligned} u_{0}= & {} q_{0}\left( k-q_{0}-\sum _{j=1}^{N}q_{j}\right) \nonumber \\ u_{i}= & {} q_{i}\left( k-q_{0}-\sum _{j=1}^{N}q_{j}\right) \quad ;\quad i=1,2,3,\ldots ,N. \end{aligned}$$
(9)

In accordance with the backwards induction analysis, the followers know the leader’s strategic move \(x_{0}\), and thus, they choose their moves \( \left\{ x_{i}\right\} \) by optimizing their profits with respect to this \( x_{0}\):

$$\begin{aligned} \frac{\partial u_{i}}{\partial x_{i}}= & {} \left\{ \left( 1-\frac{1}{N}\right) \exp \left( -2\gamma \right) +\frac{1}{N}\cosh \left( \frac{\xi }{\sqrt{N}} \right) \right\} \left( k-q_{0}-\sum _{j=1}^{N}q_{j}\right) \nonumber \\&-\,q_{i}\left[ \cosh \left( \frac{\xi }{\sqrt{N}}\right) +\frac{1}{\sqrt{N}} \sinh \left( \frac{\xi }{\sqrt{N}}\right) \right] \nonumber \\= & {} 0. \end{aligned}$$
(10)

Since all followers are identical, they will all arrive at the same optimal value \(x^{*}\), and hence, the above equation can be simplified to yield

$$\begin{aligned} x^{*}=\frac{f\left( \xi \right) k-g\left( \xi \right) x_{0}}{N\left[ \cosh \left( \frac{\xi }{\sqrt{N}}\right) +\frac{1}{\sqrt{N}}\sinh \left( \frac{\xi }{\sqrt{N}}\right) \right] } \end{aligned}$$
(11)

where

$$\begin{aligned} f\left( \xi \right)= & {} \frac{\left( N-1\right) \exp \left( -2\gamma \right) +\cosh \left( \frac{\xi }{\sqrt{N}}\right) }{\left( N-1\right) \exp \left( -2\gamma \right) +2\cosh \left( \frac{\xi }{\sqrt{N}}\right) } \nonumber \\ g\left( \xi \right)= & {} \cosh \left( \frac{\xi }{\sqrt{N}}\right) +\sqrt{N} \sinh \left( \frac{\xi }{\sqrt{N}}\right) -\frac{1}{\left( N-1\right) \exp \left( -2\gamma \right) +2\cosh \left( \frac{\xi }{\sqrt{N}}\right) }\ .\quad \nonumber \\ \end{aligned}$$
(12)

Their identical profit \(u^{*}\) is then given by

$$\begin{aligned} u^{*}=\frac{\left[ k\cosh \left( \frac{\xi }{\sqrt{N}}\right) -x_{0} \right] \left[ x^{*}\cosh \left( \frac{\xi }{\sqrt{N}}\right) +\frac{1}{ \sqrt{N}}\sinh \left( \frac{\xi }{\sqrt{N}}\right) x_{0}\right] }{\left( N-1\right) \exp \left( -2\gamma \right) +2\cosh \left( \frac{\xi }{\sqrt{N}} \right) }\ . \end{aligned}$$
(13)

Next, we substitute the optimal value \(x^{*}\) into the profit or utility function of the leader given in Eq. (9). Optimizing it with respect to \(x_{0}\) , one obtains

$$\begin{aligned} x_{0}^{*}=\frac{k}{2}\left\{ \cosh \left( \frac{\xi }{\sqrt{N}}\right) - \frac{\left[ \left( N-1\right) \exp \left( -2\gamma \right) +\cosh \left( \frac{\xi }{\sqrt{N}}\right) \right] \sinh \left( \frac{\xi }{\sqrt{N}} \right) }{\sqrt{N}\left[ \left( N-1\right) \exp \left( -2\gamma \right) +2\cosh \left( \frac{\xi }{\sqrt{N}}\right) \right] +\sinh \left( \frac{\xi }{\sqrt{N}}\right) }\right\} \nonumber \\ \end{aligned}$$
(14)

and

$$\begin{aligned} u_{0}^{*}= & {} \frac{k^{2}}{4\left[ \left( N-1\right) \exp \left( -2\gamma \right) +2\cosh \left( \frac{\xi }{\sqrt{N}}\right) \right] }\left\{ \cosh \left( \frac{\xi }{\sqrt{N}}\right) \right. \nonumber \\&+\left. \frac{\left[ \left( N-1\right) \exp \left( -2\gamma \right) +\cosh \left( \frac{\xi }{\sqrt{N}}\right) \right] \sinh \left( \frac{\xi }{\sqrt{N} }\right) }{\sqrt{N}\left[ \left( N-1\right) \exp \left( -2\gamma \right) +2\cosh \left( \frac{\xi }{\sqrt{N}}\right) \right] +\sinh \left( \frac{\xi }{\sqrt{N}}\right) }\right\} \ . \end{aligned}$$
(15)

It is not difficult to show that both \(x_{0}^{*}\) and \(u_{0}^{*}\) are nonnegative definite. In terms of the optimal value \(x_{0}^{*}\) given in Eq. (14), we could also determine the optimal values \(x^{*}\) in Eq. (11) and \(u^{*}\) in Eq. (13). For \(\gamma =\xi =0\), it is apparent that the above results are reduced to the well-known results of the classical Stackelberg oligopoly model, namely [1,2,3]

$$\begin{aligned} x_{0}^{*}= & {} \frac{k}{2},\qquad x^{*}=\frac{k}{2\left( N+1\right) } \nonumber \\ u_{0}^{*}= & {} \frac{k^{2}}{4\left( N+1\right) },\qquad u^{*}= \frac{k^{2}}{4\left( N+1\right) ^{2}} \nonumber \\ \Delta u^{*}\equiv & {} u_{0}^{*}-u^{*}=\frac{k^{2}N}{4\left( N+1\right) ^{2}}, \end{aligned}$$
(16)

which explicitly demonstrates the first-mover advantage in the model for \( \Delta u^{*}>0\).

2.2 Special case of a single follower with M leaders

In the special case of a single follower with M leaders, the entanglement operator is given by

$$\begin{aligned} \hat{J}\left( \lambda ,\xi \right) =\exp \left\{ -\frac{1}{2} \sum _{i,j=1}^{M}\alpha _{ij}\left( \hat{b}_{i}^{\dag }\hat{b}_{j}^{\dag }- \hat{b}_{i}\hat{b}_{j}\right) \right\} \exp \left\{ -\frac{\xi }{M} \sum _{i=1}^{M}\left( \hat{a}^{\dag }\hat{b}_{i}^{\dag }-\hat{a}\hat{b} _{i}\right) \right\} .\nonumber \\ \end{aligned}$$
(17)

and the initial state \(|\psi _{i}\rangle \) of the system is the tensor product of \(( M+1) \) single-mode vacuum states subjected to the entanglement operation given in Eq. (17). The strategic moves of the follower and leaders are executed via the unitary operations: \(\hat{D}( x_{0}) \equiv \exp \{ x_{0}( \hat{a}^{\dag }-\hat{a}) / \sqrt{2}\} \) and \(\{ \hat{D}_{i}( x_{i}) \equiv \exp \{ x_{i}( \hat{b}_{i}^{\dag }-\hat{b}_{i}) /\sqrt{2}\} ;\;i=1,2,3,\ldots ,M\} \), respectively. After executing their moves and the disentanglement operation, the final state is given by \(|\psi _{f}\rangle \equiv \hat{J}( \lambda ,\xi ) ^{\dag }\big [ \prod _{i=1}^{N}\hat{D}_{i}( x_{i}) \,\hat{D}( x_{0}) \big ] \hat{J}( \lambda ,\xi ) |\psi _{i}\rangle \). The final measurement generates the quantities of products produced by the firms:

$$\begin{aligned} q_{0}= & {} x_{0}\cosh \left( \frac{\xi }{\sqrt{M}}\right) +\frac{1}{\sqrt{M}} \sinh \left( \frac{\xi }{\sqrt{M}}\right) \sum _{j=1}^{M}x_{j} \nonumber \\ q_{i}= & {} x_{i}\exp \left( -2\lambda \right) +\frac{1}{M}\left[ \cosh \left( \frac{\xi }{\sqrt{M}}\right) -\exp \left( -2\lambda \right) \right] \sum _{j=1}^{M}x_{j} \nonumber \\&+\frac{1}{\sqrt{M}}\sinh \left( \frac{\xi }{\sqrt{M}}\right) x_{0} ;\quad i=1,2,3,\ldots ,M. \end{aligned}$$
(18)

The corresponding profits are then given by

$$\begin{aligned} u_{0}= & {} q_{0}\left( k-q_{0}-\sum _{j=1}^{M}q_{j}\right) \nonumber \\ u_{i}= & {} q_{i}\left( k-q_{0}-\sum _{j=1}^{M}q_{j}\right) \quad ;\quad i=1,2,3,\ldots ,M. \end{aligned}$$
(19)

It should be noted that in this special case the subscript 0 refers to the single follower instead.

To perform the backwards induction analysis, we first solve the equation

$$\begin{aligned} \frac{\partial u_{0}}{\partial x_{0}}= & {} k\cosh \left( \frac{\xi }{\sqrt{M}} \right) -2\cosh \left( \frac{\xi }{\sqrt{M}}\right) \left[ \cosh \left( \frac{\xi }{\sqrt{M}}\right) +\sqrt{M}\sinh \left( \frac{\xi }{\sqrt{M}} \right) \right] \nonumber \\&-H\left( \xi \right) \sum _{j=1}^{M}x_{j} \nonumber \\= & {} 0 \end{aligned}$$
(20)

for the optimal move \(x_{0}^{*}\) of the follower and obtain

$$\begin{aligned} x_{0}^{*}=\frac{k\cosh \left( \frac{\xi }{\sqrt{M}}\right) -H\left( \xi \right) \sum _{j=1}^{M}x_{j}}{2\cosh \left( \frac{\xi }{\sqrt{M}}\right) \left[ \cosh \left( \frac{\xi }{\sqrt{M}}\right) +\sqrt{M}\sinh \left( \frac{ \xi }{\sqrt{M}}\right) \right] }\ . \end{aligned}$$
(21)

The function \(H\left( \xi \right) \) is defined by

$$\begin{aligned} H\left( \xi \right) =\cosh \left( \frac{\xi }{\sqrt{M}}\right) ^{2}+\frac{2}{ \sqrt{M}}\cosh \left( \frac{\xi }{\sqrt{M}}\right) \sinh \left( \frac{\xi }{ \sqrt{M}}\right) +\sinh \left( \frac{\xi }{\sqrt{M}}\right) ^{2}. \end{aligned}$$
(22)

Then, by substituting this optimal value \(x_{0}^{*}\) into the profit function of the leaders given in Eq. (19) and optimizing it with respect to \( x_{i}\), the optimal move and profit of each leader are determined as follows:

$$\begin{aligned} x^{*}= & {} \frac{k}{M}\cosh \left( \frac{\xi }{\sqrt{M}}\right) \left\{ \frac{\left( M-1\right) \exp \left( -2\lambda \right) +F\left( \xi \right) }{ \left( M-1\right) \exp \left( -2\lambda \right) +G\left( \xi \right) } \right\} \end{aligned}$$
(23)
$$\begin{aligned} u^{*}= & {} \frac{k^{2}}{2M\cosh \left( \frac{\xi }{\sqrt{M}}\right) } \left\{ \frac{\left( M-1\right) \exp \left( -2\lambda \right) +\frac{1}{2} G\left( \xi \right) }{\left[ \left( M-1\right) \exp \left( -2\lambda \right) +G\left( \xi \right) \right] ^{2}}\right\} \end{aligned}$$
(24)

where

$$\begin{aligned} F\left( \xi \right)= & {} \frac{1}{\cosh \left( \frac{\xi }{\sqrt{M}}\right) + \sqrt{M}\sinh \left( \frac{\xi }{\sqrt{M}}\right) } \nonumber \\ G\left( \xi \right)= & {} \frac{2\cosh \left( \frac{\xi }{\sqrt{M}}\right) + \sqrt{M}\sinh \left( \frac{\xi }{\sqrt{M}}\right) }{\cosh \left( \frac{\xi }{ \sqrt{M}}\right) \left[ \cosh \left( \frac{\xi }{\sqrt{M}}\right) +\sqrt{M} \sinh \left( \frac{\xi }{\sqrt{M}}\right) \right] }\ . \end{aligned}$$
(25)

Here, the derivations have made use of the fact that all leaders are identical and they will all arrive at the same optimal value of \(x^{*}\). Accordingly, the corresponding optimal move and profit of the follower are given by

$$\begin{aligned} x_{0}^{*}= & {} \frac{F\left( \xi \right) }{2\cosh \left( \frac{\xi }{\sqrt{ M}}\right) }\left\{ k\cosh \left( \frac{\xi }{\sqrt{M}}\right) -MH\left( \xi \right) x^{*}\right\} \end{aligned}$$
(26)
$$\begin{aligned} u_{0}^{*}= & {} \frac{k}{2\cosh \left( \frac{\xi }{\sqrt{M}}\right) } \left\{ \frac{\cosh \left( \frac{\xi }{\sqrt{M}}\right) x_{0}^{*}+\sqrt{M }\sinh \left( \frac{\xi }{\sqrt{M}}\right) x^{*}}{\left( M-1\right) \exp \left( -2\lambda \right) +G\left( \xi \right) }\right\} . \end{aligned}$$
(27)

It should be noted that, unlike the case of a single leader with N followers, the leaders’ optimal moves and profits in this case are no longer nonnegative definite. For \(\lambda =\xi =0\), the well-known results of the classical Stackelberg oligopoly model are recovered, namely [1,2,3]

$$\begin{aligned} x^{*}= & {} \frac{k}{M+1},\qquad x_{0}^{*}=\frac{k}{2\left( M+1\right) } \nonumber \\ u^{*}= & {} \frac{k^{2}}{2\left( M+1\right) ^{2}},\qquad u_{0}^{*}=\frac{k^{2}}{4\left( M+1\right) ^{2}} \nonumber \\ \Delta u^{*}\equiv & {} u^{*}-u_{0}^{*}=\frac{k^{2}}{4\left( M+1\right) ^{2}}, \end{aligned}$$
(28)

which clearly indicate the first-mover advantage in the model for \(\Delta u^{*}>0\)

Fig. 1
figure 1

a The leader’s profit \(u_{0}^{*}\) (in units of the classical value) is plotted versus the entanglement parameter \(\gamma \) for different values of the entanglement parameter \(\xi \) in the case of one leader with two followers, i.e. \(N=2\), b the profit of each follower \(u^{*}\) (in units of the classical value) is plotted versus \( \gamma \) for different values of \(\xi \) in the same case, and c the profit difference \(\Delta u^{*}\equiv u_{0}^{*}-u^{*}\) (in units of the classical value) is plotted versus \(\gamma \) for different values of \(\xi \) in the same case. It should be noted that \(\xi \) must be less than 0.46561 in order to have nonnegative values of \(x^{*}\)

3 Illustrative numerical results

In Fig. 1 the profits of each follower \(u^{*}\) and the leader \( u_{0}^{*}\) of the special case of a single leader with two followers (i.e. \(N=2\)) are plotted versus the entanglement parameter \(\gamma \) (between the two followers) for different values of the entanglement parameter \(\xi \) (between each follower and the leader). Our analysis indicates that if \(\xi \) is less than 0.46561, then \(x^{*}\) is non-negative for all values of \(\gamma \). Of course, nonnegative values of \( x^{*}\) can also be found for other values of \(\xi \) provided that \( \gamma \) is chosen appropriately. As shown in the figure, for \(\gamma \lessapprox -\,2\) the leader’s profit \(u_{0}^{*}\) (in units of the classical value) is infinitesimal. Then, it starts to increase rapidly with the entanglement parameter \(\gamma \) and saturates around \(\gamma =1\) for the given values of the entanglement parameter \(\xi \). The profit of each follower \(u^{*}\) (in units of the classical value) also exhibits similar behaviour. In addition, tuning the parameter \(\xi \) enables us to enhance or suppress their profits against the classical values.

Moreover, the difference between the leader’s profit and the profit of each follower \(\Delta u^{*}\equiv u_{0}^{*}-u^{*}\) (in units of the classical value) is plotted against \(\gamma \) for different values of \(\xi \). It is apparent that the leader’s profit is always more than the followers’ for the given values of \(\xi \). In other words, the leader always has an advantage over the followers. However, significant first-mover advantage occurs only for \(\gamma \gtrapprox 1\). In Fig. 2 we examine how the number of followers affects the aforementioned first-mover advantage. The results indicate that the same pattern of the first-mover advantage also exists for \( N=5\), 10 and 30, as well as that the sole effect of increasing the number of followers is simply the enhancement of the first-mover advantage. In all these cases the parameter \(\xi \) plays the role of fine-tuning the first-mover advantage only. Besides, it should be noted that in each of these three cases the value of \(\xi \) is capped by an upper bound to ensure the nonnegative definiteness of \(x^{*}\) for all values of \(\gamma \).

In Fig. 3 the profits of each leader \(u^{*}\) and the follower \( u_{0}^{*}\) of the special case of a single follower with two leaders (i.e. \(M=2\)) are plotted versus the entanglement parameter \(\lambda \) (between the two leaders) for different values of the entanglement parameter \(\xi \) (between each leader and the follower). In order to ensure the values of \(x_{0}^{*}\), \(u_{0}^{*}\) and \(u^{*}\) to be nonnegative definite for all values of \(\lambda \), we must require \(-\,1.24645<\xi <0\). Other values of \(\xi \) can also be allowed if \(\lambda \) is chosen appropriately. For \(\lambda \lessapprox -\,2\) the profits of the follower \( u_{0}^{*}\) (in units of the classical value) and each leader \(u^{*}\) (in units of the classical value) are very small. Then, they start to grow rapidly with the entanglement parameter \(\lambda \) and come to a plateau around \(\lambda =1\) for the given values of the entanglement parameter \(\xi \) . By varying the parameter \(\xi \), we are able to monitor the profits against the classical values, too. Likewise, it is found that the first-mover advantage is essentially zero for \(\lambda <-\,3\) and then it begins to rise rapidly. After hitting the maximum near the classical value at \(\lambda \approx -\,1\), the first-mover advantage starts to diminish, and an unexpected loss of the first-mover advantage appears at \(\lambda \approx 0 \) (except for \(\xi =0\)). The loss deteriorates with the entanglement parameter \(\lambda \) until it levels off at \(\lambda \approx 2\). This feature is certainly not expected in the classical model of Stackelberg oligopoly. In Fig. 4 the loss of the first-mover advantage is further examined for \(M=5\), 10 and 30. The same pattern of the loss exists for \( M=5\), 10 and 30, too; increasing the number of leaders simply results in aggravation of the loss. Furthermore, in all these cases the parameter \(\xi \) plays the role of fine-tuning the first-mover advantage only, and its value is confined within a specific range.

Fig. 2
figure 2

The profit difference \(\Delta u^{*}\equiv u_{0}^{*}-u^{*}\) (in units of the classical value) is plotted against \(\gamma \) for different values of \(\xi \) in the case of a 5 followers (i.e. \( N=5)\), b 10 followers (i.e. \(N=10\)), and c 30 followers (i.e. \(N=30\)). It should be noted that in each of these three cases the value of \( \xi \) is capped by an upper bound, namely \(\xi =0.48467\), 0.49202 and 0.49726 for \(N=5\), 10 and 30, respectively

Fig. 3
figure 3

a The follower’s profit \(u_{0}^{*}\) (in units of the classical value) is plotted versus the entanglement parameter \(\lambda \) for different values of the entanglement parameter \(\xi \) in the case of one follower with two leaders, i.e. \(M=2\), b the profit of each leader \( u^{*}\) (in units of the classical value) is plotted versus \(\lambda \) for different values of \(\xi \) in the same case, and c the profit difference \(\Delta u^{*}\equiv u^{*}-u_{0}^{*}\) (in units of the classical value) is plotted versus \(\lambda \) for different values of \(\xi \) in the same case. It should be noted that \(\xi \) must be confined within the specific range \(\left[ -\,1.24645,0\right] \) in order to ensure the values of \( x_{0}^{*}\), \(u_{0}^{*}\) and \(u^{*}\) to be nonnegative definite

Fig. 4
figure 4

The profit difference \(\Delta u^{*}\equiv u^{*}-u_{0}^{*}\) (in units of the classical value) is plotted against \( \gamma \) for different values of \(\xi \) in the case of a 5 leaders (i.e. \(M=5)\), b 10 leaders (i.e. \(M=10\)), and c 30 leaders (i.e. \(M=30\)). It should be noted that in each of these three cases the value of \(\xi \) is confined within a specific range, namely \(\left[ -\,1.07602,0\right] \), \(\left[ -\,1.03549,0\right] \) and \(\left[ -\,1.01134,0 \right] \) for \(M=5\), 10 and 30, respectively

4 Conclusion

We have investigated the quantization of the multi-player Stackelberg game by proposing an asymmetric quantum entanglement operation given in Eq. (7). This quantization scheme is different from the one introduced in the quantization of the Cournot and Bertrand oligopolies where a symmetric quantum entanglement operation given in Eq. (1) is used instead. We believe that due to the informational asymmetry between the leaders and followers in the Stackelberg model, it is more natural to have differential quantum entanglement in a quantum Stackelberg oligopoly. For simplicity, two special cases of the multi-player Stackelberg model, namely the case of a single leader with N followers as well as the case of a single follower with M leaders, have been examined in detail to demonstrate the effect of quantum entanglement on the outcomes of the Stackelberg oligopoly. Our analysis indicates that in the former case the quantum entanglement simply monitors the first-mover advantage against the classical value, whereas in the latter case an unexpected loss in the first-mover advantage could be caused by the quantum entanglement. This surprising feature is certainly not expected in the classical Stackelberg model and raises an important question: “To move first or not to move first ?”. Furthermore, we believe that the general case of M leaders with N followers may contain more complex patterns of the players’ profits and a thorough investigation is desirable.

As a final remark, it should be noted that using a different quantization scheme Iqbal and Toor [15] also performed an analysis of the quantum Stackelberg duopoly. Their study indicates that the backward induction outcome in the quantum form of Stackelberg duopoly becomes the same as the classical equilibrium of Cournot duopoly. That is, the classical first-mover advantage is avoided by quantization. On the other hand, using the minimal quantization rule, Lo and Kiang [14] has found that while positive entanglement enhances the first-mover advantage beyond the classical limit, the advantage is dramatically suppressed by negative entanglement, leading to its disappearance asymptotically. Hence, despite that both quantization schemes predict suppression of the first-mover advantage, the two schemes show significant differences, too.