Abstract
In this paper, cooperative network games with pairwise interactions are considered. The cooperative version of games is investigated. For a particular type of networks, a simplified formula for the Shapley value based on a constructed characteristic function is derived. The time inconsistency of the Shapley value is shown.
Access provided by Autonomous University of Puebla. Download chapter PDF
Similar content being viewed by others
Keywords
- Shapley Value
- Network Games
- Time Inconsistency
- Cooperative Version
- Imputation Distribution Procedure (IDP)
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
1 Introduction
Network games is a new and important part of modern game theory. Networks illustrate the interaction of both individuals and groups. For the first time in the literature, a non-cooperative form of pairwise interaction in a network was considered in [3] meaning direct interactions between network neighbors. Finding an equilibrium in online gaming as an example of a Designer–Adversary game was described in [4]. Pairwise interaction was exposed in [1] on the example of the dissemination of information and misinformation in social networks. The efficiency and stability of networks depending on external factors such as marginal costs were examined in [6]. An approach for finding optimal behavior in multistage games was considered in [9]. Cooperation in network games and a model of interaction between coalitions were considered in [5].
When cooperative behavior is investigated, it is important that players follow a cooperative agreement during the whole game. If a solution of the cooperative game is time-consistent, players have no reason to deviate from the accepted agreement. An imputation distribution procedure (IDP) which is a payment scheme that provides the implementation of the solution was introduced in [8] to prevent players from deviating from the cooperative agreement. The conditions for the time consistency of the core for two-stage games with pairwise interactions were established in [2]. The dynamic properties of cooperative solutions in multicriteria games were considered in [7]. In this paper, we provide analytic expressions for characteristic functions in a two-stage game with pairwise interactions. Further, similar to [11], we provide conditions for the time consistency of the Shapley value in this game. Moreover we simplify the formula of the Shapley value for a network of a special type—a star.
2 Description of the Model
Let N be a finite set of players who make decisions in two stages, \(|N|=n\geqslant 2\). At the first stage z 1 each player i ∈ N chooses his behavior \(b_i^1=(b_{i1}^1,\ldots ,b_{in}^1)\)—a profile of offers to establish connections with other players:
with
Here M i ⊆ N ∖{i} is a given set of players whom player i can offer connections, \(b^1_{ii}=0\) for i ∈ N; a i ∈{0, …, n − 1} represents the maximum number of connections for player i. If M i = N ∖{i}, player i can offer a connection to any player; in particular, if a i = n − 1, player i can have any number of connections. The result of the first stage is a network \(g(b_1^1 , \ldots , b_n^1)\) consisting of links (connections) ij such that \(b_{ij}^1 = b_{ji}^1 =1\). For brevity, denote \(g(b_1^1 , \ldots , b_n^1)\) by g. Define the neighbors of player i in network g as elements of the set N i(g) = {j ∈ N ∖{i} : ij ∈ g} or simply N i. After the network formation stage z 1, players proceed to the second stage z 2.
At second stage z 2(g) which depends upon a network chosen at the first stage, network neighbors play pairwise simultaneous bimatrix games {γ ij}. Namely, let i ∈ N, j ∈ N i, then at the second stage, player i plays with his neighbor j a bimatrix game γ ij with non-negative payoff matrices \(A_{ij}=[a^{ij}_{p\ell }]_{p=1,\ldots ,m;\;\ell =1,\ldots ,k}\) and \(B_{ij}=[b^{ij}_{p\ell }]_{p=1,\ldots ,m;\;\ell =1,\ldots ,k}\) for players i and j, respectively.
After receiving payoffs in these bimatrix games, the game ends. In other words, we have a two-stage game Γ which is a special case of a multistage non-zero-sum game. Adapting the definition of a strategy to this case, a strategy of player i ∈ N will be a rule which assigns a set of his neighbors at first stage \(b_i^1\), and a behavior \(b_i^2\) in each of the bimatrix games at the second stage of the game taking into account a network formed at the first stage. Denote the strategy of player i ∈ N in two-stage game Γ by \(u_i=(b_i^1, b_i^2)\). Let (z 1, z 2) be a trajectory realized under the strategy profile u = (u 1(⋅), …, u n(⋅)) in Γ. Define the payoff of player i as h i(z 2) which is the sum of player i’s payoffs in all bimatrix games with his neighbors when \(b_i^2\), \(b_j^2\), j ∈ N i are chosen. Then player i’s payoff function in Γ starting at z 1 is defined as K i(z 1;u i(⋅), …, u n(⋅)) = h i(z 2).
The rest of the paper will be devoted to a cooperative version of the two-stage game Γ.
2.1 Cooperation at the Second Stage of the Game
A game \(\varGamma _{z_2}\) denoting a subgame of game Γ which starts at the second stage z 2 can be considered in cooperative form. In this case, we define characteristic function v(z 2;S) for any subset (coalition) S ⊂ N as the maxmin value of a two-person zero-sum game between coalition S and its complement N ∖ S constructed with the use of game \(\varGamma _{z_2}\). The superadditivity of the characteristic function follows from its definition. Denote the maxmin value of player i (j) in game γ ij with his neighbor j (i) as
Following [2], for any S ⊆ N, the characteristic function v(z 2;S) is given by:
2.2 Cooperation at Both Stages of the Game
Consider a cooperative form of two-stage game Γ. Suppose that all players choose strategies \(\bar {u}=(\bar {u}_1,\ldots , \bar {u}_n)\) which maximize their joint payoff in game Γ, i.e.,
The strategy profile \(\bar {u}=(\bar {u}_1,\ldots , \bar {u}_n)\) is called the cooperative strategy profile, and the corresponding trajectory \((\bar {z}_1, \bar {z}_2)\) is the cooperative trajectory.
As before for coalition S ⊆ N, we define characteristic function \(v(\bar {z}_1; S)\) as the maxmin value of a two-person two-stage zero-sum game between coalition S and its complement, where the payoff of S is the sum of players’ payoffs from S, and the strategy of S is an element of the Cartesian product of sets of players’ strategies belonging to S. Since players’ payoffs are non-negative, for player N ∖ S, the best behavior to follow is to have no connections with S. Hence we get
2.3 The Shapley Value and Time Consistency
Given a characteristic function \(v(\bar {z}_t;\cdot )\), t = 1, 2, we define an imputation as a vector \(\xi [v(\bar {z}_t)] = (\xi _1[v(\bar {z}_t)], \ldots , \xi _n[v(\bar {z}_t)])\) which is (i) efficient, i.e., \(\sum _{i\in N} \xi _i[v(\bar {z}_t)] = v(\bar {z}_t;N)\) and (ii) individually rational, i.e., \(\xi _i[v(\bar {z}_t)]\geqslant v(\bar {z}_t;\{i\})\) for all i ∈ N. Denote the set of all imputations (an imputation set) in game Γ by \(I(v(\bar {z}_t))\). As an imputation we consider the Shapley value [12] denoted by \(\varphi [v(\bar {z}_t)]= (\varphi _1[v(\bar {z}_t)], \ldots , \varphi _n[v(\bar {z}_t)])\) where
Before the start of game Γ, players agree on choosing cooperative trajectory (\(\bar {z}_1, \bar {z}_2\)), i.e., the trajectory that yields the maximum joint payoff \(v(\bar {z}_1;N)\), and we suppose that players allocate this payoff according to the Shapley value. This means that in Γ each player i ∈ N expects his payoff to be equal to \(\varphi _i[v(\bar {z}_1)]\). If players recalculate the Shapley value after the network formation stage (at the second stage), it turns out that the recalculated Shapley value \(\varphi [v(\bar {z}_2)]\) differs from the previous one. This may lead to a violation of the cooperative agreement because some players may refuse to use their cooperative strategies. We say that the Shapley value as an allocation in the two-stage game is time consistent if \(\varphi [v(\bar {z}_1)]=\varphi [v(\bar {z}_2)]\) (as players do not receive payoffs at the network formation stage), otherwise we call the Shapley value time inconsistent. In the former case, players follow the cooperative agreement not expecting that someone violate it. In the latter case, to prevent players from violating the cooperative agreement, we use an imputation distribution procedure (IDP) \(\beta =\{\beta _i^1,\beta _i^2\}_{i \in N}\) (first introduced in [8]) for the Shapley value \(\varphi [v(\bar {z}_1)]\) which decomposes it over two stages of the game Γ: \(\varphi _i[v(\bar {z}_1)] = \beta _i^1+\beta _i^2\) for each i ∈ N. Here \(\beta _i^1\) can be interpreted as a stage payment to player i at the network formation stage, and \(\beta _i^2\) is his payment at the second stage of the game under the cooperative agreement. We say that the IDP β of the Shapley value \(\varphi [v(\bar {z}_1)]\) is a time-consistent IDP [10, 11] when it is given by:
Introducing the time-consistent IDP β (2.4) of the Shapley value \(\varphi [v(\bar {z}_1)]\), players can be sure that no one violates the cooperative agreement, hence it will be realized in the game and player i ∈ N gets \(\varphi _i[v(\bar {z}_1)]\) as his cooperative payoff.
3 The Shapley Value for a Star
Since the calculation of the Shapley value \(\varphi [v(\bar {z}_t)]\), t = 1, 2, is a difficult task for a large number of players in an arbitrary network, we simplify formula (2.3) for a network of a special type—a star. Within this section we suppose the following. Let M i = N ∖{i} for i ∈ N and a 1 = n − 1, a i = 1, i ≠ 1. Further let maxj ∈ N w ij = w i1. Then in order to maximize the joint payoff, players should choose the following behaviors at the first stage of the game: \(b^1_1=(0, 1, \ldots , 1)\) for player 1, and \(b^1_i=(1,0, \ldots , 0)\) for player i ≠ 1. These behaviors form a star-network at this stage (see Fig. 2.1). In the star-network, |N 1| = n − 1 and |N i| = 1, i ≠ 1.
For a star-network, the characteristic function is calculated using a specific structure of the network. The network has central symmetry which suggests that formula (2.3) can be simplified. Let \(m_{ij}= \max _{p,\ell } (a^{ij}_{p\ell } + b^{ji}_{p\ell })\). Substituting the adopted notation, as well as (2.1), (2.2), into (2.3), we obtain the following expression for the components of the Shapley value:
3.1 Two Examples
Two examples below demonstrate that the Shapley value being an allocation in a cooperative two-stage game with pairwise interactions can be both time consistent and time inconsistent. The first example shows the time consistency of the Shapley value.
Example 1 (Prisoner’s Dilemma)
Consider the case, when n players play the same game γ with their neighbors, i.e., A ij = A, B ij = B for all i ∈ N, j ∈ N i where
To find the Shapley value \(\varphi [v(\bar {z}_2)]\), we first determine characteristic function \(v(\bar {z}_2;S)\) for all S ⊆ N. Following (2.1), we obtain
Using the formula for the Shapley value (2.5) adapted to a star and noting that the Shapley value is an efficient allocation satisfying the property of symmetry and that m 1j = 2b for any j ∈ N 1, we obtain
Similarly, to find the Shapley value \(\varphi [v(\bar {z}_1)]\), we determine characteristic function \(v(\bar {z}_1;S)\) for all S ⊆ N. Following (2.2), we have
Again, using the formula for the Shapley value (2.5) adapted to a star, the Shapley value \(\varphi [v(\bar {z}_1)]\) is given by
Comparing \(\varphi [v(\bar {z}_1)]\) and \(\varphi [v(\bar {z}_2)]\), we note they coincide and hence the Shapley value is time consistent.
In the next example we demonstrate the time inconsistency of the Shapley value.
Example 2
Consider a numerical example with N = {1, 2, 3, 4} in which players form a star-network under a cooperative agreement (see Fig. 2.2). Let simultaneous bimatrix games γ 12, γ 13, and γ 14 be defined by means of the following payoff matrices of players:
To compute the Shapley values \(\varphi [v(\bar {z}_1)]\) and \(\varphi [v(\bar {z}_2)]\), we use the corresponding formulas (2.1), (2.2) for characteristic functions \(v(\bar {z}_2;\cdot )\) and \(v(\bar {z}_1;\cdot )\), respectively, and the simplified formula (2.5). Hence we get
and therefore
Thus the Shapley values are given by
We observe that the Shapley value \(\varphi [v(\bar {z}_1)]\) in the two-stage game differs from the Shapley value \(\varphi [v(\bar {z}_2)]\) in the one-stage game starting at the second stage. This means the time inconsistency of the Shapley value. Since \(\varphi _2[v(\bar {z}_2)] = 5/2 < \varphi _2[v(\bar {z}_1)] = 3\), player 2 can break the cooperative agreement as his payoff can get less (here we recall that players do not receive payoffs at the network formation stage). Similar holds for player 3: \(\varphi _3[v(\bar {z}_2)] = 7/2 < \varphi _3[v(\bar {z}_1)] = 4\) and player 4: \(\varphi _4[v(\bar {z}_2)] = 11/2 < \varphi _4[v(\bar {z}_1)] = 6\). However introducing the time-consistent IDP of the Shapley value \(\varphi [v(\bar {z}_1)]\) over two stages determined by formula (2.4), we obtain
and therefore cooperation will be sustainable. Thus receiving \(\beta _i^1\) at the first stage and \(\beta _i^2\) at the second stage, player i ∈ N will get \(\varphi _i[v(\bar {z}_1)]\) in two stages which is exactly player i’s cooperative payoff prescribed by the Shapley value \(\varphi [v(\bar {z}_1)]\).
4 Conclusion
In this paper, we studied a two-stage network game for a special type of pairwise interactions between players. This gave us the possibility of getting analytic expressions for characteristic functions in this game. As a solution of the game under consideration, we took the Shapley value and found its analytic form for a star-network. The special structure of the network game gives us the possibility of the implementation of other cooperative solutions what enriches the scope of application. The time inconsistency of the Shapley value in the two-stage game with pairwise interactions was demonstrated, and time-consistent IDP-based payoffs were introduced to deal with time inconsistency.
References
Acemoglu, D., Ozdaglar, A., ParandehGheibi, A.: A Spread of (mis)information in social networks. Game Econ. Behav. 70(2), 194–227 (2010)
Bulgakova, M.A., Petrosyan, L.A.: About strongly time-consistency of core in the network game with pairwise interactions. In: Proceedings of 2016 International Conference “Stability and Oscillations of Nonlinear Control Systems”, pp. 157–160 (2016)
Dyer, M., Mohanaraj, V.: Pairwise-interaction games. In: Aceto, L., Henzinger, M., Sgall, J. (eds.) Automata, Languages and Programming. ICALP 2011. Lecture Notes in Computer Science, vol. 6755, pp. 159–170 (2011)
Dziubiński, M., Goyal, S.: Network design and defence. Game Econ. Behav. 79, 30–43 (2013)
Hernández, P., Muñoz-Herrera, M., Sánchez, Á.: Heterogeneous network games: conflicting preference. Game Econ. Behav. 79, 56–66 (2013)
König, M.D., Battiston, S., Napoletano, M., Schweitzer, F.: The efficiency and stability of R&D networks. Game Econ. Behav. 75(2), 694–713 (2012)
Kuzyutin, D., Nikitina, M.: Time consistent cooperative solutions for multistage games with vector payoffs. Oper. Res. Lett. 45(3), 269–274 (2017)
Petrosyan, L.A., Danilov, N.N.: Stability of solutions of non-zero-sum game with transferable payoffs. Vestn. Leningr. Univ. Ser 1. Mat. Mekh. Astron. 19, 52–59 (1979)
Petrosyan, L.A., Sedakov, A.A.: Multistage network games with perfect information. Autom. Remote Control 75(8), 1532–1540 (2014)
Petrosyan, L.A., Sedakov, A.A.: The subgame-consistent Shapley value for dynamic network games with shock. Dyn. Games Appl. 6(4), 520–537 (2016)
Petrosyan, L.A., Sedakov, A.A., Bochkarev, A.O.: Two-stage network games. Autom. Remote Control 77(10), 1855–1866 (2016)
Shapley, L.: A value for N-person games. In: Kuhn, H.W., Tucker, A.W. (eds). Contributions to the Theory of Games II, pp. 307–317. Princeton University Press, Princeton (1953)
Acknowledgements
This research was supported by the Russian Science Foundation (grant No. 17-11-01079).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Petrosyan, L., Bulgakova, M., Sedakov, A. (2019). The Time-Consistent Shapley Value for Two-Stage Network Games with Pairwise Interactions. In: Song, J., Li, H., Coupechoux, M. (eds) Game Theory for Networking Applications. EAI/Springer Innovations in Communication and Computing. Springer, Cham. https://doi.org/10.1007/978-3-319-93058-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-93058-9_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-93057-2
Online ISBN: 978-3-319-93058-9
eBook Packages: EngineeringEngineering (R0)