Abstract
In the paper the definition of S-strongly time-consistency in differential games is introduced. The approach of the construction of S-strong time-consistent subcore of the classical core on the base of characteristic function obtained by normalization of classical characteristic function is formulated. Its relation to another characteristic function obtained by an integral extension of the original characteristic function is studied.
Access provided by CONRICYT-eBooks. Download chapter PDF
Similar content being viewed by others
Keywords
- Cooperative Differential Games
- Strong Time Consistency
- Classical Characteristic Function
- Core Classes
- Integral Extension
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
12.1 Introduction
Dynamic games theory has many applications in different areas (see [1, 2, 11, 13]). Particularly important are cooperative differential games that are widely used for modeling the situations of joint decision taking by many agents. When considering such problems, the realizability of cooperative solution in time turns out to be one of the central issues.
As it was mentioned earlier, [9, 13], an attempt to transfer the optimality principles (cooperative solution) from the static cooperative game theory to n-persons differential games leads to dynamically unstable (time inconsistent) optimality principles that renders meaningless their use in differential games. Hence, the notion of time consistent cooperative solution and an approach to determining such cooperative solution was proposed in [9].
A strong time-consistent optimality principle has even more attractive property. Namely, strong time consistency of the core considered as a cooperative solution implies that a single deviation from the chosen imputation taken from the core in favor of another imputation from the core does not lead to non-realizability of the cooperative agreement (the core) defined for the whole duration of the game, [7]. This implies that the overall payment for players will also be contained in the core.
In this paper, a cooperative differential game with the set of players N is studied in general setting on the finite time horizon. The work is of fundamental character, but may potentially have a big practical impact because it proposes a constructive approach to the definition of a new cooperative solution which satisfies the condition of strong time-consistency.
In the paper, we study different approaches to constructing a strongly time-consistent cooperative solution, which are based on the use of additional procedures for the imputation distribution on the time interval [t 0, T] (IDP) for classical cooperative solution, i.e., the core, and on the transformations of the classical characteristic function V (S, ⋅), S ⊆ N. Furthermore, we present results illustrating the relationship between the introduced concepts.
In [7, 8], it was shown that it is possible to define a new type of characteristic function \(\bar V(S, \cdot )\) on the base of integral transformation of the classical characteristic function V (S, ⋅) such that the resulting optimality principles are strongly time-consistent.
In [10], another approach to the construction of the characteristic function \(\hat V(S, \cdot )\) on the base of normalizing transformation of V (S, ⋅) had been suggested and it was shown that the core constructed on the base of the new \(\hat V(S, \cdot )\) belongs to the classical core.
In this contribution we track the connection between the optimality principles constructed on the basis of classical characteristic function and the constructions resulted from the new types of characteristic function. We study the property of strong-time consistency for all constructed optimality principles and suggest a modification of the notion of strong time-consistency as described below.
The notion of S-strong time-consistency can be considered as a weakening of the strong time-consistency and means the following: after a single deviation from the chosen imputation from the optimality principle \(\hat M(x_0, t_0)\) in favor of another imputation from the same optimality principle \(\hat M(x^*(t), t)\) the resulting imputation will belong to a larger set \(M(x_0, t_0)\supset \hat M(x_0, t_0)\) even if the resulting solution does not belong to the initial set \(\hat M(x_0, t_0)\). Note that S-strong time-consistency of the cooperative solution is considered with respect to another (bigger) set, hence the prefix S-.
The construction of a S-strongly dynamically stable subcore on the base of all described approaches is presented.
12.2 List of Key Notations
- x :
-
trajectory of the system
- u :
-
control vector u = {u 1, …, u n}
- K i (x, t, u):
-
payoff of the player i in a subgame starting at t from x
- N :
-
set of players (the grand-coalition)
- S :
-
subset of players (a coalition), S ⊆ N
- V (S, x, t):
-
basic characteristic function (c.f.)
- \(\bar V(S,x,t)\) :
-
an integral extension of the c.f. V
- \(\hat V(S,x,t)\) :
-
a normalized c.f. V
- L(x, t):
-
set of imputations associated with V
- \(\bar L(x,t)\) :
-
set of imputations associated with \(\bar V\)
- C(x, t):
-
core associated with V
- \(\bar C(x,t)\) :
-
core associated with \(\bar V\)
- \(\hat C(x,t)\) :
-
core associated with \(\hat V\)
12.3 Basic Game
Consider the differential game Γ(x 0, t 0) starting from the initial position x 0 and evolving on time interval [t 0, T]. The equations of the system’s dynamics have the form
The players’ payoffs are
where x(t) is the solution of system (12.1) with controls u 1, …, u n. The non-negativeness of the utility function h i(⋅) is an important assumption of the model.
It is furthermore assumed that the system (12.1) satisfies all the conditions guaranteeing the existence and uniqueness of solution x(t) on the time interval [t 0, T] for all admissible measurable open loop controls u 1(t), …, u n(t), t ∈ [t 0, T]. Let there exist a set of controls
such that
The solution x ∗(t) of the system (12.1) corresponding to u ∗(t), is called the cooperative trajectory.
In cooperative game theory, [6], it is assumed that the players initially agree upon the use of the controls \(u^*(t)=\{u^*_1(t), \ldots , u^*_n(t)\}\) and hence, in the cooperative formulation the differential game Γ(x 0, t 0) always develops along the cooperative trajectory x ∗(t).
Let N = {1, …, i, …, n} be the set of all players. Let S ⊆ N and denote by V (S;x 0, t 0) the characteristic function of the game Γ(x 0, t 0), [6]. Note that V (N;x 0, t 0) is calculated by the formula (12.2). Let V (S;x ∗(t), t), S ⊆ N, t ∈ [t 0, T] be a (superadditive) characteristic function of the subgame Γ(x 0, t 0) constructed by any relevant method [5].
So, we state the following properties for characteristic function:
For the sake of definiteness we can assume that the characteristic function V (S;x 0, t 0) is constructed as the value of a zero-sum differential game based on the game Γ(x 0, t 0) and played between the coalition S (the first maximizing player) and the coalition N ∖ S (the second minimizing player), and in each situation the payoff of coalition S is assumed to be the sum of players’ payoffs from this coalition.
Consider the family of subgames Γ(x ∗(t), t) of game Γ(x 0, t 0) along the cooperative trajectory x ∗(t), i.e. a family of cooperative differential games from the initial state x ∗(t) defined on the interval [t, T], t ∈ [t 0, T] and the payoff functions
where x(τ) is a solution of (12.1) from initial position x ∗(t) with controls u 1, …, u n.
Let V (S;x ∗(t), t), S ⊆ N, t ∈ [t 0, T] be the (superadditive) characteristic function of subgame Γ(x ∗(t), t), s.t. the properties (12.3) hold. For V (N;x ∗(t), t), the Bellman optimality condition along x ∗(t) holds, i.e.
12.4 Construction of a Core with a New Characteristic Function
Define the new characteristic function \(\bar {V}(S; x_0,t_0)\), S ⊆ N, similar to [7, 8], by the formula
Similarly, we define for t ∈ [t 0, T]
One can readily see that the function \(\bar {V}(S; x_0,t_0)\) has the all properties (12.3) of the characteristic function of the game Γ(x 0, t 0). Indeed,
for S 1, S 2 ⊂ N, S 1 ∩ S 2 = ∅ (here we use the superadditivity of function V (S;x 0, t 0)). The similar statement is true also for function \(\bar {V} (S; x^*(t), t) \) which is defined as the characteristic function of Γ(x ∗(t), t).
Let L(x 0, t 0) be the set of imputations in Γ(x 0, t 0) determined by characteristic function of V (S;x 0, t 0), S ⊆ N, i.e.
Similarly, we define the set of imputations L(x ∗(t), t), t ∈ [t 0, T] in the subgame Γ(x ∗(t), t):
We denote the set of imputations defined by characteristic functions \(\bar {V}(S;x_0,t_0)\) and \(\bar {V}(S;x^*(t),t)\) by \(\bar {L}(x_0,t_0)\) and \(\bar {L}(x^*(t),t)\), respectively. These imputations are defined in the same way as (12.6), (12.7).
Let ξ(t) = {ξ i(t)}∈ L(x ∗(t), t) be the integrable selector [9], t ∈ [t 0, T], define
where t ∈ [t, T] and i = 1, …, n.
One can see that
Moreover, we have
and similarly
i.e. the vectors \(\bar {\xi }= \{\bar {\xi _i}\}\) and \(\bar {\xi ^t}= \{\bar {\xi _i^t}\}\) are imputations in the games Γ(x 0, t 0) and Γ(x ∗(t), t), t ∈ [t 0, T], respectively, if the functions \(\bar {V}(S;x_0,t_0)\) and \(\bar {V}(S;x^*(t),t)\) are used as characteristic functions.
We have that \(\bar {\xi }\in \bar {L}(x_0,t_0) \) and \(\bar {\xi ^t} \in \bar {L}(x^*(t),t)\).
Denote by C(x 0, t 0) ⊂ L(x 0, t 0), C(x ∗(t), t) ⊂ L(x ∗(t), t), t ∈ [t 0, T], the core of the game Γ(x 0, t 0) and of the subgame Γ(x ∗(t), t), respectively (it is assumed that the sets C(x ∗(t), t), t ∈ [t 0, T], are not empty along the cooperative trajectory x ∗(t)). For an application of the core in differential games see also [3].
So, we have
Let further \(\tilde {C}(x_0,t_0)\) and \(\tilde {C}(x^*(t),t)\), t ∈ [t 0, T] be the core of the game Γ(x 0, t 0) and of Γ(x ∗(t), t), constructed using the characteristic function \(\bar {V}(S;x,t_0)\), defined by the formulas (12.4) and (12.5). Thus, \(\tilde {C}(x_0,t_0)\) is the set of imputations \(\{\tilde \xi _i \}\) such that
and \(\tilde {C}(x^*(t),t)\) is the set of imputations \(\{\tilde {\xi }_i^t\}\), s.t.
Let in the formulas (12.8) and (12.9) ξ(t) be an integrable selector, ξ(t) ∈ C(x ∗(t), t 0), t ∈ [t 0, T]. Define the set
Similarly, we define
We have the following lemma.
Lemma 12.1
Proof
To prove this lemma, we use the necessary and sufficient conditions for imputations from the core (12.10).
We have \( \forall \bar \xi \in \bar C(x_0)\):
For imputations from the (basic) core C(x ∗(t), t) we have
Hence,
and \(\bar C(x_0)\subseteq \tilde C(x_0)\).
The inclusion \(\bar C(x^*(t), t) \subseteq \tilde C(x^*(t), t), \,\,\forall t \in [t_0, T]\) can be proved in a similar way. □
Moreover, we also have the converse result.
Lemma 12.2
Proof
We show that for each imputation \(\tilde {\xi }_i\in \tilde {C}(x_0,t_0)\), \(\tilde {\xi }_i^t \in \tilde {C}(x^*(t), t)\) there exists an integrable selector ξ(t) ∈ C(x ∗(t), t), t ∈ [t 0, T] such that
Since \(\tilde {\xi }^t\) is an imputation, we have
Moreover, by summing up we get
The non-negativeness of the utility functions h i(⋅) implies that there exist α i ≥ 0, i = 1, …, n such that
and
Obviously, that ξ(τ) = {ξ i(τ) = α i(τ) + V ({i};x ∗(τ), τ))} is an imputation in the game with the characteristic function V (S;x ∗(τ), τ)). But we can also prove that ξ(τ) = {ξ i(τ) = α i(τ) + V ({i};x ∗(τ), τ))} belongs to the core C(x ∗(τ), τ). For \(\tilde {\xi }^t \in \tilde C(x^*(t), t)\) we have
and hence we get
The lemma is proved. □
The preceding results imply that
It means, that the core \(\tilde C(x_0, t_0)\) constructed by using characteristic function \(\bar V\) coincides with the set of imputations \(\bar C(x_0, t_0)\) constructed by formula (12.8) for any imputation ξ(t) from the initial core C(x ∗(t), t). Later on we will use the unified notation \(\bar C(x_0, t_0)\) for both sets.
12.5 Strong Time-Consistency
The property of strong dynamic stability (strong time consistency) coincides with the property of dynamic stability (time consistency) for scalar-valued principles of optimality such as the Shapley value [8] or the “proportional solution”. However, for set-valued principles of optimality it has significant and non-trivial sense, which is that any optimal behavior in the subgame with the initial conditions along the cooperative trajectory computed at some intermediate time t ∈ [t 0, T], together with optimal behavior on the time interval [t, T] is optimal in the problem with the initial condition t 0. This property is almost never fulfilled for such set-valued principles of optimality as the core or the NM-solution.
Let us formulate the definition of strong time-consistency for an arbitrary optimality principle M(x 0, t 0) based on previous results, [9]. A slightly different definition was given in [4].
Introduce the subset M(x 0, t 0) of the imputation set L(x 0, t 0) as the optimality principle in the cooperative game Γ(x 0, t 0). M(x 0, t 0) can be a core, a NM-solution, a Shapley value or another one. Similarly, we define this set for all subgames Γ(x ∗(t), t) along the cooperative trajectory x ∗(t).
Definition 12.1
The solution (optimality principle) M(x 0, t 0) is said to be strongly time-consistent in the game Γ(x 0, t 0) if
-
1.
M(x ∗(t), t) ≠ ∅, t ∈ [t 0, T].
-
2.
for any ξ ∈ M(x 0, t 0) there exists a vector-function β(τ) ≥ 0 such that
$$\displaystyle \begin{aligned}M(x_0, t_0) \supset \int_{t_0}^{ t} \beta(\tau)d\tau \oplus {M}(x^*(t),t),\end{aligned} $$∀t ∈ [t 0, T], \(\int _{t_0}^T \beta (t) dt= \xi \in M(x_0, t_0) \).
Here symbol ⊕ is defined as follows. Let a ∈ R n, B ⊂ R n, then
Let us consider the core \(\bar C(x_0, t_0)\) as the set M(x 0, t 0). Thus we have the following lemma.
Lemma 12.3
\(\bar C(x_0, t_0)\) is a strongly time-consistent optimality principle.
Proof
From the definition of the set \(\bar C(x_0, t_0)\) we have that any imputation \(\bar \xi \in \bar C(x_0, t_0)\) has the form (12.8). Then for any \(\bar \xi \in \bar C(x_0, t_0)\) there exists
such that \(\bar \xi = \int _{t_0}^T \bar \beta (t) dt \in \bar C(x_0, t_0)\).
Let us take another imputation \(\hat \xi ^t\) from the core \(\bar C(x^*(t), t)\). Then according to the definition of the set \(\bar C(x^*(t), t)\) we have that there exists a selector \(\hat \xi (t) \) from the initial basic core C(x ∗(t), t) , i.e. \(\hat \xi (t) \in C(x^*(t), t)\) such that
such that \(\hat \xi ^t = \int _{t}^T \hat \beta (t) dt \in \bar C(x^*(t), t) \).
Let us consider the vector-function
It is obvious that \(\check {\xi }(\tau ) \in C(x^*(\tau ), \tau )\), ∀τ ∈ [t 0, T]. Then we have a new vector
where \(\check {\xi }(\tau ) \in C(x^*(\tau ), \tau )\), ∀τ ∈ [t 0, T].
From the definition of the set \(\bar C(x_0, t_0)\) we have that new vector \(\check {\xi } \in \bar C(x_0, t_0)\). The vector \(\hat \xi ^t\) had been taken from the core \(\bar C(x^*(t), t)\) arbitrarily.
So, we have shown that
t ∈ [t 0, T].
The lemma is proved. □
The value
is interpreted as the rate at which the ith player’s component of the imputation, i.e., \(\bar {\xi }_i\), is distributed over the time interval [t 0, T].
12.6 S-strongly Time-Consistency
As above we consider the subset M(x 0, t 0) of the imputation set L(x 0, t 0) as the optimality principle in the cooperative game Γ(x 0, t 0) which can be a core, a NM-solution, a Shapley value or another one. Similarly, we define this set for all subgames Γ(x ∗(t), t) along the cooperative trajectory x ∗(t).
Suppose we have two different optimality principles (cooperative solutions) M(x 0, t 0) and \(\hat M(x_0, t_0)\) such that
∀t ∈ [t 0, T]. Again, we assume that these sets are non-empty during the whole game.
Definition 12.2
The cooperative solution \(\hat M(x_0, t_0)\) is S-strongly time-consistent (dynamically stable) with respect to the set M(x 0, t 0) if for any imputation \(\xi \in \hat M(x_0, t_0)\) there exists β(τ) ≥ 0 such that
∀t ∈ [t 0, T], \(\int _{t_0}^T \beta (t) dt= \xi \in \hat M(x_0, t_0) \).
Here we introduce the definition of strong time-consistency of the optimality principle with respect to another (bigger) set, hence the prefix S-.
This definition means the following: even if the resulting solution will not belong to the initial set \(\hat M(x_0, t_0)\) it will stay within the set M(x 0, t 0) which includes \(\hat M(x_0, t_0)\).
From Definition 12.2 we have the following proposition.
Lemma 12.4
Let the optimality principle M(x 0, t 0) such that M(x ∗(t), t) ≠ ∅, ∀t ∈ [t 0, T] be strongly time-consistent. Then any subset \(\hat M(x_0, t_0)\) , \(\hat M(x_0, t_0) \subseteq M(x_0, t_0)\) such that \(\hat M(x^*(t), t) \neq \emptyset ,\, \hat M(x^*(t), t) \subseteq M(x^*(t), t), \forall t \in [t_0,T]\) , is S-strongly time-consistent with respect to M(x 0, t 0).
12.7 The Construction of a S-strongly Dynamically Stable Subcore
In the following we identify a subset \(\hat {C}(x_0,t_0)\) of the imputations in the set \(\bar {C}(x_0,t_0)\), which would belong to the core C(x 0, t 0), defined on the basis of the classical characteristic function V (S;x 0, t 0).
Consider the value
then the following inequality holds
We introduce a new characteristic function
Similarly, for t ∈ [t 0, T] define the respective characteristic function \(\hat {V}(S;x^*(t),t)\) as
where
From (12.12), (12.13), (12.15) and (12.16) we get
Notice that
In addition, for all S 1, S 2, S 1 ⊂ S 2
Unfortunately, the property of superadditivity for the function \(\hat {V}(S; x^*(t), t)\), t ∈ [t 0, T] does not hold in general. One can write
The preceding inequality leads to the following lemma.
Lemma 12.5
The following inequality holds true:
Denote by \(\hat {C}(x_0,t_0)\) the set of imputations ξ = (ξ 1, …ξ n) such that
Assume that the set \(\hat {C} (x^*(t), t)\) is not empty when t ∈ [t 0, T]. It is easy to see that it is analogous to the core C(x 0, t 0), if the function \(\hat {V}(S;x^*(t), t)\) is chosen as the characteristic function.
Thereby we have the statement.
Theorem 12.1 ([10])
The following inclusion takes place:
We can also formulate the following Theorem (see Fig. 12.1 for an illustration).
Theorem 12.2
The subcore \(\hat {C} (x_0, t_0) \subset C (x_0, t_0)\) is S-strongly time-consistent with respect to the set \(\bar {C} (x_0, t_0)\).
Proof
From Theorem 12.1 we have that \(\hat {C} (x_0, t_0) \subset C (x_0, t_0)\cap \bar {C} (x_0, t_0)\), and hence \(\hat {C} (x_0, t_0) \subset \bar C (x_0, t_0)\). Lemma 12.3 implies that \( \bar C (x_0, t_0)\) is strong-time consistent optimality principle.
Finally, the requested result follows from Lemma 12.4. □
The preceding theorem shows that using the new characteristic function (12.14) we constructed a subset of the classical core C(x 0, t 0) (subcore) in the game Γ(x 0, t 0) which is S-time-consistent with respect to \(\bar C(x_0, t_0)\).
This gives an interesting practical interpretation of the subcore \(\hat {C}(x_0, t_0)\). Selecting the imputation ξ from the subcore as a solution, we guarantee that if the players—when evolving along the cooperative trajectory in subgames—change their mind by switching to another imputation within the current subcore \(\hat {C} (x^*(\tau ), \tau )\), the resulting imputation will not leave the set \(\bar C(x_0, t_0)\) which is also a core in Γ(x 0, t 0), but with the characteristic function of the form \(\bar V(S,\cdot ) \) (12.3) obtained by an integral transformation of classical characteristic function V (S, x ∗(τ), τ) in the games Γ(x ∗(τ), τ).
From Theorem 12.1 it follows that the imputations of type \(\hat {C}(x^*(t),t)\) belong to the classical core of the game Γ(x ∗(t), t) for all t ∈ [t 0, T]. In this sense, Theorem 12.1 establishes a new principle of optimality (cooperative solution).
12.8 Conclusion
In the paper we introduced the definition of S-strong time-consistency in differential games. The approach to the construction of an S-strong time-consistent subcore of the classical core is based on the use of normalized initial characteristic function. We also considered its relation to another characteristic function obtained by an integral extension of the original characteristic function.
We shown that the computed subset of the classical core can be considered as a new optimality principle (cooperative solution) in differential games.
In the future we plan to study the relationship of proposed approach with another constructive approach [12] which allows to identify another subset of the core which is strongly time-consistent.
References
Basar, T., Olsder, G.J.: Dynamic Noncooperative Game Theory. Academic, London (1982)
Dockner, E.J., Jorgensen, S., Long, N.V., Sorger, G.: Differential Games in Economics and Management Science. Cambridge University Press, Cambridge (2000)
Gromova, E.: The Shapley value as a sustainable cooperative solution in differential games of three players. In: Recent Advances in Game Theory and Applications. Springer, Berlin (2016)
Gromova, E.V., Petrosyan, L.A.: Strongly time-consistent cooperative solution for a differential game of pollution control. UBS 55, 140–159 (2015)
Gromova, E.V., Petrosyan, L.A.: On an approach to constructing a characteristic function in cooperative differential games. Autom. Remote Control 78(1680), (2017). https://doi.org/10.1134/S0005117917090120
Neumann, J., Morgenstern, O.: Theory of Games and Economic Behavior. Princeton University Press, Princeton (1947)
Petrosyan, L.A.: Strongly dynamically stable differential optimality principles. Vestnik SPb. Univ. Vyp. 1: Mat. Mekh. Astronom. 4, 40–46 (1993)
Petrosyan, L.A.: Characteristic function in cooperative differential games. Vestnik SPb. Univ. Vyp. 1: Mat. Mekh. Astronom. 1, 48–52 (1995)
Petrosyan, L.A., Danilov, N.N.: Stability of the solutions in nonantagonistic differential games with transferable payoffs. Vestnik Leningrad. Univ. Vyp. 1: Mat. Mekh. Astronom. 1, 52–59 (1979)
Petrosyan, L.A., Pankratova, Y.B.: Construction of strongly time-consistent subcores in differential games with prescribed duration. Trudy Inst. Mat. i Mekh. UrO RAN 23(1), 219–227 (2017)
Petrosyan, L.A., Zenkevich, N.A.: Game Theory. World Scientific, Singapore (2016)
Petrosyan, O.L., Gromova, E.V., Pogozhev, S.V.: Strong time-consistent subset of core in cooperative differential games with finite time horizon. Mat. Teor. Igr Pril. 8(4), 79–106 (2016)
Yeung, D.W.K., Petrosyan, L.A.: Subgame Consistent Economic Optimization, p. 395. Birkhauser, New York (2012)
Acknowledgements
The work has been supported by the grant RSF 17-11-01079.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Petrosyan, L.A., Gromova, E.V. (2018). S-strongly Time-Consistency in Differential Games. In: Petrosyan, L., Mazalov, V., Zenkevich, N. (eds) Frontiers of Dynamic Games. Static & Dynamic Game Theory: Foundations & Applications. Birkhäuser, Cham. https://doi.org/10.1007/978-3-319-92988-0_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-92988-0_12
Published:
Publisher Name: Birkhäuser, Cham
Print ISBN: 978-3-319-92987-3
Online ISBN: 978-3-319-92988-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)