A link between sequential semi-anonymous nonatomic games and their large finite counterparts

Yang, Jian

doi:10.1007/s00182-016-0539-5

A link between sequential semi-anonymous nonatomic games and their large finite counterparts

Original Paper
Published: 01 June 2016

Volume 46, pages 383–433, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

International Journal of Game Theory Aims and scope Submit manuscript

A link between sequential semi-anonymous nonatomic games and their large finite counterparts

Download PDF

Jian Yang¹

234 Accesses
7 Citations
3 Altmetric
Explore all metrics

Abstract

We show that obtainable equilibria of a multi-period nonatomic game can be used by players in its large finite counterparts to achieve near-equilibrium payoffs. Such equilibria in the form of random state-to-action rules are parsimonious in form and easy to execute, as they are both oblivious of past history and blind to other players’ present states. Our transient results can be extended to a stationary case, where the finite multi-period games are special discounted stochastic games. In both nonatomic and finite games, players’ states influence their payoffs along with actions they take; also, the random evolution of one particular player’s state is driven by all players’ states as well as actions. The finite games can model diverse situations such as dynamic price competition. But they are notoriously difficult to analyze. Our results thus suggest ways to tackle these problems approximately.

Analysis of Markovian Competitive Situations Using Nonatomic Games

Article 03 May 2020

Determinacy in Stochastic Games with Unbounded Payoff Functions

Subgame Optimal Strategies in Finite Concurrent Games with Prefix-Independent Objectives

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

We show that an equilibrium of a random multi-period game involving a continuum of players can be used to achieve asymptotically equilibrium results for its large finite counterparts. The latter finite games can model competitive situations involving random and action-dependent evolution of players’ states which in turn influence period-wise payoffs. Their complex natures make equilibria difficult to locate. In contrast, those for the former continuum-player game are simple in form and relatively easy to obtain. Therefore, a bridge between the two types of games can have broad practical implications.

The former continuum-player game can be termed more formally as a sequential semi-anonymous nonatomic game (SSNG). In it, a continuum of players interact with one another in multiple periods; also, each player’s one-time payoff and random one-period state transition are both swayed by his own state and action, as well as the joint distribution of other players’ states and actions. This is indeed the anonymous sequential game studied by Jovanovic and Rosenthal (1988). We use the name SSNG just to be consistent with the single-period nonatomic-game (NG) literature, where anonymity has been reserved for a more special case. An SSNG’s finite counterpart is almost the same except that only a finite number of players are involved. This more realistic situation is much more difficult to handle.

In a few steps, we demonstrate the usefulness of an SSNG equilibrium in finite multi-period games. First, in a precise language, Theorem 1 describes the gradual retreat of randomness in finite games as the number n of players tends to $+\infty $. This paves way for Theorem 2, which states that an SSNG’s conditional equilibria, in terms of random state-to-action rules, can be used by its large finite counterparts to reach asymptotically equilibrium payoffs on average. A further refinement of this result is achieved in Theorem 3. The above transient results can be extended to the stationary case involving discounted payoffs and infinite time horizons; see Theorem 4. The conditional equilibria that facilitate our study are similar to well-understood distributional equilibria. Their existence is also directly verifiable.

One practical situation to which our results can be applied concerns dynamic price competition. Here, players may be firms producing one identical product type, states may be combinations of the firms’ inventory levels and other static or dynamic characteristics such as unit costs, and actions may be unit prices the firms charge for the product. In every period, the random demand arriving to a firm is dependent on not only its own price but also prices charged by other competitors. The actual sales is further constrained by available inventory. So the player’s one-time payoff is a function of both its own state (inventory and probably also cost) and action (price) and of the distribution of others’ actions (prices). Moreover, the firm’s next-period inventory level depends on its current level, the random demand, and potentially an exogenously given production schedule. So the random single-period state transition is potentially a function of the same factors involved in the payoff.

It is a difficult task to predict or prescribe what inventory-dependent prices the firms will or should charge over a finite time horizon. This can be further complicated by diverse scenarios where firms have different degrees of knowledge on their competitors’ inventory levels and/or costs. Our results, on the other hand, will reveal that the nonatomic counterpart SSNG is easier to tackle. Its equilibria can be plugged back to the actual finite-player situations, without regard to the particularities of the scenarios, and still make reasonably good predictions/prescriptions when the number of players is large enough. We are not equipped to answer how large is “large enough”. But computational study done in a related pricing setting hinted that player numbers “in the tens” seem large enough; see, Yang and Xia (2013).

In the remainder of the paper, Sect. 2 surveys the relevant literature. We then spend Sects. 3 and 4 on essentials of SSNGs and finite games, respectively. In Sect. 5, we demonstrate the key result that state evolutions in large finite games will not veer too far away from their NG counterparts. Section 6 is devoted to the main transient result and Sect. 7 its detailed interpretation. This result is extended to the stationary case in Sect. 8. Implications of these results and existence of our kind of equilibria for SSNGs are shown in Sect. 9. We conclude the paper in Sect. 10.

2 Literature survey

From early on, NGs have been used as easier-to-analyze proxies of real finite-player situations, such as in the study of perfect competition. Systematic research on NG started with Schmeidler (1973). He formulated a single-period semi-anonymous NG, wherein the joint distribution of other players’ identities and actions may affect any given player’s payoff. When the action space is finite, Schmeidler established the existence of pure equilibria when the game becomes anonymous, so that other players’ influence on a game’s outcome is channeled through the marginal action distribution alone. Mas-Colell (1984) showed the existence of distributional equilibria in anonymous NGs with compact metric action spaces. The latter result was extended by Khan and Sun (1990) to a case where players differ on how their preferences over actions are influenced by external action distributions. A survey of related works up until the early 2000s was provided by Khan and Sun (2002).

Much attention has been paid to the topic of pure-equilibrium existence. Khan and Sun (1995) developed a purification scheme involving a countable compact metric action space. Khan and Sun (1999) used non-standard measures on identity spaces and generalized Schmeidler’s pure-equilibrium existence result for more general action spaces. Balder (2002) established pure- and mixed-equilibrium existence results that may be regarded as generalizations of Schmeidler’s corresponding results. Other notable works still include Yu and Zhang (2007) and Balder (2008). On the other hand, Khan et al. (1997) identified a certain limit to which Schmeidler’s result can be extended. Recently, Khan et al. (2013) took players’ diverse bio-social traits into consideration and pinpointed saturation of the player-identity distribution as the key to existence of pure equilibria.

Links between NGs and their finite counterparts were covered in Green (1984), Housman (1988), Carmona (2004), Kalai (2004), Al-Najjar (2008), and Yang (2011). For multi-period games without changing states, Green (1980), Sabourian (1990), and Al-Najjar and Smorodinsky (2001) showed that equilibria for large games are nearly myopic.

SSNGs are both challenging and rewarding to analyze because in them, very realistically, individual states are subject to sways of players’ own actions as well as their opponents’ states and actions. Jovanovic and Rosenthal (1988) established the existence of distributional equilibria for such games. This result was generalized by Bergin and Bernhardt (1995) to cases involving aggregate shocks. In SSNGs’ finite-player counterparts, however, randomness in state-distribution evolution will not go away. Besides, a player’s ability to observe other players’ states and actions might also affect his decision. Presented with these difficulties, it is not surprising that known results on sequential finite-player games are restricted to the stationary setting, where they appear as discounted stochastic games first introduced by Shapley (1953). According to Mertens and Parthasarathy (1987), Duffie et al. (1994), and Solan (1998), for instance, equilibria known to exist for these games come in quite complicated forms that for real implementation, demand a high degree of coordination among players.

It is therefore natural to ask whether sequential finite-player games can be approximated by their NG counterparts. This question has so far been answered by two unpublished articles. For a case unconcerned with the copula between marginal state and action distributions, Bodoh-Creed (2012) provided an affirmative answer, and went on to show for certain cases that limits of large-game equilibria of a myopic form, when in existence, are NG equilibria. Also, Yang (2015) verified for the approximability when both state transitions and action plans are driven by exogenously generated idiosyncratic shocks. Our current study attempts with the most general possible setting, without unduly restricting ways in which a player’s payoff can be influenced by other players’ states and actions or ways in which the game can evolve randomly. To achieve results of the same spirit, we have to overcome technical challenges posed by the new phenomenon of sampling from non-product joint probabilities.

Some authors went on to pursue stationary equilibria (SE), which stressed the long-run steady-state nature of individual action plans and system-wide multi-states; see, e.g., Hopenhayn (1992) and Adlakha and Johari (2013). The oblivious equilibrium (OE) concept as proposed by Weintraub et al. (2008), in order to account for impacts of large players, took the same stationary approach by letting participants beware of only long-run average system states. Weintraub et al. (2011) showed links between equilibria of infinite-player games and their finite-player brethren for a setting where the long-run average system state could be defined. Though applicable to many situations, we caution that the implicit stationarity of SE or OE is incompatible with applications that are transient by nature; for instance, the dynamic pricing game mentioned in Sect. 1.

3 The nonatomic game

The SSNG is a game in which a continuum of players interact with one another over multiple periods. A realistic and yet complicating feature is that players possess individual states which influence their payoffs along with all players’ actions. The random evolutions of these states, meanwhile, are affected by players’ actions. Furthermore, the semi-anonymous nature of the game means that not only what was done, but also who did what to the extent at which states partially reveal player identities, figure large in both payoff formation and state evolution. We now provide a detailed account of the game.

3.1 Game primitives

For some natural number $\bar{t}\in \mathbb {N}$, we let periods $1,2,\ldots ,\bar{t}$ serve as regular periods and period $\bar{t}+1$ as the terminal period. For all periods, we let players’ individual states and actions form, respectively, separable metric spaces S and X. We further require that both spaces be discrete. In this paper, such a space always stands for a separable metric space with countably many elements and the additional feature that the minimum of the distances between any two points remains strictly positive. The discreteness requirement will be useful at one occasion. But most of our derivations will work if the spaces were merely separable metric. Given any separable metric space A, we use $\mathcal{B}(A)$ for its Borel $\sigma $-field and $\mathcal{P}(A)$ for the set of all probability measures on the measurable space $(A,\mathcal{B}(A))$.

To each player, other players’ states and actions are immediately felt in a semi-anonymous fashion, so that what really matters is the joint distribution of other players’ states and actions. This distribution, which we dub “in-action environment”, is a member of the joint state-action distribution space $\mathcal{P}(S\times X)$. In any period $t=1,2,\ldots ,\bar{t}$, a player’s state $s\in S$, his action $x\in X$, and the in-action environment $\tau \in \mathcal{P}(S\times X)$ he faces, together determine his payoff in that period. In particular, there is a function

$$\begin{aligned} \tilde{f}_t:S\times X\times \mathcal{P}(S\times X)\rightarrow [-\bar{f}_t,\bar{f}_t], \end{aligned}$$

(1)

where $\bar{f}_t$ is some positive constant on the real line $\mathbb {R}$. It is required that $\tilde{f}_t(\cdot ,\cdot ,\tau )$ be a measurable map from $S\times X$ to $[-\bar{f}_t,\bar{f}_t]$ for every $\tau \in \mathcal{P}(S\times X)$. For the terminal period $\bar{t}+1$, we let the payoff be 0 in all circumstances.

Now we describe individual players’ random state transitions. Given separable metric spaces A and B, we use $\mathcal{K}(A,B)$ to represent the space of all kernels from A to B. Each member $\kappa \in \mathcal{K}(A,B)\subseteq (\mathcal{P}(B))^A$ satisfies that

(i)
$\kappa (a)$ is a member of $\mathcal{P}(B)$ for each $a\in A$, and
(ii)
for each $B'\in \mathcal{B}(B)$, the real-valued function $\kappa (\cdot |B')$ is measurable.

Note that we have used $\kappa (a|B')$ rather than the more conventional $\kappa (B'|a)$ to denote the conditional probability for $B'\in \mathcal{B}(B)$ when given $a\in A$. The current notation allows us to always read a formula from left to right. Now in period $1,2,\ldots ,\bar{t}$, let there be a function

$$\begin{aligned} \tilde{g}_t:S\times X\times \mathcal{P}(S\times X)\rightarrow \mathcal{P}(S), \end{aligned}$$

(2)

so that $\tilde{g}_t(\cdot ,\cdot ,\tau )$ is a member of $\mathcal{K}(S\times X,S)$ for each $\tau \in \mathcal{P}(S\times X)$. For convenience, we use $\mathcal{G}(S,X)$ to denote the space of all such functions, or what we shall call “state transition kernels”. In period t, when a player is in individual state $s\in S$, takes action $x\in X$, and faces in-action environment $\tau \in \mathcal{P}(S\times X)$, there will be a $\tilde{g}_t(s,x,\tau |S')$ chance for his state in period $t+1$ to be in any $S'\in \mathcal{B}(S)$.

This setup is versatile enough to embrace different player characteristics. For instance, each $s\in S$ may comprise two components $\theta $ and $\omega $, with the $\tilde{g}_t$’s defined through (2) dictating that $\theta $ stays static over time to serve as a player’s innate type. Certainly, the $\tilde{f}_t$’s defined through (1) can have all kinds of trends over $\theta $ to reflect players’ varying payoff structures.

3.2 Evolution of the environments

In any period $1,2,\ldots ,\bar{t},\bar{t}+1$, by “pre-action environment” we mean the state distribution $\sigma \in \mathcal{P}(S)$ of all players. With $\bar{t}$, S, X, $(\tilde{f}_t|t=1,2,\ldots ,\bar{t})$, and $(\tilde{g}_t|t=1,2,\ldots ,\bar{t})$ all given in the background, we use $\Gamma (\sigma _1)$ to denote an (SS)NG with $\sigma _1\in \mathcal{P}(S)$ as its initial period-1 pre-action environment. For this NG, we can use $\chi _{[1\bar{t}]}=(\chi _t\mid t=1,\ldots ,\bar{t})\in (\mathcal{K}(S,X))^{\bar{t}}$ to denote a policy profile. Here, each $\chi _t\in \mathcal{K}(S,X)$ is a map from a player’s state to the player’s random action choice. Together with the given initial environment $\sigma _1$, this policy profile will help to generate a deterministic pre-action environment trajectory $\sigma _{[1,\bar{t}+1]}=(\sigma _t\mid t=1,2,\ldots ,\bar{t},\bar{t}+1)\in (\mathcal{P}(S))^{\bar{t}+1}$ in an iterative fashion. This process is also intertwined with the formation of in-action environments $\tau _1,\tau _2,\ldots ,\tau _{\bar{t}}$ faced by all players in periods $1,2,\ldots ,\bar{t}$.

More notation is needed to precisely describe this evolution. Given distribution $p\in \mathcal{P}(A)$ and kernel $\kappa \in \mathcal{K}(A,B)$ for separable metric spaces A and B, there is a natural product $p\otimes \kappa \in \mathcal{P}(A\times B)$, such that

$$\begin{aligned} (p\otimes \kappa )(A'\times B')=\int _{A'}p(da)\cdot \kappa (a|B'),\quad \forall A'\in \mathcal{B}(A),B'\in \mathcal{B}(B). \end{aligned}$$

(3)

Here, $p\otimes \kappa $ is essentially the joint distribution generated by the marginal p and conditional distribution $\kappa $. Obviously, $(p\otimes \kappa )|_A$, the marginal of $p\otimes \kappa $ on A, is p. At the same time, we use $p\odot \kappa $ to denote the marginal $(p\otimes \kappa )|_B$, which satisfies

$$\begin{aligned} (p\odot \kappa )(B')= & {} (p\otimes \kappa )|_B(B')=(p\otimes \kappa )(A\times B')\nonumber \\= & {} \int _A p(da)\cdot \kappa (a|B'),\quad \forall B'\in \mathcal{B}(B). \end{aligned}$$

(4)

Suppose pre-action environment $\sigma _t\in \mathcal{P}(S)$ has been given for some period $t=1,\ldots ,\bar{t}$. Then, for every player with starting state $s_t$ in the period, his random action will be sampled from the distribution $\chi _t(s_t|\cdot )$ where as noted before, $\chi _t\in \mathcal{K}(S,X)$ is every player’s behavioral guide. Thus, all players will together form the commonly felt in-action environment

$$\begin{aligned} \tau _t=\sigma _t\otimes \chi _t. \end{aligned}$$

(5)

For each individual player with state $s_t$ and realized action $x_t$, his state $s_{t+1}$ in period $t+1$ will, by (2), be distributed according to $\tilde{g}_t(s_t,x_t,\tau _t|\cdot )$. Thus, it will be reasonable for the pre-action environment in period $t+1$ to follow $\sigma _{t+1}=\tau _t\odot \tilde{g}_t(\cdot ,\cdot ,\tau _t)$, with

$$\begin{aligned}{}[\tau _t\odot \tilde{g}_t(\cdot ,\cdot ,\tau _t)](S')=\int _{S\times X}\tau _t(ds\times dx)\cdot \tilde{g}_t(s,x,\tau _t|S'),\quad \forall S'\in \mathcal{B}(S). \end{aligned}$$

(6)

Although (6) has been intuitively reasoned from (2), we caution that logically it is part of the NG’s definition rather than something derivable from the latter.

The transition from $\sigma _t$ to $\sigma _{t+1}$ through random action plan $\chi _t$ is best expressed by an operator. For any kernel $\chi \in \mathcal{K}(S,X)$, define operator $T_t(\chi )$ on the space $\mathcal{P}(S)$, so that

$$\begin{aligned} T_t(\chi )\circ \sigma =(\sigma \otimes \chi )\odot \tilde{g}_t(\cdot ,\cdot ,\sigma \otimes \chi )=\sigma \odot \chi \odot \tilde{g}_t(\cdot ,\cdot ,\sigma \otimes \chi ),\quad \forall \sigma \in \mathcal{P}(S).\nonumber \\ \end{aligned}$$

(7)

Basically, state distribution $\sigma $ and random state-dependent action plan $\chi $ first fuse to form the joint state-action distribution $\sigma \otimes \chi $ to be felt by all players. The latter’s random state transitions are then guided by the kernel $\tilde{g}_t(\cdot ,\cdot ,\sigma _t\otimes \chi )$. Subsequently, after “averaging out” impacts of actions, the next-period state distribution will become $\sigma \odot \chi \odot \tilde{g}_t(\cdot ,\cdot ,\sigma \otimes \chi )$. The one-period pre-action environment transition is now representable by

$$\begin{aligned} \sigma _{t+1}=T_t(\chi _t)\circ \sigma _t=\sigma _t\odot \chi _t\odot \tilde{g}_t(\cdot ,\cdot ,\sigma _t\otimes \chi _t). \end{aligned}$$

(8)

For periods t and $t'$ with $t\le t'$, as well as sequence $\chi _{[tt']}=(\chi _{t''}|t''=t,\ldots ,t')$ of action plans, we can iteratively define $T_{[tt']}(\chi _{[tt']})$, so that

$$\begin{aligned} T_{[tt']}(\chi _{[tt']})\circ \sigma _t=T_{t'}(\chi _{t'})\circ (T_{[t,t'-1]}(\chi _{[t,t'-1]})\circ \sigma _t),\quad \forall \sigma _t\in \mathcal{P}(S). \end{aligned}$$

(9)

The left-hand side will be players’ state distribution in period $t'+1$ when they start period t with the distribution $\sigma _t$ and adopt the action sequence $\chi _{[tt']}$ in the interim. Note that $T_{[tt]}(\chi _{[tt]})$ is nothing but $T_t(\chi _t)$. As a default, we let $T_{[t,t-1]}$ stand for the identity operator on $\mathcal{P}(S)$. The environment trajectory $\sigma _{[1,\bar{t}+1]}$ satisfies

$$\begin{aligned} \sigma _{[1,\bar{t}+1]}=(T_{[1,t-1]}(\chi _{[1,t-1]})\circ \sigma _1\mid t=1,2,\ldots ,\bar{t},\bar{t}+1). \end{aligned}$$

(10)

It is deterministic by definition.

4 The n-player game

Let the same $\bar{t}$, S, X, $(\tilde{f}_t|t=1,2,\ldots ,\bar{t})$, and $(\tilde{g}_t|t=1,2,\ldots ,\bar{t})$ remain in the background. For some $n\in \mathbb {N}\setminus \{1\}$ and initial multi-state $s_1=(s_{11},s_{12},\ldots ,s_{1n})\in S^n$, we can define an n-player game $\Gamma _n(s_1)$, in which each $s_{1m}\in S$ is player m’s initial state. The game’s payoffs and state evolutions are still described by the $\tilde{f}_t$’s and $\tilde{g}_t$’s, respectively. However, details are messier as outside environments vary from player to player and their evolutions are random.

For $a\in A$, where A is again a separable metric space, we use $\delta _a$ to denote the singleton Dirac measure with $\delta _a(\{a\})=1$. For $a=(a_1,\ldots ,a_n)\in A^n$ where $n\in \mathbb {N}$, we use $\varepsilon _a$ for $\sum _{m=1}^n \delta _{a_m}/n$, the empirical distribution generated by the vector a. We also use $\mathcal{P}_n(A)$ to denote the space of probability measures of the type $\varepsilon _a$ for $a\in A^n$, i.e., the space of empirical distributions generated from n samples. Now back at the game $\Gamma _n(s_1)$, suppose in period $t=1,2,\ldots ,\bar{t}$, each player $m=1,2,\ldots ,n$ is in state $s_{tm}$ and takes action $x_{tm}$. Then, the in-action environment experienced by player 1 will be $\varepsilon _{s_{t,-1}x_{t,-1}}=\varepsilon _{((s_{t2},x_{t2}),\ldots ,(s_{tn},x_{tn}))}$. Thus, this player will receive payoff $\tilde{f}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}})$ in the period, and his period-$(t+1)$ state $s_{t+1,1}$ will be sampled from the distribution $\tilde{g}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}}|\cdot )$.

Suppose $\chi _{[1\bar{t}]}=(\chi _t\mid t=1,\ldots ,\bar{t})\in (\mathcal{K}(S,X))^{\bar{t}}$ again describes the policy adopted by all n players. Unlike in an NG, this time $\chi _{[1\bar{t}]}$ will help to generate a stochastic as opposed to deterministic environment trajectory. To describe each one-period transition in this complex process, we rely on the kernel $\chi _t^{\;n}\odot \tilde{g}_t^{\;n}\in \mathcal{K}(S^n,S^n)$ defined by

$$\begin{aligned} (\chi _t^{\;n}\odot \tilde{g}_t^{\;n})(s|S')=\int _{X^n}\chi _t^{\;n}(s|dx)\cdot \tilde{g}_t^{\;n}(s,x|S'),\quad \forall s\in S^n, S'\in \mathcal{B}(S^n), \end{aligned}$$

(11)

where $\chi _t^{\;n}$ is a member of $\mathcal{K}(S^n,X^n)$ that satisfies

$$\begin{aligned} \chi _t^{\;n}(s|X'_1\times \cdots \times X'_n)=\Pi _{m=1}^n\chi _t(s_m|X'_m),\quad \forall s\in S^n,X'_1,\ldots ,X'_n\in \mathcal{B}(X),\qquad \end{aligned}$$

(12)

and $\tilde{g}_t^{\;n}$ is a member of $\mathcal{K}(S^n\times X^n,S^n)$ that satisfies

$$\begin{aligned} \tilde{g}_t^{\;n}(s,x|S'_1\times \cdots \times S'_n)= & {} \Pi _{l=1}^n\tilde{g}_t(s_l,x_l,\varepsilon _{s_{-l}x_{-l}}|S'_l), \nonumber \\&\forall (s,x)\in S^n\times X^n,S'_1,\ldots ,S'_n\in \mathcal{B}(S). \end{aligned}$$

(13)

In combination, (11) can be spelled out as

$$\begin{aligned} (\chi _t^{\;n}\odot \tilde{g}_t^{\;n})(s|S'_1\times \cdots \times S'_n)=\int _{X^n}\Pi _{m=1}^n \chi _t(s_m|dx_m)\cdot \Pi _{l=1}^n \tilde{g}_t(s_l,x_l,\varepsilon _{s_{-l}x_{-l}}|S'_l).\nonumber \\ \end{aligned}$$

(14)

The above reflects that, each player m samples his action $x_m$ from the distribution $\chi _t(s_m|\cdot )$; once all players’ actions $x=(x_1,\ldots ,x_n)$ have been determined, each player l will face his unique in-action environment $\varepsilon _{s_{-l}x_{-l}}$; thus, this player’s period-$(t+1)$ state will be sampled from the distribution $\tilde{g}_t(s_l,x_l,\varepsilon _{s_{-l}x_{-l}}|\cdot )$.

When the n players start period t with a random multi-state with distribution $\pi _{nt}\in \mathcal{P}(S^n)$ and they act according to random rule $\chi _t\in \mathcal{K}(S,X)$ in the period, they will generate the joint distribution $\mu _{nt}\in \mathcal{P}(S^n\times X^n)$ of period-t multi-state and -action satisfying

$$\begin{aligned} \mu _{nt}=\pi _{nt}\otimes \chi _t^{\;n}. \end{aligned}$$

(15)

According to (3) and (12), the above means that, for any $S'\in \mathcal{B}(S^n)$ and $X'_1,\ldots ,X'_n\in \mathcal{B}(X)$,

$$\begin{aligned} \mu _{nt}(S'\times X'_1\times \cdots \times X'_n)= & {} \int _{S'}\pi _{nt}(ds)\cdot \chi _t^{\;n}(s|X'_1\times \cdots \times X'_n)\nonumber \\= & {} \int _{S'}\pi _{nt}(ds)\cdot \Pi _{m=1}^n \chi _t(s_m|X'_m). \end{aligned}$$

(16)

Clearly, (15) corresponds to (5) in the NG situation.

By (11), the period-$(t+1)$ multi-state distribution $\mu _{nt}\odot \tilde{g}_t^{\;n}\in \mathcal{P}(S^n)$ will follow

$$\begin{aligned} (\mu _{nt}\odot \tilde{g}_t^{\;n})(S')=\int _{S^n\times X^n}\mu _{nt}(ds\times dx)\cdot \tilde{g}_t^{\;n}(s,x|S'),\quad \forall S'\in \mathcal{B}(S^n). \end{aligned}$$

(17)

Combining (15) and (17), we can see that the one-period transition between multi-states is

$$\begin{aligned} \pi _{n,t+1}=(\pi _{nt}\otimes \chi _t^{\;n})\odot \tilde{g}_t^{\;n}=\pi _{nt}\odot \chi _t^{\;n}\odot \tilde{g}_t^{\;n}. \end{aligned}$$

(18)

Note (18) is the n-player game’s answer to the NG’s (8). Similar to (9), for $t\le t'$, the distribution $\pi _{nt'}$ of period-$t'$ multi-state $s_{t'}$ is given by

$$\begin{aligned} \pi _{nt'}=\pi _{nt}\odot \Pi _{t''=t}^{t'-1}(\chi _{t''}^{\;n}\odot \tilde{g}_{t''}^{\;n}). \end{aligned}$$

(19)

When the initial multi-state $s_1$ is randomly drawn from distribution $\pi _{n1}$, the entire trajectory $\pi _{n,[1,\bar{t}+1]}=(\pi _{nt}|t=1,2,\ldots ,\bar{t},\bar{t}+1)$ of the n-player game’s multi-state distributions can be written as

$$\begin{aligned} \pi _{n,[1,\bar{t}+1]}=\left( \pi _{n1}\odot \Pi _{t'=1}^{t-1}(\chi _{t'}^{\;n}\odot \tilde{g}_{t'}^{\;n})|t=1,2,..,\bar{t},\bar{t}+1\right) . \end{aligned}$$

(20)

When all players’ states are sampled from some $\sigma _1\in \mathcal{P}(S)$, we still have (20) as the trajectory for multi-state distributions, but with $\pi _{n1}=\sigma _1^{\;n}$. When recognizing $\pi _{n1}=\delta _{s_1}$, the Dirac measure in $\mathcal{P}(S^n)$ that assigns the full weight to $s_1$, (20) will help describe the evolution of the multi-state distribution for the n-player game $\Gamma _n(s_1)$, much like (10) did for $\Gamma (\sigma _1)$.

5 Convergence of aggregate environments

Even before touching upon notions like cumulative payoffs and equilibria, we can already introduce an interesting link between finite games and NGs. It is in terms of an asymptotic relationship between a sequence $\pi _{n,[t,\bar{t}+1]}=(\pi _{nt'}|t'=t,t+1,\ldots ,\bar{t}+1)$ of multi-state distributions in n-player games and a sequence $\sigma _{[t,\bar{t}+1]}=(\sigma _{t'}|t'=t,t+1,\ldots ,\bar{t}+1)$ of state distributions in their NG counterparts. The message is that, when starting from similar environments in period t and adopting the same action plan from that period on, stochastic environment paths experienced by large finite games will not drift too much away from the NG’s deterministic environment trajectory. We refrain from using the word convergence because the $\pi _{nt'}$’s reside in different spaces for different n’s.

First, we propose the concept asymptotic resemblance in order to precisely describe the way in which members in a sequence of probability measures increasingly resemble the products of a given measure. For a separable metric space A, the space $\mathcal{P}(A)$ is metrized by the Prohorov metric $\rho _A$, which induces the weak topology on it. At fixed $n\in \mathbb {N}$, the map $\varepsilon _{(\cdot )}$ from $A^n$ to $\mathcal{P}_n(A)\subseteq \mathcal{P}(A)$ is continuous. Therefore, for any $p\in \mathcal{P}(A)$ and $\epsilon >0$, the set $\{a\in A^n|\rho _A(\varepsilon _a,p)<\epsilon \}$ is an open subset of $A^n$ and thus a member of $\mathcal{B}(A^n)$.

Definition 1

For a separable metric space A, suppose $p\in \mathcal{P}(A)$ and for each $n\in \mathbb {N}$, $q_n\in \mathcal{P}(A^n)$. We say that sequence $q_n$ asymptotically resembles the sequence $p^n$ made up of p’s n-th order products $p\times \cdots \times p$, if for any $\epsilon >0$ and n that is large enough,

$$\begin{aligned} q_n(\{a\in A^n|\rho _A(\varepsilon _a,p)<\epsilon \})>1-\epsilon . \end{aligned}$$

Definition 1 says that sequence $q_n$ will asymptotically resemble the sequence $p^n$ of product measures when the empirical distribution $\varepsilon _a$ of a random vector $a=(a_1,\ldots ,a_n)$, sampled from $q_n$, is highly likely to be close to p as n approaches $+\infty $. This resemblance notion is consistent with Prohorov’s theorem (Parthasarathy 2005, Theorem II.7.1), whose weak version is presented as Lemma 2 in Appendix 1. Due to it, any sequence $(p')^n$ will asymptotically resemble the sequence $p^n$ if and only if $p'=p$.

Some results related to the resemblance concept have been placed in Appendix 1. Lemma 3 stems from Dvoretzky, Kiefer and Wolfolwitz’s (1956) inequality and makes the convergence in Lemma 2 uniform in the chosen probability p. According to Lemma 4, the tampering of one component within any n-long vector $a\in A^n$ would not much alter $\varepsilon _a$. It is therefore natural for Lemma 5 to state that the resemblance of $q_n$ to $p^n$ would lead to that of the $A^{n-1}$-marginal $q_n|_{A^{n-1}}$ to $p^{n-1}$. Lemma 6 says that the above would also lead to the asymptotic resemblance of $p'\times q_{n-1}$ to $p^n$ for any $p'$. So in general there can be nothing substantial regarding the relationship between the A-marginals $q_n|_A$ and p. Finally, Lemma 7 shows that asymptotic resemblance is preserved under the projection of $A\times B$ into A.

The following one-step result states that asymptotic resemblance concerning pre-action environments is translatable into that concerning in-action environments; also, the same resemblance is preserved after undergoing one single step in a game.

Proposition 1

Let state distribution $\sigma \in \mathcal{P}(S)$, random state-dependent action plan $\chi \in \mathcal{K}(S,X)$, and state-transition kernel $g\in \mathcal{G}(S,X)$, with the latter enjoying the continuity of $g(s,x,\tau )$ in the joint state-action distribution $\tau $ at an (s, x)-independent rate. Also, multi-state distribution $\pi _n\in \mathcal{P}(S^n)$ for each $n\in \mathbb {N}$. Suppose further that the sequence $\pi _n$ asymptotically resembles the sequence $\sigma ^n$. Then,

(i)
the sequence $\pi _n\otimes \chi ^n$ will asymptotically resemble the sequence $(\sigma \otimes \chi )^n$, and
(ii)
the sequence $\pi _n\odot \chi ^n\odot g^n$ will asymptotically resemble the sequence $(\sigma \odot \chi \odot g(\cdot ,\cdot ,\sigma \otimes \chi ))^n$.

Indeed, (ii) remains valid under mild contamination. That is, for any $(s,x)\in S\times X$,
(iii)
the sequence $(\delta _{sx}\times (\pi _{n-1}\otimes \chi ^{n-1}))\odot g^n$ will asymptotically resemble the sequence $(\sigma \odot \chi \odot g(\cdot ,\cdot ,\sigma \otimes \chi ))^n$ at a rate independent of the chosen (s, x).

Proposition 1 is one of our two most technical results. Its proof invokes both Prohorov’s theorem (Parthasarathy 2005, Theorem II.7.1) on the convergence of empirical distributions and for parts (ii) and (iii), Dvoretzky, Kiefer and Wolfolwitz’s (1956) inequality which provides the uniformity of such convergence. In the proposition, part (i) stresses the passibility from convergence of pre-action environments to that of same-period in-action environments, see (5) and (15); part (ii) further points out that convergence in next-period pre-action environments will follow suit, see (8) and (18); also, part (iii) will be useful when we take the view point from one single player.

To take advantage of Proposition 1, we now assume the equi-continuity of the state transitions with respect to in-action environments.

Assumption 1

Each transition kernel $\tilde{g}_t(s,x,\tau )$ is continuous in $\tau $ at an (s, x)-independent rate. That is, for any in-action environment $\tau \in \mathcal{P}(S\times X)$ and $\epsilon >0$, there is $\delta >0$, such that for any $\tau '\in \mathcal{P}(S\times X)$ satisfying $\rho _{S\times X}(\tau ,\tau ')<\delta $ and any $(s,x)\in S\times X$,

$$\begin{aligned} \rho _S(\tilde{g}_t(s,x,\tau ),\tilde{g}_t(s,x,\tau '))<\epsilon . \end{aligned}$$

We are in a position to derive this section’s main result. It states that, when an NG and its finite counterparts evolve under the same action plan, environment pathways of large finite games, though stochastic, will resemble the deterministic pathway of the NG.

Theorem 1

Let a policy profile $\chi _{[t\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$ for periods $t,t+1,\ldots ,\bar{t}$ be given. When $s_t=(s_{t1},\ldots ,s_{tn})$ has a distribution $\pi _{nt}$ that asymptotically resembles $\sigma _t^{\;n}$, the series $(\pi _{nt}\odot \Pi _{t''=t}^{t'-1}(\chi _{t''}^{\;n}\odot \tilde{g}_{t''}^{\;n})\mid t'=t,t+1,\ldots ,\bar{t},\bar{t}+1)$ will asymptotically resemble $((T_{[t,t'-1]}(\chi _{[t,t'-1]})\circ \sigma _t)^n\mid t'=t,t+1,\ldots ,\bar{t},\bar{t}+1)$ as well. That is, for any $\epsilon >0$ and any n that is large enough,

$$\begin{aligned}{}[\pi _{nt}\odot \Pi _{t''=t}^{t'-1}(\chi _{t''}^{\;n}\odot \tilde{g}_{t''}^{\;n})](\tilde{A}_{nt'}(\epsilon ))>1-\epsilon ,\quad \forall t'=t,t+1,\ldots ,\bar{t}+1, \end{aligned}$$

where for each $t'$, the set of multi-states $\tilde{A}_{nt'}(\epsilon )\in \mathcal{B}(S^n)$ is such that,

$$\begin{aligned} \rho _S(\varepsilon _{s_{t'}},T_{[t,t'-1]}(\chi _{[t,t'-1]})\circ \sigma _t)<\epsilon , \quad \forall s_{t'}\in \tilde{A}_{nt'}(\epsilon ). \end{aligned}$$

Suppose an NG starts period t with pre-action environment $\sigma _t$ and a slew of finite games start the period with pre-action environments that are ever nearly sampled from $\sigma _t$. Let the evolution of both types of games be guided by players acting according to the same policy profile $\chi _{[t\bar{t}]}$. Then, as the numbers of players n involved in finite games grow indefinitely, Theorem 1 predicts for ever less chances for the finite games’ period-$t'$ environments $\varepsilon _{s_{t'}}$ to be even slightly away from the NG’s deterministic period-$t'$ environment $T_{[t,t'-1]}(\chi _{[t,t'-1]})\circ \sigma _t$. For some fixed $\sigma _1\in \mathcal{P}(S)$, we can plug $t=1$ and $\pi _{n1}=\sigma _1^{\;n}$ into Theorem 1. Then, we will obtain the proximity between $\sigma ^{\;n}_{[1,\bar{t}+1]}=(\sigma ^{\;n}_t|t=1,2,\ldots ,\bar{t},\bar{t}+1)$ and $\pi _{n,[1,\bar{t}+1]}=(\pi _{nt}|t=1,2,\ldots ,\bar{t},\bar{t}+1)$ for large n’s, where every $\sigma _t=T_{[1,t-1]}(\chi _{[1,t-1]})\circ \sigma _1$ and every $\pi _{nt}=\sigma _1^{\;n}\odot \Pi _{t'=1}^{t-1}(\chi _{t'}^{\;n}\odot \tilde{g}_{t'}^{\;n})$. In view of (10) and (20), this means that when large games sample their initial states from an NG’s starting distribution $\sigma _1$, the former games’ state-distribution trajectories will remain close to that of the latter game.

Our confinement so far to discrete spaces S and X arises mainly from the need to deal with non-product joint probabilities of the form $p\otimes \kappa $; see (3). In Yang (2015), where random state transitions and random action plans were modeled through independently generated shocks, only results pertaining to product-form probabilities $p\times q$, where q is an ordinary rather than conditional probability, were needed. Because of this, known properties like Propositions III.4.4 and III.4.6 of Ethier and Kurtz (1986) could be put to good use. Results there could thus be based on complete state and shock spaces. In contrast, if we were to consider more general spaces here, we would face the presently unsurmountable challenge of passing the closeness between measures p and $p_i$ for $i=1,2,\ldots ,n$ onto that between $p^n$ and $\prod _{i=1}^n p_i$ when n itself tends to infinity.

6 NG and finite-game equilibria

We present this paper’s main result that an NG equilibrium, though oblivious of past history and blind to other players’ states, will generate minimal regrets when adopted by players in large finite games. First, we introduce equilibrium concepts used in both types of games.

6.1 Equilibria in NG

In defining the NG $\Gamma (\sigma _1)$’s equilibria, we subject a candidate policy profile to one-time deviation of a single player, who is by default infinitesimal in influence. Note the deviation will not alter the environment trajectory corresponding to the candidate profile. With this understanding, we define $v_t(s_t,\xi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]})$ as the total expected payoff a player can receive from period t to $\bar{t}$, when he starts with state $s_t\in S$ and adopts action plan $\xi _{[t\bar{t}]}=(\xi _t,\ldots ,\xi _{\bar{t}})\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$ throughout, while other players form initial pre-action environment $\sigma _t\in \mathcal{P}(S)$ and adopt policy profile $\chi _{[t\bar{t}]}=(\chi _t,\ldots ,\chi _{\bar{t}})\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$ throughout. As a terminal condition, we certainly have

$$\begin{aligned} v_{\bar{t}+1}(s_{\bar{t}+1},\sigma _{\bar{t}+1})=0. \end{aligned}$$

(21)

For $t=\bar{t},\bar{t}-1,\ldots ,1$, we have the recursive relationship

$$\begin{aligned}&v_t(s_t,\xi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]}) =\int _X \xi _t(s_t|dx_t)\cdot \bigg [\tilde{f}_t(s_t,x_t,\sigma _t\otimes \chi _t)\nonumber \\&\quad +\int _S \tilde{g}_t(s_t,x_t,\sigma _t\otimes \chi _t|ds_{t+1})\cdot v_{t+1}(s_{t+1},\xi _{[t+1,\bar{t}]},T_t(\chi _t)\circ \sigma _t,\chi _{[t+1,\bar{t}]})\bigg ].\qquad \end{aligned}$$

(22)

This is because the player’s action is guided in a random fashion by $\xi _t$, its payoff is determined by $\tilde{f}_t$, its state evolution is governed by $\tilde{g}_t$, and its future payoff is supplied by $v_{t+1}$; also, after undergoing the commonly adopted action plan $\chi _t$, the period-$(t+1)$ pre-action environment $\sigma _{t+1}$ will be $T_t(\chi _t)\circ \sigma _t$ as shown in (8). The choice of $\xi _t$ affects the current player’s period-t action $x_t$, his period-$(t+1)$ state $s_{t+1}$, and his future state-action trajectory. However, the change at this negligible player does not alter the period-t in-action environment $\sigma _t\otimes \chi _t$ as listed in (5) or any environment in the future. This is the main reason why NGs are easier to handle than their finite-player counterparts.

Now, we deem policy $\chi _{[1\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}}$ a Markov equilibrium for the game $\Gamma (\sigma _1)$ when, for every $t=1,2,\ldots ,\bar{t}$ and $\xi _t\in \mathcal{K}(S,X)$,

$$\begin{aligned} v_t(s_t,\chi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]})\ge v_t(s_t,(\xi _t,\chi _{[t+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]}),\quad \forall s_t\in S, \end{aligned}$$

(23)

where

$$\begin{aligned} \sigma _t=T_{[1,t-1]}(\chi _{[1,t-1]})\circ \sigma _1. \end{aligned}$$

(24)

That is, policy $\chi _{[1\bar{t}]}$ will be regarded an equilibrium when no player can be better off by unilaterally deviating to any alternative plan $\xi _t\in \mathcal{K}(S,X)$ in any single period t. The definition of $\sigma _t$ in (24) underscores the evolution of the deterministic environment trajectory following the adoption of action plan $\chi _{[1\bar{t}]}$ by almost all players.

6.2 $\epsilon $-equilibria in n-player games

For an n-player game, let $v_{nt}(s_{t1},\xi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})$ be the total expected payoff player 1 can receive from period t to $\bar{t}$, when he starts with state $s_{t1}\in S$ and adopts action plan $\xi _{[t\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$ throughout, while other players form initial empirical state distribution $\varepsilon _{s_{t,-1}}=\varepsilon _{(s_{t2},\ldots ,s_{tn})}\in \mathcal{P}_{n-1}(S)$ and adopt action plan $\chi _{[t\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$ throughout. As a terminal condition, we have

$$\begin{aligned} v_{n,\bar{t}+1}(s_{\bar{t}+1,1},\varepsilon _{s_{\bar{t}+1,-1}})=0. \end{aligned}$$

(25)

For $t=\bar{t},\bar{t}-1,\ldots ,1$, we have the recursive relationship

$$\begin{aligned} v_{nt}(s_{t1},\xi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})= & {} \int _X\xi _t(s_{t1}|dx_{t1})\cdot \int _{X^{n-1}}\chi _t^{\;n-1}(s_{t,-1}|dx_{t,-1})\nonumber \\&\times \,\bigg [\tilde{f}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}}) +\int _{S^n}\tilde{g}_t^{\;n}(s_t,x_t|ds_{t+1})\nonumber \\&\cdot \, v_{n,t+1}(s_{t+1,1},\xi _{[t+1,\bar{t}]},\varepsilon _{s_{t+1},-1},\chi _{[t+1,\bar{t}]})\bigg ], \end{aligned}$$

(26)

where the meaning of $\chi _t^{\;n-1}(s_{t,-1}|dx_{t,-1})$ follows from (12) and that of $\tilde{g}_t^{\;n}(s_t,x_t|ds_{t+1})$ follows from (13). Note (26) differs substantially from its NG counterpart (22). With only a finite number of players, player 1’s one-time choice $\xi _t$ not only affects his own future actions and states as before, but differently, starting from the altered in-action environment $\varepsilon _{s_tx_t}$, it also impacts the entire future trajectory of all other players. Note $\varepsilon _{s_tx_t}$ impacts the generation of $s_{t+1}=(s_{t+1,1},\ldots ,s_{t+1,n})$ in its projections to n different $(n-1)$-dimensional spaces, as according to (13), $\int _{S^n}\tilde{g}_t^{\;n}(s_t,x_t|ds_{t+1})$ amounts to $\Pi _{m=1}^n\int _S \tilde{g}_t(s_{tm},x_{tm},\varepsilon _{s_{t,-m}x_{t,-m}}|ds_{t+1,m})$.

For each $n\in \mathbb {N}\setminus \{1\}$, let $\hat{\pi }_{n-1,[1\bar{t}]}=(\hat{\pi }_{n-1,t}\mid t=1,\ldots ,\bar{t})\in (\mathcal{P}(S^{n-1}))^{\bar{t}}$ be a series of other-player multi-state distributions. For $\epsilon \ge 0$, we deem $\chi _{[1\bar{t}]}=(\chi _t\mid t=1,\ldots ,\bar{t})\in (\mathcal{K}(S,X))^{\bar{t}}$ an $\epsilon $-Markov equilibrium for the game family $(\Gamma _n(s_1)\mid s_1\in S^n)$ in the sense of $\hat{\pi }_{n-1,[1\bar{t}]}$ when, for every $t=1,\ldots ,\bar{t}$, $\xi _{[t\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$, and $s_{t1}\in S$,

$$\begin{aligned}&\int _{S^{n-1}}\hat{\pi }_{n-1,t}(ds_{t,-1})\cdot v_{nt}(s_{t1},\chi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})\nonumber \\&\quad \ge \int _{S^{n-1}}\hat{\pi }_{n-1,t}(ds_{t,-1})\cdot v_{nt}(s_{t1},\xi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})-\epsilon . \end{aligned}$$

(27)

That is, action plan $\chi _{[1\bar{t}]}$ will be an $\epsilon $-Markov equilibrium in the sense of $\hat{\pi }_{n-1,[1\bar{t}]}$ when under the plan’s guidance, the average payoff from any period t and player-1 state $s_{t1}$ on cannot be improved by more than $\epsilon $ through any unilateral deviation, where the “average” is based on other players’ multi-state $s_{t,-1}$ being sampled from the distribution $\hat{\pi }_{n-1,t}$. Note (27) differs from (23) also in that its unilateral deviation need not be one-time.

6.3 Main transient result

Before moving on, we need the single-period payoff functions $\tilde{f}_t$ to be continuous.

Assumption 2

Each payoff function $\tilde{f}_t(s,x,\tau )$ is continuous in the in-action environment $\tau $ at an (s, x)-independent rate. That is, for any $\tau \in \mathcal{P}(S\times X)$ and $\epsilon >0$, there is $\delta >0$, such that for any $\tau '\in \mathcal{P}(S\times X)$ satisfying $\rho _{S\times X}(\tau ,\tau ')<\delta $ and any $(s,x)\in S\times X$,

$$\begin{aligned} \mid \tilde{f}_t(s,x,\tau )-\tilde{f}_t(s,x,\tau ')\mid <\epsilon . \end{aligned}$$

Now we show the convergence of finite-game value functions to their NG counterpart, the proof of which is quite technical as well, and calls upon parts (i) and (iii) of Proposition 1.

Proposition 2

For any $t=1,2,\ldots ,\bar{t}+1$, let $\sigma _t\in \mathcal{P}(S)$ and $\hat{\pi }_{n-1,t}\in \mathcal{P}(S^{n-1})$ for each $n\in \mathbb {N}$. Suppose the sequence $\hat{\pi }_{n-1,t}$ asymptotically resembles the sequence $\sigma _t^{\;n-1}$. Then for any $\chi _{[t\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$, the sequence $\int _{S^{n-1}}\hat{\pi }_{n-1,t}(ds_{t,-1})\cdot v_{nt}(s_{t1},\xi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})$ will converge to $v_t(s_{t1},\xi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]})$ at a rate that is independent of both $s_{t1}\in S$ and $\xi _{[t\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$.

Combining (23) and (27), as well as Proposition 2, we can come to the main result.

Theorem 2

For some $\sigma _1\in \mathcal{P}(S)$, suppose $\chi _{[1\bar{t}]}=(\chi _t\mid t=1,2,\ldots ,\bar{t})\in (\mathcal{K}(S,X))^{\bar{t}}$ is a Markov equilibrium of NG $\Gamma (\sigma _1)$. Also, suppose $\hat{\pi }_{n-1,[1\bar{t}]}=(\hat{\pi }_{n-1,t}|t=1,2,\ldots ,\bar{t})\in (\mathcal{P}(S^{n-1}))^{\bar{t}}$ is such that the sequence $\hat{\pi }_{n-1,t}$ asymptotically resembles the sequence $\sigma _t^{\;n-1}$ for each t, where $\sigma _t=T_{[1,t-1]}(\chi _{[1,t-1]})\circ \sigma _1$. Then, for $\epsilon >0$ and large enough $n\in \mathbb {N}$, the given $\chi _{[1\bar{t}]}$ is also an $\epsilon $-Markov equilibrium for the game family $(\Gamma _n(s_1)\mid s_1\in S^n)$ in the sense of $\hat{\pi }_{n-1,[1\bar{t}]}$.

The theorem says that players in a large finite game can agree on an NG equilibrium and expect to lose little on average, as long as the other-player multi-state distribution $\hat{\pi }_{n-1,t}$ on which “average” is based is similar to the product form $\sigma _t^{\;n-1}$, where $\sigma _t=T_{[1,t-1]}(\chi _{[1,t-1]})\circ \sigma _1$ is the corresponding NG’s predictable equilibrium state distribution for the same period. As to whether reasonable $\hat{\pi }_{n-1,[1\bar{t}]}=(\hat{\pi }_{n-1,t}|t=1,2,\ldots ,\bar{t})$ exists to satisfy this condition, the answer is affirmative. The next section is dedicated to this point.

7 The condition in Theorem 2

We now present examples where the key condition in Theorem 2 can be true. In all of them, we let the initial other-player multi-state distribution $\hat{\pi }_{n-1,1}=\sigma _1^{\;n-1}=\sigma _1^{\;n}|_{S^{n-1}}$. That is, we let players’ initial states in n-player games be randomly drawn from the NG’s initial state distribution $\sigma _1$. Now we discuss what can happen in periods $t=2,3,\ldots ,\bar{t}$.

7.1 Two possibilities

First, we can let each $\hat{\pi }_{n-1,t}=\sigma _t^{\;n-1}$. It has been discussed right after Definition 1 that the sequence $\sigma _t^{\;n-1}$ asymptotically resembles itself. So this choice satisfies the condition in Theorem 2. This would correspond to the case where players in large finite games take the “lazy” approach of using independent draws on the NG state distribution to assess their opponents’ states. Note this is reasonable due to the common initial condition for both types of games and Theorem 1.

Second, we can let each $\hat{\pi }_{n-1,t}=\pi _{nt}|_{S^{n-1}}$, where

$$\begin{aligned} \pi _{nt}=\sigma _1^{\;n}\odot \Pi _{t'=1}^{t-1}(\chi _{t'}^{\;n}\odot \tilde{g}_{t'}^{\;n}). \end{aligned}$$

(28)

According to (19), $\pi _{nt}$ stands for players’ multi-state distribution in period t in an n-player game when their initial states are randomly drawn from the distribution $\sigma _1$ and then from period 1 onward players all follow through with the NG equilibrium $\chi _{[1\bar{t}]}$. Since the sequence $\sigma _1^{\;n}$ asymptotically resembles itself, Theorem 1 will ascertain the asymptotic resemblance of $\pi _{nt}$ to $\sigma _t^{\;n}$. Then, Lemma 5 in Appendix 1 will lead to the asymptotic resemblance of $\hat{\pi }_{n-1,t}$ to $\sigma _t^{\;n-1}$. So this choice would satisfy Theorem 2’s condition as well. Also, its meaning is clear—here players in large finite games use precise assessments on what other players’ states might be had they followed the NG equilibrium all along.

7.2 Refinement and a third choice

Note that $\hat{\pi }_{n-1,t}$ has not countenanced the possibility in which a player involves his own state $s_{t1}$ in the estimation of the other-player multi-state $s_{t,-1}$. We now show that this is possible at least when the state space S is finite. In that case, we can upgrade the $\hat{\pi }_{n-1,t}\in \mathcal{P}(S^{n-1})$ in Proposition 2 to $\hat{\pi }_{n-1,t}(\cdot )=(\hat{\pi }_{n-1,t}(s_{t1}|\cdot )|s_{t1}\in S)\in (\mathcal{P}(S^{n-1}))^S$ and obtain the convergence of $\int _{S^{n-1}}\hat{\pi }_{n-1,t}(s_{t1}|ds_{t,-1})\cdot v_{nt}(s_{t1},\xi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})$ to $v_t(s_{t1},\xi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]})$ at an $s_{t1}$-independent rate. This will lead us to the following extended version of Theorem 2.

Theorem 3

Suppose $\sigma _{[1,\bar{t}+1]}$ and $\chi _{[1\bar{t}]}$ are all the same as in Theorem 2. Also, suppose $\hat{\pi }_{n-1,[1\bar{t}]}(\cdot )=(\hat{\pi }_{n-1,t}(s_{t1}|\cdot )|t=1,2,\ldots ,\bar{t},\;s_{t1}\in S)\in ((\mathcal{P}(S^{n-1}))^S)^{\bar{t}}$ is such that the sequence $\hat{\pi }_{n-1,t}(s_{t1}|\cdot )$ asymptotically resembles the sequence $\sigma _t^{\;n-1}$ for each t and $s_{t1}$. Then, for $\epsilon >0$ and large enough $n\in \mathbb {N}$, for every $t=1,\ldots ,\bar{t}$, $\xi _{[t\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$, and $s_{t1}\in S$,

$$\begin{aligned}&\int _{S^{n-1}}\hat{\pi }_{n-1,t}(s_{t1}|ds_{t,-1})\cdot v_{nt}(s_{t1},\chi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]}) \\&\quad \ge \int _{S^{n-1}}\hat{\pi }_{n-1,t}(s_{t1}|ds_{t,-1})\cdot v_{nt}(s_{t1},\xi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})-\epsilon . \end{aligned}$$

For it to satisfy the condition in Theorem 3, we can still let $\hat{\pi }_{n-1,[1\bar{t}]}(\cdot )$ be the same as in the aforementioned two examples, in which the newly added $s_{t1}$-dependence is mute. But a third choice would allow each player a full-fledged Bayesian update on other players’ states.

In this third choice, we still use (28) to define $\pi _{nt}$. Then, as long as $\sigma _t(s_{t1})>0$, we let

$$\begin{aligned} \hat{\pi }_{n-1,t}(s_{t1}|\cdot )=\pi _{nt,S}|_{S^{n-1}}(s_{t1}|\cdot ), \end{aligned}$$

(29)

the other-player multi-state distribution derivable from $\pi _{nt}$ when conditioned on the current player’s state $s_{t1}$; otherwise, we simply let $\hat{\pi }_{n-1,t}=\pi _{nt}|_{S^{n-1}}$ just as in the second example. Note the marginal $\pi _{nt}|_S$ is defined by

$$\begin{aligned} \pi _{nt}|_S(\{s_{t1}\})=\pi _{nt}(\{s_{t1}\}\times S^{n-1}),\quad \forall s_{t1}\in S, \end{aligned}$$

(30)

and each conditional distribution $\pi _{nt,S}|_{S^{n-1}}(s_{t1}|\cdot )$ is defined by

$$\begin{aligned} \pi _{nt,S}|_{S^{n-1}}(s_{t1}|S')=\frac{\pi _{nt}(\{s_{t1}\}\times S')}{\pi _{nt}|_S(\{s_{t1}\})}=\frac{\pi _{nt}(\{s_{t1}\}\times S')}{\pi _{nt}(\{s_{t1}\}\times S^{n-1})},\quad \forall S'\in \mathcal{B}(S^{n-1}),\qquad \end{aligned}$$

(31)

when the denominator is strictly positive and an arbitrary value otherwise.

7.3 Symmetry makes it work

The lone fact that $\pi _{nt}$ asymptotically resembles $\sigma _t^{\;n}$ is actually quite far from being able to dictate the asymptotic resemblance of the thus defined $\hat{\pi }_{n-1,t}(s_{t1}|\cdot )$ to $\sigma _t^{\;n-1}$. Note that for a general $q_n$ resembling some $p^n$, Lemma 6 in Appendix 1 has all but ruled out the convergence of $\pi _n|_A$ to p, let alone the asymptotic resemblance of $q_{n,A}|_{A^{n-1}}$ to $p^{n-1}$. Fortunately, $\pi _{nt}$ still enjoys the additional feature of being symmetric.

For any $n\in \mathbb {N}$, let $\Psi _n$ be the set of all n-dimensional permutations. That is, each $\psi \in \Psi _n$ makes $(\psi (1),\ldots ,\psi (n))$ a permutation of $(1,\ldots ,n)$. For a given $\psi \in \Psi _n$, let us suppose $\psi a=(a_{\psi (1)},\ldots ,a_{\psi (n)})$ for any $a=(a_1,\ldots ,a_n)\in A^n$, and then $\psi A'=\{\psi a|a\in A'\}$ for any $A'\subseteq A^n$. Note that, due to its innately symmetric definition, $\mathcal{B}(A^n)$ is automatically symmetric in the sense that $\mathcal{B}(A^n)=\{\psi A'|A'\in \mathcal{B}(A^n)\}$ for any $\psi \in \Psi _n$.

Definition 2

For $n\in \mathbb {N}$ and separable metric space A, we say $q_n\in \mathcal{P}(A^n)$ symmetric if

$$\begin{aligned} q_n(A')=q_n(\psi A'),\quad \forall \psi \in \Psi _n,\;A'\in \mathcal{B}(A^n).\end{aligned}$$

We have the much needed result that asymptotic resemblance of $q_n$ to $p^n$ does lead to the convergence of $q_n|_A$ to p when $q_n$ is symmetric. This is in stark contrast with Lemma 6.

Proposition 3

Let A be a discrete metric space and $q_n\in \mathcal{P}(A^n)$ for every $n\in \mathbb {N}$ be symmetric. Suppose the sequence $q_n$ asymptotically resembles the sequence $p^n$. Then, the sequence $q_n|_A$ will converge to p, namely, $\lim _{n\rightarrow +\infty }q_n|_A(\{a\})=p(\{a\})$ for every $a\in A$.

This then results in the resemblance of $q_{n,A}|_{A^{n-1}}$ to $p^{n-1}$.

Proposition 4

Let A be a discrete metric space and $q_n\in \mathcal{P}(A^n)$ for every $n\in \mathbb {N}$ be symmetric. Suppose the sequence $q_n$ asymptotically resembles the sequence $p^n$. Then, the sequence $q_{n,A}|_{A^{n-1}}(a|\cdot )$ will asymptotically resemble the sequence $p^{n-1}$ for any $a\in A$ with $p(\{a\})>0$.

Note that $\pi _{n1}$, being equal to $\sigma _1^{\;n}$, is symmetric. As suggested by (28), the operation it has to go through to arrive to $\pi _{nt}$ is also symmetric. Hence, $\pi _{nt}$ is symmetric. Therefore, by Proposition 3, the marginal probability $\pi _{nt}|_S$ as defined in (30) would converge to the NG state distribution $\sigma _t$; thus, the conditional distribution $\pi _{nt,S}|_{S^{n-1}}(s_{t1}|\cdot )$ as defined in (31) would be well defined when $\sigma _t(s_{t1})>0$. Then, Proposition 4 can guarantee that $\hat{\pi }_{n-1,t}(s_{t1}|\cdot )$ as defined in (29) would asymptotically resemble $\sigma _t^{\;n-1}$ and hence help to facilitate the condition needed for Theorem 3. The above suggests that, even when players exercise the most accurate Bayesian updates on other players’ states using their own state information, they will not discern much regret on average by adhering to the NG equilibrium.

8 A stationary situation

Now we study an infinite-horizon model with stationary features. To this end, we keep S and X, but let there be a discount factor $\bar{\alpha }\in [0,1)$. There is a payoff function $\tilde{f}$ which meets the basic measurability and boundedness requirements, so that $\tilde{f}_t=\bar{\alpha }^{t-1}\cdot \tilde{f}$ for $ t=1,2,\ldots $. Let us use $\bar{f}$ for the bound $\bar{f}_1$ that appeared in (1). In addition, there is a state transition kernel $\tilde{g}\in \mathcal{G}(S,X)$, so that $\tilde{g}_t=\tilde{g}$ for $t=1,2,\ldots $. For $\chi \in \mathcal{K}(S,X)$, denote by $T(\chi )$ the operator on $\mathcal{P}(S)$, so that for any $\sigma \in \mathcal{P}(S)$,

$$\begin{aligned} T(\chi )\circ \sigma =\sigma \odot \chi \odot \tilde{g}(\cdot ,\cdot ,\sigma \otimes \chi ). \end{aligned}$$

(32)

Thus, state transition has been made stationary by the stationarity of $\tilde{g}$.

Denote the stationary nonatomic game formed from the above S, X, $\bar{\alpha }$, $\tilde{f}$, and $\tilde{g}$ by $\Gamma ^\infty $. It helps to first study the corresponding games $\Gamma ^t$ that terminate in periods $t+1$, for $t=0,1,\ldots $. Now let $v^t(s,\xi _{[1t]},\sigma ,\chi _{[1t]})$ be the total expected payoff a player can receive in game $\Gamma ^t$, when he starts at state $s\in S$ in period 1 and adopts action plan $\xi _{[1t]}\in (\mathcal{K}(S,X))^t$ from period 1 to t, while all other players form state distribution $\sigma \in \mathcal{P}(S)$ in the beginning and act according to $\chi _{[1t]}\in (\mathcal{K}(S,X))^t$ from period 1 to t. As a terminal condition, we have $v^0(s,\sigma )=0$. Also, for $t=1,2,\ldots $,

$$\begin{aligned} v^t(s,\xi _{[1t]},\sigma ,\chi _{[1t]})= & {} \int _X \xi _1(s|dx)\cdot \left[ \phantom {\int _{S^n}}\tilde{f}(s,x,\sigma \otimes \chi _1)\nonumber \right. \\&\left. +\,\bar{\alpha }\cdot \int _S \tilde{g}(s,x,\sigma \otimes \chi _1|ds')\cdot v^{t-1}(s',\xi _{[2t]},T(\chi _1)\circ \sigma ,\chi _{[2t]})\right] .\nonumber \\ \end{aligned}$$

(33)

Using the terminal condition and (33), we can inductively show that

$$\begin{aligned} \mid v^{t+1}(s,\xi _{[1,t+1]},\sigma ,\chi _{[1,t+1]})-v^t(s,\xi _{[1t]},\sigma ,\chi _{[1t]})\mid \le \bar{\alpha }^t\cdot \bar{f}. \end{aligned}$$

(34)

Given $s\in S$, $\xi _{[1\infty ]}=(\xi _1,\xi _2,\ldots )\in (\mathcal{K}(S,X))^\infty $, $\sigma \in \mathcal{P}(S)$, and $\chi _{[1\infty ]}=(\chi _1,\chi _2,\ldots )\in (\mathcal{K}(S,X))^\infty $, the sequence $\{v^t(s,\xi _{[1t]},\sigma ,\chi _{[1t]})\mid t=0,1,\ldots \}$ is thus Cauchy and has a limit point $v^\infty (s,\xi _{[1\infty ]},\sigma ,\chi _{[1\infty ]})$. The latter is the total discounted expected payoff a player can obtain in the game $\Gamma ^\infty $, when he starts at state s and adopts action plan $\xi _{[1\infty ]}$, while all other players form initial pre-action environment $\sigma $ and act according to $\chi _{[1\infty ]}$.

A pre-action environment $\sigma \in \mathcal{P}(S)$ is said to be associated with $\chi \in \mathcal{K}(S,X)$ when

$$\begin{aligned} \sigma =T(\chi )\circ \sigma . \end{aligned}$$

(35)

That is, we let environment $\sigma $ be associated with action plan $\chi $ when the former is invariant under the one-period transition when all players adhere to the latter. For $\chi \in \mathcal{K}(S,X)$, we use $\chi ^\infty $ to represent the stationary policy profile $(\chi ,\chi ,\ldots )\in (\mathcal{K}(S,X))^\infty $ that players are to adopt in all periods $t=1,2,\ldots $.

We deem one-time action plan $\chi \in \mathcal{K}(S,X)$ a stationary Markov equilibrium for the nonatomic game $\Gamma ^\infty $, when there exists a $\sigma \in \mathcal{P}(S)$ that is associated with the given $\chi $, so that for every one-time unilateral deviation $\xi \in \mathcal{K}(S,X)$,

$$\begin{aligned} v^\infty (s,\chi ^\infty ,\sigma ,\chi ^\infty ) \ge v^\infty (s,(\xi ,\chi ^\infty ),\sigma ,\chi ^\infty ),\quad \forall s\in S. \end{aligned}$$

(36)

Therefore, a policy will be considered an equilibrium when it induces an invariant environment under whose sway the policy turns out to be a best response in the long run.

Now we move on to the n-player game $\Gamma ^\infty _n$ made out of the same S, X, $\bar{\alpha }$, $\tilde{f}$, and $\tilde{g}$. Similarly to the above, we let $\Gamma ^t_n$ be its n-player counterpart that terminates in period $t+1$. Now let $v^t_n(s_1,\xi _{[1t]},\varepsilon _{s_{-1}},\chi _{[1t]})$ be the total expected payoff player 1 can receive in game $\Gamma ^n_t$, when he starts with state $s_1\in S$ and adopts action plan $\xi _{[1t]}\in (\mathcal{K}(S,X))^t$ from period 1 to t, while other players form initial empirical distribution $\varepsilon _{s_{-1}}=\varepsilon _{(s_2,\ldots ,s_n)}\in \mathcal{P}_{n-1}(S)$ and adopt policy $\chi _{[1t]}\in (\mathcal{K}(S,X))^t$ from 1 to t. As a terminal condition, we have $v^0_n(s_1,\varepsilon _{s_{-1}})=0$. For $t=1,2,\ldots $, it follows that

$$\begin{aligned} v^t_n(s_1,\xi _{[1t]},\varepsilon _{s_{-1}},\chi _{[1t]})= & {} \int _X\xi _1(s_1|dx_1){\cdot } \int _{X^{n-1}}\chi _1^{\; n-1}(s_{-1}|dx_{-1})\cdot \left[ \phantom {\int _{S^n}}\tilde{f}(s_1,x_1,\varepsilon _{s_{-1}x_{-1}})\right. \nonumber \\&\left. +\,\bar{\alpha }\cdot \int _{S^n}\tilde{g}^n(s,x|ds')\cdot v^{t-1}_n(s'_1,\xi _{[2t]},\varepsilon _{s'_{-1}},\chi _{[2t]})\right] . \end{aligned}$$

(37)

Using the terminal condition and (37), we can inductively show that

$$\begin{aligned} \mid v^{t+1}_n(s_1,\xi _{[1,t+1]},\varepsilon _{s_{-1}},\chi _{[1,t+1]})-v^t_n(s_1,\xi _{[1t]},\varepsilon _{s_{-1}},\chi _{[1t]})\mid \le \bar{\alpha }^t\cdot \bar{f}. \end{aligned}$$

(38)

Given $s_1\in S$, $\xi _{[1\infty ]}\in (\mathcal{K}(S,X))^\infty $, $\varepsilon _{s_{-1}}\in \mathcal{P}_{n-1}(S)$, and $\chi _{[1\infty ]}\in (\mathcal{K}(S,X))^\infty $, the sequence $\{v^n_t(s_1,\xi _{[1t]},\varepsilon _{s_{-1}},\chi _{[1t]})\mid t=0,1,\ldots \}$ is Cauchy and has a limit point $v^\infty _n(s_1,\xi _{[1\infty ]},\varepsilon _{s_{-1}},\chi _{[1\infty ]})$. The latter is the total discounted expected payoff a player can obtain in $\Gamma ^\infty _n$, when he starts at state s and adopts action plan $\xi _{[1\infty ]}$, while all other players form the initial pre-action environment $\varepsilon _{s_{-1}}$ and act according to $\chi _{[1\infty ]}$.

For the current setting, it should be noted that Assumptions 1 and 2 translate into the continuity in $\tau $ at an (s, x)-independent rate of, respectively, the transition kernel $\tilde{g}(s,x,\tau )$ and payoff function $\tilde{f}(s,x,\tau )$. We now present the main result for the stationary case.

Theorem 4

Suppose $\chi \in \mathcal{K}(S,X)$ is a stationary Markov equilibrium for the stationary nonatomic game $\Gamma ^\infty $. Let $\hat{\pi }_{n-1}\in \mathcal{P}(S^{n-1})$ for each $n\in \mathbb {N}\setminus \{1\}$. Also suppose the sequence $\hat{\pi }_{n-1}$ asymptotically resembles the sequence $\sigma ^{n-1}$, where $\sigma $ is associated with $\chi $ in the equilibrium definitions (35) and (36). Then, $\chi ^\infty $ would be asymptotically equilibrium for games $\Gamma ^\infty _n$ in an average sense. More specifically, for any $\epsilon >0$ and large enough $n\in \mathbb {N}$,

$$\begin{aligned}&\int _{S^{n-1}}\hat{\pi }_{n-1}(ds_{-1})\cdot v^\infty _n(s_1,\chi ^\infty ,\varepsilon _{s_{-1}},\chi ^\infty )\\&\quad \ge \int _{S^{n-1}}\hat{\pi }_{n-1}(ds_{-1})\cdot v^\infty _n(s_1,\xi _{[1\infty ]},\varepsilon _{s_{-1}},\chi ^\infty )-\epsilon , \end{aligned}$$

for any $s_1\in S$ and $\xi _{[1\infty ]}\in (\mathcal{K}(S,X))^\infty $.

Theorem 4 says that, players in a large finite stationary game will not regret much by adopting a stationary equilibrium for a correspondent stationary nonatomic game. The regret can be measured in an average sense, so long as the underlying other-player multi-state distribution $\hat{\pi }_{n-1}$ is close to an invariant $\sigma $ associated with the NG equilibrium. Just as in Sect. 7, we can let $\hat{\pi }_{n-1}=\sigma ^{n-1}$, indicating that players take a “lazy” approach in assessing other players’ states. We leave discussion of other possibilities to Appendix 5.

9 Implications of main results

9.1 Observation, remembrance, and coordination

Regarding Theorems 2 and 3, we note the following for $\bar{t}$-period games. A prominent feature of an NG equilibrium $\chi _{[1\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}}$ is its insensitivity, at any period t, to a player’s personal history $(s_{t'},x_{t'}|t'=1,2,\ldots ,t-1)$, historical data regarding other players, and the present information about other players’ states. Independence of the first two factors has much to do with the Markovian setup of the game—neither $\tilde{f}_t$ nor $\tilde{g}_t$ depends on past history. But the more interesting independence of the latter two factors stems from players’ common knowledge about the evolution of their environments. The $(\sigma _{t'}\otimes \chi _{t'}|t'=1,2,\ldots ,t-1)$ portion of the history and the present information $\sigma _t$, both about other players, are determinable by (10) before the game is even played out.

For finite semi-anonymous games, however, information is gradually revealed and its perfection is not guaranteed. We can define space $O_S$ and map $\tilde{o}_S:\mathcal{P}(S)\rightarrow O_S$ to represent a player’s observatory power over his present pre-action environment immediately before actual play. Similarly, we can define space $O_{SX}$ and map $\tilde{o}_{SX}:\mathcal{P}(S\times X)\rightarrow O_{SX}$ to represent his observatory power over the in-action environment just experienced. So that new information does not contradict old information and no information gets lost, we suppose function $\tilde{o}_S^{\;SX}:O_{SX}\rightarrow O_S$ exists, with $\tilde{o}_S^{\;SX}(\tilde{o}_{SX}(\tau ))=\tilde{o}_S(\tau |_S)$ for any $\tau \in \mathcal{P}(S\times X)$.

With these definitions, a player’s decision in period t can be denoted by a map $\hat{\chi }_t: (S\times X\times O_{SX})^{t-1}\times O_S\times S\rightarrow \mathcal{P}(X)$. In the period, player 1’s random decision rule can be written as $\hat{\chi }_t(\tilde{h}_t,\tilde{o}_S(\varepsilon _{s_{t,-1}}),s_{t1}|\cdot )$, where the history $\tilde{h}_t$ is expressible as

$$\begin{aligned} \tilde{h}_t=(s_{t'1},x_{t'1},\tilde{o}_{SX}(\varepsilon _{s_{t',-1}x_{t',-1}})|t'=1,2,\ldots ,t-1), \end{aligned}$$

(39)

$\tilde{o}_S(\varepsilon _{s_{t,-1}})$ is his observation of other players’ status, and $s_{t1}$ represents the player’s own state. There is a whole spectrum in which $O_S$ and $\tilde{o}_S$ can reside. When $O_S=\{0\}$ and $\tilde{o}_S(\cdot )=0$, players are ignorant of others’ states; when $O_S=\mathcal{P}(S)$ and $\tilde{o}_S$ is the identity map, every player is fully aware of his surrounding. Similarly, there are varieties of $O_{SX}$, $\tilde{o}_{SX}$, and $\tilde{o}_S^{\;SX}$.

Theorems 2 and 3, however, nullify the need to delve into the $(O_S,\tilde{o}_S,O_{SX},\tilde{o}_{SX},\tilde{o}_S^{\;SX})$-related details about finite games. They state that an equilibrium of the NG counterpart, which is necessarily both oblivious of the past history $\tilde{h}_t$ and blind to the present observation $\tilde{o}_S(\varepsilon _{s_{t,-1}})$, serves as a good approximate equilibrium for games with enough players. The absence of $\tilde{h}_t$ again has a Markovian explanation. On the other hand, the ability to shake off $\tilde{o}_S(\varepsilon _{s_{t,-1}})$’s influence is very important, since this saves players the efforts to gather information about their surroundings.

Regarding Theorem 4, we note the following. Each of our finite stationary games is a discounted stochastic game. For an n-player version of the latter game in which players have full knowledge of others’ states, equilibria are hard to compute and for their implementation, require high degrees of coordination among players; see Solan (1998). These equilibria come from the space $(2^{\mathbb {R}^n})^{S^n}\times ((\mathbb {R}^n)^{X^n\times S^n})^{S^n\times \mathbb {R}^n}$; whereas, our NG equilibria come from $\mathbb {R}^{S\times X}$. Meanwhile, the discounted stochastic game one faces in real life is often semi-anonymous; see, e.g., examples listed in Jovanovic and Rosenthal (1988). For such a game, Theorem 4 has shown that a much easier path can be taken in order to coordinate player behavior under an $\epsilon $-sized compromise. If players all agree to exercise a corresponding NG equilibrium, the typical player 1 has only to respond to his own state $s_{t1}$ without giving up too much.

9.2 Sources of NG equilibria

To further buttress the claim that studying the idealistic NGs can help with the understanding and execution of messier finite games faced in real life, we demonstrate that NG equilibria, meeting criteria (23) and (24) for the transient case and (35) and (36) for the stationary case, can be obtained relatively easily.

First, we concentrate on the transient case studied in Sects. 3 to 6. From (22),

$$\begin{aligned} v_t(s_t,(\xi _t,\chi _{[t+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]})=\int _X \xi _t(dy)\cdot v_t(s_t,(\delta _y,\chi _{[t+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]}). \end{aligned}$$

(40)

Hence,

$$\begin{aligned} \sup \limits _{\xi _t\in \mathcal{K}(S,X)}v_t(s_t,(\xi _t,\chi _{[t+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]})=\sup \limits _{y\in X}v_t(s_t,(\delta _y,\chi _{[t+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]}).\quad \end{aligned}$$

(41)

So the equilibrium criterion (23) conveniently used by us for the $\bar{t}$-period case is equivalent to, for every $t=1,2,\ldots ,\bar{t}$,

$$\begin{aligned} \chi _t(s_t|\tilde{X}_t(s_t,\sigma _t,\chi _{[t\bar{t}]}))=1,\quad \forall s_t\in S, \end{aligned}$$

(42)

where

$$\begin{aligned} \tilde{X}_t(s_t,\sigma _t,\chi _{[t\bar{t}]})= & {} \{x\in X|v_t(s_t,(\delta _x,\chi _{[t+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]})\nonumber \\&\quad =\sup \limits _{y\in X}v_t(s_t,(\delta _y,\chi _{[t+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]})\}, \end{aligned}$$

(43)

and $\sigma _t$ is defined through (24).

The form consisting of (42) and (43) is fairly close to the distributional-equilibrium concept used in NG literature, such as Mas-Colell (1984) and Jovanovic and Rosenthal (1988). A distributional equilibrium is an in-action environment sequence $\tau _{[1\bar{t}]}=(\tau _t|t=1,2,\ldots ,\bar{t})\in (\mathcal{P}(S\times X))^{\bar{t}}$ which satisfies $\tau _t(\tilde{U}_t(\tau _{[t\bar{t}]}))=1$ for each $t=1,2,\ldots ,\bar{t}$. Here, $\tilde{U}_t(\tau _{[t\bar{t}]})= \{(s,x)\in S\times X|v'_t(s,x,\tau _{[t\bar{t}]})=\sup \nolimits _{y\in X}v'_t(s,y,\tau _{[t\bar{t}]})\}$, and $v'_t(s,y,\tau _{[t\bar{t}]})$ is a player’s payoff when he starts period t with state s and action y, but other players in all periods and he himself in later periods act according to $\tau _{[t\bar{t}]}$; corresponding to (24), the distributional equilibrium also satisfies $\tau _1|_S=\sigma _1$ and $\tau _t|_S=\tau _{t-1}\odot \tilde{g}_{t-1}(\cdot ,\cdot ,\tau _{t-1})$ for $t=2,3,\ldots ,\bar{t}$. According to Jovanovic and Rosenthal (1988, Theorem 1), such an equilibrium $\tau _{[1\bar{t}]}$ would exist when S and X are compact, each payoff $\tilde{f}_t$ is bounded and continuous in all arguments, and each transition kernel $\tilde{g}_t$ is continuous in all arguments.

When an equilibrium $\chi _{[1\bar{t}]}$ in our conditional sense exists, we can construct a distributional equilibrium $\tau _{[1\bar{t}]}$ by resorting iteratively to $\tau _t=\sigma _t\otimes \chi _t$ and $\sigma _{t+1}=T_t(\chi _t)\circ \sigma _t$ for $t=1,2,\ldots ,\bar{t}$. Conversely, when the latter distributional equilibrium $\tau _{[1\bar{t}]}$ is available, we can nearly get a conditional equilibrium $\chi _{[1\bar{t}]}$ back. For each $t=1,2,\ldots ,\bar{t}$, according to Duffie et al. (1994) (p. 751), we can identify a $\chi _t\in \mathcal{K}(S,X)$, which also passes as a measurable map from S to $\mathcal{P}(X)$, that satisfies $\tau _t=\tau _t|_S\otimes \chi _t$. Thus, we will be able to construct $\chi _{[1\bar{t}]}$ consecutively from $\chi _1$ up to $\chi _{\bar{t}}$. But even then, $\chi _{[t\bar{t}]}$ along with $\sigma _t=\tau _t|_S$ would satisfy (42) only for $\tau _t|_S$-almost every $s_t$, but not necessarily every $s_t\in S$. For instance, we can suppose $S=\{\bar{s}_1,\bar{s}_2,\ldots \}$. At each t, the constructed $\chi _{[1\bar{t}]}$ could guarantee (42) for those $\bar{s}_i$’s with $(\tau _t|_S)(\bar{s}_i)>0$ but not those with $(\tau _t|_S)(\bar{s}_i)=0$. On the other hand, a conditional equilibrium $\chi _{[1\bar{t}]}$ can be obtained directly; see section “The transient case” in Appendix 6 for details.

When it comes to the stationary case examined in Sect. 8, we make parallel developments. Here the property corresponding to (36) is

$$\begin{aligned} \chi (s|\tilde{X}_\infty (s,\sigma ,\chi ))=1,\quad \forall s\in S, \end{aligned}$$

(44)

where

$$\begin{aligned} \tilde{X}_\infty (s,\sigma ,\chi )=\left\{ x\in X|v^\infty (s,(\delta _x,\chi ^\infty ),\sigma ,\chi ^\infty )=\sup \limits _{y\in X}v^\infty (s,(\delta _y,\chi ^\infty ),\sigma ,\chi ^\infty )\right\} ,\nonumber \\ \end{aligned}$$

(45)

and $\sigma $ satisfies (35). Again, the existence of a related distributional equilibrium $\tau \in \mathcal{P}(S\times X)$ is known under quite general conditions; see, e.g., Jovanovic and Rosenthal (1988, Theorem 2). However, an equilibrium $\tau $ does not exactly lead to a conditional equilibrium $\chi $. So once more we focus on a direct approach for the stationary case; see section “The stationary case” in Appendix 6.

10 Concluding remarks

Under a common action plan, we have shown that environments faced by players in multi-period large finite games would stay close to those of their NG counterparts. For transient and stationary settings, our results reveal that an NG equilibrium, necessarily both oblivious of past history and blind to present status of other players, could serve as a good approximate equilibrium in large finite games. We reckon that the discreteness requirement on both the state and action spaces can be frustrating in some circumstances. Besides the relaxation of the aforementioned restriction, future research can also look into the issue of converge rate.

References

Adlakha S, Johari B (2013) Mean field equilibrium in dynamic games with complementarities. Oper Res 61:971–989
Article Google Scholar
Al-Najjar NI (2008) Large games and the law of large numbers. Games Econ Behav 64:1–34
Article Google Scholar
Al-Najjar NI, Smorodinsky R (2001) Large nonanonymous repeated games. Games Econ Behav 37:26–39
Article Google Scholar
Balder EJ (2002) A unifying pair of Cournot–Nash equilibrium existence results. J Econ Theory 102:437–470
Article Google Scholar
Balder EJ (2008) Comments on purification in continuous games. Int J Game Theory 37:73–92
Article Google Scholar
Bergin J, Bernhardt D (1995) Anonymous sequential games: existence and characterization of equilibria. Econ Theory 5:461–489
Article Google Scholar
Bodoh-Creed AL (2012) Approximation of large dynamic games. Working Paper, Cornell University
Carmona G (2004) Nash equilibria of games with a continuum of players. Working Paper, Universidade Nova de Lisboa
Duffie D, Geanakoplos J, Mas-Colell A, McLennan A (1994) Stationary Markov equilibria. Econometrica 62:745–781
Article Google Scholar
Dvoretzky A, Kiefer J, Wolfolwitz J (1956) Asymptotic minimax character of the sample distribution function and of the classical multinomial estimator. Ann Math Stat 27:642–669
Article Google Scholar
Ethier SN, Kurtz TG (1986) Markov processes: characterization and convergence. Wiley, New York
Book Google Scholar
Green EJ (1980) Non-cooperative price taking in large dynamic markets. J Econ Theory 22:155–182
Article Google Scholar
Green EJ (1984) Continuum and finite-player noncooperative models of competition. Econometrica 52:975–993
Article Google Scholar
Hopenhayn HA (1992) Entry, exit, and firm dynamics in long run equilibrium. Econometrica 60:1127–1150
Article Google Scholar
Housman D (1988) Infinite player noncooperative games and the continuity of the Nash equilibrium correspondence. Math Oper Res 13:488–496
Article Google Scholar
Jovanovic B, Rosenthal RW (1988) Anonymous sequential games. J Math Econ 17:77–88
Article Google Scholar
Kalai E (2004) Large robust games. Econometrica 72:1631–1665
Article Google Scholar
Khan MA, Rath KP, Sun YN (1997) On the existence of pure strategy equilibrium in games with a continuum of players. J Econ Theory 76:13–46
Article Google Scholar
Khan MA, Rath KP, Sun YN, Yu H (2013) Large games with a bio-social typology. J Econ Theory 148:1122–1149
Article Google Scholar
Khan MA, Sun YN (1990) On a reformulation of Cournot–Nash equilibria. J Math Anal Appl 146:442–460
Article Google Scholar
Khan MA, Sun YN (1995) Pure strategies in games with private information. J Math Econ 24:633–653
Article Google Scholar
Khan MA, Sun YN (1999) Non-cooperative games on hyperfinite Loeb space. J Math Econ 31:455–492
Article Google Scholar
Khan MA, Sun YN (2002) Non-cooperative games with many players. In: Aumann RJ, Hart S (eds) Handbook of game theory with economic applications, vol 3. Elsevier Science, Amsterdam, pp 1761–1808
Google Scholar
Mas-Colell A (1984) On a theorem of Schmeidler. J Math Econ 13:201–206
Article Google Scholar
Mertens JF, Parthasarathy T (1987) Equalibria for discounted stochastic games. CORE Discussion Paper No. 8750
Parthasarathy KR (2005) Probability measures on metric spaces. AMS Chelsea Publishing, Providence
Book Google Scholar
Sabourian H (1990) Anonymous repeated games with a large number of players and random outcomes. J Econ Theory 51:92–110
Article Google Scholar
Schmeidler D (1973) Equilibrium points of nonatomic games. J Stat Phys 7:295–300
Article Google Scholar
Shapley LS (1953) Stochastic games. Proc Natl Acad Sci 39:1095–1100
Article Google Scholar
Solan E (1998) Discounted stochastic games. Math Oper Res 23:1010–1021
Article Google Scholar
Weintraub GY, Benkard CL, van Roy B (2008) Markov perfect industry dynamics with many firms. Econometrica 76:1375–1411
Article Google Scholar
Weintraub GY, Benkard CL, van Roy B (2011) Industry dynamics: foundations for models with an infinite number of firms. J Econ Theory 146:504–527
Article Google Scholar
Yang J (2015) Analysis of Markovian competitive situations using nonatomic games—the shock-driven case. Working Paper, Rutgers University
Yang J (2011) Asymptotic interpretations for equilibria of nonatomic games. J Math Econ 47:491–499
Article Google Scholar
Yang J, Xia Y (2013) A nonatomic-game approach to dynamic pricing under competition. Prod Oper Manag 22:88–103
Article Google Scholar
Yu H, Zhang Z (2007) Pure strategy equilibria in games with countable actions. J Math Econ 43:192–200
Article Google Scholar

Download references

Acknowledgments

This research was supported in part by National Science Foundation Grant CMMI-0854803, as well as National Natural Science Foundation of China Grants 11371273 and 71502015.

Author information

Authors and Affiliations

Department of Management Science and Information Systems, Business School, Rutgers University, Newark, NJ, 07102, USA
Jian Yang

Authors

Jian Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jian Yang.

Appendices

Appendix 1: Concepts and rudimentary lemmas

Given separable metric space A, the Prohorov metric $\rho _A$ is such that, for any distributions $p,p'\in \mathcal{P}(A)$,

$$\begin{aligned} \rho _A (p,p')=\inf (\epsilon >0 \mid p'((A')^\epsilon )+\epsilon \ge p(A'),\; \text{ for } \text{ all } A'\in \mathcal{B}(A)), \end{aligned}$$

(46)

where

$$\begin{aligned} (A')^ \epsilon =\{a\in A\mid d_A(a,a') < \epsilon \text{ for } \text{ some } a'\in A'\}. \end{aligned}$$

(47)

The metric $\rho _A$ is known to generate the weak topology for $\mathcal{P}(A)$.

Lemma 1

Let A be a separable metric space. Then, for any $n\in \mathbb {N}$ and $a,a'\in A^n$,

$$\begin{aligned} \rho _A(\varepsilon _a,\varepsilon _{a'})\le \max _{m=1}^n d_A(a_m,a'_m). \end{aligned}$$

Proof

Let $\epsilon =\max _{m=1}^n d_A(a_m,a'_m)$. For any $A'\in \mathcal{B}(A)$, the key observation is that

$$\begin{aligned} \delta _{a'_m}((A')^\epsilon )\ge \delta _{a_m}(A'). \end{aligned}$$

(48)

Then,

$$\begin{aligned} \varepsilon _{a'}((A')^\epsilon )=\frac{\sum _{m=1}^n\delta _{a'_m}((A')^\epsilon )}{n}\ge \frac{\sum _{m=1}^n\delta _{a_m}(A')}{n}=\varepsilon _a(A'). \end{aligned}$$

(49)

Thus, $\rho _A(\varepsilon _a,\varepsilon _{a'})\le \epsilon $. $\square $

According to Parthasarathy (2005, Theorem II.7.1), the strong law of large numbers applies to the empirical distribution under the weak topology, and hence under the Prohorov metric. In the following, we state its weak version.

Lemma 2

Let separable metric space A and distribution $p\in \mathcal{P}(A)$ be given. Then, for any $\epsilon >0$, as long as n is large enough,

$$\begin{aligned} p^n(\{a\in A^n\mid \rho _A(\varepsilon _a ,p )<\epsilon \})>1-\epsilon . \end{aligned}$$

Due to the inequality of Dvoretzky, Kiefer and Wolfolwitz’s (1956), the above convergence is uniform for certain A’s. The inequality infers that, when A is $\mathbb {R}$ or countable,

$$\begin{aligned} p^n(\{a\in A^n\mid \rho _A(\varepsilon _a,p)\le \epsilon \})>1-2e^{-2n\epsilon ^2},\quad \forall \epsilon >0. \end{aligned}$$

When n is greater than $\ln (3/\epsilon )/(2\epsilon ^2)$, a number independent of $p\in \mathcal{P}(A)$, the above would entail the inequality in Lemma 2. Thus, we have the following.

Lemma 3

When A is the real line $\mathbb {R}$ or countable, the convergence expressed in Lemma 2 is uniform. Namely, a lower bound could be identified so that every n above it would realize the inequality in the lemma for every $p\in \mathcal{P}(A)$.

For separable metric space A, point $a\in A$, and the $(n-1)$-point empirical distribution $p\in \mathcal{P}_{n-1}(A)$, we use $(a,p)_n$ to represent the member of $\mathcal{P}_n(A)$ that has an additional 1 / n weight on the point a, but with probability masses in p being reduced to $(n-1)/n$ times of their original values. For $a\in A^n$ and $m=1,\ldots ,n$, we have $(a_m,\varepsilon _{a_{-m}})_n=\varepsilon _a$. Concerning the Prohorov metric, we have also a simple but useful observation.

Lemma 4

Let A be a separable metric space. Then, for any $n\in \mathbb {N}\setminus \{1\}$, $a\in A$, and $p\in \mathcal{P}_{n-1}(A)$,

$$\begin{aligned} \rho _A((a,p)_n,p)\le \frac{1}{n}. \end{aligned}$$

Proof

Let $A'\in \mathcal{B}(A)$ be chosen. Then $p(A')=(m-1)/(n-1)$ for some $m=1,2,\ldots ,n$. If $a\notin A'$, then $(a,p)_n(A')=(m-1)/n$ and hence

$$\begin{aligned} (a,p)_n(A')\le p(A')\le (a,p)_n(A')+\frac{1}{n}. \end{aligned}$$

(50)

If $a\in A'$, then $(a,p)_n(A')=m/n$ and hence

$$\begin{aligned} (a,p)_n(A')-\frac{1}{n}\le p(A')\le (a,p)_n(A'). \end{aligned}$$

(51)

Therefore, it is always true that

$$\begin{aligned} \mid (a,p)_n(A')-p(A')\mid \le \frac{1}{n}. \end{aligned}$$

(52)

Due to the nature of the Prohorov metric, we have

$$\begin{aligned} \rho _A((a,p)_n,p)\le \frac{1}{n}. \end{aligned}$$

(53)

We have thus completed the proof. $\square $

For the notion of asymptotic resemblance introduced in Definition 1, we have that it is preserved under certain projections and expansions.

Lemma 5

Let A be a separable metric space. Also, $q_n\in \mathcal{P}(A^n)$ for every $n\in \mathbb {N}$ and $p\in \mathcal{P}(A)$. Suppose the sequence $q_n$ asymptotically resembles the sequence $p^n$. Then, the sequence $q_n|_{A^{n-1}}$ will asymptotically resemble the sequence $p^{n-1}$.

Proof

For any $\epsilon >0$, due to the asymptotic resemblance of the sequence $q_n$ to the sequence $p^n$, we have, for n large enough,

$$\begin{aligned} q_n(A'_n)>1-\epsilon , \end{aligned}$$

(54)

where

$$\begin{aligned} A'_n=\{a\in A^n|\rho _A(\varepsilon _a,p)<\epsilon \}. \end{aligned}$$

(55)

By Lemma 4, we have

$$\begin{aligned} \rho _A(\varepsilon _a,\varepsilon _{a_{-1}})\le \frac{1}{n},\quad \forall a\in A^n. \end{aligned}$$

(56)

Hence, for large enough n,

$$\begin{aligned} A'_n\subseteq A\times A''_{n-1}, \end{aligned}$$

(57)

where

$$\begin{aligned} A''_{n-1}=\{a_{-1}\in A^{n-1}|\rho _A(\varepsilon _{a_{-1}},p)<2\epsilon \}. \end{aligned}$$

(58)

But by (54), this means that

$$\begin{aligned} (q_n|_{A^{n-1}})(A''_{n-1})=q_n(A\times A''_{n-1})\ge q_n(A'_n)>1-\epsilon . \end{aligned}$$

(59)

That is, $q_n|_{A^{n-1}}$ asymptotically resembles $p^{n-1}$. $\square $

Lemma 6

Let A be a separable metric space. Also, $q_n\in \mathcal{P}(A^n)$ for every $n\in \mathbb {N}$ and $p,p'\in \mathcal{P}(A)$. Suppose the sequence $q_n$ asymptotically resembles the sequence $p^n$. Then, the sequence $p'\times q_{n-1}$ will asymptotically resemble the sequence $p^n$ as well.

Proof

For any $\epsilon >0$, due to the asymptotic resemblance of the sequence $q_n$ to the sequence $p^n$, we have, for n large enough,

$$\begin{aligned} q_{n-1}(A'_{n-1})>1-\epsilon , \end{aligned}$$

(60)

where

$$\begin{aligned} A'_{n-1}=\{a\in A^{n-1}|\rho _A(\varepsilon _a,p)<\epsilon \}. \end{aligned}$$

(61)

By Lemma 4, we have

$$\begin{aligned} \rho _A(\varepsilon _{(a_1,a)},\varepsilon _a)\le \frac{1}{n},\quad \forall a_1\in A,\;a\in A^{n-1}. \end{aligned}$$

(62)

Hence, for large enough n,

$$\begin{aligned} A\times A'_{n-1}\subseteq A''_n, \end{aligned}$$

(63)

where

$$\begin{aligned} A''_n=\{a\in A^n|\rho _A(\varepsilon _a,p)<2\epsilon \}. \end{aligned}$$

(64)

But by (60), this means that

$$\begin{aligned} (p'\times q_{n-1})(A''_n)\ge (p'\times q_{n-1})(A\times A'_{n-1})=p'(A)\times q_{n-1}(A'_{n-1})>1-\epsilon .\nonumber \\ \end{aligned}$$

(65)

That is, $p'\times q_{n-1}$ asymptotically resembles $p^n$. $\square $

Lemma 7

Let A and B be separable metric spaces. Also, $q_n\in \mathcal{P}(A^n\times B^n)$ for every $n\in \mathbb {N}$ and $p\in \mathcal{P}(A\times B)$. Suppose the sequence $q_n$ asymptotically resembles the sequence $p^n$. Then, the sequence $q_n|_{A^n}$ will asymptotically resemble the sequence $(p|_A)^n$.

Proof

For any $\epsilon >0$, due to the asymptotic resemblance of the sequence $q_n$ to the sequence $p^n$, we have, for n large enough,

$$\begin{aligned} q_n(C'_n)>1-\epsilon , \end{aligned}$$

(66)

where

$$\begin{aligned} C'_n=\{c=(a,b)\in A^n\times B^n|\rho _{A\times B}(\varepsilon _c,p)<\epsilon \}. \end{aligned}$$

(67)

But by (87) of Yang (2011),

$$\begin{aligned} \rho _A(\varepsilon _a,p|_A)=\rho _A(\varepsilon _c|_A,p|_A)\le \rho _{A\times B}(\varepsilon _c,p),\quad \forall c=(a,b)\in C'_n. \end{aligned}$$

(68)

Hence,

$$\begin{aligned} C'_n\subseteq A'_n\times B^n, \end{aligned}$$

(69)

where

$$\begin{aligned} A'_n=\{a\in A^n|\rho _A(\varepsilon _a,p|_A)<\epsilon \}. \end{aligned}$$

(70)

Combining (66) and (69), we can obtain

$$\begin{aligned} (q_n|_{A^n})(A'_n)=q_n(A'_n\times B^n)\ge q_n(C'_n)>1-\epsilon . \end{aligned}$$

(71)

This indicates that $q_n|_{A^n}$ asymptotically resembles $(p|_A)^n$. $\square $

Appendix 2: Proofs of Sect. 5

Proof of Proposition 1

We first prove (i). Fix some $\epsilon \in (0,1)$. Due to the countability of S , we can identify some I of its points $\bar{s}_1,\bar{s}_2,\ldots ,\bar{s}_I$, so that each $\sigma (\{\bar{s}_i\})>0$ and

$$\begin{aligned} \sum _{i=1}^I\sigma (\{\bar{s}_i\})>1-\epsilon . \end{aligned}$$

(72)

For convenience, let $\bar{S}'=\{\bar{s}_1,\bar{s}_2,\ldots ,\bar{s}_I\}$ and $\bar{S}''=S\setminus \bar{S}'$.

Since S is discrete, the distance $d_S(\bar{S}',\bar{S}'')=\inf _{s'\in \bar{S}',s''\in \bar{S}''}d_S(s',s'')>0$. For $i,j=1,2,\ldots ,I$, let us use $d_{ij}$ for $d_S(\bar{s}_i,\bar{s}_j)$ and $\sigma _i$ for $\sigma (\{\bar{s}_i\})$. Now define

$$\begin{aligned} \delta =\frac{\epsilon }{I}\wedge d_S(\bar{S}',\bar{S}'')\wedge \left( \min _{i\ne j}d_{ij}\right) \wedge \left( \min _i\frac{\sigma _i}{2}\right) , \end{aligned}$$

(73)

which is still strictly positive. In this paper, we use $a\wedge b$ to stand for $\min \{a,b\}$ and $a\vee b$ to stand for $\max \{a,b\}$.

For any $n\in \mathbb {N}$, define $S'_n\in \mathcal{B}(S^n)$ so that

$$\begin{aligned} S'_n=\{s\in S^n|\rho _S(\varepsilon _s,\sigma )<\delta \}. \end{aligned}$$

(74)

By the hypothesis that $\pi _n$ asymptotically resembles $\sigma ^n$, we can ensure

$$\begin{aligned} \pi _n(S'_n)>1-\frac{\epsilon }{2}, \end{aligned}$$

(75)

by making n large enough.

Consider any such n, as well as any $s=(s_1,s_2,\ldots ,s_n)\in S'_n$ and $i=1,2,\ldots ,I$. It follows from $\delta \le d_S(\bar{S}',\bar{S}'')\wedge (\min _{i\ne j}d_{ij})$ that $(\{\bar{s}_i\})^\delta $, whose meaning comes from (46, 47), is still $\{\bar{s}_i\}$ itself. Now by (74),

$$\begin{aligned} \varepsilon _s(\{\bar{s}_i\})<\sigma ((\{\bar{s}_i\})^\delta )+\delta =\sigma _i+\delta , \end{aligned}$$

(76)

and

$$\begin{aligned} \varepsilon _s(\{\bar{s}_i\})= & {} 1-\varepsilon _s(\{\bar{s}_j|j\ne i\}\cup \bar{S}'')>1-\sigma ((\{\bar{s}_j|j\ne i\}\cup \bar{S}'')^\delta )-\delta \nonumber \\= & {} 1-\sigma (\{\bar{s}_j|j\ne i\}\cup \bar{S}'')-\delta =\sigma _i-\delta , \end{aligned}$$

(77)

which is still above $\delta >0$ by the fact that $\delta \le \min _i\sigma _i/2$. For convenience, let $n_i(s)=n\cdot \varepsilon _s(\{\bar{s}_i\})$, the number of components $s_m$ of s that happen to be $\bar{s}_i$. Now we know that $n_i(s)$ is above $n\delta $ for every $s\in S'_n$ and $i=1,2,\ldots ,I$.

On the other hand, by Lemma 2, there exists some $\underline{n}_i$ for each $i=1,2,\ldots ,I$, so that when $n_i>\underline{n}_i$,

$$\begin{aligned} (\chi (\bar{s}_i))^{n_i}(X'_{in_i})>1-\frac{\epsilon }{2I}, \end{aligned}$$

(78)

where

$$\begin{aligned} X'_{in_i}=\{x\in X^{n_i}|\rho _X(\varepsilon _x,\chi (\bar{s}_i))<\delta \}. \end{aligned}$$

(79)

Since $\delta >0$, we can ensure that $n\delta $ and hence $n_i(s)$ is above $\underline{n}_i$ for every $i=1,2,\ldots ,I$ by letting n be large enough.

Fix a big n that facilitates both (75) and (78). For any $(s,x)\in S^n\times X^n$, let $\tilde{x}_i(s,x)$ be the $n_i(s)$-long vector of $x_m$’s whose corresponding $s_m$’s happen to be $\bar{s}_i$:

$$\begin{aligned} \tilde{x}_i(s,x)=(x_m|m=1,2,\ldots ,n \text{ but } \text{ with } s_m=\bar{s}_i)\in X^{n_i(s)}. \end{aligned}$$

(80)

Define $U'_n\in \mathcal{B}(S^n\times X^n)$, so that

$$\begin{aligned} U'_n=\{(s,x)\in S^n\times X^n|s\in S'_n \text{ and } \tilde{x}_i(s,x)\in X'_{in_i(s)} \text{ for } \text{ each } i=1,2,\ldots ,I\}.\nonumber \\ \end{aligned}$$

(81)

By (16), (75), and (78), we have

(82)

For any (s, x) in $U'_n$, let us examine how close $\varepsilon _{sx}=\varepsilon _{((s_1,x_1),\ldots ,(s_n,x_n))}$ is to $\sigma \otimes \chi $. Recall that $S=\{\bar{s}_1,\bar{s}_2,\ldots ,\bar{s}_I\}\cup \bar{S}''$. So for any $U'\in \mathcal{B}(S\times X)$,

$$\begin{aligned} U'=\left( \bigcup _{i=1}^I\{\bar{s}_i\}\times X'_i\right) \bigcup U'', \end{aligned}$$

(83)

where $X'_i\in \mathcal{B}(X)$ for $i=1,2,\ldots ,I$, while $U''$ is such that $s''\in \bar{S}''$ for any $(s'',x'')\in U''$. Note again that $\delta \le d_S(\bar{S}',\bar{S}'')\wedge \min _{i\ne j}d_{ij}$. When we take $d_{S\times X}$ to mean $d_{S\times X}((s',x'),(s'',x''))=d_S(s',s'')\vee d_X(x',x'')$, (83) would lead to

$$\begin{aligned} \bigcup _{i=1}^I\{\bar{s}_i\}\times (X'_i)^\delta \subseteq (U')^\delta . \end{aligned}$$

(84)

Now from (76) and (79),

$$\begin{aligned} \varepsilon _{sx}(\{\bar{s}_i\}\times X'_i)= & {} \varepsilon _s(\{\bar{s}_i\})\cdot \varepsilon _{\tilde{x}_i(s,x)}(X'_i)<(\sigma _i+\delta )\cdot [\chi (\bar{s}_i|(X'_i)^\delta )+\delta ] \nonumber \\\le & {} (\sigma \otimes \chi )(\{\bar{s}_i\}\times (X'_i)^\delta )+2\delta +\delta ^2<(\sigma \otimes \chi )(\{\bar{s}_i\}\times (X'_i)^\delta )+3\delta ,\nonumber \\ \end{aligned}$$

(85)

where the last inequality is due to our choice that $\delta \le \epsilon /I<1$. Meanwhile,

$$\begin{aligned} \varepsilon _{sx}(U'')\le \varepsilon _{sx}(\bar{S}''\times X)=\varepsilon _s(\bar{S}''){=}1{-}\sum _{i=1}^I\varepsilon _s(\{\bar{s}_i\})<1-\sum _{i=1}^I\sigma _i+I\delta <\epsilon +I\delta ,\nonumber \\ \end{aligned}$$

(86)

where the second-to-last inequality is due to (77) and the last one is due to (72). Combine (83) to (86), and we can obtain

$$\begin{aligned} \varepsilon _{sx}(U')<(\sigma \otimes \chi )((U')^\delta )+\epsilon +4I\delta . \end{aligned}$$

(87)

Thus,

$$\begin{aligned} \rho _{S\times X}(\varepsilon _{sx},\sigma \otimes \chi )<\epsilon +4I\delta \le 5\epsilon , \end{aligned}$$

(88)

where the last inequality comes from our choice that $\delta \le \epsilon /I$. Since (82) and (88) are to occur at any n that is large enough, we see that (i) is true.

We then prove (ii). For convenience, we denote $S\times X$ by U, $\sigma \otimes \chi $ by $\tau $, and for each $n\in \mathbb {N}$, $\pi _n\otimes \chi ^n$ by $\nu _n$. From (i), we have the sequence $\nu _n$ asymptotically resembling the sequence $\tau ^n$.

Fix some $\epsilon \in (0,1)$. Due to the countability of S and X , and hence that of U, we can identify some J points $\bar{u}_1,\bar{u}_2,\ldots ,\bar{u}_J$, so that each $\tau (\{\bar{u}_j\})>0$ and

$$\begin{aligned} \sum _{j=1}^J\tau (\{\bar{u}_j\})>1-\epsilon . \end{aligned}$$

(89)

For convenience, let $\bar{U}'=\{\bar{u}_1,\bar{u}_2,\ldots ,\bar{u}_J\}$ and $\bar{U}''=U\setminus \bar{U}'$.

As S and X are both discrete, so U is discrete as well. Thence, the distance $d_U(\bar{U}',\bar{U}'')=\inf _{u'\in \bar{U}',u''\in \bar{U}''}d_U(u',u'')>0$. For $j,k=1,2,\ldots ,J$, let us use $d'_{jk}$ for $d_U(\bar{u}_j,\bar{u}_k)$ and $\tau _j$ for $\tau (\{\bar{u}_j\})$. Now define

$$\begin{aligned} \delta =\frac{\epsilon }{J}\wedge d_U(\bar{U}',\bar{U}'')\wedge \left( \min _{j\ne k}d'_{jk}\right) \wedge \left( \min _j\frac{\tau _j}{2}\right) , \end{aligned}$$

(90)

which is still strictly positive.

For any $n\in \mathbb {N}$, define $U'_n\in \mathcal{B}(U^n)$ so that

$$\begin{aligned} U'_n{=}\left\{ u\in U^n|\rho _U(\varepsilon _u,\tau )\bigvee \left[ 2\cdot \sup \nolimits _{u'\in U}\max _{m=1}^n \rho _S(g(u',\varepsilon _{u_{-m}}),g(u',\tau ))\right] {<}\delta \right\} .\quad \quad \end{aligned}$$

(91)

By (i) that $\nu _n$ asymptotically resembles $\tau ^n$, the hypothesis that $g(u,\cdot )$ is continuous at a u-independent rate, and Lemma 4, we can ensure

$$\begin{aligned} \nu _n(U'_n)>1-\frac{\epsilon }{2}, \end{aligned}$$

(92)

by making n large enough,

Consider any such n, as well as any $u=(u_1,u_2,\ldots ,u_n)\in U'_n$ and $j=1,2,\ldots ,J$. It follows from $\delta \le d_U(\bar{U}',\bar{U}'')\wedge (\min _{j\ne k}d'_{jk})$ that $(\{\bar{u}_j\})^\delta $ is still $\{\bar{u}_j\}$ itself. Now by (91),

$$\begin{aligned} \varepsilon _u(\{\bar{u}_j\})<\tau ((\{\bar{u}_j\})^\delta )+\delta =\tau _j+\delta , \end{aligned}$$

(93)

and

$$\begin{aligned} \varepsilon _u(\{\bar{u}_j\})>1-\tau ((\{\bar{u}_k|k\ne j\}\cup \bar{U}'')^\delta )-\delta =\tau _j-\delta , \end{aligned}$$

(94)

which is still above $\delta >0$ by the fact that $\delta \le \min _j\tau _j/2$. For convenience, let $n'_j(u)=n\cdot \varepsilon _u(\{\bar{u}_j\})$. Now we know that $n'_j(u)$ is above $\lfloor n\delta \rfloor $ for every $j=1,2,\ldots ,J$.

Due to the countability of U and Lemma 3 on the uniform Glivenko–Cantelli property, there exists some $\underline{n}'$, independent of both j and u, such that when every $n'_j(u)>\underline{n}'$,

$$\begin{aligned} (g(\bar{u}_j,\varepsilon _{u\setminus \bar{u}_j}))^{n'_j(u)}(S''_{jn'_j(u)}(u))>1-\frac{\epsilon }{2J},\quad \forall j=1,2,\ldots ,J, \end{aligned}$$

(95)

where every $u\setminus \bar{u}_j$ is the $(n-1)$-long vector that is almost identical to u but with only $n'_j(u)-1$ components equal to $\bar{u}_j$, and

$$\begin{aligned} S''_{jn'}(u')=\left\{ s\in S^{n'}|\rho _S(\varepsilon _s,g(\bar{u}_j,\varepsilon _{u'\setminus \bar{u}_j}))<\frac{\delta }{2}\right\} . \end{aligned}$$

(96)

But in light of (91), we can really guarantee that

$$\begin{aligned} (g(\bar{u}_j,\varepsilon _{u\setminus \bar{u}_j}))^{n'_j(u)}(S'_{jn'_j(u)})>1-\frac{\epsilon }{2J},\quad \forall j=1,2,\ldots ,J, \end{aligned}$$

(97)

where

$$\begin{aligned} S'_{jn'}=\{s\in S^{n'}|\rho _S(\varepsilon _s,g(\bar{u}_j,\tau ))<\delta \}. \end{aligned}$$

(98)

Since $\delta >0$, we can ensure that $\lfloor n\delta \rfloor $ and hence $n'_j(u)$ is above $\underline{n}'$ for every $j=1,2,\ldots ,J$ by letting n be large enough.

Fix a big n that facilitates both (92) and (97). For any $(u,s)\in U^n\times S^n$, let $\tilde{s}_j(u,s)$ be the $n'_j(u)$-long vector of $s_m$’s whose corresponding $u_m$’s happen to be $\bar{u}_j$:

$$\begin{aligned} \tilde{s}_j(u,s)=(s_m|m=1,2,\ldots ,n \text{ but } \text{ with } u_m=\bar{u}_j)\in S^{n'_j(u)}. \end{aligned}$$

(99)

Define $V'_n\in \mathcal{B}(U^n\times S^n)$, so that

$$\begin{aligned} V'_n=\{(u,s)\in U^n\times S^n|u\in U'_n \text{ and } \tilde{s}_j(u,s)\in S'_{jn'_j(u)} \text{ for } \text{ each } j=1,2,\ldots ,J\}.\nonumber \\ \end{aligned}$$

(100)

Let us follow the same logic as used from (82) to (88) in the proof of (i), with appropriate substitutions, such as J for I, U for S, S for X, $\nu _n$ for $\pi _n$, $\tau $ for $\sigma $, $g(\cdot ,\cdot ,\tau )$ for $\chi $, $g^n$ for $\chi ^n$, $V'_n$ for $U'_n$, (92) for (75), and (97) for (78). We can then derive that

$$\begin{aligned} (\nu _n\otimes g^n)(V'_n)>1-\epsilon , \end{aligned}$$

(101)

whereas, for any (u, s) in $V'_n$,

$$\begin{aligned} \rho _{U\times S}(\varepsilon _{us},\tau \otimes g(\cdot ,\cdot ,\tau ))<5\epsilon . \end{aligned}$$

(102)

Since (101) and (102) are to occur at any n that is large enough, we see that $\nu _n\otimes g^n$ would asymptotically resemble $(\tau \otimes g(\cdot ,\cdot ,\tau ))^n$. Lemma 7 will then lead to the asymptotic resemblance of the sequence $\nu _n\odot g^n=(\nu _n\otimes g^n)|_{S^n}$ to the sequence $(\tau \odot g(\cdot ,\cdot ,\tau ))^n=((\tau \otimes g(\cdot ,\cdot ,\tau ))|_S)^n$. Thus (ii) is true.

For (iii), denote the given (s, x) by $u_1$. By Lemma 4, we can make $\varepsilon _u=(u_1,\varepsilon _{u_{-1}})_n$ arbitrarily close to $\varepsilon _{u_{-1}}$ for any $u_{-1}=(u_2,u_3,\ldots ,u_n)\in U^{n-1}$ by letting n be large enough. Hence, we can follow the proof of (ii) almost verbatim, with its (91) replaced by

$$\begin{aligned} U'_{n-1}=\left\{ u_{-1}\in U^{n-1}|\rho _U(\varepsilon _u,\tau )\bigvee \left[ 2\cdot \sup \nolimits _{u'\in U}\max _{m=1}^n \rho _S(g(u',\varepsilon _{u_{-m}}),g(u',\tau ))\right] <\delta \right\} ,\nonumber \\ \end{aligned}$$

(103)

its (92) replaced by

$$\begin{aligned} (\delta _{u_1}\times \nu _{n-1})(\{u_1\}\times U'_{n-1})=\nu _{n-1}(U'_{n-1})>1-\frac{\epsilon }{2}, \end{aligned}$$

(104)

any choice of $u\in U^n$ replaced by $u_{-1}\in U^{n-1}$, and any choice of $u\in U'_n$ replaced by $u_{-1}\in U'_{n-1}$. $\square $

Proof of Theorem 1

We prove by induction on $t'$.

First, note that $T_{[t,t-1]}\circ \sigma _t$ is merely $\sigma _t$ itself. Hence, the claim is true for $t'=t$ because by the hypothesis, we do have $\pi _{nt}$ asymptotically resembling $(T_{[t,t-1]}\circ \sigma _t)^n=\sigma _t^{\;n}$. Then, for some $t'=t,t+1,\ldots ,\bar{t}$, suppose the claim is true, that $\pi _{nt'}=\pi _{nt}\odot \Pi _{t''=t}^{t'-1}(\chi _{t''}^{\;n}\odot \tilde{g}_{t''}^{\;n})$ asymptotically resembles $\sigma _{t'}^{\;n}=(T_{[t,t'-1]}(\chi _{[t,t'-1]})\circ \sigma _t)^n$.

Assumption 1 on $\tilde{g}_{t'}(s,x,\tau )$’s equi-continuity in $\tau $ allows us to use part (ii) of Proposition 1. By it, we would have $\pi _{nt'}\odot \chi _{t'}^{\;n}\odot \tilde{g}_{t'}^{\;n}$ asymptotically resembling $(\sigma _{t'}\odot \chi _{t'}\odot \tilde{g}_t(\cdot ,\cdot ,\sigma _{t'}\otimes \chi _{t'}))^n$. Since the former is merely $\pi _{n,t'+1}=\pi _{nt}\odot \Pi _{t''=t}^{t'}(\chi _{t''}^{\;n}\odot \tilde{g}_{t''}^{\;n})$ and the latter is $\sigma _{t'+1}^{\;\;\;n}=(T_{[tt']}(\chi _{[tt']})\circ \sigma _t)^n$, we have thus proved the claim for $t'+1$.

The induction process is now complete. $\square $

Appendix 3: Proofs of Sect. 6

Proof of Proposition 2

Let us prove by induction on t. By (21) and (25), the desired result is true for $t=\bar{t}+1$.

At some $t=\bar{t},\bar{t}-1,\ldots ,1$, suppose for any $\sigma _{t+1}$ and any sequence $\hat{\pi }_{n-1,t+1}$ that asymptotically resembles $\sigma _{t+1}^{n-1}$, the sequence $\int _{S^{n-1}}\hat{\pi }_{n-1,t+1}(ds_{t+1,-1})\cdot v_{n,t+1}(s_{t+1,1},\xi _{[t+1,\bar{t}]},,\varepsilon _{s_{t+1,-1}},\chi _{[t+1,\bar{t}]})$ converges to $v_{t+1}(s_{t+1,1},\xi _{[t+1,\bar{t}]},\sigma _{t+1},\chi _{[t+1,\bar{t}]})$ at a rate independent of both $s_{t+1,1}$ and $\xi _{[t+1,\bar{t}]}$.

Now, given the sequence $\hat{\pi }_{n-1,t}$ that is known to asymptotically resemble $\sigma _t^{\;n-1}$, we are to show that $\int _{S^{n-1}}\hat{\pi }_{n-1,t}(ds_{t,-1})\cdot v_{nt}(s_{t1},\xi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})$ will converge to $v_t(s_{t1},\xi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]})$ at a rate independent of both $s_{t1}$ and $\xi _{[t\bar{t}]}$. For convenience, let $\sigma _{t+1}=T_t(\chi _t)\circ \sigma _t$.

Note that, by (22) and (26),

$$\begin{aligned}&\sup \nolimits _{s_{t1}\in S,\xi _{[t\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t+1}}\left| \phantom {\int _{S^{n-1}}} v_t(s_{t1},\xi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]})\right. \nonumber \\&\quad \left. -\int _{S^{n-1}}\hat{\pi }_{n-1,t}(ds_{t,-1})\cdot v_{nt}(s_{t1},\xi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})\right| \le M_{n1}+M_{n2}+M_{n3},\qquad \qquad \end{aligned}$$

(105)

where

$$\begin{aligned} M_{n1}= & {} \sup \nolimits _{(s_{t1},x_{t1})\in S\times X}\int _{S^{n-1}\times X^{n-1}}(\hat{\pi }_{n-1,t}\otimes \chi _t^{\;n-1})(ds_{t,-1}\times dx_{t,-1}) \nonumber \\&\times \mid \tilde{f}_t(s_{t1},x_{t1},\sigma _t\otimes \chi _t)-\tilde{f}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}})\mid , \end{aligned}$$

(106)

$$\begin{aligned} M_{n2}= & {} \sup \nolimits _{(s_{t1},x_{t1})\in S\times X,\xi _{[t+1,\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t}}\int _S\tilde{g}_t(s_{t1},x_{t1},\sigma _t\otimes \chi _t|ds_{t+1,1})\nonumber \\&\times \, \Big | v_{t+1}(s_{t+1,1},\xi _{[t+1,\bar{t}]},\sigma _{t+1},\chi _{[t+1,\bar{t}]})\nonumber \\&-\int _{S^{n-1}\times X^{n-1}}(\hat{\pi }_{n-1,t}\otimes \chi _t^{\;n-1})(ds_{t,-1}\times dx_{t,-1})\nonumber \\&\times \,\Pi _{m=2}^n\int _S \tilde{g}_t(s_{tm},x_{tm},\varepsilon _{s_{t,-m}x_{t,-m}}|ds_{t+1,m})\nonumber \\&\times \, v_{n,t+1}(s_{t+1,1},\xi _{[t+1,\bar{t}]},\varepsilon _{s_{t+1,-1}},\chi _{[t+1,\bar{t}]})\Big |, \end{aligned}$$

(107)

and

$$\begin{aligned} M_{n3}= & {} \sup \nolimits _{(s_{t1},x_{t1})\in S\times X,\xi _{[t+1,\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t}}\nonumber \\&\times \int _{S^{n-1}\times X^{n-1}}(\hat{\pi }_{n-1,t}\otimes \chi _t^{\;n-1})(ds_{t,-1}\times dx_{t,-1}) \nonumber \\&\times \left| \left[ \int _S\tilde{g}_t(s_{t1},x_{t1},\sigma _t\otimes \chi _t|ds_{t+1,1})-\int _S\tilde{g}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}}|ds_{t+1,1})\right] \right. \nonumber \\&\times \,\Pi _{m=2}^n\int _S\tilde{g}_t(s_{tm},x_{tm},\varepsilon _{s_{t,-m}x_{t,-m}}|ds_{t+1,m})\nonumber \\&\left. \times \, v_{n,t+1}(s_{t+1,1},\xi _{[t+1,\bar{t}]},\varepsilon _{s_{t+1,-1}},\chi _{[t+1,\bar{t}]})\phantom {\left[ \int _S\tilde{g}_t(s_{t1},x_{t1},\sigma _t\otimes \chi _t|ds_{t+1,1})-\int _S\tilde{g}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}}|ds_{t+1,1})\right] }\right| . \end{aligned}$$

(108)

We now show that each of the above three terms can be made arbitrarily small by letting n be large enough.

For $M_{n1}$, define $\tilde{U}_{n-1}(\delta )\in \mathcal{B}(S^{n-1}\times X^{n-1})$ for every $\delta >0$, so that

$$\begin{aligned} \tilde{U}_{n-1}(\delta )=\{(s_{t,-1},x_{t,-1})\in S^{n-1}\times X^{n-1}|\rho _{S\times X}(\varepsilon _{s_{t,-1}x_{t,-1}},\sigma _t\otimes \chi _t)<\delta \}.\qquad \end{aligned}$$

(109)

From (106), we know $M_{n1}\le M_{n11}(\delta )+M_{n12}(\delta )$ for any $\delta >0$, where

(110)

and

$$\begin{aligned} M_{n12}(\delta )= & {} \sup \nolimits _{(s_{t1},x_{t1})\in S\times X}\int _{(S^{n-1}\times X^{n-1})\setminus \tilde{U}_{n-1}(\delta )} (\hat{\pi }_{n-1,t}\otimes \chi _t^{\;n-1})(ds_{t,-1}\times dx_{t,-1}) \nonumber \\&\times \,[\mid \tilde{f}_t(s_{t1},x_{t1},\sigma _t\otimes \chi _t)\mid +\mid \tilde{f}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}})\mid ]. \end{aligned}$$

(111)

Because Assumption 2 says that $\tilde{f}_t(s,x,\tau )$ is continuous in $\tau $ at an (s, x)-independent rate, we can make $M_{n11}(\delta )$ arbitrarily small by letting $\delta $ be small enough. Meanwhile, by the asymptotic resemblance of the sequence $\hat{\pi }_{n-1,t}$ to the sequence $\sigma _t^{\;n-1}$ and part (i) of Proposition 1, we know that the sequence $\hat{\pi }_{n-1,t}\otimes \chi _t^{\;n-1}$ asymptotically resembles the sequence $(\sigma _t\otimes \chi _t)^{n-1}$. So the measure $(\hat{\pi }_{n-1,t}\otimes \chi _t^{\;n-1})((S^{n-1}\times X^{n-1})\setminus \tilde{U}_{n-1}(\delta ))$ can be made arbitrarily small at any $\delta $ by letting n be large enough. Since $\tilde{f}_t$ is bounded, this means that $M_{n12}(\delta )$ can be made arbitrarily small as well.

For $M_{n2}$, note the second integral in (107) can be understood as $\hat{\pi }_{n-1,t+1}(s_{t1},x_{t1}|ds_{t+1,-1}){=}\int _{S^{n-1}}\{[(\delta _{s_{t1}x_{t1}}\times (\sigma _t^{\;n-1}\otimes \chi _t^{\;n-1}))\odot \tilde{g}_t^{\;n}]|_{S^{n-1}}\}(ds_{t+1,-1})$. So we have

$$\begin{aligned} M_{n2}\le & {} \sup \nolimits _{(s_{t1},x_{t1})\in S\times X,s_{t+1,1}\in S,\xi _{[t+1,\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t}}\Big | v_{t+1}(s_{t+1,1},\xi _{[t+1,\bar{t}]},\sigma _{t+1},\chi _{[t+1,\bar{t}]}) \nonumber \\&{-}\int _{S^{n-1}}\hat{\pi }_{n-1,t+1}(s_{t1},x_{t1}|ds_{t+1,-1})\cdot v_{n,t+1}(s_{t+1,1},\xi _{[t+1,\bar{t}]},\varepsilon _{s_{t+1,-1}},\chi _{[t+1,\bar{t}]})\Big |.\nonumber \\ \end{aligned}$$

(112)

Meanwhile, Assumption 1 allows us to use part (iii) of Proposition 1. By the asymptotic resemblance of the sequence $\hat{\pi }_{n-1,t}$ to the sequence $\sigma _t^{\;n-1}$, part (iii) of Proposition 1, and Lemma 5, we know that the sequence $\hat{\pi }_{n-1,t+1}(s_{t1},x_{t1})$ asymptotically resembles the sequence $\sigma _{t+1}^{n-1}$ at an $(s_{t1},x_{t1})$-independent rate. Then by the induction hypothesis where the convergence rate is also $(s_{t+1,1},\xi _{[t+1,\bar{t}]})$-independent, we can conclude that $M_{n2}$ can be made arbitrarily small by letting n be large enough.

For $M_{n3}$, define $V_n(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}},\xi _{[t+1,\bar{t}]})$ so that

$$\begin{aligned} V_n(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}},\xi _{[t+1,\bar{t}]})= & {} \left| \left[ \int _S\tilde{g}_t(s_{t1},x_{t1},\sigma _t\otimes \chi _t|ds_{t+1,1})\right. \right. \nonumber \\&\left. -\int _S\tilde{g}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}}|ds_{t+1,1})\right] \nonumber \\&\times \, \Pi _{m=2}^n\tilde{g}_t(s_{tm},x_{tm},\varepsilon _{s_{t,-m}x_{t,-m}}|ds_{t+1,m}) \nonumber \\&\left. \times \, v_{n,t+1}(s_{t+1,1},\xi _{[t+1,\bar{t}]},\varepsilon _{s_{t+1,-1}},\chi _{[t+1,\bar{t}]})\phantom {\int _S}\right| .\qquad \qquad \end{aligned}$$

(113)

Then, (108) can be written as

$$\begin{aligned} M_{n3}= & {} \sup \nolimits _{(s_{t1},x_{t1})\in S\times X,\xi _{[t+1,\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t}}\int _{S^{n-1}\times X^{n-1}} \nonumber \\&\,(\hat{\pi }_{n-1,t}\otimes \chi _t^{\;n-1})(ds_{t,-1}\times dx_{t,-1})\nonumber \\&\times \, V_n(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}},\xi _{[t+1,\bar{t}]}). \end{aligned}$$

(114)

Noting the definition of $\tilde{U}_{n-1}(\delta )$ in (109) for any $\delta >0$, we see that $M_{n3}\le M_{n31}(\delta )+M_{n32}(\delta )$, where

$$\begin{aligned} M_{n31}(\delta )= & {} \sup \nolimits _{(s_{t1},x_{t1})\in S\times X,(s_{t,-1},x_{t,-1})\in \tilde{U}_{n-1}(\delta ),\xi _{[t+1,\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t}}\nonumber \\&\quad \quad \quad \quad \quad \quad V_n(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}},\xi _{[t+1,\bar{t}]}), \end{aligned}$$

(115)

and

$$\begin{aligned} M_{n32}(\delta )= & {} \sup \nolimits _{(s_{t1},x_{t1})\in S\times X,\xi _{[t+1,\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t}}\int _{(s_{t,-1},x_{t,-1})\in (S^{n-1}\times X^{n-1})\setminus \tilde{U}_{n-1}(\delta )}\nonumber \\&(\hat{\pi }_{n-1,t}\otimes \chi _t^{\;n-1})(ds_{t,-1}\times dx_{t,-1})\cdot V_n(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}},\xi _{[t+1,\bar{t}]}).\nonumber \\ \end{aligned}$$

(116)

We argue that $M_{n31}(\delta )$ can be made arbitrarily small as $\delta $ approaches $0^+$. Due to Assumption 1 that $\tilde{g}_t(s,x,\tau )$ is continuous in $\tau $ at an (s, x)-independent rate, we can make $\tilde{g}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}})$ for any $(s_{t,-1},x_{t,-1})\in \tilde{U}_{n-1}(\delta )$ arbitrarily close to $\tilde{g}_t(s_{t1},x_{t1},\sigma _t\otimes \chi _t)$ by rendering $\delta $ small enough, without respect to $(s_{t1},x_{t1})$. Due to its countability, we can write $S=\{\bar{s}_1,\bar{s}_2,\ldots \}$. Under known $s_{t1}$, $x_{t1}$, $\varepsilon _{s_{t,-1}x_{t,-1}}$, and $\xi _{[t+1,\bar{t}]}$, let us use the simplified notation

$$\begin{aligned} \gamma _i= & {} \tilde{g}_t(s_{t1},x_{t1},\sigma _t\otimes \chi _t|\{\bar{s}_i\}), \end{aligned}$$

(117)

$$\begin{aligned} \gamma '_i= & {} \tilde{g}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}}|\{\bar{s}_i\}), \end{aligned}$$

(118)

and

$$\begin{aligned} v_i= & {} \Pi _{m=2}^n\int _S \tilde{g}_t(s_{tm},x_{tm},\varepsilon _{s_{t,-m}x_{t,-m}}|ds_{t+1,m})\nonumber \\&\quad \quad \quad \quad \quad \quad \quad \quad \times v_{n,t+1}(s_{t+1,1},\xi _{[t+1,\bar{t}]},\varepsilon _{s_{t+1,-1}},\chi _{[t+1,\bar{t}]}). \end{aligned}$$

(119)

Then, (113) can be expressed as

$$\begin{aligned} V_n(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}},\xi _{[t+1,\bar{t}]})=\left| \sum _i \gamma _i\cdot v_i-\sum _i\gamma '_i\cdot v_i\right| . \end{aligned}$$

(120)

Note the $\mid v_i\mid $’s are uniformly bounded, say by $\overline{v}$, due to the boundedness of the $\tilde{f}_{t'}$’s and the finiteness of $\bar{t}$. Let I be the set of i’s such that $\gamma _i\ge \gamma '_i$. Then, from (120), we have

$$\begin{aligned} V_n(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}},\xi _{[t+1,\bar{t}]})\le 2\overline{v}\cdot \sum _{i\in I}(\gamma _i-\gamma '_i). \end{aligned}$$

(121)

Let $\delta $ be below $\inf _{s\ne s'}d_S(s,s')>0$. But then, $(s_{t,-1},x_{t,-1})\in \tilde{U}_{n-1}(\delta )$ would entail

$$\begin{aligned} \sum _{i\in I}(\gamma _i-\gamma '_i)= & {} \tilde{g}_t(s_{t1},x_{t1},\sigma _t\otimes \chi _t|\{s_i|i\in I\})-\tilde{g}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}}|\{s_i|i\in I\})\nonumber \\= & {} \tilde{g}_t(s_{t1},x_{t1},\sigma _t\otimes \chi _t|\{s_i|i\in I\})-\tilde{g}_t(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}}|(\{s_i|i\in I\})^\delta )<\delta .\qquad \nonumber \\ \end{aligned}$$

(122)

In view of (121), $V_n(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}},\xi _{[t+1,\bar{t}]})$ with $(s_{t,-1},x_{t,-1})\in \tilde{U}_{n-1}(\delta )$ can be made arbitrarily small by decreasing $\delta $ at a rate independent of $(s_{t1},x_{t1},\xi _{[t+1,\bar{t}]})$. In view of (115), we see that $M_{n31}(\delta )$ can be made arbitrarily small by rendering $\delta $ small enough.

As noted earlier, the probability $(\hat{\pi }_{n-1,t}\otimes \chi _t^{\;n-1})((S^{n-1}\times X^{n-1})\setminus \tilde{U}_{n-1}(\delta ))$ can be made arbitrarily small at any $\delta $ when n is made large enough. But since $V_n(s_{t1},x_{t1},\varepsilon _{s_{t,-1}x_{t,-1}},\xi _{[t+1,\bar{t}]})$ is uniformly bounded, this means that $M_{n12}(\delta )$ can be made arbitrarily small as well.

Hence, all three terms can be made arbitrarily small by letting n be large enough. We have thus completed the induction process. $\square $

Proof of Theorem 2

Given (23) for every $t=1,2,\ldots ,\bar{t}$ and $\xi _t\in \mathcal{K}(S,X)$, we are to verify (27) for every $t=1,2,\ldots ,\bar{t}$, $\epsilon >0$, large enough n, $s_{t1}\in S$, and $\xi _{[t\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$.

First, we show that the one-time formulation of (23) would already imply the futility of any multi-period unilateral deviation. Another way to write the condition is, at $t'=0$, for any $t=1,2,\ldots ,\bar{t}-t'$ and $\xi _{[t,t+t']}\in (\mathcal{K}(S,X))^{t'+1}$,

$$\begin{aligned} v_t(s_t,(\xi _{[t,t+t'-1]},\chi _{[t+t',\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]})\ge v_t(s_t,(\xi _{[t,t+t']},\chi _{[t+t'+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]}).\qquad \quad \end{aligned}$$

(123)

Now suppose (123) is true for some $t'=0,1,\ldots ,\bar{t}-1$. We are to show its validity at $t'+1$. But by (22), for any $t=1,2,\ldots ,\bar{t}-t'$,

$$\begin{aligned}&v_t(s_t,(\xi _{[t,t+t']},\chi _{[t+t'+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]})-v_t(s_t,(\xi _{[t,t+t'+1]},\chi _{[t+t'+2,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]}) \nonumber \\&\quad =\int _X\xi _t(s_t|dx_t)\cdot \int _S\tilde{g}_t(s_t,x_t,\sigma _t\otimes \chi _t|ds_{t+1}) \nonumber \\&\qquad \times \, [v_{t+1}(s_{t+1},(\xi _{[t+1,t+t']},\chi _{[t+t'+1,\bar{t}]}),T_t(\chi _t)\circ \sigma _t,\chi _{[t+1,\bar{t}]})\nonumber \\&\qquad -\,v_{t+1}(s_{t+1},(\xi _{[t+1,t+t'+1]},\chi _{[t+t'+2,\bar{t}]}),T_t(\chi _t)\circ \sigma _t,\chi _{[t+1,\bar{t}]})], \end{aligned}$$

(124)

which, by the induction hypothesis (123), is positive. Therefore, (123) is true for any $t=1,2,\ldots ,\bar{t}$, $t'=0,1,\ldots ,\bar{t}-t$, and $\xi _{[t,t+t']}\in (\mathcal{K}(S,X))^{t'+1}$.

Using (123) multiple times, we can derive, for any $t=1,2,\ldots ,\bar{t}$ and $\xi _{[t\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$,

$$\begin{aligned} v_t(s_t,\chi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]})\ge & {} v_t(s_t,(\xi _t,\chi _{[t+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]})\nonumber \\\ge & {} v_t(s_t,(\xi _{[t,t+1]},\chi _{[t+2,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]})\nonumber \\\ge & {} \cdots \ge v_t(s_t,(\xi _{[t,\bar{t}-1]},\chi _{\bar{t}}),\sigma _t,\chi _{[t\bar{t}]})\nonumber \\\ge & {} v_t(s_t,\xi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]}). \end{aligned}$$

(125)

In view of (125), we would have (27) if for any $\epsilon $ and large enough n,

$$\begin{aligned} \int _{S^{n-1}}\hat{\pi }_{n-1,t}(ds_{t,-1})\cdot v_{nt}(s_{t1},\chi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})>v_t(s_{t1},\chi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]})-\frac{\epsilon }{2}, \end{aligned}$$

(126)

and for any $\xi _{[t\bar{t}]}\in (\mathcal{K}(S,X))^{\bar{t}-t+1}$,

$$\begin{aligned} \int _{S^{n-1}}\hat{\pi }_{n-1,t}(ds_{t,-1})\cdot v_{nt}(s_{t1},\xi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})<v_t(s_{t1},\xi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]})+\frac{\epsilon }{2}. \end{aligned}$$

(127)

Both (126) and (127) would be true if

$$\begin{aligned} \int _{S^{n-1}}\hat{\pi }_{n-1,t}(ds_{t,-1})\cdot v_{nt}(s_{t1},\xi _{[t\bar{t}]},\varepsilon _{s_{t,-1}},\chi _{[t\bar{t}]})\longrightarrow _{n\rightarrow +\infty } v_t(s_{t1},\xi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]}),\qquad \end{aligned}$$

(128)

at an $(s_{t1},\xi _{[t\bar{t}]})$-independent convergence rate. But this was provided by Proposition 2. $\square $

Appendix 4: Proofs of Sect. 7

Proof of Proposition 3

Since A is discrete, we can denote it by either $\{\bar{a}_1,\bar{a}_2,\ldots \}$ or $\{\bar{a}_1,\ldots ,\bar{a}_I\}$ for some finite I. We work with the former only, as the latter is similarly treatable.

For any $n\in \mathbb {N}$, define

$$\begin{aligned} N_n=\left\{ (n_1,n_2,\ldots )|n_i=0,1,\ldots ,n \text{ for } \text{ each } i=1,2,\ldots ,\; \text{ and } \sum _{i=1}^{+\infty }n_i=n\right\} .\qquad \end{aligned}$$

(129)

For each $(n_1,n_2,\ldots )\in N_n$, define $A^n_{n_1n_2\cdots }$ so that

$$\begin{aligned} A^n_{n_1n_2\cdots }=\left\{ a\in A^n|\varepsilon _a(\{\bar{a}_i\})=\frac{n_i}{n} \text{ for } \text{ any } i=1,2,\ldots \right\} . \end{aligned}$$

(130)

Note that every $A^n_{n_1n_2\cdots }$ is symmetric, different $A^n_{n_1n_2\cdots }$’s are non-overlapping, and

$$\begin{aligned} A^n=\bigcup _{(n_1n_2\cdots )\in N_n}A^n_{n_1n_2\cdots }. \end{aligned}$$

(131)

Due to the above decomposition, each $a\in A^n$ belongs to its own $A^n_{n\cdot \varepsilon _a(\{\bar{a}_1\}),n\cdot \varepsilon _a(\{\bar{a}_2\}),\cdots }$.

For any $(n_1,n_2,\cdots )\in N_n$, the set $A^n_{n_1n_2\cdots }$ contains $n!/(\prod _{i=1}^{+\infty }n_i!)$ distinct members of $A^n$, say $a_1,\ldots ,a_{n!/(\prod _{i=1}^{+\infty }n_i!)}$. In addition, every $a_k$ is of the form $\psi a_1$ for some $\psi \in \Psi _n$. Thus, due to $q_n$’s symmetry, for $k=1,2,\ldots ,n!/(\prod _{i=1}^{+\infty }n_i!)$,

$$\begin{aligned} q_n(\{a_k\})=\frac{\prod _{i=1}^{+\infty }n_i!}{n!}\cdot q_n(A^n_{n_1n_2\cdots }). \end{aligned}$$

(132)

Suppose $n_i\ge 1$ for some $i=1,2,\ldots $. Then, exactly $(n-1)!/((n_i-1)!\cdot \prod _{j\ne i}n_j!)$ of the $a_k$’s will have $a_{k1}=\bar{a}_i$. Therefore, for any such $a_k$,

$$\begin{aligned} q_n((\{\bar{a}_i\}\times A^{n-1})\cap A^n_{n_1n_2\cdots })=\frac{(n_i-1)!\cdot \prod _{j\ne i}n_j!}{(n-1)!}\cdot q_n(\{a_k\})=\frac{n_i}{n}\cdot q_n(A^n_{n_1n_2\cdots }),\nonumber \\ \end{aligned}$$

(133)

where the second equality stems from (132). The above left- and right-hand sides are certainly equated as well when $n_i=0$. Combine (131) and (133), and we can obtain

$$\begin{aligned} q_n(\{\bar{a}_i\}\times A^{n-1})=\sum _{(n_1,n_2,\ldots )\in N_n}\frac{n_i}{n}\cdot q_n(A^n_{n_1n_2\cdots }). \end{aligned}$$

(134)

On the other hand, we have $\min _{i\ne j}d_A(\bar{a}_i,\bar{a}_j)>0$ due to A’s discreteness. Suppose $\epsilon >0$ is small enough to be strictly below this constant. Then by the nature of the Prohorov metric, $a\in A^n$ would satisfy $\rho _A(\varepsilon _a,p)<\epsilon $ if and only if

$$\begin{aligned} \sum _{i=1}^{+\infty }\mid \varepsilon _a(\{\bar{a}_i\})-p(\{\bar{a}_i\})\mid <2\epsilon , \end{aligned}$$

(135)

and hence only if

$$\begin{aligned} \max _{i=1}^{+\infty }\mid \varepsilon _a(\{\bar{a}_i\})-p(\{\bar{a}_i\})\mid <\epsilon . \end{aligned}$$

(136)

Since the sequence $q_n$ asymptotically resembles p, for any $\epsilon >0$ that is strictly below $\min _{i\ne j}d_A(\bar{a}_i,\bar{a}_j)>0$, we can pick n large enough so that (54) and (55) in the proof of Lemma 5 are true. Define $N'_n\subseteq N_n$ so that for any $(n_1,n_2,\ldots )\in N'_n$,

$$\begin{aligned} \sum _{i=1}^{+\infty }\Big | \frac{n_i}{n}-p(\{\bar{a}_i\})\Big |<2\epsilon ,\; \text{ and } \text{ hence } \;\max _{i=1}^{+\infty }\Big | \frac{n_i}{n}-p(\{\bar{a}_i\})\Big |<\epsilon . \end{aligned}$$

(137)

Due to (130), (131), and (135), we have the following for the $A'_n$ defined in (55):

$$\begin{aligned} A'_n=\bigcup _{(n_1,n_2,\ldots )\in N'_n}A^n_{n_1n_2\cdots }. \end{aligned}$$

(138)

Now for any $i=1,2,\ldots $, we have

$$\begin{aligned}&\mid q_n|_A(\{\bar{a}_i\})-p(\{\bar{a}_i\})\mid =\mid q_n(\{\bar{a}_i\}\times A^{n-1})-p(\{\bar{a}_i\})\mid \nonumber \\&\quad =\left| \left( \sum _{(n_1,n_2,\ldots )\in N'_n}+\sum _{(n_1,n_2,\ldots )\in N_n\setminus N'_n}\right) n_i\cdot q_n(A^n_{n_1n_2\cdots })/n-p(\{\bar{a}_i\})\right| \nonumber \\&\quad \le q_n(A'_n)\cdot \mid n_i/n-p(\{\bar{a}_i\})\mid +q_n(A^n\setminus A'_n)<2\epsilon . \end{aligned}$$

(139)

Here, the first equality comes from the definition of marginal probability, the second equality comes from (134), the first inequality can be attributed to (138), and the last inequality is due to (54) and (136). Thus, for every $a\in A$, we have $\lim _{n\rightarrow +\infty }q_n|_A(\{a\})=p(\{a\})$. $\square $

Proof Proposition 4

For the time being, it does not matter whether $A=\{\bar{a}_1,\bar{a}_2,\ldots \}$ or $\{\bar{a}_1,\ldots ,\bar{a}_I\}$ for some finite I. The first few steps are the same as those in the proof of Lemma 5. For any $\epsilon >0$ and n large enough, we can have (54) to (58) as in that proof.

Fix $i=1,2,\ldots $ with $p(\{\bar{a}_i\})>0$. Due to (57),

$$\begin{aligned} A'_n\cap (\{\bar{a}_i\}\times A^{n-1})\subseteq (A\times A''_{n-1})\cap (\{\bar{a}_i\}\times A^{n-1})=\{\bar{a}_i\}\times A''_{n-1}, \end{aligned}$$

(140)

where $A'_n$ is defined in (55) and $A''_{n-1}$ is defined in (58). Thus,

$$\begin{aligned} q_n(\{\bar{a}_i\}\times A''_{n-1})\ge & {} q_n(A'_n\cap (\{\bar{a}_i\}\times A^{n-1}))=q_n((\{\bar{a}_i\}\times A^{n-1})\setminus (A^n\setminus A'_n))\nonumber \\\ge & {} q_n(\{\bar{a}_i\}\times A^{n-1}){-}q_n(A^n\setminus A'_n){>} q_n(\{\bar{a}_i\}\times A^{n-1})-\epsilon ,\qquad \quad \end{aligned}$$

(141)

where the last inequality is due to (54).

Since $q_n$ is symmetric, we know from Proposition 3 that, when n is large enough,

$$\begin{aligned} q_n(\{\bar{a}_i\}\times A^{n-1})=q_n|_A(\{\bar{a}_i\})>\frac{p(\{\bar{a}_i\})}{2}>0. \end{aligned}$$

(142)

Combining (141) and (142), we can obtain

$$\begin{aligned} q_{n,A}|_{A^{n-1}}(\bar{a}_i|A''_{n-1})=\frac{q_n(\{\bar{a}_i\}\times A''_{n-1})}{q_n(\{\bar{a}_i\}\times A^{n-1})}>1-\frac{\epsilon }{q_n(\{\bar{a}_i\}\times A^{n-1})}>1-\frac{2\epsilon }{p(\{\bar{a}_i\})}.\nonumber \\ \end{aligned}$$

(143)

With $A''_{n-1}$’s definition in (58), we get $q_{n,A}|_{A^{n-1}}(\bar{a}_i|\cdot )$’s asymptotic resemblance to $p^{n-1}$. $\square $

Appendix 5: Developments in Sect. 8

Proof of Theorem 4

Let $\epsilon >0$ be fixed. Given $t=1,2,\ldots $ and $\chi \in \mathcal{K}(S,X)$, we use $\chi ^t$ to denote $(\chi ,\chi ,\ldots ,\chi )\in (\mathcal{K}(S,X))^t$. From (38), we know

$$\begin{aligned} \mid v^\infty _n(s_1,\xi _{[1\infty ]},\varepsilon _{s_{-1}},\chi ^\infty )-v^t_n(s_1,\xi _{[1t]},\varepsilon _{s_{-1}},\chi ^t)\mid \le \frac{\bar{\alpha }^t\cdot \bar{f}}{1-\bar{\alpha }}. \end{aligned}$$

(144)

Hence, when $t\ge \ln (6\bar{f}/(\epsilon \cdot (1-\bar{\alpha })))/\ln (1/\bar{\alpha })+1$,

$$\begin{aligned} v^\infty _n(s_1,\chi ^\infty ,\varepsilon _{s_{-1}},\chi ^\infty )>v^t_n(s_1,\chi ^t,\varepsilon _{s_{-1}},\chi ^t)-\frac{\epsilon }{6}, \end{aligned}$$

(145)

and

$$\begin{aligned} v^\infty _n(s_1,\xi _{[1\infty ]},\varepsilon _{s_{-1}},\chi ^\infty )<v^t_n(s_1,\xi _{[1t]},\varepsilon _{s_{-1}},\chi ^t)+\frac{\epsilon }{6}, \end{aligned}$$

(146)

for every $s_1\in S$, $s_{-1}\in S^{n-1}$, and $\xi _{[1\infty ]}\in (\mathcal{K}(S,X))^\infty $. Therefore, we need merely to select such a large t and show that, when n is large enough,

$$\begin{aligned}&\int _{S^{n-1}}\hat{\pi }_{n-1}(ds_{-1})\cdot v^t_n(s_1,\chi ^t,\varepsilon _{s_{-1}},\chi ^t)\nonumber \\&\quad \ge \int _{S^{n-1}}\hat{\pi }_{n-1}(ds_{-1})\cdot v^t_n(s_1,\xi _{[1t]},\varepsilon _{s_{-1}},\chi ^t)-\frac{2\epsilon }{3}, \end{aligned}$$

(147)

for every $s_1\in S$ and $\xi _{[1t]}\in (\mathcal{K}(S,X))^t$.

Since $(\chi ,\sigma )$ poses as an equilibrium for $\Gamma $, we know (36) is true. Another way to write the condition is, at $t'=0$, for any $\xi _{[1,t'+1]}\in (\mathcal{K}(S,X))^{t'+1}$,

$$\begin{aligned} v^\infty (s,(\xi _{[1t']},\chi ^\infty ),\sigma ,\chi ^\infty )\ge v^\infty (s,(\xi _{[1,t'+1]},\chi ^\infty ),\sigma ,\chi ^\infty ). \end{aligned}$$

(148)

Now suppose (148) is true for some $t'=0,1,\ldots $. We are to show its validity at $t'+1$. By (33), (35), and the uniform convergence of $v^t(s,\xi _{[1t]},\sigma ,\chi ^t)$ to $v^\infty (s,\xi _{[1\infty ]},\sigma ,\chi ^\infty )$, we have

$$\begin{aligned} v^\infty (s,\xi _{[1\infty ]},\sigma ,\chi ^\infty )= & {} \int _X \xi _1(s|dx)\cdot [\tilde{f}(s,x,\sigma \otimes \chi ) \nonumber \\&+\bar{\alpha }\cdot \int _S \tilde{g}(s,x,\sigma \otimes \chi |ds')\cdot v^\infty (s',\xi _{[2\infty ]},\sigma ,\chi ^\infty )].\qquad \quad \end{aligned}$$

(149)

Therefore,

$$\begin{aligned}&v^\infty (s,(\xi _{[1,t'+1]},\chi ^\infty ),\sigma ,\chi ^\infty )-v^\infty (s,(\xi _{[1,t'+2]},\chi ^\infty ),\sigma ,\chi ^\infty ) \nonumber \\&\quad =\int _X\xi _1(s|dx)\cdot \int _S\tilde{g}(s,x,\sigma \otimes \chi |ds') \nonumber \\&\qquad \times \, [v^\infty (s',(\xi _{[2,t'+1]},\chi ^\infty ),\sigma ,\chi ^\infty )-v^\infty (s',(\xi _{[2,t'+2]},\chi ^\infty ),\sigma ,\chi ^\infty )],\qquad \quad \end{aligned}$$

(150)

which, by the induction hypothesis (148), is positive. Therefore, (148) is true for $t'=0,1,\ldots $.

By using (148) multiple times, we can derive that, for any $\xi _{[1t]}\in (\mathcal{K}(S,X))^t$,

$$\begin{aligned}&v^\infty (s,\chi ^\infty ,\sigma ,\chi ^\infty )\ge v^\infty (s,(\xi _1,\chi ^\infty ),\sigma ,\chi ^\infty )\ge v^\infty (s,(\xi _{[12]},\chi ^\infty ),\sigma ,\chi ^\infty )\nonumber \\&\quad \ge \cdots \ge v^\infty (s,(\xi _{[1,t-1]},\chi ^\infty ),\sigma _t,\chi ^\infty )\ge v^\infty (s,(\xi _{[1t]},\chi ^\infty ),\sigma ,\chi ^\infty ). \end{aligned}$$

(151)

Also, we know from (34) that

$$\begin{aligned} \mid v^\infty (s,\zeta _{[1\infty ]},\sigma ,\chi ^\infty )-v^t(s,\zeta _{[1t]},\sigma ,\chi ^t)\mid \le \frac{\bar{\alpha }^t\cdot \bar{f}}{1-\bar{\alpha }}, \end{aligned}$$

(152)

regardless of the $\zeta _{[1\infty ]}\in (\mathcal{K}(S,X))^\infty $ chosen. However, (151) and (152) would together lead to

$$\begin{aligned} v^t(s,\chi ^t,\sigma ,\chi ^t)-v^t(s,\xi _{[1t]},\sigma ,\chi ^t)\ge -\frac{2\bar{\alpha }^{t-1}\cdot \bar{f}}{1-\bar{\alpha }}\ge -\frac{\epsilon }{3}, \end{aligned}$$

(153)

for any $s\in S$ and $\xi _{[1t]}\in (\mathcal{K}(S,X))^t$.

In the presence of Assumptions 1 and 2 for the corresponding t-period games, Proposition 2 applies. Plus, it has been hypothesized that the sequence $\hat{\pi }_{n-1}$ asymptotically resembles the sequence $\sigma ^{n-1}$. Therefore, for n large enough,

$$\begin{aligned} \int _{S^{n-1}}\hat{\pi }_{n-1}(ds_{-1})\cdot v^t_n(s_1,\chi ^t,\varepsilon _{s_{-1}},\chi ^t)>v^t(s_1,\chi ^t,\sigma ,\chi ^t)-\frac{\epsilon }{6}, \end{aligned}$$

(154)

regardless of the choice on $s_1\in S$, and

$$\begin{aligned} \int _{S^{n-1}}\hat{\pi }_{n-1}(ds_{-1})\cdot v^t_n(s_1,\xi _{[1t]},\varepsilon _{s_{-1}},\chi ^t)<v^t(s_1,\xi _{[1t]},\sigma ,\chi ^t)+\frac{\epsilon }{6}, \end{aligned}$$

(155)

regardless of the choices on $s_1\in S$ and $\xi _{[1t]}\in (\mathcal{K}(S,X))^t$. Put (153) to (155) together, and we would obtain (147). $\square $

For something akin to the second example in Sect. 7, we need to consider the following invariant equation involving $\pi _n\in \mathcal{P}(S^n)$, which is inspirable from its finite-t version (28):

$$\begin{aligned} \pi _n=\pi _n\odot \chi ^n\odot \tilde{g}^n. \end{aligned}$$

(156)

Suppose (156) has a solution that asymptotically resembles $\sigma ^n$, then we can let $\hat{\pi }_{n-1}=\pi _n|_{S^{n-1}}$. By Lemma 5, this choice would satisfy the condition in Theorem 4. Its meaning is also clear—let players update their estimates on other players’ states most precisely without using their own state information.

When the state space S is finite, we again have an extended version much like Theorem 3. If we succeed in finding a satisfactory $\pi _n$, we would be able to make the third choice of letting each $\hat{\pi }_{n-1}(s_1|\cdot )$ in the extended version be the conditional probability $\pi _{n,S}|_{S^{n-1}}(s_1|\cdot )$. Propositions 3 and 4 would then lead to the satisfaction of the corresponding condition in the extended version. The third choice here again means that players update other players’ states in the most accurate Bayesian fashion.

The above second and third choices are premised on the following conjecture.

Conjecture 1

Suppose $\chi \in \mathcal{K}(S,X)$, $\tilde{g}\in \mathcal{G}(S,X)$ enjoys the continuity of $\tilde{g}(s,x,\tau )$ in $\tau $ at an (s, x)-independent rate, and $\sigma \in \mathcal{P}(S)$ is an solution to the invariant equation $\sigma =\sigma \odot \chi \odot \tilde{g}(\cdot ,\cdot ,\sigma \otimes \chi )$ as defined by (32) and (35). Then, there would exist a sequence $\pi _n$ so that for each $n\in \mathbb {N}$, $\pi _n$ as a member of $\mathcal{P}(S^n)$ satisfies the invariant equation $\pi _n=\pi _n\odot \chi ^n\odot \tilde{g}^n$ as indicated by (156), and yet the sequence asymptotically resembles the sequence $\sigma ^n$.

To tackle this conjecture, one may be tempted to show that (i) iteratively applying $\sigma _{t+1}=\sigma _t\odot \chi \odot \tilde{g}(\cdot ,\cdot ,\sigma _t\otimes \chi )$ leads to the convergence of $\sigma _t$ to an invariant $\sigma $, (ii) iteratively applying $\pi _{n,t+1}=\pi _{nt}\odot \chi ^n\odot \tilde{g}^n$ leads to the convergence of $\pi _{nt}$ to an invariant $\pi _n$ for each n, and (iii) these convergence results along with the asymptotic resemblance of each $\pi _{nt}$ to $\sigma _t^{\;n}$ would lead to that of $\pi _n$ to $\sigma ^n$. So far, (i) and (ii) still elude us. On the other hand, something slightly weaker than (iii) can be achieved.

Proposition 5

Let A be a separable metric space, and $p_i$ for $i\in \mathbb {N}$ and p be members of $\mathcal{P}(A)$. Also, for each $n\in \mathbb {N}$, let $q_{ni}$ for $i\in \mathbb {N}$ and $q_n$ be members of $\mathcal{P}(A^n)$. Suppose $p_i$ converges to p, $q_{ni}$ converges to $q_n$ for each $n\in \mathbb {N}$, and $q_{ni}$ asymptotically resembles $p_i^{\;n}$. Then, in either situation (a) where the convergence of $q_{ni}$ to $q_n$ is at an n-independent rate or situation (b) where the asymptotic resemblance of $q_{ni}$ to $p_i^{\;n}$ is at an i-independent rate, the sequence $q_n$ would asymptotically resemble the sequence $p^n$.

Proof of Proposition 5

Let $\epsilon >0$ be given. Since $p_i$ converges to p, we have

$$\begin{aligned} \rho _A(p,p_i)<\frac{\epsilon }{2}, \end{aligned}$$

(157)

as long as i is large enough.

Suppose situation (a) is true. By the equi-n convergence of $q_{ni}$ to $q_n$, we can pick i large enough to ensure both (157) and for any $n\in \mathbb {N}$,

$$\begin{aligned} q_n((A'_n)^{\epsilon /4})>q_{ni}(A')-\frac{\epsilon }{2},\quad \forall A'_n\in \mathcal{B}(A^n). \end{aligned}$$

(158)

At such a fixed $i\in \mathbb {N}$, due to the asymptotic resemblance of $q_{ni}$ to $p_i^{\;n}$, we can let n be large enough so that

$$\begin{aligned} q_{ni}\left( \left\{ a\in A^n|\rho _A(\varepsilon _a,p_i)<\frac{\epsilon }{4}\right\} \right) >1-\frac{\epsilon }{2}. \end{aligned}$$

(159)

Suppose situation (b) is true. Due to the equi-i asymptotic resemblance of $q_{ni}$ to $p_i^{\;n}$, we can pick n large enough to ensure (159) for any $i\in \mathbb {N}$. By the convergence of $q_{ni}$ to $q_n$, we can then pick i large enough to ensure (157), as well as (158) for the current $n\in \mathbb {N}$.

Either way, without loss of generality, we can suppose $d_{A^n}(a,a')\ge \max _{m=1}^n d_A(a_m,a'_m)$. Then, due to Lemma 1,

$$\begin{aligned} \left( \left\{ a\in A^n|\rho _A(\varepsilon _a,p_i)<\frac{\epsilon }{4}\right\} \right) ^{\epsilon /4}\subseteq \left\{ a\in A^n|\rho _A(\varepsilon _a,p_i)<\frac{\epsilon }{2}\right\} . \end{aligned}$$

(160)

Now we can deduce that

$$\begin{aligned}&q_n(\{a\in A^n|\rho _A(\varepsilon _a,p)<\epsilon \})>q_n(\{a\in A^n|\rho _A(\varepsilon _a,p_i)<\epsilon /2\}) \nonumber \\&\quad>q_n((\{a\in A^n|\rho _A(\varepsilon _a,p_i)<\epsilon /4\})^{\epsilon /4})\nonumber \\&\quad>q_{ni}(\{a\in A^n|\rho _A(\varepsilon _a,p_i)<\epsilon /4\})-\epsilon /2>1-\epsilon , \end{aligned}$$

(161)

where the first inequality is due to (157), the second inequality is due to (160), the third inequality is due to (158), and the last inequality is due to (159). Therefore, the sequence $q_n$ asymptotically resembles the sequence $p^n$. $\square $

Like Propositions 3 and 4, Proposition 5 also helps to bolster the legitimacy of the asymptotic resemblance concept.

Appendix 6: Developments in Sect. 9

1.1 The transient case

By the discreteness of S, every $\chi _t(s|X')$ is automatically continuous and hence measurable in s, and hence $\mathcal{K}(S,X)$ is not only a member of $(\mathcal{P}(X))^S$, but also the latter itself. Denote the space $(\mathcal{P}(S))^{\bar{t}-1}$ by $\mathcal{S}$ and the space $((\mathcal{P}(X))^S)^{\bar{t}}=(\mathcal{K}(S,X))^{\bar{t}}$ by $\mathcal{X}$. Let $\mathcal{U}=\mathcal{S}\times \mathcal{X}$. Define a correspondence $H:\mathcal{U}\Rightarrow \mathcal{U}$, so that for any $\sigma _{[2\bar{t}]}\in \mathcal{S}$ and $\chi _{[1\bar{t}]}\in \mathcal{X}$,

$$\begin{aligned} H(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})=H^S(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})\times H^X(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]}), \end{aligned}$$

(162)

where

$$\begin{aligned} H^S(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})=\left\{ \sigma '_{[2\bar{t}]}\in \mathcal{S}|\sigma '_t=T_{t-1}(\chi _{t-1})\circ \sigma _{t-1},\;\forall t=2,3,\ldots ,\bar{t}\right\} ,\qquad \end{aligned}$$

(163)

and

$$\begin{aligned} H^X(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})=\left\{ \chi '_{[1\bar{t}]}\in \mathcal{X}|\chi '_t(s_t|\tilde{X}_t(s_t,\sigma _t,\chi _{[t\bar{t}]}))=1,\;\forall t=1,2,\ldots ,\bar{t},s_t\in S\right\} .\nonumber \\ \end{aligned}$$

(164)

A fixed point $(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})$ for H would provide a Markov equilibrium $\chi _{[1\bar{t}]}$ for $\Gamma (\sigma _1)$ in the sense of (42), with $\sigma _{[2\bar{t}]}$ supplying the deterministic pre-action environment pathway from period 2 to $\bar{t}$ that is generated from all players adopting policy $\chi _{[1\bar{t}]}$. We are to use Kakutani–Fan–Glicksberg fixed point theorem to prove the existence of a fixed point for H. But first let us work out a couple of useful continuity results.

Proposition 6

(i)
$\sigma \otimes \chi $ is continuous in both $\sigma \in \mathcal{P}(S)$ and $\chi \in (\mathcal{P}(X))^S$.

When $g\in \mathcal{G}(S,X)$ satisfies that $g(s,x,\tau )$ is continuous in $\tau $ at an (s, x)-independent rate,
(ii)
$\sigma \odot \chi \odot g(\cdot ,\cdot ,\sigma \otimes \chi )$ is continuous in both $\sigma \in \mathcal{P}(S)$ and $\chi \in (\mathcal{P}(X))^S$.

Proof of Proposition 6

We first prove (i) by showing that, for any two sequences $\sigma _m$ and $\chi _m$ that converge to $\sigma $ and $\chi $, respectively, the sequence $\sigma _m\otimes \chi _m$ would converge to $\sigma \otimes \chi $. In the following, we omit detailed reasonings behind some of the steps, as they have appeared in the proof of Proposition 1.

Fix some $\epsilon \in (0,1)$. We can identify some I of its points $\bar{s}_1,\bar{s}_2,\ldots ,\bar{s}_I$, so that (72) is true. For convenience, let $\bar{S}'=\{\bar{s}_1,\bar{s}_2,\ldots ,\bar{s}_I\}$ and $\bar{S}''=S\setminus \bar{S}'$. It is known that the distance $d_S(\bar{S}',\bar{S}'')=\inf _{s'\in \bar{S}',s''\in \bar{S}''}d_S(s',s'')>0$. For $i,j=1,2,\ldots ,I$, use $d_{ij}$ for $d_S(\bar{s}_i,\bar{s}_j)$ and $\sigma _i$ for $\sigma (\{\bar{s}_i\})$. Again, define $\delta $ through (73), whose strict positivity is guaranteed.

As $\sigma _m$ and $\chi _m$ converge to $\sigma $ and $\chi $, respectively, for large enough m, we have

$$\begin{aligned} \rho _S(\sigma ,\sigma _m)<\delta , \end{aligned}$$

(165)

and

$$\begin{aligned} \rho _X(\chi (\bar{s}_i),\chi _m(\bar{s}_i))<\delta ,\quad \forall i=1,2,\ldots ,I. \end{aligned}$$

(166)

Together with the fact that $\delta \le d_S(\bar{S}',\bar{S}'')\wedge (\min _{i\ne j}d_{ij})$, (165) would result with

$$\begin{aligned} \sigma _i-\delta<\sigma _m(\{\bar{s}_i\})<\sigma _i+\delta . \end{aligned}$$

(167)

Meanwhile, (166) would lead to

$$\begin{aligned} \chi _m(\bar{s}_i|X')<\chi (\bar{s}_i|(X')^\delta )+\delta ,\quad \forall i=1,2,\ldots ,I. \end{aligned}$$

(168)

Any $U'\in \mathcal{B}(S\times X)$ still enjoys the decomposition provided in (83), that $U'=(\bigcup _{i=1}^I\{\bar{s}_i\}\times X'_i)\bigcup U''$, where $X'_i\in \mathcal{B}(X)$ for $i=1,2,\ldots ,I$, while $U''$ is such that $s''\in \bar{S}''$ for any $(s'',x'')\in U''$. This would result in the same (84). On the other hand, from the right half of (167) and (168),

$$\begin{aligned} (\sigma _m\otimes \chi _m)(\{\bar{s}_i\}\times X'_i)= & {} \sigma _m(\{\bar{s}_i\})\cdot \chi _m(\bar{s}_i|X'_i)<(\sigma _i+\delta )\cdot [\chi (\bar{s}_i|(X'_i)^\delta )+\delta ]\nonumber \\\le & {} (\sigma \otimes \chi )(\{\bar{s}_i\}\times (X'_i)^\delta )+2\delta +\delta ^2<(\sigma \otimes \chi )(\{\bar{s}_i\}\times (X'_i)^\delta )+3\delta ,\nonumber \\ \end{aligned}$$

(169)

where the last inequality is due to our choice that $\delta \le \epsilon /I<1$. Meanwhile,

$$\begin{aligned} (\sigma _m\otimes \chi _m)(U'')\le & {} (\sigma _m\otimes \chi _m)(\bar{S}''\times X)=\sigma _m(\bar{S}'')=1-\sum _{i=1}^I \sigma _m(\{\bar{s}_i\})\nonumber \\<&1-\sum _{i=1}^I\sigma _i+I\delta <\epsilon +I\delta , \end{aligned}$$

(170)

where the second-to-last inequality is due to the left half of (167) and the last one is due to (72). By combining (83), (84), (169), and (170), we can obtain

$$\begin{aligned} (\sigma _m\otimes \chi _m)(U')<(\sigma \otimes \chi )((U')^\delta )+\epsilon +4I\delta . \end{aligned}$$

(171)

Thus,

$$\begin{aligned} \rho _{S\times X}(\sigma _m\otimes \chi _m,\sigma \otimes \chi )<\epsilon +4I\delta \le 5\epsilon . \end{aligned}$$

(172)

Since (172) is to occur at any m that is large enough, we see that (i) is true.

We then prove (ii). Again, suppose two sequences $\sigma _m$ and $\chi _m$ converge to $\sigma $ and $\chi $, respectively. From (i), we know $\sigma _m\otimes \chi _m$ converges to $\sigma \otimes \chi $ too. According to (87) of Yang (2011), for any m,

$$\begin{aligned} \rho _X(\sigma _m\odot \chi _m,\sigma \odot \chi )=\rho _X((\sigma _m\otimes \chi _m)|_X,(\sigma \otimes \chi )|_X)\le \rho _{S\times X}(\sigma _m\otimes \chi _m,\sigma \otimes \chi ).\nonumber \\ \end{aligned}$$

(173)

Hence, there is also the convergence of $\sigma _m\odot \chi _m$ to $\sigma \odot \chi $.

On the other hand, the discrete property of $S\times X$ means $g(\cdot ,\cdot ,\tau )$ is a member of $(\mathcal{P}(S))^{S\times X}$ for any fixed $\tau \in \mathcal{P}(S\times X)$. Now (i) and the fact that $g(s,x,\tau )$ is continuous in $\tau $ at an (s, x)-independent rate would together mean that, the sequence $g(\cdot ,\cdot ,\sigma _m\otimes \chi _m)$ in $(\mathcal{P}(S))^{S\times X}$ converges to $g(\cdot ,\cdot ,\sigma \otimes \chi )$.

Let us use the convergence of $\sigma _m\odot \chi _m$ to $\sigma \odot \chi $ under proper substitutions. As $S\times X$ has been noted to be discrete, we can treat it as S in the convergence result. Also, let us treat $\sigma _m\otimes \chi _m$ as $\sigma _m$, $\sigma \otimes \chi $ as $\sigma $, S as X, $g(\cdot ,\cdot ,\sigma _m\otimes \chi _m)$ as $\chi _m$, and $g(\cdot ,\cdot ,\sigma \otimes \chi )$ as $\chi $.

From (i) on the convergence of $\sigma _m\otimes \chi _m$ to $\sigma \otimes \chi $, now viewed as that of $\sigma _m$ to $\sigma $, as well as the convergence of $g(\cdot ,\cdot ,\sigma _m\otimes \chi _m)$ to $g(\cdot ,\cdot ,\sigma \otimes \chi )$, now viewed as that of $\chi _m$ to $\chi $, we can conclude that $(\sigma _m\otimes \chi _m)\odot g(\cdot ,\cdot ,\sigma _m\otimes \chi _m)=\sigma _m\odot \chi _m\odot g(\cdot ,\cdot ,\sigma _m\otimes \chi _m)$ would converge to $(\sigma \otimes \chi )\odot g(\cdot ,\cdot ,\sigma \otimes \chi )=\sigma \odot \chi \odot g(\cdot ,\cdot ,\sigma \otimes \chi )$. Thus, (ii) is true as well. $\square $

Proposition 7

For each $t=1,2,\ldots ,\bar{t}+1$, the value $v_t(s_t,\xi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]})$ defined in (22) is continuous in $\sigma _t\in \mathcal{S}$ and $\chi _{[t\bar{t}]}\in \mathcal{X}$ at an $(s_t,\xi _{[t\bar{t}]})$-independent rate.

Proof of Proposition 7

We use induction on t. By (21), our claim is certainly true for $t=\bar{t}+1$. Suppose for some $t=\bar{t},\bar{t}-1,\ldots ,1$, the function $v_{t+1}(s_{t+1},\xi _{[t+1,\bar{t}]},\sigma _{t+1},\chi _{[t+1,\bar{t}]})$ is continuous in $\sigma _{t+1}$ and $\chi _{[t+1,\bar{t}]}$ at a rate independent of $s_{t+1}$ and $\xi _{[t+1,\bar{t}]}$.

Now we prove the continuity in $\sigma _t$ and $\chi _{[t\bar{t}]}$ at time t. From (22), we have

$$\begin{aligned}&\sup \limits _{s_t\in S,\xi _{[t\bar{t}]}\in ((\mathcal{P}(X))^S)^{\bar{t}-t+1}}\mid v_t(s_t,\xi _{[t\bar{t}]},\sigma _t,\chi _{[t\bar{t}]})\nonumber \\&\quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad -v_t(s_t,\xi _{[t\bar{t}]},\sigma '_t,\chi '_{[t\bar{t}]})\mid \le M_1+M_2+M_3, \end{aligned}$$

(174)

where

$$\begin{aligned} M_1= & {} \sup \limits _{(s_t,x_t)\in S\times X}\mid \tilde{f}_t(s_t,x_t,\sigma _t\otimes \chi _t)-\tilde{f}_t(s_t,x_t,\sigma '_t\otimes \chi '_t)\mid , \end{aligned}$$

(175)

$$\begin{aligned} M_2= & {} \sup \limits _{(s_t,x_t)\in S\times X,\;\xi _{[t+1,\bar{t}]}\in ((\mathcal{P}(X))^S)^{\bar{t}-t} }\left| \left[ \int _S\tilde{g}_t(s_t,x_t,\sigma _t\otimes \chi _t|ds_{t+1})\right. \right. \nonumber \\&\left. \left. -\int _S\tilde{g}_t(s_t,x_t,\sigma '_t\otimes \chi '_t|ds_{t+1})\right] \cdot v_{t+1}(s_{t+1},\xi _{[t+1,\bar{t}]},T_t(\chi _t)\circ \sigma _t,\chi _{[t+1,\bar{t}]})\right| , \nonumber \\ \end{aligned}$$

(176)

and

$$\begin{aligned} M_3= & {} \sup \limits _{(s_t,x_t)\in S\times X,\;\xi _{[t+1,\bar{t}]}\in ((\mathcal{P}(X))^S)^{\bar{t}-t}}\int _S\tilde{g}_t(s_t,x_t,\sigma '_t\otimes \chi '_t|ds_{t+1}) \nonumber \\&\times \mid v_{t+1}(s_{t+1},\xi _{[t+1,\bar{t}]},T_t(\chi _t)\circ \sigma _t,\chi _{[t+1,\bar{t}]})\nonumber \\&-v_{t+1}(s_{t+1},\xi _{[t+1,\bar{t}]},T_t(\chi '_t)\circ \sigma '_t,\chi '_{[t+1,\bar{t}]})\mid . \end{aligned}$$

(177)

By part (i) of Proposition 6, $\sigma '_t\otimes \chi '_t$ can be made arbitrarily close to $\sigma _t\otimes \chi _t$ by letting $(\sigma '_t,\chi '_t)$ be close enough to $(\sigma _t,\chi _t)$. Then due to Assumption 2, $M_1$ can be made arbitrarily small by doing the same.

Again, suppose $S=\{\bar{s}_1,\bar{s}_2,\ldots \}$. We use the simplified notation that

$$\begin{aligned} \gamma ^{(\prime )}_i(s_t,x_t)=g_t(s_t,x_t,\sigma ^{(\prime )}_t\otimes \chi ^{(\prime )}_t|\{\bar{s}_i\}), \end{aligned}$$

(178)

and

$$\begin{aligned} v_i(\xi _{[t+1,\bar{t}]})=v_{t+1}(\bar{s}_i,\xi _{[t+1,\bar{t}]},T_t(\chi _t)\circ \sigma _t,\chi _{[t+1,\bar{t}]}). \end{aligned}$$

(179)

Then, (176) can be expressed as $M_2$ equaling

$$\begin{aligned}&\sup \limits _{(s_t,x_t)\in S\times X,\;\xi _{[t+1,\bar{t}]}\in ((\mathcal{P}(X))^S)^{\bar{t}-t}} \left| \sum _i \gamma _i(s_t,x_t)\cdot v_i(\xi _{[t+1,\bar{t}]})\right. \nonumber \\&\quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \left. \quad -\sum _i\gamma '_i(s_t,x_t)\cdot v_i(\xi _{[t+1,\bar{t}]})\right| . \end{aligned}$$

(180)

Let $I(s_t,x_t)$ be the set of i’s that induce $\gamma _i(s_t,x_t)\ge \gamma '_i(s_t,x_t)$. Note the $\mid v_i(\xi _{[t+1,\bar{t}]})\mid $’s are bounded, say by $\overline{v}$, due to the boundedness of the $\tilde{f}_{t'}$’s and the finiteness of $\bar{t}$. Then, (180) would lead to

$$\begin{aligned} M_2\le 2\overline{v}\cdot \sup \limits _{(s_t,x_t)\in S\times X} \sum _{i\in I(s_t,x_t)} (\gamma _i(s_t,x_t)-\gamma '_i(s_t,x_t)). \end{aligned}$$

(181)

For $\delta $ below $\inf _{s\ne s'}d_S(s,s')$, the event $\rho _S(\tilde{g}_t(s_t,x_t,\sigma _t\otimes \chi _t),\tilde{g}_t(s_t,x_t,\sigma '_t\otimes \chi '_t))<\delta $ would trigger

$$\begin{aligned} \sum _{i\in I(s_t,x_t)} (\gamma _i(s_t,x_t)-\gamma '_i(s_t,x_t))<\delta , \end{aligned}$$

(182)

for every $(s_t,x_t)\in S\times X$; consult (122) in the proof of Proposition 2. But due to Assumption 1, the convergence of $\sigma '_t\otimes \chi _t'$ to $\sigma _t\otimes \chi _t$ means that we can make $\tilde{g}_t(s_t,x_t,\sigma '_t\otimes \chi '_t)$ arbitrarily close to $\tilde{g}_t(s_t,x_t,\sigma _t\otimes \chi _t)$, at a rate that is independent of $(s_t,x_t)$. Hence, by (181), $M_2$ can be made arbitrarily small by letting $(\sigma '_t,\chi '_t)$ get close enough to $(\sigma _t,\chi _t)$.

From (177), we can get

$$\begin{aligned} M_3\le & {} \sup \limits _{s_{t+1}\in S,\;\xi _{[t+1,\bar{t}]}\in ((\mathcal{P}(X))^S)^{\bar{t}-t}}\left| v_{t+1}(s_{t+1},\xi _{[t+1,\bar{t}]},T_t(\chi _t)\circ \sigma _t,\chi _{[t+1,\bar{t}]})\right. \nonumber \\&\quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad \left. -v_{t+1}(s_{t+1},\xi _{[t+1,\bar{t}]},T_t(\chi '_t)\circ \sigma '_t,\chi '_{[t+1,\bar{t}]})\right| .\quad \quad \quad \end{aligned}$$

(183)

By part (ii) of Proposition 6, $T_t(\chi '_t)\circ \sigma '_t=\sigma '_t\odot \chi '_t\odot \tilde{g}_t(\cdot ,\cdot ,\sigma '_t\otimes \chi '_t)$ can be made arbitrarily close to $T_t(\chi _t)\circ \sigma _t=\sigma _t\odot \chi _t\odot \tilde{g}_t(\cdot ,\cdot ,\sigma _t\otimes \chi _t)$ by letting $(\sigma '_t,\chi '_t)$ be close enough to $(\sigma _t,\chi _t)$. By the induction hypothesis, $M_3$ can be made arbitrarily small by doing the same.

We have thus completed the induction process. $\square $

Here comes the conditional-equilibrium existence result for the transient case.

Theorem 5

The correspondence H allows for a fixed point $(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})$, which supplies the game $\Gamma (\sigma _1)$ with an conditional equilibrium $\chi _{[1\bar{t}]}$.

Proof of Theorem 5

Due to S’s discreteness, $\mathcal{P}(S)$ is the simplex in $\mathbb {R}^{\mid S\mid }$, whether $\mid S\mid $ be finite or infinite, and hence is compact; the same applies to $\mathcal{P}(X)$. Thus, $\mathcal{U}$ is a compact subset of the vector space $\mathbb {R}^{\mid S\mid ^{\bar{t}-1}+\mid X\mid ^{\mid S\mid \cdot \bar{t}}}$, understood as $\mathbb {R}^\infty $ if either S or X is infinite.

For any finite-dimensional $\mathbb {R}^k$, we can take the norm $\mid \mid \cdot \mid \mid $ so that $\mid \mid r\mid \mid =\sum _{l=1}^k \mid r_l\mid /k$ for each $r=(r_l|l=1,\ldots ,k)\in \mathbb {R}^k$, whereas for the infinite-dimensional $\mathbb {R}^\infty $, we can let $\mid \mid r\mid \mid =\sum _{l=1}^{+\infty } \mid r_l\mid /2^l$ for each $r=(r_l|l=1,2,\ldots )\in \mathbb {R}^\infty $. A norm thus defined would provide the same convergence as does the weak convergence under Prohorov metric. Since the convex combination of two probabilities is still a probability, $\mathcal{U}$ is also convex.

For any $(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})\in \mathcal{U}$, the set $H(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})$ is certainly non-empty, for we can construct some $(\sigma '_{[2\bar{t}]},\chi '_{[1\bar{t}]})$ belonging to it. First, for $t=2,3,\ldots ,\bar{t}$, we simply let $\sigma '_t=T_{t-1}(\chi _{t-1})\circ \sigma _{t-1}$. Then, for $t=1,2,\ldots ,\bar{t}$ and $s\in S$, let $\chi '_t(s)$ be any measure that assigns its full weight to the set of x’s that attain the maximum value $\sup \nolimits _{y\in X}v_t(s,(\delta _y,\chi _{[t+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]})$.

Now we show that $H^S:\mathcal{U}\Rightarrow \mathcal{S}$ and $H^X:\mathcal{U}\Rightarrow \mathcal{X}$ are closed- and convex-valued, as well as upper hemi-continuous. These would lead to the same properties for H. According to (163), each $H^S(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})$ contains exactly one point, and hence is automatically closed and convex. For the upper hemi-continuity property, we need only to show that the value contained in $H^S(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})$ moves continuously with both $\sigma _{[2\bar{t}]}$ and $\chi _{[1\bar{t}]}$. But this has been guaranteed by part (ii) of Proposition 6.

According to (164), each $H^X(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})$ is a set of probability vectors, with each component probability assigning the full measure to a particular measurable set. This set of probability vectors is certainly convex. To show that it is closed, suppose $\chi '_{m,[1\bar{t}]}$ for $m=1,2,\ldots $ form a sequence in $H^X(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]})$ that converges to a given $\chi '_{[1\bar{t}]}$. We are to show that

$$\begin{aligned} \chi '_{[1\bar{t}]}\in H^X(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]}). \end{aligned}$$

(184)

Now for any $t=1,2,\ldots ,\bar{t}$, $s\in S$, and $\epsilon >0$, as long as m is large enough,

$$\begin{aligned} \chi '_t(s|(\tilde{X}_t(s,\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]}))^\epsilon )\ge \chi '_{mt}(s|\tilde{X}_t(s,\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]}))-\epsilon =1-\epsilon . \end{aligned}$$

(185)

Due to the arbitrariness of $\epsilon $, this means $\chi '_t(s|\tilde{X}_t(s,\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]}))=1$, and hence (184) is true.

We now show that $H^X$ is upper hemi-continuous. Let $\sigma _{m,[2\bar{t}]}$ be a sequence in $\mathcal{S}$ that converges to a given $\sigma _{[2\bar{t}]}$, $\chi _{m,[1\bar{t}]}$ a sequence in $\mathcal{X}$ that converges to a given $\chi _{[1\bar{t}]}$, and $\chi '_{m,[1\bar{t}]}$ another sequence in $\mathcal{X}$ that converges to a given $\chi '_{[1\bar{t}]}$. Suppose for each $m=1,2,\ldots $,

$$\begin{aligned} \chi '_{m,[1\bar{t}]}\in H^X(\sigma _{m,[2\bar{t}]},\chi _{m,[1\bar{t}]}), \end{aligned}$$

(186)

we are to show that

$$\begin{aligned} \chi '_{[1\bar{t}]}\in H^X(\sigma _{[2\bar{t}]},\chi _{[1\bar{t}]}). \end{aligned}$$

(187)

By (164), we see that (186) for each m indicates that, for each $t=1,2,\ldots ,\bar{t}$ and $s\in S$,

$$\begin{aligned} \chi '_{mt}(s|\tilde{X}_t(s,\sigma _{mt},\chi _{m,[t\bar{t}]}))=1; \end{aligned}$$

(188)

whereas, (187) boils down to that, for each $t=1,2,\ldots ,\bar{t}$ and $s\in S$,

$$\begin{aligned} \chi '_t(s|\tilde{X}_t(s,\sigma _t,\chi _{[t\bar{t}]}))=1. \end{aligned}$$

(189)

We fix some t and s. Let $\epsilon >0$ be small enough, so that there is no need to distinguish between $(X')^\epsilon $ and $X'$ for any $X'\subseteq X$. Now since $\chi '_{mt}$ converges to $\chi '_t$, for m large enough,

$$\begin{aligned} \chi '_t(s|\tilde{X}_t(s,\sigma _t,\chi _{[t\bar{t}]}))=\chi '_t(s|(\tilde{X}_t(s,\sigma _t,\chi _{[t\bar{t}]}))^\epsilon )\ge \chi '_{mt}(s|\tilde{X}_t(s,\sigma _t,\chi _{[t\bar{t}]}))-\epsilon . \end{aligned}$$

(190)

For the time being, suppose $\tilde{X}_t(s,\sigma _t,\chi _{[t\bar{t}]})$ is known to be upper hemi-continuous in $(\sigma _t,\chi _{[t\bar{t}]})$. By also noting the hypothesis on the convergence of $\sigma _{mt}$ to $\sigma _t$ and that of $\chi _{m,[t\bar{t}]}$ to $\chi _{[t\bar{t}]}$, we can obtain, for m large enough,

$$\begin{aligned} \tilde{X}_t(s,\sigma _{mt},\chi _{m,[t\bar{t}]})\subseteq (\tilde{X}_t(s,\sigma _t,\chi _{[t\bar{t}]}))^\epsilon =\tilde{X}_t(s,\sigma _t,\chi _{[t\bar{t}]}). \end{aligned}$$

(191)

Thus, for m large enough,

$$\begin{aligned} \chi '_t(s|\tilde{X}_t(s,\sigma _t,\chi _{[t\bar{t}]}))\ge \chi '_{mt}(s|\tilde{X}_t(s,\sigma _t,\chi _{[t\bar{t}]}))-\epsilon \ge \chi '_{mt}(s|\tilde{X}_t(s,\sigma _{mt},\chi _{m,[t\bar{t}]}))-\epsilon ,\qquad \end{aligned}$$

(192)

which, according to (188), is above $1-\epsilon $. In view of the arbitrariness of $\epsilon $, we can achieve (189).

We now come back to the upper hemi-continuity of $\tilde{X}_t(s,\cdot )$ as a correspondence from $\mathcal{P}(S)\times ((\mathcal{P}(X))^S)^{\bar{t}-t+1}$ to X. Suppose $\sigma _{mt}$ converges to $\sigma _t$, $\chi _{m,[t\bar{t}]}$ converges to $\chi _{[t\bar{t}]}$, and $x_m$ converges to x. For every $m=1,2,\ldots $, suppose $x_m\in \tilde{X}_t(s,\sigma _{mt},\chi _{m,[t\bar{t}]})$, which, by (43), means

$$\begin{aligned} v_t(s,(\delta _{x_m},\chi _{m,[t+1,\bar{t}]}),\sigma _{mt},\chi _{m,[t\bar{t}]})\ge v_t(s,(\delta _y,\chi _{m,[t+1,\bar{t}]}),\sigma _{mt},\chi _{m,[t\bar{t}]}),\quad \forall y\in X.\qquad \nonumber \\ \end{aligned}$$

(193)

By X’s discreteness, $x_m$ would be x for sufficiently large m. This, combined with Proposition 7 and the hypothesis on the convergence of $\sigma _{mt}$ to $\sigma _t$ and that of $\chi _{m,[t\bar{t}]}$ to $\chi _{[t\bar{t}]}$, would entail that, for any $\epsilon >0$, as long as m is large enough,

$$\begin{aligned}&v_t(s,(\delta _x,\chi _{[t+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]})\ge v_t(s,(\delta _x,\chi _{m,[t+1,\bar{t}]}),\sigma _{mt},\chi _{m,[t\bar{t}]})-\epsilon \nonumber \\&\quad =v_t(s,(\delta _{x_m},\chi _{m,[t+1,\bar{t}]}),\sigma _{mt},\chi _{m,[t\bar{t}]})-\epsilon \nonumber \\&\quad \ge v_t(s,(\delta _y,\chi _{m,[t+1,\bar{t}]}),\sigma _{mt},\chi _{m,[t\bar{t}]})-\epsilon \ge v_t(s,(\delta _y,\chi _{[t+1,\bar{t}]}),\sigma _t,\chi _{[t\bar{t}]})-2\epsilon ,\nonumber \\ \end{aligned}$$

(194)

for any $y\in X$. Since $\epsilon $ can be arbitrarily small, we see from (43) that $x\in \tilde{X}_t(s,\sigma _t,\chi _{[t\bar{t}]})$. Thus we have the upper hemi-continuity of $\tilde{X}_t(s,\cdot )$.

In summary, H is a non-empty, closed- and convex-valued, as well as upper hemi-continuous correspondence on the compact and convex subset $\mathcal{U}$ that is embedded in a normed linear topological space. We can therefore apply the Kakutani-Fan-Glicksberg fixed point theorem to verify that H has a fixed point. $\square $

1.2 The stationary case

Denote the space $\mathcal{P}(S)$ by $\mathcal{S}_\infty $ and the space $(\mathcal{P}(X))^S$ by $\mathcal{X}_\infty $. Let $\mathcal{U}_\infty =\mathcal{S}_\infty \times \mathcal{X}_\infty $. Define a correspondence $H_\infty :\mathcal{U}_\infty \Rightarrow \mathcal{U}_\infty $, so that for any $\sigma \in \mathcal{S}_\infty $ and $\chi \in \mathcal{X}_\infty $,

$$\begin{aligned} H_\infty (\sigma ,\chi )=H^S_\infty (\sigma ,\chi )\times H^X_\infty (\sigma ,\chi ), \end{aligned}$$

(195)

where

$$\begin{aligned} H^S_\infty (\sigma ,\chi )=\{\sigma '\in \mathcal{S}_\infty |\sigma '=T(\chi )\circ \sigma \}, \end{aligned}$$

(196)

and

$$\begin{aligned} H^X_\infty (\sigma ,\chi )=\{\chi '\in \mathcal{X}_\infty |\chi '(s|\tilde{X}_\infty (s,\sigma ,\chi ))=1,\;\forall s\in S\}. \end{aligned}$$

(197)

A fixed point $(\sigma ,\chi )$ for $H_\infty $ would provide a stationary Markov equilibrium $\chi $ for the stationary nonatomic game $\Gamma ^\infty $ in the sense of (44), with $\sigma $ supplying the invariant deterministic environment that is generated from all players adopting policy $\chi $. To show that such an equilibrium exists, we first need the following consequence of Proposition 7.

Proposition 8

The value $v^\infty (s,\xi _{[1\infty ]},\sigma ,\chi ^\infty )$ defined in (33) is continuous in $\sigma \in \mathcal{S}_\infty $ and $\chi \in \mathcal{X}_\infty $ at an $(s,\xi _{[1\infty ]})$-independent rate.

Proof of Proposition 8

From (34), we see that

$$\begin{aligned} \left| v^\infty (s,\xi _{[1\infty ]},\sigma ,\chi ^\infty )-v^t(s,\xi _{[1t]},\sigma ,\chi ^t)\right| \le \frac{\bar{\alpha }^t\cdot \overline{f}}{1-\bar{\alpha }}. \end{aligned}$$

(198)

Thus, for any $\epsilon >0$, by fixing at a large enough t, we can ensure

$$\begin{aligned} \left| v^\infty (s,\xi _{[1\infty ]},\sigma '',(\chi '')^\infty )-v^t(s,\xi _{[1t]},\sigma '',(\chi '')^t)\right| <\frac{\epsilon }{3}, \end{aligned}$$

(199)

for any s, $\xi _{[1\infty ]}$, $\sigma ''$, and $\chi ''$. At the same time, Proposition 7 means that, for $(\sigma ',\chi ')$ close enough to any given $(\sigma ,\chi )$, we can guarantee

$$\begin{aligned} \left| v^t(s,\xi _{[1t]},\sigma ,\chi ^t)-v^t(s,\xi _{[1t]},\sigma ',(\chi ')^t)\right| <\frac{\epsilon }{3}, \end{aligned}$$

(200)

for any s and $\xi _{[1t]}$. Then,

$$\begin{aligned}&\left| v^\infty (s,\xi _{[1\infty ]},\sigma ,\chi ^\infty )-v^\infty (s,\xi _{[1\infty ]},\sigma ',(\chi ')^\infty )\right| \le \left| v^\infty (s,\xi _{[1\infty ]},\sigma ,\chi ^\infty )\right. \nonumber \\&\left. \quad -v^t(s,\xi _{[1t]},\sigma ,\chi ^t)\right| +\left| v^t(s,\xi _{[1t]},\sigma ,\chi ^t)-v^t(s,\xi _{[1t]},\sigma ',(\chi ')^t)\right| \nonumber \\&\quad +\left| v^t(s,\xi _{[1t]},\sigma ',(\chi ')^t)-v^\infty (s,\xi _{[1\infty ]},\sigma ',(\chi ')^\infty )\right| <\epsilon . \end{aligned}$$

(201)

Thus, $v^\infty (s,\xi _{[1\infty ]},\sigma ,\chi ^\infty )$ is continuous in $(\sigma ,\chi )$ at an $(s,\xi _{[1\infty ]})$-independent rate. $\square $

We can then have the desired conditional-equilibrium existence result by using the Kakutani-Fan-Glicksberg fixed point theorem.

Theorem 6

The correspondence $H_\infty $ allows for a fixed point $(\sigma ,\chi )$, which supplies the game $\Gamma ^\infty $ with an equilibrium $\chi $.

Proof of Theorem 6

Due to the discreteness of S and X, $\mathcal{U}_\infty $ is a compact subset of the vector space $\mathbb {R}^{\mid S\mid +\mid X\mid ^{\mid S\mid }}$, understood as $\mathbb {R}^\infty $ if either S or X is infinite. Regardless of whether the space is finite- or infinite-dimensional, we can take the norm adopted in the proof of Theorem 5. Since the convex combination of two probabilities is still a probability, $\mathcal{U}_\infty $ is convex.

Using virtually the same corresponding arguments in the proof of Theorem 5, we can show that $H_\infty (\sigma ,\chi )$ at any $(\sigma ,\chi )\in \mathcal{U}_\infty $ is non-empty, closed, and convex. We separate the upper hemi-continuity of $H_\infty $ into that for $H^S_\infty $ and that for $H^X_\infty $.

The upper hemi-continuity of $H^S_\infty $ again comes from Proposition 6. Furthermore, we can use almost the same arguments from (193) to (194), this time relying on Proposition 8 instead of Proposition 7, to show that $\tilde{X}_\infty (s,\cdot )$ as a correspondence from $\mathcal{P}(S)\times (\mathcal{P}(X))^S$ to X is upper hemi-continuous. Then, using almost the same arguments from (186) to (192), we can verify that $H^X_\infty $ is upper hemi-continuous.

With all these properties, we can apply the Kakutani-Fan-Glicksberg fixed point theorem to verify that $H_\infty $ has a fixed point. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, J. A link between sequential semi-anonymous nonatomic games and their large finite counterparts. Int J Game Theory 46, 383–433 (2017). https://doi.org/10.1007/s00182-016-0539-5

Download citation

Accepted: 19 May 2016
Published: 01 June 2016
Issue Date: May 2017
DOI: https://doi.org/10.1007/s00182-016-0539-5

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A link between sequential semi-anonymous nonatomic games and their large finite counterparts

Abstract

Similar content being viewed by others

Analysis of Markovian Competitive Situations Using Nonatomic Games

Determinacy in Stochastic Games with Unbounded Payoff Functions

Subgame Optimal Strategies in Finite Concurrent Games with Prefix-Independent Objectives

1 Introduction

2 Literature survey

3 The nonatomic game

3.1 Game primitives

3.2 Evolution of the environments

4 The n-player game

5 Convergence of aggregate environments

Definition 1

Proposition 1

Assumption 1

Theorem 1

6 NG and finite-game equilibria

6.1 Equilibria in NG

6.2 \(\epsilon \)-equilibria in n-player games

6.3 Main transient result

Assumption 2

Proposition 2

Theorem 2

7 The condition in Theorem 2

7.1 Two possibilities

7.2 Refinement and a third choice

Theorem 3

7.3 Symmetry makes it work

Definition 2

Proposition 3

Proposition 4

8 A stationary situation

Theorem 4

9 Implications of main results

9.1 Observation, remembrance, and coordination

9.2 Sources of NG equilibria

10 Concluding remarks

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendices

Appendix 1: Concepts and rudimentary lemmas

Lemma 1

Proof

Lemma 2

Lemma 3

Lemma 4

Proof

Lemma 5

Proof

Lemma 6

Proof

Lemma 7

Proof

Appendix 2: Proofs of Sect. 5

Proof of Proposition 1

Proof of Theorem 1

Appendix 3: Proofs of Sect. 6

Proof of Proposition 2

Proof of Theorem 2

Appendix 4: Proofs of Sect. 7

Proof of Proposition 3

Proof Proposition 4

Appendix 5: Developments in Sect. 8

Proof of Theorem 4

Conjecture 1

Proposition 5

Proof of Proposition 5

Appendix 6: Developments in Sect. 9

1.1 The transient case

Proposition 6

Proof of Proposition 6

Proposition 7

Proof of Proposition 7

Theorem 5

Proof of Theorem 5