Abstract
We introduce a higher-dimensional version of the nonlinear thermodynamic formalism introduced by Buzzi and Leplaideur, in which a potential is replaced by a family of potentials. In particular, we establish a corresponding variational principle and we discuss the existence, characterization, and number of equilibrium measures for this higher-dimensional version.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Recently, Buzzi and Leplaideur [7] introduced a variation of the thermodynamic formalism, which they called nonlinear thermodynamic formalism. Roughly speaking, this amounts to compute the topological pressure replacing Birkhoff sums by images of them under a given function (that may be nonlinear and thus the name). Our main aim is twofold:
-
(1)
to introduce a higher-dimensional version of their notion of topological pressure, replacing a potential by a family of potentials, and to establish a corresponding variational principle;
-
(2)
to discuss the existence, characterization, and number of equilibrium measures, with special attention to the new phenomena that occur in this higher-dimensional version.
We also give a characterization of the nonlinear pressure as a Carathéodory dimension, which allows us to extend the notion to noncompact sets.
The most basic notion of the mathematical thermodynamic formalism is topological pressure. It was introduced by Ruelle [24] for expansive maps and by Walters [28] in the general case. For a continuous map \(T :X\rightarrow X\) on a compact metric space, the topological pressure of a continuous function \(\varphi :X\rightarrow {\mathbb {R}}\) is defined by
with the supremum taken over all \((n,\varepsilon )\)-separated sets E and where \(S_n\varphi = \sum _{k=0}^{n-1}\varphi \circ T^k\). An important relation between the topological pressure and the Kolmogorov–Sinai entropy is given by the variational principle
with the supremum taken over all T-invariant probability measures \(\mu \) on X and where \(h_\mu (T)\) denotes the entropy with respect to \(\mu \). This was established by Ruelle [24] for expansive maps and by Walters [28] in the general case. The theory is now a broad and active independent field of study with many connections to other areas of mathematics. We refer the reader to the books [2, 6, 14, 15, 20, 21, 25, 29] for many developments.
Building on work on the Curie–Weiss mean-field theory in [17], the nonlinear topological pressure was introduced in [7] as a generalization of (1) as follows (more precisely, we give an equivalent formulation using separated sets instead of covers). Given a continuous function \(F:{\mathbb {R}}\rightarrow {\mathbb {R}}\), the nonlinear topological pressure of a continuous function \(\varphi :X \rightarrow {\mathbb {R}}\) is given by
with the supremum taken over all \((n,\varepsilon )\)-separated sets E. For \(F(x)=x\) we recover the classical topological pressure. Buzzi and Leplaideur also established a version of the variational principle in (2). Namely, assuming that the pair \((T, \Phi )\) has an abundance of ergodic measures (see Sect. 2.1 for the definition), they proved that
with the supremum taken over all T-invariant probability measures \(\mu \) on X. In addition, they characterized the equilibrium measures of this thermodynamic formalism, that is, the invariant probability measures at which the supremum in (4) is attained, and they showed that a new type of phase transition can occur. Namely, one may have more than one equilibrium measure, although we still have a central limit theorem (see also [16, 26]).
As described above, our main aim in the paper is to understand whether and how the results in [7] extend to the higher-dimensional case. This corresponds to replace the functions F and \(\varphi \) in (3), respectively, by a continuous function \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\) and by a family \(\Phi = \{\varphi _1, \ldots , \varphi _d\}\) of continuous functions \(\varphi _i:X \rightarrow {\mathbb {R}}\) for \(i=1,\ldots ,d\). The nonlinear topological pressure of \(\Phi \) is then defined by
with the supremum taken over all \((n,\varepsilon )\)-separated sets E. Whenever possible, we follow a similar approach to obtain a variational principle and to discuss the existence, characterization, and number of equilibrium measures.
In particular, assuming that the pair \((T, \Phi )\) has an abundance of ergodic measures, we establish the variational principle
with the supremum taken over all T-invariant probability measures \(\mu \) on X. As in [7], for a certain class of pairs \((T,\Phi )\) we also characterize the equilibrium measures, that is, the invariant probability measures at which the supremum in (5) is attained. Consider the sets
and
We reduce the problem of finding equilibrium measures to the problem of finding maximizers of the function \(E:L(\Phi ) \rightarrow {\mathbb {R}}\) defined by
where
We note that the function E first appeared in [17]. It turns out that h(z) coincides with the topological entropy of the map T on the set
(see (6)). In general \(C_{z}(\Phi )\) need not be compact and so here we need the notion of topological entropy for noncompact sets (see Sect. 2.4 for the definition). In fact, we show that for each \(z \in {{\,\mathrm{int}\,}}L(\Phi )\) maximizing E there exists a unique equilibrium measure \(\nu _z\). This is actually a classical equilibrium measure for a certain function \(\psi _z\) that depends on the family of functions \(\Phi \). In addition, we give conditions for the uniqueness of the equilibrium measures, both for \(d=1\) and for \(d>1\) (see Theorems 10 and 11).
Before proceeding, we highlight the main elements and difficulties of passing to the higher-dimensional case. To the possible extent, our streamlined proof of the variational principle follows arguments in [7] for a single function, considering covers by balls instead of covers by intervals. Our main result (Theorem 7) gives a characterization of equilibrium measures and uses in an essential way the higher-dimensional multifractal analysis developed in [3] (see the following paragraph for further details). It was crucial to make sure that all was prepared so that we could apply this higher-dimensional theory, which allows us to give a description of the equilibrium measures for the nonlinear topological pressure as equilibrium measures of certain functions in \({{\,\mathrm{span}\,}}\{\varphi _1,\ldots ,\varphi _d,1\}\). In addition, in Sect. 5 we describe criteria for the uniqueness of equilibrium measures both for \(d=1\) and \(d>1\), and we give conditions for the coincidence of equilibrium measures for two systems in terms of the notion of cohomology.
As noted above, to a relevant extent we use in the proofs the higher-dimensional multifractal analysis developed in [3]. This gives once more a connection between the thermodynamic formalism and multifractal analysis, which is a principal characteristic of our work. In particular, that other work includes a conditional variational principle, which shows for example that the topological entropy of the level sets of pointwise dimensions, local entropies, and Lyapunov exponents can be approximated simultaneously by the entropy of ergodic measures. More precisely, for a continuous map \(T :X\rightarrow X\) on a compact metric space with upper semicontinuous entropy, it is shown in [3] that if \(\Phi = \{\varphi _1, \ldots , \varphi _d\}\) is composed of continuous functions such that each element of \({{\,\mathrm{span}\,}}\{\varphi _1,\ldots ,\varphi _d,1 \}\) has unique equilibrium measure (for the classical topological pressure), then for each \(z\in {{\,\mathrm{int}\,}}L(\Phi )\) the set \(C_z(\Phi )\) is nonempty and has topological entropy
In addition, there exists an ergodic equilibrium measure \(\mu _z\in \mathcal {M}(z)\) with
We note that some phenomena absent in classical multifractal analysis for a single potential may occur in a higher-dimensional multifractal spectrum. For example, the domain of the spectrum may not be convex and its interior may be empty or have more than one connected component.
Finally, we also detail further the motivation for introducing the nonlinear thermodynamical formalism and for our own work with a higher-dimensional version of this formalism. In statistical mechanics, particularly in the study of magnetic systems, the Curie–Weiss–Potts model is generally seen as an extension of the Curie–Weiss model, which can be considered as a mean-field version of the Ising model (see for example [11,12,13] for detailed discussions). Leplaideur and Watbled traced a parallel between statistical mechanics and ergodic theory for general spin spaces, introducing a generalized Curie–Weiss model in [17] and a generalized Curie–Weiss–Potts model in [18] (the latter model can be seen as a higher-dimensional generalized Curie–Weiss model). When \(X = \{-1,1\}^{{\mathbb {N}}}\), T is the shift map, \(\varphi \) is a Hölder continuous function and \(F:{\mathbb {R}}\rightarrow {\mathbb {R}}\) is given by
we recover the generalized Curie–Weiss model. Again for the shift map T on X, when \(\Phi = \{\varphi _1,\ldots ,\varphi _d\}\) is a family of Hölder continuous functions and \(F:{\mathbb {R}}^{d} \rightarrow {\mathbb {R}}\) is given by
and \(\Vert \cdot \Vert \) is a given norm on \({\mathbb {R}}^d\), we recover the generalized Curie–Weiss–Potts model. In this sense, while [7] and more recently [8] extend the study of the generalized Curie–Weiss model for any continuous function \(F:{\mathbb {R}}\rightarrow {\mathbb {R}}\) and any map T, analogously our work extends the study of the generalized Curie–Weiss–Potts model to include any continuous function \(F:{\mathbb {R}}^{d} \rightarrow {\mathbb {R}}\) and any map T, both under suitable assumptions.
We note that our results and the higher-dimensional results in [8] are mostly overlapping, but there are some differences. Namely, in [8] the authors use tools from convex analysis, for example to characterize the nonlinear equilibrium measures. They also discuss phase transitions in the nonlinear context, study equidistribution of Gibbs ensembles, and obtain a generic result on the uniqueness of nonlinear equilibrium measures. On the other hand, we characterize the nonlinear equilibrium measures using a conditional variational principle coming from higher-dimensional multifractal analysis. For instance, in our characterization we are able to specify the support of the nonlinear equilibrium measure. Moreover, we study in more detail the cohomology relations in the nonlinear context, and we obtain a simple criteria for uniqueness of nonlinear equilibrium measures.
2 Nonlinear Topological Pressure
2.1 Basic Notions
We first recall the notion of nonlinear topological pressure introduced by Buzzi and Leplaideur in [7] as an extension of the classical topological pressure. Let \(T:X \rightarrow X\) be a continuous map on a compact metric space \(X=(X,d)\). For each \(n\in {\mathbb {N}}\) we consider the distance
Take \(n\in {\mathbb {N}}\) and \(\varepsilon >0\). A set \(C\subset X\) is said to be an \((n,\varepsilon )\)-cover of X if \(\bigcup _{x \in C}B_n(x,\varepsilon ) = X\), where
(usually the set \(B_n(x,\varepsilon )\) is called a Bowen ball). Given a continuous function \(F:{\mathbb {R}}\rightarrow {\mathbb {R}}\), the nonlinear topological pressure of a continuous function \(\varphi :X \rightarrow {\mathbb {R}}\) is defined by
where \(S_n\varphi = \sum _{k=0}^{n-1}\varphi \circ T^k\), with the infimum taken over all \((n,\varepsilon )\)-covers C.
Let \(\mathcal {M}\) be the set of T-invariant probability measures on X. Following [7], we say that the pair \((T,\varphi )\) has an abundance of ergodic measures if for each \(\mu \in \mathcal {M}\), \(h<h_\mu (T)\) and \(\varepsilon >0\) there exists an ergodic measure \(\nu \in \mathcal {M}\) such that
Assuming that \((T,\varphi )\) has an abundance of ergodic measures, they obtained the variational principle
They also established (7) when F is a convex function (without assuming that the pair \((T,\varphi )\) has an abundance of ergodic measures). We say that \(\nu \in \mathcal {M}\) is an equilibrium measure for \((F,\varphi )\) with respect to T if
2.2 Higher-Dimensional Version
In this paper we consider a higher-dimensional generalization of the nonlinear topological pressure.
Given \(n\in {\mathbb {N}}\) and \(\varepsilon > 0\), a set \(E \subset X\) is said to be \((n,\varepsilon )\)-separated if \(d_{n}(x,y) > \varepsilon \) for every \(x, y \in E\) with \(x \ne y\). Since X is compact, any \((n,\varepsilon )\)-separated set has finite cardinality. Let \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\) be a continuous function and let \(\Phi = \{\varphi _1, \ldots , \varphi _d\}\) be a family of continuous functions \(\varphi _i:X \rightarrow {\mathbb {R}}\) for \(i=1,\ldots , d\). The nonlinear topological pressure of the family \(\Phi \) is defined by
with the supremum taken over all \((n,\varepsilon )\)-separated sets E. One can easily verify that the function
is nondecreasing and so \(P_F(\Phi )\) is well defined. Notice that we only need to consider F on the compact set
We also describe briefly a characterization of the nonlinear topological pressure using \((n,\varepsilon )\)-covers. Let
Following closely arguments in [2], one can show that
with the infimum taken over all \((n,\varepsilon )\)-covers C of X.
2.3 Dependence on the Potentials
In this section we discuss briefly how the nonlinear topological pressure depends on the potentials. Given a family of continuous functions \(\Phi = \{\varphi _1,\ldots ,\varphi _d\}\), we define the norm
Recall that F is said to be Hölder continuous with constants \(C, \alpha > 0\) if
Proposition 1
Let \(\Phi \) and \(\Psi \) be families of continuous functions and let \(F:{\mathbb {R}}^{d} \rightarrow {\mathbb {R}}\) be a continuous function. Then the following properties hold:
-
(1)
the map \(\Phi \mapsto P_F(\Phi )\) is continuous;
-
(2)
if F is Hölder continuous with constants \(C,\alpha > 0\), then
$$\begin{aligned} |P_F(\Phi ) - P_{F}(\Phi )| \le C\Vert \Phi - \Psi \Vert ^{\alpha }; \end{aligned}$$(10)in particular, if F is Lipschitz, then \(\Phi \mapsto P_{F}(\Phi )\) is also Lipschitz with the same Lipschitz constant.
Proof
We first prove property (1). By the uniform continuity of F, given \(\varepsilon > 0\), there exists \(\delta > 0\) such that \(|F(x)-F(y)| < \varepsilon \) whenever \(\Vert x-y\Vert _{\infty } < \delta \). Consider a family of continuous functions \(\Psi \) such that \(\Vert \Psi - \Phi \Vert < \delta \). Then
for any \(n \in {\mathbb {N}}\) and \(x \in X\). By the uniform continuity of F, we obtain
for any \(n \in {\mathbb {N}}\) and \(x \in X\). It follows from the definition of the topological pressure in (8) that \(P_F(\Psi ) - P_{F}(\Phi )< \varepsilon \). One can show in the same manner that \(P_F(\Phi ) - P_{F}(\Psi )< \varepsilon \). Therefore,
which establishes the first property in the proposition.
Now assume that F is Hölder continuous with constants \(C, \alpha > 0\). Then for any families of continuous functions \(\Phi \) and \(\Psi \) we have
for any \(n \in {\mathbb {N}}\) and \(x \in X\). Proceeding as in the proof of property (1), we readily obtain inequality (10). \(\square \)
2.4 Extension to Noncompact Sets
Based on work of Pesin and Pitskel’ in [22], we give a characterization of the nonlinear topological pressure as a Carathéodory dimension. In particular, this allows us to extend the notion to noncompact sets. We expect that this extension plays an important role in an appropriate version of multifractal analysis associated with the nonlinear topological pressure.
We continue to consider a continuous map \(T:X \rightarrow X\) on a compact metric space. Given a finite open cover \(\mathcal {U}\) of X, for each \(n \in {\mathbb {N}}\) let \(\mathcal {X}_n\) be the set of strings \(U = (U_1,\ldots , U_n)\) with \(U_i \in \mathcal {U}\) for \(i=1,\ldots ,n\). We write \(l(U) = n\) and we define
We say that \(\Gamma \subset \bigcup _{n \in {\mathbb {N}}}\mathcal {X}_n\) covers a set \(Z \subset X\) if \(Z \subset \bigcup _{U \in \Gamma }X(U)\).
Given a family of continuous functions \(\Phi = \{\varphi _1,\ldots , \varphi _d\}\), for each \(n \in {\mathbb {N}}\) we define \(S_n\Phi = (S_n\varphi _1,\ldots , S_n\varphi _d)\). Moreover, given a function \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\), for each \(U \in \mathcal {X}_n\) let
Finally, given a set \(Z \subset X\) and a number \(\alpha \in {\mathbb {R}}\), we define
with the infimum taken over all \(\Gamma \subset \bigcup _{k\ge n}\mathcal {X}_k\) covering Z and with the convention that \(\exp (-\infty ) = 0\). One can easily verify that the map \(\alpha \mapsto M_Z(\alpha , \Phi , \mathcal {U})\) goes from \(+\infty \) to zero at a unique \(\alpha \in {\mathbb {R}}\) and so one can define
One can proceed as in the proof of Theorem 2.2.1 in [2] to show that the limit
exists. One could also introduce the number \(P_F(Z,\Phi )\) using \((n,\varepsilon )\)-separated sets or \((n,\varepsilon )\)-covers (covers by Bowen balls), in a similar manner to that, for example, in Appendix D in [1].
When \(Z=X\) we recover the notion of nonlinear topological pressure for any convex function F.
Theorem 2
If the function F is convex, then \(P_F(\Phi ) = P_F(X,\Phi )\).
Proof
The proof is obtained modifying arguments in Sect. 4.2.3 of [2] and so we only give a brief sketch. Given a finite open cover \(\mathcal {U}\) of X, we define
with the infimum taken over all \(\Gamma \subset \mathcal {X}_n\) covering X. Given \(\Gamma _1 \subset \mathcal {X}_{n_1}\) and \(\Gamma _2 \subset \mathcal {X}_{n_2}\), let
Note that if \(\Gamma _1\) and \(\Gamma _2\) cover X, then \(\Gamma '\) also covers X. Moreover, since F is convex, it follows readily from the identity
that
for each \(UV \in \Gamma '\). We have
and so
Therefore, one can define
Finally, it follows as in Lemmas 2.2.5 and 2.2.6 in [2] that
and
This yields the desired result. \(\square \)
Taking \(F=\mathrm{id}\) and \(\Phi =0\) we recover the notion of topological entropy
of T on the set Z introduced by Pesin and Pitskel’ in [22]. It coincides with the notion of topological entropy for noncompact sets introduced earlier by Bowen in [4]. We emphasize that Z need not be compact nor T-invariant. When \(Z=X\) we recover the usual notion of topological entropy.
3 Variational Principle
In this section we establish a variational principle for the nonlinear topological pressure.
Let \(T:X \rightarrow X\) be a continuous map on a compact metric space and let \(\Phi = \{\varphi _1,\ldots ,\varphi _d\}\) be a family of continuous functions. We say that the pair \((T,\Phi )\) has an abundance of ergodic measures if for each \(\mu \in \mathcal {M}\), \(h<h_\mu (T)\) and \(\varepsilon >0\) there exists an ergodic measure \(\nu \in \mathcal {M}\) such that \(h_{\nu }(T) > h\) and
Moreover, we say that T has entropy density of ergodic measures if for every \(\mu \in \mathcal {M}\) there exist ergodic measures \(\nu _n\in \mathcal {M}\) for \(n\in {\mathbb {N}}\) such that \(\nu _n \rightarrow \mu \) in the \(\hbox {weak}^*\) topology and \(h_{\nu _n}(T) \rightarrow h_\mu (T)\) when \(n\rightarrow \infty \). Note that if T has entropy density of ergodic measures, then the pair \((T,\Phi )\) has an abundance of ergodic measures for any family of continuous functions \(\Phi \).
In order to give examples of pairs with an abundance of ergodic measures we first recall a few notions. Given \(\delta >0\), we say that T has weak specification at scale \(\delta \) if there exists \(\tau \in {\mathbb {N}}\) such that for every \((x_1,n_1),\ldots ,(x_k,n_k) \in X \times {\mathbb {N}}\) there are \(y \in X\) and times \(\tau _1,\ldots , \tau _{k-1} \in {\mathbb {N}}\) such that \(\tau _i \le \tau \) and
where \(s_i = \sum _{i=1}^{i}n_i + \sum _{i=1}^{i-1}\tau _i\) with \(n_0 = \tau _0 = 0\). When one can take \(\tau _i = \tau \) for \(i =1,\ldots ,k-1\), we say that T has specification at scale \(\delta \). Finally, we say that T has weak specification if it has weak specification at every scale \(\delta \) and, analogously, we say that T has specification if it has specification at every scale \(\delta \).
It was shown earlier in [10, Theorem B] and [23, Theorem 2.1] that mixing subshifts of finite type and mixing locally maximal hyperbolic sets have entropy density of ergodic measures. More recently, it was shown in [9] that a continuous map \(T:X \rightarrow X\) on a compact metric space with the weak specification property such that the entropy map \(\mu \mapsto h_\mu (T)\) is upper semicontinuous, has entropy density of ergodic measures. In particular, this implies that the pair \((T,\Phi )\) has an abundance of ergodic measures for any family of continuous functions \(\Phi \). Some examples of maps with an abundance of ergodic measures include expansive maps with specification or with weak specification, topologically transitive locally maximal hyperbolic sets for diffeomorphisms, and transitive topological Markov chains.
The following theorem establishes a variational principle for the nonlinear topological pressure.
Theorem 3
Let \(T:X \rightarrow X\) be a continuous map on a compact metric space and let \(\Phi =\{\varphi _1,\ldots , \varphi _d\}\) be a family of continuous functions. Given a continuous function \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\), if the pair \((T,\Phi )\) has an abundance of ergodic measures, then
where \(\int _X\Phi \,d\mu = \big (\int _X\varphi _1 \,d\mu ,\ldots , \int _X\varphi _{d} \,d\mu \big )\).
Proof
To the possible extent we follow arguments in [7] for a single function. We divide the proof into two lemmas.
Lemma 1
Let \(T:X \rightarrow X\) be a continuous map on a compact metric space and let \(\Phi \) be a family of continuous functions. Then:
-
(1)
$$\begin{aligned} P_F(\Phi ) \ge \sup _{\mu \in \mathcal {M}_{\mathrm{erg}}}\biggl \{h_\mu (T) + F\biggl (\int _X\Phi \,d\mu \biggr )\biggr \}; \end{aligned}$$
-
(2)
if, in addition, the pair \((T,\Phi )\) has an abundance of ergodic measures, then
$$\begin{aligned} P_F(\Phi ) \ge \sup _{\mu \in \mathcal {M}}\biggl \{h_\mu (T) + F\biggl (\int _X\Phi \,d\mu \biggr )\biggr \}. \end{aligned}$$
Proof of the lemma
Given \(r>0\), since X is compact there exist \(\delta ,\varepsilon > 0\) such that
for \(i=1,\ldots ,d\) and
For definiteness we shall take the \(\ell ^\infty \) norm on \({\mathbb {R}}^d\). Now let \(\mu \in \mathcal {M}\) be an ergodic measure. By Birkhoff’s ergodic theorem and the Brin–Katok local entropy formula, together with Egorov’s theorem, there exist a set \(A\subset X\) of measure \(\mu (A) > 1- r\) and \(N \in {\mathbb {N}}\) such that
for all \(i=1,\ldots ,d\) and
for \(x \in A\) and \(n > N\).
Now let C be an arbitrary \((n,\varepsilon )\)-cover and let \(D \subset C\) be a minimal \((n,\varepsilon )\)-cover of A. For each \(x \in D\), the ball \(B_n(x,\varepsilon )\) intersects A at some point y (otherwise one could discard the point x in D). Note that
Hence, it follows from (12) that
for \(i =1,\ldots ,d\) and so
Moreover, \(B_n(x,\varepsilon ) \subset B_n(y,2\varepsilon )\) and so it follows from (13) that
where |D| denotes the cardinality of D. Therefore,
for any sufficiently large \(n \in {\mathbb {N}}\). It follows from (9) that
Finally, by the arbitrariness of \(r > 0\) we obtain
This yields the first property in the lemma.
Now we consider an arbitrary measure \(\nu \in \mathcal {M}\). If \((T,\Phi )\) has an abundance of ergodic measures, then for each \(h<h_{\nu }(T)\) and \(\varepsilon > 0\) there exists an ergodic measure \(\mu \in \mathcal {M}\) such that
(since F is continuous). By (14) we obtain
and it follows from the arbitrariness of h and \(\varepsilon \) that
This yields the second property in the lemma. \(\square \)
Now we obtain the reverse inequality, without requiring that there are an abundance of ergodic measures.
Lemma 2
Let \(T:X \rightarrow X\) be a continuous map on a compact metric space and let \(\Phi \) be a family of continuous functions. Then
Proof of the lemma
Given \(p < P_{F}(\Phi )\), take \(\varepsilon > 0\) such that
with the infimum taken over all \((n,\varepsilon )\)-covers C. Since each maximal \((n,\varepsilon )\)-separated set \(E_n\) is an \((n,\varepsilon )\)-cover, we have
and given \(r > 0\), there exists a diverging subsequence \((n_k)_{k \in {\mathbb {N}}}\) such that
We cover the compact set \(\Phi (X)\) by balls \(B(z_i, r_i)\) for \(i=1,\ldots , L\) such that \(|F(z)-F(z_i)|<r\) for all \(z\in B(z_i, r_i)\) and \(i=1,\ldots , L\). Now let
Note that
and so it follows from (16) that
This implies that
for any sufficiently large k. Proceeding as in the proof of the variational principle in [19], we also consider the measures
Without loss of generality, one can assume that \(\nu ^i_k\) converges to a T-invariant measure \(\mu ^i\) in the \(\hbox {weak}^*\) topology satisfying
By the definition of \(\nu ^i_k\) we have
Hence, by (17) and (18) we obtain
The desired result follows from the arbitrariness of r and p. \(\square \)
Lemmas 1 and 2 establish the statement in the theorem.\(\square \)
For a general continuous map T, we obtain a variational principle for an arbitrary convex function F.
Theorem 4
Let \(T:X \rightarrow X\) be a continuous map on a compact metric space and let \(\Phi =\{\varphi _1,\ldots , \varphi _d\}\) be a family of continuous functions. If \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\) is a convex continuous function, then identity (11) holds.
Proof
It follows from the first property in Lemma 1 that
for every ergodic measure \(\mu \in \mathcal {M}\). Now let \(\nu \in \mathcal {M}\) be an arbitrary measure and consider its ergodic decomposition with respect to T. It is described by a probability measure \(\tau \) on \(\mathcal {M}\) that is concentrated on the subset of ergodic measures \(\mathcal {M}_{\mathrm{erg}}\). We recall that for every bounded measurable function \(\psi :X \rightarrow {\mathbb {R}}\) we have
For a convex function F one can use Jensen’s inequality to obtain
Moreover, we also have
(see for example Theorem 9.6.2 in [27]). Hence,
The desired result follows now readily from Lemma 2. \(\square \)
We also obtain a variational principle over the ergodic measures.
Corollary 5
Let \(T:X \rightarrow X\) be a continuous map on a compact metric space, let \(\Phi =\{\varphi _1,\ldots , \varphi _d\}\) be a family of continuous functions, and let \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\) be a continuous function. If the pair \((T,\Phi )\) has an abundance of ergodic measures or F is convex, then
Proof
Since \(\mathcal {M}_{\mathrm{erg}}\subset \mathcal {M}\), we have
Now we assume that the pair \((T,\Phi )\) has an abundance of ergodic measures and we establish the reverse inequality. It follows from (15) that for each \(\nu \in \mathcal {M}\), \(h<h_\nu (T)\) and \(\varepsilon >0\), there exists an ergodic measure \(\mu \in \mathcal {M}\) such that
Since h and \(\varepsilon \) are arbitrary, this readily implies that
Finally, it follows from Theorem 3 that identity (20) holds.
Now assume that F is convex. It follows from (19) that
for each \(\nu \in \mathcal {M}\). Therefore,
and applying Theorem 4 we also obtain identity (20). \(\square \)
Remark
Without the assumptions of abundance of ergodic measures or convexity of the function F we are only able to show that
In fact, if we drop both assumptions, then the variational principle may fail (see Example 2.5 in [7]).
4 Equilibrium Measures: Existence and Characterization
In this section we consider the problem of characterizing the equilibrium measures of the nonlinear topological pressure.
4.1 Existence of Equilibrium Measures
In view of Theorem 3, we say that \(\mu \in \mathcal {M}\) is an equilibrium measure for \((F,\Phi )\) with respect to T if
We first formulate a result on the existence of equilibrium measures.
Theorem 6
Let \(T:X \rightarrow X\) be a continuous map on a compact metric space such that the map \(\mu \mapsto h_\mu (T)\) is upper semicontinuous, let \(\Phi =\{\varphi _1,\ldots , \varphi _d\}\) be a family of continuous functions, and let \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\) be a continuous function. If the pair \((T,\Phi )\) has an abundance of ergodic measures or F is convex, then there exists at least one equilibrium measure for \((F,\Phi )\).
Proof
Since the map \(\mu \mapsto h_\mu (T)\) is upper semicontinuous, F is continuous and the map \(\mu \mapsto \int _X\psi \,d\mu \) is continuous for each continuous function \(\psi :X \rightarrow {\mathbb {R}}\), we conclude that \(\mu \mapsto h_\mu (T) + F(\int _X\Phi \,d\mu )\) is upper semicontinuous. Together with the compactness of \(\mathcal {M}\), this guarantees that there exists a measure \(\mu _{\Phi } \in \mathcal {M}\) such that
Hence, it follows from the variational principles in Theorems 3 and 4 that \(\mu _{\Phi }\) is an equilibrium measure for \((F,\Phi )\). \(\square \)
In some cases one can pass to the one-dimensional setting of the nonlinear thermodynamic formalism.
Example 1
Consider the function \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\) defined by
where \(f:{\mathbb {R}}\rightarrow {\mathbb {R}}\) is a continuous function and \(\alpha _j \in {\mathbb {R}}\) for \(j =1,\ldots ,d\). Then
for every \(\mu \in \mathcal {M}\), where
Moreover, \(P_F(\Phi ) = P_f(\varphi )\) and this implies that \((F,\Phi )\) and \((f,\varphi )\) have the same equilibrium measures. In other words, for a function F as above the study of equilibrium measures can be reduced to the case when \(d=1\).
Of course, in general the continuous function F can be much more complicated. For instance, as mentioned in the introduction, the Curie–Weiss–Potts model involves the study of the topological pressure for the function
where \(\beta \ge 0\) is a physical parameter.
For the following example, we recall the notion of cohomology. We say that a function \(\varphi :X \rightarrow {\mathbb {R}}\) is cohomologous to a function \(\psi :X \rightarrow {\mathbb {R}}\) (with respect to T) if there exists a measurable bounded function \(q:X \rightarrow {\mathbb {R}}\) such that
Example 2
(Reduction of dimension via cohomology) Let T be a continuous map on a compact metric space and let \(\Phi =\{\varphi _1,\ldots , \varphi _d\}\) be a family of continuous functions such that the pair \((T,\Phi )\) has an abundance of ergodic measures. Let \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\) be a continuous function and assume that \(\varphi _1\) is cohomologous to \(\varphi _d\). This implies that \(\int _X\varphi _1 \,d\mu = \int _X\varphi _{d} \,d\mu \) for every \(\mu \in \mathcal {M}\). Therefore,
for every \(\mu \in \mathcal {M}\), where
for each \((z_1,\ldots ,z_d) \in {\mathbb {R}}^d\). The cohomology assumption also implies that
Together with the continuity of F, this implies that \(P_{F}(\Phi ) = P_{G}(\Psi )\), where \(\Psi = \{\varphi _1,\ldots ,\varphi _{d-1}\}\). Hence, the pairs \((F,\Phi )\) and \((G,\Psi )\) have the same equilibrium measures. More generally, in order to further reduce the dimension of the problem, one could consider additional cohomology relations between any two functions in \(\Phi \). For instance, if \(\varphi _1\) is cohomologous to all functions \(\varphi _j \in \Phi \), then the problem reduces to the one-dimensional case.
Example 3
(Reduction to the classical case via cohomology) Let T be a continuous map on a compact metric space and let \(\Phi = \{\varphi _1, \varphi _2\}\) be a pair of continuous functions such that \((T,\Phi )\) has an abundance of ergodic measures. Moreover, assume that \(\varphi _1\) is cohomologous to \(\varphi _2\) and consider the function \(F:{\mathbb {R}}^2 \rightarrow {\mathbb {R}}\) given by \(F(z_1,z_2) = (z_1^3+ z_2^3)^{1/3}\). This implies that \(\int _X\varphi _1 \,d\mu = \int _X\varphi _2 \,d\mu \) for every \(\mu \in \mathcal {M}\) and so
for every \(\mu \in \mathcal {M}\). Letting \(\psi = 2^{1/3}\varphi _1\), it follows from the definitions that \(P_F(\Phi ) = P(\psi )\), where P denotes the classical topological pressure. Hence, \(\nu \) is an equilibrium measure for \((F,\Phi )\) if and only if \(\nu \) is an equilibrium measure for \(\psi \).
Recall that a continuous function \(\varphi :X \rightarrow {\mathbb {R}}\) is said to have the Bowen property if there exist \(K > 0\) and \(\varepsilon > 0\) such that whenever
we have
If \(T:X \rightarrow X\) is an expansive map with specification and \(\varphi _1\) (or \(\varphi _2\)) is a continuous function with the Bowen property, then there exists a unique equilibrium measure \(\mu _\psi \) for \(\psi \) (see [5]). Therefore, \(\mu _\psi \) is also the unique equilibrium measure for \((F,\Phi )\).
We observe that this example can be easily generalized to the case when
where \(H_n\) is a homogeneous polynomial of degree n, assuming additional cohomology relations between some pairs of functions in \(\Phi \).
4.2 Characterization of Equilibrium Measures
Now we consider the problem of characterizing the equilibrium measures. Given a pair \((T,\Phi )\), we consider the set
Since the map \(\mu \mapsto \int _X\psi \,d\mu \) is continuous for each continuous function \(\psi :X\rightarrow {\mathbb {R}}\) and \(\mathcal {M}\) is compact and connected, the set \(L(\Phi )\) is a compact and connected subset of \({\mathbb {R}}^d\). For each \(z \in {\mathbb {R}}^d\), we also consider the level sets
and
Following closely [7], we say that the pair \((T,\Phi )\) is \(C^r\) regular (for some \(2 \le r \le \omega \), where \(\omega \) refers to the analytic case) if the following holds:
-
(1)
each function in \({{\,\mathrm{span}\,}}\{\varphi _1,\ldots ,\varphi _d,1 \}\) has a unique equilibrium measure for the classical topological pressure and \({{\,\mathrm{int}\,}}L(\Phi ) \ne \emptyset \);
-
(2)
for each \(z \in {{\,\mathrm{int}\,}}L(\Phi )\) the map \(q \mapsto P(\langle q, \Phi - z\rangle )\), where P is the classical topological pressure and \(\langle \cdot , \cdot \rangle \) is the usual inner product, takes only finite values, is of class \(C^r\), is strictly convex, and its second derivative is a positive definite bilinear form for each \(q \in {\mathbb {R}}^d\);
-
(3)
the entropy map \(\mu \mapsto h_\mu (T)\) is upper semicontinuous and bounded.
Examples of \(C^r\) regular pairs \((T,\Phi )\) include topologically mixing subshifts of finite type, \(C^{1+\varepsilon }\) expanding maps, and \(C^{1+\varepsilon }\) diffeomorphisms with a locally maximal hyperbolic set, with \(\Phi \) composed of Hölder continuous functions. Finally, we say that the family of functions \(\Phi = \{\varphi _1,\ldots ,\varphi _d\}\) is cohomologous to a constant \(c = (c_1,\ldots ,c_d)\) if \(\varphi _{i}\) is cohomologous to \(c_i\) for \(i=1,\ldots ,d\). Then \(L(\Phi ) = \{c\}\) and so \({{\,\mathrm{int}\,}}L(\Phi ) = \emptyset \).
The following theorem is our main result. Given a function \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\), we consider the set
We also consider the function \(h:L(\Phi ) \rightarrow {\mathbb {R}}\) defined by
Theorem 7
Let \(T:X \rightarrow X\) be a continuous map on a compact metric space and let \(\Phi =\{\varphi _1,\ldots , \varphi _d\}\) be a family of continuous functions such that the pair \((T,\Phi )\) is \(C^1\) regular. For each continuous function \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\), the following properties hold:
-
(1)
\(K(F,\Phi )\) is a nonempty compact set;
-
(2)
\(K(F,\Phi )\) is the set of maximizers of the function \(z \mapsto h(z) + F(z)\);
-
(3)
if \(K(F,\Phi ) \subset {{\,\mathrm{int}\,}}L(\Phi )\), then the equilibrium measures for \((F,\Phi )\) are the elements of \(\{\nu _{z}: z \in K(F,\Phi )\}\), where each \(\nu _z\in \mathcal {M}\) satisfies:
-
\(\nu _z\) is ergodic;
-
\(\nu _z\) is the unique invariant measure in \(\mathcal {M}(z)\) supported on the level set \(C_z(\Phi )\) such that \(h_{\nu _z}(T) = h(z)\);
-
\(\nu _z\) is the unique equilibrium measure for a function
$$\begin{aligned} \psi _z= \langle q(z), \Phi - z\rangle \end{aligned}$$(23)in \({{\,\mathrm{span}\,}}\{\varphi _1,\ldots ,\varphi _d,1\}\), for some \(q(z) \in {\mathbb {R}}^{d}\).
-
Proof
We divide the proof into steps.
Lemma 3
\(K(F,\Phi )\) is a nonempty compact subset of \(L(\Phi )\).
Proof of the lemma
Let \((z_n)_{n\in {\mathbb {N}}}\) be a sequence in \(K(F,\Phi )\) converging to a point \(z \in L(\Phi )\). For each \(n \in {\mathbb {N}}\) there exists an equilibrium measure \(\mu _n \in \mathcal {M}\) for \((F,\Phi )\) such that \(z_n= \int _X\Phi \,d\mu _n\). Passing eventually to a subsequence, we may assume that there exists \(\mu \in \mathcal {M}\) such that \(\mu _n \rightarrow \mu \) when \(n\rightarrow \infty \) in the \(\hbox {weak}^*\) topology. Since the map \(\mu \mapsto h_\mu (T)\) is upper semicontinuous, we obtain
which implies that \(\mu \) is an equilibrium measure for \((F,\Phi )\). Since \(z = \int _X\Phi \,d\mu \), we conclude that \(z \in K(F,\Phi )\). Hence, \(K(F,\Phi )\) is closed. Moreover, since
the set \(K(F,\Phi )\) is also bounded. By Theorem 6 it is nonempty. \(\square \)
Lemma 4
For each \(z \in {{\,\mathrm{int}\,}}L(\Phi )\) there exists an ergodic measure \(\nu _{z} \in \mathcal {M}\) such that \(\int _X\Phi \,d\nu _z = z\). In fact, \(\nu _z\) is the unique equilibrium measure for the function \(\psi _z\) given by (23).
Proof of the lemma
For each \(z \in L(\Phi )\) we consider the function
where P is the classical topological pressure and \(h(T|_{C_z(\Phi )})\) is the topological entropy of T on the set \(C_z(\Phi )\) (see Sect. 2.4 for the definition). By Lemmas 1 and 2 in [3] we have
and there exists at least one point \(q(z) \in {\mathbb {R}}^{d}\) such that \(\Delta _z(q(z)) = 0\). Since the map \(q \mapsto \Delta _z(q)\) is of class \(C^{1}\) and \(\Delta _z\) has a minimum at q(z), we conclude that \(\partial _q \Delta _z(q(z)) = 0\). Now let \(\nu _z\) be the equilibrium measure of the function \(\psi _z\) in (23). One can proceed as in the proof of Theorem 8 in [3] to verify that \(\nu _z\) is ergodic with
Moreover, since \(\psi _z \in {{\,\mathrm{span}\,}}\{\varphi _1,\ldots ,\varphi _{d},1\}\), it follows from the notion of \(C^1\) regular pair that \(\nu _z\) is the unique equilibrium measure for \(\psi _z\). \(\square \)
Lemma 5
For each \(z \in L(\Phi )\) there exists \(\mu \in \mathcal {M}(z)\) with \(h(z) = h_\mu (T)\). Moreover, when \(z \in {{\,\mathrm{int}\,}}L(\Phi )\) this measure is unique and coincides with \(\nu _z\).
Proof of the lemma
Take \(z \in L(\Phi )\). By the definition of \(L(\Phi )\), there exists \(\mu \in \mathcal {M}\) such that \(\int _X\Phi \, d\mu = z\), that is, \(\mathcal {M}(z) \ne \emptyset \). By the compactness of \(\mathcal {M}(z)\) and the upper semicontinuity of the map \(\mu \mapsto h_\mu (T)\), there exists \(\mu \in \mathcal {M}(z)\) maximizing the metric entropy.
Now take \(z \in {{\,\mathrm{int}\,}}L(\Phi )\). By Lemma 4, there exists a measure \(\nu _z \in \mathcal {M}\) such that \(\int _X\Phi \,d\nu _z = z\), where \(\nu _z\) is the unique equilibrium measure for the function \(\psi _z\) in (23). Let \(\mu \in \mathcal {M}(z)\) be a measure maximizing the metric entropy. Since \(\int _X\Phi \,d\mu = \int _X\Phi \,d\nu _z\), it follows readily from (23) that
Therefore,
which implies that \(\mu \) is also an equilibrium measure for \(\psi _z\) (for the classical topological pressure). Since \(\psi _z\) has a unique equilibrium measure, we conclude that \(\mu = \nu _z\). \(\square \)
Now consider the function \(E:L(\Phi ) \rightarrow {\mathbb {R}}\) defined by \(E(z) = h(z) + F(z)\).
Lemma 6
\(z \in K(F,\Phi )\) if and only if z maximizes the function E.
Proof of the lemma
First assume that \(z \in L(\Phi )\) maximizes the function E. By Lemma 5, there exists \(\mu \in \mathcal {M}(z)\) such that \(h(z) = h_\mu (T)\) and so
This implies that \(\mu \) is an equilibrium measure for \((F,\Phi )\) and so \(z \in K(F,\Phi )\).
Now assume that \(z \in K(F,\Phi )\). Then there exists an equilibrium measure \(\mu \) for \((F,\Phi )\) such that \(z = \int _X\Phi \,d\mu \) and so
This shows that z maximizes E. \(\square \)
Lemmas 3 and 6 give items (1) and (2) in the theorem. Now we establish item (3). For each \(z \in K(F,\Phi )\) there exists an equilibrium measure \(\mu \) for \((F,\Phi )\) such that \(\int _X\Phi \,d\mu = z\). When \(K(F,\Phi ) \subset {{\,\mathrm{int}\,}}L(\Phi )\), it follows from Lemmas 4 and 5 that \(\mu \) is the unique measure with \(\int _X\Phi \,d\mu = z\) and that \(\mu = \nu _z\), where \(\nu _z\) is ergodic and is the unique equilibrium measure for some function \(\psi _z\). \(\square \)
It is shown in [8] that the condition \(K(F,\Phi ) \subset {{\,\mathrm{int}\,}}L(\Phi )\) in the last property of Theorem 7 holds for a certain class of pairs \((T,\Phi )\) that they call \(C^r\) Legendre (we refer to that paper for the definition).
Remark
In the proof of Lemma 4, for each \(z \in {{\,\mathrm{int}\,}}L(\Phi )\) the point q(z) minimizing \(\Delta _z(q)\) might not be unique. Therefore, one may have more than one function \(\psi _z\) as in (23). On the other hand, Lemma 5 guarantees that all possible functions \(\psi _z\) have the same equilibrium measure \(\nu _z\).
5 Number of Equilibrium Measures
In this section we consider the problem of how many equilibrium measures a \(C^r\) regular system has.
5.1 Preliminary Results
We start with some auxiliary results about the function h in (22). Note that
Proposition 8
For a \(C^1\) regular pair \((T, \Phi )\) the function \(h:L(\Phi ) \rightarrow {\mathbb {R}}\) is upper semicontinuous, concave and finite.
Proof
Take \(z \in L(\Phi )\) and consider a sequence \((z_n)_{n \in {\mathbb {N}}}\) in \(L(\Phi )\) such that \(z_n \rightarrow z\) when \(n\rightarrow \infty \). By Lemma 5, eventually passing to a subsequence one can assume that for each \(n \in {\mathbb {N}}\) there exists \(\mu _n \in \mathcal {M}(z_n)\) such that \(h(z_n) = h_{\mu _n}(T)\) and \(\mu _n \rightarrow \mu \) when \(n\rightarrow \infty \) for some \(\mu \in \mathcal {M}\) in the \(\hbox {weak}^*\) topology. We also have
and so \(\mu \in \mathcal {M}(z)\). Moreover, since \(\mu \mapsto h_\mu (T)\) is upper semicontinuous, we obtain
and so h is upper semicontinuous on \(L(\Phi )\).
Now we prove the concavity property. Take \(z_1, z_2 \in L(\Phi )\) and \(\mu _1 \in \mathcal {M}(z_1)\), \(\mu _2 \in \mathcal {M}(z_2)\) such that \(h(z_1) = h_{\mu _1}(T)\) and \(h(z_2) = h_{\mu _2}(T)\). Since the entropy map is affine, for each \(t \in [0,1]\) we have
The upper semicontinuity of h on \(L(\Phi )\) together with the compactness of \(L(\Phi )\) and the fact that \(\mathcal {M}(z) \ne \emptyset \) for each \(z \in L(\Phi )\), guarantee that h is finite on \(L(\Phi )\). \(\square \)
As pointed out in the recent work [30], in strong contrast to what happens for \(d=1\), the function \(z \mapsto h(z)\) need not be continuous on \(L(\Phi )\).
Proposition 9
If the pair \((T,\Phi )\) is \(C^r\) regular, then the function \(h|_{{{\,\mathrm{int}\,}}L(\Phi )}\) is \(C^{r-1}\). Moreover, if \((T,\Phi )\) is \(C^{\omega }\) regular, then \(h|_{{{\,\mathrm{int}\,}}L(\Phi )}\) is analytic.
Proof
It follows from Theorem 12 in [3] that if \((T,\Phi )\) is \(C^r\) regular, then the map \({{\,\mathrm{int}\,}}L(\Phi ) \ni z \mapsto h(T|_{C_z(\Phi )})\) (the topological entropy of T on \(C_z(\Phi )\)) is of class \(C^{r-1}\), and that if the pair is \(C^{\omega }\) regular, then this map is analytic. Since \(h(z) = h(T|_{C_z(\Phi )})\) for \(z \in {{\,\mathrm{int}\,}}L(\Phi )\), we obtain the desired statement. \(\square \)
For \(d=1\), Corollary 1.11 in [7] says that if the pair \((T,\Phi )\) is \(C^\omega \) and F is analytic on \({{\,\mathrm{int}\,}}L(\Phi )\), then the set \(K(F,\Phi )\) is finite. In particular, there exist finitely many equilibrium measures.
5.2 Equilibrium Measures I
For \(d=1\), it was shown in [7] that no point on \(\partial L(\Phi )\) maximizes the function \(E = h + F\). By Lemma 6, this implies that \(K(F,\Phi ) \subset {{\,\mathrm{int}\,}}L(\Phi )\). It is also shown that \(h''(z) < 0\) for every \(z \in {{\,\mathrm{int}\,}}L(\Phi )\) and so \(h:L(\Phi ) \rightarrow {\mathbb {R}}\) is a strictly concave function. Note that for \(d = 1\) we have \(L(\varphi ) = [A, B]\), where \(A = \inf _{\mu \in \mathcal {M}}\int \varphi \,d\mu \) and \(B = \sup _{\mu \in \mathcal {M}}\int \varphi \,d\mu \).
The next result is a criterion for uniqueness of equilibrium measures.
Theorem 10
Let \((T,\varphi )\) be a \(C^r\) regular pair and let \(F:{\mathbb {R}}\rightarrow {\mathbb {R}}\) be a \(C^r\) function that is concave on [A, B]. Then there exists a unique equilibrium measure for \((F,\varphi )\). Moreover, the equilibrium measure is ergodic.
Proof
Since F is concave and h is strictly concave, the function \(E = h + F\) is strictly concave. This implies that E has at most one maximizer in (A, B). Since there is no maximizer of E on \(\partial L(\varphi ) = \{A,B\}\) and \(K(F,\varphi ) \ne \emptyset \), we conclude that there exists a unique point \(z^* \in (A,B)\) maximizing E. Hence, it follows from Lemma 6 that \(K(F,\varphi ) = \{z^*\}\). By Theorem 4.3 in [7] together with Lemma 4, we conclude that there exists a unique equilibrium measure for \((F,\varphi )\) and that this measure is ergodic. \(\square \)
The following example illustrates various possibilities.
Example 4
Let \(\Sigma = \{-1,1\}\) and let \(T:\Sigma ^{{\mathbb {Z}}} \rightarrow \Sigma ^{{\mathbb {Z}}}\) be the two-sided shift. We consider the function \(\varphi :\Sigma \rightarrow {\mathbb {R}}\) defined by \(\varphi (\cdots \omega _{-1} \omega _0 \omega _1 \cdots ) = \omega _0\). Then \(L(\varphi ) = [-1,1]\) and the entropy function \(h:L(\varphi ) \rightarrow {\mathbb {R}}\) is given by
For the function \(F:L(\Phi ) \rightarrow {\mathbb {R}}\) defined by \(F(z) = \alpha /(z^2 - 2)\), where \(\alpha \in {\mathbb {R}}\), we have
Notice that for \(\alpha > 0\) we have \(F''< 0\) on \({{\,\mathrm{int}\,}}L(\varphi )\). Since \(F \equiv 0\) for \(\alpha = 0\), the function F is concave on \({{\,\mathrm{int}\,}}L(\varphi )\) whenever \(\alpha \ge 0\). Hence, by Theorem 10 there exists a unique equilibrium measure \(\nu _{z^*}\) for \((F, \varphi )\), where \(z^* = 0\) (see Fig. 1). For \(\alpha < 0\), the number of equilibrium measures may vary and is the number of absolute maximizers of E on \((-1,1)\). For instance, for \(\alpha = -1\) there is one equilibrium measure, while for \(\alpha = -2.3\) there are two equilibrium measures (see also Fig. 1).
Theorem 10 also shows that in order to have finitely many equilibrium measures it is not necessary that the pair \((T, \varphi )\) is \(C^\omega \) and that the function F is analytic. We give an example in the nonanalytic \(C^{\infty }\) case.
Example 5
Consider the pair \((T,\varphi )\) in Example 4 and let \(F:{\mathbb {R}}\rightarrow {\mathbb {R}}\) be the function given by
One can show that F is \(C^\infty \) but not analytic. For \(-1 \le z \le 0\), we have
It follows from (24) that E has a local maximum \(y_1 = 1\) at \(z_1^{*} = 0\). For \(0 < z \le 1\), one can verify that E has a local maximum \(y_2 \approx 1.33\) at \(z_2^{*} \approx 0.75\). Since \(y_1 < y_2\), the function E has a unique global maximum at \(z_2^{*} \in (0, 1) \subset {{\,\mathrm{int}\,}}L(\varphi )\) (see Fig. 2). By Theorem 7, we conclude that \(\nu _{z_2^*}\) is the unique equilibrium measure for \((F,\varphi )\).
5.3 Equilibrium Measures II
As in the one-dimensional case, for \(d > 1\) no point in \(\partial {L(\Phi )}\) can maximize the function E, that is, \(K(F,\Phi ) \subset {{\,\mathrm{int}\,}}L(\Phi )\) (see the Claim in the proof of Theorem 4.15 in [8]). This is possible because \(C^{r}\) regular pairs are \(C^{r}\) Legendre (see Proposition 4.10 in [8]).
The following statement is a version of the uniqueness result in Theorem 10 for \(d>1\).
Theorem 11
Let \((T,\Phi )\) be a \(C^r\) regular pair and let \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\) be a \(C^r\) function that is strictly concave on \(L(\Phi )\). Then there exists a unique equilibrium measure for \((F,\Phi )\). Moreover, the equilibrium measure is ergodic.
Proof
By Proposition 8, the map \(z \mapsto h(z)\) is upper semicontinuous on \(L(\Phi )\). Since F is \(C^r\) on \(L(\Phi )\), we conclude that \(z \mapsto E(z)\) is upper semicontinuous on \(L(\Phi )\). Together with the compactness of \(L(\Phi )\), this guarantees the existence of at least one point in \(L(\Phi )\) maximizing the function E. On the other hand, by Propositions 8 and 9 and the strict concavity of F, the function E is strictly concave on \(L(\Phi )\) and \(C^{r-1}\) on \({{\,\mathrm{int}\,}}L(\Phi )\). The concavity property of E implies that there exists at most one maximizer in \(L(\Phi )\). Since there are no maximizers of E in \(\partial L(\Phi )\), the unique point \(z^*\) maximizing E must be in \({{\,\mathrm{int}\,}}L(\Phi )\). It follows now from Theorem 7 that \(K(F,\Phi ) = \{z^*\}\), that is, \(\nu _{z^*}\) is the unique equilibrium measure for \((F,\Phi )\). Moreover, by Lemma 4, \(\nu _{z^*}\) is an ergodic measure. \(\square \)
In Example 4, we have \(h|_{\partial L(\varphi )} \equiv 0\). It turns out that this behavior at the boundary of \(L(\varphi )\) is typical for some \(C^r\) regular systems, even for \(d>1\). Let \(\mathcal {H}_{\theta }\) be the space of Hölder continuous functions with Hölder exponent \(\theta > 0\). The following result is a particular case of Theorem 14 in [3].
Theorem 12
Let T be a subshift of finite type, a \(C^{1+\varepsilon }\) diffeomorphism with a hyperbolic set, or a \(C^{1+\varepsilon }\) map with a repeller, that is assumed to be topologically mixing. Then there exists a residual set \({\mathcal {O}} \subset (\mathcal {H}_{\theta })^{d}\) such that for each \(\Phi \in {\mathcal {O}}\) we have
We also note that in Example 4 with \(\alpha > 0\), the function F satisfies
where \(F_{{{\,\mathrm{int}\,}}L(\varphi )}\) is the restriction to \({{\,\mathrm{int}\,}}L(\varphi )\). In fact, condition (26) together with the continuity of F implies that F must actually be constant on \(\partial L(\Phi )\), as it happens in Example 4. This scenario is a more general situation in which \(E = h + F\) attains its maximum on \({{\,\mathrm{int}\,}}L(\Phi )\):
Note that this condition may depend not only on F, but also on the family of functions \(\Phi \).
A similar idea works for typical \(C^r\) regular systems in the sense that they belong to the residual set \({\mathcal {O}}\) in Theorem 12. Let \((T, \Phi )\) be a \(C^r\) regular pair satisfying (25). In particular, \({{\,\mathrm{int}\,}}L(\Phi ) \ne \emptyset \). Now let \(F:{\mathbb {R}}^d \rightarrow {\mathbb {R}}\) be a function satisfying (26) with \(\varphi \) replaced by \(\Phi \). Since \(h \ge 0\), we have
which implies that property (27) holds. Therefore, \(K(F, \Phi ) \subset {{\,\mathrm{int}\,}}L(\Phi )\) and so one can apply item (3) of Theorem 7.
It was shown recently in [8] that condition (27) is satisfied for \(C^r\) Legendre pairs. This implies that we always have \(K(F, \Phi ) \subset {{\,\mathrm{int}\,}}L(\Phi )\) in our setup.
For \(d=1\) and \(C^r\) regular systems, the function h in (22) is strictly concave. The next example (which should be compared with the Curie–Weiss–Potts model for 3 colors) illustrates that this may still happen for \(d>1\), but unfortunately we are not able to describe for which \(C^r\) regular pairs the function h is strictly concave.
Example 6
Let \(T:X \rightarrow X\) be the two-sided shift with \(X = \{1,2,3\}^{\mathbb {Z}}\) and let \(\varphi _1 = \chi _{C_1}\) and \(\varphi _2 = \chi _{C_3}\), where \(C_i\) is the set of all sequences
with \(\omega _0 = i\). Since \(\int _X \varphi _1 d\mu = \mu (C_1)\) and \(\int _X \varphi _2 d\mu = \mu (C_3)\) for each \(\mu \in \mathcal {M}\), we have
By Theorem 8 in [3], we obtain
On the other hand, since \(\mu (C_1) + \mu (C_2) + \mu (C_3) = 1\) for each \(\mu \in \mathcal {M}\), we have
and
Note that \({{\,\mathrm{int}\,}}L(\Phi ) \ne \emptyset \) and that \(\partial L(\Phi )\) is the set
For \((z_1,z_2) = (1/2,0)\), (0, 1/2), \((1/2,1/2) \in \partial L(\Phi )\) we have \(h(z_1,z_2) = 1 > 0\) and so the system is not typical. On the other hand, one can easily verify that the map \((z_1,z_2) \mapsto h(z_1,z_2)\) is still strictly concave on \(L(\Phi )\).
Now consider the function \(F(z_1, z_2) = \beta (z_1^2+z_2^2)/2\) with \(\beta \in {\mathbb {R}}\). One can verify that the determinant of the Hessian matrix of \(E = h + F\) is given by
Since \(\det H_{E}(z_1,z_2) > 0\) for \((z_1,z_2) \in {{\,\mathrm{int}\,}}L(\Phi )\) and \(\beta \ge 0\), every critical point of E is nondegenerate for all \(\beta \ge 0\). Hence, for each \(\beta \ge 0\), the function E has at most finitely many critical points. In addition, it was shown in [8] that condition (27) always holds. So E attains its maximal value only at critical points. It follows from Theorem 7 that the pair \((F,\Phi )\) has finitely many equilibrium measures.
On the other hand, for \(\beta < 0\) the function F is strictly concave and one can use Theorem 11 to conclude that \((F,\Phi )\) has a unique equilibrium measure.
Remark
In Example 6 the parameter \(\beta \) is related to the absolute temperature and the model has physical meaning only when \(\beta \ge 0\). However, the general concave case (with \(\beta < 0\)) might be useful for possible applications in other contexts.
5.4 Coincidence of Equilibrium Measures
The following result gives a sufficient condition so that two systems share equilibrium measures. We say that \(\Phi _1 = \{\varphi _{1,1},\ldots ,\varphi _{1,d}\}\) is cohomologous to \(\Phi _2 = \{\varphi _{2,1},\ldots ,\varphi _{2,d}\}\) if \(\varphi _{1,i}\) is cohomologous to \(\varphi _{2,i}\) for \(i =1,\ldots ,d\). Then
which readily implies that \(L(\Phi _1) = L(\Phi _2)\).
Proposition 13
Let \((T, \Phi _1)\) and \((T,\Phi _2)\) be \(C^r\) regular pairs such that \(\Phi _1\) is cohomologous to \(\Phi _2\) and let \(F_1:L(\Phi _1) \rightarrow {\mathbb {R}}\) and \(F_2:L(\Phi _2) \rightarrow {\mathbb {R}}\) be continuous functions. If a point \(z \in {{\,\mathrm{int}\,}}L(\Phi _1) \cap {{\,\mathrm{int}\,}}L(\Phi _2)\) is simultaneously a maximizer for the functions \(E_1= h_1 + F_1\) and \(E_2= h_2 + F_2\), then \(\nu _{z}\) is an equilibrium measure for \((F_1, \Phi _1)\) and \((F_2,\Phi _2)\).
Proof
Since \(\Phi _1\) is cohomologous to \(\Phi _2\), we have \(L:= L(\Phi _1) = L(\Phi _2)\). Now take \(z \in {{\,\mathrm{int}\,}}L\) and consider the functions
where P denotes the classical topological pressure and where each \(h_i\) is the corresponding entropy function (see (22)). By the cohomology assumption, we have
and so \(C_{z}(\Phi _1) = C_{z}(\Phi _2)\) for all \(z \in {\mathbb {R}}^d\) (see (21)). In particular, this implies that \(h := h_1 = h_2\). Therefore,
for \(q \in {\mathbb {R}}^d\). Again since \(\Phi _1\) is cohomologous to \(\Phi _2\), we conclude that
On the other hand, by the proof of Theorem 8 in [3] the function \(q\mapsto \Delta _1(q)\) attains its minimum at a point \(q_1(z)\) and \(\nu _{1,z}\) is the unique equilibrium measure for the function \(\langle q_1(z), \Phi _1 - z\rangle - h(z)\). Similarly, \(q \mapsto \Delta _2(q)\) attains its minimum at a point \(q_2(z)\) and \(\nu _{2,z}\) is the unique equilibrium measure for the function \(\langle q_2(z), \Phi _2 - z\rangle - h(z)\). By (28), one can take \(q_1(z) = q_2(z)\) and so \(\nu _z:= \nu _{1,z} = \nu _{2,z}\). The desired result follows now from Theorem 7. \(\square \)
A direct consequence of Proposition 13 is that if \(\Phi _1\) is cohomologous to \(\Phi _2\) and the functions \(E_1\) and \(E_2\) attain maximal values at the same points, then \((F_1, \Phi _1)\) and \((F_2, \Phi _2)\) have the same equilibrium measures (in particular, this happens when \(F_1 = F_2\)). For the converse to hold we need stronger conditions so that the coincidence of two equilibrium measures yields a cohomology relation.
Theorem 14
Let X be a topologically mixing locally maximal hyperbolic set for a diffeomorphism T and let \(\Phi _1\) and \(\Phi _2\) be families of Hölder continuous functions. Moreover, let \(F_1\) and \(F_2\) be continuous functions. If \((F_1,\Phi _1)\) and \((F_2,\Phi _2)\) have the same equilibrium measures, then for each \(z_1\) and \(z_2\) maximizing \(E_1\) and \(E_2\), respectively, there exist \(q_1, q_2\in {\mathbb {R}}^d\) such that \(\langle q_1, \Phi _1 - z_1\rangle \) is cohomologous to \(\langle q_2, \Phi _2-z_2\rangle \).
Proof
By Theorem 7, each equilibrium measure for \((F_i,\Phi _i)\) is a measure \(\nu _{z_i}\) with \(z_i \in K(F_i,\Phi _i)\) that is the unique equilibrium measure for
where \(q_i(z_i)\) is a minimizer of the function
Since by hypotheses \(\nu _{z_1} = \nu _{z_2}\), the function \(\psi _1-\psi _2\) is cohomologous to \(P(\psi _1) - P(\psi _2) \in {\mathbb {R}}\). But since
(see Lemma 2 in [3]), we have \(P(\psi _1) =P(\psi _2)\). So there exists a continuous function \(S= S(z_1,z_2): X \rightarrow {\mathbb {R}}\) such that \(\psi _1- \psi _2 = S\circ T - S\), that is,
Again since \(\nu _{z_1} = \nu _{z_2}\), by Lemma 5 we have
Hence, for each \(z_1\) and \(z_2\) maximizing \(E_1\) and \(E_2\), respectively, there exist points \(q_1=q_1(z_1), q_2=q_2(z_2) \in {\mathbb {R}}^d\) as in the statement of the theorem. \(\square \)
Data Availability
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
References
Barreira, L.: A non-additive thermodynamic formalism and applications to dimension theory of hyperbolic dynamical systems. Ergodic Theory Dyn. Syst. 16, 871–927 (1996)
Barreira, L.: Thermodynamic Formalism and Applications to Dimension Theory, Progress in Mathematics, vol. 294. Birkhäuser, Basel (2011)
Barreira, L., Saussol, B., Schmeling, J.: Higher-dimensional multifractal analysis. J. Math. Pures Appl. 81, 67–91 (2002)
Bowen, R.: Topological entropy for noncompact sets. Trans. Am. Math. Soc. 184, 125–136 (1973)
Bowen, R.: Some systems with unique equilibrium states. Math. Systems Theory 8, 193–202 (1974/75)
Bowen, R.: Equilibrium States and Ergodic Theory of Anosov Diffeomorphisms. Lecture Notes in Mathematics, vol. 470. Springer, Berlin (1975)
Buzzi, J., Leplaideur, R.: Nonlinear thermodynamical formalism (2020). arXiv:2002.00576v2
Buzzi, J., Kloeckner, B., Leplaideur, R.: Nonlinear thermodynamical formalism (2021). arXiv:2002.00576v3
Constantine, D., Lafont, J.-F., Thompson, D.: The weak specification property for geodesic flows on CAT(-1) spaces. Groups Geom. Dyn. 14, 297–336 (2020)
Eizenberg, A., Kifer, Y., Weiss, B.: Large deviations for \(Z^{d}\)-actions. Commun. Math. Phys. 164, 433–454 (1994)
Ellis, R.: Entropy, Large Deviations, and Statistical Mechanics, Grundlehren der mathematischen Wissenschaften, vol. 271. Springer, New York (1985)
Ellis, R., Wang, K.: Limit theorems for the empirical vector of the Curie-Weiss-Potts model. Stoch. Process. Appl. 35, 59–79 (1990)
Friedli, S., Velenik, Y.: Statistical Mechanics of Lattice Systems. A Concrete Mathematical Introduction, Cambridge University Press, Cambridge (2018)
Katok, A., Hasselblatt, B.: Introduction to the Modern Theory of Dynamical Systems, Encyclopedia of Mathematics and Its Applications, vol. 54. Cambridge University Press, Cambridge (1995)
Keller, G.: Equilibrium States in Ergodic Theory, London Mathematical Society Student Texts, vol. 42. Cambridge University Press, Cambridge (1998)
Leplaideur, R.: Chaos: butterflies also generate phase transitions. J. Stat. Phys. 161, 151–170 (2015)
Leplaideur, R., Watbled, F.: Generalized Curie-Weiss model and quadratic pressure in ergodic theory. Bull. Soc. Math. France 147, 197–219 (2019)
Leplaideur, R., Watbled, F.: Generalized Curie-Weiss-Potts model and quadratic pressure in ergodic theory. J. Stat. Phys. 181, 263–292 (2020)
Misiurewicz, M.: A short proof of the variational principle for a \(\mathbb{Z}_{+}^{N}\) action on a compact space. In: International Conference on Dynamical Systems in Mathematical Physics (Rennes, 1975), Astérisque, No. 40, Soc. Math. France, Paris, pp. 147–157 (1976)
Parry, W., Pollicott, M.: Zeta functions and the periodic orbit structure of hyperbolic dynamics. Astérisque 187–188 (1990)
Pesin, Ya.: Dimension Theory in Dynamical Systems. Contemporary Views and Applications. Chicago Lectures in Mathematics, University of Chicago Press, Chicago (1997)
Pesin, Ya., Pitskel’, B.: Topological pressure and the variational principle for noncompact sets, Functional. Anal. Appl. 18, 307–318 (1984)
Pfister, C.-E., Sullivan, W.: Large deviations estimates for dynamical systems without the specification property. Applications to the \(\beta \)-shifts. Nonlinearity 22, 237–261 (2005)
Ruelle, D.: Statistical mechanics on a compact set with \({{\mathbb{Z}}}^\nu \) action satisfying expansiveness and specification. Trans. Am. Math. Soc. 185, 237–251 (1973)
Ruelle, D.: Thermodynamic Formalism, Encyclopedia of Mathematics and Its Applications, vol. 5. Addison-Wesley Publishing Co., Reading (1978)
Thaler, M., Maximilian: Estimates of the invariant densities of endomorphisms with indifferent fixed points. Isr. J. Math. 37, 303–314 (1980)
Viana, M., Oliveira, K.: Foundations of Ergodic Theory, Cambridge Studies in Advanced Mathematics, vol. 151. Cambridge University Press, Cambridge (2016)
Walters, P.: A variational principle for the pressure of continuous transformations. Am. J. Math. 97, 937–971 (1976)
Walters, P.: An Introduction to Ergodic Theory, Graduate Texts in Mathematics, vol. 79. Springer, New York (1982)
Wolf, C.: A shift map with a discontinuous entropy function. Discret. Contin. Dyn. Syst. 40, 319–329 (2020)
Acknowledgements
We thank the referees for the carefully reading of our work and for the constructive and helpful comments.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Peter Balint.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Partially supported by FCT/Portugal through CAMGSD, IST-ID, Projects UIDB/04459/2020 and UIDP/04459/2020. C.H. was supported by FCT/Portugal through the grant PD/BD/135523/2018.
Rights and permissions
About this article
Cite this article
Barreira, L., Holanda, C. Higher-Dimensional Nonlinear Thermodynamic Formalism. J Stat Phys 187, 18 (2022). https://doi.org/10.1007/s10955-022-02916-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10955-022-02916-4