Abstract
We extend results on quadratic pressure and convergence of Gibbs measures from Leplaideur and Watbled (Bull Soc Math France 147(2):197–219, 2019) to some general models for spin spaces. We define the notion of equilibrium state for the quadratic pressure and show that under some conditions on the maxima for some auxiliary function, the Gibbs measure converges to a convex combination of eigen-measures for the Transfer Operator. This extension works for dynamical systems defined by infinite-to-one maps. As an example, we compute the equilibrium for the mean-field XY model as the number of particles goes to \(+\infty \).
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
1.1 Background, Main Motivations, Open Questions
Thermodynamic formalism has been introduced in Dynamical Systems in the 70’s mainly by Sinai, Ruelle and Bowen (see [2, 23, 24]). The main motivation was to exhibit special invariant measures representing systems at equilibrium. Along the years, vocabulary from Physics has been used but for notions that may differ from the original ones in Physics. This is in particular the case for the notion of phase transition.
The present paper is the continuation of a recent work [20] where a dictionary between some phase transitions in Statistical Mechanics and their interpretation in Ergodic Theory was given.
In this paper we shall call Probabilistic Gibbs Measures (PGM for short) the Gibbs measures used in Physics or in Probability Theory and usually defined on finite lattices. On the other hand, Dynamical Gibbs Measures (DGM) will denote invariant measures usually considered in Ergodic Theory for Dynamical Systems, thus defined on an infinite system.
With this distinction, authors in [20] introduced the notion of quadratic pressure for dynamical systems and showed that for a generalization of the Curie–Weiss model, the PGM converge as the number of sites goes to \(+\infty \) to a convex combination of measures associated to the DGM which maximize the quadratic pressure.
The number of these DGM is the number of maxima for a special auxiliary function \(\varphi _{\beta }:{\mathbb {R}}\rightarrow {\mathbb {R}}\) and it was shown that a phase transition occurs at \(\beta _{c}\) when this number changes at \(\beta _{c}\).
The Curie–Weiss–Potts model is usually considered as an extension of the Curie–Weiss model, although phase transitions are quite different (see below). We refer to [9, 12] for a complete description of Ising, Curie–Weiss, and Potts models, and to [10] for the Curie–Weiss–Potts model. Even if for the Curie–Weiss–Potts model a similar result to Theorem 3 (in the present paper) was given in [20], a total description of the phenomenon with respect to quadratic pressure has not yet been done.
It was thus a natural question to investigate if a dictionary between Curie–Weiss general models for spin spaces and Ergodic Theory could be defined. In the present paper, we consider more general interactions between sites for mean field interactions and show that the approach of [20] can actually be adapted to these generalized spin spaces. These are the purposes of Theorems 2 and 3.
In view to give an application to the XY-model (see Sect. 5), the statement is done for dynamical systems which are not necessarily finite-to-one.
For that goal, the notion of entropy needs to be made precise. This notion has already been investigated and we mention e.g.a series of works [1, 6, 7, 21] and more recently [13]. We re-employ here the idea to define the entropy as the Fenchel-Legendre transform of the pressure function and to link it to a min–max problem. We remind that pressure here has to be understood within ergodic viewpoint and is the logarithm of the spectral radius of the transfer operator.
Some version of the transfer operator with infinite-to-one map has already been studied in [21]. We point out that in our case we have a more flexible operator as the transition depends on the two first coordinates (see below). Actually, we believe that it can easily be extended to the case where transitions only depend on finitely many coordinates, which is what should be a natural extension of the notion of subshift of finite type with infinite (and uncountable) alphabet. In other words, the transfer operator in [21] is the one for a full-shift of finite type whereas our is the one for more general irreducible subshift of finite type.
The main purpose of Theorem 1 is thus to show the existence of the spectral gap for the transfer operator (one of the main assumption to use [13]) and also to study the regularity of the spectral radius, in particular for the multi-dimensional case.
Theorem 3 is where we actually make links between Dynamical Gibbs Measures and Probabilistic Gibbs Measures. It deals with convergence of the PGM to a convex combination of eigen-measures for the Transfer Operator. One of the key points is the Laplace method. This is where the auxiliary function \(\varphi _{\beta }\) is involved.
We remind that the Laplace method deals with integrals of the form \(\int _{I} f(t)e^{n\phi (t)}dt\) and gives an equivalent of this quantity as n goes to \(+\infty \). This equivalent only involves values \(c_{i}\) where \(\phi \) is maximal. For each \(c_{i}\) one gets an expression \(\gamma (c_{i},n)\) depending on how flat \(\phi \) is close to \(c_{i}\) and also on how f behaves close to \(c_{i}\).
If I is some subset of \({\mathbb {R}}^{d}\) with non-empty interior, we emphasize that there is a big difference between \(d=1\) and the higher-dimensional case. For \(d=1\) we can, in general, compare all the \(\gamma (c_{i},n)\), up to the condition that they are finitely many, and whatever the flatness at each \(c_{i}\) is (as long as it is not totally degenerate). This does not hold for \(d>1\), and we only get case-by-case results, unless all the maxima have a non-degenerate Hessian. The consequence in our problem is that we can (in general) precisely determine what is the convex combination for the limit of the PGM, only if all the maxima are non-degenerate.
Once we link a phase transition with the change of maxima for an auxiliary function, it becomes easy to understand (and classify) different kinds of phase transitions. In spirit, a change in the number of the maxima may occur in two different ways. Either one maximum splits and produces several maxima. This is what happens in the Curie–Weiss model and also for the XY-model (see Sect. 5). Or, one local maxima far from the global ones becomes a global one. This is what happens in the Curie–Weiss–Potts model.
Now that we have at our disposal a kind of dictionary between the mean field theory and ergodic theory, we can formulate and translate some problems from one area to the other one, and expect to use tools from the latter to solve the problem of the former. We address here two questions of that kind.
Non-ergodic interacting particle systems. Some examples have quite recently been given or discussed (see [17, 22]). It is clear from the mathematical point of view that even if the underlying dynamical system is uniquely ergodic, the conformal measure (with respect to some potential that has to be made precise) may be different from the unique invariant measure as it is not necessarily invariant. From [20] and Theorem 3 we have shown that thermodynamical limits for PGM are the conformal measures and not the DGM. This could thus be a way to exhibit new examples of these “non-ergodic” uniquely invariant systems.
Conservation of Gibbsianness under local transformations (see [19]) and non-linear pressure. In [5] the concept of quadratic pressure is extended to define a non-linear thermodynamic formalism. An equivalent to Theorem 2 is stated and also the convergence (at logarithmic scale) of the partition function is proved. Nevertheless at that stage, there is no equivalent to the PGM extending the quadratic pressure to general non-linear pressure. Following [26], it appears reasonable to expect that results from [19] could be used to define the good notion of PGM in non-linear thermodynamic formalism.
1.2 Settings and Results
1.2.1 Shift with (Possibly) Infinite Alphabet
Let (E, d) be a compact metric space, let \(\rho \) be a Borel probability measure on E with full support. We assume that \(\rho \) satisfies the following assumption
We consider a map \(A:E\times E\rightarrow [0,1]\) called the transition function, which satisfies the following properties:
- (A1):
-
A is continuous with values in \(\{0,1\}\).
- (A2):
-
A is Lipschitz continuous with respect to the second variable with Lipschitz constant \(\mathrm{Lip}(A)\).
- (A3):
-
A generates some mixing in the following sense.
$$\begin{aligned}&\exists N\in {\mathbb {N}},\, \forall \,n\ge N,\ \forall \, a,b\in E,\, \exists z_{1},\ldots ,z_{n-1}\in E\nonumber \\&\qquad \text { such that } A(a,z_{1})A(z_{1},z_{2})\cdots A(z_{n-1},b)=1. \end{aligned}$$(1)
Remark 1
The assumption (A1) yields that A is constant with value 0 or 1 on each connected component of \(E\times E\). The assumption (A3) implies in particular that A is not identically null. \(\square \)
We define \(\Omega \subset E^{{\mathbb {N}}}\) in the following way:
The shift map \(\sigma :\Omega \rightarrow \Omega \) is defined by
Note that if E is connected, e.g.\(E=[0,1]\), then \(A\equiv 1\). If E is a finite set \(\{1,\ldots k\}\), then \(\Omega \) is the subshift of finite type with transition matrix having entries A(i, j).
For \(n\ge 1\), let \(\Omega _{n}\) be the set of words \(z_{1}\cdots z_{n}\) with \(\displaystyle \prod _{i=1}^{n-1} A(z_{i},z_{i+1})=1\). For a and b in E, let \(\Omega _{n-1}(a,b)\) be the set of words \(z_{1}\cdots z_{n-1}\) in \(\Omega _{n-1}\) with \(A(a,z_1)=A(z_{n-1},b)=1\). Assumption (A3) on A means that for every a, b in E, for every \(n\ge N\), \(\Omega _{n-1}(a,b)\ne \emptyset \). It implies in particular that for every a in E, there always exist u, v in E such that \(A(a,u)=1\) and \(A(v,a)=1\). We denote by \(\Omega _{n}(b)\) the set of words \(z_{0}\cdots z_{n-1}\) in \(\Omega _{n}\) with \(A(z_{n-1},b)=1\).
We set \({\mathbb {P}}=\rho ^{\otimes {\mathbb {N}}}\). The distance on \(\Omega \) is defined by
We notice that for any a in \(\Omega _n\),
and that
We denote by \({\mathcal {C}}^{0}(\Omega )\), respectively \({\mathcal {C}}^{+1}(\Omega )\), the set of continuous, respectively Lipschitz continuous, functions from \(\Omega \) to \({\mathbb {R}}\), equipped respectively with the norms
where \(\mathrm{Lip}(\phi )\) stands for the Lipschitz constant of \(\phi \). We recall that the spaces \(({\mathcal {C}}^{0}(\Omega ),\Vert \cdot \Vert _\infty )\) and \(({\mathcal {C}}^{+1}(\Omega ),\Vert \cdot \Vert _L)\) are Banach spaces. We set \(M(\Omega )\) the space of probability measures on \(\Omega \) and recall that by the Riesz representation theorem, the map \(\mu \mapsto (f\mapsto \int f\,d\mu )\) is a bijection between \(M(\Omega )\) and
where \(*\) means the topological dual space and \(mathbb{1}\) is the constant function \(\equiv 1\).
A measure \(\mu \) is \(\sigma \)-invariant if \(\mu (\sigma ^{-1}(B))=\mu (B)\) for all Borel sets B. The set \(M_\sigma (\Omega )\) is the space of \(\sigma \)-invariant probability measures on \(\Omega \). Both \(M(\Omega )\) and \(M_\sigma (\Omega )\) are convex and compact for the weak star topology.
The transfer operator associated to \(\phi :\Omega \rightarrow {\mathbb {R}}\) (Lipschitz continuous) is the linear operator defined by
Theorem 1 states several properties on the spectrum of the transfer operator. To properly state the theorem we need to introduce some more quantities.
The operator \({\mathcal {L}}_{\phi }\) acts on \({\mathcal {C}}^{0}(\Omega )\) and on \({\mathcal {C}}^{+1}(\Omega )\). The spectral radius of \({\mathcal {L}}_{\phi }\) on \({\mathcal {C}}^{0}(\Omega )\), denoted by \(r_{\phi }\), is a simple eigenvalue of the adjoint operator \({\mathcal {L}}_{\phi }^*\) acting on the space of Radon measures on \(\Omega \), and the conformal measure \(\nu _\phi \) is the unique probability eigen-measure associated to the eigenvalue \(r_\phi \). It is also a simple eigenvalue of \({\mathcal {L}}_{\phi }\) acting on \({\mathcal {C}}^{+1}(\Omega )\), with a positive eigenfunction \(G_\phi \) such that the measure \(\mu _\phi =G_\phi \nu _\phi \) is a probability measure. We call \(\mu _\phi \) the dynamical Gibbs measure (DGM for short) associated to \(\phi \).
If \({\textit{\textbf{z}}}\) belongs to \({\mathbb {R}}^{q}\) and \(\psi _{i}\), \(i=1,\ldots q\) are in \({\mathcal {C}}^{+1}(\Omega )\) one sets \(\overrightarrow{ \varvec{\psi }}:=(\psi _{1},\ldots ,\psi _{q})\) and \({\textit{\textbf{z}}}\cdot \overrightarrow{ \varvec{\psi }}:=\sum _{i=1}^{q}z_{i}\psi _{i}\). We denote by \(||{\textit{\textbf{z}}}||\) the Euclidean norm of \({\textit{\textbf{z}}}\)
Definition 1.1
For fixed \(\overrightarrow{ \varvec{\psi }}\) and \({\textit{\textbf{z}}}\in {\mathbb {R}}^{q}\), one sets
where \(r_{{\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\) is the spectral radius for \({\mathcal {L}}_{{\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\), and
Note that \(I(\overrightarrow{ \varvec{\psi }})\) is a closed convex subset of \({\mathbb {R}}^{q}\). Moreover, \({\textit{\textbf{z}}}\mapsto {\mathcal {H}}({\textit{\textbf{z}}},\overrightarrow{ \varvec{\psi }})\) is upper semi-continuous, as it is an infimum of affine functions.
Theorem 1
For any \(\phi \in {\mathcal {C}}^{+1}(\Omega )\), \(r_{\phi }\)is a simple single dominating eigenvalue. Moreover, for any \(\overrightarrow{ \varvec{\psi }}\)with \(\psi _{i}\in {\mathcal {C}}^{+1}(\Omega )\), the map \({\mathcal {P}}:{\textit{\textbf{t}}}\mapsto \log r_{{\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\)is infinitely differentiable.
Furthermore,
-
(1)
\({\mathcal {H}}({\textit{\textbf{z}}},\overrightarrow{ \varvec{\psi }})=-\infty \)if \({\textit{\textbf{z}}}\notin I(\overrightarrow{ \varvec{\psi }})\),
-
(2)
\({\mathcal {H}}({\textit{\textbf{z}}},\overrightarrow{ \varvec{\psi }})\)is finite if \({\textit{\textbf{z}}}= \nabla {\mathcal {P}}({\textit{\textbf{t}}})\)for some \( {\textit{\textbf{t}}}\in {\mathbb {R}}^q\).
We call pressure function for \(\overrightarrow{ \varvec{\psi }}\) the map \({\textit{\textbf{t}}}\mapsto {\mathcal {P}}({\textit{\textbf{t}}})\). In the following \(\nabla {\mathcal {P}}( {\mathbb {R}}^q)\) will denote the set of \({\textit{\textbf{z}}}\) such that \({\textit{\textbf{z}}}= \nabla {\mathcal {P}}({\textit{\textbf{t}}})\) for some \( {\textit{\textbf{t}}}\in {\mathbb {R}}^q\).
1.2.2 Quadratic Pressure
For fixed \(\overrightarrow{ \varvec{\psi }}\in {\mathcal {C}}^{+1}(\Omega )^{q}\) and for \(\beta \ge 0\), \({\textit{\textbf{t}}}\), \({\textit{\textbf{z}}}\) in \({\mathbb {R}}^q\), we set
Notation 1
We set \({\mathcal {H}}_{top}:=\log r_{\mathbf {0}}\).
Definition 1.2
For \(\mu \) in \(M_\sigma (\Omega )\), the entropy is the quantity
Let \(\overrightarrow{ \varvec{\psi }}\in {\mathcal {C}}^{+1}(\Omega )^{q}\) be fixed. The quantity
is referred to as the quadratic pressure function for \(\overrightarrow{ \varvec{\psi }}\).
Remark 2
In [13] the authors define the entropy by setting
and the pressure by setting
where \({\mathcal {X}}(\Omega )\) is a suitable space of potentials \(\Omega \rightarrow {\mathbb {R}}\). We notice that our definition of entropy is the same as theirs with \({\mathcal {X}}(\Omega )={\mathcal {C}}^{+1}(\Omega )\), whereas our definition of pressure is linked to theirs by \({\mathcal {P}}({\textit{\textbf{t}}})=\mathrm{Pr}({\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }})\), where \(\overrightarrow{ \varvec{\psi }}\) is fixed in \({\mathcal {C}}^{+1}(\Omega )^{q}\). We point out that \(\mu \mapsto \widehat{{\mathcal {H}}}(\mu )\) is upper semi-continuous as the infimum over a family of upper semi-continuous functions. \(\square \)
From Theorem 1 we can use the work of Giulietti et al. [13, th.F]. We emphasize that the key point in their work is the spectral decomposition of the transfer operator, which is stated in Theorem 1. Then, we deduce the existence of the DGM which is the unique equilibrium state for \(\phi \).
Stated with our settings, we get that for any \(\overrightarrow{ \varvec{\psi }}\in {\mathcal {C}}^{+1}(\Omega )^{q}\) and for any \(\textit{\textbf{t}}\) there is a unique invariant measure \(\mu _{{\textit{\textbf{t}}}.\overrightarrow{ \varvec{\psi }}}\) which maximizes \(\displaystyle \widehat{{\mathcal {H}}}(\mu )+\int {\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}\,d\mu \). Moreover,
Convexity for the multi-dimensional pressure function \({\textit{\textbf{t}}}\mapsto {\mathcal {P}}({\textit{\textbf{t}}})=\sup _{\mu }\{\widehat{{\mathcal {H}}}(\mu )+\int {\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}\,d\mu \}\) and differentiability obtained from Theorem 1 yield that for every \({\textit{\textbf{t}}}\) and every i, \(\displaystyle \frac{\partial \log r_{{\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}}{\partial t_{i}}=\int \psi _{i}\,d \mu _{{\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\).
Theorem 2
Equilibrium states for the quadratic pressure For any \(\overrightarrow{ \varvec{\psi }}\in {\mathcal {C}}^{+1}(\Omega )^{q}\)and for any \(\beta \ge 0\)the invariant probability measures which maximize \({\mathcal {P}}_{2}(\beta )\)are the dynamical Gibbs measures \(\mu _{\beta \textit{\textbf{t}}\cdot \overrightarrow{ \varvec{\psi }}}\)where the \(\textit{\textbf{t}}\)’s are the maxima for \(\varphi _{\beta }\).
This result goes in the same direction as the ones from [5, 20]. However, we point out some interesting difference here: in the higher-dimensional case, there may be infinitely many measures which maximize the quadratic pressure. This is actually the case for XY-model (see below Remark 6).
1.2.3 Generalized Curie–Weiss Hamiltonian
For \(\phi \in {\mathcal {C}}^{+1}(\Omega )\), we remind that \(S_{n}(\phi )\) stands for \(\phi +\phi \circ \sigma +\cdots +\phi \circ \sigma ^{n-1}=\sum _{k=0}^{n-1} \phi \circ \sigma ^{k}\). With previous notations, \(S_{n}(\overrightarrow{ \varvec{\psi }})\) is the vector with coordinates \(S_{n}(\psi _{i})\). Then, the Generalized Curie–Weiss Hamiltonian is defined for \(\omega \in \Omega \) by
We define the probabilistic Gibbs measure (PGM for short) \(\mu _{n,\beta }\) on \(\Omega \) by
where \(Z_{n,\beta }\) is the suitable normalization factor.
If \(P_n\), P are probability measures in \(M(\Omega )\), we say that \(P_n\) converges weakly to P if \(\int _\Omega f\,dP_n\rightarrow \int _\Omega f\, dP\) for each f in \({\mathcal {C}}^{0}(\Omega )\). As \({\mathcal {C}}^{+1}(\Omega )\) is dense in \({\mathcal {C}}^{0}(\Omega )\), this is equivalent to \(\int _\Omega f\,dP_n\rightarrow \int _\Omega f\, dP\) for each f in \({\mathcal {C}}^{+1}(\Omega )\).
Theorem 3
Generalized Curie–Weiss model One-dimensional case: if \(q=1\), then the PGM \(\mu _{n,\beta }\)converges weakly to a convex combination of the conformal measures \(\nu _{\beta t\psi }\)associated to the \(\mu _{\beta t\psi }\)’s from Theorem 2as n goes to \(+\infty \).
Higher-dimensional case: for \(q>1\), if \(\varphi _{\beta }\)attains its maximum only on non-degenerate points (i.e., \(d^{2}\varphi _{\beta }\)is invertible at those points), then they are finitely many and the PGM \(\mu _{n,\beta }\)converges weakly to a convex combination of the conformal measures \(\nu _{\beta {\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\)associated to the \(\mu _{\beta {\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\)’s, where the \({\textit{\textbf{t}}}\)’s are the maxima for \(\varphi _{\beta }\).
Remark 3
The classical Curie–Weiss–Potts model (see Theorem 2.1 of [10]) is obtained by taking \(E=\{1,\ldots q\}\), \(\psi _{i}=mathbb{1}_{[i]}\) and \(A(i,j)=1\) for every pair (i, j). \(\square \)
We point out that the mean-field XY model (see Sect. 5) is an example where \(\varphi _{\beta }\) attains its maximum on an infinite set of points, and for all of them the Hessian is degenerate.
1.3 Plan of the Paper
In Sect. 2 we prove Theorem 1. As we said above, the main ingredient is to define and study the spectrum of the Transfer Operator. We prove that this operator has a spectral gap and that the spectral radius is a simple isolated dominating eigenvalue. This allows to define the notion of conformal measure.
In Sect. 3 we prove Theorem 2. The main ingredient is to define two auxiliary functions, \(\varphi _{\beta }\) and \(\overline{\varphi }_{\beta }\), to show that one is always bigger than the other one, but both have the same maxima (arising for the same points). The maximal value for \(\overline{\varphi }_{\beta }\) equals the quadratic pressure but maxima for \(\varphi _{\beta }\) are easier to detect. Part of the difficulty in this section comes from our way to define the entropy for measure, as we want to deal with possibly infinite-to-one maps.
In Sect. 4 we prove Theorem 3. The main trick is the Hubbard–Stratonovich formula and then the Laplace method, as in [20].
In Sect. 5 we discuss an application to the mean-field XY model.
2 Proof of Theorem 1
2.1 Properties for the Transfer Operator with Infinite Alphabet
2.1.1 First Spectral Properties: \({\mathcal {L}}_{\phi }\) is Quasi-compact
The function A is continuous thus uniformly continuous (since \(E\times E\) is compact) and with values in \(\{0,1\}\). Therefore, there exists \(\varepsilon _{A}\in \, ]0,1[\) such that for any u, \(u'\), t and t’ in E satisfying \(d(u,u')<\varepsilon _{A}\) and \(d(t,t')<\varepsilon _{A}\),
Lemma 2.1
\({\mathcal {L}}_{\phi }\)acts on \({\mathcal {C}}^{0}(\Omega )\)and on \({\mathcal {C}}^{+1}(\Omega )\).
Proof
Let f be in \({\mathcal {C}}^{0}(\Omega )\). The function \(x\mapsto e^{\phi (tx)}A(t,x_{0})f(tx)\) is continuous on \(\Omega \) and
thus by the dominated convergence theorem \({\mathcal {L}}_{\phi }(f)\) is continuous on \(\Omega \). Moreover \({\mathcal {L}}_{\phi }\) acts continuously on \({\mathcal {C}}^{0}(\Omega )\) with operator norm
Now let f be in \({\mathcal {C}}^{+1}(\Omega )\). Notice that for any t in E and x, y in \(\Omega \),
so that the function \(f_t:x\mapsto f(tx)\) is Lipschitz with \(\mathrm{Lip}(f_t)\le \frac{1}{2}\mathrm{Lip}(f)\). It is easy to show that if \(\phi \) is Lipschitz, then \(e^\phi \) is Lipschitz with
Notice also that for any t in E, the map \(A_t:x\mapsto A(t,x_0)\) is Lipschitz with
As the product of two Lipschitz functions f and g is Lipschitz with
we easily deduce that \({\mathcal {L}}_\phi (f)\) is Lipschitz and that \({\mathcal {L}}_{\phi }\) acts continuously on \({\mathcal {C}}^{+1}(\Omega )\). \(\square \)
Lemma 2.2
The spectral radius on \({\mathcal {C}}^{0}(\Omega )\)satisfies \(\log r_{\phi }=\displaystyle \lim _{{n\rightarrow +\infty }}\frac{1}{n}\log ||{\mathcal {L}}_{\phi }^{n}(mathbb{1})||_{\infty }\).
Proof
We recall that \(r_{\phi }:=\displaystyle \lim _{n\rightarrow +\infty }\Vert {\mathcal {L}}^{n}_{\phi }\Vert _\infty ^{1/n} =\inf _{n\ge 1}\Vert {\mathcal {L}}^{n}_{\phi }\Vert _\infty ^{1/n}\). For any f in \({\mathcal {C}}^{0}(\Omega )\), \(x\in \Omega \),
hence \(\Vert {\mathcal {L}}_{\phi }\Vert _\infty = \Vert {\mathcal {L}}_{\phi }(mathbb{1})\Vert _\infty \). For \(n\in {\mathbb {N}}\),
where t inside the integral stands for \(t=t_0\cdots t_{n-1}\) and \(\rho ^{\otimes n}(dt)=\prod _{i=0}^{n-1}\rho (dt_i)\). Then \(\Vert {\mathcal {L}}_{\phi }^n\Vert _\infty = \Vert {\mathcal {L}}_{\phi }^n(mathbb{1})\Vert _\infty \) for the same reason as for \(n=1\), hence \(r_{\phi }=\displaystyle \lim _{n\rightarrow +\infty }\Vert {\mathcal {L}}^{n}_{\phi }(mathbb{1})\Vert _\infty ^{1/n}\). \(\square \)
As the measure \(\rho \) is of full support and A satisfies the hypothesis (A3) it is easy to show that for any x in \(\Omega \), \({\mathcal {L}}_{\phi }(mathbb{1})(x)\) is strictly positive. One then has that
is a probability measure for any \(\mu \in M(\Omega )\). The map \(\displaystyle \widehat{{\mathcal {L}}}^{*}_{\phi }:M(\Omega )\rightarrow M(\Omega )\) is continuous on the compact (for the weak*-topology) convex space \(M(\Omega )\) therefore by the Schauder-Tychonoff theorem, there exists a probability measure \(\nu _{\phi }\) such that
This measure is either called the conformal measure or the eigen-measure. In the following we set \(\lambda _{\phi }:={\mathcal {L}}^{*}_{\phi }(\nu _\phi )(mathbb{1}) =\displaystyle \int {\mathcal {L}}_{\phi }(mathbb{1})\,d\nu _{\phi }\).
Proposition 2.3
With previous notations \(\displaystyle r_{\phi }=\lambda _{\phi }\). Moreover for any \(x\in \Omega \),
Proof
Lipschitz regularity for \(\phi \) yields that the Bowen condition holds: for any n, for any x and y satisfying \(x_{i}=y_{i}\) for \(i=0,\ldots n-1\),
where \(D=\mathrm{Diam}(\Omega )=\max \{{d_{\Omega }}(x,y); x,y\in \Omega \}\). Indeed if \(x_{i}=y_{i}\) for \(i=0,\ldots n-1\) then \({d_{\Omega }}(\sigma ^k x,\sigma ^k y)=2^k {d_{\Omega }}(x,y)\) for \(0\le k\le n\), so that
Lemma 2.4
For any x, y in \(\Omega \)such that \({d_{\Omega }}(x,y)<\frac{\varepsilon _{A}}{2}\), for any \(n\ge 2\),
Proof
If \({d_{\Omega }}(x,y)<\frac{\varepsilon _{A}}{2}\) then \(d(x_0,y_0)<\varepsilon _A\) so that \(\Omega _n(x_0)=\Omega _n(y_0)\). For any \(a\in \Omega _n(x_0)\), (7) implies that
therefore
and by integrating over \(\Omega _n(x_0)\) one gets
\(\square \)
We pick \(N_{1}\) sufficiently big such that Assumption (A3) holds, that is
Then, we choose \(N_{2}\) sufficiently big such that \(2^{-N_{2}}<\displaystyle \frac{\varepsilon _{A}}{2\mathrm{Diam}(\Omega )}\).
Claim 1
There exists \(C=C(\phi )>0\)such that for every \(n>N_{1}+N_{2}\), for every x and y in \(\Omega \),
Proof of the Claim
We pick x and y in \(\Omega \). We denote by \(\mathbf {t}\) an element \((t_{1},\ldots , t_{N_{2}})\) of \(\Omega _{N_{2}}\) and by \(\mathbf {u}\) and \(\mathbf {v}\) some elements of \(\Omega _{N_{1}}\). We set \(N:=N_{2}+N_{1}\) and \(m:=n-N\).
where we used the identity
Let us set \(m_A=\inf _{t\in E}\rho \left( {B(t,\varepsilon _A)}\right) \), which is positive thanks to hypotheses (H). As \(\Omega _{N_1-1}(u_1,y_0)\ne \emptyset \) we can pick \(\mathbf {u}\) in this set. If \(\mathbf {v}\in \Omega _{N_1}\) is such that \(d(v_1,u_1)<\varepsilon _A\) and \(d(v_i,u_{i-1})<\varepsilon _A\) for every \(2\le i\le N_1\), then \(A(v_{N_1},y_0)=1\), that is \(\mathbf {v}\in \Omega _{N_1}(y_0)\). We deduce that
Therefore
From (7) we deduce that
As \(d_\Omega (\mathbf {t}\mathbf {u} x,\mathbf {t}\mathbf {v} y)=2^{-N_2}d_\Omega (\mathbf {u} x,\mathbf {v} y)<\dfrac{\varepsilon _A}{2}\) we know from (8) that
As \(e^{S_{N_1}(\phi )(\mathbf {u} x)}\le e^{2N_1\Vert \phi \Vert _\infty } e^{S_{N_1}(\phi )(\mathbf {v} y)}\) we deduce eventually that
Exchanging x and y we get the reverse inequality. \(\square \)
We can now finish the proof of Proposition 2.3. First we recall that according to Lemma 2.2, \(\log r_{\phi }=\displaystyle \lim _{{n\rightarrow +\infty }}\frac{1}{n}\log || {\mathcal {L}}_{\phi }^{n}(mathbb{1})||_{\infty }\). As \(\Omega \) is compact there exists \(x_n\in \Omega \) such that \(\Vert {\mathcal {L}}_{\phi }^{n}(mathbb{1})||_{\infty } ={\mathcal {L}}_{\phi }^{n}(mathbb{1})(x_n)\). For any x in \(\Omega \) and \(n>N_1+N_2\) we get from (9) that
therefore
and taking the limit we get (6). Now integrating (10) we get
and since \(\lambda ^{n}_{\phi }=\displaystyle \int {\mathcal {L}}^{n}_{\phi }(mathbb{1})(x)\,d\nu _{\phi }(x)\) we get \(\log \lambda _{\phi }=\log r_{\phi }\). \(\square \)
We claim that we can apply the Ionescu-Tulcea & Marinescu TheoremFootnote 1 (see [16], see also [4], Theorem 4.2 or [11], Theorem 2.1) to get a spectral decomposition of the operator \(\displaystyle \widetilde{{\mathcal {L}}}_{\phi }:=\frac{1}{r_{\phi }}{\mathcal {L}}_{\phi }\). Indeed the spaces \({\mathcal {C}}^{0}(\Omega )\), \({\mathcal {C}}^{+1}(\Omega )\) satisfy the first hypotheses of the ITM Theorem, which is
-
(1)
if \(f_n\in {\mathcal {C}}^{+1}(\Omega )\), \(f\in {\mathcal {C}}^{0}(\Omega )\), \(\lim \limits _{{n}\rightarrow {\infty }}\Vert f_n-f\Vert _\infty =0\), and \(\Vert f_n\Vert _L\le C\) for all n, then \(f\in {\mathcal {C}}^{+1}(\Omega )\) and \(\Vert f\Vert _L\le C\),
and \(\displaystyle \widetilde{{\mathcal {L}}}_{\phi }\) satisfies the three following hypotheses:
-
(2)
\(\sup _{n\in {\mathbb {N}}} \{\Vert \displaystyle \widetilde{{\mathcal {L}}}_{\phi }^n(f)\Vert _\infty ,\,f\in {\mathcal {C}}^{+1}(\Omega ),\,\Vert f\Vert _L\le 1\}<+\infty \),
-
(3)
there exists \(a\in ]0,1[\), \(b>0\) and \(n_{0}\ge 1\) such that for any \(f\in {\mathcal {C}}^{+1}(\Omega )\),
$$\begin{aligned} \Vert \displaystyle \widetilde{{\mathcal {L}}}_{\phi }^{n_0}(f)\Vert _L\le a\Vert f\Vert _L+b\Vert f\Vert _\infty , \end{aligned}$$ -
(4)
if V is a bounded subset of \(({\mathcal {C}}^{+1}(\Omega ),\Vert \cdot \Vert _L)\), then \(\displaystyle \widetilde{{\mathcal {L}}}_{\phi }^{n_0}(V)\) has compact closure in \(({\mathcal {C}}^{0}(\Omega ),\Vert \cdot \Vert _\infty )\).
We sketch the proof of (3) and let the reader check the other conditions.
Proof of (3). A direct computation yields that for f Lipschitz continuous
From (9) we know that for any \(n>N_1+N_2\), for any x in \(\Omega \),
hence as \(r_\phi =\lambda _\phi \) we have
Therefore
where \(A_n=\dfrac{e^C}{2^n}\) and \(B_{n}=\dfrac{e^{n\Vert \phi \Vert _\infty }}{r_\phi ^n}(\mathrm{Lip}(\phi )+2\mathrm{Lip}(A)+1)\). Picking any a in ]0, 1[ and adjusting n such that \(2^{-n}e^{C}<a\) one gets the result. \(\square \)
In particular, and considering \({\mathcal {L}}_{\phi }\) as an operator on \({\mathcal {C}}^{+1}(\Omega )\), the proof of the ITM Theorem shows (see [4, Lem. 4.7], or [11, Lemma 2.4]) that \(r_{\phi }\) is an eigenvalue for \({\mathcal {L}}_{\phi }\) associated to the function in \({\mathcal {C}}^{+1}(\Omega )\) defined by
Furthermore, we have the following decomposition
where the \(\Pi _{j}\)’s are (finitely many) projectors with finite rank, the \(\theta _{j}\)’s are real numbers and \(\Psi \) has spectral radius strictly smaller than 1. Moreover,
2.1.2 Second Decomposition of the Spectrum: \(r_{\phi }\) is the Unique Eigenvalue with Maximal Modulus and Its Eigenspace is of Dimension One
For simplicity we set \(\theta _{0}=0\). We shall see that mixing yields more precise results on the spectral decomposition of \({\mathcal {L}}_{\phi }\).
Lemma 2.5
For any \(x\in \Omega \), \(\displaystyle \bigcup _{n\ge 0}\sigma ^{-n}(\{x\})\)is dense in \(\Omega \).
Proof
Let x and y be in \(\Omega \), let \(\varepsilon >0\). Let \(n\in {\mathbb {N}}\) be such that \(2^{-n}\mathrm{Diam}(\Omega )<\varepsilon \), and let \(z_i=y_i\) for every i in \(\{{0},\ldots ,{n-1}\}\), so that \(d_\Omega (y,z)\le \varepsilon \). According to assumption (A3) there exist \(u_n,\ldots ,u_{n+N-2}\) in E such that \(z:=y_{0}\cdots y_{n-1}u_{n}\cdots u_{n+N-2}x\) belongs to \(\Omega \). Then z belongs to \(\sigma ^{-(n+N-1)}(\{x\})\cap B(y,\varepsilon )\).
Remark 4
Actually, we have proved a better result: for any \(\varepsilon \), there exists \(N'=N'(\varepsilon )\) such that for any y and x, \(B(y,\varepsilon )\cap \sigma ^{-N'}(\{x\})\ne \emptyset \). \(\square \)
Proposition 2.6
The spectral radius \(r_{\phi }\)is a simple single dominating eigenvalue. The rest of the spectrum for \({\mathcal {L}}_{\phi }\)is a compact set strictly inside the disk \({\mathbb {D}}(0,r_{\phi })\).
Proof
Because of the first result on the spectrum of \({\mathcal {L}}_{\phi }\), it remains to prove that \(r_{\phi }\) is simple and that any other eigenvalue has modulus strictly lower than \(r_{\phi }\). For that we use spectral properties of positive operators exposed in [18, chap. 1& 2]. We claim that the set K of non-negative Lipschitz functions is a solid and reproducing cone. Solid means it has non-empty interior and reproducing means
It is easy to see that any positive Lipschitz function is in \({\mathop {{K}}\limits ^{\circ }}\).
\(\bullet \)Step one We prove that for any \(f\not \equiv 0\in K\), there exists p such that \({\mathcal {L}}_{\phi }^{p}(f)\) belongs to \({\mathop {{K}}\limits ^{\circ }}\).
Let y be such that \(f(y)>0\). Let \(\varepsilon >0\) be such that \(d_\Omega (y,y')\le \varepsilon \Longrightarrow f(y')>0\). According to Remark 4, there exists \(p\in {\mathbb {N}}\) such that for any \(x\in \Omega \), \(B(y,\varepsilon )\cap \sigma ^{-p}(\{x\})\ne \emptyset \). Then \({\mathcal {L}}_{\phi }^{p}(f)\) is positive. Indeed let \(x\in \Omega \), and z in \(B(y,\varepsilon )\cap \sigma ^{-p}(\{x\})\). Let \(\eta :=\min (\varepsilon _A,\frac{\varepsilon }{2})\). We remind that \(\varepsilon _{A}\) has been fixed just above formula (4). The definition of \(\varepsilon _{A}\) yields that every t in \(\displaystyle \prod _{i=0}^{p-1}B(z_i, \eta )\) belongs to \(\Omega _p(x_0)\), therefore
But if \(t\in \displaystyle \prod _{i=0}^{p-1}B(z_i,\eta )\) then
hence \(f(tx)>0\). As any non-empty ball in E has positive \(\rho \)-measure we deduce that \({\mathcal {L}}_{\phi }^{p}(f)(x)>0\).
\(\bullet \)Step two End of the proof. We deduce from step one that \({\mathcal {L}}_{\phi }\) is strongly positive (see [18, Definitions 2.1.1]). Therefore it is u-positive for any \(u\in {\mathop {{K}}\limits ^{\circ }}\). From Th. 2.10, 2.11 and 2.13 we deduce that \(r_{\phi }\) is a simple eigenvalue and that every other eigenvalue \(\lambda \) of \({\mathcal {L}}_{\phi }\) satisfies the inequality \(|\lambda |<r_\phi \). \(\square \)
To re-employ notation from above, there is only one \(\Pi _{0}\), no other \(\Pi _{i}\)’s. Furthermore, using the fact that \(\nu _{\phi }\) is an eigenmeasure, one easily gets that for any \(f\in {\mathcal {C}}^{+1}(\Omega )\),
Because \(\Pi _{0}\Psi =\Psi \Pi _{0}=0\), we also immediately get
2.2 Gibbs Measure and Ergodic Properties
2.2.1 The Gibbs Measure and Its Main Properties
Let \(\mu _{\phi }\) be the measure defined by \(d\mu _{\phi }:=G_{\phi }d\nu _{\phi }\). We emphasize that by construction \(\mu _{\phi }\) is a probability measure. We shall use the following fact: for every \(n\in {\mathbb {N}}\),
Lemma 2.7
The measure \(\mu _{\phi }\)is \(\sigma \)-invariant. It is called the Dynamical Gibbs Measure (DGM in short) associated to \(\phi \).
Proof
For f continuous
where we used \({\mathcal {L}}_{\phi }^{*}(\nu _{\phi })=r_{\phi }.\nu _{\phi }\) to get the second equality and (13) to get the third. \(\square \)
Proposition 2.8
The measure \(\mu _{\phi }\)is mixing thus ergodic.
Proof
Let f and g be two functions in \({\mathcal {C}}^{+1}(\Omega )\). Then (12) yields
We have seen that the spectral radius of \(\Psi \) is strictly lower than 1. Therefore \(\displaystyle \Psi ^{n}(f.G_{\phi })\) goes to 0 for the Lipschitz norm, thus for the continuous norm. This yields
and the proposition is proved. \(\square \)
2.2.2 Further Properties
Lemma 2.9
There exists \(C(\phi )\)such that for every x, \(e^{-C(\phi )}\le G_{\phi }(x)\le e^{C(\phi )}\).
Proof
By definition, \(G_{\phi }\ge 0\). Let us prove by contradiction it is positive. Assume that \(G_{\phi }(x)=0\). Then, \({\mathcal {L}}_{\phi }(G_{\phi })=r_{\phi }G_{\phi }\) shows that \(G_{\phi }(tx)=0\) for \(\rho \)-a.e. t in E such that \(A(t,x_{0})>0\). As A and \(G_\phi \) are continuous and \(\rho \) has full support, this yields that for every t such that \(A(t,x_{0})>0\ G_{\phi }(tx)=0\). In other words, for every y in \(\sigma ^{-1}(\{x\})\), \(G_{\phi }(y)=0\). By induction we deduce that for every \(n\in {\mathbb {N}}\), for every z in \(\sigma ^{-n}(\{x\})\), \(G_{\phi }(z)=0\). Now, the set \(\displaystyle \cup _{n\ge 0}\sigma ^{-n}(\{x\})\) is dense, and \(G_{\phi }\) is continuous everywhere and null on a dense set. It is thus null everywhere which is impossible because \(\int G_{\phi }\,d\nu _{\phi }=1\). This shows that \(G_{\phi }\) is positive, thus bounded from below by some constant of the form \(e^{-C(\phi )}\). Furthermore, \(\Omega \) is compact and then \(G_{\phi }\) is bounded from above. \(\square \)
Lemma 2.9 immediately yields
Corollary 2.10
Both measures \(\mu _{\phi }\)and \(\nu _{\phi }\)are equivalent.
2.2.3 Regularity of the Spectral Radius
Proposition 2.11
The map \({ P}:\phi \mapsto \log r_{\phi }\)is convex on \({\mathcal {C}}^{+1}(\Omega )\).
Proof
Let us pick \(\phi _1\), \(\phi _2\) in \({\mathcal {C}}^{+1}(\Omega )\), and \(\alpha \in [0,1]\). Set \(\phi :=\alpha \phi _1+(1-\alpha )\phi _2\). For \(n\in {\mathbb {N}}\), \(x\in \Omega \),
therefore
We deduce from (6) that
which proves the convexity of P. \(\square \)
Let \(\overrightarrow{ \varvec{\psi }}=(\psi _{1},\ldots ,\psi _{q})\in {\mathcal {C}}^{+1}(\Omega )^{q}\). We recall the definition
By definition \(I(\overrightarrow{ \varvec{\psi }})\) is a convex and closed set.
Proposition 2.12
The map \({{\mathcal {P}}}:{\textit{\textbf{t}}}\mapsto \log r_{{\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\)is convex and infinitely differentiable on \({\mathbb {R}}^q\)with
For any \({\textit{\textbf{z}}}=\nabla {\mathcal {P}}({\textit{\textbf{t}}})\)in \(\nabla {\mathcal {P}}({\mathbb {R}}^q)\), \({\mathcal {H}}(z,\overrightarrow{ \varvec{\psi }})\)is finite with
If \({\textit{\textbf{z}}}\)does not belong to the closure \(\overline{\nabla {\mathcal {P}}({\mathbb {R}}^q)}\)of \(\nabla {\mathcal {P}}({\mathbb {R}}^q)\), in particular when \(z\notin I(\overrightarrow{ \varvec{\psi }})\), then \({\mathcal {H}}({\textit{\textbf{z}}},\overrightarrow{ \varvec{\psi }})=-\infty \).
Proof
The convexity of \({\mathcal {P}}\) follows from Proposition 2.11. The map Q with values in \({\mathcal {L}}({\mathcal {C}}^{+1}(\Omega ))\) defined on \({\mathbb {R}}^q\) by
is infinitely differentiable with
Adapting the proof of Thm. III.8 and Corollary III.11. of [14] we see that the map \({\textit{\textbf{t}}}\mapsto r_{{\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\) is infinitely differentiable with
from which we deduce (14).
The conjugate function \({\mathcal {P}}^*\) of \({\mathcal {P}}\), defined by
is convex on \({\mathbb {R}}^q\) with values in \(]-\infty ,+\infty ]\). We refer for instance to [25], section 26, for the theory of conjugates of convex functions. In particular it is known that
where \(\mathrm{dom}{\mathcal {P}}^*=\{{\textit{\textbf{z}}}\in {\mathbb {R}}^q,\,{\mathcal {P}}^*({\textit{\textbf{z}}})<+\infty \}\), with
As \({\mathcal {H}}(\cdot ,\overrightarrow{ \varvec{\psi }})=-{\mathcal {P}}^*\) the proof is finished. \(\square \)
3 Proof of Theorem 2
3.1 Auxiliary Functions \(\varphi \) and \(\overline{\varphi }_{\beta }\)
We recall the definitions of \(\varphi _{\beta }\) and \(\overline{\varphi }_{\beta }\) defined on \({\mathbb {R}}^{q}\):
3.1.1 The Function \(\varphi _{\beta }\)
We remind the notation \({\mathcal {H}}_{top}:=\log r_{\mathbf {0}}\).
Lemma 3.1
For every \(\beta >0\)and every \({\textit{\textbf{t}}}\)satisfying \(||{\textit{\textbf{t}}}||>4\Vert \overrightarrow{ \varvec{\psi }}\Vert _\infty \),
Proof
Let us set \(g(x):=\log r_{x\beta {\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\) with \(x\in [0,1]\). It is differentiable and Prop. 2.12 yields that for every x,
Then, we use the mean value theorem. There exists \(\theta \in ]0,1[\) such that
This yields
Now
and we get the result. \(\square \)
We emphasize an immediate consequence of Lemma 3.1: all the maxima for \(\varphi _{\beta }\) are reached at critical points and inside the hypercube \([-K,K]^{q}\) if K is chosen greater than \(4\Vert \overrightarrow{ \varvec{\psi }}\Vert _\infty \). Indeed \(\varphi _{\beta }(\mathbf{0} )={\mathcal {H}}_{top}\) and if \({\textit{\textbf{t}}}\) is outside the hypercube \([-K,K]^{q}\) with \(K\ge 4\Vert \overrightarrow{ \varvec{\psi }}\Vert _\infty \) then \(||{\textit{\textbf{t}}}||>4\Vert \overrightarrow{ \varvec{\psi }}\Vert _\infty \), which implies \(\varphi _{\beta }({\textit{\textbf{t}}})<{\mathcal {H}}_{top}\).
3.1.2 The Function \(\overline{\varphi }_{\beta }\)
We also recall the definition
From the theory of conjugate functions we know that \({\mathcal {H}}(\cdot ,\overrightarrow{ \varvec{\psi }})\) is concave and upper semi-continous, with values in \([-\infty ,+\infty [\).
We emphasize that Proposition 2.12 yields that \(\overline{\varphi }_{\beta }=-\infty \) outside \({\mathcal {I}}(\overrightarrow{ \varvec{\psi }})\), and \(\overline{\varphi }_{\beta }\) is finite on \(\nabla {\mathcal {P}}({\mathbb {R}}^q)\). Consequently all the maxima for \(\overline{\varphi }_{\beta }\) are reached inside the hypercube \([-\Vert \overrightarrow{ \varvec{\psi }}\Vert _\infty ,\Vert \overrightarrow{ \varvec{\psi }}\Vert _\infty ]^{q}\), which contains \({\mathcal {I}}(\overrightarrow{ \varvec{\psi }})\).
Let us set
Then,
Lemma 3.2
The function \(\widetilde{{\mathcal {H}}}\)is upper semi-continuous on \({\mathbb {R}}^q\).
Proof
Let \({\textit{\textbf{z}}}\) be fixed in \({\mathbb {R}}^q\), let \(({\textit{\textbf{z}}}_{n})\) be a sequence in \({\mathbb {R}}^q\) converging to \({\textit{\textbf{z}}}\). If \({\textit{\textbf{z}}}\) is not in \({\mathcal {I}}(\overrightarrow{ \varvec{\psi }})\) then neither is \({\textit{\textbf{z}}}_n\) for n big enough hence
Let us thus assume that \({\textit{\textbf{z}}}\) is in \({\mathcal {I}}(\overrightarrow{ \varvec{\psi }})\). If only a finite number of \({\textit{\textbf{z}}}_n's\) belong to \({\mathcal {I}}(\overrightarrow{ \varvec{\psi }})\) then
If an infinite number of \({\textit{\textbf{z}}}_n's\) belong to \({\mathcal {I}}(\overrightarrow{ \varvec{\psi }})\) then to compute the limsup we can assume without loss of generality that every \({\textit{\textbf{z}}}_n\) is in \({\mathcal {I}}(\overrightarrow{ \varvec{\psi }})\). Let \(\mu _{n}\) be an invariant measure such that \(\displaystyle \int \overrightarrow{ \varvec{\psi }}\,d\mu _{n}={\textit{\textbf{z}}}_{n}\) and
Let \(\mu \) be any accumulation point for \((\mu _{n})\) for the weak* topology. For simplicity we shall write \(\mu =\lim _{{n\rightarrow +\infty }}\mu _{n}\).
Then, \(\displaystyle \int \overrightarrow{ \varvec{\psi }}\,d\mu =\lim _{{n\rightarrow +\infty }}\int \overrightarrow{ \varvec{\psi }}\,d\mu _{n} =\lim _{{n\rightarrow +\infty }}{\textit{\textbf{z}}}_{n}={\textit{\textbf{z}}}\) and as the metric entropy is upper semi-continuous we get
\(\square \)
Proposition 3.3
For every \({\textit{\textbf{z}}}\)in \({\mathbb {R}}^q\), \(\widetilde{{\mathcal {H}}}({\textit{\textbf{z}}})={\mathcal {H}}({\textit{\textbf{z}}},\overrightarrow{ \varvec{\psi }})\).
Proof
For any \({\textit{\textbf{t}}}\in {\mathbb {R}}^q\) we have
In other words \((-\widetilde{{\mathcal {H}}})^{*}={\mathcal {P}}\). It is easily seen that \(\widetilde{{\mathcal {H}}}\) is concave. By Theorem 12.2 of [25] the biconjugate \((-\widetilde{{\mathcal {H}}})^{**}\) of \(-\widetilde{{\mathcal {H}}}\) is equal to its closure. Then, Lemma 3.2 shows that \(-\widetilde{{\mathcal {H}}}\) is closed convex therefore \(-\widetilde{{\mathcal {H}}}={\mathcal {P}}^*\). As by definition \({\mathcal {P}}^*=-{\mathcal {H}}(\cdot ,\overrightarrow{ \varvec{\psi }})\), we deduce that \(\widetilde{{\mathcal {H}}}={\mathcal {H}}(\cdot ,\overrightarrow{ \varvec{\psi }})\). \(\square \)
Furthermore, note that the definition of \({\mathcal {P}}({\textit{\textbf{t}}})\) and Proposition 3.3 yield
Finally we have:
Corollary 3.4
For every \({\textit{\textbf{z}}}\in {\mathbb {R}}^q\), \(\overline{\varphi }_{\beta }({\textit{\textbf{z}}})=\widetilde{{\mathcal {H}}}({\textit{\textbf{z}}})+\frac{\beta }{2}||z||^{2}\).
3.1.3 Maxima for \(\varphi _{\beta }\) and \(\overline{\varphi }_{\beta }\)
The main result of this Subsection is
Proposition 3.5
Inequality \(\varphi _{\beta }({\textit{\textbf{z}}})\ge \overline{\varphi }_{\beta }({\textit{\textbf{z}}})\)holds for any \({\textit{\textbf{z}}}\)in \({\mathbb {R}}^q\). Moreover \(\varphi _{\beta }({\textit{\textbf{z}}})\)is maximum if and only if \(\overline{\varphi }_{\beta }({\textit{\textbf{z}}})\)is maximum. Furthermore, if \(\varphi _{\beta }({\textit{\textbf{z}}})\)is maximum then \(\varphi _{\beta }({\textit{\textbf{z}}})=\overline{\varphi }_{\beta }({\textit{\textbf{z}}})\).
Proof
-
Step 1. \(\varphi _{\beta }\ge \overline{\varphi }_{\beta }\). We use Equality (17) with \({\textit{\textbf{t}}}=\beta {\textit{\textbf{z}}}\). This yields
$$\begin{aligned} \overline{\varphi }_{\beta }({\textit{\textbf{z}}}){=}\displaystyle {\mathcal {H}}({\textit{\textbf{z}}},\overrightarrow{ \varvec{\psi }})+\frac{\beta }{2}\Vert {\textit{\textbf{z}}}\Vert ^{2} \le \log r_{{\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}-{\textit{\textbf{t}}}.{\textit{\textbf{z}}}+\frac{\beta }{2}||{\textit{\textbf{z}}}||^{2} {=}\log r_{\beta {\textit{\textbf{z}}}\cdot \overrightarrow{ \varvec{\psi }}}-\frac{\beta }{2}||{\textit{\textbf{z}}}||^{2}{=}\varphi _{\beta }({\textit{\textbf{z}}}). \end{aligned}$$ -
Step 2. \(\overline{\varphi }_{\beta }({\textit{\textbf{z}}})\) is maximal if and only if \(\varphi _{\beta }({\textit{\textbf{z}}})\) is maximal and maximal values do coincide.
Let \({\textit{\textbf{z}}}\) be a maximum for \(\varphi _{\beta }\). Then, it is a critical point for \(\varphi _{\beta }\). As
this yields \({\textit{\textbf{z}}}=\nabla {\mathcal {P}}(\beta {\textit{\textbf{z}}})\). Using (15) we get
therefore
Using step 1 and this last equality we get
which shows that \(\overline{\varphi }_{\beta }({\textit{\textbf{z}}})=\varphi _{\beta }({\textit{\textbf{z}}})\).
On the other hand for any \({\textit{\textbf{z}}}'\),
which shows that \({\textit{\textbf{z}}}\) is also a maximum for \(\overline{\varphi }_{\beta }\).
Conversely, if \({\textit{\textbf{z}}}\) is a maximum for \(\overline{\varphi }_{\beta }\), let \({\textit{\textbf{z}}}'\) be any maximum for \(\varphi _{\beta }\). We get
This shows that \({\textit{\textbf{z}}}\) is also a maximum for \(\varphi _{\beta }\), which finishes the proof. \(\square \)
Corollary 3.6
Maxima for \(\overline{\varphi }_{\beta }\)are reached on \(\nabla {\mathcal {P}}({\mathbb {R}}^{q})\).
Proof
Proposition 3.5 states that maxima for \(\overline{\varphi }_{\beta }\) are maxima for \(\varphi _{\beta }\). We have seen after Lemma 3.1 that all the maxima for \(\varphi _{\beta }\) are reached at critical points.
Now, \({\textit{\textbf{t}}}\) is a critical point for \(\varphi _{\beta }({\textit{\textbf{t}}})=-\frac{\beta }{2}\Vert {\textit{\textbf{t}}}\Vert ^2+\log r_{\beta \textit{\textbf{t}}\cdot \overrightarrow{ \varvec{\psi }}}\) means
\(\square \)
3.2 Measures Maximizing Quadratic Pressure
3.2.1 Another Expression for \({\mathcal {P}}_{2}(\beta )\)
We remind that the metric entropy \(\mu \mapsto \widehat{{\mathcal {H}}}(\mu )\) is upper semi-continuous (see Remark 2).
Therefore the function
is upper semi-continuous hence attains it supremum on the compact set \(M_\sigma (\Omega )\).
where the last equality comes from Proposition 3.5 and the fourth equality comes from Corollaries 3.4 and 3.6.
3.2.2 Good DGM Maximize Quadratic Pressure
We note
Let \({\textit{\textbf{z}}}\in M\). We saw in the proof of Proposition 3.5 that \({\textit{\textbf{z}}}\) is then a critical point for \(\varphi _\beta \) hence \({\textit{\textbf{z}}}=\nabla {\mathcal {P}}(\beta {\textit{\textbf{z}}})=\displaystyle \int \overrightarrow{ \varvec{\psi }}\, d\mu _{\beta {\textit{\textbf{z}}}\cdot \overrightarrow{ \varvec{\psi }}}\). From (2) we know that
hence from (15) we deduce that \({\mathcal {H}}({\textit{\textbf{z}}},\overrightarrow{ \varvec{\psi }})=\widehat{{\mathcal {H}}}(\mu _{\beta {\textit{\textbf{z}}}\cdot \overrightarrow{ \varvec{\psi }}}),\) thus
Let \(\mu \) be any measure, \({\textit{\textbf{z}}}':=\int \overrightarrow{ \varvec{\psi }}\, d\mu \). Then
Therefore, \(\mu _{\beta {\textit{\textbf{z}}}\cdot \overrightarrow{ \varvec{\psi }}}\) maximizes F.
3.2.3 Maxima for Quadratic Pressure are Realized only by Good DGM
Conversely let \(\mu \) maximizing the function F. Set \({\textit{\textbf{z}}}:=\displaystyle \int \overrightarrow{ \varvec{\psi }}\,d\mu \). Then \({\textit{\textbf{z}}}\) is in M hence satisfies
and \(\widehat{{\mathcal {H}}}(\mu )=\widetilde{{\mathcal {H}}}({\textit{\textbf{z}}})\). Now using (2) we can write
which means that \(\mu \) is equal to \(\mu _{\beta {\textit{\textbf{z}}}\cdot \overrightarrow{ \varvec{\psi }}}\) by uniqueness of the (linear) equilibrium state.
4 Proof of Theorem 3
4.1 A Useful Computation
Let \(f:\Omega \rightarrow {\mathbb {R}}\) be continuous. We want to evaluate the limit of \(\displaystyle \int f(\omega )d\mu _{n,\beta }(\omega )\) as \(n\rightarrow +\infty \). In the first step we do the computation without the normalizing term \(Z_{n,\beta }\) and estimate it in the second step. We recall the identity
Then we have
where we made the change of variable \(\beta {\textit{\textbf{z}}}=\sqrt{\frac{\beta }{n}}{\textit{\textbf{t}}}\) to get the last equality.
We claim and prove just below that for fixed \(\beta \), the part of the integral in \({\textit{\textbf{z}}}\) outside the hypercube \([-K,K]^{q}\) is negligible with respect to the other part.
On the other hand, we remind (see Inequality (5)) \(\Vert {\mathcal {L}}^{n}_{\beta {\textit{\textbf{z}}}\cdot \overrightarrow{ \varvec{\psi }}}\Vert _\infty \le e^{n\beta \Vert {\textit{\textbf{z}}}\Vert \Vert \overrightarrow{ \varvec{\psi }}\Vert _\infty }\), which yields
for \(\Vert {\textit{\textbf{z}}}\Vert >4\Vert \overrightarrow{ \varvec{\psi }}\Vert _\infty \).
Now, reporting this inequality in the right hand side term of (21) and assuming \(K>4||\overrightarrow{ \varvec{\psi }}||_{\infty }\), one gets
and
Returning to (21) we get
if K is greater than \(4\Vert \overrightarrow{ \varvec{\psi }}\Vert _\infty \).
Now, we recall that
and that if f belongs to \({\mathcal {C}}^{+1}(\Omega )\) then
where the operator norm of \(\Psi _{\beta {\textit{\textbf{z}}}\cdot \overrightarrow{ \varvec{\psi }}}\) acting on \({\mathcal {C}}^{+1}(\Omega )\) is strictly less than one. We write \(\Psi _{\beta {\textit{\textbf{z}}}\cdot \overrightarrow{ \varvec{\psi }}}=e^{-\varepsilon (\beta ,{\textit{\textbf{z}}})}T(\beta ,{\textit{\textbf{z}}})\) where \(\varepsilon (\beta ,{\textit{\textbf{z}}})\) is the spectral gap of the operator \({\mathcal {L}}_{\beta {\textit{\textbf{z}}}\cdot \overrightarrow{ \varvec{\psi }}}\) and \(||T(\beta ,{\textit{\textbf{z}}})||_{L}= 1\). Then
The spectral gap \(\varepsilon (\beta ,{\textit{\textbf{z}}})\) is lower semi-continuous in \({\textit{\textbf{z}}}\) hence, on the compact set \([-K,K]^{q}\), it attains its infimum \(m(\beta )\), which is strictly positive. We set
and notice that for any \({\textit{\textbf{z}}}\) in \([-K,K]^{q}\),
Eventually we get
The normalization term \(Z_{n,\beta }\) is obtained taking \(f\equiv 1\). This yields
where \(\alpha (n,{\textit{\textbf{z}}},f)\) and \(\alpha (n,{\textit{\textbf{z}}},mathbb{1})\) converge uniformly to 0 with respect to \({\textit{\textbf{z}}}\) when n tends to infinity.
4.2 The Case \(q=1\)
In this case the function \(\varphi _\beta \) is analytic hence admits only finitely many maxima, and we can argue as in [20].
4.3 The Higher-Dimensional Case
We assume that all the maxima for \({\varphi _{\beta }}\) are non-degenerate.
Lemma 4.1
The function \(\varphi _{\beta }\)admits only finitely many maxima.
Proof
The proof is done by contradiction. Let us consider a sequence \(({\textit{\textbf{z}}}_{n})\) of maxima for \(\varphi _{\beta }\). We have seen that all the maxima are critical points and are in some compact set \([-K,K]^{q}\) (see Lemma 3.1 and discussion after).
Therefore, we may consider some accumulation point \({\textit{\textbf{z}}}\) for the \({\textit{\textbf{z}}}_{n}\)’s. For simplicity we set \({\textit{\textbf{z}}}=\lim _{{n\rightarrow +\infty }}{\textit{\textbf{z}}}_{n}\) and we assume that \({\textit{\textbf{z}}}_{n}\ne {\textit{\textbf{z}}}_{n+1}\) holds for every n. Note that by continuity, \({\textit{\textbf{z}}}\) is also a critical point for \({\varphi _{\beta }}\) and \({\varphi _{\beta }}\) is maximal at \({\textit{\textbf{z}}}\).
We remind that \(\varphi _{\beta }\) is \({\mathcal {C}}^{\infty }\). If we consider the restriction \({\varphi _{\beta }}_{n}\) of \(\varphi _{\beta }\) to each segment \([z_{n},z_{n+1}]\), then \({\varphi _{\beta }}_{n}\) is \({\mathcal {C}}^{\infty }\) and \({\varphi _{\beta }}'_{n}({\textit{\textbf{z}}}_{n})={\varphi _{\beta }}'_{n}({\textit{\textbf{z}}}_{n+1})=0\). Hence, Rolle’s theorem shows that there exists \({\textit{\textbf{z}}}'_{n}\in [{\textit{\textbf{z}}}_{n},{\textit{\textbf{z}}}_{n+1}]\) such that
Set \(\mathbf {u}_{n}:={\textit{\textbf{z}}}_{n}-{\textit{\textbf{z}}}_{n+1}\) and consider any accumulation point \(\mathbf {u}\) for \(\displaystyle \frac{{\textit{\textbf{z}}}_{n}-{\textit{\textbf{z}}}_{n+1}}{||{\textit{\textbf{z}}}_{n}-{\textit{\textbf{z}}}_{n+1}||}\). Equality (26) can be rewritten under the form
which yields as \({n\rightarrow +\infty }\ d^{2}{\varphi _{\beta }}({\textit{\textbf{z}}})(\mathbf {u},\mathbf {u})=0\). This means that \({\textit{\textbf{z}}}\) is a degenerate maximal point for \({\varphi _{\beta }}\), which is in contradiction with our assumption. \(\square \)
Let \({\textit{\textbf{z}}}_1,\ldots ,{\textit{\textbf{z}}}_k\) be the points where \(\varphi _\beta \) attains its maximum. We recall that the Laplace method (see [27, Ch.IX Th.3]) states
provided that \(\varphi _{\beta }\) admits no other critical point than \({\textit{\textbf{z}}}_{1}\) in an open set O of \({\mathbb {R}}^q\), that \(g({\textit{\textbf{z}}}_1)\ne 0\) and that the Hessian matrix \(d^{2}\varphi _{\beta }({\textit{\textbf{z}}}_1)\) is negative definite (which holds by our assumption).
Remark 5
We emphasize the assumption \(g({\textit{\textbf{z}}}_{1})\ne 0\). \(\square \)
We choose K such that \(\varphi _\beta ({\textit{\textbf{z}}}_{1})+\beta \frac{K^2}{4}>0\), and letting \(n\rightarrow +\infty \) in (25), we get that for every f in \({\mathcal {C}}^{+1}(\Omega )\),
which finishes the proof of Theorem 3.
5 Application to the Mean-Field XY Model
5.1 The Cosine Potential
The mean-field XY model is a system of n globally coupled planar spins (or alternatively of n globally interacting particles constrained on a ring), with Hamiltonian
where \(p_i\in [0,2\pi [\). We can interpret it as a generalized Curie–Weiss model by setting \(E={\mathbb {T}}=\{z\in {\mathbb {R}}^2,\Vert z\Vert =1\}\), \(\Omega ={\mathbb {T}}^{{\mathbb {N}}}\), and \(\overrightarrow{ \varvec{\psi }}(\omega )=\omega _0\). Indeed every \(\omega _k\) in the word \(\omega =\omega _0\omega _1\cdots \) of \(\Omega \) is uniquely expressed as \(\omega _k=(\cos \theta _k,\sin \theta _k)\) with \(\theta _k\) in \([-\pi ,\pi [\), and then
We endow \({\mathbb {T}}\) with the usual distance on \({\mathbb {R}}^2\), and the Haar measure \(\rho \) given by
As \(\overrightarrow{ \varvec{\psi }}\) only depends on the first coordinate, we see that for any \({\textit{\textbf{t}}}\) in \({\mathbb {R}}^2\) and any f in \({\mathcal {C}}^{0}(\Omega )\),
so that the spectral radius of \({\mathcal {L}}_{\beta {\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\) is
with eigenfunction \(G_{\beta {\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}=mathbb{1}\), and \(\nu _{\beta {\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}=\mu _{\beta {\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\). We notice that \(r_0=1\). If \({\textit{\textbf{t}}}\ne 0\), we denote by \(|{\textit{\textbf{t}}}|\) its Euclidean norm and by \(\theta _{\textit{\textbf{t}}}\) the unique element of \([-\pi ,\pi [\) such that \({\textit{\textbf{t}}}=|{\textit{\textbf{t}}}| \mathbf {u}_{\theta _{{\textit{\textbf{t}}}}}\). Then
because the integral does not depend on the interval of length \(2\pi \) where we compute it. Eventually we have
where \(I_0\) is the modified Bessel function of order zero, and we get
This shows that \(\varphi _{\beta }({\textit{\textbf{t}}})\) is constant on all the circles centered in 0.
Remark 6
Unless \(\varphi _{\beta }\) is maximal only at 0, which does not hold for every \(\beta \) as we will see below, we have here an example where all the maxima of the auxiliary function are degenerate. \(\square \)
We set
The equality (25) becomes
where we replaced for convenience the square \([-K,K]^2\) by the disk B(0, K). We are thus led to study the asymptotic behaviour of the integral
In polar coordinates we write \({\textit{\textbf{z}}}=r\mathbf {u}_\theta \) and we get
For \(x \in {\mathbb {R}}_+\), we denote by \(\eta _{x}\) the mean value of DGM’s defined by
for any bounded measurable h, so that
which is then a one-dimensional Laplace integral. We study the maximum of the function \(\phi _\beta \) on \({\mathbb {R}}_+\). First we notice that \(0\le I_0(\beta x)\le \beta x\) and \(I_0(0)=1\), hence
from which we deduce that \(\underset{{\mathbb {R}}_+}{\max } \,\phi _\beta =\underset{[0,2[}{\max }\,\phi _\beta \). Next we look for the critical points of \(\phi _\beta \) on [0, 2[. We compute the first and second derivatives
We notice that \(\phi _\beta '(x)\le \beta (1-x)\), from which we deduce that \(\underset{{\mathbb {R}}_+}{\max }\,\phi _\beta =\underset{[0,1]}{\max }\,\phi _\beta \). As \(I_0'(0)=0\) we know that \(\phi _\beta '(0)=0\). We compute
We shall thus consider three cases: \(\beta >2\), \(\beta =2\), and \(\beta <2\). First we take a closer look at the critical points of \(\phi _\beta \). We recall that the Bessel function \(I_0\) satisfies the differential equation (we refer for instance to [3] for information about Bessel functions)
so that
Replacing in (30) we get that for every x,
Now from (29) we know that r is a critical point of \(\phi _\beta \) if and only if
If r is such a point then replacing in (32) we get
Case \(\beta >2\): In this case \(\phi _\beta ''(0)>0\) hence 0 is not a maximum point. We claim that \(\phi \) has a unique maximum and that it belongs to \(\,]\sqrt{\frac{\beta -2}{\beta }},1]\).
We denote by \(r_1<\cdots <r_m\) the m points of ]0, 1] where \(\phi _\beta \) attains its maximum M on \({\mathbb {R}}_+\). Then every \(r_k\) satisfies \(\phi _\beta '(r_k)=0\) and \(\phi _\beta ''(r_k)\le 0\). Remember that every critical point r satisfies (33) which we rewrite
As \(\phi _\beta ''(r_k)\le 0\) we deduce that \(r_1\ge \sqrt{\frac{\beta -2}{\beta }}\). We observe that any critical point r strictly bigger than \(r_{1}\) satisfies \(\phi _\beta ''(r)<0\), which means r is a local maximum for \(\phi _\beta \). Now, if \(m\ge 2\) then between \(r_1\) and \(r_2\) there must be a local minimum which is also a critical point. This yields a contradiction. Therefore, \(m=1\) and \(r_{1}\) is the unique critical point for \(\phi \).
Let us show that \(r_1> \sqrt{\frac{\beta -2}{\beta }}\). Indeed if \(r_1=\sqrt{\frac{\beta -2}{\beta }}\) then \(\phi _\beta '(r_1)=\phi _\beta ''(r_1)=0\). From (29) we know that
Replacing in (32) we get that
which is a differential equation satisfied by \(\phi _\beta \). When we differentiate this equality we get that
We deduce that \(\phi _\beta '''(r_1) =-2\beta ^2 r_1\) is strictly negative, therefore \(r_1\) can not be a maximum, which yields a contradiction. Hence \(r_1 >\sqrt{\frac{\beta -2}{\beta }}\) holds and this finishes to prove the claim.
Now we are ready to conclude the case \(\beta >2\). We apply the Laplace method to the integral I(n, f) and we find that
hence
Case \(\beta <2\): In this case \(\phi '_\beta (0)=0\) and \(\phi _\beta ''(0)<0\) hence 0 is a local maximum of \(\phi _\beta \). The equation (33) tells us that every critical point r satisfies
which is strictly negative, therefore every critical point is a local maximum. The same argument than above shows that \(\phi _\beta \) attains its maximum at 0 and only at 0.
As the maximum is reached at 0, we cannot directly apply the Laplace method as it is emphasized in Remark 5. We then use the following lemma. It is a special version of the Laplace method and can be found in [8].
Lemma 5.1
Let \(\alpha \) and \(\gamma \) be two positive real numbers. Then, for any sequence \((b_{n})\) such that \(nb_{n}^{\alpha }\rightarrow +\infty \)
Proof
Just set \(u=nx^{\alpha }\). \(\square \)
We apply Lemma 5.1 for \(\displaystyle \int _0^{b_{n}}e^{n\phi _{\beta }(r)}\left( {\int _{\Omega } f\,d\eta _{\beta r}}\right) \,r\,dr\), with \(b_{n}=1/\root 4 \of {n}\). Because \(b_{n}\rightarrow 0\), we can get \(\phi _{\beta }(r)=-\phi _{\beta }''(0)r^{2} +O(r^{3})\) on \([0,b_{n}]\). Note that for \(r\in [0,b_{n}]\), by continuity for the eigen-measures \(\nu _{\beta {\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\) for operators \({\mathcal {L}}_{{\textit{\textbf{t}}}\cdot \overrightarrow{ \varvec{\psi }}}\) we also have.
A computation shows that \(\displaystyle \int _{b_{n}}^{b}e^{n\phi _{\beta }(r)}\left( {\int _{\Omega } f\,d\eta _{\beta r}}\right) \,r\,dr\) is of order less than \(e^{-nb_{n}^{2}/2}\) if b is chosen small but positive such that \(\phi _{\beta }(r)\le -\phi ''_{\beta }(0)\frac{r^{2}}{2}\) on [0, b]. Then, \(nb_{n}^{2}\rightarrow +\infty \) yields that this quantity is exponentially small (in n). Because 0 is the unique maximum for \(\phi _{\beta }\), \(\displaystyle \int _{b_{n}}^{b}e^{n\phi _{\beta }(r)} \left( {\int _{\Omega } f\,d\eta _{\beta r}}\right) \,r\,dr\) is of order less than \(e^{-n\varepsilon (b)}\) with \(\varepsilon (b)>0\).
Hence we get
hence
Case \(\beta =2\): In this case \(\phi '_\beta (0)=\phi _\beta ''(0)=0\) and every critical point \(r\ne 0\) is a local maximum since it satisfies
which is strictly negative. We deduce, as above, that there exists only one maximum, which may be 0 or not. If it is 0, then we conclude as before but we have to pick an higher order for the derivative (and use Lemma 5.1 with \(\alpha \ge 4\)). If it is not 0, then we conclude the computation as above.
We conclude that for every \(\beta >0\), the sequence of measures \((\mu _{n,\beta })_{n\in {\mathbb {N}}}\) weakly converges to \(\eta _{\beta r}\) where r is the unique point at which \(\phi _\beta \) reaches its maximum on \({\mathbb {R}}^+\).
Notes
ITM Theorem in short.
References
Baraviera, A.T., Cioletti, L.M., Lopes, A.O., Mohr, J., Souza, R.R.: On the general one-dimensional \(XY\) model: positive and zero temperature, selection and non-selection. Rev. Math. Phys. 23(10), 1063–1113 (2011)
Bowen, R.: Equilibrium States and the Ergodic Theory of Anosov. Diffeomorphisms Lecture Notes in Mathematics, vol. 470. Springer, Berlin (1975)
Bowman, F.: Introduction to Bessel Functions. Dover Publications, New York (1958)
Broise, A.: Transformations dilatantes de l’intervalle et théorèmes limites. Astérisque, (238) 1–109 (1996). Études spectrales d’opérateurs de transfert et applications
Buzzi, J., Leplaideur, R.: Nonlinear thermodynamical formalism (2020)
Cioletti, L., Lopes, A.O.: Interactions, Specifications, DLR probabilities and the Ruelle Operator in the One-Dimensional Lattice. ArXiv e-prints (2014)
Cioletti, L., Lopes, A.O.: Phase transitions in one-dimensional translation invariant systems: a Ruelle operator approach. J. Stat. Phys. 159(6), 1424–1455 (2015)
Dieudonné, J.: Calcul infinitésimal 2e Edition. Herman (1980). ISBN: 2-7056-5907-2
Ellis, R.S.: Entropy, Large Deviations, and Statistical Mechanics. Classics in Mathematics. Springer, Berlin (2006). Reprint of the 1985 original
Ellis, R.S., Wang, K.: Limit theorems for the empirical vector of the Curie-Weiss-Potts model. Stoch. Process. Appl. 35(1), 59–79 (1990)
Frank Norman, M.: Markov Processes and Learning Models. Mathematics in Science and Engineering, vol. 84. Academic Press, New York (1972)
Friedli, S., Velenik, Y.: Statistical Mechanics of Lattice Systems. A Concrete Mathematical Introduction. Cambridge University Press, Cambridge (2018)
Giulietti, P., Kloeckner, B.R., Lopes, A.O., Marcon, D.: The calculus of thermodynamical formalism. J. Eur. Math. Soc. 20, 2357–2412 (2018)
Hennion, Hubert: Hervé, Loïc: Limit Theorems for Markov Chains and Stochastic Properties of Dynamical Systems by Quasi-compactness. Lecture Notes in Mathematics, vol. 1766. Springer, Berlin (2001)
Hiriart-Urruty, J.B., Lemaréchal, C.: Fundamentals of Convex Analysis. Springer, Berlin (2001)
Ionescu Tulcea, C.T., Marinescu, G.: Théorie ergodique pour des classes d’opérations non complètement continues. Ann. Math. (2) 52, 140–147 (1950)
Jahnel, B., Külske, C.: A class of nonergodic interacting particle systems with unique invariant measure. Ann. Appl. Probab. 24(6), 2595–2643 (2014)
Krasnosel’skiĭ, M.A.: Positive solutions of operator equations. Translated from the Russian by Richard E. Flaherty; Boron, Leo F., (ed.) P. Noordhoff Ltd. Groningen (1964)
Külske, C., Opoku, A.A.: The posterior metric and Goodness of Gibbsianness for transforms of Gibbs measures. Electron. J. Probab. 13, 1307–1344 (2008)
Leplaideur, R., Watbled, F.: Generalized Curie-Weiss model and quadratic pressure in ergodic theory. Bull. Soc. Math. France 147(2), 197–219 (2019)
Lopes, A.O., Mengue, J.K., Mohr, J., Souza, R.R.: Entropy and variational principle for one-dimensional lattice systems with a general a priori probability: positive and zero temperature. Ergodic Theory Dyn. Syst. 35(6), 1925–1961 (2015)
Maes, C., Shlosman, S: Rotating states in driven clock- and XY-models J. Stat. Phys. 144, Article number 1238 (2011)
Ruelle, D.: Thermodynamic Formalism. Cambridge Mathematical Library. The Mathematical Structures of Equilibrium Statistical Mechanics, 2nd edn. Cambridge University Press, Cambridge (2004)
Sinaĭ, Y.: Theory of Phase Transitions: Rigorous Results, volume 108 of International Series in Natural Philosophy. Pergamon Press, Oxford, (1982). Translated from the Russian by J. Fritz, A. Krámli, P. Major and D. Szász
Tyrrell Rockafellar, R.: Convex Analysis. Princeton Landmarks in Mathematics. Princeton University Press, Princeton, NJ (1997). Reprint of the 1970 original, Princeton Paperbacks
van Enter, A.C.D., Külske, C., Opoku, A.A.: Discrete approximations to vector spin models. J. Phys. A 44, 47 (2011)
Wong, R.: Asymptotic Approximations of Integrals, volume 34 of Classics in Applied Mathematics. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA (2001). Corrected reprint of the 1989 original
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Aernout van Enter.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
F. Watbled thanks the IRMAR, CNRS UMR 6625, University of Rennes 1, for its hospitality.
The authors thank the Centre Henri Lebesgue ANR-11-LABX-0020-01 for creating an attractive mathematical environment.
Rights and permissions
About this article
Cite this article
Leplaideur, R., Watbled, F. Curie–Weiss Type Models for General Spin Spaces and Quadratic Pressure in Ergodic Theory. J Stat Phys 181, 263–292 (2020). https://doi.org/10.1007/s10955-020-02579-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10955-020-02579-z
Keywords
- Thermodynamic formalism
- Equilibrium states
- Curie–Weiss model
- Spin spaces
- Curie–Weiss–Potts model
- Gibbs measure
- Phase transition
- XY model