1 Introduction

1.1 General Background

In this paper, we deal with the notion of phase transitions. More precisely, adopting the Dynamical System viewpoint, we study the shape of the graph for the pressure function and (non-)equivalence between analyticity and presence of several equilibrium states.

It is noteworthy that depending on the viewpoint (Statistical Mechanics, Probability Theory or Dynamical Systems), the settings, the questions and the interests concerning phase transitions are different. Furthermore, and this may be source of confusion, the several viewpoints share the same terminology and vocabulary for sensibly different objects or notions.

In Statistical Mechanics and in Probability Theory, one usually considers lattices with interaction energy between the sites. Often, the geometry of the lattice and the decay of correlation of interactions are the main issues. In Dynamical Systems, one usually consider one-dimensional lattice \(\Gamma ^{{\mathbb Z}}\) (with natural \({\mathbb Z}\)-actions) and the main issue is the regularity of the potential considered as a function on \(\Gamma ^{{\mathbb Z}}\) (see e.g. Sect. 2.4.4 in [25] for a discussion on that point).

The definition of phase transition also naturally depends on the viewpoint. The Ising model (see e.g. [10, 13]) and the Potts model are studied in Probability Theory, either in mean fields case (see [1, 4, 11]), or via percolation theory (see [3, 15]). In both cases, a phase transition means the co-existence of several probability measures resolving or resulting from some optimization.

In Physics (see e.g. [25] Sect. 2.6.5), a phase transition can either mean a singularity of some thermodynamic quantity (Ehrenfest classification) or a change in the number of macro states (Gibbs classification). A more recent definition involves the Gâteaux differentiability of the pressure function (see e.g. [8, 9, 14, 20, 22]).

The topic is actually relatively new in Dynamical Systems (compared to Statistical mechanics and Probability Theory), and one usually considers that a phase transition occurs when the pressure function stops to be analytic (see [5, 7, 16, 19]).

Our main result (Theorem B) is that both definitions (regularity of the pressure vs co-existence of several equilibria) are not equivalent. Of course, a first-order phase transition, that is when the pressure function is not \({\mathcal C}^{1}\), yields co-existence of several equilibria. But we show here that the converse is not true: there are mixing systems such that the pressure function is analytic on some interval despite the existence of several equilibria simultaneously.

Theorem A deals with the possible shape of the graph of the pressure function when a phase transition occurs. It turns out that most of the known examples of phase transitions in Dynamical Systems are “freezing” phase transitions, that is that the pressure function is eventually affine. Such transitions are known in Statistical Mechanics as the Fisher–Felderhof models (see e.g. [12] for a one-dimensional lattice case). Physically, this means that for some positive temperature, the system reaches its/one ground state and then stops to change.

It was thus natural to inquire about the possibility to get phase transitions in Dynamical Systems which are not freezing, that is, that after the transition the pressure is non-flat. This is the purpose of our Theorem A.

It is not clear whether there exist dictionaries between the different viewpoints to study phase transition. We hope that beyond the mathematical problems we solve here, the present paper could help clarify similarities and differences between the viewpoints.

On the other hand, we believe that several viewpoints is also source of fruitful transfer of knowledge. The thermodynamic formalism was introduced in Dynamical Systems during the 1970s, mostly by Sinai, Ruelle and Bowen (see [2, 2224]), initially for hyperbolic systems. Over the years, mathematicians became more interested in developing this formalism for less regular systems, also motivated by its applications to dimension theory and multifractal analysis. As a result, the natural questions in mathematics became further remote from physicists’ interests. Due to the development of the ergodic optimization around 2000, mathematicians gradually rediscovered notions already known to physicists, such as e.g. the ground states.

We believe that the mathematical tools already developed in the thermodynamic formalism by mathematicians can be useful to physicists. Our way to prove existence of phase transition here is based on the study of some inducing scheme and on operator theory. This theory, which includes conformal measures, eigen-functions and subactions (for the zero temperature case) could be used in physics.

1.2 General Settings

We consider a subshift of finite type \(\Sigma \) on a finite alphabet \({\mathcal A}\). An element of \({\mathcal A}\) can either be called a letter, or a digit or a symbols. Different alphabets will be considered depending on the theorem.

A point x in \(\Sigma \) is a sequence \(x_{0},x_{1},\ldots \) (also called an infinite word) where \(x_{i}\) are letters of \({\mathcal A}\). Admissible transitions are given by an oriented graph. Two cases will be considered (see Figs. 17). Most of the times we shall use the notation \(x=x_{0}x_{1}x_{2}\ldots \).

The distance between two points \(x=x_{0}x_{1}\ldots \) and \(y=y_{0}y_{1}\ldots \) is given by

$$\begin{aligned} d(x,y)=\frac{1}{2^{\min \{n,\ x_{n}\ne y_{n}\}}}. \end{aligned}$$

A finite string of symbols \(x_{0}\ldots x_{n-1}\) is also called a word, of length n. For a word w, its length is |w|. A cylinder (of length n) is denoted by \([x_{0}\ldots x_{n-1}]\). It is the set of points y such that \(y_{i}=x_{i}\) for \(i=0,\ldots n-1\).

Fig. 1
figure 1

Dynamics for Theorem A

If i is a digit in \({\mathcal A}\), \(x=i^{n}*\) means that \(\displaystyle x=\underbrace{i\ldots i}_{n\text { digits}}j\) where j is any digit \(\ne i\) such that ij is admissible in \(\Sigma \). In other words, this means that \(x\in [\underbrace{i\ldots i}_{n\text { digits}}]{\setminus }[\underbrace{i\ldots i}_{n+1\text { digits}}]\).

The alphabet will depend on some integer parameter L. It will be either

$$\begin{aligned} \{1_{0},1_{1},1_{2},\ldots , 1_{L},2,3,4\}\text { or }\{1_{0},1_{1},1_{2},\ldots , 1_{L},2,3,4, 3',4'\}, \end{aligned}$$

and L may be equal to 0. For simplicity we set \(1_{0}=:1\).

Let us consider positive real numbers \(\alpha \), \(\gamma \), \(\delta \) and \(\varepsilon \) considered as fixed parameters. The potential \(\phi \) is defined by

$$\begin{aligned} \phi (x)={\left\{ \begin{array}{ll} -\alpha <0\text { if }x_{0}\in \{1,1_{1},1_{2},\ldots , 1_{L}\}\\ -\log \left( \frac{n+1}{n}\right) \text { if } x=2^{n}*,\ 1\leqslant n\leqslant +\infty \\ \gamma -\varepsilon \log \left( \frac{n+1}{n}\right) \text { if } x_{0}=3\text { or } 3' \text { and } n=\min \{j\geqslant 1,\ x_{j}=2\},\\ \gamma +\delta -\varepsilon \log \left( \frac{n+1}{n}\right) \text { if } x_{0}=4\text { or }4' \text { and } n=\min \{j\geqslant 1,\ x_{j}=2\}. \end{array}\right. } \end{aligned}$$

Hence, if \(x=2*\), \(\phi (x)=-\log 2\). If \(x=3\displaystyle \underbrace{x_{1}\ldots x_{n-2}}_{n-2\text { digits} =3,4}32\ldots \), then \(\phi (x)=\displaystyle \gamma -\varepsilon \log \left( \frac{n+1}{n}\right) \). If \(x=4\displaystyle \underbrace{x_{1}\ldots x_{n-2}}_{n-2\text { digits} =3,4}32\ldots \), then \(\phi (x)=\displaystyle \gamma +\delta -\varepsilon \log \left( \frac{n+1}{n}\right) \). By convention \(\displaystyle \frac{+\infty +1}{+\infty }=1\), which defines \(\phi (x)\) for x in e.g. [3] with no digit 2. Note that \(\phi \) is continuous.

We recall that for \(\beta \geqslant 0\), the pressure function \({\mathcal P}(\beta )\) is defined by

$$\begin{aligned} {\mathcal P}(\beta )=\max _{\mu \ T-inv}\left\{ h_{\mu }+\beta \int \phi \,d\mu \right\} , \end{aligned}$$

where \(h_{\mu }\) is the Kolmogorov entropy of the measure \(\mu \). We refer the reader to [2] for classical results on thermodynamic formalism of the shift. A measure for which the maximum is attained in the above equality is called an equilibrium state for \(\beta .\phi \). Existence of an equilibrium state simply follows from the continuity of \(\phi \) and the upper semi-continuity of \(\mu \mapsto h_{\mu }\).

Definition 1

We say that \({\mathcal P}(\beta )\) (or equivalently that the potential \(\phi \)) has a phase transition (at \(\beta _{c}\)) if \(\beta \mapsto {\mathcal P}(\beta )\) is not analytic at \(\beta =\beta _{c}\).

1.3 Results

We first consider the case \(\displaystyle {\mathcal A}=\{1,1_{1},1_{2},\ldots , 1_{L},2,3,4\}\). The transitions are given by Fig. 1. This gives a “butterfly” with two wings, each one tending to be autonomous. Both wings \(\{1,1_{1},\ldots , 1_{L}\}^{{\mathbb N}}\) and \(\{3,4\}^{{\mathbb N}}\) are full shifts of finite type. The unique exit letters from the wings are respectively 1 and 3, and any transition from one wing to the other must pass through 2. Digits 1 and 3 are also the unique entrance digits into the wings.

We emphasize that the system is irreducible but has several subsystems. In particular we shall consider \(\Sigma _{34}:=\{3,4\}^{{\mathbb N}}\subset \Sigma \) and \(\Sigma _{234}:=\{2,3,4\}^{{\mathbb N}}\cap \Sigma \), the restriction of \(\Sigma \) to the invariant set of infinite words containing only the letters 2, 3 or 4. For the same potential \(\phi \), we shall consider the associated pressure functions, \({\mathcal P}_{34}(\beta )\) and \({\mathcal P}_{234}(\beta )\). We leave it to the reader to check that \({\mathcal P}_{34}(\beta )=\gamma .\beta +\log (1+e^{\delta .\beta })\).

Theorem A

There exist two positive real numbers \(\beta _{1}< \beta _{c}\) such that \({\mathcal P}(\beta )\) and \({\mathcal P}_{234}(\beta )\) have a phase transition at \(\beta _{c}\) and \(\beta _{1}\) respectively. More precisely (see Fig. 2),

  1. (1)

    the pressure function \({\mathcal P}_{34}(\beta )\) is analytic and strictly convex,

  2. (2)

    for \(\beta <\beta _{c}\), the pressure function \({\mathcal P}(\beta )\) is analytic and satisfies \({\mathcal P}(\beta )>{\mathcal P}_{34}(\beta )\),

  3. (3)

    for \(\beta \geqslant \beta _{c}\), the pressure function \({\mathcal P}(\beta )\) satisfies \({\mathcal P}(\beta )={\mathcal P}_{34}(\beta )\),

  4. (4)

    for \(\beta <\beta _{1}\), the pressure function \({\mathcal P}_{234}(\beta )\) is analytic and satisfies \({\mathcal P}_{234}(\beta )>{\mathcal P}_{34}(\beta )\),

  5. (5)

    for \(\beta \geqslant \beta _{1}\), the pressure function \({\mathcal P}_{234}(\beta )\) satisfies \({\mathcal P}_{234}(\beta )={\mathcal P}_{34}(\beta )\).

For \(\beta \leqslant \beta _{1}\), there is a unique equilibrium state for \(\Sigma _{234}\) and it has full support in \(\Sigma _{234}\).

For \(\beta <\beta _{c}\), there is a unique equilibrium state for \(\Sigma \) and it has full support. For \(\beta >\beta _{c}\) there is a unique equilibrium state and it has support in \(\Sigma _{34}\).

For \(\beta =\beta _{c}\), there are two ergodic equilibrium states for \(\Sigma \) if and only if \(\varepsilon \beta _{c}>2\). This inequality can be realized (depending of the parameters \(\alpha ,\varepsilon ,\gamma \) and \(\delta \)) or non-realized.

At each phase transition (\(\beta =\beta _{1}\) or \(\beta =\beta _{c}\)) the entropy is positive.

The pressure is differentiable at the critical value if and only if \(\varepsilon .\beta _{c}\leqslant 2\).

Fig. 2
figure 2

Different graphes

As it was said above, the main motivation for Theorem A was to build phase transitions with a non-flat pressure function after the transition. Actually such an example was already known in [6]. However, in that case, the map is a skew product over a Horseshoe and the potential, the logarithm of the derivative in the central direction (equivalent to the direction of fibers). Then, when it is projected onto the Horseshoe, the potential is not a function anymore. Hence, that phase transition cannot be realized as a continuous potential defined on a subshift of finite type.

To be complete concerning the shape of the pressure, we also have to mention [16]. There, they also show that the pressure may be non-flat after the transition. However, and this is the main difference, in their construction they always need some interval where the pressure is flat. In their words, for this interval the system is transient, which means that it has no conformal measure. In our example, at any \(\beta \) there exists a conformal measure and the pressure is always strictly convex.

Let us now present the main result of the paper. As we said above, the regularity of the pressure function is an indicator of the uniqueness of the equilibrium state. This regularity can be understood in two different ways.

First, regularity of the map \(\beta \mapsto {\mathcal P}(\beta )\) is studied. Conversely, non-regularity yields different order of phase transitions as was seen above.

Another point is the Gâteaux-differentiability of the functional \({\mathcal P}(\phi +.)\) on \({\mathcal C}(\Sigma )\). In [26, Cor.2], it is shown that Gâteaux-differentiability at \(\phi \) is equivalent to uniqueness of the equilibrium state for \(\phi \). Of course the latter point is more subtle than the regularity of \({\mathcal P}(\beta )\). It was usually expected that the analyticity for \({\mathcal P}(\beta )\) would insure the uniqueness of the equilibrium state. We prove here that this is not the case.

Theorem B

There exist irreducible subshifts of finite type and continuous potentials such that their pressure function is analytic on some interval \(]\beta _{c}',+\infty [\) but there coexist several equilibrium states.

There exist an infinite-dimensional space of functions \(\varphi \) such that for every \(\beta >\beta _{c}'\), \(\varphi \mapsto {\mathcal P}(\phi +\varphi )\) is Gâteaux-differentiable in the \(\varphi \)-direction.

We recall that a function f is Gâteaux-differentiable at x in the direction y if

$$\begin{aligned} \lim _{t\rightarrow 0}\frac{f(x+ty)-f(x)}{t} \end{aligned}$$

exists.

1.4 Overview of the Paper. Main Tool. Heuristic Explanation of the Phase Transitions

1.4.1 Overview of the Paper

The main tool is the notion of local equilibrium state as it was introduced in [17] and developed in further works by the author. We briefly recall below the principal points of that theory.

In Sect. 2 we prove most of the results concerning Theorem A and the phase transition for \({\mathcal P}(\beta )\). In Sect. 3 we prove the “small” phase transition for \(\Sigma _{234}\) and finish the proof of Theorem A. In Sect. 4, we prove Theorem B. To do so, we add a new wing \(\{3',4'\}^{{\mathbb N}}\) copying the wing \(\{3,4\}^{{\mathbb N}}\). We show that the effect of this addition is just to move \(\beta _{c}\) but does not change that fact that \({\mathcal P}(\beta )\) eventually equals \({\mathcal P}_{34}(\beta )\). But \({\mathcal P}_{34}(\beta )={\mathcal P}_{3'4'}(\beta )\) and then two equilibria coexist, one in \(\Sigma _{34}\), the other in \(\Sigma _{3'4'}\).

1.4.2 Local Equilibrium

We refer to [18] for a synthesis concerning the notion of local equilibrium. The principal points are the following.

Consider a cylinder J and denote by \(\tau _{J}(x)\) the first return time into the cylinder J. Let g denote the first return map into J, i.e., \(g(x):=\sigma ^{\tau _{J}(x)}(x)\). Note that it is not defined everywhere but the inverse branches are well-defined. We recall \(\phi +\ldots +\phi \circ \sigma ^{n-1}\) is denoted by \(S_{n}(\phi )\). Given \(Z \in {\mathbb R}\), the Induced Transfer Operator on the cylinder J for the first return time map and for the potential

$$\begin{aligned} \beta \phi (x)+\cdots +\beta \phi \circ \sigma ^{\tau _{J}(x)-1}(x)-\tau _{J}(x)Z, \end{aligned}$$

is defined by

$$\begin{aligned} {\mathcal L}_{Z,\beta ,J}(\psi )(x):=\sum _{y: \, g(y)=x}e^{S_{\tau _{J}(y)}(\beta .\phi )(y)-Z \tau _{J}(y)}\psi (y), \end{aligned}$$

where \(\psi :J\rightarrow {\mathbb R}\) is continuous.

The main point is that for every \(\beta \), there exists \(Z_{c}(\beta )\) such that \({\mathcal L}_{Z,\beta ,J}\) is well-defined for \(Z>Z_{c}(\beta )\) (and sometimes also for \(Z=Z_{c}(\beta )\)) and \(\displaystyle {\mathcal L}_{Z,\beta ,J}({1}\mathrm{l}_{J})(x)\) diverges for every x and \(Z<Z_{c}(\beta )\). For these Z’s, if \(\lambda _{Z,\beta ,J}\) denotes the spectral radius of \({\mathcal L}_{Z,\beta ,J}\), then, the existence of a global equilibrium state is related to the value \(\displaystyle \lim _{Z\downarrow Z_{c}(\beta )}\lambda _{Z,\beta ,J}=:\lambda _{c}(\beta )\):

  1. (1)

    If \(\lambda _{c}(\beta )>1\), there exists a unique equilibrium state for \(\beta .\phi \) and it has full support.

  2. (2)

    If \(\lambda _{c}(\beta )<1\), no equilibrium state for \(\beta .\phi \) gives positive weight to J.

  3. (3)

    The case \(\lambda _{c}=1\) is the critical one. There exists an equilibrium state with full support if and only if \({\mathcal L}_{Z_{c}(\beta ),\beta ,J}(\tau _{J})<+\infty \).

This machinery works if \(\phi \) is such that the induced potential \(\displaystyle \beta \phi (x)+\cdots +\beta \phi \circ \sigma ^{\tau _{J}(x)-1}(x)-\tau _{J}(x)Z\) has bounded variations with respect to the induced map g. We shall check this point. More precisely, we shall also choose J properly such that the computation of \(\lambda _{Z,\beta ,J}\) is easy. In particular we will consider J satisfying \({\mathcal L}_{Z,\beta ,J}({1}\mathrm{l}_{J})(x)=\lambda _{Z,\beta ,J}\) for every x in J.

1.4.3 Why Do Phase Transitions Arise

Continuing with the same notations, we can show that \(Z_{c}(\beta )\leqslant {\mathcal P}(\beta )\) (see [18, Prop.2.2]). Then, roughly speaking, there exists an equilibrium state \(\mu _{\beta }\) for \(\beta .\phi \) satisfying \(\mu _{\beta }(J)>0\) if and only if \(\lambda _{{\mathcal P}(\beta ),\beta ,J}=1\). If this later condition does not hold, then an equilibrium state has support in the dotted system, that is, the set of trajectories which avoid the cylinder J.

In our case, we emphasize that \(\phi \) is very negative on the left wing, lightly negative on [2] and extremely positive on the right wing. For \(\beta =0\), the entropy makes the difference, but as \(\beta \) increases, entropy is not sufficient to balance the sign of \(\beta .\phi \). At some point, the potential is too positive in the dotted system and too negative on the left wing to compensate the difference of entropy. This gives the phase transition at \(\beta _{c}\) where the pressure of subsystem \(\Sigma _{234}\) equals the global pressure.

Choosing J properly, we show that this also happens for \(\Sigma _{34}\) with respect to \(\Sigma _{234}\) at \(\beta _{1}\).

Remark 1

This is e.g. exactly what happens for the Manneville–Pomeau map and the Hofbauer potential. \(\square \)

2 The Phase Transition for \({\mathcal P}(\beta )\)

2.1 The Induced Operator on [1]

We study the induced transfer operator on the cylinder [1]. Denote by \(\tau _{[1]}(x)\) the first return time into the cylinder [1] and by g the first return map. We set

$$\begin{aligned} {\mathcal L}_{Z,\beta ,[1]}(\psi )(x):=\sum _{y: \, g(y)=x}e^{S_{\tau _{[1]}(y)}(\beta .\phi )(y)-Z \tau _{[1]}(y)}\psi (y). \end{aligned}$$

Trajectories returning to [1] after having left it are of the following form:

  1. (1)

    They leave [1], visit \(\{1_{1},\ldots , 1_{L} \}\) for a while and then come back into [1].

  2. (2)

    They leave [1], visit [2] for a while and then come back into [1].

  3. (3)

    They leave [1], visit [2] for some time, visit \(\{3,4\}\), and more generally, alternate visit to [2] and to the right hand side wing, and eventually come back to [1] after a last visit to [2].

A counting argument yields

$$\begin{aligned} {\mathcal L}_{Z,\beta ,[1]}({1}\mathrm{l}_{[1]})(x)= & {} \sum _{n=1}^{+\infty }e^{-n\beta \alpha -nZ+(n-1)\log L}+ \sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta }e^{-nZ}\cdot e^{-\alpha \beta -Z}\nonumber \\&\times \left[ \sum _{k=0}^{+\infty }\left( \sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta }e^{-nZ}\,\cdot \sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\varepsilon \beta }L_{n}(\beta ,[3])e^{-nZ}\right) ^{k}\right] ,\nonumber \\ \end{aligned}$$
(1)

where

$$\begin{aligned} L_{n}(\beta ,[3])=\sum _{w\in \{3,4\}^{n},w_{0}=w_{n-1}=3}e^{\beta S_{n}(\phi )(w2)}. \end{aligned}$$
(2)

The first summand corresponds to the first kind of trajectories. They are of the form \(x=1x_2x_3 \dots x_{n-1} 1\) with \(x_i \in \{1_{1},\ldots , 1_{L}\}\). For those points we have that \(\beta \phi (x)+\cdots +\beta \phi \circ \sigma ^{\tau _{[1]}(x)-1}(x)-\tau _{[1]}(x)Z= -n\beta \alpha -nZ\). Moreover, since the system restricted to \(\{1,1_{1},\ldots , 1_{L}\}\) is a full-shift on L symbols, for each n there are exactly \(L^{n-1}\) different words of that form. Hence the factor \((n-1)\log L\).

The second term concerns points of the form \(x=12^{n}1\dots \). In this case \(e^{-\beta \alpha -Z }\) takes into account the potential at x and the contribution of the string of 2’s yields the term \(\displaystyle \left( \frac{1}{n+1}\right) ^{\beta }\).

Finally, trajectories of the third kind correspond to words of the form

$$\begin{aligned} 1(\text {string of 2's})\left( \text {intermittence of eligible strings of 3's or 4's and strings of 2's}\right) 1\ldots \end{aligned}$$

Note that an eligible string of 3’s and 4’s starts and finishes by 3, and such a string is necessarily followed by a string of 2’s before returning to 1. Moreover, we emphasize that \(\phi \) is constant on cylinders of the form \([\omega 2]\) where \(\omega \) is an eligible word of finite length of 3’s and 4’s. The sum over k in (1) is for k visits (and possibly \(k=0\) for trajectories of the second form) to the right hand side wing.

Now, if \(\omega =\omega _{0}\ldots \omega _{n-1}\) is a word of length n with k letters equal to 4 such that \(\omega 2\) is eligible, then \(S_{n}(\phi )(\omega 2)\) is equal to \(\displaystyle n\beta +k\delta \) and \(k\leqslant n-2\) because \(\omega _{0}=\omega _{n-1}=3\). Therefore, the computation for trajectories in the right hand side wing gives

$$\begin{aligned} L_{n}(\beta ,[3])= & {} e^{n\beta \gamma }\sum _{k=0}^{n-2}e^{k\beta \delta }{n-2\atopwithdelims ()k}\nonumber \\= & {} e^{n\beta \gamma }(1+e^{\beta \delta })^{n-2}=\frac{\left( e^{\gamma \beta }+e^{(\gamma +\delta )\beta }\right) ^{n}}{(1+e^{\delta \beta })^{2}}= \frac{1}{(1+e^{\delta \beta })^{2}}e^{n{\mathcal P}_{34}(\beta )}. \end{aligned}$$
(3)

Let us set for simplicity

$$\begin{aligned} \Sigma _{1}=\Sigma _{1}(Z,\beta ):= & {} \sum _{n=1}^{+\infty }e^{-n\beta \alpha -nZ+(n-1)\log L}\\ \Sigma _{2}=\Sigma _{2}(Z,\beta ):= & {} \sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta }e^{-nZ}\\ \Sigma _{3}=\Sigma _{3}(Z,\beta ):= & {} \frac{1}{(1+e^{\delta \beta })^{2}}\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\varepsilon \beta }e^{n({\mathcal P}_{34}(\beta )-Z)}. \end{aligned}$$

With these notations we get that for every x in [1]

$$\begin{aligned} {\mathcal L}_{{Z,\beta ,[1]}}({1}\mathrm{l}_{[1]})(x)= & {} \Sigma _{1}+\Sigma _{2}e^{-\alpha \beta -Z}\sum _{k=0}^{+\infty }\big (\Sigma _{2}\Sigma _{3}\big )^{k}\\= & {} \Sigma _{1}+\frac{\Sigma _{2}e^{-\alpha \beta -Z}}{1-\Sigma _{2}\Sigma _{3}}. \end{aligned}$$

2.2 Critical Value \(Z_{c}(\beta )\) for \({\mathcal L}_{{Z,\beta ,[1]}}\)

The quantity \({\mathcal L}_{{Z,\beta ,[1]}}({1}\mathrm{l}_{[1]})\) is well-defined if and only if

$$\begin{aligned} \Sigma _{1}<+\infty ,\end{aligned}$$
(4a)
$$\begin{aligned} \Sigma _{2}\Sigma _{3}<1. \end{aligned}$$
(4b)

Condition (4a) is satisfied if only if \(Z>\log L-\alpha \beta \).

Let us now study Condition (4b). First, we emphasize that a necessary condition is \(Z\geqslant {\mathcal P}_{34}(\beta )\). Now we have:

Lemma 2

There exists a maximal \(\beta _{1}>0\) such that the implicit equation

$$\begin{aligned} \Sigma _{2}\Sigma _{3}=1 \text { with constraint } Z\geqslant {\mathcal P}_{34}(\beta ) \end{aligned}$$

admits a (unique) solution \(\widetilde{Z}_{c}(\beta )\) for every \(0\leqslant \beta \leqslant \beta _{1}\) (See Fig. 3).

Fig. 3
figure 3

Existence of critical \(\beta _{1}\)

Proof

Both \(\Sigma _{2}\) and \(\Sigma _{3}\) are decreasing in Z for fixed \(\beta \), which yields uniqueness (if it exists) of \(\widetilde{Z}_{c}(\beta )\) satisfying

$$\begin{aligned} \Sigma _{2}(Z,\beta )\Sigma _{3}(Z,\beta )=1. \end{aligned}$$

Note that both \(\Sigma _{2}\) and \(\Sigma _{3}\) go to 0 if Z goes to \(+\infty \). Then, the existence of \(\widetilde{Z}_{c}(\beta )\) follows from the value of \(\Sigma _{2}({\mathcal P}_{34}(\beta ),\beta )\Sigma _{3}({\mathcal P}_{34}(\beta ),\beta )\). If it is larger than 1 then \(\widetilde{Z}_{c}(\beta )\) exits, if it is smaller than 1, then \(\widetilde{Z}_{c}(\beta )\) does not exist. Now,

$$\begin{aligned} \Sigma _{2}({\mathcal P}_{34}(\beta ),\beta )\Sigma _{3}({\mathcal P}_{34}(\beta ),\beta )=\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta }e^{-n{\mathcal P}_{34}(\beta )}\frac{\zeta (\varepsilon \beta )-1}{(1+e^{\delta \beta })^{2}}, \end{aligned}$$

\(\beta \mapsto {\mathcal P}_{34}(\beta )\) increases in \(\beta \), thus \(\beta \mapsto \Sigma _{2}({\mathcal P}_{34}(\beta ),\beta )\Sigma _{3}({\mathcal P}_{34}(\beta ),\beta )\) decreases in \(\beta \). It goes to \(+\infty \) if \(\beta \) goes to 0 and to 0 if \(\beta \) goes to \(+\infty \).

Therefore, there exists a unique \(\beta _{1}\) such that

$$\begin{aligned} \Sigma _{2}({\mathcal P}_{34}(\beta _1),\beta _1)\Sigma _{3}({\mathcal P}_{34}(\beta _1),\beta _1)=\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta _1}e^{-n{\mathcal P}_{34}(\beta _1)}\frac{\zeta (\varepsilon \beta _1)-1}{(1+e^{\delta \beta _1})^{2}}=1 \end{aligned}$$
(5)

Remark 2

We point out that \(\varepsilon \beta _{1}>1\) because the Zeta function in (5) converges. \(\square \)

Note that for every \(\beta <\beta _{1}\), \(\widetilde{Z}_{c}(\beta )>{\mathcal P}_{34}(\beta )\) and it is given by the following implicit formula

$$\begin{aligned} \sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta }e^{-n\widetilde{Z}_{c}(\beta )}\frac{1}{(1+e^{\delta \beta })^{2}}\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\varepsilon \beta }e^{-n(\widetilde{Z}_{c}(\beta )-{\mathcal P}_{34}(\beta ))}=1. \end{aligned}$$

It shows that \(\widetilde{Z}_{c}(\beta )\) is analytic in \(\beta \) for \(\beta <\beta _{1}\).

As a consequence of the proof of Lemma 2, it follows that:

  1. (1)

    for every \(0\leqslant \beta <\beta _{1}\)

    1. (a)

      \(\widetilde{Z}_{c}(\beta )> {\mathcal P}_{34}(\beta )\),

    2. (b)

      \(\Sigma _{2}\Sigma _{3}<1\) for \(Z>\widetilde{Z}_{c}(\beta )\),

    3. (c)

      \(\Sigma _{2}\Sigma _{3}>1\) for \({\mathcal P}_{34}(\beta )\leqslant Z<\widetilde{Z}_{c}(\beta )\),

    4. (d)

      \(\Sigma _{2}\Sigma _{3}=1\) for \(Z=\widetilde{Z}_{c}(\beta )\).

  2. (2)

    For every \(\beta >\beta _{1}\), Condition (4b) holds for every \(Z\geqslant {\mathcal P}_{34}(\beta )\).

  3. (3)

    For \(\beta =\beta _{1}\) (4b) holds for every \(Z> {\mathcal P}_{34}(\beta _{1})\).

Remark 3

We will see below that \(\widetilde{Z}_{c}(\beta )={\mathcal P}_{234}(\beta )\). See also [18, Th 1] \(\square \)

The function \(\beta \mapsto \log L-\alpha \beta \) is decreasing and goes to \(-\infty \) if \(\beta \rightarrow +\infty \), whereas \({\mathcal P}_{34}(\beta )\) increases to \(+\infty \). Therefore, there eventually exists \(\beta _{2}\geqslant 0\) such that \({\mathcal P}_{34}(\beta )\geqslant \log L-\alpha \beta \) if and only if \(\beta \geqslant \beta _{2}\). Consequently (see Fig. 6, p. 16),

  1. (1)

    if \(\beta _{2}\geqslant \beta _{1}\), then \(Z_{c}(\beta )=\log L-\alpha .\beta \) for \(\beta \leqslant \beta _{2}\) and \(Z_{c}(\beta )={\mathcal P}_{34}(\beta )\) for \(\beta >\beta _{2}\).

  2. (2)

    if \(\beta _{2}< \beta _{1}\) then \(Z_{c}(\beta )=\log L-\alpha \beta \) for some (possibly empty) interval \([0,\widetilde{\beta }]\), then \(Z_{c}(\beta )=\widetilde{Z}_{c}(\beta )\) for \(\widetilde{\beta }\leqslant \beta \leqslant \beta _{1}\) and \(Z_{c}(\beta )= {\mathcal P}_{34}(\beta )\) for \(\beta \geqslant \beta _{1}\).

2.3 Spectral Radius of \({\mathcal L}_{{Z,\beta ,[1]}}\)

We emphasize that the induced potential is constant on cylinders for the induced map on [1]. Consequently it satisfies the Bowen condition (H2) of [18]. Moreover, for every x in [1], \({\mathcal L}_{Z,\beta ,[1]}({1}\mathrm{l}_{[1]})(x)\) is equal to the spectral radius \(\lambda _{Z,\beta ,[1]}\):

$$\begin{aligned} \lambda _{{Z,\beta ,[1]}}={\mathcal L}_{{Z,\beta ,[1]}}({1}\mathrm{l}_{[1]})(x)=\Sigma _{1}+\frac{\Sigma _{2}e^{-\alpha \beta -Z}}{1-\Sigma _{2}\Sigma _{3}}. \end{aligned}$$
(6)

We are interested by the level curve \(\lambda _{{Z,\beta ,[1]}}=1\) because it partially determines the implicit function \(Z={\mathcal P}(\beta )\).

Lemma 3

There exists \(\beta _{c}>\max (\beta _{1},\beta _{2})\) such that for every \(\beta <\beta _{c}\), there exists a unique \(Z_{c}(\beta )<Z=Z(\beta )\) such that \(\lambda _{{Z(\beta ),\beta ,[1]}}=1\) (see Fig. 4).

Moreover, for every \(\beta >\beta _{c}\) and for every \(Z\geqslant Z_{c}(\beta )={\mathcal P}_{34}(\beta )\), \(\lambda _{{Z,\beta ,[1]}}<1\).

Fig. 4
figure 4

Implicit curve for spectral radius equal to 1, with \(L=0\)

Proof

We emphasize that \(\Sigma _{1}\), \(\Sigma _{2}\) and \(\Sigma _{3}\) are decreasing in Z for fixed \(\beta \). They go to 0 as Z goes to \(+\infty \).

Therefore, the existence of a Z such that \(\lambda _{Z,\beta ,[1]}=1\) follows from the value of

$$\begin{aligned} F(\beta ):=\lim _{Z\downarrow Z_{c}(\beta )}\lambda _{Z,\beta ,[1]}. \end{aligned}$$

\(\bullet \) Let us first consider the case \(L=0\). Then, \(Z_{c}(\beta )=\widetilde{Z}_{c}(\beta )\) for \(\beta \leqslant \beta _{1}\) and \(Z_{c}(\beta )={\mathcal P}_{34}(\beta )\) for \(\beta \geqslant \beta _{1}\). Let us first consider the case \(\beta \leqslant \beta _{1}\). For a fixed \(\beta \), if \(Z\downarrow \widetilde{Z}_{c}(\beta )\), \(\lambda _{{Z,\beta ,[1]}}\) increases to \(+\infty \) because \(\Sigma _{2}\Sigma _{3}\) goes to 1. On the other hand, if Z goes to \(+\infty \), \(\lambda _{{Z,\beta ,[1]}}\) goes to 0.

Therefore, there exists a unique Z such that \(\lambda _{{Z,\beta ,[1]}}=1\).

Let us now assume \(\beta >\beta _{1}\). Then,

$$\begin{aligned} F(\beta ):=e^{-\alpha \beta -{\mathcal P}_{34}(\beta )}+\frac{\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta }e^{-n{\mathcal P}_{34}(\beta )}e^{-\alpha \beta -{\mathcal P}_{34}(\beta )}}{1-\frac{1}{(1+e^{\delta \beta })^{2}}\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta }e^{-n{\mathcal P}_{34}(\beta )}\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\varepsilon \beta }} \end{aligned}$$
(7)

and the existence of a solution for \(\lambda _{{Z,\beta ,[1]}}=1\) is thus a consequence of the inequality \(F(\beta )\geqslant 1\). As \(\beta \mapsto {\mathcal P}_{34}(\beta )\) increases, we claim that \(F(\beta )\) decreases in \(\beta \). If \(\beta \downarrow \beta _{1}\), \(\Sigma _{2}\Sigma _{3}\) goes to 1 and then \(F(\beta )\) goes to \(+\infty \). If \(\beta \rightarrow +\infty \), \(F(\beta )\rightarrow 0\).

Consequently, there exists a unique \(\beta _{c}\), such that \(F(\beta _{c})=1\) (see Fig. 5). We have \(\beta _{c}>\beta _{1}\) because \(\lim _{\beta \downarrow \beta _{1}}F(\beta )=+\infty \).

For \(\beta <\beta _{c}\), the implicit equation \(\lambda _{{Z,\beta ,[1]}}=1\) has a unique solution, \(Z={\mathcal P}(\beta )\). For \(\beta >\beta _{c}\) there are no solutions.

\(\bullet \) The case \(L\geqslant 1\). In that case \(\Sigma _{1}\) is equal to

$$\begin{aligned} \Sigma _{1}:=\sum _{n=1}^{+\infty }e^{-n\beta .\alpha -nZ+(n-1)\log L}=\frac{e^{-\beta .\alpha -Z}}{1-e^{-\beta .\alpha -Z+\log L}}, \end{aligned}$$
(8)

and it decreases in \(\beta \) (for fixed Z), decreases in Z (for fixed \(\beta \)) and diverges if \(Z=\log L-\beta .\alpha \). The shape of \(F(\beta )\) is the same as in the previous case, whatever the relative positions of \(\beta _{1}\) and \(\beta _{2}\) are : \(F(\beta )=+\infty \) for every \(\beta \leqslant \max (\beta _{1},\beta _{2})\) and then \(F(\beta )\) decreases for \(\beta \geqslant \max (\beta _{1},\beta _{2})\) because for these \(\beta \)’s \(Z_{c}(\beta )={\mathcal P}_{34}(\beta )\). Therefore, there exists a unique \(\beta _{c}\) such that \(F(\beta _{c})=1\). \(\square \)

Fig. 5
figure 5

The graph of \(F(\beta )\)

2.4 Thermodynamic Formalism for \(\beta <\beta _{c}\)

If \(\beta <\beta _{c}\) we use Th. 4, Th. 2 and Remark 3 in [18]. The pressure function \({\mathcal P}(\beta )\) is given by an implicit function inside the interior of the domain of analyticity in both variables Z and \(\beta \) for \(\lambda _{Z,\beta ,[1]}\). Analyticity for \(\beta \mapsto {\mathcal P}(\beta )\) for \(\beta <\beta _{c}\) thus follows from analyticity of implicit function (see e.g. [21]).

By construction, \({\mathcal P}(\beta )>Z_{c}(\beta )\geqslant {\mathcal P}_{34}(\beta )\) (see Fig. 6).

Fig. 6
figure 6

Principal curves

We point out that increasing L does not affect the existence of \(\beta _{c}\) but may only shift the level curve \(\lambda _{Z,\beta ,[1]}=1\) up, and therefore increase \(\beta _{c}\) (see Fig. 6, p. 16). As a by-product, this shows that \(\varepsilon \beta _{c}\) can be made as big as wanted if L is increased.

Lemma 4

\(\displaystyle \lim _{\beta \rightarrow \beta _{c}^{-}}{\mathcal P}(\beta )={\mathcal P}_{34}(\beta _{c})\).

Proof

Remember that \({\mathcal P}(\beta )\) is given by the implicit formula \(\lambda _{{Z,\beta ,[1]}}=1\). On the other hand, for every \(\beta _{1}<\beta <\beta _{c}\) and \(Z={\mathcal P}_{34}(\beta )\) we set

$$\begin{aligned} \lambda _{{Z,\beta ,[1]}}=F(\beta ). \end{aligned}$$

Furthermore, \(F(\beta )>1\) for \(\beta <\beta _{c}\) and it goes to 1 if \(\beta \rightarrow \beta _{c}\) (by definition of \(\beta _{c}\)).

As for any fixed \(\beta \), \(Z\mapsto \lambda _{{Z,\beta ,[1]}}\) decreases, for \(\beta =\beta _{c}\), \(Z={\mathcal P}_{34}(\beta )\) is the unique solution for

$$\begin{aligned} \lambda _{{Z,\beta ,[1]}}=1 \quad \text { (see Fig. 5)}, \end{aligned}$$

thus \({\mathcal P}(\beta _{c})= {\mathcal P}_{34}(\beta _{c})\). \(\square \)

2.5 Thermodynamic Formalism for \(\beta >\beta _{c}\) and Number of Equilibrium States at \(\beta _{c}\)

Due to [18, Th. 4] no equilibrium state can give a positive weight to [1] for \(\beta >\beta _{c}\).

The next lemma shows that an equilibrium cannot give a positive weight to the left hand side wing without giving weight to [1].

Lemma 5

If \(\mu \) is an equilibrium state for \(\beta .\phi \) and \(\mu ([1])=0\), then \(\mu ([1_{1}])=\ldots =\mu ([1_{L}])=0\).

Proof

Assume \(\mu \) is an equilibrium state for \(\beta .\phi \) and \(\mu ([1])=0\). By considering an ergodic component of \(\mu \), we can assume that \(\mu \) is ergodic.

Then, if \(\mu ([1_{1}]\cup \ldots \cup [1_{L}])>0\), by ergodicity, \(\mu ([1_{1}]\cup \ldots \cup [1_{L}])=1\). As the potential \(\phi \) is constant on \([1_{1}]\cup \ldots \cup [1_{L}]\), \(\mu \) is the measure with maximal entropy supported in \(\{1_{1},\ldots ,1_{L}\}^{{\mathbb N}}\) and \({\mathcal P}(\beta )=\log L-\beta \alpha \).

In that case, the measure of maximal entropy supported in \(\{1,1_{1},\ldots ,1_{L}\}^{{\mathbb N}}\) has a pressure \(\log (L+1)-\beta \alpha >{\mathcal P}(\beta )\) which is in contradiction with the definition of the pressure. \(\square \)

Consequently, for \(\beta >\beta _{c}\), any equilibrium state has its support in \(\{2,3,4\}^{{\mathbb N}}\cap \Sigma \). The thermodynamic formalism for \(\beta >\beta _{c}\) is thus a consequence of the results in Sect. 3.

At the transition, \(\beta =\beta _{c}\), \({\mathcal P}(\beta _{c})={\mathcal P}_{34}(\beta _{c})\) follows from Lemma 4, which yields that the unique equilibrium state in \(\Sigma _{34}\) for \(\beta .\phi \) is an equilibrium state for the global system. Then, existence of an equilibrium state giving positive weight to [1] is related to the condition \(\displaystyle {\mathcal L}_{{\mathcal P}_{34}(\beta _{c}),\beta _{c},[1]}(\tau _{[1]})<+\infty \) (see again [18, Th. 4]). A simple computation shows

$$\begin{aligned} {\mathcal L}_{{\mathcal P}_{34}(\beta _{c}),\beta _{c},[1]}(\tau _{[1]})=\displaystyle \left| \frac{\partial {\mathcal L}_{{Z,\beta ,[1]}}({1}\mathrm{l}_{[1]})(x)}{\partial Z}_{|Z={\mathcal P}_{34}(\beta _{c})}\right| . \end{aligned}$$

Hence we have

Proposition 6

If \(\varepsilon \beta _{c}>2\), then there are at least two ergodic equilibrium states for \(\beta =\beta _{c}\) and only one of them has full support. If \(\varepsilon \beta _{c}\leqslant 2\), then no equilibrium state for \(\beta =\beta _{c}\) gives positive weight to [1].

Proof

We recall that for every x in [1]

$$\begin{aligned} \lambda _{{Z,\beta ,[1]}}={\mathcal L}_{{Z,\beta ,[1]}}({1}\mathrm{l}_{[1]})(x)=\Sigma _{1}(Z,\beta )+\frac{\Sigma _{2}(Z,\beta )e^{-\alpha \beta -Z}}{1-\Sigma _{2}(Z,\beta )\Sigma _{3}(Z,\beta )}. \end{aligned}$$

Therefore we have to compute \(\displaystyle \frac{\partial \lambda _{{Z,\beta ,[1]}}}{\partial Z}{}_{|Z={\mathcal P}_{34}(\beta )}\). Using the chain rule for computation of the derivative of product function, \(\displaystyle \frac{\partial \lambda _{{Z,\beta ,[1]}}}{\partial Z}{}_{|Z={\mathcal P}_{34}(\beta )}\) involves \(\Sigma _{1}\) \(\Sigma _{2}\), \(\Sigma _{3}\), \(\displaystyle \frac{\partial \Sigma _{1}}{\partial Z}\), \(\displaystyle \frac{\partial \Sigma _{2}}{\partial Z}\) and \(\displaystyle \frac{\partial \Sigma _{3}}{\partial Z}\).

All these terms are series, and more precisely power series in \(e^{-Z}\). Therefore, \(\displaystyle \frac{\partial \lambda _{{Z,\beta ,[1]}}}{\partial Z}{}_{|Z={\mathcal P}_{34}(\beta )}\) converges if and only if all these series converge.

We have already seen that at the transition, \(Z_{c}(\beta )={\mathcal P}_{34}(\beta _{c})>\log L-\beta _{c}.\alpha \) which yields the convergence of \(\Sigma _{1}\) and \(\displaystyle \frac{\partial \Sigma _{1}}{\partial Z}{}_{|Z={\mathcal P}_{34}(\beta )}\). We have also seen that for \(\beta >\beta _{c}>\beta _{1}\), \(\Sigma _{2}\Sigma _{3}<1\) which yields convergences for both \(\Sigma _{2}\) and \(\Sigma _{3}\). We also claim that \(\displaystyle \frac{\partial \Sigma _{2}}{\partial Z}{}_{|Z={\mathcal P}_{34}(\beta _{c})}\) converges because \({\mathcal P}_{34}(\beta _{c})>0\).

Therefore, the global convergence is equivalent to the convergence of \(\displaystyle \frac{\partial \Sigma _{3}}{\partial Z}{}_{|Z={\mathcal P}_{34}(\beta _{c})}\), that is

$$\begin{aligned} \sum _{n}\left( \frac{1}{n+1}\right) ^{\varepsilon \beta }n<+\infty . \end{aligned}$$
(9)

This holds if and only if \(\varepsilon \beta _{c}>2\). \(\square \)

Remark 4

As was said above, if \(\varepsilon \beta _{c}>2\), by [18, Th.4] there exists a unique equilibrium state which gives a positive weight to [1]. It is also the unique equilibrium with full support. \(\square \)

Remark 5

Existence of at most 2 equilibrium states will follow from the uniqueness of the equilibrium state in \(\{2,3,4\}^{{\mathbb N}}\cap \Sigma \) for \(\beta >\beta _{1}\). \(\square \)

3 Phase Transition for \({\mathcal P}_{234}(\beta )\). End of the Proof of Theorem A

3.1 First Inequalities and Preliminary Results

One of the main difficulties is that we do not know, at that stage, whether the pressure for \(\beta >\beta _{c}\) is strictly bigger than \({\mathcal P}_{34}(\beta )\) or not.

Let \({\mathcal P}_{234}(\beta )\) be the pressure for the sub-system \(\{2,3,4\}^{{\mathbb N}}\cap \Sigma \) of points in \(\Sigma \) with no symbols in \(\{1,1_{1},\ldots ,1_{L}\}\) and for the potential \(\beta .\phi \). As it is a subsystem of the global one, \({\mathcal P}_{234}(\beta )\leqslant {\mathcal P}(\beta )\). Conversely, \(\{3,4\}^{{\mathbb N}}\) is a subsystem of \(\{2,3,4\}^{{\mathbb N}}\cap \Sigma \) and then \({\mathcal P}_{234}(\beta )\geqslant {\mathcal P}_{34}(\beta )\). Therefore Lemma 4 yields that

$$\begin{aligned} {\mathcal P}_{234}(\beta _{c})={\mathcal P}(\beta _{c})={\mathcal P}_{34}(\beta _{c}) \end{aligned}$$

at the transition. The main question is to know whether for \(\beta > \beta _{c}\) an equilibrium state (for \(\Sigma \)) gives a positive weight to the cylinder [2] or not.

Since we only have to show that for \(\beta >\beta _{c}\), \({\mathcal P}_{234}(\beta )={\mathcal P}_{34}(\beta )\) holds, from now on untill the end of this section, equilibrium states are with respect to the system \(\Sigma _{234}\). We recall that \(\phi \) is continuous and the entropy is upper semi-continuous. Thus, there exist at least one equilibrium state, say \(\widehat{\mu }_{\beta }\), in \(\Sigma _{234}\).

Lemma 7

For every \(\beta \), for every equilibrium state \(\widehat{\mu }_{\beta }\), \(\widehat{\mu }_{\beta }([3])>0\).

Proof

If \(\widehat{\mu }_{\beta }([3])=0\), then any ergodic component of \(\widehat{\mu }_{\beta }\) is either \(\delta _{2^{\infty }}\) or \(\delta _{4^{\infty }}\). In the first case the pressure is 0, in the second case it is \(\beta .(\delta +\gamma )\). In both cases, the value is strictly lower than \({\mathcal P}_{34}(\beta )\leqslant {\mathcal P}_{234}(\beta )\). \(\square \)

Lemma 8

There exists an equilibrium state, \(\widehat{\mu }_{\beta }\), for \(\beta .\phi \) satisfying \(\widehat{\mu }_{\beta }([32])=0\) if and only if \({\mathcal P}_{234}(\beta )={\mathcal P}_{34}(\beta )\).

Proof

Let \(\widehat{\mu }_{\beta }\) be an equilibrium state such that \(\widehat{\mu }_{\beta }([32])=0\). Then \(\sigma \)-invariance immediately yields that every cylinder of the form \([i_{0}i_{1}\ldots i_{n-1}32]\) has null \(\widehat{\mu }_{\beta }\)-measure. As \(\widehat{\mu }_{\beta }([3])>0\), this shows that \(\widehat{\mu }_{\beta }(\Sigma _{34})=1\), thus \({\mathcal P}_{234}(\beta )\leqslant {\mathcal P}_{34}(\beta )\). The converse inequality holds as was seen above: the unique equilibrium state in \(\Sigma _{34}\) (for \(\beta .\phi \)) is an equilibrium state for \(\Sigma _{234}\) and it gives no weight to [32]. \(\square \)

For our purpose we will thus induce on the cylinder [32]. To avoid heavy notations, the first return time will simply be denoted by \(\tau \) and the first return map by T.

3.2 Induced Operator in [32]

Consider a point in [32] say \(x:=32x_{2}x_{3}\ldots \). Any y satisfying \(T(y)=x\) is of the form

$$\begin{aligned} y=3\underbrace{2\ldots 2}_{\text {at least one }2}3\omega 32x_{2}x_{3}\ldots , \end{aligned}$$

where \(\omega \) is a word in 3 and 4.

If \(x':=32x'_{2}x'_{3}\ldots \) \(y:=32^{n}3\omega 32x_{2}x_{3}\ldots \) and \(y':=32^{n}3\omega 32x'_{2}x'_{3}\ldots \), we emphasize that

$$\begin{aligned} S_{n+1+|\omega |+1}(\phi )(y')=S_{n+1+|\omega |+1}(\phi )(y) \end{aligned}$$
(10)

holds. In other words, if we induce on [32], the induced potential satisfies the Bowen condition (H2) of [18] and we can apply the same machinery that was used for the cylinder [1].

In that case, the family of transfer operators for T and \(\beta .\phi \) is defined by

$$\begin{aligned} {\mathcal L}_{{Z,\beta ,[32]}}(\psi )(x):= & {} \frac{1}{2^{\varepsilon \beta }}e^{\gamma \beta -Z}\sum _{n=1}^{+\infty }\sum _{\omega }\left( \frac{1}{n+1}\right) ^{\beta }\nonumber \\&\times \, e^{-nZ}\left( \frac{2}{|\omega |+2}\right) ^{\varepsilon \beta }e^{|\omega |\gamma \beta +(\#4\in \omega ).\delta \beta -|\omega |Z}\psi (32^{n}\omega x), \end{aligned}$$
(11)

where \(\psi \) belongs to \({\mathcal C}^{0}([32])\), x starts with 32..., \(\omega \) is a (possibly empty) word with digits 3 and 4 and starting with 3 and \(\#4\in \omega \) is the number of 4’s in \(\omega \). The computation givesFootnote 1

$$\begin{aligned} {\mathcal L}_{{Z,\beta ,[32]}}({1}\mathrm{l}_{[32]})=\frac{1}{(1+e^{\delta \beta })^{2}}\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta }e^{-nZ}\sum _{m=1}^{+\infty }\left( \frac{1}{m+1}\right) ^{\varepsilon \beta }e^{m({\mathcal P}_{34}(\beta )-Z)}. \end{aligned}$$

Note that the induced potential is constant on cylinders associated to T (in [32]), hence \(\lambda _{{Z,\beta ,[32]}}={\mathcal L}_{{Z,\beta ,[32]}}({1}\mathrm{l}_{[32]})(x)\) for every x in [32]. We emphasize that \({\mathcal L}_{{Z,\beta ,[32]}}({1}\mathrm{l}_{[32]})(x)\) is actually equal to \(\Sigma _{2}\Sigma _{3}\). Therefore, the implicit equation

$$\begin{aligned} \lambda _{{Z,\beta ,[32]}}=1, \end{aligned}$$

is exactly verified for \(Z=\widetilde{Z}_{c}(\beta )\) (see Lemma 2) and holds if and only if \(\beta \leqslant \beta _{1}\). By definition of \(\beta _{1}\), for every \(\beta >\beta _{1}\), \(\Sigma _{2}\Sigma _{3}<1.\) Therefore, from [18] we get:

  • \({\mathcal P}_{234}(\beta )=\widetilde{Z}_{c}(\beta )\) for \(\beta <\beta _{1}\),

  • there is a unique equilibrium state for \(\beta <\beta _{1}\), and it is fully supported in \(\Sigma _{234}\),

  • there is a unique equilibrium state for \(\beta >\beta _{1}\) and it is the one in \(\Sigma _{34}\),

  • there are two ergodic equilibrium states for \(\beta =\beta _{1}\) if and only if \(\varepsilon \beta _{1}>2\) (to get convergence for \(\frac{\partial \Sigma _{3}}{\partial Z}\)).

At that point, all the results stated in Theorem A are proved except \(\varepsilon .\beta _{1}<2\), the fact that conditions \(\varepsilon .\beta _{c}\lesseqgtr 2\) can be realized and (non-)differentiability for \({\mathcal P}(\beta )\) at the transition \(\beta =\beta _{c}\)

3.3 End of the Proof of Theorem A

3.3.1 Proof that \(\varepsilon \beta _{1}<2\)

We recall that \(\beta _{1}\) is defined by the implicit formula (5):

$$\begin{aligned} \sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta _1}e^{-n{\mathcal P}_{34}(\beta _1)}\frac{\zeta (\varepsilon \beta _1)-1}{(1+e^{\delta \beta _1})^{2}}=1, \end{aligned}$$

with \({\mathcal P}_{34}(\beta )=\gamma \beta +\log (1+e^{\delta \beta })\). Note that \({\mathcal P}_{34}(\beta )\) is always bigger than \(\log 2\), thus for every choices of the parameters

$$\begin{aligned} \zeta (\varepsilon \beta _{1})\geqslant (1+e^{\delta \beta _{1}})^{2}+1>5. \end{aligned}$$

Now, \(\zeta (2)=\displaystyle \frac{\pi ^{2}}{6}\), which shows \(\varepsilon \beta _{1}<2\).

3.3.2 Values for \(\varepsilon \beta _{c}\)

We remind that \(\beta _{c}\) is given by the implicit formula derived from Equality (7):

$$\begin{aligned} \sum _{n=1}^{+\infty }e^{-n\alpha \beta _{c}-n{\mathcal P}_{34}(\beta _{c})+(n-1)\log L}+\frac{\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta _{c}}e^{-n{\mathcal P}_{34}(\beta _{c})}e^{-\alpha \beta _{c}-{\mathcal P}_{34}(\beta _{c})}}{1-\frac{1}{(1+e^{\delta \beta _{c}})^{2}}\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta _{c}}e^{-n{\mathcal P}_{34}(\beta _{c})}\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\varepsilon \beta _{c}}}=1. \end{aligned}$$
(12)

We have already seen that increasing L is a simple way to force \(\varepsilon \beta _{c}>2\) to hold.

Remind that \(\beta _{c}>\beta _{1}\) and \(\varepsilon .\beta _{1}>1\) and \(\displaystyle {\mathcal P}_{34}(\beta )=\beta .\gamma +\log (1+e^{\beta .\delta })\). Then, assume that \(\delta \rightarrow +\infty \), \(\varepsilon \) being fixed, this yields \(\delta \beta _{c}\rightarrow +\infty \). Substituting this in (12), the first summand and the numerator of the fraction go to 0 if \(\delta \rightarrow +\infty \). Consequently, the denominator must also tend to 0, and as \(\delta \beta _{c}\) tends to \(+\infty \), we must have

$$\begin{aligned} \varepsilon \beta _{c}\rightarrow 1. \end{aligned}$$

It is thus lower than 2 if \(\delta \) is sufficiently large.

3.3.3 Non-differentibality at \(\beta _{c}\) for \(\varepsilon .\beta _{c}>2\)

The right-derivative for \({\mathcal P}(\beta )\) at the transition \(\beta _{c}\) is equal to \({\mathcal P}_{34}'(\beta )=\gamma +\delta .\frac{e^{\delta .\beta }}{1+e^{\delta .\beta }}\). This is a positive real number.

On the other hand, for \(\beta \leqslant \beta _{c}\), \({\mathcal P}(\beta )\) is given by the implicit formula

$$\begin{aligned} {\mathcal P}(\beta )=Z\text { and }\lambda _{Z,\beta ,[1]}=\Sigma _{1}(Z,\beta )+\frac{\Sigma _{2}(Z,\beta )e^{-\alpha \beta -Z}}{1-\Sigma _{2}(Z,\beta )\Sigma _{3}(Z,\beta )}=1. \end{aligned}$$

Therefore, the left-derivative for \({\mathcal P}(\beta )\) is given by

$$\begin{aligned} {\mathcal P}'_{l}(\beta )=-\frac{\partial \lambda _{Z,\beta ,[1]}}{\partial \beta }.\frac{1}{\displaystyle \frac{\partial \lambda _{Z,\beta ,[1]}}{\partial Z}}. \end{aligned}$$
(13)

We emphasize that the partial derivatives on the right-hand side are well-defined for \(\beta <\beta _{c}\) (see Sect. 3.3.4) but also for \(\beta =\beta _{c}\) because \(\varepsilon .\beta _{c}>2\). We remind equalities

$$\begin{aligned} \Sigma _{1}=\Sigma _{1}(Z,\beta ):= & {} \sum _{n=1}^{+\infty }e^{-n\beta \alpha -nZ+(n-1)\log L}\\ \Sigma _{2}=\Sigma _{2}(Z,\beta ):= & {} \sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\beta }e^{-nZ}\\ \Sigma _{3}=\Sigma _{3}(Z,\beta ):= & {} \frac{1}{(1+e^{\delta \beta })^{2}}\sum _{n=1}^{+\infty }\left( \frac{1}{n+1}\right) ^{\varepsilon \beta }e^{n({\mathcal P}_{34}(\beta )-Z)}, \end{aligned}$$

which easily show that \(Z\mapsto \lambda _{Z,\beta ,[1]}\), \(\beta \mapsto \Sigma _{1}(Z,\beta )\) and \(\beta \mapsto \Sigma _{2}(Z,\beta )\) decrease (thus have negative derivative).

Computing terms in the right-hand side of (13), we claim (and let the reader check) that \(\displaystyle \frac{\partial \lambda _{Z,\beta ,[1]}}{\partial \beta }\) is of the form \(A+{\mathcal P}'_{34}(\beta )S\) and \(\displaystyle \frac{\partial \lambda _{Z,\beta ,[1]}}{\partial Z}\) is of the form \(B-S\), where A and B are negative and S is positive. This yields

$$\begin{aligned} {\mathcal P}'_{l}(\beta )=-\frac{A+{\mathcal P}'_{34}(\beta )S}{B-S}={\mathcal P}'_{34}(\beta )+\frac{{\mathcal P}'_{34}(\beta )B+A}{S-B}<{\mathcal P}'_{34}(\beta ). \end{aligned}$$
(14)

3.3.4 Differentiability at \(\beta _{c}\) if \(\varepsilon .\beta _{c}\leqslant 2\)

We claim that equality (13) still holds for \(\beta <\beta _{c}\) because the diverging term for \(\beta =\beta _{c}\) is

$$\begin{aligned} S=\frac{1}{(1+e^{\delta .\beta })^{2}}\sum _{n=1}^{+\infty }n\left( \frac{1}{n+1}\right) ^{\varepsilon .\beta }e^{n({\mathcal P}_{34}(\beta )-{\mathcal P}(\beta ))}. \end{aligned}$$

This term diverges for \(\beta =\beta _{c}\) but converges for \(\beta <\beta _{c}\) because \({\mathcal P}(\beta )>{\mathcal P}_{34}(\beta )\). Then,

$$\begin{aligned} \displaystyle {\mathcal P}'_{l}(\beta )=-\frac{A+{\mathcal P}'_{34}(\beta )S}{B-S}={\mathcal P}'_{34}(\beta )+\frac{{\mathcal P}'_{34}(\beta )B+A}{S-B} \end{aligned}$$

still holdsFootnote 2 for \(\beta <\beta _{c}\). Therefore, if \(\beta \uparrow \beta _{c}\), S goes to \(+\infty \). We let the reader check that A and B are bounded thus, \({\mathcal P}'_{l}(\beta )\) goes to \({\mathcal P}'_{34}(\beta _{c})\) as \(\beta \uparrow \beta _{c}\).

4 Proof of Theorem B

For proving Theorem B we consider the next subshift of finite type (Fig. 7):

Fig. 7
figure 7

Dynamics for Theorem B

The right hand side wing has been replaced by two copies of itself. The left hand side wing is a full shift with \(L+1\) symbols, each wing at the right hand side is also a full shift with two symbols. The unique exit digits of the wings are also the unique entrance digits and are the symbols 1, 3 and \(3'\). To go from one of these symbols to another one, one must pass through 2.

4.1 Phase Transition for \({\mathcal P}(\beta )\)

We now explain how to adapt the results from the proof of Theorem A to this new case. Inducing in [1], orbits of the third form (see the enumeration before Equality (1)) leave [1], visit [2] for a while and then can either visit \(\Sigma _{34}\) or visit \(\Sigma _{3'4'}\). More precisely, after a string of 2’s we can either get a word of the form \(3\omega 3\) where \(\omega \) is a word with digits 3 or 4, or a word of the form \(3'\omega '3'\) with \(\omega '\) a word with digits 3\({}^\prime \) and 4\({}^\prime \).

By symmetry of the potential, any sum \(\Sigma _{3}\) has thus to be replaced by \(2\Sigma _{3}=\Sigma _{3}+\Sigma _{3}\), one for a string in 3 and 4 and one for a string in 3\({}^\prime \) and 4\({}^\prime \).

Consequently

$$\begin{aligned} {\mathcal L}_{Z,\beta ,[1]}({1}\mathrm{l}_{[1]})=\Sigma _{1}+\Sigma _{2}\sum _{k=0}^{+\infty }(2\Sigma _{3}\Sigma _{2})^{k}, \end{aligned}$$

and Condition (4b) has to be replaced by

$$\begin{aligned} \Sigma _{2}\Sigma _{3}<\frac{1}{2}. \end{aligned}$$
(15)

The implicit formula (similar to (5)) we have to consider is

$$\begin{aligned} \Sigma _{2}({\mathcal P}_{34}(\beta _{3}),\beta _{3})\Sigma _{3}({\mathcal P}_{34}(\beta _{3}),\beta _{3})=\frac{1}{2}, \end{aligned}$$

where \(\beta _{3}\) replaces \(\beta _{1}\): for \(\beta <\beta _{3}\), \(\widetilde{Z}_{c}(\beta )\) is strictly larger than \({\mathcal P}_{34}(\beta )\). For \(\beta \geqslant \beta _{3}\), \(\widetilde{Z}_{c}(\beta )= {\mathcal P}_{34}(\beta )\).

Similarly, the new value for the spectral radius \(\lambda _{{Z,\beta ,[1]}}\) is

$$\begin{aligned} \Sigma _{1}+\frac{\Sigma _{2}e^{-\alpha \beta -Z}}{1-2\Sigma _{2}\Sigma _{3}}, \end{aligned}$$

and there exists \(\beta '_{c}>\max (\beta _{2},\beta _{3})\) such that for every \(\beta >\beta '_{c}\), \(\lambda _{{Z,\beta ,[1]}}<1\). Then, for \(\beta >\beta _{c}'\), \({\mathcal P}(\beta )={\mathcal P}_{2343'4'}(\beta )\), and any equilibrium state has support in \(\Sigma _{2343'4'}\) which is \(\Sigma \) restricted to words without digits \(1,1_{1},\ldots 1_{L}\).

4.2 Phase Transition for \({\mathcal P}_{2343'4'}(\beta )\)

Again, we shall induce on the cylinder [32]. In that new case, the orbits leaving [32] and then returning back to [32] have the form: 32(string of 2’s) (intermittence of stringsFootnote 3 of 3\(^\prime \)’s or 4\(^\prime \)’s and strings of 2’s) (3 and one string of 3’s or 4’s)32. This yields that the new equation to consider for \(\lambda _{{Z,\beta ,[32]}}\) is

$$\begin{aligned} \lambda _{{Z,\beta ,[32]}}= & {} \Sigma _{2}(Z,\beta )\left( \sum _{n=0}^{+\infty }(\Sigma _{2}(Z,\beta )\Sigma _{3}(Z,\beta ))^{n}\right) \Sigma _{3}(Z,\beta )\\= & {} \frac{\Sigma _{2}(Z,\beta )\Sigma _{3}(Z,\beta )}{1-\Sigma _{2}(Z,\beta )\Sigma _{3}(Z,\beta )}, \end{aligned}$$

where we took into account the symmetry of \(\Sigma _{34}\) and \(\Sigma _{3'4'}\).

Note that \(z\mapsto \displaystyle \frac{z}{1-z}\) increases for \(z<1\), and \(\displaystyle \frac{z}{1-z}<1\iff z<\frac{1}{2}\). Then,

$$\begin{aligned} \lambda _{{Z,\beta ,[32]}}<1\iff \Sigma _{2}(Z,\beta )\Sigma _{3}(Z,\beta )<\frac{1}{2}. \end{aligned}$$

This shows that \(\beta _{3}\) is a transition parameter for \(\Sigma _{2343'4'}\): for \(\beta <\beta _{3}\) there exists a unique equilibrium state in \(\Sigma _{2343'4'}\) and it is fully supported. For \(\beta >\beta _{3}\), no equilibrium state gives weight to [32]. For symmetry reasons, no equilibrium state gives a positive weight to \([3'2]\), and thus, there are two equilibrium states which are the ones in \(\Sigma _{34}\) and in \(\Sigma _{3'4'}\).

For \(\beta >\beta _{c}'\), there is no more a single global equilibrium state (for \(\Sigma \) and \({\mathcal P}(\beta )\)) but there are two “smaller” equilibria in \(\Sigma _{34}\) and in \(\Sigma _{3'4'}\). Nevertheless, \({\mathcal P}(\beta )={\mathcal P}_{34}(\beta )\) is analytic for \(\beta >\beta '_{c}\).

4.3 Gâteaux-Differentiability in Other Directions

To use the vocabulary from [16], the potential \(\phi \) is a kind of grid function: it is constant on cylinders of the form [1], \([1_{i}]\), \([2^{n}*]\), \([3\omega 32]\) with \(\omega \in \{3,4\}^{n}\) for some n and \([3'\omega 3'2]\) with \(\omega \in \{3',4'\}^{n}\) for some n.

Let \({\mathcal V}\) be the set of such functions, which are in addition Hölder continuous and totally symmetric in \(3\leftrightarrow 3'\) and \(4\leftrightarrow 4'\). \({\mathcal V}\) is infinite-dimension in \({\mathcal C}(\Sigma )\).

It we pick some \(\varphi \) in \({\mathcal V}\), for \(\beta >\beta _{c}'\), and for \(t\in (-\eta ,\eta )\) with \(\eta \approx 0^{+}\), the induced transfer operators for \(\beta .\phi +t.\varphi \) and \(\beta .\phi \) have close spectrum.

As things are totally symmetric in \(\Sigma _{34}\) or \(\Sigma _{3'4'}\), there will still be two equilibrium states and the pressure is differentiable in direction \(\varphi \) because we are into the domains with spectral radiuses \(<\)1.

Remark 6

It is actually highly likely that \(P(\beta .\phi +.)\) is Gâteaux differentiable in the direction of any \(\varphi \) which is Hölder and totally symmetric in \(3\leftrightarrow 3'\) and \(4\leftrightarrow 4'\) (and not necessarily a grid function). \(\square \)