Keywords

1 Introduction

We use mean-field theory to approximate a challenging problem and to study a many-body problem by converting it into a one-body problem. We survey a number of results obtained recently using the theory of large deviations along with Stein’s method-type limit theorems to describe the asymptotic behavior of the O(N) spin models such as the N = 1 Curie-Weiss model, the N = 2 model called the XY model, the N = 3 Heisenberg model, and the N = 4 Toy model of the Higgs sector [5, 12, 13]. We present these results mostly without proofs. In this section, we describe the mean-field XY model and the history, including the 2D XY model (which is currently intractable). In the next section we describe the asymptotic behavior of the XY model; in the last section, the behavior of its generalizations to N-dimensional spins.

The XY model on a complete graph K n with n vertices in the absence of an external field is defined as follows: there is a circular spin \(\sigma _{i} \in \mathbb{S}^{1}\) at each site i ∈ 1, 2, , n. The configuration space of the XY model is \(\Omega _{n} = (\mathbb{S}^{1})^{n}\) where each microstate is σ = (σ 1, σ 2, , σ n ). For the higher O(N) spin models, we simply replace \(\mathbb{S}^{1}\) by \(\mathbb{S}^{N-1}\), and in all cases the Hamiltonian energy is defined by

$$\displaystyle{H_{n}(\sigma ) = -\sum _{i,j}J_{i,j}\left \langle \sigma _{i},\sigma _{j}\right \rangle.}$$

 The mean-field interaction for the XY and O(N) models on the complete graph is defined by \(J_{i,j} = \frac{1} {2n}\) for all i, j.

The simplest spin model is the Ising model, with one-dimensional ± 1 spins, a model that is used extensively in statistical mechanics, invented by Ernest Ising while working with his advisor Wilhelm Lenz [4, 11]. The one-dimensional Ising model has no phase transition, but there is a phase transition on an infinite two-dimensional lattice. The mean-field Ising model, or Curie-Weiss model, describes the Ising model well for higher dimensions, and the magnetization (average spin) in this model has a Gaussian law away from the critical temperature and a non-Gaussian law at the critical temperature [7]. Recently, Chatterjee and Shao [5] proved that the total spin in the Curie-Weiss model at the critical temperature satisfies a Berry–Esseen type error bound in this non-Gaussian limit.

The XY model, with two-dimensional circular spins, models superconductors and is interesting but challenging to study its phase transition rigorously [3]. On a lattice of two spatial dimensions, such a continuous circular symmetry cannot be broken at any finite temperature [16]. Thus the 2D XY model cannot have an ordered phase at low temperature quite like the Ising model, and it has a phase transition that is quite different from the Ising model [17, 18]. Instead, the 2D XY model exhibits the peculiar Kosterlitz-Thouless (KT) transition, a phase transition of infinite order and the subject of a Nobel prize. Above the transition temperature T KT correlations between spins decay exponentially. At low temperatures, the system does not have any long-range order as the ground state is unstable, but there is a low-temperature quasi-ordered phase with a correlation function that decreases with the distance like a power, which depends on the temperature [14].

Because the 2D XY model is so challenging, we study the mean-field classical XY model instead, which can be viewed as the large-dimensional (d) limit of the nearest-neighbor model on \(\mathbb{Z}^{d}\), with spins in \(\mathbb{S}^{1}\), and with critical inverse temperature β c = 2 [1]. Furthermore, the large-dimensional limit approximates high-dimensional models nicely since below the critical temperature, the average spin goes to zero for all d, and above the critical temperature, the total spin has a non-zero limit as d.

In the next section we will examine the XY model in detail, while Sect. 3 deals with extensions to higher spin dimensions.

2 The Mean-Field XY Model and Asymptotic Results

We consider the isotropic mean-field classical XY model on a finite complete graph K n with n vertices. That is, at each site iK n of the graph is a spin living in \(\Omega = \mathbb{S}^{1}\), so the state space is \(\Omega _{n} = (\mathbb{S}^{1})^{n}\). See Fig. 1 for a picture of the XY model on 5 vertices. The corresponding mean-field Hamiltonian energy \(H_{n}: \Omega _{n} \rightarrow \mathbb{R}\) is given by:

$$\displaystyle{H_{n}(\sigma ):= - \frac{1} {2n}\sum _{i,\,j=1}^{n}\left \langle \sigma _{ i},\sigma _{j}\right \rangle = - \frac{1} {2n}\sum _{i,\,j}\cos (\theta _{i} -\theta _{j}),}$$

where θ i is the angle that the i-th spin makes with respect to some axis. The corresponding Gibbs measure is the probability measure P n, β on \(\Omega _{n}\) with density function:

$$\displaystyle{ f(\sigma ):= \frac{1} {Z(\beta )}e^{-\beta H_{n}(\sigma )}. }$$
(1)

where Z is the normalizing constant, also known as the partition function, which encodes the statistical properties of the model such as free energy and magnetization. Note that Gibbs measure here is a normalization of the Boltzmann distribution, and that the inverse temperature β is equal to (k B T)−1, where k B is the Boltzmann constant and T is the temperature of the system. We can understand the structural behavior of the spin vectors’ distribution by studying extreme cases for the inverse temperature β as follows:

  • At high temperature, from Eq. (1) we can predict that the Gibbs measure is uniform.

  • At low temperature, again from Eq. (1) we can predict that the Gibbs measure decays quickly, and the spin vector distribution prefers the lowest-energy ground state.

Fig. 1
figure 1

Left: The classical mean-field XY Model on the complete graph K 5 with five sample spin vectors. Right: The projection of the same spin vectors from K 5 onto \(\mathbb{S}^{1}\)

The most likely physical system states corresponding to the Gibbs measure are called the canonical macrostates. We will consider the random measure of the spins {σ i }, defined by \(\mu _{n,\sigma }:= \frac{1} {n}\sum _{i=1}^{n}\delta _{ \sigma _{i}}\) on \(\mathbb{S}^{1}\) and study the total empirical spin, defined by

$$\displaystyle{S_{n}(\sigma ):=\sum _{ i=1}^{n}\sigma _{ i}.}$$

The relative entropy of a probability measure ν on \(\mathbb{S}^{1}\), with respect to the uniform probability measure μ is defined by

$$\displaystyle{ H(\nu \mid \mu ):= \left \{\begin{array}{@{}l@{\quad }l@{}} \int _{\mathbb{S}^{1}}f\log (\,f)d\mu \quad &if\,\,f:= \frac{d\nu } {d\mu }\,\,exists;\\ \infty \quad &otherwise. \end{array} \right. }$$
(2)

2.1 LDPs, Free Energy, and Macrostates for the XY Model

Let \(M_{1}(\mathbb{S}^{1})\) represent the probability measures on \(\mathbb{S}^{1}\) with the weak-∗ topology. We are interested in analyzing the total spin, S n : = i = 1 n σ i , as a function of the inverse temperature β in the Gibbs measures. This leads us to consider large deviation principles (LDPs) for the μ n, σ , and then we can rewrite the free energy more explicitly to describe the phase transition at β = 2. Part of Theorem 1 (β = 0) is a special case of Sanov’s theorem, while the other cases (β > 0) follow from an abstract result of Ellis, Haven, and Turkington [10, Theorem 2.5].

Theorem 1

If P n is the n-fold product of uniform measure on \(\mathbb{S}^{1}\) and μ n, σ is the random measure as defined above. For \(\Gamma \subset M_{1}(\mathbb{S}^{1})\) , the μ n, σ satisfy an LDP with rate function

$$\displaystyle{ I_{\beta }(\nu ):= H(\nu \mid \mu ) - \frac{\beta } {2}\left \vert \int _{\mathbb{S}^{1}}xd\nu (x)\right \vert ^{2} -\varphi (\beta ), }$$
(3)

where the free energy is given by

$$\displaystyle{ \varphi (\beta ) =\inf _{\nu \in M_{1}(\mathbb{S}^{1})}\left [H(\nu \mid \mu ) - \frac{\beta } {2}\left \vert \int _{\mathbb{S}^{1}}xd\nu (x)\right \vert ^{2}\right ]. }$$
(4)

For fixed β ≥ 0, every subsequence of \(P_{n,\beta }\left [\mu _{n,\sigma } \in \cdot \right ]\) converges weakly to a probability measure on \(M_{1}(\mathbb{S}^{1})\) concentrated on the canonical macrostates \(\mathcal{E}_{\beta }:=\{\nu: I_{\beta }(\nu ) = 0\}\) , i.e., the zeros of the rate function.

For β = 0, the relative entropy H(⋅ ∣μ) achieves its minima of 0 only for the uniform measure μ, implying that the canonical macrostate is disordered. For β > 0, canonical macrostates are defined abstractly through zeros of the rate function (3), and later Theorem 5 will describe the macrostates explicitly.

The free energy given by (4) can be transformed into the following more explicit form.

Theorem 2 (Kirkpatrick-Nawaz [13])

The free energy φ has the formula:

$$\displaystyle{\varphi (\beta ) = \left \{\begin{array}{@{}l@{\quad }l@{}} 0,\quad \quad \quad \quad \;\mathit{\text{ if }}\beta <2, \quad \\ \Phi _{\beta }(g^{-1}(\beta )),\mathit{\text{ if }}\beta \geq 2,\quad \end{array} \right.}$$

where I i below are modified Bessel functions of first kind and \(\Phi _{\beta }\) is the functional defined by:

$$\displaystyle{ \Phi _{\beta }(r):= r\frac{I_{1}(r)} {I_{0}(r)} -\log \left [I_{0}(r)\right ] - \frac{\beta } {2}\left (\frac{I_{1}(r)} {I_{0}(r)}\right )^{2}, }$$
(5)

and

$$\displaystyle{g(r):= r\frac{I_{0}(r)} {I_{1}(r)}.}$$

Here the phase transition is continuous as the function φ and its derivative φ′ are continuous at the critical threshold β = β c = 2.

The magnetization for the classical XY model can be obtained by differentiating the partition function:

$$\displaystyle{\vert m\vert = \left \vert \mathbb{E}\left [ \frac{1} {n}\sum _{i}\sigma _{i}\right ]\right \vert = \left \vert \mathbb{E}\left [ \frac{1} {n}S_{n}\right ]\right \vert = \frac{I_{1}(r)} {I_{0}(r)}}$$

From the free energy we can precisely explain the phase transition at β = 2. For 0 ≤ β ≤ 2, we have a unique global minimum for the free energy at the origin with a zero magnetization. For β ≥ 2, we have a unique global minimum for a positive radius.

Let {σ i } i = 1 n be i.i.d. uniform random points on \(\mathbb{S}^{1} \subseteq \mathbb{R}^{2}\). We have the following Cramér-type LDP for the average spin.

Theorem 3 (Kirkpatrick-Nawaz [13])

Let P n, β be the Gibbs measure defined above  (1). Then for β ≥ 0, the average spin \(M_{n} = M_{n}(\sigma ):= \frac{1} {n}\sum _{i=1}^{n}\sigma _{ i}\) satisfies an LDP with rate function \(I_{\beta }(x) = \Phi _{\beta }(r)\) :

$$\displaystyle{P_{n,\beta }\left (M_{n} \simeq x\right ) \simeq e^{-n\Phi _{\beta }(r)},}$$

where \(\Phi _{\beta }\) is given by ( 5 ) and r = | x | .

For an explicit representation of \(\mathcal{E}_{\beta }\), we note from (2) that the relative entropy depends only on the distribution of f. By Fubini’s theorem

$$\displaystyle{\int f\log (\,f)d\mu =\int _{ 0}^{\infty }\mu \big[\,f\log (\,f)> t\big]dt -\int _{ 0}^{\infty }\mu \big[\,f\log (\,f) <-t\big]dt.}$$

This implies that for a fixed f, the quantity \(\left \vert \int xd\nu (x)\right \vert\) is maximized for corresponding densities which are symmetric about a fixed pole and decreasing as the distance from the pole increases. Using this reasoning, consider a density f that is symmetric about the north pole and decreasing away from the pole i.e., ν f is the measure with density f(x, y) = f(y) which is increasing in y. Then

$$\displaystyle{ \begin{array}{rl} H(\nu _{f}\mid \mu )& = \frac{1} {2\pi }\int _{\mathbb{S}^{1}}f(x,y)\log [\,f(x,y)]dxdy \\ & = \frac{1} {\pi } \int _{0}^{2\pi }\int _{ 0}^{\pi }f(\cos (\theta ))\log [\,f(\cos (\theta ))]d\theta d\varphi \\ & = \frac{1} {\pi } \int _{-1}^{1}\frac{f(y)\log [\,f(y)]} {\sqrt{1-y^{2}}} dy. \end{array} }$$

Similarly,

$$\displaystyle{\int _{\mathbb{S}^{1}}xd\nu _{f}(x) = \frac{1} {\pi } \left [\begin{array}{*{10}c} 0\\ 1 \end{array} \right ]\int _{-1}^{1} \frac{yf(y)} {\sqrt{1 - y^{2}}}dy.}$$

Therefore, our minimization problem is reduced to minimizing the following functional

$$\displaystyle{\frac{1} {\pi } \int _{-1}^{1}\frac{f(y)\log [\,f(y)]} {\sqrt{1 - y^{2}}} dy - \frac{\beta } {2}\left (\frac{1} {\pi } \int _{-1}^{1} \frac{yf(y)} {\sqrt{1 - y^{2}}}dy\right )^{2}}$$

over \(f: [-1,1] \rightarrow \mathbb{R}_{+}\) such that f is increasing and \(\frac{1} {\pi } \int _{-1}^{1} \frac{f(y)} {\sqrt{1-y^{2}}} dy = 1\). We can rewrite the first term of the last expression to see that it involves the usual entropy S( f) = ∫flog( f):

$$\displaystyle{\frac{1} {\pi } \int _{-1}^{1}\frac{f(y)\log [\,f(y)]} {\sqrt{1 - y^{2}}} dy = \frac{1} {\pi } \int _{-1}^{1} \frac{f(y)} {\sqrt{1 - y^{2}}}\log \left [\frac{f(y)} {\pi } \right ]dy +\log (\pi ) = -S\left (\frac{f} {\pi } \right ) +\log (\pi ).}$$

Now for \(\left \vert \int xd\nu (x)\right \vert = c\) ∈ [0, 1], using constrained entropy maximization (see Theorem 12.1.1 from [6]), we will minimize \(\frac{1} {\pi } \int _{-1}^{1} \frac{yf(y)} {\sqrt{1-y^{2}}} dy\), that is, maximize S( fπ), over the \(\nu \in M_{1}(\mathbb{S}^{1})\) corresponding to this c.

Proposition 4 (Kirkpatrick-Nawaz [13])

Consider a set of functions \(f: [-1,1] \rightarrow \mathbb{R}_{+}\) , with weight function \(w(y) = \frac{1} {\sqrt{1-y^{2}}}\) , such that ∫ −1 1 f(y)w(y)dy = 1, and \(\left \vert \int _{-1}^{1}yf(y)w(y)dy\right \vert = c.\) i.e., weighted integral of f is 1 while first weighted moment is bounded. Then the exponential function f (y) = πae by uniquely maximizes S( fπ) over the densities satisfying these conditions.

For c ∈ [0, 1], observe that f increasing gives all b ∈ [0, ). Now for b ∈ [0, ), our functional minimization reduces to the following one dimensional function:

$$\displaystyle{ \begin{array}{rl} \frac{1} {\pi } \int _{-1}^{1}\frac{f(y)\log [\,f(y)]} {\sqrt{1-y^{2}}} dy - \frac{\beta } {2}c^{2} & = b\frac{I_{1}(b)} {I_{0}(b)} -\log \left [I_{0}(b)\right ] - \frac{\beta } {2}\left (\frac{I_{1}(b)} {I_{0}(b)}\right )^{2} \\ & =: \Phi _{\beta }(b).\end{array} }$$
(6)

The following theorem, a special case proved using the calculus of variations in [13], describes the canonical macrostates:

Theorem 5 (Kirkpatrick-Nawaz [13])

  1. (a)

    For β ≤ 2, \(\inf _{b\geq 0}\left \{\Phi _{\beta }(b)\right \} = 0\) is achieved for b = 0 and the corresponding a = 1, so that the minimizing function f = 1 and therefore the only canonical macrostate is the uniform distribution μ.

  2. (b)

    For β > 2, \(\inf _{b\geq 0}\left \{\Phi _{\beta }(b)\right \} = \Phi _{\beta }(g^{-1}(\beta ))\) , where b = g −1(β) is the unique strictly positive solution to g(b) = β where

    $$\displaystyle{g(b) = b\frac{I_{0}(b)} {I_{1}(b)},}$$

    \(a = \frac{1} {\pi I_{0}(b)}\) and \(\lim _{\beta \downarrow 2}\inf _{b\geq 0}\left \{\Phi _{\beta }(b)\right \} = 0\) . In this case, the canonical macrostates are given by \(\mathcal{E}_{\beta } =\{\nu _{f,x}\}_{x\in \mathbb{S}^{1}},\) where ν f, x is the measure that is the rotation of ν f from north pole to x-direction, which is symmetric about the north pole with density \(f: [-1,1] \rightarrow \mathbb{R}\) given by f(y) = πae by with a and b as above.

We can also visualize the Gibbs measure corresponding to subcritical or supercritical cases as shown in Fig. 2.

Fig. 2
figure 2

Cross-sections of two canonical macrostates: For β ≤ 2 (the disordered regime), we have the uniform distribution f(y) = 1 as the dotted line; for β = 5 > β c = 2 (the ordered regime), we have plotted the cross-section of the distribution ν f , given by \(f(x,y) = f(y) = \frac{e^{by}} {I_{0}(y)}\), showing that the spins point predominantly near the north-pole direction

2.2 Limit Theorems for the Total Spin in the XY Model

Next we understand the asymptotics for the total spin of the mean-field XY model, in different regimes across the phase transition, describing the central and non-central limit theorems for each phase.

In the high temperature regime \(\left (0 \leq \beta <2\right )\), the average spin (magnetization) of the system goes to zero with increasing number of spins n, and we have a multivariate central limit theorem with a rate of convergence in Theorem 6. The main idea is to use Stein’s method [12, 15, 19] with the exchangeable pair (W n , W n ) from the Gibbs sampling approach: our random variable representing the rescaled total spin of the original configuration is

$$\displaystyle{W_{n}:= \sqrt{\frac{2-\beta } {n}} \sum _{i=1}^{n}\sigma _{ i},}$$

while the random variable representing the rescaled total spin of the new configuration, with I ∈ {1, , n} chosen uniformly at random, is

$$\displaystyle{W_{n}^{{\prime}}:= W_{ n}(\sigma ^{{\prime}}) = W_{ n} -\sqrt{\frac{2-\beta } {n}} \sigma _{I} + \sqrt{\frac{2-\beta } {n}} \sigma _{I}^{{\prime}}.}$$

Theorem 6 (Kirkpatrick-Nawaz [13])

In the high temperature regime 0 < β  < 2, if W n is defined as above, Z is a standard normal random variable in \(\mathbb{R}^{2}\) , c β is a function depending on β only, L(g) is the modulus of uniform continuity of g, and M(g) is the maximum operator norm of the Hessian of g, then we have:

$$\displaystyle{\sup _{g:L(g),M(g)\leq 1}\vert \mathbb{E}g(W_{n}) - \mathbb{E}g(Z)\vert \leq \frac{c_{\beta }} {\sqrt{n}}}$$

The proof of Theorem 6 proceeds in several steps, as a special case of [13]: first we use the fact that the density of the Gibbs measure is rotationally invariant to conclude that each spin has a uniform marginal distribution. We obtain the complete asymptotic behavior of the total spin using the rotational invariance of the total spin, a strategy adapted from [12]. We calculate the variance of the total spin to arrive at the proper scaling for defining the exchangeable pair and use the pair to derive expressions and bounds for the linear factor \(\Lambda\) appearing in the conditional expectation and the remainder terms R and R′ [12, 13, 15]. The rest follows from a theorem of Meckes [15].

As the temperature decreases to zero, the spins start aligning. For smaller values of β > 2, the spins vectors are aligned weakly, while for larger β, this alignment is strong. For any β > 2, because of the large deviation principle in Theorem 3, we have that | ∑σ j | is close to bnβ with high probability, if b is the minimizer in \(\Phi _{\beta }\). And due to the circular symmetry, all points on the circle of radius bnβ are equally likely. With this reasoning, similar to [12], it is natural to consider the random variable representing the fluctuations of squared-length of total spin, i.e.,

$$\displaystyle{ W_{n}:= \sqrt{n}\left [ \frac{\beta ^{2}} {n^{2}b^{2}}\left \vert \sum _{j=1}^{n}\sigma _{ j}\right \vert ^{2} - 1\right ]. }$$
(7)

Our multivariate central limit theorem in the low temperature (ordered) regime is as follows:

Theorem 7 (Kirkpatrick-Nawaz [13])

If β > 2 and b is the solution of \(b =\beta f(b):=\beta \frac{I_{1}(b)} {I_{0}(b)},\) and W n is as defined above in  (7), and if Z is a centered normal random variable with variance V, where

$$\displaystyle{V = \frac{4\beta ^{2}} {\left (1 -\beta f'(b)\right )b^{2}}\left [1 -\frac{1} {b} \frac{I_{1}(b)} {I_{0}(b)} -\left (\frac{I_{1}(b)} {I_{0}(b)}\right )^{2}\right ],}$$

then there exists c β , depending only on β, such that then

$$\displaystyle{d_{BL}(W_{n},Z) \leq c_{\beta }\left (\frac{\log (n)} {n} \right )^{1/4}.}$$

where d BL (X, Y ) is the bounded Lipschitz distance between random variables X and Y.

Again the proof of Theorem 7 follows from a univariate analogue of the abstract normal approximation of Stein [19], and relies on conditional moment bounds. The fact that the variance is positive was proved by Amos [2] while deriving the improved bounds on the ratio of Bessel functions.

At the critical temperature β c = 2, we will consider the random variable

$$\displaystyle{ W_{n}:= \frac{c} {n^{3/2}}\sum _{i,j=1}^{n}\left \langle \sigma _{ i},\sigma _{j}\right \rangle, }$$
(8)

and make an exchangeable pair (W n , W n ) using Glauber dynamics. Using symmetry of the total spin and Stein’s method similar to [5, 12], we will obtain critical limiting density function p as defined below.

Theorem 8 (Kirkpatrick-Nawaz [13])

For the critical inverse temperature β = 2, if W n is as defined above in  (8), and X is the random variable with the density

$$\displaystyle{p(t) = \left \{\begin{array}{@{}l@{\quad }l@{}} \frac{1} {Z}e^{-t^{2}/64 }\quad &t \geq 0, \\ 0 \quad &t <0, \end{array} \right.}$$

where Z is normalizing constant, then there exists a universal constant C such that

$$\displaystyle{\sup _{\begin{array}{c}\|h\|_{\infty }\leq 1,\,\|h'\|_{\infty }\leq 1 \\ \|h''\|_{\infty }\leq 1 \end{array}}\big\vert \mathbb{E}h(W_{n}) - \mathbb{E}h(X)\big\vert \leq \frac{C\log (n)} {\sqrt{n}}.}$$

The proof of the limit theorem for the critical temperature is essentially via the “density approach” to Stein’s method introduced by Stein, Diaconis, Holmes, and Reinert [20]. Recently, also Chatterjee and Shao [5] have applied this approach to the total spin of the mean-field Ising model, i.e., the Curie-Weiss model.

We note that these limit theorems with explicit rates of convergence can be generalized to high-dimensional spins, but we will omit those technicalities in the following section.

3 High-Dimensional Spin O(N) Models

We can use similar methods to extend our results for two-dimensional spin classical XY model to classical O(N) models, or N-vector models. In this general case, with spins in \(\mathbb{S}^{N-1} \subset \mathbb{R}^{N}\), the critical inverse temperature is β c = N [1, 13]. The N-vector models on a complete graph K n have the Hamiltonian:

$$\displaystyle{ H_{n}(\sigma ):= - \frac{1} {2n}\sum _{i,j=1}^{n}\left \langle \sigma _{ i},\sigma _{j}\right \rangle. }$$
(9)

We present results about the magnetization, free energy, and critical behavior in the O(N) models. It is important to note that we divide our asymptotic analysis into two cases: if N an even positive integer, we have modified Bessel functions of first kind with order ν = N∕2 and ν − 1, while for N odd, we have hyperbolic functions arising from the half-integer order Bessel functions.

3.1 The Magnetization in O(N) Models

Similar to the classical XY model, we can calculate the magnetization of the classical N-vector unit hyperspherical model using the conditional density, from the conditional expectations, and it turns out to be a ratio of modified Bessel function of first kind:

Theorem 9 (Kirkpatrick-Nawaz [13])

Consider the O(N) model with the above Hamiltonian ( 9 ), with N representing the dimension of the spin \(\sigma _{i} \in \mathbb{S}^{N-1}\) . Then on the complete graph K n the O(N) magnetization M N, n = i = 1 n σ i has the following mean-field limit:

$$\displaystyle{\left \vert M_{N}\right \vert = \frac{I_{\frac{N} {2} }(r)} {I_{\frac{N} {2} -1}(r)}}$$

From Fig. 3, we can observe that low-dimensional spin models can be magnetized easier in some sense, and as the spin gets higher dimensional, it takes more energy to magnetize the physical system.

Fig. 3
figure 3

Graph of magnetization limits | M N | for N-vector models, 1 ≤ N ≤ 4. For the mean-field Ising model, M 1 = tanh(x), for the mean-field XY model \(\left \vert M_{2}\right \vert = \frac{I_{1}(r)} {I_{0}(r)}\), for the mean-field Heisenberg model \(\left \vert M_{3}\right \vert = coth(r) -\frac{1} {r}\), and for the mean-field Toy model of the Higgs sector, \(\left \vert M_{4}\right \vert = \frac{I_{2}(r)} {I_{1}(r)}\). Here r and β are related by the formula \(g_{N}(r):= r\frac{I_{\frac{N} {2} -1}(r)} {I_{\frac{N} {2} }(r)} =\beta\)

3.2 The Rate Function and Free Energy in O(N) Models

Next we will present rate functions for large deviation principles similar to Theorems 1 and 3, the first of which is the relative entropy for the N-vector model given by an abstract formula similar to before:

$$\displaystyle{I_{\beta,N}(\nu ):= H(\nu \mid \mu ) - \frac{\beta } {2}\left \vert \int _{\mathbb{S}^{N-1}}xd\nu (x)\right \vert ^{2} -\varphi (\beta )}$$

where H(νμ) is the relative entropy (2) and φ N is the free energy defined abstractly as before:

$$\displaystyle{ \varphi _{N}(\beta ) =\inf _{\nu \in M_{1}(\mathbb{S}^{N-1})}\left [H(\nu \mid \mu ) - \frac{\beta } {2}\left \vert \int _{\mathbb{S}^{N-1}}xd\nu (x)\right \vert ^{2}\right ]. }$$
(10)

We can calculate the minima in the expression of this rate function and verify that in the subcritical regime (β < N) there is a unique minimum, while in the supercritical regime there is a family of minima parametrized by \(\mathbb{S}^{N-1}\). The free energy given by (10) can be written in the following more explicit form using a method like the one in the previous section. In particular, we have a Cramér-type LDP for the average spin \(M_{n}:= \frac{1} {n}\sum _{i=1}^{n}\sigma _{ i} \in \mathbb{R}^{N}\), with rate function \(I_{\beta,N}(x) = \Phi _{\beta,N}(r)\), defined below for β ≥ 0 and r = | x | (Fig. 4).

Fig. 4
figure 4

Graph of the rate function \(I_{\beta,N}(x) = \Phi _{\beta,N}(r)\) in the supercritical regime (β = N + 1) for 2 ≤ N ≤ 4, which has minimum at radius g N −1(β) = r

Theorem 10 (Kirkpatrick-Nawaz [13])

For dimension N, the free energy φ has the formula:

$$\displaystyle{\varphi _{N}(\beta ) = \left \{\begin{array}{@{}l@{\quad }l@{}} 0,\quad \quad \quad \quad \quad \quad \quad \quad \mathit{\text{ if }}\beta <N, \quad \\ \Phi _{\beta,N}(g^{-1}(\beta )),\quad \quad \quad \;\,\mathit{\text{ if }}\beta \geq N,\quad \end{array} \right.}$$

where g −1(β) = r with

$$\displaystyle{g(r) = g_{N}(r):= r\frac{I_{\frac{N} {2} -1}(r)} {I_{\frac{N} {2} }(r)},}$$

and

$$\displaystyle{\Phi _{\beta,N}(r) = r \frac{I_{\frac{N} {2} }(r)} {I_{\frac{N} {2} -1}(r)} +\log \left [ \frac{A_{N}} {A_{N-1}} \frac{r^{\frac{N} {2} -1}} {B_{N}\pi I_{\frac{N} {2} -1}(r)}\right ] - \frac{\beta } {2}\left ( \frac{I_{\frac{N} {2} }(r)} {I_{\frac{N} {2} -1}(r)}\right )^{2},}$$

with

$$\displaystyle{A_{N}:= \frac{2\pi ^{\frac{N} {2} }} {\Gamma \left (\frac{N} {2} \right )}}$$

and

$$\displaystyle{B_{N} = \left \{\begin{array}{@{}l@{\quad }l@{}} \prod _{k=0}^{\frac{N} {2} -1}\vert 2k - 1\vert,\quad \quad \quad \quad \quad \quad \quad \quad \mathit{\text{ if }}N\mathit{\text{ even}},\quad \\ \quad \\ \frac{2^{\frac{N} {2} -1}\Gamma \left (\frac{N-1} {2} \right )} {\sqrt{\pi }},\quad \quad \quad \quad \quad \ \quad \quad \quad \;\,\mathit{\text{ if }}N\mathit{\text{ odd}}. \quad \end{array} \right.}$$

In particular, φ and φ′ are continuous at the critical threshold β = N, implying that the phase transition is second-order or continuous.

3.3 The Critical Density Function in O(N) Models

The limiting density for the critical case uses the (hyper-)spherical symmetry of the total spin for O(N) models, giving the following non-normal limit theorem (Fig. 5).

Fig. 5
figure 5

Mean-field critical density functions p N for 2 ≤ N ≤ 4 and t ≥ 0. For the XY model \(p_{2}(t) = \frac{e^{-t^{2}/64}} {4\sqrt{\pi }}\), for the Heisenberg model \(p_{3}(t) = \frac{\sqrt{t}e^{-t^{2}/180}} {5^{3/4}\sqrt{54}\Gamma [3/4]}\), and for the Toy model of the Higgs sector, \(p_{4}(t) = \frac{te^{-t^{2}/384}} {192}\)

Theorem 11 (Kirkpatrick-Nawaz [13])

At the critical temperature β = N, the random variable \(W_{n} = \frac{c_{N}\vert S_{n}\vert ^{2}} {n^{3/2}}\) has as its limit as n∞ the random variable X with density

$$\displaystyle{p_{N}(t) = \left \{\begin{array}{@{}l@{\quad }l@{}} \frac{1} {Z}t^{\frac{N-2} {2} }\ e^{-kt^{2} },\mathit{\text{ if }}t \geq 0,\quad \\ 0,\ \quad \quad \quad \quad \;\mathit{\text{ if }}t <0, \quad \end{array} \right.}$$

where \(k = \frac{1} {N^{2}(4N+8)}\) and Z is the normalizing constant. To be precise about the rate of convergence, there exists a universal constant C such that

$$\displaystyle{\sup _{\begin{array}{c}\|h\|_{\infty }\leq 1,\,\|h'\|_{\infty }\leq 1 \\ \|h''\|_{\infty }\leq 1 \end{array}}\big\vert \mathbb{E}h(W_{n}) - \mathbb{E}h(X)\big\vert \leq \frac{C\log (n)} {\sqrt{n}}.}$$

The proof of this theorem is in [13] and includes methods from [8, 9, 12].