4.1 The Fundamental Integral Equation

Although Bethe had given the wave functions in 1931 they were in a rather difficult formal form, involving sums over permutations. The first really useful result based on Bethe’s work was by Hulthén in 1938 [1].

As we saw for two spins, if J is −ve, then the ground state is ferromagnetic and completely aligned. This state is also very simple from a quantum mechanical point of view as the eigenstate is one of the basis states. However, for J +ve the ground state is antiferromagnetic and even for two spins is more complicated.

When we considered states with two deviation from the aligned state in the N atom chain we found two types of states

  1. a.

    2 ‘free’ deviations

  2. b.

    bound pair.

If J is positive the bound states lie above the free ones in energy, so the lowest state is the class C type for which k 1, k 2 and \(\phi _{12}\) are all real. These are states for which \(|\lambda _1-\lambda _2|\geq 2\). For r deviations the same result applies. The lowest states are those for which all the k i and \(\phi_{ij}\) are real. Again these are class C type and have \(|\lambda_i - \lambda_j| \geq 2\) for all i, j.

The classical antiferromagnetic ground state is (choosing the direction of alignment of the spins to be parallel and antiparallel to the z-axis)

$$\ldots+ - + - + - + - + - \ldots$$

which is a state with \(\frac{N}{2}\) atoms reversed. The corresponding quantum mechanical state, known as the Néel state,

$$|+ - + - \dots \rangle$$

is a basis state but is not an eigenstate. Nevertheless the true ground state will not be orthogonal to this state and since it is a state with \(S_T^Z = 0\), i.e. \(r = \frac{N}{2}\), then the ground state will have \(S_T^z = 0\) and \(r = \frac{N}{2}\) also.

The equations we need to solve are

$$k_i = \lambda_i \frac{2\pi}{N} + \frac{1}{N} \sum_{j\neq i} \phi_{ij} (\lambda_i \mathrm{integer})$$
((4.1))

and

$$2 \, \mathrm{cot} \frac{\phi_{ij}}{2} = \mathrm{cot} \frac{k_i}{2} - \mathrm{cot} \frac{k_j}{2}$$
((4.2))

Clearly \(\lambda_i + N\) is equivalent to λ i (since adding N to λ i merely increases k i by 2π). Hence we can restrict λ i to be an integer in the range \(0 \leq \lambda_i \leq N-1\) without loss of generality. The antiferromagnetic ground state has \(r = \frac{N}{2}\), i.e. \(\frac{N}{2}\) values of λ i , and is class C so we require \(|\lambda_i - \lambda_j| \geq 2\) for all i, j. There are two possible choices:

$$\begin{array}{lll} \left\{\lambda_i\right\} & = & \left\{0, 2, 4, \ldots , N - 2 \right\} \\ \mathrm{OR} \left\{\lambda_i\right\} & = & \left\{1, 3, 5, \ldots , N-1\right\} \end{array}$$

The first of these has \(\lambda_1 = 0\) and it can be shown that this gives a state in which S T = 1 with \(S_T^Z = 0\), while the second choice gives a state with \(S_T = 0\) and \(S_T^Z = 0\). The second of these is the correct one since the first is degenerate with other states with \(S_T = 1\) and \(S_T^Z = \pm 1\) which lie higher in energy.

Now because the λ i are uniformly spaced, we can introduce a new variable

$$\ x_i = \dfrac{2i-1}{N} = \dfrac{\lambda_i}{N},$$

which becomes a continuous variable in the limit \(N \rightarrow \infty\), running from 0 to 1. Likewise we can regard the k i as forming a continuous set, i.e.

$$x_i \equiv \frac{\lambda_i}{N} \, \raisebox{-1.5ex}{$\stackrel{{\displaystyle \longrightarrow}}{{\scriptstyle N\rightarrow\infty}}$} \, x \qquad \qquad k_i \, \raisebox{-1.5ex}{$\stackrel{{\displaystyle \longrightarrow}}{{\scriptstyle N\rightarrow\infty}}$} \, k(x) \qquad \qquad 0 \leq x \leq 1 \,.$$

In Eq. (4.2) we have k j as well as k i . k j is associated with λ j and we introduce

$$y_j \equiv \frac{\lambda_j}{N} \, \raisebox{-1.5ex}{$\stackrel{{\displaystyle \longrightarrow}}{{\scriptstyle N\rightarrow\infty}}$} \, y \qquad \qquad k_j \, \raisebox{-1.5ex}{$\stackrel{{\displaystyle \longrightarrow}}{{\scriptstyle N\rightarrow\infty}}$} \, k(y) \qquad \qquad 0 \leq y \leq 1,$$

so that Eq. (4.2) becomes

$$2\cot \frac 12 \, \phi (x,y)=\cot \frac{k(x)}2-\cot \frac{k(y)}2$$
((4.3))

and we shall restrict φ to lie in the range \(-\pi \leq \phi \leq \pi \). Note that the y here is not the same as the y in the previous chapter.

Equation (4.1) becomes

$$k(x)=2\pi x+\frac 12\int_0^1\phi (x,y)dy$$
((4.4))

(the factor \(\frac 12\) comes because the λ j are separated by 2). Note that the range of \(k(x)\) is \(0\leq k(x)\leq 2\pi \), unlike that of φ. It can be shown that if \(x <y\) then \(k(x) <k(y)\) as one would expect from Eq. (4.4) provided the integral is well-behaved.

These are the integral equations solved by Huthén. The treatment given here is based on that of Mattis [2] with important extra details. First make the substitutions

$$\begin{array}{lll} \cot \frac{k(x)}{2} & = & \xi(x) \\ \cot \frac{k(y)}{2} & = & \eta(y), \end{array}$$

so that \(2\cot \frac{1}{2} \, \phi(x,y) = \xi(x) - \eta(y)\).

When \(x=y\), \(\cot \frac 12\phi =0\), so \(\phi =\pm \pi \) and at this point the value of φ jumps from −π to π. For this reason we divide the integral in (4.4) into two sections:

$$k(x)=2\pi x+\frac 12\int_0^x\phi (x,y)dy+\frac 12\int_x^1\phi (x,y)dy$$
((4.5))

In the first integral \(y<x\) therefore \(k(y)<k(x)\). This implies, as shown in Fig. 4.1, and recalling that \(0\leq k(x)\), \(k(y)\leq 2\pi \), that \(\cot \frac{k(y)}{2} > \cot \frac{k(x)}{2}\) and therefore

$$\cot \frac \phi 2 = \frac 12\left[ \cot \frac{k(x)}2- \cot \frac{k(y)}2\right] < 0.$$
Fig. 4.1
figure 1

(a) Plot of \(\cot \frac{k}{2}\) versus k showing that if \(k(y)<k(x)\) then \(\cot \frac{k(y)}{2} > \cot \frac{k(x)}{2}\). (b) Plot of \(\cot \frac{\phi}{2}\) versus φ showing that \(\cot \frac{\phi}{2}\) has the same sign as φ

As shown in Fig. 4.1, if \(\cot \frac{\phi}{2} < 0\) and \(-\pi \leq \phi \leq \pi\) we must have \(\phi < 0\). Similarly in the second integral we have \(\phi > 0\).

Now differentiate (4.5) with respect to x

$$\begin{array}{lll} \frac{dk}{dx} & = & 2\pi + \frac{1}{2} \phi_1(x,x) + \frac{1}{2} \int^x_0 \frac{\partial \phi}{\partial x} dy \\ & & -\frac{1}{2} \phi_2 (x,x) + \frac{1}{2} \int^1_x \frac{\partial \phi }{\partial x} dy \end{array}$$
$$\begin{array}{lll} \mathrm{where } \phi_1(x,x) & = & \lim_{y\rightarrow x-} \phi(x,y) \mathrm{in the first integral in Eq. (<InternalRef RefID="Equ5">4.5</InternalRef>)} \\ & = & -\pi \\ \mathrm{and} \phi_2(x,x) & = & \lim_{y\rightarrow x+} \phi(x,y) \mathrm{in the second integral in Eq. (<InternalRef RefID="Equ5">4.5</InternalRef>)} \\ & = & + \pi \end{array}$$

so

$$\begin{array}{lll} \frac{dk}{dx} & = & 2\pi + \frac{1}{2} (-\pi) - \frac{1}{2}(\pi) + \frac{1}{2} \int^x_0 \frac{\partial \phi}{\partial x} \, dy + \frac{1}{2} \int^1_x \frac{\partial \phi}{\partial x} \, dy \\ & = & \pi + \frac{1}{2} \int^1_0 \frac{\partial \phi}{\partial x} \, dy. \end{array}$$

Next we introduce the functions

$$f(\xi )=-\frac{dx}{d\xi }; f(\eta )=-\frac{dy}{d\eta }$$

then

$$\frac{\partial \phi }{\partial x} = \frac{\partial \phi }{\partial \xi }\frac{d\xi }{dx} = -\frac 1{f(\xi )}\frac{\partial \phi }{\partial \xi }.$$
$$\begin{array}{lll} \mathrm{But } \phi &=&2\cot ^{-1}\left[ \frac 12(\xi -\eta )\right], \\ \mathrm{therefore} \frac{\partial \phi }{\partial \xi } &=&2 \, \frac{-1}{1+\frac14(\xi -\eta )^2} \, \frac 12. \\ \mathrm{Also} dy &=&\frac{dy}{d\eta } \, d\eta =-f(\eta ) \, d\eta, \\ \mathrm{therefore} \frac{dk}{dx} &=&\pi +\frac 12\int \left( -\frac 1{f(\xi )}\right) \frac{(-1)}{1+\frac 14(\xi -\eta )^2} \, \left( -f(\eta ) \, d\eta \right) \\ &=&\pi -\frac 1{2f(\xi )}\int \frac{f(\eta )d\eta }{1+\frac 14(\xi -\eta )^2}. \end{array}$$

This is now an integral over η instead of y. As y goes from 0 to 1, k(y) goes from 0 to 2π (not proved here but can be shown from (4.4)) so \(\cot \frac{k(y)}{2}\) goes from +∞ to −∞. Therefore

$$\frac{dk}{dx} = \pi + \frac{1}{2f(\xi)} \int^\infty_{-\infty} \frac{f(\eta) \, d\eta}{1+\frac{1}{4} (\xi-\eta)^2} \, .$$

Finally \(\dfrac{dk}{dx} = \dfrac{dk}{d\xi } \dfrac{d\xi }{dx} = -\dfrac 1{f(\xi )} \dfrac{dk}{d\xi }\) and since \(\xi = \cot \dfrac{k(x)}2\), \(k(x) = 2\cot^{-1}\xi\) and \(\dfrac{dk}{d\xi } = \dfrac{-2}{1+\xi ^2}\). Therefore

$$\frac 2{1+\xi ^2}\frac 1{f(\xi )} = \pi +\frac 1{2f(\xi )}\int_{-\infty }^\infty \frac{f(\eta )d\eta }{1+\frac 14(\xi -\eta )^2} \mathrm{{i.e. }} f(\xi ) = \frac 2{\pi (1+\xi ^2)}-\frac 1{2\pi }\int_{-\infty }^\infty \frac{f(\eta )d\eta }{1+\frac 14(\xi -\eta )^2}.$$
((4.6))

This is the fundamental integral equation for f(ξ) (see, Mattis [2] 5.168).

4.2 Solution of the Fundamental Integral Equation

Linear integral equations in which the kernel is a function of \((\xi -\eta )\) only, can be solved by the Fourier Transform. We define

$$\ F(q)=\int_{-\infty }^\infty f(\theta )e^{iq\theta }d\theta$$

with inverse

$$\ f(\theta )=\frac 1{2\pi }\int_{-\infty }^\infty F(q)e^{-iq\theta }dq.$$

Substituting in Eq. (4.6) gives

$$\begin{array}{lll} \frac 1{2\pi }\int_{-\infty }^\infty F(q)e^{-iq\xi }dq &=&\frac 2{\pi (1+\xi ^2)} \\ &&-\frac 1{2\pi }\int_{-\infty }^\infty \left(\frac 1{2\pi }\int_{-\infty }^\infty F(q)e^{-iq\eta }dq \right) \frac{d\eta }{1+\frac 14(\xi -\eta)^2} \,. \end{array}$$

Now

$$\int_{-\infty }^\infty \frac{e^{-iq\eta }d\eta }{1+\frac 14(\xi -\eta )^2} = e^{-iq\xi }\int_{-\infty }^\infty \frac{e^{iq(\xi -\eta )}d\eta }{1+\frac 14(\xi -\eta )^2} \, .$$

so substituting \(z = \dfrac{1}{2}(\xi-\eta)\), with \(d\eta = -2dz\), gives

$$\begin{array}{lll} &=&2e^{-iq\xi }\int_{-\infty }^\infty \frac{e^{2iqz}dz}{1+z^2} \\ &=&2e^{-iq\xi }\pi e^{-2q}. \end{array}$$

Therefore

$$\begin{array}{lll} \frac 1{2\pi }\int_{-\infty }^\infty F(q)e^{-iq\xi }dq &=&\frac 2{\pi (1+\xi ^2)} -\frac 1{2\pi }\int_{-\infty }^\infty F(q)e^{iq\xi }e^{-2q}dq. \end{array}$$

Multiply both sides by \(e^{iq^{\prime }\xi }\) and integrate \(\int_{-\infty }^\infty d\xi \)

$$\begin{array}{lll} F(q^{\prime }) &=& \frac 2\pi \int_{-\infty }^\infty \frac{e^{iq^{\prime }\xi }d\xi }{(1+\xi ^2)} - e^{-2q^{\prime }}F(q^{\prime }) \\ (1+e^{-2q^{\prime }})F(q^{\prime }) &=&\frac 2\pi \pi e^{-q^{\prime }} \\ \mathrm{therefore} F(q^{\prime }) &=&\frac 1{\mathrm{cosh} q^{\prime }} \end{array}$$
$$\mathrm{i.e.} F(q)={\operatorname{sech}}(q).$$

Finally

$$\ f(\xi )=\frac 1{2\pi }\int_{-\infty }^\infty {\operatorname{sech}}(q)e^{-iq\xi }dq = \frac{1}{2} {\operatorname{sech}} \left( \frac{\pi \xi}{2} \right).$$

4.3 The Ground State Energy

We can now calculate the ground state energy of the infinite chain. For a finite chain with N atoms we have seen that the ground state energy is given by

$$\varepsilon =J\sum_i(\cos k_i-1) \,.$$

In the limit \(N \rightarrow \infty\) we again make the changes

$$k_i \rightarrow k(x) \mathrm{ and } \sum_{i=1}^{N/2} \longrightarrow \frac N2 \int_0^1dx$$

so that

$$\varepsilon = J \,\frac N2\int_0^1 \left[ \cos k(x)-1 \right] dx.$$
$$\begin{array}{lll} \mathrm{Also} \cos k(x)-1 &=&-2\sin ^2\frac k2 \\ &=& -\frac 2{\mathrm{cosec}^2\frac k2} = \frac{-2}{1+\cot ^2\frac k2} = \frac{-2}{1+\xi ^2} \\[0.1in] \mathrm{while} dx &=& \frac{dx}{d\xi } \, d\xi = -f(\xi )d\xi. \end{array}$$

Therefore

$$\begin{array}{lll} \varepsilon &=&-\frac{JN}2\int_{-\infty }^\infty \frac{2f(\xi )d\xi }{1+\xi ^2} \\ &=&-JN\int_{-\infty }^\infty \left(\frac 1{2\pi} \int_{-\infty }^\infty F(q)e^{-iq\xi }dq \right)\frac{d\xi }{(1+\xi ^2)} \end{array}$$

Doing the ξ integral

$$\begin{array}{lll} &=&-JN\frac 1{2\pi }\int_{-\infty }^\infty F(q)\pi e^{-|q|}dq \\ &=&-\frac{JN}2\int_{-\infty }^\infty \frac 2{(e^q+e^{-q})}e^{-|q|}dq \\ &=&-JN2\int_0^\infty \frac{e^{-q}}{e^q+e^{-q}}dq \\ &=&-JN \ln 2 \qquad (\mathrm{using the substitution } z=e^{-q}.) \end{array}$$

Therefore

$$E = \varepsilon + E_A \quad = -JN \ln 2 + \frac {JN}4$$
((4.7))
$$\quad = -0.443147J \mathrm{ to six decimal places.}$$
((4.8))

This is a very famous result due to Hulthén and Bethe. It is one of the outstanding achievements of modern theoretical physics.

As well as this exact result for the ground state energy of the spin-\(\frac{1}{2}\) chain with isotropic Heisenberg exchange, the Bethe Ansatz [3] has been used to obtain many other results for spin-\(\frac{1}{2}\) chains. Some of these are as follows:

  1. a.

    It may be extended to the case of anisotropic exchange. Firstly to the XXZ-model in which S x S x and S y S y terms are equal but the S z S z terms are different, by Orbach [4] and Walker [5] and later to the more general XYZ-model in which all three terms are different, by Baxter [6].

  2. b.

    For the XXZ-model the correlation functions behave as

    $$\left|\Big\langle S_i^z S_{i+R}^z \Big\rangle\right| = \mathrm{const.} \mathrm{as} R \rightarrow \infty$$

    for the anisotropic model (\(\varDelta>1\)) and

    $$\left|\Big\langle S_i^z S_{i+R}^z \Big\rangle\right| \propto \frac{1}{R} {\textrm as} R \rightarrow \infty$$

    for the isotropic model (\(\varDelta=1\)) showing that there is no long range order for \(\varDelta = 1\). These and other important results for the correlation function of this model were obtained by a completely different method, the inverse scattering method, by Bogoliubov et al. [7]

  3. c.

    It was extended to include the effects of a magnetic field by Griffiths [8]

  4. d.

    Results for the elementary excitations (antiferromagnetic magnons) can be found as described in the next chapter.

  5. e.

    The thermodynamics, i.e. the properties at non-zero temperature, have been studied by Yang and Yang [911], Takahashi and Suzuki [12], Gaudin [13] and Klümper [14]