1 Introduction

The Higgs bundle equations and affine Toda field equations lie at the intersection of some intensively studied topics in integrable systems. Namely they are examples of harmonic maps from a surface to a homogeneous space and they are also realized as certain dimensional reductions of the Yang–Mills self-duality equations. It has been observed by Aldrovandi and Falqui [1] that a certain real form of the affine Toda equations yield solutions to the Higgs bundle equations of a specific form, which in this paper are called cyclic. In this paper we will provide a proof of the converse: any cyclic Higgs bundle arises from a solution to the affine Toda equations. It is worth noting that in our approach we are able to derive the Toda equations from the Higgs bundle equations by imposing only a few natural assumptions.

There are at least two points of view in which one can see the relationship between the Higgs and Toda equations. On the one hand, such equations give rise to flat connections and representations of the fundamental group of a surface, on the other hand they are both related to harmonic maps into certain homogeneous spaces. For the most part we will fix our attention on the flat connection side of the story, leaving some observations about the corresponding harmonic maps to the final section of the paper.

The affine Toda equations and the Higgs bundle equations are two particular cases of harmonic mappings from a Riemann surface into a homogeneous space. Our objective here is to establish the exact relation between these two instances. The Toda equations arise as harmonic maps into a quotient \(G/T\) of a simple Lie group \(G\) modulo a maximal abelian subgroup \(T\). Usually \(G\) is taken to be the compact real form and \(T\) a maximal torus, however for our investigations it is necessary to consider the case where \(G\) is the split real form of a simple Lie group and \(T\) is generated by a certain Cartan subalgebra. While it is true that all maximal tori in a compact simple Lie group are conjugate, for other real forms there generally exists several inequivalent classes of maximal abelian subgroups modulo conjugation. In terms of the Toda equations this translates to the existence of several inequivalent real forms of the equations. The particular real form of the equations of interest here is governed by the connection to Higgs bundles.

The Higgs bundle equations are related to harmonic maps into a quotient \(G/K\) where \(K\) is the maximal compact subgroup of \(G\). In attempting to relate the Toda equations to Higgs bundles, we are asking when this harmonic map lifts to a harmonic map into \(G/(K \cap T)\) where \(T\) is a maximal abelian subgroup. In the case where \(G\) is the split real form of a simple Lie group, Hitchin [9] identified a component of the moduli space of Higgs bundles which we call the Hitchin component. For Higgs bundles in the Hitchin component, we are able to give a simple characterization of the Higgs bundles arising from the affine Toda equations. A Higgs bundle in the Hitchin component corresponds to a solution of the Toda equations if and only if it is cyclic, where a cyclic Higgs bundle is one in which the Higgs field is cyclic as an element of the Lie algebra in the sense of Kostant [10], except at finitely many points where it is allowed to vanish. Alternatively the cyclic Higgs bundles are characterized by the property that they correspond to fixed points of a certain finite cyclic subgroup of the \(\mathbb {C}^*\)-action on the moduli space of Higgs bundles.

In Sect. 2 we begin with a review of some structure theory of simple Lie algebras; the principal three dimensional subgroups and cyclic elements. Following this, Sect. 3 presents a quick review of Higgs bundles leading to the construction of Higgs bundles for the Hitchin component. In Sect. 4 we introduce cyclic Higgs bundles and prove that they reduce to certain real forms of the affine Toda equations. We present a review of the affine Toda equations in Sect. 5 and their description in terms of flat connections. We show that under the appropriate reality conditions, the affine Toda equations provide a solution to the Higgs bundle equations. A small modification of the Toda equations puts them in a form applicable to any Riemann surface and we show that the appropriate real form of these equations is equivalent to a cyclic Higgs bundle. Finally, in Sect. 6 we take a look at the Higgs–Toda relationship on the level of harmonic maps.

2 Algebraic preliminaries

We review some results in the theory of complex simple Lie algebras. We will also take this opportunity to fix some notation that will be used throughout.

2.1 Principal three-dimensional subalgebras

Let \(\mathfrak {g}\) be a complex simple Lie algebra of rank \(l\). We let \(G\) be the adjoint form of \(\mathfrak {g}\), that is \(G = \mathrm{Aut}(\mathfrak {g})_0\) is the identity component of the automorphism group of \(\mathfrak {g}\). It is well known that the Lie algebra of \(G\) is \(\mathfrak {g}\) and that the center of \(G\) is trivial. We choose a Cartan subalgebra \(\mathfrak {h}\), let \(\Delta \subset \mathfrak {h}^*\) be the roots, choose a system of positive roots \(\Delta ^+\) and corresponding set of simple roots \(\varPi \). Fix a corresponding basis \(\{h_\beta , e_\alpha ,e_{-\alpha } | \; \beta \in \varPi , \; \alpha \in \Delta ^+\}\). Here we use the convention that if \(\alpha \) is a root then \(h_\alpha \in \mathfrak {h}\) is the corresponding coroot defined by

$$\begin{aligned} \beta (h_\alpha ) = 2\frac{(\alpha , \beta )}{(\alpha , \alpha )}. \end{aligned}$$

Kostant [10] defines a subalgebra (unique up to conjugacy) called the principal 3-dimensional subalgebra of \(\mathfrak {g}\). We construct the principal \(3\)-dimensional subalgebra as follows. Let

$$\begin{aligned} x = \dfrac{1}{2}\sum _{\alpha \in \Delta ^+}h_\alpha , \end{aligned}$$
(1)

then \(x = \sum _{\alpha \in \varPi }r_\alpha h_\alpha \) for some positive half-integers \(r_\alpha \). We use these to further define

$$\begin{aligned} e = \sum _{\alpha \in \varPi }\sqrt{r_\alpha }e_\alpha , \; \; \; \tilde{e} = \sum _{\alpha \in \varPi }\sqrt{r_\alpha }e_{-\alpha }. \end{aligned}$$

We define \(\mathfrak {s}\) as the linear span of \(\{x,e,\tilde{e}\}\). We must verify that \(\mathfrak {s}\) is a subalgebra:

Lemma 1

[16] For any \(\beta \in \varPi \) we have \(\beta (x)=1\)

Proof

Let \(R_\beta \) denote the reflection in the hyperplane in \(\mathfrak {h}^*\) orthogonal to \(\beta \), namely

$$\begin{aligned} R_\beta (\alpha ) = \alpha - 2\frac{(\alpha ,\beta )}{(\beta ,\beta )}\beta . \end{aligned}$$

The dual action of \(R_\beta \) on \(\mathfrak {h}\) is

$$\begin{aligned} R_\beta ^t (h) = h-\beta (h)h_\beta . \end{aligned}$$
(2)

From this one verifies the relation \(R_\beta ^t(h_\alpha ) = h_{R_\beta (\alpha )}\). We then have that

$$\begin{aligned} R_\beta ^t(x) = \dfrac{1}{2}\sum _{\alpha \in \Delta ^+} h_{R_\beta (\alpha )}. \end{aligned}$$

However we also have that for \(\alpha \in \Delta ^+\), \(R_\beta (\alpha ) \in \Delta ^+\) except for \(\alpha = \beta \) in which case \(R_\beta (\beta ) = -\beta \). It follows that \(R_\beta ^t(x) = x - h_\beta \) and by (2), \(\beta (x)=1\).\(\square \)

Given a root \(\lambda \in \Delta \), let \(\lambda = \sum _{\alpha \in \varPi }n_\alpha \alpha \). We define the \(\varPi \) -height of \(\lambda \) to be the integer \(\mathrm{height}(\lambda ) = \sum _{\alpha \in \varPi }n_{\alpha }\). The above lemma shows that for \(y \in \mathfrak {g}_\alpha \) we have \([x,y] = \mathrm{height}(\alpha )y\), so \(x\) is the grading element corresponding to the gradation of \(\mathfrak {g}\) by height. We now deduce the following commutation relations for \(\mathfrak {s}\):

$$\begin{aligned}{}[x,e] = e, \; \; \; [x,\tilde{e}] = -\tilde{e}, \; \; \; [e,\tilde{e}] = x. \end{aligned}$$

Thus \(\mathfrak {s}\) is a copy of \(\mathfrak {sl}(2,\mathbb {C})\).

The element \(e\) is regular, meaning it has an \(l\)-dimensional centralizer spanned by elements \(e_1,\ldots , e_l\). Moreover on restriction to \(\mathfrak {s}\) the adjoint representation decomposes into irreducible subspaces:

$$\begin{aligned} \mathfrak {g} = \bigoplus _{i=1}^l V_i. \end{aligned}$$
(3)

We can take \(e_1,\ldots , e_l\) as highest weight elements of \(V_1, \ldots , V_l\) which shows there are indeed \(l\) summands. Since \(\mathfrak {s}\) itself must appear as one of the \(V_i\), we take it to be \(V_1\), so we may take \(e_1 = e\). Let \(m_1,\ldots , m_l\) denote the exponents of \(\mathfrak {g}\). These can be described as follows [15]; arrange the positive roots of \(\mathfrak {g}\) into an array with the \(k\)th row consisting of all roots of height \(k\), filling in rows from right to left. The lengths of the columns from left to right are the exponents. In particular we see that if the highest root \(\delta \) has height \(M\) then \(m_l = M\) is the largest exponent.

Returning to the decomposition of \(\mathfrak {g}\) given by (3), we have that the dimensions of \(V_1,\ldots , V_l\) are \((2m_1+1),\ldots , (2m_l + 1)\), from which we may write

$$\begin{aligned} \mathfrak {g} = \bigoplus _{i=1}^l \mathrm{S}^{2m_i}(V), \end{aligned}$$
(4)

where \(V\) is the \(2\)-dimensional fundamental representation for \(\mathfrak {s}\). Observe that \([x,e_l] = Me_l\) and thus \(e_l\) is a highest weight vector. We may also decompose \(\mathfrak {g}\) according to the action of \(x\), i.e. by height:

$$\begin{aligned} \mathfrak {g} = \bigoplus _{m= -M}^M \mathfrak {g}_m. \end{aligned}$$

2.2 Coxeter and cyclic elements

The key algebraic feature underpinning much of what follows is the presence of a certain distinguished element of \(G\):

$$\begin{aligned} g = \mathrm{exp}(2\pi i x / h) \end{aligned}$$

where \(h = M+1\) is the Coxeter number. We have that \(g^h = 1\) and the eigenvalues of \(\mathrm{Ad}_g\) are the \(h\)th roots of unity. The highest weight vectors \(e_1,\ldots , e_l\) for the representation of the principal three dimensional subgroup on \(\mathfrak {g}\) span a Cartan subalgebra. Kostant [10] shows that \(g\) acts on this Cartan subalgebra as a Coxeter element for the Weyl group. As such, we will call \(g\) a Coxeter element of \(G\). Conversely every lift to \(G\) of a Coxeter element of a Cartan subalgebra is of this form.

We can decompose \(\mathfrak {g}\) into the eigenspaces of \(\mathrm{Ad}_g\):

$$\begin{aligned} \mathfrak {g} = \bigoplus _{m=0}^M \mathfrak {g}^g_m, \end{aligned}$$

where \(\mathfrak {g}^{g}_m\) is the \(e^{2\pi i m/ h}\)-eigenspace of \(\mathrm{Ad}_g\). In particular \(\mathfrak {g}^g_0 = \mathfrak {h}\) is the Cartan subalgebra and \(\mathfrak {g}^g_1\) is the sum of the root spaces of the simple roots together with the root space of the lowest root \(-\delta \), that is

$$\begin{aligned} \mathfrak {g}^g_1 = \bigoplus _{\alpha \in \varPi \cup \{-\delta \}} \mathfrak {g}_\alpha . \end{aligned}$$

It is clear that \(g\) treats the simple roots and the lowest root on an equal footing. We let \(\overline{\varPi } = \varPi \cup \{-\delta \}\).

Let \(X \in \mathfrak {g}_1^g\). We say that \(X\) is cyclic if \(X = \sum _{\alpha \in \overline{\varPi }} c_\alpha e_\alpha \) where the constants \(c_\alpha \) are non-zero for all \(\alpha \in \overline{\varPi }\). More generally, we say that \(Y \in \mathfrak {g}\) is cyclic if it is \(Y\) conjugate to such an \(X\) under the adjoint action of \(G\). Cyclic elements have a simple description in terms of the invariant polynomials on \(\mathfrak {g}\) as we now describe. Recall that an invariant polynomial on \(\mathfrak {g}\) is an element of the polynomial algebra \(S^*(\mathfrak {g}^*) = \bigoplus _{k=0}^\infty S^k(\mathfrak {g}^*)\) that is invariant under the natural action of \(G\). There are homogeneous generators \(p_1, \ldots , p_l\) for the invariant polynomials on \(\mathfrak {g}\) such that for all elements \(f \in \mathfrak {g}\) of the form

$$\begin{aligned} f = \tilde{e} + \alpha _1 e_1 + \cdots + \alpha _l e_l, \end{aligned}$$

we have

$$\begin{aligned} p_i(f) = \alpha _i. \end{aligned}$$
(5)

We also have that \(p_i\) has degree \(m_i+1\). It turns out that an element \(X \in \mathfrak {g}\) is cyclic if and only if \(p_1(X) = p_2(X) = \cdots = p_{l-1}(X) = 0\), and \(p_l(X) \ne 0\) [10]. Put another way, \(X\) is cyclic if every invariant polynomial of degree less than \(h\) without constant term vanishes on \(X\), but there is an invariant polynomial without constant term that is non-vanishing on \(X\).

Let \(X,Y \in \mathfrak {g}^g_1\) be cyclic. There exists an element \(t \in T\) such that \(\mathrm{Ad}_t X = \lambda Y\) for some non zero constant \(\lambda \), where \(T\) is the connected subgroup of \(G\) generated by the Cartan subalgebra \(\mathfrak {h}\). We see that \(X\) and \(Y\) are conjugate under \(T\) if and only if \(p_l(X) = p_l(Y)\).

3 Higgs bundles and the Hitchin component

3.1 Higgs bundles

Higgs bundles were introduced by Hitchin [8] after considering dimensional reduction of the Yang–Mills self-duality equations to a surface. A Higgs bundle over a Riemann surface \(\varSigma \) is a pair \((E,\varPhi )\) where \(E\) is a holomorphic vector bundle on \(\varSigma \) and \(\varPhi \) is a holomorphic section of \(\mathrm{End}(E) \otimes K\), where \(K\) is the canonical bundle on \(\varSigma \). We call \(\varPhi \) a Higgs field. The slope of a Higgs bundle \((E,\varPhi )\) is the slope of the bundle \(E\), namely \(\mu (E) = \mathrm{deg}(E)/\mathrm{rank}(E)\). There is a \(\mathbb {C}^*\)-action on Higgs bundles which acts by multiplication of the Higgs field by a non-zero complex number.

Closely related to Higgs bundles are the Higgs bundle equations for a pair \((\nabla _A , \varPhi )\) on a Hermitian vector bundle \(E\) over a Riemann surface. Here \(\nabla _A\) is a unitary connection on \(E\) and \(\varPhi \) is a section of \(\mathrm{End}(E) \otimes K\). The Higgs bundle equations are:

$$\begin{aligned} F_A + \left[ \varPhi , \varPhi ^* \right]&= 0 \end{aligned}$$
(6)
$$\begin{aligned} \nabla _A^{0,1}\varPhi&= 0 \end{aligned}$$
(7)

where \(F_A\) is the curvature of \(\nabla _A\) and \(\nabla _A^{0,1}\) is the \((0,1)\) part of \(\nabla _A\).

The relation between Higgs bundles and the Higgs bundle equations is as follows: if \((\nabla _A , \varPhi )\) is a solution of the Higgs bundle equations then \(\nabla _A^{0,1}\) defines a holomorphic structure on \(E\) for which \(\varPhi \) is holomorphic and this defines a Higgs bundle pair \((E,\varPhi )\). Conversely suppose \((E,\varPhi )\) is a Higgs bundle. A choice of Hermitian form \(h\) on \(E\) induces a Chern connection \(\nabla _A\), the unique connection such that \(\nabla _A(h)=0\) and for which \(\nabla _A^{0,1}\) defines the holomorphic structure on \(E\). We seek to find a Hermitian form \(h\) such that \((\nabla _A , \varPhi )\) solves the Higgs bundle equations. Even with the obvious restriction that \(E\) must have degree \(0\), it is not always possible to find such a metric. The result of Hitchin [8] and Simpson [17] is that such a metric exists if and only if the Higgs bundle \((E,\varPhi )\) is polystable. In this case the unitary connection \(\nabla _A\) is unique. We now recall the polystable condition.

If \((E,\varPhi )\) is a Higgs bundle, a subbundle \(F \subset E\) is called \(\varPhi \) -invariant if \(\varPhi (F) \subseteq F \otimes K\). The Higgs bundle \((E,\varPhi )\) is said to be stable if for every proper, non-zero \(\varPhi \)-invariant holomorphic subbundle \(F \subset E\) satisfies the inequality \(\mu (F) < \mu (E)\). If this condition is relaxed to the non-strict inequality \(\mu (F) \le \mu (E)\), we say \(E\) is semistable. We also say that \((E,\varPhi )\) is polystable if it is a direct sum of stable Higgs bundles of the same slope.

The above discussion generalizes to the case of principal \(G\)-bundles where \(G\) is a complex Lie group with Lie algebra \(\mathfrak {g}\). A \(G\)-Higgs bundle is a pair \((P,\varPhi )\) where \(P\) is a principal \(G\)-bundle over \(\varSigma \) and \(\varPhi \) is a holomorphic section of \(\mathfrak {g}^P \otimes K\), where \(\mathfrak {g}^P\) is the adjoint bundle associated to \(P\). Similarly, the Higgs bundle equations have a generalization to principal \(G\)-bundles. We will assume that \(G\) is semisimple. Then for a principal \(G\)-bundle, the equivalent of a Hermitian form is a reduction of structure to the maximal compact subgroup \(G^\rho \). The adjoint bundle \(\mathfrak {g}^\rho \) is a real subbundle of the complex adjoint bundle \(\mathfrak {g}^P\) and defines an antilinear involution \(\rho : \mathfrak {g}^P \rightarrow \mathfrak {g}^P\). If \(x \otimes y \mapsto k(x,y)\) is the Killing form on \(\mathfrak {g}^P\), then corresponding to \(\rho \) is the Hermitian form \(x \otimes y \mapsto -k(x , \rho (y))\). The Hermitian adjoint of \(\mathrm{ad}_x\) is \(-\mathrm{ad}_{\rho (x)}\), hence we will denote \(-\rho (x)\) by \(x^*\). Given an antilinear involution \(\rho : \mathfrak {g}^P \rightarrow \mathfrak {g}^P\), we extend \(\rho \) to an antilinear involution on form-valued sections of \(\mathfrak {g}^P\) by letting it act by complex conjugation on the form part of the section.

It is now straightforward to generalize the Higgs bundle equations. Suppose \(P\) is a principal \(G\)-bundle with a reduction of structure \(P_H \subset P\) to a principal \(H\)-subbundle, with \(H = G^\rho \) the maximal compact subgroup. Then the Higgs bundle equations for a connection \(\nabla _A\) on \(P_H\) and a section \(\varPhi \) of \(\mathfrak {g}^P \otimes K\) are given by Eqs. (6) and (7).

Note also that if \((\nabla _A , \varPhi )\) is a solution of the Higgs bundle equations for a principal \(G\)-bundle, then \(\nabla = \nabla _A + \varPhi + \varPhi ^*\) is a flat \(G\)-connection and is determined by its monodromy representation \(\pi _1(\varSigma ) \rightarrow G\). In this way, Higgs bundles on \(\varSigma \) are closely related to representations of the fundamental group of \(\varSigma \).

3.2 The Hitchin component

Higgs bundles provide a geometric interpretation for representations of the fundamental group of a surface into a simple Lie group. A particular explicit construction of Higgs bundles by Hitchin [9] identifies a component of the space of representations into the split real form of a Lie group. This component is often called the Hitchin component. We consider within the Hitchin component those representations for which only the highest holomorphic differential is non-vanishing and show in this case the Higgs bundle equation reduces to a set of Toda equations.

We review the construction of Higgs bundles in [9]. Throughout we will assume \(\varSigma \) has genus \(g > 1\). Recall that if \(\mathbf{M}\) is the moduli space of Higgs bundles \((E,\varPhi )\) for the group \(G\) then there is a map

$$\begin{aligned} p : \mathbf{M} \rightarrow \bigoplus _{i = i}^l H^0(\varSigma , K^{m_i+1}) \end{aligned}$$

obtained by applying the invariant polynomials \(p_1, \ldots , p_l\) to \(\varPhi \). We will construct a section of this map.

Consider the bundle

$$\begin{aligned} E = \bigoplus _{m=-M}^M \mathfrak {g}_m \otimes K^m \end{aligned}$$
(8)

where \(\mathfrak {g}_m\) is the \(m\)-eigenspace of the grading element \(x\). This is the adjoint bundle associated to the principal \(G\)-bundle \(P = P_1 \times _i G\) where \(P_1\) is the holomorphic principal \(\mathrm{SL}(2,\mathbb {C})\)-bundle associated to \(K^{1/2}\oplus K^{-1/2}\) (for a choice of \(K^{1/2}\)) and \(i: \mathrm{SL}(2,\mathbb {C}) \rightarrow G\) is the inclusion corresponding to the principal \(3\)-dimensional subalgebra \(\mathfrak {s}\). We see from (4) that \(E\) is independent of choice of \(K^{1/2}\) and defines for us a holomorphic vector bundle with fibre \(\mathfrak {g}\).

Next we construct a Higgs field \(\varPhi \). Let \(q_1, \ldots , q_l\) be holomorphic differentials of degrees \(m_1+1, \ldots , m_l +1\). We may define \(\varPhi \in H^0(\varSigma , E \otimes K)\) as follows:

$$\begin{aligned} \varPhi = \tilde{e} + q_1 e_1 + \cdots + q_l e_l. \end{aligned}$$

Here since \(\tilde{e} \in \mathfrak {g}_{-1}\), we regard \(\tilde{e}\) as a section of \((\mathfrak {g}_{-1} \otimes K^{-1}) \otimes K\) and similarly \(q_ie_i\) can be considered as a section of \((\mathfrak {g}_{m_i} \otimes K^{m_i} ) \otimes K\) so that \(\varPhi \) is a well-defined holomorphic section of \(E \otimes K\). Let us recall that \(E\) is the adjoint bundle associated to the principal \(G\)-bundle \(P = P_1 \times _i G\). Hence \(\varPhi \) is a holomorphic \(K\)-valued section of the adjoint bundle. That \(\varPhi \) is holomorphic follows since Eq. (8) is an equality of holomorphic vector bundles.

The Higgs bundles \((E,\varPhi )\) constructed here are polystable and in fact correspond to smooth points of the moduli space \(\mathbf{M}\) [9]. We have from (5) that \(p_i(\varPhi ) = q_i\) and it follows that \((q_1, \ldots , q_l ) \mapsto (E,\varPhi )\) is our desired section.

So far we have constructed Higgs bundles with holonomy in \(G\). We show that in fact the Higgs bundles constructed have holonomy in the split real form of \(G\). We define an involutive automorphism \(\sigma \) on \(\mathfrak {g}\) as follows:

$$\begin{aligned} \sigma (e_i) = -e_i, \; \; \; \sigma (\tilde{e}) = -\tilde{e}. \end{aligned}$$
(9)

Since \(\mathfrak {g}\) is obtained from \(e_1, \ldots , e_l\) by repeated application of \(\mathrm{ad}_{\tilde{e}}\), we see that such a \(\sigma \) must be unique. From [9], \(\sigma \) exists and defines a Cartan involution corresponding to the split real form of \(\mathfrak {g}\). Thus if \(\rho \) is a compact anti-involution defining a Hermitian metric solving the Higgs bundle equations then in fact \(\rho \) and \(\sigma \) commute and \(\lambda = \rho \sigma \) is an anti-involution for the split real form of \(\mathfrak {g}\). Clearly our Higgs field satisfies \(\sigma \varPhi = -\varPhi \). Recall that the Higgs bundle equations are invariant under the \(U(1)\)-action \(\varPhi \mapsto a\varPhi \), for \(a \in U(1)\). If \((\nabla _A , \varPhi )\) satisfies the Higgs bundle equations then so does \((\sigma ^*\nabla _A , \sigma ^*\varPhi )\) and hence, using the \(U(1)\)-action to send \(\varPhi \) to \(-\varPhi \), so does \((\sigma ^*\nabla _A , \varPhi )\). Uniqueness of \(\nabla _A\) now shows that \(\sigma \) is preserved by \(\nabla _A\). Note also that \(\lambda \varPhi = - \rho (\varPhi ) = \varPhi ^*\). It follows that the flat connection \(\nabla _A + \varPhi + \varPhi ^*\) has holonomy in the split real form of \(G\). Therefore we have constructed representations of the fundamental group of \(\varSigma \) into the split real form.

Our section \(s : \bigoplus _{i = i}^l H^0(\varSigma , K^{m_i+1}) \rightarrow \mathbf{M}\) takes values in the smooth points of \(\mathbf{M}\) and identifies the vector space \(V = \bigoplus _{i = i}^l H^0(\varSigma , K^{m_i+1})\) with a submanifold of \(\mathbf{M}\). Moreover the map \(s\) actually maps \(V\) into the moduli space \(\mathbf{M}^\lambda \) of flat \(G^\lambda \)-connections where \(G^\lambda \) is the split real form of \(G\). Around the smooth points \(\mathbf{M}^\lambda \) is a manifold of dimension \((2g-2) \dim (G^\lambda )\) and from the Riemann–Roch theorem we have \(\dim (V) = (2g-2)\dim (G^\lambda )\) as well. Since the image \(s(V) \subset \mathbf{M}^\lambda \) is open, closed and connected it defines a smooth component of the space of representations of the fundamental group in \(G^\lambda \). We call this the Hitchin component.

4 Cyclic Higgs bundles

We now consider a special case of this construction in which all but the highest differential are set to zero. We will call such a Higgs bundle cyclic. Thus our Higgs field is of the form

$$\begin{aligned} \varPhi = \tilde{e} + qe_l. \end{aligned}$$
(10)

Here \(q \in \mathrm{H}^0(\varSigma ,K^{M+1})\) is a holomorphic \((M+1)\)-differential, where \(M = m_l\). Thus our Higgs field is cyclic as an element of \(\mathfrak {g}\) at all points where \(q \ne 0\).

Solutions of the form (10) have the property that they are fixed points in the moduli space of Higgs bundles of the action of certain roots of unity in \(\mathbb {C}^*\). From the properties of the element \(x\) we have

$$\begin{aligned}{}[x,\varPhi ] = -\tilde{e} + Mqe_l. \end{aligned}$$

Let \(g = \mathrm{exp}(2\pi i x/(M+1))\) be the Coxeter element, as constructed in Sect. 2.2. Note that \(g\) commutes with the action of \(x\) so that \(g\) gives a well-defined global gauge transformation of \(E\). It follows that

$$\begin{aligned} \mathrm{Ad}_g \varPhi = \mathrm{exp}(2\pi i M/(M+1))\varPhi = \omega \varPhi \end{aligned}$$

where \(\omega \) is an \((M+1)\)th root of unity. If \((\nabla _A,\varPhi )\) is the solution to the Higgs bundle equation corresponding to \(\varPhi \), we gauge transform this solution by \(g\) to get the equivalent solution \((g^*\nabla _A, \omega \varPhi )\), we then use the action of \(U(1) \subset \mathbb {C}^*\) to obtain another solution \((g^*\nabla _A,\varPhi )\). The uniqueness theorem for solutions to the Higgs bundle equations now implies that \(A\) is gauge invariant under \(g\), which in turn is equivalent to covariant constancy of \(g\) with respect to the connection \(\nabla _A\). Thus the action of \(g\) and \(\nabla _A\) on sections of \(E\) commutes. Observe that the Cartan subalgebra \(\mathfrak {h}\) is the unity eigenspace of \(\mathrm{Ad}_g\) from which it follows that \(\nabla _A\) preserves the subbundle of \(E\) corresponding to \(\mathfrak {h}\) and thus the connection form \(A\) is \(\mathfrak {h}\)-valued.

We now seek an explicit form of the Higgs bundle equations for the special class of solutions (10). For this we adopt the following point of view: given the holomorphic data \((E,\varPhi )\), we seek a Hermitian metric \(H\) on \(E\) such that the Chern connection \(A = H^{-1}\partial H\) satisfies the Higgs bundle equations:

$$\begin{aligned} F_A + [\varPhi ,\varPhi ^*] = 0, \end{aligned}$$

where \(F_A = \overline{\partial }(H^{-1}\partial H)\) is the curvature of \(\nabla _A\). In our case, a Hermitian metric is a reduction of the structure group of \(E\) to the maximal compact subgroup \(G^\rho \) of \(G\). This is equivalent to finding an anti-linear involution \(\rho : E \rightarrow E\) preserving the Lie algebra structure of the fibres, i.e. it is a choice of compact real form of \(\mathfrak {g}\) on each fibre of \(E\). The associated Hermitian form is then

$$\begin{aligned} H(u,v) = h_\rho (u,v) = -k(u,\rho (v)), \end{aligned}$$

where \(k\) is the Killing form on \(\mathfrak {g}\). Recall that for \(u \in \mathfrak {g}\), the adjoint of \(u\) thought of as the endomorphism \(\mathrm{ad}_u\) is given by \(u^* = -\rho (u)\). Since \(G\) is the adjoint form of \(\mathfrak {g}\) we can think of \(G\) as a group of linear transformations of \(\mathfrak {g}\). Given a real form \(\hat{\rho }\) on the Lie algebra \(\mathfrak {g}\), we may define a map \(\hat{\rho }_1 : G \rightarrow G\) by \(\hat{\rho }_1(g) = (g^*)^{-1}\). Here the adjoint is with respect to the Hermitian form \(h_{\hat{\rho }}\) on \(\mathfrak {g}\) induced by \(\hat{\rho }\). Note that the differential of \(\hat{\rho }_1\) at the identity is the anti-involution \(\hat{\rho }\).

In order to write down an arbitrary anti-involution \(\rho \) on the adjoint bundle \(E\), we first define the following fixed choice of anti-involution \(\hat{\rho }\) of \(\mathfrak {g}\):

$$\begin{aligned} \hat{\rho }(h_\alpha ) = -h_\alpha , \; \; \; \hat{\rho }(e_\alpha ) = -e_{-\alpha }, \; \; \; \hat{\rho }(e_{-\alpha }) = -e_\alpha . \end{aligned}$$
(11)

The fixed set of \(\hat{\rho }\) defines a compact real form of \(\mathfrak {g}\) and moreover any other compact real form is conjugate to \(\hat{\rho }\) by an inner automorphism.

We emphasize that \(\rho \) is an anti-involution on the adjoint bundle \(E\) while \(\hat{\rho }\) is a fixed anti-involution on the Lie algebra \(\mathfrak {g}\). Consider now a trivialisation of \(E\) over an open subset \(U\) which identifies the fibres of \(E\) with \(\mathfrak {g}\) preserving the Lie algebra structure. The anti-involution \(\rho : E \rightarrow E\) defines on each fibre \(E_x\) an anti-involution \(\rho _x : E_x \rightarrow E_x\) and upon identifying \(E_x\) with \(\mathfrak {g}\), \(\rho _x\) defines an anti-involution on \(\mathfrak {g}\). Hence for each \(x \in U\) there exists a \(p(x) \in G\) such that \(\rho _x = \alpha _x \hat{\rho } \alpha _x^{-1}\) where \(\alpha _x = \mathrm{Ad}_{p(x)}\). Allowing \(x\) to vary over \(U\) we have a map \(p : U \rightarrow G\) which can be chosen smoothly such that in the trivialisation over \(U\), \(\rho = \mathrm{Ad}_p \circ \hat{\rho } \circ \mathrm{Ad}_{p}^{-1}\). Thus

$$\begin{aligned} \rho = \mathrm{Ad}_h \circ \hat{\rho } \end{aligned}$$
(12)

where \(h : U \rightarrow G\) is given by \(h = p\hat{\rho }_1(p)^{-1}\). Equation (12) is a local formula relating \(\rho \) to \(\hat{\rho }\). Note also that \(\mathrm{Ad}_h\) is a positive definite Hermitian operator with respect to \(\hat{\rho }\). The following lemma will show that we have \(h = \mathrm{exp}(2\varOmega )\) for some \(\varOmega \in \mathfrak {g}\) satisfying \(\hat{\rho }(\varOmega ) = -\varOmega \).

Lemma 2

Let \(\mathfrak {g}\) be a complex Lie algebra with Hermitian form. If \(h \in \mathrm{Aut}(\mathfrak {g})\) is Hermitian and positive definite, then \(h = e^H\) where \(H\) is Hermitian and \(H \in \mathrm{der}(\mathfrak {g})\) is a derivation of \(\mathfrak {g}\)

Proof

We may decompose \(\mathfrak {g}\) into the eigenspaces of \(h\)

$$\begin{aligned} \mathfrak {g} = \bigoplus _\lambda \mathfrak {g}_\lambda . \end{aligned}$$

Since \(h\) is an automorphism we have \(\left[ \mathfrak {g}_{\lambda _1} , \mathfrak {g}_{\lambda _2} \right] \subseteq \mathfrak {g}_{\lambda _1 \lambda _2}\). Now we may write \(h = e^H\) where \(H = \mathrm{log}(h)\) is Hermitian. Further \(H\) acts on \(\mathfrak {g}_\lambda \) with eigenvalue \(\mathrm{log}(\lambda )\). The fact that \(H\) is a derivation follows immediately since \(\mathrm{log}(\lambda _1\lambda _2) = \mathrm{log}(\lambda _1) + \mathrm{log}(\lambda _2)\).\(\square \)

Consider how to find the Chern connection associated to the Hermitian metric \(h_{\rho }\). In a local trivialisation where \(\rho = \mathrm{Ad}_h \circ \hat{\rho }\) we have:

$$\begin{aligned} h_{\rho }(u,v)&= -k(u,\mathrm{Ad}_{h}(\hat{\rho }(v))) \\&= h_{\hat{\rho }}(u,\mathrm{Ad}_{h^{-1}} v). \end{aligned}$$

We can regard \(h_{\hat{\rho }}\) as a fixed Hermitian form on \(\mathfrak {g}\), hence we can think of the metric \(h_\rho \) as being locally associated to the matrix \(h^{-1}\). The connection form in this trivialisation is then \(A = h \partial h^{-1}\).

We will need the following:

Lemma 3

The anti-involution \(\rho \) solving the Higgs bundle equations is unique

Proof

Any other solution is related by a \(G\)-valued gauge transformation \(L: E \rightarrow E\) such that \(L\) preserves the holomorphic structure and \(\varPhi \). We will argue that \(L\) is the identity. It is a general fact [8] that a gauge transformation that is holomorphic and commutes with \(\varPhi \) is in fact covariantly constant with respect to the unitary connection \(\nabla _A\). Now as in (8) we may write

$$\begin{aligned} E = \bigoplus _{m=-M}^M \mathfrak {g}_m \otimes K^m. \end{aligned}$$

We have seen that \(\nabla _A\) is \(\mathfrak {h} = \mathfrak {g}_0\)-valued, hence \(\nabla _A\) preserves the subbundles \(E_m = \mathfrak {g}_m \otimes K^m\). If we decompose \(L\) into endomorphisms \(L_{ij} : E_i \rightarrow E_j\) then each \(L_{ij}\) must be covariantly constant. Since \(E_i \otimes E_j^* = \mathfrak {g}_i \otimes \mathfrak {g}_j^* \otimes K^{i-j}\) there can only be a non-vanishing covariantly constant section if \(i=j\) (as \(\varSigma \) is taken to have genus \(g>1\)). Therefore \(L\) preserves \(\mathfrak {g}_i\) for all \(i\). Since \(L\) preserves the Cartan subalgebra \(\mathfrak {h} = \mathfrak {g}_0\) then \(L\) is valued in the normaliser \(N(T)\) of the maximal torus \(T\). But \(L\) also preserves \(\mathfrak {g}_1 = \bigoplus _{\alpha \in \varPi } \mathfrak {g}_{\alpha }\). Therefore the induced action of \(L\) on the roots of \(\mathfrak {g}\) preserves the choice of simple roots, hence \(L\) is valued in the maximal torus \(T\). We also have that \(L\) commutes with \(\varPhi = \tilde{e} + qe_l\), in particular \(\mathrm{Ad}_L \tilde{e} = \tilde{e}\). It follows that \(L\) is the identity.\(\square \)

We can now prove the following:

Proposition 1

The anti-involution \(\rho \) locally has the form \(\rho = \mathrm{Ad}_h \circ \hat{\rho }\), where \(h\) is a locally defined \(G\)-valued function. Moreover \(h\) can be written in the form \(h = e^{2\varOmega }\), for some locally defined \(\mathfrak {h}\)-valued function \(\varOmega \) satisfying \(\hat{\rho }(\varOmega ) = -\varOmega \)

Proof

Recall that the Coxeter element \(g \in G\) gives a well-defined gauge transformation of \(E\). We have seen that the connection \(A\) is valued in \(\mathfrak {h}\) and that \(\nabla _A\) preserves the element \(g\). Consider a new globally defined anti-involution given by \(\mu = \mathrm{Ad}_g \circ \rho \circ \mathrm{Ad}_{g^{-1}} = \mathrm{Ad}_q \circ \rho \) where \(q = g\rho _1(g)^{-1}\).

Now since both \(g\) and \(H\) are covariantly constant we see that \(\mu \) yields a metric compatible with \(A\). Thus the Chern connection corresponding to \(\mu \) is also \(A\). Now if we take the Higgs bundle equation for \(\rho \):

$$\begin{aligned} F_A + [\varPhi , -\rho (\varPhi )] = 0 \end{aligned}$$

and apply \(\mathrm{Ad}_g\), noting that \(g\) commutes with elements of \(\mathfrak {h}\) we get

$$\begin{aligned} F_A + [\omega \varPhi , -\mathrm{Ad}_g \rho (\varPhi )] = 0. \end{aligned}$$

However, since \(\omega \mathrm{Ad}_g \circ \rho (\varPhi ) = \mathrm{Ad}_g \circ \rho \circ \mathrm{Ad}_{g^{-1}} \varPhi = \mu (\varPhi )\) we get

$$\begin{aligned} F_A + [\varPhi ,-\mu (\varPhi )] = 0. \end{aligned}$$

By uniqueness of solutions to the Higgs bundle equations we have that \(\mu = \rho \), and so \(\mathrm{Ad}_q\) is the identity. Thus \(q\) lies in the centre of \(G\) which is trivial since \(G\) is the adjoint form, hence \(\rho _1(g) = g\).

Locally we write \(\rho = \mathrm{Ad}_h \circ \hat{\rho }\), for a locally defined \(G\)-valued function \(h\). We have seen that \(Ad_h\) is a positive definite Hermitian operator, so by Lemma 2 we may write \(h = e^{2\varOmega }\) for some locally defined \(\mathfrak {g}\)-valued Hermitian function \(\varOmega \). Here Hermitian means \(\varOmega = \varOmega ^* = -\hat{\rho }(\varOmega )\). In other words, \(\hat{\rho }(\varOmega ) = -\varOmega \). Now \(g = \rho _1(g) = \mathrm{Ad}_h (\hat{\rho }_1(g)) = \mathrm{Ad}_h(g)\). Thus \(g\) and \(h\) commute. This implies that \(h\) preserves the eigenspaces of \(g\), in particular \(h\) preserves the Cartan subalgebra \(\mathfrak {h}\). Thus \(h\) is valued in the normaliser \(N(T)\). We will further argue that \(h\) is valued in the maximal torus \(T\).

Since \(N(T)/T\) is a finite group (the Weyl group) there is some positive integer \(a\) such that \(h^a = e^{2a\varOmega }\) is valued in \(T\), say \(h^a = e^{X}\) where \(X\) is valued in \(\mathfrak {h}\). However \(h^a\) is a positive definite Hermitian operator in the sense that \(h^a = h^{a/2}\hat{\rho }_1(h^{-a/2}) = h^{a/2}(h^{a/2})^*\). Therefore \(h^{2a} = e^{X - \hat{\rho }(X)}\) is also Hermitian and positive. So \(h^{2a}\) has a unique positive Hermitian \(2a\)th root which is \(h\). Taking this root yields

$$\begin{aligned} h = e^{(X - \hat{\rho }(X))/2a}. \end{aligned}$$

Replacing \(2\varOmega \) by \((X - \hat{\rho }(X))/2a\) if necessary we have that \(h = e^{2\varOmega }\) is valued in \(T\) and \(\varOmega \) is valued in \(\mathfrak {h}\) with \(\hat{\rho }(\varOmega ) = -\varOmega \).\(\square \)

Having established that \(\varOmega \) is \(\mathfrak {h}\)-valued, we have that \(\varOmega \) and \(\partial \varOmega \) commute and the local connection form is simply \(A = -2\partial \varOmega \). We have in turn that the curvature is \(F_A = -2\overline{\partial }\partial \varOmega \). Next we determine \(\varPhi ^* = -\rho (\varPhi )\). Recall that \(\varPhi = \tilde{e} + qe_l\), so \(\rho (\varPhi ) = -Ad_{h}(e + \overline{q}\tilde{e_l})\), where \(\tilde{e_l}\) is a lowest weight vector for \(\mathfrak {g}\). Thus

$$\begin{aligned} \varPhi ^*&= \mathrm{Ad}_h (e+ \overline{q}\tilde{e_l}) \\&= \sum _{\alpha \in \varPi } \sqrt{r_\alpha }e^{2\alpha (\varOmega )}e_\alpha +\overline{q}e^{-2\delta (\varOmega )}\tilde{e_l}. \end{aligned}$$

We can now determine the commutator \([\varPhi ,\varPhi ^*]\):

$$\begin{aligned}{}[\varPhi ,\varPhi ^*]&= \left[ \sum _{\alpha \in \varPi }\sqrt{r_\alpha }e_{-\alpha } + qe_l ,\sum _{\alpha \in \varPi } \sqrt{r_\alpha }e^{2\alpha (\varOmega )}e_\alpha +\overline{q}e^{-2\delta (\varOmega )}\tilde{e_l} \right] , \\&= -\sum _{\alpha \in \varPi }r_\alpha e^{2\alpha (\varOmega )}h_\alpha -q\overline{q}e^{-2\delta (\varOmega )}h_{-\delta }. \end{aligned}$$

Although it has been suppressed, this should really be multiplied by a \(dz \wedge d\overline{z}\) term. The Higgs bundle equation for \(\varOmega \) now becomes:

$$\begin{aligned} -2\varOmega _{z\overline{z}} + \sum _{\alpha \in \varPi }r_\alpha e^{2\alpha (\varOmega )}h_\alpha +q\overline{q}e^{-2\delta (\varOmega )}h_{-\delta } = 0. \end{aligned}$$
(13)

If \(\mathfrak {g}\) is a simple Lie algebra of type \(L_n\), then these are a version of the affine Toda field equations [13] for the Affine Dynkin diagram \(L_n^{(1)}\) which we introduce more generally in Sect. 5. Note that since \(\varOmega \) is \(\mathfrak {h}\)-valued, we have \(\rho (\varOmega ) = \hat{\rho }(\varOmega ) = -\varOmega \) and similarly for \(h_\alpha , h_{-\delta }\). Thus \(\varOmega \) is a real linear combination of the \(h_\alpha \). However, there is an additional constraint on \(\varOmega \) that we have yet to consider. By examining this condition, we will find that in some cases the above equations will reduce to affine Toda field equations for a smaller affine Dynkin diagram.

4.1 Symmetry conditions

The additional constraint on \(\varOmega \) that we have yet to consider relates to a symmetry property of \(\varOmega \). Recall that we define an involutive automorphism \(\sigma \) on \(\mathfrak {g}\) by the properties

$$\begin{aligned} \sigma (e_i) = -e_i, \; \; \; \sigma (\tilde{e}) = -\tilde{e}. \end{aligned}$$

We also note that \(\sigma \) commutes with \(\hat{\rho }\) and \(\hat{\lambda } = \sigma \hat{\rho } = \hat{\rho }\sigma \) is an anti-involution corresponding to the split real form of \(\mathfrak {g}\). As before, we have \(\sigma (\varPhi ) = -\varPhi \) and \(\sigma \) is covariantly constant with respect to \(\nabla _A\).

We shall argue that in fact \(\sigma (\varOmega ) = \varOmega \). First note that since \(\sigma A = A\) we have \(\sigma (\varOmega _{z\overline{z}}) = \varOmega _{z\overline{z}}\). Thus, the remaining terms of the Toda equations (13) must also be fixed by \(\sigma \). We also know [9] that \(\sigma = \phi \circ \hat{\nu }\) where \(\phi \) is the inner automorphism corresponding to a rotation by \(\pi \) in the principal \(3\)-dimensional subgroup and \(\hat{\nu }\) is a lift of an automorphism \(\nu \) of the Dynkin diagram for \(\varPi \). In fact, \(\nu \) is trivial except for the simple Lie algebras of type \(A_n,D_{2n+1},E_6\) for which it has order \(2\). In any case, \(\phi \) is conjugation by an element of \(\mathfrak {h}\), so that \(\sigma \) agrees with \(\hat{\nu }\) on \(\mathfrak {h}\) and we have

$$\begin{aligned} \sigma ({h_\alpha }) = h_{\nu (\alpha )}, \; \; \; \sigma (h_{-\delta }) = h_{-\delta }. \end{aligned}$$

We also have that \(\sigma (x) = x\) from which it follows that \(r_{\nu (\alpha )} = r_{\alpha }\). By invariance of the right hand side of (13) under \(\sigma \) we see that \(e^{2\alpha (\varOmega )} = e^{2\nu (\alpha )(\varOmega )}\), now since \(\varOmega \) is real with respect to the basis \(\{h_\alpha \}\), we may take logarithms to conclude that \(\alpha (\varOmega ) = \nu (\alpha )(\varOmega )\) for all \(\alpha \in \varPi \), that is \(\sigma (\varOmega ) = \varOmega \).

From the relation \(\sigma (\varOmega ) = \varOmega \) we have that \(\rho \) and \(\sigma \) commute and thus \(\lambda = \sigma \rho = \rho \sigma \) is a real structure on \(E\) conjugate to \(\hat{\lambda }\), hence \(\lambda \) defines a reduction of structure to the split real form \(G^\lambda \) of \(G\).

Since \(A\) is invariant under \(\rho \) and \(\sigma \), it is invariant under \(\lambda \) as well. Therefore \(\nabla _A\) has holonomy in \(K = G^\rho \cap G^\lambda \), the maximal compact subgroup of \(G^\lambda \). In fact, the holonomy reduces to a maximal torus \(T\) of \(K\). Let \(\mathfrak {t}\) denote the Lie algebra of \(T\), that is \(\mathfrak {t} = \mathfrak {h}^\lambda \). The inclusion \(i: \mathfrak {t} \rightarrow \mathfrak {h}\) induces the restriction map \(r: \mathfrak {h}^* \rightarrow \mathfrak {t}^*\). Since \(\varOmega \) is \(\mathfrak {t}\)-valued, we wish to re-interpret the Toda field equations in terms of the restricted roots on \(\mathfrak {t}^*\). Intuitively speaking, we are dividing out the action of \(\nu \).

We can realize \(\mathfrak {t}^*\) as a subspace of \(\mathfrak {h}^*\). In fact the kernel of the restriction map is the \(-1\)-eigenspace of \(\hat{\nu }^t\) so that \(\mathfrak {t}^*\) identifies with the \(+1\) eigenspace. Under this identification the restriction map becomes the orthogonal projection map

$$\begin{aligned} r(\alpha ) = \tfrac{1}{2}(\alpha + \hat{\nu }^t(\alpha )). \end{aligned}$$

For any \(\beta \in \mathfrak {t}^*\), define the dual vector \(\tilde{h}_\beta \) by the usual formula

$$\begin{aligned} \alpha (\tilde{h}_\beta ) = 2\frac{(\alpha ,\beta )}{(\beta ,\beta )} \end{aligned}$$

for all \(\alpha \in \mathfrak {t}^*\). Then, for possibly different (non-zero) constants \(\tilde{r}_\beta \) the Toda equation (13) reduces to:

$$\begin{aligned} -2\varOmega _{z\overline{z}} + \sum _{\beta \in r(\varPi )}\tilde{r}_\beta e^{2\beta (\varOmega )}\tilde{h}_\beta + q\overline{q}e^{-2\delta (\varOmega )}\tilde{h}_{-\delta } = 0. \end{aligned}$$
(14)

The elements of \(r(\varPi )\) together with \(-\delta = -r(\delta )\) correspond to an affine Dynkin diagram as explained in Sect. 5. That is we have a system of vectors in a Euclidean vector space \(\mathfrak {t}^*\) whose Dynkin diagram is an affine Dynkin diagram. Equation (20) is then the affine Toda field equation for the corresponding affine Dynkin diagram.

All that remains is to actually determine which affine root system to which they correspond. If \(\nu \) is trivial, then \(\mathfrak {t} = \mathfrak {h}\) and the affine diagram is just the extended Dynkin diagram \(L_n^{(1)}\) obtained by adding to the root system \(L_n\) the lowest weight \(-\delta \). The remaining cases follow from a straightforward calculation and are as follows: for \(\mathfrak {g}\) of type \(A_{2n},A_{2n-1},D_{2n+1},E_6\), the corresponding affine diagrams are \(A_{2n}^{(2)},C_n^{(1)},B_{2n}^{(1)},F_4^{(1)}\) respectively. Here we are using the Kac names for the affine Dynkin diagrams [4].

5 Geometry of the affine Toda equations

5.1 The affine Toda equations

The Toda equations have been extensively studied from the point of view of integrable systems [1114] as well as their relation to minimal surfaces and harmonic maps [2, 3, 57]. This should not be surprising given the link between Higgs bundles and harmonic maps. However our treatment of the affine Toda equations allows for more general real forms of the equations than are usually considered.

Let \(A\) be an \(n \times n\) indecomposable generalized Cartan matrix of affine type. We label the rows and columns by \(0,1,\ldots , l\), where \(l = n-1\). We arrange so that the \(l \times l\) matrix \(A^0\) formed by removing row \(0\) and column \(0\) is the Cartan matrix for the root system of a complex simple Lie algebra. So there is a real \(l\)-dimensional Euclidean vector space \((\mathfrak {h}_{\mathbb {R}},\langle \, , \, \rangle )\) with basis \(\{h_i\}\), \(i = 1,\ldots , l\) such that if we define a corresponding basis \(\{\alpha _i \}\), \(i = 1,\ldots , l\) of \(\mathfrak {h}_{\mathbb {R}}^*\) by \(\alpha _i = 2h_i/\langle h_i, h_i \rangle \) then \(A_{ij} = \alpha _j(h_i)\) for \(i,j = 1,\ldots , l\). Moreover, [4] there exists positive integers \(a_0,a_1, \ldots , a_l\) with no common factor such that \(A (a_0,\ldots , a_l )^t = 0\). Similarly, there exists positive integers \(c_0, c_1, \ldots , c_l\) with no common factor such that \((c_0, \ldots , c_l)A = 0\). Let us define \(h_0 \in \mathfrak {h}_{\mathbb {R}}\) and \(\alpha _0 \in \mathfrak {h}_{\mathbb {R}}^*\) by

$$\begin{aligned} h_0&= -\sum _{i=1}^l c_i h_i, \\ \alpha _0&= -\sum _{i=1}^l a_i \alpha _i. \end{aligned}$$

Then it follows that

$$\begin{aligned} \alpha _i (h_j) = A_{ji}, \, i,j = 0,1,\ldots , l. \end{aligned}$$

Now we may define the affine Toda equations

Definition 1

Given an \(n \times n\) indecomposable generalized Cartan matrix of affine type, let \(\mathfrak {h}_{\mathbb {R}}\), \(\{h_i\}\), \(\{\alpha _i\}\) be as above. Let \(U\) be an open subset of \(\mathbb {C}\). A map \(\varOmega : U \rightarrow \mathfrak {h} = \mathfrak {h}_{\mathbb {R}} \otimes \mathbb {C}\) is said to satisfy the affine Toda equations if it is a solution to

$$\begin{aligned} 2\varOmega _{z\overline{z}} = \sum _{i=0}^l k_i e^{2\alpha _i(\varOmega )}h_i, \end{aligned}$$
(15)

where \(k_i \in \mathbb {C}\) are constants, \(k_i \ne 0, \, i = 0, \ldots , l\).

5.2 From affine Toda to Higgs bundles

It is well known that the affine Toda equations can be re-written as a zero curvature condition for a certain associated connection. The fact that such a formulation exists is intimately connected to their integrability. Alternatively, the affine Toda equations admit a description in terms of a class of harmonic maps from a surface into a homogeneous space of the form \(G/T\) where \(T\) is a maximal torus or more generally the group corresponding to a Cartan subalgebra [2, 7]. In turn, both of these descriptions are closely related to the Higgs bundle equations.

We will first re-write the Toda equations as the equations for a flat connection. After imposing appropriate reality conditions we will see that the flat connection decomposes into a unitary connection and Higgs field satisfying the Higgs bundle equations. We note that if different reality conditions are imposed one can arrive at similar equations except involving connections that preserve an indefinite Hermitian form.

For now we restrict to the case of an untwisted affine Dynkin diagram. In fact, the twisted cases all occur as special cases of the untwisted equations [14] and we have already seen in Sect. 4.1 how this works in some cases. Thus, we take a complex simple Lie algebra \(\mathfrak {g}\) of rank \(l\), we let \(\mathfrak {h}\) be the real subspace of a Cartan subalgebra spanned by the coroots \(h_1, \ldots , h_l\) corresponding to the simple roots \(\alpha _1, \ldots , \alpha _l\). The untwisted affine root system corresponding to \(\mathfrak {g}\) amounts to letting \(\alpha _0 = -\delta \) be the lowest root and \(h_0\) the corresponding coroot. Define \(x \in \mathfrak {h}\) by (1).

For each root \(\alpha _i\), \(i=0,1,\ldots , l\), let \(e_{\alpha _i}\), be a basis element for the spaces of \(\alpha _i\). We can choose a basis such that

$$\begin{aligned}{}[ e_{\alpha _i} , e_{-\alpha _j} ] = \delta _{ij} h_i. \end{aligned}$$

Consider the connection \(\nabla = \nabla ^{1,0} + \nabla ^{0,1}\) on the trivial bundle \(\mathfrak {g} \times U \rightarrow U\), where

$$\begin{aligned} \nabla ^{1,0}&= (\partial _{z} - \varOmega _z)dz + \varPhi \end{aligned}$$
(16)
$$\begin{aligned} \nabla ^{0,1}&= (\partial _{\overline{z}} +\varOmega _{\overline{z}})d\overline{z} + \varPsi \end{aligned}$$
(17)

and \(\varPhi \), \(\varPsi \) are defined by

$$\begin{aligned} \varPhi&= \mathrm{Ad}_{e^{-\varOmega }}\left( \sum _{i=0}^l c_i e_{-\alpha _i} \right) dz \end{aligned}$$
(18)
$$\begin{aligned} \varPsi&= \mathrm{Ad}_{e^{+\varOmega }}\left( \sum _{i=0}^l d_i e_{+\alpha _i} \right) d\overline{z} \end{aligned}$$
(19)

where the \(c_i,d_i\) are constants such that \(c_i d_i = k_i\). The different choices of the constants \(c_i,d_i\) actually define a \((\mathbb {C}^*)^{l+1}\)-family of flat connections. We see that the Toda equations (15) are equivalent to the vanishing of the curvature of \(\nabla \). We will call this gauge in which \(\nabla \) is written the Toda gauge.

One can write down a whole family of gauge equivalent connections. For any smooth map \(H : U \rightarrow \mathfrak {h}\), the gauge transformation by \(\mathrm{Ad}_{e^H}\) changes \(\nabla \) to \(\nabla '^{1,0} + \nabla '^{0,1}\) where

$$\begin{aligned} \nabla '^{1,0}&= (\partial _{z} - \varOmega _z-H_z)dz + \varPhi ' \\ \nabla '^{0,1}&= (\partial _{\overline{z}} +\varOmega _{\overline{z}}-H_{\overline{z}})d\overline{z} + \varPsi ' \end{aligned}$$

with \(\varPhi '\), \(\varPsi '\) given by

$$\begin{aligned} \varPhi '&= \mathrm{Ad}_{e^{-\varOmega +H}}\left( \sum _{i=0}^l c_i e_{-\alpha _i} \right) dz \\ \varPsi '&= \mathrm{Ad}_{e^{\varOmega +H}}\left( \sum _{i=0}^l d_i e_{\alpha _i} \right) d\overline{z}. \end{aligned}$$

Choosing \(H\) to be constant accounts for a \((\mathbb {C}^*)^l\) subgroup of the aforementioned \((\mathbb {C}^*)^{l+1}\)-family of flat connections. Modulo this gauge symmetry There is a remaining \(\mathbb {C}^*\)-action which amounts to transformations \(\varPhi \mapsto \zeta \varPhi \), \(\varPsi \mapsto \zeta ^{-1} \varPsi \) for \(\zeta \in \mathbb {C}^*\). The choice of gauge transformation with \(H = \varOmega \) will be called the Higgs gauge for reasons that will soon become clear.

Having arrived at the desired zero curvature form of the equations, the next step is to impose a reality condition. Let \(\hat{\rho }\) be the anti-linear involution of \(\mathfrak {g}\) defined as in (11). We extend \(\hat{\rho }\) to differential form-valued sections of \(U \times \mathfrak {g}\) by letting \(\hat{\rho }\) acts as complex conjugation on the differential form part of the section. Note that \(\hat{\rho }\) defines the compact form of \(\mathfrak {g}\) and that \(\hat{\rho }\) preserves \(\mathfrak {h}\). We impose the reality condition \(\hat{\rho }(\varOmega ) = -\varOmega \), or equivalently \(\varOmega \) is valued in \(\mathfrak {h}_{\mathbb {R}}\), the \(-1\) eigenspace of \(\hat{\rho }\) on \(\mathfrak {h}\). Now to consider a particular real form of the Toda equations we also fix the values of the coefficients \(k_i\), \(i = 0,\ldots , l\) to the following convenient values: \(k_i = r_i\) for \(i = 1,\ldots , l\) where the \(r_i\) are defined by \(x = \sum _{i=1}^l r_i h_i\) and set \(k_0 = 1\). Let us further take \(c_i = d_i = \sqrt{r_i}\) for \(i = 0,\ldots , l\).

Under these conditions the connection \(\nabla _A = (\partial _{z} - \varOmega _z) + (\partial _{\overline{z}} +\varOmega _{\overline{z}})\) is a unitary connection on the trivial bundle \(\mathfrak {g} \times U \rightarrow U\) with respect to the Hermitian form \(h_{\hat{\rho }}(x,y) = -k(x, \hat{\rho }(y))\), where \(k\) is the Killing form on \(\mathfrak {g}\). Moreover we observe that \(\hat{\rho }(\varPhi ) = -\varPsi \), or simply \(\varPsi = \varPhi ^*\). It follows easily that:

Proposition 2

The pair \((\nabla _A , \varPhi )\) is a solution of the Higgs bundle equations if and only if \(\varOmega \) is a solution of the affine Toda equations

The above proposition is also proved in [1, Proposition 3.1].

Note that in the Higgs gauge \(\nabla _A^{0,1} = \overline{\partial }\), that is the Higgs gauge is a holomorphic trivialization of the holomorphic structure determined by \(\nabla _A\). Let us also note that in the Higgs gauge, the Hermitian form preserved by \(\nabla _A\) becomes

$$\begin{aligned} H(x,y)&= -k(Ad_{e^{-\varOmega }} (x) , \hat{\rho }(Ad_{e^{-\varOmega }}(y))) \\&= -k( x , Ad_{e^{\varOmega }} \hat{\rho }( Ad_{e^{-\varOmega }} (y))) \\&= -k (x , Ad_{e^{2\varOmega }} \hat{\rho }(y)) \\&= -k( x , \rho (y)) \end{aligned}$$

where \(\rho = \mathrm{Ad}_{e^{2\varOmega }} \circ \hat{\rho }\) and we have use Ad-invariance of the Killing form and the fact that \(\hat{\rho } \circ Ad_{e^{-\varOmega }} = Ad_{e^{\varOmega }} \circ \hat{\rho }\). Observe that this is the same relation as in Proposition 1.

5.3 Globalization of the affine Toda equations

We are interested in putting the affine Toda equations into more geometric terms, applicable to any Riemann surface. Since we have already written the affine Toda equations in terms of the Higgs bundle equations, the remaining step is to introduce transition maps to get a connection on a non-trivial bundle. To achieve this we will also need the data of a holomorphic differential of degree \(h = M+1\). This will ultimately lead us back to the class of cyclic Higgs bundles.

Let \(\varSigma \) be a Riemann surface and \(\{ U_i \}\) and open cover by coordinate charts with local holomorphic coordinate \(z_i\). On the intersection \(U_{ij} = U_i \cap U_j\), let \(dz_i = g_{ij}dz_j\) where \(g_{ij}\) is a non-vanishing holomorphic function on \(U_{ij}\). Further define real valued functions \(f_{ij}\) by \(g_{ij}\overline{g_{ij}} = e^{2f_{ij}}\).

Suppose we have a collection of functions \(\varOmega _i : U_i \rightarrow \mathfrak {h}\) which are related on the overlap \(U_{ij}\) by

$$\begin{aligned} \varOmega _i = \varOmega _j + f_{ij}x. \end{aligned}$$

So the \(\varOmega _i\) define a section \(\varOmega \) of a certain affine bundle over \(\varSigma \). We call such a section \(\varOmega \) a Toda field. The point of this definition is that for each simple root \(\alpha \), we have \(\alpha (x)=1\) and hence the \(e^{2\alpha (\varOmega _i)}dz_i \overline{dz_i}\) define a section of \(K \overline{K}\), where \(K\) is the canonical bundle of \(\varSigma \). On the other hand, for the lowest root \(-\delta \) we have that the \(e^{-2\delta (\varOmega _i)}dz_i \overline{dz_i}\) define a section of \(K^{-M}\overline{K}^{-M}\). To make sense of the affine Toda equation we introduce a holomorphic degree \((M+1)\)-differential \(q \in H^0(\varSigma , K^{M+1})\). We may then say that the pair \((\varOmega ,q)\) consisting of a Toda field and holomorphic differential satisfy the affine Toda equations, if

$$\begin{aligned} 2\varOmega _{z\overline{z}} = \sum _{i=1}^l k_i e^{2\alpha _i(\varOmega )}h_i + q \overline{q} e^{-2 \delta (\varOmega )} h_0. \end{aligned}$$
(20)

Note that both sides of this equation define sections of \(K\overline{K} \otimes \mathfrak {h}\) so this is a well-defined equation on \(\varSigma \). If \(q = 0\), the equation is locally equivalent to the non-affine Toda equations. On the other hand, in a neighborhood of a point where \(q \ne 0\) we can find a local coordinate such that \(q = (dz)^{M+1}\) and the equation reduces to the affine Toda equations as in (15).

On each open subset \(U_i\), we have a trivial bundle \(\mathfrak {g} \times U_i\) and a pair \((\nabla _{A,i},\varPhi _i)\) which in the Higgs gauge have the form

$$\begin{aligned} \nabla _{A,i}^{1,0}&= (\partial _{z_i} - 2{(\varOmega _i)}_{z_i})dz_i, \\ \nabla _{A,i}^{0,1}&= \partial _{\overline{z_i}} d\overline{z_i}, \end{aligned}$$

and

$$\begin{aligned} \varPhi _i&= \left( \sum _{i=1}^l c_i e_{-\alpha _i} + q_i e_\delta \right) dz_i, \\ \varPhi _i^*&= \mathrm{Ad}_{e^{2(\varOmega )_i}}\left( \sum _{i=1}^l d_i e_{\alpha _i} + \overline{q_i} e_{-\delta } \right) d\overline{z_i}, \end{aligned}$$

where \(q = q_i (dz_i)^{M+1}\) in the local coordinate \(z_i\).

Next, we claim that on the overlaps \(\,U_{ij}\,\) the pairs \((\nabla _{A,i},\varPhi _i)\) and \((\nabla _{A,j},\varPhi _j)\) are gauge equivalent by some gauge transformation \(\phi _{ij}\). The \(\phi _{ij}\) will satisfy a cocyle condition so we can patch together to get a single Higgs bundle \((\nabla _A , \varPhi )\) defined on \(\varSigma \).

Let us take transition maps that preserve the root decomposition \(\mathfrak {g} = \mathfrak {h} \oplus \bigoplus _{\alpha \in \Delta } \mathfrak {g}_\alpha \). More specifically, we take transitions \(\phi _{ij}\) that act trivially on \(\mathfrak {h}\) and on a given root space \(\mathfrak {g}_\alpha \) acts as multiplication by \(g_{ij}^{\alpha (x)}\). Thus a section \(X_i\) of \(\mathfrak {g}_\alpha \times U_i\) is identified with a section \(X_j\) of \(\mathfrak {g}_\alpha \times U_j\) over \(U_{ij}\) by the relation \(X_i = g_{ij}^{\alpha (x)} X_j\).

It follows that the bundles \(\mathfrak {g} \times U_i\) patch together to form precisely the bundle \(E\) given by (8). Moreover, one easily checks that the Higgs bundles \((\nabla _{A,i},\varPhi _i)\) patch together under these transitions to form a Higgs bundle \((\nabla _A , \varPhi )\) on \(E\). It is immediate from the construction that the pair \((\nabla _A , \varPhi )\) is a cyclic Higgs bundle. If we now assume \(\varSigma \) is compact of genus \(g>1\), then as an immediate corollary we find that \(\sigma (\varOmega ) = \varOmega \), where \(\sigma \) is the Cartan involution as defined in (9). Combined with the results in Sect. 4 we have thus obtained:

Theorem 1

Let \(\varSigma \) be a compact Riemann surface of genus \(g>1\). Then any pair \((\varOmega ,q)\) consisting of a Toda field and holomorphic differential \(q\) of degree \((M+1)\) satisfying the affine Toda equations (20) subject to \(k_i = r_i\) for \(i=1,\ldots , l\), \(k_0 = 1\) and \(\hat{\rho }(\varOmega ) = -\varOmega \) corresponds to a cyclic Higgs bundle. Conversely any cyclic Higgs bundle gives such a solution of the Toda field equations

Moreover, we know that the cyclic Higgs bundles on a compact Riemann surface of genus \(g>1\) are parametrized by the corresponding holomorphic differential \(q \in H^0(\varSigma , K^{M+1})\) so we also have:

Theorem 2

Let \(\varSigma \) be a compact Riemann surface of genus \(g>1\). For any holomorphic differential \(q \in H^0(\varSigma , K^{M+1})\) there is a unique Toda field \(\varOmega \) such that \(\hat{\rho }(\varOmega ) = -\varOmega \) and \((\varOmega ,q)\) satisfies the affine Toda equations (with coefficients \(k_i = r_i\) for \(i=1, \ldots , l\), \(k_0 = 1\))

The uniqueness follows from Lemma 3.

6 Remark on harmonic maps

We have mentioned that the Higgs bundle and Toda equations are related not only to flat connections, but also to harmonic maps. We demonstrated the relation between Higgs and Toda using flat connection descriptions. Here we remark on the relation in terms of harmonic maps.

Suppose \(P_G \rightarrow \varSigma \) is a principal \(G\)-bundle with flat connection \(\omega \). Let \(\tilde{\varSigma }\) be the universal cover of \(\varSigma \), which we view as a right principal \(\pi _1(\varSigma )\)-bundle over \(\varSigma \). The pull-back of \(P_G\) to a principal bundle over \(\tilde{\varSigma }\) admits global sections in which \(\omega \) is trivial. Any two such trivializing sections differ by the right action of \(G\). A choice of such section \(s\) determines a monodromy representation \(\theta : \pi _1(\varSigma ) \rightarrow G\) of \(\omega \) by the relation \(s(p\gamma ) = s(p) \theta (\gamma )\) for all \(\gamma \in \pi _1(\varSigma )\). If we change the section \(s\) then \(\theta \) changes by conjugation.

Now suppose in addition we are given a reduction of structure to a principal \(H\)-subbundle \(P_H \rightarrow P_G\) where \(H\) is a subgroup of \(G\). Note that we are not assuming the connection \(\omega \) reduces to a connection on \(P_H\). With respect to a trivializing section \(s\) we can find a map \(\tau : \tilde{\varSigma } \rightarrow G/H\) such that the pull-back of \(P_H\) to \(\tilde{\varSigma }\) is given by \(\{ s(p)g \, | \, p \in \tilde{\varSigma }, \; g \in \tau (p) \, \}\). If we change the section \(s\), to \(sg\) for some \(g \in G\) then we must change \(\tau \) to \(g^{-1}\tau \). We also observe that the map \(\tau : \tilde{\varSigma } \rightarrow G/H\) is \(\theta \)-equivariant meaning \(\tau (p\gamma ) = \theta (\gamma )^{-1} \tau (p)\) for all \(\gamma \in \pi _1(\varSigma )\).

Conversely a representation \(\theta : \pi _1(\varSigma ) \rightarrow G\) determines a flat \(G\)-connection and a \(\theta \)-equivariant map \(\tau : \tilde{\varSigma } \rightarrow G/H\) corresponds precisely to a reduction of structure to \(H\) of the associated flat principal \(G\)-bundle.

In the case of Higgs bundles, we have a flat \(G\)-connection, where we assume \(G\) is simple and a reduction of structure to the maximal compact subgroup \(K \subseteq G\). If \(\nabla \) is any flat \(G\)-connection and we are given a reduction of structure \(\tau : \tilde{\varSigma } \rightarrow G/K\) to the maximal compact, then we may uniquely decompose \(\nabla = \nabla _A + \varPhi + \varPhi ^*\) where \(\nabla _A\) is a unitary connection, \(\varPhi \) is a \((1,0)\)-valued section of the adjoint bundle and \(\varPhi ^*\) is the adjoint with respect to the Hermitian form determined by the reduction of structure. A key observation is that the pair \((\nabla _A , \varPhi )\) satisfies the Higgs bundle equations if and only if the corresponding map \(\tau : \tilde{\varSigma } \rightarrow G/K\) is harmonic. This is the link between harmonic maps and Higgs bundles.

Similarly, in the case of the affine Toda equations we wrote the equations in terms of a flat \(G\)-connection for a complex Lie group and a reduction of structure to the subgroup \(T\) corresponding to a Cartan subalgebra. So there is a reduction of structure map \(\tau : \tilde{\varSigma } \rightarrow G/T\). When we have a solution of the Toda equations, this map is harmonic. There is a partial converse—a special class of harmonic maps \(\tilde{\varSigma } \rightarrow G/T\) defined in [2] called \(\tau \)-primitive correspond to solutions of the affine Toda equations.

Now, consider the case of cyclic Higgs bundles. Let \(G\) be a complex simple Lie group in adjoint form, \(G^{\hat{\rho }}\) the maximal compact, \(G^{\hat{\lambda }}\) the split real form, and \(K = G^{\hat{\lambda }} \cap G^{\hat{\rho }}\) the maximal compact subgroup of \(G^{\hat{\lambda }}\). A cyclic Higgs bundle determines a harmonic map \(\tau _{\mathrm{Higgs}} : \tilde{\varSigma } \rightarrow G^{\hat{\lambda }}/K\). Now let \(T\) be the subgroup generated by a Cartan subalgebra \(\mathfrak {h}\). The compact and split real forms have corresponding anti-involutions \(\hat{\rho }\) and \(\hat{\lambda }\) which according to the choices we use preserve \(\mathfrak {h}\). So there are corresponding abelian subgroups \(T^{\hat{\rho }} \subset G^{\hat{\rho }}\), \(T^{\hat{\lambda }} \subset G^{\hat{\lambda }}\) and \(T_K = T^{\hat{\rho }} \cap T^{\hat{\lambda }} \subset K\). The Toda equations imply that the reduction of structure map \(\tau _{\mathrm{Toda}} : \tilde{\varSigma } \rightarrow G^{\hat{\lambda }}/ T^{\hat{\lambda }}\) is harmonic. Moreover, the reductions of structure \(\tau _{\mathrm{Higgs}},\tau _{\mathrm{Toda}}\) are compatible in the sense that we have a commutative diagram

It is a straightforward computation to verify that the map \(\tau \) is also harmonic. Thus, in terms of harmonic maps the relationship between the Higgs bundle and affine Toda equations is the existence of a common lift of harmonic maps which is also harmonic.