Abstract
We study the regular surface defect in the \(\Omega \)-deformed four-dimensional supersymmetric gauge theory with gauge group SU(N) with 2N hypermultiplets in fundamental representation. We prove its vacuum expectation value obeys the Knizhnik–Zamolodchikov equation for the 4-point conformal block of the \(\widehat{{\mathfrak {sl}}}_{N}\)-current algebra, originally introduced in the context of two-dimensional conformal field theory. The level and the vertex operators are determined by the parameters of the \(\Omega \)-background and the masses of the hypermultiplets; the cross-ratio of the 4 points is determined by the complexified gauge coupling. We clarify that in a somewhat subtle way the branching rule is parametrized by the Coulomb moduli. This is an example of the BPS/CFT relation.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The rich mathematics of quantum field theory has a remarkable feature of admitting, to some extent, an analytic continuation in various parameters, such as momenta, spins etc. This feature is best studied in the examples of two-dimensional conformal field theories, where one can observe almost with a naked eye that the building blocks of the correlation functions are analytic in the parameters, such as the central charges, conformal dimensions, weights, spins and so on cf. [53]. Some formulae admit analytic continuation in the level k of the current algebra, cf. [22]. The analytic continuation offers some glimpses of the Langlands duality [12] \((k+h^{\vee })(k^{\vee }+h) = 1\), which suggests an identification of the quantum group parameter q with the modular parameter \({\exp } (2\pi \mathrm{i}\tau )\) of some elliptic curve [6]. These observations solidified as soon as the connection between the S-duality of four-dimensional supersymmetric theories and the modular invariance of two-dimensional conformal field theories was observed [47]. Localization computations in supersymmetric gauge theories [24, 26, 27, 32,33,34,35,36,37] showed that the correlation functions of selected observables coincide with conformal blocks of some two-dimensional conformal field theories, or, more generally, are given by the matrix elements of representations of some infinite-dimensional algebras, such as Kac–Moody, Virasoro, or their q-deformations, albeit extended to the complex domain of parameters, typically quantized in the two-dimensional setup. In [26], this phenomenon was attributed to the chiral nature of the tensor field propagating on the worldvolume of the fivebranes. The fivebranes (M5 branes in M-theory and NS5 branes in the IIA string theory) were used in [21, 49] to engineer, in string theory setup, the supersymmetric systems whose low energy is described by \({{\mathcal {N}}}=2\) supersymmetric gauge theories in four dimensions. This construction was extended and generalized in [11]. This correspondence, named the BPS/CFT correspondence in [29, 30], has been supported by a large class of very detailed examples in [1, 2, 32,33,34,35,36,37], and more recently in [14, 15, 18, 20].
Finally, in [50, 51], the relation of the quantum group parameter q with the elliptic curves has been brought into the familiar context of the relation of the \({{\mathcal {N}}}=4\) super-Yang–Mills theory to elliptic curves. Hopefully, with this understanding of the analytically continued Chern–Simons theory, the (quasi)-modularity conjectures of [31] could be tested.
In this paper, we shall be studying a particular corner of that theoretical landscape: the SU(N) gauge theory with 2N fundamental hypermultiplets. In the BPS/CFT correspondence, it is associated with a zoo of two-dimensional conformal theories living on a 4-punctured sphere, all related to the \({\widehat{\mathfrak {sl}}}_{N}\) current algebra, either directly, or through the Drinfeld–Sokolov reduction, producing the \(W_{N}\)-algebra [52], depending on the supersymmetric observables one uses to probe the four-dimensional theory. Two observables are of interest for us. First, the supersymmetric partition function \(\mathbf{Z} = \mathbf{Z}( {{\mathbf{a }}}, {{\mathbf{m }}}, {{\varepsilon }}_{1}, {{\varepsilon }}_{2}; {{\mathfrak {q}}})\) on \({{{\mathbb {R}}}}^4\), which is a function of the vacuum expectation value \({{\mathbf{a }}}= {\mathrm{diag}} \left( a_{1}, \ldots , a_{N} \right) \) of the scalar in the vector multiplet, the masses \({{\mathbf{m }}} = \left\{ m_{1}, m_{2}, \ldots , m_{2N-1}, m_{2N} \right\} \), the exponentiated complexified gauge coupling \({{\mathfrak {q}}} = {\exp } (2\pi \mathrm{i}\tau )\),
and the parameters \({{\varepsilon }}=\left( {{\varepsilon }}_{1}, {{\varepsilon }}_{2} \right) \) of the \(\Omega \)-deformation. The latter are the equivariant parameters of the maximal torus \(U(1) \times U(1)\) of the Euclidean rotation group Spin(4). In the complex coordinates \((z_{1}, z_{2})\) on \({{{\mathbb {C}}}}^2 \approx {{{\mathbb {R}}}}^4\), the rotational symmetry acts by \((z_{1}, z_{2}) \mapsto ( e^{{\mathrm{i}}t{{\varepsilon }}_{1}} \, z_{1}\, , \, e^{{\mathrm{i}}t{{\varepsilon }}_{2}} \, z_{2} )\). Exchanging \({{\varepsilon }}_{1} \leftrightarrow {{\varepsilon }}_{2}\) is part of the Spin(4) Weyl group; hence, it is a symmetry of \({\mathbf{Z}}\). The second observable is the partition function \({{\varvec{\Psi }}}\) of the regular surface defect which breaks the gauge group down to its maximal torus \(U(1)^{N-1}\) along the surface, which we shall take to be the \(z_2=0\) plane. This partition function depends on all the parameters \({{\mathbf{a }}}, {{\mathbf{m }}}, {{\varepsilon }}, {{\mathfrak {q}}}\) that the bulk partition function \(\mathbf{Z}\) depends on and, in addition, it depends on the parameters
of a two-dimensional theory the defect supports. The physics and mathematics setup of the problem are explained in [36, 37], which the reader may consult for motivations and orientation. However, our exposition is self-contained as a well-posed mathematical problem, which we introduce presently.
Our main result is the proof of a particular case of the BPS/CFT conjecture [29, 30]: the vacuum expectation value \(\langle \, {{\mathcal {S}}} \, \rangle \) of the surface defect obeys the Knizhnik–Zamolodchikov equation [22], specifically the equation obeyed by the \(\left( {\widehat{\mathfrak {sl}}}_{N} \right) _{k}\) current algebra conformal block
with the vertex operators corresponding to irreducible infinite-dimensional representations of \(\mathfrak {sl}_{N}\). More specifically, the vertex operators at 0 and \(\infty \) correspond to the generic lowest weight \({{\mathcal {V}}}_{\vec \nu }\) and highest weight \({{\tilde{{\mathcal {V}}}}}_{\vec {{{\tilde{\nu }}}}}\) Verma modules, while the vertex operators at \({\mathfrak {q}}\) and 1 correspond to the so-called twisted HW-modules \({{\mathcal {H}}}_{\mathrm{m}}^{\vec \mu }, {{\tilde{{\mathcal {H}}}}}_{{\tilde{\mathrm{m}}}}^{\vec {{{\tilde{\mu }}}}}\). The subscripts \({\vec \nu },{\vec {{{\tilde{\nu }}}}} \in {{{\mathbb {C}}}}^{N-1}\) and \({\mathrm{m}}, {\tilde{\mathrm{m}}} \in {{{\mathbb {C}}}}\) determine the values of the Casimir operators, in correspondence with the 2N masses \({\mathbf{m }}\) and one of the \(\Omega \)-background parameters \({\varepsilon }_1\). The superscripts \({\vec \mu }, \vec {{{\tilde{\mu }}}} \in {{{\mathbb {C}}}}^{N-1}\) determine the so-called twists of the HW-modules, all defined below, which we express via \({\mathbf{m }}, {\varepsilon }_1\), and the Coulomb parameters \({\mathbf{a }}\). In other words, the Coulomb parameters determine the analogue of the “intermediate spin”, which we indicate by placing a superscript \({\mathbf{a }}\) in (1) to label the specific fusion channel. We define these representations and the Knizhnik–Zamolodchikov equation [22].
The appearance of the twisted representations is a curious fact not visible in the rational conformal field theories.
2 Basic setup in four dimensions
First we introduce the setup of the four-dimensional gauge theory calculation.
2.1 Notations
We start by reviewing our notations. The reader is invited to consult [32,33,34,35,36,37] for the general orientation.
-
The parameters of the \(\Omega \)-deformation: \({\varepsilon }_1, {\varepsilon }_2\) – two complex parameters, generating the equivariant cohomology \(\text {H}^{\bullet }_{{{\mathbb {C}}}^\times \times {{\mathbb {C}}}^\times }(\text {pt})\). The twist part of the \(\Omega \)-deformation is \({\varepsilon }={\varepsilon }_1+{\varepsilon }_2\). The torus \({{{\mathbb {C}}}^\times \times {{\mathbb {C}}}^\times }\) is the complexification of the maximal torus of the spin cover of the rotation group Spin(4). We also define
$$\begin{aligned} {\kappa } = \frac{{{\varepsilon }}_{2}}{{{\varepsilon }}_{1}} \, . \end{aligned}$$(2) -
The Coulomb moduli:
$$\begin{aligned} {\mathbf{a }}= ( a_{b} )_{b =1}^{N} \equiv \left( a_1,\ldots ,a_N \right) \in {{{\mathbb {C}}}}^{N} \end{aligned}$$(3)–the equivariant parameters of the color group, in other words these are the generators of \(\text {H}^{\bullet }_{({{\mathbb {C}}}^\times )^N}(\text {pt})\), on which the symmetric group S(N) acts by permutations.
-
The masses:
$$\begin{aligned} {\mathbf{m }}= (m_f)_{f=1}^{2N} \equiv \left( m_1,\ldots ,m_{2N} \right) \in {{{\mathbb {C}}}}^{2N} \end{aligned}$$(4)–the equivariant parameters of the flavor group. The symmetric group S(2N) acts on them by permutations. The S(2N)-invariants are encoded via the polynomial
$$\begin{aligned} P(x)=\prod \limits _{f=1}^{2N} (x-m_{f})\, . \end{aligned}$$(5) -
The splitting of the set of masses into the N “fundamental” and N “anti-fundamental” ones:
$$\begin{aligned} P(x) = P^{+}(x) P^{-}(x) \, , \qquad P^{{\pm }}(x) = \prod _{f=1}^{N} \left( x-m_{f}^{{\pm }}\right) . \end{aligned}$$(6) -
The lattice of equivariant weights \({\Lambda } \subset {{{\mathbb {C}}}}\) is defined by:
$$\begin{aligned} {\Lambda } := {{\mathbb {Z}}}\cdot {\varepsilon }_1\oplus {{\mathbb {Z}}}\cdot {\varepsilon }_2\oplus \bigoplus _{{b}=1}^N {{\mathbb {Z}}}\cdot a_{b} \oplus \bigoplus _{f=1}^{2N} {{{\mathbb {Z}}}}\cdot m_{f}\, . \end{aligned}$$(7)We assume that all the parameters \({{\varepsilon }}_{1,2}, {{\mathbf{a }}}, {{\mathbf{m }}}\) are generic, up to the overall translation \(a_{b} \mapsto a_{b} + s,\ m_{f} \mapsto m_{f} + s\), for \(s \in {{{\mathbb {C}}}}\). Thus, the rank of \(\Lambda \) is at least \(3N +1\).
Recall that the bulk theory (subject to noncommutative deformation, leading to instanton moduli space being partially compactified to the moduli space \({{\mathfrak {M}}}_{k,N}\) of charge k rank N framed torsion-free sheaves on \({\mathbb {CP}}^2\)) is invariant under the nonabelian symmetry group U(2) of rotations, preserving the complex structure of \({{{\mathbb {C}}}}^2 \approx {{{\mathbb {R}}}}^4\). The group U(N) of constant gauge transformations acts on \({{\mathfrak {M}}}_{k,N}\) by changing the asymptotics of instantons at infinity. The Coulomb parameters \({\mathbf{a }}\) represent the maximal torus of U(N); they can be viewed as local coordinates on the \(\mathrm{Spec}\, \text {H}^{\bullet }_{U(N)} (\text {pt}) = {{{\mathbb {C}}}} [{{\mathbf{a }}}]^{S(N)}\), with S(N) being the Weyl group. Likewise, the parameters \(({{\varepsilon }}_{1}, {{\varepsilon }}_{2})\) are acted by the Weyl group \({{{\mathbb {Z}}}}_{2}\) which acts by permuting \({{\varepsilon }}_{1} \leftrightarrow {{\varepsilon }}_{2}\). The physical theory has a larger rotation symmetry group Spin(4), whose Weyl group is \({{{\mathbb {Z}}}}_{2} \times {{{\mathbb {Z}}}}_{2}\) but we don’t see the full symmetry in the \(\mathbf{Z}\)-function. The full symmetry is present once \(\mathbf{Z}\) is divided by the so-called U(1)-factor, having to do with decoupling of the U(1)-part of gauge group [1].
Finally, the masses represent the equivariant parameters of the flavor group SU(2N) (the physical theory has a larger flavor symmetry group, which we don’t see either); hence, the Weyl group S(2N) symmetry making the polynomial P(x) of (5) the good parameter.
The surface defect we are going to study in this paper breaks both the gauge group U(N) to its maximal torus \(U(1)^N\) and the flavor group to its maximal torus \(U(1)^{2N}\). The group \(S(N) \times S(2N)\) acts, therefore, on the space of surface defects. In describing the specific bases in the vector space of surface defects, we keep track of the ordering of the Coulomb and mass parameters.
-
The set of vertices of the Young graph \({{\mathcal {P}}}\)–the set of all Young diagrams (\(=\) partitions of nonnegative integers) \(\{\lambda \}\). Then
$$\begin{aligned} {{{\mathcal {P}}}}^N = \left\{ \, {\overline{\lambda }} = (\lambda ^{(1)},\ldots ,\lambda ^{(N)})\, \Big | \, {\lambda }^{(b)} \in {{\mathcal {P}}}\, \ \text {for}\, \ 1\le b \le N \, \right\} \, . \end{aligned}$$ -
For a box \(\square =(i,j)\), define its content \(c(\square )\) by:
$$\begin{aligned} c(\square ):=(i-1){\varepsilon }_1+(j-1){\varepsilon }_2\, . \end{aligned}$$(8) -
For \(\lambda \in {{\mathcal {P}}}\), define:
$$\begin{aligned} \chi _\lambda :=\sum _{\square \in \lambda } e^{c(\square )} \qquad \text {and} \qquad \chi ^*_\lambda :=\sum _{\square \in \lambda } e^{-c(\square )}\, . \end{aligned}$$(9) -
For \({\overline{\lambda }}\in {{\mathcal {P}}}^N\), define the multiset, i.e., its elements may have multiplicities, of tangent weights, \(\{ \mathrm{w}_t\}_{t\in T_{{\overline{\lambda }}}} \subset \Lambda \) by the character
$$\begin{aligned} \sum _{t\in T_{{\overline{\lambda }}}}e^{\mathrm{w}_{t}} := \sum _{b,c=1}^N \ e^{a_{b} - a_{c}}\, \Big ( \, \chi ^{*}_{{\lambda }^{({c})}}+e^{{\varepsilon }}\cdot \chi _{\lambda ^{({b})}}- (1-e^{{\varepsilon }_1})(1-e^{{\varepsilon }_2})\cdot \chi _{\lambda ^{({b})}} {\chi }^{*}_{{\lambda }^{({c})}}\, \Big )\, . \end{aligned}$$(10)
Remark 2.1
The duality: \(\{ \mathrm{w}_{t} \}_{t \in T_{{\overline{\lambda }}}} = \{ {\varepsilon }- \mathrm{w}_{t} \}_{t\in T_{{\overline{\lambda }}}}\) is related to the symplectic structure on the instanton moduli space and its completion \({{\mathfrak {M}}}_{k,N}\).
-
The pseudo-measure \(\mu ={\mu }({{\mathbf{a }}}, {{\mathbf{m }}}, {{\varepsilon }}_{1}, {{\varepsilon }}_{2}; {{\mathfrak {q}}}):{{{\mathcal {P}}}}^{N} \rightarrow {{{\mathbb {C}}}}\) on \({{\mathcal {P}}}^N\) is defined via:
$$\begin{aligned} \begin{aligned}&{\mu }({{\mathbf{a }}}, {{\mathbf{m }}}, {{\varepsilon }}_{1}, {{\varepsilon }}_{2}; {{\mathfrak {q}}})\, \vert _{{\overline{\lambda }}} := \, \frac{1}{\mathbf{Z}^{\text {inst}}}\cdot \left( (-1)^N {{\mathfrak {q}}}\right) ^{|{{{\overline{\lambda }}}}|}\cdot \frac{\prod \nolimits _{f=1}^{2N}\prod \nolimits _{b=1}^N\prod \nolimits _{\square \in \lambda ^{({b})}}\, \left( a_{b} + c(\square ) -m_f \right) }{\prod \nolimits _{t\in T_{{\overline{\lambda }}}} \mathrm{w}_{t}} \, \\&\quad = \frac{1}{\mathbf{Z}^{\text {inst}}}\cdot {{\mathfrak {q}}}^{|{{{\overline{\lambda }}}}|}\cdot \frac{\prod \nolimits _{b=1}^N\prod \nolimits _{\square \in \lambda ^{({b})}}\, \, \Big ( - P^{-}\left( a_{b} + c({\square }) \right) P^{+}\left( a_{b}+c(\square ) \right) \Big )}{\prod \nolimits _{t\in T_{{\overline{\lambda }}}} \mathrm{w}_{t}}\, , \end{aligned} \end{aligned}$$(11)
where \(|{{{\overline{\lambda }}}}| = \sum \nolimits _{b=1}^N |\lambda ^{({b})}|\) with
denoting the total number of boxes in \(\lambda ^{({b})}\), and \(\mathbf{Z}^{\text {inst}}=\mathbf{Z}^{\text {inst}}( {{\mathbf{a }}},{{\mathbf{m }}}, {{\varepsilon }}_{1},{{\varepsilon }}_{2}; {{\mathfrak {q}}})\) is the Taylor series in \({\mathfrak {q}}\) uniquely determined by the normalization
Remark 2.2
The restriction \(\mathrm{deg}\, P(x) = 2N\) comes from the convergence of \(\mathbf{Z}^{\text {inst}}\) for generic \({{\mathbf{a }}}, {{\mathbf{m }}}, {{\varepsilon }}_{1,2}\), cf. [9]. When working over the ring \({{{\mathbb {C}}}}[[{{\mathfrak {q}}}]]\) of formal power series in \({\mathfrak {q}}\), the restriction on the degree of P(x), i.e., the number of masses, can be dropped.
-
For \(\lambda \in {{\mathcal {P}}}\), we call \(\blacksquare \in \lambda \) a corner box if \(\lambda \backslash \blacksquare \in {{\mathcal {P}}}\) and we call \(\square \notin \lambda \) a growth box if \(\lambda \sqcup \square \in {{\mathcal {P}}}\). We denote by \({\partial }_{+}{\lambda }\) the set of all growth boxes of \(\lambda \), and by \({\partial }_{-}{\lambda }\) the set of all corner boxes of \({\lambda }\). It is easy to check that:
$$\begin{aligned} {\#} {\partial }_{+}{\lambda } \, -\, {\#}{\partial }_{-}{\lambda } \, = \, 1\, . \end{aligned}$$ -
For \(x\in {{{\mathbb {C}}}}\), we define the function Y(x) on \({{{\mathcal {P}}}}^{N}\) as follows: its value \(Y(x)\, \vert _{{\overline{\lambda }}}\) on \({{{\bar{\lambda }}}} \in {{{\mathcal {P}}}}^{N}\) is equal to
$$\begin{aligned} \begin{aligned} Y(x)\, \vert _{{\overline{\lambda }}}\,&:=\, \prod _{b=1}^N \left( (x-a_{b})\prod _{\square \in \lambda ^{(b)}} \frac{(x-a_{b}-c(\square )-{\varepsilon }_1)(x-a_{b}-c(\square )-{\varepsilon }_2)}{(x-a_{b}-c(\square ))(x-a_{b}-c(\square )-{\varepsilon })}\right) \\&= \, \prod _{b=1}^N \frac{\prod \nolimits _{{\square } \in {\partial }_{+}{\lambda }^{({b})}} \left( x - a_{b} - c ({\square }) \right) }{\prod \nolimits _{{\blacksquare } \in {\partial }_{-}{\lambda }^{({b})}} \left( x - a_{b} - {{\varepsilon }} - c ({\blacksquare }) \right) }\, , \end{aligned} \end{aligned}$$(12)the second line being obtained from the first one by the simple inspection of the cancelling common factors.
-
For \(x \in {{{\mathbb {C}}}}\), we define the function \({{\mathcal {X}}}(x)\) on \({{{\mathcal {P}}}}^N\), called the fundamental qq-character, by specifying its value \({{\mathcal {X}}}(x)\, \vert _{{\overline{\lambda }}}\) on \({{{\bar{\lambda }}}} \in {{{\mathcal {P}}}}^{N}\) as follows:
$$\begin{aligned} {{\mathcal {X}}} (x)\, \vert _{{\overline{\lambda }}}\, :=\ Y (x+{{\varepsilon }})\, \vert _{{\overline{\lambda }}}\, +\, {{\mathfrak {q}}}\, \frac{P(x)}{Y(x)\, \vert _{{\overline{\lambda }}}}\, . \end{aligned}$$(13) -
For a pseudo-measure \({\widetilde{\mu }}:{{\mathcal {P}}}^N\rightarrow {{\mathbb {C}}}\) and a function \(g:{{\mathcal {P}}}^N\rightarrow {{\mathbb {C}}}(x)\), the average \(\langle \, g(x)\, \rangle _{{\widetilde{\mu }}}\) is defined via:
$$\begin{aligned} \Big \langle g(x) \Big \rangle _{{\widetilde{\mu }}}\, := \sum _{{\overline{\lambda }}\in {{\mathcal {P}}}^N} {\widetilde{\mu }}\,\vert _{{\overline{\lambda }}}\cdot g(x)\,\vert _{{\overline{\lambda }}}\, . \end{aligned}$$(14)
2.2 Dyson–Schwinger equation
The following is the key property of \({{\mathcal {X}}} :{{{\mathcal {P}}}}^{N}\rightarrow {{{\mathbb {C}}}}(x)\) of (13):
Proposition 2.1
The average \(\langle \, {{\mathcal {X}}}(x)\, \rangle _{\mu }\) is a regular function of x.
This is the simplest case of the general result on the qq-characters as established in [32,33,34,35,36,37]. For completeness of our exposition, an elementary proof is presented in Appendix A.
2.3 An orbifold version
As explained in [35], there is a very important \({{\mathbb {Z}}}_N\)-equivariant counterpart of the above story. It is defined in several steps.
First, we change the notations:
Next, we introduce the \({{\mathbb {Z}}}_N\)-grading \({\lambda }\mapsto {\mathfrak S}_{\lambda }\in {{{\mathbb {Z}}}}_{N}\) of the lattice \(\Lambda \) via:
for some partitions
of the sets of the Coulomb moduli and the fundamental/anti-fundamental masses. Such \({{\mathbb {Z}}}_N\)-grading is also often called an N-coloring. We define:
The following depends on a choice of a section \({{{\mathbb {Z}}}}_{N}\rightarrow {{{\mathbb {Z}}}}\). We send
thus, identifying \({{\mathbb {Z}}}_N\) with \(\{0,\ldots ,N-1\}\), as a set. An N-coloring is called regular iff
For a regular N-coloring, the \(\omega \)-colored masses are packaged into a degree two polynomial
Also, for a regular N-coloring, assuming (16), we set:
where
We shall also need a few more new notations.
-
For every \({\omega }\in {{{\mathbb {Z}}}}_N\), define the observable \(k_{\omega }:{{{\mathcal {P}}}}^{N} \rightarrow {{\mathbb {Z}}}_{\ge 0}\) by:
$$\begin{aligned} k_{\omega }\,\vert _{{\overline{\lambda }}} \, := \sum _{b=1}^N \sum _{\square \in \lambda ^{(b)}} \delta _{{{\mathfrak {S}}}_{{\tilde{a}}_{b}+ {{{\tilde{c}}}}(\square )}}^{\omega }\, , \end{aligned}$$(20)where \({{{\tilde{c}}}}(i,j) := (i-1){{\varepsilon }}_{1} + (j-1){{{\tilde{{\varepsilon }}}}}_{2}\), cf. (8), and \(\delta _i^j \equiv {\delta }_{i,j}\) is the Kronecker delta.
-
The fractional couplings:
$$\begin{aligned} {\overline{{\mathfrak {q}}}} = ({\mathfrak {q}}_{\omega })_{\omega \in {{\mathbb {Z}}}_N} \equiv ({\mathfrak {q}}_0,{\mathfrak {q}}_1,\ldots ,{\mathfrak {q}}_{N-1})\in {{{\mathbb {C}}}}^{N}\, . \end{aligned}$$(21) -
Given \({\overline{{\mathfrak {q}}}}\) of (21), define the observable , called the fractional instanton factor, as follows:
(22) -
The pseudo-measure \(\mu ^{\mathrm{orb}} = \mu ^{\mathrm{orb}}({{{\tilde{{\mathbf{a }}}}}},{{{\tilde{{\mathbf{m }}}}}},{{\varepsilon }}_{1},{{{\tilde{{\varepsilon }}}}}_{2};{{\overline{{\mathfrak {q}}}}}) :{{{\mathcal {P}}}}^{N} \rightarrow {{{\mathbb {C}}}}\) on \({{{\mathcal {P}}}}^{N}\) is defined via:
(23)where the tangent weights \(\{\mathrm{w}_{t}\}_{t\in T_{{\overline{\lambda }}}}\) are defined via (10) with the substitution \(a_{b} \mapsto {{{\tilde{a}}}}_{b}\), \({\varepsilon }_2 \mapsto {{{\tilde{{\varepsilon }}}}}_{2}\), and the partition function \({{\varvec{\Psi }}}^{\text {inst}} = {{\varvec{\Psi }}}^{\text {inst}} ( {{{\tilde{{\mathbf{a }}}}}}, {{{\tilde{{\mathbf{m }}}}}}, {{\varepsilon }}_{1}, {{{\tilde{{\varepsilon }}}}}_{2}; {\overline{{\mathfrak {q}}}})\) is the formal power seriesFootnote 1 in \(\left( {{\mathfrak {q}}}_{0}, \ldots , {{\mathfrak {q}}}_{N-1} \right) \) uniquely determined by the normalization
$$\begin{aligned} \sum _{{\overline{\lambda }}\in {{\mathcal {P}}}^N}\, \mu ^{\mathrm{orb}}\, \vert _{{\overline{\lambda }}} \, = \, 1\, . \end{aligned}$$(24) -
For every \({\omega }\in {{\mathbb {Z}}}_N\), define the \({{{\mathbb {C}}}}(x)\)-valued observable \(Y_{\omega }:{{{\mathcal {P}}}}^{N} \rightarrow {{{\mathbb {C}}}}(x)\) via:
$$\begin{aligned} \begin{aligned} Y_{\omega }(x)\, \vert _{{\overline{\lambda }}}\,&:= \prod \limits _{b=1}^N \, \left( \left( x - {{{\tilde{a}}}}_{b} \right) ^{{\delta }_{{{\mathfrak {S}}}_{{{{\tilde{a}}}}_{b}}}^{\omega }} \right. \\&\quad \times \left. \prod \limits _{\square \in \lambda ^{(b)}} \left( \frac{x-{{{\tilde{a}}}}_{b}-{{{\tilde{c}}}}({\square })-{{\varepsilon }}_{1}}{x-{{{\tilde{a}}}}_b-{{{\tilde{c}}}}({\square })}\right) ^ {\delta _{{\mathfrak S}_{{{{\tilde{a}}}}_{b}+{{{\tilde{c}}}}({\square })}}^{{\omega }}} \left( \frac{x-{{{\tilde{a}}}}_{b}-{{{\tilde{c}}}}({\square })-{{{\tilde{{\varepsilon }}}}}_{2}}{x-{{{\tilde{a}}}}_b-{{{\tilde{c}}}}({\square })-{{{\tilde{{\varepsilon }}}}}}\right) ^ {\delta _{{{\mathfrak {S}}}_{{{{\tilde{a}}}}_{b}+{\tilde{c}}({\square })}}^{{\omega }-1}}\right) . \end{aligned} \end{aligned}$$(25) -
For every \({\omega } \in {{{\mathbb {Z}}}}_{N}\), define the \({{{\mathbb {C}}}}(x)\)-valued observable \({{\mathcal {X}}}_{\omega }:{{{\mathcal {P}}}}^N \rightarrow {{{\mathbb {C}}}}(x)\) via:
$$\begin{aligned} {{\mathcal {X}}}_{\omega }(x)\,\vert _{{\overline{\lambda }}}\, := \, Y_{{\omega }+1}(x+{{{\tilde{{\varepsilon }}}}})\,\vert _{{\overline{\lambda }}}\, +\, {{\mathfrak {q}}}_{\omega }\, \frac{P_{\omega }(x)}{Y_{{\omega }}(x)\,\vert _{{\overline{\lambda }}}}\, . \end{aligned}$$(26)
2.4 Surface defects
Consider a map
defined via
with
The geometric origin of \(\pi _N\) is explained in [32,33,34,35,36,37]. Note that \({\pi }_{1} = \text {Id}_{{{\mathcal {P}}}}\).
Following [32,33,34,35,36,37], let us now pass from \({\overline{{\mathfrak {q}}}} = ({{\mathfrak {q}}}_{0}, \dots , {{\mathfrak {q}}}_{N-1})\) of (21) to another set of variables, namely \({{\mathbf{w }}}=(w_{0} : w_{1} : \dots : w_{N-1})\) and \({{\mathfrak {q}}}\) via:
where the bulk coupling \({\mathfrak {q}}\) is recovered by:
The variables \({\mathbf{w }}\) are redundant, in the sense that correlation functions are invariant under the simultaneous rescaling of all w’s. However, just as the bulk coupling \({\mathfrak {q}}\) is identified below with the cross-ratio of four points on a sphere, thus revealing a connection to the 4-point function in conformal field theory, the variables w’s are identified with the coordinates of N particles, whose dynamics is described by the partition function \({{\varvec{\Psi }}}^{\text {inst}}\).
In terms of the \(({{\mathbf{w }}}, {{\mathfrak {q}}})\)-variables, the instanton factor looks as follows (recall that \(k_{-1} = k_{N-1}\)):
Evoking (16), we also have an obvious equality
Using the aforementioned map \(\pi _N\), we define the Surface defect observable \({{\mathcal {S}}}({{\mathbf{a }}},{{\mathbf{m }}},{{\varepsilon }}_{1},{{\varepsilon }}_{2};{{\mathbf{w }}},{{\mathfrak {q}}})\) in the statistical model defined by the pseudo-measure \(\mu \) of (11) via:
where, again with (16) understood,
Note that
evoking the notations of (17). The shifts (35) are motivated by the relation between the sheaves on the orbifold \({{{\mathbb {C}}}} \times {{{\mathbb {C}}}}/{{{\mathbb {Z}}}}_{N}\) and the covering space \({{{\mathbb {C}}}} \times {{{\mathbb {C}}}}\), see [23, 38]. In what follows, we shall not be using the observable (34). Instead, we shall work directly with the pseudo-measure \({\mu }^{\mathrm{orb}}\).
2.5 The key property of \({\mathcal {X}}_{{\omega }}\)
The following result [32,33,34,35,36,37] (whose proof is presented in Appendix A for completeness of our exposition) is a simple consequence of Proposition 2.1:
Proposition 2.2
The average \(\left\langle \, {\mathcal {X}}_{\omega }(x)\, \right\rangle _{\mu ^{\mathrm{orb}}}\) is a regular function of x for every \({\omega }\in {{\mathbb {Z}}}_N\).
For a power series \(F(x)=\sum _{\ell =-\infty }^\infty F_{\ell } x^{-\ell }\) and \(k\in {{\mathbb {Z}}}\), let \(\left[ x^{-k}\right] F(x)\) denote the coefficient \(F_k\). The regularity property of Proposition 2.2 implies the following result:
The is that the \(k=1\) case of Eq. (37) implies a second-order differential equation on the partition function \({\varvec{\Psi }}^{\text {inst}}\), viewed as a function of \({{\mathfrak {q}}}_0,\ldots ,{{\mathfrak {q}}}_{N-1}\). This differential equation is the subject of the following subsection.
2.6 The differential operator \({{\mathcal {D}}}^{\mathrm{BPS}}\)
To apply (37) for \(k=1\), we shall first explicitly compute \([x^{-1}]{\mathcal {X}}_{\omega }(x)\,\vert _{{\overline{\lambda }}}\). For every \({\omega }\in {{{\mathbb {Z}}}}_{N}\), define the observable \(c_{\omega , {\mathbf{a }}}:{{{\mathcal {P}}}}^{N} \rightarrow {{{\mathbb {C}}}}\) via:
Recalling (18, 35), so that in particular \(a_b={\varepsilon }_1 \alpha _{{{\mathfrak {S}}}_{{{{\tilde{a}}}}_{b}}}\) and \({{{\tilde{\kappa }}}}={{{\tilde{{\varepsilon }}}}}_2 / {{\varepsilon }_1}\), we get:
which implies:
Lemma 2.3
The large x expansion of the observable \(Y_{\omega }(x)\) has x as a leading term, while the next two coefficients are the observables \({{{\mathcal {P}}}}^{N}\rightarrow {{{\mathbb {C}}}}\) given explicitly by:
As an immediate corollary, using notations (2, 17, 18), we obtain:
Proposition 2.4
The observable \(\left[ x^{-1}\right] {\mathcal {X}}_{\omega }(x) :{{{\mathcal {P}}}}^{N} \rightarrow {{{\mathbb {C}}}}\) is explicitly given by:
To get rid of the observables \(c_{\omega , {\mathbf{a }}}\)’s (38) in the right-hand side of (39), we introduce, following [32,33,34,35,36,37], the functions \(\{ U_{\omega }\}_{\omega \in {{\mathbb {Z}}}_N}\) via:
with the conventions \(U_{\omega + N} = U_{\omega }\) being used. They provide a (unique up to a common factor) solution of the following linear system:
We also note that
Due to the key property (41) of \(U_\omega \)’s, the coefficient of \(x^{-1}\) in the observable \(\sum _{\omega \in {{\mathbb {Z}}}_N} U_{\omega }\, {\mathcal {X}}_{\omega }(x)\) is a degree-two polynomial in the instanton charges \(\{k_{\omega }\}_{\omega \in {{\mathbb {Z}}}_N}\). Therefore,
with \(D^{\text {inst}}\), a second-order differential operator in \({{\mathfrak {q}}}_{\omega }\)’s, naturally arising from the equality
due to (23, 24). We can further express \(D^\text {inst}\) as a differential operator in \({\mathfrak {q}}\) and \(w_\omega \)’s by using
It is convenient to introduce the normalized partition function \({{\varvec{\Psi }}}\) via:
where
Combining Propositions 2.2, 2.4 with formulae (41) and (42), we get (cf. [32, 37]):
Theorem 2.5
The normalized partition function \({{\varvec{\Psi }}} = {{\varvec{\Psi }}}({\tilde{\mathbf{a }}}, {{{\tilde{{\mathbf{m }}}}}}, {\varepsilon }_1, {{{\tilde{{\varepsilon }}}}}_2; {\mathbf{w }},{\mathfrak {q}})\) of (44) satisfies the equation
with the second-order differential operator \({{\mathcal {D}}}^{\mathrm{BPS}}\) explicitly given by [cf. (2)]
where \({{{\hat{H}}}}_{0}, {{{\hat{H}}}}_{1}\) are the second-order differential operators in \(w_\omega \)’s, \({\mathfrak {q}}\) and \(\alpha _\omega \)’s:
with
Remark 2.3
Note that \({{\varvec{\Psi }}}^{\mathrm{inst}}\) is a single-valued homogeneous function of \(w_\omega \)’s. If we wrote the differential equation obeyed by \({{\varvec{\Psi }}}^{\mathrm{inst}}\) in the original variables \({{\mathfrak {q}}}_{0}, \ldots , {{\mathfrak {q}}}_{N-2}, {{\mathfrak {q}}}_{N-1}\), it would not contain any ambiguity due to the redundant nature of the variables \(w_0, \ldots , w_{N-1}\). However, the equations written in the invariant variables, such as the variables \(v_i\) introduced below, look more complicated. Conversely, by introducing more degrees of freedom with additional symmetries, modifying accordingly the prefactor \({{\varvec{\Psi }}}^{\mathrm{tree}}\), one arrives at a very simple form of the operators \({{{\hat{H}}}}_{0}, {{{\hat{H}}}}_{1}\), cf. Theorem 3.1. This is known as the projection method in the theory of many-body systems [43].
Remark 2.4
The normalized partition function \({\varvec{\Psi }}\) obeys:
The operators \({{{\hat{H}}}}_{0}, {{{\hat{H}}}}_{1}\) in (47) are therefore defined up to addition of the second-order differential operators of the form
with a first-order differential operator \({\mathfrak {D}}_{1}\). The choice (47) is uniquely characterized by its \(\alpha _\omega \)-independence, for any \(\omega \).
2.7 One more coordinate change
For the purpose of the next section, it will be convenient to use the coordinates
and the associated quantities
with
Define the \({{{\mathbb {C}}}}[[v_{1}^{{\pm } 1}, v_{2}^{{\pm } 1}, \ldots , v_{N-1}^{{\pm } 1} ]]\)-valued power series in \({\mathfrak {q}}\) by:
where we intentionally omit the parameters \({{{\tilde{{\mathbf{a }}}}}}, {\tilde{\mathbf{m }}}, {\varepsilon }_1, {{{\tilde{{\varepsilon }}}}}_2\) in the right-hand side and note that
The following is a straightforward reformulation of Theorem 2.5 in the present setting:
Theorem 2.6
The function \(\psi ={\psi }(v_{1}, v_{2} , \ldots , v_{N-1} ; {{\mathfrak {q}}} )\) satisfies the equation
with
with the residues of the meromorphic connection \(\nabla ^{\mathrm{bps}}\) at \({\mathfrak {q}}= 0\) and \({\mathfrak {q}}= 1\) having the decomposition:
with the kinetic, magnetic, and potential terms given by:
where we defined
and
Remark 2.5
The operator \({\nabla }^{\mathrm{bps}}\) of (56) depends, explicitly, on \({\vec \mu }, {\delta }{\vec \mu }, {\vec \alpha }\). However, Theorem 2.5 shows that the \({\vec \alpha }\) dependence is a pure gauge:
where [cf. (45)]
3 The CFT side, or the projection method
The operator \({{{\hat{h}}}}^{\mathrm{bps}}_{0}/{{\mathfrak {q}}} + {{{\hat{h}}}}^{\mathrm{bps}}_{1}/({{\mathfrak {q}}}-1)\) of (56) can be viewed as a time-dependent Hamiltonian of a quantum mechanical system with \(N-1\) degrees of freedom \(v_1, \ldots , v_{N-1}\). The parameters \(\vec {\mu }=(\mu _0,\dots ,\mu _{N-1}), \delta \vec {\mu }=(\delta \mu _0,\dots ,\delta \mu _{N-1})\) play the rôle of the coupling constants, while the parameters \(\vec {\alpha }=(\alpha _0,\dots ,\alpha _{N-1})\) play the rôle of the spectral parameters, such as the asymptotic momenta of N particles, in the center-of-mass frame, where the interactions between the particles can be neglected.
The BPS/CFT correspondence [29, 30] suggests to look for the representation-theoretic realization of the operators \({{{\hat{h}}}}^{\mathrm{bps}}_{0}\) and \({{{\hat{h}}}}^{\mathrm{bps}}_{1}\).
We present such a realization below.
3.1 Flags, co-flags, lines, and co-lines
Let \(W \approx {{{\mathbb {C}}}}^{N}\) be the complex vector space of dimension N, and let \(W^*\) denote its dual. Let \(F(W),\, F(W^{*}),\, {{{\mathbb {P}}}}(W),\, {{{\mathbb {P}}}}(W^*)\) denote the space of complete flags in W, the space of complete flags in \(W^{*}\), the projective space of lines in W, and the projective space of lines in \(W^*\), respectively. The natural action of the general linear group GL(W) on W and \(W^*\) gives rise to canonical actions of GL(W) on those four projective varieties. Let \(J^{a}_{b},\, {{{\tilde{J}}}}^{a}_{b},\, V^{a}_{b},\, {{{\tilde{V}}}}^{a}_{b}\), with \(a, b = 1, \ldots , N\), denote the vector fields on \(F(W),\, F(W^{*}),\, {{{\mathbb {P}}}}(W),\, {{{\mathbb {P}}}}(W^{*})\), respectively, representing those actions. Here, to define those vector fields, we need to choose some basis \(\{e_{a}\}_{a = 1}^{N}\) in W, with the dual basis in \(W^*\) denoted by \(\{{{{\tilde{e}}}}^{b}\}_{b = 1}^{N}\), so that the operators
represent the action of the Lie algebra of GL(W) on W. They obey the \(\mathfrak {gl}_N\) commutation relations:
to which we shall refer in what follows.
We define the second-order differential operators \({{{\hat{h}}}}_{0}, {{{\hat{h}}}}_{1}\) on the product
by
These operators are independent of the choice of the basis in W and are globally well defined on \({{\mathcal {X}}}\). Furthermore, they commute with the diagonal action of GL(W) on \({\mathcal {X}}\):
Note that the center of GL(W) acts trivially on \({\mathcal {X}}\), hence a natural action of PGL(W) on \({\mathcal {X}}\).
3.2 The v-coordinates
Let us now endow W with the volume form \(\varpi \in {\Lambda }^{N} W^{*}\). Denote
Let \(H = SL(W, {\varpi }) \approx SL(N, {{{\mathbb {C}}}})\) denote the group of linear transformations of W preserving \(\varpi \). The center \(Z(H)\simeq {{\mathbb {Z}}}_N\) of \(H\subset GL(W)\) is finite and acts trivially on \({\mathcal {X}}\). There is an H-invariant open subset \({{\mathcal {X}}}^{\circ }\) (described in (78)) of \({\mathcal {X}}\), on which the action of H/Z(H) is free. The corresponding quotient \({{\mathcal {X}}}^{\circ }/H\) can be coordinatized by the values of \(N-1\) functions \(v_{1}, \ldots , v_{N-1}\), defined as follows:
where
is the collection consisting of a pair
of flags in W and \(W^{*}\), respectively, and another pair
of lines in W and \(W^{*}\); and finally,
are the corresponding i-polyvector and the i-form on W, both defined up to a scalar multiplier. Note that these scalar factor ambiguities cancel out in (69).
We can also view \(v_i\)’s as meromorphic functions on \({{\mathcal {X}}}/H\). To this end, we promote \(\pi _i, {{{\tilde{\pi }}}}^i, {\mathrm{z}}, {{\tilde{\mathrm{z}}}}\) to global objects, the canonical holomorphic sections of the corresponding vector bundles:
and
and define
We also note that while (69, 75) can be extended to \(i = N\), the corresponding quantity \(v_N\) satisfies
due to the Desnanot–Jacobi–Dodgson–Sylvester theorem, which states that
The open set \({\mathcal {X}}^{\circ }\subset {\mathcal {X}}\) has the following description: there exists a basis \(e_a\) in W such that
We note that the aforementioned equality (76) is obvious in this basis, since
Remark 3.1
The flag varieties F(W) and \(F(W^{*})\) are isomorphic. For example, the assignment \(W_{i} = {{{\tilde{W}}}}^{\perp }_{N-i}\) gives rise to an isomorphism \(F(W^{*}){{\mathop {\longrightarrow }\limits ^{\sim }}}F(W)\). Alternatively, fixing the volume form \(\varpi \in {\Lambda }^{N}W^{*}\), we have an SL(W)-equivariant isomorphism \(F(W){{\mathop {\longrightarrow }\limits ^{\sim }}}F(W^{*})\) given by:
Remark 3.2
In the \(N=2\) case, we have \(F(W) \simeq F(W^*) \simeq {{{\mathbb {P}}}}(W) \simeq {{{\mathbb {P}}}}(W^*)\), and the only nontrivial coordinate \(v_1\) of (69) is determined by the usual cross-ratio of four points on \({{{\mathbb {C}}}{{\mathbb {P}}}}^1\). More precisely, if \(z_1, z_2, z_3, z_4 \in W\) are defined (each up to a scalar multiplier) by:
then
depends only on the four points \({{{\mathbb {C}}}}z_{i} \in {{{\mathbb {P}}}}(W)\).
3.3 The \({{\mathfrak {L}}}\)-twist
Let \(L_{1}, \ldots , L_{N-1}\) denote the tautological line bundles over F(W), the fiber of \(L_i\) over the point \(0 = W_{0} \subset W_{1} \subset W_{2} \subset \ldots \subset W_{N-1} \subset W_{N} \equiv W\) being
Similarly, let \({{{\tilde{L}}}}^1, \ldots , {{{\tilde{L}}}}^{N-1}\) denote the tautological line bundles over \(F(W^{*})\), and
be the tautological line bundles over \({{{\mathbb {P}}}}(W),\, {{{\mathbb {P}}}}(W^{*})\), respectively. We note that
All these line bundles are GL(W)-equivariant. By abuse of notation, we shall use the same notations for the pull-backs of the aforementioned line bundles to \({\mathcal {X}}\) of (65) under the natural projections. The line bundles \({{{\tilde{{\mathcal {L}}}}}}^{-1} \otimes {\Lambda }^{a-1}{{{\tilde{W}}}}_{a-1} \otimes ({\Lambda }^{a} W_{a})^{-1}\), \({\Lambda }^{a}{{{\tilde{W}}}}_{a} \otimes {{\mathcal {L}}}^{-1} \otimes ({\Lambda }^{a-1} W_{a-1})^{-1}\), \({\Lambda }^{i} {\tilde{W}}_{i} \otimes ({\Lambda }^{i} W_{i})^{-1}\), and \({{{\tilde{{\mathcal {L}}}}}}^{-1} \otimes {{\mathcal {L}}}^{-1}\) on \({\mathcal {X}}\) are H-invariant (and those with \(a< N\) are actually GL(W)-invariant). Furthermore, each factor in formula (75) can be viewed as a holomorphic section of one of those line bundles. For example,
is a holomorphic section of \(\mathrm{det} ( {{{\tilde{W}}}}_{a}) \otimes {{\mathcal {L}}}^{-1} \otimes \mathrm{det}(W_{a-1})^{-1}\). Its zeroes determine the locus in \({\mathcal {X}}\) where the plane \(W_{a-1}\), the line \({{{\mathbb {C}}}}z\), and the plane \({{{\tilde{W}}}}_{a}^{\perp } \subset W\) are not in general position, i.e., their linear span does not coincide with the entire W. Let \(\Sigma \subset {{\mathcal {X}}}^{\circ }\) denote the union of vanishing loci of \({{{\tilde{\Pi }}}}^{a} ( Z \wedge {\Pi }_{a-1} ),\, ({{{\tilde{Z}}}} \wedge {{{\tilde{\Pi }}}}^{a-1}) ({\Pi }_{a}),\, {\tilde{\Pi }}^{i}({\Pi }_{i})\) for \(a = 1, \ldots , N\) and \(i = 1, \ldots , N-1\).
For \({\vec n}, {\vec {{{\tilde{n}}} }} \in {{{\mathbb {C}}}}^{N},\ {\vec \gamma } \in {{{\mathbb {C}}}}^{N-1}\), consider the tensor product of “complex powers of line bundles”
defined on any simply-connected open domain \({{\mathcal {U}}} \subset \left( {{\mathcal {X}}}^{\circ }\backslash \Sigma \right) /H\). Here, the complex numbers \({\mathrm{m}}, {{\tilde{\mathrm{m}}}} \in {{{\mathbb {C}}}}\) and the vectors \({\vec \nu }, {\vec {{{\tilde{\nu }}}}} \in {{{\mathbb {C}}}}^{N-1}\) are defined via:
Our main result is:
Theorem 3.1
The operators \({{{\hat{h}}}}^{\mathrm{bps}}_{0}, {{{\hat{h}}}}^{\mathrm{bps}}_{1}\) of (56) coincide with the operators \({{{\hat{h}}}}^{\mathrm{cft}}_{0}, {{{\hat{h}}}}^{\mathrm{cft}}_{1}\), which are \({{{\hat{h}}}}_{0}, {{{\hat{h}}}}_{1}\) of (66), viewed now as the differential operators on \({{\mathcal {X}}}^{\circ }/H\), twisted by the “line bundle” \({{\mathfrak {L}}}\):
where
is the holomorphic section of \({\mathfrak {L}}\) on \({{\mathcal {U}}}\). The parameters \(\vec {n}, \vec {{{\tilde{n}}}}, \vec {\gamma }\) are related to the parameters \(\vec {\mu }, \delta {\vec \mu }\) and \(\vec {\alpha }\) (which encode the mass parameters \({\mathbf{m }}\) and the Coulomb parameters \({\mathbf{a }}\) via (17, 36) and (18), respectively) as follows:
for \(b=1, \ldots , N\) and \(i = 1, \ldots , N-1\).
For future use, let us record the relation between the parameters of the gauge theory and the parameters \({\vec \nu }, {\vec {{{\tilde{\nu }}}}}, {\mathrm{m}} , {{\tilde{\mathrm{m}}}}\) of (87):
where we used (18, 36) and the second formula of (35).
3.4 Proof of Theorem 3.1
The vector fields \(V^{a}_{b}, {{{\tilde{V}}}}^{a}_{b}\) can be explicitly written in the homogeneous coordinates \(({\mathrm{z}}^1:{\mathrm{z}}^2: \cdots :{\mathrm{z}}^N)\) on \({{{\mathbb {P}}}}(W)\) and \(({{\tilde{\mathrm{z}}}}_{1}:{{\tilde{\mathrm{z}}}}_{2} : \cdots : {{\tilde{\mathrm{z}}}}_{N})\) on \({{{\mathbb {P}}}}(W^{*})\):
so that \({{{\hat{h}}}}_{1}\) of (66) is explicitly given by:
where
The minus sign in (92) in the formula for \(V_{a}^{b}\) does match the commutation relations (64). This minus sign is due to the fact that the vector space of polynomials in \(z^a\)’s is the symmetric algebra built on \(W^*\), while that of polynomials in \({{\tilde{z}}}_a\)’s is built on W. Thus, (92) is the infinitesimal version of the group action, where \(h \in GL(W)\) acts on \(f = f({\mathrm{z}}), {{{\tilde{f}}}} = {{{\tilde{f}}}}({{\tilde{\mathrm{z}}}})\) via \(f \mapsto f^{h}, {{{\tilde{f}}}} \mapsto {{{\tilde{f}}}}^{h}\):
As for \(J^{a}_{b}, {{{\tilde{J}}}}^{a}_{b}\), let us first recall the quiver description of the flag varieties \(F(W), F(W^{*})\). Let \(F_{1}, {{{\tilde{F}}}}_{1}, \ldots , F_{N-1}, {{{\tilde{F}}}}_{N-1}\) be the sequence of complex vector spaces with \(\mathrm{dim}\, F_{i} = \mathrm{dim}\, {{{\tilde{F}}}}_{i} = i\). Consider the vector spaces of linear maps:
where we set \(F_{N}=W\) and \({{{\tilde{F}}}}_{N}=W\). Consider the groups
of linear transformations of the respective vector spaces. The groups \({\mathcal {G}}\), \({{{\tilde{{\mathcal {G}}}}}}\) act on \({{\mathcal {A}}}, {{{\tilde{{\mathcal {A}}}}}}\), respectively, in the natural way:
where \(g_{i} \in GL(F_{i}),\, U_{i}:F_{i} \rightarrow F_{i+1},\, {{{\tilde{g}}}}_{i} \in GL({{{\tilde{F}}}}_{i}),\, {{{\tilde{U}}}}_{i}:{{{\tilde{F}}}}_{i+1} \rightarrow {{{\tilde{F}}}}_{i}\), and \(g_N, {{{\tilde{g}}}}_{N}\) are vacuous. Then, the flag variety F(W) is the quotient of the open subvariety \({\mathcal {A}}^{s}\) of \({\mathcal {A}}\), consisting of the collections \(\left( U_{i} \right) _{i=1}^{N-1}\) for which the composition \(U_{N-1} U_{N-2} \cdots U_{i}:F_{i} \rightarrow W\) has no kernel for any \(i=1,\dots ,N-1\), by the free action of \({{\mathcal {G}}}\):
We can represent the \(\pi _i\)’s of (72), in coordinates, as:
Here, \(\Big [ U_{N-1}U_{N-2}\cdots U_{i} \Big ]^{a_k}_{\ell }\) denote the matrix coefficients of the corresponding linear operator with respect to some bases \(\{{{\varepsilon }}_{\ell }^{(i)}\}_{\ell =1}^i\) in \(F_i\) and the chosen basis \(\{e_a\}_{a=1}^N\) in W. Note that the group \({\mathcal {G}}\) acts on \({{\mathcal {A}}}^{s}\) by the changes of bases \(\{{{\varepsilon }}_{\ell }^{(i)}\}_{\ell =1}^i\) in each \(F_i\): \({{\varepsilon }}_{\ell }^{(i)} \mapsto \sum _{m=1}^{i} g_{i | \ell }^{m} {{\varepsilon }}_{m}^{(i)}\). This results in \(U_{N-1}U_{N-2}\cdots U_{i}\) being multiplied on the right by \(g_{i}^{-1}\); hence, according to (101), the \({\pi }_{i}\)’s are transformed via:
thus justifying the \(\mathrm{det}(W_{i})^{-1}\) factor in (73). The group GL(W) acts on \({\mathcal {A}}\) via:
This GL(W)-action preserves \({{\mathcal {A}}}^s\subset {\mathcal {A}}\) and also commutes with the \({\mathcal {G}}\)-action. The resulting action of GL(W) on \({{\mathcal {A}}}^s/{{\mathcal {G}}}\) clearly coincides with the natural action of GL(W) on \(F(W)={{\mathcal {A}}}^s/{{\mathcal {G}}}\). Accordingly, the GL(W)-action on functions on F(W) is given by:
This means that the vector field \(J_{a}^{b}\in Vect(F(W))\) representing the action of the element \(T_a^b=e_{a} \otimes {\tilde{e}}^{b}\in \mathfrak {gl}(W)\) on functions on F(W) is given by (cf. the first formula of (92)):
where \(U_{N-1|m}^{a}\) are the matrix coefficients of \(U_{N-1}:F_{N-1} \rightarrow W\) defined via:
Up to a compensating infinitesimal \(g_i\)-transformation, the vector field \(J_{a}^{b}\) acts on \({\pi }_{i}\) (more precisely, on functions of \(\pi _i\) viewed as functions on F(W)) by:
To clarify, the right-hand side of (105) should be viewed as a descent of the \({\mathcal {G}}\)-equivariant vector field on \({{\mathcal {A}}}^s\), given by the same formula, to the quotient space \({{\mathcal {A}}}^s/{{\mathcal {G}}}=F(W)\). The attentive reader will be content to see that the minus sign in (105) is needed to match the commutation relations (64).
Likewise, the flag variety \(F(W^*)\) admits the quotient realization:
where the open subvariety \({{{\tilde{{\mathcal {A}}}}}}^{s}\) of \({{{\tilde{{\mathcal {A}}}}}}\) consists of the collections \(\left( {{{\tilde{U}}}}_{i} \right) _{i=1}^{N-1}\) for which the composition \({{{\tilde{U}}}}_{i} {{{\tilde{U}}}}_{i+1} \cdots {{{\tilde{U}}}}_{N-1}:W \rightarrow {{{\tilde{F}}}}_{i}\) has no cokernel (i.e., has the maximal rank) for any \(i=1,\dots ,N-1\), and the action of \({{{\tilde{{\mathcal {G}}}}}}\) on \({{{\tilde{{\mathcal {A}}}}}}^{s}\) is free. We can represent the \({{{\tilde{\pi }}}}^i\)’s of (72), in coordinates, as:
Here, \(\left[ {{{\tilde{U}}}}_{i}{{{\tilde{U}}}}_{i+1}\cdots {{{\tilde{U}}}}_{N-2} {{{\tilde{U}}}}_{N-1} \right] ^{\ell }_{a_{k}}\) denote the matrix coefficients of the corresponding linear operator with respect to some bases \(\{{{{\tilde{{\varepsilon }}}}}_{\ell }^{(i)}\}_{\ell =1}^i\) in \({\tilde{F}}_i\) and the bases \(\{e_a\}_{a=1}^N\) in W which is dual to the chosen basis \(\{ {{{\tilde{e}}}}^a\}_{a=1}^N\) in \(W^{*}\). Note that the group \({{{\tilde{{\mathcal {G}}}}}}\) acts on \({{{\tilde{{\mathcal {A}}}}}}^{s}\) by the changes of bases \(\{{{{\tilde{{\varepsilon }}}}}_{\ell }^{(i)}\}_{\ell =1}^i\) in each \({\tilde{F}}_i\): \({{{\tilde{{\varepsilon }}}}}_{\ell }^{(i)} \mapsto \sum _{m=1}^{i} {\tilde{g}}_{i | \ell }^{m} {{{\tilde{{\varepsilon }}}}}_{m}^{(i)}\). This results in \({\tilde{U}}_{i}{{{\tilde{U}}}}_{i+1}\cdots {{{\tilde{U}}}}_{N-2} {{{\tilde{U}}}}_{N-1}\) being multiplied on the left by \({{{\tilde{g}}}}_{i}\); hence, according to (109), the \({{{\tilde{\pi }}}}^{i}\)’s are transformed via:
thus justifying the \(\mathrm{det}({{{\tilde{W}}}}_{i})\) factor in (73). The group GL(W) acts on \({{{\tilde{{\mathcal {A}}}}}}\) via:
This action preserves \({{{\tilde{{\mathcal {A}}}}}}^s\subset {{{\tilde{{\mathcal {A}}}}}}\) and also commutes with the \({{{\tilde{{\mathcal {G}}}}}}\)-action. The resulting action of GL(W) on \({{{\tilde{{\mathcal {A}}}}}}^s/{{{\tilde{{\mathcal {G}}}}}}\) clearly coincides with the natural action of GL(W) on \(F(W^*)={{{\tilde{{\mathcal {A}}}}}}^s/{{{\tilde{{\mathcal {G}}}}}}\), see (108). Therefore, the vector field \({{{\tilde{J}}}}_{a}^{b}\in Vect(F(W^*))\) representing the action of the element \(T_a^b=e_{a} \otimes {{{\tilde{e}}}}^{b}\in \mathfrak {gl}(W)\) on \(F(W^*)\) is given by (cf. the second formula of (92)):
where \({{{\tilde{U}}}}_{N-1|a}^{m}\) are the matrix coefficients of \({{{\tilde{U}}}}_{N-1}:W\rightarrow {{{\tilde{F}}}}_{N-1}\) defined via:
To clarify, the right-hand side of (112) should be viewed as a descent of the \({{{\tilde{{\mathcal {G}}}}}}\)-equivariant vector field on \({{{\tilde{{\mathcal {A}}}}}}^s\), given by the same formula, to the quotient space \({{{\tilde{{\mathcal {A}}}}}}^s/{{{\tilde{{\mathcal {G}}}}}}=F(W^*)\). The attentive reader will be content to see that the commutation relations (64) are obeyed by \({{{\tilde{J}}}}_{a}^{b}\) of (112).
3.5 End of proof of Theorem 3.1
It remains to compute the action of the operators \({\Upsilon }^{-1} {{{\hat{h}}}}_{p} {\Upsilon }\) in the coordinates \(v_i\), and then to compare formulas (264, 265) in Appendix B to formulas (57, 58). We leave this straightforward computation to the interested reader.
4 Representation theory
Let us now explain the representation-theoretic meaning of the main Theorem 3.1. Namely, we identify the function \(\Phi \), given by
for any \({\mathfrak {q}}\), with the \(\mathfrak {sl}_N\)-invariant in the completed tensor product
of four irreducible infinite-dimensional representations \(\{V_i\}_{i=1}^{4}\) of the Lie algebra \(\mathfrak {sl}_N\).
We shall actually define \(V_i\)’s as representations of \(\mathfrak {gl}_N\). Let us denote the generators of \(\mathfrak {gl}_N\) by \({\mathbf{J}}_{a}^{b}\), with \(a, b = 1, \ldots , N\). These obey the commutation relations (64):
Notation 4.1
For a Lie algebra \({\mathfrak {g}}\), its element \(\xi \in {\mathfrak {g}}\), and a representation R of \({\mathfrak {g}}\), we denote by \(T_{R}({\xi }) \in \mathrm{End}(R)\) the linear operator in R, corresponding to \(\xi \).
It is well known that (116) implies that the Casimir operators
commute with all generators \({\mathbf{J}}^{a}_{b}\), so that in every irreducible \(\mathfrak {gl}_N\)-representation R the operator \({{\mathcal {C}}}_{k}\) acts via a multiplication by a scalar \(c_{k}(R)\), also commonly known as the k-th Casimir of R:
Notation 4.2
The Lie algebra \(\mathfrak {sl}_N\) is a subalgebra of \(\mathfrak {gl}_N\) with a basis consisting of \({\mathbf{J}}_{a}^{b}\), with \(a \ne b\), and
Notation 4.3
The Chevalley generators of \(\mathfrak {sl}_N\) are formed by \({{{\mathfrak {h}}}}_i\)’s, and
also for \(i = 1, \ldots , N-1\).
The elements \({{{\mathfrak {e}}}}_{i}\) generate, via commutators, the Lie subalgebra \({\mathfrak {n}}_{+}\) of \(\mathfrak {sl}_N\). As a vector space, \({\mathfrak {n}}_{+}\) has a basis consisting of \({\mathbf{J}}_{a}^{b}\) with \(b > a\). Likewise, the elements \({{{\mathfrak {f}}}}_{i}\) generate the Lie subalgebra \({\mathfrak {n}}_{-}\) which, as a vector space, has a basis consisting of \({\mathbf{J}}_{a}^{b}\) with \(b < a\).
Remark 4.1
With a slight abuse of notation, when this does not lead to a confusion, below we shall also denote by \({{{\mathfrak {h}}}}_{i}, {{{\mathfrak {f}}}}_i, {{{\mathfrak {e}}}}_i\) the corresponding operators
in a \(\mathfrak {gl}_N\)-module R.
4.1 Verma modules
4.1.1 Lowest weight module
For a generic \({\vec \nu } \in {{{\mathbb {C}}}}^{N-1}\), the lowest weight Verma \(\mathfrak {sl}_N\)-module \({{\mathcal {V}}}_{\vec \nu }\) is defined, algebraically, as follows. There is a vector \({\Omega }_{\vec \nu } \in {{\mathcal {V}}}_{\vec \nu }\), which obeys:
and:
and which generates \({{\mathcal {V}}}_{\vec \nu }\), i.e., \({{\mathcal {V}}}_{\vec \nu }\) is spanned by polynomials in \({\mathbf{J}}^{a}_{b}\), with \(a > b\), acting on \({\Omega }_{\vec \nu }\). Geometrically, \({{\mathcal {V}}}_{\vec \nu }\) can be realized as the space of analytic functions \({\Psi }\) of \(\left( U_{i} \right) _{i=1}^{N-1}\), obeying:
where \(g_N\) is vacuous and \({{\mathcal {G}}}^{\mathrm{formal}}\) denotes the group of formal exponents \(g_{i} = {\exp } \, h {\xi }_{i}\) with \({\xi }_{i} \in \mathrm{End}(F_{i})\) and h being a nilpotent parameter.
Remark 4.2
For \({\vec \nu } \in {{{\mathbb {Z}}}}^{N-1}\), the equation (124) makes sense for \((g_{i})_{i=1}^{N-1} \in {{\mathcal {G}}}\). For \({\vec \nu } \in {{{\mathbb {Z}}}}^{N-1}_{\ge 0}\), the polynomial solutions to the equation (124) are in one-to-one correspondence with the holomorphic sections of the following line bundle on the complete flag variety F(W):
For our chosen basis \(\{e_{a}\}_{a=1}^N\) of W, consider the i-form \({{{\tilde{\pi }}}}^{i}_{0}\) defined via:
Then,
(here, the index b runs through the labels of the first i basis vectors \(e_b\) in W, while the index a runs through the labels of a basis \({{\varepsilon }}_{a}^{(i)}\) in \(F_i\)) clearly satisfies (124). Furthermore, using \({{{\tilde{\pi }}}}^{i}_{0} \left( e_{a} \wedge {{{\tilde{e}}}}^{b} {\pi }_{i} \right) = 0\) unless \(i\ge a\) and \(b>i\) for \(a\ne b\), we get (122) and (123), due to (107).
The Lie algebra \(\mathfrak {gl}_N\) acts on the space of analytic functions \(\Psi =\Psi [U_i]\) by vector fields, viewed as the first-order differential operators, via (105):
We can easily compute the first two Casimirs of \({{\mathcal {V}}}_{\vec \nu }\):
Now, obviously \(\Omega _{\vec \nu }\) is not well-defined for arbitrary \(U_i\)’s. We need first to impose:
On the open set of \(U_i\)’s obeying (130) \(\Omega _{\vec \nu }\) is not single-valued. We can, however, view it as an analytic function in the neighborhood \(F(W)^{\circ }\) of the point where, in some \({{\mathcal {G}}}\)-gauge, \({\pi }_{i} = {\pi }_{i}^{0}\) with the i-polyvector \({\pi }_{i}^{0}\) defined via:
To parametrize \(F(W)^{\circ }\), we use:
where \(a_{m} = m\) for \(m \ne k\) while \(a_{k} = i+1\), so that the vectors
form the unique basis in \(W_i = \mathrm{Im}\left( U_{N-1}U_{N-2} \cdots U_{i} \right) \), \( i = 1, \ldots , N-1\), obeying:
with \(e^{(N)}_{a} : = e_{a}\). Therefore, we have:
with \({{\mathbf{U }}}^{i | j}_{\ell }\) polynomial in \(u^{(m)}_{k}\), \(m \ge i\), nonzero only for \(1\le j \le N-i, 1 \le \ell \le i\). Explicitly,
Invoking (134) and the first equality of (135), we obtain the following analogue of (132):
Since the local coordinates \(u_{k}^{(i)}\) are \({\mathcal {G}}\)-invariant, the general solution to (124) can be written as:
with some analytic functions \({\psi }\). We amend the definition of \({{\mathcal {V}}}_{\vec \nu }\) given prior to Remark 4.2 by rather defining \({{\mathcal {V}}}_{\vec \nu }\) as the space of analytic functions \(\Psi \), obeying (124), such that the corresponding functions \(\psi \) (138) are polynomials in \(u^{(i)}_{k}\)’s. Using the equality (based on (137))
the generators \({\mathbf{J}}_{b}^{a}\) can be expressed as the first-order differential operators in \(u_{k}^{(i)}\):
with polynomial in \(u^{(i)}_{k}\)’s coefficients. In particular, the Cartan generators of \(\mathfrak {gl}_N\) act by:
hence, the Cartan generators of \(\mathfrak {sl}_N\) act by:
With the natural definition of the order on the weights, it is not difficult to show that the positive degree polynomials in \(u_{k}^{(i)}\)’s have higher weights than the vacuum, the state \({\psi } = 1\). According to (140), the generators \({{{\mathfrak {f}}}}_{i} = {\mathbf{J}}_{i+1}^{i}\) act by:
thus annihilating the vacuum, the state \({\psi }=1\), as they should. Likewise, according to (140), the generators \({{{\mathfrak {e}}}}_{i} = {\mathbf{J}}_{i}^{i+1}\) act by:
which generate the whole module, as we can see using \([{{{\mathfrak {e}}}}_{i}, {{{\mathfrak {e}}}}_{i+1}] = {\mathbf{J}}^{i+2}_{i}\), etc.
4.1.2 Highest weight module
For a generic \(\vec {{{\tilde{\nu }}}}\in {{\mathbb {C}}}^{N-1}\), the highest weight Verma \(\mathfrak {sl}_N\)-module \({{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}\) is defined similarly, so we’d be brief. Algebraically, \({{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}\) is generated by a vector \({{{\tilde{\Omega }}}}_{\vec {{{\tilde{\nu }}}}}\), obeying:
and:
Geometrically, \({{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}\) can be realized in the space of analytic functions \({{{\tilde{\Psi }}}}\) of \(\left( {\tilde{U}}_{i} \right) _{i=1}^{N-1}\), obeying:
where \({{{\tilde{g}}}}_N\) is vacuous and \({{{\tilde{{\mathcal {G}}}}}}^{\mathrm{formal}}\) denotes the group of formal exponents \({{{\tilde{g}}}}_{i} = {\exp }\, h {{{\tilde{\xi }}}}_{i}\) with \({{{\tilde{\xi }}}}_{i} \in \mathrm{End}({\tilde{F}}_{i})\) and h being a nilpotent parameter. Again, we take:
which clearly satisfies (145, 146). Then, \({{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}\) is realized in the space of analytic functions \({{{\tilde{\Psi }}}}\), obeying (147), of the form \({{{\tilde{\Psi }}}}[{{{\tilde{U}}}}_i]={{{\tilde{\psi }}}}[{\tilde{u}}_{(i)}^{k}]\cdot {{{\tilde{\Omega }}}}_{\vec {{{\tilde{\nu }}}}}\) with \({{{\tilde{\psi }}}}\) polynomial in the \(\tilde{{\mathcal {G}}}\)-invariant coordinates
on the open domain \(F(W^{*})^{\circ }\), where \({{{\tilde{\pi }}}}^{i}({\pi }^{0}_{i}) \ne 0\) for \(i=1,\ldots , N-1\).
Remark 4.3
The identification of the vector space of representation \({{\mathcal {V}}}_{\vec \nu }\) with the space of polynomials in \(u^{(i)}_{k}\)’s, and similarly for \({{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}\), is known mathematically under the name of the Poincare–Birkhoff–Witt theorem [3, 45, 48] (apparently proven in the case of our interest by A. Capelli).
Remark 4.4
The genericity assumption on \(\vec \nu \in {{{\mathbb {C}}}}^{N-1}\) (resp. \({\vec {{{\tilde{\nu }}}}} \in {{{\mathbb {C}}}}^{N-1}\)) guarantees that the Verma \(\mathfrak {sl}_N\)-module \({{\mathcal {V}}}_{\vec \nu }\) (resp. \({{{\tilde{{\mathcal {V}}}}}}_{\tilde{\vec \nu }}\)) is irreducible, and thus is the unique lowest (resp. highest) weight module of the given lowest (resp. highest) weight, up to an isomorphism.
4.2 Twisted HW-modules
For generic \({\mathbf{n }} = (n_{1}, \ldots , n_{N}) \in {{{\mathbb {C}}}}^{N}\) and \({{{\tilde{\mathbf{n }}}}} = ({{{\tilde{n}}}}_{1}, \ldots , {{{\tilde{n}}}}_{N}) \in {{{\mathbb {C}}}}^{N}\), let us define the HW-modules \(H_{\mathbf{n }}\) and \({{{\tilde{H}}}}_{{{\tilde{\mathbf{n }}}}}\) of \(\mathfrak {gl}_N\) (for W. Heisenberg and H. Weyl) by making \({\mathbf{J}}_{a}^{b}\) act via the first-order differential operators in N complex variables. In other words, the generators of GL(N) in its defining N-dimensional representation W or its dual \(W^{*}\) act on the space of appropriately twisted functions on \(\mathrm{Hom}({\texttt {F}}, W)\) or \(\mathrm{Hom}(W, {{{\tilde{\texttt {F}}}}})\), where \({\texttt {F}} \approx {{{\mathbb {C}}}}\), \({{{\tilde{\texttt {F}}}}} \approx {{{\mathbb {C}}}}\) denote complex lines.
Explicitly, let \(\left( {\mathrm{z}}^{a} \right) _{a=1}^{N}\) and \(\left( {{\tilde{\mathrm{z}}}}_{a} \right) _{a=1}^{N}\) denote the coordinates on \(\mathrm{Hom}({\texttt {F}}, W)\) and \(\mathrm{Hom}(W, {{{\tilde{\texttt {F}}}}})\), respectively, in the dual bases \((e_{a})_{a=1}^{N}\), \(({{{\tilde{e}}}}^{a})_{a=1}^{N}\) of \(W,W^*\) we used in the previous section and in the dual bases \({\texttt {e}} \in {\texttt {F}}, {{{\tilde{\texttt {e}}}}} \in {\texttt {F}}^*\). Then, the underlying vector spaces \(H_{\mathbf{n }}\), \({{{\tilde{H}}}}_{{{\tilde{\mathbf{n }}}}}\) of the HW-modules are the spaces of homogeneous (i.e., degree zero) Laurent polynomials in \(\{{\mathrm{z}}^{a}\}, \{{{\tilde{\mathrm{z}}}}_{a}\}\), respectively:
while the generators of \(\mathfrak {gl}_N\) are represented by the following differential operators:
and
with
Remark 4.5
For \({{{\tilde{\mathbf{n }}}}}=(s,\ldots ,s)\), the module \({{{\tilde{H}}}}_{{{\tilde{\mathbf{n }}}}}\) coincides with \(V_s\) of [7, Sect. 1], as \(\mathfrak {sl}_N\)-modules.
In general, \({{{\tilde{H}}}}_{{{\tilde{\mathbf{n }}}}}\) is a twisted version of \(V_{({{{\tilde{n}}}}_1+\ldots +{{{\tilde{n}}}}_N)/N}\), with underlying vector spaces being isomorphic. We thus shall use the following notation:
Notation 4.4
For \(\mathrm{m}\in {{\mathbb {C}}}\) and \(\vec \mu \in {{{\mathbb {C}}}}^{N-1}\), define:
with
The action of \(\mathfrak {gl}_N\) on \({{\mathcal {H}}}_{\mathrm{m}}^{\vec \mu }\) is represented by the ordinary vector fields:
Notation 4.5
For \({{{\tilde{\mathrm{m}}}}} \in {{\mathbb {C}}}\) and \(\vec {{{\tilde{\mu }}}} \in {{{\mathbb {C}}}}^{N-1}\), define:
with
The action of \(\mathfrak {gl}_N\) on \({{{\tilde{{\mathcal {H}}}}}}_{{{\tilde{\mathrm{m}}}}}^{\vec {{{\tilde{\mu }}}}}\) is represented by the ordinary vector fields:
Remark 4.6
-
(a)
It is clear that the Casimirs \(c_k \left( {{\mathcal {H}}}_{\mathrm{m}}^{\vec \mu } \right) \) and \(c_k \left( {{{\tilde{{\mathcal {H}}}}}}_{{{\tilde{\mathrm{m}}}}}^{\vec {{{\tilde{\mu }}}}} \right) \), defined by (118), depend only on \(\mathrm{m}\) and \({{\tilde{\mathrm{m}}}}\), respectively.
-
(b)
The \(\mathfrak {gl}_N\)-weight subspaces, i.e., the joint eigenspaces of a commuting family \(\{{\mathbf{J}}_{a}^{a}\}_{a=1}^N\), of \({{\mathcal {H}}}_{\mathrm{m}}^{\vec \mu }\) and \({{{\tilde{{\mathcal {H}}}}}}_{{{\tilde{\mathrm{m}}}}}^{\vec {{{\tilde{\mu }}}}}\) are all one-dimensional, the corresponding sets of weights being \(-{\mathbf{n }}+{\Lambda }_{0}\subset {{\mathbb {C}}}^N\) and \({{{\tilde{\mathbf{n }}}}} + {\Lambda }_{0} \subset {{{\mathbb {C}}}}^{N}\), respectively, where \({\Lambda }_{0}\) denotes the lattice \({\Lambda }_{0}=\left\{ (r_1, \ldots , r_N)\in {{\mathbb {Z}}}^N\Big |\sum _{i=1}^N r_i = 0\right\} \).
-
(c)
The vectors \({\Omega }_{{{\mathcal {H}}}_{\mathrm{m}}^{\vec \mu }} := {\omega }_{\mathbf{n }} \in {{\mathcal {H}}}_{\mathrm{m}}^{\vec \mu }\), \({{{\tilde{\Omega }}}}_{{{{\tilde{{\mathcal {H}}}}}}_{{{\tilde{\mathrm{m}}}}}^{\vec {{{\tilde{\mu }}}}}} := {{{\tilde{\omega }}}}_{{{\tilde{\mathbf{n }}}}} \in {{{\tilde{{\mathcal {H}}}}}}_{{{\tilde{\mathrm{m}}}}}^{\vec {{{\tilde{\mu }}}}}\) have the following \(\mathfrak {sl}_N\)-weights:
$$\begin{aligned} {{{\mathfrak {h}}}}_{i} \cdot {\Omega }_{{{\mathcal {H}}}_{\mathrm{m}}^{\vec \mu }} = - {\mu }_{i} \cdot {\Omega }_{{{\mathcal {H}}}_{\mathrm{m}}^{\vec \mu }}\, , \qquad {{{\mathfrak {h}}}}_{i} \cdot {{{\tilde{\Omega }}}}_{{{{\tilde{{\mathcal {H}}}}}}_{{{\tilde{\mathrm{m}}}}}^{\vec {{{\tilde{\mu }}}}}} = {{{\tilde{\mu }}}}_{i} \cdot {{{\tilde{\Omega }}}}_{{{{\tilde{{\mathcal {H}}}}}}_{{{\tilde{\mathrm{m}}}}}^{\vec {{{\tilde{\mu }}}}}} \, , \qquad i=1,\dots ,N-1 \, . \end{aligned}$$(160)
4.3 Vermas and HW-modules in the \(N=2\) case
The generators \({{{\mathfrak {e}}}} \equiv {{{\mathfrak {e}}}}_{1}, {{{\mathfrak {f}}}} \equiv {{{\mathfrak {f}}}}_{1}, {{{\mathfrak {h}}}} \equiv {{{\mathfrak {h}}}}_{1}\) of \(\mathfrak {sl}_2\), see (119, 120), obey the standard relations:
For \(a,s \in {{{\mathbb {C}}}}\) and \(i\in \{-1,0,1\}\), consider the differential operators:
obeying the commutation relations:
The assignments
or
represent \(\mathfrak {sl}_2\) by the first-order differential operators on a line.
The modules we defined in the general N case can be described quite explicitly. Specifically, the highest/lowest weight Verma and the twisted HW \(\mathfrak {sl}_2\)-modules are all realized in the spaces of the twisted tensors:
with f(z) being a single-valued function of \(z \in {{{\mathbb {C}}}}^{\times }\), so that the operators (162) are the infinitesimal fractional linear transformations:
To make this relation precise, let us start with the geometric descriptions of the Verma modules.
In the geometric realization of the lowest weight Verma modules, we have a two-component vector
which is acted upon by the gauge \({{{\mathbb {C}}}}^{\times }\)-symmetry via \((u^1, u^2) \mapsto (t^{-1} u^{1}, t^{-1} u^{2})\). We look at the space of the locally defined functions \(\Psi =\Psi (u^{1}, u^{2})\) which transform with weight \(-\nu \) under the Lie algebra of the gauge \({{{\mathbb {C}}}}^{\times }\)-symmetry. More precisely, following (138) and the succeeding discussion, we look at \(\Psi \) of the form:
where \(\psi \) is a polynomial and \(z = u^2/u^1\) is the only coordinate \(u^{(1)}_{1}\) (132) in the present setting. One can perceive the right-hand side of (169) as the local section of a complex power of a line bundle \({\mathcal {O}}(1)\) over a neighborhood of \(z =0\) in \({{{\mathbb {C}}}{{\mathbb {P}}}}^{1}\), defined near the slice \(u^1 = 1\). The generators of \(\mathfrak {sl}_2\) act via:
where the differential operators in the middle act on \(\Psi \) while the rightmost ones act on \(\psi =\psi (z)\). The vacuum is:
corresponding to \(\psi =1\), and the lowest weight Verma module is:
The weight (eigenvalue of \({{\mathfrak {h}}}\)) of the state \(z^{n}\) is \(2n-\nu \). Note that the fractional linear transformation (167) transforms \((u^1, u^2) \mapsto (Cu^2 + Du^1, A u^2 + B u^1)\), hence it maps the vacuum to (again, we are working infinitesimally):
The formula (173) allows us to match:
Thus, the lowest weight Verma module \({{\mathcal {V}}}_{\nu }\) corresponds to the realization (165, 166) with:
and with polynomial f in (166).
In the geometric realization of the highest weight Verma modules, we have a two-component covector
which is acted upon by the gauge \({{\mathbb {C}}}^\times \)-symmetry via \((v_1, v_2) \mapsto (t v_1, tv_2 )\). We are looking at the space of locally defined functions \({{\tilde{\Psi }}} = {{\tilde{\Psi }}}(v_{1}, v_{2})\), which transform with weight \({{\tilde{\nu }}}\) under the Lie algebra of the gauge \({{{\mathbb {C}}}}^{\times }\)-symmetry. More precisely, following (148, 149), we look at \({{\tilde{\Psi }}}\) of the form:
where \({{{\tilde{\psi }}}}\) is a polynomial and \({{{\tilde{z}}}} = v_{2}/v_{1}\) is the only coordinate \({\tilde{u}}^{1}_{(1)}\) (149) in the present setting. The generators of \(\mathfrak {sl}_2\) act via:
where the differential operators in the middle act on \({{\tilde{\Psi }}}\), while the rightmost ones act on \({{\tilde{\psi }}}={{\tilde{\psi }}}({{{\tilde{z}}}})\). The vacuum is:
corresponding to \({{\tilde{\psi }}}=1\), and the highest weight Verma module is:
The weight of the state \({{{\tilde{z}}}}^{n}\) is \(- 2n + {{{\tilde{\nu }}}}\). Note that under the \(SL(2,{{{\mathbb {C}}}})\) fractional linear transformation (167) the covector \((v_1, v_2)\) transforms via \((v_1, v_2) \mapsto (-B v_2 + A v_1, D v_2 - C v_1)\) with \(AD-BC=1\), so that the pairing \({{{\tilde{U}}}}_1\cdot U_1 = v \cdot u \equiv u^1 v_1 + u^2 v_2\) is invariant, leading to:
Thus, the vacuum \({{\tilde{\Omega }}}_{{{\tilde{\nu }}}}\) is transformed via:
which allows us to match:
Hence, the highest weight Verma module \({{{\tilde{{\mathcal {V}}}}}}_{{{\tilde{\nu }}}}\) corresponds to the realization (164, 166) with:
and with polynomial f in (166).
We note that the transformations (167) and (181) are related via \({{{\tilde{z}}}}z = -1\), so that we get an equivalent representation (165, 166) with:
Finally, to describe the twisted HW-modules \(H_{\mathbf{n }}\), \({\tilde{H}}_{{{\tilde{\mathbf{n }}}}}\) with \({\mathbf{n }}= (n_{1}, n_{2})\), \({{{\tilde{\mathbf{n }}}}} = ({{{\tilde{n}}}}_{1}, {{{\tilde{n}}}}_{2})\), we recall the notation (153):
The vector space underlying \(H_{\mathbf{n }}\) is the space of Laurent polynomials \(\psi \) in \(z={\mathrm{z}}^{2}/{\mathrm{z}}^{1}\). Analogously, the vector space underlying \({{{\tilde{H}}}}_{{{\tilde{\mathbf{n }}}}}\) is the space of Laurent polynomials \({{\tilde{\psi }}}\) in \({\tilde{z}}={{\tilde{\mathrm{z}}}}_{2}/{{\tilde{\mathrm{z}}}}_{1}\).
In the first case, the generators of \(\mathfrak {sl}_2\) act via:
Thus, the twisted HW-module \(H_{\mathbf{n }} \sim {{\mathcal {H}}}_{2s}^{2(s+a)}\) corresponds to the realization (165, 166) with:
In the second case, analogously, the generators of \(\mathfrak {sl}_2\) act via:
Thus, the twisted HW-module \({\tilde{H}}_{{{\tilde{\mathbf{n }}}}} \sim {{{\tilde{{\mathcal {H}}}}}}_{2s}^{2(s+a)}\) corresponds to the realization (164, 166) with:
4.4 Tensor products and invariants
Let us recall the following \(SL(2,{{{\mathbb {C}}}})\)-invariants (under the fractional linear action) on the configurations of 2, 3, and 4 points on \({{{\mathbb {C}}}{{\mathbb {P}}}}^1\):
is an invariant \((1,0) \otimes (1,0)\)–form on \({{{\mathbb {C}}}{{\mathbb {P}}}}^1 \times {{{\mathbb {C}}}{{\mathbb {P}}}}^1\),
is an invariant \(0 \otimes 0 \otimes (1,0)\)–form on \({{{\mathbb {C}}}{{\mathbb {P}}}}^1 \times {{{\mathbb {C}}}{{\mathbb {P}}}}^1 \times {{{\mathbb {C}}}{{\mathbb {P}}}}^1\), and finally, the cross-ratio
is an invariant meromorphic function on \({{{\mathbb {C}}}{{\mathbb {P}}}}^1 \times {{{\mathbb {C}}}{{\mathbb {P}}}}^1 \times {{{\mathbb {C}}}{{\mathbb {P}}}}^1 \times {{{\mathbb {C}}}{{\mathbb {P}}}}^1\).
Thus,
is an \(\mathfrak {sl}_2\)-invariant element in the completed tensor product \({{\mathcal {V}}}_{\nu } {{{\hat{\otimes }}}} {{{\tilde{{\mathcal {V}}}}}}_{\nu }\). More precisely, we need to view (194) as a power series in \(z_1, {\tilde{z}}_2 = - z_{2}^{-1}\) in the domain \(z_1\rightarrow 0, z_2\rightarrow \infty \):
For another domain of convergence, e.g., \(z_1 \rightarrow \infty , z_2 \rightarrow 0\), the expression (194) would define an invariant in the completed tensor product \({{{\tilde{{\mathcal {V}}}}}}_{\nu } {{{\hat{\otimes }}}} {{\mathcal {V}}}_{\nu }\) instead:
Finally, invoking (171, 174, 179, 183), we can express \(I_{\nu }^{(2)}\) (194) in terms of \(U_1, {\tilde{U}}_1\) (168, 176):
The benefit of formula (197) is that it admits a natural generalization to the general N:
Remark 4.7
In coordinates, we have:
Remark 4.8
The formula (198) determines the unique \(\mathfrak {gl}_N\)-invariant bilinear pairing:
such that
One can present \((\cdot ,\cdot )_{\vec \nu }\) as an integral over F(W), but the quicker way is the following: the matrix \(G_{\vec n, {\vec {{{\tilde{n}}}}}}\) inverse to
is given by the coefficients of the expansion
Let us now similarly produce an \(\mathfrak {sl}_2\)-invariant in the completed tensor product of three \(\mathfrak {sl}_2\)-representations: the lowest weight and the highest weight Vermas, as well as the twisted HW-module. To this end, we consider:
By invoking (175, 185, 188) and expanding (204) in the region \(|z_1|\ll |z_2| \ll |z_3|\), we arrive at the following interpretation:
Finally, in the \(\left( u^1,u^2\right) , \left( {\mathrm{z}}^1,{\mathrm{z}}^2\right) , \left( v_1,v_2\right) \)-realizations, this invariant takes the following form:
with
where we matched \(z_1 \sim z,\, z_2 \sim {\mathrm{z}}^2/{\mathrm{z}}^{1},\, z_3 \sim -1/{{{\tilde{z}}}}\). We note that the last two factors in (206) are \(\mathfrak {gl}_2\)-invariant, while the first one is only \(\mathfrak {sl}_2\)-invariant.
The formula (206) admits a natural generalization to the general N, with the triple \({\nu }_{1}, {\nu }_{2}, {\nu }_{3}\) being replaced with \({\vec \nu }_{1}, {\vec \nu }_{3} \in {{{\mathbb {C}}}}^{N-1},\, {\nu }_2 \in {{{\mathbb {C}}}}\). In this case, we have a unique invariant (cf. (68)):
where the vector \({\mathbf{n }} = (n_{1}, \ldots , n_{N}) \in {{{\mathbb {C}}}}^{N}\) is determined from
and
Similarly to the \(N=2\) case, the factor \({{{\tilde{\pi }}}}^{N} \left( {\pi }_{N-1} \wedge {\mathbf{z }}\right) ^{n_N}\) is only \(\mathfrak {sl}_N\)-invariant, while all other factors in (208) are naturally \(\mathfrak {gl}_N\)-invariant.
Another generalization of (206) is the invariant
where the vector \({{{\tilde{\mathbf{n }}}}} = ({{{\tilde{n}}}}_{1}, \ldots , {\tilde{n}}_{N}) \in {{{\mathbb {C}}}}^{N}\) is determined from
and
Remark 4.9
The examples (208, 211) demonstrate the need for twists in the definition of the HW-modules in Sect. 4.2.
To prove that \(I^{(2)}\) of (198), \(I^{(3)}\) of (208), and \({{{\tilde{I}}}}^{(3)}\) of (211) are the only invariants in the corresponding (completed) tensor products of 2 and 3 modules of \(\mathfrak {sl}_N\), see Corollary 4.9, let us recall the realization of the corresponding spaces of invariants as the weight subspaces.
Notation 4.6
For an \(\mathfrak {sl}_N\)-module W and \({\vec \lambda } \in {{{\mathbb {C}}}}^{N-1}\), we denote by \(W [ {\vec \lambda } ]\) the weight \(\vec \lambda \) subspace:
Remark 4.10
We have (cf. Remark 4.6):
To Verma modules \({{\mathcal {V}}}_{\vec \nu }, {{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}\) defined in Sects. 4.1.1 and 4.1.2, we associate the restricted dual modules \({{\mathcal {V}}}_{\vec \nu }^{*}, {{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}^{*}\). These are defined as the submodules of \(\mathrm{Hom}_{{{\mathbb {C}}}}({{\mathcal {V}}}_{\vec \nu }\, , \, {{{\mathbb {C}}}})\), \(\mathrm{Hom}_{{{\mathbb {C}}}}({{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}\, , \, {{{\mathbb {C}}}})\), respectively, whose underlying vector spaces are direct sums of the spaces, dual to the \(\mathfrak {sl}_N\)-weight subspaces of \({{\mathcal {V}}}_{\vec \nu }, {{{\tilde{{\mathcal {V}}}}}}_{\tilde{\vec \nu }}\). The following is well known:
Lemma 4.7
If \({{\mathcal {V}}}_{\vec \nu }\) (resp. \({{\tilde{{\mathcal {V}}}}}_{\vec {{{\tilde{\nu }}}}}\)) is an irreducible \(\mathfrak {sl}_N\)-module, then \({{\mathcal {V}}}_{\vec \nu }^*\simeq {{{\tilde{{\mathcal {V}}}}}}_{\vec \nu }\) (resp. \({{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}^{*} \simeq {{\mathcal {V}}}_{\vec {{{\tilde{\nu }}}}}\)).
For any \(\mathfrak {sl}_N\)-module W, we define the completed tensor products \({{\mathcal {V}}}_{\vec \nu } {\hat{\otimes }} W\) and \({{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}} {\hat{\otimes }} W\) via:
both of which have natural structure of \(\mathfrak {sl}_N\)-modules.
Now we are ready to invoke the standard interpretation of the space of \(\mathfrak {sl}_N\)-invariants in the tensor product, completed in the sense of (216), of \(\mathfrak {sl}_N\)-modules involving both the highest weight and the lowest weight Verma modules (cf. the proof of [7, Proposition 1.1]):
Lemma 4.8
If the lowest weight Verma \({{\mathcal {V}}}_{\vec \nu }\) and the highest weight Verma \({{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}\) modules of \(\mathfrak {sl}_N\) are irreducible, then the space of \(\mathfrak {sl}_N\)-invariants in \({{\mathcal {V}}}_{\vec \nu } {\hat{\otimes }} W {\hat{\otimes }} {{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}\) can be described as follows:
Proof
This follows from the following sequence of canonical identifications:
by using the conventions (216), Lemma 4.7, and Frobenius reciprocity. \(\square \)
Remark 4.11
Putting together the identifications (218), we see that the resulting vector space isomorphism
is obtained by pairing an element of \(\left( {{\mathcal {V}}}_{\vec \nu } {\hat{\otimes }} W {\hat{\otimes }} {{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}} \right) ^{\mathfrak {gl}_N}\) with \({\tilde{\Omega }}_{{\vec \nu }} \otimes {\Omega }_{{\vec {{{\tilde{\nu }}}}}} \in {\tilde{{\mathcal {V}}}}_{{\vec \nu }} \otimes {{\mathcal {V}}}_{{\vec {{{\tilde{\nu }}}}}}\) with respect to \((\cdot ,\cdot )_{\vec {\nu }}\) and \((\cdot ,\cdot )_{{\vec {{{\tilde{\nu }}}}}}\) in the first and third tensor factors, cf. Remark 4.8 and Lemma 4.7.
Applying Lemma 4.8 to the trivial and the twisted HW-modules of \(\mathfrak {sl}_N\), we obtain:
Corollary 4.9
-
(a)
For the trivial \(\mathfrak {sl}_N\)-module \(W={{\mathbb {C}}}\), the space of invariants \(\left( {{\mathcal {V}}}_{\vec {\nu }_1} {\hat{\otimes }} {{{\tilde{{\mathcal {V}}}}}}_{\vec {\nu }_2} \right) ^{\mathfrak {sl}_N}\) vanishes if \(\vec {\nu }_1\ne \vec {\nu }_2\), and is one-dimensional (hence, is spanned by \(I^{(2)}_{\vec {\nu }_1}\) of (198)) if \(\vec {\nu }_1=\vec {\nu }_2\).
-
(b)
For the twisted HW-modules \(W={{\mathcal {H}}}_{\nu _2}^{{\vec \mu }}, {{\tilde{{\mathcal {H}}}}}_{\nu _2}^{{\vec {{{\tilde{\mu }}}}}}\), the spaces of invariants \(\left( {{\mathcal {V}}}_{\vec \nu _1} {{{\hat{\otimes }}}} {{\mathcal {H}}}_{\nu _2}^{\vec \mu } {{{\hat{\otimes }}}} {{{\tilde{{\mathcal {V}}}}}}_{\vec \nu _3} \right) ^{\mathfrak {sl}_N}\) and \(\left( {{\mathcal {V}}}_{\vec \nu _1} {{{\hat{\otimes }}}} {{\tilde{{\mathcal {H}}}}}_{\nu _2}^{\vec {{{\tilde{\mu }}}}} {{{\hat{\otimes }}}} {{{\tilde{{\mathcal {V}}}}}}_{\vec \nu _3} \right) ^{\mathfrak {sl}_N}\) are at most one-dimensional, and they vanish if \(\vec \mu +\vec {\nu }_1-\vec {\nu }_3\notin {{\mathbb {Z}}}^{N-1}\), \(\vec {{{\tilde{\mu }}}}+\vec {\nu }_3-\vec {\nu }_1\notin {{\mathbb {Z}}}^{N-1}\), respectively. In particular, the invariants \(I^{(3)}_{{\vec \nu }_{1}, {\nu }_2, {\vec \nu }_{3}}\in \left( {{\mathcal {V}}}_{\vec \nu _1} {{{\hat{\otimes }}}} {{\mathcal {H}}}_{\nu _2}^{{\vec \nu }_{3}-{\vec \nu }_{1}} {{{\hat{\otimes }}}} {{{\tilde{{\mathcal {V}}}}}}_{\vec \nu _3} \right) ^{\mathfrak {sl}_N}\) and \({\tilde{I}}^{(3)}_{{\vec \nu }_{1}, {\nu }_2, {\vec \nu }_{3}}\in \left( {{\mathcal {V}}}_{\vec \nu _1} {{{\hat{\otimes }}}} {{\tilde{{\mathcal {H}}}}}_{\nu _2}^{{\vec \nu }_{1}-{\vec \nu }_{3}} {{{\hat{\otimes }}}} {{{\tilde{{\mathcal {V}}}}}}_{\vec \nu _3} \right) ^{\mathfrak {sl}_N}\) of (208) and (211) are unique, up to scalar multipliers.
4.5 Our quartet
We are now finally ready to relate (89, 114) to the invariants in the completed tensor products of four \(\mathfrak {sl}_N\)-modules: the two Vermas and the two twisted HW-modules.
Let us fix \(\vec \nu , {\vec {{{\tilde{\nu }}}}}, {\vec \gamma } \in {{{\mathbb {C}}}}^{N-1}\), and \({\mathrm{m}}, {{{\tilde{\mathrm{m}}}}} \in {{{\mathbb {C}}}}\). Let us specify four \(\mathfrak {sl}_N\)-representations as follows:
We shall work with the completion
so defined (cf. (216)) that it contains the power series expansion in \(u^{(i)}_{k}, {{{\tilde{u}}}}_{(i)}^{k}, {\mathrm{z}}^a, {{\tilde{\mathrm{z}}}}_a\) of \(\Upsilon \) given by (89).
Let us now apply Lemma 4.8 to the case \(W=V_2\otimes V_3\). Noticing that
with the \(\mathfrak {sl}_N\)-action (151, 152) twisted by the factors (153), we get the following identification:
where the variables \(\eta _i\)’s are defined via:
The above vector space isomorphism \({{{\mathbb {C}}}}\left[ \eta _1^{{\pm } 1},\ldots , \eta _{N-1}^{{\pm } 1} \right] {{\mathop {\longrightarrow }\limits ^{\sim }}}\left( V_1 {\hat{\otimes }} V_2 {\hat{\otimes }} V_3 {\hat{\otimes }} V_4 \right) ^{\mathfrak {sl}_N}\) is constructive. Explicitly, given \(\vec {r}=(r_{1},\ldots ,r_{N-1})\in {{\mathbb {Z}}}^{N-1}\), define the \(\mathfrak {sl}_N\)-weight \(\vec {\delta }=(\delta _1,\ldots ,\delta _{N-1})\in {{\mathbb {Z}}}^{N-1}\) via \(\delta _i=r_{i-1}-2r_i+r_{i+1}\) with \(r_0=r_N=0\). According to Lemma 4.8, the spaces of invariants \(\left( {{\mathcal {V}}}_{\vec \nu } {{{\hat{\otimes }}}} {{\mathcal {H}}}_{\mathrm{m}}^{\vec \gamma -\vec \nu } {{{\hat{\otimes }}}} {{{\tilde{{\mathcal {V}}}}}}_{\vec \gamma +\vec \delta } \right) ^{\mathfrak {sl}_N}\) and \(\left( {{\mathcal {V}}}_{\vec \gamma +\vec \delta }\, {{{\hat{\otimes }}}} {{\tilde{{\mathcal {H}}}}}_{{{\tilde{\mathrm{m}}}}}^{\vec \gamma -\vec {{{\tilde{\nu }}}}} {{{\hat{\otimes }}}} {{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}} \right) ^{\mathfrak {sl}_N}\) are one-dimensional (for \(\vec {r}=\vec {0}\), they are spanned by \(I^{(3)}_{\vec {\nu },{\mathrm{m}},\vec {\gamma }}\) and \({\tilde{I}}^{(3)}_{\vec {\gamma },{{{\tilde{\mathrm{m}}}}},\vec {{{\tilde{\nu }}}}}\)). Equivalently, there are unique \(\mathfrak {sl}_N\)-module homomorphisms:
such that
cf. (153–155, 157, 158), where we used Lemma 4.7 and the pairing \((\cdot ,\cdot )_{\vec \nu }, (\cdot ,\cdot )_{\vec {{{\tilde{\nu }}}}}\) of Remark 4.8 on the first and second components, respectively. Hence, we get an \(\mathfrak {sl}_N\)-module homomorphism:
Invoking the \(\mathfrak {sl}_N\)-invariant \(I^{(2)}_{\vec \gamma +\vec \delta }\in \left( {{\mathcal {V}}}_{\vec \gamma +\vec \delta }\, {\hat{\otimes }} {{\tilde{{\mathcal {V}}}}}_{\vec \gamma +\vec \delta } \right) ^{\mathfrak {sl}_N}\), we obtain the sought-after \(\mathfrak {sl}_N\)-invariant
which exactly corresponds to \(\eta _1^{r_1}\eta _2^{r_2}\cdots \eta _{N-1}^{r_{N-1}}\) under the identification (222).
Remark 4.12
The realization (222) corresponds to the family (over \({\mathfrak {q}}\)) of maps
which consists, in detail, of restricting to \({\pi }_{i} \rightarrow {\pi }_{i}^{0}\) (131), \({{{\tilde{\pi }}}}^{i} \rightarrow {{{\tilde{\pi }}}}^{i}_{0}\) (126), and dropping the factor
5 Knizhnik–Zamolodchikov equations
5.1 KZ equations
Let us recall the notion of Knizhnik–Zamolodchikov (KZ) equations [22] associated with the following data:
-
(a)
\({\mathfrak {g}}\) –a semisimple Lie algebra,
-
(b)
t –a non-degenerate ad-invariant bilinear form on \({\mathfrak {g}}\), that is:
$$\begin{aligned} t([a,b],c) = t(a, [b,c]) \quad \text {for\ any} \ a,b,c \in {\mathfrak {g}}\, , \end{aligned}$$ -
(c)
\(V_1,\ldots ,V_n\) –representations of \({\mathfrak {g}}\),
-
(d)
\(\kappa \in {{\mathbb {C}}}^\times \)–a nonzero constant.
Define the Casimir tensor \({{{\hat{C}}}}\in {\mathfrak {g}}\otimes {\mathfrak {g}}\) and the Casimir element \(\text {Cas}\in U({\mathfrak {g}})\) via:
and
where \(\{X_A\}_{A\in I}\) is a basis of \({\mathfrak {g}}\), \(\Vert t^{AB} \Vert \) is the matrix inverse to \(\Vert t(X_{A}, X_{B}) \Vert \).
Define the configuration space \(\Sigma _n\subset {{\mathbb {C}}}^n\) via:
A function \(F:\Sigma _n\rightarrow V_1\otimes \dots \otimes V_n\) is said to satisfy the KZ equations [22] if:
where \({{{\hat{C}}}}_{ij}\) denotesFootnote 2 the action of \({{\hat{C}}}\) (230) on the i-th and j-th factors of \(V_1\otimes \dots \otimes V_n\).
Remark 5.1
Note that the KZ equations essentially depend only on the ad-invariant form \(\frac{t}{\kappa }\).
5.2 \({\mathfrak {g}}\)-invariance and \(n=4\) case
A function \(F:\Sigma _n\rightarrow V_1\otimes \dots \otimes V_n\) is called \({\mathfrak {g}}\)-invariant if:
Let \(n=4\). Recall the cross-ratio (193) of 4 points, which can be thought of as a map:
This map can be naturally extended to a map \({\bar{\pi }}:{\bar{\Sigma }}_4 \rightarrow {\mathbb {CP}}^{1}\), where \({\bar{\Sigma }}_4\subset (\mathbb {CP}^1)^4\) is the locus of points with pairwise distinct coordinates. The map \({\bar{\pi }}\) is the quotient map for the natural free action of \(H = SL(2,{{{\mathbb {C}}}})\) on \({\bar{\Sigma }}_4\) (the diagonal action by the fractional linear transformations). In particular, for any \(\mathrm{p}\in \Sigma _4\) the points \(\mathrm{p}=(p_1,p_2,p_3,p_4)\) and \((0, {{\mathfrak {q}}} = [p_1,p_2;p_3,p_4], 1, \infty )\) of \({\bar{\Sigma }}_4\) lie in the same H-orbit. Naturally the four KZ equations (233) on a \({\mathfrak {g}}\)-invariant function F reduce to a single equation on a \((V_1\otimes V_2 \otimes V_3 \otimes V_4)^{\mathfrak {g}}\)-valued function of \({\mathfrak {q}}\):
Proposition 5.1
Assume that the Casimir element \(\text {Cas}\) (231) acts on \(V_i\) as a multiplication by \({\Delta }_{i}\in {{{\mathbb {C}}}}\) for any \(1\le i\le 4\). Choose constants \(\{\mathrm{{d}}_{ij}\, \vert \, 1\le i\ne j\le 4\}\) so that \(\mathrm{{d}}_{ij}=\mathrm{{d}}_{ij}\) and \(\sum _{j\ne i} \mathrm{{d}}_{ij} = \Delta _i\) for any \(1\le i\le 4\).Footnote 3 Then, \(F:\Sigma _4\rightarrow (V_1\otimes V_2\otimes V_3\otimes V_4)^{\mathfrak {g}}\) satisfies all four KZ equations (233) if and only if Footnote 4
with \(\Phi :{{{\mathbb {C}}}}^\times \backslash \{1\}\rightarrow (V_1\otimes V_2\otimes V_3\otimes V_4)^{\mathfrak {g}}\) satisfying the following equation:
The proof of this result is elementary.
5.3 Our KZ setup
Let us now apply the above discussion to \({\mathfrak {g}}=\mathfrak {sl}_N\) endowed with an ad-invariant bilinear form \(t(a,b)=\text {tr}_{{{{\mathbb {C}}}}^{N}}(ab)\), and the \(n=4\) modules \(V_i\ (1\le i\le 4)\) as in (220):
According to Lemma 4.8 and the identification (222), we have:
with \(\eta _i\)’s defined in (223). Hence, functions F and \(\Phi \) of Proposition 5.1 can be thought of as:
Our next goal is to rewrite Eq. (236) on \(\Phi \) as a differential equation in \({{\mathfrak {q}}},\eta _1,\ldots ,\eta _{N-1}\).
5.4 The differential operator \({\widehat{H}}^{\text {KZ}}\)
Choose the basis \(\{X_A\}\) of \({\mathfrak {g}}=\mathfrak {sl}_N\) as follows:
Then, the Casimir tensor (230) has the following form:
where \(\Vert C^{ij} \Vert \) is the matrix inverse to the Cartan matrix \(\Vert (2\delta _{i}^{j}-\delta _{i}^{j+1}-\delta _i^{j-1} ) \Vert \) of \(\mathfrak {sl}_N\). To simplify the calculations, it is convenient to consider a natural embedding \(\iota :\mathfrak {sl}_N\hookrightarrow \mathfrak {gl}_N\), so that:
where \({{\mathcal {C}}}_1=\sum _{a=1}^N {\mathbf{J}}_{a}^a\in \mathfrak {gl}_N\) is the first Casimir operator (117). Similarly, the image of the Casimir element \(\text {Cas}\) (231) under the induced embedding \(\iota :U(\mathfrak {sl}_N)\hookrightarrow U(\mathfrak {gl}_N)\) is given by:
Define
The operators
coincide with \({{{\hat{h}}}}^{\mathrm{cft}}_{0}, {{{\hat{h}}}}^{\mathrm{cft}}_{1}\) of (88), respectively, which in turn coincide with \({{{\hat{h}}}}^{\mathrm{bps}}_{0}, {{{\hat{h}}}}^{\mathrm{bps}}_{1}\) of (56), according to Theorem 3.1. This concludes the proof of our main result: the vacuum expectation value \(\langle \, {{\mathcal {S}}} \, \rangle \) of the surface defect obeys the Knizhnik–Zamolodchikov equation [22], specifically the equation obeyed by the \(\left( {\widehat{\mathfrak {sl}}}_{N} \right) _{k}\) current algebra conformal block
with the vertex operators at 0 and \(\infty \) corresponding to the generic lowest weight \({{\mathcal {V}}}_{\vec \nu }\) and highest weight \({{\tilde{{\mathcal {V}}}}}_{\vec {{{\tilde{\nu }}}}}\) Verma modules, while the vertex operators at \({\mathfrak {q}}\) and 1 correspond to the twisted HW-modules \({{\mathcal {H}}}_{\mathrm{m}}^{\vec \mu } \ \text {and} \ {{\tilde{{\mathcal {H}}}}}_{{\tilde{\mathrm{m}}}}^{\vec {{{\tilde{\mu }}}}}\).
6 Conclusions and further directions
In this paper, we established that the vacuum expectation value of the regular surface defect in SU(N) gauge theory in four dimensions with \({{\mathcal {N}}}=2\) supersymmetry, with 2N fundamental hypermultiplets, obeys the analytical continuation of Knizhnik–Zamolodchikov equation for the four-point conformal block \(\left\langle V_1 V_2 V_3 V_4 \right\rangle \) of the two-dimensional \(\mathfrak {sl}_N\) current algebra at the level
The surprising feature we discovered is the need to twist the irreducible representations corresponding to the middle vertex operators \(V_2\) and \(V_3\).
Our result has been anticipated for many years, see [29, 30]. In particular, in the specific limit \(m_{i} \rightarrow \infty \), \({{\mathfrak {q}}} \rightarrow 0\), with
the equation (56) becomes the non-stationary version of the periodic Toda equation:
where
It was shown in [9] that the equation (246) is obeyed by the J-function of the affine flag variety, which in [29, 30] was interpreted as the vev of the surface defect in the pure \({{\mathcal {N}}}=2\) super-Yang–Mills theory with SU(N) gauge group. However, the method of [9] does not generalize to the theories with matter. In [32,33,34,35,36,37] the equations, obeyed by the surface defects of certain quiver gauge theories, were derived.
In the limit \({\varepsilon }_1 \rightarrow 0\) and/or \({\varepsilon }_2 \rightarrow 0\), the differential operator (56) becomes the equation describing certain Lagrangian submanifolds in the complex symplectic manifolds, which are related to the moduli spaces [46] of vacua of the four-dimensional gauge theory we started with, compactified on a circle. These moduli spaces can be also identified with the moduli space of solutions of some partial differential equations, describing monopoles and instantons in some auxiliary gauge theory [11, 44].
In this paper, we studied the simplest case of the asymptotically conformal \({{\mathcal {N}}}=2\) gauge theory, corresponding to the \(A_1\)-type quiver. There exist various quiver generalizations, whose Seiberg–Witten geometry can be exactly computed [39]. The orbifold surface defects of the \(A_r\)-generalizations conjecturally obey the KZ equations corresponding to the \(r+3\)-point conformal blocks of the \(\widehat{\mathfrak {su}(N)}_{k}\) current algebra, with two Verma modules and \(r+1\) twisted HW-modules. One can also study the intersecting surface defects. For example, in the companion paper [17] a 5-point conformal block corresponding to the infinite-dimensional modules \({{\mathcal {V}}}_{\vec \nu }, {{\mathcal {H}}}^{\vec \mu }_{\mathrm{m}}, {{{\tilde{{\mathcal {H}}}}}}^{\vec {{{\tilde{\mu }}}}}_{{{\tilde{\mathrm{m}}}}}, {{{\tilde{{\mathcal {V}}}}}}_{\vec {{{\tilde{\nu }}}}}\), and the N-dimensional standard representation is associated with the intersecting surface defect of the orbifold type studied in this paper, and the orthogonal surface defect corresponding to the Q-observable of gauge theory [32,33,34,35,36,37, 40]
Perhaps the most interesting continuation of our work would be a translation of the connection between the conformal blocks of two-dimensional current algebra \(\left( {\widehat{\mathfrak {sl}_N}}\right) _{k}\) to the surface defect partition function of four-dimensional gauge theory that we firmly established, to the \(A_{N-1}\) (0, 2)-theory in six dimensions.
For integral level k and the weights \(\vec \nu , \vec {{{\tilde{\nu }}}}, {\mathrm{m}}, {{{\tilde{\mathrm{m}}}}}\) the current algebra conformal blocks have a familiar Chern–Simons interpretation. It can be represented as the path integral in the SU(N) gauge theory on a three-ball \(B^3\) with the action
with the gauge fields having a curvature singularity along an embedded graph \(\Gamma \), as in Fig. 1. The edges of the graph are labelled by the conjugacy classes of the monodromy of connection around the small loop linking the edge. We need an extension, or an analytic continuation, to the case of complex levels and weights. The paper [50] offers such a continuation for the Chern–Simons level. The analytic continuation of Chern–Simons theory in the representation parameters of Wilson and ’t Hooft lines is not yet available, but our results strongly suggest it should be possible. We are familiar with the Wilson line operators \(W_{R}(C)\), associated with the representation of the gauge group G and its representation R,
More generally, a tri-valent orientation graph \(\Gamma \), with oriented edges e labelled by representations \(R_{e}\), with the understanding that the change of the orientation flips the representation \(R_{{{\bar{e}}}} = R_{e}^{*}\), and vertices labelled by the invariants
with the edges \(e_1, e_2, e_3\) coming out of the vertex v, corresponds to the Wilson graph observable
where l labels the loops, i.e., the edges with coinciding ends.
In the case the graph has tails, i.e., 1-valent vertices, which are placed at the boundary \({\partial } B\), the path integral takes values in the Hilbert space obtained by quantizing the moduli space of flat G-connections on \({\Sigma }^2 = {\partial }B^3\) with singularities at the end-points, with fixed conjugacy classes of monodromies around those. In the case of \(B^3, {\Sigma }^2 \approx S^2\) this Hilbert space is isomorphic to the space of invariants in the tensor product of representations attached to the edges ending at the tails. For the graph \(\Gamma \) in Fig. 1, this would be
Having the invariants \(I_{1} \in \left( R_{1} \otimes R_{2} \otimes R \right) ^{G}\), \(I_{2} \in \left( R^{*} \otimes R_{3} \otimes R_{4} \right) ^{G}\) at the two internal vertices of \(\Gamma \) identifies the conformal block with the channel of the tensor product decomposition (252) corresponding to the intermediate representation \(R \in R_{3} \otimes R_{4}\), \(R^{*} \in R_{1} \otimes R_{2}\).
All this, to a limited extent, generalizes to the infinite-dimensional \({\mathfrak {g}}\)-representations, although the expression (251) does not literally make sense. Nevertheless, the form
of our basic invariant \(\Upsilon \) (89), and moreover, the \({{\mathfrak {q}}} \rightarrow 0\) asymptotics of the surface defect partition function (44), which can be analyzed [23] rather explicitly, are suggestive of some sort of three-dimensional interpretation with the graph \(\Gamma \), with some intermediate \(\mathfrak {sl}_N\)-module with the highest/lowest/middle weight \({\vec \gamma }\).
It does not seem to be possible to analytically continue (251) as a line operator in the analytically continued Chern–Simons theory, as in [50]. However, it might be possible to analytically continue the S-dual ’t Hooft operator, as a surface defect in the topologically twisted \({{\mathcal {N}}}=4\) theory on a four-dimensional manifold with corners, which locally looks like \(B^3 \times I\).
On the other hand, the surface defect in four dimensions can be related [42] to boundary conditions in the two-dimensional sigma model valued in the moduli space of vacua of the theory, compactified on a circle, which in the present case is believed to be the moduli space \({{\mathcal {M}}}_{N} \left( S^{2} \backslash 4\, \mathrm{pts} \, ; {\vec \nu }, {\mathrm{m}}, {{{\tilde{\mathrm{m}}}}}, {\vec {{{\tilde{\nu }}}}} \right) \) of SU(N) Higgs pairs on a 4-punctured sphere with the regular punctures at 0 and \(\infty \), and the minimal punctures at \({\mathfrak {q}}\) and 1, see Fig. 2. The homotopy between these two representatives of a cohomology class of an intrinsic operator in the six-dimensional theory proceeds by viewing the two-dimensional sigma model, with the worldsheet C as a long distance limit of the four-dimensional \({{\mathcal {N}}}=2\) \(\Omega \)-deformed theory compactified on a two-torus \(T^2\) as in [42], which, in turn, is a limit of the \(A_{N-1}\) (0, 2)-theory compactified on \(\left( S^{2} \backslash 4\, \mathrm{pts} \right) \times T^2\), which, finally, can be reinterpreted, as the \({{\mathcal {N}}}=4\) theory on \(C \times \left( S^{2} \backslash 4\, \mathrm{pts} \right) \). As in [42], the canonical parameter [19] \(\Psi \) (not to be confused with the vev of our surface defect) is identified with the ratio \(\kappa \) of the \(\Omega \)-deformation parameters. With C having the topology of the corner \({{{\mathbb {R}}}}_{+}^{2}\), as in Fig. 2, the \({{\mathcal {N}}}=4\) theory on \(C \times \left( S^{2} \backslash 4\, \mathrm{pts} \right) \) looks very much like a gradient flow theory of the analytically continued Chern–Simons theory on \({{{\mathbb {R}}}}_{+} \times \left( S^{2} \backslash 4\, \mathrm{pts} \right) \), with certain boundary conditions. We plan to discuss this duality in detail elsewhere.
Notes
One can show that this power series converges when all \(|{{\mathfrak {q}}}_{\omega }|< 1\), uniformly on compact sets in the complex domain \({{{\tilde{a}}}}_{b} - {{{\tilde{a}}}}_{c} + i{{\varepsilon }}_{1} + j{\tilde{\varepsilon }}_{2} \ne 0\) for all \(i,j \ge 1\).
A more pedantic notation would be:
$$\begin{aligned} {{{\hat{C}}}}_{ij} \, = \sum _{A,B \in I} \, t^{AB} \, \mathbf{1}_{V_{1}} \otimes \dots \otimes T_{V_{i}}(X_{A}) \otimes \dots \otimes T_{V_{j}} (X_{B}) \otimes \dots \otimes \mathbf{1}_{V_{n}}\, . \end{aligned}$$Such \(\{{\textsf {d}}_{ij}\}\) exist and are unique for an arbitrary choice of \({\textsf {d}}_{12}\) and \({\textsf {d}}_{13}\).
On any simply connected region in \(\left( {\mathbb {CP}}^{1} \right) ^{4} \backslash \, \{\mathrm{diagonals}\}\).
References
Alday, L., Gaiotto, D., Tachikawa, Y.: Liouville correlation functions from four-dimensional gauge theories. Lett. Math. Phys. 91(2), 167–197 (2010). https://doi.org/10.1007/s11005-010-0369-5. [arXiv:0906.3219 [hep-th]]
Alday, L., Tachikawa, Y.: Affine \(SL(2)\) conformal blocks from 4d gauge theories. Lett. Math. Phys. 94(1), 87–114 (2010). https://doi.org/10.1007/s11005-010-0422-4. [arXiv:1005.4469 [hep-th]]
Birkhoff, G.D.: Representability of Lie algebras and Lie groups by matrices. Ann. Math. 38(2), 526–532 (1937)
Bullimore, M., Kim, H., Koroteev, P.: Defects and quantum Seiberg-Witten geometry. JHEP 05, 095 (2015). https://doi.org/10.1007/JHEP05(2015)095. [arXiv:1412.6081 [hep-th]]
Drukker, N., Gomis, J., Okuda, T., Teschner, J.: Gauge theory loop operators and Liouville theory. JHEP 1002, 057 (2010). https://doi.org/10.1007/JHEP02(2010)057. [arXiv:0909.1105 [hep-th]]
Enriquez, B., Nekrasov, N.: unpublished (1992)
Etingof, P.: Quantum integrable systems and representations of Lie algebras. J. Math. Phys. 36(6), 2636–2651 (1995). [arXiv:hep-th/9311132]
Feigin, B., Frenkel, E., Reshetikhin, N.: Gaudin model, Bethe ansatz and critical level. Commun. Math. Phys. 166(1), 27–62 (1994). https://doi.org/10.1007/BF02099300. [arXiv:hep-th/9402022]
Felder, G., Müller-Lennert, M.: Analyticity of Nekrasov partition functions. Commun. Math. Phys. 364(2), 683–718 (2018). https://doi.org/10.1007/s00220-018-3270-1. [arXiv:1709.05232 [math]]
Frenkel, E., Gukov, S., Teschner, J.: Surface operators and separation of variables. JHEP 1601(1), 179 (2016). https://doi.org/10.1007/JHEP01(2016)179. [arXiv:1506.07508 [hep-th]]
Gaiotto, D.: N=2 dualities. JHEP 1208, 034 (2012). https://doi.org/10.1007/JHEP08(2012)034. [arXiv:0904.2715 [hep-th]
Gerasimov, A.: In the case of the WZWN theory with the simple real Lie group \(G\) it can be understood using the free field representations, unpublished remarks (1991)
Gukov, S., Witten, E.: Branes and quantization. Adv. Theor. Math. Phys. 13(5), 1445–1518 (2009). https://doi.org/10.4310/ATMP.2009.v13.n5.a5. [arXiv:0809.0305 [hep-th]]
Haouzi, N., Kozçaz, C.: Supersymmetric Wilson Loops, Instantons, and Deformed \(W\)-Algebras. [arXiv:1907.0383] [hep-th]]
Haouzi, N., Oh, J.: On the Quantization of Seiberg-Witten Geometry. JHEP. 1, (2021). https://doi.org/10.1007/JHEP01(2021)184
Jeong, S., Nekrasov, N.: Riemann-Hilbert Correspondence and Blown Up Surface Defects. JHEP. 12, (2020). 10.1007/JHEP12(2020)006
Jeong, S., Lee, N., Nekrasov, N.: Intersecting Defects in Gauge Theory, Quantum Spin Chains, and Knizhnik-Zamolodchikov Equation. JHEP. 10, (2021). https://doi.org/10.1007/JHEP10(2021)120
Kanno, H., Tachikawa, Y.: Instanton Counting with a Surface Operator and the Chain-Saw Quiver. [arXiv:1105.0357 [hep-th]]
Kapustin, A., Witten, E.: Electric-magnetic duality and the geometric langlands program. Commun. Num. Theor. Phys. 1, 1–236 (2007). https://doi.org/10.4310/CNTP.2007.v1.n1.a1. [arXiv:hep-th/0604151]
Kimura, T., Pestun, V.: Quiver W-algebras. Lett. Math. Phys. 108(6), 1351–1381 (2018). https://doi.org/10.1007/s11005-018-1072-1. [arXiv:1512.08533 [hep-th]]
Klemm, A., Lerche, W., Mayr, P., Vafa, C., Warner, N.P.: Selfdual strings and N = 2 supersymmetric field theory. Nucl. Phys. B 477(3), 746–764 (1996). https://doi.org/10.1016/0550-3213(96)00353-7. [arXiv:hep-th/9604034]
Knizhnik, V., Zamolodchikov, A.: Current algebra and Wess-Zumino model in two-dimensions. Nucl. Phys. B 247(1), 83–103 (1984). https://doi.org/10.1016/0550-3213(84)90374-2
Lee, N., Nekrasov, N.: Quantum spin systems and supersymmetric gauge theories. Part I. JHEP 03, 093 (2021). https://doi.org/10.1007/JHEP03(2021)093. [arXiv:2009.11199 [hep-th]]
Losev, A.S., Marshakov, A., Nekrasov, N.A.: Small instantons, little strings and free fermions. In: Shifman, M., et al. (eds.) From Fields to Strings, Ian Kogan Memorial, vol. 1, pp. 581–621. [arXiv:hep-th/0302191]
Nekrasov, N.: Holomorphic bundles and many body systems. Commun. Math. Phys. 180(3), 587–603 (1996). https://doi.org/10.1007/BF02099624. [arXiv:hep-th/9503157]
Nekrasov, N.A.: Seiberg-Witten prepotential from instanton counting. Adv. Theor. Math. Phys. 7(5), 831–864 (2003). https://doi.org/10.4310/ATMP.2003.v7.n5.a4. [arXiv:hep-th/0206161]
Nekrasov, N., Okounkov, A.: Seiberg-Witten theory and random partitions. Prog. Math. 244, 525–596 (2006). https://doi.org/10.1007/0-8176-4467-9_15. [arXiv:hep-th/0306238]
Nekrasov, N.: Localizing gauge theories. In: Zambrini, J.C. (ed.) 14th International Congress on Mathematical Physics 2003, pp. 645–654. World Scientific, Lisbon (2005)
Nekrasov, N.: On the BPS/CFT correspondence, Lecture at the University of Amsterdam string theory group seminar, Feb. 3 (2004)
Nekrasov, N.: 2d CFT-type equations from 4d gauge theory, Lecture at the “Langlands Program and Physics" conference, IAS, Princeton, March 8–10 (2004)
Nekrasov, N.: À la recherche de la \(M\)-theorie perdue. Z theory: Chasing \(m/f\)-theory. C. R. Phys. 6(2), 261–269 (2005). [arXiv:hep-th/0412021]
Nekrasov, N.: Non-perturbative Schwinger-Dyson equations: from BPS/CFT correspondence to the novel symmetries of quantum field theory. Phys.-Usp. 57, 133–149 (2014). https://doi.org/10.1142/9789814616850_0008
Nekrasov, N.: BPS/CFT correspondence: non-perturbative Dyson-Schwinger equations and \(qq\)-characters. JHEP 1603, 181 (2016). [arXiv:1512.05388 [hep-th]]
Nekrasov, N.: BPS/CFT correspondence II: instantons at crossroads, moduli and compactness theorem. Adv. Theor. Math. Phys. 21(2), 503–583 (2017). [arXiv:1608.07272 [hep-th]]
Nekrasov, N.: BPS/CFT Correspondence III: Gauge Origami partition function and \(qq\)-characters. Commun. Math. Phys. 358(3), 863–894 (2018). [arXiv:1701.00189 [hep-th]]
Nekrasov, N.: BPS/CFT correspondence IV: sigma models and defects in gauge theory. Lett. Math. Phys. 109(3), 579–622 (2019). [arXiv:1711.11011 [hep-th]]
Nekrasov, N.: BPS/CFT correspondence V: BPZ and KZ equations from \(qq\)-characters. [arXiv:1711.11582 [hep-th]]
Nekrasov, N.: Blowups in BPS/CFT correspondence, and Painlevé VI. [arXiv:2007.03646 [hep-th]]
Nekrasov, N., Pestun, V.: Seiberg-Witten geometry of four dimensional \({\cal{N}}=2\) quiver gauge theories. [arXiv:1211.2240 [hep-th]]
Nekrasov, N., Pestun, V., Shatashvili, S.: Quantum geometry and quiver gauge theories. Commun. Math. Phys. 357(2), 519–567 (2018). https://doi.org/10.1007/s00220-017-3071-y. [arXiv:1312.6689 [hep-th]]
Nekrasov, N., Rosly, A., Shatashvili, S.: Darboux coordinates, Yang-Yang functional, and gauge theory. Nucl. Phys. Proc. Suppl. 216, 69–93 (2011). https://doi.org/10.1016/j.nuclphysbps.2011.04.150. [arXiv:1103.3919 [hep-th]]
Nekrasov, N., Witten, E.: The omega deformation, branes, integrability, and Liouville theory. JHEP 1009, 092 (2010). https://doi.org/10.1007/JHEP09(2010)092. [arXiv:1002.0888 [hep-th]]
Olshanetsky, M., Perelomov, A.: Explicit solution of the Calogero model in the classical case and geodesic flows on symmetric spaces of zero curvature. Lett. Nuovo Cimento (2) 16(11), 333–339 (1976)
Pestun, V.: Localization of gauge theory on a four-sphere and supersymmetric Wilson loops. Commun. Math. Phys. 313(1), 71–129 (2012). https://doi.org/10.1007/s00220-012-1485-0. [arXiv:0712.2824 [hep-th]]
Poincaré, H.: Sur les groupes continus. Trans. Cambr. Philos. Soc. 18, 220–225 (1900)
Seiberg, N., Witten, E.: Gauge dynamics and compactification to three-dimensions. [arXiv:hep-th/9607163]
Vafa, C., Witten, E.: A strong coupling test of S-duality. Nucl. Phys. B 431(1–2), 3–77 (1994). https://doi.org/10.1016/0550-3213(94)90097-3. [arXiv:hep-th/9408074]
Witt, E.: Treue Darstellung Liescher Ringe. J. Reine Angew. Math. 177, 152–160 (1937)
Witten, E.: Solutions of four-dimensional field theories via \(M\)-theory. Nucl. Phys. B 500(1–3), 3–42 (1997). https://doi.org/10.1016/S0550-3213(97)00416-1. [arXiv:hep-th/9703166]
Witten, E.: Analytic continuation of Chern–Simons theory, Chern-Simons gauge theory: 20 years after, 347–446, AMS/IP Stud. Adv. Math., 50, Amer. Math. Soc., Providence, RI (2011). [arXiv:1001.2933 [hep-th]]
Witten, E.: Fivebranes and Knots. Quantum. Topol. 3, (2012), 1–137. https://doi.org/10.4171/QT/26
Zamolodchikov, A.B.: Infinite extra symmetries in two-dimensional conformal quantum field theory. Theor. Math. Phys. 65(3), 347–359 (1985). (in Russian)
Zamolodchikov, A.B., Zamolodchikov, A.B.: Conformal bootstrap in Liouville field theory. Nucl. Phys. B 477(2), 577–605 (1996). https://doi.org/10.1016/0550-3213(96)00351-3. [arXiv:hep-th/9506136]
Acknowledgements
N.N. is grateful to A. Okounkov, A. Rosly, and A. Zamolodchikov for discussions. A.T. gratefully acknowledges support from the Simons Center for Geometry and Physics and is extremely grateful to IHES (Bures-sur-Yvette, France) for the hospitality and wonderful working conditions, where some parts of the research for this paper were performed. The work of A.T. was partially supported by the NSF Grants DMS-1502497 and DMS-2037602.
Author information
Authors and Affiliations
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A: Analyticity properties
In this Appendix, we provide proofs of the regularity properties from Sect. 2.
Proof of Proposition 2.1
By inspecting the right-hand side of (12), we see that, for generic \({\mathbf{a }}, {\varepsilon }_1,{\varepsilon }_2\), and for any \({\overline{\lambda }}\in {{\mathcal {P}}}^N\), the rational functions \(\left( Y(x)\,\vert _{{\overline{\lambda }}} \right) ^{{\pm } 1}\) have only simple poles in x. Moreover, all the poles of \(Y(x+{{\varepsilon }})\,\vert _{{\overline{\lambda }}}\) and \(\left( Y(x)\,\vert _{{\overline{\lambda }}} \right) ^{-1}\) belong to the set
Hence, to prove the regularity of \(\left\langle \, {\mathcal {X}}(x)\, \right\rangle _{\mu }\), it suffices to verify that it has no poles at the above locus (254). Fix \(1\le b\le N,\, i\ge 0,\, j\ge 0\), and set
The function \(Y(x+{\varepsilon })\,\vert _{{\overline{\lambda }}}\) has a pole at \(x=x_0\) iff \(\square =(i+1,j+1)\in {\partial }_{-}{\lambda }^{({b})}\), while the function \(\left( Y(x)\,\vert _{{\overline{\lambda }}}\right) ^{-1}\) has a pole at \(x=x_0\) iff \({\square }=(i+1,j+1)\in {\partial }_{+}{\lambda }^{({b})}\). Note that
(where \(\square ^{b}_{(i+1,j+1)}\) denotes the (i, j)-th box in the b-th Young diagram) establishes a bijection between the loci of \({\overline{\lambda }}\) satisfying the first condition and the loci of \({\overline{\lambda }}'\) satisfying the second condition. Finally, for any \({\overline{\lambda }}\) from the first locus, a straightforward computation shows that:
This completes our proof of the proposition. \(\square \)
This result admits the following multi-parameter generalization [32,33,34,35,36,37]:
Proposition A.1
For arbitrary parameters \(\varvec{\nu }=(\nu _1,\ldots ,\nu _m)\in {{\mathbb {C}}}^m\), define the \({{\mathbb {C}}}(x)\)-valued observable \({\mathcal {X}}(x;\varvec{\nu }):{{\mathcal {P}}}^N\rightarrow {{\mathbb {C}}}(x)\) via:
where \(R(z)=\frac{(z-{\varepsilon }_1)(z-{\varepsilon }_2)}{z(z-{\varepsilon }_1-{\varepsilon }_2)}\). Then, the average \(\left\langle \, {\mathcal {X}}(x;\varvec{\nu })\, \right\rangle _\mu \) is a regular function of x.
As for \(m=1\) and \(\nu _1=0\), we have \({\mathcal {X}}(x;0)={\mathcal {X}}(x)\), this result generalizes Proposition 2.1.
Proof of Proposition A.1
The proof is similar to the previous one. For generic \((\varvec{\nu }, {\mathbf{a }}, {\varepsilon }_1,{\varepsilon }_2)\), each summand of (258) is a rational function in x with simple poles, all belonging to the set
Moreover, for a fixed quadruple \((b,r,i,j)\in \{1,\ldots ,N\}\times \{1,\ldots ,m\}\times {{\mathbb {Z}}}_{\ge 0}\times {{\mathbb {Z}}}_{\ge 0}\) as in (259), the (I, J)-th summand of \({\mathcal {X}}(x;\varvec{\nu })\,\vert _{{\overline{\lambda }}}\) (258) has a pole at
iff either of the following two conditions hold:
-
(I)
\(r\in I\) and \(\square =(i+1,j+1)\in {\partial }_{-}\lambda ^{(b)}\),
-
(II)
\(r\in J\) and \(\square =(i+1,j+1)\in {\partial }_{+} \lambda ^{(b)}\).
Clearly, the map
establishes a bijection between the loci of \({\overline{\lambda }}\) satisfying the first condition (I) and the loci of those satisfying the second condition (II), while a straightforward computation shows that:
The regularity of \(\left\langle \, {\mathcal {X}}(x;\varvec{\nu })\, \right\rangle _\mu \) follows. \(\square \)
Finally, let us prove the analyticity in the orbifold/colored setup.
Proof of Proposition 2.2
It follows immediately from the proof of Proposition 2.1 presented above. The key observation is that, while each non-colored residue of \(Y(x+{\varepsilon })\,\vert _{{\overline{\lambda }}}\) and \(\frac{P(x)}{Y(x)\,\vert _{{\overline{\lambda }}'}}\) at \(x=x_0\) (255) is a product of elements from the lattice \(\Lambda \) (7) and their inverses, the corresponding colored residues of \(Y_{{\omega }+1}(x+{\varepsilon })\,\vert _{{\overline{\lambda }}}\) and \(\frac{P_{\omega }(x)}{Y_{\omega }(x)\,\vert _{{\overline{\lambda }}'}}\) at \(x=x_0\) are zero unless \({\mathfrak {S}}_{x_0}={\omega }\), while in the latter case they are obtained from their non-colored counterparts by disregarding all factors from \(\Lambda \) with a nonzero \({{\mathbb {Z}}}_N\)-grading. Likewise, all elements of the lattice \(\Lambda \) that appear in \(\mu ^{\mathrm{orb}}\,\vert _{{\overline{\lambda }}}\) (23) are obtained from those that appear in \(\mu \,\vert _{{\overline{\lambda }}}\) (11) by disregarding all factors from \(\Lambda \) with a nonzero \({{\mathbb {Z}}}_N\)-grading.
Therefore, for each pair \(({\overline{\lambda }},{\overline{\lambda }}')\) from the proof of Proposition 2.1, see (256), we get (cf. (257)):
The regularity of \(\left\langle \, {\mathcal {X}}_{\omega }(x) \,\right\rangle _{\mu ^{\mathrm{orb}}}\) follows. \(\square \)
Appendix B: Some technical computations
The following equations are used in the proof of Theorem 3.1:
and
with \(u_i\)’s defined in (77) and satisfying the equality \(v_{i+1}=u_i-u_{i+1}\) of loc.cit.
Rights and permissions
About this article
Cite this article
Nekrasov, N., Tsymbaliuk, A. Surface defects in gauge theory and KZ equation. Lett Math Phys 112, 28 (2022). https://doi.org/10.1007/s11005-022-01511-8
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11005-022-01511-8
Keywords
- KZ equation
- Surface defect
- qq-character
- BPS/CFT correspondence
- Supersymmetric gauge theory
- Supersymmetry and duality