Abstract
We study the spectra of general \(N\times N\) Toeplitz matrices given by symbols in the Wiener Algebra perturbed by small complex Gaussian random matrices, in the regime \(N\gg 1\). We prove an asymptotic formula for the number of eigenvalues of the perturbed matrix in smooth domains. We show that these eigenvalues follow a Weyl law with probability sub-exponentially close to 1, as \(N\gg 1\), in particular that most eigenvalues of the perturbed Toeplitz matrix are close to the curve in the complex plane given by the symbol of the unperturbed Toeplitz matrix.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction and main result
Let \(a_{\nu }\in \mathbf{C}\), for \(\nu \in \mathbf{Z}\) and assume that
where \(m:\mathbf{Z}\rightarrow ]0,+\infty [\) satisfies
and
Let
act on complex valued functions on \(\mathbf{Z}\). Here \(\tau \) denotes translation by 1 unit to the right: \(\tau u(j)=u(j-1)\), \(j\in \mathbf{Z}\). By (1.2) we know that \(p(\tau )={{\mathcal {O}}}(1):\ell ^2(\mathbf{Z})\rightarrow \ell ^2(\mathbf{Z})\). Indeed, for the corresponding operator norm, we have
From the identity, \(\tau (e^{ik\xi })=e^{-i\xi }e^{ik\xi }\), we define the symbol of \(p(\tau )\) by
It is an element of the Wiener algebra [4] and by (1.2) in \(C^1(S^1)\).
We are interested in the Toeplitz matrix
acting on \({\mathbf {C}}^N \simeq \ell ^2([0,N[)\), for \(1\ll N<\infty \). Furthermore, we frequently identify \(\ell ^2([0,N[)\) with the space \(\ell ^2_{[0,N[}(\mathbf{Z})\) of functions \(u\in \ell ^2(\mathbf{Z})\) with support in [0, N[.
The spectra of such Toeplitz matrices have been studied thoroughly, see [4] for an overview. Let \(P_\infty \) denote \(p(\tau )\) as an operator \(\ell ^2(\mathbf{Z})\rightarrow \ell ^2(\mathbf{Z})\). It is a normal operator and by Fourier series expansions, we see that the spectrum of \(P_\infty \) is given by
The restriction \(P_{{\mathbf {N}}}=P_{\infty }|_{\ell ^2({\mathbf {N}})}\) of \(P_{\infty }\) to \(\ell ^2({\mathbf {N}})\) is in general no longer normal, except for specific choices of the coefficients \(a_\nu \). The essential spectrum of the Toeplitz operator \(P_{{\mathbf {N}}}\) is given by \(p(S^1)\) and we have pointspectrum in all loops of \(p(S^1)\) with nonzero winding number, i.e.,
By a result of Krein [4, Theorem 1.15], the winding number of \(p(S^1)\) around the point \(z\not \in p(S^1)\) is related to the Fredholm index of \(P_{{\mathbf {N}}}-z\): \(\mathrm {Ind}(P_{{\mathbf {N}}}-z) = - \mathrm {ind}_{p(S^1)}(z)\).
The spectrum of the Toeplitz matrix \(P_N\) is contained in a small neighborhood of the spectrum of \(P_{{\mathbf {N}}}\). More precisely, for every \(\epsilon >0\),
for \(N>0\) sufficiently large, where D(z, r) denotes the open disc of radius r, centered at z. Moreover, the limit of \(\sigma (P_{N})\) as \(N\rightarrow \infty \) is contained in a union of analytic arcs inside \( \sigma (P_{{\mathbf {N}}})\), see [4, Theorem 5.28].
We show in Theorem 1.1 that after adding a small random perturbation to \(P_N\), most of its eigenvalues will be close to the curve \(p(S^1)\) with probability very close to 1. See Fig. 1 for a numerical illustration.
1.1 Small Gaussian perturbation
Consider the random matrix
with complex Gaussian law
where L denotes the Lebesgue measure on \({\mathbf {C}}^{N\times N}\). The entries \(q_{j,k}\) of \(Q_{\omega }\) are independent and identically distributed complex Gaussian random variables with expectation 0, and variance 1, i.e., \(q_{j,k\sim {\mathcal {N}}_{{\mathbf {C}}}(0,1)}\).
We recall that the probability distribution of a complex Gaussian random variable \(\alpha \sim {\mathcal {N}}_{{\mathbf {C}}}(0,1)\) is given by
where \(L(d\alpha )\) denotes the Lebesgue measure on \({\mathbf {C}}\). If \(\mathbb {E}\) denotes the expectation with respect to the probability measure \(\mathbb {P}\), then
We are interested in studying the spectrum of the random perturbations of the matrix \(P_N^0=P_N\):
1.2 Eigenvalue asymptotics in smooth domains
Let \(\Omega \Subset {\mathbf {C}}\) be an open simply connected set with smooth boundary \(\partial \Omega \), which is independent of N, satisfying
-
(1)
\(\partial \Omega \) intersects \(p(S^1)\) in at most finitely many points;
-
(2)
\(p(S^1)\) does not self-intersect at these points of intersection;
-
(3)
these points of intersection are non-critical, i.e.,
$$\begin{aligned} d p \ne 0 \hbox { on } p^{-1}(\partial \Omega \cap p(S^1) ); \end{aligned}$$ -
(4)
\(\partial \Omega \) and \(p(S^1)\) are transversal at every point of the intersection.
Theorem 1.1
Let p be as in (1.6) and let \(P_N^{\delta }\) be as in (1.12). Let \(\Omega \) be as above, satisfying conditions (1)–(4), pick a \(\delta _0\in ]0,1[\) and let \(\delta _1 >3\). If
then there exists \(\varepsilon _N = o(1)\), as \(N\rightarrow \infty \), such that
with probability
In (1.14), we view p as a map from \(S^1\) to \({\mathbf {C}}\). Theorem 1.1 shows that most eigenvalues of \(P_N^{\delta }\) can be found close to the curve \(p(S^1)\) with probability subexponentially close to 1. This is illustrated in Fig. 1 for two different symbols. The left-hand side of Fig. 1 shows the spectrum of a perturbed Toeplitz matrix with \(N=2000\) and \(\delta =10^{-14}\), given by the symbol \(p = p_0 + p_1\) where
and
The red line shows the curve \(p(S^1)\). The right-hand side of Fig. 1 similarly shows the spectrum of the perturbed Toeplitz matrix given by \(p= p_0 + p_1\) where \(p_1\) is as above and
In our previous work [15], we studied Toeplitz matrices with a finite number of bands, given by symbols of the form
In this case, the symbols are analytic functions on \(S^1\) and we are able to provide in [15, Theorem 2.1] a version of Theorem 1.1 with a much sharper remainder estimate. See also [13, 14], concerning the special cases of large Jordan block matrices \(p(\tau ) = \tau ^{-1}\) and large bi-diagonal matrices \(p(\tau ) = a\tau + b\tau ^{-1}\), \(a,b\in {\mathbf {C}}\). However, Fig. 1 suggests that one could hope for a better remainder estimate in Theorem 1.1 as well.
Theorems 1.1 and 1.2 can be extended to allow for coupling constants with \(\delta _1 >1/2\). Furthermore, one can allow for much more general perturbations, for example perturbations given by random matrices whose entries are iid copies of a centered random variables with bounded fourth moment. However, both extensions require some extra work which we will present in a follow-up paper.
1.3 Convergence of the empirical measure and related results
An alternative way to study the limiting distribution of the eigenvalues of \(P_N^{\delta }\), up to errors of o(N), is to study the empirical measure of eigenvalues, defined by
where the eigenvalues are counted including multiplicity and \(\delta _{\lambda }\) denotes the Dirac measure at \(\lambda \in {\mathbf {C}}\). For any positive monotonically increasing function \(\phi \) on the positive reals and random variable X, Markov’s inequality states that \(\mathbb {P} [ |X| \ge \varepsilon ] \le \phi (\varepsilon )^{-1} \mathbb {E}[ \phi (|X|)]\), assuming that the last quantity is finite. Using \(\phi (x) = \mathrm {e}^{x/C}\), \(x\ge 0\), with a sufficiently large \(C>0\), yields that for \(C_1>0\) large enough
If \(\delta \le N^{-1}\), then (1.5) and the Borel–Cantelli Theorem shows that, almost surely, \(\xi _N\) has compact support for \(N>0\) sufficiently large.
We will show that, almost surely, \(\xi _N\) converges weakly to the push-forward of the uniform measure on \(S^1\) by the symbol p.
Theorem 1.2
Let \(\delta _0\in ]0,1[\), let \(\delta _1 >3\) and let p be as in (1.4). If (1.13) holds, i.e.,
then, almost surely,
weakly, where \(L_{S^1}\) denotes the Lebesgue measure on \(S^1\).
This result generalizes [15, Corollary 2.2] from the case of Toeplitz matrices with a finite number of bands to the general case (1.4).
Similar results to Theorem 1.2 have been proven in various settings. In [2, 3], the authors consider the special case of band Toeplitz matrices, i.e. \(P_N\) with p as in (1.19). In this case, they show that the convergence (1.22) holds weakly in probability for a coupling constant \(\delta = N^{-\gamma }\), with \(\gamma >1/2\). Furthermore, they prove a version of this theorem for Toeplitz matrices with non-constant coefficients in the bands, see [2, Theorem 1.3, Theorem 4.1]. They follow a different approach than we do: They compute directly the \(\log |\det {\mathcal {M}}_N -z|\) by relating it to \(\log |\det M_N(z)|\), where \(M_N(z)\) is a truncation of \(M_N -z\), where the smallest singular values of \(M_N-z\) have been excluded. The level of truncation, however, depends on the strength of the coupling constant and it necessitates a very detailed analysis of the small singular values of \(M_N -z\).
In the earlier work [9], the authors prove that the convergence (1.22) holds weakly in probability for the Jordan bloc matrix \(P_N\) with \(p(\tau ) = \tau ^{-1}\) (1.4) and a perturbation given by a complex Gaussian random matrix whose entries are independent complex Gaussian random variables whose variances vanish (not necessarily at the same speed) polynomially fast, with minimal decay of order \(N^{-1/2+}\). See also [6] for a related result.
In [20], using a replacement principle developed in [18], it was shown that the result of [9] holds for perturbations given by complex random matrices whose entries are independent and identically distributed random complex random variables with expectation 0 and variance 1 and a coupling constant \(\delta = N^{-\gamma }\), with \(\gamma >2 \).
1.4 Notation
We will frequently use the following notation: When we write \(a \ll b\), we mean that \(Ca \le b\) for some sufficiently large constant \(C>0\). The notation \(f = {\mathcal {O}}(N)\) means that there exists a constant \(C>0\) (independent of N) such that \(|f| \le C N\). When we want to emphasize that the constant \(C>0\) depends on some parameter k, then we write \(C_k\), or with the above notation \({\mathcal {O}}_k(N)\).
2 The unperturbed operator
We are interested in the Toeplitz matrix
for \(1\ll N<\infty \), see also (1.7). Here we identify \(\ell ^2([0,N[)\) with the space \(\ell ^2_{[0,N[}(\mathbf{Z})\) of functions \(u\in \ell ^2(\mathbf{Z})\) with support in [0, N[. Sometimes we write \(P_N=P_{[0,N[}\) and identify \(P_N\) with \(P_{I}=1_Ip(\tau )1_I\) where \(I=I_N\) is any interval in \(\mathbf{Z}\) of “length” \(|I|=\# I=N\).
Let \(P_\mathbf{N}=P_{[0,+\infty [}\) and let \(P_{\mathbf{Z}/{\widetilde{N}}{} \mathbf{Z}}\) denote \(P=p(\tau )\), acting on \(\ell ^2(\mathbf{Z}/{\widetilde{N}}{} \mathbf{Z})\) which we identify with the space of \({\widetilde{N}}\)-periodic functions on \(\mathbf{Z}\). Here \({\widetilde{N}}\ge 1\). Using the discrete Fourier transform, we see that
where \(S_{{\widetilde{N}}}\) is the dual of \(\mathbf{Z}/{\widetilde{N}}{\mathbf {Z}}\) and given by
Let
and notice that
We now consider [0, N[ as an interval \(I_N\) in \(\mathbf{Z}/{\widetilde{N}}{} \mathbf{Z}\), \({\widetilde{N}}=N+M\), where \(M\in \{1,2,.. \}\) will be fixed and independent of N. The matrix of \(P_N\), indexed over \(I_N\times I_N\) is then given by
where \({\widetilde{j}},{\widetilde{k}}\in \mathbf{Z}\) are the preimages of j, k under the projection \(\mathbf{Z}\rightarrow \mathbf{Z} /{\widetilde{N}}\mathbf{Z}\) that belong to the interval \([0,N[\subset \mathbf{Z}\).
Let \({\widetilde{P}}_N\) be given by the formula (2.4), with the difference that we now view \(\tau \) as a translation on \(\ell ^2(\mathbf{Z}/{\widetilde{N}}{} \mathbf{Z})\):
The matrix of \({\widetilde{P}}_N\) is given by
Alternatively, if we let \({\widetilde{j}},{\widetilde{k}}\) be the preimages in [0, N[ of \(j,k\in I_N\), then
Recall that the terms in (2.7), (2.8) with \(|\nu |>N\) or \(|{\widehat{j}}-{\widetilde{k}}|>N\) do vanish. This implies that with \({\widetilde{j}}\), \({\widetilde{k}}\) as in (2.8),
Here
Since \({\widetilde{k}}\in [0,N[\), we have for the first term in (2.9) that \(|{\widetilde{j}}-{\widetilde{N}}-{\widetilde{k}}|={\widetilde{k}}+M+(N-{\widetilde{j}})\) with nonnegative terms in the last sum. Similarly for the second term in (2.9), we have \(|{\widetilde{j}}+{\widetilde{N}}-{\widetilde{k}}|={\widetilde{j}}+M+(N-{\widetilde{k}})\) where the terms in the last sum are all \(\ge 0\).
It follows that the trace class norm of \(P_N-{\widetilde{P}}_N\) is bounded from above by
By (1.2), it follows that
uniformly with respect to N. Here \(\Vert A\Vert _{\mathrm {tr}} = \mathrm {tr} (A^*A)^{1/2}\) denotes the Schatten 1-norm for a trace class operator A.
Remark 2.1
To illustrate the difference between \(P_N\) and \({\widetilde{P}}_N\) let \(N\gg 1\), \(M>0\) and consider the example of \(p(\tau ) = \tau ^{n}\), so \(a_{n} =1\), for some fixed \(n\in {\mathbf {N}}\), and \(a_{\nu }=0\) for \(\nu \ne n\). Since \(P_N(j,k) = a^N_{{\widetilde{j}}-{\widetilde{k}}}\), we see that
In other words \(P_N = (J^*)^n\) where J denotes the \(N\times N\) Jordan block matrix. The matrix elements of \({\widetilde{P}}_N\) on the other hand are given by \({\widetilde{P}}_N(j,k) = a^N_{{\widetilde{j}}-{\widetilde{N}}-{\widetilde{k}}} + a^N_{{\widetilde{j}}-{\widetilde{k}}} +a^N_{{\widetilde{j}}+{\widetilde{N}}-{\widetilde{k}}}\), so
So \({\widetilde{P}}_N = P_N + J^{(N+M-n)}\), when \(n\ge M\), otherwise \({\widetilde{P}}_N = P_N\).
3 A Grushin problem for \(P_N-z\)
Let \(K\Subset {\mathbf {C}}\) be an open relatively compact set and let \(z\in K\). Consider
as subsets of \(\mathbf{Z}/(N+M)\mathbf{Z}\) so that
is a partition. Recall (2.3), (2.6) and consider
and write this operator as a \(2\times 2\) matrix
induced by the orthogonal decomposition
The operator \(p_N(\tau )\) is normal and we know by (2.2) that its spectrum is
Replacing \({\widetilde{P}}_N\) in (3.2) by \(P_N\) (2.4), we put
Then, by (2.10),
If \(\epsilon (M)<\mathrm {dist}\,(z,p_N(S_{N+M}))=:d_N(z)\), then \({{\mathcal {P}} }_N(z)\) is bijective and
Write,
Here,
so
Similarly from
we get
In analogy with (3.5), we write
where J, \(I_N\) were defined in (3.1), still viewed as intervals in \(\mathbf{Z}_{N+M}\). From (3.7), we get for the respective operator norms:
4 Second Grushin problem
We begin with a result, which is a generalization of [16, Proposition 3.4] to the case where \(R_{+-}\ne 0\).
Proposition 4.1
Let \({{\mathcal {H}}}_1, {{\mathcal {H}}}_2, {{\mathcal {H}}}_{\pm }, {{\mathcal {S}}}_{\pm }\) be Banach spaces. If
is bijective with bounded inverse
and if
is bijective with bounded inverse
then
is bijective with bounded inverse
Proof
We can essentially follow the proof of [16, Proposition 3.4]. We need to solve
Putting \({\widetilde{v}}_+=R_+u+R_{+-}S_-u_-\), the first equation is equivalent to
and hence to
Therefore, we can replace u by \({\widetilde{v}}_+\) and (4.5) is equivalent to
which can be solved by \({\mathcal {F}}\). Hence, (4.7) is equivalent to
and (4.6) gives the unique solution of (4.5)
\(\square \)
4.1 Grushin problem for \(E_{-+}(z)\)
We want to apply Proposition 4.1 to \({{\mathcal {P}}}={{\mathcal {P}}}(z)={{\mathcal {P}}}_N(z)\) in (3.5) with the inverse \({{\mathcal {E}}}={{\mathcal {E}}}_N(z)\) in (3.10), where we sometimes drop the index N. We begin by constructing an invertible Grushin problem for \(E_{-+}\):
Let \(0\le t_1 \le \dots \le t_M\) denote the singular values of \(E_{-+}(z)\). Let \(e_1, \dots , e_M\) denote an orthonormal basis of eigenvectors of \(E_{-+}^*E_{-+}\) associated to the eigenvalues \(t_1^2 \le \dots \le t_M^2\). Since \(E_{-+}\) is a square matrix, we have that \(\dim {\mathcal {N}}(E_{-+}(z)) = \dim {\mathcal {N}}(E_{-+}^*(z))\)Footnote 1. Using the spectral decomposition \(\ell ^2(J) = {\mathcal {N}}(E_{-+}^*E_{-+}) \oplus _{\perp } {\mathcal {R}}(E_{-+}^*E_{-+})\) together with the fact that \({\mathcal {N}}(E_{-+}^*E_{-+}) = {\mathcal {N}}(E_{-+}) \) and \({\mathcal {R}}(E_{-+}^*) = {\mathcal {N}}(E_{-+})^{\perp }\), it follows that \({\mathcal {R}}(E_{-+}^*) = {\mathcal {R}}(E_{-+}^*E_{-+})\). Similarly, we get that \({\mathcal {R}}(E_{-+}) = {\mathcal {R}}(E_{-+}E_{-+}^*)\). One then easily checks that \(E_{-+}: {\mathcal {R}}(E_{-+}^*E_{-+}) \rightarrow {\mathcal {R}}(E_{-+}E_{-+}^*)\) is a bijection. Similarly, \(E_{-+}^*: {\mathcal {R}}(E_{-+}E_{-+}^*) \rightarrow {\mathcal {R}}(E_{-+}^*E_{-+})\) is a bijection. Let \(f_1,\dots ,f_{M_0}\) denote an orthonormal basis of \({\mathcal {N}}(E_{-+}^*(z))\) and set
Then, \(f_1,\dots ,f_M\) is an orthonormal basis of \(\ell ^2(J)\) comprised of eigenfunctions of \(E_{-+}E_{-+}^*\) associated with the eigenvalues \(t_1^2 \le \dots \le t_M^2\). In particular, \(\sigma (E_{-+}E_{-+}^*) = \sigma (E_{-+}^*E_{-+})\) and
Let \(0\le t_1\le ...\le t_k\) be the singular values of \(E_{-+}(z)\) in the interval \([0,\tau ]\) for \(\tau >0\) small. Let \({{\mathcal {S}}}_+,\, {{\mathcal {S}}}_-\subset \ell ^2(J)\) be the corresponding (sums of) spectral subspaces for \(E_{-+}^*E_{-+}\) and \(E_{-+}E_{-+}^*\), respectively, corresponding to the eigenvalues \(t_1^2\le t_2^2\le ... \le t_k^2\) in \([0,\tau ^2]\). Using (4.8), we see that the restrictions (denoted by the same symbols)
have norms \(\le \tau \). Also,
are bijective with inverses of norm \(\le 1/\tau \).
Let \(S_+\) be the orthogonal projection onto \({{\mathcal {S}}}_+\), viewed as an operator \(\ell ^2(J)\rightarrow {{\mathcal {S}}}_+\), whose adjoint is the inclusion map \({{\mathcal {S}}}_+\rightarrow \ell ^2(J)\). Let \(S_-:{{\mathcal {S}} }_-\rightarrow \ell ^2(J)\) be the inclusion map. Let \({{\mathcal {S}}}\) be the operator in (4.2) with \({{\mathcal {H}}}_\pm =\ell ^2(J)\), corresponding to the problem
for the unknowns \(g\in \ell ^2(J)\), \(g_-\in {{\mathcal {S}}}_-\). Using the orthogonal decompositions,
we write \(g=\sum _1^kg_je_j + g^{\perp }\) and \(h=\sum _1^kh_jf_j + h^{\perp }\). Then, (4.10) is equivalent to
where we also used that \(g_-=\sum _1^kg_-^jf_j\) and \(h_+=\sum _1^kh_+^je_j\). It follows that
Hence, the unique solution to (4.10) is given by
where
Here \(\Pi _{B}\) denotes the orthogonal projection onto the subspace B of A, viewed as a self-adjoint operator \(A\rightarrow A\). Notice that \(F = \Pi _{{\mathcal {S}}_+^{\perp }} F\) and that
Using as well (4.9), we have
4.2 Composing the Grushin problems
From now on we assume that
and the estimates below will be uniformly valid for \(z\in K\setminus \gamma _\alpha \), \(N\gg 1\), where K is some fixed relatively compact open set in \(\mathbf{C}\) and
We apply Proposition 4.1 to \({{\mathcal {P}}_N}\) in (3.5) with the inverse \({\mathcal {E}}_N\) in (3.10), and to \({\mathcal {S}}\) defined in (4.10) with inverse in \({\mathcal {F}}\) in (4.12). Let \(z\in K\backslash \gamma _\alpha \), then
defined as in (4.3), is bijective with the bounded inverse
Since \(S_\pm \) have norms \(\le 1\), we get
uniformly in N, \(\alpha \) and \(z\in K\). Also, since the norms of \(E^N,E_+^N, E_-^N\) are \(\le 2/\alpha \) (uniformly as \(N\rightarrow \infty \)) by (3.11), we get from (4.4), (4.15), that
Proposition 4.2
Let \(K\Subset {\mathbf {C}}\) be an open relatively compact set, let \(z\in K\backslash \gamma _\alpha \), and let \(\tau >0\) be as in the definition of the Grushin problem (4.10). Then, for \(\tau >0\) small enough, depending only on K, we have that \(G_+^N\) is injective and \(G_-^N\) is surjective. Moreover, there exists a constant \(C>0\), depending only on K, such that for all \(z\in K\backslash \gamma _{\alpha }\) the singular values \(s_j^+\) of \(G_+^N\), and \(s_j^-\) of \((G_-^N)^*\) satisfy
Proof
To ease the notation we will omit the sub/superscript N. We begin with the injectivity of \(G_+\). From
we have \(T_+G_++T_{+-}G_{-+}=1\) which we write \(T_+G_+=1-T_{+-}G_{-+}\). Here
where we used that \(\Vert R_{+-} \Vert \le \Vert p(\tau )-z\Vert = {\mathcal {O}}(1)\Vert m\Vert _{\ell ^1}\), thus the error term above only depends on K. Choosing \(\tau >0\) small enough, depending on K but not on N, we get that \(\Vert T_{+-}G_{+-}\Vert \le 1/2\). Then, \(1-T_{+-}G_{-+}\) is bijective with \(\Vert (1-T_{+-}G_{-+})^{-1}\Vert \le 2\) and \(G_+\) has the left inverse
of norm \(\le 2\Vert R_+\Vert ={{\mathcal {O}}}(1)\), depending only on K.
Now we turn to the surjectivity of \(G_-\). From
we get
and as above we then see that \(G_-^*\) has the left inverse \((1-T_{+-}^*G_{-+}^*)^{-1}T_-^*\). Hence, \(G_-\) has the right inverse
of norm \(\le 2\Vert R_-\Vert = {\mathcal {O}}(1)\), depending only on K.
The lower bound on the singular values follows from the estimates on the left inverses of \(G_+\) and \(G_-^*\), and the upper bound follows from (4.21). \(\square \)
5 Determinants
We continue working under the assumptions (4.16), (4.17). Additionally, we fix \(\tau >0\) sufficiently small (depending only on the fixed relatively compact set \(K\Subset {\mathbf {C}}\)) so that \(\Vert T_{+-}G_{-+}\Vert \), \(\Vert G_{-+}T_{+-}\Vert \) (both \(={{\mathcal {O}}}(\tau )\)) are \(\le 1/2\), which implies that \(G_+\) is injective and \(G_+\) is surjective, see Proposition 4.2. Here, we sometimes drop the sub-/superscript N.
From now on, we will work with \(z\in K\backslash \gamma _{\alpha }\). The constructions and estimates in Sect. 3 are then uniform in z for \(N\gg 1\) and the same holds for those in Sect. 4.
Remark 5.1
To get the o(N) error term in Theorem 1.1, we will take \(\alpha >0 \) arbitrarily small, and \(M>1\) large enough (but fixed) so that \(\varepsilon (M) \le \alpha /2\), see (2.10) as well as \(N>1\) sufficiently large. In the following, the error terms will typically depend on \(\alpha \), although we will not always denote this explicitly, however, they will be uniform in \(N>1\) and in \(z\in K\backslash \gamma _{\alpha }\).
5.1 The unperturbed operator
For \(z\in K\setminus \gamma _\alpha \), we have \(d_N(z)\ge \alpha \) and (3.8), (3.9) give
where we also used that
by the second inequality in (4.16). Recall here that \(p_N(\tau )\) acts on \(\ell ^2(\mathbf{Z}/{\widetilde{N}}{} \mathbf{Z})\), \({\widetilde{N}}=N+M\).
By the Schur complement formula, we have
so
Recall from Sect. 4.1 that the singular values of \(E_{-+}\) are denoted by \(0\le t_1\le t_2\le \dots \le t_M\) and that those of \(G_{-+}\) are \(t_1,...,t_k\), where \(k=k(z,N)\) is determined by the condition \(t_k\le \tau <t_{k+1}\). Thus
and we get (since \(\tau \ll 1\))
Since \(\tau >0\) is small, but fixed depending only on K, we have uniformly for \(z\in K\setminus \gamma _\alpha \), \(N\gg 1\):
and by (5.4)
hence
5.2 The perturbed operator
We next extend the estimates to the case of a perturbed operator
where \(Q:\ell ^2(I_N)\rightarrow \ell ^2(I_N)\) satisfies
Proposition 5.2
Let \(K\Subset {\mathbf {C}}\) be an open relatively compact set and suppose that (4.16) hold. Recall (4.17) and (3.5), if \(\delta \Vert Q\Vert \alpha ^{-1} \ll 1\), then for all \(z\in K\backslash \gamma _{\alpha }\)
is bijective with bounded inverse
Recall (4.18), if \(\delta \Vert Q\Vert \alpha ^{-2} \ll 1 \), then for all \(z\in K\backslash \gamma _{\alpha }\)
is bijective with bounded inverse
with
Moreover, \(\Vert {\mathcal {E}}_N^\delta \Vert \le 4/\alpha \), \(\Vert {\mathcal {G}}_N^\delta \Vert \le {\mathcal {O}}(\alpha ^{-2})\), uniformly in \(z\in K\backslash \gamma _{\alpha }\) and \(N>1\).
Proof
We sometimes drop the subscript N. By (3.10),
By (3.11), it follows that \(\Vert E\Vert \le 2/\alpha \), so if \(\delta \Vert Q\Vert \alpha ^{-1} \ll 1\), then by Neumann series argument, the above is invertible and
is a right inverse of \({\mathcal {P}}^\delta \), of norm \(\le 2 \Vert {\mathcal {E}} \Vert \le 4/\alpha \). Since \({\mathcal {P}}^\delta \) is Fredholm of index 0, this is also a left inverse.
The proof for \({\mathcal {T}}_N^\delta \) is similar, using that \(\Vert G\Vert ={\mathcal {O}}(\alpha ^{-2})\) by (4.21), since \(\tau >0\) is fixed. Finally, the expression (5.15) follows easily from expanding (5.16). \(\square \)
We drop the subscript N until further notice. By (5.13), we have
Recall from the text after (2.10) the definition of the Schatten norm \(\Vert \cdot \Vert _{\mathrm {tr}}\). Write,
where
Here, we used that \(\Vert {{\mathcal {T}}}^{-1}\Vert = \Vert {\mathcal {G}} \Vert = {\mathcal {O}}(1)\), by (4.21) and the fact that \(\tau >0\) is fixed. We recall that the estimates here depend on \(\alpha \), yet are uniform in \(z\in K\backslash \gamma _{\alpha }\) and \(N>1\). It follows that
and
Similarly from the identity
(putting \(\delta \) as a subscript whenever convenient), we get
thus
Assume that (uniformly in \(N>1\) and independently of \(\alpha \))
and recall (5.8). Then,
Notice that the error term depends on \(\alpha \). Using also the general identity (cf. (5.3)),
we get
uniformly for \(z\in K\setminus \gamma _\alpha \), \(N\gg 1\).
6 Lower bounds with probability close to 1
We now adapt the discussion in [15, Section 5] to \({{\mathcal {T}} }^\delta \). Let
where \(0\le \delta \ll 1\) and \(q_{j,k}(\omega )\sim {{\mathcal {N}}}(0,1)\) are independent normalized complex Gaussian random variables. Recall from (1.21) that
for some universal constant \(C_1>0\). In the following, we restrict the attention to the case when
and (as before) \(z\in K\setminus \gamma _\alpha \), \(N\gg 1\). We assume that
Then,
and the estimates of the previous sections apply.
Let \({{\mathcal {Q}}}_{C_1N}\) be the set of matrices satisfying (6.3). As in [15, Section 5.3], we study the map (5.15), i.e.,
where
and notice first that by (4.21)
We strengthen the assumption (6.4) to
At the end of Sect. 4, we have established the uniform injectivity and surjectivity respectively for \(G_+\) and \(G_-\). This means that the singular values \(s_j^\pm \) of \(G_\pm \) for \(1\le j\le k(z)=\mathrm {rank}\,(G_-)=\mathrm {rank}\,(G_+) \) satisfy
This corresponds to [15, (5.27)] and the subsequent discussion there carries over to the present situation with the obvious modifications. Similarly to [15, (5.42)], we strengthen the assumption on \(\delta \) to
Notice that assumption (6.10) is stronger than the assumptions on \(\delta \) in Proposition 5.2. The same reasoning as in [15, Section 5.3] leads to the following adaptation of Proposition 5.3 in [15]:
Proposition 6.1
Let \(K\subset \mathbf{C}\) be compact, \(0<\alpha \ll 1\) and choose M so that \(\epsilon (M)\le \alpha /2\). Let \(\delta \) satisfy (6.10). Then, the second Grushin problem with matrix \({{\mathcal {T}} }^\delta \) is well posed with a bounded inverse \({{\mathcal {G}}}^\delta \) introduced in Proposition 5.2. The following holds uniformly for \(z\in K\setminus \gamma _\alpha \), \(N\gg 1\):
There exist positive constants \(C_0\), \(C_2\) such that
when
7 Counting eigenvalues in smooth domains
In this section, we will prove Theorem 1.1. We will begin with a brief outline of the key steps:
We wish to count the zeros of the holomorphic function \(u(z) = \det (P_N^{\delta }-z)\), which depends on the large parameter \(N>0\), in smooth domains \(\Omega \Subset {\mathbf {C}}\) as in Theorem 1.1.
1. We work in some sufficiently large but fixed compact set \(K\Subset {\mathbf {C}}\) containing \(\Omega \). In Sect. 7.1, we begin by showing that u(z) satisfies with probability close to 1 an upper bound of the form
for \(z\in K\). Here, \(0< \varepsilon \ll 1\) and \(\phi (z)\) is some suitable continuous subharmonic function. Next, we will show that u(z) satisfies for any fixed point \(z_0\) in \(K\backslash \Gamma _{\alpha }\) a lower bound of the form
with probability close to 1. Here, \(\Gamma _{\alpha }\) denotes the set \(\gamma _{\alpha }\) suitably enlarged to be a compact set with smooth boundary, see Fig. 2 for an illustration. The function \(\phi \) will be constructed in the following way : Outside \(\Gamma _{\alpha }\) we set \(\phi (z)\) to be \(\ln |\det (p_N(\tau )-z)|\), which in view of (5.25) and Proposition 6.1 yields the estimates (7.1), (7.2) outside \(\Gamma _{\alpha }\). Inside \(\Gamma _{\alpha }\), we set \(\phi \) to be the solution to the Dirichlet problem for the Laplace operator on \(\Gamma _{\alpha }\) with boundary conditions \(\phi \!\upharpoonright _{\partial \Gamma _{\alpha }} = \ln |\det (p_N(\tau )-z)|\!\upharpoonright _{\partial \Gamma _{\alpha }}\). Since \(\ln |u(z)|\) is subharmonic we have that the bound (7.1) holds in all of K.
2. In Sect. 7.2, we will use (7.1), (7.2) and [12, Theorem 1.1] (see also [13, Chapter 12]) to estimate the number of zeros of u in \(\Omega \) and thus the number of eigenvalues of \(P_N^{\delta }\) in \(\Omega \), i.e.,
see (7.22).
3. In Sect. 7.3, we study the measure \(\Delta \phi \) by analyzing the Poisson and Green kernel of \(\Gamma _{\alpha }\). We will use this analysis to give precise error estimates on the asymptotics (7.3) and we will show that \(\frac{N}{2\pi } \Delta \phi \) integrated over \(\Omega \) is, up to a small error, given by the number of eigenvalues \(\lambda _j\) of \(p_N(\tau )\) (3.4) in \(\Omega \), i.e.,
see (7.53). This, in combination with (7.3), see (7.22), will let us conclude Theorem 1.1.
7.1 Estimates on the log-determinant
We work under the assumptions of Proposition 6.1 and from now on we assume that \(\delta \) satisfies (1.13), i.e.,
for some fixed \(\delta _0 \in ]0,1[\) and \(\delta _1 >3\). Notice that (6.10) holds for \(N>1\) sufficiently large (depending on \(\alpha \)). Then with probability \(\ge 1-e^{-N^2}\), we have \(G_{-+}^\delta (z)={{\mathcal {O}}}(1)\) for every \(z\in K\setminus \gamma _\alpha \), hence by (5.25)
On the other hand, by (5.25) and Proposition 6.1, we have for every \(z\in K\setminus \gamma _\alpha \) that
with probability
when
Next we enlarge \(\gamma _{\alpha }\) to \(\Gamma _{\alpha }\), away from a neighborhood of the region \(\partial \Omega \cap \gamma \), so that \(\Gamma _{\alpha }\) has a smooth boundary. More precisely, let \(g \in C^\infty ({\mathbf {C}};{\mathbf {R}})\) be a boundary defining function of \(\Omega \), so that \(g(z)<0\) for \(z\in \Omega \) and \(dg \ne 0\) on \(\partial \Omega \). Then, for \(C>0\) sufficiently large and \(\alpha >0\) sufficiently small, we define
Notice that due to the assumption that the intersection of \(\partial \Omega \) with \(\gamma \) is transversal, the boundary of \(\Gamma ^0_{\alpha }\) may be only Lipschitz near the intersection points
By the assumptions on \(\Omega \), we have that \(q < \infty \). Away from these points, we have that \(\partial \Gamma ^0_{\alpha }\) is smooth. To remedy this lack of regularity, we will slightly deform \(\Gamma ^0_{\alpha }\) in an \(\alpha \)-neighborhood of these points.
Pick \(z_0\in \partial \gamma _{\alpha }\cap \partial G\). Since \(\partial \gamma _{\alpha }\cap D(z_0,\alpha )\) and \(\partial G\cap D(z_0,\alpha )\) are transversal to each other, it follows that there exists new affine coordinates \({\widetilde{z}} = U(z-z_0)\), \({\mathbf {R}}^2\simeq {\mathbf {C}}\ni z=(z^1,z^2)\) being the old coordinates, where U is orthogonal, and smooth functions \(f_1,f_2\) independent of \(\alpha \), such that \(\gamma _{\alpha }\cap D(z_0,\alpha )\) takes the form
and that \(({\mathbf {C}}\backslash \mathring{G})\cap D(z_0,\alpha )\) takes the form
Here, \(f_1\), respectively \(f_2\), is (after translation and rotation) a smooth local parametrization of \(\partial G\), resp. \(\partial \gamma _{\alpha }\), near \(z_0\). Moreover, \(f_2(0) = f_1(0)\) and the transversality assumption yields that \({\widetilde{z}}^1=0\) is the only point in the interval \(]-\alpha ,\alpha [\) where \(f_2({\widetilde{z}}^1) = f_1({\widetilde{z}}^1)\).
Then, \(\Gamma ^0_{\alpha }\cap D(z_0,\alpha )\) takes the form
Continuing, let \(\chi \in C_c^{\infty }({\mathbf {R}};[0,1])\) so that \(\chi =1\) on \([-1/4,1/4]\) and \(\chi =0\) outside \(]-1/2,1/2[\), and let \(C>0\) be sufficiently large. Set
which is a smooth function. Then, let \(\Gamma _{\alpha }^1\) be equal to \(\Gamma ^0_{\alpha }\) outside \(D(z_0,\alpha )\), and equal to
inside \(D(z_0,\alpha )\). Summing up, we have that the boundary of \(\Gamma _{\alpha }^1\) is smooth at \(z_0\) and \(\Gamma _{\alpha }^0 \subset \Gamma _{\alpha }^1\).
Next, we perform the same procedure for \(\Gamma _{\alpha }^1\) at the point \(z_1\) and obtain \(\Gamma _{\alpha }^2\) whose boundary is smooth at \(z_0\) and \(z_1\) and which contains \(\Gamma _{\alpha }^1\). Continuing in this way until \(z_q\), and defining
we have that \(\Gamma _{\alpha }\) has a smooth boundary and it contains \(\Gamma ^0_{\alpha }\) (7.9), and thus \(\gamma _{\alpha }\). Figure 2 presents an illustration of this “fattening” of \(\gamma _{\alpha }\).
Remark 7.1
Notice that the deformation of the boundary of \(\Gamma _{\alpha }^0\) (7.9) has been done in such a way that the rescaled domain \(\frac{1}{\alpha } \Gamma _{\alpha }\) has a smooth boundary which can be locally parametrized by a smooth function f with \(\partial ^{\beta } f = {\mathcal {O}}(1)\), \(\beta \in {\mathbf {N}}\), uniformly in \(\alpha \).
Continuing, we define \(\phi (z)=\phi _N(z)\) by requiring that
and
Here we assume that K is large enough to contain a neighborhood of \(\Gamma _\alpha \). Choose
for some fixed \(\epsilon _0\in ]0,1[\) with \(\delta _0< \varepsilon _0\), see (7.4), (1.13). Then,
and we require from \(\delta \) that
i.e.,
This is fulfilled if \(N\gg 1\) and
i.e.,
and (7.13), (7.14) imply (7.8) when \(N\gg 1\). Notice that (7.4) implies (7.14) for \(N\gg 1\).
Combining (7.6), (7.11), (7.13) and (7.14), we get for each \(z\in K\setminus { \Gamma _\alpha } \) that
with probability
where
Here and in the following, we assume that \(N\ge N(\alpha ,K)\) sufficiently large.
On the other hand, with probability \(\ge 1-e^{-N^2}\), we have by (7.5)
for all \(z\in K\setminus \Gamma _\alpha \). Then, since the left- hand side in (7.18) is subharmonic and the right-hand side is harmonic in \(\Gamma _\alpha \), we see that (7.18) remains valid also in \(\Gamma _\alpha \) and hence in all of K.
7.2 Counting zeros of holomorphic functions with exponential growth
Let \(\Omega \Subset \mathbf{C}\) be as in Theorem 1.1, so that \(\partial \Omega \) intersects \(\gamma \) at finitely many points \({\widetilde{z}}_1,...,{\widetilde{z}}_{k_0}\) which are not critical values of p and where the intersection is transversal. Choose \(z_1,...,z_L\in \partial \Omega \setminus \Gamma _\alpha \) such that with \(r_0=C_0\alpha \), \(C_0\gg 1\), we have
where the \(z_j\) are distributed along the boundary in the positively oriented sense and with the cyclic convention that \(z_{L+1}=z_{1}\). Notice that \(L={{\mathcal {O}}}(1/\alpha )\). Then,
and we can arrange so that \(z_j\not \in \Gamma _\alpha \) and even so that
for \(\alpha >0\) sufficiently small.
Choose K above so that \({\overline{\Omega }}\Subset K\). Combining (7.18) and (7.15), we have that \(\det (P_N^\delta -z)\) satisfies the upper bound (7.18) for all \(z\in K\) and the lower bound (7.15) for \(z = z_1,\dots , z_L\) with probability
Since \(\phi \) is continuous and subharmonic, we can apply [12, Theorem 1.1] (see also [13, Chapter 12]) to the holomorphic function \(\det (P_N^\delta -z)\) and get
with probability (7.21).
Recall that \(L={{\mathcal {O}}}(1/\alpha )\) (hence \({{\mathcal {O}}}(1)\) for every fixed \(\alpha \)). \(\Delta \phi \) is supported in \(\Gamma _\alpha \) and the number of discs \(D(z_j,r_0)\) that intersect \(\Gamma _\alpha \) is \(\le {{\mathcal {O}}}(1)\) uniformly with respect to \(\alpha \). Also \(\ln (|z-z_j|/r_0)={{\mathcal {O}} }(1)\) on the intersection of each such disc with \(\Gamma _\alpha \). Since \(\epsilon _1=N^{\epsilon _0-1}\), we get from (7.22):
7.3 Analysis of the measure \(\Delta \phi \)
By (3.4), we have that
where
and this expression is equal to \(N\phi (z)\) in \(K\setminus \Gamma _\alpha \).
Define
so that \(\psi \) is continuous away from the \(\lambda _j\in \gamma \),
It follows that in \(\Gamma _\alpha \):
where \(G_{\Gamma _\alpha }\) is the Green kernel for \(\Gamma _\alpha \).
\(\phi \) is harmonic away from \(\partial \Gamma _\alpha \), so for \(\phi \) as a distribution on \(\mathbf{C}\), we have \(\mathrm {supp}\,\Delta \phi \subset \partial \Gamma _\alpha \). Now \(\psi -\phi \) is harmonic near \(\partial \Gamma _\alpha \), so \(\Delta \psi =\Delta \phi \) near \(\partial \Gamma _\alpha \). In the interior of \(\Gamma _\alpha \) we have (7.28) and in order to compute \(\Delta \psi \) globally, we let \(v\in C_0^\infty (\mathbf{C})\) and apply Green’s formula to get
Here \(\nu \) is the exterior unit normal and in the last term, it is understood that we apply \(\partial _\nu \) to the restriction of \(\psi \) to \({\mathop {\Gamma }\limits ^{\circ }}_\alpha \) then take the boundary limit. (7.27), (7.28) and (7.29) imply that in the sense of distributions on \(\mathbf{C}\),
where \(L_{\partial \Gamma _\alpha }\) denotes the (Lebesgue) arc length measure supported on \(\partial \Gamma _\alpha \).
By the preceding discussion, we conclude that
Each term in the sum is a nonnegative measure of mass 1:
Before continuing, we will present two technical lemmas.
Lemma 7.2
Let \(X \Subset {\mathbf {C}}\) be an open relatively compact, simply connected domain with smooth boundary. Let \(u\in C^{\infty }({\overline{X}})\) with \(u\!\upharpoonright _{\partial X} =0\). Let \(z_0\in \partial X\) and let \({\widetilde{W}} \Subset W\Subset {\mathbf {C}}\) be two open relatively compact small complex neighborhoods of \(z_0\), so that the closure of \({\widetilde{W}}\) is contained in W. If u is harmonic in \(X\cap W\), then for any \(s\in {\mathbf {N}}\)
Here \(H^s\) are the standard Sobolev spaces.
Proof
The proof is standard, and we present it here for the reader’s convenience.
1. Let \(W_1 \Subset W\Subset {\mathbf {C}}\) be two open relatively compact small complex neighborhoods of \(z_0\), so that the closure of \(W_1\) is contained in W. Let \(\chi \in C^{\infty }_c({\mathbf {C}};[0,1])\) be so that \(\chi =1\) on \(W_1\) and \(\mathrm{supp}\chi \subset W\). Integration by parts then yields that
In the last equality, we used as well that u is harmonic in \(X\cap W\). By the Cauchy–Schwarz inequality
which implies that
Hence,
2. Since W is small, we may pass to new local coordinates y, and we can suppose that \(z_0 =0\) and that locally \(\partial X = \{y_2 = 0\}\). If \(\phi \) is a local diffeomorphism realizing this change of variables, then the Laplacian can be formally written in the new coordinates as
Here, L is an elliptic second-order differential operator, and \(\phi '\) is the Jacobian map associated with the diffeomorphism \(\phi \).
Working from now on in these new coordinates, we proceed by an induction argument: suppose that
holds for some \(s\in {\mathbf {N}}\). Here we write as well \(W,W_1\) for the respective sets in the new coordinates to ease notation. We want to show that we then also have
where \(W_2\Subset W_1\) is a slightly smaller neighborhood of \(z_0=0\), whose closure is contained inside \(W_1\).
Let \(\chi \in C^{\infty }_c({\mathbf {C}};[0,1])\) be so that \(\chi =1\) on \(W_2\) and \(\mathrm{supp}\chi \subset W_1\). Let \(\partial _{t,j}u(y): = t^{-1}(u(y+te_j) - u(y)\), where \(x\in {\mathbf {C}}\simeq {\mathbf {R}}^2\) and \(e_1, e_2\) is the standard orthonormal basis of \({\mathbf {R}}^2\). Then, by the hypothesis (7.36) applied to \(\partial _{t,j}\chi u\), for \(|t| \ll 1\), we get
uniformly in \(|t|\ll 1\). In the last inequality, we used as well that \(\chi \partial _{t,1} u \) and \([\partial _{t,1},\chi ] u = (\partial _{t,1}\chi ) u(\cdot + te_1)\) are supported in \(W_1\) for \(|t|\ll 1\). Performing the limit \(t\rightarrow 0\), we get
Thus, for \(j=1,2\), we have that
By (7.35), it follows that there exists some smooth function \(a\ne 0\), such that
where \({\widetilde{L}}\) is a second-order differential operator with smooth coefficients and which does not contain the derivative \(\partial _{y_2}^2\). Since u is harmonic in \(X\cap W\), it follows that \(L\chi u = [L,\chi ] u\). Since \([L,\chi ]\) is a differential operator of order 1, it follows from (7.40) and (7.39) that
In combination with (7.38), this yields
Thus, by choosing a decreasing sequence of nested compact neighborhoods of \(z_0\), say \({\widetilde{W}} = W_{s+1} \Subset W_{s} \dots \Subset W_0 = W\), we may iterate the estimate (7.36), which then in combination with (7.34) yields (7.33). \(\square \)
Lemma 7.3
There exists a \(C>0\) independent of \(\alpha >0\), such that for any \(1 \le j \le N+M\)
for \(z\in \partial \Gamma _\alpha \cap \mathrm {neigh}\,(\gamma \cap \partial \Omega )\), \(\lambda _j\in \Gamma _\alpha \), \(|z-\lambda _j|\ge \alpha /C\). (7.43) also holds when \(z\in \partial \Gamma _\alpha ,\) \(\lambda _j\in \Gamma _\alpha \), \(|z-\lambda _j|\ge \alpha /C\) and \((z,\lambda _j)\in (\Omega \times (\mathbf{C}\setminus \Omega ))\cup ((\mathbf{C}\setminus \Omega )\times \Omega ) \).
Proof
1. By scaling of the harmonic function \(G_{\Gamma _\alpha }(\cdot ,\lambda _j)\) by a factor \(1/\alpha \), it suffices to show that
for \((z,\lambda _j)\) as after (7.43) with the difference that z now varies in \(\Gamma _{\alpha }\) instead of \(\partial \Gamma _{\alpha }\).
To see this, recall from the construction of \(\Gamma _{\alpha }\) after (7.8) that \(\mathrm {dist}(\partial \Gamma _{\alpha },\lambda _j) \ge \alpha \) and fix a point \(z_0 \in \partial \Gamma _{\alpha }\), let \(C_1>0\) be sufficiently large so that for any \(z\in D( z_0, \alpha /C_1)\cap \Gamma _{\alpha }\) we have that \((z,\lambda _j)\) satisfies the conditions after (7.43) with z varying in \(D( z_0, \alpha /C_1)\cap \Gamma _{\alpha }\) instead of \(\partial \Gamma _{\alpha }\).
Let \(u(z) := G_{\gamma _\alpha }(\alpha z,\lambda _j)\), \(z\in \frac{1}{\alpha } \Gamma _{\alpha }\), be the scaled function, and recall Remark 7.1. Let \(\chi \in C^{\infty }_c({\mathbf {C}};[0,1])\) be so that \(\chi =1\) on \(D( z_0/\alpha , 1/(4C_1))\), \(\mathrm{supp}\chi \subset D( z_0/\alpha , 1/2C_1)=:W'\) and \(\partial ^{\beta } = {\mathcal {O}}(1)\), uniformly in \(\alpha \) for any \(\beta \in {\mathbf {N}}^2\). Moreover, put \(W=D( z_0/\alpha , 1/C_1)\).
Then, \(\chi u \in H^s(\Gamma _{\alpha }\cap W')\) for any \(s>0\). We can find an extension \(v\in H^s({\mathbf {R}}^2)\) of \(\chi u\) so that \(\Vert v\Vert _{H^s} \le {\mathcal {O}}(1) \Vert \chi u\Vert _{H^s(\Gamma _{\alpha }\cap W')}\). Using the Fourier transform, we see that for \(s>2\) and for \(z\in D( z_0/\alpha , 1/(4C_1))\)
By Lemma 7.2 and (7.44), we see that
and
which implies (7.44) after rescaling and potentially slightly increasing the constant \(C>0\).
2. We decompose \(\Gamma _\alpha \) as \(\Gamma ^{int}\cup \Gamma ^{ext}\cup \Gamma _{1,\alpha }\cup ...\cup \Gamma _{T,\alpha }\), where \(\Gamma ^{int}\) and \(\Gamma ^{ext}\) are the enlarged parts of \(\Gamma _\alpha \) with \(\Gamma ^{int}\subset \Omega \), \(\Gamma ^{ext}\subset \mathbf{C}\setminus \Omega \) and \(\Gamma _{1,\alpha },...,\Gamma _{T,\alpha }\) are the regular parts of width \(2\alpha \), corresponding to the segments of \(\gamma \), that intersect \(\partial \Omega \) transversally, see Fig. 2 for an illustration. Here, T is the number of intersections of \(\gamma \) with \(\partial \Omega \), notice that T is finite and independent of \(N,\alpha \).
For simplicity, we assume that \(\Gamma ^{int}\) and \(\Gamma ^{ext}\) are connected and that each segment \(\Gamma _{k,\alpha }\) links \(\Gamma ^{int}\) to \(\Gamma ^{ext}\) and crosses \(\partial \Omega \) once. We may think of \(\Gamma _{\alpha }\) as a graph with the vertices \(\Gamma ^{int}\), \(\Gamma ^{ext}\) and with \(\Gamma _{k,\alpha }\) as the edges.
Let first \(\lambda _j\) belong to \(\Gamma ^{int}\). We apply the first estimate in Proposition 2.2 in [12] or equivalently Proposition 12.2.2 in [13] and see that \(-G_{\Gamma _\alpha }(z,\lambda _j)\le {{\mathcal {O}}}(1)\) for \(z\in \Gamma _\alpha \), \(|z-\lambda _j|\ge 1/{{\mathcal {O}}}(1)\). Here and in the following the constants \({\mathcal {O}}(1)\) are independent of j and \(\alpha \). Furthermore, the notation \(1/{{\mathcal {O}}}(1)\) means 1/C for some sufficiently large constant \(C>0\).
Possibly, after cutting away a piece of \(\Gamma _{k,\alpha }\) and adding it to \(\Gamma ^{int}\), we may assume that \(-G_{\Gamma _\alpha }(z,\lambda _j)\le {{\mathcal {O}}}(1)\) in \(\Gamma _{k,\alpha }\). Consider one of the \(\Gamma _{k,\alpha }\) as a finite band with the two ends given by the closure of the set of \(z\in \partial \Gamma _{k,\alpha }\) with \(\mathrm {dist}\,(z,\partial \Gamma _\alpha )<\alpha \). Let \(G_{\Gamma _{k,\alpha }}\) denote the Green kernel of \(\Gamma _{k,\alpha }\). Then, the second estimate in the quoted proposition applies and we find
Let
where \(\chi \in C^\infty (\Gamma _{k,\alpha };[0,1])\) vanishes near the ends of \(\Gamma _{k,\alpha }\), is equal to 1 away from an \(\alpha \)-neighborhood of these end points and with the property that \(\nabla \chi ={{\mathcal {O}}}(1/\alpha )\), \(\nabla ^2\chi ={{\mathcal {O}}}(1/\alpha ^2)\). Then, \({{u}_\vert }_{\partial \Gamma _{k,\alpha }}=0\) and \(\Delta u={{\mathcal {O}}}(\alpha ^{-2})\) is supported in an \(\alpha \)-neighborhood of the union of the two ends and hence of uniformly bounded \(L^1\)-norm. Now we apply the second estimate in the quoted proposition to \( u=\int G_{\Gamma _{k,\alpha }}(\cdot ,y)\Delta u(y)L(dy) \) and we see that
in \(\{ x\in \Gamma _{k,\alpha };\,\mathrm {dist}\,(x,\partial \Omega \cap \Gamma _{k,\alpha })\le 1/{{\mathcal {O}}}(1) \}\). Here, we also recall that \(\lambda _j \in \gamma \Subset \mathring{\Gamma }_{\alpha }\). Varying k, we get (7.48) in \(\{ x\in \Gamma _\alpha ;\, \mathrm {dist}\,(x,\partial \Omega \cap \Gamma )\le 1/{{\mathcal {O}}}(1) \}\). Applying the maximum principle to the harmonic function \({G_{\Gamma _\alpha }(\cdot ,\lambda _j)}\!\upharpoonright _{(\mathbf{C}\setminus \Omega )\cap \Gamma _\alpha }\), we see that (7.48) holds uniformly in \((\mathbf{C}\setminus \Omega )\cap \Gamma _\alpha \).
Similarly, we have (7.48) uniformly in
when \(\lambda _j\in \Gamma ^{ext}\) and we have shown (7.44), (7.43) when \(\lambda _j\in \Gamma ^{int}\cup \Gamma ^{ext}\). Similarly, we have (7.43) when \(\lambda _j\in \gamma _{k,\alpha }\) is close to one of the ends.
It remains to treat the case when \(\lambda _j\in \gamma _{k,\alpha }\) is at distance \(\ge 1/{{\mathcal {O}}}(1)\) from the ends of \(\gamma _{k,\alpha }\). Defining \(u=\chi {G_{\Gamma _\alpha }(\cdot ,\lambda _j)}\!\upharpoonright _{\gamma _{k,\alpha }}\) as before we now have
where the first term in the right-hand side has its support in an \(\alpha \)-neighborhood of the union of the ends and is \({{\mathcal {O}}}(1)\) in \(L^1\). By the second part of the quoted proposition, we have
away from an \(\alpha \)-neighborhood of \(\mathrm {ends}\,(\gamma _{k,\alpha })\cup \{\lambda _j\}\). Here \(\mathrm {ends}\,(\gamma _{k,\alpha })\) denotes the union of the two ends of \(\gamma _{k,\alpha }\). Since u is harmonic away from \(\lambda _j\) and from \(\alpha \)-neighborhoods of the ends, we get from (7.49) that
which gives (7.43) near \(\partial \Omega \cap \gamma \). By using the maximum principle as before, we can extend the validity of (7.43) to all of \(\partial \Gamma _\alpha \setminus D(\lambda _j,\alpha /{{\mathcal {O}}}(1))\). \(\square \)
Continuing, notice that by (3.4), (7.24)
for \(\eta \subset \gamma \). Since two consecutive points of \({\widehat{S}}_{{\widetilde{N}}}\) differ by an angle of \(2\pi /{\widetilde{N}}\) and by the assumptions (1)-(4) prior to Theorem 1.1, we get that
and also
From (7.43) and (7.31), we get
Combining (7.32) and (7.43), we get when \(\mathrm {dist}\,(\lambda _j,\partial \Omega \cap \gamma )\ge 2r_0\):
We now get
Thus, (7.23) gives
with a probability as in (7.21) which is bounded from below by the probability (1.15) for \(N>1\) sufficiently large. Here and in the next formula, we view \(p_N\) and p as maps from \(S^1\) to \({\mathbf {C}}\). In the second equality, we used that by (7.51)
where we used that \(p_N \rightarrow p\) uniformly on \(S^1\) and where the measure \(L_{S^1}(d\theta )\) in the integral denotes the Lebesgue measure on \(S^{1}\).
Theorem 1.1 follows by taking \(\alpha >0\) in (7.54) arbitrarily small and \(N>1\) sufficiently large.
8 Convergence of the empirical measure
In this section, we present a proof of Theorem 1.2 following the strategy of [15, Section 7.3]. An alternative, and perhaps more direct way, to conclude the weak convergence of the empirical measure from a counting theorem as Theorem 1.2, is presented in [15, Section 7.1].
Recall the definition of the empirical measure \(\xi _N\) (1.20). By (1.21), (1.5) combined with a Borel Cantelli argument, it follows that almost surely
for N sufficiently large. For p as in (1.4), put
which has compact support,
Here, \(\frac{1}{2\pi } L_{S^1}\) denotes the normalized Lebesgue measure on \(S^1\).
We recall [15, Theorem 7.1]:
Theorem 8.1
Let \(K,K'\Subset {\mathbf {C}}\) be open relatively compact sets with \({\overline{K}}\subset K'\), and let \(\{\mu _n\}_{n\in {\mathbf {N}}} \in {\mathcal {P}}({\mathbf {C}})\) be as sequence of random measures so that almost surely
Suppose that for a.e. \(z\in K'\) almost surely
where \(\mu \in {\mathcal {P}}({\mathbf {C}})\) is some probability measure with \(\mathrm{supp}\mu \subset K\). Then, almost surely,
This theorem is a modification of a classical result which allows to deduce the weak convergence of measures from the point-wise convergence of the associated Logarithmic potentials, see for instance [17, Theorem 2.8.3] or [1].
In view of Theorem 8.1, it remains to show that for almost every \(z\in K'\) we have that \(U_{\xi _N}(z) \rightarrow U_{\xi }(z)\) almost surely, where
For \(z\notin \sigma ( P_N^{\delta })\)
For any \(z\in {\mathbf {C}}\) the set \(\Sigma _z = \{ Q \in {\mathbf {C}}^{N\times N}; \det (P_N +\delta Q -z) =0\}\) has Lebesgue measure 0, since \({\mathbf {C}}^{N\times N} \ni Q \mapsto \det (P_N^{\delta } -z)\) is analytic and not constantly 0. Thus, \(\mu _N(\Sigma _z)=0\), where \(\mu _N\) is the Gaussian measure given in after (1.11), and for every \(z\in {\mathbf {C}}\) (8.4) holds almost surely.
Let \(\delta \) satisfy (1.13) for some fixed \(\delta _0\in ]0,1[\) and \(\delta _1>3\). Pick a \(\varepsilon _0 \in ]\delta _0,1[\). Let \(z\in K'\backslash p(S^1)\). Recall (4.17). For \(\alpha >0\) sufficiently small, we have that \(z\in K'\backslash \gamma _{\alpha }\).
Put \(t=N^{\varepsilon _0}\) as in (7.13), which together with (7.14) implies (7.8) when \(N\gg 1\). Since (1.13) implies (7.14), it follows by combining (7.14), (7.5), (7.6) and (7.7) that
with probability \(\ge 1 - \mathrm {e}^{-N^2}- \mathrm {e}^{-N^{\varepsilon _0/4}}\). Here, \(\phi (z) := N^{-1}\ln |\det (p_N(\tau )-z)|\), since \(z\notin \gamma _\alpha \).
Using a Riemann sum argument and the fact that \(p_N \rightarrow p\) uniformly on \(S^1\), we have that
Thus, by (8.5), (8.6), we have for any \(z\in K'\backslash p(S^1)\) that
with probability \(\ge 1 - \mathrm {e}^{-N^2}- \mathrm {e}^{-N^{\varepsilon _0/4}}\). By the Borel–Cantelli theorem, if follows that for every \(z\in K'\backslash p(S^1)\)
Notes
Here \({{\mathcal {N}}}(A)\) and \({{\mathcal {R}}}(A)\) denote the null space and the range of a linear operator A.
References
Bordenave, C., Chafaï, D.: Lecture notes on the circular law. In: Vu, V.H. (ed.) Modern Aspects of Random Matrix Theory, vol. 72, pp. 1–34. American Mathematical Society, London (2013)
Basak, A., Paquette, E., Zeitouni, O.: Regularization of non-normal matrices by gaussian noise-the banded toeplitz and twisted toeplitz cases. Forum Math. Sigma 7, e3 (2019)
Basak, A., Paquette, E., Zeitouni, O.: Spectrum of random perturbations of toeplitz matrices with finite symbols. Trans. Am. Math. Soc. 373(7), 4999–5023 (2020)
Böttcher, A., Silbermann, B.: Introduction to Large Truncated Toeplitz Matrices. Springer, Berlin (1999)
Davies, E.B.: Non-self-adjoint operators and pseudospectra, volume 76 of Proceedings of Symposia in Pure Mathematics, AMS (2007)
Davies, E.B., Hager, M.: Perturbations of Jordan matrices. J. Approx. Theory 156(1), 82–94 (2009)
Dimassi, M., Sjöstrand, J.: Spectral Asymptotics in the Semi-Classical Limit. Cambridge University Press, Cambridge (1999)
Embree, M., Trefethen, L.N.: Spectra and Pseudospectra: The Behavior of Nonnormal Matrices and Operators. Princeton University Press, Princeton (2005)
Guionnet, A., Matchett Wood, P., Zeitouni, O.: Convergence of the spectral measure of non-normal matrices. Proc. AMS 142(2), 667–679 (2014)
Hager, M., Sjöstrand, J.: Eigenvalue asymptotics for randomly perturbed non-selfadjoint operators. Math. Annal. 342, 177–243 (2008)
Kallenberg, O.: Foundations of Modern Probability. Probability and its Applications. Springer, Berlin (1997)
Sjöstrand, J.: Counting zeros of holomorphic functions of exponential growth. J. Pseudodiffer. Oper. Appl. 1(1), 75–100 (2010)
Sjöstrand, J.: Non-Self-Adjoint Differential Operators, Spectral Asymptotics and Random Perturbations Pseudo-Differential Operators Theory and Applications, vol. 14. Birkhäuser, Basel (2019)
Sjöstrand, J., Vogel, M.: Large bi-diagonal matrices and random perturbations. J. Spectral Theory 6(4), 977–1020 (2016)
Sjöstrand, J., Vogel, M.: Toeplitz band matrices with small random perturbations. Indagationes Mathematicae (2020). https://doi.org/10.1016/j.indag.2020.09.001
Sjöstrand, J., Zworski, M.: Elementary linear algebra for advanced spectral problems. Ann. l’Inst. Fourier 57, 2095–2141 (2007)
Tao, T.: Topics in Random Matrix Theory. Graduate Studies in Mathematics, vol. 132. American Mathematical Society, London (2012)
Tao, T., Vu, V., Krishnapur, M.: Random matrices: universality of esds and the circular law. Ann. Probab. 38(5), 2023–2065 (2010)
Vogel, M.: The precise shape of the eigenvalue intensity for a class of Non-Self-Adjoint operators under random perturbations. Ann. Henri Poincaré 18, 435–517 (2017)
Wood, P.M.: Universality of the esd for a fixed matrix plus small random noise: a stability approach. Ann. l’Inst. Henri Poinc. Probab. Stat. 52(4), 1877–1896 (2016)
Acknowledgements
The first author acknowledges support from the 2018 S. Bergman award. The second author was supported by a CNRS Momentum fellowship. We are grateful to Ofer Zeitouni for his interest and a remark which lead to a better presentation of this paper. We are grateful to the referee for pointing out a mistake affecting the range of the exponent \(\delta _1\).
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Jan Derezinski.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sjöstrand, J., Vogel, M. General Toeplitz Matrices Subject to Gaussian Perturbations. Ann. Henri Poincaré 22, 49–81 (2021). https://doi.org/10.1007/s00023-020-00970-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00023-020-00970-w