Complex Outliers of Hermitian Random Matrices

Rochet, Jean

doi:10.1007/s10959-016-0686-4

Complex Outliers of Hermitian Random Matrices

Published: 16 April 2016

Volume 30, pages 1624–1654, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Theoretical Probability Aims and scope Submit manuscript

Complex Outliers of Hermitian Random Matrices

Download PDF

Jean Rochet¹

254 Accesses
7 Citations
Explore all metrics

Abstract

In this paper, we study the asymptotic behavior of the outliers of the sum a Hermitian random matrix and a finite rank matrix which is not necessarily Hermitian. We observe several possible convergence rates and outliers locating around their limits at the vertices of regular polygons as in Benaych-Georges and Rochet (Probab Theory Relat Fields, 2015), as well as possible correlations between outliers at macroscopic distance as in Knowles and Yin (Ann Probab 42(5):1980–2031, 2014) and Benaych-Georges and Rochet (2015). We also observe that a single spike can generate several outliers in the spectrum of the deformed model, as already noticed in Benaych-Georges and Nadakuditi (Adv Math 227(1):494–521, 2011) and Belinschi et al. (Outliers in the spectrum of large deformed unitarily invariant models 2012, arXiv:1207.5443v1). In the particular case where the perturbation matrix is Hermitian, our results complete the work of Benaych-Georges et al. (Electron J Probab 16(60):1621–1662, 2011), as we consider fluctuations of outliers lying in “holes” of the limit support, which happen to exhibit surprising correlations.

Outliers in the Single Ring Theorem

Article 16 May 2015

No Outliers in the Spectrum of the Product of Independent Non-Hermitian Random Matrices with Independent Entries

Article 23 September 2016

Large Deviation Theorem for Zeros of Polynomials and Hermitian Random Matrices

Article 01 November 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

It is known that adding a finite rank perturbation to a large matrix barely changes the global behavior of its spectrum. Nevertheless, some of the eigenvalues, called outliers, can deviate away from the bulk of the spectrum, depending on the strength of the perturbation. This phenomenon, well known as the BBP transition, was first brought to light for empirical covariance matrices by Johnstone in [23], by Baik et al. in [3], and then shown under several hypothesis in the Hermitian case in [6–9, 13–15, 17, 24, 25, 29, 30]. Non-Hermitian models have been also studied: i.i.d. matrices in [12, 27, 32], elliptic matrices in [28], and matrices from the Single Ring Theorem in [10]. In [10], and lately in [27], the authors have also studied the fluctuations of the outliers and, due to non-Hermitian structure, obtained unusual results: The distribution of the fluctuations highly depends on the shape of the Jordan canonical form of the perturbation; in particular, the convergence rate depends on the size of the Jordan blocks. Also, the outliers tend to locate around their limit at the vertices of a regular polygon. At last, they observe correlations between the fluctuations of outliers at a macroscopic distance with each other.

In this paper, we show that the same kind of phenomenon occurs when we perturb an Hermitian matrix $\mathbf{H}$ with a non-Hermitian one $\mathbf{A}$. More precisely, we study finite rank perturbations for Hermitian random matrices $\mathbf{H}$ whose spectral measure tends to a compactly supported measure $\mu $ and the perturbation $\mathbf{A}$ is just a complex matrix with a finite rank. With further assumptions, we prove that outliers of $\mathbf{H}+\mathbf{A}$ may appear at a macroscopic distance from the bulk and, following the ideas of [10], we show that they fluctuate with convergence rates which depend on the matrix $\mathbf{A}$ through its Jordan canonical form. Remind that any complex matrix is similar to a block diagonal matrix with diagonal blocks of the type

so that $\mathbf{A}\sim {{\mathrm{\text {diag}}}}\big (\mathbf{R}_{p_1}(\theta _1),\ldots ,\mathbf{R}_{p_q}(\theta _q) \big )$, this last matrix being called the Jordan Canonical Form of $\mathbf{A}$ [22, Chapter 3]. We show, up to some hypothesis, that for any eigenvalue $\theta $ of $\mathbf{A}$, if we denote by

$$\begin{aligned} \underbrace{p_1,\ldots ,p_1}_{\beta _1 \text { times}} > \underbrace{p_2, \ldots , p_2}_{\beta _2 \text { times}} > \cdots > \underbrace{p_\alpha , \ldots , p_\alpha }_{\beta _\alpha \text { times}} \end{aligned}$$

the sizes of the blocks associated with $\theta $ in the Jordan Canonical Form of $\mathbf{A}$ and introduce the (possibly empty) set

$$\begin{aligned} \mathscr {S}_\theta \ : = \ \left\{ \xi \in {{\mathrm{\mathbb {C}}}}, \ G_\mu (\xi ) = \frac{1}{\theta }\right\} \end{aligned}$$

where $G_\mu (z) := \displaystyle \int \frac{1}{z-x}\mu (\mathrm{d}x)$ is the Cauchy transform of the measure $\mu $, then there are exactly $\beta _1 p_1 + \cdots + \beta _\alpha p_\alpha $ outliers of $\mathbf{H}+\mathbf{A}$ tending to each element of $\mathscr {S}_\theta $. We also prove that for each element $\xi $ in $\mathscr {S}_\theta $, there are exactly $\beta _1 p_1$ outliers tending to $\xi $ at rate $N^{-1/(2p_1)}$, $\beta _2 p_2$ outliers tending to $\xi $ at rate $N^{-1/(2p_2)}$, etc... (see Fig. 2). Furthermore, the limit joint distribution of the fluctuations is explicit, not necessarily Gaussian, and might show correlations even between outliers at a macroscopic distance with each other. This phenomenon of correlations between the fluctuations of two outliers with distinct limits has already been proved for non-Gaussian Wigner matrices when $\mathbf{A}$ is Hermitian (see [25]), while in our case, Gaussian Wigner matrices can have such correlated outliers: Indeed, the correlations that we bring to light here are due to the fact that the eigenspaces of $\mathbf{A}$ are not necessarily orthogonal or that one single spike generates several outliers. Indeed, we observe that the outliers may outnumber the rank of $\mathbf{A}$. This had already been noticed in [8, Remark 2.11] when the support of the limit spectral measure of $\mathbf{H}$ has some “holes” or in the different model of [5], where the authors study the case where $\mathbf{A}$ is Hermitian but with full rank and is invariant in distribution by unitary conjugation. Here, the phenomenon can be proved to occur even when the support of the limit spectral measure of $\mathbf{H}$ is connected. At last, if we apply our results in the particular case where $\mathbf{A}$ is Hermitian, we also see that two outliers at a macroscopic distance with each other are correlated if they both are generated by the same spike (which can occur only if the limit support is disconnected) and are independent otherwise (see Fig. 3). From this point of view, this completes the work of [6], where fluctuations of outliers lying in “holes” of the limit support had not been studied.

The fact to consider a non-Hermitian deformation on a Hermitian random matrix has already been studied in theoretical physics (see [18–21]) in the particular case where $\mathbf{H}$ is a GOE/GUE matrix and $\mathbf{A}$ is a nonnegative Hermitian matrix times ${\text {i}}$ (the square root of $-1$). They proved a weaker version of Theorem 2.3 in this specific case but did not study the fluctuations.

The proofs of this paper rely essentially on the ideas of the paper [10] about outliers in the Single Ring Theorem and on the results proved in [6, 30, 31]. More precisely, the study of the fluctuations reproduces the outlines of the proofs of [10] as long as the model fulfills some conditions. Thanks to [30, 31], we show that these conditions are satisfied for Wigner matrices. At last, using [6] and the Weingarten calculus, we show the same for Hermitian matrices invariant in distribution by unitary conjugation. In the appendix, as a tool for the outliers study, we prove a result on the fluctuations of the entries of such matrices.

2 General Results

At first, we formulate the results in general settings, and we shall give, in the next section, examples of random matrices on which these results apply.

2.1 Convergence of the Outliers

2.1.1 Setup and Assumptions

For all $N \ge 1$, let $\mathbf{H}_N$ be an Hermitian random $N \times N$ matrix whose empirical spectral measure, as N goes to infinity, converges weakly in probability to a compactly supported measure $\mu $

$$\begin{aligned} \mu _N : = \frac{1}{N} \sum _{i=1}^N \delta _{\lambda _i(\mathbf{H})} \ \longrightarrow \ \mu . \end{aligned}$$

(1)

We shall suppose that $\mu $ is non-trivial in the sense that $\mu $ is not a single Dirac measure. Also, we suppose that $\mathbf{H}_N$ does not possess any natural outliers, i.e.,

Assumption 2.1

As N goes to infinity, with probability tending to one,

$$\begin{aligned} \sup _{\lambda \in {\text {Spec}}(\mathbf{H}_N)} {\text {dist}}(\lambda ,{\text {supp}}(\mu ))\longrightarrow & {} 0. \end{aligned}$$

For all $N \ge 1$, let $\mathbf{A}_N$ be an $N \times N$ random matrix independent from $\mathbf{H}_N$ (which does not satisfies necessarily $\mathbf{A}_N^* = \mathbf{A}_N$) whose rank is bounded by an integer r (independent from N). We know that we can write

(2)

where $\mathbf{U}$ is an $N\times N$ unitary matrix and $\mathbf{A}_0$ is $2r \times 2r$ matrix. We notice that $\mathbf{A}_N$ only depends on the 2r-first columns of $\mathbf{U}$ so that, we shall write

$$\begin{aligned} \mathbf{A}_N \ := \ \mathbf{U}_{2r} \mathbf{A}_0 \mathbf{U}_{2r}^*, \end{aligned}$$

where the $N\times 2r$ matrix $\mathbf{U}_{2r}$ designates the 2r-first columns of $\mathbf{U}$. We shall assume that $\mathbf{A}_0$ is deterministic and independent from N. We shall denote by $\theta _1,\ldots ,\theta _j$ the distinct nonzero eigenvalues of $\mathbf{A}_0$ and $k_1,\ldots ,k_j$ their respective multiplicity^{Footnote 1} (note that $\sum _{i=1}^jk_i \le r$).

We consider the additive perturbation

$$\begin{aligned} \widetilde{\mathbf{H}}_N:= & {} \mathbf{H}_N + \mathbf{A}_N, \end{aligned}$$

(3)

We set

$$\begin{aligned} G_\mu (z):= & {} \int \frac{1}{z-x}\mu (\mathrm{d}x) . \end{aligned}$$

(4)

the Cauchy transform of the measure $\mu $. We introduce, for all $i \in \{1,\ldots ,j\}$, the finite, possibly empty, set

$$\begin{aligned} \mathscr {S}_{\theta _i}&: =&\left\{ \xi \in {{\mathrm{\mathbb {C}}}}\backslash {\text {supp}}(\mu ), \ G_\mu (\xi ) = \frac{1}{\theta }_i \right\} , \ \text { and } \ m_i \ : = \ {\text {Card}}\mathscr {S}_{\theta _i} \end{aligned}$$

(5)

We make the following assumption

Assumption 2.2

For any $\delta >0$, as N goes to infinity, we have

$$\begin{aligned} \sup _{{\text {dist}}(z,{\text {supp}}(\mu ))>\delta }\left\| \mathbf{U}_{2r}^* \left( z\mathbf{I}- \mathbf{H}_N \right) ^{-1} \mathbf{U}_{2r} - G_\mu (z) \mathbf{I} \right\| _{{{\mathrm{op}}}} \ {{\mathrm{\overset{(\mathbb {P})}{\longrightarrow }}}}\ 0. \end{aligned}$$

2.1.2 Result

Theorem 2.3

(Convergence of the outliers) For $\theta _1,\ldots ,\theta _j$, $k_1,\ldots , k_j$, $\mathscr {S}_{\theta _1},\ldots ,\mathscr {S}_{\theta _j}$, and $m_1,\ldots ,m_j$ as defined above, with probability tending to one, $\widetilde{\mathbf{H}}_N := \mathbf{H}_N + \mathbf{A}_N$ possesses exactly $\displaystyle \sum \nolimits _{i=1}^j k_i m_i$ eigenvalues at a macroscopic distance of ${\text {supp}}\mu $ (outliers). More precisely, for all small enough $\delta >0$, for all large enough N, for all $i \in \{1,\ldots ,j\}$, if we set

$$\begin{aligned} \mathscr {S}_{\theta _i} \ = \ \{\xi _{i,1},\ldots ,\xi _{i,m_i}\}, \end{aligned}$$

there are $m_i$ eigenvalues $\widetilde{\lambda }_{i,1},\ldots ,\widetilde{\lambda }_{i,m_i}$ of $\widetilde{\mathbf{H}}_N$ in $\{z, \ {\text {dist}}(z,{\text {supp}}(\mu ))>\delta \}$ satisfying

$$\begin{aligned} \widetilde{\lambda }_{i,n}= & {} \xi _{i,n} + o \left( 1 \right) , \ \ \ \text { for all } n \in \{1,\ldots ,m_i\}, \end{aligned}$$

after a proper labeling.

Remark 2.4

If all the $\mathscr {S}_{\theta _i}$’s are empty, there is possibly no outlier at all. This condition is the analogous of the phase transition condition in [8, Theorem 2.1] in the case where the $\theta _i$’s are real, which is if

$$\begin{aligned} \frac{1}{\theta }_i \not \in \left] \lim _{x \rightarrow a^-}G_{\mu }(x) , \lim _{x \rightarrow b^+}G_{\mu }(x) \right[ \end{aligned}$$

where a (respectively, b) designates the infimum (respectively, the supremum) of the support of $\mu $, then $\theta _i$ does not generate any outlier. In our case, if $|\theta _i|$ is large enough, $\mathscr {S}_{\theta _i}$ is necessarily non-empty^{Footnote 2}, which means that a strong enough perturbation always creates outliers.

Remark 2.5

We notice that the outliers can outnumber the rank of $\mathbf{A}$. This phenomenon was already observed in [8] in the case where the support of the limit spectral distribution has a disconnected support (see also [5]). In our case, the phenomenon occurs even for connected support (see Fig. 1).

2.2 Fluctuations of the Outliers

To study the fluctuations, one needs to understand the limit distribution of

$$\begin{aligned} \sqrt{N} \left\| \mathbf{U}_{2r}^* \left( z\mathbf{I}- \mathbf{H}_N \right) ^{-1} \mathbf{U}_{2r} - G_\mu (z) \mathbf{I} \right\| _{{{\mathrm{op}}}}. \end{aligned}$$

(6)

In the particular case where $\mathbf{H}_N$ is a Wigner matrix, we know from [30] that this quantity is tight but does not necessarily converge. Hence, we shall need additional assumptions.

2.2.1 Setup and Assumptions

As $\mathbf{A}_N$ is not Hermitian, we need to introduce the Jordan Canonical Form (JCF) to describe the fluctuations. More precisely, we shall consider the JCF of $\mathbf{A}_0$ which does not depend on N. We know that, in a proper basis, $\mathbf{A}_0$ is a direct sum of Jordan blocks, i.e., blocks of the form

(7)

Let us denote by $\theta _1, \ldots , \theta _q$ the distinct eigenvalues of $\mathbf{A}_0 $ such that $\mathscr {S}_{\theta } \ne \emptyset $ (see (5) for the definition of $\mathscr {S}_\theta $), and for each $i=1, \ldots , q$, we introduce a positive integer $\alpha _i$, some positive integers $p_{i,1}> \cdots > p_{i,\alpha _i}$ corresponding to the distinct sizes of the blocks relative to the eigenvalue $\theta _i$ and $\beta _{i,1}, \ldots , \beta _{i, \alpha _i}$ such that for all j, $\mathbf{R}_{p_{i,j}}(\theta _i)$ appears $\beta _{i,j}$ times, so that, for a certain $2r \times 2r$ non-singular matrix $\mathbf{Q}$, we have:

$$\begin{aligned} \mathbf{J}= & {} \mathbf{Q}^{-1}\mathbf{A}_0\mathbf{Q}\ = \ \hat{\mathbf{A}}\ \bigoplus \ \bigoplus _{i=1}^q \ \bigoplus _{j=1}^{\alpha _i} \! \underbrace{\begin{pmatrix}\mathbf{R}_{p_{i,j}}(\theta _i)&{}&{}\\ &{}\ddots &{}\\ &{}&{}\mathbf{R}_{p_{i,j}}(\theta _i)\end{pmatrix}}_{ \beta _{{i,j}} \text { blocks}} \end{aligned}$$

(8)

where $\oplus $ is defined, for square block matrices, by and $\hat{\mathbf{A}}$ is a matrix such that its eigenvalues $\theta $ are such that $\mathscr {S}_\theta = \emptyset $ or null.

The asymptotic orders of the fluctuations of the eigenvalues of $\mathbf{H}_N+\mathbf{A}_N $ depend on the sizes $p_{i,j}$ of the blocks. Actually, for each $\theta _i$ and each $\xi _{i,n} \in \mathscr {S}_{\theta _i} = \{\xi _{i,1},\ldots ,\xi _{i,m_i}\}$, we know, by Theorem 2.3, there are $\sum _{j=1}^{\alpha _i}p_{ij}\times \beta _{i,j}$ eigenvalues ${\widetilde{\lambda }}$ of $\mathbf{H}_N+\mathbf{A}_N$ which tend to $\xi _{i,n}$ : We shall write them with a $\xi _{i,n}$ on the top left corner, as follows:

$$\begin{aligned} {\phantom {{\widetilde{\lambda }}}}^{\xi _{i,n}}{\widetilde{\lambda }}. \end{aligned}$$

Theorem 2.10 below will state that for each block with size $p_{i,j}$ corresponding to $\theta _i$ in the JCF of $\mathbf{A}_0$, there are $p_{i,j}$ eigenvalues (we shall write them with $p_{i,j}$ on the bottom left corner : ${\phantom {{\widetilde{\lambda }}}}^{\;\;\xi _{i,n}}_{p_{i,j}}{\widetilde{\lambda }}$) whose convergence rate will be $N^{-1/(2p_{i,j})}$. As there are $\beta _{{i,j}}$ blocks of size $p_{i,j}$, there are actually $p_{i,j}\times \beta _{{i,j}}$ eigenvalues tending to $\xi _{i,n}$ with convergence rate $N^{-1/(2p_{i,j})}$ (we shall write them ${\phantom {{\widetilde{\lambda }}_{s,t}}}^{\;\;\xi _{i,n}}_{p_{i,j}}{\widetilde{\lambda }}_{s,t}$ with $s \in \{1,\ldots ,p_{i,j}\}$ and $t \in \{1,\ldots ,\beta _{{i,j}}\}$). It would be convenient to denote by $\Lambda _{i,j,n}$ the vector with size $p_{i,j}\times \beta _{{i,j}}$ defined by

$$\begin{aligned} \Lambda _{i,j,n} \ := \ \displaystyle \left( N^{1/(2p_{i,j})} \cdot \Big ({\phantom { {\widetilde{\lambda }}_{s,t}}}^{\;\;\xi _{i,n}}_{p_{i,j}} {\widetilde{\lambda }}_{s,t} - \xi _{i,n} \Big ) \right) _{\begin{array}{c} 1\le s\le p_{i,j}\\ 1\le t\le \beta _{i,j} \end{array}}. \end{aligned}$$

(9)

In addition, we make an assumption on the convergence of (6).

Assumption 2.6

(1)
The vector $\displaystyle \left( \sqrt{N}\mathbf{U}_{2r}^*\left( \left( \xi _{i,n}-\mathbf{H}_N \right) ^{-1} - \frac{1}{\theta _i}\right) \mathbf{U}_{2r}\right) _{\begin{array}{c} 1 \le \; i\le q_{{1}} \\ 1 \le n \le m_i \end{array}}$ converges in distribution, and none of its entries tends to zero.
(2)
For all $k \ge 1$, all $i\in \{1,\ldots ,q\}$, and all $n \in \{1,\ldots ,m_i\}$,
$$\begin{aligned} \sqrt{N}\mathbf{U}_{2r}^* \left( \big (\xi _{i,n}-\mathbf{H}_N\big )^{-(k+1)} - \int \frac{\mu (\mathrm{d}x)}{(\xi _{i,n}-x)^{k+1}} \right) \mathbf{U}_{2r} \end{aligned}$$
is tight.

or

(0’):

For all $i\in \{1,\ldots ,q\}$ and all $n\in \{1,\ldots ,m_i\}$, as N goes to infinity,

$$\begin{aligned} \displaystyle \sqrt{N}\left( \frac{1}{N} {\text {Tr}} \left( \xi _{i,n}-\mathbf{H}_N \right) ^{-1} - \frac{1}{\theta _i}\big )\right) \longrightarrow 0. \end{aligned}$$

(1’):

The vector $\displaystyle \left( \sqrt{N}\mathbf{U}_{2r}^*\left( \left( \xi _{i,n}-\mathbf{H}_N \right) ^{-1} - \frac{1}{N} {\text {Tr}}\big (\xi _{i,n}-\mathbf{H}_N\big )^{-1}\right) \mathbf{U}_{2r}\right) _{\begin{array}{c} 1 \le \; i\le q_{{1}} \\ 1 \le n \le m_i \end{array}}$ converges in distribution, and none of its entries tends to zero.

(2’):

For all $k \ge 1$ and for all $i\in \{1,\ldots ,q\}$,

$$\begin{aligned} \sqrt{N}\mathbf{U}_{2r}^* \left( \big (\xi _{i,n}-\mathbf{H}_N\big )^{-(k+1)} - \frac{1}{N} {\text {Tr}}\big (\xi _{i,n}-\mathbf{H}_N\big )^{-(k+1)}\right) \mathbf{U}_{2r} \end{aligned}$$

is tight.

As in [10], we define now the family of random matrices that we shall use to characterize the limit distribution of the $\Lambda _{i,j,n}$’s. For each $i=1, \ldots , q$, let $I(\theta _i)$ (respectively, $J(\theta _i)$) denote the set, with cardinality $\sum _{j=1}^{\alpha _i}\beta _{i,j}$, of indices in $\{1, \ldots , r\}$ corresponding to the first (respectively, last) columns of the blocks $\mathbf{R}_{p_{i,j}}(\theta _i)$ ($1\le j\le \alpha _i$) in (8).

Remark 2.7

Note that the columns of $\mathbf{Q}$ (respectively, of $(\mathbf{Q}^{-1})^*$) whose index belongs to $I(\theta _i)$ (respectively, $J(\theta _i)$) are eigenvectors of $\mathbf{A}_0$ (respectively, of $\mathbf{A}_0^*$) associated with $\theta _i$ (respectively, $\overline{\theta _i}$). See [10, Remark 2.7].

Now, let

$$\begin{aligned} \left( {m}^{\theta _i,n}_{k,\ell }\right) _{\begin{array}{c} 1 \le i \le q_{{1.}} \quad \quad \ \ \\ 1 \le n \le m_i \quad \quad \ \ \\ (k,\ell )\in J(\theta _i)\times I(\theta _i) \end{array}} \end{aligned}$$

(10)

be the multivariate random variable defined as the limit joint distribution of

$$\begin{aligned}&\left( \sqrt{N}\mathbf{e}_k^* \mathbf{Q}^{-1}\mathbf{U}_{2r}^*\left( \left( \xi _{i,n}-\mathbf{H}_N \right) ^{-1} - \frac{1}{\theta _i}\right) \mathbf{U}_{2r}\mathbf{Q}\mathbf{e}_\ell \right) _{\begin{array}{c} 1 \le i \le q_{{1.}} \quad \quad \ \ \\ 1 \le n \le m_i \quad \quad \ \ \\ (k,\ell )\in J(\theta _i)\times I(\theta _i) \end{array}} \nonumber \\&\quad \underset{\text {jointly}}{{{\mathrm{\overset{(d)}{\longrightarrow }}}}} \ \left( {m}^{\theta _i,n}_{k,\ell }\right) _{\begin{array}{c} 1 \le i \le q_{{1.}} \quad \quad \ \ \\ 1 \le n \le m_i \quad \quad \ \ \\ (k,\ell )\in J(\theta _i)\times I(\theta _i) \end{array}} \end{aligned}$$

(11)

(which does exist by Assumption 2.6) and where $\mathbf{e}_1, \ldots , \mathbf{e}_r$ are the column vectors of the canonical basis of ${{\mathrm{\mathbb {C}}}}^r$).

For each i, j, let K(i, j) (respectively, $K(i,j)^-$) be the set, with cardinality $\beta _{i,j}$ (respectively, $\sum _{j'=1}^{j-1}\beta _{i,j'}$), of indices in $J(\theta _i)$ corresponding to a block of the type $\mathbf{R}_{p_{i,j}}(\theta _i)$ (respectively, to a block of the type $\mathbf{R}_{p_{i,j'}}(\theta _i)$ for $j'<j$). In the same way, let L(i, j) (respectively, $L(i,j)^-$) be the set, with the same cardinality as K(i, j) (respectively, as $K(i,j)^-$), of indices in $I(\theta _i)$ corresponding to a block of the type $\mathbf{R}_{p_{i,j}}(\theta _i)$ (respectively, to a block of the type $\mathbf{R}_{p_{i,j'}}(\theta _i)$ for $j'<j$). Note that $K(i,j)^-$ and $L(i,j)^-$ are empty if $j=1$. Let us define the random matrices for each $n \in \{1,\ldots ,m_i\}$

$$\begin{aligned} {\text {M}}^{\theta _i,\mathrm{I}}_{j,n}\ := \ [m^{\theta _i,n}_{k,\ell }]_{\displaystyle ^{k\in K(i,j)^-}_{\ell \in L(i,j)^-}}&\qquad \qquad \qquad&{\text {M}}^{\theta _i,\mathrm{I}\mathrm{I}}_{j,n}\ := \ [m^{\theta _i,n}_{k,\ell }]_{\displaystyle ^{k\in K(i,j)^-}_{\ell \in L(i,j)}} \nonumber \\ {\text {M}}^{\theta _i,\mathrm{I}\mathrm{I}\mathrm{I}}_{j,n}\ := \ [m^{\theta _i,n}_{k,\ell }]_{\displaystyle ^{k\in K(i,j)}_{\ell \in L(i,j)^-}}&\qquad \qquad \qquad&{\text {M}}^{\theta _i,\mathrm{I}\mathrm{V}}_{j,n}\ := \ [m^{\theta _i,n}_{k,\ell }]_{\displaystyle ^{k\in K(i,j)}_{\ell \in L(i,j)}} \end{aligned}$$

(12)

and then let us define the $\beta _{i,j}\times \beta _{i,j}$ matrix ${\mathbf{M}}^{\theta _i}_{j,n}$ as

$$\begin{aligned} \mathbf{M}^{\theta _i}_{j,n}\ := \ \theta _i \left( {\text {M}}^{\theta _i,\mathrm{I}\mathrm{V}}_{j,n}- {\text {M}}^{\theta _i,\mathrm{I}\mathrm{I}\mathrm{I}}_{j,n} \left( {\text {M}}^{\theta _i,\mathrm{I}}_{j,n} \right) ^{-1} {\text {M}}^{\theta _i,\mathrm{I}\mathrm{I}}_{j,n} \right) \end{aligned}$$

(13)

Remark 2.8

It follows from the fact that the matrix $\mathbf{Q}$ is invertible, that ${\text {M}}^{\theta _i,\mathrm{I}}_{j,n}$ is a.s. invertible and so is $\mathbf{M}^{\theta _i}_{j,n}$.

Remark 2.9

In the particular case where $\mathbf{A}_0$ is Hermitian (which means that $\mathbf{Q}^{-1} = \mathbf{Q}^*$ and the $\theta _i$’s are real), then the matrices ${\mathbf{M}}^{\theta _i}_{j,n}$ are also Hermitian.

Now, we can formulate the result on the fluctuations.

2.2.2 Result

Theorem 2.10

(1)
As N goes to infinity, the random vector
$$\begin{aligned} \displaystyle \left( \Lambda _{i,j,n} \right) _{\begin{array}{c} 1 \le i \le q_{{1}} \\ 1 \le j\le \alpha _i \\ 1 \le n \le m_i \end{array} } \end{aligned}$$
defined at (9) converges to the distribution of a random vector
$$\begin{aligned} \displaystyle \left( \Lambda ^\infty _{i,j,n} \right) _{\begin{array}{c} 1 \le i \le q_{{1}} \\ 1 \le j\le \alpha _i \\ 1 \le n \le m_i \end{array} } \end{aligned}$$
with joint distribution defined by the fact that for each $1 \le i \le q$, $1 \le j \le \alpha _i$ and $1 \le n \le m_i$, $\Lambda _{i,j,n}^\infty $ is the collection of the ${p_{i,j}}^{\text {th}}$ roots of the eigenvalues of some random matrix $\mathbf{M}^{\theta _i}_{j,n}$.
(2)
The distributions of the random matrices $\mathbf{M}^{\theta _i}_{j,n}$ are absolutely continuous with respect to the Lebesgue measure, and the random vector $\displaystyle \left( \Lambda ^\infty _{i,j,n} \right) _{\displaystyle ^{1 \le i \le q}_{1 \le j\le \alpha _i}} $ has no deterministic coordinate.

Theorem 2.10 is illustrated in Fig. 2 with an example. We clearly see appearing regular polygons.

3 Applications

In this section, we give examples of random matrices which satisfy the assumptions of Theorem 2.3 and Theorem 2.10.

3.1 Wigner Matrices

Let $\mathbf{H}_N = \frac{1}{\sqrt{N}}\mathbf{W}_N$ be a symmetric/Hermitian Wigner matrix with independent entries up to the symmetry. More precisely, we assume that

Assumption 3.1

Real symmetric case :

$$\begin{aligned}&\bullet \ \big ( \mathbf{W}_N\big )_{i,j}, 1 \le i \le j \le N, \ \text {are independent,} \\&\bullet \ \text {The } (\mathbf{W}_N)_{i,j}\text {'s} \ \text { for } i\ne j \text { (respectively, } i=j \text {) are identically distributed,} \qquad \qquad \qquad \qquad \ \\&\bullet \ {{\mathrm{\mathbb {E}}}}(\mathbf{W}_N)_{1,1}={{\mathrm{\mathbb {E}}}}(\mathbf{W}_N)_{1,2} = 0, \ {{\mathrm{\mathbb {E}}}}(\mathbf{W}_N)_{1,1}^2 = 2\sigma ^2, \ {{\mathrm{\mathbb {E}}}}(\mathbf{W}_N)_{1,2}^2 = \sigma ^2, \\&\bullet \ c_3 := {{\mathrm{\mathbb {E}}}}\left| (\mathbf{W}_N)_{1,1}\right| ^3 < \infty , \ m_5 := {{\mathrm{\mathbb {E}}}}\left| (\mathbf{W}_N)_{1,2}\right| ^5 < \infty . \\ \end{aligned}$$

Hermitian case :

$$\begin{aligned}&\bullet \ \big ({{\mathrm{{\text {Re}}}}}\mathbf{W}_N\big )_{i,j},\big ({{\mathrm{{\text {Im}}}}}\mathbf{W}_N\big )_{i,j}, 1 \le i < j \le N, \big (\mathbf{W}_N\big )_{i,i}, 1 \le i \le N, \ \text {are independent}.\\&\bullet \ \text {The } ({{\mathrm{{\text {Re}}}}}\mathbf{W}_N)_{i,j}\text {'s},({{\mathrm{{\text {Im}}}}}\mathbf{W}_N)_{i,j}\text {'s} \ \text { for } i\ne j \text { (respectively, } (\mathbf{W}_N)_{i,i}\text {'s), are}\\&\quad \text {identically distributed,}\\&\bullet \ {{\mathrm{\mathbb {E}}}}(\mathbf{W}_N)_{1,1}={{\mathrm{\mathbb {E}}}}(\mathbf{W}_N)_{1,2} = 0, \ {{\mathrm{\mathbb {E}}}}(\mathbf{W}_N)_{1,1}^2 = \sigma ^2, \ {{\mathrm{\mathbb {E}}}}({{\mathrm{{\text {Re}}}}}\mathbf{W}_N)_{1,2}^2 = \frac{\sigma ^2}{2}, \\&\bullet \ c_3 := {{\mathrm{\mathbb {E}}}}\left| (\mathbf{W}_N)_{1,1}\right| ^3 < \infty , \ m_5 := {{\mathrm{\mathbb {E}}}}\left| (\mathbf{W}_N)_{1,2}\right| ^5 < \infty . \\ \end{aligned}$$

In this case, we have the following version of Theorem 2.3

Theorem 3.2

(Convergence of the outliers for Wigner matrices) Let $\theta _1,\ldots ,\theta _j$ be the eigenvalues of $\mathbf{A}_N$ such that $|\theta _i|>\sigma $. Then, with probability tending to one, for all large enough N, there are exactly j eigenvalues $\widetilde{\lambda }_1,\ldots ,\widetilde{\lambda }_j$ of $\widetilde{\mathbf{H}}_N := \frac{1}{\sqrt{N}}\mathbf{W}_N + \mathbf{A}_N$ at a macroscopic distance of $[-2\sigma ,2\sigma ]$ (outliers). More precisely, for all small enough $\delta >0$, for all large enough N, for all $i \in \{1,\ldots ,j\}$,

$$\begin{aligned} \widetilde{\lambda }_{i} = \theta _i + \frac{\sigma ^2}{\theta _i} + o \left( 1 \right) , \ \end{aligned}$$

after a proper labeling.

Proof

We just need to check that Assumptions 2.1 and 2.2 are satisfied.

As long as the entries of $\mathbf{W}_N$ have a finite fourth moment, we know (see [2, Theorem5.2]) that Assumption 2.1 is satisfied.
Now, we need to show that for any $\delta >0$, as N goes to infinity,
$$\begin{aligned} \sup _{{\text {dist}}(z,{\text {supp}}(\mu ))>\delta }\left\| \mathbf{U}_{2r}^* \left( z\mathbf{I}- \mathbf{H}_N \right) ^{-1} \mathbf{U}_{2r} - G_{\mu _{\mathrm{sc}}}(z) \mathbf{I} \right\| _{{{\mathrm{op}}}} \ {{\mathrm{\overset{(\mathbb {P})}{\longrightarrow }}}}\ 0. \end{aligned}$$
Since we are dealing with $2r \times 2r$-sized matrices, it suffices to prove that for any unite vectors $\mathbf{u}$,$\mathbf{v}$ of ${{\mathrm{\mathbb {C}}}}^N$, for any $\delta >0$ and any $\eta >0$, as N goes to infinity,
$$\begin{aligned} {{\mathrm{\mathbb {P}}}}\Big (\sup _{{\text {dist}}(z,{\text {supp}}(\mu ))>\delta }\left| \mathbf{u}^* \big ( \left( z\mathbf{I}- \mathbf{H}_N \right) ^{-1} - G_{\mu _{\mathrm{sc}}}(z) \mathbf{I}\big )\mathbf{v}\right| > \eta \Big ) \ \longrightarrow \ 0. \end{aligned}$$
Moreover, as both $G_{\mu _{\mathrm{sc}}}(z)$ and $\left\| \left( z\mathbf{I}- \mathbf{H}_N \right) ^{-1} \right\| _{{{\mathrm{op}}}}$ go to 0 when |z| goes to infinity, we know there is a large enough constant M such that we just need to prove that
$$\begin{aligned} {{\mathrm{\mathbb {P}}}}\Big (\sup _{\begin{array}{c} {\text {dist}}(z,{\text {supp}}(\mu ))>\delta \\ |z| \ \le \ M \end{array}}\left| \mathbf{u}^* \big ( \left( z\mathbf{I}- \mathbf{H}_N \right) ^{-1} - G_{\mu _{\mathrm{sc}}}(z) \mathbf{I}\big )\mathbf{v}\right| > \eta \Big ) \ \longrightarrow \ 0. \end{aligned}$$
Then, for any $\eta '>0$, the compact set $K = \{z, \ {\text {dist}}(z,{\text {supp}}(\mu ))>\delta \ \text { and } \ |z| \le M \}$ admits a $\eta '$-net, which is a finite set $\{z_1,\ldots ,z_p\}$ of K such that
$$\begin{aligned} \forall z \in K, \ \exists i \in \{1,\ldots ,p\}, \ \ \ |z-z_i| < \eta ', \end{aligned}$$
so that, using the uniform boundedness of the derivative of $G_{\mu _{\mathrm{sc}}}(z)$ and $\mathbf{u}^* \left( z - \mathbf{H}_N \right) ^{-1}\mathbf{v}$ on K, for a small enough $\eta '$, we just need to prove that
$$\begin{aligned} {{\mathrm{\mathbb {P}}}}\Big (\max _{i=1}^p \left| \mathbf{u}^* \big ( \left( z_i\mathbf{I}- \mathbf{H}_N \right) ^{-1} - G_{\mu _{\mathrm{sc}}}(z_i) \mathbf{I}\big )\mathbf{v}\right| > \eta /2 \Big ) \ \longrightarrow \ 0. \end{aligned}$$
Then, we properly decompose each function $ x \ \mapsto \frac{1}{z_i-x}$ as a sum of a smooth compactly supported function and one that vanishes on a neighborhood of $[-2\sigma ,2\sigma ]$ and conclude using [30, (ii) Theorem 1.6]

Moreover, in the Wigner case, we have

$$\begin{aligned} G_{\mu _{\mathrm{sc}}}(z)= & {} \frac{z - \sqrt{z^2 - 4\sigma ^2}}{2\sigma ^2}, \end{aligned}$$

where $\sqrt{z^2 - 4\sigma ^2}$ is the branch of the square root with branch cut $[-2\sigma ,2\sigma ]$ so that for any z outside $[-2\sigma ,2\sigma ]$, the equation $G_{\mu _{\mathrm{sc}}}(z) = \frac{1}{\theta }$ possesses one solution if and only if $|\theta |> \sigma $ and the unique solution is

$$\begin{aligned} \theta + \frac{\sigma ^2}{\theta }, \end{aligned}$$

which means that in the Wigner case, the outliers cannot outnumber the rank of the perturbation, and the phase transition condition is simply : $|\theta |>\sigma $. Actually, in [5] (see Remark 3.2), the authors explain that if $\mu $ is $\boxplus $-infinitely divisible, then the sets $\mathscr {S}_{\theta _i}$’s have at most one element, which means that for Wigner matrices, it is not possible to observe the phenomenon of “outliers outnumber the rank of $\mathbf{A}$.” $\square $

Remark 3.3

One can find an other proof of Theorem 3.2 in [28] as a particular case of the Theorem 2.4 (see [28, Remark 2.5]) due to the fact that a Wigner matrix can be seen as a particular Elliptic matrix. Nevertheless, the authors of [28] do not deal with the matter of the fluctuations.

To study the fluctuations of the outliers in the Wigner case, we must make an additional assumption on the perturbation $\mathbf{A}_N$.

Assumption 3.4

The matrix $\mathbf{A}_N$ has only a finite number (independent of N) of entries which are nonzero.

Remark 3.5

Assumption 3.4 is equivalent to suppose that $\mathbf{U}_{2r}$ (the 2r-first columns of $\mathbf{U}$) possesses only a finite number K (independent of N) of nonzero rows. Actually, this assumption is the analogous “the eigenvectors of $\mathbf{A}$ do not spread out” hypothesis corresponding to the “case a)” in [14].

Remark 3.6

If $\mathbf{U}$ is Haar-distributed and independent from $\mathbf{W}$, we can avoid making Assumption 3.4 (see Sect. 3.2). One can also slightly weaken Assumption 3.4 by assuming that the 2r-first rows of $\mathbf{U}$ correspond to the N first coordinates of a collection of non-random vectors $\mathbf{u}_1,\ldots ,\mathbf{u}_{2r}$ in $\ell ^2({{\mathrm{\mathbb {N}}}})$ (see [30, Theorem 1.7]).

Theorem 3.7

(Fluctuations for Wigner matrices) With assumptions 3.1 and 3.4, Theorem 2.10 holds. Moreover, the distribution of the random vector

$$\begin{aligned} \left( {m}^{\theta _i}_{k,\ell }\right) _{\begin{array}{c} 1 \le i \le q_{{1.}} \quad \quad \ \ \\ (k,\ell )\in J(\theta _i)\times I(\theta _i) \end{array}}, \end{aligned}$$

defined by (10), is

$$\begin{aligned} \Big ( \mathbf{e}_k^* \mathbf{Q}^{-1} \mathbf{U}_{K,2r}^*\Upsilon (\xi _{i})\mathbf{U}_{K,2r}\mathbf{Q}\mathbf{e}_\ell \Big )_{\begin{array}{c} 1 \le i \le q_{{1.}} \quad \quad \ \ \\ (k,\ell )\in J(\theta _i)\times I(\theta _i) \end{array}}, \end{aligned}$$

where $\xi _i : = \theta _i + \displaystyle \frac{\sigma ^2}{\theta _i}$ and where $\Upsilon (z)$ is a $K\times K$ random field defined by

$$\begin{aligned} \Upsilon (z) := (G_{\mu _{\mathrm{sc}}}(z))^2 \left( \mathbf{W}^{(K)} + \mathbf{Y}(z) \right) \end{aligned}$$

(14)

where $\mathbf{W}^{(K)}$ is the $K \times K$ upper-left corner submatrix of a matrix $\widetilde{\mathbf{W}}_N$ such that $\widetilde{\mathbf{W}}_N {{\mathrm{\overset{(d)}{=}}}}\mathbf{W}_N$ and $\mathbf{Y}(z)$ is a $K \times K$ Gaussian random field defined by [31, (2.7),(2.8),(2.9),(2.10),(2.11),(2.12)] in the real case and [31, (2.42),(2.43),(2.44),(2.45),(2.46),(2.47)] in the complex case.

Remark 3.8

This provides an example of non-universal fluctuations, in the sense that the $\left( {m}^{\theta _i}_{k,\ell }\right) $’s are not necessarily Gaussian. However, when $\mathbf{H}_N$ is a GOE or GUE matrix, the $\left( {m}^{\theta _i}_{k,\ell }\right) $’s are centered Gaussian variables such that

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left( {m}^{\theta _i}_{k,\ell } \; {m}^{\theta _{i'}}_{k',\ell '}\right)= & {} \psi _{\mathrm{sc}}(\xi _{i},\xi _{i'}) \; \big (\mathbf{e}_k^* \mathbf{Q}^{-1}(\mathbf{Q}^{-1})^*\mathbf{e}_{k'} \; \mathbf{e}_\ell ^* \mathbf{Q}^* \mathbf{Q}\mathbf{e}_{\ell '}\; + \; \delta _{k,\ell '}\delta _{k',\ell }\big ), \nonumber \\ {{\mathrm{\mathbb {E}}}}\left( {m}^{\theta _i}_{k,\ell } \; \overline{{m}^{\theta _{i'}}_{k',\ell '}}\right)= & {} \psi _{\mathrm{sc}}(\xi _{i},\overline{\xi _{i'}}) \; \big (\mathbf{e}_k^* \mathbf{Q}^{-1}(\mathbf{Q}^{-1})^*\mathbf{e}_{k'} \; \mathbf{e}_\ell ^* \mathbf{Q}^* \mathbf{Q}\mathbf{e}_{k'}\; + \; \delta _{k,\ell '}\delta _{k',\ell }\big ), \end{aligned}$$

(15)

for the GOE, and

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left( {m}^{\theta _i}_{k,\ell } \; {m}^{\theta _{i'}}_{k',\ell '}\right)= & {} \psi _{\mathrm{sc}}(\xi _{i},\xi _{i'}) \; \delta _{k,\ell '}\delta _{k',\ell }, \nonumber \\ {{\mathrm{\mathbb {E}}}}\left( {m}^{\theta _i}_{k,\ell } \; \overline{{m}^{\theta _{i'}}_{k',\ell '}}\right)= & {} \psi _{\mathrm{sc}}(\xi _{i},\overline{\xi _{i'}}) \;\mathbf{e}_{ k}^*\mathbf{Q}^{-1}(\mathbf{Q}^{-1})^*\mathbf{e}_{ {k'}}\;\mathbf{e}_{ \ell '}^*\mathbf{Q}^*\mathbf{Q}\, \mathbf{e}_{ {\ell }}, \end{aligned}$$

(16)

for the GUE, where

$$\begin{aligned} \psi _{\mathrm{sc}}(z,w):= & {} G^2_{\mu _{\mathrm{sc}}}(z)G^2_{\mu _{\mathrm{sc}}}(w)\big (\sigma ^2 + \sigma ^4 \varphi _{\mathrm{sc}}(z,w)\big ) , \\ \varphi _{\mathrm{sc}}(z,w):= & {} \int \frac{1}{z-x} \frac{1}{w-x} \mu _{\mathrm{sc}}(\mathrm{d}x). \end{aligned}$$

We notice that if $\mathbf{Q}^{-1} \ne \mathbf{Q}^*$, then we might observe correlations between the fluctuations of outliers at a macroscopic distance with each other. This phenomenon has already been observed in [25] for non-Gaussian Wigner matrices, whereas, here, the phenomenon may still occur for GUE matrices. Actually, (15) and (16) can be simplified due to the fact

$$\begin{aligned} \sigma ^2 G_{\mu _{\mathrm{sc}}}^2(z) - z G_{\mu _{\mathrm{sc}}}(z)+1 \ = \ 0, \end{aligned}$$

so that $\varphi _{\mathrm{sc}}(z,w) = \displaystyle -\frac{G_{\mu _{\mathrm{sc}}}(z) -G_{\mu _{\mathrm{sc}}}(w) }{z-w}$ satisfies

$$\begin{aligned} \sigma ^2 G_{\mu _{\mathrm{sc}}}(z)G_{\mu _{\mathrm{sc}}}(w) \varphi _{\mathrm{sc}}(z,w) = \varphi _{\mathrm{sc}}(z,w) - G_{\mu _{\mathrm{sc}}}(z)G_{\mu _{\mathrm{sc}}}(w). \end{aligned}$$

(17)

Hence,

$$\begin{aligned}&G^2_{\mu _{\mathrm{sc}}}(\xi _{i})G^2_{\mu _{\mathrm{sc}}}(\xi _{i'})\big (\sigma ^2 + \sigma ^4 \varphi _{\mathrm{sc}}(\xi _{i},\xi _{i'})\big ) \\&\quad = \sigma ^2 G_{\mu _{\mathrm{sc}}}(\xi _{i})G_{\mu _{\mathrm{sc}}}(\xi _{i'})\Big [ G_{\mu _{\mathrm{sc}}} (\xi _{i})G_{\mu _{\mathrm{sc}}}(\xi _{i'}) + \sigma ^2 G_{\mu _{\mathrm{sc}}}(\xi _i)G_{\mu _{\mathrm{sc}}}(\xi _{i'}) \varphi _{\mathrm{sc}}(\xi _i,\xi _{i'})\Big ] \\&\quad = \sigma ^2 G_{\mu _{\mathrm{sc}}}(\xi _{i})G_{\mu _{\mathrm{sc}}}(\xi _{i'}) \varphi _{\mathrm{sc}}(\xi _i,\xi _{i'}) \\&\quad = \varphi _{\mathrm{sc}}(\xi _i,\xi _{i'}) - G_{\mu _{\mathrm{sc}}}(\xi _i)G_{\mu _{\mathrm{sc}}}(\xi _{i'}) \ = \ \Phi _{\mathrm{sc}}(\xi _i,\xi _{i'}) . \end{aligned}$$

and we fall back on the expression of the variance for the UCI model (see Sect. 3.2), which is expected since the GUE belongs to the UCI model.

Proof

We show that the assumptions 3.1 and 3.4 imply Assumption 2.6, more precisely (1) and (2). For (1), we simply use [31, Theorem 2.1/2.5] to show that

$$\begin{aligned} \sqrt{N} \mathbf{U}_{2r}^* \left( \left( z-\mathbf{H}_N \right) ^{-1} - G_{\mu _{\mathrm{sc}}}(z) \mathbf{I}\right) \mathbf{U}_{2r}, \end{aligned}$$

converges weakly (as it is done in [30]). The limit distribution is also given by [31, Theorem 2.1/2.5].

Then, for (2), we know by [31, (i) of Theorem 2.3/2.7] (respectively, [31, (iii) of Proposition 2.1]) that, for all $k \ge 1$, the diagonal entries (respectively, the off-diagonal entries) of the matrix

$$\begin{aligned} \sqrt{N} \big (\left( z-\mathbf{H}_N\right) ^{-k-1} - \int (z-x)^{-k-1}\mu _{\mathrm{sc}}(\mathrm{d}x) \mathbf{I}\big ) \end{aligned}$$

converge in distribution so that

$$\begin{aligned} \sqrt{N} \mathbf{U}_{2r}^* \left( \left( z-\mathbf{H}_N\right) ^{-k-1} - \int (z-x)^{-k-1}\mu _{\mathrm{sc}}(\mathrm{d}x) \mathbf{I}\right) \mathbf{U}_{2r} \end{aligned}$$

is tight.

3.2 Hermitian Matrices Whose Distribution is Invariant by Unitary Conjugation

Let $\mathbf{H}_N$ be an Hermitian matrix such that for any unitary $N \times N$ matrix $\mathbf{U}_N$, we have

$$\begin{aligned} \mathbf{U}_N \mathbf{H}_N \mathbf{U}_N^*&{{\mathrm{\overset{(d)}{=}}}}&\mathbf{H}_N. \end{aligned}$$

(18)

$\mathbf{H}_N$ can be written $\mathbf{H}_N = \mathbf{U}_N \mathbf{D}_N \mathbf{U}_N^*$ where $\mathbf{D}_N$ is diagonal, $\mathbf{U}_N$ is Haar-distributed, and $\mathbf{U}_N$ and $\mathbf{D}_N$ are independent. We also assume that $\mathbf{H}_N$ satisfies (1) and Assumption 2.1. We shall call such matrices UCI matrices (for unitary conjugation invariance). In this case, as we can, we can write

$$\begin{aligned} \widetilde{\mathbf{H}}_N \ = \ \mathbf{H}_N + \mathbf{A}_N \ = \ \mathbf{U}_N \left( \mathbf{D}_N + \mathbf{U}^*_N \mathbf{A}_N \mathbf{U}_N \right) \mathbf{U}_N^*, \end{aligned}$$

so that, without any loss of generality, we can simply assume that $\mathbf{H}_N$ is a diagonal matrix, and $\mathbf{A}_N$ is a matrix of the form

$$\begin{aligned} \mathbf{A}_N = \mathbf{U}_{2r} \mathbf{A}_0 \mathbf{U}_{2r}^* \end{aligned}$$

where $\mathbf{U}_{2r}$ is the 2r-first columns of an Haar-distributed matrix independent from $\mathbf{H}_N$.

Theorem 3.9

(Convergence of the outliers for UCI matrices) If $\mathbf{H}_N$ is an UCI matrix, then Theorem 2.3 holds.

Remark 3.10

Unlike the Wigner case, Theorem 2.3 does not need to be reformulated. In this case, we do observe the phenomenon of the outliers outnumbering the rank of $\mathbf{A}_N$.

Proof

We just need to check that Assumption 2.2 is satisfied. To do so, one can apply a slightly modified version of [6, Lemma 2.2], where we replace all the “${\text {dist}}(z,[a,b])>\delta $” by “${\text {dist}}(z,{\text {supp}}(\mu ))>\delta $,” which does not change the ideas of the proof. $\square $

For the fluctuations, we need to assume that for all $i\in \{1,\ldots ,q\}$ and all $n\in \{1,\ldots ,m_i\}$, as N goes to infinity,

$$\begin{aligned} \sqrt{N}\left( \frac{1}{N} {\text {Tr}} \left( \xi _{i,n}-\mathbf{H}_N \right) ^{-1} - \frac{1}{\theta _i}\big )\right) \longrightarrow 0. \end{aligned}$$

(19)

Remark 3.11

Actually, in [6], the authors make the same assumption ([6, Hypothesis 3.1]).

Theorem 3.12

(Fluctuations for UCI matrices) If $\mathbf{H}_N$ is an UCI matrix, then it satisfies Theorem 2.10. More precisely, the

$$\begin{aligned} \left( {m}^{\theta _i,n}_{k,\ell }\right) _{\begin{array}{c} 1 \le i \le q_{{1.}} \quad \quad \ \ \\ 1 \le n \le m_i \quad \quad \ \ \\ (k,\ell )\in J(\theta _i)\times I(\theta _i) \end{array}}, \end{aligned}$$

defined by (10) are centered Gaussian variables such that

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left( {m}^{\theta _i,n}_{k,\ell } \; {m}^{\theta _{i'},n'}_{k',\ell '}\right)= & {} \Phi (\xi _{i,n},\xi _{i',n'}) \; \delta _{k,\ell '}\delta _{k',\ell }, \\ {{\mathrm{\mathbb {E}}}}\left( {m}^{\theta _i,n'}_{k,\ell } \; \overline{{m}^{\theta _{i'},n'}_{k',\ell '}}\right)= & {} \Phi (\xi _{i,n},\overline{\xi _{i',n'}}) \;\mathbf{e}_{ k}^*\mathbf{Q}^{-1}(\mathbf{Q}^{-1})^*\mathbf{e}_{ {k'}}\;\mathbf{e}_{ \ell '}^*\mathbf{Q}^*\mathbf{Q}\, \mathbf{e}_{ {\ell }}, \end{aligned}$$

where

$$\begin{aligned} \Phi (z,w):= & {} \int \frac{1}{z-x}\frac{1}{w-x}\mu (\mathrm{d}x) - \int \frac{1}{z-x}\mu (\mathrm{d}x)\int \frac{1}{w-x}\mu (\mathrm{d}x)\\= & {} {\left\{ \begin{array}{ll} - \frac{G_\mu (z)-G_\mu (w)}{z-w} - G_\mu (z)G_\mu (w)&{} : \text { if } z \ne w, \\ - G'_\mu (z) - (G_\mu (z))^2&{} : \text { otherwise.} \\ \end{array}\right. } \\ \end{aligned}$$

Remark 3.13

Remind that we supposed that $\mu $ is not a single Dirac measure, so that $\Phi $ is not equal to zero.

Remark 3.14

If $\mathbf{A}_N$ is Hermitian, the size of all the Jordan blocks is equal to 1 and the fluctuations are real random variables (see Remark 2.9). We find back that, in the Hermitian case, fluctuations between outliers at a macroscopic distance are independent (see [6]) except if the two outliers come from the same eigenvalue of $\mathbf{A}$ (i.e., they both belong to the same set $\mathscr {S}_\theta $). In this case, the fluctuations of outliers belonging to the same set $\mathscr {S}_\theta $ are all correlated. This phenomenon is illustrated by Fig. 3a, b.

Proof

We just need to check that $\mathbf{H}_N$ satisfies $(1'),(2')$ of Assumption 2.6 (since $(0')$ is assumed below). Actually, for any $k \ge 1$ and any $i \in \{1,\ldots ,q\}$, the diagonal matrix

$$\begin{aligned} \big (\xi _{i,n}-\mathbf{H}_N\big )^{-(k+1)} - \frac{1}{N} {\text {Tr}}\big (\xi _{i,n}-\mathbf{H}_N\big )^{-(k+1)}, \end{aligned}$$

fulfill the assumptions of Theorem 5.3, so that $(2')$ is true. Then, $(1')$ is true thanks to Theorem 5.5. This theorem also gives us the covariance.

4 Proofs

4.1 Convergence of the Outliers: Proof of Theorem 2.3

In [5], the authors give an interpretation of why the limit is necessarily a solution of $G_{\mu }(z) = \frac{1}{\theta }$ with the subordinate functions of the free additive convolution of measures in the particular case where one of the measures is $\delta _0$ (see [5, Example 4.1]). Actually, our definition of the sets $\mathscr {S}_{\theta _i}$’s corresponds to the one of the set $O_\theta $ in [5, Definition 4.1]. A quick (but inaccurate) way to see why the limit is $G_{\mu }^{-1}(\frac{1}{\theta })$ and to understand the approach of the proof is to write

$$\begin{aligned} \det \left( z - (\mathbf{H}_N+\mathbf{A}_N) \right) \ = \ \det \left( z - \mathbf{H}_N \right) \det \left( \mathbf{I}- \left( z - \mathbf{H}_N \right) ^{-1}\mathbf{A}_N \right) , \end{aligned}$$

then if $ \left( z - \mathbf{H}_N \right) ^{-1} \sim G_\mu (z)\mathbf{I}$, we can write

$$\begin{aligned} \det \left( z - (\mathbf{H}_N+\mathbf{A}_N) \right) \sim \det \left( z - \mathbf{H}_N \right) G_\mu (z) \det \left( \frac{1}{G_\mu (z)}\mathbf{I}- \mathbf{A}_N \right) \end{aligned}$$

so that if z is an outlier of $\mathbf{H}_N+\mathbf{A}_N$, $\frac{1}{G_\mu (z)}$ must be an eigenvalue of $\mathbf{A}_N$.

To do it properly, we introduce the following function^{Footnote 3},

$$\begin{aligned} f(z)= & {} \det \left( \mathbf{I}- \mathbf{U}_{2r}^* \left( z\mathbf{I}- \mathbf{H} \right) ^{-1}\mathbf{U}_{2r}\mathbf{A}_0 \right) \ = \ \frac{\det \left( z - (\mathbf{H}_N+\mathbf{A}_N) \right) }{\det \left( z - \mathbf{H}_N \right) }, \end{aligned}$$

(20)

we know that the zeros of f are eigenvalues of $\widetilde{\mathbf{H}}_N$ which are not eigenvalues of $\mathbf{H}_N$. Then, we introduce the function

$$\begin{aligned} f_0(z):= & {} \det \left( \mathbf{I}- G_\mu (z) \mathbf{A}_0\right) , \end{aligned}$$

(21)

and the proof of Theorem 2.3 relies on the two following lemmas.

Lemma 4.1

As N goes to infinity, we have

$$\begin{aligned} \sup _{{\text {dist}}(z,{\text {supp}}(\mu ))>\delta } \left| f(z)-f_0(z)\right| \ {{\mathrm{\overset{(\mathbb {P})}{\longrightarrow }}}}\ 0. \end{aligned}$$

Lemma 4.2

Let K be a compact set, and let $\varepsilon >0$ such that

$\displaystyle {\text {dist}}(K,\bigcup _{i=1}^j \mathscr {S}_{\theta _i})\ge \varepsilon $,
$\displaystyle {\text {dist}}(K,{\text {supp}}(\mu )) \ge \varepsilon $.

Then, with a probability tending to one,

If these lemmas are true, the end of the proof goes as follows. We know that, with a probability tending to one, there is $\varepsilon >0$, such that

there is a constant $M>0$ such that $\mathbf{H}_N+\mathbf{A}_N$ has no eigenvalues in the area $\{z, \ |z|>M\}$,
${\text {Spec}}(\mathbf{H}_N)\subset \{z, \ {\text {dist}}(z,{\text {supp}}(\mu ))<\varepsilon \}$,

We set

$$\begin{aligned} \mathscr {S} \ := \ \bigcup _{i=1}^j \mathscr {S}_{\theta _i} \end{aligned}$$

, and we define

$$\begin{aligned} \mathscr {S}^\varepsilon:= & {} \bigcup _{i=1}^j \bigcup _{\xi \in \mathscr {S}_{\theta _i}} \left\{ z, \ |z-\xi |<\varepsilon \right\} \end{aligned}$$

(22)

with the convention that ${\mathscr {S}}^{\varepsilon } = \emptyset $ if $\mathscr {S}=\emptyset $. Up to a smaller choice of $\varepsilon $, we can suppose that none of the disk centered in the element of the $\mathscr {S}_{\theta _i}$’s and of radius $\varepsilon $ intersects each other nor intersect $\{z, \ {\text {dist}}(z,{\text {supp}}(\mu ))<\varepsilon \}$. Then, using Lemma 4.2, with

$$\begin{aligned} K \ := \ \left\{ z, \ |z| \le M \right\} \backslash \left( {\mathscr {S}}^{\varepsilon } \cup \{z, \ {\text {dist}}(z,{\text {supp}}(\mu ))<\varepsilon \} \right) , \end{aligned}$$

we deduce all the eigenvalues of $\widetilde{\mathbf{H}}_N$ are contained in ${\mathscr {S}}^{\varepsilon } \cup \{z, \ {\text {dist}}(z,{\text {supp}}(\mu ))<\varepsilon \}$. Indeed, if z is an eigenvalue of $\widetilde{\mathbf{H}}_N$ such that ${\text {dist}}(z,{\text {supp}}(\mu ))>\varepsilon $, z must be a zero of f.

Moreover, for each $i\in \{1,\ldots ,j\}$ and each $\xi \in \mathscr {S}_{\theta _i}$, we know that from Lemma 4.1

$$\begin{aligned} \sup _{z, |z-\xi |=\varepsilon } |f(z)-f_0(z)| \ \longrightarrow \ 0,&\quad \text { and } \quad&\inf _{z, |z-\xi |=\varepsilon } |f_0(z)| \ > \ 0. \end{aligned}$$

We deduce by Rouché Theorem (see [4, p.131]) that f and $f_0$, for all large enough N, have the same number of zeros inside the domain $\{z, \ |z-\xi |<\varepsilon \}$, for each $\xi $ in the $\mathscr {S}_{\theta _i}$’s.

Now, we just need to prove the two previous lemmas.

Proof of Lemma 4.1

We know that, for some positive constant C,

$$\begin{aligned} \sup _{{\text {dist}}(z,{\text {supp}}(\mu ))>\delta }|f(z) - f_0(z)|\le & {} C \sup _{{\text {dist}}(z,{\text {supp}}(\mu ))>\delta }\left\| \mathbf{U}_{2r}^*\left( \left( z - \mathbf{H}_N \right) ^{-1}-G_\mu (z)\mathbf{I}\right) \mathbf{U}_{2r} \right\| _{{{\mathrm{op}}}}, \end{aligned}$$

and we conclude with Assumption 2.2.

Proof of Lemma 4.2

We write, thanks to Assumption 2.2,

$$\begin{aligned} \det \left( \mathbf{I}- \left( z - \mathbf{H}_N \right) ^{-1} \mathbf{A}_N\right)= & {} \det \left( \mathbf{I}- \mathbf{U}_{2r}^* \left( z-\mathbf{H}_N \right) ^{-1}\mathbf{U}_{2r}\mathbf{A}_0 \right) \\= & {} \det \left( \mathbf{I}- G_{\mu }(z) \mathbf{A}_0 +o \left( 1 \right) \right) \\= & {} \prod _{i=1}^k \big (1 - G_{\mu }(z) \theta _i \big )+o \left( 1 \right) . \end{aligned}$$

Then, since $z \in K$, it easy to show that for each i, $|1 - G_{\mu }(z)\theta _i|>0$.

4.2 Fluctuations

The proof of Theorem 2.10 is the same than [10, Theorem 2.10], and all we need to do here is to prove this analogous version of [10, Lemma 5.1].

Lemma 4.3

For all $j \in \{1,\ldots ,\alpha _i\}$ and all $n \in \{1,\ldots ,m_i\}$, let $F^{\theta _i}_{j,n}(z)$ be the rational function defined by

$$\begin{aligned} F^{\theta _i}_{j,n}(z) \ : = \ f\left( \xi _{i,n} + \frac{z}{N^{1/(2p_{i,j})}} \right) . \end{aligned}$$

(23)

Then, there exists a collection of positive constants $(\gamma _{i,j})_{\displaystyle ^{1\le i\le q}_{1\le j\le \alpha _i}}$ and a collection of nonvanishing random variables $(C_{i,j,n})_{\begin{array}{c} 1\le i\le q \\ 1\le j\le \alpha _i \\ 1 \le n \le m_i \end{array}}$ independent of z, such that we have the convergence in distribution (for the topology of the uniform convergence over any compact set)

$$\begin{aligned} \left( N^{\gamma _{i,j}} F^{\theta _i}_{j,n}(\cdot ) \right) _{\begin{array}{c} 1\le i\le q \\ 1\le j\le \alpha _i \\ 1 \le n \le m_i \end{array}}\ \underset{N \rightarrow \infty }{\longrightarrow } \ \left( z\in {{\mathrm{\mathbb {C}}}}\ \mapsto \ z^{\pi _{i,j}}\cdot C_{i,j,n} \cdot \det \left( z^{p_{i,j}}-\mathbf{M}^{\theta _i}_{j,n}\right) \right) _{\begin{array}{c} 1\le i\le q \\ 1\le j\le \alpha _i \\ 1 \le n \le m_i \end{array}} \end{aligned}$$

where $\mathbf{M}^{\theta _i}_{j,n}$ is the random matrix introduced at (11) and $\pi _{i,j} \ := \ \sum _{l > j} \beta _{{i,l}}p_{i,l}$.

Once this lemma proven, the Theorem 2.10 follows (see section 5.1 of [10] for more details). To prove Lemma 4.3, we shall proceed as it is done in [10] to prove Lemma 5.1. First, we write, for a fixed $\theta _i(=\theta )$, a fixed $n\in \{1,\ldots ,m_i\}$ and a fixed $j\in \{1,\ldots ,\alpha _i\}$ (which shall be implicit) and fixed $p_{i,j}$($=p$), recall that $\mathbf{A}_0 = \mathbf{Q}\mathbf{J}\mathbf{Q}^{-1}$,

$$\begin{aligned} F^\theta _{j,n}(z)= & {} \det \left( \mathbf{I}- \left( \xi _n + \frac{z}{N^{1/(2p)}}-\mathbf{H}_N \right) ^{-1} \mathbf{U}_{2r}\mathbf{Q}\mathbf{J}\mathbf{Q}^{-1}\mathbf{U}_{2r}^*\right) \\= & {} \det \left( \mathbf{I}- G_\mu \Big ( \xi _n+\frac{z}{N^{1/(2p)}}\Big )\mathbf{J}- \frac{1}{\sqrt{N}}{ \mathbf{Z}_N\Big ( \xi _n + \frac{z}{N^{1/(2p)}} \Big )}\right) \\= & {} \det \left( \mathbf{I}-\frac{\mathbf{J}}{\theta }-G'_\mu (\xi _n)\frac{z}{N^{1/(2p)}}\big (1+o \left( 1 \right) \big )\mathbf{J}-\frac{1}{\sqrt{N}}{ \mathbf{Z}_N\Big (\xi _n +\frac{z}{ N^{1/(2p)}} \Big )}\right) \end{aligned}$$

where

$$\begin{aligned} Z_N(z) \ := \ \sqrt{N} \mathbf{Q}^{-1}\mathbf{U}_{2r}^*\left( \left( z-\mathbf{H}_N \right) ^{-1}-G_\mu (z)\mathbf{I}\right) \mathbf{U}_{2r}\mathbf{Q}\mathbf{J}. \end{aligned}$$

Remind that by definition, $G_\mu (\xi _n) = \theta ^{-1}$. From here, the reasoning to end the proof is the exact same than the one from [10, Lemma 5.1]. Nevertheless, we still have to prove that, for all $\theta $ and for all n, for all compact set K and for all $z \in K$,

$$\begin{aligned} \mathbf{Z}_N\big (\xi _n+\frac{z}{N^{1/(2p)}}\big ) = \mathbf{Z}_N(\xi _n) + o \left( 1 \right) , \qquad \text { and } \ \ \mathbf{Z}_N(\xi _n) \text { converges weakly. } \end{aligned}$$

(24)

To do so, we write (thanks to 5.1),

$$\begin{aligned}&\mathbf{Z}_N\left( \xi _n + \frac{z}{N^{1/(2p)}}\right) = \sqrt{N}\mathbf{Q}^{-1}\mathbf{U}_{2r}^*\left( \left( \xi _n-\mathbf{H}_N \right) ^{-1} - \frac{1}{\theta }\right) \mathbf{U}_{2r}\mathbf{Q}\mathbf{J}\\&\quad + \ \sum _{k=1}^p \left( \frac{-z}{N^{1/(2p)}}\right) ^k \sqrt{N}\mathbf{Q}^{-1}\mathbf{U}_{2r}^* \left( \big (\xi _n-\mathbf{H}_N\big )^{-(k+1)} - \int \frac{\mu (\mathrm{d}x)}{(\xi _n-x)^{k+1}} \right) \mathbf{U}_{2r}\mathbf{Q}\mathbf{J}\\&+ \ \frac{1}{N^{1/(2p)}} \mathbf{Q}\mathbf{U}_{2r}^*\Big ({\xi _n}-\mathbf{H}_N \Big )^{-(k+1)} \left( \xi _n+\frac{z}{N^{1/(2p)}} -\mathbf{H}_N \right) ^{-1}\mathbf{U}_{2r}\mathbf{Q}\mathbf{J}+ o \left( 1 \right) . \end{aligned}$$

The last term is a $o \left( 1 \right) $ since ${\text {dist}}(\xi _n,{\text {Spec}}(\mathbf{H}_N))> \varepsilon $ and one can conclude if (1), (2) are satisfied in Assumption 2.6. Otherwise, if it’s $(0'),(1'),(2')$, we write

$$\begin{aligned} \mathbf{Z}_N\left( \xi _n + \frac{z}{N^{1/(2p)}}\right)= & {} \sqrt{N} \mathbf{Q}^{-1}\mathbf{U}_{2r}^* \left( \frac{1}{N} {\text {Tr}} \left( \xi _n-\mathbf{H}_N \right) ^{-1} - \frac{1}{\theta } \right) \mathbf{U}_{2r}\mathbf{Q}\mathbf{J}\\&+ \ \sqrt{N}\mathbf{Q}^{-1}\mathbf{U}_{2r}^*\left( \left( \xi _n-\mathbf{H}_N \right) ^{-1} - \frac{1}{N} {\text {Tr}} \left( \xi _n-\mathbf{H}_N \right) ^{-1} \right) \mathbf{U}_{2r}\mathbf{Q}\mathbf{J}\\&+ \ \sum _{k=1}^p \left( \frac{-z}{N^{1/(2p)}} \right) ^k \sqrt{N}\mathbf{Q}^{-1} \mathbf{U}_{2r}^* \left( \big (\xi _n-\mathbf{H}_N\big )^{-(k+1)} - \frac{1}{N} \right. \\&\left. {\text {Tr}} \left( \xi _n-\mathbf{H}_N \right) ^{-1} \right) \mathbf{U}_{2r}\mathbf{Q}\mathbf{J}\\&\ + \ \frac{1}{N^{1/(2p)}} \mathbf{Q}\mathbf{U}_{2r}^*\Big ({\xi _n}-\mathbf{H}_{N} \Big )^{-(p+1)}\\&\quad \left( \xi _n+\frac{z}{N^{1/(2p)}} -\mathbf{H}_N \right) ^{-1}\mathbf{U}_{2r}\mathbf{Q}\mathbf{J}+ o \left( 1 \right) . \end{aligned}$$

Notes

The multiplicity of an eigenvalue is defined as its order as a root of the characteristic polynomial, which is greater than or equal to the dimension of the associated eigenspace.
due to the fact that the Cauchy transform of a compactly supported measure can always be inverted in a neighborhood of infinity.
we used a classical trick of finite rank perturbation models which $\det (\mathbf{I}_m+\mathbf{A}\mathbf{B})= \det (\mathbf{I}_n + \mathbf{B}\mathbf{A})$ for any $m \times n$ matrix $\mathbf{A}$ and $n \times m$ matrix $\mathbf{B}$

References

Anderson, G., Guionnet, A., Zeitouni, O.: An Introduction to Random Matrices. Cambridge Studies in Advanced Mathematics, vol. 118. Cambridge University Press (2009)
Bai, Z.D., Silverstein, J.W.: Spectral Analysis of Large Dimensional Random Matrices, 2nd edn. Springer, NewYork (2009)
MATH Google Scholar
Baik, J., Ben Arous, G., Péché, S.: Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices. Ann. Probab. 33(5), 1643–1697 (2005)
Article MATH MathSciNet Google Scholar
Beardon, A.: Complex Analysis: The Winding Number Principle in Analysis and Topology. Wiley, NewYork (1979)
MATH Google Scholar
Belinschi, S.T., Bercovici, H., Capitaine, M., Février, M.: Outliers in the spectrum of large deformed unitarily invariant models. arXiv:1207.5443v1, (2012)
Benaych-Georges, F., Guionnet, A., Maida, M.: Fluctuations of the extreme eigenvalues of finite rank deformations of random matrices. Electron. J. Probab. 16(60), 1621–1662 (2011)
Article MATH MathSciNet Google Scholar
Benaych-Georges, F., Guionnet, A., Maida, M.: Large deviations of the extreme eigenvalues of random deformations of matrices. Probab. Theory Relat. Fields 154(3), 703–751 (2012)
Article MATH MathSciNet Google Scholar
Benaych-Georges, F., Nadakuditi, R.R.: The eigenvalues and eigenvectors of finite, low rank perturbations of large random matrices. Adv. Math. 227(1), 494–521 (2011)
Article MATH MathSciNet Google Scholar
Benaych-Georges, F., Nadakuditi, R.R.: The singular values and vectors of low rank perturbations of large rectangular random matrices. J. Multivar. Anal. 111, 120–135 (2012)
Article MATH MathSciNet Google Scholar
Benaych-Georges, F., Rochet, J.: Outliers in the single ring theorem. Probab. Theory Relat. Fields (2015). doi:10.1007/s00440-015-0632-x
Benaych-Georges, F., Rochet, J.: Fluctuations for analytic test functions in the single ring theorem. arXiv preprint arXiv:1504.05106, (2015)
Bordenave, C., Capitaine, M.: Outlier eigenvalues for deformed i.i.d. random matrices. arXiv:1403.6001
Capitaine, M., Donati-Martin, C., Féral, D.: The largest eigenvalue of finite rank deformation of large Wigner matrices: convergence and non universality of the fluctuations. Ann. Probab. 37(1), 1–47 (2009)
Article MATH MathSciNet Google Scholar
Capitaine, M., Donati-Martin, C., Féral, D.: Central limit theorems for eigenvalues of deformations of Wigner matrices. Ann. I.H.P. Probab. Stat. 48(1), 107–133 (2012)
MATH MathSciNet Google Scholar
Capitaine, M., Donati-Martin, C., Féral, D., Février, M.: Free convolution with a semi-circular distribution and eigenvalues of spiked deformations of Wigner matrices. Electron. J. Probab. 16(64), 1750–1792 (2011)
Article MATH MathSciNet Google Scholar
Erdös, L., Yau, H.T., Yin, J.: Bulk universality for generalized Wigner matrices. arXiv preprint arXiv:1001.3453
Féral, D., Péché, S.: The largest eigenvalue of rank one deformation of large Wigner matrices. Commun. Math. Phys. 272, 185–228 (2007)
Article MATH MathSciNet Google Scholar
Fyodorov, Y.V., Sommers, H.-J.: Statistics of S-matrix poles in few-channel chaotic scattering: crossover from isolated to overlapping resonances. JETP Lett. 63, 1026–1030 (1996)
Article Google Scholar
Fyodorov, Y.V., Sommers, H.-J.: Statistics of resonance poles, phase shifts and time delays in quantum chaotic scattering: random matrix approach for systems with broken time reversal invariance. J. Math. Phys. 38, 1918–1981 (1997)
Article MATH MathSciNet Google Scholar
Fyodorov, Y.V., Khoruzhenko, B.A.: Systematic analytical approach to correlation functions of resonances in quantum chaotic scattering. Phys. Rev. Lett. 83, 65–68 (1999)
Article Google Scholar
Fyodorov, Y.V., Sommers, H.-J.: Random matrices close to Hermitian or unitary: overview of methods and results. J. Phys. A Math. Gen. 36, 3303–3347 (2003)
Article MATH MathSciNet Google Scholar
Horn, R.A., Johnson, C.R.: Matrix Analysis, Cambridge University Press, ISBN 978-0-521-38632-6 (1985)
Johnstone, I.M.: On the distribution of the largest eigenvalue in principal components analysis. Ann. Stat. 29(2), 295–327 (2001)
Article MATH MathSciNet Google Scholar
Knowles, A., Yin, J.: The isotropic semicircle law and deformation of Wigner matrices. Commun. Pure Appl. Math. 66(11), 1663–1750 (2013)
Article MATH MathSciNet Google Scholar
Knowles, A., Yin, J.: The outliers of a deformed Wigner matrix. Ann. Probab. 42(5), 1980–2031 (2014)
Article MATH MathSciNet Google Scholar
Mingo, J.A., Śniady, P., Speicher, R.: Second order freeness and fluctuations of random matrices: II. Unitary random matrices. Adv. Math. 209(1), 212–240 (2007)
Article MATH MathSciNet Google Scholar
Rajagopalan, A.B.: Outlier eigenvalue fluctuations of perturbed iid matrices. arXiv preprint arXiv:1507.01441
O’Rourke, S., Renfrew, D.: Low rank perturbations of large elliptic random matrices, arXiv
Péché, S.: The largest eigenvalue of small rank perturbations of Hermitian random matrices. Prob. Theory Relat. Fields 134, 127–173 (2006)
Article MATH MathSciNet Google Scholar
Pizzo, A., Renfrew, D., Soshnikov, A.: On finite rank deformations of Wigner matrices. Ann. Inst. H. Poincaré Probab. Stat. 49(1), 64–94 (2013)
Article MATH MathSciNet Google Scholar
Pizzo, A., Renfrew, D., Soshnikov, A.: Fluctuations of matrix entries of regular functions of Wigner matrices. J. Stat. Phys. 146(3), 550–591 (2012)
Article MATH MathSciNet Google Scholar
Tao, T.: Outliers in the spectrum of iid matrices with bounded rank perturbations. Probab. Theory Relat. Fields 155, 231–263 (2013)
Article MATH MathSciNet Google Scholar
Sinai, Y., Soshnikov, A.: Central limit theorem for traces of large random symmetric matrics with independent matrix elements. Bull. Braz. Math. Soc. 29(1), 1–24 (1998)

Download references

Author information

Authors and Affiliations

MAP5, Université Paris Descartes, 45, rue des Saints-Pères, 75270, Paris Cedex 06, France
Jean Rochet

Authors

Jean Rochet
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jean Rochet.

Linear Algebra Lemmas

Lemma 5.1

Let $\mathbf{A}$ be a matrix and $\lambda \in {{\mathrm{\mathbb {C}}}}$ be such that both $\mathbf{A}$ and $\mathbf{A}+\lambda \mathbf{I}$ are non-singular. Then, for all $p \ge 1$,

$$\begin{aligned} \left( \mathbf{A}+ \lambda \mathbf{I} \right) ^{-1}= & {} \sum _{k=1}^p (-\lambda )^{k-1}\mathbf{A}^{-k} + (-\lambda )^{p} \mathbf{A}^{-p} \left( \mathbf{A}+\lambda \mathbf{I} \right) ^{-1} \end{aligned}$$

Lemma 5.2

(Schur’s complement [22]) For any $\mathbf{A},\mathbf{B},\mathbf{C},\mathbf{D}$, one has, when it makes sense

1.1 Fluctuations of the Entries of UCI Random Matrices

We give here some results on the fluctuations of the entries of UCI matrices, which means matrices of the form $\mathbf{H}:=\mathbf{U}\mathbf{D}\mathbf{U}^*$ where $\mathbf{U}$ is Haar-distributed and $\mathbf{D}$ is a complex diagonal matrix.

Theorem 5.3

(Fluctuations of the entries of UCI random matrices) Let $\mathbf{T}$ be an $N \times N$ diagonal matrix such that

$$\begin{aligned} {\text {Tr}}\mathbf{T}= & {} \ 0, \quad \quad \frac{1}{N} {\text {Tr}}\mathbf{T}\mathbf{T}^* \ \rightarrow \ \sigma ^2, \quad \quad \frac{1}{N} {\text {Tr}}\mathbf{T}^2 \ \rightarrow \ \tau ^2, \nonumber \\&\forall k\ge 1, \ \frac{1}{N} {\text {Tr}}(\mathbf{T}\mathbf{T}^*)^k \ = \ O \left( 1 \right) . \end{aligned}$$

(25)

Let $\mathbf{u}_{t_1},\ldots ,\mathbf{u}_{t_p}$ be p distinct columns of a Haar-distributed unitary matrix. Then,

$$\begin{aligned} \left( \sqrt{N} \big <\mathbf{u}_{t_i} , \ \mathbf{T}\mathbf{u}_{t_j} \big >\right) ^{p}_{i,j=1}, \end{aligned}$$

converges in distribution to a centered complex Gaussian vector $\big (\mathscr {G}_{i,j}\big )_{i,j=1}^p$ with covariance

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ \mathscr {G}_{i,j}\mathscr {G}_{k,\ell } \right] \ = \ \delta _{i,\ell }\delta _{j,k} \tau ^2&;&{{\mathrm{\mathbb {E}}}}\left[ \mathscr {G}_{i,j}\overline{\mathscr {G}}_{k,\ell } \right] \ = \ \delta _{i,k}\delta _{j,\ell }\sigma ^2 \end{aligned}$$

Remark 5.4

If $\mathbf{H}:= \mathbf{U}\mathbf{D}\mathbf{U}^*$ satisfies (1) and Assumption 2.1, then $\mathbf{T}:=\mathbf{D}-\frac{1}{N} {\text {Tr}}\mathbf{D}$ satisfies (25).

Here comes a version of Theorem 5.3, with several matrices diagonal $\mathbf{T}$. Due to the complex values of the diagonal matrices, the following theorem is not a simple consequence of Theorem 5.3 and Cramér–Wold theorem.

Theorem 5.5

Let $\mathbf{T}_1,\ldots ,\mathbf{T}_q$ be $N \times N$ diagonal matrices such that for all $m, n \in \{1,\ldots ,q\}$

$$\begin{aligned} {\text {Tr}}\mathbf{T}_m= & {} \ 0, \quad \quad \frac{1}{N} {\text {Tr}}\mathbf{T}_m \mathbf{T}^*_n \ \rightarrow \ \sigma ^2_{m,n}, \quad \quad \frac{1}{N} {\text {Tr}}\mathbf{T}_m\mathbf{T}_n \ \rightarrow \ \tau ^2_{m,n}, \\&\forall k \ge 1, \ \frac{1}{N} {\text {Tr}}(\mathbf{T}_m \mathbf{T}^*_m)^k \ = \ O \left( 1 \right) . \end{aligned}$$

Let $\mathbf{u}_{t_1},\ldots ,\mathbf{u}_{t_p}$ be p distinct columns of an Haar-distributed matrix. Then,

$$\begin{aligned} \left( \sqrt{N} \big <\mathbf{u}_{t_i} , \ \mathbf{T}_m\mathbf{u}_{t_j} \big >\right) _{\begin{array}{c} 1 \le i \le p \\ 1 \le j \le p \\ 1 \le m \le q \end{array} }, \end{aligned}$$

converges in distribution to a centered complex Gaussian vector $\big (\mathscr {G}_{i,j,m}\big )_{\begin{array}{c} 1 \le i \le p \\ 1 \le j \le p \\ 1 \le m \le q \end{array} }$ with covariance

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ \mathscr {G}_{i,j,m}\mathscr {G}_{k,\ell ,n} \right] \ = \ \delta _{i,\ell }\delta _{j,k} \tau ^2_{m,n}&;&{{\mathrm{\mathbb {E}}}}\left[ \mathscr {G}_{i,j,m}\overline{\mathscr {G}}_{k,\ell ,n} \right] \ = \ \delta _{i,k}\delta _{j,\ell }\sigma ^2_{m,n} \end{aligned}$$

Proof of Theorem 5.3

Without any loss of generality, due to the invariance by conjugation by a matrix of permutation, we can suppose that $t_1=1,t_2=2,\ldots ,t_p=p$. Then, we just need to show that

$$\begin{aligned} X \ = \ \sqrt{N} {\text {Tr}}\big ( \mathbf{U}^* \mathbf{T}\mathbf{U}\mathbf{A}\big ) \end{aligned}$$

where $\mathbf{A}$ is a $N\times N$ deterministic matrix of the form

is a asymptotically Gaussian. Before starting, we remind some definition. Let $(\mathbf{M}_1,\ldots ,\mathbf{M}_q)$ be q matrices. For any permutation $\sigma \in S_q$, with cycle decomposition

$$\begin{aligned} \sigma \ = \ (i_{1,1} \cdots i_{1,k_1})(i_{2,1}\cdots i_{2,k_2}) \cdots (i_{r,1}\cdots i_{r,k_r}), \end{aligned}$$

we denote by

$$\begin{aligned} {\text {Tr}}_{\sigma } \big ( \mathbf{M}_t \big )_{t=1}^q&: =&\prod _{j=1}^r {\text {Tr}}\big ( \mathbf{M}_{t_{i_{j,1}}} \cdots \mathbf{M}_{t_{i_{j,r_j}}}\big ). \end{aligned}$$

(26)

For example, if $\sigma = (13)(256) \in S_6$, then

$$\begin{aligned} {\text {Tr}}_\sigma \big (\mathbf{M}_t \big )_{t=1}^6 \ = \ {\text {Tr}}(\mathbf{M}_1 \mathbf{M}_3) {\text {Tr}}(\mathbf{M}_2\mathbf{M}_5\mathbf{M}_6) {\text {Tr}}(\mathbf{M}_4). \end{aligned}$$

Let M(2n) be the set of all perfect matching on $\{1,\ldots ,2n\}$ which is a subset of $S_{2n}$ of the permutation which are the product of n transpositions with disjoint support. For example,

$$\begin{aligned} M(4) \ = \ \left\{ (12)(34),(13)(24),(14)(23)\right\} . \end{aligned}$$

Then, if the following lemma is true, one can conclude the proof. $\square $

Lemma 5.6

Let $\mathbf{T}_1 ,\ldots , \mathbf{T}_q$ be q diagonal matrix such that for all $i,j \in \{1,\ldots ,q\}$,

$$\begin{aligned} {\text {Tr}}\mathbf{T}_i \ = \ 0; \quad \frac{1}{N} {\text {Tr}}\mathbf{T}_i \mathbf{T}_j \ \longrightarrow \ \tau _{i,j}; \quad \forall k\ge 1, \ \frac{1}{N} {\text {Tr}}\big ( \mathbf{T}_i \mathbf{T}_i^*\big )^k \ = \ O \left( 1 \right) . \end{aligned}$$

(27)

Let $\mathbf{A}_1,\ldots ,\mathbf{A}_q$ be q matrices of the form

where the $\mathbf{A}_{0,i}$’s are $K \times K$ matrices independent from N where K is a fixed integer. Let $\mathbf{U}$ be a Haar-distributed matrix. Then, as N goes to infinity,

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ \prod _{t=1}^q \sqrt{N} {\text {Tr}}\big ( \mathbf{U}^* \mathbf{T}_t \mathbf{U}\mathbf{A}_t\big ) \right]\longrightarrow & {} {\left\{ \begin{array}{ll} \displaystyle \sum _{\sigma \in M(q)} {\text {Tr}}_\sigma \big (\mathbf{A}_i \big )_{i=1}^q \prod _{t=1}^{q/2}\tau _{\sigma (2t-1),\sigma (2t)} &{} \text {if } q \text { is even.}\\ \quad 0 &{} \text {if } q \text { is odd.} \end{array}\right. } \end{aligned}$$

Indeed, once we suppose Lemma 5.6 satisfied, we need to compute for all p, q

$$\begin{aligned}&{{\mathrm{\mathbb {E}}}}\left[ \big [\sqrt{N}{\text {Tr}}\mathbf{U}^* \mathbf{T}\mathbf{U}\mathbf{A}\big ]^p \big [\overline{\sqrt{N} {\text {Tr}}\mathbf{U}^* \mathbf{T}\mathbf{U}\mathbf{A}}\big ]^q \right] \\&\quad = {{\mathrm{\mathbb {E}}}}\left[ \big [\sqrt{N}{\text {Tr}}\mathbf{U}^* \mathbf{T}\mathbf{U}\mathbf{A}\big ]^p \big [{\sqrt{N} {\text {Tr}}\mathbf{U}^* \mathbf{T}^* \mathbf{U}\mathbf{A}^*}\big ]^q \right] , \end{aligned}$$

in order to apply Lemma 5.7. According to Lemma 5.6, for $\mathbf{T}_t \equiv \mathbf{T}$ and $\mathbf{A}_t \equiv \mathbf{A}$, we have

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\big [\sqrt{N}{\text {Tr}}\mathbf{U}^* \mathbf{T}\mathbf{U}\mathbf{A}\big ]^q\longrightarrow & {} {\left\{ \begin{array}{ll} {\text {Card}}M(q) {\text {Tr}}(\mathbf{A}^2)^{q/2} \tau ^{q/2} &{} \text {if } q \text { is even,}\\ 0&{} \text {if } q \text { is odd.} \end{array}\right. } \end{aligned}$$

(remind that ${\text {Card}}M(q) = (q-1)(q-3)\cdots 3$) which means that the limit distribution of X already satisfies (30) and (31). Let $p \ge 1$ and $q \ge 2$ be two fixed integers such that $p+q$ is even; then, using notations from (26), we know thanks to Lemma 5.6 that

$$\begin{aligned}&{{\mathrm{\mathbb {E}}}}\left[ \big [\sqrt{N}{\text {Tr}}\mathbf{U}^* \mathbf{T}\mathbf{U}\mathbf{A}\big ]^p \big [{\sqrt{N} {\text {Tr}}\mathbf{U}^* \mathbf{T}^* \mathbf{U}\mathbf{A}^*}\big ]^q \right] \nonumber \\&\quad = \frac{1}{N^{\frac{p+q}{2}}} \sum _{\sigma \in M(p+q)} {\text {Tr}}_\sigma \big (\mathbf{A}_t \big )_{t=1}^{p+q}{\text {Tr}}_\sigma \big (\mathbf{T}_t\big )_{t=1}^{p+q} + o \left( 1 \right) \end{aligned}$$

(28)

where

$$\begin{aligned}&(\mathbf{T}_1,\ldots ,\mathbf{T}_{p+q}) \ = \ (\underbrace{\mathbf{T},\ldots ,\mathbf{T}}_{p},\underbrace{\mathbf{T}^*,\ldots ,\mathbf{T}^*}_{q}) \ \ \text { and } \ \ (\mathbf{A}_1,\ldots ,\mathbf{A}_{p+q}) \\&\quad = (\underbrace{\mathbf{A},\ldots ,\mathbf{A}}_{p},\underbrace{\mathbf{A}^*,\ldots ,\mathbf{A}^*}_{q}). \end{aligned}$$

We rewrite the right side of (28) summing according to the value of $\sigma (1)$.

$$\begin{aligned}&\sum _{\sigma \in M(p+q)} {\text {Tr}}_\sigma \big (\mathbf{A}_t \big )_{t=1}^{p+q}{\text {Tr}}_\sigma \big (\mathbf{T}_t\big )_{t=1}^{p+q} = \sum _{a=2}^{p+q}\sum _{\begin{array}{c} \sigma \in M(p+q) \\ \sigma (1)=a \end{array}} {\text {Tr}}_\sigma \big (\mathbf{A}_t \big )_{t=1}^{p+q}{\text {Tr}}_\sigma \big (\mathbf{T}_t\big )_{t=1}^{p+q} \\&\quad = \sum _{a=2}^{p+q}{\text {Tr}}(\mathbf{A}_1 \mathbf{A}_a) {\text {Tr}}(\mathbf{T}_1 \mathbf{T}_a) \sum _{\begin{array}{c} \sigma \in M(p+q) \\ \sigma (1)=a \end{array}} {\text {Tr}}_{\sigma \circ (1a)}\big (\mathbf{A}_t \big )_{t=1}^{p+q}{\text {Tr}}_{\sigma \circ (1a)}\big (\mathbf{T}_t\big )_{t=1}^{p+q} \\&\quad = \sum _{a=2}^p {\text {Tr}}\mathbf{A}^2 {\text {Tr}}\mathbf{T}^2 \sum _{\sigma \in M(p+q-2) } {\text {Tr}}_{\sigma }\big (\widehat{\mathbf{A}}_t \big )_{t=1}^{p+q-2}{\text {Tr}}_{\sigma }\big (\widehat{\mathbf{T}}_t\big )_{t=1}^{p+q-2} \\&\qquad + \sum _{a=p+1}^{p+q} {\text {Tr}}\mathbf{A}\mathbf{A}^* {\text {Tr}}\mathbf{T}\mathbf{T}^* \sum _{\sigma \in M(p+q-2) } {\text {Tr}}_{\sigma }\big (\widetilde{\mathbf{A}}_t \big )_{t=1}^{p+q-2}{\text {Tr}}_{\sigma }\big (\widetilde{\mathbf{T}}_t\big )_{t=1}^{p+q-2}, \end{aligned}$$

where

$$\begin{aligned}&(\widehat{\mathbf{A}}_1,\ldots ,\widehat{\mathbf{A}}_{p+q-2}) \ = \ (\underbrace{\mathbf{A},\ldots ,\mathbf{A}}_{p-2},\underbrace{\mathbf{A}^*,\ldots ,\mathbf{A}^*}_{q}) \ \ \text { and } \ \ (\widetilde{\mathbf{A}}_1,\ldots ,\widetilde{\mathbf{A}}_{p+q-2}) \\&\quad = (\underbrace{\mathbf{A},\ldots ,\mathbf{A}}_{p-1},\underbrace{\mathbf{A}^*,\ldots ,\mathbf{A}^*}_{q-1}). \end{aligned}$$

At last, one easily deduces that

$$\begin{aligned}&{{\mathrm{\mathbb {E}}}}\left[ \big [\sqrt{N}{\text {Tr}}\mathbf{U}^* \mathbf{T}\mathbf{U}\mathbf{A}\big ]^p \big [{\sqrt{N} {\text {Tr}}\mathbf{U}^* \mathbf{T}^* \mathbf{U}\mathbf{A}^*}\big ]^q \right] \\&\quad = \frac{1}{N} {\text {Tr}}\mathbf{T}^2 {\text {Tr}}\mathbf{A}^2 (p-1){{\mathrm{\mathbb {E}}}}\left[ \big [\sqrt{N}{\text {Tr}}\mathbf{U}^* \mathbf{T}\mathbf{U}\mathbf{A}\big ]^{p-2} \big [{\sqrt{N} {\text {Tr}}\mathbf{U}^* \mathbf{T}^* \mathbf{U}\mathbf{A}^*}\big ]^q \right] \\&\quad + \ \frac{1}{N} {\text {Tr}}\mathbf{T}\mathbf{T}^* {\text {Tr}}\mathbf{A}\mathbf{A}^* q{{\mathrm{\mathbb {E}}}}\left[ \big [\sqrt{N}{\text {Tr}}\mathbf{U}^* \mathbf{T}\mathbf{U}\mathbf{A}\big ]^{p-1} \big [{\sqrt{N} {\text {Tr}}\mathbf{U}^* \mathbf{T}^* \mathbf{U}\mathbf{A}^*}\big ]^{q-1} \right] +o \left( 1 \right) \end{aligned}$$

and so $\sqrt{N} {\text {Tr}}(\mathbf{U}^* \mathbf{T}\mathbf{A}\mathbf{U})$ satisfies (32) which means according to Lemma 5.7 that its limit distribution is Gaussian.

At last, to compute to covariance of the $\big (\mathscr {G}_{i,j}\big )$’s, one can simply use [11, Lemma A.6].

Proof of Theorem 5.5

This time, we shall use Lemma 5.8 to show that for any $\mathbf{A}_1,\ldots ,\mathbf{A}_r$, $N\times N$ deterministic matrix of the form

the vector

$$\begin{aligned} \left( \frac{1}{N} {\text {Tr}}\big (\mathbf{U}^* \mathbf{T}_1 \mathbf{U}\mathbf{A}_1 \big ) ,\ldots , \frac{1}{N} {\text {Tr}}\big (\mathbf{U}^* \mathbf{T}_r \mathbf{U}\mathbf{A}_r \big )\right) \end{aligned}$$

converges weakly to a Gaussian multivariate. Thanks to Theorem 5.3, we know that for each m,

$$\begin{aligned} \frac{1}{N} {\text {Tr}}\big (\mathbf{U}^* \mathbf{T}_m \mathbf{U}\mathbf{A}_m\big ) \end{aligned}$$

is asymptotically Gaussian. Then, we show that

$$\begin{aligned}&{{\mathrm{\mathbb {E}}}}\left[ \left( \frac{1}{N} {\text {Tr}}\big (\mathbf{U}^* \mathbf{T}_1 \mathbf{U}\mathbf{A}_1\big )\right) ^{p_1}\left( \frac{1}{N} {\text {Tr}}\big (\mathbf{U}^* \mathbf{T}^*_1 \mathbf{U}\mathbf{A}^*_1\big )\right) ^{q_1} \right. \\&\left. \quad \cdots \left( \frac{1}{N} {\text {Tr}}\big (\mathbf{U}^* \mathbf{T}_r \mathbf{U}\mathbf{A}_r\big )\right) ^{p_r}\left( \frac{1}{N} {\text {Tr}}\big (\mathbf{U}^* \mathbf{T}^*_r \mathbf{U}\mathbf{A}^*_r\big )\right) ^{q_r}\right] \end{aligned}$$

satisfies (35) and (36) using Lemma 5.6. $\square $

Proof of Lemma 5.6

We know from [26, Proposition3.4]

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ \prod _{t=1}^q \sqrt{N} {\text {Tr}}\big ( \mathbf{U}^* \mathbf{T}_t \mathbf{U}\mathbf{A}_t\big ) \right]= & {} N^{q/2} \sum _{\sigma ,\tau \in S_q} {\text {Wg}}(\tau \circ \sigma ^{-1}){\text {Tr}}_\tau \big (\mathbf{A}_i\big )_{i=1}^q {\text {Tr}}_\sigma \big ( \mathbf{T}_i\big )_{i=1}^q \end{aligned}$$

where ${\text {Wg}}$ is a function called the Weingarten function. Moreover, for $\sigma \in S_q$, the asymptotical behavior of ${\text {Wg}}(\sigma )$ is at most given by

$$\begin{aligned} {\text {Wg}}(\sigma ) = O\left( N^{-q}\right) . \end{aligned}$$

(29)

First, one should notice that if $\sigma $ has one invariant point (which means a cycle of size one in its cycle decomposition), then

$$\begin{aligned} {\text {Tr}}_\sigma \big ( \mathbf{T}_i\big )_{i=1}^q \ = \ 0, \end{aligned}$$

also, if $\sigma $ has r cycles in its cycle decomposition, then, by the Holder inequality,

$$\begin{aligned} {\text {Tr}}_\sigma \big ( \mathbf{T}_i\big )_{i=1}^q \ = \ O \left( N^r \right) . \end{aligned}$$

Actually, the maximum of cycles in its decomposition that can have $\sigma $ without any 1-sized cycle is $\displaystyle \left\lfloor \frac{q}{2} \right\rfloor $ so that, using (29)

$$\begin{aligned} N^{q/2}{\text {Wg}}( \sigma \circ \tau ^{-1}){\text {Tr}}_{\tau }\big ( \mathbf{A}_i\big )_{i=1}^q {\text {Tr}}_\sigma \big ( \mathbf{T}_i\big )_{i=1}^q= & {} O \left( N^{ \left\lfloor \frac{q}{2} \right\rfloor -\frac{q}{2}} \right) , \end{aligned}$$

so that first, if q is odd

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ \prod _{t=1}^q \sqrt{N} {\text {Tr}}\big ( \mathbf{U}^* \mathbf{T}_t \mathbf{U}\mathbf{A}_t\big ) \right]= & {} o \left( 1 \right) . \end{aligned}$$

Moreover, if $q = 2r$, then the only way to have

$$\begin{aligned} N^{q/2}{\text {Wg}}( \sigma \circ \tau ^{-1}){\text {Tr}}_{\tau }\big ( \mathbf{A}_i\big )_{i=1}^q {\text {Tr}}_\sigma \big ( \mathbf{T}_i\big )_{i=1}^q \ \ne \ o \left( 1 \right) \end{aligned}$$

is to have

$\tau = \sigma $,
$\sigma $ is a product of $\displaystyle \frac{q}{2}=r$ transpositions with disjoint support.

One easily conclude. $\square $

1.2 Moments of a Complex Gaussian Variable

The following lemma allows to prove that a random variable is Gaussian if and only if its moments satisfy an induction relation.

Lemma 5.7

Let Z be a complex Gaussian variable such that

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ Z \right] \ = \ 0, \ \ \ {{\mathrm{\mathbb {E}}}}\left[ Z^2 \right] \ = \ \tau ^2, \ \ \ {{\mathrm{\mathbb {E}}}}\left[ |Z|^2 \right] \ = \ \sigma ^2. \end{aligned}$$

(30)

Then, for all $p \ge 1$

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ Z^{2p} \right] \ = \ p!! \tau ^{2p}&\text { and }&{{\mathrm{\mathbb {E}}}}\left[ Z^{2p+1} \right] \ = \ 0, \qquad \qquad \big (\text {where } p!! := \frac{(2p)!}{2^p p!} \big )\qquad \end{aligned}$$

(31)

also, for all $p,q \ge 0$,

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ Z^{p+2} \overline{Z}^{q+2} \right]= & {} \sigma ^2 (q+2) {{\mathrm{\mathbb {E}}}}\left[ Z^{p+1} \overline{Z}^{q+1} \right] + \tau ^2 (p+1){{\mathrm{\mathbb {E}}}}\left[ Z^{p} \overline{Z}^{q+2} \right] \nonumber \\= & {} \sigma ^2 (p+2) {{\mathrm{\mathbb {E}}}}\left[ Z^{p+1} \overline{Z}^{q+1} \right] + \overline{\tau }^2 (q+1){{\mathrm{\mathbb {E}}}}\left[ Z^{p+2} \overline{Z}^{q} \right] .\qquad \end{aligned}$$

(32)

Conversely, any complex random variable Z satisfying (30), (31), and (32) is a complex Gaussian variable.

Proof

First, recall that if $Z=X_1+{\text {i}}X_2$ is a complex random Gaussian such that

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ Z \right] \ = \ 0, \ \ \ {{\mathrm{\mathbb {E}}}}\left[ Z^2 \right] \ = \ \tau ^2, \ \ \ {{\mathrm{\mathbb {E}}}}\left[ |Z|^2 \right] \ = \ \sigma ^2, \end{aligned}$$

, then its Fourier transform is given, for $t=t_1+{\text {i}}t_2 \in {{\mathrm{\mathbb {C}}}}$, by

$$\begin{aligned} \Phi _Z(t):= & {} {{\mathrm{\mathbb {E}}}}\exp \left( i(X_1 t_1 + X_2 t_2) \right) \\= & {} \exp \left( -\frac{1}{4} \big ( (t_1^2 + t_2^2)\sigma ^2 + (t_1^2-t_2^2){{\mathrm{{\text {Re}}}}}(\tau ^2) + 2t_1 t_2 {{\mathrm{{\text {Im}}}}}(\tau ^2)\big )\right) \end{aligned}$$

We define the differential operators

$$\begin{aligned} \partial _t \ : = \ \partial _1 + {\text {i}}\partial _2&;&\partial _{\overline{t}} \ : = \ \partial _1 - {\text {i}}\partial _2 \end{aligned}$$

(33)

so that

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ Z^p \overline{Z}^q \right]= & {} (-{\text {i}})^{p+q} \partial _t^p \partial _{\overline{t}}^q \Phi (t)\big |_{t=0} . \end{aligned}$$

(34)

One can easily compute

$$\begin{aligned}&\partial _t \Phi (t) \ = \ - \frac{1}{2} \left( t \sigma ^2 + \overline{t} \tau ^2\right) \Phi (t); \quad \partial _{\overline{t}} \Phi (t) \ = \ - \frac{1}{2} \left( \overline{t} \sigma ^2 + t \overline{\tau }^2\right) \Phi (t); \\ \end{aligned}$$

therefore, for any $p \ge 0$, $q \ge 0$,

$$\begin{aligned} \partial _t^{p+2} \Phi (t)= & {} \partial ^{p+1}_t \left( - \frac{1}{2} \left( t \sigma ^2 + \overline{t} \tau ^2\right) \Phi (t)\right) \\= & {} -\frac{1}{2} t \sigma ^2 \partial ^{p+1}_t \Phi (t) - \frac{1}{2} \tau ^2 \partial ^{p+1}_t \big (\overline{t} \Phi (t) \big )\\= & {} -\frac{1}{2} t \sigma ^2 \partial ^{p+1}_t \Phi (t) - \frac{1}{2} \tau ^2 \overline{t} \partial ^{p+1}_t \Phi (t) - \tau ^2 (p+1)\partial _t^{p}\Phi (t) \end{aligned}$$

and

$$\begin{aligned} \partial _t^{p+2}\partial _{\overline{t}}^{q+2} \Phi (t)= & {} \partial _t^{p+1}\partial _{\overline{t}}^{q+2} \left( - \frac{1}{2} \left( t \sigma ^2 + \overline{t} \tau ^2\right) \Phi (t)\right) \\= & {} -\frac{1}{2} \sigma ^2 \partial _{\overline{t}}^{q+2} \big ( t \partial _t^{p+1} \Phi (t)\big ) - \frac{1}{2} \tau ^2 \partial _{t}^{p+1} \big ( \overline{t} \partial _{\overline{t}}^{q+2} \Phi (t)\big )\\= & {} - \sigma ^2 \left( \frac{t}{2} \partial _t^{p+1} \partial ^{q+2}_{\overline{t}} \Phi (t) + (q+2) \partial _t^{p+1} \partial ^{q+1}_{\overline{t}} \Phi (t)\right) \\&\ \quad - \ \tau ^2 \left( \frac{\overline{t}}{2}\partial _t^{p+1} \partial ^{q+2}_{\overline{t}} \Phi (t) +(p+1) \partial _t^{p} \partial ^{q+2}_{\overline{t}} \Phi (t) \right) , \end{aligned}$$

hence,

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ Z^{p+2} \right]= & {} \tau ^2 (p+1) {{\mathrm{\mathbb {E}}}}\left[ Z^p \right] , \\ {{\mathrm{\mathbb {E}}}}\left[ Z^{p+2} \overline{Z}^{q+2} \right]= & {} \sigma ^2 (q+2) {{\mathrm{\mathbb {E}}}}\left[ Z^{p+1} \overline{Z}^{q+1} \right] + \tau ^2 (p+1){{\mathrm{\mathbb {E}}}}\left[ Z^{p} \overline{Z}^{q+2} \right] \end{aligned}$$

and the same way,

$$\begin{aligned} {{\mathrm{\mathbb {E}}}}\left[ \overline{Z}^{p+2} \right]= & {} \overline{\tau }^2 (p+1) {{\mathrm{\mathbb {E}}}}\left[ \overline{Z}^p \right] ,\\ {{\mathrm{\mathbb {E}}}}\left[ Z^{p+2} \overline{Z}^{q+2} \right]= & {} \sigma ^2 (p+2) {{\mathrm{\mathbb {E}}}}\left[ Z^{p+1} \overline{Z}^{q+1} \right] + \overline{\tau }^2 (q+1){{\mathrm{\mathbb {E}}}}\left[ Z^{p+2} \overline{Z}^{q} \right] . \end{aligned}$$

Conversely, one can easily prove by induction that any complex random variable Z satisfying (30), (31), and (32) has all its moments uniquely determined, and since the complex Gaussian variable also satisfies (30), (31), and (32), one can conclude.$\square $

More generally, one can show the following lemma

Lemma 5.8

Let $(X_1,\ldots ,X_r)$ be a centered complex Gaussian vector. Then, for all nonnegative integers $p_1, q_1 ,\ldots , p_r,q_r$, for all $i \in \{1,\ldots ,r\}$

$$\begin{aligned}&\text {if } p_i \ge 1, \qquad {{\mathrm{\mathbb {E}}}}\left[ X_1^{p_1} \overline{X_1}^{q_1}\cdots X_r^{p_r}\overline{X_r}^{q_r} \right] \nonumber \\&\quad = (p_i-1){{\mathrm{\mathbb {E}}}}\left[ X_i^2 \right] {{\mathrm{\mathbb {E}}}}\left[ X_1^{p_1} \overline{X_1}^{q_1}\cdots X_i^{p_i-2} \overline{X_i}^{q_i}\cdots X_r^{p_r}\overline{X_r}^{q_r} \right] \nonumber \\&\qquad + \sum _{\displaystyle ^{j=1}_{j \ne i }}^r p_j {{\mathrm{\mathbb {E}}}}\left[ X_i X_j \right] {{\mathrm{\mathbb {E}}}}\left[ X_1^{p_1} \overline{X_1}^{q_1}\cdots X_i^{p_i-1}\cdots X_j^{p_j-1}\cdots X_r^{p_r}\overline{X_r}^{q_r} \right] \nonumber \\&\qquad + \sum _{j=1}^r q_j {{\mathrm{\mathbb {E}}}}\left[ X_i \overline{X_j} \right] {{\mathrm{\mathbb {E}}}}\left[ X_1^{p_1} \quad l{X_1}^{q_1}\cdots X_i^{p_i-1}\cdots \overline{X_j^{q_j-1}}\cdots X_r^{p_r}\overline{X_r}^{q_r} \right] \end{aligned}$$

(35)

$$\begin{aligned}&\text {if } q_i \ge 1, \qquad {{\mathrm{\mathbb {E}}}}\left[ X_1^{p_1} \overline{X_1}^{q_1}\cdots X_r^{p_r}\overline{X_r}^{q_r} \right] \nonumber \\&\quad = (q_i-1){{\mathrm{\mathbb {E}}}}\left[ \overline{X_i^2} \right] {{\mathrm{\mathbb {E}}}}\left[ X_1^{p_1} \overline{X_1}^{q_1}\cdots X_i^{p_i} \overline{X_i}^{q_i-2}\cdots X_r^{p_r}\overline{X_r}^{q_r} \right] \nonumber \\&\qquad + \sum _{{j=1}}^r p_j {{\mathrm{\mathbb {E}}}}\left[ \overline{X_i} X_j \right] {{\mathrm{\mathbb {E}}}}\left[ X_1^{p_1} \overline{X_1}^{q_1}\cdots \overline{X_i^{q_i-1}}\cdots X_j^{p_j-1}\cdots X_r^{p_r}\overline{X_r}^{q_r} \right] \nonumber \\&\qquad + \sum _{\displaystyle ^{j=1}_{j \ne i}}^r q_j {{\mathrm{\mathbb {E}}}}\left[ \overline{X_i} \overline{X_j} \right] {{\mathrm{\mathbb {E}}}}\left[ X_1^{p_1} \overline{X_1}^{q_1}\cdots \overline{X_i^{q_i-1}}\cdots \overline{X_j^{q_j-1}}\cdots X_r^{p_r}\overline{X_r}^{q_r} \right] \end{aligned}$$

(36)

with the convention that $X^{-1} = 0$.

Conversely, if $X_1,\ldots ,X_r$ are r centered Gaussian variables satisfying (35) and (36), then $(X_1,\ldots ,X_r)$ is a centered complex Gaussian vector.

Proof

In the same spirit as the proof of Lemma 5.7, we obtain (35) and (36) by derivating the Fourier transform. The converse is proved by induction.$\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rochet, J. Complex Outliers of Hermitian Random Matrices. J Theor Probab 30, 1624–1654 (2017). https://doi.org/10.1007/s10959-016-0686-4

Download citation

Received: 20 December 2015
Revised: 02 March 2016
Published: 16 April 2016
Issue Date: December 2017
DOI: https://doi.org/10.1007/s10959-016-0686-4

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Complex Outliers of Hermitian Random Matrices

Abstract

Similar content being viewed by others

Outliers in the Single Ring Theorem

No Outliers in the Spectrum of the Product of Independent Non-Hermitian Random Matrices with Independent Entries

Large Deviation Theorem for Zeros of Polynomials and Hermitian Random Matrices

1 Introduction

2 General Results

2.1 Convergence of the Outliers

2.1.1 Setup and Assumptions

Assumption 2.1

Assumption 2.2

2.1.2 Result

Theorem 2.3

Remark 2.4

Remark 2.5

2.2 Fluctuations of the Outliers

2.2.1 Setup and Assumptions

Assumption 2.6

Remark 2.7

Remark 2.8

Remark 2.9

2.2.2 Result

Theorem 2.10

3 Applications

3.1 Wigner Matrices

Assumption 3.1

Theorem 3.2

Proof

Remark 3.3

Assumption 3.4

Remark 3.5

Remark 3.6

Theorem 3.7

Remark 3.8

Proof

3.2 Hermitian Matrices Whose Distribution is Invariant by Unitary Conjugation

Theorem 3.9

Remark 3.10

Proof

Remark 3.11

Theorem 3.12

Remark 3.13

Remark 3.14

Proof

4 Proofs

4.1 Convergence of the Outliers: Proof of Theorem 2.3

Lemma 4.1

Lemma 4.2

Proof of Lemma 4.1

Proof of Lemma 4.2

4.2 Fluctuations

Lemma 4.3

Notes

References

Author information

Authors and Affiliations

Corresponding author

Linear Algebra Lemmas

Linear Algebra Lemmas

Lemma 5.1

Lemma 5.2

1.1 Fluctuations of the Entries of UCI Random Matrices

Theorem 5.3

Remark 5.4

Theorem 5.5

Proof of Theorem 5.3

Lemma 5.6

Proof of Theorem 5.5

Proof of Lemma 5.6

1.2 Moments of a Complex Gaussian Variable

Lemma 5.7

Proof

Lemma 5.8

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords