Limiting Eigenvectors of Outliers for Spiked Information-Plus-Noise Type Matrices

Capitaine, Mireille

doi:10.1007/978-3-319-92420-5_4

Mireille Capitaine¹⁵

Part of the book series: Lecture Notes in Mathematics ((SEMPROBAB,volume 2215))

802 Accesses
6 Citations

Abstract

We consider an Information-Plus-Noise type matrix where the Information matrix is a spiked matrix. When some eigenvalues of the random matrix separate from the bulk, we study how the corresponding eigenvectors project onto those of the spikes. Note that, in an Appendix, we present alternative versions of the earlier results of Bai and Silverstein (Random Matrices Theory Appl 1(1):1150004, 44, 2012) (“noeigenvalue outside the support of the deterministic equivalent measure”) and Capitaine (Indiana Univ Math J 63(6):1875–1910, 2014) (“exact separation phenomenon”) where we remove some technical assumptions that were difficult to handle.

Access provided by CONRICYT-eBooks. Download chapter PDF

Outliers in the Single Ring Theorem

Article 16 May 2015

Complex Outliers of Hermitian Random Matrices

Article 16 April 2016

On the principal components of sample covariance matrices

Article 22 February 2015

Keywords

4.1 Introduction

In this paper, we consider the so-called Information-Plus-Noise type model

$$\displaystyle \begin{aligned} M_N= \varSigma_N \varSigma_N^* \mbox{ ~where ~} \varSigma_N = \sigma \frac{X_N}{\sqrt{N}}+A_N, \end{aligned} $$

defined as follows.

n = n(N), n ≤ N, c _N = n∕N →_N→+∞c ∈ ]0;1].
σ ∈ ]0;+∞[.
X _N = [X _ij]_{1≤i≤n;1≤j≤N} where $\{X_{ij}, i\in \mathbb {N}, j \in \mathbb {N}\}$ is an infinite set of complex random variables such that $\{\Re (X_{ij}), \Im (X_{ij}), i\in \mathbb {N}, j \in \mathbb {N}\}$ are independent centered random variables with variance 1∕2 and satisfy
1. 1.
  There exists K > 0 and a random variable Z with finite fourth moment for which there exists x ₀ > 0 and an integer number n ₀ > 0 such that, for any x > x ₀ and any integer numbers n ₁, n ₂ > n ₀, we have
  $$\displaystyle \begin{aligned}\frac{1}{n_1n_2} \sum_{i\leq n_1,j\leq n_2}P\left( \vert X_{ij}\vert >x\right) \leq KP\left(\vert Z \vert>x\right).\end{aligned} $$
  (4.1)
2. 2.
  $$\displaystyle \begin{aligned}\sup_{(i,j)\in \mathbb{N}^2}\mathbb{E}(\vert X_{ij}\vert^3)<+\infty. \end{aligned} $$
  (4.2)
Let ν be a compactly supported probability measure on $\mathbb {R}$ whose support has a finite number of connected components. Let Θ = {θ ₁;…;θ _J} where θ ₁ > … > θ _J ≥ 0 are J fixed real numbers independent of N which are outside the support of ν. Let k ₁, …, k _J be fixed integer numbers independent of N and $r=\sum _{j=1}^J k_j$. Let β _j(N) ≥ 0, r + 1 ≤ j ≤ n, be such that $\frac {1}{n} \sum _{j=r+1}^{n} \delta _{\beta _j(N)}$ weakly converges to ν and
$$\displaystyle \begin{aligned} \max _{r+1\leq j\leq n} \mathrm{dist}(\beta _j(N),\mathrm{supp}(\nu ))\mathop{\longrightarrow } _{N \rightarrow \infty } 0\end{aligned} $$
(4.3)
where supp(ν) denotes the support of ν.

Let α _j(N), j = 1, …, J, be real nonnegative numbers such that
$$\displaystyle \begin{aligned}\lim_{N \rightarrow +\infty} \alpha_j(N)=\theta_j.\end{aligned}$$
Let A _N be a n × N deterministic matrix such that, for each j = 1, …, J, α _j(N) is an eigenvalue of $A_N A_N^*$ with multiplicity k _j, and the other eigenvalues of $A_N A_N^*$ are the β _j(N), r + 1 ≤ j ≤ n. Note that the empirical spectral measure of ${A_N A_N^*} $ weakly converges to ν.

Remark 4.1

Note that assumption such as (4.1) appears in [14]. It obviously holds if the X _ij’s are identically distributed with finite fourth moment.

For any Hermitian n × n matrix Y , denote by spect(Y ) its spectrum, by

$$\displaystyle \begin{aligned} \lambda_1(Y) \geq \ldots \geq \lambda_n(Y) \end{aligned}$$

the ordered eigenvalues of Y and by μ _Y the empirical spectral measure of Y :

$$\displaystyle \begin{aligned}\mu _{Y} := \frac{1}{n} \sum_{i=1}^n \delta_{\lambda _{i}(Y)}.\end{aligned}$$

For a probability measure τ on $\mathbb {R}$, denote by g _τ its Stieltjes transform defined for $z \in \mathbb {C}\setminus \mathbb {R}$ by

$$\displaystyle \begin{aligned}g_\tau (z) = \int_{\mathbb{R}} \frac{d\tau (x)}{z-x}.\end{aligned}$$

When the X _ij’s are identically distributed, Dozier and Silverstein established in [15] that almost surely the empirical spectral measure $\mu _{M_N}$ of M _N converges weakly towards a nonrandom distribution μ _σ,ν,c which is characterized in terms of its Stieltjes transform which satisfies the following equation: for any $z \in \mathbb {C}^+$,

$$\displaystyle \begin{aligned} g_{\mu_{\sigma,\nu,c}}(z)=\int \frac{1}{(1-\sigma^2cg_{ \mu_{\sigma,\nu,c}}(z))z- \frac{ t}{1- \sigma^2 cg_{ \mu_{\sigma,\nu,c}}(z)} -\sigma^2 (1-c)}d\nu(t).\end{aligned} $$

(4.4)

This result of convergence was extended to independent but non identically distributed random variables by Xie in [30]. (Note that, in [21], the authors in- vestigated the case where σ is replaced by a bounded sequence of real numbers.) In [11], the author carries on with the study of the support of the limiting spectral measure previously investigated in [16] and later in [25, 28] and obtains that there is a one-to-one relationship between the complement of the limiting support and some subset in the complement of the support of ν which is defined in (4.6) below.

Proposition 4.1

Define differentiable functions ω _{σ,
ν,
c} and Φ _{σ,
ν,
c} on respectively $ \mathbb {R}\setminus \mathit{\mbox{supp}}(\mu _{\sigma ,\nu ,c})$ and $ \mathbb {R}\setminus \mathit{\mbox{supp}}(\nu )$ by setting

$$\displaystyle \begin{aligned}\omega_{\sigma,\nu,c} :\begin{array}{ll} \mathbb{R}\setminus \mathit{\mbox{supp}}(\mu_{\sigma,\nu,c}) \rightarrow \mathbb{R}\\ x \mapsto x (1- \sigma^2 c g_{ \mu_{\sigma,\nu,c}}(x))^2 -\sigma^2 (1-c)(1-\sigma^2 c g_{\mu_{\sigma,\nu,c}}(x))\end{array}\end{aligned} $$

(4.5)

and

$$\displaystyle \begin{aligned}\varPhi_{\sigma,\nu,c} :\begin{array}{ll} \mathbb{R}\setminus \mathit{\mbox{supp}}(\nu) \rightarrow \mathbb{R}\\ x \mapsto x (1+c \sigma^2g_{ \nu}(x))^2 + \sigma^2 (1-c) (1+ c \sigma^2 g_\nu(x))\end{array}.\end{aligned}$$

Set

$$\displaystyle \begin{aligned} \mathbb{E}_{\sigma,\nu,c}:=\left\{ x \in \mathbb{R}\setminus \mathit{\mbox{supp}}(\nu), \varPhi_{\sigma,\nu,c}^{\prime}(x) >0, g_\nu(x) >-\frac{1}{\sigma^2c}\right\}.\end{aligned} $$

(4.6)

ω _σ,ν,c is an increasing analytic diffeomorphism with positive derivative from $\mathbb {R}\setminus \mathit{\mbox{supp}}(\mu _{\sigma ,\nu ,c})$ to $\mathbb {E}_{\sigma ,\nu ,c}$ , with inverse Φ _σ,ν,c.

Moreover, extending previous results in [25] and [8] involving the Gaussian case and finite rank perturbations, [11] establishes a one-to-one correspondence between the θ _i’s that belong to the set $\mathbb {E}_{\sigma ,\nu ,c}$ (counting multiplicity) and the outliers in the spectrum of M _N. More precisely, setting

$$\displaystyle \begin{aligned}\varTheta_{\sigma,\nu,c} = \left\{\theta \in \varTheta, \varPhi_{\sigma,\nu,c}^{\prime}(\theta) >0, g_\nu(\theta) >-\frac{1}{\sigma^2c}\right\},\end{aligned} $$

(4.7)

and

$$\displaystyle \begin{aligned}\mathbb{S}=\mbox{ supp } (\mu_{\sigma,\nu,c}) \cup \left\{ \varPhi_{\sigma,\nu,c}({\theta}), \theta \in \varTheta_{\sigma,\nu,c} \right\},\end{aligned} $$

(4.8)

we have the following results.

Theorem 4.1 ([11])

For any 𝜖 > 0,

$$\displaystyle \begin{aligned}\mathbb P[\,\mathit{\mbox{for all large N}}, \mathrm{spect}(M_N) \subset \{x \in \mathbb{R} , \mathit{\mbox{dist}}(x,\mathbb{S})\leq \epsilon \}]=1.\end{aligned}$$

Theorem 4.2 ([11])

Let θ _j be in Θ _σ,ν,c and denote by n _j−1 + 1, …, n _j−1 + k _j the descending ranks of α _j(N) among the eigenvalues of $A_NA_N^*$ . Then the k _j eigenvalues $(\lambda _{n_{j-1}+i}(M_N), \, 1 \leq i \leq k_j)$ converge almost surely outside the support of μ _σ,ν,c towards $\rho _{\theta _j}:=\varPhi _{\sigma ,\nu ,c}(\theta _j)$ . Moreover, these eigenvalues asymptotically separate from the rest of the spectrum since (with the conventions that λ ₀(M _N) = +∞ and λ _N+1(M _N) = −∞) there exists δ ₀ > 0 such that almost surely for all large N,

$$\displaystyle \begin{aligned}\lambda_{n_{j-1}}(M_N) > \rho _{\theta _j} + \delta_0 \, \mathit{\mbox{ and }} \, \lambda_{n_{j-1}+k_j +1}(M_N) < \rho _{\theta _j} - \delta_0 .\end{aligned} $$

(4.9)

Remark 4.2

Note that Theorems 4.1 and 4.2 were established in [11] for A _N as (4.14) below and with $\mathbb {S}\cup \{0\}$ instead of $ \mathbb {S}$ but they hold true as stated above and in the more general framework of this paper. Indeed, these extensions can be obtained sticking to the proof of the corresponding results in [11] but using the new versions of [3] and of the exact separation phenomenon of [11] which are presented in the Appendix 1 of the present paper.

The aim of this paper is to study how the eigenvectors corresponding to the outliers of M _N project onto those corresponding to the spikes θ _i’s. Note that there are some pioneering results investigating the eigenvectors corresponding to the outliers of finite rank perturbations of classical random matricial models: [27] in the real Gaussian sample covariance matrix setting, and [7, 8] dealing with finite rank additive or multiplicative perturbations of unitarily invariant matrices. For a general perturbation, dealing with sample covariance matrices, Péché and Ledoit [23] introduced a tool to study the average behaviour of the eigenvectors but it seems that this did not allow them to focus on the eigenvectors associated with the eigenvalues that separate from the bulk. It turns out that further studies [6, 10] point out that the angle between the eigenvectors of the outliers of the deformed model and the eigenvectors associated to the corresponding original spikes is determined by Biane-Voiculescu’s subordination function. For the model investigated in this paper, such a free interpretation holds but we choose not to develop this free probabilistic point of view in this paper and we refer the reader to the paper [13]. Here is the main result of the paper.

Theorem 4.3

Let θ _j be in Θ _σ,ν,c (defined in (4.7)) and denote by n _j−1 + 1, …, n _j−1 + k _j the descending ranks of α _j(N) among the eigenvalues of $A_NA_N^*$ . Let ξ(j) be a normalized eigenvector of M _N relative to one of the eigenvalues $(\lambda _{n_{j-1}+q}(M_N)$ , 1 ≤ q ≤ k _j). Denote by ∥⋅∥₂ the Euclidean norm on $\mathbb {C}^n$ . Then, almost surely

(i)
$\displaystyle {\lim _{N\rightarrow +\infty }\left \| P_{\mathit{\mbox{Ker }}(\alpha _j(N) I_N-A_NA_N^*)}\xi (j)\right \|{ }^2_2 = \tau (\theta _j)}$

where
$$\displaystyle \begin{aligned}\tau(\theta_j)= \frac{1-\sigma^2c g_{\mu_{\sigma,\nu,c}}(\rho_{\theta_j})}{\omega_{\sigma,\nu,c}^{\prime}(\rho_{\theta_j})}=\frac{ \varPhi_{\sigma,\nu,c}^{\prime}({\theta_j})}{1+ \sigma^2 cg_\nu(\theta_j)} \end{aligned} $$
(4.10)
(ii)
for any θ _i in Θ _σ,ν,c ∖{θ _j},
$$\displaystyle \begin{aligned}\displaystyle{\lim_{N\rightarrow +\infty}\left\| P_{ \mathit{\mbox{Ker }}(\alpha_i(N) I_N-A_NA_N^*)}\xi(j)\right\|{}_2 = 0.}\end{aligned}$$

The sketch of the proof of Theorem 4.3 follows the analysis of [10] as explained in Sect. 4.2. In Sect. 4.3, we prove a universal result allowing to reduce the study to estimating expectations of Gaussian resolvent entries carried on Sect. 4.4. In Sect. 4.5, we explain how to deduce Theorem 4.3 from the previous Sections. In an Appendix 1, we present alternative versions on the one hand of the result in [3] about the lack of eigenvalues outside the support of the deterministic equivalent measure, and, on the other hand, of the result in [11] about the exact separation phenomenon. These new versions deal with random variables whose imaginary and real parts are independent but remove the technical assumptions ((1.10) and “b ₁ > 0” in Theorem 1.1 in [3] and “ω _σ,ν,c(b) > 0” in Theorem 1.2 in [11]). This allows us to claim that Theorem 4.2 holds in our context (see Remark 4.2). Finally, we present, in Appendix 2, some technical lemmas that are used throughout the paper.

4.2 Sketch of the Proof

Throughout the paper, for any m × p matrix B, $(m,p)\in {\mathbb {N}}^2$, we will denote by ∥B∥ the largest singular value of B, and by $\Vert B\Vert _2=\{Tr (BB^*)\}^{\frac {1}{2}}$ its Hilbert-Schmidt norm.

The proof of Theorem 4.3 follows the analysis in two steps of [10].

Step A

First, we shall prove that, for any orthonormal system $(\xi _1,\cdots ,\xi _{k_j})$ of eigenvectors associated to the k _j eigenvalues $\lambda _{n_{j-1}+q}(M_N)$, 1 ≤ q ≤ k _j, the following convergence holds almost surely: ∀l = 1, …, J,

$$\displaystyle \begin{aligned} \sum_{p=1}^{k_j}\left\| P_{\ker(\alpha_l (N) I_N-A_NA_N^*)}\xi_p \right\|{}^2_2 \rightarrow_{N \rightarrow +\infty} \frac{k_j\delta_{jl} (1-\sigma^2 c g_{\mu_{\sigma,\nu,c}}(\rho_{\theta_j}))}{\omega_{\sigma,\nu,c}^{\prime}(\rho_{\theta_j})}. \end{aligned} $$

(4.11)

Note that for any smooth functions h and f on $\mathbb {R}$, if v ₁, …, v _n are eigenvectors associated to $\lambda _1(A_NA_N^*), \ldots ,\lambda _n(A_NA_N^*)$ and w ₁, …, w _n are eigenvectors associated to λ ₁(M _N), …, λ _n(M _N), one can easily check that

$$\displaystyle \begin{aligned} \mathrm{Tr} \left[h(M_N) f(A_NA_N^*)\right] =\sum_{m,p=1}^n h(\lambda_p(M_N)) f(\lambda_m(A_NA_N^*)) \vert \langle v_m,w_p \rangle \vert^2. \end{aligned} $$

(4.12)

Thus, since α _l(N) on one hand and the k _j eigenvalues of M _N in $(\rho _{\theta _j} -\varepsilon ,\rho _{\theta _j}+\varepsilon [ )$ (for 𝜖 small enough) on the other hand, asymptotically separate from the rest of the spectrum of respectively $A_NA_N^*$ and M _N, a fit choice of h and f will allow the study of the restrictive sum $\sum _{p=1}^{k_j}\left \| P_{\ker (\alpha _l(N) I_N-A_NA_N^*)} \xi _p \right \|{ }^2_2$. Therefore proving (4.11) is reduced to the study of the asymptotic behaviour of $\mathrm {Tr}\left [h(M_N)f(A_NA_N^*)\right ]$ for some functions f and h respectively concentrated on a neighborhood of θ _l and $\rho _{\theta _j}$.

Step B

In the second, and final, step, we shall use a perturbation argument identical to the one used in [10] to reduce the problem to the case of a spike with multiplicity one, case that follows trivially from Step A.

Step B closely follows the lines of [10] whereas Step A requires substantial work. We first reduce the investigations to the mean Gaussian case by proving the following.

Proposition 4.2

Let X _N as defined in Sect. 4.1 . Let $\mathbb {G}_N = [\mathbb {G}_{ij}]_{1\leq i\leq n, 1\leq j\leq N}$ be a n × N random matrix with i.i.d. standard complex normal entries. Let h be a function in $\mathbb {C}^\infty (\mathbb {R}, \mathbb {R})$ with compact support, and Γ _N be a n × n Hermitian matrix such that

$$\displaystyle \begin{aligned} \sup_{n,N} \Vert \varGamma_N \Vert<\infty \mathit{\text{ and }} \sup_{n,N} \mathrm{rank} (\varGamma_N) <\infty.\end{aligned} $$

(4.13)

Then almost surely,

$\mathrm {Tr} \left (h\left (\left (\sigma \frac {X_N}{\sqrt {N}}+A_N\right )\left (\sigma \frac {X_N}{\sqrt {N}}+A_N\right )^*\right ) \varGamma _N\right )$

$$\displaystyle \begin{aligned}-\mathbb{E}\left(\mathrm{Tr} \left[h\left(\left(\sigma \frac{\mathbb{G}_N}{\sqrt{N}}+A_N\right)\left(\sigma \frac{\mathbb{G}_N}{\sqrt{N}}+A_N\right)^*\right) \varGamma_N\right] \right)\rightarrow_{N \rightarrow +\infty} 0.\end{aligned}$$

The asymptotic behaviour of $\mathbb {E}\Big (\mathrm {Tr} \Big [h\Big (\Big (\sigma \frac {\mathbb {G}_N}{\sqrt {N}}+A_N\Big )\Big (\sigma \frac {\mathbb {G}_N}{\sqrt {N}}+A_N\Big )^*\Big ) f(A_NA_N^*)\Big ] \Big )$ can be deduced, by using the bi-unitarily invariance of the distribution of $ \mathbb {G}_N$, from the following Proposition 4.3 and Lemma 4.18.

Proposition 4.3

Let $\mathbb {G}_N = [\mathbb {G}_{ij}]_{1\leq i \leq n, 1\leq j\leq N}$ be a n × N random matrix with i.i.d. complex standard normal entries. Assume that A _N is such that

$$\displaystyle \begin{aligned} A_N=\begin{pmatrix} d_1(N) ~~~~~~~~~~~~~~~~~~~~(0)\\ ~~~(0)\\ ~~~~~~~~~~\ddots~~~~~~~~~~~~~( 0)\\ ~(0)~~~~~~~~~~~~~~~~~~~~\\ ~~~~~~~~~~~~~~~~~d_{n}(N)~~~ ( 0 ) \end{pmatrix} \end{aligned} $$

(4.14)

where n = n(N), n ≤ N, c _N = n∕N →_N→+∞c ∈ ]0;1], for i = 1, …, n, $d_i(N) \in \mathbb {C}$ , sup_Nmax_i=1,…,n|d _i(N)| < +∞ and $\frac {1}{n} \sum _{i=1}^n \delta _{\vert d_i(N)\vert ^2}$ weakly converges to a compactly supported probability measure ν on $\mathbb {R}$ when N goes to infinity. Define for all $z\in \mathbb {C}\setminus \mathbb {R}$ ,

$$\displaystyle \begin{aligned}G^{\mathbb{G}}_N(z) =\left (zI - \left(\sigma \frac{\mathbb{ G}_N}{\sqrt{N}}+A_N\right)\left(\sigma \frac{\mathbb{G}_N}{\sqrt{N}}+A_N\right)^*\right)^{-1}.\end{aligned}$$

Define for any q = 1, …, n,

$$\displaystyle \begin{aligned}\gamma_q(N) =(A_NA_N^*)_{qq} =\vert d_q(N)\vert^2. \end{aligned} $$

(4.15)

There is a polynomial P with nonnegative coefficients, a sequence (u _N)_N of nonnegative real numbers converging to zero when N goes to infinity and some nonnegative real number l, such that for any (p, q) in {1, …, n}², for all $z\in \mathbb {C}\setminus \mathbb {R}$ ,

$$\displaystyle \begin{aligned} \mathbb{E} \left(\left( G^{\mathbb{G}}_N(z)\right)_{pq}\right) = \frac{1- \sigma^2 cg_{\mu_{\sigma,\nu,c}}(z)}{\omega_{\sigma, \nu, c}(z) -\gamma_q(N)} \delta_{pq} +\varDelta_{p,q,N}(z), \end{aligned} $$

(4.16)

with

$$\displaystyle \begin{aligned}\left| \varDelta_{p,q,N} (z)\right| \leq (1+\vert z\vert)^l P(\vert \Im z \vert^{-1})u_N.\end{aligned}$$

4.3 Proof of Proposition 4.2

In the following, we will denote by o _C(1) any deterministic sequence of positive real numbers depending on the parameter C and converging for each fixed C to zero when N goes to infinity. The aim of this section is to prove Proposition 4.2.

Define for any C > 0,

(4.17)

Set

$$\displaystyle \begin{aligned}\theta^*=\sup_{(i,j)\in \mathbb{N}^2}\mathbb{E}(\vert X_{ij}\vert^3)<+\infty.\end{aligned}$$

We have

so that

$$\displaystyle \begin{aligned}\sup_{i\geq 1,j\geq 1}\mathbb{E} \left( \vert X_{ij}-Y_{ij}^C\vert^2 \right) \leq \frac{2\theta^*}{C}.\end{aligned}$$

Note that

so that

$$\displaystyle \begin{aligned}\sup_{i\geq 1, j\geq 1}\vert 1 - 2\mathbb{E} \left( \vert \Re Y_{ij}^C\vert^2 \right) \vert \leq \frac{4\theta^*}{C}.\end{aligned}$$

Similarly

$$\displaystyle \begin{aligned}\sup_{i\geq 1,j\geq 1}\vert 1 - 2\mathbb{E} \left( \vert \Im Y_{ij}^C\vert^2 \right) \vert \leq \frac{4\theta^*}{C}.\end{aligned}$$

Let us assume that C > 8θ ^∗. Then, we have

$$\displaystyle \begin{aligned}\mathbb{E} \left( \vert \Re Y_{ij}^C\vert^2 \right)> \frac{1}{4} \; \mbox{and}\; \mathbb{E} \left( \vert \Im Y_{ij}^C\vert^2 \right)> \frac{1}{4}.\end{aligned}$$

Define for any C > 8θ ^∗, $X^C=(X^C_{ij})_{1\leq i \leq n; 1\leq j \leq N},$ where for any 1 ≤ i ≤ n, 1 ≤ j ≤ N,

$$\displaystyle \begin{aligned}{X}_{ij}^C =\frac{\Re Y_{ij}^C}{\sqrt{2\mathbb{E} \left( \vert \Re Y_{ij}^C\vert^2 \right)}} +\mathrm{i} \frac{\Im Y_{ij}^C}{\sqrt{2\mathbb{E} \left( \vert \Im Y_{ij}^C\vert^2 \right)}}.\end{aligned} $$

(4.18)

Let $\mathbb {G} = [\mathbb {G}_{ij}]_{1\leq i\leq n, 1\leq j \leq N}$ be a n × N random matrix with i.i.d. standard complex normal entries, independent from X _N, and define for any α > 0,

$$\displaystyle \begin{aligned}X^{\alpha,C}= \frac{ X^C +\alpha \mathbb{G}}{\sqrt{1+\alpha^2}}.\end{aligned}$$

Now, for any n × N matrix B, let us introduce the (N + n) × (N + n) matrix

$$\displaystyle \begin{aligned}\mathbb{M}_{N+n}(B) =\left( \begin{array}{ll} 0_{n\times n}~~~ B+A_N\\ B^* +A_N^*~~~ 0_{N\times N} \end{array} \right).\end{aligned}$$

Define for any $z\in \mathbb {C}\setminus \mathbb {R}$,

$$\displaystyle \begin{aligned}\tilde G(z) = \left( z I_{N+n} - \mathbb{M}_{N+n}\left(\sigma \frac{X_N}{\sqrt{N}}\right)\right)^{-1},\end{aligned}$$

and

$$\displaystyle \begin{aligned}\tilde G^{\alpha,C}(z) = \left( z I_{N+n} - \mathbb{M}_{N+n}\left(\sigma \frac{X^{\alpha,C}}{\sqrt{N}}\right)\right)^{-1} .\end{aligned}$$

Denote by $\mathbb {U}(n+N)$ the set of unitary (n + N) × (n + N) matrices. We first establish the following approximation result.

Lemma 4.1

There exist some positive deterministic functions u and v on [0, +∞[ such that lim_C→+∞u(C) = 0 and lim_α→0v(α) = 0, and a polynomial P with nonnegative coefficients such that for any α and C > 8θ ^∗, we have that

almost surely, for all large N,
$$\displaystyle \begin{aligned} &\displaystyle{ \sup_{U\in \mathbb{U}(n+N)}\sup_{(i,j)\in \{1,\ldots,n+N\}^2}\sup_{z\in \mathbb{C}\setminus \mathbb{R} } |\Im z |{}^{2} \left| (U^*\tilde{G}^{\alpha,C}(z)U)_{ij}- (U^*\tilde{{G}}(z)U)_{ij}\right|} \\ & \quad \leq u(C)+v(\alpha),{} \end{aligned} $$
(4.19)
for all large N,
$$\displaystyle \begin{aligned} & \displaystyle{ \sup_{U\in \mathbb{U}(n+N)} \sup_{(i,j)\in \{1,\ldots,n+N\}^2} \sup_{z\in \mathbb{C}\setminus \mathbb{R} } \frac{1}{ P(|\Im z |{}^{-1}) }} \\ &\quad \displaystyle{\times\left| \mathbb{E} \left( (U^*\tilde{G}^{\alpha,C}(z)U)_{ij}- (U^*\tilde{{G}}(z)U)_{ij}\right)\right|} \\ & \qquad \leq u(C)+v(\alpha)+ o_{C}(1). {} \end{aligned} $$
(4.20)

Proof

Note that

$$\displaystyle \begin{aligned} \begin{array}{rcl} {X}_{ij}^C-Y_{ij}^C&\displaystyle =&\displaystyle \Re X_{ij}^C \left( 1-\sqrt{2} \mathbb{E} \left( \vert \Re Y_{ij}^C\vert^2 \right)^{1/2}\right) +\mathrm{i} \Im X_{ij}^C \left( 1-\sqrt{2} \mathbb{E} \left( \vert \Im Y_{ij}^C\vert^2 \right)^{1/2}\right) \\&\displaystyle =&\displaystyle \Re X_{ij}^C\frac{1 - 2\mathbb{E} \left( \vert \Re Y_{ij}^C\vert^2 \right) }{1 + \sqrt{2}\mathbb{E} \left( \vert \Re Y_{ij}^C\vert^2 \right)^{1/2}}+ \mathrm{i} \Im X_{ij}^C\frac{1 - 2\mathbb{E} \left( \vert \Im Y_{ij}^C\vert^2 \right) }{1 + \sqrt{2}\mathbb{E} \left( \vert \Im Y_{ij}^C\vert^2 \right)^{1/2}}.\end{array} \end{aligned} $$

Then,

$$\displaystyle \begin{aligned}\left\{\sup_{(i,j)\in \mathbb{N}^2}\mathbb{E} \left( \vert X^C_{ij}-Y_{ij}^C\vert^2 \right)\right\}^{1/2} \leq \frac{4\theta^*}{C}, \mbox{ and }\sup_{(i,j)\in \mathbb{N}^2}\mathbb{E} \left( \vert X_{ij}^C-Y_{ij}^C\vert^3 \right) <\infty.\end{aligned}$$

It is straightforward to see, using Lemma 4.17, that for any unitary (n + N) × (n + N) matrix U,

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle \left| (U^*\tilde{G}^{\alpha,C}(z)U)_{ij}- (U^*\tilde{{G}}(z)U)_{ij}\right| \\ &\displaystyle &\displaystyle \quad \leq \frac{\sigma}{\vert \Im z \vert^2} \left\| \frac{X_N-{X}^{\alpha,C}}{\sqrt{N}} \right\| \\ &\displaystyle &\displaystyle \quad \leq \frac{\sigma}{\vert \Im z \vert^2} \left\{ \left\| \frac{X_N-Y^C}{\sqrt{N}} \right\| + \left\| \frac{{X}^C-Y^C}{\sqrt{N}} \right\|\right. \\&\displaystyle &\displaystyle \qquad \left. + \left(1- \frac{1}{\sqrt{1+\alpha^2}}\right) \left\| \frac{ X^C}{\sqrt{N}} \right\| +\alpha \left\| \frac{\mathbb{G}}{\sqrt{N}} \right\| \right\} {}. \end{array} \end{aligned} $$

(4.21)

From Bai-Yin’s theorem (Theorem 5.8 in [2]), we have

$$\displaystyle \begin{aligned}\left\| \frac{\mathbb{G}}{\sqrt{N}} \right\|=2+o(1).\end{aligned}$$

Applying Remark 4.3 to the (n + N) × (n + N) matrix $\tilde B= \left ( \begin {array}{ll} 0_{n\times n}~~~ B\\ B^*~~~ 0_{N\times N} \end {array} \right )$ for B ∈{X _N − Y ^C, X ^C − Y ^C, X ^C} (see also Appendix B of [14]), we have that almost surely

$$\displaystyle \begin{aligned}\limsup_{N\rightarrow +\infty}\left\| \frac{{X^C}}{\sqrt{N}} \right\|\leq 2 \sqrt{2} ,~\limsup_{N\rightarrow +\infty}\left\| \frac{{X}^C-Y^C}{\sqrt{N}} \right\|\leq \frac{8\sqrt{2}\theta^*}{C},\;\end{aligned}$$

and

$$\displaystyle \begin{aligned}\limsup_{N\rightarrow +\infty}\left\| \frac{{X_N}-Y^C}{\sqrt{N}} \right\|\leq 4 \sqrt{ \frac{\theta^*}{C}}. \end{aligned}$$

Then, (4.19) readily follows.

Let us introduce

$$\displaystyle \begin{aligned}\varOmega_{N,C}=\left\{ \left\| \frac{\mathbb{G}}{\sqrt{N}} \right\| \leq 4, \left\| \frac{X^C}{\sqrt{N}} \right\| \leq 4, \left\| \frac{X_N-Y^C}{\sqrt{N}} \right\| \leq 8 \sqrt{ \frac{\theta^*}{C}}, \left\| \frac{{X}^C-Y^C}{\sqrt{N}} \right\|\leq \frac{16\theta^*}{C} \right\}. \end{aligned}$$

Using (4.21), we have

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle \left| \mathbb{E} \left( (U^*\tilde{{G}}^{\alpha,C}(z)U)_{ij}-(U^*\tilde{G}(z)U)_{ij}\right)\right| \\ &\displaystyle &\displaystyle \quad \leq \frac{4\sigma }{\vert \Im z \vert^2} \left[ 2 \sqrt{ \frac{\theta^*}{C}}+ \frac{4\theta^*}{C}+ \alpha +\left( 1-\frac{1}{\sqrt{1+\alpha^2}}\right) \right] \\ &\displaystyle &\displaystyle \qquad + \frac{2}{\vert \Im z \vert} \mathbb{P}(\varOmega_{N,C}^c). \end{array} \end{aligned} $$

Thus (4.20) follows.

Now, Lemmas 4.18, 4.1 and 4.19 readily yields the following approximation lemma.

Lemma 4.2

Let h be in $\mathbb {C}^\infty (\mathbb {R}, \mathbb {R})$ with compact support and $\tilde \varGamma _N$ be a (n + N) × (n + N) Hermitian matrix such that

$$\displaystyle \begin{aligned} \sup_{n,N} \Vert \tilde \varGamma_N \Vert<\infty \mathit{\text{ and }} \sup_{n,N} \mathrm{rank} (\tilde \varGamma_N) <\infty.\end{aligned} $$

(4.22)

Then, there exist some deterministic functions on [0, +∞[, u and v, such that lim_C→+∞u(C) = 0 and lim_α→0v(α) = 0, such that for all C > 0, α > 0, we have almost surely for all large N,

$$\displaystyle \begin{aligned} \left|\mathrm{Tr} \left[ h\left((\mathbb{M}_{N+n}\left(\frac{X^{\alpha,C}}{\sqrt{N}}\right)\right) \tilde \varGamma_N\right] - \mathrm{Tr} \left[ h\left((\mathbb{M}_{N+n}\left(\frac{X_N}{\sqrt{N}}\right)\right) \tilde \varGamma_N\right]\right| \leq a^{(1)}_{C,\alpha},\end{aligned} $$

(4.23)

and for all large N,

$$\displaystyle \begin{aligned} \left|\mathbb{E}\mathrm{Tr} \left[ h\left((\mathbb{M}_{N+n}\left(\frac{X^{\alpha,C}}{\sqrt{N}}\right)\right) \tilde \varGamma_N\right] - \mathbb{E}\mathrm{Tr} \left[ h\left((\mathbb{M}_{N+n}\left(\frac{X_N}{\sqrt{N}}\right)\right) \tilde \varGamma_N\right]\right| \leq a^{(2)}_{C,\alpha,N}, \end{aligned} $$

(4.24)

where

$$\displaystyle \begin{aligned}a^{(1)}_{C,\alpha}= u(C)+v(\alpha),\; a^{(2)}_{C,\alpha,N}= u(C)+v(\alpha) + o_{C}(1).\end{aligned}$$

Note that the distributions of the independent random variables $\Re (X_{ij}^{\alpha ,C})$, $\Im (X_{ij}^{\alpha ,C})$ are all a convolution of a centred Gaussian distribution with some variance v _α, with some law with bounded support in a ball of some radius R _C,α; thus, according to Lemma 4.20, they satisfy a Poincaré inequality with some common constant C _PI(C, α) and therefore so does their product (see Appendix 2). An important consequence of the Poincaré inequality is the following concentration result.

Lemma 4.3

Lemma 4.4.3 and Exercise 4.4.5 in [ 1 ] or Chapter 3 in [ 24 ]. There exists K ₁ > 0 and K ₂ > 0 such that for any probability measure $\mathbb {P}$ on $\mathbb {R^M}$ which satisfies a Poincaré inequality with constant C _PI, and for any Lipschitz function F on $\mathbb {R}^M$ with Lipschitz constant |F|_Lip, we have

$$\displaystyle \begin{aligned}\forall \epsilon> 0, \, \mathbb{P}\left( \vert F-\mathbb{E}_{\mathbb{P}}(F) \vert > \epsilon \right) \leq K_1 \exp\left(-\frac{\epsilon}{K_2 \sqrt{C_{PI}} \vert F \vert_{Lip}}\right).\end{aligned}$$

In order to apply Lemma 4.3, we need the following preliminary lemmas.

Lemma 4.4 (See Lemma 8.2 [10])

Let f be a real $C_{\mathbb {L}}$ -Lipschitz function on $\mathbb {R}$ . Then its extension on the N × N Hermitian matrices is $C_{\mathbb {L}}$ -Lipschitz with respect to the Hilbert-Schmidt norm.

Lemma 4.5

Let $\tilde \varGamma _N $ be a (n + N) × (n + N) matrix and h be a real Lipschitz function on $\mathbb {R}$ . For any n × N matrix B,

$$\displaystyle \begin{aligned} \left\{\left(\Re B(i,j),~ \Im B(i,j)\right)_{1\leq i \leq n, 1 \leq j \leq N}\right\} \mapsto Tr \left[ h\left((\mathbb{ M}_{N+n}\left(B\right)\right) \tilde \varGamma_N\right] \end{aligned}$$

is Lipschitz with constant bounded by $\sqrt {2} \left \| \tilde \varGamma _N \right \|{ }_2 \Vert h \Vert _{Lip}$.

Proof

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle \left| \mathrm{Tr} \left[h (\mathbb{M}_{N+p}(B)) \tilde \varGamma_N\right]- \mathrm{Tr} \left[ h (\mathbb{M}_{N+p}(B^{\prime}))\tilde \varGamma_N\right]\right| \\ &\displaystyle &\displaystyle \quad \leq \left\| \tilde \varGamma_N \right\|{}_2 \left\| h (\mathbb{M}_{N+p}(B))- h (\mathbb{M}_{N+p}(B^{\prime}))\right\|{}_2 \\ &\displaystyle &\displaystyle \quad \leq \left\| \tilde \varGamma_N \right\|{}_2 \left\|h \right\|{}_{Lip} \left\| \mathbb{M}_{N+p}(B)-\mathbb{M}_{N+p}(B^{\prime}) \right\|{}_2{} \end{array} \end{aligned} $$

(4.25)

where we used Lemma 4.4 in the last line. Now,

$$\displaystyle \begin{aligned}\left\| \mathbb{M}_{N+p}(B)-\mathbb{M}_{N+p}(B^{\prime}) \right\|{}^2_2= 2 \left\| B-B^{\prime}\right\|{}^2_2.{} \end{aligned} $$

(4.26)

Lemma 4.5 readily follows from (4.25) and (4.26).

Lemma 4.6

Let $\tilde \varGamma _N $ be a (n + N) × (n + N) matrix such that $ \sup _{N,n} \left \| \tilde \varGamma _N \right \|{ }_2 \leq K$ . Let h be a real Lipschitz function on $\mathbb {R}$. $F_N=\mathrm {Tr} \left [ h\left ( \mathbb {M}_{N+p}\left ( \frac {X^{\alpha ,C}}{\sqrt {N}}\right ) \right ) \tilde \varGamma _N\right ]$ satisfies the following concentration inequality

$$\displaystyle \begin{aligned}\forall \epsilon> 0, \, \mathbb{P}\left( \vert F_N-\mathbb{E}(F_N) \vert > \epsilon \right) \leq K_1 \exp\left(-\frac{\epsilon \sqrt{ N}}{K_2(\alpha,C) K \Vert h \Vert_{Lip}}\right),\end{aligned}$$

for some positive real numbers K ₁ and K ₂(α, C).

Proof

Lemma 4.6 follows from Lemmas 4.5 and 4.3 and basic facts on Poincaré inequality recalled at the end of Appendix 2.

By Borel-Cantelli’s Lemma, we readily deduce from the above Lemma the following

Lemma 4.7

Let $\tilde \varGamma _N $ be a (n + N) × (n + N) matrix such that $ \sup _{N,n} \left \| \tilde \varGamma _N \right \|{ }_2 \leq K$ . Let h be a real $\mathbb {C}^1$ - function with compact support on $\mathbb {R}$.

$$\displaystyle \begin{aligned} &\mathrm{Tr} \left[ h\left( \mathbb{M}_{N+p}\left( \sigma \frac{X^{\alpha,C}}{\sqrt{N}}\right) \right)\tilde \varGamma_N\right]- \mathbb{E}\left[ \mathrm{Tr} \left[ h\left( \mathbb{M}_{N+p}\left( \sigma \frac{X^{\alpha,C}}{\sqrt{N}}\right) \right) \tilde \varGamma_N\right]\right] \\ & \quad \stackrel{a.s}{\longrightarrow}_{N\rightarrow +\infty}0.{} \end{aligned} $$

(4.27)

Now, we will establish a comparison result with the Gaussian case for the mean values by using the following lemma (which is an extension of Lemma 4.10 below to the non-Gaussian case) as initiated by Khorunzhy et al. [22] in Random Matrix Theory.

Lemma 4.8

Let ξ be a real-valued random variable such that $\mathbb {E}(\vert \xi \vert ^{p+2}) < \infty $ . Let ϕ be a function from $\mathbb {R}$ to $\mathbb {C}$ such that the first p + 1 derivatives are continuous and bounded. Then,

$$\displaystyle \begin{aligned} \mathbb{E} (\xi \phi (\xi )) = \sum_{a=0}^p \frac{\kappa _{a+1}}{a!}\mathbb{E} (\phi ^{(a)}(\xi )) + \epsilon , \end{aligned} $$

(4.28)

where κ _a are the cumulants of ξ, $\vert \epsilon \vert \leq K \sup _t \vert \phi ^{(p+1)}(t)\vert \mathbb {E} (\vert \xi \vert ^{p+2})$ , K only depends on p.

Lemma 4.9

Let $\mathbb {G}_N = [\mathbb {G}_{ij}]_{1\leq i\leq n, 1\leq j\leq N}$ be a n × N random matrix with i.i.d. complex N(0, 1) Gaussian entries. Define

$$\displaystyle \begin{aligned}\tilde G^{{ \mathbb{G}}}(z)= \left( zI_{N+n}- \mathbb{M}_{N+n}\left(\sigma \frac{\mathbb{G}_N}{\sqrt{N}}\right) \right)^{-1}\end{aligned}$$

for any $z\in \mathbb {C}\setminus \mathbb {R}.$ There exists a polynomial P with nonnegative coefficients such that for all large N, for any (i, j) ∈{1, …, n + N}², for any $z\in \mathbb {C}\setminus \mathbb {R}$ , for any unitary (n + N) × (n + N) matrix U,

$$\displaystyle \begin{aligned} \left| \mathbb{E} \left[(U^*\tilde G^{{ \mathbb{G}}}(z)U)_{ij}\right]- \mathbb{E}\left[(U^*\tilde G(z)U)_{ij}\right]\right| \leq \frac{1}{\sqrt{N}}P(\left|\Im z \right|{}^{-1}).\end{aligned} $$

(4.29)

Moreover, for any (N + n) × (N + n) matrix $\tilde \varGamma _N$ such that

$$\displaystyle \begin{aligned} \sup_{n,N} \Vert \tilde \varGamma_N \Vert<\infty \mathit{\text{ and }} \sup_{n,N} \mathrm{rank} (\tilde \varGamma_N) <\infty,\end{aligned} $$

(4.30)

and any function h in $\mathbb {C}^\infty (\mathbb {R}, \mathbb {R})$ with compact support, there exists some constant K > 0 such that, for any large N,

$$\displaystyle \begin{aligned} & \left| \mathbb{E}\left[ \mathrm{Tr} \left[ h\left( \mathbb{M}_{N+n}\left( \sigma \frac{X_N}{\sqrt{N}}\right) \right)\tilde \varGamma_N\right]\right]- \mathbb{E}\left[ \mathrm{Tr} \left[ h\left( \mathbb{M}_{N+n}\left( \sigma \frac{\mathbb{G}_N}{\sqrt{N}}\right) \right)\tilde \varGamma_N\right]\right] \right| \\ & \quad \leq \frac{K}{\sqrt{N}}.{} \end{aligned} $$

(4.31)

Proof

We follow the approach of [26] chapters 18 and 19 consisting in introducing an interpolation matrix $X_N(\alpha )= \cos \alpha X_N + \sin \alpha \mathbb {G}_N$ for any α in $[0;\frac {\pi }{2}]$ and the corresponding resolvent matrix $\tilde G(\alpha ,z)= \left ( zI_{N+n}- \mathbb {M}_{N+n}\left (\sigma \frac {X_N(\alpha )}{\sqrt {N}}\right ) \right )^{-1}$ for any $z\in \mathbb {C}\setminus \mathbb {R}.$ We have, for any (s, t) ∈{1, …, n + N}²,

$$\displaystyle \begin{aligned}\mathbb{E}\tilde G^{{ \mathbb{G}}}_{st}(z)- \mathbb{E} \tilde G_{st}(z)=\int_0^{\frac{\pi}{2}} \mathbb{E} \left( \frac{ \partial }{\partial \alpha} \tilde G_{st}(\alpha,z)\right) d\alpha\end{aligned}$$

with

$$\displaystyle \begin{aligned} \begin{array}{rcl} \frac{ \partial }{\partial \alpha} \tilde G_{st}(\alpha,z)&\displaystyle = &\displaystyle \frac{\sigma}{2\sqrt{N}}\sum_{l=1}^n \sum_{k=n+1}^{n+N }\left\{ \left[ \tilde G_{sl} (\alpha,z) \tilde G_{kt} (\alpha,z)+ \tilde G_{sk}(\alpha,z) \tilde G_{l t}(\alpha,z) \right]\right.\\ &\displaystyle &\displaystyle \times \left[ -\sin \alpha \Re X_{l(k-n)}+\cos \alpha \Re \mathbb{G}_{l(k-n)}\right] \\ &\displaystyle &\displaystyle \left.+i \left[ \tilde G_{sl}(\alpha,z) \tilde G_{kt} (\alpha,z)- \tilde G_{sk} (\alpha,z)\tilde G_{l t}(\alpha,z) \right]\right.\\ &\displaystyle &\displaystyle \left.\times \left[-\sin \alpha \Im X_{l(k-n)} +\cos \alpha \Im \mathbb{G}_{l(k-n)} \right]\right\}. \end{array} \end{aligned} $$

Now, for any l = 1, …, n and k = n + 1, …, n + N, using Lemma 4.8 for p = 1 and for each random variable ξ in the set $\left \{\Re X_{l(k-n)}, \Re \mathbb {G}_{l(k-n)},\Im X_{l(k-n)}, \Im \mathbb {G}_{l(k-n)} \right \} $, and for each ϕ in the set

$$\displaystyle \begin{aligned}\left\{ (U^*\tilde G (\alpha ,z))_{ip}(\tilde G(\alpha,z)U)_{q j} ; (p,q)=(l,k) \mbox{ or }(k,l), (i,j)\in \{1,\ldots,n+N\}^2\right\},\end{aligned}$$

one can easily see that there exists some constant K > 0 such that

$$\displaystyle \begin{aligned}\left| \mathbb{E} (U^* \tilde G^{{ \mathbb{G}}}(z)U)_{ij}- \mathbb{E}(U^*\tilde G(z)U)_{ij} \right| \leq \frac{ K}{N^{3/2}}\sup_{Y \in \mathbb{H}_{n+N}(\mathbb{C})} \sup_{V\in \mathbb{U}(n+N)} S_V(Y)\end{aligned}$$

where $\mathbb {H}_{n+N}(\mathbb {C})$ denotes the set of (n + N) × (n + N) Hermitian matrices and S _V(Y ) is a sum of a finite number independent of N and n of terms of the form

$$\displaystyle \begin{aligned}\sum_{l=1}^n \sum_{k=n+1}^{n+N } \left|\left(U^*R(Y)\right)_{ip_1}\left(R(Y)\right)_{p_2p_3}\left(R(Y)\right)_{p_4p_5}\left(R(Y)U\right)_{p_6j} \right|\end{aligned} $$

(4.32)

with $R(Y)=\left (zI_{N+n}- Y\right )^{-1}$ and {p ₁, …, p ₆} contains exactly three k and three l.

When (p ₁, p ₆) = (k, l) or (l, k), then, using Lemma 4.17,

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle \sum_{l=1}^n \sum_{k=n+1}^{n+N }\left| \left(U^*R(Y)\right)_{ip_1}\left(R(Y)\right)_{p_2p_3}\left(R(Y)\right)_{p_4p_5}\left(R(Y)U\right)_{p_6j}\right| \\ &\displaystyle &\displaystyle \quad \leq \frac{1}{\vert \Im z \vert^2} \sum_{k,l =1}^{n+N} \left|\left(U^*R(Y)\right)_{il}\left(R(Y)U\right)_{kj}\right| \\ &\displaystyle &\displaystyle \quad \leq \frac{(N+n)}{\vert \Im z \vert^2} \left(\sum_{l =1}^{n+N} \left|\left(U^*R(Y)\right)_{il}\right|{}^2 \right)^{1/2}\left(\sum_{k =1}^{n+N} \left|\left(R(Y)U\right)_{kj}\right|{}^2 \right)^{1/2}\\ &\displaystyle &\displaystyle \quad =\frac{(N+n)}{\vert \Im z \vert^2} \left( \left(U^*R(Y)R(Y)^*U\right)_{ii} \right)^{1/2}\left( \left(U^*R(Y)^*R(Y)U\right)_{jj} \right)^{1/2}\\ &\displaystyle &\displaystyle \quad \leq \frac{(N+n)}{\vert \Im z \vert^4} \end{array} \end{aligned} $$

When p ₁ = p ₆ = k or l, then, using Lemma 4.17,

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle \sum_{l=1}^n \sum_{k=n+1}^{n+N }\left| \left(U^*R(Y)\right)_{ip_1}\left(R(Y)\right)_{p_2p_3}\left(R(Y)\right)_{p_4p_5}\left(R(Y)U\right)_{p_6j}\right|\\ &\displaystyle &\displaystyle \quad \leq \frac{N+n}{\vert \Im z \vert^2} \sum_{l =1}^{n+N} \left|\left(U^*R(Y)\right)_{il}\left(R(Y)U\right)_{lj}\right| \\ &\displaystyle &\displaystyle \quad \leq \frac{(N+n)}{\vert \Im z \vert^2} \left(\sum_{l =1}^{n+N} \left|\left(U^*R(Y)\right)_{il}\right|{}^2 \right)^{1/2}\left(\sum_{l =1}^{n+N} \left|\left(R(Y)U\right)_{lj}\right|{}^2 \right)^{1/2}\\ &\displaystyle &\displaystyle \quad = \frac{(N+n)}{\vert \Im z \vert^2} \left( \left(U^*R(Y)R(Y)^*U\right)_{ii} \right)^{1/2}\left( \left(U^*R(Y)^*R(Y)U\right)_{jj} \right)^{1/2}\\ &\displaystyle &\displaystyle \quad \leq \frac{(N+n)}{\vert \Im z \vert^4} \end{array} \end{aligned} $$

(4.29) readily follows.

Then by Lemma 4.19, there exists some constant K > 0 such that, for any N and n, for any (i, j) ∈{1, …, n + N}², any unitary (n + N) × (n + N) matrix U,

$$\displaystyle \begin{aligned}\limsup_{y \rightarrow 0^+} \left| \int \left[ \mathbb{E}(U^*\tilde G(t+\mathrm{i} y)U)_{ij}- \mathbb{E}(U^* {\tilde G^{{ \mathbb{G}}}}(t+\mathrm{i} y)U)_{ij}\right] h(t) dt \right| \leq \frac{ K}{\sqrt{N}}.\end{aligned} $$

(4.33)

Thus, using (4.97) and (4.30), we can deduce (4.31) from (4.33).

The above comparison lemmas allow us to establish the following convergence result.

Proposition 4.4

Let h be a function in $\mathbb {C}^\infty (\mathbb {R}, \mathbb {R})$ with compact support and let $\tilde \varGamma _N $ be a (n + N) × (n + N) matrix such that $ \sup _{n,N} \mathrm {rank} (\tilde \varGamma _N) <\infty $ and $ \sup _{n,N} \Vert \tilde \varGamma _N \Vert <\infty $ . Then we have that almost surely

$$\displaystyle \begin{aligned} &Tr \left[ h\left((\mathbb{M}_{N+n}\left(\sigma \frac{X_N}{\sqrt{N}}\right)\right) \tilde \varGamma_N\right]- \mathbb{E}\left[Tr \left[ h\left((\mathbb{M}_{N+n}\left(\sigma \frac{\mathbb{G}_N}{\sqrt{N}}\right)\right) \tilde \varGamma_N\right] \right] \\ & \quad {\longrightarrow}_{N\rightarrow +\infty}0. \end{aligned} $$

(4.34)

Proof

Lemmas 4.2, 4.7 and 4.9 readily yield that there exist some positive deterministic functions u and v on [0, +∞[ with lim_C→+∞u(C) = 0 and lim_α→0v(α) = 0, such that for any C > 0 and any α > 0, almost surely

$$\displaystyle \begin{aligned} &\limsup_{N \rightarrow +\infty} \left| Tr \left[ h\left((\mathbb{M}_{N+n}\left(\sigma \frac{X_N}{\sqrt{N}}\right)\right) \tilde \varGamma_N\right]{-}\, \mathbb{E}\left[Tr \left[ h\left((\mathbb{M}_{N+n}\left(\sigma \frac{\mathbb{G}_N}{\sqrt{N}}\right)\right) \tilde \varGamma_N\right] \right] \right| \\ & \quad \leq u(C) +v(\alpha). \end{aligned} $$

The result follows by letting α go to zero and C go to infinity.

Now, note that, for any N × n matrix B, for any continuous real function h on $\mathbb {R}$, and any n × n Hermitian matrix Γ _N, we have

$$\displaystyle \begin{aligned}Tr \left(h\left((B+A_N)(B+A_N)^*\right) \varGamma_N\right)=Tr \left[\tilde h\left(\mathbb{M}_{N+n}\left(B\right)\right) \tilde \varGamma_N\right]\end{aligned}$$

where $\tilde h(x)=h(x^2)$ and $ \tilde \varGamma _N= \begin {pmatrix} \varGamma _N & (0)\\ (0) & (0) \end {pmatrix}$. Thus, Proposition 4.4 readily yields Proposition 4.2.

4.4 Proof of Proposition 4.3

The aim of this section is to prove Proposition 4.3 which deals with Gaussian random variables.Therefore we assume here that A _N is as (4.14) and set $\gamma _q(N)=(A_N A_N^*)_{qq}$. In this section, we let X stand for $\mathbb {G}_N$, A stands for A _N, G denotes the resolvent of M _N = ΣΣ ^∗ where $\varSigma =\sigma \frac {\mathbb {G}_N}{\sqrt {N}}+A_N$ and g _N denotes the mean of the Stieltjes transform of the spectral measure of M _N, that is

$$\displaystyle \begin{aligned}g_N(z) = \mathbb{E}\left(\frac{1}{n} Tr G(z)\right), \, z \in \mathbb{C}\setminus \mathbb{R}.\end{aligned}$$

4.4.1 Matricial Master Equation

To obtain Eq. (4.35) below, we will use many ideas from [17]. The following Gaussian integration by part formula is the key tool in our approach.

Lemma 4.10 (Lemma 2.4.5 [1])

Let ξ be a real centered Gaussian random variable with variance 1. Let Φ be a differentiable function with polynomial growth of Φ and Φ′. Then,

$$\displaystyle \begin{aligned}\mathbb{E} \left( \xi \varPhi(\xi) \right) = \mathbb{E} \left( \varPhi^{\prime}(\xi) \right).\end{aligned}$$

Proposition 4.5

Let z be in $\mathbb {C}\setminus \mathbb {R}$ . We have for any (p, q) in {1, …, n}²,

(4.35)

where

$$\displaystyle \begin{aligned} \nabla_{pq} = \frac{1}{1- \sigma^2 c_N g_N}\left\{ \frac{\sigma^2}{N} \frac{ \mathbb{E}\left( G_{pq}\right) }{1-\sigma^2 c_N g_N} \varDelta_3 + \varDelta_2(p,q) + \varDelta_1(p,q).\right\},\end{aligned} $$

(4.36)

$$\displaystyle \begin{aligned}\varDelta_{1}(p,q) = {\sigma^2}\mathbb{E}\left\{ \left[\frac{1}{N} Tr G - \mathbb{E}\left(\frac{1}{N} Tr G \right)\right] (G\varSigma \varSigma^*)_{pq} \right\} ,\end{aligned} $$

(4.37)

$$\displaystyle \begin{aligned}\varDelta_{2}(p,q) = \frac{\sigma^2}{N}\mathbb{E}\left\{ Tr(GA\varSigma^* ) \left[G_{pq} - \mathbb{E}\left(G_{pq}\right) \right] \right\} ,\end{aligned} $$

(4.38)

$$\displaystyle \begin{aligned}\varDelta_{3} = {\sigma^2}\mathbb{E}\left\{ \left[\frac{1}{N} Tr G - \mathbb{E}\left(\frac{1}{N} Tr G \right)\right] Tr (\varSigma^*GA ) \right\} .\end{aligned} $$

(4.39)

Proof

Using Lemma 4.10 with $\xi =\Re X_{ij} $ or ξ = ℑX _ij and $\varPhi = G_{pi} \overline {\varSigma _{qj}}$, we obtain that for any j, q, p,

$$\displaystyle \begin{aligned} \begin{array}{rcl} \mathbb{E}\left[ \left( G \frac{\sigma X}{\sqrt{N}}\right)_{pj} \overline{\varSigma_{qj}} \right] &\displaystyle =&\displaystyle \sum_{i=1}^n \mathbb{E}\left[ G_{pi} \frac{\sigma X_{ij} }{\sqrt{N}}\overline{\varSigma_{qj}} \right] \end{array} \end{aligned} $$

(4.40)

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle =&\displaystyle \frac{\sigma^2}{N} \sum_{i=1}^n \mathbb{E}\left[ \left( G \varSigma\right)_{pj} G_{ii} \overline{\varSigma_{qj}} \right] + \frac{\sigma^2}{N} \mathbb{E}(G_{pq}) \end{array} \end{aligned} $$

(4.41)

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle =&\displaystyle \frac{\sigma^2}{N} \mathbb{E}\left[ \left( Tr G\right) \left(G \varSigma\right)_{pj} \overline{\varSigma_{qj}} \right] + \frac{\sigma^2}{N} \mathbb{E}(G_{pq}). {} \end{array} \end{aligned} $$

(4.42)

On the other hand, we have

$$\displaystyle \begin{aligned} \begin{array}{rcl} \mathbb{E}\left[ \left( G A \right)_{pj} \overline{\varSigma_{qj}} \right] &\displaystyle =&\displaystyle \mathbb{E}\left[ \left( G A \right)_{pj} \overline{A_{qj}} \right] +\sum_{i=1}^n \mathbb{E}\left[ G_{pi} A_{ij} \frac{ \sigma \overline{X_{qj}}}{\sqrt{N}} \right] \end{array} \end{aligned} $$

(4.43)

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle =&\displaystyle \mathbb{E}\left[ \left( G A \right)_{pj} \overline{A_{qj}} \right] +\frac{\sigma^2}{N} \mathbb{E}\left[ G_{pq} \left( \varSigma^* G A \right)_{jj} \right] {} \end{array} \end{aligned} $$

(4.44)

where we applied Lemma 4.10 with $\xi =\Re X_{qj} $ or ξ = ℑX _qj and Ψ = G _piA _ij. Summing (4.42) and (4.44) yields

$$\displaystyle \begin{aligned} \begin{array}{rcl} \mathbb{E}\left[ \left( G \varSigma \right)_{pj} \overline{\varSigma_{qj}} \right] &\displaystyle =&\displaystyle \frac{\sigma^2}{N} \mathbb{E}(G_{pq})+ \frac{\sigma^2}{N} \mathbb{E}\left[ \left(Tr G\right) \left(G \varSigma\right)_{pj} \overline{\varSigma_{qj}} \right] \end{array} \end{aligned} $$

(4.45)

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle +\frac{\sigma^2}{N} \mathbb{E}\left[ G_{pq} \left( \varSigma^* G A \right)_{jj} \right] + \mathbb{E}\left[ \left( G A \right)_{pj} \overline{A_{qj}} \right]. {}\end{array} \end{aligned} $$

(4.46)

Define

$$\displaystyle \begin{aligned}\varDelta_1(j)= \frac{\sigma^2}{N} \mathbb{E}\left[ \left(Tr G\right) \left(G \varSigma\right)_{pj} \overline{\varSigma_{qj}} \right]- \frac{\sigma^2}{N} \mathbb{E}\left[ Tr G \right] \mathbb{E}\left[ \left(G \varSigma\right)_{pj} \overline{\varSigma_{qj}} \right].\end{aligned}$$

From (4.46), we can deduce that

$$\displaystyle \begin{aligned} \begin{array}{rcl} \mathbb{E}\left[ \left( G \varSigma \right)_{pj} \overline{\varSigma_{qj}} \right] &\displaystyle =&\displaystyle \frac{1}{1-\sigma^2 c_N g_N} \left\{ \frac{\sigma^2}{N} \mathbb{E}(G_{pq})+ \frac{\sigma^2}{N} \mathbb{E}\left[ G_{pq} \left( \varSigma^* G A \right)_{jj} \right] \right.\\ &\displaystyle &\displaystyle \left.+ \mathbb{E}\left[ \left( G A \right)_{pj} \overline{A_{qj}} \right] + \varDelta_1(j)\right\}.\end{array} \end{aligned} $$

Then, summing over j, we obtain that

$$\displaystyle \begin{aligned} \mathbb{E}\left[ \left( G \varSigma \varSigma^*\right)_{pq} \right] &=\frac{1}{1-\sigma^2 c_N g_N} \left\{ {\sigma^2} \mathbb{E}(G_{pq})+ \frac{\sigma^2}{N} \mathbb{E}\left[ G_{pq} Tr\left( \varSigma^* G A \right) \right] \right. \\ & \quad \left. + \mathbb{E}\left[ \left( G A A^*\right)_{pq} \right] + \varDelta_1 (p,q)\right\},{} \end{aligned} $$

(4.47)

where Δ ₁(p, q) is defined by (4.37). Applying Lemma 4.10 with $\xi =\Re X_{ij} $ or ℑX _ij and Ψ = (GA)_ij, we obtain that

$$\displaystyle \begin{aligned}\mathbb{E} \left[Tr\left( \frac{\sigma X^*}{\sqrt{N}} G A \right) \right] =\frac{\sigma^2}{N} \mathbb{E}\left[ Tr G ~Tr\left( \varSigma^* G A \right) \right].\end{aligned}$$

Thus,

$$\displaystyle \begin{aligned}\mathbb{E}\left[ Tr\left( \varSigma^* G A \right) \right]=\mathbb{E}\left[ Tr\left( A^* G A \right) \right]+ \sigma^2 c_N g_N \mathbb{E}\left[ Tr\left( \varSigma^* G A \right) \right] + \varDelta_3,\end{aligned}$$

where Δ ₃ is defined by (4.39) and then

$$\displaystyle \begin{aligned} \mathbb{E}\left[ Tr\left( \varSigma^* G A \right) \right]=\frac{1}{1-\sigma^2 c_Ng_N} \left\{ \mathbb{E}\left[ Tr\left( G A A^*\right) \right] + \varDelta_3 \right\}. \end{aligned} $$

(4.48)

(4.48) and (4.38) imply that

$$\displaystyle \begin{aligned}\frac{\sigma^2}{N} \mathbb{E}\left[ G_{pq}Tr\left( \varSigma^* G A \right) \right]= \frac{\sigma^2}{N}\frac{\mathbb{E}(G_{pq})}{1-\sigma^2 c_N g_N} \left\{ \mathbb{E}\left[ Tr\left( G A A^*\right) \right] + \varDelta_3 \right\} + \varDelta_2(p,q), \end{aligned} $$

(4.49)

where Δ ₂(p, q) is defined by (4.38). We can deduce from (4.47) and (4.49) that

$$\displaystyle \begin{aligned} & \mathbb{E}\left[ \left( G \varSigma \varSigma^*\right)_{pq} \right] \\ & \quad =\frac{1}{1-\sigma^2 c_Ng_N} \left\{ {\sigma^2} \mathbb{E}(G_{pq})+ \mathbb{E}\left[ \left( G A A^*\right)_{pq} \right] \right.\\ & \qquad \left. +\frac{\sigma^2}{N} \frac{\mathbb{E}\left[ G_{pq}\right] }{1-\sigma^2 c_N g_N}\mathbb{E}\left[ Tr\left( G AA^* \right) \right]\right. \\ & \qquad \left.+\frac{\sigma^2}{N}\frac{\mathbb{E}(G_{pq})}{1-\sigma^2 c_N g_N} \varDelta_3 + \varDelta_1 (p,q)+ \varDelta_2 (p,q) \right\}. {} \end{aligned} $$

(4.50)

Using the resolvent identity and (4.50), we obtain that

$$\displaystyle \begin{aligned} \begin{array}{rcl} z\mathbb{E}\left( G_{pq} \right)&\displaystyle =&\displaystyle \frac{1}{1-\sigma^2 c_Ng_N} \left\{ {\sigma^2} \mathbb{E}(G_{pq})+ \mathbb{E}\left[ \left( G A A^*\right)_{pq} \right] \right. \\ &\displaystyle &\displaystyle \left. +\frac{\sigma^2}{N} \frac{\mathbb{E}\left[ G_{pq}\right] }{1-\sigma^2 c_Ng_N} \mathbb{E}\left[ Tr\left( G AA^* \right)\right] \right\}+ \delta_{pq}+\nabla_{pq} {}\end{array} \end{aligned} $$

(4.51)

where ∇_pq is defined by (4.36). Taking p = q in (4.51), summing over p and dividing by n, we obtain that

$$\displaystyle \begin{aligned} \begin{array}{rcl} z g_N &\displaystyle =&\displaystyle \frac{\sigma^2 g_N}{1-\sigma^2 c_N g_N} + \frac{ Tr \left[ \mathbb{E} (G) AA^*\right]}{n(1-\sigma^2 c_N g_N)} \end{array} \end{aligned} $$

(4.52)

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle + \frac{\sigma^2 g_N Tr \left[ \mathbb{E} (G) AA^*\right]}{N(1-\sigma^2 c_N g_N)^2} +1 + \frac{1}{n} \sum_{p=1}^n \nabla_{pp}. \end{array} \end{aligned} $$

(4.53)

It readily follows that

$$\displaystyle \begin{aligned} \frac{ Tr \left[ \mathbb{E} (G) AA^*\right]}{n(1-\sigma^2 c_Ng_N)} \left( \frac{ \sigma^2 c_N g_N}{(1-\sigma^2 c_N g_N)} +1 \right) =\left( z - \frac{ \sigma^2 }{(1-\sigma^2 c_N g_N)} \right) g_N -1 - \frac{1}{n} \sum_{p=1}^n \nabla_{pp}. \end{aligned}$$

Therefore

$$\displaystyle \begin{aligned} \frac{ Tr \left[ \mathbb{E} (G) AA^*\right]}{n(1-\sigma^2 c_N g_N)}=zg_N (1-\sigma^2 c_N g_N) - (1-\sigma^2 c_N g_N) -\sigma^2 g_N - (1-\sigma^2 c_N g_N) \frac{1}{n} \sum_{p=1}^n \nabla_{pp}. \end{aligned} $$

(4.54)

(4.54) and (4.51) yield

$$\displaystyle \begin{aligned} &\mathbb{E}(G_{pq}) \times \left\{ z (1-\sigma^2 c_N g_N) -\frac{\gamma_q}{1-\sigma^2 c_N g_N} -\sigma^2(1-c_N) + \frac{\sigma^2}{N} \sum_{p=1}^n \nabla_{pp}\right\} \\ &\quad =\delta_{pq} + \nabla_{pq}.\end{aligned} $$

Proposition 4.5 follows.

4.4.2 Variance Estimates

In this section, when we state that some quantity Δ _N(z), $z \in \mathbb {C}\setminus \mathbb {R}$, is equal to $O(\frac {1}{N^p})$, this means precisely that there exist some polynomial P with nonnegative coefficients and some positive real number l which are all independent of N such that for any $z \in \mathbb {C}\setminus \mathbb {R}$,

$$\displaystyle \begin{aligned}\vert \varDelta _N(z)\vert \leq \frac{(\vert z\vert+1)^l P( | \Im z |{}^{-1}) }{N^p}.\end{aligned}$$

We present now the different estimates on the variance. They rely on the following Gaussian Poincaré inequality (see Appendix 2). Let Z ₁, …, Z _q be q real independent centered Gaussian variables with variance σ ². For any $\mathbb {C}^1$ function $f: \mathbb {R}^q \rightarrow \mathbb {C}$ such that f and gradf are in $L^2(\mathbb {N}(0, \sigma ^2I_q))$, we have

$$\displaystyle \begin{aligned} \mathbf{V}\left\{f(Z_1,\ldots ,Z_q)\right\}\leq \sigma^2 \mathbb{E} \left(\Vert (\mathrm{grad}f) (Z_1,\ldots,Z_q)\Vert_2^2\right) , \end{aligned} $$

(4.55)

denoting for any random variable a by V(a) its variance $ \mathbb {E}(\vert a-\mathbb {E}(a)\vert ^2)$. Thus, (Z ₁, …, Z _q) satisfies a Poincaré inequality with constant C _PI = σ ².

The following preliminary result will be useful to these estimates.

Lemma 4.11

There exists K > 0 such for all N,

$$\displaystyle \begin{aligned}\mathbb{E} \left( \lambda_1\left( \frac{XX^*}{N} \right) \right) \leq K.\end{aligned}$$

Proof

According to Lemma 7.2 in [19], we have for any t ∈ ]0;N∕2],

$$\displaystyle \begin{aligned}\mathbb{E} \left[\mathrm{Tr} \left(\exp t \frac{XX^*}{N} \right) \right]\leq n \exp \left( (\sqrt{c_N} +1)^2 t +\frac{1}{N} (c_N +1) t^2 \right).\end{aligned}$$

By the Chebychev’s inequality, we have

$$\displaystyle \begin{aligned} \begin{array}{rcl} \exp \left(t \mathbb{E} \left( \lambda_1\left( \frac{XX^*}{N} \right) \right) \right)&\displaystyle \leq&\displaystyle \mathbb{E} \left( \exp t \lambda_1\left( \frac{XX^*}{N} \right)\right)\\ &\displaystyle \leq &\displaystyle \mathbb{E} \left[\mathrm{Tr} \left(\exp t \frac{XX^*}{N} \right) \right]\\&\displaystyle \leq&\displaystyle n \exp \left( (\sqrt{c_N} +1)^2 t +\frac{1}{N} (c_N +1) t^2 \right). \end{array} \end{aligned} $$

It follows that

$$\displaystyle \begin{aligned}\mathbb{E}\left(\lambda_1\left( \frac{XX^*}{N}\right)\right) \leq \frac{1}{t} \log n+ (\sqrt{c_N} +1)^2 + \frac{1}{N} (c_N +1) t.\end{aligned}$$

The result follows by optimizing in t.

Lemma 4.12

There exists C > 0 such that for all large N, for all $z \in \mathbb {C}\setminus \mathbb {R}$ ,

$$\displaystyle \begin{aligned} \mathbb{E}\left( \left| \frac{1}{n}\mathrm{Tr} G - \mathbb{E}(\frac{1}{n}\mathrm{Tr} G)\right|{}^2 \right)\leq \frac{C}{N^2 \vert \Im z \vert^4},\end{aligned} $$

(4.56)

$$\displaystyle \begin{aligned}\forall (p,q)\in \{1,\ldots,n\}^2, \;\mathbb{E}\left( \vert G_{pq} - \mathbb{E}(G_{pq})\vert^2 \right)\leq \frac{C}{N \vert \Im z \vert^4},\end{aligned} $$

(4.57)

$$\displaystyle \begin{aligned}\mathbb{E}\left( \vert \mathrm{Tr} \varSigma^*GA - \mathbb{E}(\mathrm{Tr} \varSigma^* GA)\vert^2 \right)\leq \frac{C(1+\vert z \vert)^2}{ \vert \Im z \vert^4}.\end{aligned} $$

(4.58)

Proof

Let us define $\varPsi : \mathbb {R}^{2(n\times N)} \rightarrow { M }_{n\times N}(\mathbb {C})$ by

$$\displaystyle \begin{aligned}\varPsi:~~\{x_{ij},y_{ij},i=1,\ldots,n,j=1,\ldots,N\}\rightarrow \sum_{i=1,\ldots,n}\sum_{j=1,\ldots,N} \left( x_{ij} +\mathrm{i} y_{ij} \right) e_{ij},\end{aligned}$$

where e _ij stands for the n × N matrix such that for any (p, q) in {1, …, n}×{1, …, N}, (e _ij)_pq = δ _ipδ _jq. Let F be a smooth complex function on ${ M }_{n\times N}(\mathbb {C})$ and define the complex function f on $\mathbb {R}^{2(n\times N)}$ by setting f = F ∘ Ψ. Then,

$$\displaystyle \begin{aligned}\Vert \mathrm{grad} f(u)\Vert_2 = \sup_{V\in { M }_{n\times N}(\mathbb{C}), Tr VV^*=1} \left| \frac{d}{dt} F(\varPsi(u)+tV)_{\vert_{t=0}}\right|.\end{aligned}$$

Now, $X=\varPsi (\Re (X_{ij}), \Im (X_{ij}),1\leq i\leq n,1\leq j\leq N)$ where the distribution of the random variable $(\Re (X_{ij}), \Im (X_{ij}),1\leq i\leq n,1\leq j\leq N)$ is $\mathbb {N}(0, \frac {1}{2}I_{2nN})$.

Hence consider $F:~H \rightarrow \frac {1}{n} \mathrm {Tr} \left (zI_n -\left (\sigma \frac { H}{\sqrt {N}} +A \right )\left (\sigma \frac {H}{\sqrt {N}} +A \right )^*\right )^{-1}$.

Let $ V\in {M }_{n\times N}(\mathbb {C})$ such that TrV V ^∗ = 1.

$$\displaystyle \begin{aligned} &\frac{d}{dt} F(X+tV)\vert_{t=0}\\ & \quad =\frac{1}{n} \left\{\mathrm{Tr} \left(G\sigma\frac{V}{\sqrt{N}} \left(\sigma\frac{X}{\sqrt{N}} +A \right)^* G\right) + \mathrm{Tr}\left(G \left(\sigma\frac{X}{\sqrt{N}} +A \right)\sigma \frac{V^*}{\sqrt{N}} G\right)\right\}.\end{aligned} $$

Moreover using Cauchy-Schwartz’s inequality and Lemma 4.17, we have

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle \left| \frac{1}{n} \mathrm{Tr} \left(G\sigma\frac{V}{\sqrt{N}} \left(\sigma\frac{X}{\sqrt{N}} +A \right)^* G\right)\right| \\ &\displaystyle &\displaystyle \quad \leq \frac{\sigma}{n} (TrVV^* )^{\frac{1}{2}}\left[\frac{1}{N}Tr (\left(\sigma\frac{X}{\sqrt{N}} +A \right)\left(\sigma\frac{X}{\sqrt{N}} +A \right)^*G^2(G^*)^2)\right]^{\frac{1}{2}}\\ &\displaystyle &\displaystyle \quad \leq \frac{\sigma}{\sqrt{N}\sqrt{n}\vert \Im z\vert^2}\left[\lambda_1\left(\left(\sigma\frac{X}{\sqrt{N}} +A \right)\left(\sigma\frac{X}{\sqrt{N}} +A \right)^*\right)\right]^{\frac{1}{2}}. \end{array} \end{aligned} $$

We get obviously the same bound for $\vert \frac {1}{n} \mathrm {Tr}\left (G \left (\sigma \frac {X}{\sqrt {N}} +A \right ) \sigma \frac {V^*}{\sqrt {N}} G\right )\vert $. Thus

$$\displaystyle \begin{aligned} &\mathbb{E}\left(\Vert \mathrm{grad}f\left(\Re(X_{ij}), \Im(X_{ij}),1\leq i\leq n,1\leq j\leq N\right)\Vert_2^2 \right)\\ &\quad \leq \frac{4 \sigma^2}{\vert \Im z\vert^4 Nn}\mathbb{E}\left[\lambda_1\left(\left(\sigma\frac{X}{\sqrt{N}} +A \right)\left(\sigma\frac{X}{\sqrt{N}} +A \right)^*\right)\right]. {} \end{aligned} $$

(4.59)

(4.56) readily follows from (4.55), (4.59), Theorem A.8 in [2], Lemma 4.11 and the fact that ∥A _N∥ is uniformly bounded. Similarly, considering

$$\displaystyle \begin{aligned}F:~H \rightarrow \mathrm{Tr} \left[\left(zI_N -\left(\sigma \frac{H}{\sqrt{N}} +A \right)\left(\sigma \frac{H}{\sqrt{N}} +A \right)^*\right)^{-1}E_{qp}\right],\end{aligned}$$

where E _qp is the n × n matrix such that (E _qp)_ij = δ _qiδ _pj, we can obtain that, for any $V\in { M }_{n\times N}(\mathbb {C})$ such that TrV V ^∗ = 1,

$$\displaystyle \begin{aligned} &\left| \frac{d}{dt} F(X+tV)_{\vert_{t=0}} \right| \\ &\quad \leq \frac{\sigma}{\sqrt{N}} \left\{\left(\left(GG^*\right)_{pp} \left(G^*\varSigma \varSigma^*G\right)_{qq}\right)^{1/2} + \left( \left(G^*G\right)_{qq} \left(G\varSigma \varSigma^*G^*\right)_{pp}\right)^{1/2}\right\}.\end{aligned} $$

Thus, one can get (4.57) in the same way. Finally, considering

$$\displaystyle \begin{aligned}F:~H \rightarrow \mathrm{Tr} \left[\left(\sigma \frac{H}{\sqrt{N}} +A \right)^*\left(zI_N -\left(\sigma \frac{H}{\sqrt{N}} +A \right)\left(\sigma \frac{H}{\sqrt{N}} +A \right)^*\right)^{-1}A\right],\end{aligned}$$

we can obtain that, for any $V\in { M }_{n\times N}(\mathbb {C})$ such that TrV V ^∗ = 1,

$$\displaystyle \begin{aligned} \begin{array}{rcl}\left| \frac{d}{dt} F(X+tV)_{\vert_{t=0} }\right|&\displaystyle \leq&\displaystyle \sigma\left\{ \left(\frac{1}{N} \mathrm{Tr} \varSigma^* G A \varSigma^* GG^* \varSigma A^* G^*\varSigma \right)^{1/2} \right. \\&\displaystyle &\displaystyle \left.+ \left(\frac{1}{N} \mathrm{Tr} GA \varSigma^* G \varSigma \varSigma^* G^* \varSigma A^* G^* \right)^{1/2} \right. \\&\displaystyle &\displaystyle \left.+ \left(\frac{1}{N} \mathrm{Tr} GA A^* G^* \right)^{1/2}\right\} \end{array} \end{aligned} $$

Using Lemma 4.17 (i), Theorem A.8 in [2], Lemma 4.11, the identity ΣΣ ^∗G = GΣΣ ^∗ = −I + zG, and the fact that ∥A _N∥ is uniformly bounded, the same analysis allows to prove (4.58).

Corollary 4.1

Let Δ ₁(p, q), Δ ₂(p, q), (p, q) ∈{1, …, n}², and Δ ₃ be as defined in Proposition 4.5 . Then there exist a polynomial P with nonnegative coefficients and a nonnegative real number l such that, for all large N, for any $z\in \mathbb {C}\setminus \mathbb {R}$ ,

$$\displaystyle \begin{aligned} \varDelta_3(z)\leq \frac{P(\vert \Im z\vert^{-1}) (1+\vert z \vert )^l}{N},\end{aligned} $$

(4.60)

and for all (p, q) ∈{1, …, n}²,

$$\displaystyle \begin{aligned} \varDelta_1(p,q)(z)\leq \frac{P(\vert \Im z\vert^{-1}) (1+\vert z \vert )^l}{N},\end{aligned} $$

(4.61)

$$\displaystyle \begin{aligned} \varDelta_2(p,q)(z)\leq \frac{P(\vert \Im z\vert^{-1}) (1+\vert z \vert )^l}{N\sqrt{N}}.\end{aligned} $$

(4.62)

Proof

Using the identity

$$\displaystyle \begin{aligned}GM_N = -I + z G,\end{aligned}$$

(4.61) readily follows from Cauchy-Schwartz inequality, Lemma 4.17 and (4.56). (4.62) and (4.60) readily follows from Cauchy-Schwartz inequality and Lemma 4.12.

4.4.3 Estimates of Resolvent Entries

In order to deduce Proposition 4.3 from Proposition 4.5 and Corollary 4.1, we need the two following Lemmas 4.13 and 4.14.

Lemma 4.13

For all $z\in \mathbb {C}\setminus \mathbb {R}$ ,

$$\displaystyle \begin{aligned}\frac{1}{\left|1- \sigma^2 c_N g_N(z)\right|} \leq \frac{\vert z\vert }{\vert \Im z \vert},\end{aligned} $$

(4.63)

$$\displaystyle \begin{aligned}\frac{1}{\left|1- \sigma^2 c g_{\mu_{\sigma,\nu,c}}(z)\right|} \leq \frac{\vert z\vert }{\vert \Im z \vert}.\end{aligned} $$

(4.64)

Proof

Since $\mu _{M_N}$ is supported by [0, +∞[, (4.63) readily follows from

$$\displaystyle \begin{aligned} \frac{1}{\left|1- \sigma^2 c_N g_N(z)\right|} &=\frac{\vert z\vert }{\left|z- \sigma^2 c_N zg_N(z)\right|} \\ &\leq \frac{\vert z\vert }{\big|\Im (z- \sigma^2 c_N zg_N(z))\big|}\,{=}\, \frac{\vert z\vert }{|\Im z |\big( 1+\sigma^2 c_N \mathbb{E} \int \frac{t}{|z-t|{}^2} d\mu_{M_N}(t)\big)}.\end{aligned} $$

(4.64) may be proved similarly.

Corollary 4.1 and Lemma 4.13 yield that, there is a polynomial Q with nonnegative coefficients, a sequence b _N of nonnegative real numbers converging to zero when N goes to infinity and some nonnegative integer number l, such that for any p, q in {1, …, n}, for all $z\in \mathbb {C}\setminus \mathbb {R}$,

$$\displaystyle \begin{aligned}\nabla_{pq} \leq (1+\vert z\vert)^l Q(\vert \Im z \vert^{-1})b_N,\end{aligned} $$

(4.65)

where ∇_pq was defined by (4.36).

Lemma 4.14

There is a sequence v _N of nonnegative real numbers converging to zero when N goes to infinity such that for all $z\in \mathbb {C}\setminus \mathbb {R}$ ,

$$\displaystyle \begin{aligned} \left| g_N(z)-g_{\mu_{\sigma,\nu,c}}(z)\right| \leq \left\{\frac{\vert z\vert^2 +2}{\vert \Im z \vert^{2}}+ \frac{1}{\vert \Im z \vert}\right\}v_N.\end{aligned} $$

(4.66)

Proof

First note that it is sufficient to prove (4.66) for $z\in \mathbb {C}^+:=\{z \in \mathbb {C}; \Im z >0\}$ since $ g_N(\bar z)-g_{\mu _{\sigma ,\nu ,c}} (\bar z)= \overline {g_N(z)-g_{\mu _{\sigma ,\nu ,c}}(z)}$. Fix 𝜖 > 0. According to Theorem A.8 and Theorem 5.11 in [2], and the assumption on A _N, we can choose $K> \max \{ 2/\varepsilon ; x, x \in \mathrm {supp}( \mu _{\sigma ,\nu ,c})\}$ large enough such that $\mathbb {P}\left ( \left \|M_N\right \| >K\right )$ goes to zero as N goes to infinity. Let us write

(4.67)

For any $z \in \mathbb {C}^+$ such that |z| > 2K, we have

Thus, $\forall z \in \mathbb {C}^+,$ such that |z| > 2K, we can deduce that

(4.68)

Now, it is clear that is a sequence of locally bounded holomorphic functions on $\mathbb {C}^+$ which converges towards $g_{\mu _{\sigma ,\nu ,c}}$. Hence, by Vitali’s Theorem, converges uniformly towards $g_{\mu _{\sigma ,\nu ,c}}$ on each compact subset of $\mathbb {C}^+$. Thus, there exists N(𝜖) > 0, such that for any N ≥ N(𝜖), for any $z\in \mathbb {C}^+$, such that |z|≤ 2K and ℑz ≥ ε,

(4.69)

Finally, for any $z\in \mathbb {C}^+$, such that ℑz ∈ ]0;ε[, we have

(4.70)

It readily follows from (4.68), (4.69) and (4.70) that for N ≥ N(𝜖),

Moreover, for N ≥ N′(𝜖) ≥ N(𝜖), $\mathbb {P}\left ( \left \|M_N\right \| >K\right ) \leq \varepsilon .$ Therefore, for N ≥ N′(𝜖), we have for any $z \in \mathbb {C}^+$,

(4.71)

Thus, the proof is complete by setting

$$\displaystyle \begin{aligned}v_N= \sup_{z\in \mathbb{C}^+} \left\{\left|g_N(z)-g_{\mu_{\sigma,\nu,c}}(z) \right| \left(\frac{\vert z\vert^2 +2}{\vert \Im z \vert^{2}}+ \frac{1}{\Im z}\right)^{-1}\right\}.\end{aligned}$$

Now set

$$\displaystyle \begin{aligned}\tau_N= {(1-\sigma^2c_Ng_{ N}(z))z- \frac{ \gamma_q(N)}{1- \sigma^2 c_Ng_{N}(z)} -\sigma^2 (1-c_N)}\end{aligned}$$

and

$$\displaystyle \begin{aligned} \tilde \tau_N={(1-\sigma^2cg_{ \mu_{\sigma,\nu,c}}(z))z- \frac{ \gamma_q(N)}{1- \sigma^2 cg_{ \mu_{\sigma,\nu,c}}(z)} -\sigma^2 (1-c)}.\end{aligned} $$

(4.72)

Lemmas 4.13 and 4.14 yield that there is a polynomial R with nonnegative coefficients, a sequence w _N of nonnegative real numbers converging to zero when N goes to infinity and some nonnegative real number l, such that for all $z\in \mathbb {C}\setminus \mathbb {R}$,

$$\displaystyle \begin{aligned}\left|\tau_N - \tilde \tau_N\right| \leq (1+\vert z\vert)^l R(\vert \Im z \vert^{-1})w_N.\end{aligned} $$

(4.73)

Now, one can easily see that,

$$\displaystyle \begin{aligned} \left|\Im \left\{(1-\sigma^2cg_{ \mu_{\sigma,\nu,c}}(z))z- \frac{ \gamma_q(N)}{1- \sigma^2 cg_{ \mu_{\sigma,\nu,c}}(z)} -\sigma^2 (1-c)\right\}\right| \geq \vert \Im z \vert,\end{aligned} $$

(4.74)

so that

$$\displaystyle \begin{aligned} \left| \frac{1}{\tilde \tau_N}\right| \leq \frac{1}{\vert\Im z \vert}. \end{aligned} $$

(4.75)

Note that

$$\displaystyle \begin{aligned}\frac{1}{ \tilde \tau_N} =\frac{( {1- \sigma^2c g_{\mu_{\sigma,\nu,c}}(z)})}{\omega_{\sigma, \nu, c}(z) -\gamma_q(N)}.\end{aligned} $$

(4.76)

Then, (4.16) readily follows from Proposition 4.5, (4.65), (4.73), (4.75), (4.76), and (ii) Lemma 4.17. The proof of Proposition 4.3 is complete.

4.5 Proof of Theorem 4.3

We follow the two steps presented in Sect. 4.2.

Step A

We first prove (4.11).

Let η > 0 small enough and N large enough such that for any l = 1, …, J, α _l(N) ∈ [θ _l − η, θ _l + η] and [θ _l − 2η, θ _l + 2η] contains no other element of the spectrum of $A_NA_N^*$ than α _l(N). For any l = 1, …, J, choose f _η,l in $\mathbb {C}^\infty (\mathbb {R}, \mathbb {R})$ with support in [θ _l − 2η, θ _l + 2η] such that f _η,l(x) = 1 for any x ∈ [θ _l − η, θ _l + η] and 0 ≤ f _η,l ≤ 1. Let 0 < 𝜖 < δ ₀ where δ ₀ is introduced in Theorem 4.2. Choose h _ε,j in $\mathbb { C}^\infty (\mathbb {R}, \mathbb {R})$ with support in $[\rho _{\theta _j} -\varepsilon ,\rho _{\theta _j}+\varepsilon ]$ such that h _ε,j ≡ 1 on $[\rho _{\theta _j} -\varepsilon /2 ,\rho _{\theta _j}+\varepsilon /2 ]$ and 0 ≤ h _ε,j ≤ 1.

Almost surely for all large N, M _N has k _j eigenvalues in $]\rho _{\theta _j} -\varepsilon /2 ,\rho _{\theta _j}+\varepsilon /2[$. According to Theorem 4.2, denoting by $(\xi _1,\cdots ,\xi _{k_j})$ an orthonormal system of eigenvectors associated to the k _j eigenvalues of M _N in $( \rho _{\theta _j} -\varepsilon /2, \rho _{\theta _j}+\varepsilon /2)$, it readily follows from (4.12) that almost surely for all large N,

$$\displaystyle \begin{aligned}\sum_{n=1}^{k_j}\left\| P_{\ker(\alpha_l(N) I_n-A_NA_N^*)}\xi_n \right\|{}^2= \mathrm{Tr} \left[ h_{\varepsilon ,j}(M_N) f_{\eta,l}(A_NA_N^*)\right].\end{aligned}$$

Applying Proposition 4.2 with $\varGamma _N= f_{\eta ,l}(A_NA_N^*)$ and K = k _l, the problem of establishing (4.11) is reduced to prove that

$$\displaystyle \begin{aligned} & \mathbb{E}\left(\mathrm{Tr} \left[h_{\varepsilon,j} \left(\left(\sigma\frac{\mathbb{G}_N}{\sqrt{N}}+A_N\right)\left(\sigma\frac{\mathbb{ G}_N}{\sqrt{N}}+A_N\right)^*\right) f_{\eta,l}(A_NA_N^*)\right] \right)\\ & \quad \rightarrow_{N \rightarrow +\infty} \frac{k_j\delta_{jl} (1-\sigma^2 c g_{\mu_{\sigma,\nu,c}}(\rho_{\theta_j}))}{\omega_{\sigma,\nu,c}^{\prime}(\rho_{\theta_j})}. \end{aligned} $$

(4.77)

Using a Singular Value Decomposition of A _N and the biunitarily invariance of the distribution of $\mathbb {G}_N$, we can assume that A _N is as (4.14) and such that for any j = 1, …, J,

$$\displaystyle \begin{aligned}(A_NA_N^*)_{ii}=\alpha_j(N) \mbox{ ~ for }i=k_1+\ldots+k_{j-1}+l, l=1,\ldots,k_j.\end{aligned}$$

Now, according to Lemma 4.18,

$$\displaystyle \begin{aligned} & \mathbb{E}\left(\mathrm{Tr} \left[h_{\varepsilon,j} \left(\left(\sigma \frac{\mathbb{G}_N}{\sqrt{N}}+A_N\right)\left(\sigma \frac{\mathbb{ G}_N}{\sqrt{N}}+A_N\right)^*\right) f_{\eta,l}(A_NA_N^*)\right] \right)\\ &\quad = - \lim_{y\rightarrow 0^{+}}\frac{1}{\pi} \int \Im \mathbb{E}\mathrm{Tr} \left[G^{\mathbb{G}}_N(t+\mathrm{i} y) f_{\eta,l}(A_NA_N^*)\right] h_{\varepsilon,j}(t) dt,\end{aligned} $$

with, for all large N,

$$\displaystyle \begin{aligned} \begin{array}{rcl}\mathbb{E}\mathrm{Tr} \left[G^{\mathbb{G}}_N(t+\mathrm{i} y) f_{\eta,l}(A_NA_N^*)\right] &\displaystyle =&\displaystyle \sum_{k=k_1+\cdot+k_{l-1}+1}^{k_1+\cdot+k_{l}} f_{\eta,l} (\alpha_l(N))\mathbb{E}[G^{\mathbb{G}}_N(t+\mathrm{i} y)]_{kk} \\&\displaystyle =&\displaystyle \sum_{k=k_1+\cdot+k_{l-1}+1}^{k_1+\cdot+k_{l}} \mathbb{E}[G^{\mathbb{G}}_N(t+\mathrm{i} y)]_{kk}. \end{array} \end{aligned} $$

Now, by considering

$$\displaystyle \begin{aligned}\tau'={(1-\sigma^2cg_{ \mu_{\sigma,\nu,c}}(z))z- \frac{ \theta_l}{1- \sigma^2 cg_{ \mu_{\sigma,\nu,c}}(z)} -\sigma^2 (1-c)}\end{aligned}$$

instead of dealing with $\tilde \tau _N$ defined in (4.72) at the end of the proof of Proposition 4.3, one can prove that there is a polynomial P with nonnegative coefficients, a sequence (u _N)_N of nonnegative real numbers converging to zero when N goes to infinity and some nonnegative real number s, such that for any k in {k ₁ + … + k _l−1 + 1, …, k ₁ + … + k _l}, for all $z\in \mathbb {C}\setminus \mathbb {R}$,

$$\displaystyle \begin{aligned} \mathbb{E} \left(\left( G^{\mathbb{G}}_N(z)\right)_{kk}\right) = \frac{1- \sigma^2 cg_{\mu_{\sigma,\nu,c}}(z)}{\omega_{\sigma, \nu, c}(z) -\theta_l} +\varDelta_{k,N}(z), \end{aligned} $$

(4.78)

with

$$\displaystyle \begin{aligned}\left| \varDelta_{k,N} (z)\right| \leq (1+\vert z\vert)^s P(\vert \Im z \vert^{-1})u_N.\end{aligned}$$

Thus,

$$\displaystyle \begin{aligned}\mathbb{E}\mathrm{Tr} \left[G^{\mathbb{G}}_N(t+\mathrm{i} y) f_{\eta,l}(A_NA_N^*)\right] = k_l \frac{1- \sigma^2 cg_{\mu,\sigma,\nu}(t+\mathrm{i} y)}{\omega_{\sigma, \nu, c}(z) -\theta_l} + \varDelta_N(t+\mathrm{i} y),\end{aligned}$$

where for all $z \in \mathbb {C} \setminus \mathbb {R}$, $\varDelta _N(z)= \sum _{k=k_1+\cdot +k_{l-1}+1}^{k_1+\cdot +k_{l}} \varDelta _{k,N}(z),$ and $\left | \varDelta _{N} (z)\right | \leq k_l (1+\vert z\vert )^s P(\vert \Im z \vert ^{-1})u_N.$

First let us compute

$$\displaystyle \begin{aligned}\lim_{y\downarrow0}\frac{k_l}{\pi}\int_{\rho_{\theta_j}-\varepsilon}^{\rho_{\theta_j}+\varepsilon} \Im\frac{h_{\varepsilon,j}(t) (1-\sigma^2 c g_{\mu_{\sigma,\nu,c}}(t+\mathrm{i} y))}{\theta_l-\omega_{\sigma,\nu,c}(t+\mathrm{i} y)}\,dt. \end{aligned}$$

The function ω _σ,ν,c satisfies $\omega _{\sigma ,\nu ,c}(\overline {z})=\overline {\omega _{\sigma ,\nu ,c}(z)}$ and $g_{\mu _{\sigma ,\nu ,c}}(\overline {z})=\overline {g_{\mu _{\sigma ,\nu ,c}}(z)}$, so that $\Im \frac { (1-\sigma ^2 c g_{\mu _{\sigma ,\nu ,c}}(t+\mathrm{i} y))}{\theta _l-\omega _{\sigma ,\nu ,c}(t+\mathrm{i} y)}=\frac {1}{2i}[\frac { (1-\sigma ^2 c g_{\mu _{\sigma ,\nu ,c}}(t+\mathrm{i} y))}{\theta _l-\omega _{\sigma ,\nu ,c}(t+\mathrm{i} y)}- \frac { (1-\sigma ^2 c g_{\mu _{\sigma ,\nu ,c}}(t-iy))}{\theta _l-\omega _{\sigma ,\nu ,c}(t-iy)}]$. As in [10], the above integral is split into three pieces, namely $\int _{\rho _{\theta _j}-\varepsilon }^{\rho _{\theta _j}-\varepsilon /2}+ \int _{\rho _{\theta _j}-\varepsilon /2}^{\rho _{\theta _j}+\varepsilon /2}+\int _{\rho _{\theta _j}+\varepsilon /2}^{\rho _{\theta _j}+\varepsilon }$. Each of the first and third integrals are easily seen to go to zero when y ↓ 0 by a direct application of the definition of the functions involved and of the (Riemann) integral. As h _ε,j is constantly equal to one on $[\rho _{\theta _j}-\epsilon /2; \rho _{\theta _j}+\epsilon /2]$, the second (middle) term is simply the integral

$$\displaystyle \begin{aligned}\frac{k_l}{2\pi i}\int_{\rho_{\theta_j}-\varepsilon/2}^{\rho_{\theta_j}+\varepsilon/2}\frac{1-\sigma^2cg_{\mu_{\sigma,\nu,c}}(t+\mathrm{i} y)}{\theta_l-\omega_{\sigma,\nu,c}(t+\mathrm{i} y)}- \frac{1-\sigma^2cg_{\mu_{\sigma,\nu,c}}(t-\mathrm{i} y)}{\theta_l-\omega_{\sigma,\nu,c}(t-\mathrm{i} y)}\,dt. \end{aligned}$$

Completing this to a contour integral on the rectangular with corners $\rho _{\theta _j}\pm \varepsilon /2\pm iy$ and noting that the integrals along the vertical lines tend to zero as y ↓ 0 allows a direct application of the residue theorem for the final result, if l = j,

$$\displaystyle \begin{aligned}\frac{k_j (1-\sigma^2 c g_{\mu_{\sigma,\nu,c}}(\rho_{\theta_j}))}{\omega_{\sigma,\nu,c}^{\prime}(\rho_{\theta_j})}. \end{aligned}$$

If we consider θ _l for some l ≠ j, then $z\mapsto (1-\sigma ^2 cg_{\mu _{\sigma ,\nu ,c}}(z))(\theta _l- \omega _{\sigma ,\nu ,c}(z))^{-1}$ is analytic around $\rho _{\theta _j}$, so its residue at $\rho _{\theta _j}$ is zero, and the above argument provides zero as answer.

Now, according to Lemma 4.19, we have

$$\displaystyle \begin{aligned}\limsup_{y\rightarrow 0^+}~(u_N)^{-1}\left|\int h_{\varepsilon,j} (t)\varDelta_N(t+\mathrm{i} y)dt\right| <+\infty \end{aligned}$$

so that

$$\displaystyle \begin{aligned} \lim_{N\rightarrow + \infty}\limsup_{y \rightarrow 0^+} \left| \int h_{\varepsilon,j} (t) \varDelta_N(t+\mathrm{i} y)dt \right| =0. \end{aligned} $$

(4.79)

This concludes the proof of (4.11).

Step B

In the second, and final, step, we shall use a perturbation argument identical to the one used in [10] to reduce the problem to the case of a spike with multiplicity one, case that follows trivially from Step A. A further property of eigenvectors of Hermitian matrices which are close to each other in the norm will be important in the analysis of the behaviour of the eigenvectors of our matrix models. Given a Hermitian matrix $M\in \ M_N(\mathbb C)$ and a Borel set $S\subseteq \mathbb R$, we denote by E _M(S) the spectral projection of M associated to S. In other words, the range of E _M(S) is the vector space generated by the eigenvectors of M corresponding to eigenvalues in S. The following lemma can be found in [6].

Lemma 4.15

Let M and M ₀ be N × N Hermitian matrices. Assume that $\alpha ,\beta ,\delta \in \mathbb R$ are such that α < β, δ > 0, M and M ₀ has no eigenvalues in [α − δ, α] ∪ [β, β + δ]. Then,

$$\displaystyle \begin{aligned}\|E_{M}((\alpha,\beta))-E_{M_0}((\alpha,\beta))\|<\frac{4(\beta-\alpha+2\delta)}{\pi\delta^2}\|M-M_0\|. \end{aligned}$$

In particular, for any unit vector $\xi \in E_{M_0}((\alpha ,\beta ))(\mathbb C^N)$ ,

$$\displaystyle \begin{aligned}\|(I_N-E_{M}((\alpha,\beta)))\xi\|{}_2<\frac{4(\beta-\alpha+2\delta)}{\pi\delta^2}\|M-M_0\|. \end{aligned}$$

Assume that θ _i is in Θ _σ,ν,c defined in (4.7) and k _i ≠ 1. Let us denote by $V_1(i),\ldots , V_{k_i}(i)$, an orthonormal system of eigenvectors of $A_NA_N^*$ associated with α _i(N). Consider a Singular Value Decomposition A _N = U _ND _NV _N where V _N is a N × N unitary matrix, U _N is a n × n unitary matrix whose k _i first columns are $ V_1(i),\ldots , V_{k_i}(i)$ and D _N is as (4.14) with the first k _i diagonal elements equal to $\sqrt {\alpha _i(N)}$.

Let δ ₀ be as in Theorem 4.2. Almost surely, for all N large enough, there are k _i eigenvalues of M _N in $(\rho _{\theta _i}- \frac {\delta _0}{4}, \rho _{\theta _i}+ \frac {\delta _0}{4})$, namely $\lambda _{n_{i-1}+q}(M_N)$, q = 1, …, k _i (where n _i−1 + 1, …, n _i−1 + k _i are the descending ranks of α _i(N) among the eigenvalues of $A_NA_N^*$), which are moreover the only eigenvalues of M _N in $(\rho _{\theta _i}-\delta _0,\rho _{\theta _i}+\delta _0)$. Thus, the spectrum of M _N is split into three pieces:

$$\displaystyle \begin{aligned}\{\lambda_1(M_N),\dots,\lambda_{n_{i-1}}(M_N)\}\subset (\rho_{\theta_i}+\delta_0,+\infty[,\end{aligned}$$

$$\displaystyle \begin{aligned}\{\lambda_{n_{i-1}+1}(M_N),\dots,\lambda_{n_{i-1}+k_i}(M_N)\} \subset(\rho_{\theta_i}- \frac{\delta_0}{4},\rho_{\theta_i}+ \frac{\delta_0}{4}),\end{aligned}$$

$$\displaystyle \begin{aligned}\{\lambda_{n_{i-1}+k_i+1}(M_N),\dots, \lambda_{N}(M_N)\}\subset [0, \rho_{\theta_i}-\delta_0). \end{aligned}$$

The distance between any of these components is equal to 3δ ₀∕4. Let us fix 𝜖 ₀ such that $0\leq \theta _i ( 2 \epsilon _0 k_i +\epsilon _0^2k_i^2) < dist(\theta _i, \mbox{supp }\nu \cup _{i\neq s}\theta _s )$ and such that $[\theta _i; \theta _i+\theta _i ( 2 \epsilon _0 k_i +\epsilon _0^2k_i^2)] \subset \mathbb {E}_{\sigma , \nu , c}$ defined by (4.6). For any 0 < 𝜖 < 𝜖 ₀, define the matrix A _N(𝜖) as A _N(𝜖) = U _ND _N(𝜖)V _N where

$$\displaystyle \begin{aligned}\left(D_N(\epsilon)\right)_{m, m}=\sqrt{\alpha_{i}(N)} [1 + \epsilon (k_i-m+1)],\text{ ~for } m\in\{1,\ldots,k_i\},\end{aligned}$$

and $\left (D_N(\epsilon )\right )_{pq}=\left (D_N\right )_{pq}$ for any (p, q)∉{(m, m), m ∈{1, …, k _i}}.

Set

$$\displaystyle \begin{aligned}M_N(\epsilon)=\left(\sigma \frac{X_N}{\sqrt{N}} +A_N(\epsilon)\right)\left( \sigma \frac{X_N}{\sqrt{N}} +A_N(\epsilon)\right)^*.\end{aligned}$$

For N large enough, for each m ∈{1, …, k _i}, α _i(N)[1 + 𝜖(k _i − m + 1)]² is an eigenvalue of $A_NA_N^*(\epsilon )$ with multiplicity one. Note that, since sup_N∥A _N∥ < +∞, it is easy to see that there exist some constant C such that for any N and for any 0 < 𝜖 < 𝜖 ₀,

$$\displaystyle \begin{aligned}\left\|M_N(\epsilon)-M_N\right\|\leq C\epsilon \left( \left\| \frac{X_N}{\sqrt{N}}\right\| +1\right) . \end{aligned}$$

Applying Remark 4.3 to the (n + N) × (n + N) matrix $\tilde X_N= \left ( \begin {array}{ll} 0_{n\times n}~~~ X_N\\ X_N^*~~~ 0_{N\times N} \end {array} \right )$ (see also Appendix B of [14]), it readily follows that there exists some constant C′ such that a.s for all large N, for any 0 < 𝜖 < 𝜖 ₀,

$$\displaystyle \begin{aligned} \left\| M_N(\epsilon)-M_N\right\| \leq C' \epsilon.\end{aligned} $$

(4.80)

Therefore, for 𝜖 sufficiently small such that C′𝜖 < δ ₀∕4, by Theorem A.46 [2], there are precisely n _i−1 eigenvalues of M _N(𝜖) in $[0,\rho _{\theta _i}-3\delta _0/4)$, precisely k _i in $(\rho _{\theta _i}-\delta _0/2,\rho _{\theta _i}+\delta _0/2)$ and precisely N − (n _i−1 + k _i) in $(\rho _{\theta _i}+3\delta _0/4,+ \infty [$. All these intervals are again at strictly positive distance from each other, in this case δ ₀∕4.

Let ξ be a normalized eigenvector of M _N relative to $\lambda _{n_{i-1}+q}(M_N)$ for some q ∈{1, …, k _i}. As proved in Lemma 4.15, if E(𝜖) denotes the subspace spanned by the eigenvectors associated to $\{\lambda _{n_{i-1}+1}(M_N(\epsilon )),\dots ,\lambda _{n_{i-1}+k_i} (M_N(\epsilon ))\}$ in $\mathbb C^N$, then there exists some constant C (which depends on δ ₀) such that for 𝜖 small enough, almost surely for large N,

$$\displaystyle \begin{aligned} \left\|P_{ E(\epsilon)^{\bot}}\xi\right\|{}_2\leq C \epsilon .\end{aligned} $$

(4.81)

According to Theorem 4.2, for any j in {1, …, k _i}, for large enough N, $\lambda _{n_{i-1}+j}(M_N(\epsilon ))$ separates from the rest of the spectrum and belongs to a neighborhood of $\varPhi _{\sigma ,\nu ,c} (\theta _i^{(j)}(\epsilon ))$ where

$$\displaystyle \begin{aligned}\theta_i^{(j)}(\epsilon)=\theta_i\left( 1+\epsilon (k_i -j+1) \right)^2.\end{aligned}$$

If ξ _j(𝜖, i) denotes a normalized eigenvector associated to $\lambda _{n_{i-1}+j}(M_N(\epsilon ))$, Step A above implies that almost surely for any p ∈{1, …, k _i}, for any γ > 0, for all large N,

$$\displaystyle \begin{aligned} \left|\left| \langle V_{p}(i),\xi_{j}(\epsilon ,i)\rangle \right|{}^2 - \frac{\delta_{jp}\left(1-\sigma^2 c g_{\mu_{\sigma,\nu,c}}(\varPhi_{\sigma,\nu,c} (\theta_i^{(j)}(\epsilon)))\right)}{ \omega_{\sigma,\nu,c}^{\prime}\left(\varPhi_{\sigma,\nu,c} (\theta_i^{(j)}(\epsilon)) \right)}\right|<\gamma. \end{aligned} $$

(4.82)

The eigenvector ξ decomposes uniquely in the orthonormal basis of eigenvectors of M _N(𝜖) as $\xi =\sum _{j=1}^{k_i}c_j(\epsilon )\xi _j(\epsilon ,i)+\xi (\epsilon )^\perp $, where c _j(𝜖) = 〈ξ|ξ _j(𝜖, i)〉 and $\xi (\epsilon )^\perp =P_{ E(\epsilon )^{\bot }}\xi $; necessarily $\sum _{j=1}^{k_i}|c_j(\epsilon )|{ }^2+\|\xi (\epsilon )^\perp \|{ }_2^2=1$. Moreover, as indicated in relation (4.81), ∥ξ(𝜖)^⊥∥₂ ≤ C𝜖. We have

$$\displaystyle \begin{aligned} \begin{array}{rcl} P_{\ker(\alpha_i(N)I_N-A_NA_N^*)}\xi&\displaystyle =&\displaystyle \sum_{j=1}^{k_i}c_j(\epsilon) P_{\ker(\alpha_i(N)I_N-A_NA_N^*)}\xi_j(\epsilon,i)\\&\displaystyle &\displaystyle +P_{\ker(\alpha_i(N)I_N-A_NA_N^*)}\xi(\epsilon)^\perp\\ &\displaystyle =&\displaystyle \sum_{j=1}^{k_i}c_j(\epsilon) \sum_{l=1}^{k_i}\langle \xi_j(\epsilon,i) | V_{l}(i)\rangle V_{l}(i)\\ &\displaystyle &\displaystyle +P_{\ker(\alpha_i(N)I_N-A_NA_N^*)}\xi(\epsilon)^\perp. \end{array} \end{aligned} $$

Take in the above the scalar product with $\xi =\sum _{j=1}^{k_i}c_j(\epsilon )\xi _j(\epsilon ,i)+\xi (\epsilon )^\perp $ to get

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle \langle P_{\ker(\alpha_i(N)I_N-A_NA_N^*)}\xi|\xi\rangle \\ &\displaystyle &\displaystyle \quad =\sum_{j,l,s=1}^{k_i}c_j(\epsilon)\langle \xi_j(\epsilon, i) |V_{l}(i) \rangle\overline{c_s(\epsilon)}\langle V_{l}(i)|\xi_s(\epsilon, i)\rangle\\ &\displaystyle &\displaystyle \qquad +\sum_{j=1}^{k_i}c_j(\epsilon) \sum_{l=1}^{k_i}\langle \xi_j(\epsilon,i)| V_{l}(i) \rangle \langle V_{l}(i)|\xi(\epsilon)^\perp\rangle\\ &\displaystyle &\displaystyle \qquad +\langle P_{\ker(\alpha_i(N)I_N-A_NA_N^*)}\xi(\epsilon)^\perp|\xi\rangle. \end{array} \end{aligned} $$

Relation (4.82) indicates that

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle \sum_{j,l,s=1}^{k_i}c_j(\epsilon)\langle \xi_j(\epsilon, i) | V_{l}(i) \rangle\overline{c_s(\epsilon)}\langle V_{l}(i)|\xi_s(\epsilon, i)\rangle\\ &\displaystyle &\displaystyle \quad = \sum_{j=1}^{k_i}|c_j(\epsilon)|{}^2|\langle V_{j}(i)|\xi_j(\epsilon, i)\rangle|{}^2+\varDelta_1\\ &\displaystyle &\displaystyle \quad = \sum_{j=1}^{k_i}|c_j(\epsilon)|{}^2 \frac{\left(1-\sigma^2 c g_{\mu_{\sigma,\nu,c}}(\varPhi_{\sigma,\nu,c} (\theta_i^{(j)}(\epsilon)))\right)}{ \omega_{\sigma,\nu,c}^{\prime}\left(\varPhi_{\sigma,\nu,c} (\theta_i^{(j)}(\epsilon)) \right)}+\varDelta_1 + \varDelta_2, \end{array} \end{aligned} $$

where for all large N, $\vert \varDelta _1\vert \leq \sqrt { \gamma } k_i^3$ and |Δ ₂|≤ γ. Since ∥ξ(𝜖)^⊥∥₂ ≤ C𝜖,

$$\displaystyle \begin{aligned} &\left|\sum_{j=1}^{k_i}c_j(\epsilon) \sum_{l=1}^{k_i}\langle \xi_j(\epsilon,i) | V_{l}(i) \rangle \langle V_{l}(i)|\xi(\epsilon)^\perp\rangle \right. \\ &\quad \left. +\langle P_{\ker(\alpha_i(N)I_N-A_NA_N^*)}\xi(\epsilon)^\perp|\xi\rangle\right| \leq\left(k_i^2+1\right){C\epsilon}. \end{aligned} $$

Thus, we conclude that almost surely for any γ > 0, for all large N,

$$\displaystyle \begin{aligned}&\left|\langle P_{\ker(\alpha_i(N)I_N-A_NA_N^*)}\xi|\xi\rangle- \sum_{j=1}^{k_i}\frac{|c_j(\epsilon)|{}^2 \left(1-\sigma^2 c g_{\mu_{\sigma,\nu,c}}(\varPhi_{\sigma,\nu,c} (\theta_i^{(j)}(\epsilon)))\right)}{ \omega_{\sigma,\nu,c}^{\prime}\left(\varPhi_{\sigma,\nu,c} (\theta_i^{(j)}(\epsilon)) \right)} \right| \\ & \quad \leq(k_i^2+1)C\epsilon+ \sqrt{\gamma}k_i^3+\gamma .{} \end{aligned} $$

(4.83)

Since we have the identity

$$\displaystyle \begin{aligned}\langle P_{\ker(\alpha_i(N)I_N-A_NA_N^*)}\xi|\xi\rangle=\|P_{\ker( \alpha_i(N)I_N-A_NA_N^*)}\xi\|{}_2^2\end{aligned}$$

and the three obvious convergences $\lim _{\epsilon \to 0}\omega _{\sigma ,\nu ,c}^{\prime }\left (\varPhi _{\sigma ,\nu ,c} (\theta _i^{(j)}(\epsilon )) \right )=\omega _{\sigma ,\nu ,c}^{\prime }(\rho _{\theta _i})$, $\lim _{\epsilon \to 0}g_{\mu _{\sigma ,\nu ,c}}\left (\varPhi _{\sigma ,\nu ,c} (\theta _i^{(j)}(\epsilon )) \right )=g_{\mu _{\sigma ,\nu ,c}}(\rho _{\theta _i})$ and $\lim _{\epsilon \to 0}\sum _{j=1}^{k_i}|c_j(\epsilon )|{ }^2=1$, relation (4.83) concludes Step B and the proof of Theorem 4.3. (Note that we use (2.9) of [11] which is true for any $x\in \mathbb {C}\setminus \mathbb {R}$ to deduce that $1-\sigma ^2 c g_{\mu _{\sigma ,\nu ,c}}(\varPhi _{\sigma ,\nu ,c}(\theta _i))= \frac { 1}{1+ \sigma ^2 cg_\nu (\theta _i)}$ by letting x goes to Φ _σ,ν,c(θ _i)).

References

G. Anderson, A. Guionnet, O. Zeitouni, An Introduction to Random Matrices (Cambridge University Press, Cambridge, 2009)
Google Scholar
Z.D. Bai, J.W. Silverstein, Spectral Analysis of Large-Dimensional Random Matrices. Mathematics Monograph Series, vol. 2 (Science Press, Beijing, 2006)
Google Scholar
Z. Bai, J.W. Silverstein, No eigenvalues outside the support of the limiting spectral distribution of information-plus-noise type matrices. Random Matrices Theory Appl. 1(1), 1150004, 44 (2012)
Google Scholar
J.B. Bardet, N. Gozlan, F. Malrieu, P.-A. Zitt, Functional inequalities for Gaussian convolutions of compactly supported measures: explicit bounds and dimension dependence (2018). Bernoulli 24(1), 333–353 (2018)
Google Scholar
S.T. Belinschi, M. Capitaine, Spectral properties of polynomials in independent Wigner and deterministic matrices. J. Funct. Anal. 273, 3901–3963 (2017). https://doi.org/10.1016/j.jfa.2017.07.010
S.T. Belinschi, H. Bercovici, M. Capitaine, M. Février, Outliers in the spectrum of large deformed unitarily invariant models. Ann. Probab. 45(6A), 3571–3625 (2017)
Google Scholar
F. Benaych-Georges, R.N. Rao, The eigenvalues and eigenvectors of finite, low rank perturbations of large random matrices. Adv. Math. 227(1), 494–521 (2011)
Google Scholar
F. Benaych-Georges, R.N. Rao, The singular values and vectors of low rank perturbations of large rectangular random matrices (2011). J. Multivar. Anal. (111), 120–135 (2012)
Google Scholar
S.G. Bobkov, F. Götze, Exponential integrability and transportation cost related to logarithmic Sobolev inequalities. J. Funct. Anal. 163(1), 1–28 (1999)
Google Scholar
M. Capitaine, Additive/multiplicative free subordination property and limiting eigenvectors of spiked additive deformations of Wigner matrices and spiked sample covariance matrices. J. Theor. Probab. 26(3), 595–648 (2013)
Google Scholar
M. Capitaine, Exact separation phenomenon for the eigenvalues of large Information-Plus-Noise type matrices. Application to spiked models. Indiana Univ. Math. J. 63(6), 1875–1910 (2014)
Google Scholar
M. Capitaine, C. Donati-Martin, Strong asymptotic freeness for Wigner and Wishart matrices. Indiana Univ. Math. J. 56(2), 767–803 (2007)
Google Scholar
F. Benaych-Georges, C. Bordenave, M. Capitaine, C. Donati-Martin, A. Knowles, Spectrum of deformed random matrices and free probability, in Advanced Topics in Random Matrices, ed. by F. Benaych-Georges, D. Chafaï, S. Péché, B. de Tiliére. Panoramas et syntheses, vol. 53 (2018)
Google Scholar
R. Couillet, J.W. Silverstein, Z. Bai, M. Debbah, Eigen-inference for energy estimation of multiple sources. IEEE Trans. Inf. Theory 57(4), 2420–2439 (2011)
Google Scholar
R.B. Dozier, J.W. Silverstein, On the empirical distribution of eigenvalues of large dimensional information-plus-noise type matrices. J. Multivar. Anal. 98(4), 678–694 (2007)
Google Scholar
R.B. Dozier, J.W. Silverstein, Analysis of the limiting spectral distribution of large dimensional information-plus-noise type matrices. J. Multivar. Anal. 98(6), 1099–1122 (2007)
Google Scholar
J. Dumont, W. Hachem, S. Lasaulce, Ph. Loubaton, J. Najim, On the capacity achieving covariance matrix for Rician MIMO channels: an asymptotic approach. IEEE Trans. Inf. Theory 56(3), 1048–1069 (2010)
Google Scholar
A. Guionnet, B. Zegarlinski, Lectures on logarithmic Sobolev inequalities, in Séminaire de Probabilités, XXXVI. Lecture Notes in Mathematics, vol. 1801 (Springer, Berlin, 2003)
Google Scholar
U. Haagerup, S. Thorbjørnsen, Random matrices with complex Gaussian entries. Expo. Math. 21, 293–337 (2003)
Google Scholar
U. Haagerup, S. Thorbjørnsen, A new application of random matrices: $\mathrm {Ext}(C^*_{\mathrm { red}}(F_2))$ is not a group. Ann. Math. (2) 162(2), 711–775 (2005)
Google Scholar
W. Hachem, P. Loubaton, J. Najim, Deterministic Equivalents for certain functionals of large random matrices. Ann. Appl. Probab. 17(3), 875–930 (2007)
Google Scholar
A.M. Khorunzhy, B.A. Khoruzhenko, L.A. Pastur, Asymptotic properties of large random matrices with independent entries. J. Math. Phys. 37(10), 5033–5060 (1996)
Google Scholar
O. Ledoit, S. Péché, Eigenvectors of some large sample covariance matrix ensembles. Probab. Theory Relat. Fields 151, 233–264 (2011)
Google Scholar
M. Ledoux, The Concentration of Measure Phenomenon (American Mathematical Society, Providence, 2001)
Google Scholar
P. Loubaton, P. Vallet, Almost sure localization of the eigenvalues in a Gaussian information-plus-noise model. Application to the spiked models. Electron. J. Probab. 16, 1934–1959 (2011)
Google Scholar
L.A. Pastur, M. Shcherbina, Eigenvalue Distribution of Large Random Matrices. Mathematical Surveys and Monographs (American Mathematical Society, Providence, 2011)
Google Scholar
D. Paul, Asymptotics of sample eigenstructure for a large dimensional spiked covariance model. Stat. Sin. 17(4), 1617–1642 (2007)
MathSciNet MATH Google Scholar
P. Vallet, P. Loubaton, X. Mestre, Improved subspace estimation for multivariate observations of high dimension: the deterministic signal case. IEEE Trans. Inf. Theory 58(2), 1043–1068 (2012)
Article MathSciNet Google Scholar
D.V. Voiculescu, K. Dykema, A. Nica, Free Random Variables: A Noncommutative Probability Approach to Free Products with Applications to Random Matrices, Operator Algebras and Harmonic Analysis on Free Groups. CRM Monograph Series, vol. 1 (American Mathematical Society, Providence, 1992). ISBN 0-8218-6999-X
Google Scholar
J.-s. Xie, The convergence on spectrum of sample covariance matrices for information-plus-noise type data. Appl. Math. J. Chinese Univ. Ser. B 27(2), 181191 (2012)
Google Scholar

Download references

Acknowledgements

The author is very grateful to Charles Bordenave and Serban Belinschi for several fruitful discussions and thanks Serban Belinschi for pointing out Lemma 4.14. The author also wants to thank an anonymous referee who provided a much simpler proof of Lemma 4.13 and encouraged the author to establish the results for non diagonal perturbations, which led to an overall improvement of the paper.

Author information

Authors and Affiliations

CNRS, Institut de Mathématiques de Toulouse, Université Paul Sabatier, Toulouse Cedex, France
Mireille Capitaine

Authors

Mireille Capitaine
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mireille Capitaine .

Editor information

Editors and Affiliations

Laboratoire de Mathématiques de Versailles, Université Versailles Saint-Quentin, Versailles, France
Catherine Donati-Martin
Institut Elie Cartan de Lorraine, Vandoeuvre-les-Nancy, France
Antoine Lejay
Laboratoire de Mathématiques de Versailles, Université Versailles Saint-Quentin, Versailles, France
Alain Rouault

Appendices

Appendix 1

We present alternative versions on the one hand of the result in [3] about the lack of eigenvalues outside the support of the deterministic equivalent measure, and on the other hand of the result in [11] about the exact separation phenomenon. These new versions (Theorems 4.5 and 4.6 below) deal with random variables whose imaginary and real parts are independent, but remove the technical assumptions ((1.10) and “b ₁ > 0” in Theorem 1.1 in [3] and “ω _σ,ν,c(b) > 0” in Theorem 1.2 in [11]). The proof of Theorem 4.5 is based on the results of [5]. The arguments of the proof of Theorem 1.2 in [11] and Theorem 4.5 lead to the proof of Theorem 4.6.

Theorem 4.4

Consider

$$\displaystyle \begin{aligned}M_N=( \sigma \frac{ X_N}{\sqrt{N}}+A_N)(\sigma \frac{ X_N}{\sqrt{N}}+A_N)^*,\end{aligned} $$

(4.84)

and assume that

1.
X _N = [X _ij]_{1≤i≤n,1≤j≤N} is a n × N random matrix such that [X _ij]_i≥1,j≥1 is an infinite array of random variables which satisfy (4.1) and (4.2) and such that $ \Re (X_{ij})$ , ℑ(X _ij), $(i,j)\in \mathbb {N}^2$ , are independent, centered with variance 1∕2.
2.
A _N is an n × N nonrandom matrix such that ∥A _N∥ is uniformly bounded.
3.
n ≤ N and, as N tends to infinity, c _N = n∕N → c ∈ ]0, 1].
4.
[x, y], x < y, is such that there exists δ > 0 such that for all large N, $ ]x-\delta ; y+\delta [ \subset \mathbb {R}\setminus \mathrm {supp} (\mu _{\sigma ,\mu _{A_N A_N^*},c_N})$ where $\mu _{\sigma ,\mu _{A_N A_N^*},c_N}$ is the nonrandom distribution which is characterized in terms of its Stieltjes transform which satisfies Eq.(4.4) where we replace c by c _N and ν by $\mu _{A_N A_N^*}.$

Then, we have

$$\displaystyle \begin{aligned}\mathbb P[\,\mathit{\mbox{for all large N}}, \mathrm{spect}(M_N) \subset \mathbb{R} \setminus [x,y] ]=1.\end{aligned} $$

Since, in the proof of Theorem 4.4, we will use tools from free probability theory, for the reader’s convenience, we recall the following basic definitions from free probability theory. For a thorough introduction to free probability theory, we refer to [29].

A $\mathbb {C}^*$-probability space is a pair $\left (\mathbb {A}, \tau \right )$ consisting of a unital $ \mathbb {C}^*$-algebra $\mathbb {A}$ and a state τ on $\mathbb {A}$ i.e. a linear map $\tau : \mathbb {A}\rightarrow \mathbb {C}$ such that $\tau (1_{\mathbb { A}})=1$ and τ(aa ^∗) ≥ 0 for all $a \in \mathbb {A}$. τ is a trace if it satisfies τ(ab) = τ(ba) for every $(a,b)\in \mathbb {A}^2$. A trace is said to be faithful if τ(aa ^∗) > 0 whenever a ≠ 0. An element of $\mathbb {A}$ is called a noncommutative random variable.
The noncommutative ⋆ -distribution of a family a = (a ₁, …, a _k) of noncommutative random variables in a $\mathbb { C}^*$-probability space $\left (\mathbb {A}, \tau \right )$ is defined as the linear functional μ _a : P↦τ(P(a, a ^∗)) defined on the set of polynomials in 2k noncommutative indeterminates, where (a, a ^∗) denotes the 2k-uple $(a_1,\ldots ,a_k,a_1^*,\ldots ,a_k^*)$. For any selfadjoint element a ₁ in $\mathbb {A}$, there exists a probability measure $\nu _{a_1}$ on $\mathbb {R}$ such that, for every polynomial P, we have
$$\displaystyle \begin{aligned}\mu_{a_1}(P)=\int P(t) \mathrm{d}\nu_{a_1}(t).\end{aligned} $$
Then we identify $\mu _{a_1}$ and $\nu _{a_1}$. If τ is faithful then the support of $\nu _{a_1}$ is the spectrum of a ₁ and thus $\|a_1\| = \sup \{|z|, z\in \mathrm {support} (\nu _{a_1})\}$.
A family of elements (a _i)_{i ∈ I} in a $\mathbb {C}^*$-probability space $\left (\mathbb {A}, \tau \right )$ is free if for all $k\in \mathbb {N}$ and all polynomials p ₁, …, p _k in two noncommutative indeterminates, one has
$$\displaystyle \begin{aligned} \tau(p_1(a_{i_1},a_{i_1}^*)\cdots p_k (a_{i_k},a_{i_k}^*))=0\end{aligned} $$
(4.85)

whenever i ₁ ≠ i ₂, i ₂ ≠ i ₃, …, i _k−1 ≠ i _k, (i ₁, …i _k) ∈ I ^k, and $\tau (p_l(a_{i_l},a_{i_l}^*))=0$ for l = 1, …, k.
A noncommutative random variable x in a $\mathbb {C}^*$-probability space $\left (\mathbb {A}, \tau \right )$ is a standard semicircular random variable if x = x ^∗ and for any $k\in \mathbb {N}$,
$$\displaystyle \begin{aligned}\tau(x^k)= \int t^k d\mu_{sc}(t)\end{aligned} $$
where is the semicircular standard distribution.
Let k be a nonnull integer number. Denote by $\mathbb {P}$ the set of polynomials in 2k noncommutative indeterminates. A sequence of families of variables (a _n)_n≥1 = (a ₁(n), …, a _k(n))_n≥1 in C ^∗-probability spaces $\left (\mathbb {A}_n, \tau _n\right )$ converges in ⋆ -distribution, when n goes to infinity, to some k-tuple of noncommutative random variables a = (a ₁, …, a _k) in a $\mathbb {C}^*$-probability space $\left (\mathbb {A}, \tau \right )$ if the map $P\in \mathbb {P} \mapsto \tau _n( P(a_n,a_n^*))$ converges pointwise towards $P\in \mathbb {P} \mapsto \tau ( P(a,a^*))$.
k noncommutative random variables a ₁(n), …, a _k(n), in C ^∗-probability spaces $\left (\mathbb {A}_n, \tau _n\right )$, n ≥ 1, are said asymptotically free if (a ₁(n), …, a _k(n)) converges in ⋆ -distribution, as n goes to infinity, to some noncommutative random variables (a ₁, …, a _k) in a $\mathbb {C}^*$-probability space $\left (\mathbb {A}, \tau \right )$ where a ₁, …, a _k are free.

We will also use the following well known result on asymptotic freeness of random matrices. Let $\mathbb {A}_n$ be the algebra of n × n matrices with complex entries and endow this algebra with the normalized trace defined for any $M\in \mathbb {A}_n$ by $\tau _n(M) =\frac {1}{n}\mathrm {Tr}(M)$. Let us consider a n × n so-called standard G.U.E matrix, i.e. a random Hermitian matrix $\mathbb {G}_n = [\mathbb {G}_{jk}]_{j,k=1}^n$, where $\mathbb {G}_{ii}$, $\sqrt {2} \Re (\mathbb {G}_{ij})$, $\sqrt {2} \Im (\mathbb {G}_{ij})$, i < j are independent centered Gaussian random variables with variance 1. For a fixed real number t independent from n, let $H_n^{(1)}, \ldots , H_n^{(t)}$ be deterministic n × n Hermitian matrices such that $\max _{i=1}^t\sup _n \Vert H_n^{(i)} \Vert < +\infty $ and $(H_n^{(1)}, \ldots , H_n^{(t)})$, as a t-tuple of noncommutative random variables in $(\mathbb {A}_n, \tau _n)$, converges in distribution when n goes to infinity. Then, according to Theorem 5.4.5 in [1], $ \frac {\mathbb {G}_n}{\sqrt {n}}$ and $(H_n^{(1)}, \ldots , H_n^{(t)})$ are almost surely asymptotically free i.e. almost surely, for any polynomial P in t+1 noncommutative indeterminates,

$$\displaystyle \begin{aligned}\tau_n\left\{ P\left({ H_n^{(1)}},\ldots,{ H_n^{(t)}},\frac{\mathbb{G}_n}{\sqrt{n}}\right)\right\} \rightarrow_{n\rightarrow +\infty} \tau \left( P(h_1,\ldots,h_t,s)\right)\end{aligned} $$

(4.86)

where h ₁, …, h _t and s are noncommutative random variables in some $\mathbb {C}^*$-probability space $(\mathbb {A}, \tau )$ such that (h ₁, …, h _t) and s are free, s is a standard semi-circular noncommutative random variable and the distribution of (h ₁, …, h _t) is the limiting distribution of $(H_n^{(1)}, \ldots , H_n^{(t)})$.

Finally, the proof of Theorem 4.4 is based on the following result which can be established by following the proof of Theorem 1.1 in [5]. First, note that the algebra of polynomials in non-commuting indeterminates X ₁, …, X _k, becomes a ⋆ -algebra by anti-linear extension of $(X_{i_1}X_{i_2}\ldots X_{i_m})^*=X_{i_m}\ldots X_{i_2}X_{i_1}$.

Theorem 4.5

Let us consider three independent infinite arrays of random variables, $ [W^{(1)}_{ij}]_{i\geq 1,j\geq 1}$ , $ [W^{(2)}_{ij}]_{i\geq 1,j\geq 1}$ and [X _ij]_i≥1,j≥1 where

for l = 1, 2, $W^{(l)}_{ii}$ , $\sqrt {2}\Re (W^{(l)}_{ij})$ , $\sqrt {2} \Im (W^{(l)}_{ij}), i<j$ , are i.i.d centered and bounded random variables with variance 1 and $W^{(l)}_{ji}=\overline {W^{(l)}_{ij}}$ ,
$\{\Re (X_{ij}), \Im (X_{ij}), i\in \mathbb {N}, j \in \mathbb {N}\}$ are independent centered random variables with variance 1∕2 and satisfy (4.1) and (4.2).

For any $(N,n)\in \mathbb {N}^2$ , define the (n + N) × (n + N) matrix:

$$\displaystyle \begin{aligned}W_{n+N}=\begin{pmatrix} W_n^{(1)} & X_N \\ X_N^* & W_N^{(2)} \end{pmatrix}\end{aligned} $$

(4.87)

where $X_N=[X_{ij}]_{ \begin {array}{ll}1\leq i\leq n{,}1 \leq j\leq N\end {array}}, \; W^{(1)}_n= [W^{(1)}_{ij}]_{1\leq i,j\leq n},\; W^{(2)}_N= [W^{(2)}_{ij}]_{1\leq i,j\leq N}$.

Assume that n = n(N) and $\lim _{N\rightarrow +\infty }\frac {n}{N}=c \in ]0,1].$

Let t be a fixed integer number and P be a selfadjoint polynomial in t + 1 noncommutative indeterminates.

For any $N \in \mathbb {N}^2$ , let $(B_{n+N}^{(1)},\ldots ,B_{n+N}^{(t)})$ be a t −tuple of (n + N) × (n + N) deterministic Hermitian matrices such that for any u = 1, …, t, $ \sup _{N} \Vert B_{n+N}^{(u)} \Vert < \infty $ . Let $(\mathbb {A}, \tau )$ be a C ^∗-probability space equipped with a faithful tracial state and s be a standard semi-circular noncommutative random variable in $(\mathbb {A}, \tau )$ . Let $b_{n+N}=(b_{n+N}^{(1)},\ldots ,b_{n+N}^{(t)})$ be a t-tuple of noncommutative selfadjoint random variables which is free from s in $(\mathbb {A},\tau )$ and such that the distribution of b _n+N in $(\mathbb {A},\tau )$ coincides with the distribution of $(B_{n+N}^{(1)},\ldots , B_{n+N}^{(t)})$ in $({ M}_{n+N}(\mathbb {C}), \frac {1}{n+N}\mathrm {Tr})$.

Let [x, y] be a real interval such that there exists δ > 0 such that, for any large N, [x − δ, y + δ] lies outside the support of the distribution of the noncommutative random variable $ P\left (s, b_{n+N}^{(1)},\ldots ,b_{n+N}^{(t)}\right )$ in $(\mathbb {A},\tau )$ . Then, almost surely, for all large N,

$$\displaystyle \begin{aligned}\mathrm{spect}P\left(\frac{{ W}_{n+N}}{\sqrt{n+N}}, B_{n+N}^{(1)},\ldots,B_{n+N}^{(t)})\right) \subset \mathbb{R} \setminus [x,y].\end{aligned}$$

Proof

We start by checking that a truncation and Gaussian convolution procedure as in Section 2 of [5] can be handled for such a matrix as defined by (4.87), to reduce the problem to a fit framework where,

(H)
for any N, (W _n+N)_ii, $\sqrt {2}\Re ((W_{n+N})_{ij})$, $\sqrt {2} \Im ((W_{n+N})_{ij}), i<j, i \leq n+N,\; j\leq n+N$, are independent, centered random variables with variance 1, which satisfy a Poincaré inequality with common fixed constant C _PI.

Note that, according to Corollary 3.2 in [24], (H) implies that for any $p\in \mathbb {N}$,

$$\displaystyle \begin{aligned} \sup_{N\geq 1} \sup_{1\leq i,j\leq n+N} \mathbb{E}\left(\vert (W_{n+N})_{ij}\vert^p\right) <+\infty.\end{aligned} $$

(4.88)

Remark 4.3

Following the proof of Lemma 2.1 in [5], one can establish that, if (V _ij)_i≥1,j≥1 is an infinite array of random variables such that $\{\Re (V_{ij}), \Im (V_{ij}), i\in \mathbb {N}, j \in \mathbb {N}\}$ are independent centered random variables which satisfy (4.1) and (4.2), then almost surely we have

$$\displaystyle \begin{aligned}\limsup_{N\rightarrow +\infty} \left\| \frac{Z_{n+N}}{\sqrt{N+n}}\right\| \leq 2\sigma^*\end{aligned}$$

where

$$\displaystyle \begin{aligned}Z_{n+N}=\begin{pmatrix} (0) & V_N \\ V_N^* & (0) \end{pmatrix} \; \mbox{ with}\; V_N=[V_{ij}]_{ \begin{array}{ll}1\leq i\leq n{,}1 \leq j\leq N\end{array}} \mbox{and}\; \sigma^*=\left\{\sup_{(i,j) \in \mathbb{N}^2}\mathbb{E}(\vert V_{ij}\vert^2)\right\}^{1/2}.\end{aligned}$$

Then, following the rest of the proof of Section 2 in [5], one can prove that for any polynomial P in 1 + t noncommutative variables, there exists some constant L > 0 such that the following holds. Set $\theta ^*=\sup _{i,j}\mathbb {E}\left (\left |X_{ij}\right |{ }^3\right )$. For any 0 < 𝜖 < 1, there exist C _𝜖 > 8θ ^∗ (such that $C_\epsilon >\max _{l=1,2} \vert W^{(l)}_{11}\vert $ a.s.) and δ _𝜖 > 0 such that almost surely for all large N,

$$\displaystyle \begin{aligned} \left\| P\left(\frac{W_{n+N}}{\sqrt{n+N}},B_{n+N}^{(1)},\ldots,B_{n+N}^{(t)}\right)- P\left(\frac{\tilde W_{n+N}^{C_\epsilon,\delta_\epsilon}}{\sqrt{n+N}}, B_{n+N}^{(1)},\ldots,B_{n+N}^{(t)}\right)\right\|\leq L \epsilon, \end{aligned} $$

(4.89)

where, for any C > 8θ ^∗ such that $C>\max _{l=1,2} \vert W^{(l)}_{11}\vert $ a.s., and for any δ > 0, $\tilde W_{N+n}^{C,\delta }$ is a (n + N) × (n + N) matrix which is defined as follows. Let $(\mathbb {G}_{ij})_{i\geq 1, j\geq 1}$ be an infinite array which is independent of $\{X_{ij}, W^{(1)}_{ij}, W^{(2)}_{ij}, (i,j)\in \mathbb {N}^2\}$ and such that $\sqrt {2} \Re \mathbb {G}_{ij}$, $ \sqrt {2} \Im \mathbb {G}_{ij}$, i < j, $\mathbb {G}_{ii}$, are independent centred standard real gaussian variables and ${\mathbb {G}}_{ij}=\overline {\mathbb {G}}_{ji}$. Set $\mathbb {G}_{n+N}= [\mathbb {G}_{ij}]_{1\leq i,j \leq n+N }$ and define $X_N^C=[X_{ij}^C]_{ \begin {array}{ll}1\leq i\leq n{,}1 \leq j\leq N\end {array}}$as in (4.18) . Set

$$\displaystyle \begin{aligned}\tilde W_{n+N}^C=\begin{pmatrix} W_n^{(1)} & X_N^C \\ (X_N^C)^* & W_N^{(2)} \end{pmatrix}\; \mbox{and} \; \tilde W_{N+n}^{C,\delta}= \frac{ \tilde W_{n+N}^C +\delta \mathbb{G}_{n+N}}{\sqrt{1+\delta^2}}.\end{aligned}$$

$\tilde W_{N+n}^{C,\delta }$ satisfies (H) (see the end of Section 2 in [5]). (4.89) readily yields that it is sufficient to prove Theorem 4.5 for $\tilde W_{N+n}^{C,\delta }$.

Therefore, assume now that W _N+n satisfies (H). As explained in Section 6.2 in [5], to establish Theorem 4.5, it is sufficient to prove that for all $m \in \mathbb {N}$, all self-adjoint matrices γ, α, β ₁, …, β _t of size m × m and all 𝜖 > 0, almost surely, for all large N, we have

$$\displaystyle \begin{aligned} &spect(\gamma \otimes I_{n+N} + \alpha\otimes \frac{W_{n+N}}{\sqrt{n+N}}+ \sum_{u=1}^t \beta_u \otimes B_{n+N}^{(u)}) \\ &\quad \subset spect(\gamma \otimes 1_{\mathbb{A}} + \alpha \otimes s+ \sum_{u=1}^t \beta_u \otimes b_{n+N}^{(u)}) + ]-\epsilon, \epsilon[. {} \end{aligned} $$

(4.90)

((4.90) is the analog of Lemma 1.3 for r = 1 in [5]). Finally, one can prove (4.90) by following Section 5 in [5].

We will need the following lemma in the proof of Theorem 4.4.

Lemma 4.16

Let A _N and c _N be defined as in Theorem 4.4 . Define the following (n + N) × (n + N) matrices: $P=\begin {pmatrix} I_n & (0) \\ (0) & (0) \end {pmatrix} Q=\begin {pmatrix} (0) & (0) \\ (0) & I_N\end {pmatrix}$ and $\mathbf {A}=\begin {pmatrix} (0) & A_N \\ (0) & (0) \end {pmatrix}$ . Let s, p _N, q _N, a _N be noncommutative random variables in some $\mathbb {C}^*$ -probability space $\left ( \mathbb {A}, \tau \right )$ such that s is a standard semi-circular variable which is free with (p _N, q _N, a _N) and the ⋆ -distribution of (A, P, Q) in $\left (M_{N+n}(\mathbb {C}),\frac {1}{N+n} \mathrm {Tr}\right )$ coincides with the ⋆ -distribution of (a _N, p _N, q _N) in $\left ( \mathbb {A}, \tau \right ). $ Then, for any 𝜖 ≥ 0, the distribution of $ ({\sqrt {1+c_N}}\sigma p_N s q_N+ {\sqrt {1+c_N}}\sigma q_N s p_N + {\mathbf {a}}_N+ \mathbf { a}_N^*)^2 +\epsilon p_N$ is $\frac {n}{N+n} T_\epsilon \star \mu _{\sigma , \mu _{A_NA_N^*}, c_N} +\frac {n}{N+n} \mu _{\sigma , \mu _{A_NA_N^*}, c_N}+\frac {N-n}{N+n} \delta _{0}$ where ${T_\epsilon } {\star } \mu _{\sigma , \mu _{A_NA_N^*}, c_N}$ is the pushforward of $ \mu _{\sigma , \mu _{A_NA_N^*}, c_N}$ by the map z↦z + 𝜖.

Proof

Here N and n are fixed. Let k ≥ 1 and C _k be the k × k matrix defined by

Define the k(n + N) × k(n + N) matrices

$$\displaystyle \begin{aligned}\hat A_k= C_k\otimes \mathbf{A},\; \hat P_k=I_k\otimes P, \; \hat Q_k= I_k\otimes Q.\end{aligned}$$

For any k ≥ 1, the ⋆ -distributions of $(\hat A_k, \hat P_k, \hat Q_k)$ in $( M_{k(N+n)}(\mathbb {C}), \frac {1}{k(N+n)}\mathrm {Tr})$ and (A, P, Q) in $( M_{(N+n)}(\mathbb {C}), \frac {1}{(N+n)}\mathrm {Tr})$ respectively, coincide. Indeed, let $\mathbb {K}$ be a noncommutative monomial in $\mathbb {C}\langle X_1,X_2,X_3,X_4\rangle $ and denote by q the total number of occurrences of X ₃ and X ₄ in $\mathbb {K}$. We have

$$\displaystyle \begin{aligned}\mathbb{K}(\hat P_k, \hat Q_k, \hat A_k, \hat A_k^*)=C_k^q \otimes \mathbb{K}(P,Q,\mathbf{A}, {\mathbf{A}}^*),\end{aligned}$$

so that

$$\displaystyle \begin{aligned}\frac{1}{k(n+N)} \mathrm{Tr} \left[\mathbb{K}(\hat P_k, \hat Q_k, \hat A_k, \hat A_k^*)\right]= \frac{1}{k}\mathrm{Tr} (C_k^q) \frac{1}{(n+N)}\mathrm{Tr} \left[\mathbb{K}(P,Q,\mathbf{A}, {\mathbf{A}}^*)\right].\end{aligned}$$

Note that if q is even then $C_k^q=I_k$ so that

$$\displaystyle \begin{aligned}\frac{1}{k(n+N)} \mathrm{Tr} \left[\mathbb{K}(\hat P_k, \hat Q_k, \hat A_k, \hat A_k^*)\right]=\frac{1}{(n+N)}\mathrm{Tr}\left[ \mathbb{K}(P,Q,\mathbf{A}, {\mathbf{A}}^*)\right].\end{aligned} $$

(4.91)

Now, assume that q is odd. Note that PQ = QP = 0, A Q = A, Q A = 0, A P = 0 and P A = A (and then Q A ^∗ = A ^∗, A ^∗Q = 0, P A ^∗ = 0 and A ^∗P = A ^∗). Therefore, if at least one of the terms X ₁X ₂, X ₂X ₁, X ₂X ₃, X ₃X ₁, X ₄X ₂ or X ₁X ₄ appears in the noncommutative product in $\mathbb {K}$, then $ \mathbb {K}(P,Q,\mathbf {A}, {\mathbf {A}}^*)=0,$ so that (4.91) still holds. Now, if none of the terms X ₁X ₂, X ₂X ₁, X ₂X ₃, X ₃X ₁, X ₄X ₂ or X ₁X ₄ appears in the noncommutative product in $\mathbb {K}$, then we have ${\mathbb {K}}(P,Q,\mathbf {A}, {\mathbf {A}}^*)=\tilde {\mathbb {K}}(\mathbf {A}, {\mathbf {A}}^*)$ for some noncommutative monomial $\tilde {\mathbb { K}}\in \mathbb {C} \langle X,Y\rangle $ with degree q. Either the noncommutative product in $\tilde {\mathbb {K}}$ contains a term such as X ^p or Y ^p for some p ≥ 2 and then, since A ² = (A ^∗)² = 0, we have $\tilde {\mathbb {K}}(\mathbf {A}, {\mathbf {A}}^*)=0$, or $\tilde {\mathbb {K}}(X,Y)$ is one of the monomials $ (XY)^{\frac {q-1}{2}}X$ or $Y(XY)^{\frac {q-1}{2}}$. In both cases, we have $\mathrm {Tr} \tilde {\mathbb {K}}(\mathbf {A}, {\mathbf {A}}^*)=0$ and (4.91) still holds.

Now, define the k(N + n) × k(N + n) matrices

$$\displaystyle \begin{aligned}\tilde P_k=\begin{pmatrix} I_{kn} & (0) \\ (0) & (0) \end{pmatrix}, \; \;\tilde Q_k= \begin{pmatrix} (0) & (0) \\ (0) & I_{kN}\end{pmatrix},\; \tilde A_k=\begin{pmatrix} (0) & \check{A} \\ (0) & (0) \end{pmatrix}\end{aligned}$$

where $\check { A}$ is the kn × kN matrix defined by

It is clear that there exists a real orthogonal k(N + n) × k(N + n) matrix O such that $\tilde P_k=O\hat P_k O^*$, $\tilde Q_k=O\hat Q_k O^*$ and $\tilde A_k=O\hat A_k O^*$. This readily yields that the noncommutative ⋆ -distributions of $(\hat A_k, \hat P_k, \hat Q_k)$ and $({\tilde A}_k, \tilde P_k, \tilde Q_k)$ in $( M_{k(N+n)}(\mathbb {C}), \frac {1}{k(N+n)}\mathrm {Tr})$ coincide. Hence, for any k ≥ 1, the distribution of $({\tilde A}_k, \tilde P_k, \tilde Q_k)$ in $( M_{k(N+n)}(\mathbb {C}), \frac {1}{k(N+n)}\mathrm {Tr})$ coincides with the distribution of (a _N, p _N, q _N) in $\left ( \mathbb {A}, \tau \right ). $ By Theorem 5.4.5 in [1], it readily follows that the distribution of $({\sqrt {1+c_N}}\sigma p_N s q_N+ {\sqrt {1+c_N}}\sigma q_N s p_N + {\mathbf {a}}_N+ {\mathbf {a}}_N^*)^2 +\epsilon p_N$ is the almost sure limiting distribution, when k goes to infinity, of $({\sqrt {1+c_N}}\sigma \tilde P_k\frac {\mathbb {G} }{\sqrt {k(N+n)}}\tilde Q_k+ {\sqrt {1+c_N}} \sigma \tilde Q_k \frac {\mathbb {G} }{\sqrt {k(N+n)}}\tilde P_k+\tilde A_k+\tilde A_k^*)^2+\epsilon \tilde P_k$ in $( M_{k(N+n)}(\mathbb {C}), \frac {1}{k(N+n)}\mathrm {Tr})$, where $\mathbb {G}$ is a k(N + n) × k(N + n) GUE matrix with entries with variance 1. Now, note that

$$\displaystyle \begin{aligned}&\left[{\sqrt{1+c_N}}\sigma\left\{\tilde P_k \frac{\mathbb{G} }{\sqrt{k(N+n)}} \tilde Q_k +\tilde Q_k \frac{\mathbb{G} }{\sqrt{k(N+n)}}\tilde P_k\right\} +\tilde A_k +\tilde A_k^*\right]^2 +\epsilon \tilde P_k \\ & \quad = \begin{pmatrix} (\sigma \frac{\mathbb{G}_{kn\times kN}}{\sqrt{kN}}+\check{ A})(\sigma \frac{\mathbb{G}_{kn\times kN}}{\sqrt{kN}}+\check{ A})^*+\epsilon I_{kn} &(0)\\ (0)& (\sigma \frac{\mathbb{G}_{kn\times kN}}{\sqrt{kN}}+\check{A})^*(\sigma \frac{\mathbb{G}_{kn\times kN}}{\sqrt{kN}}+\check{A}) \end{pmatrix}\end{aligned} $$

where $\mathbb {G}_{kn\times kN}$ is the upper right kn × kN corner of $ \mathbb {G}$. Thus, noticing that $\mu _{\check { A}\check { A}^*}=\mu _{A_NA_N^*}$, the lemma follows from [15].

Proof of Theorem 4.4

Let W be a (n + N) × (n + N) matrix as defined by (4.87) in Theorem 4.5. Note that, with the notations of Lemma 4.16, for any 𝜖 ≥ 0,

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle \begin{pmatrix} (\sigma \frac{X_N}{\sqrt{N}}+A_N)(\sigma \frac{X_N}{\sqrt{N}}+A_N)^*+\epsilon I_n &\displaystyle (0)\\ (0)&\displaystyle (\sigma \frac{X_N}{\sqrt{N}}+A_N)^*(\sigma \frac{X_N}{\sqrt{N}}+A_N) \end{pmatrix}\\ &\displaystyle &\displaystyle \quad = \begin{pmatrix} (0)&\displaystyle (\sigma \frac{X_N}{\sqrt{N}}+A_N) \\ (\sigma \frac{X_N}{\sqrt{N}}+A_N)^* &\displaystyle (0) \end{pmatrix}^2 +\epsilon P \\ &\displaystyle &\displaystyle \quad = \left({\sqrt{1+c_N}}P\frac{\sigma W}{\sqrt{N+n}}Q+ {\sqrt{1+c_N}}Q\frac{\sigma W}{\sqrt{N+n}}P+\mathbf{A}+{\mathbf{A}}^*\right)^2+\epsilon P. \end{array} \end{aligned} $$

Thus, for any 𝜖 ≥ 0,

$$\displaystyle \begin{aligned} & \mathrm{spect}\left\{(\sigma \frac{X_N}{\sqrt{N}}+A)(\sigma \frac{X_N}{\sqrt{N}}+A)^*+\epsilon I_n\right\}\\ &\quad \subset \mathrm{spect}\left\{\left({\sqrt{1+c_N}}P\frac{\sigma W}{\sqrt{N+n}}Q+ {\sqrt{1+c_N}}Q\frac{\sigma W}{\sqrt{N+n}}P+\mathbf{A}+{\mathbf{A}}^*\right)^2+\epsilon P\right\}. {}\end{aligned} $$

(4.92)

Let [x, y] be such that there exists δ > 0 such that for all large N, $ ]x-\delta ; y+\delta [ \subset \mathbb {R}\setminus \mathrm {supp} (\mu _{\sigma ,\mu _{A_N A_N^*},c_N})$.

(i)
Assume x > 0. Then, according to Lemma 4.16 with 𝜖 = 0, there exists δ′ > 0 such that for all large n, ]x − δ′;y + δ′[ is outside the support of the distribution of $ ({\sqrt {1+c_N}}\sigma p_N s q_N+ {\sqrt {1+c_N}}\sigma q_N s p_N + {\mathbf {a}}_N+ {\mathbf {a}}_N^*)^2 $. We readily deduce that almost surely for all large N, according to Theorem 4.5, there is no eigenvalue of $({\sqrt {1+c_N}}P\frac {\sigma W}{\sqrt {N+n}}Q+ {\sqrt {1+c_N}}Q\frac {\sigma W}{\sqrt {N+n}}P+\mathbf {A}+{\mathbf {A}}^*)^2 $ in [x, y]. Hence, by (4.92) with 𝜖 = 0, almost surely for all large N, there is no eigenvalue of M _N in [x, y].
(ii)
Assume x = 0 and y > 0. There exists 0 < δ′ < y such that [0, 3δ′] is for all large N outside the support of $\mu _{\sigma , \mu _{A_NA_N^*}, c_N}$. Hence, according to Lemma 4.16, [δ′∕2, 3δ′] is outside the support of the distribution of $ ({\sqrt {1+c_N}}\sigma p_N s q_N+ {\sqrt {1+c_N}}\sigma q_N s p_N + {\mathbf {a}}_N+ {\mathbf {a}}_N^*)^2 +\delta ' p_N$. Then, almost surely for all large N, according to Theorem 4.5, there is no eigenvalue of $({\sqrt {1+c_N}}P\frac {\sigma W}{\sqrt {N+n}}Q+ {\sqrt {1+c_N}}Q\frac {\sigma W}{\sqrt {N+n}}P+\mathbf {A}+{\mathbf {A}}^*)^2 +\delta ' P $ in [δ′, 2δ′] and thus, by (4.92), no eigenvalue of $ (\sigma \frac {X}{\sqrt {N}}+A_N)(\sigma \frac {X_N}{\sqrt {N}}+A_N)^*+\delta ' I_n$ in [δ′, 2δ′]. It readily follows that, almost surely for all large N, there is no eigenvalue of $ (\sigma \frac {X_N}{\sqrt {N}}+A_N)(\sigma \frac {X_N}{\sqrt {N}}+A_N)^*$ in [0, δ′]. Since moreover, according to (i), almost surely for all large N, there is no eigenvalue of $ (\sigma \frac {X_N}{\sqrt {N}}+A_N)(\sigma \frac {X_N}{\sqrt {N}}+A_N)^*$ in [δ′, y], we can conclude that there is no eigenvalue of M _N in [x, y].

The proof of Theorem 4.4 is now complete. □

We are now in a position to establish the following exact separation phenomenon.

Theorem 4.6

Let M _n as in (4.84) with assumptions [1–4] of Theorem 4.4 . Assume moreover that the empirical spectral measure $\mu _{A_NA_N^*}$ of $A_NA_N^*$ converges weakly to some probability measure ν. Then for N large enough,

$$\displaystyle \begin{aligned}\omega_{{\sigma,\nu,c}}([x,y])=[\omega_{{\sigma,\nu,c}}(x);\omega_{{\sigma,\nu,c}}(y)] \subset \mathbb{R} \setminus \mathit{\mbox{supp}}(\mu _{A_N A_N^*}),\end{aligned} $$

(4.93)

where ω _σ,ν,c is defined in (4.5). With the convention that $\lambda _0(M_N)=\lambda _0(A_NA_N^*)=+\infty $ and $\lambda _{n+1}(M_N)=\lambda _{n+1}(A_NA_N^*)=-\infty $ , for N large enough, let i _N ∈{0, …, n} be such that

$$\displaystyle \begin{aligned}\lambda_{i_N+1}(A_N A_N^*) <\omega_{{\sigma,\nu,c}}(x) \mathit{\mbox{ ~ and ~}} \lambda_{i_N}(A_N A_N^*) > \omega_{{\sigma,\nu ,c}}(y).\end{aligned} $$

(4.94)

Then

$$\displaystyle \begin{aligned}P[\,\mathit{\mbox{for all large N}}, \lambda_{i_N+1}(M_N) <x\mathit{\mbox{ and}} ~ \lambda_{i_N}(M_N)>y] = 1.\end{aligned} $$

(4.95)

Remark 4.4

Since $\mu _{\sigma ,\mu _{A_N A_N^*},c_N}$ converges weakly towards μ _σ,ν,c assumption 4. implies that ∀0 < τ < δ, $[x-\tau ; y+\tau ] \subset \mathbb {R} \setminus \mathrm {supp}~ \mu _{\sigma ,\nu ,c}$.

Proof of Theorem 4.4

(4.93) is proved in Lemma 3.1 in [11].

If ω _σ,ν,c(x) < 0, then i _N = n in (4.94) and moreover we have, for all large N, $\omega _{{\sigma ,\mu _{A_N A_N^*},c_N}}(x)<0$. According to Lemma 2.7 in [11], we can deduce that, for all large N, [x, y] is on the left hand side of the support of $\mu _{\sigma ,\mu _{A_N A_N^*},c_N}$ so that ] −∞;y + δ] is on the left hand side of the support of $\mu _{\sigma ,\mu _{A_N A_N^*},c_N}$. Since [−|y|− 1, y] satisfies the assumptions of Theorem 4.4, we readily deduce that almost surely, for all large N, λ _n(M _N) > y. Hence (4.95) holds true.
If ω _σ,ν,c(x) ≥ 0, we first explain why it is sufficient to prove (4.95) for x such that ω _σ,ν,c(x) > 0. Indeed, assume for a while that (4.95) is true whenever ω _σ,ν,c(x) > 0. Let us consider any interval [x, y] satisfying condition 4. of Theorem 4.4 and such that ω _σ,ν,c(x) = 0; then i _N = n in (4.94). According to Proposition 4.1, $\omega _{{\sigma ,\nu ,c}}(\frac {x+y}{2})> 0$ and then almost surely for all large N, λ _n(M _N) > y. Finally, sticking to the proof of Theorem 1.2 in [11] leads to (4.95) for x such that ω _σ,ν,c(x) > 0.

Appendix 2

We first recall some basic properties of the resolvent (see [12, 22]).

Lemma 4.17

For a N × N Hermitian matrix M, for any $z \in \mathbb {C}\setminus \mathrm {spect}(M)$ , we denote by G(z) := (zI _N − M)⁻¹ the resolvent of M.

Let $z \in \mathbb {C}\setminus \mathbb {R}$ ,

(i)
∥G(z)∥≤|ℑz|⁻¹.
(ii)
|G(z)_ij|≤|ℑz|⁻¹ for all i, j = 1, …, N.
(iii)
G(z)M = MG(z) = −I _N + zG(z).

Moreover, for any N × N Hermitian matrices M ₁ and M ₂,

$$\displaystyle \begin{aligned}(zI_N-M_1)^{-1}-(zI_N-M_2)^{-1}=(zI_N-M_1)^{-1}(M_1-M_2)(zI_N-M_2)^{-1}.\end{aligned}$$

The following technical lemmas are fundamental in the approach of the present paper.

Lemma 4.18 (Lemma 4.4 in [6])

Let $h: \mathbb {R}\rightarrow \mathbb {R}$ be a continuous function with compact support. Let B _N be a N × N Hermitian matrix and C _N be a N × N matrix. Then

$$\displaystyle \begin{aligned}\mathrm{Tr} \left[h(B_N) C_N\right]= - \lim_{y\rightarrow 0^{+}}\frac{1}{\pi} \int \Im \mathrm{Tr} \left[(t+\mathrm{i} y-B_N)^{-1}C_N\right] h(t) dt. \end{aligned} $$

(4.96)

Moreover, if B _N is random, we also have

$$\displaystyle \begin{aligned}\mathbb{E}\mathrm{Tr} \left[h(B_N) C_N\right]= - \lim_{y\rightarrow 0^{+}}\frac{1}{\pi} \int \Im \mathbb{E}\mathrm{ Tr} \left[(t+\mathrm{i} y-B_N)^{-1}C_N\right] h(t) dt. \end{aligned} $$

(4.97)

Lemma 4.19

Let f be an analytic function on $\mathbb {C}\setminus \mathbb {R}$ such that there exist some polynomial P with nonnegative coefficients, and some positive real number α such that

$$\displaystyle \begin{aligned} \forall z \in \mathbb{C}\setminus \mathbb{R},~~\vert f(z)\vert \leq (\vert z\vert +1)^\alpha P(\vert \Im z\vert ^{-1}). \end{aligned}$$

Then, for any h in $\mathbb {C}^\infty (\mathbb {R}, \mathbb {R})$ with compact support, there exists some constant τ depending only on h, α and P such that

$$\displaystyle \begin{aligned}\limsup _{y\rightarrow 0^+}\vert \int _{\mathbb{R}} h (x)f(x+\mathrm{i} y)dx\vert < \tau.\end{aligned} $$

We refer the reader to the Appendix of [12] where it is proved using the ideas of [20].

Finally, we recall some facts on Poincaré inequality. A probability measure μ on $\mathbb {R}$ is said to satisfy the Poincaré inequality with constant C _PI if for any $\mathbb {C}^1$ function $f: \mathbb {R}\rightarrow \mathbb {C}$ such that f and f′ are in L ²(μ),

$$\displaystyle \begin{aligned}\mathbf{V}(f)\leq C_{PI}\int \vert f' \vert^2 d\mu ,\end{aligned} $$

with $\mathbf {V}(f) = \int \vert f-\int f d\mu \vert ^2 d\mu $.

We refer the reader to [9] for a characterization of the measures on $\mathbb {R}$ which satisfy a Poincaré inequality.

If the law of a random variable X satisfies the Poincaré inequality with constant C _PI then, for any fixed α ≠ 0, the law of αX satisfies the Poincaré inequality with constant α ²C _PI.

Assume that probability measures μ ₁, …, μ _M on $\mathbb {R}$ satisfy the Poincaré inequality with constant C _PI(1), …, C _PI(M) respectively. Then the product measure μ ₁ ⊗⋯ ⊗ μ _M on $\mathbb {R}^M$ satisfies the Poincaré inequality with constant $\displaystyle {C_{PI}^*=\max _{i\in \{1,\ldots ,M\}}C_{PI}(i)}$ in the sense that for any differentiable function f such that f and its gradient gradf are in L ²(μ ₁ ⊗⋯ ⊗ μ _M),

$$\displaystyle \begin{aligned}\mathbf{V}(f)\leq C_{PI}^* \int \Vert \mathrm{grad} f \Vert_2 ^2 d\mu_1\otimes \cdots \otimes \mu_M\end{aligned}$$

with $\mathbf {V}(f) = \int \vert f-\int f d\mu _1\otimes \cdots \otimes \mu _M \vert ^2 d\mu _1\otimes \cdots \otimes \mu _M$ (see Theorem 2.5 in [18]) .

Lemma 4.20 (Theorem 1.2 in [4])

Assume that the distribution of a random variable X is supported in [−C;C] for some constant C > 0. Let g be an independent standard real Gaussian random variable. Then X + δg satisfies a Poincaré inequality with constant $C_{PI}\leq \delta ^2 \exp \left ( 4C^2/\delta ^2\right )$.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Capitaine, M. (2018). Limiting Eigenvectors of Outliers for Spiked Information-Plus-Noise Type Matrices. In: Donati-Martin, C., Lejay, A., Rouault, A. (eds) Séminaire de Probabilités XLIX. Lecture Notes in Mathematics(), vol 2215. Springer, Cham. https://doi.org/10.1007/978-3-319-92420-5_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-92420-5_4
Published: 08 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92419-9
Online ISBN: 978-3-319-92420-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Limiting Eigenvectors of Outliers for Spiked Information-Plus-Noise Type Matrices

Abstract

Similar content being viewed by others

Outliers in the Single Ring Theorem

Complex Outliers of Hermitian Random Matrices

On the principal components of sample covariance matrices

Keywords

4.1 Introduction

Remark 4.1

Proposition 4.1

Theorem 4.1 ([11])

Theorem 4.2 ([11])

Remark 4.2

Theorem 4.3

4.2 Sketch of the Proof

Step A

Step B

4.3 Proof of Proposition 4.2

Lemma 4.1

Proof

Lemma 4.2

Lemma 4.3

Lemma 4.4 (See Lemma 8.2 [10])

Lemma 4.5

Proof

Lemma 4.6

Proof

Lemma 4.7

Lemma 4.8

Lemma 4.9

Proof

Proposition 4.4

Proof

4.4 Proof of Proposition 4.3

4.4.1 Matricial Master Equation

Lemma 4.10 (Lemma 2.4.5 [1])

Proposition 4.5

Proof

4.4.2 Variance Estimates

Lemma 4.11

Proof

Lemma 4.12

Proof

Corollary 4.1

Proof

4.4.3 Estimates of Resolvent Entries

Lemma 4.13

Proof

Lemma 4.14

Proof

4.5 Proof of Theorem 4.3

Step A

Step B

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendices

Appendix 1

Theorem 4.4

Theorem 4.5

Proof

Lemma 4.16

Proof

Proof of Theorem 4.4

Theorem 4.6

Remark 4.4

Proof of Theorem 4.4

Appendix 2

Lemma 4.17

Lemma 4.18 (Lemma 4.4 in [6])

Lemma 4.19

Lemma 4.20 (Theorem 1.2 in [4])

Rights and permissions

Copyright information

About this chapter

Cite this chapter